101
|
Toda Y, Wakatsuki H, Aoike T, Kajiya-Kanegae H, Yamasaki M, Yoshioka T, Ebana K, Hayashi T, Nakagawa H, Hasegawa T, Iwata H. Predicting biomass of rice with intermediate traits: Modeling method combining crop growth models and genomic prediction models. PLoS One 2020; 15:e0233951. [PMID: 32559220 PMCID: PMC7304626 DOI: 10.1371/journal.pone.0233951] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 05/15/2020] [Indexed: 11/30/2022] Open
Abstract
Genomic prediction (GP) is expected to become a powerful technology for accelerating the genetic improvement of complex crop traits. Several GP models have been proposed to enhance their applications in plant breeding, including environmental effects and genotype-by-environment interactions (G×E). In this study, we proposed a two-step model for plant biomass prediction wherein environmental information and growth-related traits were considered. First, the growth-related traits were predicted by GP. Second, the biomass was predicted from the GP-predicted values and environmental data using machine learning or crop growth modeling. We applied the model to a 2-year-old field trial dataset of recombinant inbred lines of japonica rice and evaluated the prediction accuracy with training and testing data by cross-validation performed over two years. Therefore, the proposed model achieved an equivalent or a higher correlation between the observed and predicted values (0.53 and 0.65 for each year, respectively) than the model in which biomass was directly predicted by GP (0.40 and 0.65 for each year, respectively). This result indicated that including growth-related traits enhanced accuracy of biomass prediction. Our findings are expected to contribute to the spread of the use of GP in crop breeding by enabling more precise prediction of environmental effects on crop traits.
Collapse
Affiliation(s)
- Yusuke Toda
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Science, The University of Tokyo, Tokyo, Japan
| | - Hitomi Wakatsuki
- Institute for Agro-Environmental Sciences, National Agriculture and Food Research Organization (NARO), Ibaraki, Japan
| | - Toru Aoike
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Science, The University of Tokyo, Tokyo, Japan
| | | | - Masanori Yamasaki
- Food Resources Education and Research Center, Graduate School of Agricultural Science, Kobe University, Hyogo, Japan
| | - Takuma Yoshioka
- Food Resources Education and Research Center, Graduate School of Agricultural Science, Kobe University, Hyogo, Japan
| | | | | | - Hiroshi Nakagawa
- Research Center for Agricultural Information Technology, NARO, Ibaraki, Japan
| | | | - Hiroyoshi Iwata
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
102
|
Tsai HY, Cericola F, Edriss V, Andersen JR, Orabi J, Jensen JD, Jahoor A, Janss L, Jensen J. Use of multiple traits genomic prediction, genotype by environment interactions and spatial effect to improve prediction accuracy in yield data. PLoS One 2020; 15:e0232665. [PMID: 32401769 PMCID: PMC7219756 DOI: 10.1371/journal.pone.0232665] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 04/20/2020] [Indexed: 11/24/2022] Open
Abstract
Genomic selection has been extensively implemented in plant breeding schemes. Genomic selection incorporates dense genome-wide markers to predict the breeding values for important traits based on information from genotype and phenotype records on traits of interest in a reference population. To date, most relevant investigations have been performed using single trait genomic prediction models (STGP). However, records for several traits at once are usually documented for breeding lines in commercial breeding programs. By incorporating benefits from genetic characterizations of correlated phenotypes, multiple trait genomic prediction (MTGP) may be a useful tool for improving prediction accuracy in genetic evaluations. The objective of this study was to test whether the use of MTGP and including proper modeling of spatial effects can improve the prediction accuracy of breeding values in commercial barley and wheat breeding lines. We genotyped 1,317 spring barley and 1,325 winter wheat lines from a commercial breeding program with the Illumina 9K barley and 15K wheat SNP-chip (respectively) and phenotyped them across multiple years and locations. Results showed that the MTGP approach increased correlations between future performance and estimated breeding value of yields by 7% in barley and by 57% in wheat relative to using the STGP approach for each trait individually. Analyses combining genomic data, pedigree information, and proper modeling of spatial effects further increased the prediction accuracy by 4% in barley and 3% in wheat relative to the model using genomic relationships only. The prediction accuracy for yield in wheat and barley yield trait breeding, were improved by combining MTGP and spatial effects in the model.
Collapse
Affiliation(s)
- Hsin-Yuan Tsai
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
- Department of Marine Biotechnology and Resources, National Sun Yat-Sen University, Kaohsiung, Taiwan
- * E-mail:
| | | | | | | | | | | | - Ahmed Jahoor
- Nordic Seed, Galten, Denmark
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Luc Janss
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| |
Collapse
|
103
|
Lozada DN, Ward BP, Carter AH. Gains through selection for grain yield in a winter wheat breeding program. PLoS One 2020; 15:e0221603. [PMID: 32343696 PMCID: PMC7188280 DOI: 10.1371/journal.pone.0221603] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 03/26/2020] [Indexed: 11/19/2022] Open
Abstract
Increased genetic gain for complex traits in plant breeding programs can be achieved through different selection strategies. The objective of this study was to compare potential gains for grain yield in a winter wheat breeding program through estimating response to selection R values across several selection approaches including phenotypic (PS), marker-based (MS), genomic (GS), and a combination of PS and GS (PS+GS). Ten populations of Washington State University (WSU) winter wheat breeding lines including a diversity panel and F5 and double haploid lines evaluated from 2015 to 2019 growing seasons for grain yield in Lind and Pullman, WA, USA were used in the study. Selection was conducted by selecting the top 20% of lines based on observed yield (PS strategy), genomic estimated breeding values (GS), presence of yield "enhancing" alleles of the most significant single nucleotide polymorphism (SNP) markers identified from genome-wide association mapping (MS), and high observed yield and estimated breeding values (PS+GS). Overall, PS compared to other individual selection strategies (MS and GS) showed the highest mean response (R = 0.61) within the same environment. When combined with GS, a 23% improvement in R for yield was observed, indicating that gains could be improved by complementing traditional PS with GS within the same environment. Validating selection strategies in different environments resulted in low to negative R values indicating the effects of genotype-by-environment interactions for grain yield. MS was not successful in terms of R relative to the other selection approaches; using this strategy resulted in a significant (P < 0.05) decrease in response to selection compared with the other approaches. An integrated PS+GS approach could result in optimal genetic gain within the same environment, whereas a PS strategy might be a viable option for grain yield validated in different environments. Altogether, we demonstrated that gains through increased response to selection for yield could be achieved in the WSU winter wheat breeding program by implementing different selection strategies either exclusively or in combination.
Collapse
Affiliation(s)
- Dennis N. Lozada
- Crop and Soil Sciences Department, Washington State University, Pullman, WA, United States of America
| | - Brian P. Ward
- USDA-ARS Plant Science Research Unit, Raleigh, NC, United States of America
| | - Arron H. Carter
- Crop and Soil Sciences Department, Washington State University, Pullman, WA, United States of America
| |
Collapse
|
104
|
Prakapenka D, Wang C, Liang Z, Bian C, Tan C, Da Y. GVCHAP: A Computing Pipeline for Genomic Prediction and Variance Component Estimation Using Haplotypes and SNP Markers. Front Genet 2020; 11:282. [PMID: 32318093 PMCID: PMC7154123 DOI: 10.3389/fgene.2020.00282] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 03/09/2020] [Indexed: 01/05/2023] Open
Abstract
Haplotype prediction models open many possibilities to improve the accuracy of genomic selection but require more data processing and computing time than single-SNP prediction models. To facilitate haplotype analysis for genomic prediction and estimation using structural and functional genomic information, we developed a computing pipeline to implement haplotype analysis with capabilities for preparation of input data for haplotype analysis, genomic prediction and estimation using GVCHAP, and analysis of GVCHAP results. Data preparation includes utility programs for haplotype imputing; defining haplotype blocks by a fixed number of SNPs, a fixed distance in base pairs per block, or user defined block lengths based on structural or functional genomic information or a mixture of both types of information; and defining haplotype genotypes within each haplotype block. GVCHAP is the main program for genomic prediction and estimation, calculates GREML (genomic restricted maximum likelihood) estimates of variance components and heritabilities, and calculates GBLUP (genomic best linear unbiased prediction) for additive and dominance values of single SNPs as well as additive values of haplotypes with reliability estimates for training and validation populations. A two-step strategy and a method of multi-node processing are implemented to remove the computing bottleneck due to the creation of genomic relationship matrices for large samples. The analysis of GVCHAP results includes calculation of observed prediction accuracies from validation studies and preparation of input files for graphical visualization of heritability estimates of haplotype blocks as well as estimates of SNP effects and heritabilities. The entire pipeline provides an efficient and versatile computing tool for identifying the most accurate haplotype model among many candidate haplotype models utilizing structural and functional genomic information for genomic selection.
Collapse
Affiliation(s)
- Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Chunkao Wang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Bian
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, China
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
105
|
Yan J, Zou D, Li C, Zhang Z, Song S, Wang X. SR4R: An Integrative SNP Resource for Genomic Breeding and Population Research in Rice. GENOMICS, PROTEOMICS & BIOINFORMATICS 2020; 18:173-185. [PMID: 32619768 PMCID: PMC7646087 DOI: 10.1016/j.gpb.2020.03.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/24/2020] [Accepted: 03/25/2020] [Indexed: 12/16/2022]
Abstract
The information commons for rice (IC4R) database is a collection of 18 million single nucleotide polymorphisms (SNPs) identified by resequencing of 5152 rice accessions. Although IC4R offers ultra-high density rice variation map, these raw SNPs are not readily usable for the public. To satisfy different research utilizations of SNPs for population genetics, evolutionary analysis, association studies, and genomic breeding in rice, raw genotypic data of these 18 million SNPs were processed by unified bioinformatics pipelines. The outcomes were used to develop a daughter database of IC4R - SnpReady for Rice (SR4R). SR4R presents four reference SNP panels, including 2,097,405 hapmapSNPs after data filtration and genotype imputation, 156,502 tagSNPs selected from linkage disequilibrium-based redundancy removal, 1180 fixedSNPs selected from genes exhibiting selective sweep signatures, and 38 barcodeSNPs selected from DNA fingerprinting simulation. SR4R thus offers a highly efficient rice variation map that combines reduced SNP redundancy with extensive data describing the genetic diversity of rice populations. In addition, SR4R provides rice researchers with a web interface that enables them to browse all four SNP panels, use online toolkits, as well as retrieve the original data and scripts for a variety of population genetics analyses on local computers. SR4R is freely available to academic users at http://sr4r.ic4r.org/.
Collapse
Affiliation(s)
- Jun Yan
- Department of Crop Genomics and Bioinformatics, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China
| | - Dong Zou
- China National Center for Bioinformation, Beijing 100101, China; National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100101, China
| | - Chen Li
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
| | - Zhang Zhang
- China National Center for Bioinformation, Beijing 100101, China; National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100101, China
| | - Shuhui Song
- China National Center for Bioinformation, Beijing 100101, China; National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100101, China.
| | - Xiangfeng Wang
- Department of Crop Genomics and Bioinformatics, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China.
| |
Collapse
|
106
|
Srivastava RK, Singh RB, Pujarula VL, Bollam S, Pusuluri M, Chellapilla TS, Yadav RS, Gupta R. Genome-Wide Association Studies and Genomic Selection in Pearl Millet: Advances and Prospects. Front Genet 2020; 10:1389. [PMID: 32180790 PMCID: PMC7059752 DOI: 10.3389/fgene.2019.01389] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 12/19/2019] [Indexed: 11/13/2022] Open
Abstract
Pearl millet is a climate-resilient, drought-tolerant crop capable of growing in marginal environments of arid and semi-arid regions globally. Pearl millet is a staple food for more than 90 million people living in poverty and can address the triple burden of malnutrition substantially. It remained a neglected crop until the turn of the 21st century, and much emphasis has been placed since then on the development of various genetic and genomic resources for whole-genome scan studies, such as the genome-wide association studies (GWAS) and genomic selection (GS). This was facilitated by the advent of sequencing-based genotyping, such as genotyping-by-sequencing (GBS), RAD-sequencing, and whole-genome re-sequencing (WGRS) in pearl millet. To carry out GWAS and GS, a world association mapping panel called the Pearl Millet inbred Germplasm Association Panel (PMiGAP) was developed at ICRISAT in partnership with Aberystwyth University. This panel consisted of germplasm lines, landraces, and breeding lines from 27 countries and was re-sequenced using the WGRS approach. It has a repository of circa 29 million genome-wide SNPs. PMiGAP has been used to map traits related to drought tolerance, grain Fe and Zn content, nitrogen use efficiency, components of endosperm starch, grain yield, etc. Genomic selection in pearl millet was jump-started recently by WGRS, RAD, and tGBS (tunable genotyping-by-sequencing) approaches for the PMiGAP and hybrid parental lines. Using multi-environment phenotyping of various training populations, initial attempts have been made to develop genomic selection models. This mini review discusses advances and prospects in GWAS and GS for pearl millet.
Collapse
Affiliation(s)
- Rakesh K Srivastava
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Ram B Singh
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Vijaya Lakshmi Pujarula
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Srikanth Bollam
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Madhu Pusuluri
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Tara Satyavathi Chellapilla
- All India Coordinated Research Project on Pearl Millet (AICRP-PM), Indian Council of Agricultural Research (ICAR), Jodhpur, India
| | - Rattan S Yadav
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | - Rajeev Gupta
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| |
Collapse
|
107
|
Potential of Genome-Wide Association Studies and Genomic Selection to Improve Productivity and Quality of Commercial Timber Species in Tropical Rainforest, a Case Study of Shorea platyclados. FORESTS 2020. [DOI: 10.3390/f11020239] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Shorea platyclados (Dark Red Meranti) is a commercially important timber tree species in Southeast Asia. However, its stocks have dramatically declined due, inter alia, to excessive logging, insufficient natural regeneration and a slow recovery rate. Thus, there is a need to promote enrichment planting and develop effective technique to support its rehabilitation and improve timber production through implementation of Genome-Wide Association Studies (GWAS) and Genomic Selection (GS). To assist such efforts, plant materials were collected from a half-sib progeny population in Sari Bumi Kusuma forest concession, Kalimantan, Indonesia. Using 5900 markers in sequences obtained from 356 individuals, we detected high linkage disequilibrium (LD) extending up to >145 kb, suggesting that associations between phenotypic traits and markers in LD can be more easily and feasibly detected with GWAS than with analysis of quantitative trait loci (QTLs). However, the detection power of GWAS seems low, since few single nucleotide polymorphisms linked to any focal traits were detected with a stringent false discovery rate, indicating that the species’ phenotypic traits are mostly under polygenic quantitative control. Furthermore, Machine Learning provided higher prediction accuracies than Bayesian methods. We also found that stem diameter, branch diameter ratio and wood density were more predictable than height, clear bole, branch angle and wood stiffness traits. Our study suggests that GS has potential for improving the productivity and quality of S. platyclados, and our genomic heritability estimates may improve the selection of traits to target in future breeding of this species.
Collapse
|
108
|
Xu Y, Liu X, Fu J, Wang H, Wang J, Huang C, Prasanna BM, Olsen MS, Wang G, Zhang A. Enhancing Genetic Gain through Genomic Selection: From Livestock to Plants. PLANT COMMUNICATIONS 2020; 1:100005. [PMID: 33404534 PMCID: PMC7747995 DOI: 10.1016/j.xplc.2019.100005] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Although long-term genetic gain has been achieved through increasing use of modern breeding methods and technologies, the rate of genetic gain needs to be accelerated to meet humanity's demand for agricultural products. In this regard, genomic selection (GS) has been considered most promising for genetic improvement of the complex traits controlled by many genes each with minor effects. Livestock scientists pioneered GS application largely due to livestock's significantly higher individual values and the greater reduction in generation interval that can be achieved in GS. Large-scale application of GS in plants can be achieved by refining field management to improve heritability estimation and prediction accuracy and developing optimum GS models with the consideration of genotype-by-environment interaction and non-additive effects, along with significant cost reduction. Moreover, it would be more effective to integrate GS with other breeding tools and platforms for accelerating the breeding process and thereby further enhancing genetic gain. In addition, establishing an open-source breeding network and developing transdisciplinary approaches would be essential in enhancing breeding efficiency for small- and medium-sized enterprises and agricultural research systems in developing countries. New strategies centered on GS for enhancing genetic gain need to be developed.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
- CIMMYT-China Tropical Maize Research Center, Foshan University, Foshan 528231, China
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Xiaogang Liu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Junjie Fu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Hongwu Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jiankang Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Changling Huang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Boddupalli M. Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Michael S. Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Guoying Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Aimin Zhang
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
109
|
Cui Y, Li R, Li G, Zhang F, Zhu T, Zhang Q, Ali J, Li Z, Xu S. Hybrid breeding of rice via genomic selection. PLANT BIOTECHNOLOGY JOURNAL 2020; 18:57-67. [PMID: 31124256 PMCID: PMC6920338 DOI: 10.1111/pbi.13170] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 04/27/2019] [Accepted: 05/12/2019] [Indexed: 05/04/2023]
Abstract
Hybrid breeding is the main strategy for improving productivity in many crops, especially in rice and maize. Genomic hybrid breeding is a technology that uses whole-genome markers to predict future hybrids. Predicted superior hybrids are then field evaluated and released as new hybrid cultivars after their superior performances are confirmed. This will increase the opportunity of selecting true superior hybrids with minimum costs. Here, we used genomic best linear unbiased prediction to perform hybrid performance prediction using an existing rice population of 1495 hybrids. Replicated 10-fold cross-validations showed that the prediction abilities on ten agronomic traits ranged from 0.35 to 0.92. Using the 1495 rice hybrids as a training sample, we predicted six agronomic traits of 100 hybrids derived from half diallel crosses involving 21 parents that are different from the parents of the hybrids in the training sample. The prediction abilities were relatively high, varying from 0.54 (yield) to 0.92 (grain length). We concluded that the current population of 1495 hybrids can be used to predict hybrids from seemingly unrelated parents. Eventually, we used this training population to predict all potential hybrids of cytoplasm male sterile lines from 3000 rice varieties from the 3K Rice Genome Project. Using a breeding index combining 10 traits, we identified the top and bottom 200 predicted hybrids. SNP genotypes of the training population and parameters estimated from this training population are available for general uses and further validation in genomic hybrid prediction of all potential hybrids generated from all varieties of rice.
Collapse
Affiliation(s)
- Yanru Cui
- Hebei Agricultural UniversityBaodingChina
- Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA
| | - Ruidong Li
- Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA
| | - Guangwei Li
- National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research (Wuhan)Huazhong Agricultural UniversityWuhanChina
| | - Fan Zhang
- Institute of Crop Science/National Key Facility for Crop Gene Resource and Genetic ImprovementChinese Academy of Agricultural SciencesBeijingChina
| | - Tiantian Zhu
- Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA
| | - Qifa Zhang
- National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research (Wuhan)Huazhong Agricultural UniversityWuhanChina
| | - Jauhar Ali
- International Rice Research InstituteMetro ManilaPhilippines
| | - Zhikang Li
- Institute of Crop Science/National Key Facility for Crop Gene Resource and Genetic ImprovementChinese Academy of Agricultural SciencesBeijingChina
- Anhui Agricultural UniversityHefeiChina
| | - Shizhong Xu
- Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA
| |
Collapse
|
110
|
Azodi CB, Pardo J, VanBuren R, de Los Campos G, Shiu SH. Transcriptome-Based Prediction of Complex Traits in Maize. THE PLANT CELL 2020; 32:139-151. [PMID: 31641024 PMCID: PMC6961623 DOI: 10.1105/tpc.19.00332] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 09/24/2019] [Accepted: 10/21/2019] [Indexed: 05/11/2023]
Abstract
The ability to predict traits from genome-wide sequence information (i.e., genomic prediction) has improved our understanding of the genetic basis of complex traits and transformed breeding practices. Transcriptome data may also be useful for genomic prediction. However, it remains unclear how well transcript levels can predict traits, particularly when traits are scored at different development stages. Using maize (Zea mays) genetic markers and transcript levels from seedlings to predict mature plant traits, we found that transcript and genetic marker models have similar performance. When the transcripts and genetic markers with the greatest weights (i.e., the most important) in those models were used in one joint model, performance increased. Furthermore, genetic markers important for predictions were not close to or identified as regulatory variants for important transcripts. These findings demonstrate that transcript levels are useful for predicting traits and that their predictive power is not simply due to genetic variation in the transcribed genomic regions. Finally, genetic marker models identified only 1 of 14 benchmark flowering-time genes, while transcript models identified 5. These data highlight that, in addition to being useful for genomic prediction, transcriptome data can provide a link between traits and variation that cannot be readily captured at the sequence level.
Collapse
Affiliation(s)
- Christina B Azodi
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
- The DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, Michigan, 48824
| | - Jeremy Pardo
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
- Plant Resilience Institute, Michigan State University, East Lansing, Michigan 48824
| | - Robert VanBuren
- Plant Resilience Institute, Michigan State University, East Lansing, Michigan 48824
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824
| | - Gustavo de Los Campos
- Epidemiology and Biostatistics and Statistics and Probability Departments, Michigan State University, East Lansing, Michigan 48824
| | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
- The DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, Michigan, 48824
- Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
111
|
Kumar A, Kumar R, Sengupta D, Das SN, Pandey MK, Bohra A, Sharma NK, Sinha P, Sk H, Ghazi IA, Laha GS, Sundaram RM. Deployment of Genetic and Genomic Tools Toward Gaining a Better Understanding of Rice- Xanthomonas oryzae pv. oryzae Interactions for Development of Durable Bacterial Blight Resistant Rice. FRONTIERS IN PLANT SCIENCE 2020; 11:1152. [PMID: 32849710 PMCID: PMC7417518 DOI: 10.3389/fpls.2020.01152] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 07/15/2020] [Indexed: 05/05/2023]
Abstract
Rice is the most important food crop worldwide and sustainable rice production is important for ensuring global food security. Biotic stresses limit rice production significantly and among them, bacterial blight (BB) disease caused by Xanthomonas oryzae pv. oryzae (Xoo) is very important. BB reduces rice yields severely in the highly productive irrigated and rainfed lowland ecosystems and in recent years; the disease is spreading fast to other rice growing ecosystems as well. Being a vascular pathogen, Xoo interferes with a range of physiological and biochemical exchange processes in rice. The response of rice to Xoo involves specific interactions between resistance (R) genes of rice and avirulence (Avr) genes of Xoo, covering most of the resistance genes except the recessive ones. The genetic basis of resistance to BB in rice has been studied intensively, and at least 44 genes conferring resistance to BB have been identified, and many resistant rice cultivars and hybrids have been developed and released worldwide. However, the existence and emergence of new virulent isolates of Xoo in the realm of a rapidly changing climate necessitates identification of novel broad-spectrum resistance genes and intensification of gene-deployment strategies. This review discusses about the origin and occurrence of BB in rice, interactions between Xoo and rice, the important roles of resistance genes in plant's defense response, the contribution of rice resistance genes toward development of disease resistance varieties, identification and characterization of novel, and broad-spectrum BB resistance genes from wild species of Oryza and also presents a perspective on potential strategies to achieve the goal of sustainable disease management.
Collapse
Affiliation(s)
- Anirudh Kumar
- Department of Botany, Indira Gandhi National Tribal University (IGNTU), Amarkantak, India
- *Correspondence: Raman Meenakshi Sundaram, ; Anirudh Kumar,
| | - Rakesh Kumar
- Department of Life Science, Central University of Karnataka, Kalaburagi, India
| | - Debashree Sengupta
- Department of Plant Sciences, School of Life Sciences, University of Hyderabad (UoH), Hyderabad, India
| | - Subha Narayan Das
- Department of Botany, Indira Gandhi National Tribal University (IGNTU), Amarkantak, India
| | - Manish K. Pandey
- Department of Biotechnology, ICAR-Indian Institute of Rice Research (IIRR), Hyderabad, India
| | - Abhishek Bohra
- ICAR-Crop Improvement Division, Indian Institute of Pulses Research (IIPR), Kanpur, India
| | - Naveen K. Sharma
- Department of Botany, Indira Gandhi National Tribal University (IGNTU), Amarkantak, India
| | - Pragya Sinha
- Department of Biotechnology, ICAR-Indian Institute of Rice Research (IIRR), Hyderabad, India
| | - Hajira Sk
- Department of Biotechnology, ICAR-Indian Institute of Rice Research (IIRR), Hyderabad, India
| | - Irfan Ahmad Ghazi
- Department of Plant Sciences, School of Life Sciences, University of Hyderabad (UoH), Hyderabad, India
| | - Gouri Sankar Laha
- Department of Biotechnology, ICAR-Indian Institute of Rice Research (IIRR), Hyderabad, India
| | - Raman Meenakshi Sundaram
- Department of Biotechnology, ICAR-Indian Institute of Rice Research (IIRR), Hyderabad, India
- *Correspondence: Raman Meenakshi Sundaram, ; Anirudh Kumar,
| |
Collapse
|
112
|
Liu Y, Wang D, He F, Wang J, Joshi T, Xu D. Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean. Front Genet 2019; 10:1091. [PMID: 31824557 PMCID: PMC6883005 DOI: 10.3389/fgene.2019.01091] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 10/09/2019] [Indexed: 12/21/2022] Open
Abstract
Genomic selection uses single-nucleotide polymorphisms (SNPs) to predict quantitative phenotypes for enhancing traits in breeding populations and has been widely used to increase breeding efficiency for plants and animals. Existing statistical methods rely on a prior distribution assumption of imputed genotype effects, which may not fit experimental datasets. Emerging deep learning technology could serve as a powerful machine learning tool to predict quantitative phenotypes without imputation and also to discover potential associated genotype markers efficiently. We propose a deep-learning framework using convolutional neural networks (CNNs) to predict the quantitative traits from SNPs and also to investigate genotype contributions to the trait using saliency maps. The missing values of SNPs are treated as a new genotype for the input of the deep learning model. We tested our framework on both simulation data and experimental datasets of soybean. The results show that the deep learning model can bypass the imputation of missing values and achieve more accurate results for predicting quantitative phenotypes than currently available other well-known statistical methods. It can also effectively and efficiently identify significant markers of SNPs and SNP combinations associated in genome-wide association study.
Collapse
Affiliation(s)
- Yang Liu
- Institute of Data Science and Informatics, University of Missouri, Columbia, MO, United States.,Department of Electrical Engineer and Computer Science, University of Missouri, Columbia, MO, United States
| | - Duolin Wang
- Department of Electrical Engineer and Computer Science, University of Missouri, Columbia, MO, United States.,Christopher S. Bond Life Science Center, University of Missouri, Columbia, MO, United States
| | - Fei He
- Christopher S. Bond Life Science Center, University of Missouri, Columbia, MO, United States.,Department of Computer Science and Information Technology, Northeast Normal University, Changchun, China
| | - Juexin Wang
- Department of Electrical Engineer and Computer Science, University of Missouri, Columbia, MO, United States.,Christopher S. Bond Life Science Center, University of Missouri, Columbia, MO, United States
| | - Trupti Joshi
- Institute of Data Science and Informatics, University of Missouri, Columbia, MO, United States.,Christopher S. Bond Life Science Center, University of Missouri, Columbia, MO, United States.,Department of Health Management and Informatics, School of Medicine, University of Missouri, Columbia, MO, United States
| | - Dong Xu
- Institute of Data Science and Informatics, University of Missouri, Columbia, MO, United States.,Department of Electrical Engineer and Computer Science, University of Missouri, Columbia, MO, United States.,Christopher S. Bond Life Science Center, University of Missouri, Columbia, MO, United States
| |
Collapse
|
113
|
Qin J, Shi A, Song Q, Li S, Wang F, Cao Y, Ravelombola W, Song Q, Yang C, Zhang M. Genome Wide Association Study and Genomic Selection of Amino Acid Concentrations in Soybean Seeds. FRONTIERS IN PLANT SCIENCE 2019; 10:1445. [PMID: 31803203 PMCID: PMC6873630 DOI: 10.3389/fpls.2019.01445] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 10/17/2019] [Indexed: 05/15/2023]
Abstract
Soybean is a major source of protein for human consumption and animal feed. Releasing new cultivars with high nutritional value is one of the major goals in soybean breeding. To achieve this goal, genome-wide association studies of seed amino acid contents were conducted based on 249 soybean accessions from China, US, Japan, and South Korea. The accessions were evaluated for 15 amino acids and genotyped by sequencing. Significant genetic variation was observed for amino acids among the accessions. Among the 231 single nucleotide polymorphisms (SNPs) significantly associated with variations in amino acid contents, fifteen SNPs localized near 14 candidate genes involving in amino acid metabolism. The amino acids were classified into two groups with five in one group and seven amino acids in the other. Correlation coefficients among the amino acids within each group were high and positive, but the correlation coefficients of amino acids between the two groups were negative. Twenty-five SNP markers associated with multiple amino acids can be used to simultaneously improve multi-amino acid concentration in soybean. Genomic selection analysis of amino acid concentration showed that selection efficiency of amino acids based on the markers significantly associated with all 15 amino acids was higher than that based on random markers or markers only associated with individual amino acid. The identified markers could facilitate selection of soybean varieties with improved seed quality.
Collapse
Affiliation(s)
- Jun Qin
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Ainong Shi
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Qijian Song
- Soybean Genomics and Improvement Lab, USDA-ARS, Beltsville, MD, United States
| | - Song Li
- Crop and Soil Environmental Science, Virginia Tech, Blacksburg, VA, United States
| | - Fengmin Wang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| | - Yinghao Cao
- Bioinformatics Center, Allife Medical Science and Technology Co., Ltd, Beijing, China
| | - Waltram Ravelombola
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Qi Song
- Crop and Soil Environmental Science, Virginia Tech, Blacksburg, VA, United States
| | - Chunyan Yang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| | - Mengchen Zhang
- National Soybean Improvement Center Shijiazhuang Sub-Center, North China Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, Laboratory of Crop Genetics and Breeding of Hebei, Cereal & Oil Crop Institute, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang, China
| |
Collapse
|
114
|
Rana N, Rahim MS, Kaur G, Bansal R, Kumawat S, Roy J, Deshmukh R, Sonah H, Sharma TR. Applications and challenges for efficient exploration of omics interventions for the enhancement of nutritional quality in rice (Oryza sativa L.). Crit Rev Food Sci Nutr 2019; 60:3304-3320. [DOI: 10.1080/10408398.2019.1685454] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Nitika Rana
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | | | - Gazaldeep Kaur
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Ruchi Bansal
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Surbhi Kumawat
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Joy Roy
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Rupesh Deshmukh
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Humira Sonah
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| | - Tilak Raj Sharma
- National Agri-Food Biotechnology Institute (NABI), Mohali, Punjab, India
| |
Collapse
|
115
|
Azodi CB, Bolger E, McCarren A, Roantree M, de Los Campos G, Shiu SH. Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits. G3 (BETHESDA, MD.) 2019; 9:3691-3702. [PMID: 31533955 PMCID: PMC6829122 DOI: 10.1534/g3.119.400498] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 09/09/2019] [Indexed: 12/21/2022]
Abstract
The usefulness of genomic prediction in crop and livestock breeding programs has prompted efforts to develop new and improved genomic prediction algorithms, such as artificial neural networks and gradient tree boosting. However, the performance of these algorithms has not been compared in a systematic manner using a wide range of datasets and models. Using data of 18 traits across six plant species with different marker densities and training population sizes, we compared the performance of six linear and six non-linear algorithms. First, we found that hyperparameter selection was necessary for all non-linear algorithms and that feature selection prior to model training was critical for artificial neural networks when the markers greatly outnumbered the number of training lines. Across all species and trait combinations, no one algorithm performed best, however predictions based on a combination of results from multiple algorithms (i.e., ensemble predictions) performed consistently well. While linear and non-linear algorithms performed best for a similar number of traits, the performance of non-linear algorithms vary more between traits. Although artificial neural networks did not perform best for any trait, we identified strategies (i.e., feature selection, seeded starting weights) that boosted their performance to near the level of other algorithms. Our results highlight the importance of algorithm selection for the prediction of trait values.
Collapse
Affiliation(s)
| | - Emily Bolger
- Department of Mathematics, Moravian College, Bethlehem, PA
| | - Andrew McCarren
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin 9, Ireland
| | - Mark Roantree
- Insight Centre for Data Analytics, School of Computing, Dublin City University, Dublin 9, Ireland
| | - Gustavo de Los Campos
- Department of Epidemiology & Biostatistics
- Department of Statistics & Probability
- Institute for Quantitative Health Science and Engineering, and
| | - Shin-Han Shiu
- Department of Plant Biology
- Department of Computational, Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, 48824
| |
Collapse
|
116
|
Lozada DN, Mason RE, Sarinelli JM, Brown-Guedira G. Accuracy of genomic selection for grain yield and agronomic traits in soft red winter wheat. BMC Genet 2019; 20:82. [PMID: 31675927 PMCID: PMC6823964 DOI: 10.1186/s12863-019-0785-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 10/18/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Genomic selection has the potential to increase genetic gains by using molecular markers as predictors of breeding values of individuals. This study evaluated the accuracy of predictions for grain yield, heading date, plant height, and yield components in soft red winter wheat under different prediction scenarios. Response to selection for grain yield was also compared across different selection strategies- phenotypic, marker-based, genomic, combination of phenotypic and genomic, and random selections. RESULTS Genomic selection was implemented through a ridge regression best linear unbiased prediction model in two scenarios- cross-validations and independent predictions. Accuracy for cross-validations was assessed using a diverse panel under different marker number, training population size, relatedness between training and validation populations, and inclusion of fixed effect in the model. The population in the first scenario was then trained and used to predict grain yield of biparental populations for independent validations. Using subsets of significant markers from association mapping increased accuracy by 64-70% for grain yield but resulted in lower accuracy for traits with high heritability such as plant height. Increasing size of training population resulted in an increase in accuracy, with maximum values reached when ~ 60% of the lines were used as a training panel. Predictions using related subpopulations also resulted in higher accuracies. Inclusion of major growth habit genes as fixed effect in the model caused increase in grain yield accuracy under a cross-validation procedure. Independent predictions resulted in accuracy ranging between - 0.14 and 0.43, dependent on the grouping of site-year data for the training and validation populations. Genomic selection was "superior" to marker-based selection in terms of response to selection for yield. Supplementing phenotypic with genomic selection resulted in approximately 10% gain in response compared to using phenotypic selection alone. CONCLUSIONS Our results showed the effects of different factors on accuracy for yield and agronomic traits. Among the factors studied, training population size and relatedness between training and validation population had the greatest impact on accuracy. Ultimately, combining phenotypic with genomic selection would be relevant for accelerating genetic gains for yield in winter wheat.
Collapse
Affiliation(s)
- Dennis N Lozada
- Crop, Soil and Environmental Sciences Department, University of Arkansas, Fayetteville, AR, 72701, USA.
- Present Address: Department of Crop and Soil Sciences, Washington State University, Pullman, WA, 99164, USA.
| | - R Esten Mason
- Crop, Soil and Environmental Sciences Department, University of Arkansas, Fayetteville, AR, 72701, USA
| | - Jose Martin Sarinelli
- GDM Seeds Inc, Marion, AR, 72364, USA
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC, 27607, USA
| | - Gina Brown-Guedira
- USDA-ARS Plant Science Research and Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC, 27607, USA
| |
Collapse
|
117
|
Grinberg NF, Orhobor OI, King RD. An evaluation of machine-learning for predicting phenotype: studies in yeast, rice, and wheat. Mach Learn 2019; 109:251-277. [PMID: 32174648 PMCID: PMC7048706 DOI: 10.1007/s10994-019-05848-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 09/17/2019] [Accepted: 09/19/2019] [Indexed: 11/01/2022]
Abstract
In phenotype prediction the physical characteristics of an organism are predicted from knowledge of its genotype and environment. Such studies, often called genome-wide association studies, are of the highest societal importance, as they are of central importance to medicine, crop-breeding, etc. We investigated three phenotype prediction problems: one simple and clean (yeast), and the other two complex and real-world (rice and wheat). We compared standard machine learning methods; elastic net, ridge regression, lasso regression, random forest, gradient boosting machines (GBM), and support vector machines (SVM), with two state-of-the-art classical statistical genetics methods; genomic BLUP and a two-step sequential method based on linear regression. Additionally, using the clean yeast data, we investigated how performance varied with the complexity of the biological mechanism, the amount of observational noise, the number of examples, the amount of missing data, and the use of different data representations. We found that for almost all the phenotypes considered, standard machine learning methods outperformed the methods from classical statistical genetics. On the yeast problem, the most successful method was GBM, followed by lasso regression, and the two statistical genetics methods; with greater mechanistic complexity GBM was best, while in simpler cases lasso was superior. In the wheat and rice studies the best two methods were SVM and BLUP. The most robust method in the presence of noise, missing data, etc. was random forests. The classical statistical genetics method of genomic BLUP was found to perform well on problems where there was population structure. This suggests that standard machine learning methods need to be refined to include population structure information when this is present. We conclude that the application of machine learning methods to phenotype prediction problems holds great promise, but that determining which methods is likely to perform well on any given problem is elusive and non-trivial.
Collapse
Affiliation(s)
- Nastasiya F. Grinberg
- School of Computer Science, University of Manchester, Oxford Road, Manchester, M13 9PL UK
- Present Address: Department of Medicine, Cambridge Institute of Therapeutic Immunology & Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0AW UK
| | | | - Ross D. King
- Department of Biology and Biological Engineering, Division of Systems and Synthetic Biology, Chalmers University of Technology, Kemivägen 10, SE-412 96 Gothenburg, Sweden
| |
Collapse
|
118
|
Evaluation of genomic selection methods for predicting fiber quality traits in Upland cotton. Mol Genet Genomics 2019; 295:67-79. [DOI: 10.1007/s00438-019-01599-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 07/29/2019] [Indexed: 01/25/2023]
|
119
|
Bajgain P, Zhang X, Anderson JA. Genome-Wide Association Study of Yield Component Traits in Intermediate Wheatgrass and Implications in Genomic Selection and Breeding. G3 (BETHESDA, MD.) 2019; 9:2429-2439. [PMID: 31147390 PMCID: PMC6686922 DOI: 10.1534/g3.119.400073] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 05/23/2019] [Indexed: 11/18/2022]
Abstract
Intermediate wheatgrass (Thinopyrum intermedium, IWG) is a perennial grain crop with high biomass and grain yield, long seeds, and resistance to pests and diseases. It also reduces soil erosion, nitrate and mineral leaching into underground water tables, and sequesters carbon in its roots. The domestication timeline of IWG as a grain crop spans only 3 decades, hence it lags annual grain crops in yield and seed characteristics. One approach to improve its agronomic traits is by using molecular markers to uncover marker-trait associations. In this study, we performed association mapping on IWG breeding germplasm from the third recurrent selection cycle at the University of Minnesota. The IWG population was phenotyped in St Paul, MN in 2017 and 2018, and in Crookston, MN in 2018 for grain yield, seed length, width and weight, spike length and weight, and number of spikelets per spike. Strong positive correlations were observed among most trait pairs, with correlations as high as 0.76. Genotyping using high throughput sequencing identified 8,899 high-quality genome-wide SNPs which were combined with phenotypic data in association mapping to discover regions associated with the yield component traits. We detected 154 genetic loci associated with these traits of which 19 were shared between at least two traits. Prediction of breeding values using significant loci as fixed effects in genomic selection model improved predictive abilities by up to 14%. Genetic mapping of agronomic traits followed by using genomic selection to predict breeding values can assist breeders in selecting superior genotypes to accelerate IWG domestication.
Collapse
Affiliation(s)
- Prabin Bajgain
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN and
| | - Xiaofei Zhang
- Department of Horticultural Science, North Carolina State University, Raleigh, NC
| | - James A Anderson
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN and
| |
Collapse
|
120
|
de C Lara LA, Santos MF, Jank L, Chiari L, Vilela MDM, Amadeu RR, Dos Santos JPR, Pereira GDS, Zeng ZB, Garcia AAF. Genomic Selection with Allele Dosage in Panicum maximum Jacq. G3 (BETHESDA, MD.) 2019; 9:2463-2475. [PMID: 31171567 PMCID: PMC6686918 DOI: 10.1534/g3.118.200986] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 05/23/2019] [Indexed: 12/21/2022]
Abstract
Genomic selection is an efficient approach to get shorter breeding cycles in recurrent selection programs and greater genetic gains with selection of superior individuals. Despite advances in genotyping techniques, genetic studies for polyploid species have been limited to a rough approximation of studies in diploid species. The major challenge is to distinguish the different types of heterozygotes present in polyploid populations. In this work, we evaluated different genomic prediction models applied to a recurrent selection population of 530 genotypes of Panicum maximum, an autotetraploid forage grass. We also investigated the effect of the allele dosage in the prediction, i.e., considering tetraploid (GS-TD) or diploid (GS-DD) allele dosage. A longitudinal linear mixed model was fitted for each one of the six phenotypic traits, considering different covariance matrices for genetic and residual effects. A total of 41,424 genotyping-by-sequencing markers were obtained using 96-plex and Pst1 restriction enzyme, and quantitative genotype calling was performed. Six predictive models were generalized to tetraploid species and predictive ability was estimated by a replicated fivefold cross-validation process. GS-TD and GS-DD models were performed considering 1,223 informative markers. Overall, GS-TD data yielded higher predictive abilities than with GS-DD data. However, different predictive models had similar predictive ability performance. In this work, we provide bioinformatic and modeling guidelines to consider tetraploid dosage and observed that genomic selection may lead to additional gains in recurrent selection program of P. maximum.
Collapse
Affiliation(s)
- Letícia A de C Lara
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | | | - Liana Jank
- Embrapa Beef Cattle, Campo Grande, MS, Brazil, and
| | | | | | - Rodrigo R Amadeu
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | - Jhonathan P R Dos Santos
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| | | | | | - Antonio Augusto F Garcia
- Luiz de Queiroz College of Agriculture / University of São Paulo (ESALQ/USP), Piracicaba, SP, Brazil
| |
Collapse
|
121
|
Arbelaez JD, Dwiyanti MS, Tandayu E, Llantada K, Jarana A, Ignacio JC, Platten JD, Cobb J, Rutkoski JE, Thomson MJ, Kretzschmar T. 1k-RiCA (1K-Rice Custom Amplicon) a novel genotyping amplicon-based SNP assay for genetics and breeding applications in rice. RICE (NEW YORK, N.Y.) 2019; 12:55. [PMID: 31350673 PMCID: PMC6660535 DOI: 10.1186/s12284-019-0311-0] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Accepted: 07/02/2019] [Indexed: 05/04/2023]
Abstract
BACKGROUND While a multitude of genotyping platforms have been developed for rice, the majority of them have not been optimized for breeding where cost, turnaround time, throughput and ease of use, relative to density and informativeness are critical parameters of their utility. With that in mind we report the development of the 1K-Rice Custom Amplicon, or 1k-RiCA, a robust custom sequencing-based amplicon panel of ~ 1000-SNPs that are uniformly distributed across the rice genome, designed to be highly informative within indica rice breeding pools, and tailored for genomic prediction in elite indica rice breeding programs. RESULTS Empirical validation tests performed on the 1k-RiCA showed average marker call rates of 95% with marker repeatability and concordance rates of 99%. These technical properties were not affected when two common DNA extraction protocols were used. The average distance between SNPs in the 1k-RiCA was 1.5 cM, similar to the theoretical distance which would be expected between 1,000 uniformly distributed markers across the rice genome. The average minor allele frequencies on a panel of indica lines was 0.36 and polymorphic SNPs estimated on pairwise comparisons between indica by indica accessions and indica by japonica accessions were on average 430 and 450 respectively. The specific design parameters of the 1k-RiCA allow for a detailed view of genetic relationships and unambiguous molecular IDs within indica accessions and good cost vs. marker-density balance for genomic prediction applications in elite indica germplasm. Predictive abilities of Genomic Selection models for flowering time, grain yield, and plant height were on average 0.71, 0.36, and 0.65 respectively based on cross-validation analysis. Furthermore the inclusion of important trait markers associated with 11 different genes and QTL adds value to parental selection in crossing schemes and marker-assisted selection in forward breeding applications. CONCLUSIONS This study validated the marker quality and robustness of the 1k-RiCA genotypic platform for genotyping populations derived from indica rice subpopulation for genetic and breeding purposes including MAS and genomic selection. The 1k-RiCA has proven to be an alternative cost-effective genotyping system for breeding applications.
Collapse
Affiliation(s)
- Juan David Arbelaez
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | | | - Erwin Tandayu
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - Krizzel Llantada
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - Annalhea Jarana
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - John Carlos Ignacio
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - John Damien Platten
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - Joshua Cobb
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - Jessica Elaine Rutkoski
- International Rice Research Institute, DAPO Box 7777, 1301 Los Baños, Metro Manila Philippines
| | - Michael J. Thomson
- Department of Soil and Crop Sciences, Texas A&M University, College Station, Houston, TX 77843 USA
| | - Tobias Kretzschmar
- Southern Cross Plant Sciences, Southern Cross University, PO Box 157, Lismore, NSW 2480 Australia
| |
Collapse
|
122
|
Dalal A, Bourstein R, Haish N, Shenhar I, Wallach R, Moshelion M. Dynamic Physiological Phenotyping of Drought-Stressed Pepper Plants Treated With "Productivity-Enhancing" and "Survivability-Enhancing" Biostimulants. FRONTIERS IN PLANT SCIENCE 2019; 10:905. [PMID: 31379898 PMCID: PMC6654182 DOI: 10.3389/fpls.2019.00905] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Accepted: 06/26/2019] [Indexed: 05/02/2023]
Abstract
The improvement of crop productivity under abiotic stress is one of the biggest challenges faced by the agricultural scientific community. Despite extensive research, the research-to-commercial transfer rate of abiotic stress-resistant crops remains very low. This is mainly due to the complexity of genotype × environment interactions and in particular, the ability to quantify the dynamic plant physiological response profile to a dynamic environment. Most existing phenotyping facilities collect information using robotics and automated image acquisition and analysis. However, their ability to directly measure the physiological properties of the whole plant is limited. We demonstrate a high-throughput functional phenotyping system (HFPS) that enables comparing plants' dynamic responses to different ambient conditions in dynamic environments due to its direct and simultaneous measurement of yield-related physiological traits of plants under several treatments. The system is designed as one-to-one (1:1) plant-[sensors+controller] units, i.e., each individual plant has its own personalized sensor, controller and irrigation valves that enable (i) monitoring water-relation kinetics of each plant-environment response throughout the plant's life cycle with high spatiotemporal resolution, (ii) a truly randomized experimental design due to multiple independent treatment scenarios for every plant, and (iii) reduction of artificial ambient perturbations due to the immobility of the plants or other objects. In addition, we propose two new resilience-quantifying-related traits that can also be phenotyped using the HFPS: transpiration recovery rate and night water reabsorption. We use the HFPS to screen the effects of two commercial biostimulants (a seaweed extract -ICL-SW, and a metabolite formula - ICL-NewFo1) on Capsicum annuum under different irrigation regimes. Biostimulants are considered an alternative approach to improving crop productivity. However, their complex mode of action necessitates cost-effective pre-field phenotyping. The combination of two types of treatment (biostimulants and drought) enabled us to evaluate the precision and resolution of the system in investigating the effect of biostimulants on drought tolerance. We analyze and discuss plant behavior at different stages, and assess the penalty and trade-off between productivity and resilience. In this test case, we suggest a protocol for the screening of biostimulants' physiological mechanisms of action.
Collapse
Affiliation(s)
- Ahan Dalal
- The Robert H. Smith Faculty of Agriculture, Food and Environment, Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Ronny Bourstein
- The Robert H. Smith Faculty of Agriculture, Food and Environment, Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Department of Soil and Water Sciences, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Nadav Haish
- The Robert H. Smith Faculty of Agriculture, Food and Environment, Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Itamar Shenhar
- The Robert H. Smith Faculty of Agriculture, Food and Environment, Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Rony Wallach
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Department of Soil and Water Sciences, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Menachem Moshelion
- The Robert H. Smith Faculty of Agriculture, Food and Environment, Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of Jerusalem, Rehovot, Israel
| |
Collapse
|
123
|
Frouin J, Labeyrie A, Boisnard A, Sacchi GA, Ahmadi N. Genomic prediction offers the most effective marker assisted breeding approach for ability to prevent arsenic accumulation in rice grains. PLoS One 2019; 14:e0217516. [PMID: 31194746 PMCID: PMC6563978 DOI: 10.1371/journal.pone.0217516] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Accepted: 05/13/2019] [Indexed: 12/28/2022] Open
Abstract
The high concentration of arsenic (As) in rice grains, in a large proportion of the rice growing areas, is a critical issue. This study explores the feasibility of conventional (QTL-based) marker-assisted selection and genomic selection to improve the ability of rice to prevent As uptake and accumulation in the edible grains. A japonica diversity panel (RP) of 228 accessions phenotyped for As concentration in the flag leaf (FL-As) and in the dehulled grain (CG-As), and genotyped at 22,370 SNP loci, was used to map QTLs by association analysis (GWAS) and to train genomic prediction models. Similar phenotypic and genotypic data from 95 advanced breeding lines (VP) with japonica genetic backgrounds, was used to validate related QTLs mapped in the RP through GWAS and to evaluate the predictive ability of across populations (RP-VP) genomic estimate of breeding value (GEBV) for As exclusion. Several QTLs for FL-As and CG-As with a low-medium individual effect were detected in the RP, of which some colocalized with known QTLs and candidate genes. However, less than 10% of those QTLs could be validated in the VP without loosening colocalization parameters. Conversely, the average predictive ability of across populations GEBV was rather high, 0.43 for FL-As and 0.48 for CG-As, ensuring genetic gains per time unit close to phenotypic selection. The implications of the limited robustness of the GWAS results and the rather high predictive ability of genomic prediction are discussed for breeding rice for significantly low arsenic uptake and accumulation in the edible grains.
Collapse
Affiliation(s)
- Julien Frouin
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Axel Labeyrie
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | | | | | - Nourollah Ahmadi
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| |
Collapse
|
124
|
QTLian breeding for climate resilience in cereals: progress and prospects. Funct Integr Genomics 2019; 19:685-701. [PMID: 31093800 DOI: 10.1007/s10142-019-00684-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 04/05/2019] [Accepted: 04/30/2019] [Indexed: 10/26/2022]
Abstract
The ever-rising population of the twenty-first century together with the prevailing challenges, such as deteriorating quality of arable land and water, has placed a big challenge for plant breeders to satisfy human needs for food under erratic weather patterns. Rice, wheat, and maize are the major staple crops consumed globally. Drought, waterlogging, heat, salinity, and mineral toxicity are the key abiotic stresses drastically affecting crop yield. Conventional plant breeding approaches towards abiotic stress tolerance have gained success to limited extent, due to the complex (multigenic) nature of these stresses. Progress in breeding climate-resilient crop plants has gained momentum in the last decade, due to improved understanding of the physiochemical and molecular basis of various stresses. A good number of genes have been characterized for adaptation to various stresses. In the era of novel molecular markers, mapping of QTLs has emerged as viable solution for breeding crops tolerant to abiotic stresses. Therefore, molecular breeding-based development and deployment of high-yielding climate-resilient crop cultivars together with climate-smart agricultural practices can pave the path to enhanced crop yields for smallholder farmers in areas vulnerable to the climate change. Advances in fine mapping and expression studies integrated with cheaper prices offer new avenues for the plant breeders engaged in climate-resilient plant breeding, and thereby, hope persists to ensure food security in the era of climate change.
Collapse
|
125
|
An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction. G3-GENES GENOMES GENETICS 2019; 9:1355-1369. [PMID: 30819822 PMCID: PMC6505148 DOI: 10.1534/g3.119.400126] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Evidence that genomic selection (GS) is a technology that is revolutionizing plant breeding continues to grow. However, it is very well documented that its success strongly depends on statistical models, which are used by GS to perform predictions of candidate genotypes that were not phenotyped. Because there is no universally better model for prediction and models for each type of response variable are needed (continuous, binary, ordinal, count, etc.), an active area of research aims to develop statistical models for the prediction of univariate and multivariate traits in GS. However, most of the models developed so far are for univariate and continuous (Gaussian) traits. Therefore, to overcome the lack of multivariate statistical models for genome-based prediction by improving the original version of the BMTME, we propose an improved Bayesian multi-trait and multi-environment (BMTME) R package for analyzing breeding data with multiple traits and multiple environments. We also introduce Bayesian multi-output regressor stacking (BMORS) functions that are considerably efficient in terms of computational resources. The package allows parameter estimation and evaluates the prediction performance of multi-trait and multi-environment data in a reliable, efficient and user-friendly way. We illustrate the use of the BMTME with real toy datasets to show all the facilities that the software offers the user. However, for large datasets, the BME() and BMTME() functions of the BMTME R package are very intense in terms of computing time; on the other hand, less intensive computing is required with BMORS functions BMORS() and BMORS_Env() that are also included in the BMTME package.
Collapse
|
126
|
Bhandari A, Bartholomé J, Cao-Hamadoun TV, Kumari N, Frouin J, Kumar A, Ahmadi N. Selection of trait-specific markers and multi-environment models improve genomic predictive ability in rice. PLoS One 2019; 14:e0208871. [PMID: 31059529 PMCID: PMC6502484 DOI: 10.1371/journal.pone.0208871] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 04/15/2019] [Indexed: 12/29/2022] Open
Abstract
Developing high yielding rice varieties that are tolerant to drought stress is crucial for the sustainable livelihood of rice farmers in rainfed rice cropping ecosystems. Genomic selection (GS) promises to be an effective breeding option for these complex traits. We evaluated the effectiveness of two rather new options in the implementation of GS: trait and environment-specific marker selection and the use of multi-environment prediction models. A reference population of 280 rainfed lowland accessions endowed with 215k SNP markers data was phenotyped under a favorable and two managed drought environments. Trait-specific SNP subsets (28k) were selected for each trait under each environment, using results of GWAS performed with the complete genotype dataset. Performances of single-environment and multi-environment genomic prediction models were compared using kernel regression based methods (GBLUP and RKHS) under two cross validation scenarios: availability (CV2) or not (CV1) of phenotypic data for the validation set, in one of the environments. Trait-specific marker selection strategy achieved predictive ability (PA) of genomic prediction up to 22% higher than markers selected on the bases of neutral linkage disequilibrium (LD). Tolerance to drought stress was up to 32% better predicted by multi-environment models (especially RKHS based models) under CV2 strategy. Under the less favorable CV1 strategy, the multi-environment models achieved similar PA than the single-environment predictions. We also showed that reasonable PA could be obtained with as few as 3,000 SNP markers, even in a population of low LD extent, provided marker selection is based on pairwise LD. The implications of these findings for breeding for drought tolerance are discussed. The most resource sparing option would be accurate phenotyping of the reference population in a favorable environment and under a managed drought, while the candidate population would be phenotyped only under one of those environments.
Collapse
Affiliation(s)
- Aditi Bhandari
- International Rice Research Institute, Los Banos, Philippines
- Banasthali University, Banasthali Vidyapith, India
| | - Jérôme Bartholomé
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Tuong-Vi Cao-Hamadoun
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | | | - Julien Frouin
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Arvind Kumar
- International Rice Research Institute, Los Banos, Philippines
| | - Nourollah Ahmadi
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| |
Collapse
|
127
|
Juanillas V, Dereeper A, Beaume N, Droc G, Dizon J, Mendoza JR, Perdon JP, Mansueto L, Triplett L, Lang J, Zhou G, Ratharanjan K, Plale B, Haga J, Leach JE, Ruiz M, Thomson M, Alexandrov N, Larmande P, Kretzschmar T, Mauleon RP. Rice Galaxy: an open resource for plant science. Gigascience 2019; 8:giz028. [PMID: 31107941 PMCID: PMC6527052 DOI: 10.1093/gigascience/giz028] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 08/29/2018] [Accepted: 02/12/2019] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Rice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high-density genotyping platforms for genome variation analysis and gene discovery. Germplasm collections representing rice diversity, improved varieties, and elite breeding materials are accessible through rice gene banks for use in research and breeding, with many having genome sequences and high-density genotype data available. Combining phenotypic and genotypic information on these accessions enables genome-wide association analysis, which is driving quantitative trait loci discovery and molecular marker development. Comparative sequence analyses across quantitative trait loci regions facilitate the discovery of novel alleles. Analyses involving DNA sequences and large genotyping matrices for thousands of samples, however, pose a challenge to non-computer savvy rice researchers. FINDINGS The Rice Galaxy resource has shared datasets that include high-density genotypes from the 3,000 Rice Genomes project and sequences with corresponding annotations from 9 published rice genomes. The Rice Galaxy web server and deployment installer includes tools for designing single-nucleotide polymorphism assays, analyzing genome-wide association studies, population diversity, rice-bacterial pathogen diagnostics, and a suite of published genomic prediction methods. A prototype Rice Galaxy compliant to Open Access, Open Data, and Findable, Accessible, Interoperable, and Reproducible principles is also presented. CONCLUSIONS Rice Galaxy is a freely available resource that empowers the plant research community to perform state-of-the-art analyses and utilize publicly available big datasets for both fundamental and applied science.
Collapse
Affiliation(s)
- Venice Juanillas
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Alexis Dereeper
- Institut de recherche pour le développement (IRD), University of Montpellier, DIADE, IPME, Montpellier, France
| | - Nicolas Beaume
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Gaetan Droc
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Joshua Dizon
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - John Robert Mendoza
- Advanced Science and Technology Institute, Department of Science and Technology, Quezon City, Philippines
| | - Jon Peter Perdon
- Advanced Science and Technology Institute, Department of Science and Technology, Quezon City, Philippines
| | - Locedie Mansueto
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Lindsay Triplett
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Jillian Lang
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Gabriel Zhou
- Indiana University, 107 S Indiana Ave, Bloomington, IN 47405, USA
| | | | - Beth Plale
- Indiana University, 107 S Indiana Ave, Bloomington, IN 47405, USA
| | - Jason Haga
- National Institute of Advanced Industrial Science and Technology, AIST Tsukuba Central 1,1-1-1 Umezono, Tsukuba, Ibaraki 305-8560, Japan
| | - Jan E Leach
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Manuel Ruiz
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Michael Thomson
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Department of Soil and Crop Sciences, Texas A&M University, Houston, TX, USA
| | - Nickolai Alexandrov
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Pierre Larmande
- Institut de recherche pour le développement (IRD), University of Montpellier, DIADE, IPME, Montpellier, France
| | - Tobias Kretzschmar
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Southern Cross Plant Science, Southern Cross University, Lismore, Australia
| | - Ramil P Mauleon
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Southern Cross Plant Science, Southern Cross University, Lismore, Australia
| |
Collapse
|
128
|
Wang DR, Guadagno CR, Mao X, Mackay DS, Pleban JR, Baker RL, Weinig C, Jannink JL, Ewers BE. A framework for genomics-informed ecophysiological modeling in plants. JOURNAL OF EXPERIMENTAL BOTANY 2019; 70:2561-2574. [PMID: 30825375 PMCID: PMC6487588 DOI: 10.1093/jxb/erz090] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 02/18/2019] [Indexed: 05/06/2023]
Abstract
Dynamic process-based plant models capture complex physiological response across time, carrying the potential to extend simulations out to novel environments and lend mechanistic insight to observed phenotypes. Despite the translational opportunities for varietal crop improvement that could be unlocked by linking natural genetic variation to first principles-based modeling, these models are challenging to apply to large populations of related individuals. Here we use a combination of model development, experimental evaluation, and genomic prediction in Brassica rapa L. to set the stage for future large-scale process-based modeling of intraspecific variation. We develop a new canopy growth submodel for B. rapa within the process-based model Terrestrial Regional Ecosystem Exchange Simulator (TREES), test input parameters for feasibility of direct estimation with observed phenotypes across cultivated morphotypes and indirect estimation using genomic prediction on a recombinant inbred line population, and explore model performance on an in silico population under non-stressed and mild water-stressed conditions. We find evidence that the updated whole-plant model has the capacity to distill genotype by environment interaction (G×E) into tractable components. The framework presented offers a means to link genetic variation with environment-modulated plant response and serves as a stepping stone towards large-scale prediction of unphenotyped, genetically related individuals under untested environmental scenarios.
Collapse
Affiliation(s)
- Diane R Wang
- Geography Department, University at Buffalo, Buffalo, NY, USA
| | | | - Xiaowei Mao
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, USA
| | - D Scott Mackay
- Geography Department, University at Buffalo, Buffalo, NY, USA
| | | | | | - Cynthia Weinig
- Botany Department, University of Wyoming, Laramie, WY, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, USA
- USDA-ARS, Ithaca, NY, USA
| | - Brent E Ewers
- Botany Department, University of Wyoming, Laramie, WY, USA
| |
Collapse
|
129
|
Budhlakoti N, Mishra DC, Rai A, Lal SB, Chaturvedi KK, Kumar RR. A Comparative Study of Single-Trait and Multi-Trait Genomic Selection. J Comput Biol 2019; 26:1100-1112. [PMID: 30994361 DOI: 10.1089/cmb.2019.0032] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
In recent years of animal and plant breeding research, genomic selection (GS) became a choice for selection of appropriate candidate for breeding as it significantly contributes to enhance the genetic gain. Various studies related to GS have been carried out in the recent past. These studies were mostly confined to single trait. Although GS methods based on single trait have not performed very well in cases like pleiotropy, missing data and when the trait under study has low heritability. Gradually, some studies were carried out to explore the possibility of methods for GS based on multiple traits in the view of overcoming the above-mentioned problems in the method of single-trait GS (STGS). Currently, multi-trait-based GS methods are getting importance as it exploits the information of correlated structure among response. In this study, we have compared various methods related to STGS, such as stepwise regression, ridge regression, least absolute shrinkage and selection operator (LASSO), Bayesian, best linear unbiased prediction, and support vector machine, and multi-trait-based GS methods, such as multivariate regression with covariance estimation, conditional Gaussian graphical models, mixed model, and LASSO. In almost all cases, multi-trait-based methods are found to be more accurate. Based on the results of this study, it may be concluded that multi-trait-based methods have great potential to increase genetic gain as they utilize the correlation among the response variable as extra information, which contributes to estimate breeding value more precisely. This study is a comprehensive review of the methods of GS right from single trait to multiple traits and comparisons among these two classes.
Collapse
Affiliation(s)
- Neeraj Budhlakoti
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | | | - Anil Rai
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | - S B Lal
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| | | | - Rajeev Ranjan Kumar
- ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India
| |
Collapse
|
130
|
Wang S, Wei J, Li R, Qu H, Chater JM, Ma R, Li Y, Xie W, Jia Z. Identification of optimal prediction models using multi-omic data for selecting hybrid rice. Heredity (Edinb) 2019; 123:395-406. [PMID: 30911139 DOI: 10.1038/s41437-019-0210-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 02/22/2019] [Accepted: 02/25/2019] [Indexed: 11/09/2022] Open
Abstract
Genomic prediction benefits hybrid rice breeding by increasing selection intensity and accelerating breeding cycles. With the rapid advancement of technology, other omic data, such as metabolomic data and transcriptomic data, are readily available for predicting breeding values for agronomically important traits. In this study, the best prediction strategies were determined for yield, 1000 grain weight, number of grains per panicle, and number of tillers per plant of hybrid rice (derived from recombinant inbred lines) by comprehensively evaluating all possible combinations of omic datasets with different prediction methods. It was demonstrated that, in rice, the predictions using a combination of genomic and metabolomic data generally produce better results than single-omics predictions or predictions based on other combined omic data. Best linear unbiased prediction (BLUP) appears to be the most efficient prediction method compared to the other commonly used approaches, including least absolute shrinkage and selection operator (LASSO), stochastic search variable selection (SSVS), support vector machines with radial basis function and epsilon regression (SVM-R(EPS)), support vector machines with radial basis function and nu regression (SVM-R(NU)), support vector machines with polynomial kernel and epsilon regression (SVM-P(EPS)), support vector machines with polynomial kernel and nu regression (SVM-P(NU)) and partial least squares regression (PLS). This study has provided guidelines for selection of hybrid rice in terms of which types of omic datasets and which method should be used to achieve higher trait predictability. The answer to these questions will benefit academic research and will also greatly reduce the operative cost for the industry which specializes in breeding and selection.
Collapse
Affiliation(s)
- Shibo Wang
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Julong Wei
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - Ruidong Li
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Han Qu
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - John M Chater
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Renyuan Ma
- Department of Mathematics, Bowdoin College, Brunswick, ME, USA
| | - Yonghao Li
- Department of Neuroscience, University of British Columbia, Vancouver, BC, Canada
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Zhenyu Jia
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA.
| |
Collapse
|
131
|
Voss-Fels KP, Cooper M, Hayes BJ. Accelerating crop genetic gains with genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:669-686. [PMID: 30569365 DOI: 10.1007/s00122-018-3270-8] [Citation(s) in RCA: 120] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Accepted: 12/12/2018] [Indexed: 05/05/2023]
Abstract
Genomic prediction based on additive genetic effects can accelerate genetic gain. There are opportunities for further improvement by including non-additive effects that access untapped sources of genetic diversity. Several studies have reported a worrying gap between the projected global future demand for plant-based products and the current annual rates of production increase, indicating that enhancing the rate of genetic gain might be critical for future food security. Therefore, new breeding technologies and strategies are required to significantly boost genetic improvement of future crop cultivars. Genomic selection (GS) has delivered considerable genetic gain in animal breeding and is becoming an essential component of many modern plant breeding programmes as well. In this paper, we review the lessons learned from implementing GS in livestock and the impact of GS on crop breeding, and discuss important features for the success of GS under different breeding scenarios. We highlight major challenges associated with GS including rapid genotyping, phenotyping, genotype-by-environment interaction and non-additivity and give examples for opportunities to overcome these issues. Finally, the potential of combining GS with other modern technologies in order to maximise the rate of crop genetic improvement is discussed, including the potential of increasing prediction accuracy by integration of crop growth models in GS frameworks.
Collapse
Affiliation(s)
- Kai Peter Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Ben John Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia.
| |
Collapse
|
132
|
Hao Y, Wang H, Yang X, Zhang H, He C, Li D, Li H, Wang G, Wang J, Fu J. Genomic Prediction using Existing Historical Data Contributing to Selection in Biparental Populations: A Study of Kernel Oil in Maize. THE PLANT GENOME 2019; 12. [PMID: 30951098 DOI: 10.3835/plantgenome2018.05.0025] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Maize ( L.) kernel oil provides high-quality nutrition for animal feed and human health. A certain number of maize breeding programs seek to enhance oil concentration and composition. Genomic selection (GS), which entails selection based on genomic estimated breeding values (GEBVs), has proven to be efficient in breeding programs. Here, we estimate the robustness of predictions for the oil traits of maize kernels in biparental recombination inbred lines (RILs) using a GS model built based on an association population. Most statistical models, including ridge regression-best linear unbiased prediction (RR-BLUP), showed high prediction accuracy in the training population through a cross validation procedure. The training population size was more important than marker density and a statistical model for prediction performance. Using the optimized GS model, prediction of the biparental RIL population showed medium-high prediction accuracy (0.68) compared with prediction using only oil associated markers ( = 0.43). The potential to apply the GS model to another RIL population that is genetically less related to the training population was also examined, showing promising prediction accuracy in the top selected lines. Our results proved that genomic prediction using existing data is robust for the prediction of polygenic traits with moderate to high heritability.
Collapse
|
133
|
Rice B, Lipka AE. Evaluation of RR-BLUP Genomic Selection Models that Incorporate Peak Genome-Wide Association Study Signals in Maize and Sorghum. THE PLANT GENOME 2019; 12. [PMID: 30951091 DOI: 10.3835/plantgenome2018.07.0052] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Certain agronomic crop traits are complex and thus governed by many small-effect loci. Statistical models typically used in a genome-wide association study (GWAS) and genomic selection (GS) quantify these signals by assessing genomic marker contributions in linkage disequilibrium (LD) with these loci to trait variation. These models have been used in separate quantitative genetics contexts until recently, when, in published studies, the predictive ability of GS models that include peak associated markers from a GWAS as fixed-effect covariates was assessed. Previous work suggests that such models could be useful for predicting traits controlled by several large-effect and many small-effect genes. We expand this work by evaluating simulated traits from diversity panels in maize ( L.) and sorghum [ (L.) Moench] using ridge-regression best linear unbiased prediction (RR-BLUP) models that include fixed-effect covariates tagging peak GWAS signals. The ability of such covariates to increase GS prediction accuracy in the RR-BLUP model under a wide variety of genetic architectures and genomic backgrounds was quantified. Of the 216 genetic architectures that we simulated, we identified 60 where the addition of fixed-effect covariates boosted prediction accuracy. However, for the majority of the simulated data, no increase or a decrease in prediction accuracy was observed. We also noted several instances where the inclusion of fixed-effect covariates increased both the variability of prediction accuracies and the bias of the genomic estimated breeding values. We therefore recommend that the performance of such a GS model be explored on a trait-by-trait basis prior to its implementation into a breeding program.
Collapse
|
134
|
Ward BP, Brown-Guedira G, Kolb FL, Van Sanford DA, Tyagi P, Sneller CH, Griffey CA. Genome-wide association studies for yield-related traits in soft red winter wheat grown in Virginia. PLoS One 2019; 14:e0208217. [PMID: 30794545 PMCID: PMC6386437 DOI: 10.1371/journal.pone.0208217] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 02/05/2019] [Indexed: 01/19/2023] Open
Abstract
Grain yield is a trait of paramount importance in the breeding of all cereals. In wheat (Triticum aestivum L.), yield has steadily increased since the Green Revolution, though the current rate of increase is not forecasted to keep pace with demand due to growing world population and increasing affluence. While several genome-wide association studies (GWAS) on yield and related component traits have been performed in wheat, the previous lack of a reference genome has made comparisons between studies difficult. In this study, a GWAS for yield and yield-related traits was carried out on a population of 322 soft red winter wheat lines across a total of four rain-fed environments in the state of Virginia using single-nucleotide polymorphism (SNP) marker data generated by a genotyping-by-sequencing (GBS) protocol. Two separate mixed linear models were used to identify significant marker-trait associations (MTAs). The first was a single-locus model utilizing a leave-one-chromosome-out approach to estimating kinship. The second was a sub-setting kinship estimation multi-locus method (FarmCPU). The single-locus model identified nine significant MTAs for various yield-related traits, while the FarmCPU model identified 74 significant MTAs. The availability of the wheat reference genome allowed for the description of MTAs in terms of both genetic and physical positions, and enabled more extensive post-GWAS characterization of significant MTAs. The results indicate a number of promising candidate genes contributing to grain yield, including an ortholog of the rice aberrant panicle organization (APO1) protein and a gibberellin oxidase protein (GA2ox-A1) affecting the trait grains per square meter, an ortholog of the Arabidopsis thaliana mother of flowering time and terminal flowering 1 (MFT) gene affecting the trait seeds per square meter, and a B2 heat stress response protein affecting the trait seeds per head.
Collapse
Affiliation(s)
- Brian P. Ward
- Department Of Crop and Soil Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Gina Brown-Guedira
- Eastern Regional Small Grains Genotyping Laboratory, USDA-ARS, Raleigh, North Carolina, United States of America
| | - Frederic L. Kolb
- Department of Crop Sciences, University of Illinois, Urbana, Illinois, United States of America
| | - David A. Van Sanford
- Department of Plant and Soil Sciences, University of Kentucky, Lexington, Kentucky, United States of America
| | - Priyanka Tyagi
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Clay H. Sneller
- Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, Ohio, United States of America
| | - Carl A. Griffey
- Department Of Crop and Soil Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|
135
|
Sant’ Anna IDC, Cabral Ferreira RAD, Nascimento M, Silva GN, Carneiro VQ, Cruz CD, Oliveira MS, Chagas FE. Multigenerational prediction of genetic values using genome-enabled prediction. PLoS One 2019; 14:e0210531. [PMID: 30653561 PMCID: PMC6336252 DOI: 10.1371/journal.pone.0210531] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 12/26/2018] [Indexed: 11/19/2022] Open
Abstract
The identification of elite individuals is a critical component of most breeding programs. However, the achievement of this goal is limited by the high cost of phenotyping and experimental research. A significant benefit of genomic selection (GS) to plant breeding is the identification of elite individuals without the need for phenotyping. This study aimed to propose different calibration strategies using combinations between generations from different genetic backgrounds to improve the reliability of GS and to investigate the effects of LD in different types of mating systems: outcrossing (An) self-pollination (Sn) and hybridization (Hn). For this purpose, we simulated a genome with 10 linkage groups. In each group, two QTL were simulated. Subsequently, an F2 population was created, followed by four generations of inbreeding (S1 to S4, H1 to H 4, A1, to A4,). Quantitative traits were simulated in three scenarios considering three degrees of dominance (d/a = 0, 0.5 and 1) and two broad sense heritabilities (h2 = 0.30 and 0.70), totaling six genetic architectures. To evaluate prediction reliability, a model (RR-BLUP) was trained in one generation and used to predict the following generations of mating systems. For example, the marker effects estimated in the F2 population were used to estimate the expected genomic breeding value (GEBV) in populations S1 through A4. The squared correlation between the GEBV and the true genetic value were used to measure the reliability of the predictions. Independently of the population used to estimate the marker effect, reliability showed the lowest values in the scenario where d = 1. For any scenario, the use of the multigenerational prediction methodology improved the reliability of GS.
Collapse
Affiliation(s)
| | | | - Moysés Nascimento
- Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | - Gabi Nunes Silva
- Department of Statistics, Federal University of Rondônia, Ji-Paraná, Rondônia, Brazil
| | | | - Cosme Damião Cruz
- Department of General Biology, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | - Francyse Edith Chagas
- Department of General Biology, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| |
Collapse
|
136
|
He L, Xiao J, Rashid KY, Jia G, Li P, Yao Z, Wang X, Cloutier S, You FM. Evaluation of Genomic Prediction for Pasmo Resistance in Flax. Int J Mol Sci 2019; 20:E359. [PMID: 30654497 PMCID: PMC6359301 DOI: 10.3390/ijms20020359] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 01/06/2019] [Accepted: 01/11/2019] [Indexed: 02/06/2023] Open
Abstract
Pasmo (Septoria linicola) is a fungal disease causing major losses in seed yield and quality and stem fibre quality in flax. Pasmo resistance (PR) is quantitative and has low heritability. To improve PR breeding efficiency, the accuracy of genomic prediction (GP) was evaluated using a diverse worldwide core collection of 370 accessions. Four marker sets, including three defined by 500, 134 and 67 previously identified quantitative trait loci (QTL) and one of 52,347 PR-correlated genome-wide single nucleotide polymorphisms, were used to build ridge regression best linear unbiased prediction (RR-BLUP) models using pasmo severity (PS) data collected from field experiments performed during five consecutive years. With five-fold random cross-validation, GP accuracy as high as 0.92 was obtained from the models using the 500 QTL when the average PS was used as the training dataset. GP accuracy increased with training population size, reaching values >0.9 with training population size greater than 185. Linear regression of the observed PS with the number of positive-effect QTL in accessions provided an alternative GP approach with an accuracy of 0.86. The results demonstrate the GP models based on marker information from all identified QTL and the 5-year PS average is highly effective for PR prediction.
Collapse
Affiliation(s)
- Liqiang He
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University/JiangSu Collaborative Innovation Center for Modern Crop Production, Nanjing 210095, China.
| | - Jin Xiao
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University/JiangSu Collaborative Innovation Center for Modern Crop Production, Nanjing 210095, China.
| | - Khalid Y Rashid
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
| | - Gaofeng Jia
- Crop Development Centre, University of Saskatchewan, Saskatoon, SK S7N 5A8, Canada.
| | - Pingchuan Li
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
| | - Zhen Yao
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada.
| | - Xiue Wang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University/JiangSu Collaborative Innovation Center for Modern Crop Production, Nanjing 210095, China.
| | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
| | - Frank M You
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University/JiangSu Collaborative Innovation Center for Modern Crop Production, Nanjing 210095, China.
| |
Collapse
|
137
|
Genotype Imputation in Winter Wheat Using First-Generation Haplotype Map SNPs Improves Genome-Wide Association Mapping and Genomic Prediction of Traits. G3-GENES GENOMES GENETICS 2019; 9:125-133. [PMID: 30420469 PMCID: PMC6325902 DOI: 10.1534/g3.118.200664] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Genome-wide single nucleotide polymorphism (SNP) variation allows for the capture of haplotype structure in populations and prediction of unobserved genotypes based on inferred regions of identity-by-descent (IBD). Here we have used a first-generation wheat haplotype map created by targeted re-sequencing of low-copy genomic regions in the reference panel of 62 lines to impute marker genotypes in a diverse panel of winter wheat cultivars from the U.S. Great Plains. The IBD segments between the reference population and winter wheat cultivars were identified based on SNP genotyped using the 90K iSelect wheat array and genotyping by sequencing (GBS). A genome-wide association study and genomic prediction of resistance to stripe rust in winter wheat cultivars showed that an increase in marker density achieved by imputation improved both the power and precision of trait mapping and prediction. The majority of the most significant marker-trait associations belonged to imputed genotypes. With the vast amount of SNP variation data accumulated for wheat in recent years, the presented imputation framework will greatly improve prediction accuracy in breeding populations and increase resolution of trait mapping hence, facilitate cross-referencing of genotype datasets available across different wheat populations.
Collapse
|
138
|
Ikeogu UN, Akdemir D, Wolfe MD, Okeke UG, Chinedozi A, Jannink JL, Egesi CN. Genetic Correlation, Genome-Wide Association and Genomic Prediction of Portable NIRS Predicted Carotenoids in Cassava Roots. FRONTIERS IN PLANT SCIENCE 2019; 10:1570. [PMID: 31867030 PMCID: PMC6904298 DOI: 10.3389/fpls.2019.01570] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Accepted: 11/08/2019] [Indexed: 05/21/2023]
Abstract
Random forests (RF) was used to correlate spectral responses to known wet chemistry carotenoid concentrations including total carotenoid content (TCC), all-trans β-carotene (ATBC), violaxanthin (VIO), lutein (LUT), 15-cis beta-carotene (15CBC), 13-cis beta-carotene (13CBC), alpha-carotene (AC), 9-cis beta-carotene (9CBC), and phytoene (PHY) from laboratory analysis of 173 cassava root samples in Columbia. The cross-validated correlations between the actual and estimated carotenoid values using RF ranged from 0.62 in PHY to 0.97 in ATBC. The developed models were used to evaluate the carotenoids of 594 cassava clones with spectral information collected across three locations in a national breeding program (NRCRI, Umudike), Nigeria. Both populations contained cassava clones characterized as white and yellow. The NRCRI evaluated phenotypes were used to assess the genetic correlations, conduct genome-wide association studies (GWAS), and genomic predictions. Estimates of genetic correlation showed various levels of the relationship among the carotenoids. The associations between TCC and the individual carotenoids were all significant (P < 0.001) with high positive values (r > 0.75, except in LUT and PHY where r < 0.3). The GWAS revealed significant genomic regions on chromosomes 1, 2, 4, 13, 14, and 15 associated with variation in at least one of the carotenoids. One of the identified candidate genes, phytoene synthase (PSY) has been widely reported for variation in TCC in cassava. On average, genomic prediction accuracies from the single-trait genomic best linear unbiased prediction (GBLUP) and RF as well as from a multiple-trait GBLUP model ranged from ∼0.2 in LUT and PHY to 0.52 in TCC. The multiple-trait GBLUP model gave slightly higher accuracies than the single trait GBLUP and RF models. This study is one of the initial attempts in understanding the genetic basis of individual carotenoids and demonstrates the usefulness of NIRS in cassava improvement.
Collapse
Affiliation(s)
- Ugochukwu N. Ikeogu
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, United States
- Biotechnology Department, National Root Crops Research Institute, Umudike, Nigeria
- *Correspondence: Ugochukwu N. Ikeogu,
| | - Deniz Akdemir
- Cornell University Statistical Consulting Unit (CSCU), Cornell University, Ithaca, NY, United States
| | - Marnin D. Wolfe
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, United States
| | - Uche G. Okeke
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, United States
| | - Amaefula Chinedozi
- Biotechnology Department, National Root Crops Research Institute, Umudike, Nigeria
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, United States
- Plant, Soil and Nutrition Research, Robert W. Holley Center for Agriculture & Health, Agricultural Research Service, United States Department of Agriculture (USDA), Ithaca, NY, United States
| | - Chiedozie N. Egesi
- Plant Breeding and Genetics Section, Cornell University, Ithaca, NY, United States
- Biotechnology Department, National Root Crops Research Institute, Umudike, Nigeria
- Cassava Breeding Department, International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria
| |
Collapse
|
139
|
Morton MJL, Awlia M, Al‐Tamimi N, Saade S, Pailles Y, Negrão S, Tester M. Salt stress under the scalpel - dissecting the genetics of salt tolerance. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 97:148-163. [PMID: 30548719 PMCID: PMC6850516 DOI: 10.1111/tpj.14189] [Citation(s) in RCA: 123] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 11/28/2018] [Accepted: 11/30/2018] [Indexed: 05/08/2023]
Abstract
Salt stress limits the productivity of crops grown under saline conditions, leading to substantial losses of yield in saline soils and under brackish and saline irrigation. Salt tolerant crops could alleviate these losses while both increasing irrigation opportunities and reducing agricultural demands on dwindling freshwater resources. However, despite significant efforts, progress towards this goal has been limited, largely because of the genetic complexity of salt tolerance for agronomically important yield-related traits. Consequently, the focus is shifting to the study of traits that contribute to overall tolerance, thus breaking down salt tolerance into components that are more genetically tractable. Greater consideration of the plasticity of salt tolerance mechanisms throughout development and across environmental conditions furthers this dissection. The demand for more sophisticated and comprehensive methodologies is being met by parallel advances in high-throughput phenotyping and sequencing technologies that are enabling the multivariate characterisation of vast germplasm resources. Alongside steady improvements in statistical genetics models, forward genetics approaches for elucidating salt tolerance mechanisms are gaining momentum. Subsequent quantitative trait locus and gene validation has also become more accessible, most recently through advanced techniques in molecular biology and genomic analysis, facilitating the translation of findings to the field. Besides fuelling the improvement of established crop species, this progress also facilitates the domestication of naturally salt tolerant orphan crops. Taken together, these advances herald a promising era of discovery for research into the genetics of salt tolerance in plants.
Collapse
Affiliation(s)
- Mitchell J. L. Morton
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Mariam Awlia
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Nadia Al‐Tamimi
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Stephanie Saade
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Yveline Pailles
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Sónia Negrão
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| | - Mark Tester
- Division of Biological and Environmental Sciences and EngineeringKing Abdullah University of Science and Technology (KAUST)Thuwal23955‐6900Kingdom of Saudi Arabia
| |
Collapse
|
140
|
Liu X, Wang H, Hu X, Li K, Liu Z, Wu Y, Huang C. Improving Genomic Selection With Quantitative Trait Loci and Nonadditive Effects Revealed by Empirical Evidence in Maize. FRONTIERS IN PLANT SCIENCE 2019; 10:1129. [PMID: 31620155 PMCID: PMC6759780 DOI: 10.3389/fpls.2019.01129] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 08/15/2019] [Indexed: 05/20/2023]
Abstract
Genomic selection (GS), a tool developed for molecular breeding, is used by plant breeders to improve breeding efficacy by shortening the breeding cycle and to facilitate the selection of candidate lines for creating hybrids without phenotyping in various environments. Association and linkage mapping have been widely used to explore and detect candidate genes in order to understand the genetic mechanisms of quantitative traits. In the current study, phenotypic and genotypic data from three experimental populations, including data on six agronomic traits (e.g., plant height, ear height, ear length, ear diameter, grain yield per plant, and hundred-kernel weight), were used to evaluate the effect of trait-relevant markers (TRMs) on prediction accuracy estimation. Integrating information from mapping into a statistical model can efficiently improve prediction performance compared with using stochastically selected markers to perform GS. The prediction accuracy can reach plateau when a total of 500-1,000 TRMs are utilized in GS. The prediction accuracy can be significantly enhanced by including nonadditive effects and TRMs in the GS model when genotypic data with high proportions of heterozygous alleles and complex agronomic traits with high proportion of nonadditive variancein phenotypic variance are used to perform GS. In addition, taking information on population structure into account can slightly improve prediction performance when the genetic relationship between the training and testing sets is influenced by population stratification due to different allele frequencies. In conclusion, GS is a useful approach for prescreening candidate lines, and the empirical evidence provided by the current study for TRMs and nonadditive effects can inform plant breeding and in turn contribute to the improvement of selection efficiency in practical GS-assisted breeding programs.
Collapse
|
141
|
Jimenez R, Molina L, Zarei I, Lapis JR, Chavez R, Cuevas RPO, Sreenivasulu N. Method Development of Near-Infrared Spectroscopy Approaches for Nondestructive and Rapid Estimation of Total Protein in Brown Rice Flour. Methods Mol Biol 2019; 1892:109-135. [PMID: 30397803 DOI: 10.1007/978-1-4939-8914-0_7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Rice varietal development and improvement programs are constantly seeking means to shorten the breeding cycle in order to deliver new, consumer-acceptable rice varieties to farmers and to consumers. Advances in molecular biology technologies have enabled breeders to use high-throughput genotyping to screen breeding lines. However, current phenotyping technologies, particularly for rice cooking and eating properties, have yet to match the efficiency of genotyping methodologies. A high-throughput and cost-effective phenotyping suite is essential because without phenotype, the value of genotypic information cannot be maximized. In this book chapter, we explore the application of near-infrared spectroscopy (NIRS), a high-throughput and nondestructive approach in characterizing rice grains, primarily describing method development and validation, instrument calibration, upgrading, and maintenance. We then focus on estimating protein content (PC) in brown rice as a case study because (1) PC is an attribute that contributes to the cooking behavior and the eating properties of cooked rice; and (2) proteins contain chemical bonds that can easily be detected by NIRS.
Collapse
Affiliation(s)
- Rosario Jimenez
- International Rice Research Institute, Los Baños, Laguna, Philippines
| | - Lilia Molina
- International Rice Research Institute, Los Baños, Laguna, Philippines
| | - Iman Zarei
- International Rice Research Institute, Los Baños, Laguna, Philippines
| | | | - Ruben Chavez
- International Rice Research Institute, Los Baños, Laguna, Philippines
| | | | - Nese Sreenivasulu
- International Rice Research Institute, Los Baños, Laguna, Philippines.
| |
Collapse
|
142
|
Carpenter MA, Goulden DS, Woods CJ, Thomson SJ, Kenel F, Frew TJ, Cooper RD, Timmerman-Vaughan GM. Genomic Selection for Ascochyta Blight Resistance in Pea. FRONTIERS IN PLANT SCIENCE 2018; 9:1878. [PMID: 30619430 PMCID: PMC6306417 DOI: 10.3389/fpls.2018.01878] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Accepted: 12/05/2018] [Indexed: 05/02/2023]
Abstract
Genomic selection (GS) is a breeding tool, which is rapidly gaining popularity for plant breeding, particularly for traits that are difficult to measure. One such trait is ascochyta blight resistance in pea (Pisum sativum L.), which is difficult to assay because it is strongly influenced by the environment and depends on the natural occurrence of multiple pathogens. Here we report a study of the efficacy of GS for predicting ascochyta blight resistance in pea, as represented by ascochyta blight disease score (ASC), and using nucleotide polymorphism data acquired through genotyping-by-sequencing. The effects on prediction accuracy of different GS models and different thresholds for missing genotypic data (which modified the number of single nucleotide polymorphisms used in the analysis) were compared using cross-validation. Additionally, the inclusion of marker × environment interactions in a genomic best linear unbiased prediction (GBLUP) model was evaluated. Finally, different ways of combining trait data from two field trials using bivariate, spatial, and single-stage analyses were compared to results obtained using a mean value. The best prediction accuracy achieved for ASC was 0.56, obtained using GBLUP analysis with a mean value for ASC and data quality threshold of 70% (i.e., missing SNP data in <30% of lines). GBLUP and Bayesian Reproducing kernel Hilbert spaces regression (RKHS) performed slightly better than the other models trialed, whereas different missing data thresholds made minimal differences to prediction accuracy. The prediction accuracies of individual, randomly selected, testing/training partitions were highly variable, highlighting the effect that the choice of training population has on prediction accuracy. The inclusion of marker × environment interactions did not increase the prediction accuracy for lines which had not been phenotyped, but did improve the results of prediction across environments. GS is potentially useful for pea breeding programs pursuing ascochyta blight resistance, both for predicting breeding values for lines that have not been phenotyped, and for providing enhanced estimated breeding values for lines for which trait data is available.
Collapse
Affiliation(s)
- Margaret A. Carpenter
- The New Zealand Institute for Plant & Food Research Limited, Christchurch, New Zealand
| | | | | | | | | | | | | | | |
Collapse
|
143
|
Gerard D, Ferrão LFV, Garcia AAF, Stephens M. Genotyping Polyploids from Messy Sequencing Data. Genetics 2018; 210:789-807. [PMID: 30185430 PMCID: PMC6218231 DOI: 10.1534/genetics.118.301468] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 08/21/2018] [Indexed: 12/30/2022] Open
Abstract
Detecting and quantifying the differences in individual genomes (i.e., genotyping), plays a fundamental role in most modern bioinformatics pipelines. Many scientists now use reduced representation next-generation sequencing (NGS) approaches for genotyping. Genotyping diploid individuals using NGS is a well-studied field, and similar methods for polyploid individuals are just emerging. However, there are many aspects of NGS data, particularly in polyploids, that remain unexplored by most methods. Our contributions in this paper are fourfold: (i) We draw attention to, and then model, common aspects of NGS data: sequencing error, allelic bias, overdispersion, and outlying observations. (ii) Many datasets feature related individuals, and so we use the structure of Mendelian segregation to build an empirical Bayes approach for genotyping polyploid individuals. (iii) We develop novel models to account for preferential pairing of chromosomes, and harness these for genotyping. (iv) We derive oracle genotyping error rates that may be used for read depth suggestions. We assess the accuracy of our method in simulations, and apply it to a dataset of hexaploid sweet potato (Ipomoea batatas). An R package implementing our method is available at https://cran.r-project.org/package=updog.
Collapse
Affiliation(s)
- David Gerard
- Department of Mathematics and Statistics, American University, Washington, DC 20016
| | | | - Antonio Augusto Franco Garcia
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, 13418-900, Brazil
| | - Matthew Stephens
- Department of Human Genetics, University of Chicago, Illinois 60637
- Department of Statistics, University of Chicago, Illinois 60637
| |
Collapse
|
144
|
Rife TW, Graybosch RA, Poland JA. Genomic Analysis and Prediction within a US Public Collaborative Winter Wheat Regional Testing Nursery. THE PLANT GENOME 2018; 11. [PMID: 30512033 DOI: 10.3835/plantgenome2018.01.0004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The development of inexpensive, whole-genome profiling enables a transition to allele-based breeding using genomic prediction models. These models consider alleles shared between lines to predict phenotypes and select new lines based on estimated breeding values. This approach can leverage highly unbalanced datasets that are common to breeding programs. The Southern Regional Performance Nursery (SRPN) is a public nursery established by the USDA-ARS in 1931 to characterize performance and quality of near-release wheat ( L.) varieties from breeding programs in the US Central Plains. New entries are submitted annually and can be re-entered only once. The trial is grown at >30 locations each year and lines are evaluated for grain yield, disease resistance, and agronomic traits. Overall genetic gain is measured across years by including common check cultivars for comparison. We have generated whole-genome profiles via genotyping-by-sequencing (GBS) for 939 SPRN entries dating back to 1992 to explore the potential use of the nursery as a genomic selection (GS) training population (TP). The GS prediction models across years (average = 0.33) outperformed year-to-year phenotypic correlation for yield ( = 0.27) for a majority of the years evaluated, suggesting that genomic selection has the potential to outperform low heritability selection on yield in these highly variable environments. We also examined the predictability of programs using both program-specific and whole-set TPs. Generally, the predictability of a program was similar with both approaches. These results suggest that wheat breeding programs can collaboratively leverage the immense datasets that are generated from regional testing networks.
Collapse
|
145
|
Ma W, Qiu Z, Song J, Li J, Cheng Q, Zhai J, Ma C. A deep convolutional neural network approach for predicting phenotypes from genotypes. PLANTA 2018; 248:1307-1318. [PMID: 30101399 DOI: 10.1007/s00425-018-2976-9] [Citation(s) in RCA: 79] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Accepted: 07/11/2018] [Indexed: 05/21/2023]
Abstract
Deep learning is a promising technology to accurately select individuals with high phenotypic values based on genotypic data. Genomic selection (GS) is a promising breeding strategy by which the phenotypes of plant individuals are usually predicted based on genome-wide markers of genotypes. In this study, we present a deep learning method, named DeepGS, to predict phenotypes from genotypes. Using a deep convolutional neural network, DeepGS uses hidden variables that jointly represent features in genotypes when making predictions; it also employs convolution, sampling and dropout strategies to reduce the complexity of high-dimensional genotypic data. We used a large GS dataset to train DeepGS and compared its performance with other methods. The experimental results indicate that DeepGS can be used as a complement to the commonly used RR-BLUP in the prediction of phenotypes from genotypes. The complementarity between DeepGS and RR-BLUP can be utilized using an ensemble learning approach for more accurately selecting individuals with high phenotypic values, even for the absence of outlier individuals and subsets of genotypic markers. The source codes of DeepGS and the ensemble learning approach have been packaged into Docker images for facilitating their applications in different GS programs.
Collapse
Affiliation(s)
- Wenlong Ma
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Zhixu Qiu
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Biomass Energy Center for Arid and Semi-arid Lands, Northwest A&F University, Shaanxi, 712100, Yangling, China
| | - Jie Song
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Jiajia Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Biomass Energy Center for Arid and Semi-arid Lands, Northwest A&F University, Shaanxi, 712100, Yangling, China
| | - Qian Cheng
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Biomass Energy Center for Arid and Semi-arid Lands, Northwest A&F University, Shaanxi, 712100, Yangling, China
| | - Jingjing Zhai
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China
- Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Chuang Ma
- State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, 712100, Shaanxi, China.
- Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Yangling, 712100, Shaanxi, China.
| |
Collapse
|
146
|
Quero G, Gutiérrez L, Monteverde E, Blanco P, Pérez de Vida F, Rosas J, Fernández S, Garaycochea S, McCouch S, Berberian N, Simondi S, Bonnecarrère V. Genome-Wide Association Study Using Historical Breeding Populations Discovers Genomic Regions Involved in High-Quality Rice. THE PLANT GENOME 2018; 11:170076. [PMID: 30512035 DOI: 10.3835/plantgenome2017.08.0076] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Rice ( L.) is one of the most important staple food crops in the world; however, there has recently been a shift in consumer demand for higher grain quality. Therefore, understanding the genetic architecture of grain quality has become a key objective of rice breeding programs. Genome-wide association studies (GWAS) using large diversity panels have successfully identified genomic regions associated with complex traits in diverse crop species. Our main objective was to identify genomic regions associated with grain quality and to identify and characterize favorable haplotypes for selection. We used two locally adapted rice breeding populations and historical phenotypic data for three rice quality traits: yield after milling, percentage of head rice recovery, and percentage of chalky grain. We detected 22 putative quantitative trait loci (QTL) in the same genomic regions as starch synthesis, starch metabolism, and cell wall synthesis-related genes are found. Additionally, we found a genomic region on chromosome 6 in the population that was associated with all quality traits and we identified favorable haplotypes. Furthermore, this region is linked to the gene that codes for a starch branching enzyme I, which is implicated in starch granule formation. In , we also found two putative QTL linked to , , and . Our study provides an insight into the genetic basis of rice grain chalkiness, yield after milling, and head rice, identifying favorable haplotypes and molecular markers for selection in breeding programs.
Collapse
|
147
|
Liu R, Gong J, Xiao X, Zhang Z, Li J, Liu A, Lu Q, Shang H, Shi Y, Ge Q, Iqbal MS, Deng X, Li S, Pan J, Duan L, Zhang Q, Jiang X, Zou X, Hafeez A, Chen Q, Geng H, Gong W, Yuan Y. GWAS Analysis and QTL Identification of Fiber Quality Traits and Yield Components in Upland Cotton Using Enriched High-Density SNP Markers. FRONTIERS IN PLANT SCIENCE 2018; 9:1067. [PMID: 30283462 PMCID: PMC6157485 DOI: 10.3389/fpls.2018.01067] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 07/02/2018] [Indexed: 05/18/2023]
Abstract
It is of great importance to identify quantitative trait loci (QTL) controlling fiber quality traits and yield components for future marker-assisted selection (MAS) and candidate gene function identifications. In this study, two kinds of traits in 231 F6:8 recombinant inbred lines (RILs), derived from an intraspecific cross between Xinluzao24, a cultivar with elite fiber quality, and Lumianyan28, a cultivar with wide adaptability and high yield potential, were measured in nine environments. This RIL population was genotyped by 122 SSR and 4729 SNP markers, which were also used to construct the genetic map. The map covered 2477.99 cM of hirsutum genome, with an average marker interval of 0.51 cM between adjacent markers. As a result, a total of 134 QTLs for fiber quality traits and 122 QTLs for yield components were detected, with 2.18-24.45 and 1.68-28.27% proportions of the phenotypic variance explained by each QTL, respectively. Among these QTLs, 57 were detected in at least two environments, named stable QTLs. A total of 209 and 139 quantitative trait nucleotides (QTNs) were associated with fiber quality traits and yield components by four multilocus genome-wide association studies methods, respectively. Among these QTNs, 74 were detected by at least two algorithms or in two environments. The candidate genes harbored by 57 stable QTLs were compared with the ones associated with QTN, and 35 common candidate genes were found. Among these common candidate genes, four were possibly "pleiotropic." This study provided important information for MAS and candidate gene functional studies.
Collapse
Affiliation(s)
- Ruixian Liu
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Juwu Gong
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xianghui Xiao
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Zhen Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Junwen Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Aiying Liu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Quanwei Lu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
- School of Biotechnology and Food Engineering, Anyang Institute of Technology, Anyang, China
| | - Haihong Shang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Yuzhen Shi
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Qun Ge
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Muhammad S. Iqbal
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiaoying Deng
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Shaoqi Li
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Jingtao Pan
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Li Duan
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Qi Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xiao Jiang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Xianyan Zou
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Abdul Hafeez
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Quanjia Chen
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
| | - Hongwei Geng
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
| | - Wankui Gong
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| | - Youlu Yuan
- Xinjiang Research Base, State Key Laboratory of Cotton Biology, Xinjiang Agricultural University, Urumqi, China
- State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
| |
Collapse
|
148
|
Covarrubias-Pazaran G, Schlautman B, Diaz-Garcia L, Grygleski E, Polashock J, Johnson-Cicalese J, Vorsa N, Iorizzo M, Zalapa J. Multivariate GBLUP Improves Accuracy of Genomic Selection for Yield and Fruit Weight in Biparental Populations of Vaccinium macrocarpon Ait. FRONTIERS IN PLANT SCIENCE 2018; 9:1310. [PMID: 30258453 PMCID: PMC6144488 DOI: 10.3389/fpls.2018.01310] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 08/20/2018] [Indexed: 05/10/2023]
Abstract
The development of high-throughput genotyping has made genome-wide association (GWAS) and genomic selection (GS) applications possible for both model and non-model species. The exploitation of genome-assisted approaches could greatly benefit breeding efforts in American cranberry (Vaccinium macrocarpon) and other minor crops. Using biparental populations with different degrees of relatedness, we evaluated multiple GS methods for total yield (TY) and mean fruit weight (MFW). Specifically, we compared predictive ability (PA) differences between univariate and multivariate genomic best linear unbiased predictors (GBLUP and MGBLUP, respectively). We found that MGBLUP provided higher predictive ability (PA) than GBLUP, in scenarios with medium genetic correlation (8-17% increase with corg~0.6) and high genetic correlations (25-156% with corg~0.9), but found no increase when genetic correlation was low. In addition, we found that only a few hundred single nucleotide polymorphism (SNP) markers are needed to reach a plateau in PA for both traits in the biparental populations studied (in full linkage disequilibrium). We observed that higher resemblance among individuals in the training (TP) and validation (VP) populations provided greater PA. Although multivariate GS methods are available, genetic correlations and other factors need to be carefully considered when applying these methods for genetic improvement.
Collapse
Affiliation(s)
| | | | - Luis Diaz-Garcia
- Department of Horticulture, University of Wisconsin Madison, Madison, WI, United States
- Instituto Nacional de Investigaciones, Forestales, Agrícolas y Pecuarias, Campo Experimental Pabellón, Aguascalientes, Mexico
| | | | - James Polashock
- Genetic Improvement of Fruits and Vegetables Laboratory, USDA-ARS, Chatsworth, NJ, United States
| | - Jennifer Johnson-Cicalese
- Blueberry and Cranberry Research and Extension Center, Rutgers University, Chatsworth, NJ, United States
| | - Nicholi Vorsa
- Blueberry and Cranberry Research and Extension Center, Rutgers University, Chatsworth, NJ, United States
| | - Massimo Iorizzo
- Department of Horticulture Sciences, Plants for Human Health Institute, North Carolina State University, Kannapolis, NC, United States
| | - Juan Zalapa
- Vegetable Crops Research Unit, USDA-ARS, University of Wisconsin, Madison, WI, United States
| |
Collapse
|
149
|
Hiraoka Y, Fukatsu E, Mishima K, Hirao T, Teshima KM, Tamura M, Tsubomura M, Iki T, Kurita M, Takahashi M, Watanabe A. Potential of Genome-Wide Studies in Unrelated Plus Trees of a Coniferous Species, Cryptomeria japonica (Japanese Cedar). FRONTIERS IN PLANT SCIENCE 2018; 9:1322. [PMID: 30254658 PMCID: PMC6141754 DOI: 10.3389/fpls.2018.01322] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/24/2018] [Accepted: 08/22/2018] [Indexed: 06/08/2023]
Abstract
A genome-wide association study (GWAS) was conducted on more than 30,000 single nucleotide polymorphisms (SNPs) in unrelated first-generation plus tree genotypes from three populations of Japanese cedar Cryptomeria japonica D. Don with genomic prediction for traits of growth, wood properties and male fecundity. Among the assessed populations, genetic characteristics including the extent of linkage disequilibrium (LD) and genetic structure differed and these differences are considered to be due to differences in genetic background. Through population-independent GWAS, several significant SNPs found close to the regions associated with each of these traits and shared in common across the populations were identified. The accuracies of genomic predictions were dependent on the traits and populations and reflected the genetic architecture of traits and genetic characteristics. Prediction accuracies using SNPs selected based on GWAS results were similar to those using all SNPs for several combinations of traits and populations. We discussed the application of genome-wide studies for C. japonica improvement.
Collapse
Affiliation(s)
- Yuichiro Hiraoka
- Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Hitachi, Japan
| | - Eitaro Fukatsu
- Kyushu Regional Breeding Office, Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Kumamoto, Japan
| | - Kentaro Mishima
- Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Hitachi, Japan
| | - Tomonori Hirao
- Forest Bio-Research Center, Forestry and Forest Products Research Institute, Hitachi, Japan
| | | | - Miho Tamura
- Faculty of Agriculture, Kyushu University, Fukuoka, Japan
| | - Miyoko Tsubomura
- Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Hitachi, Japan
| | - Taiichi Iki
- Tohoku Regional Breeding Office, Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Takizawa, Japan
| | - Manabu Kurita
- Kyushu Regional Breeding Office, Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Kumamoto, Japan
| | - Makoto Takahashi
- Forest Tree Breeding Center, Forestry and Forest Products Research Institute, Hitachi, Japan
| | | |
Collapse
|
150
|
Voss-Fels KP, Qian L, Gabur I, Obermeier C, Hickey LT, Werner CR, Kontowski S, Frisch M, Friedt W, Snowdon RJ, Gottwald S. Genetic insights into underground responses to Fusarium graminearum infection in wheat. Sci Rep 2018; 8:13153. [PMID: 30177750 PMCID: PMC6120866 DOI: 10.1038/s41598-018-31544-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Accepted: 08/19/2018] [Indexed: 11/16/2022] Open
Abstract
The ongoing global intensification of wheat production will likely be accompanied by a rising pressure of Fusarium diseases. While utmost attention was given to Fusarium head blight (FHB) belowground plant infections of the pathogen have largely been ignored. The current knowledge about the impact of soil borne Fusarium infection on plant performance and the underlying genetic mechanisms for resistance remain very limited. Here, we present the first large-scale investigation of Fusarium root rot (FRR) resistance using a diverse panel of 215 international wheat lines. We obtained data for a total of 21 resistance-related traits, including large-scale Real-time PCR experiments to quantify fungal spread. Association mapping and subsequent haplotype analyses discovered a number of highly conserved genomic regions associated with resistance, and revealed a significant effect of allele stacking on the stembase discoloration. Resistance alleles were accumulated in European winter wheat germplasm, implying indirect prior selection for improved FRR resistance in elite breeding programs. Our results give first insights into the genetic basis of FRR resistance in wheat and demonstrate how molecular parameters can successfully be explored in genomic prediction. Ongoing work will help to further improve our understanding of the complex interactions of genetic factors influencing FRR resistance.
Collapse
Affiliation(s)
- Kai P Voss-Fels
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany.
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia.
| | - Lunwen Qian
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, P.R. China
| | - Iulian Gabur
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Christian Obermeier
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Lee T Hickey
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Christian R Werner
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Stefan Kontowski
- W. von Borries-Eckendorf GmbH & Co. KG, Hovedisser Str. 92, 33818, Leopoldshöhe, Germany
| | - Matthias Frisch
- Institute for Agronomy and Plant Breeding II, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Wolfgang Friedt
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Sven Gottwald
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| |
Collapse
|