1
|
Tello J, Ibáñez J. Review: Status and prospects of association mapping in grapevine. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2023; 327:111539. [PMID: 36410567 DOI: 10.1016/j.plantsci.2022.111539] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 11/15/2022] [Accepted: 11/17/2022] [Indexed: 06/16/2023]
Abstract
Thanks to current advances in sequencing technologies, novel bioinformatics tools, and efficient modeling solutions, association mapping has become a widely accepted approach to unravel the link between genotype and phenotype diversity in numerous crops. In grapevine, this strategy has been used in the last decades to understand the genetic basis of traits of agronomic interest (fruit quality, crop yield, biotic and abiotic resistance), of special relevance nowadays to improve crop resilience to cope with future climate scenarios. Genome-wide association studies have identified many putative causative loci for different traits, some of them overlapping well-known causal genes identified by conventional quantitative trait loci studies in biparental progenies, and/or validated by functional approaches. In addition, candidate-gene association studies have been useful to pinpoint the causal mutation underlying phenotypic variation for several traits of high interest in breeding programs (like berry color, seedlessness, and muscat flavor), information that has been used to develop highly informative and useful markers already in use in marker-assisted selection processes. Thus, association mapping has proved to represent a valuable step towards high quality and sustainable grape production. This review summarizes current applications of association mapping in grapevine research and discusses future prospects in view of current viticulture challenges.
Collapse
Affiliation(s)
- Javier Tello
- Instituto de Ciencias de la Vid y del Vino (CSIC, UR, Gobierno de La Rioja), Logroño 26007, Spain.
| | - Javier Ibáñez
- Instituto de Ciencias de la Vid y del Vino (CSIC, UR, Gobierno de La Rioja), Logroño 26007, Spain
| |
Collapse
|
2
|
Meng Y, Du Q, Du H, Wang Q, Wang L, Du L, Liu P. Analysis of chemotypes and their markers in leaves of core collections of Eucommia ulmoides using metabolomics. FRONTIERS IN PLANT SCIENCE 2023; 13:1029907. [PMID: 36699853 PMCID: PMC9868706 DOI: 10.3389/fpls.2022.1029907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Accepted: 12/02/2022] [Indexed: 06/17/2023]
Abstract
The leaves of Eucommia ulmoides contain various active compunds and nutritional components, and have successively been included as raw materials in the Chinese Pharmacopoeia, the Health Food Raw Material Catalogue, and the Feed Raw Material Catalogue. Core collections of E. ulmoides had been constructed from the conserved germplasm resources basing on molecular markers and morphological traits, however, the metabolite diversity and variation in this core population were little understood. Metabolite profiles of E. ulmoides leaves of 193 core collections were comprehensively characterized by GC-MS and LC-MS/MS based non-targeted metabolomics in present study. Totally 1,100 metabolites were identified and that belonged to 18 categories, and contained 120 active ingredients for traditional Chinese medicine (TCM) and 85 disease-resistant metabolites. Four leaf chemotypes of the core collections were established by integrated uses of unsupervised self-organizing map (SOM), supervised orthogonal partial least squares discriminant analysis (OPLS-DA) and random forest (RF) statistical methods, 30, 23, 43, and 23 chemomarkers were screened corresponding to the four chemotypes, respectively. The morphological markers for the chemotypes were obtained by weighted gene co-expression network analysis (WGCNA) between the chenomarkers and the morphological traits, with leaf length (LL), chlorophyll reference value (CRV), leaf dentate height (LDH), and leaf thickness (LT) corresponding to chemotypes I, II, III, and IV, respectively. Contents of quercetin-3-O-pentosidine, isoquercitrin were closely correlated to LL, leaf area (LA), and leaf perimeter (LP), suggesting the quercetin derivatives might influence the growth and development of E. ulmoides leaf shape.
Collapse
Affiliation(s)
- Yide Meng
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Qingxin Du
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Hongyan Du
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Qi Wang
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Lu Wang
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Lanying Du
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
| | - Panfeng Liu
- Research Institute of Non-Timber Forestry, Chinese Academy of Forestry, Zhengzhou, China
- Key Laboratory of Non-timber Forest Germplasm Enhancement & Utilization of National Forestry and Grassland Administration, Zhengzhou, China
| |
Collapse
|
3
|
Mahmood U, Li X, Fan Y, Chang W, Niu Y, Li J, Qu C, Lu K. Multi-omics revolution to promote plant breeding efficiency. FRONTIERS IN PLANT SCIENCE 2022; 13:1062952. [PMID: 36570904 PMCID: PMC9773847 DOI: 10.3389/fpls.2022.1062952] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
Crop production is the primary goal of agricultural activities, which is always taken into consideration. However, global agricultural systems are coming under increasing pressure from the rising food demand of the rapidly growing world population and changing climate. To address these issues, improving high-yield and climate-resilient related-traits in crop breeding is an effective strategy. In recent years, advances in omics techniques, including genomics, transcriptomics, proteomics, and metabolomics, paved the way for accelerating plant/crop breeding to cope with the changing climate and enhance food production. Optimized omics and phenotypic plasticity platform integration, exploited by evolving machine learning algorithms will aid in the development of biological interpretations for complex crop traits. The precise and progressive assembly of desire alleles using precise genome editing approaches and enhanced breeding strategies would enable future crops to excel in combating the changing climates. Furthermore, plant breeding and genetic engineering ensures an exclusive approach to developing nutrient sufficient and climate-resilient crops, the productivity of which can sustainably and adequately meet the world's food, nutrition, and energy needs. This review provides an overview of how the integration of omics approaches could be exploited to select crop varieties with desired traits.
Collapse
Affiliation(s)
- Umer Mahmood
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Xiaodong Li
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Yonghai Fan
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Wei Chang
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Yue Niu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Jiana Li
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| | - Cunmin Qu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| | - Kun Lu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| |
Collapse
|
4
|
Eragam A, Shukla V, Kola VS, Latha P, Akkareddy S, Kommana ML, Ramireddy E, Vemireddy LR. Yield-associated putative gene regulatory networks in Oryza sativa L. subsp. indica and their association with high-yielding genotypes. Mol Biol Rep 2022; 49:7649-7663. [PMID: 35612779 DOI: 10.1007/s11033-022-07581-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 05/06/2022] [Indexed: 11/26/2022]
Abstract
BACKGROUND With the increase in population and economies of developing countries in Asia and Africa, the research towards securing future food demands is an imminent need. Among japonica and indica genotypes, indica rice varieties are largely cultivated across the globe. However, our present understanding of yield-contributing gene information stems mainly from japonica and studies on the yield potential of indica genotypes are limited. METHODS AND RESULTS In the present study, yield contributing orthologous genes previously characterized from japonica varieties were identified in the indica genome and analysed with binGO tool for GO biological processes categorization. Transcription factor binding site enrichment analysis in the promoters of yield-related genes of indica was performed with MEME-AME tool that revealed putative common TF regulators are enriched in flower development, two-component signalling and water deprivation biological processes. Gene regulatory networks revealed important TF-target interactions that might govern yield-related traits. Some of the identified candidate genes were validated by qRT-PCR analysis for their expression and association with yield-related traits among 16 widely cultivated popular indica genotypes. Further, SNP-metabolite-trait association analysis was performed using high-yielding indica variety Rasi. This resulted in the identification of putative SNP variations in TF regulators and targeted yield genes significantly linked with metabolite accumulation. CONCLUSIONS The study suggests some of the high yielding indica genotypes such as Ravi003, Rasi and Kavya could be used as potential donors in breeding programs based on yield gene expression analysis and SNP-metabolites associations.
Collapse
Affiliation(s)
- Aparna Eragam
- Department of Molecular Biology and Biotechnology, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, 517502, India
- Biology Division, Indian Institute of Science Education and Research Tirupati (IISER Tirupati), Tirupati, 517507, India
| | - Vishnu Shukla
- Biology Division, Indian Institute of Science Education and Research Tirupati (IISER Tirupati), Tirupati, 517507, India
| | - Vijaya Sudhakararao Kola
- Biology Division, Indian Institute of Science Education and Research Tirupati (IISER Tirupati), Tirupati, 517507, India
| | - P Latha
- Regional Agricultural Research Station (RARS), ANGRAU, Tirupati, India
| | | | - Madhavi L Kommana
- Department of Genetics and Plant Breeding, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, 517502, India
| | - Eswarayya Ramireddy
- Biology Division, Indian Institute of Science Education and Research Tirupati (IISER Tirupati), Tirupati, 517507, India.
| | - Lakshminarayana R Vemireddy
- Department of Molecular Biology and Biotechnology, S.V. Agricultural College, Acharya NG Ranga Agricultural University (ANGRAU), Tirupati, 517502, India.
| |
Collapse
|
5
|
Iqbal Z, Iqbal MS, Khan MIR, Ansari MI. Toward Integrated Multi-Omics Intervention: Rice Trait Improvement and Stress Management. FRONTIERS IN PLANT SCIENCE 2021; 12:741419. [PMID: 34721467 PMCID: PMC8554098 DOI: 10.3389/fpls.2021.741419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 09/20/2021] [Indexed: 05/04/2023]
Abstract
Rice (Oryza sativa) is an imperative staple crop for nearly half of the world's population. Challenging environmental conditions encompassing abiotic and biotic stresses negatively impact the quality and yield of rice. To assure food supply for the unprecedented ever-growing world population, the improvement of rice as a crop is of utmost importance. In this era, "omics" techniques have been comprehensively utilized to decipher the regulatory mechanisms and cellular intricacies in rice. Advancements in omics technologies have provided a strong platform for the reliable exploration of genetic resources involved in rice trait development. Omics disciplines like genomics, transcriptomics, proteomics, and metabolomics have significantly contributed toward the achievement of desired improvements in rice under optimal and stressful environments. The present review recapitulates the basic and applied multi-omics technologies in providing new orchestration toward the improvement of rice desirable traits. The article also provides a catalog of current scenario of omics applications in comprehending this imperative crop in relation to yield enhancement and various environmental stresses. Further, the appropriate databases in the field of data science to analyze big data, and retrieve relevant information vis-à-vis rice trait improvement and stress management are described.
Collapse
Affiliation(s)
- Zahra Iqbal
- Molecular Crop Research Unit, Department of Biochemistry, Chulalongkorn University, Bangkok, Thailand
| | | | | | | |
Collapse
|
6
|
Metabolomics for Crop Breeding: General Considerations. Genes (Basel) 2021; 12:genes12101602. [PMID: 34680996 PMCID: PMC8535592 DOI: 10.3390/genes12101602] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/05/2021] [Accepted: 10/08/2021] [Indexed: 12/16/2022] Open
Abstract
The development of new, more productive varieties of agricultural crops is becoming an increasingly difficult task. Modern approaches for the identification of beneficial alleles and their use in elite cultivars, such as quantitative trait loci (QTL) mapping and marker-assisted selection (MAS), are effective but insufficient for keeping pace with the improvement of wheat or other crops. Metabolomics is a powerful but underutilized approach that can assist crop breeding. In this review, basic methodological information is summarized, and the current strategies of applications of metabolomics related to crop breeding are explored using recent examples. We briefly describe classes of plant metabolites, cellular localization of metabolic pathways, and the strengths and weaknesses of the main metabolomics technique. Among the commercialized genetically modified crops, about 50 with altered metabolic enzyme activities have been identified in the International Service for the Acquisition of Agri-biotech Applications (ISAAA) database. These plants are reviewed as encouraging examples of the application of knowledge of biochemical pathways. Based on the recent examples of metabolomic studies, we discuss the performance of metabolic markers, the integration of metabolic and genomic data in metabolic QTLs (mQTLs) and metabolic genome-wide association studies (mGWAS). The elucidation of metabolic pathways and involved genes will help in crop breeding and the introgression of alleles of wild relatives in a more targeted manner.
Collapse
|
7
|
Kang H, Zhao D, Xiang H, Li J, Zhao G, Li H. Large-scale transcriptome sequencing in broiler chickens to identify candidate genes for breast muscle weight and intramuscular fat content. Genet Sel Evol 2021; 53:66. [PMID: 34399688 PMCID: PMC8369645 DOI: 10.1186/s12711-021-00656-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 07/15/2021] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND In broiler production, breast muscle weight and intramuscular fat (IMF) content are important economic traits. Understanding the genetic mechanisms that underlie these traits is essential to implement effective genetic improvement programs. To date, genome-wide association studies (GWAS) and gene expression analyses have been performed to identify candidate genes for these traits. However, GWAS mainly detect associations at the DNA level, while differential expression analyses usually have low power because they are typically based on small sample sizes. To detect candidate genes for breast muscle weight and IMF contents (intramuscular fat percentage and relative content of triglycerides, cholesterol, and phospholipids), we performed association analyses based on breast muscle transcriptomic data on approximately 400 Tiannong partridge chickens at slaughter age. RESULTS First, by performing an extensive simulation study, we evaluated the statistical properties of association analyses of gene expression levels and traits based on the linear mixed model (LMM) and three regularized linear regression models, i.e., least absolute shrinkage and selection operator (LASSO), ridge regression (RR), and elastic net (EN). The results show that LMM, LASSO and EN with tuning parameters that are determined based on the one standard error rule exhibited the lowest type I error rates. Using results from all three models, we detected 43 candidate genes with expression levels that were associated with breast muscle weight. In addition, candidate genes were detected for intramuscular fat percentage (1), triglyceride content (2), cholesterol content (1), and phospholipid content (1). Many of the identified genes have been demonstrated to play roles in the development and metabolism of skeletal muscle or adipocyte. Moreover, weighted gene co-expression network analyses revealed that many candidate genes were harbored by gene co-expression modules, which were also significantly correlated with the traits of interest. The results of Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses indicated that these modules are involved in muscle development and contraction, and in lipid metabolism. CONCLUSIONS Our study provides valuable insight into the transcriptomic bases of breast muscle weight and IMF contents in Chinese indigenous yellow broilers. Our findings could be useful for the genetic improvement of these traits in broiler chickens.
Collapse
Affiliation(s)
- Huimin Kang
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China
| | - Di Zhao
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China
| | - Hai Xiang
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China
| | - Jing Li
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China
| | - Guiping Zhao
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China. .,Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, No. 2 Yuanmingyuan West Road, Beijing, 100193, People's Republic of China.
| | - Hua Li
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding; Key Laboratory of Animal Molecular Design and Precise Breeding of Guangdong Higher Education Institutes; School of Life Science and Engineering, Foshan University, #33 Guang-yun-lu, Shishan, Nanhai, Foshan, 528231, Guangdong, People's Republic of China. .,Guangdong Tinoo's Foods Group Co., Ltd, Jiangkou, Feilaixia, Qingcheng, Qingyuan, 511827, Guangdong, People's Republic of China.
| |
Collapse
|
8
|
Zhang J, Chen M, Wen Y, Zhang Y, Lu Y, Wang S, Chen J. A Fast Multi-Locus Ridge Regression Algorithm for High-Dimensional Genome-Wide Association Studies. Front Genet 2021; 12:649196. [PMID: 33854527 PMCID: PMC8041068 DOI: 10.3389/fgene.2021.649196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 03/01/2021] [Indexed: 11/13/2022] Open
Abstract
The mixed linear model (MLM) has been widely used in genome-wide association study (GWAS) to dissect quantitative traits in human, animal, and plant genetics. Most methodologies consider all single nucleotide polymorphism (SNP) effects as random effects under the MLM framework, which fail to detect the joint minor effect of multiple genetic markers on a trait. Therefore, polygenes with minor effects remain largely unexplored in today’s big data era. In this study, we developed a new algorithm under the MLM framework, which is called the fast multi-locus ridge regression (FastRR) algorithm. The FastRR algorithm first whitens the covariance matrix of the polygenic matrix K and environmental noise, then selects potentially related SNPs among large scale markers, which have a high correlation with the target trait, and finally analyzes the subset variables using a multi-locus deshrinking ridge regression for true quantitative trait nucleotide (QTN) detection. Results from the analyses of both simulated and real data show that the FastRR algorithm is more powerful for both large and small QTN detection, more accurate in QTN effect estimation, and has more stable results under various polygenic backgrounds. Moreover, compared with existing methods, the FastRR algorithm has the advantage of high computing speed. In conclusion, the FastRR algorithm provides an alternative algorithm for multi-locus GWAS in high dimensional genomic datasets.
Collapse
Affiliation(s)
- Jin Zhang
- College of Science, Nanjing Agricultural University, Nanjing, China.,Postdoctoral Research Station of Crop Science, Nanjing Agricultural University, Nanjing, China
| | - Min Chen
- College of Science, Nanjing Agricultural University, Nanjing, China
| | - Yangjun Wen
- College of Science, Nanjing Agricultural University, Nanjing, China
| | - Yin Zhang
- College of Science, Nanjing Agricultural University, Nanjing, China
| | - Yunan Lu
- College of Science, Nanjing Agricultural University, Nanjing, China
| | - Shengmeng Wang
- College of Science, Nanjing Agricultural University, Nanjing, China
| | - Juncong Chen
- College of Finance, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
9
|
Wang M, Li R, Xu S. Deshrinking ridge regression for genome-wide association studies. Bioinformatics 2021; 36:4154-4162. [PMID: 32379866 DOI: 10.1093/bioinformatics/btaa345] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 04/21/2020] [Accepted: 04/29/2020] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION Genome-wide association studies (GWAS) are still the primary steps toward gene discovery. The urgency is more obvious in the big data era when GWAS are conducted simultaneously for thousand traits, e.g. transcriptomic and metabolomic traits. Efficient mixed model association (EMMA) and genome-wide efficient mixed model association (GEMMA) are the widely used methods for GWAS. An algorithm with high computational efficiency is badly needed. It is interesting to note that the test statistics of the ordinary ridge regression (ORR) have the same patterns across the genome as those obtained from the EMMA method. However, ORR has never been used for GWAS due to its severe shrinkage on the estimated effects and the test statistics. RESULTS We introduce a degree of freedom for each marker effect obtained from ORR and use it to deshrink both the estimated effect and the standard error so that the Wald test of ORR is brought back to the same level as that of EMMA. The new method is called deshrinking ridge regression (DRR). By evaluating the methods under three different model sizes (small, medium and large), we demonstrate that DRR is more generalized for all model sizes than EMMA, which only works for medium and large models. Furthermore, DRR detect all markers in a simultaneous manner instead of scanning one marker at a time. As a result, the computational time complexity of DRR is much simpler than EMMA and about m (number of genetic variants) times simpler than that of GEMMA when the sample size is way smaller than the number of markers. CONTACT shizhong.xu@ucr.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Meiyue Wang
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | - Ruidong Li
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | - Shizhong Xu
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| |
Collapse
|
10
|
Dale R, Oswald S, Jalihal A, LaPorte MF, Fletcher DM, Hubbard A, Shiu SH, Nelson ADL, Bucksch A. Overcoming the Challenges to Enhancing Experimental Plant Biology With Computational Modeling. FRONTIERS IN PLANT SCIENCE 2021; 12:687652. [PMID: 34354723 PMCID: PMC8329482 DOI: 10.3389/fpls.2021.687652] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 06/01/2021] [Indexed: 05/10/2023]
Abstract
The study of complex biological systems necessitates computational modeling approaches that are currently underutilized in plant biology. Many plant biologists have trouble identifying or adopting modeling methods to their research, particularly mechanistic mathematical modeling. Here we address challenges that limit the use of computational modeling methods, particularly mechanistic mathematical modeling. We divide computational modeling techniques into either pattern models (e.g., bioinformatics, machine learning, or morphology) or mechanistic mathematical models (e.g., biochemical reactions, biophysics, or population models), which both contribute to plant biology research at different scales to answer different research questions. We present arguments and recommendations for the increased adoption of modeling by plant biologists interested in incorporating more modeling into their research programs. As some researchers find math and quantitative methods to be an obstacle to modeling, we provide suggestions for easy-to-use tools for non-specialists and for collaboration with specialists. This may especially be the case for mechanistic mathematical modeling, and we spend some extra time discussing this. Through a more thorough appreciation and awareness of the power of different kinds of modeling in plant biology, we hope to facilitate interdisciplinary, transformative research.
Collapse
Affiliation(s)
- Renee Dale
- Donald Danforth Plant Science Center, St. Louis, MO, United States
- *Correspondence: Renee Dale
| | - Scott Oswald
- Warnell School of Forestry and Natural Resources and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Amogh Jalihal
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States
| | - Mary-Francis LaPorte
- Department of Plant Sciences, University of California, Davis, Davis, CA, United States
| | - Daniel M. Fletcher
- Bioengineering Sciences Research Group, Department of Mechanical Engineering, School of Engineering, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, United Kingdom
| | - Allen Hubbard
- Donald Danforth Plant Science Center, St. Louis, MO, United States
| | - Shin-Han Shiu
- Department of Plant Biology and Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, United States
| | | | - Alexander Bucksch
- Warnell School of Forestry and Natural Resources and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
- Department of Plant Biology, University of Georgia, Athens, GA, United States
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| |
Collapse
|
11
|
Sen P, Lamichhane S, Mathema VB, McGlinchey A, Dickens AM, Khoomrung S, Orešič M. Deep learning meets metabolomics: a methodological perspective. Brief Bioinform 2020; 22:1531-1542. [PMID: 32940335 DOI: 10.1093/bib/bbaa204] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 08/08/2020] [Accepted: 08/10/2020] [Indexed: 12/15/2022] Open
Abstract
Deep learning (DL), an emerging area of investigation in the fields of machine learning and artificial intelligence, has markedly advanced over the past years. DL techniques are being applied to assist medical professionals and researchers in improving clinical diagnosis, disease prediction and drug discovery. It is expected that DL will help to provide actionable knowledge from a variety of 'big data', including metabolomics data. In this review, we discuss the applicability of DL to metabolomics, while presenting and discussing several examples from recent research. We emphasize the use of DL in tackling bottlenecks in metabolomics data acquisition, processing, metabolite identification, as well as in metabolic phenotyping and biomarker discovery. Finally, we discuss how DL is used in genome-scale metabolic modelling and in interpretation of metabolomics data. The DL-based approaches discussed here may assist computational biologists with the integration, prediction and drawing of statistical inference about biological outcomes, based on metabolomics data.
Collapse
Affiliation(s)
- Partho Sen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland.,School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| | - Santosh Lamichhane
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Vivek B Mathema
- Metabolomics and Systems Biology, Department of Biochemistry, and Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
| | - Aidan McGlinchey
- School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| | - Alex M Dickens
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Sakda Khoomrung
- Metabolomics and Systems Biology, Department of Biochemistry, and Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Center for Innovation in Chemistry (PERCH), Faculty of Science, Mahidol University, Rama 6 Road, Bangkok 10400, Thailand
| | - Matej Orešič
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland.,School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| |
Collapse
|
12
|
Li R, Wang S, Cui Y, Qu H, Chater JM, Zhang L, Wei J, Wang M, Xu Y, Yu L, Lu J, Feng Y, Zhou R, Huang Y, Ma R, Zhu J, Zhong W, Jia Z. Extended application of genomic selection to screen multiomics data for prognostic signatures of prostate cancer. Brief Bioinform 2020; 22:5902820. [PMID: 32898860 DOI: 10.1093/bib/bbaa197] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 06/29/2020] [Accepted: 08/02/2020] [Indexed: 12/30/2022] Open
Abstract
Prognostic tests using expression profiles of several dozen genes help provide treatment choices for prostate cancer (PCa). However, these tests require improvement to meet the clinical need for resolving overtreatment, which continues to be a pervasive problem in PCa management. Genomic selection (GS) methodology, which utilizes whole-genome markers to predict agronomic traits, was adopted in this study for PCa prognosis. We leveraged The Cancer Genome Atlas (TCGA) database to evaluate the prediction performance of six GS methods and seven omics data combinations, which showed that the Best Linear Unbiased Prediction (BLUP) model outperformed the other methods regarding predictability and computational efficiency. Leveraging the BLUP-HAT method, an accelerated version of BLUP, we demonstrated that using expression data of a large number of disease-relevant genes and with an integration of other omics data (i.e. miRNAs) significantly increased outcome predictability when compared with panels consisting of a small number of genes. Finally, we developed a novel stepwise forward selection BLUP-HAT method to facilitate searching multiomics data for predictor variables with prognostic potential. The new method was applied to the TCGA data to derive mRNA and miRNA expression signatures for predicting relapse-free survival of PCa, which were validated in six independent cohorts. This is a transdisciplinary adoption of the highly efficient BLUP-HAT method and its derived algorithms to analyze multiomics data for PCa prognosis. The results demonstrated the efficacy and robustness of the new methodology in developing prognostic models in PCa, suggesting a potential utility in managing other types of cancer.
Collapse
Affiliation(s)
- Ruidong Li
- University of California, Riverside, USA
| | - Shibo Wang
- University of California, Riverside, USA
| | - Yanru Cui
- Hebei Agricultural University, China
| | - Han Qu
- University of California, Riverside, USA
| | | | - Le Zhang
- University of California, Riverside, USA
| | | | | | | | - Lei Yu
- University of California, Riverside, USA
| | | | | | - Rui Zhou
- South China University of Technology, China
| | | | | | - Jianguo Zhu
- Guizhou Provincial People's Hospital, Guizhou, China
| | - Weide Zhong
- South China University of Technology, Guangzhou, China
| | - Zhenyu Jia
- University of California, Riverside, USA
| |
Collapse
|
13
|
Sun S, Wang X, Wang K, Cui X. Dissection of complex traits of tomato in the post-genome era. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:1763-1776. [PMID: 31745578 DOI: 10.1007/s00122-019-03478-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 11/09/2019] [Indexed: 06/10/2023]
Abstract
We present the main advances of dissection of complex traits in tomato by omics, the genes identified to control complex traits and the application of CRISPR/Cas9 in tomato breeding. Complex traits are believed to be under the control of multiple genes, each with different effects and interaction with environmental factors. Advance development of sequencing and molecular technologies has enabled the recognition of the genomic structure of most organisms and the identification of a nearly limitless number of markers that have made it to accelerate the speed of QTL identification and gene cloning. Meanwhile, multiomics have been used to identify the genetic variations among different tomato species, determine the expression profiles of genes in different tissues and at distinct developmental stages, and detect metabolites in different pathways and processes. The combination of these data facilitates to reveal mechanism underlying complex traits. Moreover, mutants generated by mutagens and genome editing provide relatively rich genetic variation for deciphering the complex traits and exploiting them in tomato breeding. In this article, we present the main advances of complex trait dissection in tomato by omics since the release of the tomato genome sequence in 2012. We provide further insight into some tomato complex traits because of the causal genetic variations discovered so far and explore the utilization of CRISPR/Cas9 for the modification of tomato complex traits.
Collapse
Affiliation(s)
- Shuai Sun
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiaotian Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Ketao Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xia Cui
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops of the Ministry of Agriculture, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
- Sino-Dutch Joint Laboratory of Horticultural Genomics, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| |
Collapse
|
14
|
Liu P, Luo J, Zheng Q, Chen Q, Zhai N, Xu S, Xu Y, Jin L, Xu G, Lu X, Xu G, Wang G, Shao J, Xu H, Cao P, Zhou H, Wang X. Integrating transcriptome and metabolome reveals molecular networks involved in genetic and environmental variation in tobacco. DNA Res 2020; 27:dsaa006. [PMID: 32324848 PMCID: PMC7320822 DOI: 10.1093/dnares/dsaa006] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Accepted: 04/16/2020] [Indexed: 12/23/2022] Open
Abstract
Tobacco (Nicotiana tabacum) is one of the most widely cultivated commercial non-food crops with significant social and economic impacts. Here we profiled transcriptome and metabolome from 54 tobacco samples (2-3 replicates; n = 151 in total) collected from three varieties (i.e. genetic factor), three locations (i.e. environmental factor), and six developmental stages (i.e. developmental process). We identified 3,405 differentially expressed (DE) genes (DEGs) and 371 DE metabolites, respectively. We used quantitative real-time PCR to validate 20 DEGs, and confirmed 18/20 (90%) DEGs between three locations and 16/20 (80%) with the same trend across developmental stages. We then constructed nine co-expression gene modules and four co-expression metabolite modules , and defined seven de novo regulatory networks, including nicotine- and carotenoid-related regulatory networks. A novel two-way Pearson correlation approach was further proposed to integrate co-expression gene and metabolite modules to identify joint gene-metabolite relations. Finally, we further integrated DE and network results to prioritize genes by its functional importance and identified a top-ranked novel gene, LOC107773232, as a potential regulator involved in the carotenoid metabolism pathway. Thus, the results and systems-biology approaches provide a new avenue to understand the molecular mechanisms underlying complex genetic and environmental perturbations in tobacco.
Collapse
Affiliation(s)
- Pingping Liu
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Jie Luo
- Central Laboratory of Zhejiang Academy of Agricultural Sciences, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Qingxia Zheng
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Qiansi Chen
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Niu Zhai
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Shengchun Xu
- Central Laboratory of Zhejiang Academy of Agricultural Sciences, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Yalong Xu
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Lifeng Jin
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Guoyun Xu
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Xin Lu
- Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Guowang Xu
- Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Gangjun Wang
- Central Laboratory of Zhejiang Academy of Agricultural Sciences, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Jianfeng Shao
- Central Laboratory of Zhejiang Academy of Agricultural Sciences, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Hai‐Ming Xu
- Institute of Crop Science and Institute of Bioinformatics, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Peijian Cao
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Huina Zhou
- China Tobacco Gene Research Center, Zhengzhou Tobacco Research Institute of CNTC, Zhengzhou 450001, China
| | - Xusheng Wang
- Department of Biology, University of North Dakota, Grand Forks, ND, 58202, USA
| |
Collapse
|
15
|
Price EJ, Drapal M, Perez‐Fons L, Amah D, Bhattacharjee R, Heider B, Rouard M, Swennen R, Becerra Lopez‐Lavalle LA, Fraser PD. Metabolite database for root, tuber, and banana crops to facilitate modern breeding in understudied crops. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:1258-1268. [PMID: 31845400 PMCID: PMC7383867 DOI: 10.1111/tpj.14649] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2019] [Revised: 11/09/2019] [Accepted: 11/28/2019] [Indexed: 05/06/2023]
Abstract
Roots, tubers, and bananas (RTB) are vital staples for food security in the world's poorest nations. A major constraint to current RTB breeding programmes is limited knowledge on the available diversity due to lack of efficient germplasm characterization and structure. In recent years large-scale efforts have begun to elucidate the genetic and phenotypic diversity of germplasm collections and populations and, yet, biochemical measurements have often been overlooked despite metabolite composition being directly associated with agronomic and consumer traits. Here we present a compound database and concentration range for metabolites detected in the major RTB crops: banana (Musa spp.), cassava (Manihot esculenta), potato (Solanum tuberosum), sweet potato (Ipomoea batatas), and yam (Dioscorea spp.), following metabolomics-based diversity screening of global collections held within the CGIAR institutes. The dataset including 711 chemical features provides a valuable resource regarding the comparative biochemical composition of each RTB crop and highlights the potential diversity available for incorporation into crop improvement programmes. Particularly, the tropical crops cassava, sweet potato and banana displayed more complex compositional metabolite profiles with representations of up to 22 chemical classes (unknowns excluded) than that of potato, for which only metabolites from 10 chemical classes were detected. Additionally, over 20% of biochemical signatures remained unidentified for every crop analyzed. Integration of metabolomics with the on-going genomic and phenotypic studies will enhance 'omics-wide associations of molecular signatures with agronomic and consumer traits via easily quantifiable biochemical markers to aid gene discovery and functional characterization.
Collapse
Affiliation(s)
- Elliott J. Price
- Royal Holloway University of London, SurreyTW20 0EXEghamUnited Kingdom
- Present address:
Masaryk UniversityBrno‐Bohunice625 00Czech Republic
| | - Margit Drapal
- Royal Holloway University of London, SurreyTW20 0EXEghamUnited Kingdom
| | - Laura Perez‐Fons
- Royal Holloway University of London, SurreyTW20 0EXEghamUnited Kingdom
| | - Delphine Amah
- International Institute of Tropical AgriculturePMB 5320IbadanNigeria
| | | | | | - Mathieu Rouard
- Bioversity InternationalParc Scientifique Agropolis II34397MontpellierFrance
| | - Rony Swennen
- Laboratory of Tropical Crop ImprovementDivision of Crop BiotechnicsKU LeuvenB‐3001LeuvenBelgium
- Bioversity InternationalWillem De Croylaan 42B‐3001LeuvenBelgium
- International Institute of Tropical Agriculture. C/0 The Nelson Mandela African Institution of Science and TechnologyP.O. Box 44ArushaTanzania
| | | | - Paul D. Fraser
- Royal Holloway University of London, SurreyTW20 0EXEghamUnited Kingdom
| |
Collapse
|
16
|
Wei J, Xie W, Li R, Wang S, Qu H, Ma R, Zhou X, Jia Z. Analysis of trait heritability in functionally partitioned rice genomes. Heredity (Edinb) 2019; 124:485-498. [PMID: 31253955 DOI: 10.1038/s41437-019-0244-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 06/05/2019] [Accepted: 06/08/2019] [Indexed: 01/10/2023] Open
Abstract
Knowledge of the genetic architecture of importantly agronomical traits can speed up genetic improvement in cultivated rice (Oryza sativa L.). Many recent investigations have leveraged genome-wide association studies (GWAS) to identify single nucleotide polymorphisms (SNPs), associated with agronomic traits in various rice populations. The reported trait-relevant SNPs appear to be arbitrarily distributed along the genome, including genic and nongenic regions. Whether the SNPs in different genomic regions play different roles in trait heritability and which region is more responsible for phenotypic variation remains opaque. We analyzed a natural rice population of 524 accessions with 3,616,597 SNPs to compare the genetic contributions of functionally distinct genomic regions for five agronomic traits, i.e., yield, heading date, plant height, grain length, and grain width. An analysis of heritability in the functionally partitioned rice genome showed that regulatory or intergenic regions account for the most trait heritability. A close look at the trait-associated SNPs (TASs) indicated that the majority of the TASs are located in nongenic regions, and the genetic effects of the TASs in nongenic regions are generally greater than those in genic regions. We further compared the predictabilities using the genetic variants from genic regions with those using nongenic regions. The results revealed that nongenic regions play a more important role than genic regions in trait heritability in rice, which is consistent with findings in humans and maize. This conclusion not only offers clues for basic research to disclose genetics behind these agronomic traits, but also provides a new perspective to facilitate genomic selection in rice.
Collapse
Affiliation(s)
- Julong Wei
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, Jiangsu, China.,Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Ruidong Li
- Department of Botany & Plant Sciences, University of California (Riverside), Riverside, CA, USA
| | - Shibo Wang
- Department of Botany & Plant Sciences, University of California (Riverside), Riverside, CA, USA
| | - Han Qu
- Department of Botany & Plant Sciences, University of California (Riverside), Riverside, CA, USA
| | - Renyuan Ma
- Department of Botany & Plant Sciences, University of California (Riverside), Riverside, CA, USA.,Department of Mathematics, Bowdoin College, Brunswick, ME, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Zhenyu Jia
- Department of Botany & Plant Sciences, University of California (Riverside), Riverside, CA, USA.
| |
Collapse
|
17
|
Wang S, Wei J, Li R, Qu H, Chater JM, Ma R, Li Y, Xie W, Jia Z. Identification of optimal prediction models using multi-omic data for selecting hybrid rice. Heredity (Edinb) 2019; 123:395-406. [PMID: 30911139 DOI: 10.1038/s41437-019-0210-6] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 02/22/2019] [Accepted: 02/25/2019] [Indexed: 11/09/2022] Open
Abstract
Genomic prediction benefits hybrid rice breeding by increasing selection intensity and accelerating breeding cycles. With the rapid advancement of technology, other omic data, such as metabolomic data and transcriptomic data, are readily available for predicting breeding values for agronomically important traits. In this study, the best prediction strategies were determined for yield, 1000 grain weight, number of grains per panicle, and number of tillers per plant of hybrid rice (derived from recombinant inbred lines) by comprehensively evaluating all possible combinations of omic datasets with different prediction methods. It was demonstrated that, in rice, the predictions using a combination of genomic and metabolomic data generally produce better results than single-omics predictions or predictions based on other combined omic data. Best linear unbiased prediction (BLUP) appears to be the most efficient prediction method compared to the other commonly used approaches, including least absolute shrinkage and selection operator (LASSO), stochastic search variable selection (SSVS), support vector machines with radial basis function and epsilon regression (SVM-R(EPS)), support vector machines with radial basis function and nu regression (SVM-R(NU)), support vector machines with polynomial kernel and epsilon regression (SVM-P(EPS)), support vector machines with polynomial kernel and nu regression (SVM-P(NU)) and partial least squares regression (PLS). This study has provided guidelines for selection of hybrid rice in terms of which types of omic datasets and which method should be used to achieve higher trait predictability. The answer to these questions will benefit academic research and will also greatly reduce the operative cost for the industry which specializes in breeding and selection.
Collapse
Affiliation(s)
- Shibo Wang
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Julong Wei
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - Ruidong Li
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Han Qu
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - John M Chater
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Renyuan Ma
- Department of Mathematics, Bowdoin College, Brunswick, ME, USA
| | - Yonghao Li
- Department of Neuroscience, University of British Columbia, Vancouver, BC, Canada
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Zhenyu Jia
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA.
| |
Collapse
|
18
|
Thorwarth P, Liu G, Ebmeyer E, Schacht J, Schachschneider R, Kazman E, Reif JC, Würschum T, Longin CFH. Dissecting the genetics underlying the relationship between protein content and grain yield in a large hybrid wheat population. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:489-500. [PMID: 30456718 DOI: 10.1007/s00122-018-3236-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Accepted: 11/07/2018] [Indexed: 05/13/2023]
Abstract
Additive and dominance effect QTL for grain yield and protein content display antagonistic pleiotropic effects, making genomic selection based on the index grain protein deviation a promising method to alleviate the negative correlation between these traits in wheat breeding. Grain yield and quality-related traits such as protein content and sedimentation volume are key traits in wheat breeding. In this study, we used a large population of 1604 hybrids and their 135 parental components to investigate the genetics and metabolomics underlying the negative relationship of grain yield and quality, and evaluated approaches for their joint improvement. We identified a total of nine trait-associated metabolites and show that prediction using genomic data alone resulted in the highest prediction ability for all traits. We dissected the genetic architecture of grain yield and quality-determining traits and show results of the first mapping of the derived trait grain protein deviation. Further, we provide a genetic analysis of the antagonistic relation of grain yield and protein content and dissect the mode of gene action (pleiotropy vs linkage) of identified QTL. Lastly, we demonstrate that the composition of the training set for genomic prediction is crucial when considering different quality classes in wheat breeding.
Collapse
Affiliation(s)
- Patrick Thorwarth
- State Plant Breeding Institute, University of Hohenheim, 70593, Stuttgart, Germany
| | - Guozheng Liu
- BASF Agricultural Solutions Seed GmbH, OT Gatersleben, Am Schwabeplan 8, 06466, Seeland, Germany
| | | | | | | | | | - Jochen Christoph Reif
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany
| | - Tobias Würschum
- State Plant Breeding Institute, University of Hohenheim, 70593, Stuttgart, Germany
| | | |
Collapse
|
19
|
Cui Y, Zhang F, Zhou Y. The Application of Multi-Locus GWAS for the Detection of Salt-Tolerance Loci in Rice. FRONTIERS IN PLANT SCIENCE 2018; 9:1464. [PMID: 30337936 PMCID: PMC6180169 DOI: 10.3389/fpls.2018.01464] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 09/14/2018] [Indexed: 05/18/2023]
Abstract
Improving the salt-tolerance of direct-seeding rice at the seed germination stage is a major goal of breeders. Efficiently identifying salt tolerance loci will help researchers develop effective rice breeding strategies. In this study, six multi-locus genome-wide association studies (GWAS) methods (mrMLM, FASTmrMLM, FASTmrEMMA, pLARmEB, pKWmEB, and ISIS EM-BLASSO) were applied to identify quantitative trait nucleotides (QTNs) for the salt tolerance traits of 478 rice accessions with 162,529 SNPs at the seed germination stage. Among the 371 QTNs detected by the six methods, 56 were identified by at least three methods. Among these 56 QTNs, 12, 6, 7, 4, 13, 12, and 12 were found to be associated with SSI-GI, SSI-VI, SSI-MGT, SSI-IR-24h, SSI-IR-48h, SSI-GR-5d, and SSI-GR-10d, respectively. Additionally, 66 candidate genes were identified in the vicinity of the 56 QTNs, and two of these genes (LOC_Os01g45760 and LOC_Os10g04860) are involved in auxin biosynthesis according to the enriched GO terms and KEGG pathways. This information will be useful for identifying the genes responsible for rice salt tolerance. A comparison of the six methods revealed that ISIS EM-BLASSO identified the most co-detected QTNs and performed best, with the smallest residual errors and highest computing speed, followed by FASTmrMLM, pLARmEB, mrMLM, pKWmEB, and FASTmrEMMA. Although multi-locus GWAS methods are superior to single-locus GWAS methods, their utility for identifying QTNs may be enhanced by adding a bin analysis to the models or by developing a hybrid method that merges the results from different methods.
Collapse
Affiliation(s)
| | - Fan Zhang
- *Correspondence: Fan Zhang, Yongli Zhou,
| | | |
Collapse
|