1
|
Crop modeling suggests limited transpiration would increase yield of sorghum across drought-prone regions of the United States. FRONTIERS IN PLANT SCIENCE 2024; 14:1283339. [PMID: 38348164 PMCID: PMC10859530 DOI: 10.3389/fpls.2023.1283339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 12/21/2023] [Indexed: 02/15/2024]
Abstract
Breeding sorghum to withstand droughts is pivotal to secure crop production in regions vulnerable to water scarcity. Limited transpiration (LT) restricts water demand at high vapor pressure deficit, saving water for use in critical periods later in the growing season. Here we evaluated the hypothesis that LT would increase sorghum grain yield in the United States. We used a process-based crop model, APSIM, which simulates interactions of genotype, environment, and management (G × E × M). In this study, the G component includes the LT trait (GT) and maturity group (GM), the EW component entails water deficit patterns, and the MP component represents different planting dates. Simulations were conducted over 33 years (1986-2018) for representative locations across the US sorghum belt (Kansas, Texas, and Colorado) for three planting dates and maturity groups. The interaction of GT x EW indicated a higher impact of LT sorghum on grain for late drought (LD), mid-season drought (MD), and early drought (ED, 8%), than on well-watered (WW) environments (4%). Thus, significant impacts of LT can be achieved in western regions of the sorghum belt. The lack of interaction of GT × GM × MP suggested that an LT sorghum would increase yield by around 8% across maturity groups and planting dates. Otherwise, the interaction GM × MP revealed that specific combinations are better suited across geographical regions. Overall, the findings suggest that breeding for LT would increase sorghum yield in the drought-prone areas of the US without tradeoffs.
Collapse
|
2
|
Current challenges and future of agricultural genomes to phenomes in the USA. Genome Biol 2024; 25:8. [PMID: 38172911 PMCID: PMC10763150 DOI: 10.1186/s13059-023-03155-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 12/21/2023] [Indexed: 01/05/2024] Open
Abstract
Dramatic improvements in measuring genetic variation across agriculturally relevant populations (genomics) must be matched by improvements in identifying and measuring relevant trait variation in such populations across many environments (phenomics). Identifying the most critical opportunities and challenges in genome to phenome (G2P) research is the focus of this paper. Previously (Genome Biol, 23(1):1-11, 2022), we laid out how Agricultural Genome to Phenome Initiative (AG2PI) will coordinate activities with USA federal government agencies expand public-private partnerships, and engage with external stakeholders to achieve a shared vision of future the AG2PI. Acting on this latter step, AG2PI organized the "Thinking Big: Visualizing the Future of AG2PI" two-day workshop held September 9-10, 2022, in Ames, Iowa, co-hosted with the United State Department of Agriculture's National Institute of Food and Agriculture (USDA NIFA). During the meeting, attendees were asked to use their experience and curiosity to review the current status of agricultural genome to phenome (AG2P) work and envision the future of the AG2P field. The topic summaries composing this paper are distilled from two 1.5-h small group discussions. Challenges and solutions identified across multiple topics at the workshop were explored. We end our discussion with a vision for the future of agricultural progress, identifying two areas of innovation needed: (1) innovate in genetic improvement methods development and evaluation and (2) innovate in agricultural research processes to solve societal problems. To address these needs, we then provide six specific goals that we recommend be implemented immediately in support of advancing AG2P research.
Collapse
|
3
|
Impact of genotype-calling methodologies on genome-wide association and genomic prediction in polyploids. THE PLANT GENOME 2023; 16:e20401. [PMID: 37903749 DOI: 10.1002/tpg2.20401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 09/17/2023] [Accepted: 09/23/2023] [Indexed: 11/01/2023]
Abstract
Discovery and analysis of genetic variants underlying agriculturally important traits are key to molecular breeding of crops. Reduced representation approaches have provided cost-efficient genotyping using next-generation sequencing. However, accurate genotype calling from next-generation sequencing data is challenging, particularly in polyploid species due to their genome complexity. Recently developed Bayesian statistical methods implemented in available software packages, polyRAD, EBG, and updog, incorporate error rates and population parameters to accurately estimate allelic dosage across any ploidy. We used empirical and simulated data to evaluate the three Bayesian algorithms and demonstrated their impact on the power of genome-wide association study (GWAS) analysis and the accuracy of genomic prediction. We further incorporated uncertainty in allelic dosage estimation by testing continuous genotype calls and comparing their performance to discrete genotypes in GWAS and genomic prediction. We tested the genotype-calling methods using data from two autotetraploid species, Miscanthus sacchariflorus and Vaccinium corymbosum, and performed GWAS and genomic prediction. In the empirical study, the tested Bayesian genotype-calling algorithms differed in their downstream effects on GWAS and genomic prediction, with some showing advantages over others. Through subsequent simulation studies, we observed that at low read depth, polyRAD was advantageous in its effect on GWAS power and limit of false positives. Additionally, we found that continuous genotypes increased the accuracy of genomic prediction, by reducing genotyping error, particularly at low sequencing depth. Our results indicate that by using the Bayesian algorithm implemented in polyRAD and continuous genotypes, we can accurately and cost-efficiently implement GWAS and genomic prediction in polyploid crops.
Collapse
|
4
|
Exploring the Relationship Between Pattern-Triggered Immunity and Quantitative Resistance to Xanthomonas vasicola pv. vasculorum in Maize. PHYTOPATHOLOGY 2023; 113:2127-2133. [PMID: 36853191 DOI: 10.1094/phyto-09-22-0357-sa] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Bacterial leaf streak (BLS) of maize is an emerging foliar disease of maize in the Americas. It is caused by the gram-negative nonvascular bacterium Xanthomonas vasicola pv. vasculorum. There are no chemical controls available for BLS, and thus, host resistance is crucial for managing X. vasicola pv. vasculorum. The objective of this study was to examine the genetic determinants of resistance to X. vasicola pv. vasculorum in maize, as well as the relationship between other defense-related traits and BLS resistance. Specifically, we examined the correlations among BLS severity, severity for three fungal diseases, flg-22 response, hypersensitive response, and auricle color. We conducted quantitative trait locus (QTL) mapping for X. vasicola pv. vasculorum resistance using the maize recombinant inbred line population Z003 (B73 × CML228). We detected three QTLs for BLS resistance. In addition to the disease resistance QTL, we detected a single QTL for auricle color. We observed significant, yet weak, correlations among BLS severity, levels of pattern-triggered immunity response and leaf flecking. These results will be useful for understanding resistance to X. vasicola pv. vasculorum and mitigating the impact of BLS on maize yields.
Collapse
|
5
|
Regularized multi-trait multi-locus linear mixed models for genome-wide association studies and genomic selection in crops. BMC Bioinformatics 2023; 24:399. [PMID: 37884874 PMCID: PMC10604903 DOI: 10.1186/s12859-023-05519-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 10/03/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND We consider two key problems in genomics involving multiple traits: multi-trait genome wide association studies (GWAS), where the goal is to detect genetic variants associated with the traits; and multi-trait genomic selection (GS), where the emphasis is on accurately predicting trait values. Multi-trait linear mixed models build on the linear mixed model to jointly model multiple traits. Existing estimation methods, however, are limited to the joint analysis of a small number of genotypes; in fact, most approaches consider one SNP at a time. Estimating multi-dimensional genetic and environment effects also results in considerable computational burden. Efficient approaches that incorporate regularization into multi-trait linear models (no random effects) have been recently proposed to identify genomic loci associated with multiple traits (Yu et al. in Multitask learning using task clustering with applications to predictive modeling and GWAS of plant varieties. arXiv:1710.01788 , 2017; Yu et al in Front Big Data 2:27, 2019), but these ignore population structure and familial relatedness (Yu et al in Nat Genet 38:203-208, 2006). RESULTS This work addresses this gap by proposing a novel class of regularized multi-trait linear mixed models along with scalable approaches for estimation in the presence of high-dimensional genotypes and a large number of traits. We evaluate the effectiveness of the proposed methods using datasets in maize and sorghum diversity panels, and demonstrate benefits in both achieving high prediction accuracy in GS and in identifying relevant marker-trait associations. CONCLUSIONS The proposed regularized multivariate linear mixed models are relevant for both GWAS and GS. We hope that they will facilitate agronomy-related research in plant biology and crop breeding endeavors.
Collapse
|
6
|
Importance of genetic architecture in marker selection decisions for genomic prediction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:220. [PMID: 37819415 DOI: 10.1007/s00122-023-04469-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 09/25/2023] [Indexed: 10/13/2023]
Abstract
KEY MESSAGE We demonstrate potential for improved multi-environment genomic prediction accuracy using structural variant markers. However, the degree of observed improvement is highly dependent on the genetic architecture of the trait. Breeders commonly use genetic markers to predict the performance of untested individuals as a way to improve the efficiency of breeding programs. These genomic prediction models have almost exclusively used single nucleotide polymorphisms (SNPs) as their source of genetic information, even though other types of markers exist, such as structural variants (SVs). Given that SVs are associated with environmental adaptation and not all of them are in linkage disequilibrium to SNPs, SVs have the potential to bring additional information to multi-environment prediction models that are not captured by SNPs alone. Here, we evaluated different marker types (SNPs and/or SVs) on prediction accuracy across a range of genetic architectures for simulated traits across multiple environments. Our results show that SVs can improve prediction accuracy, but it is highly dependent on the genetic architecture of the trait and the relative gain in accuracy is minimal. When SVs are the only causative variant type, 70% of the time SV predictors outperform SNP predictors. However, the improvement in accuracy in these instances is only 1.5% on average. Further simulations with predictors in varying degrees of LD with causative variants of different types (e.g., SNPs, SVs, SNPs and SVs) showed that prediction accuracy increased as linkage disequilibrium between causative variants and predictors increased regardless of the marker type. This study demonstrates that knowing the genetic architecture of a trait in deciding what markers to use in large-scale genomic prediction modeling in a breeding program is more important than what types of markers to use.
Collapse
|
7
|
Inhibition of ethylene involved in resistance to E. turcicum in an exotic-derived double haploid maize population. FRONTIERS IN PLANT SCIENCE 2023; 14:1272951. [PMID: 37868313 PMCID: PMC10587583 DOI: 10.3389/fpls.2023.1272951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 09/19/2023] [Indexed: 10/24/2023]
Abstract
Northern corn leaf blight (NCLB) is an economically important disease of maize. While the genetic architecture of NCLB has been well characterized, the pathogen is known to overcome currently deployed resistance genes, and the role of hormones in resistance to NCLB is an area of active research. The objectives of the study were (i) to identify significant markers associated with resistance to NCLB, (ii) to identify metabolic pathways associated with NCLB resistance, and (iii) to examine role of ethylene in resistance to NCLB. We screened 252 lines from the exotic-derived double haploid BGEM maize population for resistance to NCLB in both field and greenhouse environments. We used a genome wide association study (GWAS) and stepwise regression to identify four markers associated with resistance, followed by a pathway association study tool (PAST) to identify important metabolic pathways associated with disease severity and incubation period. The ethylene synthesis pathway was significant for disease severity and incubation period. We conducted a greenhouse assay in which we inhibited ethylene to examine the role of ethylene in resistance to NCLB. We observed a significant increase in incubation period and a significant decrease in disease severity between plants treated with the ethylene inhibitor and mock-treated plants. Our study confirms the potential of the BGEM population as a source of novel alleles for resistance. We also confirm the role of ethylene in resistance to NCLB and contribute to the growing body of literature on ethylene and disease resistance in monocots.
Collapse
|
8
|
Linking genetic and environmental factors through marker effect networks to understand trait plasticity. Genetics 2023; 224:iyad103. [PMID: 37246567 DOI: 10.1093/genetics/iyad103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/19/2023] [Accepted: 05/24/2023] [Indexed: 05/30/2023] Open
Abstract
Understanding how plants adapt to specific environmental changes and identifying genetic markers associated with phenotypic plasticity can help breeders develop plant varieties adapted to a rapidly changing climate. Here, we propose the use of marker effect networks as a novel method to identify markers associated with environmental adaptability. These marker effect networks are built by adapting commonly used software for building gene coexpression networks with marker effects across growth environments as the input data into the networks. To demonstrate the utility of these networks, we built networks from the marker effects of ∼2,000 nonredundant markers from 400 maize hybrids across 9 environments. We demonstrate that networks can be generated using this approach, and that the markers that are covarying are rarely in linkage disequilibrium, thus representing higher biological relevance. Multiple covarying marker modules associated with different weather factors throughout the growing season were identified within the marker effect networks. Finally, a factorial test of analysis parameters demonstrated that marker effect networks are relatively robust to these options, with high overlap in modules associated with the same weather factors across analysis parameters. This novel application of network analysis provides unique insights into phenotypic plasticity and specific environmental factors that modulate the genome.
Collapse
|
9
|
A first look at the ability to use genomic prediction for improving the ratooning ability of sugarcane. FRONTIERS IN PLANT SCIENCE 2023; 14:1205999. [PMID: 37600177 PMCID: PMC10433174 DOI: 10.3389/fpls.2023.1205999] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 07/03/2023] [Indexed: 08/22/2023]
Abstract
The sugarcane ratooning ability (RA) is the most important target trait for breeders seeking to enhance the profitability of sugarcane production by reducing the planting cost. Understanding the genetics governing the RA could help breeders by identifying molecular markers that could be used for genomics-assisted breeding (GAB). A replicated field trial was conducted for three crop cycles (plant cane, first ratoon, and second ratoon) using 432 sugarcane clones and used for conducting genome-wide association and genomic prediction of five sugar and yield component traits of the RA. The RA traits for economic index (EI), stalk population (SP), stalk weight (SW), tonns of cane per hectare (TCH), and tonns of sucrose per hectare (TSH) were estimated from the yield and sugar data. A total of six putative quantitative trait loci and eight nonredundant single-nucleotide polymorphism (SNP) markers were associated with all five tested RA traits and appear to be unique. Seven putative candidate genes were colocated with significant SNPs associated with the five RA traits. The genomic prediction accuracies for those tested traits were moderate and ranged from 0.21 to 0.36. However, the models fitting fixed effects for the most significant associated markers for each respective trait did not give any advantages over the standard models without fixed effects. As a result of this study, more robust markers could be used in the future for clone selection in sugarcane, potentially helping resolve the genetic control of the RA in sugarcane.
Collapse
|
10
|
Genome-wide association analysis for emergence of deeply sown rice ( Oryza sativa) reveals novel aus-specific phytohormone candidate genes for adaptation to dry-direct seeding in the field. FRONTIERS IN PLANT SCIENCE 2023; 14:1172816. [PMID: 37377815 PMCID: PMC10291202 DOI: 10.3389/fpls.2023.1172816] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 05/23/2023] [Indexed: 06/29/2023]
Abstract
Dry direct-seeded rice (dry-DSR) is typically sown deeply to circumvent the need for irrigation, and thus seedling emergence is a crucial trait affecting plant stand and yield. To breed elite cultivars that use less water and are climate-resilient, an understanding of the genomic regions and underlying genes that confer emergence in deeply sown dry-DSR would be highly advantageous. A combined diversity panel of 470 rice accessions (RDP1 plus aus subset of 3K RGP) was evaluated with 2.9 million single nucleotide polymorphisms (SNPs) to identify associations with dry-DSR traits in the field and component traits in a controlled-environment experiment. Using genome-wide association study (GWAS) analyses, we identified 18 unique QTLs on chromosomes 1, 2, 4, 5, 6, 7, 9, 10, and 11, explaining phenotypic variance ranging from 2.6% to 17.8%. Three QTLs, namely, qSOE-1.1, qEMERG-AUS-1.2, and qEMERG-AUS-7.1, were co-located with previously reported QTLs for mesocotyl length. Among the identified QTLs, half were associated with the emergence of aus, and six were unique to the aus genetic group. Based on functional annotation, we identified eleven compelling candidate genes that primarily regulate phytohormone pathways such as cytokinin, auxin, gibberellic acid, and jasmonic acid. Prior studies indicated that these phytohormones play a critical role in mesocotyl length under deep sowing. This study provides new insight into the importance of aus and indica as desirable genetic resources to mine favorable alleles for deep-sowing tolerance in rice. The candidate genes and marker-tagged desirable alleles identified in this study should benefit rice breeding programs directly.
Collapse
|
11
|
Reductions in leaf area index, pod production, seed size, and harvest index drive yield loss to high temperatures in soybean. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:1629-1641. [PMID: 36571807 DOI: 10.1093/jxb/erac503] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 12/23/2022] [Indexed: 06/17/2023]
Abstract
Improvements in genetics, technology, and agricultural intensification have increased soybean yields; however, adverse climate conditions may prevent these gains from being fully realized in the future. Higher growing season temperatures reduce soybean yields in key production regions including the US Midwest, and better understanding of the developmental and physiological mechanisms that constrain soybean yield under high temperature conditions is needed. This study tested the response of two soybean cultivars to four elevated temperature treatments (+1.7, +2.6, +3.6, and +4.8 °C) in the field over three growing seasons and identified threshold temperatures for response and linear versus non-linear trait responses to temperature. Yield declined non-linearly to temperature, with decreases apparent when canopy temperature exceeded 20.9 °C for the locally adapted cultivar and 22.7°C for a cultivar adapted to more southern locations. While stem node number increased with increasing temperature, leaf area index decreased substantially. Pod production, seed size, and harvest index significantly decreased with increasing temperature. The seasonal average temperature of even the mildest treatment exceeded the threshold temperatures for yield loss, emphasizing the importance of improving temperature tolerance in soybean germplasm with intensifying climate change.
Collapse
|
12
|
Genetic mapping of sorghum resistance to an Illinois isolate of Colletotrichum sublineola. THE PLANT GENOME 2022; 15:e20243. [PMID: 35822435 DOI: 10.1002/tpg2.20243] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 06/03/2022] [Indexed: 06/15/2023]
Abstract
Anthracnose leaf blight (ALB) is an economically important disease of sorghum [Sorghum bicolor (L.) Moench] caused by the fungal pathogen Colletotrichum sublineola Henn. ex Sacc. & Trotter. Although qualitative and quantitative resistance have been identified for ALB, the usefulness of resistance loci differs depending on the pathogen pathotype. Identifying resistance effective against unique pathogen pathotypes is critical to managing ALB, as the disease is managed primarily through the deployment of host resistance. We isolated C. sublineola from ALB-infected leaves collected in Illinois and found that the strain was a novel pathotype, as it produced a unique combination of virulence against a set of differential lines. Using this isolate, we inoculated 579 temperate-adapted sorghum conversion lines in 2019 and 2020. We then conducted a genome-wide association study (GWAS) and a metabolic pathway analysis using the Pathway Associated Study Tool (PAST). We identified 47 significant markers distributed across all chromosomes except chromosome 8. We identified 32 candidate genes based on physical proximity with significant markers, some of which have a known role in host defense. We identified 47 pathways associated with ALB resistance, indicating a role for secondary metabolism in defense to ALB. Our results are important to improve the understanding of the genetic basis of ALB resistance in sorghum and highlight the importance of developing durable resistance to ALB.
Collapse
|
13
|
A multi-trait multi-locus stepwise approach for conducting GWAS on correlated traits. THE PLANT GENOME 2022; 15:e20200. [PMID: 35307964 DOI: 10.1002/tpg2.20200] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 02/14/2022] [Indexed: 06/14/2023]
Abstract
The ability to accurately quantify the simultaneous effect of multiple genomic loci on multiple traits is now possible due to current and emerging high-throughput genotyping and phenotyping technologies. To date, most efforts to quantify these genotype-to-phenotype relationships have focused on either multi-trait models that test a single marker at a time or multi-locus models that quantify associations with a single trait. Therefore, the purpose of this study was to compare the performance of a multi-trait, multi-locus stepwise (MSTEP) model selection procedure we developed to (a) a commonly used multi-trait single-locus model and (b) a univariate multi-locus model. We used real marker data in maize (Zea mays L.) and soybean (Glycine max L.) to simulate multiple traits controlled by various combinations of pleiotropic and nonpleiotropic quantitative trait nucleotides (QTNs). In general, we found that both multi-trait models outperformed the univariate multi-locus model, especially when analyzing a trait of low heritability. For traits controlled by either a combination of pleiotropic and nonpleiotropic QTNs or a large number of QTNs (i.e., 50), our MSTEP model often outperformed at least one of the two alternative models. When applied to the analysis of two tocochromanol-related traits in maize grain, MSTEP identified the same peak-associated marker that has been reported in a previous study. We therefore conclude that MSTEP is a useful addition to the suite of statistical models that are commonly used to gain insight into the genetic architecture of agronomically important traits.
Collapse
|
14
|
Assessment of two statistical approaches for variance genome-wide association studies in plants. Heredity (Edinb) 2022; 129:93-102. [PMID: 35538221 PMCID: PMC9338250 DOI: 10.1038/s41437-022-00541-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 04/28/2022] [Accepted: 04/29/2022] [Indexed: 11/09/2022] Open
Abstract
Genomic loci that control the variance of agronomically important traits are increasingly important due to the profusion of unpredictable environments arising from climate change. The ability to identify such variance-controlling loci in association studies will be critical for future breeding efforts. Two statistical approaches that have already been used in the variance genome-wide association study (vGWAS) paradigm are the Brown-Forsythe test (BFT) and the double generalized linear model (DGLM). To ensure that these approaches are deployed as effectively as possible, it is critical to study the factors that influence their ability to identify variance-controlling loci. We used genome-wide marker data in maize (Zea mays L.) and Arabidopsis thaliana to simulate traits controlled by epistasis, genotype by environment (GxE) interactions, and variance quantitative trait nucleotides (vQTNs). We then quantified true and false positive detection rates of the BFT and DGLM across all simulated traits. We also conducted a vGWAS using both the BFT and DGLM on plant height in a maize diversity panel. The observed true positive detection rates at the maximum sample size considered (N = 2815) suggest that both of these vGWAS approaches are capable of identifying epistasis and GxE for sufficiently large sample sizes. We also noted that the DGLM decisively outperformed the BFT for simulated traits controlled by vQTNs at sample sizes of N = 500. Although we conclude that there are still certain aspects of vGWAS approaches that need further refinement, this study suggests that the BFT and DGLM are capable of identifying variance-controlling loci in current state-of-the-art plant or agronomic data sets.
Collapse
|
15
|
A population-level statistic for assessing Mendelian behavior of genotyping-by-sequencing data from highly duplicated genomes. BMC Bioinformatics 2022; 23:101. [PMID: 35317727 PMCID: PMC8939213 DOI: 10.1186/s12859-022-04635-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 03/10/2022] [Indexed: 12/02/2022] Open
Abstract
Background Given the economic and environmental importance of allopolyploids and other species with highly duplicated genomes, there is a need for methods to distinguish paralogs, i.e. duplicate sequences within a genome, from Mendelian loci, i.e. single copy sequences that pair at meiosis. The ratio of observed to expected heterozygosity is an effective tool for filtering loci but requires genotyping to be performed first at a high computational cost, whereas counting the number of sequence tags detected per genotype is computationally quick but very ineffective in inbred or polyploid populations. Therefore, new methods are needed for filtering paralogs. Results We introduce a novel statistic, Hind/HE, that uses the probability that two reads sampled from a genotype will belong to different alleles, instead of observed heterozygosity. The expected value of Hind/HE is the same across all loci in a dataset, regardless of read depth or allele frequency. In contrast to methods based on observed heterozygosity, it can be estimated and used for filtering loci prior to genotype calling. In addition to filtering paralogs, it can be used to filter loci with null alleles or high overdispersion, and identify individuals with unexpected ploidy and hybrid status. We demonstrate that the statistic is useful at read depths as low as five to 10, well below the depth needed for accurate genotype calling in polyploid and outcrossing species. Conclusions Our methodology for estimating Hind/HE across loci and individuals, as well as determining reasonable thresholds for filtering loci, is implemented in polyRAD v1.6, available at https://github.com/lvclark/polyRAD. In large sequencing datasets, we anticipate that the ability to filter markers and identify problematic individuals prior to genotype calling will save researchers considerable computational time. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04635-9.
Collapse
|
16
|
Multiomics approach reveals a role of translational machinery in shaping maize kernel amino acid composition. PLANT PHYSIOLOGY 2022; 188:111-133. [PMID: 34618082 PMCID: PMC8774818 DOI: 10.1093/plphys/kiab390] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 07/21/2021] [Indexed: 06/13/2023]
Abstract
Maize (Zea mays) seeds are a good source of protein, despite being deficient in several essential amino acids. However, eliminating the highly abundant but poorly balanced seed storage proteins has revealed that the regulation of seed amino acids is complex and does not rely on only a handful of proteins. In this study, we used two complementary omics-based approaches to shed light on the genes and biological processes that underlie the regulation of seed amino acid composition. We first conducted a genome-wide association study to identify candidate genes involved in the natural variation of seed protein-bound amino acids. We then used weighted gene correlation network analysis to associate protein expression with seed amino acid composition dynamics during kernel development and maturation. We found that almost half of the proteome was significantly reduced during kernel development and maturation, including several translational machinery components such as ribosomal proteins, which strongly suggests translational reprogramming. The reduction was significantly associated with a decrease in several amino acids, including lysine and methionine, pointing to their role in shaping the seed amino acid composition. When we compared the candidate gene lists generated from both approaches, we found a nonrandom overlap of 80 genes. A functional analysis of these genes showed a tight interconnected cluster dominated by translational machinery genes, especially ribosomal proteins, further supporting the role of translation dynamics in shaping seed amino acid composition. These findings strongly suggest that seed biofortification strategies that target the translation machinery dynamics should be considered and explored further.
Collapse
|
17
|
Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies. Methods Mol Biol 2022; 2481:63-80. [PMID: 35641759 DOI: 10.1007/978-1-0716-2237-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
With increasing marker density, estimation of recombination rate between a marker and a causal mutation using linkage analysis becomes less important. Instead, linkage disequilibrium (LD) becomes the major indicator for gene mapping through genome-wide association studies (GWAS). In addition to the linkage between the marker and the causal mutation, many other factors may contribute to the LD, including population structure and cryptic relationships among individuals. As statistical methods and software evolve to improve statistical power and computing speed in GWAS, the corresponding outputs must also evolve to facilitate the interpretation of input data, the analytical process, and final association results. In this chapter, our descriptions focus on (1) considerations in creating a Manhattan plot displaying the strength of LD and locations of markers across a genome; (2) criteria for genome-wide significance threshold and the different appearance of Manhattan plots in single-locus and multiple-locus models; (3) exploration of population structure and kinship among individuals; (4) quantile-quantile (QQ) plot; (5) LD decay across the genome and LD between the associated markers and their neighbors; (6) exploration of individual and marker information on Manhattan and QQ plots via interactive visualization using HTML. The ultimate objective of this chapter is to help users to connect input data to GWAS outputs to balance power and false positives, and connect GWAS outputs to the selection of candidate genes using LD extent.
Collapse
|
18
|
Optical topometry and machine learning to rapidly phenotype stomatal patterning traits for maize QTL mapping. PLANT PHYSIOLOGY 2021; 187:1462-1480. [PMID: 34618057 PMCID: PMC8566313 DOI: 10.1093/plphys/kiab299] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 05/26/2021] [Indexed: 05/03/2023]
Abstract
Stomata are adjustable pores on leaf surfaces that regulate the tradeoff of CO2 uptake with water vapor loss, thus having critical roles in controlling photosynthetic carbon gain and plant water use. The lack of easy, rapid methods for phenotyping epidermal cell traits have limited discoveries about the genetic basis of stomatal patterning. A high-throughput epidermal cell phenotyping pipeline is presented here and used for quantitative trait loci (QTL) mapping in field-grown maize (Zea mays). The locations and sizes of stomatal complexes and pavement cells on images acquired by an optical topometer from mature leaves were automatically determined. Computer estimated stomatal complex density (SCD; R2 = 0.97) and stomatal complex area (SCA; R2 = 0.71) were strongly correlated with human measurements. Leaf gas exchange traits were genetically correlated with the dimensions and proportions of stomatal complexes (rg = 0.39-0.71) but did not correlate with SCD. Heritability of epidermal traits was moderate to high (h2 = 0.42-0.82) across two field seasons. Thirty-six QTL were consistently identified for a given trait in both years. Twenty-four clusters of overlapping QTL for multiple traits were identified, with univariate versus multivariate single marker analysis providing evidence consistent with pleiotropy in multiple cases. Putative orthologs of genes known to regulate stomatal patterning in Arabidopsis (Arabidopsis thaliana) were located within some, but not all, of these regions. This study demonstrates how discovery of the genetic basis for stomatal patterning can be accelerated in maize, a C4 model species where these processes are poorly understood.
Collapse
|
19
|
An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments. Front Genet 2021; 12:689319. [PMID: 34367248 PMCID: PMC8343134 DOI: 10.3389/fgene.2021.689319] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 06/07/2021] [Indexed: 11/13/2022] Open
Abstract
The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of n = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts.
Collapse
|
20
|
Genomic Prediction of Yield Traits in Single-Cross Hybrid Rice ( Oryza sativa L.). Front Genet 2021; 12:692870. [PMID: 34276796 PMCID: PMC8278103 DOI: 10.3389/fgene.2021.692870] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 06/09/2021] [Indexed: 11/16/2022] Open
Abstract
Hybrid rice varieties can outyield the best inbred varieties by 15 – 30% with appropriate management. However, hybrid rice requires more inputs and management than inbred rice to realize a yield advantage in high-yielding environments. The development of stress-tolerant hybrid rice with lowered input requirements could increase hybrid rice yield relative to production costs. We used genomic prediction to evaluate the combining abilities of 564 stress-tolerant lines used to develop Green Super Rice with 13 male sterile lines of the International Rice Research Institute for yield-related traits. We also evaluated the performance of their F1 hybrids. We identified male sterile lines with good combining ability as well as F1 hybrids with potential further use in product development. For yield per plant, accuracies of genomic predictions of hybrid genetic values ranged from 0.490 to 0.822 in cross-validation if neither parent or up to both parents were included in the training set, and both general and specific combining abilities were modeled. The accuracy of phenotypic selection for hybrid yield per plant was 0.682. The accuracy of genomic predictions of male GCA for yield per plant was 0.241, while the accuracy of phenotypic selection was 0.562. At the observed accuracies, genomic prediction of hybrid genetic value could allow improved identification of high-performing single crosses. In a reciprocal recurrent genomic selection program with an accelerated breeding cycle, observed male GCA genomic prediction accuracies would lead to similar rates of genetic gain as phenotypic selection. It is likely that prediction accuracies of male GCA could be improved further by targeted expansion of the training set. Additionally, we tested the correlation of parental genetic distance with mid-parent heterosis in the phenotyped hybrids. We found the average mid-parent heterosis for yield per plant to be consistent with existing literature values at 32.0%. In the overall population of study, parental genetic distance was significantly negatively correlated with mid-parent heterosis for yield per plant (r = −0.131) and potential yield (r = −0.092), but within female families the correlations were non-significant and near zero. As such, positive parental genetic distance was not reliably associated with positive mid-parent heterosis.
Collapse
|
21
|
Eleven biosynthetic genes explain the majority of natural variation in carotenoid levels in maize grain. THE PLANT CELL 2021; 33:882-900. [PMID: 33681994 PMCID: PMC8226291 DOI: 10.1093/plcell/koab032] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 01/26/2021] [Indexed: 05/03/2023]
Abstract
Vitamin A deficiency remains prevalent in parts of Asia, Latin America, and sub-Saharan Africa where maize (Zea mays) is a food staple. Extensive natural variation exists for carotenoids in maize grain. Here, to understand its genetic basis, we conducted a joint linkage and genome-wide association study of the US maize nested association mapping panel. Eleven of the 44 detected quantitative trait loci (QTL) were resolved to individual genes. Six of these were correlated expression and effect QTL (ceeQTL), showing strong correlations between RNA-seq expression abundances and QTL allelic effect estimates across six stages of grain development. These six ceeQTL also had the largest percentage of phenotypic variance explained, and in major part comprised the three to five loci capturing the bulk of genetic variation for each trait. Most of these ceeQTL had strongly correlated QTL allelic effect estimates across multiple traits. These findings provide an in-depth genome-level understanding of the genetic and molecular control of carotenoids in plants. In addition, these findings provide a roadmap to accelerate breeding for provitamin A and other priority carotenoid traits in maize grain that should be readily extendable to other cereals.
Collapse
|
22
|
Diversifying maize genomic selection models. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2021; 41:33. [PMID: 37309328 PMCID: PMC10236107 DOI: 10.1007/s11032-021-01221-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 03/07/2021] [Indexed: 06/14/2023]
Abstract
Genomic selection (GS) is one of the most powerful tools available for maize breeding. Its use of genome-wide marker data to estimate breeding values translates to increased genetic gains with fewer breeding cycles. In this review, we cover the history of GS and highlight particular milestones during its adaptation to maize breeding. We discuss how GS can be applied to developing superior maize inbreds and hybrids. Additionally, we characterize refinements in GS models that could enable the encapsulation of non-additive genetic effects, genotype by environment interactions, and multiple levels of the biological hierarchy, all of which could ultimately result in more accurate predictions of breeding values. Finally, we suggest the stages in a maize breeding program where it would be beneficial to apply GS. Given the current sophistication of high-throughput phenotypic, genotypic, and other -omic level data currently available to the maize community, now is the time to explore the implications of their incorporation into GS models and thus ensure that genetic gains are being achieved as quickly and efficiently as possible.
Collapse
|
23
|
HAPPI GWAS: Holistic Analysis with Pre- and Post-Integration GWAS. Bioinformatics 2021; 36:4655-4657. [PMID: 32579187 DOI: 10.1093/bioinformatics/btaa589] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 05/29/2020] [Accepted: 06/16/2020] [Indexed: 12/20/2022] Open
Abstract
MOTIVATION Advanced publicly available sequencing data from large populations have enabled informative genome-wide association studies (GWAS) that associate SNPs with phenotypic traits of interest. Many publicly available tools able to perform GWAS have been developed in response to increased demand. However, these tools lack a comprehensive pipeline that includes both pre-GWAS analysis, such as outlier removal, data transformation and calculation of Best Linear Unbiased Predictions or Best Linear Unbiased Estimates. In addition, post-GWAS analysis, such as haploblock analysis and candidate gene identification, is lacking. RESULTS Here, we present Holistic Analysis with Pre- and Post-Integration (HAPPI) GWAS, an open-source GWAS tool able to perform pre-GWAS, GWAS and post-GWAS analysis in an automated pipeline using the command-line interface. AVAILABILITY AND IMPLEMENTATION HAPPI GWAS is written in R for any Unix-like operating systems and is available on GitHub (https://github.com/Angelovici-Lab/HAPPI.GWAS.git). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
24
|
How Well Can Multivariate and Univariate GWAS Distinguish Between True and Spurious Pleiotropy? Front Genet 2021; 11:602526. [PMID: 33584799 PMCID: PMC7873880 DOI: 10.3389/fgene.2020.602526] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 12/11/2020] [Indexed: 11/13/2022] Open
Abstract
Quantification of the simultaneous contributions of loci to multiple traits, a phenomenon called pleiotropy, is facilitated by the increased availability of high-throughput genotypic and phenotypic data. To understand the prevalence and nature of pleiotropy, the ability of multivariate and univariate genome-wide association study (GWAS) models to distinguish between pleiotropic and non-pleiotropic loci in linkage disequilibrium (LD) first needs to be evaluated. Therefore, we used publicly available maize and soybean genotypic data to simulate multiple pairs of traits that were either (i) controlled by quantitative trait nucleotides (QTNs) on separate chromosomes, (ii) controlled by QTNs in various degrees of LD with each other, or (iii) controlled by a single pleiotropic QTN. We showed that multivariate GWAS could not distinguish between QTNs in LD and a single pleiotropic QTN. In contrast, a unique QTN detection rate pattern was observed for univariate GWAS whenever the simulated QTNs were in high LD or pleiotropic. Collectively, these results suggest that multivariate and univariate GWAS should both be used to infer whether or not causal mutations underlying peak GWAS associations are pleiotropic. Therefore, we recommend that future studies use a combination of multivariate and univariate GWAS models, as both models could be useful for identifying and narrowing down candidate loci with potential pleiotropic effects for downstream biological experiments.
Collapse
|
25
|
Single-cell RNA sequencing of developing maize ears facilitates functional analysis and trait candidate gene discovery. Dev Cell 2021; 56:557-568.e6. [PMID: 33400914 DOI: 10.1016/j.devcel.2020.12.015] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 10/31/2020] [Accepted: 12/15/2020] [Indexed: 12/30/2022]
Abstract
Crop productivity depends on activity of meristems that produce optimized plant architectures, including that of the maize ear. A comprehensive understanding of development requires insight into the full diversity of cell types and developmental domains and the gene networks required to specify them. Until now, these were identified primarily by morphology and insights from classical genetics, which are limited by genetic redundancy and pleiotropy. Here, we investigated the transcriptional profiles of 12,525 single cells from developing maize ears. The resulting developmental atlas provides a single-cell RNA sequencing (scRNA-seq) map of an inflorescence. We validated our results by mRNA in situ hybridization and by fluorescence-activated cell sorting (FACS) RNA-seq, and we show how these data may facilitate genetic studies by predicting genetic redundancy, integrating transcriptional networks, and identifying candidate genes associated with crop yield traits.
Collapse
|
26
|
simplePHENOTYPES: SIMulation of pleiotropic, linked and epistatic phenotypes. BMC Bioinformatics 2020; 21:491. [PMID: 33129253 PMCID: PMC7603745 DOI: 10.1186/s12859-020-03804-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 10/08/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Advances in genotyping and phenotyping techniques have enabled the acquisition of a great amount of data. Consequently, there is an interest in multivariate statistical analyses that identify genomic regions likely to contain causal mutations affecting multiple traits (i.e., pleiotropy). As the demand for multivariate analyses increases, it is imperative that optimal tools are available to assess their performance. To facilitate the testing and validation of these multivariate approaches, we developed simplePHENOTYPES, an R/CRAN package that simulates pleiotropy, partial pleiotropy, and spurious pleiotropy in a wide range of genetic architectures, including additive, dominance and epistatic models. RESULTS We illustrate simplePHENOTYPES' ability to simulate thousands of phenotypes in less than one minute. We then provide two vignettes illustrating how to simulate sets of correlated traits in simplePHENOTYPES. Finally, we demonstrate the use of results from simplePHENOTYPES in a standard GWAS software, as well as the equivalence of simulated phenotypes from simplePHENOTYPES and other packages with similar capabilities. CONCLUSIONS simplePHENOTYPES is a R/CRAN package that makes it possible to simulate multiple traits controlled by loci with varying degrees of pleiotropy. Its ability to interface with both commonly-used marker data formats and downstream quantitative genetics software and packages should facilitate a rigorous assessment of both existing and emerging statistical GWAS and GS approaches. simplePHENOTYPES is also available at https://github.com/samuelbfernandes/simplePHENOTYPES .
Collapse
|
27
|
Genetic variation associated with PPO-inhibiting herbicide tolerance in sorghum. PLoS One 2020; 15:e0233254. [PMID: 33052910 PMCID: PMC7556536 DOI: 10.1371/journal.pone.0233254] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 08/26/2020] [Indexed: 11/29/2022] Open
Abstract
Herbicide application is crucial for weed management in most crop production systems, but for sorghum herbicide options are limited. Sorghum is sensitive to residual protoporphyrinogen oxidase (PPO)-inhibiting herbicides, such as fomesafen, and a long re-entry period is required before sorghum can be planted after its application. Improving sorghum for tolerance to such residual herbicides would allow for increased sorghum production and the expansion of herbicide options for growers. In this study, we observed sorghum tolerance to residual fomesafen. To investigate the underlying tolerance mechanism a genome-wide association mapping study was conducted using field-collected sorghum biomass panel (SBP) data, and a greenhouse assay was developed to confirm the field phenotypes. A total of 26 significant SNPs (FDR<0.05), spanning a 215.3 kb region on chromosome 3, were detected. The ten most significant SNPs included two in genic regions (Sobic.003G136800, and Sobic.003G136900) and eight SNPs in the intergenic region encompassing the genes Sobic.003G136700, Sobic.003G136800, Sobic.003G137000, Sobic.003G136900, and Sobic.003G137100. The gene Sobic.003G137100 (PPXI), which encodes the PPO1 enzyme, one of the targets of PPO-inhibiting herbicides, was located 12kb downstream of the significant SNP S03_13152838. We found that PPXI is highly conserved in sorghum and expression does not significantly differ between tolerant and sensitive sorghum lines. Our results suggest that PPXI most likely does not underlie the observed herbicide tolerance. Instead, the mechanism underlying herbicide tolerance in the SBP is likely metabolism-based resistance, possibly regulated by the action of multiple genes. Further research is necessary to confirm candidate genes and their functions.
Collapse
|
28
|
CorrigendumMulti-Trait Genome-Wide Association Studies Reveal Loci Associated with Maize Inflorescence and Leaf Architecture. PLANT & CELL PHYSIOLOGY 2020; 61:1530. [PMID: 32812044 DOI: 10.1093/pcp/pcaa078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
|
29
|
Multi-Trait Genome-Wide Association Studies Reveal Loci Associated with Maize Inflorescence and Leaf Architecture. PLANT & CELL PHYSIOLOGY 2020; 61:1427-1437. [PMID: 32186727 DOI: 10.1093/pcp/pcaa039] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 03/17/2020] [Indexed: 05/23/2023]
Abstract
Maize inflorescence is a complex phenotype that involves the physical and developmental interplay of multiple traits. Given the evidence that genes could pleiotropically contribute to several of these traits, we used publicly available maize data to assess the ability of multivariate genome-wide association study (GWAS) approaches to identify pleiotropic quantitative trait loci (pQTL). Our analysis of 23 publicly available inflorescence and leaf-related traits in a diversity panel of n = 281 maize lines genotyped with 376,336 markers revealed that the two multivariate GWAS approaches we tested were capable of identifying pQTL in genomic regions coinciding with similar associations found in previous studies. We then conducted a parallel simulation study on the same individuals, where it was shown that multivariate GWAS approaches yielded a higher true-positive quantitative trait nucleotide (QTN) detection rate than comparable univariate approaches for all evaluated simulation settings except for when the correlated simulated traits had a heritability of 0.9. We therefore conclude that the implementation of state-of-the-art multivariate GWAS approaches is a useful tool for dissecting pleiotropy and their more widespread implementation could facilitate the discovery of genes and other biological mechanisms underlying maize inflorescence.
Collapse
|
30
|
Exploration of Life-Course Factors Influencing Phenotypic Outcomes in Crops. PLANT & CELL PHYSIOLOGY 2020; 61:1381-1383. [PMID: 32603418 PMCID: PMC7434585 DOI: 10.1093/pcp/pcaa087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Accepted: 06/22/2020] [Indexed: 05/06/2023]
|
31
|
Population Structure Analyses Provide Insight into the Source Populations Underlying Rural Isolated Communities in Illinois. Hum Biol 2020; 91:31-47. [PMID: 32073243 DOI: 10.13110/humanbiology.91.1.05] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 11/12/2019] [Indexed: 11/18/2022]
Abstract
We have previously hypothesized that relatively small and isolated rural communities may experience founder effects, defined as the genetic ramifications of small population sizes at the time of a community's establishment. To explore this, we used an Illumina Infinium Omni2.5Exome-8 chip to collect data from 157 individuals from four Illinois communities, three rural and one urban. Genetic diversity estimates of 999,259 autosomal markers suggested that the reduction in heterozygosity due to shared ancestry was approximately 0, indicating a randomly mating population. An eigenanalysis, which is similar to a principal component analysis but run on a genetic coancestry matrix, conducted in the SNPRelate R package revealed that most of these individuals formed one cluster, with a few putative outliers obscuring population variation. An additional eigenanalysis on the same markers in a combined data set including the 2,504 individuals in the 1000 Genomes database found that most of the 157 Illinois individuals clustered into one group in close proximity to individuals of European descent. A final eigenanalysis of the Illinois individuals with the 503 individuals of European descent (within the 1000 Genomes Project) revealed two clusters of individuals and likely two source populations; one British and one consisting of multiple European subpopulations. We therefore demonstrate the feasibility of examining genetic relatedness across Illinois populations and assessing the number of source populations using publicly available databases. When assessed, population structure information can contribute to the understanding of genetic history in rural populations.
Collapse
|
32
|
Training Population Optimization for Genomic Selection in Miscanthus. G3 (BETHESDA, MD.) 2020; 10:2465-2476. [PMID: 32457095 PMCID: PMC7341128 DOI: 10.1534/g3.120.401402] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 05/20/2020] [Indexed: 01/08/2023]
Abstract
Miscanthus is a perennial grass with potential for lignocellulosic ethanol production. To ensure its utility for this purpose, breeding efforts should focus on increasing genetic diversity of the nothospecies Miscanthus × giganteus (M×g) beyond the single clone used in many programs. Germplasm from the corresponding parental species M. sinensis (Msi) and M. sacchariflorus (Msa) could theoretically be used as training sets for genomic prediction of M×g clones with optimal genomic estimated breeding values for biofuel traits. To this end, we first showed that subpopulation structure makes a substantial contribution to the genomic selection (GS) prediction accuracies within a 538-member diversity panel of predominately Msi individuals and a 598-member diversity panels of Msa individuals. We then assessed the ability of these two diversity panels to train GS models that predict breeding values in an interspecific diploid 216-member M×g F2 panel. Low and negative prediction accuracies were observed when various subsets of the two diversity panels were used to train these GS models. To overcome the drawback of having only one interspecific M×g F2 panel available, we also evaluated prediction accuracies for traits simulated in 50 simulated interspecific M×g F2 panels derived from different sets of Msi and diploid Msa parents. The results revealed that genetic architectures with common causal mutations across Msi and Msa yielded the highest prediction accuracies. Ultimately, these results suggest that the ideal training set should contain the same causal mutations segregating within interspecific M×g populations, and thus efforts should be undertaken to ensure that individuals in the training and validation sets are as closely related as possible.
Collapse
|
33
|
Abstract
BACKGROUND The functional genome of agronomically important plant species remains largely unexplored, yet presents a virtually untapped resource for targeted crop improvement. Functional elements of regulatory DNA revealed through profiles of chromatin accessibility can be harnessed for fine-tuning gene expression to optimal phenotypes in specific environments. RESULT Here, we investigate the non-coding regulatory space in the maize (Zea mays) genome during early reproductive development of pollen- and grain-bearing inflorescences. Using an assay for differential sensitivity of chromatin to micrococcal nuclease (MNase) digestion, we profile accessible chromatin and nucleosome occupancy in these largely undifferentiated tissues and classify at least 1.6% of the genome as accessible, with the majority of MNase hypersensitive sites marking proximal promoters, but also 3' ends of maize genes. This approach maps regulatory elements to footprint-level resolution. Integration of complementary transcriptome profiles and transcription factor occupancy data are used to annotate regulatory factors, such as combinatorial transcription factor binding motifs and long non-coding RNAs, that potentially contribute to organogenesis, including tissue-specific regulation between male and female inflorescence structures. Finally, genome-wide association studies for inflorescence architecture traits based solely on functional regions delineated by MNase hypersensitivity reveals new SNP-trait associations in known regulators of inflorescence development as well as new candidates. CONCLUSIONS These analyses provide a comprehensive look into the cis-regulatory landscape during inflorescence differentiation in a major cereal crop, which ultimately shapes architecture and influences yield potential.
Collapse
|
34
|
mGWAS Uncovers Gln-Glucosinolate Seed-Specific Interaction and its Role in Metabolic Homeostasis. PLANT PHYSIOLOGY 2020; 183:483-500. [PMID: 32317360 PMCID: PMC7271782 DOI: 10.1104/pp.20.00039] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 03/26/2020] [Indexed: 05/04/2023]
Abstract
Gln is a key player in plant metabolism. It is one of the major free amino acids that is transported into the developing seed and is central for nitrogen metabolism. However, Gln natural variation and its regulation and interaction with other metabolic processes in seeds remain poorly understood. To investigate the latter, we performed a metabolic genome-wide association study (mGWAS) of Gln-related traits measured from the dry seeds of the Arabidopsis (Arabidopsis thaliana) diversity panel using all potential ratios between Gln and the other members of the Glu family as traits. This semicombinatorial approach yielded multiple candidate genes that, upon further analysis, revealed an unexpected association between the aliphatic glucosinolates (GLS) and the Gln-related traits. This finding was confirmed by an independent quantitative trait loci mapping and statistical analysis of the relationships between the Gln-related traits and the presence of specific GLS in seeds. Moreover, an analysis of Arabidopsis mutants lacking GLS showed an extensive seed-specific impact on Gln levels and composition that manifested early in seed development. The elimination of GLS in seeds was associated with a large effect on seed nitrogen and sulfur homeostasis, which conceivably led to the Gln response. This finding indicates that both Gln and GLS play key roles in shaping the seed metabolic homeostasis. It also implies that select secondary metabolites might have key functions in primary seed metabolism. Finally, our study shows that an mGWAS performed on dry seeds can uncover key metabolic interactions that occur early in seed development.
Collapse
|
35
|
Healthy Planet, Healthy Youth: A Food Systems Education and Promotion Intervention to Improve Adolescent Diet Quality and Reduce Food Waste. Nutrients 2019; 11:E1869. [PMID: 31405231 PMCID: PMC6723537 DOI: 10.3390/nu11081869] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 08/02/2019] [Accepted: 08/08/2019] [Indexed: 02/07/2023] Open
Abstract
Emerging evidence suggests a link between young people's interest in alternative food production practices and dietary quality. The primary purpose of this study was to examine the impact of a student-driven sustainable food systems education and promotion intervention on adolescent school lunch selection, consumption, and waste behaviors. Sixth grade science teachers at two middle schools (n = 268 students) implemented a standards-based curriculum on sustainable food systems, addressing the environmental impacts of food choices and food waste. The cumulating curriculum activity required the 6th grade students to share their food systems knowledge with their 7th and 8th grade counterparts (n = 426) through a cafeteria promotional campaign to discourage food waste. School-wide monthly plate waste assessments were used to evaluate changes in vegetable consumption and overall plate waste using a previously validated digital photography method. At baseline, the intervention students consumed significantly less vegetables relative to the control group (47.1% and 71.8% of vegetables selected, respectively (p = 0.006). This disparity was eliminated after the intervention with the intervention group consuming 69.4% and the control consuming 68.1% of selected vegetables (p = 0.848). At five months follow up, the intervention group wasted significantly less salad bar vegetables compared to the control group (24.2 g and 50.1 g respectively (p = 0.029). These findings suggest that food systems education can be used to promote improved dietary behaviors among adolescent youth.
Collapse
|
36
|
Abstract
Goss's bacterial wilt and leaf blight is one of the most important foliar diseases of maize ( L.). To date, neither large-effect resistance genes, nor practical chemical controls exist to manage the disease. Thus, the importance of discovering durable host resistance necessitates additional genetic mapping for this disease. Unfortunately, because of the biology of the pathogen and the highly significant genotype-by-environment interaction effect observed with Goss's wilt, consistent phenotyping across multiple years poses a hurdle for genetic studies and conventional breeding methods. The objective of this study was to perform a genome-wide association study (GWAS) to identify regions of the genome associated with Goss's wilt resistance as well as to use genomic prediction models to evaluate the utility of genomic selection (GS) in predicting Goss's wilt phenotypes in a panel of diverse maize lines. Using genome-wide association mapping, we were unable to identify any variants significantly associated with Goss's wilt. However, using genomic prediction we were able to train a model with an accuracy of 0.69. Taken together, this suggests that resistance to Goss's wilt is highly polygenic. In addition, when evaluating the accuracy of our prediction model under reduced marker density, it was shown that only 10,000 single nucleotide polymorphisms (SNPs), or ∼20% of our total marker set, was necessary to achieve prediction accuracies similar to the full marker set. This is the first report of genomic prediction for a bacterial disease of maize, and these results highlight the potential of GS for disease resistance in maize.
Collapse
|
37
|
Evaluation of RR-BLUP Genomic Selection Models that Incorporate Peak Genome-Wide Association Study Signals in Maize and Sorghum. THE PLANT GENOME 2019; 12. [PMID: 30951091 DOI: 10.3835/plantgenome2018.07.0052] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Certain agronomic crop traits are complex and thus governed by many small-effect loci. Statistical models typically used in a genome-wide association study (GWAS) and genomic selection (GS) quantify these signals by assessing genomic marker contributions in linkage disequilibrium (LD) with these loci to trait variation. These models have been used in separate quantitative genetics contexts until recently, when, in published studies, the predictive ability of GS models that include peak associated markers from a GWAS as fixed-effect covariates was assessed. Previous work suggests that such models could be useful for predicting traits controlled by several large-effect and many small-effect genes. We expand this work by evaluating simulated traits from diversity panels in maize ( L.) and sorghum [ (L.) Moench] using ridge-regression best linear unbiased prediction (RR-BLUP) models that include fixed-effect covariates tagging peak GWAS signals. The ability of such covariates to increase GS prediction accuracy in the RR-BLUP model under a wide variety of genetic architectures and genomic backgrounds was quantified. Of the 216 genetic architectures that we simulated, we identified 60 where the addition of fixed-effect covariates boosted prediction accuracy. However, for the majority of the simulated data, no increase or a decrease in prediction accuracy was observed. We also noted several instances where the inclusion of fixed-effect covariates increased both the variability of prediction accuracies and the bias of the genomic estimated breeding values. We therefore recommend that the performance of such a GS model be explored on a trait-by-trait basis prior to its implementation into a breeding program.
Collapse
|
38
|
Stability Analysis of Kernel Quality Traits in Exotic-Derived Doubled Haploid Maize Lines. THE PLANT GENOME 2019; 12. [PMID: 30951103 DOI: 10.3835/plantgenome2017.12.0114] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Variation in kernel composition across maize ( L.) germplasm is affected by a combination of the plant's genotype, the environment in which it is grown, and the interaction between these two elements. Adapting exotic germplasm to the US Corn Belt is highly dependent on the plant's genotype, the environment where it is grown, and the interaction between these components. Phenotypic plasticity is ill-defined when specific exotic germplasm is moved over large latitudinal distances and for the adapted variants being created. Reduced plasticity (or stability) is desired for the adapted variants, as it allows for a more rapid implementation into breeding programs throughout the Corn Belt. Here, doubled haploid lines derived from exotic maize and adapted through backcrossing exotic germplasm to elite adapted lines were used in conjunction with genome-wide association studies to explore stability in four kernel composition traits. Genotypes demonstrated a response to environments that paralleled the mean response of all genotypes used across all traits, with protein content and kernel density exhibiting the highest levels of Type II stability. Genes such as , , and were identified as potential candidates within quantitative trait locus regions. The findings within this study aid in validating previously identified genomic regions and identified novel genomic regions affecting kernel quality traits.
Collapse
|
39
|
An assessment of the performance of the logistic mixed model for analyzing binary traits in maize and sorghum diversity panels. PLoS One 2018; 13:e0207752. [PMID: 30462727 PMCID: PMC6248992 DOI: 10.1371/journal.pone.0207752] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 11/06/2018] [Indexed: 11/18/2022] Open
Abstract
The logistic mixed model (LMM) is well-suited for the genome-wide association study (GWAS) of binary agronomic traits because it can include fixed and random effects that account for spurious associations. The recent implementation of a computationally efficient model fitting and testing approach now makes it practical to use the LMM to search for markers associated with such binary traits on a genome-wide scale. Therefore, the purpose of this work was to assess the applicability of the LMM for GWAS in crop diversity panels. We dichotomized three publicly available quantitative traits in a maize diversity panel and two quantitative traits in a sorghum diversity panel, and them performed a GWAS using both the LMM and the unified mixed linear model (MLM) on these dichotomized traits. Our results suggest that the LMM is capable of identifying statistically significant marker-trait associations in the same genomic regions highlighted in previous studies, and this ability is consistent across both diversity panels. We also show how subpopulation structure in the maize diversity panel can underscore the LMM’s superior control for spurious associations compared to the unified MLM. These results suggest that the LMM is a viable model to use for the GWAS of binary traits in crop diversity panels and we therefore encourage its broader implementation in the agronomic research community.
Collapse
|
40
|
An assessment of true and false positive detection rates of stepwise epistatic model selection as a function of sample size and number of markers. Heredity (Edinb) 2018; 122:660-671. [PMID: 30443009 PMCID: PMC6462028 DOI: 10.1038/s41437-018-0162-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 10/19/2018] [Accepted: 10/28/2018] [Indexed: 12/21/2022] Open
Abstract
Association studies have been successful at identifying genomic regions associated with important traits, but routinely employ models that only consider the additive contribution of an individual marker. Because quantitative trait variability typically arises from multiple additive and non-additive sources, utilization of statistical approaches that include main and two-way interaction marker effects of several loci in one model could lead to unprecedented characterization of these sources. Here we examine the ability of one such approach, called the Stepwise Procedure for constructing an Additive and Epistatic Multi-Locus model (SPAEML), to detect additive and epistatic signals simulated using maize and human marker data. Our results revealed that SPAEML was capable of detecting quantitative trait nucleotides (QTNs) at sample sizes as low as n = 300 and consistently specifying signals as additive and epistatic for larger sizes. Sample size and minor allele frequency had a major influence on SPAEML's ability to distinguish between additive and epistatic signals, while the number of markers tested did not. We conclude that SPAEML is a useful approach for providing further elucidation of the additive and epistatic sources contributing to trait variability when applied to a small subset of genome-wide markers located within specific genomic regions identified using a priori analyses.
Collapse
|
41
|
Genetic Architecture of Soybean Yield and Agronomic Traits. G3 (BETHESDA, MD.) 2018; 8:3367-3375. [PMID: 30131329 PMCID: PMC6169381 DOI: 10.1534/g3.118.200332] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Accepted: 08/16/2018] [Indexed: 01/31/2023]
Abstract
Soybean is the world's leading source of vegetable protein and demand for its seed continues to grow. Breeders have successfully increased soybean yield, but the genetic architecture of yield and key agronomic traits is poorly understood. We developed a 40-mating soybean nested association mapping (NAM) population of 5,600 inbred lines that were characterized by single nucleotide polymorphism (SNP) markers and six agronomic traits in field trials in 22 environments. Analysis of the yield, agronomic, and SNP data revealed 23 significant marker-trait associations for yield, 19 for maturity, 15 for plant height, 17 for plant lodging, and 29 for seed mass. A higher frequency of estimated positive yield alleles was evident from elite founder parents than from exotic founders, although unique desirable alleles from the exotic group were identified, demonstrating the value of expanding the genetic base of US soybean breeding.
Collapse
|
42
|
Association Mapping of Flowering and Height Traits in Germplasm Enhancement of Maize Doubled Haploid (GEM-DH) Lines. THE PLANT GENOME 2018; 11. [PMID: 30025021 DOI: 10.3835/plantgenome2017.09.0083] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Flowering and height related traits are extensively studied in maize for three main reasons: 1) easily obtained phenotypic measurements, 2) highly heritable, and 3) importance of these traits to adaptation and grain yield. However, variation in flowering and height traits is extensive and findings from previous studies are genotype specific. Herein, a diverse panel of exotic derived doubled haploid lines, in conjunction with genome-wide association analysis, is used to further explore adaptation related trait variation of exotic germplasm for potential use in adapting exotic germplasm to the U.S. Corn-Belt. Phenotypes for the association panel were obtained from six locations across the central-U.S. and genotyping was performed using the genotyping-by-sequencing method. Nineteen flowering time candidate genes were found for three flowering traits. Eighteen candidate genes were found for four height related traits, with the majority of the candidate genes relating to plant hormones auxin and gibberellin. A single gene was discovered for ear height that also had effects on -like flowering gene expression levels. Findings will be used to inform future research efforts of the USDA Germplasm Enhancement of Maize project and eventually aid in the rapid adaptation of exotic germplasm to temperate U.S. environments.
Collapse
|
43
|
Selinene Volatiles Are Essential Precursors for Maize Defense Promoting Fungal Pathogen Resistance. PLANT PHYSIOLOGY 2017; 175:1455-1468. [PMID: 28931629 PMCID: PMC5664469 DOI: 10.1104/pp.17.00879] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 09/18/2017] [Indexed: 05/21/2023]
Abstract
To ensure food security, maize (Zea mays) is a model crop for understanding useful traits underlying stress resistance. In contrast to foliar biochemicals, root defenses limiting the spread of disease remain poorly described. To better understand belowground defenses in the field, we performed root metabolomic profiling and uncovered unexpectedly high levels of the sesquiterpene volatile β-selinene and the corresponding nonvolatile antibiotic derivative β-costic acid. The application of metabolite-based quantitative trait locus mapping using biparental populations, genome-wide association studies, and near-isogenic lines enabled the identification of terpene synthase21 (ZmTps21) on chromosome 9 as a β-costic acid pathway candidate gene. Numerous closely examined β-costic acid-deficient inbred lines were found to harbor Zmtps21 pseudogenes lacking conserved motifs required for farnesyl diphosphate cyclase activity. For biochemical validation, a full-length ZmTps21 was cloned, heterologously expressed in Escherichia coli, and demonstrated to cyclize farnesyl diphosphate, yielding β-selinene as the dominant product. Consistent with microbial defense pathways, ZmTps21 transcripts strongly accumulate following fungal elicitation. Challenged field roots containing functional ZmTps21 alleles displayed β-costic acid levels over 100 μg g-1 fresh weight, greatly exceeding in vitro concentrations required to inhibit the growth of five different fungal pathogens and rootworm larvae (Diabrotica balteata). In vivo disease resistance assays, using ZmTps21 and Zmtps21 near-isogenic lines, further support the endogenous antifungal role of selinene-derived metabolites. Involved in the biosynthesis of nonvolatile antibiotics, ZmTps21 exists as a useful gene for germplasm improvement programs targeting optimized biotic stress resistance.
Collapse
|
44
|
Novel Loci Underlie Natural Variation in Vitamin E Levels in Maize Grain. THE PLANT CELL 2017; 29:2374-2392. [PMID: 28970338 PMCID: PMC5774569 DOI: 10.1105/tpc.17.00475] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 08/23/2017] [Accepted: 09/29/2017] [Indexed: 05/19/2023]
Abstract
Tocopherols, tocotrienols, and plastochromanols (collectively termed tocochromanols) are lipid-soluble antioxidants synthesized by all plants. Their dietary intake, primarily from seed oils, provides vitamin E and other health benefits. Tocochromanol biosynthesis has been dissected in the dicot Arabidopsis thaliana, which has green, photosynthetic seeds, but our understanding of tocochromanol accumulation in major crops, whose seeds are nonphotosynthetic, remains limited. To understand the genetic control of tocochromanols in grain, we conducted a joint linkage and genome-wide association study in the 5000-line U.S. maize (Zea mays) nested association mapping panel. Fifty-two quantitative trait loci for individual and total tocochromanols were identified, and of the 14 resolved to individual genes, six encode novel activities affecting tocochromanols in plants. These include two chlorophyll biosynthetic enzymes that explain the majority of tocopherol variation, which was not predicted given that, like most major cereal crops, maize grain is nonphotosynthetic. This comprehensive assessment of natural variation in vitamin E levels in maize establishes the foundation for improving tocochromanol and vitamin E content in seeds of maize and other major cereal crops.
Collapse
|
45
|
Brassinosteroid and gibberellin control of seedling traits in maize (Zea mays L.). PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2017; 263:132-141. [PMID: 28818369 DOI: 10.1016/j.plantsci.2017.07.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Revised: 07/09/2017] [Accepted: 07/11/2017] [Indexed: 05/24/2023]
Abstract
In this study, we established two doubled haploid (DH) libraries with a total of 207 DH lines. We applied BR and GA inhibitors to all DH lines at seedling stage and measured seedling BR and GA inhibitor responses. Moreover, we evaluated field traits for each DH line (untreated). We conducted genome-wide association studies (GWAS) with 62,049 genome wide SNPs to explore the genetic control of seedling traits by BR and GA. In addition, we correlate seedling stage hormone inhibitor response with field traits. Large variation for BR and GA inhibitor response and field traits was observed across these DH lines. Seedling stage BR and GA inhibitor response was significantly correlate with yield and flowering time. Using three different GWAS approaches to balance false positive/negatives, multiple SNPs were discovered to be significantly associated with BR/GA inhibitor responses with some localized within gene models. SNPs from gene model GRMZM2G013391 were associated with GA inhibitor response across all three GWAS models. This gene is expressed in roots and shoots and was shown to regulate GA signaling. These results show that BRs and GAs have a great impact for controlling seedling growth. Gene models from GWAS results could be targets for seeding traits improvement.
Collapse
|
46
|
Abstract
Vertical growth of plants is a dynamic process that is influenced by genetic and environmental factors and has a pronounced effect on overall plant architecture and biomass composition. We have performed six controlled growth trials of an interspecific Setaria italica x Setaria viridis recombinant inbred line population to assess how the genetic architecture of plant height is influenced by developmental queues, water availability and planting density. The non-destructive nature of plant height measurements has enabled us to monitor height throughout the plant life cycle in both field and controlled environments. We find that plant height is reduced under water limitation and high density planting and affected by growth environment (field vs. growth chamber). The results support a model where plant height is a heritable, polygenic trait and that the major genetic loci that influence plant height function independent of growth environment. The identity and contribution of loci that influence height changes dynamically throughout development and the reduction of growth observed in water limited environments is a consequence of delayed progression through the genetic program which establishes plant height in Setaria. In this population, alleles inherited from the weedy S. viridis parent act to increase plant height early, whereas a larger number of small effect alleles inherited from the domesticated S. italica parent collectively act to increase plant height later in development. Growth is a dynamic process that responds to a changing environment. Most of the methods that we have for measuring are static and collecting information throughout an organisms lifecycle is labor and cost prohibitive. Advances in imaging and robotics technology have enabled novel approaches to understanding how plants adapt to the environment. Using the model grass Setaria and new methods for measuring parameters from images, we investigate the genetic architecture of plant height in response to water availability and planting density. Height is one of the most influential components of plant architecture, determining tradeoffs between competition and resource allocation and is an important trait for boosting yields. The non-destructive nature of plant height measurements has enabled us to monitor growth throughout the plant life cycle in both field and controlled environments. We identified several loci controlling height in a population derived from a wild strain of Setaria viridis and its domesticated relative Setaria italica, as well as the developmental time in which these loci act. In this population, alleles inherited from the wild parent act to increase plant height early, whereas a larger number of small effect alleles inherited from the domesticated parent collectively act to increase plant height later in development.
Collapse
|
47
|
NLR locus-mediated trade-off between abiotic and biotic stress adaptation in Arabidopsis. NATURE PLANTS 2017; 3:17072. [PMID: 28548656 DOI: 10.1038/nplants.2017.72] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2016] [Accepted: 04/24/2017] [Indexed: 05/23/2023]
Abstract
Osmotic stress caused by drought, salt or cold decreases plant fitness. Acquired stress tolerance defines the ability of plants to withstand stress following an initial exposure1. We found previously that acquired osmotolerance after salt stress is widespread among Arabidopsis thaliana accessions2. Here, we identify ACQOS as the locus responsible for ACQUIRED OSMOTOLERANCE. Of its five haplotypes, only plants carrying group 1 ACQOS are impaired in acquired osmotolerance. ACQOS is identical to VICTR, encoding a nucleotide-binding leucine-rich repeat (NLR) protein3. In the absence of osmotic stress, group 1 ACQOS contributes to bacterial resistance. In its presence, ACQOS causes detrimental autoimmunity, thereby reducing osmotolerance. Analysis of natural variation at the ACQOS locus suggests that functional and non-functional ACQOS alleles are being maintained due to a trade-off between biotic and abiotic stress adaptation. Thus, polymorphism in certain plant NLR genes might be influenced by competing environmental stresses.
Collapse
|
48
|
Engaging rural communities in genetic research: challenges and opportunities. J Community Genet 2017; 8:209-219. [PMID: 28477297 DOI: 10.1007/s12687-017-0304-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Accepted: 04/27/2017] [Indexed: 12/13/2022] Open
Abstract
Statistical analyses of health and disease in rural communities is frequently limited by low sample counts. Still, some studies indicate increased risk for some diseases even after adjustment for known risk factors. It has been hypothesized that the context of community formation in rural areas facilitates the propagation of genetic founder effects-potentially impacting disease susceptibility. However, outright examination of genetic diversity in such communities has not been performed. Our objective was to engage otherwise research-inexperienced rural communities of largely European descent in genomic research in the context of cancer susceptibility. From September 2015 to February 2016, we implemented a systematic process of progressive community engagement. This iterative method sought project buy-in from first the town mayor, then village council. If approved by both, a focus group of community members examined how residents might view the research, informed consent and specimen collection, and issues of privacy. We were successful in engaging three of the four communities approached for the research project. There was universal enthusiasm for the project by all mayors and village councils. The focus groups' main point of discussion involved wording in the informed consent, with little concern regarding the research question or privacy. Perhaps contrary to popular thought, we found each community we approached to be both welcoming and enthusiastic about collaborating in research on genomic diversity. The systematic method of engagement did much to preserve community respect and autonomy and facilitated buy-in.
Collapse
|
49
|
Gibberellins Promote Brassinosteroids Action and Both Increase Heterosis for Plant Height in Maize ( Zea mays L.). FRONTIERS IN PLANT SCIENCE 2017; 8:1039. [PMID: 28676808 PMCID: PMC5477294 DOI: 10.3389/fpls.2017.01039] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 05/30/2017] [Indexed: 05/20/2023]
Abstract
Brassinosteroids (BRs) and Gibberellins (GAs) are two classes of plant hormones affecting plant height (PHT). Thus, manipulation of BR and GA levels or signaling enables optimization of crop grain and biomass yields. We established backcross (BC) families, selected for increased PHT, in two elite maize inbred backgrounds. Various exotic accessions used in the germplasm enhancement in maize project served as donors. BC1-derived doubled haploid lines in the same two elite maize inbred backgrounds established without selection for plant height were included for comparison. We conducted genome-wide association studies to explore the genetic control of PHT by BR and GA. In addition, we used BR and GA inhibitors to compare the relationship between PHT, BR, and GA in inbred lines and heterozygotes from a physiological and biological perspective. A total of 73 genomic loci were discovered to be associated with PHT, with seven co-localized with GA, and two co-localized with BR candidate genes. PHT determined in field trials was significantly correlated with seedling stage BR and GA inhibitor responses. However, this observation was only true for maize heterozygotes, not for inbred lines. Path analysis results suggest that heterozygosity increases GA levels, which in turn promote BR levels. Thus, at least part of heterosis for PHT in maize can be explained by increased GA and BR levels, and seedling stage hormone inhibitor response is promising to predict heterosis for PHT.
Collapse
|
50
|
Genomic Prediction of Single Crosses in the Early Stages of a Maize Hybrid Breeding Pipeline. G3 (BETHESDA, MD.) 2016; 6:3443-3453. [PMID: 27646704 PMCID: PMC5100843 DOI: 10.1534/g3.116.031286] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 08/12/2016] [Indexed: 11/18/2022]
Abstract
Prediction of single-cross performance has been a major goal of plant breeders since the beginning of hybrid breeding. Recently, genomic prediction has shown to be a promising approach, but only limited studies have examined the accuracy of predicting single-cross performance. Moreover, no studies have examined the potential of predicting single crosses among random inbreds derived from a series of biparental families, which resembles the structure of germplasm comprising the initial stages of a hybrid maize breeding pipeline. The main objectives of this study were to evaluate the potential of genomic prediction for identifying superior single crosses early in the hybrid breeding pipeline and optimize its application. To accomplish these objectives, we designed and analyzed a novel population of single crosses representing the Iowa Stiff Stalk synthetic/non-Stiff Stalk heterotic pattern commonly used in the development of North American commercial maize hybrids. The performance of single crosses was predicted using parental combining ability and covariance among single crosses. Prediction accuracies were estimated using cross-validation and ranged from 0.28 to 0.77 for grain yield, 0.53 to 0.91 for plant height, and 0.49 to 0.94 for staygreen, depending on the number of tested parents of the single cross and genomic prediction method used. The genomic estimated general and specific combining abilities showed an advantage over genomic covariances among single crosses when one or both parents of the single cross were untested. Overall, our results suggest that genomic prediction of single crosses in the early stages of a hybrid breeding pipeline holds great potential to redesign hybrid breeding and increase its efficiency.
Collapse
|