1
|
Souza R, Rouf Mian MA, Vaughn JN, Li Z. Introgression of a Danbaekkong high-protein allele across different genetic backgrounds in soybean. Front Plant Sci 2023; 14:1308731. [PMID: 38173927 PMCID: PMC10761420 DOI: 10.3389/fpls.2023.1308731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 11/28/2023] [Indexed: 01/05/2024]
Abstract
Soybean meal is a major component of livestock feed due to its high content and quality of protein. Understanding the genetic control of protein is essential to develop new cultivars with improved meal protein. Previously, a genomic region on chromosome 20 significantly associated with elevated protein content was identified in the cultivar Danbaekkong. The present research aimed to introgress the Danbaekkong high-protein allele into elite lines with different genetic backgrounds by developing and deploying robust DNA markers. A multiparent population consisting of 10 F5-derived populations with a total of 1,115 recombinant inbred lines (RILs) was developed using "Benning HP" as the donor parent of the Danbaekkong high-protein allele. A new functional marker targeting the 321-bp insertion in the gene Glyma.20g085100 was developed and used to track the Danbaekkong high-protein allele across the different populations and enable assessment of its effect and stability. Across all populations, the high-protein allele consistently increased the content, with an increase of 3.3% in seed protein. A total of 103 RILs were selected from the multiparent population for yield testing in five environments to assess the impact of the high-protein allele on yield and to enable the selection of new breeding lines with high protein and high yield. The results indicated that the high-protein allele impacts yield negatively in general; however, it is possible to select high-yielding lines with high protein content. An analysis of inheritance of the Chr 20 high-protein allele in Danbaekkong indicated that it originated from a Glycine soja line (PI 163453) and is the same as other G. soja lines studied. A survey of the distribution of the allele across 79 G. soja accessions and 35 Glycine max ancestors of North American soybean cultivars showed that the high-protein allele is present in all G. soja lines evaluated but not in any of the 35 North American soybean ancestors. These results demonstrate that G. soja accessions are a valuable source of favorable alleles for improvement of protein composition.
Collapse
Affiliation(s)
- Renan Souza
- Department of Crop and Soil Sciences, University of Georgia, Athens, GA, United States
| | - M. A. Rouf Mian
- Soybean and Nitrogen Fixation Research Unit, United States Department of Agriculture - Agricultural Research Service (USDA-ARS), Raleigh, NC, United States
| | - Justin N. Vaughn
- Department of Crop and Soil Sciences, University of Georgia, Athens, GA, United States
- Genomics and Bioinformatics Research Unit, United States Department of Agriculture - Agricultural Research Service (USDA-ARS), Athens, GA, United States
| | - Zenglu Li
- Department of Crop and Soil Sciences, University of Georgia, Athens, GA, United States
| |
Collapse
|
2
|
Affiliation(s)
- Jake C. Fountain
- Department of Biochemistry, Molecular Biology, Entomology, and Plant Pathology, Mississippi State University, Mississippi State, Mississippi, USA
| | | | - Justin N. Vaughn
- USDA-ARS, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi, USA
| | - Baozhu Guo
- USDA-ARS, Crop Genetics and Breeding Research Unit, Tifton, Georgia, USA
| |
Collapse
|
3
|
Pan Z, Bajsa‐Hirschel J, Vaughn JN, Rimando AM, Baerson SR, Duke SO. In vivo assembly of the sorgoleone biosynthetic pathway and its impact on agroinfiltrated leaves of Nicotiana benthamiana. New Phytol 2021; 230:683-697. [PMID: 33460457 PMCID: PMC8048663 DOI: 10.1111/nph.17213] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 12/17/2020] [Indexed: 06/12/2023]
Abstract
Sorgoleone, a hydrophobic compound exuded from root hair cells of Sorghum spp., accounts for much of the allelopathic activity of the genus. The enzymes involved in the biosynthesis of this compound have been identified and functionally characterized. Here, we report the successful assembly of the biosynthetic pathway and the significant impact of in vivo synthesized sorgoleone on the heterologous host Nicotiana benthamiana. A multigene DNA construct was prepared for the expression of genes required for sorgoleone biosynthesis in planta and deployed in N. benthamiana leaf tissues via Agrobacterium-mediated transient expression. RNA-sequencing was conducted to investigate the effects of sorgoleone, via expression of its biosynthesis pathway, on host gene expression. The production of sorgoleone in agroinfiltrated leaves as detected by gas chromatography/mass spectrometry (GC/MS) resulted in the formation of necrotic lesions, indicating that the compound caused severe phytotoxicity to these tissues. RNA-sequencing profiling revealed significant changes in gene expression in the leaf tissues expressing the pathway during the formation of sorgoleone-induced necrotic lesions. Transcriptome analysis suggested that the compound produced in vivo impaired the photosynthetic system as a result of downregulated gene expression for the photosynthesis apparatus and elevated expression of proteasomal genes which may play a major role in the phytotoxicity of sorgoleone.
Collapse
Affiliation(s)
- Zhiqiang Pan
- Natural Products Utilization Research UnitUS Department of Agriculture, Agricultural Research ServiceUniversityMS38677USA
| | - Joanna Bajsa‐Hirschel
- Natural Products Utilization Research UnitUS Department of Agriculture, Agricultural Research ServiceUniversityMS38677USA
| | - Justin N. Vaughn
- Genomics and Bioinformatics Research UnitUSDA, ARSAthensGA30605USA
| | - Agnes M. Rimando
- Natural Products Utilization Research UnitUS Department of Agriculture, Agricultural Research ServiceUniversityMS38677USA
| | - Scott R. Baerson
- Natural Products Utilization Research UnitUS Department of Agriculture, Agricultural Research ServiceUniversityMS38677USA
| | - Stephen O. Duke
- Natural Products Utilization Research UnitUS Department of Agriculture, Agricultural Research ServiceUniversityMS38677USA
| |
Collapse
|
4
|
Vaughn JN, Korani W, Stein JC, Edwards JD, Peterson DG, Simpson SA, Youngblood RC, Grimwood J, Chougule K, Ware DH, McClung AM, Scheffler BE. Gene disruption by structural mutations drives selection in US rice breeding over the last century. PLoS Genet 2021; 17:e1009389. [PMID: 33735256 PMCID: PMC7971508 DOI: 10.1371/journal.pgen.1009389] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 01/28/2021] [Indexed: 12/30/2022] Open
Abstract
The genetic basis of general plant vigor is of major interest to food producers, yet the trait is recalcitrant to genetic mapping because of the number of loci involved, their small effects, and linkage. Observations of heterosis in many crops suggests that recessive, malfunctioning versions of genes are a major cause of poor performance, yet we have little information on the mutational spectrum underlying these disruptions. To address this question, we generated a long-read assembly of a tropical japonica rice (Oryza sativa) variety, Carolina Gold, which allowed us to identify structural mutations (>50 bp) and orient them with respect to their ancestral state using the outgroup, Oryza glaberrima. Supporting prior work, we find substantial genome expansion in the sativa branch. While transposable elements (TEs) account for the largest share of size variation, the majority of events are not directly TE-mediated. Tandem duplications are the most common source of insertions and are highly enriched among 50-200bp mutations. To explore the relative impact of various mutational classes on crop fitness, we then track these structural events over the last century of US rice improvement using 101 resequenced varieties. Within this material, a pattern of temporary hybridization between medium and long-grain varieties was followed by recent divergence. During this long-term selection, structural mutations that impact gene exons have been removed at a greater rate than intronic indels and single-nucleotide mutations. These results support the use of ab initio estimates of mutational burden, based on structural data, as an orthogonal predictor in genomic selection. Some crop varieties have superior performance across years and environments. In hybrids, harmful mutations in one parent are masked by the ancestral alleles in the other parent, resulting in increased vigor. Unfortunately, these mutations are very difficult to identify precisely because, individually, they only have a small effect. In this study, we use long-read sequencing to characterize the entire mutational spectrum between two rice varieties. We then track these mutations through the last century of rice breeding. We show that large structural mutations in exons are selected against at a greater rate than any other mutational class. These findings illuminate the nature of deleterious alleles and will guide attempts to predict variety vigor based solely on genomic information.
Collapse
Affiliation(s)
- Justin N. Vaughn
- USDA-ARS, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi, United States of America
- University of Georgia, Athens, Institute of Plant Breeding, Genetics, and Genomics, Athens, Georgia, United States of America
- * E-mail: (JNV); (BES)
| | - Walid Korani
- University of Georgia, Athens, Institute of Plant Breeding, Genetics, and Genomics, Athens, Georgia, United States of America
| | - Joshua C. Stein
- Cold Spring Harbor Laboratory, Cold Springs Harbor, New York, United States of America
| | - Jeremy D. Edwards
- USDA-ARS, Dale Bumpers National Rice Research Center, Stuttgart, Arkansas, United States of America
| | - Daniel G. Peterson
- Mississippi State University, Institute for Genomics, Biocomputing & Biotechnology, Starkville, Mississippi, United States of America
| | - Sheron A. Simpson
- USDA-ARS, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi, United States of America
| | - Ramey C. Youngblood
- Mississippi State University, Institute for Genomics, Biocomputing & Biotechnology, Starkville, Mississippi, United States of America
| | - Jane Grimwood
- Hudson-Alpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Springs Harbor, New York, United States of America
| | - Doreen H. Ware
- Cold Spring Harbor Laboratory, Cold Springs Harbor, New York, United States of America
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, New York, United States of America
| | - Anna M. McClung
- USDA-ARS, Dale Bumpers National Rice Research Center, Stuttgart, Arkansas, United States of America
| | - Brian E. Scheffler
- USDA-ARS, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi, United States of America
- * E-mail: (JNV); (BES)
| |
Collapse
|
5
|
Stewart-Brown BB, Vaughn JN, Carter TE, Li Z. Characterizing the impact of an exotic soybean line on elite cultivar development. PLoS One 2020; 15:e0235434. [PMID: 32649700 PMCID: PMC7351202 DOI: 10.1371/journal.pone.0235434] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 06/15/2020] [Indexed: 11/18/2022] Open
Abstract
The genetic diversity of North American soybean cultivars has been largely influenced by a small number of ancestors. High yielding breeding lines that possess exotic pedigrees have been developed, but identifying beneficial exotic alleles has been difficult as a result of complex interactions of yield alleles with genetic backgrounds and environments as well as the highly quantitative nature of yield. PI 416937 has been utilized in the development of many high yielding lines that have been entered into the USDA Southern States Uniform Tests over the past ~20 years. The primary goal of this research was to identify genomic regions under breeding selection from PI 416937 and introduce a methodology for identifying and potentially utilizing beneficial diversity from lines prevalent in the ancestry of elite cultivars. Utilizing SoySNP50K Infinium BeadChips, 52 high yielding PI 416937-derived lines as well as their parents were genotyped to identify PI 416937 alleles under breeding selection. Nine genomic regions across three chromosomes and 17 genomic regions across seven chromosomes were identified where PI 416937 alleles were under positive or negative selection. Minimal significant associations between PI 416937 alleles and yield were observed in replicated yield trials of five RIL populations, highlighting the difficulty of consistently detecting yield associations.
Collapse
Affiliation(s)
- Benjamin B. Stewart-Brown
- Department of Crop and Soil Sciences, Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, GA, United States of America
| | - Justin N. Vaughn
- Genomics and Bioinformatics Research Unit, USDA-ARS, Athens, GA, United States of America
| | - Thomas E. Carter
- Soybean & Nitrogen Fixation Unit, USDA-ARS, Raleigh, NC, United States of America
| | - Zenglu Li
- Department of Crop and Soil Sciences, Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, GA, United States of America
| |
Collapse
|
6
|
Díaz-Tielas C, Graña E, Sánchez-Moreiras AM, Reigosa MJ, Vaughn JN, Pan Z, Bajsa-Hirschel J, Duke MV, Duke SO. Transcriptome responses to the natural phytotoxin t-chalcone in Arabidopsis thaliana L. Pest Manag Sci 2019; 75:2490-2504. [PMID: 30868714 DOI: 10.1002/ps.5405] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 03/04/2019] [Accepted: 03/13/2019] [Indexed: 06/09/2023]
Abstract
BACKGROUND New modes of action are needed for herbicides. The flavonoid synthesis intermediate t-chalcone causes apoptosis-like symptoms in roots and bleaching of shoots of Arabidospsis, suggesting a unique mode of action as a phytotoxin. RESULTS Using RNA-Seq, transcriptome changes were monitored in Arabidopsis seedlings during the first 24 h of exposure (at 1, 3, 6, 12 and 24 h) to 21 μm t-chalcone (I50 dose), examining effects on roots and shoots separately. Expression of 892 and 1000 genes was affected in roots and shoots, respectively. According to biological classification, many of the affected genes were transcription factors and genes associated with oxidative stress, heat shock proteins, xenobiotic detoxification, ABA and auxin biosynthesis, and primary metabolic processess. These are secondary effects found with most phytotoxins. Potent phytotoxins usually act by inhibiting enzymes of primary metabolism. KEGG pathway analysis of transcriptome results from the first 3 h of t-chalcone exposure indicated several potential primary metabolism target sites for t-chalcone. Of these, p-hydroxyphenylpyruvate dioxygenase (HPPD) and tyrosine amino transferase were consistent with the bleaching effect of the phytotoxin. Supplementation studies with Lemna paucicostata and Arabidiopsis supported HPPD as the target, although in vitro enzyme inhibition was not found. CONCLUSIONS t-Chalcone is possibly a protoxin that is converted to a HPPD inhibitor in vivo. © 2019 Society of Chemical Industry.
Collapse
Affiliation(s)
- Carla Díaz-Tielas
- Department of Plant Biology and Soil Science, University of Vigo, Vigo, Spain
| | - Elisa Graña
- Department of Plant Biology and Soil Science, University of Vigo, Vigo, Spain
| | | | - Manuel J Reigosa
- Department of Plant Biology and Soil Science, University of Vigo, Vigo, Spain
| | - Justin N Vaughn
- Genomics and Bioinformatics Research Unit, USDA, ARS, Athens, GA, USA
| | - Zhiqiang Pan
- Natural Products Utilization Research Unit, USDA, ARS, Oxford, MS, USA
| | | | - Mary V Duke
- Genomics and Bioinformatics Research, USDA, ARS, Stoneville, MS, USA
| | - Stephen O Duke
- Natural Products Utilization Research Unit, USDA, ARS, Oxford, MS, USA
| |
Collapse
|
7
|
Stewart-Brown BB, Song Q, Vaughn JN, Li Z. Genomic Selection for Yield and Seed Composition Traits Within an Applied Soybean Breeding Program. G3 (Bethesda) 2019; 9:2253-2265. [PMID: 31088906 PMCID: PMC6643879 DOI: 10.1534/g3.118.200917] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Accepted: 05/10/2019] [Indexed: 01/09/2023]
Abstract
Genomic selection (GS) has become viable for selection of quantitative traits for which marker-assisted selection has often proven less effective. The potential of GS for soybean was characterized using 483 elite breeding lines, genotyped with BARCSoySNP6K iSelect BeadChips. Cross validation was performed using RR-BLUP and predictive abilities (rMP) of 0.81, 0.71, and 0.26 for protein, oil, and yield, were achieved at the largest tested training set size. Minimal differences were observed when comparing different marker densities and there appeared to be inflation in rMP due to population structure. For comparison purposes, two additional methods to predict breeding values for lines of four bi-parental populations within the GS dataset were tested. The first method predicted within each bi-parental population (WP method) and utilized a training set of full-sibs of the validation set. The second method utilized a training set of all remaining breeding lines except for full-sibs of the validation set to predict across populations (AP method). The AP method is more practical as the WP method would likely delay the breeding cycle and leverage smaller training sets. Averaging across populations for protein and oil content, rMP for the AP method (0.55, 0.30) approached rMP for the WP method (0.60, 0.52). Though comparable, rMP for yield was low for both AP and WP methods (0.12, 0.13). Based on increases in rMP as training sets increased and the effectiveness of WP vs. AP method, the AP method could potentially improve with larger training sets and increased relatedness between training and validation sets.
Collapse
Affiliation(s)
- Benjamin B Stewart-Brown
- Institute of Plant Breeding, Genetics and Genomics and Dep. of Crop and Soil Sci., University of Georgia, Athens, GA 30602
| | - Qijian Song
- Soybean Genomics and Improvement Lab, USDA-ARS, Beltsville, MD 20705
| | - Justin N Vaughn
- Genomics and Bioinformatics Research Unit, USDA-ARS, Center for Applied Genetic Technologies, Athens, GA 30602
| | - Zenglu Li
- Institute of Plant Breeding, Genetics and Genomics and Dep. of Crop and Soil Sci., University of Georgia, Athens, GA 30602
| |
Collapse
|
8
|
Korani W, Vaughn JN. Crossword: A data-driven simulation language for the design of genetic-mapping experiments and breeding strategies. Sci Rep 2019; 9:4386. [PMID: 30867436 PMCID: PMC6416259 DOI: 10.1038/s41598-018-38348-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 12/17/2018] [Indexed: 11/15/2022] Open
Abstract
Quantitative genetic simulations can save time and resources by optimizing the logistics of an experiment. Current tools are difficult to use by those unfamiliar with programming, and these tools rarely address the actual genetic structure of the population under study. Here, we introduce crossword, which utilizes the widely available re-sequencing and genomics data to create more realistic simulations and to reduce user burden. The software was written in R, to simplify installation and implementation. Because crossword is a domain-specific language, it allows complex and unique simulations to be performed, but the language is supported by a graphical interface that guides users through functions and options. We first show crossword’s utility in QTL-seq design, where its output accurately reflects empirical data. By introducing the concept of levels to reflect family relatedness, crossword can simulate a broad range of breeding programs and crops. Using levels, we further illustrate crossword’s capabilities by examining the effect of family size and number of selfing generations on phenotyping accuracy and genomic selection. Additionally, we explore the ramifications of large phenotypic difference between parents in a QTL mapping cross, a scenario that is common in crop genetics but often difficult to simulate.
Collapse
Affiliation(s)
- Walid Korani
- Center for Applied Genetic Technologies, The University of Georgia, Athens, GA, 30602, USA
| | - Justin N Vaughn
- United States Department of Agriculture, Athens, GA, 30602, USA.
| |
Collapse
|
9
|
Frailey DC, Chaluvadi SR, Vaughn JN, Coatney CG, Bennetzen JL. Gene loss and genome rearrangement in the plastids of five Hemiparasites in the family Orobanchaceae. BMC Plant Biol 2018; 18:30. [PMID: 29409454 PMCID: PMC5801802 DOI: 10.1186/s12870-018-1249-x] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 01/30/2018] [Indexed: 05/09/2023]
Abstract
BACKGROUND The chloroplast genomes (plastome) of most plants are highly conserved in structure, gene content, and gene order. Parasitic plants, including those that are fully photosynthetic, often contain plastome rearrangements. These most notably include gene deletions that result in a smaller plastome size. The nature of gene loss and genome structural rearrangement has been investigated in several parasitic plants, but their timing and contributions to the adaptation of these parasites requires further investigation, especially among the under-studied hemi-parasites. RESULTS De novo sequencing, assembly and annotation of the chloroplast genomes of five photosynthetic parasites from the family Orobanchaceae were employed to investigate plastome dynamics. Four had major structural rearrangements, including gene duplications and gene losses, that differentiated the taxa. The facultative parasite Aureolaria virginica had the most similar genome content to its close non-parasitic relative, Lindenbergia philippensis, with similar genome size and organization, and no differences in gene content. In contrast, the facultative parasite Buchnera americana and three obligate parasites in the genus Striga all had enlargements of their plastomes, primarily caused by expansion within the large inverted repeats (IRs) that are a standard plastome feature. Some of these IR increases were shared by multiple investigated species, but others were unique to particular lineages. Gene deletions and pseudogenization were also both shared and lineage-specific, with particularly frequent and independent loss of the ndh genes involved in electron recycling. CONCLUSIONS Five new plastid genomes were fully assembled and compared. The results indicate that plastome instability is common in parasitic plants, even those that retain the need to perform essential plastid functions like photosynthesis. Gene losses were slow and not identical across taxa, suggesting that different lineages had different uses or needs for some of their plastome gene content, including genes involved in some aspects of photosynthesis. Recent repeat region extensions, some unique to terminal species branches, were observed after the divergence of the Buchnera/Striga clade, suggesting that this otherwise rare event has some special value in this lineage.
Collapse
Affiliation(s)
| | | | - Justin N. Vaughn
- Department of Genetics, University of Georgia, Athens, GA 30677 USA
| | | | | |
Collapse
|
10
|
Vaughn JN, Li Z. Genomic Signatures of North American Soybean Improvement Inform Diversity Enrichment Strategies and Clarify the Impact of Hybridization. G3 (Bethesda) 2016; 6:2693-705. [PMID: 27402364 PMCID: PMC5015928 DOI: 10.1534/g3.116.029215] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 06/14/2016] [Indexed: 11/18/2022]
Abstract
Crop improvement represents a long-running experiment in artificial selection on a complex trait, namely yield. How such selection relates to natural populations is unclear, but the analysis of domesticated populations could offer insights into the relative role of selection, drift, and recombination in all species facing major shifts in selective regimes. Because of the extreme autogamy exhibited by soybean (Glycine max), many "immortalized" genotypes of elite varieties spanning the last century have been preserved and characterized using ∼50,000 single nucleotide polymorphic (SNP) markers. Also due to autogamy, the history of North American soybean breeding can be roughly divided into pre- and posthybridization eras, allowing for direct interrogation of the role of recombination in improvement and selection. Here, we report on genome-wide characterization of the structure and history of North American soybean populations and the signature of selection in these populations. Supporting previous work, we find that maturity defines population structure. Though the diversity of North American ancestors is comparable to available landraces, prehybridization line selections resulted in a clonal structure that dominated early breeding and explains many of the reductions in diversity found in the initial generations of soybean hybridization. The rate of allele frequency change does not deviate sharply from neutral expectation, yet some regions bare hallmarks of strong selection, suggesting a highly variable range of selection strengths biased toward weak effects. We also discuss the importance of haplotypes as units of analysis when complex traits fall under novel selection regimes.
Collapse
Affiliation(s)
- Justin N Vaughn
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia 30602 Department of Crop and Soil Science, University of Georgia, Athens, Georgia 30602
| | - Zenglu Li
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia 30602 Department of Crop and Soil Science, University of Georgia, Athens, Georgia 30602
| |
Collapse
|
11
|
Shin JH, Vaughn JN, Abdel-Haleem H, Chavarro C, Abernathy B, Kim KD, Jackson SA, Li Z. Transcriptomic changes due to water deficit define a general soybean response and accession-specific pathways for drought avoidance. BMC Plant Biol 2015; 15:26. [PMID: 25644024 PMCID: PMC4322458 DOI: 10.1186/s12870-015-0422-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 01/12/2015] [Indexed: 05/04/2023]
Abstract
BACKGROUND Among abiotic stresses, drought is the most common reducer of crop yields. The slow-wilting soybean genotype PI 416937 is somewhat robust to water deficit and has been used previously to map the trait in a bi-parental population. Since drought stress response is a complex biological process, whole genome transcriptome analysis was performed to obtain a deeper understanding of the drought response in soybean. RESULTS Contrasting data from PI 416937 and the cultivar 'Benning', we developed a classification system to identify genes that were either responding to water-deficit in both genotypes or that had a genotype x environment (GxE) response. In spite of very different wilting phenotypes, 90% of classifiable genes had either constant expression in both genotypes (33%) or very similar response profiles (E genes, 57%). By further classifying E genes based on expression profiles, we were able to discern the functional specificity of transcriptional responses at particular stages of water-deficit, noting both the well-known reduction in photosynthesis genes as well as the less understood up-regulation of the protein transport pathway. Two percent of classifiable genes had a well-defined GxE response, many of which are located within slow-wilting QTLs. We consider these strong candidates for possible causal genes underlying PI 416937's unique drought avoidance strategy. CONCLUSIONS There is a general and functionally significant transcriptional response to water deficit that involves not only known pathways, such as down-regulation of photosynthesis, but also up-regulation of protein transport and chromatin remodeling. Genes that show a genotypic difference are more likely to show an environmental response than genes that are constant between genotypes. In this study, at least five genes that clearly exhibited a genotype x environment response fell within known QTL and are very good candidates for further research into slow-wilting.
Collapse
Affiliation(s)
- Jin Hee Shin
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Justin N Vaughn
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Hussein Abdel-Haleem
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Carolina Chavarro
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Brian Abernathy
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Kyung Do Kim
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Scott A Jackson
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| | - Zenglu Li
- Center for Applied Genetic Technologies & Department of Crop and Soil Science, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
12
|
Vaughn JN, Chaluvadi SR, Tushar, Rangan L, Bennetzen JL. Whole plastome sequences from five ginger species facilitate marker development and define limits to barcode methodology. PLoS One 2014; 9:e108581. [PMID: 25333869 PMCID: PMC4204815 DOI: 10.1371/journal.pone.0108581] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 08/28/2014] [Indexed: 11/19/2022] Open
Abstract
Plants from the Zingiberaceae family are a key source of spices and herbal medicines. Species identification within this group is critical in the search for known and possibly novel bioactive compounds. To facilitate precise characterization of this group, we have sequenced chloroplast genomes from species representing five major groups within Zingiberaceae. Generally, the structure of these genomes is similar to the basal angiosperm excepting an expansion of 3 kb associated with the inverted repeat A region. Portions of this expansion appear to be shared across the entire Zingiberales order, which includes gingers and bananas. We used whole plastome alignment information to develop DNA barcodes that would maximize the ability to differentiate species within the Zingiberaceae. Our computation pipeline identified regions of high variability that were flanked by highly conserved regions used for primer design. This approach yielded hitherto unexploited regions of variability. These theoretically optimal barcodes were tested on a range of species throughout the family and were found to amplify and differentiate genera and, in some cases, species. Still, though these barcodes were specifically optimized for the Zingiberaceae, our data support the emerging consensus that whole plastome sequences are needed for robust species identification and phylogenetics within this family.
Collapse
Affiliation(s)
- Justin N. Vaughn
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Srinivasa R. Chaluvadi
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Tushar
- Department of Biotechnology, Indian Institute of Technology Guwahati, Assam, India
| | - Latha Rangan
- Department of Biotechnology, Indian Institute of Technology Guwahati, Assam, India
| | - Jeffrey L. Bennetzen
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| |
Collapse
|
13
|
Vaughn JN, Nelson RL, Song Q, Cregan PB, Li Z. The genetic architecture of seed composition in soybean is refined by genome-wide association scans across multiple populations. G3 (Bethesda) 2014; 4:2283-94. [PMID: 25246241 PMCID: PMC4232554 DOI: 10.1534/g3.114.013433] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2014] [Accepted: 09/15/2014] [Indexed: 12/21/2022]
Abstract
Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to resolve loci controlling moderately complex quantitative traits (QTL) in numerous crop species. Yet, it is still unclear how soybean's unique population history will affect GWA scans. Using one of the populations in this study, we simulated phenotypes resulting from a range of genetic architectures. We found that with a heritability of 0.5, ∼100% and ∼33% of the 4 and 20 simulated QTL can be recovered, respectively, with a false-positive rate of less than ∼6×10(-5) per marker tested. Additionally, we demonstrated that combining information from multi-locus mixed models and compressed linear-mixed models improves QTL identification and interpretation. We applied these insights to exploring seed composition in soybean, refining the linkage group I (chromosome 20) protein QTL and identifying additional oil QTL that may allow some decoupling of highly correlated oil and protein phenotypes. Because the value of protein meal is closely related to its essential amino acid profile, we attempted to identify QTL underlying methionine, threonine, cysteine, and lysine content. Multiple QTL were found that have not been observed in family-based mapping studies, and each trait exhibited associations across multiple populations. Chromosomes 1 and 8 contain strong candidate alleles for essential amino acid increases. Overall, we present these and additional data that will be useful in determining breeding strategies for the continued improvement of soybean's nutrient portfolio.
Collapse
Affiliation(s)
- Justin N Vaughn
- Center for Applied Genetic Technologies and Department of Crop and Soil Sciences, University of Georgia, Athens, Georgia 30602
| | - Randall L Nelson
- Soybean Maize Germplasm, Pathology, and Genetics Research Unit, USDA, Agricultural Research Service, and Department of Crop Sciences University of Illinois, Urbana, Illinois 61801
| | - Qijian Song
- Soybean Genomics and Improvement Laboratory, USDA, Agricultural Research Service, Beltsville, Maryland 20705
| | - Perry B Cregan
- Soybean Genomics and Improvement Laboratory, USDA, Agricultural Research Service, Beltsville, Maryland 20705
| | - Zenglu Li
- Center for Applied Genetic Technologies and Department of Crop and Soil Sciences, University of Georgia, Athens, Georgia 30602
| |
Collapse
|
14
|
von Arnim AG, Jia Q, Vaughn JN. Regulation of plant translation by upstream open reading frames. Plant Sci 2014; 214:1-12. [PMID: 24268158 DOI: 10.1016/j.plantsci.2013.09.006] [Citation(s) in RCA: 127] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 09/08/2013] [Accepted: 09/10/2013] [Indexed: 05/08/2023]
Abstract
We review the evidence that upstream open reading frames (uORFs) function as RNA sequence elements for post-transcriptional control of gene expression, specifically translation. uORFs are highly abundant in the genomes of angiosperms. Their negative effect on translation is often attenuated by ribosomal translation reinitiation, a process whose molecular biochemistry is still being investigated. Certain uORFs render translation responsive to small molecules, thus offering a path for metabolic control of gene expression in evolution and synthetic biology. In some cases, uORFs form modular logic gates in signal transduction. uORFs thus provide eukaryotes with a functionality analogous to, or comparable to, riboswitches and attenuators in prokaryotes. uORFs exist in many genes regulating development and point toward translational control of development. While many uORFs appear to be poorly conserved, and the number of genes with conserved-peptide uORFs is modest, many mRNAs have a conserved pattern of uORFs. Evolutionarily, the gain and loss of uORFs may be a widespread mechanism that diversifies gene expression patterns. Last but not least, this review includes a dedicated uORF database for Arabidopsis.
Collapse
Affiliation(s)
- Albrecht G von Arnim
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-0840, USA; Graduate School of Genome Science and Technology, The University of Tennessee, Knoxville, TN 37996-0840, USA.
| | | | | |
Collapse
|
15
|
Abstract
The sequence elements that mediate post-transcriptional gene regulation often reside in the 5' and 3' untranslated regions (UTRs) of mRNAs. Using six different families of dicotyledonous plants, we developed a comparative transcriptomics pipeline for the identification and annotation of deeply conserved regulatory sequences in the 5' and 3' UTRs. Our approach was robust to confounding effects of poor UTR alignability and rampant paralogy in plants. In the 3' UTR, motifs resembling PUMILIO-binding sites form a prominent group of conserved motifs. Additionally, Expansins, one of the few plant mRNA families known to be localized to specific subcellular sites, possess a core conserved RCCCGC motif. In the 5' UTR, one major subset of motifs consists of purine-rich repeats. A distinct and substantial fraction possesses upstream AUG start codons. Half of the AUG containing motifs reveal hidden protein-coding potential in the 5' UTR, while the other half point to a peptide-independent function related to translation. Among the former, we added four novel peptides to the small catalog of conserved-peptide uORFs. Among the latter, our case studies document patterns of uORF evolution that include gain and loss of uORFs, switches in uORF reading frame, and switches in uORF length and position. In summary, nearly three hundred post-transcriptional elements show evidence of purifying selection across the eudicot branch of flowering plants, indicating a regulatory function spanning at least 70 million years. Some of these sequences have experimental precedent, but many are novel and encourage further exploration.
Collapse
Affiliation(s)
- Justin N. Vaughn
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, Tennessee 37996, USA
| | - Sally R. Ellingson
- Graduate School of Genome Science and Technology, The University of Tennessee, Knoxville, Tennessee 37996, USA
| | - Flavio Mignone
- Dipartimento di Chimica Strutturale e Stereochimica Inorganica, Università degli Studi di Milano, 20133 Milano, Italy
| | - Albrecht von Arnim
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, Tennessee 37996, USA
- Graduate School of Genome Science and Technology, The University of Tennessee, Knoxville, Tennessee 37996, USA
- Corresponding author.E-mail .
| |
Collapse
|
16
|
Roy B, Vaughn JN, Kim BH, Zhou F, Gilchrist MA, Von Arnim AG. The h subunit of eIF3 promotes reinitiation competence during translation of mRNAs harboring upstream open reading frames. RNA 2010; 16:748-61. [PMID: 20179149 PMCID: PMC2844622 DOI: 10.1261/rna.2056010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Upstream open reading frames (uORFs) are protein coding elements in the 5' leader of messenger RNAs. uORFs generally inhibit translation of the main ORF because ribosomes that perform translation elongation suffer either permanent or conditional loss of reinitiation competence. After conditional loss, reinitiation competence may be regained by, at the minimum, reacquisition of a fresh methionyl-tRNA. The conserved h subunit of Arabidopsis eukaryotic initiation factor 3 (eIF3) mitigates the inhibitory effects of certain uORFs. Here, we define more precisely how this occurs, by combining gene expression data from mutated 5' leaders of Arabidopsis AtbZip11 (At4g34590) and yeast GCN4 with a computational model of translation initiation in wild-type and eif3h mutant plants. Of the four phylogenetically conserved uORFs in AtbZip11, three are inhibitory to translation, while one is anti-inhibitory. The mutation in eIF3h has no major effect on uORF start codon recognition. Instead, eIF3h supports efficient reinitiation after uORF translation. Modeling suggested that the permanent loss of reinitiation competence during uORF translation occurs at a faster rate in the mutant than in the wild type. Thus, eIF3h ensures that a fraction of uORF-translating ribosomes retain their competence to resume scanning. Experiments using the yeast GCN4 leader provided no evidence that eIF3h fosters tRNA reaquisition. Together, these results attribute a specific molecular function in translation initiation to an individual eIF3 subunit in a multicellular eukaryote.
Collapse
Affiliation(s)
- Bijoyita Roy
- Department of Biochemistry and Cellular and Molecular Biology, The University of Tennessee, Knoxville, Tennessee 37996, USA
| | | | | | | | | | | |
Collapse
|
17
|
Kim BH, Cai X, Vaughn JN, von Arnim AG. On the functions of the h subunit of eukaryotic initiation factor 3 in late stages of translation initiation. Genome Biol 2007; 8:R60. [PMID: 17439654 PMCID: PMC1896003 DOI: 10.1186/gb-2007-8-4-r60] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2006] [Revised: 01/15/2007] [Accepted: 04/17/2007] [Indexed: 12/29/2022] Open
Abstract
Reporter transgene assays and comparative polysome-microarray analysis reveal that the intact h subunit of Arabidopsis eIF3 contributes to efficient translation initiation on mRNA leader sequences harbouring multiple uORFs. Background The eukaryotic translation initiation factor 3 (eIF3) has multiple roles during the initiation of translation of cytoplasmic mRNAs. How individual subunits of eIF3 contribute to the translation of specific mRNAs remains poorly understood, however. This is true in particular for those subunits that are not conserved in budding yeast, such as eIF3h. Results Working with stable reporter transgenes in Arabidopsis thaliana mutants, it was demonstrated that the h subunit of eIF3 contributes to the efficient translation initiation of mRNAs harboring upstream open reading frames (uORFs) in their 5' leader sequence. uORFs, which can function as devices for translational regulation, are present in over 30% of Arabidopsis mRNAs, and are enriched among mRNAs for transcriptional regulators and protein modifying enzymes. Microarray comparisons of polysome loading in wild-type and eif3h mutant seedlings revealed that eIF3h generally helps to maintain efficient polysome loading of mRNAs harboring multiple uORFs. In addition, however, eIF3h also boosted the polysome loading of mRNAs with long leaders or coding sequences. Moreover, the relative polysome loading of certain functional groups of mRNAs, including ribosomal proteins, was actually increased in the eif3h mutant, suggesting that regulons of translational control can be revealed by mutations in generic translation initiation factors. Conclusion The intact eIF3h protein contributes to efficient translation initiation on 5' leader sequences harboring multiple uORFs, although mRNA features independent of uORFs are also implicated.
Collapse
Affiliation(s)
- Byung-Hoon Kim
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-0840, USA
| | - Xue Cai
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-0840, USA
- Department of Cell Biology, The University of Oklahoma Health Sciences Center, Stanton L Young Blvd, Oklahoma City, OK 73104, USA
| | - Justin N Vaughn
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-0840, USA
| | - Albrecht G von Arnim
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-0840, USA
| |
Collapse
|