1
|
Shu M, Yates TB, John C, Harman-Ware AE, Happs RM, Bryant N, Jawdy SS, Ragauskas AJ, Tuskan GA, Muchero W, Chen JG. Providing biological context for GWAS results using eQTL regulatory and co-expression networks in Populus. THE NEW PHYTOLOGIST 2024. [PMID: 39169686 DOI: 10.1111/nph.20026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 07/16/2024] [Indexed: 08/23/2024]
Abstract
Our study utilized genome-wide association studies (GWAS) to link nucleotide variants to traits in Populus trichocarpa, a species with rapid linkage disequilibrium decay. The aim was to overcome the challenge of interpreting statistical associations at individual loci without sufficient biological context, which often leads to reliance solely on gene annotations from unrelated model organisms. We employed an integrative approach that included GWAS targeting multiple traits using three individual techniques for lignocellulose phenotyping, expression quantitative trait loci (eQTL) analysis to construct transcriptional regulatory networks around each candidate locus and co-expression analysis to provide biological context for these networks, using lignocellulose biosynthesis in Populus trichocarpa as a case study. The research identified three candidate genes potentially involved in lignocellulose formation, including one previously recognized gene (Potri.005G116800/VND1, a critical regulator of secondary cell wall formation) and two genes (Potri.012G130000/AtSAP9 and Potri.004G202900/BIC1) with newly identified putative roles in lignocellulose biosynthesis. Our integrative approach offers a framework for providing biological context to loci associated with trait variation, facilitating the discovery of new genes and regulatory networks.
Collapse
Affiliation(s)
- Mengjun Shu
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| | - Timothy B Yates
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| | - Cai John
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, 37996, TN, USA
| | - Anne E Harman-Ware
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, 80401, CO, USA
| | - Renee M Happs
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, 80401, CO, USA
| | - Nathan Bryant
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, 37996, TN, USA
| | - Sara S Jawdy
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| | - Arthur J Ragauskas
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, 37996, TN, USA
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| | - Wellington Muchero
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| | - Jin-Gui Chen
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, 37831, TN, USA
| |
Collapse
|
2
|
Happs R, Hanes RJ, Bartling AW, Field JL, Harman-Ware AE, Clark RJ, Pendergast TH, Devos KM, Webb EG, Missaoui A, Xu Y, Makaju S, Shrestha V, Mazarei M, Stewart CN, Millwood RJ, Davison BH. Economic and Sustainability Impacts of Yield and Composition Variation in Bioenergy Crops: Switchgrass ( Panicum virgatum L.). ACS SUSTAINABLE CHEMISTRY & ENGINEERING 2024; 12:1897-1910. [PMID: 38333206 PMCID: PMC10848292 DOI: 10.1021/acssuschemeng.3c05770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 12/20/2023] [Accepted: 12/20/2023] [Indexed: 02/10/2024]
Abstract
Economically viable production of biobased products and fuels requires high-yielding, high-quality, sustainable process-advantaged crops, developed using bioengineering or advanced breeding approaches. Understanding which crop phenotypic traits have the largest impact on biofuel economics and sustainability outcomes is important for the targeted feedstock crop development. Here, we evaluated biomass yield and cell-wall composition traits across a large natural variant population of switchgrass (Panicum virgatum L.) grown across three common garden sites. Samples from 331 switchgrass genotypes were collected and analyzed for carbohydrate and lignin components. Considering plant survival and biomass after multiple years of growth, we found that 84 of the genotypes analyzed may be suited for commercial production in the southeastern U.S. These genotypes show a range of growth and compositional traits across the population that are apparently independent of each other. We used these data to conduct techno-economic analyses and life cycle assessments evaluating the performance of each switchgrass genotype under a standard cellulosic ethanol process model with pretreatment, added enzymes, and fermentation. We find that switchgrass yield per area is the largest economic driver of the minimum fuel selling price (MSFP), ethanol yield per hectare, global warming potential (GWP), and cumulative energy demand (CED). At any yield, the carbohydrate content is significant but of secondary importance. Water use follows similar trends but has more variability due to an increased dependence on the biorefinery model. Analyses presented here highlight the primary importance of plant yield and the secondary importance of carbohydrate content when selecting a feedstock that is both economical and sustainable.
Collapse
Affiliation(s)
- Renee
M. Happs
- Renewable
Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Rebecca J. Hanes
- Strategic
Energy Analysis Center, National Renewable
Energy Laboratory, Golden, Colorado 80401, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Andrew W. Bartling
- Catalytic
Carbon and Transformation Center, National
Renewable Energy Laboratory, Golden, Colorado 80401, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - John L. Field
- Environmental
Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Anne E. Harman-Ware
- Renewable
Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Robin J. Clark
- Environmental
Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Thomas H. Pendergast
- Institute
of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia 30602, United States
- Department
of Crop and Soil Sciences, University of
Georgia, Athens, Georgia 30602, United States
- Department
of Plant Biology, University of Georgia, Athens, Georgia 30602, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Katrien M. Devos
- Institute
of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia 30602, United States
- Department
of Crop and Soil Sciences, University of
Georgia, Athens, Georgia 30602, United States
- Department
of Plant Biology, University of Georgia, Athens, Georgia 30602, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Erin G. Webb
- Environmental
Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Ali Missaoui
- Institute
of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia 30602, United States
- Department
of Crop and Soil Sciences, University of
Georgia, Athens, Georgia 30602, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Yaping Xu
- Department
of Plant Sciences, University of Tennessee
Knoxville, Knoxville, Tennessee 37919, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Shiva Makaju
- Institute
of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, Georgia 30602, United States
- Department
of Crop and Soil Sciences, University of
Georgia, Athens, Georgia 30602, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Vivek Shrestha
- Department
of Plant Sciences, University of Tennessee
Knoxville, Knoxville, Tennessee 37919, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Mitra Mazarei
- Department
of Plant Sciences, University of Tennessee
Knoxville, Knoxville, Tennessee 37919, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Charles Neal Stewart
- Department
of Plant Sciences, University of Tennessee
Knoxville, Knoxville, Tennessee 37919, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Reginald J. Millwood
- Department
of Plant Sciences, University of Tennessee
Knoxville, Knoxville, Tennessee 37919, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| | - Brian H. Davison
- Biosciences
Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, United States
- Center
for Bioenergy Innovation, Oak Ridge National
Laboratory, Oak Ridge, Tennessee 37830, United States
| |
Collapse
|
3
|
Abeyratne CR, Macaya-Sanz D, Zhou R, Barry KW, Daum C, Haiby K, Lipzen A, Stanton B, Yoshinaga Y, Zane M, Tuskan GA, DiFazio SP. High-resolution mapping reveals hotspots and sex-biased recombination in Populus trichocarpa. G3 (BETHESDA, MD.) 2023; 13:jkac269. [PMID: 36250890 PMCID: PMC9836356 DOI: 10.1093/g3journal/jkac269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/28/2022] [Indexed: 12/14/2022]
Abstract
Fine-scale meiotic recombination is fundamental to the outcome of natural and artificial selection. Here, dense genetic mapping and haplotype reconstruction were used to estimate recombination for a full factorial Populus trichocarpa cross of 7 males and 7 females. Genomes of the resulting 49 full-sib families (N = 829 offspring) were resequenced, and high-fidelity biallelic SNP/INDELs and pedigree information were used to ascertain allelic phase and impute progeny genotypes to recover gametic haplotypes. The 14 parental genetic maps contained 1,820 SNP/INDELs on average that covered 376.7 Mb of physical length across 19 chromosomes. Comparison of parental and progeny haplotypes allowed fine-scale demarcation of cross-over regions, where 38,846 cross-over events in 1,658 gametes were observed. Cross-over events were positively associated with gene density and negatively associated with GC content and long-terminal repeats. One of the most striking findings was higher rates of cross-overs in males in 8 out of 19 chromosomes. Regions with elevated male cross-over rates had lower gene density and GC content than windows showing no sex bias. High-resolution analysis identified 67 candidate cross-over hotspots spread throughout the genome. DNA sequence motifs enriched in these regions showed striking similarity to those of maize, Arabidopsis, and wheat. These findings, and recombination estimates, will be useful for ongoing efforts to accelerate domestication of this and other biomass feedstocks, as well as future studies investigating broader questions related to evolutionary history, perennial development, phenology, wood formation, vegetative propagation, and dioecy that cannot be studied using annual plant model systems.
Collapse
Affiliation(s)
| | - David Macaya-Sanz
- Department of Forest Ecology & Genetics, CIFOR-INIA, CSIC, Madrid 28040, Spain
| | - Ran Zhou
- Warnell School of Forestry and Natural Resources, Department of Genetics, and Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Kerrie W Barry
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Christopher Daum
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | | | - Anna Lipzen
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | | | - Yuko Yoshinaga
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Matthew Zane
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Gerald A Tuskan
- Biosciences Division, Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Stephen P DiFazio
- Department of Biology, West Virginia University, Morgantown, WV 26506, USA
| |
Collapse
|
4
|
Harman-Ware AE, Happs RM, Macaya-Sanz D, Doeppke C, Muchero W, DiFazio SP. Abundance of Major Cell Wall Components in Natural Variants and Pedigrees of Populus trichocarpa. FRONTIERS IN PLANT SCIENCE 2022; 13:757810. [PMID: 35185975 PMCID: PMC8850957 DOI: 10.3389/fpls.2022.757810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 01/04/2022] [Indexed: 06/14/2023]
Abstract
The rapid analysis of biopolymers including lignin and sugars in lignocellulosic biomass cell walls is essential for the analysis of the large sample populations needed for identifying heritable genetic variation in biomass feedstocks for biofuels and bioproducts. In this study, we reported the analysis of cell wall lignin content, syringyl/guaiacyl (S/G) ratio, as well as glucose and xylose content by high-throughput pyrolysis-molecular beam mass spectrometry (py-MBMS) for >3,600 samples derived from hundreds of accessions of Populus trichocarpa from natural populations, as well as pedigrees constructed from 14 parents (7 × 7). Partial Least Squares (PLS) regression models were built from the samples of known sugar composition previously determined by hydrolysis followed by nuclear magnetic resonance (NMR) analysis. Key spectral features positively correlated with glucose content consisted of m/z 126, 98, and 69, among others, deriving from pyrolyzates such as hydroxymethylfurfural, maltol, and other sugar-derived species. Xylose content positively correlated primarily with many lignin-derived ions and to a lesser degree with m/z 114, deriving from a lactone produced from xylose pyrolysis. Models were capable of predicting glucose and xylose contents with an average error of less than 4%, and accuracy was significantly improved over previously used methods. The differences in the models constructed from the two sample sets varied in training sample number, but the genetic and compositional uniformity of the pedigree set could be a potential driver in the slightly better performance of that model in comparison with the natural variants. Broad-sense heritability of glucose and xylose composition using these data was 0.32 and 0.34, respectively. In summary, we have demonstrated the use of a single high-throughput method to predict sugar and lignin composition in thousands of poplar samples to estimate the heritability and phenotypic plasticity of traits necessary to develop optimized feedstocks for bioenergy applications.
Collapse
Affiliation(s)
- Anne E. Harman-Ware
- Renewable Resources and Enabling Sciences Center, Center for Bioenergy Innovation, National Renewable Energy Laboratory, Golden, CO, United States
| | - Renee M. Happs
- Renewable Resources and Enabling Sciences Center, Center for Bioenergy Innovation, National Renewable Energy Laboratory, Golden, CO, United States
| | - David Macaya-Sanz
- Department of Forest Ecology and Genetics, INIA-CIFOR, Madrid, Spain
- Department of Biology, West Virginia University, Morgantown, WV, United States
| | - Crissa Doeppke
- Renewable Resources and Enabling Sciences Center, Center for Bioenergy Innovation, National Renewable Energy Laboratory, Golden, CO, United States
| | - Wellington Muchero
- Biosciences Division, Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Stephen P. DiFazio
- Department of Biology, West Virginia University, Morgantown, WV, United States
| |
Collapse
|
5
|
Nag A, Gerritsen A, Doeppke C, Harman-Ware AE. Machine Learning-Based Classification of Lignocellulosic Biomass from Pyrolysis-Molecular Beam Mass Spectrometry Data. Int J Mol Sci 2021; 22:ijms22084107. [PMID: 33921121 PMCID: PMC8071563 DOI: 10.3390/ijms22084107] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/09/2021] [Accepted: 04/09/2021] [Indexed: 12/04/2022] Open
Abstract
High-throughput analysis of biomass is necessary to ensure consistent and uniform feedstocks for agricultural and bioenergy applications and is needed to inform genomics and systems biology models. Pyrolysis followed by mass spectrometry such as molecular beam mass spectrometry (py-MBMS) analyses are becoming increasingly popular for the rapid analysis of biomass cell wall composition and typically require the use of different data analysis tools depending on the need and application. Here, the authors report the py-MBMS analysis of several types of lignocellulosic biomass to gain an understanding of spectral patterns and variation with associated biomass composition and use machine learning approaches to classify, differentiate, and predict biomass types on the basis of py-MBMS spectra. Py-MBMS spectra were also corrected for instrumental variance using generalized linear modeling (GLM) based on the use of select ions relative abundances as spike-in controls. Machine learning classification algorithms e.g., random forest, k-nearest neighbor, decision tree, Gaussian Naïve Bayes, gradient boosting, and multilayer perceptron classifiers were used. The k-nearest neighbors (k-NN) classifier generally performed the best for classifications using raw spectral data, and the decision tree classifier performed the worst. After normalization of spectra to account for instrumental variance, all the classifiers had comparable and generally acceptable performance for predicting the biomass types, although the k-NN and decision tree classifiers were not as accurate for prediction of specific sample types. Gaussian Naïve Bayes (GNB) and extreme gradient boosting (XGB) classifiers performed better than the k-NN and the decision tree classifiers for the prediction of biomass mixtures. The data analysis workflow reported here could be applied and extended for comparison of biomass samples of varying types, species, phenotypes, and/or genotypes or subjected to different treatments, environments, etc. to further elucidate the sources of spectral variance, patterns, and to infer compositional information based on spectral analysis, particularly for analysis of data without a priori knowledge of the feedstock composition or identity.
Collapse
Affiliation(s)
- Ambarish Nag
- Computational Science Center, National Renewable Energy Laboratory, 15013 Denver West Pkwy, Golden, CO 80401, USA; (A.N.); (A.G.)
| | - Alida Gerritsen
- Computational Science Center, National Renewable Energy Laboratory, 15013 Denver West Pkwy, Golden, CO 80401, USA; (A.N.); (A.G.)
| | - Crissa Doeppke
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, 15013 Denver West Pkwy, Golden, CO 80401, USA;
| | - Anne E. Harman-Ware
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, 15013 Denver West Pkwy, Golden, CO 80401, USA;
- Correspondence:
| |
Collapse
|