1
|
Mendoza-Revilla J, Trop E, Gonzalez L, Roller M, Dalla-Torre H, de Almeida BP, Richard G, Caton J, Lopez Carranza N, Skwark M, Laterre A, Beguir K, Pierrot T, Lopez M. A foundational large language model for edible plant genomes. Commun Biol 2024; 7:835. [PMID: 38982288 PMCID: PMC11233511 DOI: 10.1038/s42003-024-06465-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 06/17/2024] [Indexed: 07/11/2024] Open
Abstract
Significant progress has been made in the field of plant genomics, as demonstrated by the increased use of high-throughput methodologies that enable the characterization of multiple genome-wide molecular phenotypes. These findings have provided valuable insights into plant traits and their underlying genetic mechanisms, particularly in model plant species. Nonetheless, effectively leveraging them to make accurate predictions represents a critical step in crop genomic improvement. We present AgroNT, a foundational large language model trained on genomes from 48 plant species with a predominant focus on crop species. We show that AgroNT can obtain state-of-the-art predictions for regulatory annotations, promoter/terminator strength, tissue-specific gene expression, and prioritize functional variants. We conduct a large-scale in silico saturation mutagenesis analysis on cassava to evaluate the regulatory impact of over 10 million mutations and provide their predicted effects as a resource for variant characterization. Finally, we propose the use of the diverse datasets compiled here as the Plants Genomic Benchmark (PGB), providing a comprehensive benchmark for deep learning-based methods in plant genomic research. The pre-trained AgroNT model is publicly available on HuggingFace at https://huggingface.co/InstaDeepAI/agro-nucleotide-transformer-1b for future research purposes.
Collapse
|
2
|
Leduque B, Edera A, Vitte C, Quadrana L. Simultaneous profiling of chromatin accessibility and DNA methylation in complete plant genomes using long-read sequencing. Nucleic Acids Res 2024; 52:6285-6297. [PMID: 38676941 PMCID: PMC11194078 DOI: 10.1093/nar/gkae306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 03/29/2024] [Accepted: 04/10/2024] [Indexed: 04/29/2024] Open
Abstract
Epigenetic regulations, including chromatin accessibility, nucleosome positioning and DNA methylation intricately shape genome function. However, current chromatin profiling techniques relying on short-read sequencing technologies fail to characterise highly repetitive genomic regions and cannot detect multiple chromatin features simultaneously. Here, we performed Simultaneous Accessibility and DNA Methylation Sequencing (SAM-seq) of purified plant nuclei. Thanks to the use of long-read nanopore sequencing, SAM-seq enables high-resolution profiling of m6A-tagged chromatin accessibility together with endogenous cytosine methylation in plants. Analysis of naked genomic DNA revealed significant sequence preference biases of m6A-MTases, controllable through a normalisation step. By applying SAM-seq to Arabidopsis and maize nuclei we obtained fine-grained accessibility and DNA methylation landscapes genome-wide. We uncovered crosstalk between chromatin accessibility and DNA methylation within nucleosomes of genes, TEs, and centromeric repeats. SAM-seq also detects DNA footprints over cis-regulatory regions. Furthermore, using the single-molecule information provided by SAM-seq we identified extensive cellular heterogeneity at chromatin domains with antagonistic chromatin marks, suggesting that bivalency reflects cell-specific regulations. SAM-seq is a powerful approach to simultaneously study multiple epigenetic features over unique and repetitive sequences, opening new opportunities for the investigation of epigenetic mechanisms.
Collapse
Affiliation(s)
- Basile Leduque
- Institute of Plant Sciences Paris-Saclay, Centre Nationale de la Recherche Scientifique, Institute National de la Recherche Agronomique, Université Evry, Université Paris-Saclay, Orsay, France
| | - Alejandro Edera
- Institute of Plant Sciences Paris-Saclay, Centre Nationale de la Recherche Scientifique, Institute National de la Recherche Agronomique, Université Evry, Université Paris-Saclay, Orsay, France
| | - Clémentine Vitte
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
| | - Leandro Quadrana
- Institute of Plant Sciences Paris-Saclay, Centre Nationale de la Recherche Scientifique, Institute National de la Recherche Agronomique, Université Evry, Université Paris-Saclay, Orsay, France
| |
Collapse
|
3
|
Galli M, Chen Z, Ghandour T, Chaudhry A, Gregory J, Li M, Zhang X, Dong Y, Song G, Walley JW, Chuck G, Whipple C, Kaeppler HF, Huang SSC, Gallavotti A. Transcription factor binding site divergence across maize inbred lines drives transcriptional and phenotypic variation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.31.596834. [PMID: 38895211 PMCID: PMC11185568 DOI: 10.1101/2024.05.31.596834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Regulatory elements are important constituents of plant genomes that have shaped ancient and modern crops. Their identification, function, and diversity in crop genomes however are poorly characterized, thus limiting our ability to harness their power for further agricultural advances using induced or natural variation. Here, we use DNA affinity purification-sequencing (DAP-seq) to map transcription factor (TF) binding events for 200 maize TFs belonging to 30 distinct families and heterodimer pairs in two distinct inbred lines historically used for maize hybrid plant production, providing empirical binding site annotation for 5.3% of the maize genome. TF binding site comparison in B73 and Mo17 inbreds reveals widespread differences, driven largely by structural variation, that correlate with gene expression changes. TF binding site presence-absence variation helps clarify complex QTL such as vgt1, an important determinant of maize flowering time, and DICE, a distal enhancer involved in herbivore resistance. Modification of TF binding regions via CRISPR-Cas9 mediated editing alters target gene expression and phenotype. Our functional catalog of maize TF binding events enables collective and comparative TF binding analysis, and highlights its value for agricultural improvement.
Collapse
Affiliation(s)
- Mary Galli
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Zongliang Chen
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Tara Ghandour
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003, USA
| | - Amina Chaudhry
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Jason Gregory
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Miaomiao Li
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003, USA
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Yinxin Dong
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - Gaoyuan Song
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University; Ames, IA, 50011
| | - Justin W. Walley
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University; Ames, IA, 50011
| | - George Chuck
- Plant Gene Expression Center, Albany, CA 94710, USA
| | - Clinton Whipple
- Department of Biology, Brigham Young University, 4102 LSB, Provo, UT 84602, USA
| | - Heidi F. Kaeppler
- Department of Agronomy, University of Wisconsin, Madison, WI, USA
- Wisconsin Crop Innovation Center, University of Wisconsin, Middleton, WI, USA
| | - Shao-shan Carol Huang
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003, USA
| | - Andrea Gallavotti
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
- Department of Plant Biology, Rutgers University, New Brunswick, NJ, 08901, USA
| |
Collapse
|
4
|
Hsieh JWA, Lin PY, Wang CT, Lee YJ, Chang P, Lu RJH, Chen PY, Wang CJR. Establishing an optimized ATAC-seq protocol for the maize. FRONTIERS IN PLANT SCIENCE 2024; 15:1370618. [PMID: 38863553 PMCID: PMC11165127 DOI: 10.3389/fpls.2024.1370618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 05/07/2024] [Indexed: 06/13/2024]
Abstract
The advent of next-generation sequencing in crop improvement offers unprecedented insights into the chromatin landscape closely linked to gene activity governing key traits in plant development and adaptation. Particularly in maize, its dynamic chromatin structure is found to collaborate with massive transcriptional variations across tissues and developmental stages, implying intricate regulatory mechanisms, which highlights the importance of integrating chromatin information into breeding strategies for precise gene controls. The depiction of maize chromatin architecture using Assay for Transposase Accessible Chromatin with high-throughput sequencing (ATAC-seq) provides great opportunities to investigate cis-regulatory elements, which is crucial for crop improvement. In this context, we developed an easy-to-implement ATAC-seq protocol for maize with fewer nuclei and simple equipment. We demonstrate a streamlined ATAC-seq protocol with four key steps for maize in which nuclei purification can be achieved without cell sorting and using only a standard bench-top centrifuge. Our protocol, coupled with the bioinformatic analysis, including validation by read length periodicity, key metrics, and correlation with transcript abundance, provides a precise and efficient assessment of the maize chromatin landscape. Beyond its application to maize, our testing design holds the potential to be applied to other crops or other tissues, especially for those with limited size and amount, establishing a robust foundation for chromatin structure studies in diverse crop species.
Collapse
Affiliation(s)
- Jo-Wei Allison Hsieh
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, Taiwan
| | - Pei-Yu Lin
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Chi-Ting Wang
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Yi-Jing Lee
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Pearl Chang
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- Department of Tropical Agriculture and International Cooperation/Department of Biological Science and Technology, National Pingtung University of Science and Technology, Pingtung, Taiwan
| | - Rita Jui-Hsien Lu
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Pao-Yang Chen
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, Taiwan
| | | |
Collapse
|
5
|
Hu G, Grover CE, Vera DL, Lung PY, Girimurugan SB, Miller ER, Conover JL, Ou S, Xiong X, Zhu D, Li D, Gallagher JP, Udall JA, Sui X, Zhang J, Bass HW, Wendel JF. Evolutionary Dynamics of Chromatin Structure and Duplicate Gene Expression in Diploid and Allopolyploid Cotton. Mol Biol Evol 2024; 41:msae095. [PMID: 38758089 PMCID: PMC11140268 DOI: 10.1093/molbev/msae095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 04/10/2024] [Accepted: 05/10/2024] [Indexed: 05/18/2024] Open
Abstract
Polyploidy is a prominent mechanism of plant speciation and adaptation, yet the mechanistic understandings of duplicated gene regulation remain elusive. Chromatin structure dynamics are suggested to govern gene regulatory control. Here, we characterized genome-wide nucleosome organization and chromatin accessibility in allotetraploid cotton, Gossypium hirsutum (AADD, 2n = 4X = 52), relative to its two diploid parents (AA or DD genome) and their synthetic diploid hybrid (AD), using DNS-seq. The larger A-genome exhibited wider average nucleosome spacing in diploids, and this intergenomic difference diminished in the allopolyploid but not hybrid. Allopolyploidization also exhibited increased accessibility at promoters genome-wide and synchronized cis-regulatory motifs between subgenomes. A prominent cis-acting control was inferred for chromatin dynamics and demonstrated by transposable element removal from promoters. Linking accessibility to gene expression patterns, we found distinct regulatory effects for hybridization and later allopolyploid stages, including nuanced establishment of homoeolog expression bias and expression level dominance. Histone gene expression and nucleosome organization are coordinated through chromatin accessibility. Our study demonstrates the capability to track high-resolution chromatin structure dynamics and reveals their role in the evolution of cis-regulatory landscapes and duplicate gene expression in polyploids, illuminating regulatory ties to subgenomic asymmetry and dominance.
Collapse
Affiliation(s)
- Guanjing Hu
- State Key Laboratory of Cotton Bio-breeding and Integrated, Chinese Academy of Agricultural Sciences, Institute of Cotton Research, Anyang 455000, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Chinese Academy of Agricultural Sciences, Agricultural Genomics Institute at Shenzhen, Shenzhen 518120, China
| | - Corrinne E Grover
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Daniel L Vera
- Department of Biological Science, Florida State University, Tallahassee, FL 32306, USA
| | - Pei-Yau Lung
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | | | - Emma R Miller
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Justin L Conover
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Shujun Ou
- Department of Molecular Genetics, Ohio State University, Columbus, OH 43210, USA
| | - Xianpeng Xiong
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Chinese Academy of Agricultural Sciences, Agricultural Genomics Institute at Shenzhen, Shenzhen 518120, China
| | - De Zhu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Chinese Academy of Agricultural Sciences, Agricultural Genomics Institute at Shenzhen, Shenzhen 518120, China
| | - Dongming Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Chinese Academy of Agricultural Sciences, Agricultural Genomics Institute at Shenzhen, Shenzhen 518120, China
- Zhengzhou Research Base, State Key Laboratory of Cotton Biology, School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450000, China
| | - Joseph P Gallagher
- Forage Seed and Cereal Research Unit, USDA/Agricultural Research Service, Corvallis, OR 97331, USA
| | - Joshua A Udall
- Crop Germplasm Research Unit, USDA/Agricultural Research Service, College Station, TX 77845, USA
| | - Xin Sui
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | - Hank W Bass
- Department of Biological Science, Florida State University, Tallahassee, FL 32306, USA
| | - Jonathan F Wendel
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
6
|
Paterson AH, Queitsch C. Genome organization and botanical diversity. THE PLANT CELL 2024; 36:1186-1204. [PMID: 38382084 PMCID: PMC11062460 DOI: 10.1093/plcell/koae045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 02/07/2024] [Accepted: 02/07/2024] [Indexed: 02/23/2024]
Abstract
The rich diversity of angiosperms, both the planet's dominant flora and the cornerstone of agriculture, is integrally intertwined with a distinctive evolutionary history. Here, we explore the interplay between angiosperm genome organization and botanical diversity, empowered by genomic approaches ranging from genetic linkage mapping to analysis of gene regulation. Commonality in the genetic hardware of plants has enabled robust comparative genomics that has provided a broad picture of angiosperm evolution and implicated both general processes and specific elements in contributing to botanical diversity. We argue that the hardware of plant genomes-both in content and in dynamics-has been shaped by selection for rather substantial differences in gene regulation between plants and animals such as maize and human, organisms of comparable genome size and gene number. Their distinctive genome content and dynamics may reflect in part the indeterminate development of plants that puts strikingly different demands on gene regulation than in animals. Repeated polyploidization of plant genomes and multiplication of individual genes together with extensive rearrangement and differential retention provide rich raw material for selection of morphological and/or physiological variations conferring fitness in specific niches, whether natural or artificial. These findings exemplify the burgeoning information available to employ in increasing knowledge of plant biology and in modifying selected plants to better meet human needs.
Collapse
Affiliation(s)
- Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, USA
| | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| |
Collapse
|
7
|
Gomez-Cano F, Rodriguez J, Zhou P, Chu YH, Magnusson E, Gomez-Cano L, Krishnan A, Springer NM, de Leon N, Grotewold E. Prioritizing Metabolic Gene Regulators through Multi-Omic Network Integration in Maize. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.26.582075. [PMID: 38464086 PMCID: PMC10925184 DOI: 10.1101/2024.02.26.582075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Elucidating gene regulatory networks (GRNs) is a major area of study within plant systems biology. Phenotypic traits are intricately linked to specific gene expression profiles. These expression patterns arise primarily from regulatory connections between sets of transcription factors (TFs) and their target genes. In this study, we integrated publicly available co-expression networks derived from more than 6,000 RNA-seq samples, 283 protein-DNA interaction assays, and 16 million of SNPs used to identify expression quantitative loci (eQTL), to construct TF-target networks. In total, we analyzed ~4.6M interactions to generate four distinct types of TF-target networks: co-expression, protein-DNA interaction (PDI), trans-expression quantitative loci (trans-eQTL), and cis-eQTL combined with PDIs. To improve the functional annotation of TFs based on its target genes, we implemented three different strategies to integrate these four types of networks. We subsequently evaluated the effectiveness of our method through loss-of function mutant and random networks. The multi-network integration allowed us to identify transcriptional regulators of hormone-, metabolic- and development-related processes. Finally, using the topological properties of the fully integrated network, we identified potentially functional redundant TF paralogs. Our findings retrieved functions previously documented for numerous TFs and revealed novel functions that are crucial for informing the design of future experiments. The approach here-described lays the foundation for the integration of multi-omic datasets in maize and other plant systems.
Collapse
Affiliation(s)
- Fabio Gomez-Cano
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
- Current address: Department of Molecular, Cellular, and Development Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jonas Rodriguez
- Department of Plant and Agroecosystem Sciences, University of Wisconsin Madison, Madison, WI 53706, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
| | - Yi-Hsuan Chu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| | - Erika Magnusson
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
| | - Lina Gomez-Cano
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| | - Arjun Krishnan
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
- Current address: Global Breeding, Bayer Crop Sciences, Chesterfield MO 63017, USA
| | - Natalia de Leon
- Department of Plant and Agroecosystem Sciences, University of Wisconsin Madison, Madison, WI 53706, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| |
Collapse
|
8
|
Zhou Y, Kathiresan N, Yu Z, Rivera LF, Yang Y, Thimma M, Manickam K, Chebotarov D, Mauleon R, Chougule K, Wei S, Gao T, Green CD, Zuccolo A, Xie W, Ware D, Zhang J, McNally KL, Wing RA. A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset. BMC Biol 2024; 22:13. [PMID: 38273258 PMCID: PMC10809545 DOI: 10.1186/s12915-024-01820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/09/2024] [Indexed: 01/27/2024] Open
Abstract
BACKGROUND Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.
Collapse
Affiliation(s)
- Yong Zhou
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Nagarajan Kathiresan
- KAUST Supercomputing Laboratory (KSL), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Zhichao Yu
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Luis F Rivera
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Yujian Yang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Manjula Thimma
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Keerthana Manickam
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Dmytro Chebotarov
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Ramil Mauleon
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Tingting Gao
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Carl D Green
- Information Technology Department, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Andrea Zuccolo
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Crop Science Research Center (CSRC), Scuola Superiore Sant'Anna, Pisa, 56127, Italy
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, 14853, USA
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China
| | - Kenneth L McNally
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines
| | - Rod A Wing
- Center for Desert Agriculture (CDA), Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Arizona Genomics Institute (AGI), School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.
- International Rice Research Institute (IRRI), Los Baños, Laguna, 4031, Philippines.
| |
Collapse
|
9
|
Minow MAA, Marand AP, Schmitz RJ. Leveraging Single-Cell Populations to Uncover the Genetic Basis of Complex Traits. Annu Rev Genet 2023; 57:297-319. [PMID: 37562412 PMCID: PMC10775913 DOI: 10.1146/annurev-genet-022123-110824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023]
Abstract
The ease and throughput of single-cell genomics have steadily improved, and its current trajectory suggests that surveying single-cell populations will become routine. We discuss the merger of quantitative genetics with single-cell genomics and emphasize how this synergizes with advantages intrinsic to plants. Single-cell population genomics provides increased detection resolution when mapping variants that control molecular traits, including gene expression or chromatin accessibility. Additionally, single-cell population genomics reveals the cell types in which variants act and, when combined with organism-level phenotype measurements, unveils which cellular contexts impact higher-order traits. Emerging technologies, notably multiomics, can facilitate the measurement of both genetic changes and genomic traits in single cells, enabling single-cell genetic experiments. The implementation of single-cell genetics will advance the investigation of the genetic architecture of complex molecular traits and provide new experimental paradigms to study eukaryotic genetics.
Collapse
Affiliation(s)
- Mark A A Minow
- Department of Genetics, University of Georgia, Athens, Georgia, USA;
| | | | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia, USA;
| |
Collapse
|
10
|
Li M, Feng Y, Han Q, Yang Y, Shi Y, Zheng D, Zhang W. Genomic variations combined with epigenetic modifications rewire open chromatin in rice. PLANT PHYSIOLOGY 2023; 193:1880-1896. [PMID: 37539937 DOI: 10.1093/plphys/kiad440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 07/14/2023] [Accepted: 07/19/2023] [Indexed: 08/05/2023]
Abstract
Cis-regulatory elements (CREs) fine-tune gene transcription in eukaryotes. CREs with sequence variations play vital roles in driving plant or crop domestication. However, how global sequence and structural variations (SVs) are responsible for multilevel changes between indica and japonica rice (Oryza sativa) is still not fully elucidated. To address this, we conducted multiomic studies using MNase hypersensitivity sequencing (MH-seq) in combination with RNA sequencing (RNA-seq), chromatin immunoprecipitation sequencing (ChIP-seq), and bisulfite sequencing (BS-seq) between the japonica rice variety Nipponbare (NIP) and indica rice variety 93-11. We found that differential MNase hypersensitive sites (MHSs) exhibited some distinct intrinsic genomic sequence features between NIP and 93-11. Notably, through MHS-genome-wide association studies (GWAS) integration, we found that key sequence variations may be associated with differences of agronomic traits between NIP and 93-11, which is partly achieved by MHSs harboring CREs. In addition, SV-derived differential MHSs caused by transposable element (TE) insertion, especially by noncommon TEs among rice varieties, were associated with genes with distinct functions, indicating that TE-driven gene neo- or subfunctionalization is mediated by changes of chromatin openness. This study thus provides insights into how sequence and genomic SVs control agronomic traits of NIP and 93-11; it also provides genome-editing targets for molecular breeding aiming at improving favorable agronomic properties.
Collapse
Affiliation(s)
- Mengqi Li
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Yilong Feng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Qi Han
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Ying Yang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Yining Shi
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Dongyang Zheng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| | - Wenli Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing, Jiangsu 210095, China
| |
Collapse
|
11
|
Shu G, Wang A, Wang X, Ding J, Chen R, Gao F, Wang A, Li T, Wang Y. Identification of southern corn rust resistance QTNs in Chinese summer maize germplasm via multi-locus GWAS and post-GWAS analysis. FRONTIERS IN PLANT SCIENCE 2023; 14:1221395. [PMID: 37810381 PMCID: PMC10552154 DOI: 10.3389/fpls.2023.1221395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 08/15/2023] [Indexed: 10/10/2023]
Abstract
Southern corn rust (SCR) caused by Puccinia polysora Underw is a major disease leading to severe yield losses in China Summer Corn Belt. Using six multi-locus GWAS methods, we identified a set of SCR resistance QTNs from a diversity panel of 140 inbred lines collected from China Summer Corn Belt. Thirteen QTNs on chromosomes 1, 2, 4, 5, 6, and 8 were grouped into three types of allele effects and their associations with SCR phenotypes were verified by post-GWAS case-control sampling, allele/haplotype effect analysis. Relative resistance (RRR) and relative susceptibility (RRs) catering to its inbred carrier were estimated from single QTN and QTN-QTN combos and epistatitic effects were estimated for QTN-QTN combos. By transcriptomic annotation, a set of candidate genes were predicted to be involved in transcriptional regulation (S5_145, Zm00001d01613, transcription factor GTE4), phosphorylation (S8_123, Zm00001d010672, Pgk2- phosphoglycerate kinase 2), and temperature stress response (S6_164a/S6_164b, Zm00001d038806, hsp101, and S5_211, Zm00001d017978, cellulase25). The breeding implications of the above findings were discussed.
Collapse
Affiliation(s)
- Guoping Shu
- Center of Biotechnology, Beijing Lantron Seed, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Aifang Wang
- Center of Biotechnology, Beijing Lantron Seed, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Xingchuan Wang
- Henan LongPing-Lantron AgriScience & Technology Co., LTD, Zhengzhou, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Junqiang Ding
- College of Agronomy, Henan Agricultural University, Zhengzhou, Henan, China
| | - Ruijie Chen
- Henan LongPing-Lantron AgriScience & Technology Co., LTD, Zhengzhou, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Fei Gao
- Henan LongPing-Lantron AgriScience & Technology Co., LTD, Zhengzhou, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Aifen Wang
- Henan LongPing-Lantron AgriScience & Technology Co., LTD, Zhengzhou, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Ting Li
- Center of Biotechnology, Beijing Lantron Seed, LongPing High-tech Corp., Zhengzhou, Henan, China
| | - Yibo Wang
- Henan LongPing-Lantron AgriScience & Technology Co., LTD, Zhengzhou, LongPing High-tech Corp., Zhengzhou, Henan, China
| |
Collapse
|
12
|
Zhang J, Xiong H, Burguener GF, Vasquez-Gross H, Liu Q, Debernardi JM, Akhunova A, Garland-Campbell K, Kianian SF, Brown-Guedira G, Pozniak C, Faris JD, Akhunov E, Dubcovsky J. Sequencing 4.3 million mutations in wheat promoters to understand and modify gene expression. Proc Natl Acad Sci U S A 2023; 120:e2306494120. [PMID: 37703281 PMCID: PMC10515147 DOI: 10.1073/pnas.2306494120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 08/03/2023] [Indexed: 09/15/2023] Open
Abstract
Wheat is an important contributor to global food security, and further improvements are required to feed a growing human population. Functional genetics and genomics tools can help us to understand the function of different genes and to engineer beneficial changes. In this study, we used a promoter capture assay to sequence 2-kb regions upstream of all high-confidence annotated genes from 1,513 mutagenized plants from the tetraploid wheat variety Kronos. We identified 4.3 million induced mutations with an accuracy of 99.8%, resulting in a mutation density of 41.9 mutations per kb. We also remapped Kronos exome capture reads to Chinese Spring RefSeq v1.1, identified 4.7 million mutations, and predicted their effects on annotated genes. Using these predictions, we identified 59% more nonsynonymous substitutions and 49% more truncation mutations than in the original study. To show the biological value of the promoter dataset, we selected two mutations within the promoter of the VRN-A1 vernalization gene. Both mutations, located within transcription factor binding sites, significantly altered VRN-A1 expression, and one reduced the number of spikelets per spike. These publicly available sequenced mutant datasets provide rapid and inexpensive access to induced variation in the promoters and coding regions of most wheat genes. These mutations can be used to understand and modulate gene expression and phenotypes for both basic and commercial applications, where limited governmental regulations can facilitate deployment. These mutant collections, together with gene editing, provide valuable tools to accelerate functional genetic studies in this economically important crop.
Collapse
Affiliation(s)
- Junli Zhang
- Department of Plant Sciences, University of California, Davis, CA95616
| | - Hongchun Xiong
- Department of Plant Sciences, University of California, Davis, CA95616
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing100081, China
| | - Germán F. Burguener
- Department of Plant Sciences, University of California, Davis, CA95616
- HHMI, Chevy Chase, MD20815
| | - Hans Vasquez-Gross
- Department of Plant Sciences, University of California, Davis, CA95616
- Nevada Bioinformatics Center, University of Nevada, Reno, NV89557
| | - Qiujie Liu
- Department of Plant Sciences, University of California, Davis, CA95616
- HHMI, Chevy Chase, MD20815
| | - Juan M. Debernardi
- Department of Plant Sciences, University of California, Davis, CA95616
- HHMI, Chevy Chase, MD20815
| | - Alina Akhunova
- Department of Plant Pathology, Kansas State University, Manhattan, KS66506
| | - Kimberly Garland-Campbell
- United States Department of Agriculture - Agricultural Research Service, Wheat Health, Genetics and Quality Research Unit, Pullman, WA99164
| | - Shahryar F. Kianian
- United States Department of Agriculture - Agricultural Research Service, Cereal Disease Laboratory, Saint Paul, MN55108-6086
| | - Gina Brown-Guedira
- United States Department of Agriculture - Agricultural Research Service, Plant Science Research Unit, Raleigh, NC27695
| | - Curtis Pozniak
- Crop Development Centre, University of Saskatchewan, SaskatoonS7N 5A8, Canada
| | - Justin D. Faris
- United States Department of Agriculture - Agricultural Research Service, Cereal Crops Research Unit, Northern Crop Science Laboratory, Fargo, ND58102
| | - Eduard Akhunov
- Department of Plant Pathology, Kansas State University, Manhattan, KS66506
| | - Jorge Dubcovsky
- Department of Plant Sciences, University of California, Davis, CA95616
- HHMI, Chevy Chase, MD20815
| |
Collapse
|
13
|
Xu C, Song LY, Zhou Y, Ma DN, Ding QS, Guo ZJ, Li J, Song SW, Zhang LD, Zheng HL. Integration of eQTL and GWAS analysis uncovers a genetic regulation of natural ionomic variation in Arabidopsis. PLANT CELL REPORTS 2023; 42:1473-1485. [PMID: 37516984 DOI: 10.1007/s00299-023-03042-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 06/12/2023] [Indexed: 08/01/2023]
Abstract
KEY MESSAGE This study provided important insights into the genetic architecture of variations in A. thaliana leaf ionome in a cell-type-specific manner. The functional interpretation of traits associated variants by expression quantitative trait loci (eQTL) analysis is usually performed in bulk tissue samples. While the regulation of gene expression is context-dependent, such as cell-type-specific manner. In this study, we estimated cell-type abundances from 728 bulk tissue samples using single-cell RNA-sequencing dataset, and performed cis-eQTL mapping to identify cell-type-interaction eQTL (cis-eQTLs(ci)) in A. thaliana. Also, we performed Genome-wide association studies (GWAS) analyses for 999 accessions to identify the genetic basis of variations in A. thaliana leaf ionome. As a result, a total of 5,664 unique eQTL genes and 15,038 unique cis-eQTLs(ci) were significant. The majority (62.83%) of cis-eQTLs(ci) were cell-type-specific eQTLs. Using colocalization, we uncovered one interested gene AT2G25590 in Phloem cell, encoding a kind of plant Tudor-like protein with possible chromatin-associated functions, which colocalized with the most significant cis-eQTL(ci) of a Mo-related locus (Chr2:10,908,806:A:C; P = 3.27 × 10-27). Furthermore, we prioritized eight target genes associated with AT2G25590, which were previously reported in regulating the concentration of Mo element in A. thaliana. This study revealed the genetic regulation of ionomic variations and provided a foundation for further studies on molecular mechanisms of genetic variants controlling the A. thaliana ionome.
Collapse
Affiliation(s)
- Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ling-Yu Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ying Zhou
- School of Medicine, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361102, China
| | - Dong-Na Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, School of Food Science and Technology, Jiangnan University, Wuxi, 214122, Jiangsu, China
| | - Qian-Su Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ze-Jun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Jing Li
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Shi-Wei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Lu-Dan Zhang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China.
| |
Collapse
|
14
|
Shi L, Su J, Cho MJ, Song H, Dong X, Liang Y, Zhang Z. Promoter editing for the genetic improvement of crops. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:4349-4366. [PMID: 37204916 DOI: 10.1093/jxb/erad175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 05/06/2023] [Indexed: 05/21/2023]
Abstract
Gene expression plays a fundamental role in the regulation of agronomically important traits in crop plants. The genetic manipulation of plant promoters through genome editing has emerged as an effective strategy to create favorable traits in crops by altering the expression pattern of the pertinent genes. Promoter editing can be applied in a directed manner, where nucleotide sequences associated with favorable traits are precisely generated. Alternatively, promoter editing can also be exploited as a random mutagenic approach to generate novel genetic variations within a designated promoter, from which elite alleles are selected based on their phenotypic effects. Pioneering studies have demonstrated the potential of promoter editing in engineering agronomically important traits as well as in mining novel promoter alleles valuable for plant breeding. In this review, we provide an update on the application of promoter editing in crops for increased yield, enhanced tolerance to biotic and abiotic stresses, and improved quality. We also discuss several remaining technical bottlenecks and how this strategy may be better employed for the genetic improvement of crops in the future.
Collapse
Affiliation(s)
- Lu Shi
- Jiangsu Key Laboratory for Food Quality and Safety-State Key Laboratory Cultivation Base, Ministry of Science and Technology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Jing Su
- State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Province and Ministry Co-sponsored Collaborative Innovation Center for Modern Crop Production, Jiangsu Engineering Research Center for Plant Genome Editing, Nanjing Agricultural University, Nanjing 210095, China
| | - Myeong-Je Cho
- Innovative Genomics Institute, University of California, Berkeley, CA 94704, USA
| | - Hao Song
- State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Province and Ministry Co-sponsored Collaborative Innovation Center for Modern Crop Production, Jiangsu Engineering Research Center for Plant Genome Editing, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiaoou Dong
- State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Province and Ministry Co-sponsored Collaborative Innovation Center for Modern Crop Production, Jiangsu Engineering Research Center for Plant Genome Editing, Nanjing Agricultural University, Nanjing 210095, China
- Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China
- Zhongshan Biological Breeding Laboratory, No. 50 Zhongling Street, Nanjing, Jiangsu 210014, China
| | - Ying Liang
- Jiangsu Key Laboratory for Food Quality and Safety-State Key Laboratory Cultivation Base, Ministry of Science and Technology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Zhiyong Zhang
- Jiangsu Key Laboratory for Food Quality and Safety-State Key Laboratory Cultivation Base, Ministry of Science and Technology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| |
Collapse
|
15
|
Ouyang K, Liang Q, Miao L, Zhang Z, Li Z. Genome-wide mapping of DNase I hypersensitive sites in pineapple leaves. Front Genet 2023; 14:1086554. [PMID: 37470036 PMCID: PMC10352800 DOI: 10.3389/fgene.2023.1086554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 06/21/2023] [Indexed: 07/21/2023] Open
Abstract
Pineapple [Ananas comosus (L.) Merr.] is the most economically important crop possessing crassulacean acid metabolism (CAM) photosynthesis which has a higher water use efficiency by control of nocturnal opening and diurnal closure of stomata. To provide novel insights into the diel regulatory landscape in pineapple leaves, we performed genome-wide mapping of DNase I hypersensitive sites (DHSs) in pineapple leaves at day (2a.m.) and night (10a.m.) using a simplified DNase-seq method. As a result, totally 33340 and 28753 DHSs were found in green-tip tissue, and 29597 and 40068 were identified in white-base tissue at 2a.m. and 10a.m., respectively. We observed that majority of the pineapple genes occupied less than two DHSs with length shorter than 1 kb, and the promotor DHSs showed a proximal trend to the transcription start site (>77% promotor DHSs within 1 kb). In addition, more intergenic DHSs were identified around transcription factors or transcription co-regulators (TFs/TCs) than other functional genes, indicating complex regulatory contexts around TFs/TCs. Through combined analysis of tissue preferential DHSs and genes, we respectively found 839 and 888 coordinately changed genes in green-tip at 2a.m. and 10a.m. (AcG2 and AcG10). Furthermore, AcG2-specific, AcG10-specific and common accessible DHSs were dissected from the total photosynthetic preferential DHSs, and the regulatory networks indicated dynamic regulations with multiple cis-regulatory elements occurred to genes preferentially expressed in photosynthetic tissues. Interestingly, binding motifs of several cycling TFs were identified in the DHSs of key CAM genes, revealing a circadian regulation to CAM coordinately diurnal expression. Our results provide a chromatin regulatory landscape in pineapple leaves during the day and night. This will provide important information to assist with deciphering the circadian regulation of CAM photosynthesis.
Collapse
Affiliation(s)
- Kai Ouyang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Key Laboratory of Biological Breeding for Fujian and Taiwan Crops, Ministry of Agriculture and Rural Affairs, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Qifu Liang
- Fujian Key Laboratory of Agro-Products Quality and Safety, Institute of Quality Standards and Testing Technology for Agro-Products, Fujian Academy of Agricultural Sciences, Fuzhou, Fujian, China
| | - Li Miao
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Key Laboratory of Biological Breeding for Fujian and Taiwan Crops, Ministry of Agriculture and Rural Affairs, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zhiliang Zhang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Key Laboratory of Biological Breeding for Fujian and Taiwan Crops, Ministry of Agriculture and Rural Affairs, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, China
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovative Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Zhanjie Li
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, Key Laboratory of Biological Breeding for Fujian and Taiwan Crops, Ministry of Agriculture and Rural Affairs, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
16
|
Debernardi JM, Burguener G, Bubb K, Liu Q, Queitsch C, Dubcovsky J. Optimization of ATAC-seq in wheat seedling roots using INTACT-isolated nuclei. BMC PLANT BIOLOGY 2023; 23:270. [PMID: 37211599 DOI: 10.1186/s12870-023-04281-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 05/12/2023] [Indexed: 05/23/2023]
Abstract
BACKGROUND The genetic information contained in the genome of an organism is organized in genes and regulatory elements that control gene expression. The genomes of multiple plants species have already been sequenced and the gene repertory have been annotated, however, cis-regulatory elements remain less characterized, limiting our understanding of genome functionality. These elements act as open platforms for recruiting both positive- and negative-acting transcription factors, and as such, chromatin accessibility is an important signature for their identification. RESULTS In this work we developed a transgenic INTACT [isolation of nuclei tagged in specific cell types] system in tetraploid wheat for nuclei purifications. Then, we combined the INTACT system together with the assay for transposase-accessible chromatin with sequencing [ATAC-seq] to identify open chromatin regions in wheat root tip samples. Our ATAC-seq results showed a large enrichment of open chromatin regions in intergenic and promoter regions, which is expected for regulatory elements and that is similar to ATAC-seq results obtained in other plant species. In addition, root ATAC-seq peaks showed a significant overlap with a previously published ATAC-seq data from wheat leaf protoplast, indicating a high reproducibility between the two experiments and a large overlap between open chromatin regions in root and leaf tissues. Importantly, we observed overlap between ATAC-seq peaks and cis-regulatory elements that have been functionally validated in wheat, and a good correlation between normalized accessibility and gene expression levels. CONCLUSIONS We have developed and validated an INTACT system in tetraploid wheat that allows rapid and high-quality nuclei purification from root tips. Those nuclei were successfully used to performed ATAC-seq experiments that revealed open chromatin regions in the wheat genome that will be useful to identify cis-regulatory elements. The INTACT system presented here will facilitate the development of ATAC-seq datasets in other tissues, growth stages, and under different growing conditions to generate a more complete landscape of the accessible DNA regions in the wheat genome.
Collapse
Affiliation(s)
- Juan M Debernardi
- University of California, Davis, CA, 95616, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA.
| | - German Burguener
- University of California, Davis, CA, 95616, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| | - Kerry Bubb
- Dept. of Biology, University of Washington, Seattle, WA, 98195, USA
| | - Qiujie Liu
- University of California, Davis, CA, 95616, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| | | | - Jorge Dubcovsky
- University of California, Davis, CA, 95616, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA.
| |
Collapse
|
17
|
Hartwig T, Banf M, Prietsch GP, Zhu JY, Mora-Ramírez I, Schippers JHM, Snodgrass SJ, Seetharam AS, Huettel B, Kolkman JM, Yang J, Engelhorn J, Wang ZY. Hybrid allele-specific ChIP-seq analysis identifies variation in brassinosteroid-responsive transcription factor binding linked to traits in maize. Genome Biol 2023; 24:108. [PMID: 37158941 PMCID: PMC10165856 DOI: 10.1186/s13059-023-02909-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 03/23/2023] [Indexed: 05/10/2023] Open
Abstract
BACKGROUND Genetic variation in regulatory sequences that alter transcription factor (TF) binding is a major cause of phenotypic diversity. Brassinosteroid is a growth hormone that has major effects on plant phenotypes. Genetic variation in brassinosteroid-responsive cis-elements likely contributes to trait variation. Pinpointing such regulatory variations and quantitative genomic analysis of the variation in TF-target binding, however, remains challenging. How variation in transcriptional targets of signaling pathways such as the brassinosteroid pathway contributes to phenotypic variation is an important question to be investigated with innovative approaches. RESULTS Here, we use a hybrid allele-specific chromatin binding sequencing (HASCh-seq) approach and identify variations in target binding of the brassinosteroid-responsive TF ZmBZR1 in maize. HASCh-seq in the B73xMo17 F1s identifies thousands of target genes of ZmBZR1. Allele-specific ZmBZR1 binding (ASB) has been observed for 18.3% of target genes and is enriched in promoter and enhancer regions. About a quarter of the ASB sites correlate with sequence variation in BZR1-binding motifs and another quarter correlate with haplotype-specific DNA methylation, suggesting that both genetic and epigenetic variations contribute to the high level of variation in ZmBZR1 occupancy. Comparison with GWAS data shows linkage of hundreds of ASB loci to important yield and disease-related traits. CONCLUSION Our study provides a robust method for analyzing genome-wide variations of TF occupancy and identifies genetic and epigenetic variations of the brassinosteroid response transcription network in maize.
Collapse
Affiliation(s)
- Thomas Hartwig
- Department of Plant Biology, Carnegie Institution for Science, 260 Panama Street, Stanford, CA, 94305, USA.
- Heinrich-Heine University, Universitätsstraße 1, Düsseldorf, NRW, 40225, Germany.
- Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, Cologne, NRW, 50829, Germany.
| | - Michael Banf
- Department of Plant Biology, Carnegie Institution for Science, 260 Panama Street, Stanford, CA, 94305, USA
| | - Gisele Passaia Prietsch
- Department of Plant Biology, Carnegie Institution for Science, 260 Panama Street, Stanford, CA, 94305, USA
| | - Jia-Ying Zhu
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, Seeland, SA, 06466, Germany
| | - Isabel Mora-Ramírez
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, Seeland, SA, 06466, Germany
| | - Jos H M Schippers
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, Seeland, SA, 06466, Germany
| | - Samantha J Snodgrass
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, 339A Bessey Hall, Ames, IA, 50011, USA
| | - Arun S Seetharam
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, 339A Bessey Hall, Ames, IA, 50011, USA
| | - Bruno Huettel
- Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, Cologne, NRW, 50829, Germany
| | - Judith M Kolkman
- School of Integrative Plant Science, Plant Pathology and Plant-Microbe Biology Section, Cornell University, 413 Bradfield Hall, Ithaca, NY, 14853, USA
| | - Jinliang Yang
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, 363 Keim Hall, Lincoln, NE, 68583, USA
| | - Julia Engelhorn
- Heinrich-Heine University, Universitätsstraße 1, Düsseldorf, NRW, 40225, Germany
- Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, Cologne, NRW, 50829, Germany
| | - Zhi-Yong Wang
- Department of Plant Biology, Carnegie Institution for Science, 260 Panama Street, Stanford, CA, 94305, USA.
| |
Collapse
|
18
|
Sun G, Yu H, Wang P, Lopez-Guerrero M, Mural RV, Mizero ON, Grzybowski M, Song B, van Dijk K, Schachtman DP, Zhang C, Schnable JC. A role for heritable transcriptomic variation in maize adaptation to temperate environments. Genome Biol 2023; 24:55. [PMID: 36964601 PMCID: PMC10037803 DOI: 10.1186/s13059-023-02891-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 03/06/2023] [Indexed: 03/26/2023] Open
Abstract
Background Transcription bridges genetic information and phenotypes. Here, we evaluated how changes in transcriptional regulation enable maize (Zea mays), a crop originally domesticated in the tropics, to adapt to temperate environments. Result We generated 572 unique RNA-seq datasets from the roots of 340 maize genotypes. Genes involved in core processes such as cell division, chromosome organization and cytoskeleton organization showed lower heritability of gene expression, while genes involved in anti-oxidation activity exhibited higher expression heritability. An expression genome-wide association study (eGWAS) identified 19,602 expression quantitative trait loci (eQTLs) associated with the expression of 11,444 genes. A GWAS for alternative splicing identified 49,897 splicing QTLs (sQTLs) for 7614 genes. Genes harboring both cis-eQTLs and cis-sQTLs in linkage disequilibrium were disproportionately likely to encode transcription factors or were annotated as responding to one or more stresses. Independent component analysis of gene expression data identified loci regulating co-expression modules involved in oxidation reduction, response to water deprivation, plastid biogenesis, protein biogenesis, and plant-pathogen interaction. Several genes involved in cell proliferation, flower development, DNA replication, and gene silencing showed lower gene expression variation explained by genetic factors between temperate and tropical maize lines. A GWAS of 27 previously published phenotypes identified several candidate genes overlapping with genomic intervals showing signatures of selection during adaptation to temperate environments. Conclusion Our results illustrate how maize transcriptional regulatory networks enable changes in transcriptional regulation to adapt to temperate regions. Supplementary information The online version contains supplementary material available at 10.1186/s13059-023-02891-3.
Collapse
Affiliation(s)
- Guangchao Sun
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Huihui Yu
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
| | - Peng Wang
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Martha Lopez-Guerrero
- grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
| | - Ravi V. Mural
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Olivier N. Mizero
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Marcin Grzybowski
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Baoxing Song
- grid.5386.8000000041936877XInstitute for Genomic Diversity, Cornell University, Ithaca, USA
| | - Karin van Dijk
- grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
| | - Daniel P. Schachtman
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| | - Chi Zhang
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
| | - James C. Schnable
- grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA
- grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
| |
Collapse
|
19
|
Zhang J, Debernardi JM, Burguener GF, Choulet F, Paux E, O'Connor L, Enk J, Dubcovsky J. A second-generation capture panel for cost-effective sequencing of genome regulatory regions in wheat and relatives. THE PLANT GENOME 2023; 16:e20296. [PMID: 36484157 DOI: 10.1002/tpg2.20296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 11/11/2022] [Indexed: 05/10/2023]
Abstract
As genome resources for wheat (Triticum L.) expand at a rapid pace, it is important to update targeted sequencing tools to incorporate improved sequence assemblies and regions of previously unknown significance. Here, we developed an updated regulatory region enrichment capture for wheat and other Triticeae species. The core target space includes sequences from 2-Kbp upstream of each gene predicted in the Chinese Spring wheat genome (IWGSC RefSeq Annotation v1.0) and regions of open chromatin identified with an assay for transposase-accessible chromatin using sequencing from wheat leaf and root samples. To improve specificity, we aggressively filtered candidate repetitive sequences using a combination of nucleotide basic local alignment search tool (BLASTN) searches to the Triticeae Repetitive Sequence Database (TREP), identification of regions with read over-coverage from previous target enrichment experiments, and k-mer frequency analyses. The final design comprises 216.5 Mbp of predicted hybridization space in hexaploid wheat and showed increased specificity and coverage of targeted sequences relative to previous protocols. Test captures on hexaploid and tetraploid wheat and other diploid cereals show that the assay has broad potential utility for cost-effective promoter and open chromatin resequencing and general-purpose genotyping of various Triticeae species.
Collapse
Affiliation(s)
- Junli Zhang
- Dep. of Plant Sciences, Univ. of California, Davis, CA, 95616, USA
| | | | | | | | | | | | - Jacob Enk
- Daicel Arbor Biosciences, Ann Arbor, MI, 48103, USA
| | - Jorge Dubcovsky
- Dep. of Plant Sciences, Univ. of California, Davis, CA, 95616, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| |
Collapse
|
20
|
Pei H, Li Y, Liu Y, Liu P, Zhang J, Ren X, Lu Z. Chromatin accessibility landscapes revealed the subgenome-divergent regulation networks during wheat grain development. ABIOTECH 2023; 4:8-19. [PMID: 37220536 PMCID: PMC10199822 DOI: 10.1007/s42994-023-00095-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 01/22/2023] [Indexed: 05/25/2023]
Abstract
Development of wheat (Triticum aestivum L.) grain mainly depends on the processes of starch synthesis and storage protein accumulation, which are critical for grain yield and quality. However, the regulatory network underlying the transcriptional and physiological changes of grain development is still not clear. Here, we combined ATAC-seq and RNA-seq to discover the chromatin accessibility and gene expression dynamics during these processes. We found that the chromatin accessibility changes are tightly associated with differential transcriptomic expressions, and the proportion of distal ACRs was increased gradually during grain development. Specific transcription factor (TF) binding sites were enriched at different stages and were diversified among the 3 subgenomes. We further predicted the potential interactions between key TFs and genes related with starch and storage protein biosynthesis and found different copies of some key TFs played diversified roles. Overall, our findings have provided numerous resources and illustrated the regulatory network during wheat grain development, which would shed light on the improvement of wheat yields and qualities. Supplementary Information The online version contains supplementary material available at 10.1007/s42994-023-00095-8.
Collapse
Affiliation(s)
- Hongcui Pei
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Yushan Li
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Yanhong Liu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Pan Liu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Jialin Zhang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Xueni Ren
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| | - Zefu Lu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081 China
| |
Collapse
|
21
|
Zhan W, Cui L, Guo G, Zhang Y. Genome-wide identification and functional analysis of the TCP gene family in rye (Secale cereale L.). Gene X 2023; 854:147104. [PMID: 36509294 DOI: 10.1016/j.gene.2022.147104] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 11/20/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022] Open
Abstract
TEOSINTE BRANCHED1/CYCLOIDEA/PCF (TCP) proteins are plant-specific transcription factors that play significant roles in plant growth, development, and stress response. Rye is a high-value crop with strong resistance to adverse environments. However, the functions of TCP proteins in rye are rarely reported. Based on a genome-wide analysis, the present study identified 26 TCP genes (ScTCPs) in rye. Mapping showed an uneven distribution of the ScTCP genes on the seven rye chromosomes and detected three pairs of tandem duplication genes. Phylogenetic analysis divided these genes into PCF (Proliferrating Cell Factors), CIN (CINCINNATA), and CYC (CYCLOIDEA)/TB1 (Teosinte Branched1) classes, which showed the highest homology between rye and wheat genes. Analysis of miRNA targeting sites indicated that five ScTCP genes were identified as potential targets of miRNA319. Promoter cis-acting elements analysis indicated that ScTCPs were regulated by light signals. Further analysis of the gene expression patterns and functional annotations suggested the role of a few ScTCPs in grain development and stress response. In addition, two TB1 homologous genes (ScTCP9 and ScTCP10) were identified in the ScTCP family. Synteny analysis showed that TB1 orthologous gene pairs existed before the ancestral divergence. Finally, the yeast two-hybrid assay and luciferase complementation imaging assay proved that ScTCP9, localized in the nucleus, interacts with ScFT (Flowering locus T), indicating their role in regulating flowering time. Taken together, this comprehensive study of ScTCPs provides important information for further research on gene function and crop improvement.
Collapse
Affiliation(s)
- Weimin Zhan
- College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Lianhua Cui
- College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Guanghui Guo
- State Key Laboratory of Crop Stress Adaptation and Improvement, College of Agriculture, Henan University, Kaifeng 475004, China
| | - Yanpei Zhang
- College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China.
| |
Collapse
|
22
|
Padgitt-Cobb LK, Pitra NJ, Matthews PD, Henning JA, Hendrix DA. An improved assembly of the "Cascade" hop ( Humulus lupulus) genome uncovers signatures of molecular evolution and refines time of divergence estimates for the Cannabaceae family. HORTICULTURE RESEARCH 2023; 10:uhac281. [PMID: 36818366 PMCID: PMC9930403 DOI: 10.1093/hr/uhac281] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 12/22/2022] [Indexed: 06/16/2023]
Abstract
We present a chromosome-level assembly of the Cascade hop (Humulus lupulus L. var. lupulus) genome. The hop genome is large (2.8 Gb) and complex, and early attempts at assembly were fragmented. Recent advances have made assembly of the hop genome more tractable, transforming the extent of investigation that can occur. The chromosome-level assembly of Cascade was developed by scaffolding the previously reported Cascade assembly generated with PacBio long-read sequencing and polishing with Illumina short-read DNA sequencing. We developed gene models and repeat annotations and used a controlled bi-parental mapping population to identify significant sex-associated markers. We assessed molecular evolution in gene sequences, gene family expansion and contraction, and time of divergence from Cannabis sativa and other closely related plant species using Bayesian inference. We identified the putative sex chromosome in the female genome based on significant sex-associated markers from the bi-parental mapping population. While the estimate of repeat content (~64%) is similar to the estimate for the hemp genome, syntenic blocks in hop contain a greater percentage of LTRs. Hop is enriched for disease resistance-associated genes in syntenic gene blocks and expanded gene families. The Cascade chromosome-level assembly will inform cultivation strategies and serve to deepen our understanding of the hop genomic landscape, benefiting hop researchers and the Cannabaceae genomics community.
Collapse
Affiliation(s)
- Lillian K Padgitt-Cobb
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, USA
| | - Nicholi J Pitra
- Department of Research and Development, Hopsteiner, S.S. Steiner, Inc., 1 West Washington Avenue, Yakima, Washington 98903, USA
| | - Paul D Matthews
- Department of Research and Development, Hopsteiner, S.S. Steiner, Inc., 1 West Washington Avenue, Yakima, Washington 98903, USA
| | | | | |
Collapse
|
23
|
Lasky JR, Josephs EB, Morris GP. Genotype-environment associations to reveal the molecular basis of environmental adaptation. THE PLANT CELL 2023; 35:125-138. [PMID: 36005926 PMCID: PMC9806588 DOI: 10.1093/plcell/koac267] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 08/23/2022] [Indexed: 06/14/2023]
Abstract
A fundamental goal in plant biology is to identify and understand the variation underlying plants' adaptation to their environment. Climate change has given new urgency to this goal, as society aims to accelerate adaptation of ecologically important plant species, endangered plant species, and crops to hotter, less predictable climates. In the pre-genomic era, identifying adaptive alleles was painstaking work, leveraging genetics, molecular biology, physiology, and ecology. Now, the rise of genomics and new computational approaches may facilitate this research. Genotype-environment associations (GEAs) use statistical associations between allele frequency and environment of origin to test the hypothesis that allelic variation at a given gene is adapted to local environments. Researchers may scan the genome for GEAs to generate hypotheses on adaptive genetic variants (environmental genome-wide association studies). Despite the rapid adoption of these methods, many important questions remain about the interpretation of GEA findings, which arise from fundamental unanswered questions on the genetic architecture of adaptation and limitations inherent to association-based analyses. We outline strategies to ground GEAs in the underlying hypotheses of genetic architecture and better test GEA-generated hypotheses using genetics and ecophysiology. We provide recommendations for new users who seek to learn about the molecular basis of adaptation. When combined with a rigorous hypothesis testing framework, GEAs may facilitate our understanding of the molecular basis of climate adaptation for plant improvement.
Collapse
Affiliation(s)
- Jesse R Lasky
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Emily B Josephs
- Department of Plant Biology; Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, Michigan 48824, USA
| | - Geoffrey P Morris
- Department of Soil and Crop Sciences; Cell and Molecular Biology Program, Colorado State University, Fort Collins, Colorado 80526, USA
| |
Collapse
|
24
|
Zhang A, Li X, Zhao H, Jiang J, Zhang W. Genome-Wide Identification of Open Chromatin in Plants Using MH-Seq. Methods Mol Biol 2023; 2594:29-43. [PMID: 36264486 DOI: 10.1007/978-1-0716-2815-7_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Functional cis-regulatory elements (CREs) act as precise transcriptional switches for fine-tuning gene transcription. Identification of CREs is critical for understanding regulatory mechanisms of gene expression associated with various biological processes in eukaryotes. It is well known that CREs reside in open chromatin that exhibits hypersensitivity to enzyme cleavage and physical shearing. Currently, high-throughput methodologies, such as DNase-seq, ATAC-seq, and FAIRE-seq, have been widely applied in mapping open chromatin in various eukaryotic genomes. More recently, differential MNase (micrococcal nuclease) treatment has been successfully employed to map open chromatin in addition to profiling nucleosome landscape in both mammalian and plant species. We have developed a MNase hypersensitivity sequencing (MH-seq) technique in plants. The MH-seq procedure includes plant nuclei fixation and purification, differential treatments of purified nuclei with MNase, specific recovery of MNase-trimmed small DNA fragments within 20~100 bp in length, and MH-seq library construction followed by Illumina sequencing and data analysis. MH-seq has been successfully applied for global identification of open chromatin in both Arabidopsis thaliana and maize. It has been proven to be an attractive alternative for profiling open chromatin. Thus, MH-seq is expected to be valuable in probing chromatin accessibility on a genome-wide scale for other plants with sequenced genomes. Moreover, MHS data allow to implement footprinting assays to unveil binding sites of transcription factors.
Collapse
Affiliation(s)
- Aicen Zhang
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, JCIC-MCP, CIC-MCP, Nanjing Agricultural University, Nanjing, Jiangsu, P. R. China
| | - Xinxu Li
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, JCIC-MCP, CIC-MCP, Nanjing Agricultural University, Nanjing, Jiangsu, P. R. China
| | - Hainan Zhao
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Jiming Jiang
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Wenli Zhang
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, JCIC-MCP, CIC-MCP, Nanjing Agricultural University, Nanjing, Jiangsu, P. R. China.
| |
Collapse
|
25
|
Hernandes-Lopes J, Yassitepe JEDCT, Koltun A, Pauwels L, Silva VCHD, Dante RA, Gerhardt IR, Arruda P. Genome editing in maize: Toward improving complex traits in a global crop. Genet Mol Biol 2023; 46:e20220217. [PMID: 36880696 PMCID: PMC9990078 DOI: 10.1590/1678-4685-gmb-2022-0217] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Accepted: 12/23/2022] [Indexed: 03/08/2023] Open
Abstract
Recent advances in genome editing have enormously enhanced the effort to develop biotechnology crops for more sustainable food production. CRISPR/Cas, the most versatile genome-editing tool, has shown the potential to create genome modifications that range from gene knockout and gene expression pattern modulations to allele-specific changes in order to design superior genotypes harboring multiple improved agronomic traits. However, a frequent bottleneck is the delivery of CRISPR/Cas to crops that are less amenable to transformation and regeneration. Several technologies have recently been proposed to overcome transformation recalcitrance, including HI-Edit/IMGE and ectopic/transient expression of genes encoding morphogenic regulators. These technologies allow the eroding of the barriers that make crops inaccessible for genome editing. In this review, we discuss the advances in genome editing in crops with a particular focus on the use of technologies to improve complex traits such as water use efficiency, drought stress, and yield in maize.
Collapse
Affiliation(s)
- José Hernandes-Lopes
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil
| | - Juliana Erika de Carvalho Teixeira Yassitepe
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil.,Embrapa Agricultura Digital, Campinas, SP, Brazil
| | - Alessandra Koltun
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil
| | - Laurens Pauwels
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium.,VIB, Center for Plant Systems Biology, Ghent, Belgium
| | - Viviane Cristina Heinzen da Silva
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil
| | - Ricardo Augusto Dante
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil.,Embrapa Agricultura Digital, Campinas, SP, Brazil
| | - Isabel Rodrigues Gerhardt
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil.,Embrapa Agricultura Digital, Campinas, SP, Brazil
| | - Paulo Arruda
- Universidade Estadual de Campinas, Genomics for Climate Change Research Center (GCCRC), Campinas, SP, Brazil.,Universidade Estadual de Campinas, Centro de Biologia Molecular e Engenharia Genética, Campinas, SP, Brazil.,Universidade Estadual de Campinas, Instituto de Biologia, Departamento de Genética, Evolução, Microbiologia e Imunologia e Evolução, Campinas, SP, Brazil
| |
Collapse
|
26
|
Wang G, Li X, Shen W, Li MW, Huang M, Zhang J, Li H. The chromatin accessibility landscape of pistils and anthers in rice. PLANT PHYSIOLOGY 2022; 190:2797-2811. [PMID: 36149297 PMCID: PMC9706442 DOI: 10.1093/plphys/kiac448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 09/04/2022] [Indexed: 06/16/2023]
Abstract
Transcription activation is tightly associated with the openness of chromatin and allows direct contact between transcriptional regulators and their targeted DNA for gene expression. However, there are limited studies on the annotation of open chromatin regions (OCRs) in rice (Oryza sativa), especially those in reproductive organs. Here, we characterized OCRs in rice pistils and anthers with an assay for transposase-accessible chromatin using sequencing. Despite a large overlap, we found more OCRs in pistils than in anthers. These OCRs were enriched in gene transcription start sites (TSSs) and showed tight associations with gene expression. Transcription factor (TF) binding motifs were enriched at these OCRs as validated by TF chromatin immunoprecipitation followed by sequencing. Pistil-specific OCRs provided potential regulatory networks by binding directly to the targets, indicating that pistil-specific OCRs may be indicators of cis-regulatory elements in regulating pistil development, which are absent in anthers. We also found that open chromatin of pistils and anthers responded differently to low temperature (LT). These data offer a comprehensive overview of OCRs regulating reproductive organ development and LT responses in rice.
Collapse
Affiliation(s)
- Guanqun Wang
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518000, China
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin 999077, Hong Kong
| | - Xiaozheng Li
- College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518000, China
| | - Wei Shen
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin 999077, Hong Kong
| | - Man-Wah Li
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin 999077, Hong Kong
| | - Mingkun Huang
- Lushan Botanical Garden Jiangxi Province, Chinese Academy of Sciences, Jiujiang 332900, China
| | - Jianhua Zhang
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin 999077, Hong Kong
- Department of Biology, Hong Kong Baptist University, Kowloon 999077, Hong Kong
| | - Haoxuan Li
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Shatin 999077, Hong Kong
- Department of Biology, Hong Kong Baptist University, Kowloon 999077, Hong Kong
| |
Collapse
|
27
|
Pei H, Teng W, Gao L, Gao H, Ren X, Liu Y, Jia J, Tong Y, Wang Y, Lu Z. Low-affinity SPL binding sites contribute to subgenome expression divergence in allohexaploid wheat. SCIENCE CHINA LIFE SCIENCES 2022; 66:819-834. [PMID: 36417050 DOI: 10.1007/s11427-022-2202-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 09/22/2022] [Indexed: 11/24/2022]
Abstract
Expression divergence caused by genetic variation and crosstalks among subgenomes of the allohexaploid bread wheat (Triticum aestivum. L., BBAADD) is hypothesized to increase its adaptability and/or plasticity. However, the molecular basis of expression divergence remains unclear. Squamosa promoter-binding protein-like (SPL) transcription factors are critical for a wide array of biological processes. In this study, we constructed expression regulatory networks by combining DAP-seq for 40 SPLs, ATAC-seq, and RNA-seq. Our findings indicate that a group of low-affinity SPL binding regions (SBRs) were targeted by diverse SPLs and caused different sequence preferences around the core GTAC motif. The SBRs including the low-affinity ones are evolutionarily conserved, enriched GWAS signals related to important agricultural traits. However, those SBRs are highly diversified among the cis-regulatory regions (CREs) of syntenic genes, with less than 8% SBRs coexisting in triad genes, suggesting that CRE variations are critical for subgenome differentiations. Knocking out of TaSPL7A/B/D and TaSPL15A/B/D subfamily further proved that both high- and low-affinity SBRs played critical roles in the differential expression of genes regulating tiller number and spike sizes. Our results have provided baseline data for downstream networks of SPLs and wheat improvements and revealed that CRE variations are critical sources for subgenome divergence in the allohexaploid wheat.
Collapse
|
28
|
Genome-wide chromatin accessibility analysis unveils open chromatin convergent evolution during polyploidization in cotton. Proc Natl Acad Sci U S A 2022; 119:e2209743119. [PMID: 36279429 PMCID: PMC9636936 DOI: 10.1073/pnas.2209743119] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Allopolyploidization, resulting in divergent genomes in the same cell, is believed to trigger a “genome shock”, leading to broad genetic and epigenetic changes. However, little is understood about chromatin and gene-expression dynamics as underlying driving forces during allopolyploidization. Here, we examined the genome-wide DNase I-hypersensitive site (DHS) and its variations in domesticated allotetraploid cotton (
Gossypium hirsutum
and
Gossypium barbadense
, AADD) and its extant AA (
Gossypium arboreum
) and DD (
Gossypium raimondii
) progenitors. We observed distinct DHS distributions between
G. arboreum
and
G. raimondii
. In contrast, the DHSs of the two subgenomes of
G. hirsutum
and
G. barbadense
showed a convergent distribution. This convergent distribution of DHS was also present in the wild allotetraploids
Gossypium darwinii
and
G. hirsutum
var.
yucatanense
, but absent from a resynthesized hybrid of
G. arboreum
and
G. raimondii
, suggesting that it may be a common feature in polyploids, and not a consequence of domestication after polyploidization. We revealed that putative
cis
-regulatory elements (CREs) derived from polyploidization-related DHSs were dominated by several families, including Dof, ERF48, and BPC1. Strikingly, 56.6% of polyploidization-related DHSs were derived from transposable elements (TEs). Moreover, we observed positive correlations between DHS accessibility and the histone marks H3K4me3, H3K27me3, H3K36me3, H3K27ac, and H3K9ac, indicating that coordinated interplay among histone modifications, TEs, and CREs drives the DHS landscape dynamics under polyploidization. Collectively, these findings advance our understanding of the regulatory architecture in plants and underscore the complexity of regulome evolution during polyploidization.
Collapse
|
29
|
Hämälä T, Ning W, Kuittinen H, Aryamanesh N, Savolainen O. Environmental response in gene expression and DNA methylation reveals factors influencing the adaptive potential of Arabidopsis lyrata. eLife 2022; 11:83115. [PMID: 36306157 PMCID: PMC9616567 DOI: 10.7554/elife.83115] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 10/12/2022] [Indexed: 11/13/2022] Open
Abstract
Understanding what factors influence plastic and genetic variation is valuable for predicting how organisms respond to changes in the selective environment. Here, using gene expression and DNA methylation as molecular phenotypes, we study environmentally induced variation among Arabidopsis lyrata plants grown at lowland and alpine field sites. Our results show that gene expression is highly plastic, as many more genes are differentially expressed between the field sites than between populations. These environmentally responsive genes evolve under strong selective constraint – the strength of purifying selection on the coding sequence is high, while the rate of adaptive evolution is low. We find, however, that positive selection on cis-regulatory variants has likely contributed to the maintenance of genetically variable environmental responses, but such variants segregate only between distantly related populations. In contrast to gene expression, DNA methylation at genic regions is largely insensitive to the environment, and plastic methylation changes are not associated with differential gene expression. Besides genes, we detect environmental effects at transposable elements (TEs): TEs at the high-altitude field site have higher expression and methylation levels, suggestive of a broad-scale TE activation. Compared to the lowland population, plants native to the alpine environment harbor an excess of recent TE insertions, and we observe that specific TE families are enriched within environmentally responsive genes. Our findings provide insight into selective forces shaping plastic and genetic variation. We also highlight how plastic responses at TEs can rapidly create novel heritable variation in stressful conditions.
Collapse
Affiliation(s)
- Tuomas Hämälä
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Weixuan Ning
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Helmi Kuittinen
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Nader Aryamanesh
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Outi Savolainen
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| |
Collapse
|
30
|
Abeyratne CR, Macaya-Sanz D, Zhou R, Barry KW, Daum C, Haiby K, Lipzen A, Stanton B, Yoshinaga Y, Zane M, Tuskan GA, DiFazio SP. High-resolution mapping reveals hotspots and sex-biased recombination in Populus trichocarpa. G3 (BETHESDA, MD.) 2022; 13:6762080. [PMID: 36250890 PMCID: PMC9836356 DOI: 10.1093/g3journal/jkac269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/28/2022] [Indexed: 12/14/2022]
Abstract
Fine-scale meiotic recombination is fundamental to the outcome of natural and artificial selection. Here, dense genetic mapping and haplotype reconstruction were used to estimate recombination for a full factorial Populus trichocarpa cross of 7 males and 7 females. Genomes of the resulting 49 full-sib families (N = 829 offspring) were resequenced, and high-fidelity biallelic SNP/INDELs and pedigree information were used to ascertain allelic phase and impute progeny genotypes to recover gametic haplotypes. The 14 parental genetic maps contained 1,820 SNP/INDELs on average that covered 376.7 Mb of physical length across 19 chromosomes. Comparison of parental and progeny haplotypes allowed fine-scale demarcation of cross-over regions, where 38,846 cross-over events in 1,658 gametes were observed. Cross-over events were positively associated with gene density and negatively associated with GC content and long-terminal repeats. One of the most striking findings was higher rates of cross-overs in males in 8 out of 19 chromosomes. Regions with elevated male cross-over rates had lower gene density and GC content than windows showing no sex bias. High-resolution analysis identified 67 candidate cross-over hotspots spread throughout the genome. DNA sequence motifs enriched in these regions showed striking similarity to those of maize, Arabidopsis, and wheat. These findings, and recombination estimates, will be useful for ongoing efforts to accelerate domestication of this and other biomass feedstocks, as well as future studies investigating broader questions related to evolutionary history, perennial development, phenology, wood formation, vegetative propagation, and dioecy that cannot be studied using annual plant model systems.
Collapse
Affiliation(s)
| | - David Macaya-Sanz
- Department of Forest Ecology & Genetics, CIFOR-INIA, CSIC, Madrid 28040, Spain
| | - Ran Zhou
- Warnell School of Forestry and Natural Resources, Department of Genetics, and Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Kerrie W Barry
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Christopher Daum
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | | | - Anna Lipzen
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | | | - Yuko Yoshinaga
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Matthew Zane
- Department of Energy Joint Genome Institute, Berkeley, CA 94720, USA
| | - Gerald A Tuskan
- Biosciences Division, Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Stephen P DiFazio
- Corresponding author: Department of Biology, West Virginia University, Morgantown, WV 26506, USA.
| |
Collapse
|
31
|
Ramstein GP, Buckler ES. Prediction of evolutionary constraint by genomic annotations improves functional prioritization of genomic variants in maize. Genome Biol 2022; 23:183. [PMID: 36050782 PMCID: PMC9438327 DOI: 10.1186/s13059-022-02747-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 08/15/2022] [Indexed: 11/10/2022] Open
Abstract
Background Crop improvement through cross-population genomic prediction and genome editing requires identification of causal variants at high resolution, within fewer than hundreds of base pairs. Most genetic mapping studies have generally lacked such resolution. In contrast, evolutionary approaches can detect genetic effects at high resolution, but they are limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Here we use genomic annotations to accurately predict nucleotide conservation across angiosperms, as a proxy for fitness effect of mutations. Results Using only sequence analysis, we annotate nonsynonymous mutations in 25,824 maize gene models, with information from bioinformatics and deep learning. Our predictions are validated by experimental information: within-species conservation, chromatin accessibility, and gene expression. According to gene ontology and pathway enrichment analyses, predicted nucleotide conservation points to genes in central carbon metabolism. Importantly, it improves genomic prediction for fitness-related traits such as grain yield, in elite maize panels, by stringent prioritization of fewer than 1% of single-site variants. Conclusions Our results suggest that predicting nucleotide conservation across angiosperms may effectively prioritize sites most likely to impact fitness-related traits in crops, without being limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Our approach—Prediction of mutation Impact by Calibrated Nucleotide Conservation (PICNC)—could be useful to select polymorphisms for accurate genomic prediction, and candidate mutations for efficient base editing. The trained PICNC models and predicted nucleotide conservation at protein-coding SNPs in maize are publicly available in CyVerse (10.25739/hybz-2957). Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02747-2.
Collapse
Affiliation(s)
- Guillaume P Ramstein
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus, Denmark. .,Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.,USDA-ARS, Ithaca, NY, 14853, USA
| |
Collapse
|
32
|
Mo Y, Jiao Y. Advances and applications of single-cell omics technologies in plant research. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1551-1563. [PMID: 35426954 DOI: 10.1111/tpj.15772] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 06/14/2023]
Abstract
Single-cell sequencing approaches reveal the intracellular dynamics of individual cells and answer biological questions with high-dimensional catalogs of millions of cells, including genomics, transcriptomics, chromatin accessibility, epigenomics, and proteomics data across species. These emerging yet thriving technologies have been fully embraced by the field of plant biology, with a constantly expanding portfolio of applications. Here, we introduce the current technical advances used for single-cell omics, especially single-cell genome and transcriptome sequencing. Firstly, we overview methods for protoplast and nucleus isolation and genome and transcriptome amplification. Subsequently, we use well-executed benchmarking studies to highlight advances made through the application of single-cell omics techniques. Looking forward, we offer a glimpse of additional hurdles and future opportunities that will introduce broad adoption of single-cell sequencing with revolutionary perspectives in plant biology.
Collapse
Affiliation(s)
- Yajin Mo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Center for Quantitative Biology, School of Life Sciences, Peking University, Beijing, 100871, China
- School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yuling Jiao
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Center for Quantitative Biology, School of Life Sciences, Peking University, Beijing, 100871, China
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
| |
Collapse
|
33
|
Sork VL, Cokus SJ, Fitz-Gibbon ST, Zimin AV, Puiu D, Garcia JA, Gugger PF, Henriquez CL, Zhen Y, Lohmueller KE, Pellegrini M, Salzberg SL. High-quality genome and methylomes illustrate features underlying evolutionary success of oaks. Nat Commun 2022; 13:2047. [PMID: 35440538 PMCID: PMC9018854 DOI: 10.1038/s41467-022-29584-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 03/11/2022] [Indexed: 02/01/2023] Open
Abstract
The genus Quercus, which emerged ∼55 million years ago during globally warm temperatures, diversified into ∼450 extant species. We present a high-quality de novo genome assembly of a California endemic oak, Quercus lobata, revealing features consistent with oak evolutionary success. Effective population size remained large throughout history despite declining since early Miocene. Analysis of 39,373 mapped protein-coding genes outlined copious duplications consistent with genetic and phenotypic diversity, both by retention of genes created during the ancient γ whole genome hexaploid duplication event and by tandem duplication within families, including numerous resistance genes and a very large block of duplicated DUF247 genes, which have been found to be associated with self-incompatibility in grasses. An additional surprising finding is that subcontext-specific patterns of DNA methylation associated with transposable elements reveal broadly-distributed heterochromatin in intergenic regions, similar to grasses. Collectively, these features promote genetic and phenotypic variation that would facilitate adaptability to changing environments.
Collapse
Affiliation(s)
- Victoria L Sork
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA.
- Institute of the Environment and Sustainability, University of California, Los Angeles, CA, 90095, USA.
| | - Shawn J Cokus
- Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, CA, 90095-7239, USA
| | - Sorel T Fitz-Gibbon
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA
| | - Aleksey V Zimin
- Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Daniela Puiu
- Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jesse A Garcia
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA
| | - Paul F Gugger
- Appalachian Laboratory, University of Maryland Center for Environmental Science, Frostburg, MD, 21532, USA
| | - Claudia L Henriquez
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA
| | - Ying Zhen
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095-1438, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Matteo Pellegrini
- Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, CA, 90095-7239, USA
| | - Steven L Salzberg
- Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
- Departments of Biomedical Engineering, Computer Science, and Biostatistics, Johns Hopkins University, Baltimore, MD, 21218, USA
| |
Collapse
|
34
|
Gage JL, Mali S, McLoughlin F, Khaipho-Burch M, Monier B, Bailey-Serres J, Vierstra RD, Buckler ES. Variation in upstream open reading frames contributes to allelic diversity in maize protein abundance. Proc Natl Acad Sci U S A 2022; 119:e2112516119. [PMID: 35349347 PMCID: PMC9169109 DOI: 10.1073/pnas.2112516119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 02/22/2022] [Indexed: 11/18/2022] Open
Abstract
SignificanceProteins are the machinery which execute essential cellular functions. However, measuring their abundance within an organism can be difficult and resource-intensive. Cells use a variety of mechanisms to control protein synthesis from mRNA, including short open reading frames (uORFs) that lie upstream of the main coding sequence. Ribosomes can preferentially translate uORFs instead of the main coding sequence, leading to reduced translation of the main protein. In this study, we show that uORF sequence variation between individuals can lead to different rates of protein translation and thus variable protein abundances. We also demonstrate that natural variation in uORFs occurs frequently and can be linked to whole-plant phenotypes, indicating that uORF sequence variation likely contributes to plant adaptation.
Collapse
Affiliation(s)
- Joseph L. Gage
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695
| | - Sujina Mali
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130
| | - Fionn McLoughlin
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130
| | - Merritt Khaipho-Burch
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853
| | - Brandon Monier
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853
| | - Julia Bailey-Serres
- Department of Botany and Plant Sciences, Center for Plant Cell Biology, University of California, Riverside, CA 92521
| | - Richard D. Vierstra
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130
| | - Edward S. Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853
- Agricultural Research Service, US Department of Agriculture, Ithaca, NY 14853
| |
Collapse
|
35
|
Li Y, Ruperao P, Batley J, Edwards D, Martin W, Hobson K, Sutton T. Genomic prediction of preliminary yield trials in chickpea: Effect of functional annotation of SNPs and environment. THE PLANT GENOME 2022; 15:e20166. [PMID: 34786880 DOI: 10.1002/tpg2.20166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/14/2021] [Indexed: 06/13/2023]
Abstract
Achieving yield potential in chickpea (Cicer arietinum L.) is limited by many constraints that include biotic and abiotic stresses. Combining next-generation sequencing technology with advanced statistical modeling has the potential to increase genetic gain efficiently. Whole genome resequencing data was obtained from 315 advanced chickpea breeding lines from the Australian chickpea breeding program resulting in more than 298,000 single nucleotide polymorphisms (SNPs) discovered. Analysis of population structure revealed a distinct group of breeding lines with many alleles that are absent from recently released Australian cultivars. Genome-wide association studies (GWAS) using these Australian breeding lines identified 20 SNPs significantly associated with grain yield in multiple field environments. A reduced level of nucleotide diversity and extended linkage disequilibrium suggested that some regions in these chickpea genomes may have been through selective breeding for yield or other traits. A large introgression segment that introduced from C. echinospermum for phytophthora root rot resistance was identified on chromosome 6, yet it also has unintended consequences of reducing yield due to linkage drag. We further investigated the effect of genotype by environment interaction on genomic prediction of yield. We found that the training set had better prediction accuracy when phenotyped under conditions relevant to the targeted environments. We also investigated the effect of SNP functional annotation on prediction accuracy using different subsets of SNPs based on their genomic locations: regulatory regions, exome, and alternative splice sites. Compared with the whole SNP dataset, a subset of SNPs did not significantly decrease prediction accuracy for grain yield despite consisting of a smaller number of SNPs.
Collapse
Affiliation(s)
- Yongle Li
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
| | - Pradeep Ruperao
- Statistics, Bioinformatics and Data Management, ICRISAT, Hyderabad, 502324, India
| | - Jacqueline Batley
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - David Edwards
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - William Martin
- Dep. of Agriculture and Fisheries, Warwick, Qld, 4370, Australia
| | - Kristy Hobson
- NSW Dep. of Primary Industries, Tamworth, NSW, 2340, Australia
| | - Tim Sutton
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
- South Australian Research and Development Institute, Adelaide, SA, 5064, Australia
| |
Collapse
|
36
|
Zanini SF, Bayer PE, Wells R, Snowdon RJ, Batley J, Varshney RK, Nguyen HT, Edwards D, Golicz AA. Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs. THE PLANT GENOME 2022; 15:e20177. [PMID: 34904403 DOI: 10.1002/tpg2.20177] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 10/07/2021] [Indexed: 05/15/2023]
Abstract
Since the first reported crop pangenome in 2014, advances in high-throughput and cost-effective DNA sequencing technologies facilitated multiple such studies including the pangenomes of oilseed rape (Brassica napus L.), soybean [Glycine max (L.) Merr.], rice (Oryza sativa L.), wheat (Triticum aestivum L.), and barley (Hordeum vulgare L.). Compared with single-reference genomes, pangenomes provide a more accurate representation of the genetic variation present in a species. By combining the genomic data of multiple accessions, pangenomes allow for the detection and annotation of complex DNA polymorphisms such as structural variations (SVs), one of the major determinants of genetic diversity within a species. In this review we summarize the current literature on crop pangenomics, focusing on their application to find candidate SVs involved in traits of agronomic interest. We then highlight the potential of pangenomes in the discovery and functional characterization of noncoding regulatory sequences and their variations. We conclude with a summary and outlook on innovative data structures representing the complete content of plant pangenomes including annotations of coding and noncoding elements and outcomes of transcriptomic and epigenomic experiments.
Collapse
Affiliation(s)
- Silvia F Zanini
- Dep. of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig Univ. Giessen, Giessen, 35392, Germany
| | - Philipp E Bayer
- School of Biological Sciences and Institute of Agriculture, Univ. of Western Australia, Perth, Western Australia, Australia
| | - Rachel Wells
- Dep. of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, NR47UH, UK
| | - Rod J Snowdon
- Dep. of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig Univ. Giessen, Giessen, 35392, Germany
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, Univ. of Western Australia, Perth, Western Australia, Australia
| | - Rajeev K Varshney
- Center of Excellence in Genomics & Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India
- State Agricultural Biotechnology Centre, Centre for Crop Food Innovation, Food Futures Institute, Murdoch Univ., Murdoch, WA, Australia
| | - Henry T Nguyen
- Division of Plant Sciences, Univ. of Missouri, Columbia, MO, USA
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, Univ. of Western Australia, Perth, Western Australia, Australia
| | - Agnieszka A Golicz
- Dep. of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig Univ. Giessen, Giessen, 35392, Germany
| |
Collapse
|
37
|
Schmitz RJ, Grotewold E, Stam M. Cis-regulatory sequences in plants: Their importance, discovery, and future challenges. THE PLANT CELL 2022; 34:718-741. [PMID: 34918159 PMCID: PMC8824567 DOI: 10.1093/plcell/koab281] [Citation(s) in RCA: 110] [Impact Index Per Article: 55.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 10/20/2021] [Indexed: 05/19/2023]
Abstract
The identification and characterization of cis-regulatory DNA sequences and how they function to coordinate responses to developmental and environmental cues is of paramount importance to plant biology. Key to these regulatory processes are cis-regulatory modules (CRMs), which include enhancers and silencers. Despite the extraordinary advances in high-quality sequence assemblies and genome annotations, the identification and understanding of CRMs, and how they regulate gene expression, lag significantly behind. This is especially true for their distinguishing characteristics and activity states. Here, we review the current knowledge on CRMs and breakthrough technologies enabling identification, characterization, and validation of CRMs; we compare the genomic distributions of CRMs with respect to their target genes between different plant species, and discuss the role of transposable elements harboring CRMs in the evolution of gene expression. This is an exciting time to study cis-regulomes in plants; however, significant existing challenges need to be overcome to fully understand and appreciate the role of CRMs in plant biology and in crop improvement.
Collapse
Affiliation(s)
- Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, USA
| | | |
Collapse
|
38
|
Dai X, Tu X, Du B, Dong P, Sun S, Wang X, Sun J, Li G, Lu T, Zhong S, Li P. Chromatin and regulatory differentiation between bundle sheath and mesophyll cells in maize. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 109:675-692. [PMID: 34783109 DOI: 10.1111/tpj.15586] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 11/04/2021] [Accepted: 11/09/2021] [Indexed: 06/13/2023]
Abstract
C4 plants partition photosynthesis enzymes between the bundle sheath (BS) and the mesophyll (M) cells for the better delivery of CO2 to RuBisCO and to reduce photorespiration. To better understand how C4 photosynthesis is regulated at the transcriptional level, we performed RNA-seq, ATAC-seq, ChIP-seq and Bisulfite-seq (BS-seq) on BS and M cells isolated from maize leaves. By integrating differentially expressed genes with chromatin features, we found that chromatin accessibility coordinates with epigenetic features, especially H3K27me3 modification and CHH methylation, to regulate cell type-preferentially enriched gene expression. Not only the chromatin-accessible regions (ACRs) proximal to the genes (pACRs) but also the distal ACRs (dACRs) are determinants of cell type-preferentially enriched expression. We further identified cell type-preferentially enriched motifs, e.g. AAAG for BS cells and TGACC/T for M cells, and determined their corresponding transcription factors: DOFs and WRKYs. The complex interaction between cis and trans factors in the preferential expression of C4 genes was also observed. Interestingly, cell type-preferentially enriched gene expression can be fine-tuned by the coordination of multiple chromatin features. Such coordination may be critical in ensuring the cell type-specific function of key C4 genes. Based on the observed cell type-preferentially enriched expression pattern and coordinated chromatin features, we predicted a set of functionally unknown genes, e.g. Zm00001d042050 and Zm00001d040659, to be potential key C4 genes. Our findings provide deep insight into the architectures associated with C4 gene expression and could serve as a valuable resource to further identify the regulatory mechanisms present in C4 species.
Collapse
Affiliation(s)
- Xiuru Dai
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| | - Xiaoyu Tu
- Joint Center for Single Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Baijuan Du
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| | - Pengfei Dong
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Shilei Sun
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| | - Xianglan Wang
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| | - Jing Sun
- Biotechnology Research Institute/National Key Facility for Gene Resources and Gene Improvement, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Gang Li
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| | - Tiegang Lu
- Biotechnology Research Institute/National Key Facility for Gene Resources and Gene Improvement, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Silin Zhong
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Pinghua Li
- State Key Laboratory of Crop Biology, College of Agronomic Sciences, Shandong Agricultural University, Tai'an, Shandong, 271018, China
| |
Collapse
|
39
|
Marand AP, Schmitz RJ. Single-cell analysis of cis-regulatory elements. CURRENT OPINION IN PLANT BIOLOGY 2022; 65:102094. [PMID: 34390932 DOI: 10.1016/j.pbi.2021.102094] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 07/06/2021] [Accepted: 07/14/2021] [Indexed: 06/13/2023]
Abstract
Plant tissues and organs are composed of functionally discrete cell types that are all defined by the same genome sequence. Cell-type variation in part arises from differential accessibility of cis-regulatory elements that encode the blueprints for transcriptional programs underlying cell identity and function. Owing to technical limitations, the role of cis-regulatory elements in cell identity maintenance, differentiation, and functional specialization has remained relatively unexplored in plant systems. Single-cell profiling has emerged as a powerful tool to circumvent these past obstacles by enabling unbiased charting of transcriptional and cis-regulatory states at the resolution of individual cells. Here, we review state-of-the-art single-cell approaches and analytical frameworks that have paved the way for establishing the link between cellular phenotypic variation and cis-regulatory mechanisms in plants.
Collapse
Affiliation(s)
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, GA 30602, USA.
| |
Collapse
|
40
|
Catlin NS, Josephs EB. The important contribution of transposable elements to phenotypic variation and evolution. CURRENT OPINION IN PLANT BIOLOGY 2022; 65:102140. [PMID: 34883307 DOI: 10.1016/j.pbi.2021.102140] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 10/04/2021] [Accepted: 10/13/2021] [Indexed: 06/13/2023]
Abstract
Transposable elements (TEs) are responsible for significant genomic variation in plants. Our understanding of the evolutionary forces shaping TE polymorphism has lagged behind other mutations because of the difficulty of accurately identifying TE polymorphism in short-read population genomic data. However, new approaches allow us to quantify TE polymorphisms in population datasets and address fundamental questions about the evolution of these polymorphisms. Here, we discuss how insertional biases shape where, when, and how often TEs insert throughout the genome. Next, we examine mechanisms by which TEs can affect phenotype. Finally, we evaluate current evidence for selection on TE polymorphisms. All together, it is clear that TEs are important, but underappreciated, contributors to intraspecific phenotypic variation, and that understanding the dynamics governing TE polymorphism is crucial for evolutionary biologists interested in the maintenance of variation.
Collapse
Affiliation(s)
- Nathan S Catlin
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA; Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, 48824, USA.
| | - Emily B Josephs
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA; Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
41
|
Epigenome guided crop improvement: current progress and future opportunities. Emerg Top Life Sci 2022; 6:141-151. [PMID: 35072210 PMCID: PMC9023013 DOI: 10.1042/etls20210258] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 12/14/2021] [Accepted: 01/04/2022] [Indexed: 12/22/2022]
Abstract
Epigenomics encompasses a broad field of study, including the investigation of chromatin states, chromatin modifications and their impact on gene regulation; as well as the phenomena of epigenetic inheritance. The epigenome is a multi-modal layer of information superimposed on DNA sequences, instructing their usage in gene expression. As such, it is an emerging focus of efforts to improve crop performance. Broadly, this might be divided into avenues that leverage chromatin information to better annotate and decode plant genomes, and into complementary strategies that aim to identify and select for heritable epialleles that control crop traits independent of underlying genotype. In this review, we focus on the first approach, which we term ‘epigenome guided’ improvement. This encompasses the use of chromatin profiles to enhance our understanding of the composition and structure of complex crop genomes. We discuss the current progress and future prospects towards integrating this epigenomic information into crop improvement strategies; in particular for CRISPR/Cas9 gene editing and precision genome engineering. We also highlight some specific opportunities and challenges for grain and horticultural crops.
Collapse
|
42
|
Schmitz RJ, Marand AP, Zhang X, Mosher RA, Turck F, Chen X, Axtell MJ, Zhong X, Brady SM, Megraw M, Meyers BC. Quality control and evaluation of plant epigenomics data. THE PLANT CELL 2022; 34:503-513. [PMID: 34648025 PMCID: PMC8773985 DOI: 10.1093/plcell/koab255] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 10/08/2021] [Indexed: 05/22/2023]
Abstract
Epigenomics is the study of molecular signatures associated with discrete regions within genomes, many of which are important for a wide range of nuclear processes. The ability to profile the epigenomic landscape associated with genes, repetitive regions, transposons, transcription, differential expression, cis-regulatory elements, and 3D chromatin interactions has vastly improved our understanding of plant genomes. However, many epigenomic and single-cell genomic assays are challenging to perform in plants, leading to a wide range of data quality issues; thus, the data require rigorous evaluation prior to downstream analyses and interpretation. In this commentary, we provide considerations for the evaluation of plant epigenomics and single-cell genomics data quality with the aim of improving the quality and utility of studies using those data across diverse plant species.
Collapse
Affiliation(s)
- Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
- Author for correspondence:
| | - Alexandre P Marand
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Rebecca A Mosher
- School of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
| | - Franziska Turck
- Department of Plant Developmental Biology, Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Xuemei Chen
- Department of Botany and Plant Sciences, University of California, Riverside, California 92521, USA
| | - Michael J Axtell
- Department of Biology and Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16801, USA
| | - Xuehua Zhong
- Wisconsin Institute for Discovery & Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin 53715, USA
| | - Siobhan M Brady
- Department of Plant Biology and Genome Center, University of California Davis, Davis, California 95616, USA
| | - Molly Megraw
- Department of Botany and Plant Pathology, Center for Quantitative Life Sciences, Oregon State University, Corvallis, Oregon 97331 USA
| | - Blake C Meyers
- Donald Danforth Plant Science Center, St Louis, Missouri 63132, USA
- Division of Plant Sciences, University of Missouri, Columbia, Missouri 65211, USA
| |
Collapse
|
43
|
Genomic interrogation of a MAGIC population highlights genetic factors controlling fiber quality traits in cotton. Commun Biol 2022; 5:60. [PMID: 35039628 PMCID: PMC8764025 DOI: 10.1038/s42003-022-03022-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 12/21/2021] [Indexed: 02/05/2023] Open
Abstract
Cotton (Gossypium hirsutum L.) fiber is the most important resource of natural and renewable fiber for the textile industry. However, the understanding of genetic components and their genome-wide interactions controlling fiber quality remains fragmentary. Here, we sequenced a multiple-parent advanced-generation inter-cross (MAGIC) population, consisting of 550 individuals created by inter-crossing 11 founders, and established a mosaic genome map through tracing the origin of haplotypes that share identity-by-descent (IBD). We performed two complementary GWAS methods—SNP-based GWAS (sGWAS) and IBD-based haplotype GWAS (hGWAS). A total of 25 sQTLs and 14 hQTLs related to cotton fiber quality were identified, of which 26 were novel QTLs. Two major QTLs detected by both GWAS methods were responsible for fiber strength and length. The gene Ghir_D11G020400 (GhZF14) encoding the MATE efflux family protein was identified as a novel candidate gene for fiber length. Beyond the additive QTLs, we detected prevalent epistatic interactions that contributed to the genetics of fiber quality, pinpointing another layer for trait variance. This study provides new targets for future molecular design breeding of superior fiber quality. Wang and colleagues use a complementary GWAS approach to identify genetic loci associated with cotton fiber quality. Using a multiparent advanced-generation inter-cross population, 26 new QTLs related to cotton fiber quality were found.
Collapse
|
44
|
Horvath R, Josephs EB, Pesquet E, Stinchcombe JR, Wright SI, Scofield D, Slotte T. Selection on Accessible Chromatin Regions in Capsella grandiflora. Mol Biol Evol 2021; 38:5563-5575. [PMID: 34498072 PMCID: PMC8662636 DOI: 10.1093/molbev/msab270] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.
Collapse
Affiliation(s)
- Robert Horvath
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Emily B Josephs
- Department of Plant Biology, Michigan State University, Lansing, MI, USA
| | - Edouard Pesquet
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
| | - John R Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Douglas Scofield
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| |
Collapse
|
45
|
Minow MAA, Lukens L, Rossi V, Colasanti J. Patterns of stability and change in the maize genome: a case study of small RNA transcriptomes in two recombinant inbred lines and their progenitors. Genome 2021; 65:1-12. [PMID: 34597524 DOI: 10.1139/gen-2021-0040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Small RNAs (sRNAs) are epigenetic regulators of eukaryotic genes and transposable elements (TEs). Diverse sRNA expression patterns exist within a species, but how this diversity arises is not well understood. To provide a window into the dynamics of maize sRNA patterning, sRNA and mRNA transcriptomes were examined in two related Zea mays recombinant inbred lines (RILs) and their inbred parents. Analysis of these RILs revealed that most clusters of sRNA expression retained the parental sRNA expression level. However, expression states that differ from the parental allele were also observed, predominantly reflecting decreases in sRNA expression. When RIL sRNA expression differed from the parental allele, the new state was frequently similar between the two RILs, and similar to the expression state found at the allele in the other parent. Novel sRNA expression patterns, distinct from those of either parent, were rare. Additionally, examination of sRNA expression over TEs revealed one TE family, Gyma, which showed consistent enrichment for RIL sRNA expression differences compared to those found in parental alleles. These findings provide insights into how sRNA silencing might evolve over generations and suggest that further investigation into the molecular nature of sRNA trans regulators is warranted.
Collapse
Affiliation(s)
- Mark A A Minow
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, Canada
| | - Lewis Lukens
- Plant Agriculture Department, University of Guelph, Guelph, Ontario, Canada
| | - Vincenzo Rossi
- Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, I-24126 Bergamo, Italy
| | - Joseph Colasanti
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, Canada
| |
Collapse
|
46
|
Stitzer MC, Anderson SN, Springer NM, Ross-Ibarra J. The genomic ecosystem of transposable elements in maize. PLoS Genet 2021; 17:e1009768. [PMID: 34648488 PMCID: PMC8547701 DOI: 10.1371/journal.pgen.1009768] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 10/26/2021] [Accepted: 08/10/2021] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) constitute the majority of flowering plant DNA, reflecting their tremendous success in subverting, avoiding, and surviving the defenses of their host genomes to ensure their selfish replication. More than 85% of the sequence of the maize genome can be ascribed to past transposition, providing a major contribution to the structure of the genome. Evidence from individual loci has informed our understanding of how transposition has shaped the genome, and a number of individual TE insertions have been causally linked to dramatic phenotypic changes. Genome-wide analyses in maize and other taxa have frequently represented TEs as a relatively homogeneous class of fragmentary relics of past transposition, obscuring their evolutionary history and interaction with their host genome. Using an updated annotation of structurally intact TEs in the maize reference genome, we investigate the family-level dynamics of TEs in maize. Integrating a variety of data, from descriptors of individual TEs like coding capacity, expression, and methylation, as well as similar features of the sequence they inserted into, we model the relationship between attributes of the genomic environment and the survival of TE copies and families. In contrast to the wholesale relegation of all TEs to a single category of junk DNA, these differences reveal a diversity of survival strategies of TE families. Together these generate a rich ecology of the genome, with each TE family representing the evolution of a distinct ecological niche. We conclude that while the impact of transposition is highly family- and context-dependent, a family-level understanding of the ecology of TEs in the genome can refine our ability to predict the role of TEs in generating genetic and phenotypic diversity.
Collapse
Affiliation(s)
- Michelle C. Stitzer
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Sarah N. Anderson
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Nathan M. Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Jeffrey Ross-Ibarra
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, California, United States of America
- Genome Center, University of California, Davis, California, United States of America
| |
Collapse
|
47
|
Lozano R, Booth GT, Omar BY, Li B, Buckler ES, Lis JT, Del Carpio DP, Jannink JL. RNA polymerase mapping in plants identifies intergenic regulatory elements enriched in causal variants. G3-GENES GENOMES GENETICS 2021; 11:6364897. [PMID: 34499719 PMCID: PMC8527479 DOI: 10.1093/g3journal/jkab273] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 07/04/2021] [Indexed: 12/14/2022]
Abstract
Control of gene expression is fundamental at every level of cell function. Promoter-proximal pausing and divergent transcription at promoters and enhancers, which are prominent features in animals, have only been studied in a handful of research experiments in plants. PRO-Seq analysis in cassava (Manihot esculenta) identified peaks of transcriptionally engaged RNA polymerase at both the 5' and 3' end of genes, consistent with paused or slowly moving Polymerase. In addition, we identified divergent transcription at intergenic sites. A full genome search for bi-directional transcription using an algorithm for enhancer detection developed in mammals (dREG) identified many intergenic regulatory element (IRE) candidates. These sites showed distinct patterns of methylation and nucleotide conservation based on genomic evolutionary rate profiling (GERP). SNPs within these IRE candidates explained significantly more variation in fitness and root composition than SNPs in chromosomal segments randomly ascertained from the same intergenic distribution, strongly suggesting a functional importance of these sites. Maize GRO-Seq data showed RNA polymerase occupancy at IREs consistent with patterns in cassava. Furthermore, these IREs in maize significantly overlapped with sites previously identified on the basis of open chromatin, histone marks, and methylation, and were enriched for reported eQTL. Our results suggest that bidirectional transcription can identify intergenic genomic regions in plants that play an important role in transcription regulation and whose identification has the potential to aid crop improvement.
Collapse
Affiliation(s)
- Roberto Lozano
- Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Gregory T Booth
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | | | - Bo Li
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Science, Beijing 100101, China
| | - Edward S Buckler
- Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA.,Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA.,United States Department of Agriculture, Agricultural Research Service (USDA-ARS) R.W. Holley Center for Agriculture and Health, Ithaca, NY 14853, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Dunia Pino Del Carpio
- Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA.,United States Department of Agriculture, Agricultural Research Service (USDA-ARS) R.W. Holley Center for Agriculture and Health, Ithaca, NY 14853, USA
| |
Collapse
|
48
|
Schwope R, Magris G, Miculan M, Paparelli E, Celii M, Tocci A, Marroni F, Fornasiero A, De Paoli E, Morgante M. Open chromatin in grapevine marks candidate CREs and with other chromatin features correlates with gene expression. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:1631-1647. [PMID: 34219317 PMCID: PMC8518642 DOI: 10.1111/tpj.15404] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 06/24/2021] [Accepted: 06/25/2021] [Indexed: 05/14/2023]
Abstract
Vitis vinifera is an economically important crop and a useful model in which to study chromatin dynamics. In contrast to the small and relatively simple genome of Arabidopsis thaliana, grapevine contains a complex genome of 487 Mb that exhibits extensive colonization by transposable elements. We used Hi-C, ChIP-seq and ATAC-seq to measure how chromatin features correlate to the expression of 31 845 grapevine genes. ATAC-seq revealed the presence of more than 16 000 open chromatin regions, of which we characterize nearly 5000 as possible distal enhancer candidates that occur in intergenic space > 2 kb from the nearest transcription start site (TSS). A motif search identified more than 480 transcription factor (TF) binding sites in these regions, with those for TCP family proteins in greatest abundance. These open chromatin regions are typically within 15 kb from their nearest promoter, and a gene ontology analysis indicated that their nearest genes are significantly enriched for TF activity. The presence of a candidate cis-regulatory element (cCRE) > 2 kb upstream of the TSS, location in the active nuclear compartment as determined by Hi-C, and the enrichment of H3K4me3, H3K4me1 and H3K27ac at the gene are correlated with gene expression. Taken together, these results suggest that regions of intergenic open chromatin identified by ATAC-seq can be considered potential candidates for cis-regulatory regions in V. vinifera. Our findings enhance the characterization of a valuable agricultural crop, and help to clarify the understanding of unique plant biology.
Collapse
Affiliation(s)
- Rachel Schwope
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
| | - Gabriele Magris
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
| | - Mara Miculan
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
- Present address:
Institute of Life SciencesScuola Superiore Sant'Anna PisaPisa56127Italy
| | - Eleonora Paparelli
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
- Present address:
IGA Technology ServicesUdineI‐33100Italy
| | - Mirko Celii
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
- Present address:
Center for Desert Agriculture, Biological and Environmental Sciences & Engineering Division (BESE)KAUSTThuwalMakkahSaudi Arabia
| | - Aldo Tocci
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
- Scuola Internazionale Superiore di Studi AvanzatiTriesteFriuli‐Venezia GiuliaItaly
| | - Fabio Marroni
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
| | - Alice Fornasiero
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
- Present address:
Center for Desert Agriculture, Biological and Environmental Sciences & Engineering Division (BESE)KAUSTThuwalMakkahSaudi Arabia
| | - Emanuele De Paoli
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
| | - Michele Morgante
- Dipartimento di Scienze AgroalimentariAmbientali e Animali (DI4A)UdineI‐33100Italy
- Istituto di Genomica ApplicataUdineI‐33100Italy
| |
Collapse
|
49
|
Noshay JM, Marand AP, Anderson SN, Zhou P, Mejia Guerra MK, Lu Z, O'Connor CH, Crisp PA, Hirsch CN, Schmitz RJ, Springer NM. Assessing the regulatory potential of transposable elements using chromatin accessibility profiles of maize transposons. Genetics 2021; 217:1-13. [PMID: 33683350 DOI: 10.1093/genetics/iyaa003] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 11/02/2020] [Indexed: 11/14/2022] Open
Abstract
Transposable elements (TEs) have the potential to create regulatory variation both through the disruption of existing DNA regulatory elements and through the creation of novel DNA regulatory elements. In a species with a large genome, such as maize, many TEs interspersed with genes create opportunities for significant allelic variation due to TE presence/absence polymorphisms among individuals. We used information on putative regulatory elements in combination with knowledge about TE polymorphisms in maize to identify TE insertions that interrupt existing accessible chromatin regions (ACRs) in B73 as well as examples of polymorphic TEs that contain ACRs among four inbred lines of maize including B73, Mo17, W22, and PH207. The TE insertions in three other assembled maize genomes (Mo17, W22, or PH207) that interrupt ACRs that are present in the B73 genome can trigger changes to the chromatin, suggesting the potential for both genetic and epigenetic influences of these insertions. Nearly 20% of the ACRs located over 2 kb from the nearest gene are located within an annotated TE. These are regions of unmethylated DNA that show evidence for functional importance similar to ACRs that are not present within TEs. Using a large panel of maize genotypes, we tested if there is an association between the presence of TE insertions that interrupt, or carry, an ACR and the expression of nearby genes. While most TE polymorphisms are not associated with expression for nearby genes, the TEs that carry ACRs exhibit enrichment for being associated with higher expression of nearby genes, suggesting that these TEs may contribute novel regulatory elements. These analyses highlight the potential for a subset of TEs to rewire transcriptional responses in eukaryotic genomes.
Collapse
Affiliation(s)
- Jaclyn M Noshay
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| | - Alexandre P Marand
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Sarah N Anderson
- Department of Genetics, Development, and Cell Biology, Iowa State University, 2437 Pammel Dr, Ames, IA 50011, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| | | | - Zefu Lu
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Christine H O'Connor
- Department of Agronomy and Plant Genetics, University of Minnesota, 1994 Upper Buford Circle, 411 Borlaug Hall, St. Paul, MN 55108, USA
| | - Peter A Crisp
- School of Agriculture and Food Sciences, The University of Queensland, Harley Teakle Building, Keyhold Rd, St Lucia QLD 4067, Australia
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, 1994 Upper Buford Circle, 411 Borlaug Hall, St. Paul, MN 55108, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| |
Collapse
|
50
|
Savadel SD, Hartwig T, Turpin ZM, Vera DL, Lung PY, Sui X, Blank M, Frommer WB, Dennis JH, Zhang J, Bass HW. The native cistrome and sequence motif families of the maize ear. PLoS Genet 2021; 17:e1009689. [PMID: 34383745 PMCID: PMC8360572 DOI: 10.1371/journal.pgen.1009689] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 06/30/2021] [Indexed: 01/22/2023] Open
Abstract
Elucidating the transcriptional regulatory networks that underlie growth and development requires robust ways to define the complete set of transcription factor (TF) binding sites. Although TF-binding sites are known to be generally located within accessible chromatin regions (ACRs), pinpointing these DNA regulatory elements globally remains challenging. Current approaches primarily identify binding sites for a single TF (e.g. ChIP-seq), or globally detect ACRs but lack the resolution to consistently define TF-binding sites (e.g. DNAse-seq, ATAC-seq). To address this challenge, we developed MNase-defined cistrome-Occupancy Analysis (MOA-seq), a high-resolution (< 30 bp), high-throughput, and genome-wide strategy to globally identify putative TF-binding sites within ACRs. We used MOA-seq on developing maize ears as a proof of concept, able to define a cistrome of 145,000 MOA footprints (MFs). While a substantial majority (76%) of the known ATAC-seq ACRs intersected with the MFs, only a minority of MFs overlapped with the ATAC peaks, indicating that the majority of MFs were novel and not detected by ATAC-seq. MFs were associated with promoters and significantly enriched for TF-binding and long-range chromatin interaction sites, including for the well-characterized FASCIATED EAR4, KNOTTED1, and TEOSINTE BRANCHED1. Importantly, the MOA-seq strategy improved the spatial resolution of TF-binding prediction and allowed us to identify 215 motif families collectively distributed over more than 100,000 non-overlapping, putatively-occupied binding sites across the genome. Our study presents a simple, efficient, and high-resolution approach to identify putative TF footprints and binding motifs genome-wide, to ultimately define a native cistrome atlas. Understanding gene regulation remains a central goal of modern biology. Delineating the full set of regulatory DNA elements that orchestrate this regulation requires information at two scales; the broad landscape of accessible chromatin, and the site-specific binding of transcription factors (TFs) at discrete cis-regulatory DNA elements. Here we describe a single assay that uses micrococcal nuclease (MNase) as a structural probe to simultaneously reveal regions of accessible chromatin in addition to high-resolution footprints with signatures of TF-occupied cis-elements. We have used maize developing ear tissue as proof of concept, showing the method detects known TF-binding sites. This genome-wide assay not only defines chromatin landscapes, but crucially enables global discovery and mapping of sequence motifs underlying small footprints of ~30 bp to produce an atlas of candidate TF occupancy.
Collapse
Affiliation(s)
- Savannah D. Savadel
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Thomas Hartwig
- Institute for Molecular Physiologie, Heinrich-Heine-Universität, Düsseldorf, Germany
- Independent research groups, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Zachary M. Turpin
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Daniel L. Vera
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Pei-Yau Lung
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
| | - Xin Sui
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
| | - Max Blank
- Institute for Molecular Physiologie, Heinrich-Heine-Universität, Düsseldorf, Germany
- Independent research groups, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Wolf B. Frommer
- Institute for Molecular Physiologie, Heinrich-Heine-Universität, Düsseldorf, Germany
- Independent research groups, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Jonathan H. Dennis
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
| | - Hank W. Bass
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
- * E-mail:
| |
Collapse
|