1
|
Song B, Buckler ES, Wang H, Wu Y, Rees E, Kellogg EA, Gates DJ, Khaipho-Burch M, Bradbury PJ, Ross-Ibarra J, Hufford MB, Romay MC. Conserved noncoding sequences provide insights into regulatory sequence and loss of gene expression in maize. Genome Res 2021; 31:1245-1257. [PMID: 34045362 PMCID: PMC8256870 DOI: 10.1101/gr.266528.120] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 05/21/2021] [Indexed: 01/16/2023]
Abstract
Thousands of species will be sequenced in the next few years; however, understanding how their genomes work, without an unlimited budget, requires both molecular and novel evolutionary approaches. We developed a sensitive sequence alignment pipeline to identify conserved noncoding sequences (CNSs) in the Andropogoneae tribe (multiple crop species descended from a common ancestor ∼18 million years ago). The Andropogoneae share similar physiology while being tremendously genomically diverse, harboring a broad range of ploidy levels, structural variation, and transposons. These contribute to the potential of Andropogoneae as a powerful system for studying CNSs and are factors we leverage to understand the function of maize CNSs. We found that 86% of CNSs were comprised of annotated features, including introns, UTRs, putative cis-regulatory elements, chromatin loop anchors, noncoding RNA (ncRNA) genes, and several transposable element superfamilies. CNSs were enriched in active regions of DNA replication in the early S phase of the mitotic cell cycle and showed different DNA methylation ratios compared to the genome-wide background. More than half of putative cis-regulatory sequences (identified via other methods) overlapped with CNSs detected in this study. Variants in CNSs were associated with gene expression levels, and CNS absence contributed to loss of gene expression. Furthermore, the evolution of CNSs was associated with the functional diversification of duplicated genes in the context of maize subgenomes. Our results provide a quantitative understanding of the molecular processes governing the evolution of CNSs in maize.
Collapse
Affiliation(s)
- Baoxing Song
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, USA
- Agricultural Research Service, United States Department of Agriculture, Ithaca, New York 14853, USA
| | - Hai Wang
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
- National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization, Joint Laboratory for International Cooperation in Crop Molecular Breeding, China Agricultural University, Beijing 100193, China
| | - Yaoyao Wu
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
- Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Evan Rees
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, USA
| | | | - Daniel J Gates
- Department of Evolution and Ecology, University of California Davis, Davis, California 95616, USA
| | - Merritt Khaipho-Burch
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Peter J Bradbury
- Agricultural Research Service, United States Department of Agriculture, Ithaca, New York 14853, USA
| | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology, University of California Davis, Davis, California 95616, USA
- Center for Population Biology and Genome Center, University of California Davis, Davis, California 95616, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | - M Cinta Romay
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
2
|
Reverse Chromatin Immunoprecipitation (R-ChIP) enables investigation of the upstream regulators of plant genes. Commun Biol 2020; 3:770. [PMID: 33318632 PMCID: PMC7736860 DOI: 10.1038/s42003-020-01500-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 11/13/2020] [Indexed: 02/02/2023] Open
Abstract
DNA binding proteins carry out important and diverse functions in the cell, including gene regulation, but identifying these proteins is technically challenging. In the present study, we developed a technique to capture DNA-associated proteins called reverse chromatin immunoprecipitation (R-ChIP). This technology uses a set of specific DNA probes labeled with biotin to isolate chromatin, and the DNA-associated proteins are then identified using mass spectrometry. Using R-ChIP, we identified 439 proteins that potentially bind to the promoter of the Arabidopsis thaliana gene AtCAT3 (AT1G20620). According to functional annotation, we randomly selected 5 transcription factors from these candidates, including bZIP1664, TEM1, bHLH106, BTF3, and HAT1, to verify whether they in fact bind to the AtCAT3 promoter. The binding of these 5 transcription factors was confirmed using chromatin immunoprecipitation quantitative real-time PCR and electrophoretic mobility shift assays. In addition, we improved the R-ChIP method using plants in which the DNA of interest had been transiently introduced, which does not require the T-DNA integration, and showed that this substantially improved the protein capture efficiency. These results together demonstrate that R-ChIP has a wide application to characterize chromatin composition and isolate upstream regulators of a specific gene.
Collapse
|
3
|
Rich-Griffin C, Eichmann R, Reitz MU, Hermann S, Woolley-Allen K, Brown PE, Wiwatdirekkul K, Esteban E, Pasha A, Kogel KH, Provart NJ, Ott S, Schäfer P. Regulation of Cell Type-Specific Immunity Networks in Arabidopsis Roots. THE PLANT CELL 2020; 32:2742-2762. [PMID: 32699170 PMCID: PMC7474276 DOI: 10.1105/tpc.20.00154] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 07/07/2020] [Accepted: 07/20/2020] [Indexed: 05/04/2023]
Abstract
While root diseases are among the most devastating stresses in global crop production, our understanding of root immunity is still limited relative to our knowledge of immune responses in leaves. Considering that root performance is based on the concerted functions of its different cell types, we undertook a cell type-specific transcriptome analysis to identify gene networks activated in epidermis, cortex, and pericycle cells of Arabidopsis (Arabidopsis thaliana) roots challenged with two immunity elicitors, the bacterial flagellin-derived flg22 and the endogenous Pep1 peptide. Our analyses revealed distinct immunity gene networks in each cell type. To further substantiate our understanding of regulatory patterns underlying these cell type-specific immunity networks, we developed a tool to analyze paired transcription factor binding motifs in the promoters of cell type-specific genes. Our study points toward a connection between cell identity and cell type-specific immunity networks that might guide cell types in launching immune response according to the functional capabilities of each cell type.
Collapse
Affiliation(s)
| | - Ruth Eichmann
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
- Institute of Molecular Botany, Ulm University, 89069 Ulm, Germany
| | - Marco U Reitz
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Sophie Hermann
- Institute of Phytopathology, Justus Liebig University, 35392 Giessen, Germany
| | | | - Paul E Brown
- Bioinformatics Research Technology Platform, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Kate Wiwatdirekkul
- Department of Computer Science, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Eddi Esteban
- Department of Cell and Systems Biology/Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario M5S 3B2, Canada
| | - Asher Pasha
- Department of Cell and Systems Biology/Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario M5S 3B2, Canada
| | - Karl-Heinz Kogel
- Institute of Phytopathology, Justus Liebig University, 35392 Giessen, Germany
| | - Nicholas J Provart
- Department of Cell and Systems Biology/Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario M5S 3B2, Canada
| | - Sascha Ott
- Department of Computer Science, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Patrick Schäfer
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
- Institute of Molecular Botany, Ulm University, 89069 Ulm, Germany
- Warwick Integrative Synthetic Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
4
|
Kulkarni SR, Jones DM, Vandepoele K. Enhanced Maps of Transcription Factor Binding Sites Improve Regulatory Networks Learned from Accessible Chromatin Data. PLANT PHYSIOLOGY 2019; 181:412-425. [PMID: 31345953 PMCID: PMC6776849 DOI: 10.1104/pp.19.00605] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 07/12/2019] [Indexed: 05/05/2023]
Abstract
Determining where transcription factors (TFs) bind in genomes provides insight into which transcriptional programs are active across organs, tissue types, and environmental conditions. Recent advances in high-throughput profiling of regulatory DNA have yielded large amounts of information about chromatin accessibility. Interpreting the functional significance of these data sets requires knowledge of which regulators are likely to bind these regions. This can be achieved by using information about TF-binding preferences, or motifs, to identify TF-binding events that are likely to be functional. Although different approaches exist to map motifs to DNA sequences, a systematic evaluation of these tools in plants is missing. Here, we compare four motif-mapping tools widely used in the Arabidopsis (Arabidopsis thaliana) research community and evaluate their performance using chromatin immunoprecipitation data sets for 40 TFs. Downstream gene regulatory network (GRN) reconstruction was found to be sensitive to the motif mapper used. We further show that the low recall of Find Individual Motif Occurrences, one of the most frequently used motif-mapping tools, can be overcome by using an Ensemble approach, which combines results from different mapping tools. Several examples are provided demonstrating how the Ensemble approach extends our view on transcriptional control for TFs active in different biological processes. Finally, a protocol is presented to effectively derive more complete cell type-specific GRNs through the integrative analysis of open chromatin regions, known binding site information, and expression data sets. This approach will pave the way to increase our understanding of GRNs in different cellular conditions.
Collapse
Affiliation(s)
- Shubhada R Kulkarni
- Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| | - D Marc Jones
- Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Ghent University, Department of Plant Biotechnology and Bioinformatics, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
5
|
Kulkarni SR, Vaneechoutte D, Van de Velde J, Vandepoele K. TF2Network: predicting transcription factor regulators and gene regulatory networks in Arabidopsis using publicly available binding site information. Nucleic Acids Res 2019; 46:e31. [PMID: 29272447 PMCID: PMC5888541 DOI: 10.1093/nar/gkx1279] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 12/18/2017] [Indexed: 12/16/2022] Open
Abstract
A gene regulatory network (GRN) is a collection of regulatory interactions between transcription factors (TFs) and their target genes. GRNs control different biological processes and have been instrumental to understand the organization and complexity of gene regulation. Although various experimental methods have been used to map GRNs in Arabidopsis thaliana, their limited throughput combined with the large number of TFs makes that for many genes our knowledge about regulating TFs is incomplete. We introduce TF2Network, a tool that exploits the vast amount of TF binding site information and enables the delineation of GRNs by detecting potential regulators for a set of co-expressed or functionally related genes. Validation using two experimental benchmarks reveals that TF2Network predicts the correct regulator in 75–92% of the test sets. Furthermore, our tool is robust to noise in the input gene sets, has a low false discovery rate, and shows a better performance to recover correct regulators compared to other plant tools. TF2Network is accessible through a web interface where GRNs are interactively visualized and annotated with various types of experimental functional information. TF2Network was used to perform systematic functional and regulatory gene annotations, identifying new TFs involved in circadian rhythm and stress response.
Collapse
Affiliation(s)
- Shubhada R Kulkarni
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 927, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Dries Vaneechoutte
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 927, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Jan Van de Velde
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 927, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 927, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 927, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
- To whom correspondence should be addressed. Tel: +32 9 3313822; Fax: +32 9 3313809;
| |
Collapse
|
6
|
Vannozzi A, Wong DCJ, Höll J, Hmmam I, Matus JT, Bogs J, Ziegler T, Dry I, Barcaccia G, Lucchin M. Combinatorial Regulation of Stilbene Synthase Genes by WRKY and MYB Transcription Factors in Grapevine (Vitis vinifera L.). PLANT & CELL PHYSIOLOGY 2018; 59:1043-1059. [PMID: 29529275 DOI: 10.1093/pcp/pcy045] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 02/20/2018] [Indexed: 05/20/2023]
Abstract
Stilbene synthase (STS) is the key enzyme leading to the biosynthesis of resveratrol. Recently we reported two R2R3-MYB transcription factor (TF) genes that regulate the stilbene biosynthetic pathway in grapevine: VviMYB14 and VviMYB15. These genes are strongly co-expressed with STS genes under a range of stress and developmental conditions, in agreement with the specific activation of STS promoters by these TFs. Genome-wide gene co-expression analysis using two separate transcriptome compendia based on microarray and RNA sequencing data revealed that WRKY TFs were the top TF family correlated with STS genes. On the basis of correlation frequency, four WRKY genes, namely VviWRKY03, VviWRKY24, VviWRKY43 and VviWRKY53, were further shortlisted and functionally validated. Expression analyses under both unstressed and stressed conditions, together with promoter-luciferase reporter assays, suggested different hierarchies for these TFs in the regulation of the stilbene biosynthetic pathway. In particular, VviWRKY24 seems to act as a singular effector in the activation of the VviSTS29 promoter, while VviWRKY03 acts through a combinatorial effect with VviMYB14, suggesting that these two regulators may interact at the protein level as previously reported in other species.
Collapse
Affiliation(s)
- Alessandro Vannozzi
- Department of Agronomy, Food, Natural resources, Animals, and Environment (DAFNAE), University of Padova, Legnaro 35020, Italy
| | - Darren Chern Jan Wong
- Ecology and Evolution, Research School of Biology, Australian National University Acton, ACT 2601, Australia
| | - Janine Höll
- Centre for Organismal Studies Heidelberg, University of Heidelberg, Heidelberg D-69120, Germany
| | - Ibrahim Hmmam
- Department of Agronomy, Food, Natural resources, Animals, and Environment (DAFNAE), University of Padova, Legnaro 35020, Italy
| | - José Tomás Matus
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Barcelona 08034, Spain
| | - Jochen Bogs
- Centre for Organismal Studies Heidelberg, University of Heidelberg, Heidelberg D-69120, Germany
| | - Tobias Ziegler
- Centre for Organismal Studies Heidelberg, University of Heidelberg, Heidelberg D-69120, Germany
| | - Ian Dry
- CSIRO Agriculture & Food, Urrbrae, SA 5064, Australia
| | - Gianni Barcaccia
- Department of Agronomy, Food, Natural resources, Animals, and Environment (DAFNAE), University of Padova, Legnaro 35020, Italy
| | - Margherita Lucchin
- Department of Agronomy, Food, Natural resources, Animals, and Environment (DAFNAE), University of Padova, Legnaro 35020, Italy
| |
Collapse
|
7
|
Bekiaris PS, Tekath T, Staiger D, Danisman S. Computational exploration of cis-regulatory modules in rhythmic expression data using the "Exploration of Distinctive CREs and CRMs" (EDCC) and "CRM Network Generator" (CNG) programs. PLoS One 2018; 13:e0190421. [PMID: 29298348 PMCID: PMC5752016 DOI: 10.1371/journal.pone.0190421] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Accepted: 12/14/2017] [Indexed: 11/19/2022] Open
Abstract
Understanding the effect of cis-regulatory elements (CRE) and clusters of CREs, which are called cis-regulatory modules (CRM), in eukaryotic gene expression is a challenge of computational biology. We developed two programs that allow simple, fast and reliable analysis of candidate CREs and CRMs that may affect specific gene expression and that determine positional features between individual CREs within a CRM. The first program, "Exploration of Distinctive CREs and CRMs" (EDCC), correlates candidate CREs and CRMs with specific gene expression patterns. For pairs of CREs, EDCC also determines positional preferences of the single CREs in relation to each other and to the transcriptional start site. The second program, "CRM Network Generator" (CNG), prioritizes these positional preferences using a neural network and thus allows unbiased rating of the positional preferences that were determined by EDCC. We tested these programs with data from a microarray study of circadian gene expression in Arabidopsis thaliana. Analyzing more than 1.5 million pairwise CRE combinations, we found 22 candidate combinations, of which several contained known clock promoter elements together with elements that had not been identified as relevant to circadian gene expression before. CNG analysis further identified positional preferences of these CRE pairs, hinting at positional information that may be relevant for circadian gene expression. Future wet lab experiments will have to determine which of these combinations confer daytime specific circadian gene expression.
Collapse
Affiliation(s)
| | - Tobias Tekath
- RNA Biology and Molecular Physiology, Faculty of Biology, Bielefeld University, Bielefeld, Germany
| | - Dorothee Staiger
- RNA Biology and Molecular Physiology, Faculty of Biology, Bielefeld University, Bielefeld, Germany
| | - Selahattin Danisman
- RNA Biology and Molecular Physiology, Faculty of Biology, Bielefeld University, Bielefeld, Germany
| |
Collapse
|
8
|
Van de Velde J, Van Bel M, Vaneechoutte D, Vandepoele K. A Collection of Conserved Noncoding Sequences to Study Gene Regulation in Flowering Plants. PLANT PHYSIOLOGY 2016; 171:2586-98. [PMID: 27261064 PMCID: PMC4972296 DOI: 10.1104/pp.16.00821] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 05/31/2016] [Indexed: 05/03/2023]
Abstract
Transcription factors (TFs) regulate gene expression by binding cis-regulatory elements, of which the identification remains an ongoing challenge owing to the prevalence of large numbers of nonfunctional TF binding sites. Powerful comparative genomics methods, such as phylogenetic footprinting, can be used for the detection of conserved noncoding sequences (CNSs), which are functionally constrained and can greatly help in reducing the number of false-positive elements. In this study, we applied a phylogenetic footprinting approach for the identification of CNSs in 10 dicot plants, yielding 1,032,291 CNSs associated with 243,187 genes. To annotate CNSs with TF binding sites, we made use of binding site information for 642 TFs originating from 35 TF families in Arabidopsis (Arabidopsis thaliana). In three species, the identified CNSs were evaluated using TF chromatin immunoprecipitation sequencing data, resulting in significant overlap for the majority of data sets. To identify ultraconserved CNSs, we included genomes of additional plant families and identified 715 binding sites for 501 genes conserved in dicots, monocots, mosses, and green algae. Additionally, we found that genes that are part of conserved mini-regulons have a higher coherence in their expression profile than other divergent gene pairs. All identified CNSs were integrated in the PLAZA 3.0 Dicots comparative genomics platform (http://bioinformatics.psb.ugent.be/plaza/versions/plaza_v3_dicots/) together with new functionalities facilitating the exploration of conserved cis-regulatory elements and their associated genes. The availability of this data set in a user-friendly platform enables the exploration of functional noncoding DNA to study gene regulation in a variety of plant species, including crops.
Collapse
Affiliation(s)
- Jan Van de Velde
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Michiel Van Bel
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Dries Vaneechoutte
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Klaas Vandepoele
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| |
Collapse
|
9
|
Deokar AA, Tar'an B. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea ( Cicer arietinum L.). FRONTIERS IN PLANT SCIENCE 2016; 7:1802. [PMID: 27965700 PMCID: PMC5126082 DOI: 10.3389/fpls.2016.01802] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Accepted: 11/15/2016] [Indexed: 05/18/2023]
Abstract
Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness reflecting their varying pattern of gene expression and potential involvement in biotic and abiotic stress responses. The current study presents the first detailed genome-wide analysis of the AQP gene family in chickpea and provides valuable information for further functional analysis to infer the role of AQP in the adaptation of chickpea in diverse environmental conditions.
Collapse
|
10
|
He L, Su C, Wang Y, Wei Z. ATDOF5.8 protein is the upstream regulator of ANAC069 and is responsive to abiotic stress. Biochimie 2015; 110:17-24. [PMID: 25572919 DOI: 10.1016/j.biochi.2014.12.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Accepted: 12/26/2014] [Indexed: 11/18/2022]
Abstract
ANAC069 encodes a plasma membrane-bound NAC protein that integrates auxin and salt signals to regulate Arabidopsis thaliana seed germination. However, the regulatory network of ANAC069 has not been revealed. Here, we report that ANAC069 is a direct target gene of the ATDOF5.8 transcription factor. There are seven DOF motifs in the promoter of ANAC069, which is highly enriched in the promoter. Yeast one-hybrid (Y1H) assays showed that a Dof protein, ATDOF5.8, binds to the DOF motifs. The interaction between ATDOF5.8 and the promoter of ANAC069 was further confirmed by transient expression assays in tobacco leaves. Chromatin Immunoprecipitation (ChIP) results suggested that the binding of ATDOF5.8 to DOF motifs in the promoter of ANAC069 occurs in vivo in Arabidopsis to regulate the expression of ANAC069. Moreover, ATDOF5.8 and ANAC069 share similar expression patterns in response to salt, drought and ABA treatment. Taken together, the results suggested a regulatory network model mediated by ANAC069, under abiotic stress conditions, ATDOF5.8 regulates the expression of ANAC069, and the activated ANAC069 binds to NAC recognition sequences or other motifs to regulate the expression of genes containing these motifs in their promoters.
Collapse
Affiliation(s)
- Lin He
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, 150040 Harbin, China; Agricultural College of Heilongjiang Bayi Agricultural University, 163319 Daqing, China
| | - Chang Su
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, 150040 Harbin, China; Agricultural College of Heilongjiang Bayi Agricultural University, 163319 Daqing, China
| | - Yucheng Wang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, 150040 Harbin, China; Agricultural College of Heilongjiang Bayi Agricultural University, 163319 Daqing, China.
| | - Zhigang Wei
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, 150040 Harbin, China; Agricultural College of Heilongjiang Bayi Agricultural University, 163319 Daqing, China.
| |
Collapse
|
11
|
Heyndrickx KS, Van de Velde J, Wang C, Weigel D, Vandepoele K. A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana. THE PLANT CELL 2014; 26:3894-910. [PMID: 25361952 PMCID: PMC4247581 DOI: 10.1105/tpc.114.130591] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Revised: 10/07/2014] [Accepted: 10/12/2014] [Indexed: 05/19/2023]
Abstract
Understanding the mechanisms underlying gene regulation is paramount to comprehend the translation from genotype to phenotype. The two are connected by gene expression, and it is generally thought that variation in transcription factor (TF) function is an important determinant of phenotypic evolution. We analyzed publicly available genome-wide chromatin immunoprecipitation experiments for 27 TFs in Arabidopsis thaliana and constructed an experimental network containing 46,619 regulatory interactions and 15,188 target genes. We identified hub targets and highly occupied target (HOT) regions, which are enriched for genes involved in development, stimulus responses, signaling, and gene regulatory processes in the currently profiled network. We provide several lines of evidence that TF binding at plant HOT regions is functional, in contrast to that in animals, and not merely the result of accessible chromatin. HOT regions harbor specific DNA motifs, are enriched for differentially expressed genes, and are often conserved across crucifers and dicots, even though they are not under higher levels of purifying selection than non-HOT regions. Distal bound regions are under purifying selection as well and are enriched for a chromatin state showing regulation by the Polycomb repressive complex. Gene expression complexity is positively correlated with the total number of bound TFs, revealing insights in the regulatory code for genes with different expression breadths. The integration of noncanonical and canonical DNA motif information yields new hypotheses on cobinding and tethering between specific TFs involved in flowering and light regulation.
Collapse
Affiliation(s)
- Ken S Heyndrickx
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| | - Jan Van de Velde
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| | - Congmao Wang
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Klaas Vandepoele
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| |
Collapse
|
12
|
Van de Velde J, Heyndrickx KS, Vandepoele K. Inference of transcriptional networks in Arabidopsis through conserved noncoding sequence analysis. THE PLANT CELL 2014; 26:2729-45. [PMID: 24989046 PMCID: PMC4145110 DOI: 10.1105/tpc.114.127001] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Transcriptional regulation plays an important role in establishing gene expression profiles during development or in response to (a)biotic stimuli. Transcription factor binding sites (TFBSs) are the functional elements that determine transcriptional activity, and the identification of individual TFBS in genome sequences is a major goal to inferring regulatory networks. We have developed a phylogenetic footprinting approach for the identification of conserved noncoding sequences (CNSs) across 12 dicot plants. Whereas both alignment and non-alignment-based techniques were applied to identify functional motifs in a multispecies context, our method accounts for incomplete motif conservation as well as high sequence divergence between related species. We identified 69,361 footprints associated with 17,895 genes. Through the integration of known TFBS obtained from the literature and experimental studies, we used the CNSs to compile a gene regulatory network in Arabidopsis thaliana containing 40,758 interactions, of which two-thirds act through binding events located in DNase I hypersensitive sites. This network shows significant enrichment toward in vivo targets of known regulators, and its overall quality was confirmed using five different biological validation metrics. Finally, through the integration of detailed expression and function information, we demonstrate how static CNSs can be converted into condition-dependent regulatory networks, offering opportunities for regulatory gene annotation.
Collapse
Affiliation(s)
- Jan Van de Velde
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium
| | - Ken S Heyndrickx
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium
| |
Collapse
|
13
|
House MA, Griswold CK, Lukens LN. Evidence for selection on gene expression in cultivated rice (Oryza sativa). Mol Biol Evol 2014; 31:1514-25. [PMID: 24659814 DOI: 10.1093/molbev/msu110] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Artificial selection has been used throughout plant domestication and breeding to develop crops that are adapted to diverse environments. Here, we investigate whether gene regulatory changes have been widespread targets of lineage-specific selection in cultivated lines Minghui 63 and Zhenshan 97 of rice, Oryza sativa. A line experiencing positive selection for either an increase or a decrease in genes' transcript abundances is expected to have an overabundance of expression quantitative trait locus (eQTL) alleles that increase or decrease those genes' expression, respectively. Results indicate that several genes that share Gene Ontology terms or are members of the same coexpression module have eQTL alleles from one parent that consistently increase gene expression relative to the second parent. A second line of evidence for lineage-specific selection is an overabundance of cis-trans pairs of eQTL alleles that affect gene expression in the same direction (are reinforcing). Across all cis-trans pairs of eQTL, including pairs that both weakly and strongly affect gene expression, there is no evidence for selection. However, the frequency of genes with reinforcing eQTL increases with eQTL strength. Therefore, there is evidence that eQTL with strong effects were positively selected during rice cultivation. Among 41 cis-trans pairs with strong trans eQTL, 31 have reinforcing eQTL. Several of the candidate genes under positive selection accurately predict phenotypic differences between Minghui 63 and Zhenshan 97. Overall, our results suggest that positive selection for regulatory alleles may be a key factor in plant improvement.
Collapse
Affiliation(s)
- Megan A House
- Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada
| | - Cortland K Griswold
- Department of Integrative Biology, University of Guelph, Guelph, Ontario, Canada
| | - Lewis N Lukens
- Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada
| |
Collapse
|
14
|
Downs GS, Liseron-Monfils C, Lukens LN. Regulatory motifs identified from a maize developmental coexpression network. Genome 2014; 57:181-4. [DOI: 10.1139/gen-2013-0177] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Transcriptional control is an important determinant of plant development, and distinct modules of coordinated genes characterize the maize developmental transcriptome. Upstream regulatory sequences are often the primary factors that control gene expression pattern and abundance. Here, we identify 244 regulatory motifs that are significantly enriched within 24 gene expression modules previously constructed from transcript abundances of 34 876 Zea mays (maize) gene models from embryogenesis to senescence. Within modules, we identify motifs that have not been characterized. In addition, we identify motifs similar to experimentally verified motifs, and the functions of these motifs overlap with predicted module functions. This work demonstrates the power of transcript-level coexpression modules to identify both variants of known regulatory motifs and novel motifs that control a species’ developmental transcriptome.
Collapse
Affiliation(s)
- Gregory S. Downs
- Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
| | - Christophe Liseron-Monfils
- Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Lewis N. Lukens
- Department of Plant Agriculture, University of Guelph, Guelph, ON N1G 2W1, Canada
| |
Collapse
|
15
|
Hosseini P, Ovcharenko I, Matthews BF. Using an ensemble of statistical metrics to quantify large sets of plant transcription factor binding sites. PLANT METHODS 2013; 9:12. [PMID: 23578135 PMCID: PMC3639912 DOI: 10.1186/1746-4811-9-12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2012] [Accepted: 03/28/2013] [Indexed: 05/07/2023]
Abstract
BACKGROUND From initial seed germination through reproduction, plants continuously reprogram their transcriptional repertoire to facilitate growth and development. This dynamic is mediated by a diverse but inextricably-linked catalog of regulatory proteins called transcription factors (TFs). Statistically quantifying TF binding site (TFBS) abundance in promoters of differentially expressed genes can be used to identify binding site patterns in promoters that are closely related to stress-response. Output from today's transcriptomic assays necessitates statistically-oriented software to handle large promoter-sequence sets in a computationally tractable fashion. RESULTS We present Marina, an open-source software for identifying over-represented TFBSs from amongst large sets of promoter sequences, using an ensemble of 7 statistical metrics and binding-site profiles. Through software comparison, we show that Marina can identify considerably more over-represented plant TFBSs compared to a popular software alternative. CONCLUSIONS Marina was used to identify over-represented TFBSs in a two time-point RNA-Seq study exploring the transcriptomic interplay between soybean (Glycine max) and soybean rust (Phakopsora pachyrhizi). Marina identified numerous abundant TFBSs recognized by transcription factors that are associated with defense-response such as WRKY, HY5 and MYB2. Comparing results from Marina to that of a popular software alternative suggests that regardless of the number of promoter-sequences, Marina is able to identify significantly more over-represented TFBSs.
Collapse
Affiliation(s)
- Parsa Hosseini
- Department of Bioinformatics and Computational Biology, George Mason University, Manassas, Virginia, USA
- Computational Biology Branch, National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland, USA
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Beltsville, Maryland, USA
| | - Ivan Ovcharenko
- Computational Biology Branch, National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland, USA
| | - Benjamin F Matthews
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Beltsville, Maryland, USA
| |
Collapse
|
16
|
Seidl MF, Wang RP, Van den Ackerveken G, Govers F, Snel B. Bioinformatic inference of specific and general transcription factor binding sites in the plant pathogen Phytophthora infestans. PLoS One 2012; 7:e51295. [PMID: 23251489 PMCID: PMC3520976 DOI: 10.1371/journal.pone.0051295] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 11/01/2012] [Indexed: 11/19/2022] Open
Abstract
Plant infection by oomycete pathogens is a complex process. It requires precise expression of a plethora of genes in the pathogen that contribute to a successful interaction with the host. Whereas much effort has been made to uncover the molecular systems underlying this infection process, mechanisms of transcriptional regulation of the genes involved remain largely unknown. We performed the first systematic de-novo DNA motif discovery analysis in Phytophthora. To this end, we utilized the genome sequence of the late blight pathogen Phytophthora infestans and two related Phytophthora species (P. ramorum and P. sojae), as well as genome-wide in planta gene expression data to systematically predict 19 conserved DNA motifs. This catalog describes common eukaryotic promoter elements whose functionality is supported by the presence of orthologs of known general transcription factors. Together with strong functional enrichment of the common promoter elements towards effector genes involved in pathogenicity, we obtained a new and expanded picture of the promoter structure in P. infestans. More intriguingly, we identified specific DNA motifs that are either highly abundant or whose presence is significantly correlated with gene expression levels during infection. Several of these motifs are observed upstream of genes encoding transporters, RXLR effectors, but also transcriptional regulators. Motifs that are observed upstream of known pathogenicity-related genes are potentially important binding sites for transcription factors. Our analyses add substantial knowledge to the as of yet virtually unexplored question regarding general and specific gene regulation in this important class of pathogens. We propose hypotheses on the effects of cis-regulatory motifs on the gene regulation of pathogenicity-related genes and pinpoint motifs that are prime targets for further experimental validation.
Collapse
Affiliation(s)
- Michael F Seidl
- Theoretical Biology and Bioinformatics, Department of Biology, Utrecht University, Utrecht, The Netherlands.
| | | | | | | | | |
Collapse
|
17
|
Ding J, Li X, Hu H. Systematic prediction of cis-regulatory elements in the Chlamydomonas reinhardtii genome using comparative genomics. PLANT PHYSIOLOGY 2012; 160:613-23. [PMID: 22915576 PMCID: PMC3461543 DOI: 10.1104/pp.112.200840] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Chlamydomonas reinhardtii is one of the most important microalgae model organisms and has been widely studied toward the understanding of chloroplast functions and various cellular processes. Further exploitation of C. reinhardtii as a model system to elucidate various molecular mechanisms and pathways requires systematic study of gene regulation. However, there is a general lack of genome-scale gene regulation study, such as global cis-regulatory element (CRE) identification, in C. reinhardtii. Recently, large-scale genomic data in microalgae species have become available, which enable the development of efficient computational methods to systematically identify CREs and characterize their roles in microalgae gene regulation. Here, we performed in silico CRE identification at the whole genome level in C. reinhardtii using a comparative genomics-based method. We predicted a large number of CREs in C. reinhardtii that are consistent with experimentally verified CREs. We also discovered that a large percentage of these CREs form combinations and have the potential to work together for coordinated gene regulation in C. reinhardtii. Multiple lines of evidence from literature, gene transcriptional profiles, and gene annotation resources support our prediction. The predicted CREs will serve, to our knowledge, as the first large-scale collection of CREs in C. reinhardtii to facilitate further experimental study of microalgae gene regulation. The accompanying software tool and the predictions in C. reinhardtii are also made available through a Web-accessible database (http://hulab.ucf.edu/research/projects/Microalgae/sdcre/motifcomb.html).
Collapse
|
18
|
Baxter L, Jironkin A, Hickman R, Moore J, Barrington C, Krusche P, Dyer NP, Buchanan-Wollaston V, Tiskin A, Beynon J, Denby K, Ott S. Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants. THE PLANT CELL 2012; 24:3949-65. [PMID: 23110901 PMCID: PMC3517229 DOI: 10.1105/tpc.112.103010] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research.
Collapse
Affiliation(s)
- Laura Baxter
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Aleksey Jironkin
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Richard Hickman
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jay Moore
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | | | - Peter Krusche
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Nigel P. Dyer
- Molecular Organisation and Assembly in Cells Doctoral Training Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Vicky Buchanan-Wollaston
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Alexander Tiskin
- Department of Computer Science, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jim Beynon
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Katherine Denby
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Sascha Ott
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- Address correspondence to
| |
Collapse
|
19
|
Berendzen KW, Weiste C, Wanke D, Kilian J, Harter K, Dröge-Laser W. Bioinformatic cis-element analyses performed in Arabidopsis and rice disclose bZIP- and MYB-related binding sites as potential AuxRE-coupling elements in auxin-mediated transcription. BMC PLANT BIOLOGY 2012; 12:125. [PMID: 22852874 PMCID: PMC3438128 DOI: 10.1186/1471-2229-12-125] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2011] [Accepted: 07/11/2012] [Indexed: 05/05/2023]
Abstract
BACKGROUND In higher plants, a diverse array of developmental and growth-related processes is regulated by the plant hormone auxin. Recent publications have proposed that besides the well-characterized Auxin Response Factors (ARFs) that bind Auxin Response Elements (AuxREs), also members of the bZIP- and MYB-transcription factor (TF) families participate in transcriptional control of auxin-regulated genes via bZIP Response Elements (ZREs) or Myb Response Elements (MREs), respectively. RESULTS Applying a novel bioinformatic algorithm, we demonstrate on a genome-wide scale that singular motifs or composite modules of AuxREs, ZREs, MREs but also of MYC2 related elements are significantly enriched in promoters of auxin-inducible genes. Despite considerable, species-specific differences in the genome structure in terms of the GC content, this enrichment is generally conserved in dicot (Arabidopsis thaliana) and monocot (Oryza sativa) model plants. Moreover, an enrichment of defined composite modules has been observed in selected auxin-related gene families. Consistently, a bipartite module, which encompasses a bZIP-associated G-box Related Element (GRE) and an AuxRE motif, has been found to be highly enriched. Making use of transient reporter studies in protoplasts, these findings were experimentally confirmed, demonstrating that GREs functionally interact with AuxREs in regulating auxin-mediated transcription. CONCLUSIONS Using genome-wide bioinformatic analyses, evolutionary conserved motifs have been defined which potentially function as AuxRE-dependent coupling elements to establish auxin-specific expression patterns. Based on these findings, experimental approaches can be designed to broaden our understanding of combinatorial, auxin-controlled gene regulation.
Collapse
Affiliation(s)
- Kenneth W Berendzen
- Zentrum für Molekularbiologie der Pflanzen, Pflanzenphysiologie, Universität Tübingen, Auf der Morgenstelle 1, 72076, Tübingen, Germany
| | - Christoph Weiste
- Julius-von-Sachs-Institut, Pharmazeutische Biologie, Universität Würzburg, Julius-von-Sachs-Platz 2, 97082, Würzburg, Germany
| | - Dierk Wanke
- Zentrum für Molekularbiologie der Pflanzen, Pflanzenphysiologie, Universität Tübingen, Auf der Morgenstelle 1, 72076, Tübingen, Germany
| | - Joachim Kilian
- Zentrum für Molekularbiologie der Pflanzen, Pflanzenphysiologie, Universität Tübingen, Auf der Morgenstelle 1, 72076, Tübingen, Germany
| | - Klaus Harter
- Zentrum für Molekularbiologie der Pflanzen, Pflanzenphysiologie, Universität Tübingen, Auf der Morgenstelle 1, 72076, Tübingen, Germany
| | - Wolfgang Dröge-Laser
- Julius-von-Sachs-Institut, Pharmazeutische Biologie, Universität Würzburg, Julius-von-Sachs-Platz 2, 97082, Würzburg, Germany
| |
Collapse
|
20
|
Heyndrickx KS, Vandepoele K. Systematic identification of functional plant modules through the integration of complementary data sources. PLANT PHYSIOLOGY 2012; 159:884-901. [PMID: 22589469 PMCID: PMC3387714 DOI: 10.1104/pp.112.196725] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
A major challenge is to unravel how genes interact and are regulated to exert specific biological functions. The integration of genome-wide functional genomics data, followed by the construction of gene networks, provides a powerful approach to identify functional gene modules. Large-scale expression data, functional gene annotations, experimental protein-protein interactions, and transcription factor-target interactions were integrated to delineate modules in Arabidopsis (Arabidopsis thaliana). The different experimental input data sets showed little overlap, demonstrating the advantage of combining multiple data types to study gene function and regulation. In the set of 1,563 modules covering 13,142 genes, most modules displayed strong coexpression, but functional and cis-regulatory coherence was less prevalent. Highly connected hub genes showed a significant enrichment toward embryo lethality and evidence for cross talk between different biological processes. Comparative analysis revealed that 58% of the modules showed conserved coexpression across multiple plants. Using module-based functional predictions, 5,562 genes were annotated, and an evaluation experiment disclosed that, based on 197 recently experimentally characterized genes, 38.1% of these functions could be inferred through the module context. Examples of confirmed genes of unknown function related to cell wall biogenesis, xylem and phloem pattern formation, cell cycle, hormone stimulus, and circadian rhythm highlight the potential to identify new gene functions. The module-based predictions offer new biological hypotheses for functionally unknown genes in Arabidopsis (1,701 genes) and six other plant species (43,621 genes). Furthermore, the inferred modules provide new insights into the conservation of coexpression and coregulation as well as a starting point for comparative functional annotation.
Collapse
|
21
|
From plant gene regulatory grids to network dynamics. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1819:454-65. [DOI: 10.1016/j.bbagrm.2012.02.016] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2011] [Revised: 02/15/2012] [Accepted: 02/16/2012] [Indexed: 11/19/2022]
|
22
|
Petrov V, Vermeirssen V, De Clercq I, Van Breusegem F, Minkov I, Vandepoele K, Gechev TS. Identification of cis-regulatory elements specific for different types of reactive oxygen species in Arabidopsis thaliana. Gene 2012; 499:52-60. [DOI: 10.1016/j.gene.2012.02.035] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Revised: 02/09/2012] [Accepted: 02/19/2012] [Indexed: 10/28/2022]
|
23
|
Salvini M, Sani E, Fambrini M, Pistelli L, Pucciariello C, Pugliesi C. Molecular analysis of a sunflower gene encoding an homologous of the B subunit of a CAAT binding factor. Mol Biol Rep 2012; 39:6449-65. [PMID: 22359114 DOI: 10.1007/s11033-012-1463-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2011] [Accepted: 01/23/2012] [Indexed: 10/28/2022]
Abstract
A genomic DNA fragment containing the complete LEAFY COTYLEDON1-LIKE (HaL1L) gene was retrieved by chromosome walking. Its sequence was confirmed and elongated by screening a sunflower genomic DNA BAC Library. HaL1L, whose cDNA had already been sequenced and characterized, encodes a NF-YB subunit of a CCAAT box-binding factor (NF-Y) involved in the early stages of zygotic and somatic embryogenesis in the Helianthus genus. In the HaL1L 5'-flanking region, elements specific to a putative TATA-box promoter and two "CG isles" were identified. An investigation of the methylation status of these CG rich DNA regions showed that differentially methylated cytosines were recognizable in the DNA of embryos on the fifth day after pollination in comparison to leaf DNA suggesting that during plant development epigenetic regulation of HaL1L transcription was achieved by methylating cytosine residues. We also searched the HaL1L nucleotide sequence for cis-regulatory elements able to interact with other transcription factors (TFs) involved in the HaL1L regulation. Of the elements identified, one of the most intriguing is WUSATA, the target sequence for the WUSCHEL (WUS) TF, which may be part of a complex regulation network controlling embryo development. In this article, we show that the WUSATA target site, located in the intron of HaL1L, is able to bind the TF WUS. Interestingly, we found auxin and abscisic acid responsive motifs in the HaL1L promoter region suggesting that this gene may additionally by under hormonal control. Finally, the presence of a cytoplasmic polyadenylation signal downstream to the coding region indicates that this gene may also be controlled at the translation level by a temporarily making the pre-synthesized HaL1L mRNA unavailable for protein synthesis.
Collapse
|
24
|
Ding J, Hu H, Li X. Thousands of cis-regulatory sequence combinations are shared by Arabidopsis and poplar. PLANT PHYSIOLOGY 2012; 158:145-55. [PMID: 22058225 PMCID: PMC3252106 DOI: 10.1104/pp.111.186080] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The identification of cis-regulatory modules (CRMs) can greatly advance our understanding of gene regulatory mechanisms. Despite the existence of binding sites of more than three transcription factors (TFs) in a CRM, studies in plants often consider only the cooccurrence of binding sites of one or two TFs. In addition, CRM studies in plants are limited to combinations of only a few families of TFs. It is thus not clear how widespread plant TFs work together, which TFs work together to regulate plant genes, and how the combinations of these TFs are shared by different plants. To fill these gaps, we applied a frequent pattern-mining-based approach to identify frequently used cis-regulatory sequence combinations in the promoter sequences of two plant species, Arabidopsis (Arabidopsis thaliana) and poplar (Populus trichocarpa). A cis-regulatory sequence here corresponds to a DNA motif bound by a TF. We identified 18,638 combinations composed of two to six cis-regulatory sequences that are shared by the two plant species. In addition, with known cis-regulatory sequence combinations, gene function annotation, gene expression data, and known functional gene sets, we showed that the functionality of at least 96.8% and 65.2% of these shared combinations in Arabidopsis are partially supported, under a false discovery rate of 0.1 and 0.05, respectively. Finally, we discovered that 796 of the 18,638 combinations might relate to functions that are important in bioenergy research. Our work will facilitate the study of gene transcriptional regulation in plants.
Collapse
|
25
|
Timmerhaus G, Hanke ST, Buchta K, Rensing SA. Prediction and validation of promoters involved in the abscisic acid response in Physcomitrella patens. MOLECULAR PLANT 2011; 4:713-29. [PMID: 21398384 DOI: 10.1093/mp/ssr009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Detection of cis-regulatory elements, such as transcription factor binding sites (TFBS), through utilization of ortholog conservation is possible only if genomic data from closely related organisms are available. An alternative approach is the detection of TFBS based on their overrepresentation in promoters of co-regulated genes. However, this approach usually suffers from a high rate of false-positive prediction. Here, we have conducted a case study using promoters of genes known to be strongly induced by the phytohormone abscisic acid (ABA) in the model plant Physcomitrella patens, a moss. Putative TFBS were detected using three de novo motif detection tools in a strict consensus approach. The resulting motifs were validated using data from microarray expression profiling and were able to predict ABA-induced genes with high specificity (90.48%) at mediocre sensitivity (33.33%). In addition, 27 genes predicted to contain ABA-responsive TFBS were validated using real-time PCR. Here, a total of 37% of the genes could be shown to be induced upon ABA treatment, while 70% were found to be regulated by ABA. We conclude that the consensus approach for motif detection using co-regulation information can be used to identify genes that are regulated under a given stimulus. In terms of evolution, we find that the ABA response has apparently been conserved since the first land plants on the level of families involved in transcriptional regulation.
Collapse
Affiliation(s)
- Gerrit Timmerhaus
- Faculty of Biology, University of Freiburg, Hauptstr. 1, 79104 Freiburg, Germany
| | | | | | | |
Collapse
|
26
|
Van Leene J, Hollunder J, Eeckhout D, Persiau G, Van De Slijke E, Stals H, Van Isterdael G, Verkest A, Neirynck S, Buffel Y, De Bodt S, Maere S, Laukens K, Pharazyn A, Ferreira PCG, Eloy N, Renne C, Meyer C, Faure JD, Steinbrenner J, Beynon J, Larkin JC, Van de Peer Y, Hilson P, Kuiper M, De Veylder L, Van Onckelen H, Inzé D, Witters E, De Jaeger G. Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana. Mol Syst Biol 2011; 6:397. [PMID: 20706207 PMCID: PMC2950081 DOI: 10.1038/msb.2010.53] [Citation(s) in RCA: 257] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2009] [Accepted: 06/12/2010] [Indexed: 12/27/2022] Open
Abstract
A protein interactome focused towards cell proliferation was mapped comprising 857 interactions among 393 proteins, leading to many new insights in plant cell cycle regulation. A comprehensive view on heterodimeric cyclin-dependent kinase (CDK)/cyclin complexes in plants is obtained, in relation with their regulators. Over 100 new candidate cell cycle proteins were predicted.
The basic underlying mechanisms that govern the cell cycle are conserved among all eukaryotes. Peculiar for plants, however, is that their genome contains a collection of cell cycle regulatory genes that is intriguingly large (Vandepoele et al, 2002; Menges et al, 2005) compared to other eukaryotes. Arabidopsis thaliana (Arabidopsis) encodes 71 genes in five regulatory classes versus only 15 in yeast and 23 in human. Despite the discovery of numerous cell cycle genes, little is known about the protein complex machinery that steers plant cell division. Therefore, we applied tandem affinity purification (TAP) approach coupled with mass spectrometry (MS) on Arabidopsis cell suspension cultures to isolate and analyze protein complexes involved in the cell cycle. This approach allowed us to successfully map a first draft of the basic cell cycle complex machinery of Arabidopsis, providing many new insights into plant cell division. To map the interactome, we relied on a streamlined platform comprising generic Gateway-based vectors with high cloning flexibility, the fast generation of transgenic suspension cultures, TAP adapted for plant cells, and matrix-assisted laser desorption ionization (MALDI) tandem-MS for the identification of purified proteins (Van Leene et al, 2007, 2008Van Leene et al, 2007, 2008). Complexes for 102 cell cycle proteins were analyzed using this approach, leading to a non-redundant data set of 857 interactions among 393 proteins (Figure 1A). Two subspaces were identified in this data set, domain I1, containing interactions confirmed in at least two independent experimental repeats or in the reciprocal purification experiment, and domain I2 consisting of uniquely observed interactions. Several observations underlined the quality of both domains. All tested reverse purifications found the original interaction, and 150 known or predicted interactions were confirmed, meaning that also a huge stack of new interactions was revealed. An in-depth computational analysis revealed enrichment for many cell cycle-related features among the proteins of the network (Figure 1B), and many protein pairs were coregulated at the transcriptional level (Figure 1C). Through integration of known cell cycle-related features, more than 100 new candidate cell cycle proteins were predicted (Figure 1D). Besides common qualities of both interactome domains, their real significance appeared through mutual differences exposing two subspaces in the cell cycle interactome: a central regulatory network of stable complexes that are repeatedly isolated and represent core regulatory units, and a peripheral network comprising transient interactions identified less frequently, which are involved in other aspects of the process, such as crosstalk between core complexes or connections with other pathways. To evaluate the biological relevance of the cell cycle interactome in plants, we validated interactions from both domains by a transient split-luciferase assay in Arabidopsis plants (Marion et al, 2008), further sustaining the hypothesis-generating power of the data set to understand plant growth. With respect to insights into the cell cycle physiology, the interactome was subdivided according to the functional classes of the baits and core protein complexes were extracted, covering cyclin-dependent kinase (CDK)/cyclin core complexes together with their positive and negative regulation networks, DNA replication complexes, the anaphase-promoting complex, and spindle checkpoint complexes. The data imply that mitotic A- and B-type cyclins exclusively form heterodimeric complexes with the plant-specific B-type CDKs and not with CDKA;1, whereas D-type cyclins seem to associate with CDKA;1. Besides the extraction of complexes previously shown in other organisms, our data also suggested many new functional links; for example, the link coupling cell division with the regulation of transcript splicing. The association of negative regulators of CDK/cyclin complexes with transcription factors suggests that their role in reallocation is not solely targeted to CDK/cyclin complexes. New members of the Siamese-related inhibitory proteins were identified, and for the first time potential inhibitors of plant-specific mitotic B-type CDKs have been found in plants. New evidence that the E2F–DP–RBR network is not only active at G1-to-S, but also at the G2-to-M transition is provided and many complexes involved in DNA replication or repair were isolated. For the first time, a plant APC has been isolated biochemically, identifying three potential new plant-specific APC interactors, and finally, complexes involved in the spindle checkpoint were isolated mapping many new but specific interactions. Finally, to get a general view on the complex machinery, modules of interacting cyclins and core cell cycle regulators were ranked along the cell cycle phases according to the transcript expression peak of the cyclins, showing an assorted set of CDK–cyclin complexes with high regulatory differentiation (Figure 4). Even within the same subfamily (e.g. cyclin A3, B1, B2, D3, and D4), cyclins differ not only in their functional time frame but also in the type and number of CDKs, inhibitors, and scaffolding proteins they bind, further indicating their functional diversification. According to our interaction data, at least 92 different variants of CDK–cyclin complexes are found in Arabidopsis. In conclusion, these results reflect how several rounds of gene duplication (Sterck et al, 2007) led to the evolution of a large set of cyclin paralogs and a myriad of regulators, resulting in a significant jump in the complexity of the cell cycle machinery that could accommodate unique plant-specific features such as an indeterminate mode of postembryonic development. Through their extensive regulation and connection with a myriad of up- and downstream pathways, the core cell cycle complexes might offer the plant a flexible toolkit to fine-tune cell proliferation in response to an ever-changing environment. Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)–cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK–cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants.
Collapse
|
27
|
Bernard V, Lecharny A, Brunaud V. Improved detection of motifs with preferential location in promoters. Genome 2011; 53:739-52. [PMID: 20924423 DOI: 10.1139/g10-042] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Many transcription factor binding sites (TFBSs) involved in gene expression regulation are preferentially located relative to the transcription start site. This property is exploited in in silico prediction approaches, one of which involves studying the local overrepresentation of motifs using a sliding window to scan promoters with considerable accuracy. Nevertheless, the consequences of the choice of the sliding window size have never before been analysed. We propose an automatic adaptation of this size to each motif distribution profile. This approach allows a better characterization of the topological constraints of the motifs and the lists of genes containing them. Moreover, our approach allowed us to highlight a nonconstant frequency of occurrence of spurious motifs that could be counter-selected close to their functional area. Therefore, to improve the accuracy of in silico prediction of TFBSs and the sensitivity of the promoter cartography, we propose, in addition to automatic adaptation of window size, consideration of the nonconstant frequency of motifs in promoters.
Collapse
Affiliation(s)
- Virginie Bernard
- Unité de Recherche en Génomique Végétale (URGV), UMR INRA 1165 - CNRS 8114 - UEVE, 91057 Evry CEDEX, France
| | | | | |
Collapse
|
28
|
Gaspin C, Rami JF, Lescure B. Distribution of short interstitial telomere motifs in two plant genomes: putative origin and function. BMC PLANT BIOLOGY 2010; 10:283. [PMID: 21171996 PMCID: PMC3022908 DOI: 10.1186/1471-2229-10-283] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Accepted: 12/20/2010] [Indexed: 05/05/2023]
Abstract
BACKGROUND Short interstitial telomere motifs (telo boxes) are short sequences identical to plant telomere repeat units. They are observed within the 5' region of several genes over-expressed in cycling cells. In synergy with various cis-acting elements, these motifs participate in the activation of expression. Here, we have analysed the distribution of telo boxes within Arabidopsis thaliana and Oryza sativa genomes and their association with genes involved in the biogenesis of the translational apparatus. RESULTS Our analysis showed that the distribution of the telo box (AAACCCTA) in different genomic regions of A. thaliana and O. sativa is not random. As is also the case for plant microsatellites, they are preferentially located in the 5' flanking regions of genes, mainly within the 5' UTR, and distributed as a gradient along the direction of transcription. As previously reported in Arabidopsis, a conserved topological association of telo boxes with site II or TEF cis-acting elements is observed in almost all promoters of genes encoding ribosomal proteins in O. sativa. Such a conserved promoter organization can be found in other genes involved in the biogenesis of the translational machinery including rRNA processing proteins and snoRNAs. Strikingly, the association of telo boxes with site II motifs or TEF boxes is conserved in promoters of genes harbouring snoRNA clusters nested within an intron as well as in the 5' flanking regions of non-intronic snoRNA genes. Thus, the search for associations between telo boxes and site II motifs or TEF box in plant genomes could provide a useful tool for characterizing new cryptic RNA pol II promoters. CONCLUSIONS The data reported in this work support the model previously proposed for the spreading of telo boxes within plant genomes and provide new insights into a putative process for the acquisition of microsatellites in plants. The association of telo boxes with site II or TEF cis-acting elements appears to be an essential feature of plant genes involved in the biogenesis of ribosomes and clearly indicates that most plant snoRNAs are RNA pol II products.
Collapse
Affiliation(s)
- Christine Gaspin
- INRA Toulouse, UBIA & Plateforme Bioinformatique, UR 875, Chemin de Borde Rouge, Auzeville BP 52627, 31326 Castanet-Tolosan, France
| | - Jean-François Rami
- Centre de coopération internationale en recherche agronomique pour le développement (CIRAD). UMR Développement et Amélioration des Plantes, TA A96/3, Avenue Agropolis, 34398 Montpellier Cedex 5, France
| | - Bernard Lescure
- Laboratoire Interactions Plantes-Microorganismes (LIPM), UMR 441-2594 (INRA-CNRS), BP 52627, Chemin de Borde Rouge, Auzeville BP 52627, 31326 Castanet-Tolosan, France
| |
Collapse
|
29
|
Takahashi N, Quimbaya M, Schubert V, Lammens T, Vandepoele K, Schubert I, Matsui M, Inzé D, Berx G, De Veylder L. The MCM-binding protein ETG1 aids sister chromatid cohesion required for postreplicative homologous recombination repair. PLoS Genet 2010; 6:e1000817. [PMID: 20090939 PMCID: PMC2806904 DOI: 10.1371/journal.pgen.1000817] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Accepted: 12/16/2009] [Indexed: 11/22/2022] Open
Abstract
The DNA replication process represents a source of DNA stress that causes potentially spontaneous genome damage. This effect might be strengthened by mutations in crucial replication factors, requiring the activation of DNA damage checkpoints to enable DNA repair before anaphase onset. Here, we demonstrate that depletion of the evolutionarily conserved minichromosome maintenance helicase-binding protein ETG1 of Arabidopsis thaliana resulted in a stringent late G2 cell cycle arrest. This arrest correlated with a partial loss of sister chromatid cohesion. The lack-of-cohesion phenotype was intensified in plants without functional CTF18, a replication fork factor needed for cohesion establishment. The synergistic effect of the etg1 and ctf18 mutants on sister chromatid cohesion strengthened the impact on plant growth of the replication stress caused by ETG1 deficiency because of inefficient DNA repair. We conclude that the ETG1 replication factor is required for efficient cohesion and that cohesion establishment is essential for proper development of plants suffering from endogenous DNA stress. Cohesion defects observed upon knockdown of its human counterpart suggest an equally important developmental role for the orthologous mammalian ETG1 protein. DNA replication is a highly complex process and the source of potential DNA damage. It is of utmost importance that the damaged DNA is repaired before cells proceed through mitosis, because the genome holds all the information required for correct development. DNA replication results in two identical sister chromatids. A trick applied by cells to overcome damaged DNA is homologous recombination, using the undamaged copy of the sister chromatid as a template to repair the damaged one. This process is aided by keeping the two sister chromatids in close proximity after the replication process by the deposition of a molecular glue, called cohesin. In the present work, we identified the Arabidopsis thaliana ETG1 protein as a novel evolutionarily conserved replication factor that is needed for maintaining the sister chromatids physically aligned. In plants without ETG1, DNA damage builds up due to inefficient DNA repair. As a consequence, cell division is impaired with a huge impact on plant growth, highlighting the importance of cohesin for the correct development of eukaryotic organisms. Cohesion phenotypes observed upon the depletion of the orthologous human ETG1 protein indicate equally prominent roles for this particular factor during mammalian development.
Collapse
Affiliation(s)
- Naoki Takahashi
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
- Plant Functional Genomics Research Group, RIKEN Plant Science Center, Yokohama, Kanagawa, Japan
| | - Mauricio Quimbaya
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
- Department for Molecular Biomedical Research, Molecular and Cellular Oncology Unit, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Gent, Belgium
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Tim Lammens
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
| | - Klaas Vandepoele
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
| | - Ingo Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Minami Matsui
- Plant Functional Genomics Research Group, RIKEN Plant Science Center, Yokohama, Kanagawa, Japan
| | - Dirk Inzé
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
| | - Geert Berx
- Department for Molecular Biomedical Research, Molecular and Cellular Oncology Unit, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Gent, Belgium
| | - Lieven De Veylder
- Department of Plant Systems Biology, Flanders Institute for Biotechnology (VIB), Gent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Gent, Belgium
- * E-mail:
| |
Collapse
|
30
|
Nero D, Katari MS, Kelfer J, Tranchina D, Coruzzi GM. In silico evaluation of predicted regulatory interactions in Arabidopsis thaliana. BMC Bioinformatics 2009; 10:435. [PMID: 20025756 PMCID: PMC2803859 DOI: 10.1186/1471-2105-10-435] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2009] [Accepted: 12/21/2009] [Indexed: 01/18/2023] Open
Abstract
Background Prediction of transcriptional regulatory mechanisms in Arabidopsis has become increasingly critical with the explosion of genomic data now available for both gene expression and gene sequence composition. We have shown in previous work [1], that a combination of correlation measurements and cis-regulatory element (CRE) detection methods are effective in predicting targets for candidate transcription factors for specific case studies which were validated. However, to date there has been no quantitative assessment as to which correlation measures or CRE detection methods used alone or in combination are most effective in predicting TF→target relationships on a genome-wide scale. Results We tested several widely used methods, based on correlation (Pearson and Spearman Rank correlation) and cis-regulatory element (CRE) detection (≥1 CRE or CRE over-representation), to determine which of these methods individually or in combination is the most effective by various measures for making regulatory predictions. To predict the regulatory targets of a transcription factor (TF) of interest, we applied these methods to microarray expression data for genes that were regulated over treatment and control conditions in wild type (WT) plants. Because the chosen data sets included identical experimental conditions used on TF over-expressor or T-DNA knockout plants, we were able to test the TF→target predictions made using microarray data from WT plants, with microarray data from mutant/transgenic plants. For each method, or combination of methods, we computed sensitivity, specificity, positive and negative predictive value and the F-measure of balance between sensitivity and positive predictive value (precision). This analysis revealed that the ≥1 CRE and Spearman correlation (used alone or in combination) were the most balanced CRE detection and correlation methods, respectively with regard to their power to accurately predict regulatory-target interactions. Conclusion These findings provide an approach and guidance for researchers interested in predicting transcriptional regulatory mechanisms using microarray data that they generate (or microarray data that is publically available) combined with CRE detection in promoter sequence data.
Collapse
Affiliation(s)
- Damion Nero
- Department of Biology, New York University, Center for Genomics and Systems Biology, New York, NY 10003, USA.
| | | | | | | | | |
Collapse
|
31
|
Unravelling cis-regulatory elements in the genome of the smallest photosynthetic eukaryote: phylogenetic footprinting in Ostreococcus. J Mol Evol 2009; 69:249-59. [PMID: 19693423 DOI: 10.1007/s00239-009-9271-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2009] [Revised: 07/17/2009] [Accepted: 07/27/2009] [Indexed: 10/20/2022]
Abstract
We used a phylogenetic footprinting approach, adapted to high levels of divergence, to estimate the level of constraint in intergenic regions of the extremely gene dense Ostreococcus algae genomes (Chlorophyta, Prasinophyceae). We first benchmarked our method against the Saccharomyces sensu stricto genome data and found that the proportion of conserved non-coding sites was consistent with those obtained with methods using calibration by the neutral substitution rate. We then applied our method to the complete genomes of Ostreococcus tauri and O. lucimarinus, which are the most divergent species from the same genus sequenced so far. We found that 77% of intergenic regions in Ostreococcus still contain some phylogenetic footprints, as compared to 88% for Saccharomyces, corresponding to an average rate of constraint on intergenic region of 17% and 30%, respectively. A comparison with some known functional cis-regulatory elements enabled us to investigate whether some transcriptional regulatory pathways were conserved throughout the green lineage. Strikingly, the size of the phylogenetic footprints depends on gene orientation of neighboring genes, and appears to be genus-specific. In Ostreococcus, 5' intergenic regions contain four times more conserved sites than 3' intergenic regions, whereas in yeast a higher frequency of constrained sites in intergenic regions between genes on the same DNA strand suggests a higher frequency of bidirectional regulatory elements. The phylogenetic footprinting approach can be used despite high levels of divergence in the ultrasmall Ostreococcus algae, to decipher structure of constrained regulatory motifs, and identify putative regulatory pathways conserved within the green lineage.
Collapse
|
32
|
Wang X, Haberer G, Mayer KFX. Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation. BMC Genomics 2009; 10:284. [PMID: 19558665 PMCID: PMC2714861 DOI: 10.1186/1471-2164-10-284] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2008] [Accepted: 06/26/2009] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. RESULTS In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. CONCLUSION Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.
Collapse
Affiliation(s)
- Xi Wang
- MIPS/IBIS Institute of Bioinformatics and System Biology, Helmholtz Center Munich, Neuherberg, Germany.
| | | | | |
Collapse
|
33
|
Vandepoele K, Quimbaya M, Casneuf T, De Veylder L, Van de Peer Y. Unraveling transcriptional control in Arabidopsis using cis-regulatory elements and coexpression networks. PLANT PHYSIOLOGY 2009; 150:535-46. [PMID: 19357200 PMCID: PMC2689962 DOI: 10.1104/pp.109.136028] [Citation(s) in RCA: 160] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2009] [Accepted: 04/02/2009] [Indexed: 05/17/2023]
Abstract
Analysis of gene expression data generated by high-throughput microarray transcript profiling experiments has demonstrated that genes with an overall similar expression pattern are often enriched for similar functions. This guilt-by-association principle can be applied to define modular gene programs, identify cis-regulatory elements, or predict gene functions for unknown genes based on their coexpression neighborhood. We evaluated the potential to use Gene Ontology (GO) enrichment of a gene's coexpression neighborhood as a tool to predict its function but found overall low sensitivity scores (13%-34%). This indicates that for many functional categories, coexpression alone performs poorly to infer known biological gene functions. However, integration of cis-regulatory elements shows that 46% of the gene coexpression neighborhoods are enriched for one or more motifs, providing a valuable complementary source to functionally annotate genes. Through the integration of coexpression data, GO annotations, and a set of known cis-regulatory elements combined with a novel set of evolutionarily conserved plant motifs, we could link many genes and motifs to specific biological functions. Application of our coexpression framework extended with cis-regulatory element analysis on transcriptome data from the cell cycle-related transcription factor OBP1 yielded several coexpressed modules associated with specific cis-regulatory elements. Moreover, our analysis strongly suggests a feed-forward regulatory interaction between OBP1 and the E2F pathway. The ATCOECIS resource (http://bioinformatics.psb.ugent.be/ATCOECIS/) makes it possible to query coexpression data and GO and cis-regulatory element annotations and to submit user-defined gene sets for motif analysis, providing an access point to unravel the regulatory code underlying transcriptional control in Arabidopsis (Arabidopsis thaliana).
Collapse
Affiliation(s)
- Klaas Vandepoele
- Department of Plant Systems Biology, Flanders Institute for Biotechnology, B-9052 Ghent, Belgium
| | | | | | | | | |
Collapse
|
34
|
Freeling M, Subramaniam S. Conserved noncoding sequences (CNSs) in higher plants. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:126-32. [PMID: 19249238 DOI: 10.1016/j.pbi.2009.01.005] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Revised: 01/22/2009] [Accepted: 01/22/2009] [Indexed: 05/09/2023]
Abstract
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.
| | | |
Collapse
|
35
|
Hervé C, Dabos P, Bardet C, Jauneau A, Auriac MC, Ramboer A, Lacout F, Tremousaygue D. In vivo interference with AtTCP20 function induces severe plant growth alterations and deregulates the expression of many genes important for development. PLANT PHYSIOLOGY 2009; 149:1462-77. [PMID: 19091878 PMCID: PMC2649380 DOI: 10.1104/pp.108.126136] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2008] [Accepted: 12/04/2008] [Indexed: 05/19/2023]
Abstract
AtTCP20 is a transcription factor belonging to the Arabidopsis (Arabidopsis thaliana) TCP-P subfamily, characterized by its capacity to bind to site II motifs (TGGGCY). Our aim was to understand the role of AtTCP20 in plant development. The expression pattern of a translational fusion of Prom(TCP20):CDS20GUSGFP suggested a function for AtTCP20 in several plant organs and stages of development. The role of AtTCP20 was challenged in planta by inducing expression of AtTCP20 proteins fused with either a transcriptional activator domain (VP16) or a repressor domain (EAR). Expression of both modified proteins led to severe developmental phenotypes. In-depth analysis suggested that AtTCP20 may participate in the regulation of cell expansion, cell division, and cell differentiation. Gene expression profiling in roots and hypocotyls revealed that 252 genes were down-regulated in both organs after induction of the AtTCP20EAR repressor gene. Site II motifs (TGGGCY) were underrepresented in their promoters. Conversely, GG(A/T)CCC sequences related to binding sites identified for TCP proteins in rice (Oryza sativa) were overrepresented, and a TCP20 fusion protein was shown to bind to these sequences in vitro. Gene ontology indicated that many targeted genes were involved in cell wall biogenesis and modification during expansion and also encoded numerous transcription factors controlling plant development. Our results are consistent with the previous proposal that AtTCP20 is involved in cell division and growth coordination. Moreover, they further suggest that AtTCP20 also contributes to cell expansion control and indicate a different involvement of this protein in plant morphogenesis depending on the organ and the developmental stage.
Collapse
Affiliation(s)
- Christine Hervé
- Laboratoire des Interactions Plantes Microorganismes, INRA 441, 31326 Castanet Tolosan cedex, France
| | | | | | | | | | | | | | | |
Collapse
|
36
|
Abstract
Despite recent elucidation of the three-dimensional structure of major photosynthetic complexes, our understanding of light energy conversion in plant chloroplasts and microalgae under physiological conditions requires exploring the dynamics of photosynthesis. The photosynthetic apparatus is a flexible molecular machine that can acclimate to metabolic and light fluctuations in a matter of seconds and minutes. On a longer time scale, changes in environmental cues trigger acclimation responses that elicit intracellular signaling between the nucleo-cytosol and chloroplast resulting in modification of the biogenesis of the photosynthetic machinery. Here we attempt to integrate well-established knowledge on the functional flexibility of light-harvesting and electron transfer processes, which has greatly benefited from genetic approaches, with data derived from the wealth of recent transcriptomic and proteomic studies of acclimation responses in photosynthetic eukaroytes.
Collapse
Affiliation(s)
- Stephan Eberhard
- Université Pierre et Marie Curie, Institut de Biologie Physico-Chimique, F-75005 Paris, France
| | | | | |
Collapse
|
37
|
Alvim Kamei CL, Boruc J, Vandepoele K, Van den Daele H, Maes S, Russinova E, Inzé D, De Veylder L. The PRA1 gene family in Arabidopsis. PLANT PHYSIOLOGY 2008; 147:1735-49. [PMID: 18583532 PMCID: PMC2492607 DOI: 10.1104/pp.108.122226] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2008] [Accepted: 06/25/2008] [Indexed: 05/18/2023]
Abstract
Prenylated Rab acceptor 1 (PRA1) domain proteins are small transmembrane proteins that regulate vesicle trafficking as receptors of Rab GTPases and the vacuolar soluble N-ethylmaleimide-sensitive factor attachment receptor protein VAMP2. However, little is known about PRA1 family members in plants. Sequence analysis revealed that higher plants, compared with animals and primitive plants, possess an expanded family of PRA1 domain-containing proteins. The Arabidopsis (Arabidopsis thaliana) PRA1 (AtPRA1) proteins were found to homodimerize and heterodimerize in a manner corresponding to their phylogenetic distribution. Different AtPRA1 family members displayed distinct expression patterns, with a preference for vascular cells and expanding or developing tissues. AtPRA1 genes were significantly coexpressed with Rab GTPases and genes encoding vesicle transport proteins, suggesting an involvement in the vesicle trafficking process similar to that of their animal counterparts. Correspondingly, AtPRA1 proteins were localized in the endoplasmic reticulum, Golgi apparatus, and endosomes/prevacuolar compartments, hinting at a function in both secretory and endocytic intracellular trafficking pathways. Taken together, our data reveal a high functional diversity of AtPRA1 proteins, probably dealing with the various demands of the complex trafficking system.
Collapse
Affiliation(s)
- Claire Lessa Alvim Kamei
- Instituto de Bioquímica Médica, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ 21941-590, Brazil
| | | | | | | | | | | | | | | |
Collapse
|
38
|
Krishnan A, Pereira A. Integrative approaches for mining transcriptional regulatory programs in Arabidopsis. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2008; 7:264-74. [DOI: 10.1093/bfgp/eln035] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
39
|
Horan K, Jang C, Bailey-Serres J, Mittler R, Shelton C, Harper JF, Zhu JK, Cushman JC, Gollery M, Girke T. Annotating genes of known and unknown function by large-scale coexpression analysis. PLANT PHYSIOLOGY 2008; 147:41-57. [PMID: 18354039 PMCID: PMC2330292 DOI: 10.1104/pp.108.117366] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2008] [Accepted: 03/10/2008] [Indexed: 05/17/2023]
Abstract
About 40% of the proteins encoded in eukaryotic genomes are proteins of unknown function (PUFs). Their functional characterization remains one of the main challenges in modern biology. In this study we identified the PUF encoding genes from Arabidopsis (Arabidopsis thaliana) using a combination of sequence similarity, domain-based, and empirical approaches. Large-scale gene expression analyses of 1,310 publicly available Affymetrix chips were performed to associate the identified PUF genes with regulatory networks and biological processes of known function. To generate quality results, the study was restricted to expression sets with replicated samples. First, genome-wide clustering and gene function enrichment analysis of clusters allowed us to associate 1,541 PUF genes with tightly coexpressed genes for proteins of known function (PKFs). Over 70% of them could be assigned to more specific biological process annotations than the ones available in the current Gene Ontology release. The most highly overrepresented functional categories in the obtained clusters were ribosome assembly, photosynthesis, and cell wall pathways. Interestingly, the majority of the PUF genes appeared to be controlled by the same regulatory networks as most PKF genes, because clusters enriched in PUF genes were extremely rare. Second, large-scale analysis of differentially expressed genes was applied to identify a comprehensive set of abiotic stress-response genes. This analysis resulted in the identification of 269 PKF and 104 PUF genes that responded to a wide variety of abiotic stresses, whereas 608 PKF and 206 PUF genes responded predominantly to specific stress treatments. The provided coexpression and differentially expressed gene data represent an important resource for guiding future functional characterization experiments of PUF and PKF genes. Finally, the public Plant Gene Expression Database (http://bioweb.ucr.edu/PED) was developed as part of this project to provide efficient access and mining tools for the vast gene expression data of this study.
Collapse
Affiliation(s)
- Kevin Horan
- Department of Botany and Plant Sciences , University of California, Riverside, CA 92521, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Lee HG, Lee HS, Jeon SH, Chung TH, Lim YS, Huh WK. High-resolution analysis of condition-specific regulatory modules in Saccharomyces cerevisiae. Genome Biol 2008; 9:R2. [PMID: 18171483 PMCID: PMC2395236 DOI: 10.1186/gb-2008-9-1-r2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2007] [Revised: 10/15/2007] [Accepted: 01/03/2008] [Indexed: 01/11/2023] Open
Abstract
A novel approach for identifying condition-specific regulatory modules in yeast reveals functionally distinct coregulated submodules. We present an approach for identifying condition-specific regulatory modules by using separate units of gene expression profiles along with ChIP-chip and motif data from Saccharomyces cerevisiae. By investigating the unique and common features of the obtained condition-specific modules, we detected several important properties of transcriptional network reorganization. Our approach reveals the functionally distinct coregulated submodules embedded in a coexpressed gene module and provides an effective method for identifying various condition-specific regulatory events at high resolution.
Collapse
Affiliation(s)
- Hun-Goo Lee
- School of Biological Sciences and Research Center for Functional Cellulomics, Institute of Microbiology, Seoul National University, Seoul 151-747, Republic of Korea
| | | | | | | | | | | |
Collapse
|
41
|
Freeling M, Rapaka L, Lyons E, Pedersen B, Thomas BC. G-boxes, bigfoot genes, and environmental response: characterization of intragenomic conserved noncoding sequences in Arabidopsis. THE PLANT CELL 2007; 19:1441-57. [PMID: 17496117 PMCID: PMC1913728 DOI: 10.1105/tpc.107.050419] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2007] [Revised: 03/10/2007] [Accepted: 04/19/2007] [Indexed: 05/15/2023]
Abstract
A tetraploidy left Arabidopsis thaliana with 6358 pairs of homoeologs that, when aligned, generated 14,944 intragenomic conserved noncoding sequences (CNSs). Our previous work assembled these phylogenetic footprints into a database. We show that known transcription factor (TF) binding motifs, including the G-box, are overrepresented in these CNSs. A total of 254 genes spanning long lengths of CNS-rich chromosomes (Bigfoot) dominate this database. Therefore, we made subdatabases: one containing Bigfoot genes and the other containing genes with three to five CNSs (Smallfoot). Bigfoot genes are generally TFs that respond to signals, with their modal CNS positioned 3.1 kb 5' from the ATG. Smallfoot genes encode components of signal transduction machinery, the cytoskeleton, or involve transcription. We queried each subdatabase with each possible 7-nucleotide sequence. Among hundreds of hits, most were purified from CNSs, and almost all of those significantly enriched in CNSs had no experimental history. The 7-mers in CNSs are not 5'- to 3'-oriented in Bigfoot genes but are often oriented in Smallfoot genes. CNSs with one G-box tend to have two G-boxes. CNSs were shared with the homoeolog only and with no other gene, suggesting that binding site turnover impedes detection. Bigfoot genes may function in adaptation to environmental change.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| | | | | | | | | |
Collapse
|