1
|
Bruno L, Ronchini M, Binelli G, Muto A, Chiappetta A, Bitonti MB, Gerola P. A Study of GUS Expression in Arabidopsis as a Tool for the Evaluation of Gene Evolution, Function and the Role of Expression Derived from Gene Duplication. PLANTS (BASEL, SWITZERLAND) 2023; 12:2051. [PMID: 37653968 PMCID: PMC10221982 DOI: 10.3390/plants12102051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 05/17/2023] [Accepted: 05/18/2023] [Indexed: 09/02/2023]
Abstract
Gene duplication played a fundamental role in eukaryote evolution and different copies of a given gene can be present in extant species, often with expressions and functions differentiated during evolution. We assume that, when such differentiation occurs in a gene copy, this may be indicated by its maintenance in all the derived species. To verify this hypothesis, we compared the histological expression domains of the three β-glucuronidase genes (AtGUS) present in Arabidopsis thaliana with the GUS evolutionary tree in angiosperms. We found that AtGUS gene expression overlaps in the shoot apex, the floral bud and the root hairs. In the root apex, AtGUS3 expression differs completely from AtGUS1 and AtGUS2, whose transcripts are present in the root cap meristem and columella, in the staminal cell niche, in the epidermis and in the proximal cortex. Conversely, AtGUS3 transcripts are limited to the old border-like cells of calyptra and those found along the protodermal cell line. The GUS evolutionary tree reveals that the two main clusters (named GUS1 and GUS3) originate from a duplication event predating angiosperm radiation. AtGUS3 belongs to the GUS3 cluster, while AtGUS1 and AtGUS2, which originate from a duplication event that occurred in an ancestor of the Brassicaceae family, are found together in the GUS1 cluster. There is another, previously undescribed cluster, called GUS4, originating from a very ancient duplication event. While the copy of GUS4 has been lost in many species, copies of GUS3 and GUS1 have been conserved in all species examined.
Collapse
Affiliation(s)
- Leonardo Bruno
- Dipartimento di Biologia, Ecologia e Scienze della Terra, Università della Calabria, Arcavacata di Rende, 87036 Cosenza, Italy; (A.M.); (A.C.); (M.B.B.)
| | - Matteo Ronchini
- Dipartimento di Scienze Teoriche e Applicate, Università degli Studi dell’Insubria, 21100 Varese, Italy; (M.R.); (P.G.)
| | - Giorgio Binelli
- Dipartimento di Biotecnologie e Scienze della Vita, Università degli Studi dell’Insubria, 21100 Varese, Italy;
| | - Antonella Muto
- Dipartimento di Biologia, Ecologia e Scienze della Terra, Università della Calabria, Arcavacata di Rende, 87036 Cosenza, Italy; (A.M.); (A.C.); (M.B.B.)
| | - Adriana Chiappetta
- Dipartimento di Biologia, Ecologia e Scienze della Terra, Università della Calabria, Arcavacata di Rende, 87036 Cosenza, Italy; (A.M.); (A.C.); (M.B.B.)
| | - Maria Beatrice Bitonti
- Dipartimento di Biologia, Ecologia e Scienze della Terra, Università della Calabria, Arcavacata di Rende, 87036 Cosenza, Italy; (A.M.); (A.C.); (M.B.B.)
| | - Paolo Gerola
- Dipartimento di Scienze Teoriche e Applicate, Università degli Studi dell’Insubria, 21100 Varese, Italy; (M.R.); (P.G.)
| |
Collapse
|
2
|
Oh DH, Kowalski KP, Quach QN, Wijesinghege C, Tanford P, Dassanayake M, Clay K. Novel genome characteristics contribute to the invasiveness of Phragmites australis (common reed). Mol Ecol 2021; 31:1142-1159. [PMID: 34839548 PMCID: PMC9300010 DOI: 10.1111/mec.16293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/12/2021] [Accepted: 11/15/2021] [Indexed: 11/06/2022]
Abstract
The rapid invasion of the non‐native Phragmites australis (Poaceae, subfamily Arundinoideae) is a major threat to native wetland ecosystems in North America and elsewhere. We describe the first reference genome for P. australis and compare invasive (ssp. australis) and native (ssp. americanus) genotypes collected from replicated populations across the Laurentian Great Lakes to deduce genomic bases driving its invasive success. Here, we report novel genomic features including a Phragmites lineage‐specific whole genome duplication, followed by gene loss and preferential retention of genes associated with transcription factors and regulatory functions in the remaining duplicates. Comparative transcriptomic analyses revealed that genes associated with biotic stress and defence responses were expressed at a higher basal level in invasive genotypes, but native genotypes showed a stronger induction of defence responses when challenged by a fungal endophyte. The reference genome and transcriptomes, combined with previous ecological and environmental data, add to our understanding of mechanisms leading to invasiveness and support the development of novel, genomics‐assisted management approaches for invasive Phragmites.
Collapse
Affiliation(s)
- Dong-Ha Oh
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Kurt P Kowalski
- U.S. Geological Survey, Great Lakes Science Center, Ann Arbor, Michigan, USA
| | - Quynh N Quach
- Department of Ecology & Evolutionary Biology, Tulane University, New Orleans, Louisiana, USA
| | - Chathura Wijesinghege
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Philippa Tanford
- Department of Ecology & Evolutionary Biology, Tulane University, New Orleans, Louisiana, USA.,Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Maheshi Dassanayake
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Keith Clay
- Department of Ecology & Evolutionary Biology, Tulane University, New Orleans, Louisiana, USA.,Department of Biology, Indiana University, Bloomington, Indiana, USA
| |
Collapse
|
3
|
Liu Z, Suarez Duran HG, Harnvanichvech Y, Stephenson MJ, Schranz ME, Nelson D, Medema MH, Osbourn A. Drivers of metabolic diversification: how dynamic genomic neighbourhoods generate new biosynthetic pathways in the Brassicaceae. THE NEW PHYTOLOGIST 2020; 227:1109-1123. [PMID: 31769874 PMCID: PMC7383575 DOI: 10.1111/nph.16338] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 11/17/2019] [Indexed: 05/11/2023]
Abstract
Plants produce an array of specialized metabolites with important ecological functions. The mechanisms underpinning the evolution of new biosynthetic pathways are not well-understood. Here, we exploit available genome sequence resources to investigate triterpene biosynthesis across the Brassicaceae. Oxidosqualene cyclases (OSCs) catalyze the first committed step in triterpene biosynthesis. Systematic analysis of 13 sequenced Brassicaceae genomes was performed to identify all OSC genes. The genome neighbourhoods (GNs) around a total of 163 OSC genes were investigated to identify Pfam domains significantly enriched in these regions. All-vs-all comparisons of OSC neighbourhoods and phylogenomic analysis were used to investigate the sequence similarity and evolutionary relationships of the numerous candidate triterpene biosynthetic gene clusters (BGCs) observed. Functional analysis of three representative BGCs was carried out and their triterpene pathway products were elucidated. Our results indicate that plant genomes are remarkably plastic, and that dynamic GNs generate new biosynthetic pathways in different Brassicaceae lineages by shuffling the genes encoding a core palette of triterpene-diversifying enzymes, presumably in response to strong environmental selection pressure. These results illuminate a genomic basis for diversification of plant-specialized metabolism through natural combinatorics of enzyme families, which can be mimicked using synthetic biology to engineer diverse bioactive molecules.
Collapse
Affiliation(s)
- Zhenhua Liu
- Department of Metabolic BiologyJohn Innes CentreNorwich Research Park, Colney LaneNorwichNR4 7UHUK
| | | | - Yosapol Harnvanichvech
- Bioinformatics GroupWageningen UniversityDroevendaalsesteeg 16708PBWageningenthe Netherlands
| | - Michael J. Stephenson
- Department of Metabolic BiologyJohn Innes CentreNorwich Research Park, Colney LaneNorwichNR4 7UHUK
| | - M. Eric Schranz
- Biosystematics GroupWageningen UniversityDroevendaalsesteeg 16708PBWageningenthe Netherlands
| | - David Nelson
- Department of Microbiology, Immunology and BiochemistryUniversity of Tennessee858 Madison Avenue, Suite G01MemphisTN38163USA
| | - Marnix H. Medema
- Bioinformatics GroupWageningen UniversityDroevendaalsesteeg 16708PBWageningenthe Netherlands
| | - Anne Osbourn
- Department of Metabolic BiologyJohn Innes CentreNorwich Research Park, Colney LaneNorwichNR4 7UHUK
| |
Collapse
|
4
|
Joly-Lopez Z, Platts AE, Gulko B, Choi JY, Groen SC, Zhong X, Siepel A, Purugganan MD. An inferred fitness consequence map of the rice genome. NATURE PLANTS 2020; 6:119-130. [PMID: 32042156 PMCID: PMC7446671 DOI: 10.1038/s41477-019-0589-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 12/20/2019] [Indexed: 05/04/2023]
Abstract
The extent to which sequence variation impacts plant fitness is poorly understood. High-resolution maps detailing the constraint acting on the genome, especially in regulatory sites, would be beneficial as functional annotation of noncoding sequences remains sparse. Here, we present a fitness consequence (fitCons) map for rice (Oryza sativa). We inferred fitCons scores (ρ) for 246 inferred genome classes derived from nine functional genomic and epigenomic datasets, including chromatin accessibility, messenger RNA/small RNA transcription, DNA methylation, histone modifications and engaged RNA polymerase activity. These were integrated with genome-wide polymorphism and divergence data from 1,477 rice accessions and 11 reference genome sequences in the Oryzeae. We found ρ to be multimodal, with ~9% of the rice genome falling into classes where more than half of the bases would probably have a fitness consequence if mutated. Around 2% of the rice genome showed evidence of weak negative selection, frequently at candidate regulatory sites, including a novel set of 1,000 potentially active enhancer elements. This fitCons map provides perspective on the evolutionary forces associated with genome diversity, aids in genome annotation and can guide crop breeding programs.
Collapse
Affiliation(s)
- Zoé Joly-Lopez
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Adrian E Platts
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Brad Gulko
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Jae Young Choi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Simon C Groen
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Xuehua Zhong
- Laboratory of Genetics and Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Michael D Purugganan
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA.
- Center for Genomics and Systems Biology, NYU Abu Dhabi Research Institute, NYU Abu Dhabi, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
5
|
Zhang Y, Ngu DW, Carvalho D, Liang Z, Qiu Y, Roston RL, Schnable JC. Differentially Regulated Orthologs in Sorghum and the Subgenomes of Maize. THE PLANT CELL 2017; 29:1938-1951. [PMID: 28733421 PMCID: PMC5590507 DOI: 10.1105/tpc.17.00354] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Revised: 07/05/2017] [Accepted: 07/18/2017] [Indexed: 05/19/2023]
Abstract
Identifying interspecies changes in gene regulation, one of the two primary sources of phenotypic variation, is challenging on a genome-wide scale. The use of paired time-course data on cold-responsive gene expression in maize (Zea mays) and sorghum (Sorghum bicolor) allowed us to identify differentially regulated orthologs. While the majority of cold-responsive transcriptional regulation of conserved gene pairs is species specific, the initial transcriptional responses to cold appear to be more conserved than later responses. In maize, the promoters of genes with conserved transcriptional responses to cold tend to contain more micrococcal nuclease hypersensitive sites in their promoters, a proxy for open chromatin. Genes with conserved patterns of transcriptional regulation between the two species show lower ratios of nonsynonymous to synonymous substitutions. Genes involved in lipid metabolism, known to be involved in cold acclimation, tended to show consistent regulation in both species. Genes with species-specific cold responses did not cluster in particular pathways nor were they enriched in particular functional categories. We propose that cold-responsive transcriptional regulation in individual species may not be a reliable marker for function, while a core set of genes involved in perceiving and responding to cold stress are subject to functionally constrained cold-responsive regulation across the grass tribe Andropogoneae.
Collapse
Affiliation(s)
- Yang Zhang
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - Daniel W Ngu
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - Daniel Carvalho
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - Zhikai Liang
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - Yumou Qiu
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - Rebecca L Roston
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| | - James C Schnable
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska 68588
| |
Collapse
|
6
|
Van de Velde J, Van Bel M, Vaneechoutte D, Vandepoele K. A Collection of Conserved Noncoding Sequences to Study Gene Regulation in Flowering Plants. PLANT PHYSIOLOGY 2016; 171:2586-98. [PMID: 27261064 PMCID: PMC4972296 DOI: 10.1104/pp.16.00821] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Accepted: 05/31/2016] [Indexed: 05/03/2023]
Abstract
Transcription factors (TFs) regulate gene expression by binding cis-regulatory elements, of which the identification remains an ongoing challenge owing to the prevalence of large numbers of nonfunctional TF binding sites. Powerful comparative genomics methods, such as phylogenetic footprinting, can be used for the detection of conserved noncoding sequences (CNSs), which are functionally constrained and can greatly help in reducing the number of false-positive elements. In this study, we applied a phylogenetic footprinting approach for the identification of CNSs in 10 dicot plants, yielding 1,032,291 CNSs associated with 243,187 genes. To annotate CNSs with TF binding sites, we made use of binding site information for 642 TFs originating from 35 TF families in Arabidopsis (Arabidopsis thaliana). In three species, the identified CNSs were evaluated using TF chromatin immunoprecipitation sequencing data, resulting in significant overlap for the majority of data sets. To identify ultraconserved CNSs, we included genomes of additional plant families and identified 715 binding sites for 501 genes conserved in dicots, monocots, mosses, and green algae. Additionally, we found that genes that are part of conserved mini-regulons have a higher coherence in their expression profile than other divergent gene pairs. All identified CNSs were integrated in the PLAZA 3.0 Dicots comparative genomics platform (http://bioinformatics.psb.ugent.be/plaza/versions/plaza_v3_dicots/) together with new functionalities facilitating the exploration of conserved cis-regulatory elements and their associated genes. The availability of this data set in a user-friendly platform enables the exploration of functional noncoding DNA to study gene regulation in a variety of plant species, including crops.
Collapse
Affiliation(s)
- Jan Van de Velde
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Michiel Van Bel
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Dries Vaneechoutte
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| | - Klaas Vandepoele
- Department of Plant Systems Biology, Vlaams Instituut voor Biotechnologie, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.); andDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium (J.V.d.V., M.V.B., D.V., K.V.)
| |
Collapse
|
7
|
Hoffmann RD, Palmgren M. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana. BMC Genomics 2016; 17:456. [PMID: 27296049 PMCID: PMC4906602 DOI: 10.1186/s12864-016-2803-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 05/27/2016] [Indexed: 01/13/2023] Open
Abstract
Background Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Results Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3′ untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Conclusions Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2803-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Robert D Hoffmann
- Center for Membrane Pumps in Cells and Disease - PUMPKIN, Danish National Research Foundation, Department of Plant and Environmental Sciences, University of Copenhagen, 1871, Frederiksberg C, Denmark.
| | - Michael Palmgren
- Center for Membrane Pumps in Cells and Disease - PUMPKIN, Danish National Research Foundation, Department of Plant and Environmental Sciences, University of Copenhagen, 1871, Frederiksberg C, Denmark
| |
Collapse
|
8
|
Abstract
BACKGROUND The major mechanism driving cellular differentiation and organism development is the regulation of gene expression. Cis-acting enhancers and silencers have key roles in controlling gene transcription. The genomic era allowed the transition from single gene analysis to the investigation of full transcriptomes. This transition increased the complexity of the analyses and the difficulty in the interpretation of the results. In this context, there is demand for new tools aimed at the creation of gene networks that can facilitate the interpretation of Next Generation Sequencing (NGS) data. RESULTS Arabidopsis Motif Scanner (AMS) is a Windows application that runs on local computers. It was developed to build gene networks by identifying the positions of cis-regulatory elements in the model plant Arabidopsis thaliana and by providing an easy interface to assess and evaluate gene relationships. Its major innovative feature is to combine the cis-regulatory element positions, NGS and DNA Chip Arrays expression data, Arabidopsis annotations and gene interactions for the identification of gene networks regulated by transcription factors. In studies focused on transcription factors function, the software uses the expression data and binding site motifs in the regulative gene regions to predict direct target genes. Additionally, AMS utilizes DNA-protein and protein-protein interaction data to facilitate the identification of the metabolic pathways regulated by the transcription factor of interest. CONCLUSIONS Arabidopsis Motif Scanner is a new tool that helps researchers to unravel gene relations and functions. In fact, it facilitates studies focused on the effects and the impact that transcription factors have on the transcriptome by correlating the position of cis-acting elements, gene expression data and interactions.
Collapse
|
9
|
Freeling M, Scanlon MJ, Fowler JE. Fractionation and subfunctionalization following genome duplications: mechanisms that drive gene content and their consequences. Curr Opin Genet Dev 2015; 35:110-8. [PMID: 26657818 DOI: 10.1016/j.gde.2015.11.002] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Revised: 11/09/2015] [Accepted: 11/09/2015] [Indexed: 12/11/2022]
Abstract
A gene's duplication relaxes selection. Loss of duplicate, low-function DNA (fractionation) sometimes follows, mostly by deletion in plants, but mostly via the pseudogene pathway in fish and other clades with smaller population sizes. Subfunctionalization--the founding term of the Xfunctionalization lexicon--while not the general cause of differences in duplicate gene retention, becomes primary as the number of a gene's cis-regulatory sites increases. Balanced gene drive explains retention for the average gene. Both maintenance-of-balance and subfunctionalization drive gene content nonrandomly, and currently fall outside of our accepted Theory of Evolution. The 'typical' mutation encountered by a gene duplicate is not a neutral loss-of-function; dominant mutations (Muller's lexicon; these are not neutral) abound, and confound X functionalization terms like 'neofunctionalization'. Confusion of words may cause confusion of thought. As with many plants, fish tetraploidies provide a higher throughput surrogate-genetic method to infer function from human and other vertebrate ENCODE-like regulatory sites.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, Univ. California, Berkeley, CA 94720, United States.
| | - Michael J Scanlon
- Section of Plant Biology, Cornell University, Ithaca, NY 14853, United States
| | - John E Fowler
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, United States
| |
Collapse
|
10
|
Arsovski AA, Pradinuk J, Guo XQ, Wang S, Adams KL. Evolution of Cis-Regulatory Elements and Regulatory Networks in Duplicated Genes of Arabidopsis. PLANT PHYSIOLOGY 2015; 169:2982-91. [PMID: 26474639 PMCID: PMC4677880 DOI: 10.1104/pp.15.00717] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Accepted: 10/16/2015] [Indexed: 05/22/2023]
Abstract
Plant genomes contain large numbers of duplicated genes that contribute to the evolution of new functions. Following duplication, genes can exhibit divergence in their coding sequence and their expression patterns. Changes in the cis-regulatory element landscape can result in changes in gene expression patterns. High-throughput methods developed recently can identify potential cis-regulatory elements on a genome-wide scale. Here, we use a recent comprehensive data set of DNase I sequencing-identified cis-regulatory binding sites (footprints) at single-base-pair resolution to compare binding sites and network connectivity in duplicated gene pairs in Arabidopsis (Arabidopsis thaliana). We found that duplicated gene pairs vary greatly in their cis-regulatory element architecture, resulting in changes in regulatory network connectivity. Whole-genome duplicates (WGDs) have approximately twice as many footprints in their promoters left by potential regulatory proteins than do tandem duplicates (TDs). The WGDs have a greater average number of footprint differences between paralogs than TDs. The footprints, in turn, result in more regulatory network connections between WGDs and other genes, forming denser, more complex regulatory networks than shown by TDs. When comparing regulatory connections between duplicates, WGDs had more pairs in which the two genes are either partially or fully diverged in their network connections, but fewer genes with no network connections than the TDs. There is evidence of younger TDs and WGDs having fewer unique connections compared with older duplicates. This study provides insights into cis-regulatory element evolution and network divergence in duplicated genes.
Collapse
Affiliation(s)
- Andrej A Arsovski
- Department of Botany, University of British Columbia, Vancouver, Canada V6T 1Z4
| | - Julian Pradinuk
- Department of Botany, University of British Columbia, Vancouver, Canada V6T 1Z4
| | - Xu Qiu Guo
- Department of Botany, University of British Columbia, Vancouver, Canada V6T 1Z4
| | - Sishuo Wang
- Department of Botany, University of British Columbia, Vancouver, Canada V6T 1Z4
| | - Keith L Adams
- Department of Botany, University of British Columbia, Vancouver, Canada V6T 1Z4
| |
Collapse
|
11
|
CNMS: The preferred genic markers for comparative genomic, molecular phylogenetic, functional genetic diversity and differential gene regulatory expression analyses in chickpea. J Biosci 2015; 40:579-92. [PMID: 26333404 DOI: 10.1007/s12038-015-9545-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
The intra/inter-genomic comparative mapping-based phylogenetic footprinting identified 5 paralogous and 656 orthologous genome-wide CNMS markers in the upstream sequences of chickpea genes. These CNMS markers revealed a high-degree of gene-based syntenic relationship between chickpea and Medicago genomes while minimum between chickpea and Vitis genomes. The time of divergence and duplication estimated using CNMS markers highlight the expected phylogenetic relationships between chickpea and six dicot (legume) species as well as occurrence of ancient genome (approximately 53 Mya) with small-scale recent segmental (approximately 10 Mya) duplication events in chickpea. A wider level of functional molecular diversity (14 to 88 percent) and admixed population genetic structure was detected among desi, kabuli and wild genotypes by genic CNMS markers at a genome-wide scale suggesting their utility in large-scale genetic analysis in chickpea. The subfunctionalization at the cis-regulatory element region and TFBS (transcription factor binding site) motif levels in the upstream sequences of CNMS marker-associated orthologous genes than the paralogues was predominant. Functional constraint might have considerable effect on these CNMScontaining regulatory elements controlling consistent orthologous gene expression in dicots. A rapid subfunctionalization based on diverge differential expression of paralogous CNMS marker-associated genes particularly those that underwent recent small-scale segmental duplication events in chickpea was apparent. The differential regulation of expression and subfunctionalization potential of Ultra CNMS marker-associated genes suggest their utility in deciphering the complex gene regulatory function as well as identification and targeted mapping of potential genes/QTLs governing vital agronomic traits in chickpea. The gene-based CNMS markers with desirable inherent genetic attributes like higher degree of comparative genome mapping, functional genetic diversity and differential gene regulatory expression potential can significantly propel the genomics-assisted chickpea crop improvement.
Collapse
|
12
|
Berke L, Snel B. The histone modification H3K27me3 is retained after gene duplication and correlates with conserved noncoding sequences in Arabidopsis. Genome Biol Evol 2014; 6:572-9. [PMID: 24567304 PMCID: PMC3971591 DOI: 10.1093/gbe/evu040] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The histone modification H3K27me3 is involved in repression of transcription and plays a crucial role in developmental transitions in both animals and plants. It is deposited by PRC2 (Polycomb repressive complex 2), a conserved protein complex. In Arabidopsis thaliana, H3K27me3 is found at 15% of all genes. These tend to encode transcription factors and other regulators important for development. However, it is not known how PRC2 is recruited to target loci nor how this set of target genes arose during Arabidopsis evolution. To resolve the latter, we integrated A. thaliana gene families with five independent genome-wide H3K27me3 data sets. Gene families were either significantly enriched or depleted of H3K27me3, showing a strong impact of shared ancestry to H3K27me3 distribution. To quantify this, we performed ancestral state reconstruction of H3K27me3 on phylogenetic trees of gene families. The set of H3K27me3-marked genes changed less than expected by chance, suggesting that H3K27me3 was retained after gene duplication. This retention suggests that the PRC2-recruiting signal could be encoded in the DNA and also conserved among certain duplicated genes. Indeed, H3K27me3-marked genes were overrepresented among paralogs sharing conserved noncoding sequences (CNSs) that are enriched with transcription factor binding sites. The association of upstream CNSs with H3K27me3-marked genes represents the first genome-wide connection between H3K27me3 and potential regulatory elements in plants. Thus, we propose that CNSs likely function as part of the PRC2 recruitment in plants.
Collapse
Affiliation(s)
- Lidija Berke
- Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, The Netherlands
| | | |
Collapse
|
13
|
Burgess D, Freeling M. The most deeply conserved noncoding sequences in plants serve similar functions to those in vertebrates despite large differences in evolutionary rates. THE PLANT CELL 2014; 26:946-61. [PMID: 24681619 PMCID: PMC4001403 DOI: 10.1105/tpc.113.121905] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing-associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates.
Collapse
|
14
|
Su Z, Ma X, Guo H, Sukiran NL, Guo B, Assmann SM, Ma H. Flower development under drought stress: morphological and transcriptomic analyses reveal acute responses and long-term acclimation in Arabidopsis. THE PLANT CELL 2013; 25:3785-807. [PMID: 24179129 PMCID: PMC3877795 DOI: 10.1105/tpc.113.115428] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Revised: 09/30/2013] [Accepted: 10/14/2013] [Indexed: 05/06/2023]
Abstract
Drought dramatically affects plant growth and crop yield, but previous studies primarily examined responses to drought during vegetative development. Here, to study responses to drought during reproductive development, we grew Arabidopsis thaliana plants with limited water, under conditions that allowed the plants to initiate and complete reproduction. Drought treatment from just after the onset of flowering to seed maturation caused an early arrest of floral development and sterility. After acclimation, plants showed reduced fertility that persisted throughout reproductive development. Floral defects included abnormal anther development, lower pollen viability, reduced filament elongation, ovule abortion, and failure of flowers to open. Drought also caused differential expression of 4153 genes, including flowering time genes flowering locus t, suppressor of overexpression of CO1, and leafy, genes regulating anther and pistil development, and stress-related transcription factors. Mutant phenotypes of hypersensitivity to drought and fewer differentially expressed genes suggest that dehydration response element B1A may have an important function in drought response in flowers. A more severe filament elongation defect under drought in myb21 plants demonstrated that appropriate stamen development requires MYB domain protein 21 under drought conditions. Our study reveals a regulatory cascade in reproductive responses and acclimation under drought.
Collapse
Affiliation(s)
- Zhao Su
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
| | - Xuan Ma
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
- Intercollege Graduate Program in Cell and Developmental Biology, Pennsylvania State University, University Park, Pennsylvania 16802
| | - Huihong Guo
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
- College of Biological Science and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Noor Liyana Sukiran
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
| | - Bin Guo
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
- State Key Laboratory of Genetic Engineering and Institute of Plant Biology, Institute of Genetics, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Sarah M. Assmann
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
| | - Hong Ma
- Department of Biology and the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802
- Intercollege Graduate Program in Cell and Developmental Biology, Pennsylvania State University, University Park, Pennsylvania 16802
- State Key Laboratory of Genetic Engineering and Institute of Plant Biology, Institute of Genetics, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| |
Collapse
|
15
|
Subramaniam S, Wang X, Freeling M, Pires JC. The fate of Arabidopsis thaliana homeologous CNSs and their motifs in the Paleohexaploid Brassica rapa. Genome Biol Evol 2013; 5:646-60. [PMID: 23493633 PMCID: PMC3641636 DOI: 10.1093/gbe/evt035] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Following polyploidy, duplicate genes are often deleted, and if they are not, then duplicate regulatory regions are sometimes lost. By what mechanism is this loss and what is the chance that such a loss removes function? To explore these questions, we followed individual Arabidopsis thaliana–A. thaliana conserved noncoding sequences (CNSs) into the Brassica ancestor, through a paleohexaploidy and into Brassica rapa. Thus, a single Brassicaceae CNS has six potential orthologous positions in B. rapa; a single Arabidopsis CNS has three potential homeologous positions. We reasoned that a CNS, if present on a singlet Brassica gene, would be unlikely to lose function compared with a more redundant CNS, and this is the case. Redundant CNSs go nondetectable often. Using this logic, each mechanism of CNS loss was assigned a metric of functionality. By definition, proved deletions do not function as sequence. Our results indicated that CNSs that go nondetectable by base substitution or large insertion are almost certainly still functional (redundancy does not matter much to their detectability frequency), whereas those lost by inferred deletion or indels are approximately 75% likely to be nonfunctional. Overall, an average nondetectable, once-redundant CNS more than 30 bp in length has a 72% chance of being nonfunctional, and that makes sense because 97% of them sort to a molecular mechanism with “deletion” in its description, but base substitutions do cause loss. Similarly, proved-functional G-boxes go undetectable by deletion 82% of the time. Fractionation mutagenesis is a procedure that uses polyploidy as a mutagenic agent to genetically alter RNA expression profiles, and then to construct testable hypotheses as to the function of the lost regulatory site. We show fractionation mutagenesis to be a “deletion machine” in the Brassica lineage.
Collapse
|
16
|
Spangler JB, Feltus FA. Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2013; 4:129. [PMID: 23675377 PMCID: PMC3650315 DOI: 10.3389/fpls.2013.00129] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Accepted: 04/19/2013] [Indexed: 06/02/2023]
Abstract
Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.
Collapse
Affiliation(s)
- Jacob B. Spangler
- Department of Genetics and Biochemistry, Clemson UniversityClemson, SC, USA
| | - Frank Alex Feltus
- Department of Genetics and Biochemistry, Clemson UniversityClemson, SC, USA
- Plant and Environmental Sciences, Clemson UniversityClemson, SC, USA
| |
Collapse
|
17
|
Turco G, Schnable JC, Pedersen B, Freeling M. Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses. FRONTIERS IN PLANT SCIENCE 2013; 4:170. [PMID: 23874343 PMCID: PMC3708275 DOI: 10.3389/fpls.2013.00170] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Accepted: 05/13/2013] [Indexed: 05/07/2023]
Abstract
Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize.
Collapse
Affiliation(s)
| | - James C. Schnable
- *Correspondence: James C. Schnable and Michael Freeling, Department of Plant and Microbial Biology, University of California, 111 Koshland Hall, Berkeley, CA 94720, USA e-mail: ;
| | | | - Michael Freeling
- *Correspondence: James C. Schnable and Michael Freeling, Department of Plant and Microbial Biology, University of California, 111 Koshland Hall, Berkeley, CA 94720, USA e-mail: ;
| |
Collapse
|
18
|
Contribution of the epigenetic mark H3K27me3 to functional divergence after whole genome duplication in Arabidopsis. Genome Biol 2012; 13:R94. [PMID: 23034476 PMCID: PMC3491422 DOI: 10.1186/gb-2012-13-10-r94] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 10/03/2012] [Indexed: 12/21/2022] Open
Abstract
Background Following gene duplication, retained paralogs undergo functional divergence, which is reflected in changes in DNA sequence and expression patterns. The extent of divergence is influenced by several factors, including protein function. We examine whether an epigenetic modification, trimethylation of histone H3 at lysine 27 (H3K27me3), could be a factor in the evolution of expression patterns after gene duplication. Whereas in animals this repressive mark for transcription is deposited on long regions of DNA, in plants its localization is gene-specific. Because of this and a well-annotated recent whole-genome duplication, Arabidopsis thaliana is uniquely suited for studying the potential association of H3K27me3 with the evolutionary fate of genes. Results Paralogous pairs with H3K27me3 show the highest coding sequence divergence, which can be explained by their low expression levels. Interestingly, they also show the highest similarity in expression patterns and upstream regulatory regions, while paralogous pairs where only one gene is an H3K27me3 target show the highest divergence in expression patterns and upstream regulatory sequence. These trends in divergence of expression and upstream regions are especially pronounced for transcription factors. Conclusions After duplication, a histone modification can be associated with a particular fate of paralogs: H3K27me3 is linked to lower expression divergence yet higher coding sequence divergence. Our results show that H3K27me3 constrains expression divergence after duplication. Moreover, its association with higher conservation of upstream regions provides a potential mechanism for the conserved H3K27me3 targeting of the paralogs.
Collapse
|
19
|
Baxter L, Jironkin A, Hickman R, Moore J, Barrington C, Krusche P, Dyer NP, Buchanan-Wollaston V, Tiskin A, Beynon J, Denby K, Ott S. Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants. THE PLANT CELL 2012; 24:3949-65. [PMID: 23110901 PMCID: PMC3517229 DOI: 10.1105/tpc.112.103010] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research.
Collapse
Affiliation(s)
- Laura Baxter
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Aleksey Jironkin
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Richard Hickman
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jay Moore
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | | | - Peter Krusche
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Nigel P. Dyer
- Molecular Organisation and Assembly in Cells Doctoral Training Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Vicky Buchanan-Wollaston
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Alexander Tiskin
- Department of Computer Science, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Jim Beynon
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Katherine Denby
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Sascha Ott
- Warwick Systems Biology Centre, University of Warwick, Coventry CV4 7AL, United Kingdom
- Address correspondence to
| |
Collapse
|
20
|
Heyndrickx KS, Vandepoele K. Systematic identification of functional plant modules through the integration of complementary data sources. PLANT PHYSIOLOGY 2012; 159:884-901. [PMID: 22589469 PMCID: PMC3387714 DOI: 10.1104/pp.112.196725] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
A major challenge is to unravel how genes interact and are regulated to exert specific biological functions. The integration of genome-wide functional genomics data, followed by the construction of gene networks, provides a powerful approach to identify functional gene modules. Large-scale expression data, functional gene annotations, experimental protein-protein interactions, and transcription factor-target interactions were integrated to delineate modules in Arabidopsis (Arabidopsis thaliana). The different experimental input data sets showed little overlap, demonstrating the advantage of combining multiple data types to study gene function and regulation. In the set of 1,563 modules covering 13,142 genes, most modules displayed strong coexpression, but functional and cis-regulatory coherence was less prevalent. Highly connected hub genes showed a significant enrichment toward embryo lethality and evidence for cross talk between different biological processes. Comparative analysis revealed that 58% of the modules showed conserved coexpression across multiple plants. Using module-based functional predictions, 5,562 genes were annotated, and an evaluation experiment disclosed that, based on 197 recently experimentally characterized genes, 38.1% of these functions could be inferred through the module context. Examples of confirmed genes of unknown function related to cell wall biogenesis, xylem and phloem pattern formation, cell cycle, hormone stimulus, and circadian rhythm highlight the potential to identify new gene functions. The module-based predictions offer new biological hypotheses for functionally unknown genes in Arabidopsis (1,701 genes) and six other plant species (43,621 genes). Furthermore, the inferred modules provide new insights into the conservation of coexpression and coregulation as well as a starting point for comparative functional annotation.
Collapse
|
21
|
Parizot B, Roberts I, Raes J, Beeckman T, De Smet I. In silico analyses of pericycle cell populations reinforce their relation with associated vasculature in Arabidopsis. Philos Trans R Soc Lond B Biol Sci 2012; 367:1479-88. [PMID: 22527390 PMCID: PMC3321678 DOI: 10.1098/rstb.2011.0227] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In Arabidopsis, lateral root initiation occurs in a subset of pericycle cells at the xylem pole that will divide asymmetrically to give rise to a new lateral root organ. While lateral roots never develop at the phloem pole, it is unclear how the interaction with xylem and phloem poles determines the distinct pericycle identities with different competences. Nevertheless, pericycle cells at these poles are marked by differences in size, by ultrastructural features and by specific proteins and gene expression. Here, we provide transcriptional evidence that pericycle cells are intimately associated with their vascular tissue instead of being a separate concentric layer. This has implications for the identification of cell- and tissue-specific promoters that are necessary to drive and/or alter gene expression locally, avoiding pleiotropic effects. We were able to identify a small set of genes that display specific expression in the phloem or xylem pole pericycle cells, and we were able to identify motifs that are likely to drive expression in either one of those tissues.
Collapse
Affiliation(s)
- Boris Parizot
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Ianto Roberts
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Jeroen Raes
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Tom Beeckman
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Genetics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Ive De Smet
- Division of Plant and Crop Sciences, School of Biosciences, University of Nottingham, Loughborough LE12 5RD, UK
| |
Collapse
|
22
|
Schnable JC, Freeling M, Lyons E. Genome-wide analysis of syntenic gene deletion in the grasses. Genome Biol Evol 2012; 4:265-77. [PMID: 22275519 PMCID: PMC3318446 DOI: 10.1093/gbe/evs009] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community.
Collapse
Affiliation(s)
- James C Schnable
- Department of Plant and Microbial Biology, University of California-Berkeley, CA, USA
| | | | | |
Collapse
|
23
|
Spangler JB, Subramaniam S, Freeling M, Feltus FA. Evidence of function for conserved noncoding sequences in Arabidopsis thaliana. THE NEW PHYTOLOGIST 2012; 193:241-252. [PMID: 21955124 DOI: 10.1111/j.1469-8137.2011.03916.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
• Whole genome duplication events provide a lineage with a large reservoir of genes that can be molded by evolutionary forces into phenotypes that fit alternative environments. A well-studied whole genome duplication, the α-event, occurred in an ancestor of the model plant Arabidopsis thaliana. Retained segments of the α-event have been defined in recent years in the form of duplicate protein coding sequences (α-pairs) and associated conserved noncoding DNA sequences (CNSs). Our aim was to identify any association between CNSs and α-pair co-functionality at the gene expression level. • Here, we tested for correlation between CNS counts and α-pair co-expression and expression intensity across nine expression datasets: aerial tissue, flowers, leaves, roots, rosettes, seedlings, seeds, shoots and whole plants. • We provide evidence for a putative regulatory role of the CNSs. The association of CNSs with α-pair co-expression and expression intensity varied by gene function, subgene position and the presence of transcription factor binding motifs. A range of possible CNS regulatory mechanisms, including intron-mediated enhancement, messenger RNA fold stability and transcriptional regulation, are discussed. • This study provides a framework to understand how CNS motifs are involved in the maintenance of gene expression after a whole genome duplication event.
Collapse
Affiliation(s)
- Jacob B Spangler
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC 29634, USA
| | - Sabarinath Subramaniam
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - F Alex Feltus
- Department of Genetics & Biochemistry, Clemson University, Clemson, SC 29634, USA
| |
Collapse
|
24
|
Tang H, Lyons E. Unleashing the genome of brassica rapa. FRONTIERS IN PLANT SCIENCE 2012; 3:172. [PMID: 22866056 PMCID: PMC3408644 DOI: 10.3389/fpls.2012.00172] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Accepted: 07/12/2012] [Indexed: 05/06/2023]
Abstract
The completion and release of the Brassica rapa genome is of great benefit to researchers of the Brassicas, Arabidopsis, and genome evolution. While its lineage is closely related to the model organism Arabidopsis thaliana, the Brassicas experienced a whole genome triplication subsequent to their divergence. This event contemporaneously created three copies of its ancestral genome, which had diploidized through the process of homeologous gene loss known as fractionation. By the fractionation of homeologous gene content and genetic regulatory binding sites, Brassica's genome is well placed to use comparative genomic techniques to identify syntenic regions, homeologous gene duplications, and putative regulatory sequences. Here, we use the comparative genomics platform CoGe to perform several different genomic analyses with which to study structural changes of its genome and dynamics of various genetic elements. Starting with whole genome comparisons, the Brassica paleohexaploidy is characterized, syntenic regions with A. thaliana are identified, and the TOC1 gene in the circadian rhythm pathway from A. thaliana is used to find duplicated orthologs in B. rapa. These TOC1 genes are further analyzed to identify conserved non-coding sequences that contain cis-acting regulatory elements and promoter sequences previously implicated in circadian rhythmicity. Each "cookbook style" analysis includes a step-by-step walk-through with links to CoGe to quickly reproduce each step of the analytical process.
Collapse
Affiliation(s)
| | - Eric Lyons
- iPlant Collaborative, School of Plant Sciences, University of ArizonaTucson, AZ, USA
- *Correspondence: Eric Lyons, iPlant Collaborative, School of Plant Sciences, University of Arizona, Keating Bioresearch Building, 1657 E. Helen St. Tucson, AZ 85745, USA. e-mail:
| |
Collapse
|
25
|
Zhang W, Wu Y, Schnable JC, Zeng Z, Freeling M, Crawford GE, Jiang J. High-resolution mapping of open chromatin in the rice genome. Genome Res 2011; 22:151-62. [PMID: 22110044 DOI: 10.1101/gr.131342.111] [Citation(s) in RCA: 175] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Gene expression is controlled by the complex interaction of transcription factors binding to promoters and other regulatory DNA elements. One common characteristic of the genomic regions associated with regulatory proteins is a pronounced sensitivity to DNase I digestion. We generated genome-wide high-resolution maps of DNase I hypersensitive (DH) sites from both seedling and callus tissues of rice (Oryza sativa). Approximately 25% of the DH sites from both tissues were found in putative promoters, indicating that the vast majority of the gene regulatory elements in rice are not located in promoter regions. We found 58% more DH sites in the callus than in the seedling. For DH sites detected in both the seedling and callus, 31% displayed significantly different levels of DNase I sensitivity within the two tissues. Genes that are differentially expressed in the seedling and callus were frequently associated with DH sites in both tissues. The DNA sequences contained within the DH sites were hypomethylated, consistent with what is known about active gene regulatory elements. Interestingly, tissue-specific DH sites located in the promoters showed a higher level of DNA methylation than the average DNA methylation level of all the DH sites located in the promoters. A distinct elevation of H3K27me3 was associated with intergenic DH sites. These results suggest that epigenetic modifications play a role in the dynamic changes of the numbers and DNase I sensitivity of DH sites during development.
Collapse
Affiliation(s)
- Wenli Zhang
- Department of Horticulture, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | | | | | | | | | |
Collapse
|
26
|
Reineke AR, Bornberg-Bauer E, Gu J. Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes. Nucleic Acids Res 2011; 39:6029-43. [PMID: 21470961 PMCID: PMC3152334 DOI: 10.1093/nar/gkr179] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2010] [Revised: 02/22/2011] [Accepted: 03/15/2011] [Indexed: 12/17/2022] Open
Abstract
The discovery of regulatory motifs embedded in upstream regions of plants is a particularly challenging bioinformatics task. Previous studies have shown that motifs in plants are short compared with those found in vertebrates. Furthermore, plant genomes have undergone several diversification mechanisms such as genome duplication events which impact the evolution of regulatory motifs. In this article, a systematic phylogenomic comparison of upstream regions is conducted to further identify features of the plant regulatory genomes, the component of genomes regulating gene expression, to enable future de novo discoveries. The findings highlight differences in upstream region properties between major plant groups and the effects of divergence times and duplication events. First, clear differences in upstream region evolution can be detected between monocots and dicots, thus suggesting that a separation of these groups should be made when searching for novel regulatory motifs, particularly since universal motifs such as the TATA box are rare. Second, investigating the decay rate of significantly aligned regions suggests that a divergence time of ~100 mya sets a limit for reliable conserved non-coding sequence (CNS) detection. Insights presented here will set a framework to help identify embedded motifs of functional relevance by understanding the limits of bioinformatics detection for CNSs.
Collapse
Affiliation(s)
| | | | - Jenny Gu
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, 48149, Münster, Germany
| |
Collapse
|
27
|
Schnable JC, Pedersen BS, Subramaniam S, Freeling M. Dose-sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses. FRONTIERS IN PLANT SCIENCE 2011; 2:2. [PMID: 22645525 PMCID: PMC3355796 DOI: 10.3389/fpls.2011.00002] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2011] [Accepted: 02/19/2011] [Indexed: 05/08/2023]
Abstract
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.
Collapse
Affiliation(s)
- James C. Schnable
- Department of Plant and Microbial Biology, University of California BerkeleyBerkeley, CA, USA
| | - Brent S. Pedersen
- Department of Plant and Microbial Biology, University of California BerkeleyBerkeley, CA, USA
| | - Sabarinath Subramaniam
- Department of Plant and Microbial Biology, University of California BerkeleyBerkeley, CA, USA
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California BerkeleyBerkeley, CA, USA
- *Correspondence: Michael Freeling, Department of Plant and Microbial Biology, University of California-Berkeley, 111 Koshland Hall, Berkeley, CA 94720, United States of America e-mail:
| |
Collapse
|
28
|
Paterson AH, Freeling M, Tang H, Wang X. Insights from the comparison of plant genome sequences. ANNUAL REVIEW OF PLANT BIOLOGY 2010; 61:349-72. [PMID: 20441528 DOI: 10.1146/annurev-arplant-042809-112235] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The next decade will see essentially completed sequences for multiple branches of virtually all angiosperm clades that include major crops and/or botanical models. These sequences will provide a powerful framework for relating genome-level events to aspects of morphological and physiological variation that have contributed to the colonization of much of the planet by angiosperms. Clarification of the fundamental angiosperm gene set, its arrangement, lineage-specific variations in gene repertoire and arrangement, and the fates of duplicated gene pairs will advance knowledge of functional and regulatory diversity and perhaps shed light on adaptation by lineages to whole-genome duplication, which is a distinguishing feature of angiosperm evolution. Better understanding of the relationships among angiosperm genomes promises to provide a firm foundation upon which to base translational genomics: the leveraging of hard-won structural and functional genomic information from crown botanical models to dissect novel and, in some cases, economically important features in many additional organisms.
Collapse
Affiliation(s)
- Andrew H Paterson
- Department of Plant Biology, University of Georgia, Athens, Georgia.
| | | | | | | |
Collapse
|
29
|
Priest HD, Filichkin SA, Mockler TC. Cis-regulatory elements in plant cell signaling. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:643-649. [PMID: 19717332 DOI: 10.1016/j.pbi.2009.07.016] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Revised: 06/30/2009] [Accepted: 07/21/2009] [Indexed: 05/26/2023]
Abstract
Plant cell signaling pathways are in part dependent on transcriptional regulatory networks comprising circuits of transcription factors (TFs) and regulatory DNA elements that control the expression of target genes. Here, we describe experimental and bioinformatic approaches for identifying potential cis-regulatory elements. We also discuss recent integrative genomics studies aimed at elucidating the functions of cis-regulatory elements in aspects of plant biology, including the circadian clock, interactions with the environment, stress responses, and regulation of growth and development by phytohormones. Finally, we discuss emerging technologies and approaches that offer great potential for accelerating the discovery and functional characterization of cis-elements and interacting TFs--which will help realize the promise of systems biology.
Collapse
Affiliation(s)
- Henry D Priest
- Department of Botany and Plant Pathology and Center for Genome Research and Biocomputing, Oregon State University, Corvallis, OR 97331, USA
| | | | | |
Collapse
|
30
|
Jurkuta RJ, Kaplinsky NJ, Spindel JE, Barton MK. Partitioning the apical domain of the Arabidopsis embryo requires the BOBBER1 NudC domain protein. THE PLANT CELL 2009; 21:1957-71. [PMID: 19648297 PMCID: PMC2729608 DOI: 10.1105/tpc.108.065284] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2009] [Revised: 06/24/2009] [Accepted: 07/09/2009] [Indexed: 05/20/2023]
Abstract
The apical domain of the embryo is partitioned into distinct regions that will give rise to the cotyledons and the shoot apical meristem. In this article, we describe a novel screen to identify Arabidopsis thaliana embryo arrest mutants that are defective in this partitioning, and we describe the phenotype of one such mutant, bobber1. bobber1 mutants arrest at the globular stage of development, they express the meristem-specific SHOOTMERISTEMLESS gene throughout the top half of the embryo, and they fail to express the AINTEGUMENTA transcript normally found in cotyledons. Thus, BOBBER1 is required to limit the extent of the meristem domain and/or to promote the development of the cotyledon domains. Based on expression of early markers for apical development, bobber1 mutants differentiate protodermis and undergo normal early apical development. Consistent with a role for auxin in cotyledon development, BOBBER1 mutants fail to express localized maxima of the DR5:green fluorescent protein reporter. BOBBER1 encodes a protein with homology to the Aspergillus nidulans protein NUDC that has similarity to protein chaperones, indicating a possible role for BOBBER1 in synthesis or transport of proteins involved in patterning the Arabidopsis embryo.
Collapse
Affiliation(s)
- Rebecca Joy Jurkuta
- Department of Genetics, University of Wisconsin, Madison, Wisconsin 53706, USA
| | | | | | | |
Collapse
|
31
|
Freeling M, Subramaniam S. Conserved noncoding sequences (CNSs) in higher plants. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:126-32. [PMID: 19249238 DOI: 10.1016/j.pbi.2009.01.005] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Revised: 01/22/2009] [Accepted: 01/22/2009] [Indexed: 05/09/2023]
Abstract
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.
| | | |
Collapse
|
32
|
Isolation and characterization of conserved non-coding sequences among rice (Oryza sativa L.) paralogous regions. Mol Genet Genomics 2008; 281:11-8. [DOI: 10.1007/s00438-008-0388-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2008] [Accepted: 09/14/2008] [Indexed: 01/07/2023]
|
33
|
Abstract
Comparative genomics is a powerful tool for gaining insight into genomic function and evolution. However, in plants, sequence data that would enable detailed comparisons of both coding and noncoding regions have been limited in availability. Here we report the generation and analysis of sequences for an unduplicated conserved syntenic segment (CSS) in the genomes of five members of the agriculturally important plant family Solanaceae. This CSS includes a 105-kb region of tomato chromosome 2 and orthologous regions of the potato, eggplant, pepper, and petunia genomes. With a total neutral divergence of 0.73-0.78 substitutions/site, these sequences are similar enough that most noncoding regions can be aligned, yet divergent enough to be informative about evolutionary dynamics and selective pressures. The CSS contains 17 distinct genes with generally conserved order and orientation, but with numerous small-scale differences between species. Our analysis indicates that the last common ancestor of these species lived approximately 27-36 million years ago, that more than one-third of short genomic segments (5-15 bp) are under selection, and that more than two-thirds of selected bases fall in noncoding regions. In addition, we identify genes under positive selection and analyze hundreds of conserved noncoding elements. This analysis provides a window into 30 million years of plant evolution in the absence of polyploidization.
Collapse
|
34
|
Schütze K, Harter K, Chaban C. Post-translational regulation of plant bZIP factors. TRENDS IN PLANT SCIENCE 2008; 13:247-55. [PMID: 18424222 DOI: 10.1016/j.tplants.2008.03.002] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Revised: 02/14/2008] [Accepted: 03/21/2008] [Indexed: 05/07/2023]
Abstract
The post-translational regulation of transcription factors plays an important role in the control of gene expression in eukaryotes. The mechanisms of regulation include not only factor modifications but also regulated protein-protein interaction, protein degradation and intracellular partitioning. In plants, the basic-region leucine zipper (bZIP) transcription factors contribute to many transcriptional response pathways. Despite this, little is known about their post-translational regulation. Recent findings suggest that plant bZIP factors are under the control of various partially signal-induced and reversible post-translational mechanisms that are crucial for the control of their function. However, the fact that, to date, only a few plant bZIPs have been analyzed with respect to post-translational regulation indicates that we have just identified the tip of an iceberg.
Collapse
Affiliation(s)
- Katia Schütze
- Zentrum für Molekularbiologie der Pflanzen, Pflanzenphysiologie, Universität Tübingen, Auf der Morgenstelle 1, Tübingen, Germany
| | | | | |
Collapse
|
35
|
Lyons E, Freeling M. How to usefully compare homologous plant genes and chromosomes as DNA sequences. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2008; 53:661-73. [PMID: 18269575 DOI: 10.1111/j.1365-313x.2007.03326.x] [Citation(s) in RCA: 308] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
There are four sequenced and publicly available plant genomes to date. With many more slated for completion, one challenge will be to use comparative genomic methods to detect novel evolutionary patterns in plant genomes. This research requires sequence alignment algorithms to detect regions of similarity within and among genomes. However, different alignment algorithms are optimized for identifying different types of homologous sequences. This review focuses on plant genome evolution and provides a tutorial for using several sequence alignment algorithms and visualization tools to detect useful patterns of conservation: conserved non-coding sequences, false positive noise, subfunctionalization, synteny, annotation errors, inversions and local duplications. Our tutorial encourages the reader to experiment online with the reviewed tools as a companion to the text.
Collapse
Affiliation(s)
- Eric Lyons
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA.
| | | |
Collapse
|