1
|
Gorjifard S, Jores T, Tonnies J, Mueth NA, Bubb K, Wrightsman T, Buckler ES, Fields S, Cuperus JT, Queitsch C. Arabidopsis and maize terminator strength is determined by GC content, polyadenylation motifs and cleavage probability. Nat Commun 2024; 15:5868. [PMID: 38997252 PMCID: PMC11245536 DOI: 10.1038/s41467-024-50174-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 07/03/2024] [Indexed: 07/14/2024] Open
Abstract
The 3' end of a gene, often called a terminator, modulates mRNA stability, localization, translation, and polyadenylation. Here, we adapted Plant STARR-seq, a massively parallel reporter assay, to measure the activity of over 50,000 terminators from the plants Arabidopsis thaliana and Zea mays. We characterize thousands of plant terminators, including many that outperform bacterial terminators commonly used in plants. Terminator activity is species-specific, differing in tobacco leaf and maize protoplast assays. While recapitulating known biology, our results reveal the relative contributions of polyadenylation motifs to terminator strength. We built a computational model to predict terminator strength and used it to conduct in silico evolution that generated optimized synthetic terminators. Additionally, we discover alternative polyadenylation sites across tens of thousands of terminators; however, the strongest terminators tend to have a dominant cleavage site. Our results establish features of plant terminator function and identify strong naturally occurring and synthetic terminators.
Collapse
Affiliation(s)
- Sayeh Gorjifard
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Tobias Jores
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Jackson Tonnies
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
- Graduate Program in Biology, University of Washington, Seattle, WA, 98195, USA
| | - Nicholas A Mueth
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Kerry Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Travis Wrightsman
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY, 14853, USA
| | - Edward S Buckler
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY, 14853, USA
- Agricultural Research Service, United States Department of Agriculture, Ithaca, NY, 14853, USA
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Stanley Fields
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
- Department of Medicine, University of Washington, Seattle, WA, 98195, USA
| | - Josh T Cuperus
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Christine Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
2
|
Phan NT, Orjuela J, Danchin EGJ, Klopp C, Perfus‐Barbeoch L, Kozlowski DK, Koutsovoulos GD, Lopez‐Roques C, Bouchez O, Zahm M, Besnard G, Bellafiore S. Genome structure and content of the rice root-knot nematode ( Meloidogyne graminicola). Ecol Evol 2020; 10:11006-11021. [PMID: 33144944 PMCID: PMC7593179 DOI: 10.1002/ece3.6680] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 07/04/2020] [Accepted: 07/17/2020] [Indexed: 12/15/2022] Open
Abstract
Discovered in the 1960s, Meloidogyne graminicola is a root-knot nematode species considered as a major threat to rice production. Yet, its origin, genomic structure, and intraspecific diversity are poorly understood. So far, such studies have been limited by the unavailability of a sufficiently complete and well-assembled genome. In this study, using a combination of Oxford Nanopore Technologies and Illumina sequencing data, we generated a highly contiguous reference genome (283 scaffolds with an N50 length of 294 kb, totaling 41.5 Mb). The completeness scores of our assembly are among the highest currently published for Meloidogyne genomes. We predicted 10,284 protein-coding genes spanning 75.5% of the genome. Among them, 67 are identified as possibly originating from horizontal gene transfers (mostly from bacteria), which supposedly contribute to nematode infection, nutrient processing, and plant defense manipulation. Besides, we detected 575 canonical transposable elements (TEs) belonging to seven orders and spanning 2.61% of the genome. These TEs might promote genomic plasticity putatively related to the evolution of M. graminicola parasitism. This high-quality genome assembly constitutes a major improvement regarding previously available versions and represents a valuable molecular resource for future phylogenomic studies of Meloidogyne species. In particular, this will foster comparative genomic studies to trace back the evolutionary history of M. graminicola and its closest relatives.
Collapse
Affiliation(s)
- Ngan Thi Phan
- IRD‐CIRAD‐University of MontpellierUMR Interactions Plantes Microorganismes Environnement (IPME)MontpellierFrance
| | - Julie Orjuela
- IRD‐CIRAD‐University of MontpellierUMR Interactions Plantes Microorganismes Environnement (IPME)MontpellierFrance
| | | | - Christophe Klopp
- Plateforme BioInfo GenotoulUR875INRAECastanet‐Tolosan cedexFrance
| | | | - Djampa K. Kozlowski
- Institut Sophia AgrobiotechINRAECNRSUniversité Côte d’AzurSophia AntipolisFrance
| | | | | | | | - Margot Zahm
- Plateforme BioInfo GenotoulUR875INRAECastanet‐Tolosan cedexFrance
| | | | - Stéphane Bellafiore
- IRD‐CIRAD‐University of MontpellierUMR Interactions Plantes Microorganismes Environnement (IPME)MontpellierFrance
| |
Collapse
|
3
|
Wu X, Gaffney B, Hunt AG, Li QQ. Genome-wide determination of poly(A) sites in Medicago truncatula: evolutionary conservation of alternative poly(A) site choice. BMC Genomics 2014; 15:615. [PMID: 25048171 PMCID: PMC4117952 DOI: 10.1186/1471-2164-15-615] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 07/15/2014] [Indexed: 11/16/2022] Open
Abstract
Background Alternative polyadenylation (APA) plays an important role in the post-transcriptional regulation of gene expression. Little is known about how APA sites may evolve in homologous genes in different plant species. To this end, comparative studies of APA sites in different organisms are needed. In this study, a collection of poly(A) sites in Medicago truncatula, a model system for legume plants, has been generated and compared with APA sites in Arabidopsis thaliana. Results The poly(A) tags from a deep-sequencing protocol were mapped to the annotated M. truncatula genome, and the identified poly(A) sites used to update the annotations of 14,203 genes. The results show that 64% of M. truncatula genes possess more than one poly(A) site, comparable to the percentages reported for Arabidopsis and rice. In addition, the poly(A) signals associated with M. truncatula genes were similar to those seen in Arabidopsis and other plants. The 3′-UTR lengths are correlated in pairs of orthologous genes between M. truncatula and Arabidopsis. Very little conservation of intronic poly(A) sites was found between Arabidopsis and M. truncatula, which suggests that such sites are likely to be species-specific in plants. In contrast, there is a greater conservation of CDS-localized poly(A) sites in these two species. A sizeable number of M. truncatula antisense poly(A) sites were found. A high percentage of the associated target genes possess Arabidopsis orthologs that are also associated with antisense sites. This is suggestive of important roles for antisense regulation of these target genes. Conclusions Our results reveal some distinct patterns of sense and antisense poly(A) sites in Arabidopsis and M. truncatula. In so doing, this study lends insight into general evolutionary trends of alternative polyadenylation in plants. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-615) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | - Arthur G Hunt
- Department of Plant and Soil Sciences, University of Kentucky, Lexington, KY, USA.
| | | |
Collapse
|
4
|
Catania F, Lynch M. A simple model to explain evolutionary trends of eukaryotic gene architecture and expression: how competition between splicing and cleavage/polyadenylation factors may affect gene expression and splice-site recognition in eukaryotes. Bioessays 2013; 35:561-70. [PMID: 23568225 DOI: 10.1002/bies.201200127] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Enormous phylogenetic variation exists in the number and sizes of introns in protein-coding genes. Although some consideration has been given to the underlying role of the population-genetic environment in defining such patterns, the influence of the intracellular environment remains virtually unexplored. Drawing from observations on interactions between co-transcriptional processes involved in splicing and mRNA 3'-end formation, a mechanistic model is proposed for splice-site recognition that challenges the commonly accepted intron- and exon-definition models. Under the suggested model, splicing factors that outcompete 3'-end processing factors for access to intronic binding sites concurrently favor the recruitment of 3'-end processing factors at the pre-mRNA tail. This hypothesis sheds new light on observations such as the intron-mediated enhancement of gene expression and the negative correlation between intron length and levels of gene expression.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| | | |
Collapse
|
5
|
Mathew LG, Maloney B, Takeda N, Mason HS. Spurious polyadenylation of Norovirus Narita 104 capsid protein mRNA in transgenic plants. PLANT MOLECULAR BIOLOGY 2011; 75:263-75. [PMID: 21203799 DOI: 10.1007/s11103-010-9725-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2010] [Accepted: 12/22/2010] [Indexed: 05/30/2023]
Abstract
Noroviruses are members of the family Caliciviridae, and cause a highly communicable gastroenteritis in humans. We explored the potential to develop a plant-based vaccine against Narita 104 virus, a Genogroup II Norovirus. In stably transgenic potato, we obtained very poor expression of Narita 104 virus capsid protein (NaVCP) despite the use of a strong constitutive promoter (dual enhancer 35S) driving the native coding sequence. We identified potentially detrimental sequence motifs that could mediate aberrant mRNA processing via spurious polyadenylation signals. Northern blots and RT-PCR analysis of total RNA revealed truncated transcripts that suggested premature polyadenylation. Site-directed mutagenesis to remove one potential polyadenylation near-upstream element resulted in an increased expression of NaVCP when transiently expressed in leaves of Nicotiana benthamiana. Further, cloning of the truncated cDNAs from transgenic NaVCP potato plants and transiently transfected N. benthamiana allowed us to identify at least ten different truncated transcripts resulting from premature polyadenylation of full length NaVCP transcripts. Comparative studies using real time PCR analysis from cDNA samples revealed lower accumulation of full length transcripts of NaVCP as compared to those from a gene encoding Norwalk Virus capsid protein (a related Genogroup I Norovirus) in transiently transfected plants. These findings provide evidence for impaired expression of NaVCP in transgenic plants mediated by spurious polyadenylation signals, and demonstrate the need to scrupulously search for potential polyadenylation signals in order to improve transgene expression in plants.
Collapse
Affiliation(s)
- Lolita G Mathew
- Center for Infectious Diseases and Vaccinology (CIDV), The Biodesign Institute at Arizona State University, 1001 South McAllister Avenue, Tempe, AZ 85287-5401, USA
| | | | | | | |
Collapse
|
6
|
Abstract
The systems for mRNA surveillance, capping, and cleavage/polyadenylation are proposed to play pivotal roles in the physical establishment and distribution of spliceosomal introns along a transcript.
Collapse
|
7
|
Kim MJ, Kim H, Shin JS, Chung CH, Ohlrogge JB, Suh MC. Seed-specific expression of sesame microsomal oleic acid desaturase is controlled by combinatorial properties between negative cis-regulatory elements in the SeFAD2 promoter and enhancers in the 5'-UTR intron. Mol Genet Genomics 2006; 276:351-68. [PMID: 16862401 DOI: 10.1007/s00438-006-0148-2] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2006] [Accepted: 06/26/2006] [Indexed: 10/24/2022]
Abstract
The regulation of genes involved in primary lipid metabolism in plants is much less well understood than that in many other pathways in plant biology. In the investigation reported here, we have characterized transcriptional regulatory mechanisms controlling seed-specific FAD2 expression in sesame (Sesamum indicum). FAD2 codes for extra-plastidial FAD2 desaturase, which catalyzes the conversion of oleic acid to linoleic acid. Promoter analysis of the sesame FAD2 gene (SeFAD2) using the beta-glucuronidase (GUS) reporter system demonstrated that the - 660 to - 180 promoter region functions as a negative cis-element in the seed-specific expression of the SeFAD2 gene. Sesame and Arabidopsis FAD2 genes harbor one large intron within their 5'-untranslated region. These introns conferred up to 100-fold enhancement of GUS expression in transgenic Arabidopsis tissues as compared with intron-less controls. Prerequisite cis-elements for the SeFAD2 intron-mediated enhancement of gene expression and the promoter-like activity of SeFAD2 intron were identified. SeFAD2 transcripts were induced by abscisic acid (ABA) in developing sesame seeds, and the - 660 to - 548 and - 179 to - 53 regions in the SeFAD2 promoter were implicated in ABA-responsive signaling. Theses observations indicate that an intron-mediated regulatory mechanism is involved in controlling not only the seed-specific expression of the SeFAD2 gene but also the expression of plant FAD2 genes, which are essential for the synthesis of polyunsaturated fatty acids.
Collapse
Affiliation(s)
- Mi Jung Kim
- School of Life Sciences and Biotechnology, Korea University, Seoul, 136-701, South Korea
| | | | | | | | | | | |
Collapse
|
8
|
Chung BYW, Simons C, Firth AE, Brown CM, Hellens RP. Effect of 5'UTR introns on gene expression in Arabidopsis thaliana. BMC Genomics 2006; 7:120. [PMID: 16712733 PMCID: PMC1482700 DOI: 10.1186/1471-2164-7-120] [Citation(s) in RCA: 144] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2006] [Accepted: 05/19/2006] [Indexed: 11/02/2022] Open
Abstract
BACKGROUND The majority of introns in gene transcripts are found within the coding sequences (CDSs). A small but significant fraction of introns are also found to reside within the untranslated regions (5'UTRs and 3'UTRs) of expressed sequences. Alignment of the whole genome and expressed sequence tags (ESTs) of the model plant Arabidopsis thaliana has identified introns residing in both coding and non-coding regions of the genome. RESULTS A bioinformatic analysis revealed some interesting observations: (1) the density of introns in 5'UTRs is similar to that in CDSs but much higher than that in 3'UTRs; (2) the 5'UTR introns are preferentially located close to the initiating ATG codon; (3) introns in the 5'UTRs are, on average, longer than introns in the CDSs and 3'UTRs; and (4) 5'UTR introns have a different nucleotide composition to that of CDS and 3'UTR introns. Furthermore, we show that the 5'UTR intron of the A. thaliana EF1alpha-A3 gene affects the gene expression and the size of the 5'UTR intron influences the level of gene expression. CONCLUSION Introns within the 5'UTR show specific features that distinguish them from introns that reside within the coding sequence and the 3'UTR. In the EF1alpha-A3 gene, the presence of a long intron in the 5'UTR is sufficient to enhance gene expression in plants in a size dependent manner.
Collapse
Affiliation(s)
- Betty YW Chung
- Biochemistry Department, University of Otago, Dunedin, New Zealand
- Bioscience Institute, University College Cork, Cork, Ireland
| | - Cas Simons
- HortResearch, Auckland, New Zealand
- Institute of Molecular Biosciences, Brisbane, Australia
| | - Andrew E Firth
- Biochemistry Department, University of Otago, Dunedin, New Zealand
| | - Chris M Brown
- Biochemistry Department, University of Otago, Dunedin, New Zealand
| | | |
Collapse
|
9
|
Kimura T, Tanaka Y, Hanada K, Takio S, Saito A. Cryptic polyadenylation of transcripts of an RNA virus gene introduced into tobacco plants. Biosci Biotechnol Biochem 2005; 69:2005-8. [PMID: 16244460 DOI: 10.1271/bbb.69.2005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We constructed an expression vector for the coat protein (CP) gene and the 3' untranslated region (3' UTR) of RNA virus (sweet potato feathery mottle virus severe strain (SPFMV-S)) lacking a foreign terminator. Out of seven transgenic tobacco plants, expression of the transgene was observed in six plants. RT-PCR analysis revealed that the transcripts had a poly(A) tail, and in most of them, polyadenylation occurred on the 5' side of the 3' UTR. These results suggest that the viral sequence contains a cryptic polyadenylation signal that permits 3'-end processing of the transcripts.
Collapse
Affiliation(s)
- Takashi Kimura
- National Agricultural Research Center for Kyushu Okinawa Region, National Agriculture and Bio-oriented Research Organization, Kikuchi, Kumamoto, Japan.
| | | | | | | | | |
Collapse
|
10
|
Abraham D, Löfstedt C, Picimbon JF. Molecular characterization and evolution of pheromone binding protein genes in Agrotis moths. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2005; 35:1100-11. [PMID: 16102416 DOI: 10.1016/j.ibmb.2005.05.002] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2005] [Accepted: 05/06/2005] [Indexed: 05/04/2023]
Abstract
Pheromone-binding proteins (PBPs) are soluble transporter proteins that increase the capture and the solubilization of pheromone molecules in the lymph surrounding the olfactory receptors. A polymerase chain reaction-based method was used to identify PBP genes in Agrotis species for an evolutionary genomic study of noctuid moth PBPs. From genomic DNA we determined the structure of different PBP genes in the two closely related species, Agrotis ipsilon and A. segetum. In all, we clearly identified four genes (Aips-1, Aips-2, Aseg-1 and Aseg-2) that represent two distinct PBP orthology groups. We found that the four genes have the same exon-intron structure and that they comprise three exons and two introns but differ in length mainly in the second intron. The three exons of Aseg-2 and Aips-2 have the same lengths but both intron 1 and intron 2 differ in length between the genes. In contrast, Aips-1 and Aseg-1 show dissimilarity only in the length of intron 2. Interestingly, introns 1 and 2 are inserted in the same positions in the Aips-1, Aips-2, Aseg-1 and Aseg-2 genes. These findings show that the Agrotis PBP genes have common ancestry and probably originate from gene duplication before the speciation of ipsilon and segetum. We found that expression of Aips-1/Aseg-1 and Aips-2/Aseg-2 is antennal-specific, but expression is not restricted to the male antennae.
Collapse
Affiliation(s)
- David Abraham
- Department of Ecology, Lund University, Ecology building, SE-22362, Sweden
| | | | | |
Collapse
|
11
|
Simpson CG, Jennings SN, Clark GP, Thow G, Brown JWS. Dual functionality of a plant U-rich intronic sequence element. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2004; 37:82-91. [PMID: 14675434 DOI: 10.1046/j.1365-313x.2003.01941.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In potato invertase genes, the constitutively included, 9-nucleotide (nt)-long mini-exon requires a strong branchpoint and U-rich polypyrimidine tract for inclusion. The strength of these splicing signals was demonstrated by greatly enhanced splicing of a poorly spliced intron and by their ability to support splicing of an artificial mini-exon, following their introduction. Plant introns also require a second splicing signal, UA-rich intronic elements, for efficient intron splicing. Mutation of the branchpoint caused loss of mini-exon inclusion without loss of splicing enhancement, showing that the same U-rich sequence can function as either a polypyrimidine tract or a UA-rich intronic element. The distinction between the splicing signals depended on intron context (the presence or absence of an upstream, adjacent and functional branchpoint), and on the sequence context of the U-rich elements. Polypyrimidine tracts tolerated C residues while UA-rich intronic elements tolerated As. Thus, in plant introns, U-rich splicing elements can have dual roles as either a general plant U-rich splicing signal or a polypyrimidine tract. Finally, overexpression of two different U-rich binding proteins enhanced intron recognition significantly. These results highlight the importance of co-operation between splicing signals, the importance of other nucleotides within U-rich elements for optimal binding of competing splicing factors and effects on splicing efficiency of U-rich binding proteins.
Collapse
Affiliation(s)
- Craig G Simpson
- Gene Expression, Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA Scotland, UK
| | | | | | | | | |
Collapse
|
12
|
Menossi M, Rabaneda F, Puigdomènech P, Martínez-Izquierdo JA. Analysis of regulatory elements of the promoter and the 3' untranslated region of the maize Hrgp gene coding for a cell wall protein. PLANT CELL REPORTS 2003; 21:916-923. [PMID: 12789511 DOI: 10.1007/s00299-003-0602-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2002] [Revised: 01/25/2003] [Accepted: 01/27/2003] [Indexed: 05/24/2023]
Abstract
Hydroxyproline-rich glycoproteins (HRGP) are structural components of the plant cell wall. Hrgp genes from maize and related species have a conserved 500 bp sequence in the 5'-flanking region, and all Hrgp genes from monocots have an intron located in the 3' untranslated region. To study the role of these conserved regions, several deletions of the Hrgp gene were fused to the beta-glucuronidase ( GUS) gene and used to transform maize tissues by particle bombardment. The overall pattern of GUS activity directed by sequential deletions of the Hrgp promoter was different in embryos and young shoots. In embryos, the activity of the full-length Hrgp promoter was in the same range as that of the p35SI promoter construct, based on the strong 35S promoter, whereas in the fast-growing young shoots it was 20 times higher. A putative silencer element specific for young shoots was found in the -1,076/-700 promoter region. Other major cis elements for Hrgp expression are probably located in the regions spanning -699/-510 and -297/-160. Sequences close to the initial ATG and mRNA leader were also important since deletion of the region -52/+16 caused a 75% reduction in promoter activity. The presence of the Hrgp intron in the 3' untranslated region changed the levels of GUS activity directed by the Hrgp and the 35S promoters. This pattern of activity was complex, and was dependent on the promoter and cell type analysed.
Collapse
Affiliation(s)
- M Menossi
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), CxP 6010, CEP 13083-970, Campinas SP, Brazil.
| | | | | | | |
Collapse
|
13
|
Abstract
Debate over the mechanisms responsible for the phylogenetic and genomic distribution of introns has proceeded largely without consideration of the population-genetic forces influencing the establishment and retention of novel genetic elements. However, a simple model incorporating random genetic drift and weak mutation pressure against intron-containing alleles yields predictions consistent with a diversity of observations: (i) the rarity of introns in unicellular organisms with large population sizes, and their expansion after the origin of multicellular organisms with reduced population sizes; (ii) the relationship between intron abundance and the stringency of splice-site requirements; (iii) the tendency for introns to be more numerous and longer in regions of low recombination; and (iv) the bias toward phase-0 introns. This study provides a second example of a mechanism whereby genomic complexity originates passively as a "pathological" response to small population size, and raises difficulties for the idea that ancient introns played a major role in the origin of genes by exon shuffling.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
14
|
Isshiki M, Nakajima M, Satoh H, Shimamoto K. dull: rice mutants with tissue-specific effects on the splicing of the waxy pre-mRNA. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2000; 23:451-460. [PMID: 10972871 DOI: 10.1046/j.1365-313x.2000.00803.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
In the endosperm of japonica rice, du-1 and du-2 mutations cause the reduction of amylose contents. It was previously shown that the Wx(b) allele of rice, which is predominantly distributed in japonica rice, has a mutation in the 5' splice site of intron 1 resulting in the creation of two weak 5' splice sites within exon 1. In du-1 and du-2 mutants, spliced Wx(b) transcripts were highly reduced, whereas the processing of transcripts derived from three other genes highly expressed in endosperm was not apparently influenced. Results of competitive RT-PCR analysis indicate that transcripts spliced at the two newly created 5' splice sites were equally affected in these two mutants. Genetic and molecular analyses of the effects of du-1 and du-2 on Wx(a) pre-mRNA with normal splice sites indicate that these two mutations do not affect the processing of Wx(a) pre-mRNA after splicing, suggesting that du-1 and du-2 are mutations of genes required for the efficient splicing of mutated Wx(b) pre-mRNA. Furthermore, du-1 and du-2 showed differential effects in endosperm and pollen. Although both mutations caused similar effects on the splicing of Wx(a) transcripts in endosperm, du-1 caused higher reduction of Wx(b) mRNA in pollen than in endosperm, while du-2 had a lesser effect in pollen than in endosperm. Based on these results, we propose that the du-1 and du-2 loci of rice encode tissue-specifically regulated splicing factors that are involved in alternative splicing of pre-mRNA in rice.
Collapse
Affiliation(s)
- M Isshiki
- Laboratory of Plant Molecular Genetics, Nara Institute of Science and Technology, 8916-5 Takayama, Ikoma 630-0101, Japan
| | | | | | | |
Collapse
|
15
|
Lambermon MH, Simpson GG, Wieczorek Kirk DA, Hemmings-Mieszczak M, Klahre U, Filipowicz W. UBP1, a novel hnRNP-like protein that functions at multiple steps of higher plant nuclear pre-mRNA maturation. EMBO J 2000; 19:1638-49. [PMID: 10747031 PMCID: PMC310232 DOI: 10.1093/emboj/19.7.1638] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/1999] [Revised: 02/02/2000] [Accepted: 02/04/2000] [Indexed: 01/06/2023] Open
Abstract
Efficient splicing of higher plant pre-mRNAs depends on AU- or U-rich sequences in introns. Moreover, AU-rich sequences present in 3'-untranslated regions (3'-UTRs) may play a role in 3' end processing of plant mRNAs. Here, we describe the cloning and characterization of a Nicotiana plumbaginifolia nuclear protein that can be cross-linked to U-rich intron and 3'-UTR sequences in vitro, and associates with nuclear poly(A)(+) RNA in vivo. The protein, UBP1, strongly enhances the splicing of otherwise inefficiently processed introns when overexpressed in protoplasts. It also increases the accumulation of reporter mRNAs that contain suboptimal introns or are intronless. The enhanced accumulation is apparently due to UBP1 interacting with the 3'-UTR and protecting mRNA from exonucleolytic degradation. The effect on mRNA accumulation but not on mRNA splicing was found to be promoter specific. The fact that these effects of UBP1 can be separated suggests that they represent two independent activities. The properties of UBP1 indicate that it is an hnRNP protein that functions at multiple steps to facilitate the nuclear maturation of plant pre-mRNAs.
Collapse
Affiliation(s)
- M H Lambermon
- Friedrich Miescher-Institut, PO Box 2543, 4002 Basel, Switzerland
| | | | | | | | | | | |
Collapse
|
16
|
Grec S, Wang Y, Le Guen L, Negrouk V, Boutry M. Cryptic polyadenylation sites within the coding sequence of three yeast genes expressed in tobacco. Gene 2000; 242:87-95. [PMID: 10721700 DOI: 10.1016/s0378-1119(99)00544-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Three yeast genes, MIP (mitochondrial DNA polymerase) and two genes, YCF1 (yeast cadmium factor 1) and PDR5 (pleiotropic drug resistance 5), conferring multidrug resistance, were provided with the cauliflower mosaic virus 35S transcription promoter and introduced into tobacco using an Agrobacterium tumefaciens T-DNA-derived vector. Transcripts of each gene much shorter than those expected were found in the transgenic plants. RT-PCR and S1 nuclease mapping of the PDR5 and MIP transcripts demonstrated the presence of one (PDR5), or several close (MIP), cryptic polyadenylation site(s) within the coding sequence of these yeast genes. Possible sequences involved in polyadenylation are discussed.
Collapse
Affiliation(s)
- S Grec
- Unité de Biochimie Physiologique, Université Catholique de Louvain, Louvain la Neuve, Belgium
| | | | | | | | | |
Collapse
|
17
|
Golovkin M, Reddy AS. An SC35-like protein and a novel serine/arginine-rich protein interact with Arabidopsis U1-70K protein. J Biol Chem 1999; 274:36428-38. [PMID: 10593939 DOI: 10.1074/jbc.274.51.36428] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The U1 small nuclear ribonucleoprotein 70-kDa protein, a U1 small nuclear ribonucleoprotein-specific protein, has been shown to have multiple roles in nuclear precursor mRNA processing in animals. By using the C-terminal arginine-rich region of Arabidopsis U1-70K protein in the yeast two-hybrid system, we have identified an SC35-like (SR33) and a novel plant serine/arginine-rich (SR) protein (SR45) that interact with the plant U1-70K. The SR33 and SR45 proteins share several features with SR proteins including modular domains typical of splicing factors in the SR family of proteins. However, both plant SR proteins are rich in proline, and SR45, unlike most animal SR proteins, has two distinct arginine/serine-rich domains separated by an RNA recognition motif. By using coprecipitation assays we confirmed the interaction of plant U1-70K with SR33 and SR45 proteins. Furthermore, in vivo and in vitro protein-protein interaction experiments have shown that SR33 protein interacts with itself and with SR45 protein but not with two other members (SRZ21 and SRZ22) of the SR family that are known to interact with the Arabidopsis full-length U-70K only. A Clk/Sty protein kinase (AFC-2) from Arabidopsis phosphorylated four SR proteins (SR33, SR45, SRZ21, and SRZ22). Coprecipitation studies have confirmed the interaction of SR proteins with AFC2 kinase, and the interaction between AFC2 and SR33 is modulated by the phosphorylation status of these proteins. These and our previous results suggest that the plant U1-70K interacts with at least four distinct members of the SR family including SR45 with its two arginine/serine-rich domains, and the interaction between the SR proteins and AFC2 is modulated by phosphorylation. The interaction of plant U1-70K with a novel set of proteins suggests the early stages of spliceosome assembly, and intron recognition in plants is likely to be different from animals.
Collapse
Affiliation(s)
- M Golovkin
- Department of Biology and Program in Cell and Molecular Biology, Colorado State University, Fort Collins, Colorado 80523, USA
| | | |
Collapse
|
18
|
Lal S, Choi JH, Shaw JR, Hannah LC. A splice site mutant of maize activates cryptic splice sites, elicits intron inclusion and exon exclusion, and permits branch point elucidation. PLANT PHYSIOLOGY 1999; 121:411-8. [PMID: 10517832 PMCID: PMC59403 DOI: 10.1104/pp.121.2.411] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/1999] [Accepted: 06/25/1999] [Indexed: 05/17/2023]
Abstract
DNA sequence analysis of the bt2-7503 mutant allele of the maize brittle-2 gene revealed a point mutation in the 5' terminal sequence of intron 3 changing GT to AT. This lesion completely abolishes use of this splice site, activates two cryptic splice sites, and alters the splicing pattern from extant splice sites. One activated donor site, located nine nt 5' to the normal splice donor site, begins with the dinucleotide GC. While non-consensus, this sequence still permits both trans-esterification reactions of pre-mRNA splicing. A second cryptic site located 23 nt 5' to the normal splice site and beginning with GA, undergoes the first trans-esterification reaction leading to lariat formation, but lacks the ability to participate in the second reaction. Accumulation of this splicing intermediate and use of an innovative reverse transcriptase-polymerase chain reaction technique (J. Vogel, R.H. Wolfgang, T. Borner [1997] Nucleic Acids Res 25: 2030-2031) led to the identification of 3' intron sequences needed for lariat formation. In most splicing reactions, neither cryptic site is recognized. Most mature transcripts include intron 3, while the second most frequent class lacks exon 3. Traditionally, the former class of transcripts is taken as evidence for the intron definition of splicing, while the latter class has given credence to the exon definition of splicing.
Collapse
Affiliation(s)
- S Lal
- Program in Plant Molecular and Cellular Biology and Horticultural Sciences, 1143 Fifield Hall, P.O. Box 110690, University of Florida, Gainesville, Florida 32611-0690, USA
| | | | | | | |
Collapse
|
19
|
Lal S, Choi JH. The AG dinucleotide terminating introns is important but not always required for pre-mRNA splicing in the maize endosperm. PLANT PHYSIOLOGY 1999; 120:65-72. [PMID: 10318684 PMCID: PMC59270 DOI: 10.1104/pp.120.1.65] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/1998] [Accepted: 01/25/1999] [Indexed: 05/18/2023]
Abstract
Previous RNA analysis of lesions within the 15 intron-containing Sh2 (shrunken2) gene of maize (Zea mays) revealed that the majority of these mutants affect RNA splicing. Here we decipher further two of these mutants, sh2-i (shrunken2 intermediate phenotype) and sh2-7460. Each harbors a G-to-A transition in the terminal nucleotide of an intron, hence destroying the invariant AG found at the terminus of virtually all nuclear introns. Consequences of the mutations, however, differ dramatically. In sh2-i the mutant site is recognized as an authentic splice site in approximately 10% of the primary transcripts processed in the maize endosperm. The other transcripts exhibited exon skipping and lacked exon 3. A G-to-A transition in the terminus of an intron was also found in the mutant sh2-7460, in this case intron 12. The lesion activates a cryptic acceptor site downstream 22 bp within exon 13. In addition, approximately 50% of sh2-7460 transcripts contain intron 2 and 3 sequences.
Collapse
Affiliation(s)
- S Lal
- Program in Plant Molecular and Cellular Biology and Horticultural Sciences, 1143 Fifield Hall, P.O. Box 110690, University of Florida, Gainesville, Florida 32611-0690, USA
| | | |
Collapse
|
20
|
Domon C, Lorković ZJ, Valcárcel J, Filipowicz W. Multiple forms of the U2 small nuclear ribonucleoprotein auxiliary factor U2AF subunits expressed in higher plants. J Biol Chem 1998; 273:34603-10. [PMID: 9852132 DOI: 10.1074/jbc.273.51.34603] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Requirements for intron recognition during pre-mRNA splicing in plants differ from those in vertebrates and yeast. Plant introns contain neither conserved branch points nor distinct 3' splice site-proximal polypyrimidine tracts characteristic of the yeast and vertebrate introns, respectively. However, they are strongly enriched in U residues throughout the intron, property essential for splicing. To understand the roles of different sequence elements in splicing, we are characterizing proteins involved in intron recognition in plants. In this work we show that Nicotiana plumbaginifolia, a dicotyledonous plant, contains two genes encoding different homologs of the large 50-65-kDa subunit of the polypyrimidine tract binding factor U2AF, characterized previously in animals and Schizosaccharomyces pombe. Both plant U2AF65 isoforms, referred to as NpU2AF65a and NpU2AF65b, support splicing of an adenovirus pre-mRNA in HeLa cell nuclear extracts depleted of the endogenous U2AF factor. Both proteins interact with RNA fragments containing plant introns and show affinity for poly(U) and, to a lesser extend, poly(C) and poly(G). The branch point or the 3' splice site regions do not contribute significantly to intron recognition by NpU2AF65. The existence of multiple isoforms of U2AF may be quite general in plants because two genes expressing U2AF65 have been identified in Arabidopsis, and different isoforms of the U2AF small subunit are expressed in rice.
Collapse
Affiliation(s)
- C Domon
- Friedrich Miescher-Institut, CH-4002 Basel, Switzerland
| | | | | | | |
Collapse
|
21
|
Justesen AF, Jespersen HM, Welinder KG. Analysis of two incompletely spliced Arabidopsis cDNAs encoding novel types of peroxidase. BIOCHIMICA ET BIOPHYSICA ACTA 1998; 1443:149-54. [PMID: 9838086 DOI: 10.1016/s0167-4781(98)00205-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Most known class III peroxidase genes contain three introns at conserved positions. Two Arabidopsis cDNAs (ESTs), encoding novel type peroxidases ATP9a and ATP15a were sequenced, and found to contain inserts for intron 2. PCR and sequence analysis of genomic DNA revealed that the atp9a gene contains all three introns, whereas atp15a contains only introns 2 and 3. The ATP15a cDNA intron contained a single base substitution reducing the splicing potential significantly as compared with the genomic sequence. The putative enzymes share essential catalytic and structural features with horseradish peroxidase, despite a pair-wise sequence identity of only 40-45% among the three.
Collapse
Affiliation(s)
- A F Justesen
- Department of Protein Chemistry, Institute of Molecular Biology, University of Copenhagen, Oster Farimagsgade 2A, DK-1353 Copenhagen K, Denmark
| | | | | |
Collapse
|
22
|
Ostergaard L, Pedersen AG, Jespersen HM, Brunak S, Welinder KG. Computational analyses and annotations of the Arabidopsis peroxidase gene family. FEBS Lett 1998; 433:98-102. [PMID: 9738941 DOI: 10.1016/s0014-5793(98)00849-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Classical heme-containing plant peroxidases have been ascribed a wide variety of functional roles related to development, defense, lignification, and hormonal signaling. More than 40 peroxidase genes are now known in Arabidopsis thaliana for which functional association is complicated by a general lack of peroxidase substrate specificity. Computational analysis was performed on 30 near full-length Arabidopsis peroxidase cDNAs for annotation of start codons and signal peptide cleavage sites. A compositional analysis revealed that 23 of the 30 peroxidase cDNAs have 5' untranslated regions containing 40-71% adenine, a rare feature observed also in cDNAs which predominantly encode stress-induced proteins, and which may indicate translational regulation.
Collapse
Affiliation(s)
- L Ostergaard
- Department of Protein Chemistry, Institute of Molecular Biology, University of Copenhagen, Denmark
| | | | | | | | | |
Collapse
|
23
|
Diehn SH, Chiu WL, De Rocher EJ, Green PJ. Premature polyadenylation at multiple sites within a Bacillus thuringiensis toxin gene-coding region. PLANT PHYSIOLOGY 1998; 117:1433-43. [PMID: 9701599 PMCID: PMC34907 DOI: 10.1104/pp.117.4.1433] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/1997] [Accepted: 05/09/1998] [Indexed: 05/20/2023]
Abstract
Some foreign genes introduced into plants are poorly expressed, even when transcription is controlled by a strong promoter. Perhaps the best examples of this problem are the cry genes of Bacillus thuringiensis (B.t.), which encode the insecticidal proteins commonly referred to as B.t. toxins. As a step toward overcoming such problems most effectively, we sought to elucidate the mechanisms limiting the expression of a typical B.t.-toxin gene, cryIA(c), which accumulates very little mRNA in tobacco (Nicotiana tabacum) cells. Most cell lines transformed with the cryIA(c) B.t.-toxin gene accumulate short, polyadenylated transcripts. The abundance of these transcripts can be increased by treating the cells with cycloheximide, a translation inhibitor that can stabilize many unstable transcripts. Using a series of hybridizations, reverse-transcriptase polymerase chain reactions, and RNase-H-digestion experiments, poly(A+) addition sites were identified in the B.t.-toxin-coding region corresponding to the short transcripts. A fourth polyadenylation site was identified using a chimeric gene. These results demonstrate for the first time to our knowledge that premature polyadenylation can limit the expression of a foreign gene in plants. Moreover, this work emphasizes that further study of the fundamental principles governing polyadenylation in plants will have basic as well as applied significance.
Collapse
Affiliation(s)
- S H Diehn
- Michigan State University-Department of Energy Plant Research Laboratory, East Lansing, 48824, USA
| | | | | | | |
Collapse
|
24
|
Simpson CG, McQuade C, Lyon J, Brown JW. Characterization of exon skipping mutants of the COP1 gene from Arabidopsis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 1998; 15:125-131. [PMID: 9744100 DOI: 10.1046/j.1365-313x.1998.00184.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The removal of introns from pre-mRNA requires accurate recognition and selection of the intron splice sites. Mutations which alter splice site selection and which lead to skipping of specific exons are indicative of intron/exon recognition mechanisms involving an exon definition process. In this paper, three independent mutants to the COP1 gene in Arabidopsis which show exon skipping were identified and the mutations which alter the normal splicing pattern were characterized. The mutation in cop1-1 was a G-->A change 4 nt upstream from the 3' splice site of intron 5, while the mutation in cop1-2 was a G-->A at the first nucleotide of intron 6, abolishing the conserved G within the 5' splice site consensus. The effect of these mutations was skipping of exon 6. The mutation in cop1-8 was G-->A in the final nucleotide of intron 10 abolishing the conserved G within the 3' splice site consensus and leading to skipping of exon 11. The splicing patterns surrounding exons 6 and 11 of COP1 in these three mutant lines of Arabidopsis provide evidence for exon definition mechanisms operating in plant splicing.
Collapse
Affiliation(s)
- C G Simpson
- Cell and Molecular Genetics Department, Scottish Crop Research Institute, Invergowrie, Dundee, UK
| | | | | | | |
Collapse
|
25
|
Brendel V, Kleffe J, Carle-Urioste JC, Walbot V. Prediction of splice sites in plant pre-mRNA from sequence properties. J Mol Biol 1998; 276:85-104. [PMID: 9514728 DOI: 10.1006/jmbi.1997.1523] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Heterologous introns are often inaccurately or inefficiently processed in higher plants. The precise features that distinguish the process of pre-mRNA splicing in plants from splicing in yeast and mammals are unclear. One contributing factor is the prominent base compositional contrast between U-rich plant introns and flanking G + C-rich exons. Inclusion of this contrast factor in recently developed statistical methods for splice site prediction from sequence inspection significantly improved prediction accuracy. We applied the prediction tools to re-analyze experimental data on splice site selection and splicing efficiency for native and more than 170 mutated plant introns. In almost all cases, the experimentally determined preferred sites correspond to the highest scoring sites predicted by the model. In native genes, about 90% of splice sites are the locally highest scoring sites within the bounds of the flanking exon and intron. We propose that, in most cases, local context (about 50 bases upstream and downstream from a potential intron end) is sufficient to account for intrinsic splice site strength, and that competition for transacting factors determines splice site selection in vivo. We suggest that computer-aided splice site prediction can be a powerful tool for experimental design and interpretation.
Collapse
Affiliation(s)
- V Brendel
- Department of Mathematics, Stanford University, CA 94305-2125, USA
| | | | | | | |
Collapse
|
26
|
Abstract
Cyclins are the regulatory subunits of cyclin-dependent protein kinases. In investigations of the expression of a cyclin gene during maize endosperm development, we detected a cyclin transcript with a 63-bp deletion in the region encoding the conserved 'cyclin box' where cyclin interacts with p34cdc2, the catalytic domain of the cyclin-dependent protein kinase. Analysis of cDNA and genomic sequences, and other observations, indicated that the deletion was caused by alternative splicing of a retained intron in the normally spliced transcript. Whereas the normally spliced cyclin RNA was mitotically functional, as indicated by its ability to promote maturation of Xenopus oocytes, the alternatively spliced transcript was unable to promote maturation. In addition to maize endosperm, the alternatively spliced cyclin was detected in apical meristem, mature leaf, root tip and mature root.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Base Sequence
- Cell Cycle/genetics
- Cloning, Molecular
- Cyclins/genetics
- Cyclins/metabolism
- Gene Expression Regulation, Plant
- Genes, Plant
- Humans
- Introns
- Molecular Sequence Data
- Oocytes/metabolism
- Plant Leaves/metabolism
- Plant Roots/metabolism
- Plant Stems/metabolism
- Polymerase Chain Reaction
- RNA Processing, Post-Transcriptional
- RNA, Plant/genetics
- RNA, Plant/metabolism
- Sequence Alignment
- Sequence Analysis, DNA
- Sequence Deletion
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Transcription, Genetic
- Xenopus/genetics
- Zea mays/genetics
Collapse
Affiliation(s)
- Y Sun
- Department of Soil, Crop and Atmospheric Sciences, Cornell University, Ithaca, NY 14853, USA
| | | | | | | |
Collapse
|
27
|
Macknight R, Bancroft I, Page T, Lister C, Schmidt R, Love K, Westphal L, Murphy G, Sherson S, Cobbett C, Dean C. FCA, a gene controlling flowering time in Arabidopsis, encodes a protein containing RNA-binding domains. Cell 1997; 89:737-45. [PMID: 9182761 DOI: 10.1016/s0092-8674(00)80256-1] [Citation(s) in RCA: 314] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
A strong promoter of the transition to flowering in Arabidopsis is encoded by FCA. FCA has been cloned and shown to encode a protein containing two RNA-binding domains and a WW protein interaction domain. This suggests that FCA functions in the posttranscriptional regulation of transcripts involved in the flowering process. The FCA transcript is alternatively spliced with only one form encoding the entire FCA protein. Plants carrying the FCA gene fused to the strong constitutive 35S promoter flowered earlier, and the ratio and abundance of the different FCA transcripts were altered. Thus, FCA appears to be a component of a posttranscriptional cascade involved in the control of flowering time.
Collapse
Affiliation(s)
- R Macknight
- Department of Molecular Genetics, John Innes Centre, Norwich Research Park, Colney, United Kingdom
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Kleffe J, Hermann K, Vahrson W, Wittig B, Brendel V. Logitlinear models for the prediction of splice sites in plant pre-mRNA sequences. Nucleic Acids Res 1996; 24:4709-18. [PMID: 8972857 PMCID: PMC146321 DOI: 10.1093/nar/24.23.4709] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Pre-mRNA splicing in plants, while generally similar to the processes in vertebrates and yeast, is thought to involve plant specific cis-acting elements. Both monocot and dicot introns are typically strongly enriched in U nucleotides, and AU- or U-rich segments are thought to be involved in intron recognition, splice site selection, and splicing efficiency. We have applied logitlinear models to find optimal combinations of splice site variables for the purpose of separating true splice sites from a large excess of potential sites. It is shown that plant splice site prediction from sequence inspection is greatly improved when compositional contrast between exons and introns is considered in addition to degree of matching to the splice site consensus (signal quality). The best model involves subclassification of splice sites according to the identity of the base immediately upstream of the GU and AG signals and gives substantial performance gains compared with conventional profile methods.
Collapse
Affiliation(s)
- J Kleffe
- Freie Universität Berlin, Institut für Molekularbiologie und Biochemie, Germany
| | | | | | | | | |
Collapse
|
29
|
Abstract
Our understanding of how the 3' ends of mRNAs are formed in plants is rudimentary compared to what we know about this process in other eukaryotes. The salient features of plant pre-mRNAs that signal cleavage and polyadenylation remain obscure, and the biochemical mechanism is as yet wholly uncharacterized. Nevertheless, despite the lack of universally conserved cis-acting motifs, a common underlying architecture is emerging from functional analyses of plant poly(A) signals, allowing meaningful comparison with components of poly(A) signals in other eukaryotes. A plant poly(A) signal consists of one or more near-upstream elements (NUE), each directing processing at a poly(A) site a short distance downstream of it, and an extensive far-upstream element (FUE) that enhances processing efficiency at all sites. By analogy with other systems, a model for a plant 3'-end processing complex can be proposed. Plant poly(A) polymerases have been isolated and partially characterised. These, together with hints that some processing factors are conserved in different organisms, opens promising avenues toward initial characterisation of the trans-acting factors involved in 3'-end formation of mRNAs in higher plants.
Collapse
Affiliation(s)
- H M Rothnie
- Friedrich Miescher-Institut, Basel, Switzerland
| |
Collapse
|
30
|
Simpson GG, Filipowicz W. Splicing of precursors to mRNA in higher plants: mechanism, regulation and sub-nuclear organisation of the spliceosomal machinery. PLANT MOLECULAR BIOLOGY 1996; 32:1-41. [PMID: 8980472 DOI: 10.1007/bf00039375] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
The removal of introns from pre-mRNA transcripts and the concomitant ligation of exons is known as pre-mRNA splicing. It is a fundamental aspect of constitutive eukaryotic gene expression and an important level at which gene expression is regulated. The process is governed by multiple cis-acting elements of limited sequence content and particular spatial constraints, and is executed by a dynamic ribonucleoprotein complex termed the spliceosome. The mechanism and regulation of pre-mRNA splicing, and the sub-nuclear organisation of the spliceosomal machinery in higher plants is reviewed here. Heterologous introns are often not processed in higher plants indicating that, although highly conserved, the process of pre-mRNA splicing in plants exhibits significant differences that distinguish it from splicing in yeast and mammals. A fundamental distinguishing feature is the presence of and requirement for AU or U-rich intron sequence in higher-plant pre-mRNA splicing. In this review we document the properties of higher-plant introns and trans-acting spliceosomal components and discuss the means by which these elements combine to determine the accuracy and efficiency of pre-mRNA processing. We also detail examples of how introns can effect regulated gene expression by affecting the nature and abundance of mRNA in plants and list the effects of environmental stresses on splicing. Spliceosomal components exhibit a distinct pattern of organisation in higher-plant nuclei. Effective probes that reveal this pattern have only recently become available, but the domains in which spliceosomal components concentrate were identified in plant nuclei as enigmatic structures some sixty years ago. The organisation of spliceosomal components in plant nuclei is reviewed and these recent observations are unified with previous cytochemical and ultrastructural studies of plant ribonuleoprotein domains.
Collapse
Affiliation(s)
- G G Simpson
- Friedrich Miescher-Institut, Basel, Switzerland
| | | |
Collapse
|
31
|
Gniadkowski M, Hemmings-Mieszczak M, Klahre U, Liu HX, Filipowicz W. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia. Nucleic Acids Res 1996; 24:619-27. [PMID: 8604302 PMCID: PMC145670 DOI: 10.1093/nar/24.4.619] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).
Collapse
Affiliation(s)
- M Gniadkowski
- Friedrich Miescher Institute, Ch-4002 Basel, Switzerland
| | | | | | | | | |
Collapse
|
32
|
Lagarias DM, Wu SH, Lagarias JC. Atypical phytochrome gene structure in the green alga Mesotaenium caldariorum. PLANT MOLECULAR BIOLOGY 1995; 29:1127-1142. [PMID: 8616213 DOI: 10.1007/bf00020457] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The phytochrome photoreceptor in the green alga Mesotaenium caldariorum is encoded by a small family of highly related genes. DNA sequence analysis of two of the algal phytochrome genes indicates an atypical gene structure with numerous long introns. The two genes, termed mesphy1a and mesphy1b, encode polypeptides which differ by one amino acid in the region of overlap that was sequenced. RT-PCR studies have established the intron-exon junctions of both genes and show that both are expressed. RNA blot analysis indicates a single transcript of ca. 4.1 kb in length. The deduced amino acid sequence of the mesphy1b gene reveals that the photoreceptor consists of 1142 amino acids, with an overall structure similar to other phytochromes. Phylogenetic analyses indicate that the algal phytochrome falls into a distinct subfamily with other lower plant phytochromes. Profile analysis of an internal repeat found within the central hinge region of the phytochrome polypeptide indicates an evolutionary relatedness to the photoactive yellow protein from the purple bacterium Ectothiorhodospira halophila, to several bacterial sensor kinase family members, and to a family of eukaryotic regulatory proteins which includes the period clock (per) and single-minded (sim) gene products of Drosophila. Since mutations which alter phytochrome activity cluster within the region delimited by these direct repeats (P.H. Quail et al., Science 268 (1995): 675-680), this conserved motif may play an important role in the signal transducing function of these disparate protein families.
Collapse
Affiliation(s)
- D M Lagarias
- Section of Molecular and Cellular Biology, University of California, Davis 95616, USA
| | | | | |
Collapse
|
33
|
Li Q, Hunt AG. A near-upstream element in a plant polyadenylation signal consists of more than six nucleotides. PLANT MOLECULAR BIOLOGY 1995; 28:927-34. [PMID: 7640363 DOI: 10.1007/bf00042076] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
A plant polyadenylation signal consists of three distinct components: a far-upstream element (FUE) that can control utilization of several polyadenylation sites, one or more near-upstream elements (NUEs) that control utilization of each site in a transcription unit, and polyadenylation site (CSs) themselves. NUEs have previously been suggested to be related to the mammalian polyadenylation signal AAUAAA. However, many plant genes do not contain AAUAAA-like motifs near their polyadenylation sites. To better understand the nature of NUEs, we conducted a systematic analysis of the NUE for one polyadenylation site (site 1) in the pea rbcS-E9 gene; this NUE lacks an AAUAAA motif. Linker substitution studies showed that the NUE for site 1 in this gene resides in the sequence AAAUGGAAA. Single-nucleotide substitutions in this domain had modest effects on the functioning of this NUE. Replacement of part of this sequence with the sequence AAUAAA increased the efficiency of this NUE. However, alteration of nucleotides immediately 3' of the AAUAAA reversed this effect. Our results indicate that the NUE for site 1 consists of as many as 9 nucleotides, that these 9 bases do not include an element that is intolerant of single base changes, that the sequence AAUAAA can function as a NUE for site 1, and that sequences flanking AAUAAA can affect the efficiency of functioning as a NUE.
Collapse
Affiliation(s)
- Q Li
- Plant Physiology/Biochemistry/Molecular Biology Program, University of Kentucky, Lexington 40546-0091, USA
| | | |
Collapse
|
34
|
Klahre U, Hemmings-Mieszczak M, Filipowicz W. Extreme heterogeneity of polyadenylation sites in mRNAs encoding chloroplast RNA-binding proteins in Nicotiana plumbaginifolia. PLANT MOLECULAR BIOLOGY 1995; 28:569-74. [PMID: 7632924 DOI: 10.1007/bf00020402] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
We have previously characterized nuclear cDNA clones encoding two RNA binding proteins, CP-RBP30 and CP-RBP-31, which are targeted to chloroplasts in Nicotiana plumbaginifolia. In this report we describe the analysis of the 3'-untranslated regions (3'-UTRs) in 22 CP-RBP30 and 8 CP-RBP31 clones which reveals that mRNAs encoding both proteins have a very complex polyadenylation pattern. Fourteen distinct poly(A) sites were identified among CP-RBP30 clones and four sites among the CP-RBP31 clones. The authenticity of the sites was confirmed by RNase A/T1 mapping of N. plumbaginifolia RNA. CP-RBP30 provides an extreme example of the heterogeneity known to be a feature of mRNA polyadenylation in higher plants. Using PCR we have demonstrated that CP-RBP genes in N. plumbaginifolia and N. sylvestris, in addition to the previously described introns interrupting the coding region, contain an intron located in the 3' non-coding part of the gene. In the case of the CP-RBP31, we have identified one polyadenylation event occurring in this intron.
Collapse
Affiliation(s)
- U Klahre
- Friedrich Miescher-Institut, Basel, Switzerland
| | | | | |
Collapse
|
35
|
Carle-Urioste JC, Ko CH, Benito MI, Walbot V. In vivo analysis of intron processing using splicing-dependent reporter gene assays. PLANT MOLECULAR BIOLOGY 1994; 26:1785-1795. [PMID: 7858217 DOI: 10.1007/bf00019492] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The mechanisms of intron recognition and processing have been well-studied in mammals and yeast, but in plants the biochemistry of splicing is not known and the rules for intron recognition are not clearly defined. To increase understanding of intron processing in plants, we have constructed new pairs of vectors, pSuccess and pFail, to assess the efficiency of splicing in maize cells. In the pFail series we use translation of pre-mRNA to monitor the amount of unspliced RNA. We inserted an ATG codon in the Bz2 (Bronze-2) intron in frame with luciferase: this construct will express luciferase activity only when splicing fails. In the pSuccess series the spliced message is monitored by inserting an ATG upstream of the Bz2 intron in frame with luciferase: this construct will express luciferase activity only when splicing succeeds. We show here, using both the wild-type Bz2 intron and the same intron with splice site mutations, that the efficiency of splicing can be estimated by the ratio between the luciferase activities of the vector pairs. We also show that mutations in the unique U-rich motif inside the intron can modulate splicing. In addition, a GC-rich insertion in the first exon increases the efficiency of splicing, suggesting that exons also play an important role in intron recognition and/or processing.
Collapse
Affiliation(s)
- J C Carle-Urioste
- Department of Biological Sciences, Stanford University, CA 94305-5020
| | | | | | | |
Collapse
|
36
|
Luehrsen KR, Walbot V. Addition of A- and U-rich sequence increases the splicing efficiency of a deleted form of a maize intron. PLANT MOLECULAR BIOLOGY 1994; 24:449-63. [PMID: 8123788 DOI: 10.1007/bf00024113] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Plant introns are generally short (< 200 nt) and AU-rich, and an elevated AU content is necessary for efficient splicing. Further, an intron in some plant genes enhances gene expression by a post-transcriptional mechanism that results in an increase of cytoplasmic mRNA. The specific intron features responsible for efficient splicing and enhancement are not well characterized in plants. Internal deletions of up to 80% of two maize introns, Adh1 intron 1 and maize actin 3, indicate that large segments of these introns are dispensable for normal function. However, extensive deletion (> 75%) of Adh1 intron 1 diminishes both intron enhancement and splicing efficiency. This finding suggests that there are internal sequence motifs required for intron function, and that these motifs are redundant. We attempted to repair a deletion-impaired Adh1 intron 1 variant by adding back either oligomers of defined sequence content or fragments of maize internal intron sequence. The addition of AU-rich oligomers improved splicing efficiency and in one example, a U-rich oligomer activated a cryptic 3' splice acceptor. We also found that replacing the region proximal to the Adh1 intron 1 3' acceptor with U-rich sequence improved splicing. We found that adding G- and C-rich oligomers did not improve intron function, but a C-rich oligomer activated a cryptic 3' acceptor. The addition of internal intron sequence to an impaired intron improved splicing, and in one case, resulted in the activation of a cryptic 3' acceptor. We present evidence that U-rich sequence immediately upstream of the 3' splice junction increases splicing efficiency and contributes to, but does not uniquely specify, 3' acceptor AG choice.
Collapse
Affiliation(s)
- K R Luehrsen
- Dept. of Biological Sciences, Stanford University, CA 94305-5020
| | | |
Collapse
|