1
|
Zhukova M, Schedl P, Shidlovskii YV. The role of secondary structures in the functioning of 3' untranslated regions of mRNA: A review of functions of 3' UTRs' secondary structures and hypothetical involvement of secondary structures in cytoplasmic polyadenylation in Drosophila. Bioessays 2024; 46:e2300099. [PMID: 38161240 DOI: 10.1002/bies.202300099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 12/11/2023] [Accepted: 12/12/2023] [Indexed: 01/03/2024]
Abstract
3' untranslated regions (3' UTRs) of mRNAs have many functions, including mRNA processing and transport, translational regulation, and mRNA degradation and stability. These different functions require cis-elements in 3' UTRs that can be either sequence motifs or RNA structures. Here we review the role of secondary structures in the functioning of 3' UTRs and discuss some of the trans-acting factors that interact with these secondary structures in eukaryotic organisms. We propose potential participation of 3'-UTR secondary structures in cytoplasmic polyadenylation in the model organism Drosophila melanogaster. Because the secondary structures of 3' UTRs are essential for post-transcriptional regulation of gene expression, their disruption leads to a wide range of disorders, including cancer and cardiovascular diseases. Trans-acting factors, such as STAU1 and nucleolin, which interact with 3'-UTR secondary structures of target transcripts, influence the pathogenesis of neurodegenerative diseases and tumor metastasis, suggesting that they are possible therapeutic targets.
Collapse
Affiliation(s)
- Mariya Zhukova
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
| | - Paul Schedl
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA
| | - Yulii V Shidlovskii
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
- Department of Biology and General Genetics, Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| |
Collapse
|
2
|
Stroup EK, Ji Z. Delineating yeast cleavage and polyadenylation signals using deep learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561764. [PMID: 37873420 PMCID: PMC10592759 DOI: 10.1101/2023.10.10.561764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
3'-end cleavage and polyadenylation is an essential process for eukaryotic mRNA maturation. In yeast species, the polyadenylation signals that recruit the processing machinery are degenerate and remain poorly characterized compared to well-defined regulatory elements in mammals. Especially, recent deep sequencing experiments showed extensive cleavage heterogeneity for some mRNAs in Saccharomyces cerevisiae and uncovered the polyA motif differences between S. cerevisiae vs. Schizosaccharomyces pombe . The findings raised the fundamental question of how polyadenylation signals are formed in yeast. Here we addressed this question by developing deep learning models to deconvolute degenerate cis -regulatory elements and quantify their positional importance in mediating yeast polyA site formation, cleavage heterogeneity, and strength. In S. cerevisiae , cleavage heterogeneity is promoted by the depletion of U-rich elements around polyA sites as well as multiple occurrences of upstream UA-rich elements. Sites with high cleavage heterogeneity show overall lower strength. The site strength and tandem site distances modulate alternative polyadenylation (APA) under the diauxic stress. Finally, we developed a deep learning model to reveal the distinct motif configuration of S. pombe polyA sites which show more precise cleavage than S. cerevisiae . Altogether, our deep learning models provide unprecedented insights into polyA site formation across yeast species.
Collapse
|
3
|
Muñoz O, Lore M, Jagannathan S. The long and short of EJC-independent nonsense-mediated RNA decay. Biochem Soc Trans 2023; 51:1121-1129. [PMID: 37145092 DOI: 10.1042/bst20221131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 04/21/2023] [Accepted: 04/25/2023] [Indexed: 05/06/2023]
Abstract
Nonsense-mediated RNA decay (NMD) plays a dual role as an RNA surveillance mechanism against aberrant transcripts containing premature termination codons and as a gene regulatory mechanism for normal physiological transcripts. This dual function is possible because NMD recognizes its substrates based on the functional definition of a premature translation termination event. An efficient mode of NMD target recognition involves the presence of exon-junction complexes (EJCs) downstream of the terminating ribosome. A less efficient, but highly conserved, mode of NMD is triggered by long 3' untranslated regions (UTRs) that lack EJCs (termed EJC-independent NMD). While EJC-independent NMD plays an important regulatory role across organisms, our understanding of its mechanism, especially in mammalian cells, is incomplete. This review focuses on EJC-independent NMD and discusses the current state of knowledge and factors that contribute to the variability in the efficiency of this mechanism.
Collapse
Affiliation(s)
- Oscar Muñoz
- Molecular Biology Graduate Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, U.S.A
| | - Mlana Lore
- Molecular Biology Graduate Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, U.S.A
| | - Sujatha Jagannathan
- Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, U.S.A
- RNA Bioscience Initiative, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, U.S.A
| |
Collapse
|
4
|
Architectural and functional details of CF IA proteins involved in yeast 3'-end pre-mRNA processing and its significance for eukaryotes: A concise review. Int J Biol Macromol 2021; 193:387-400. [PMID: 34699898 DOI: 10.1016/j.ijbiomac.2021.10.129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 10/04/2021] [Accepted: 10/18/2021] [Indexed: 11/22/2022]
Abstract
In eukaryotes, maturation of pre-mRNA relies on its precise 3'-end processing. This processing involves co-transcriptional steps regulated by sequence elements and other proteins. Although, it holds tremendous importance, defect in the processing machinery will result in erroneous pre-mRNA maturation leading to defective translation. Remarkably, more than 20 proteins in humans and yeast share homology and execute this processing. The defects in this processing are associated with various diseases in humans. We shed light on the CF IA subunit of yeast Saccharomyces cerevisiae that contains four proteins (Pcf11, Clp1, Rna14 and Rna15) involved in this processing. Structural details of various domains of CF IA and their roles during 3'-end processing, like cleavage and polyadenylation at 3'-UTR of pre-mRNA and other cellular events are explained. Further, the chronological development and important discoveries associated with 3'-end processing are summarized. Moreover, the mammalian homologues of yeast CF IA proteins, along with their key roles are described. This knowledge would be helpful for better comprehension of the mechanism associated with this marvel; thus opening up vast avenues in this area.
Collapse
|
5
|
Cheperegin SE, Sannikova EP, Malysheva AV, Klebanov FA, Kozlov DG. Highly Active Modified Variants of Recombinant Phospholipase А2 from Streptomyces violaceoruber for Effective Expression in Yeasts. APPL BIOCHEM MICRO+ 2020. [DOI: 10.1134/s0003683820070029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Sanborn MA, Li T, Victor K, Siegfried H, Fung C, Rothman AL, Srikiatkhachorn A, Fernandez S, Ellison D, Jarman RG, Friberg H, Maljkovic Berry I, Currier JR, Waickman AT. Analysis of cell-associated DENV RNA by oligo(dT) primed 5' capture scRNAseq. Sci Rep 2020; 10:9047. [PMID: 32493997 PMCID: PMC7270085 DOI: 10.1038/s41598-020-65939-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 05/12/2020] [Indexed: 01/12/2023] Open
Abstract
Dengue is one of the most widespread vector-borne viral diseases in the world. However, the size, heterogeneity, and temporal dynamics of the cell-associated viral reservoir during acute dengue virus (DENV) infection remains unclear. In this study, we analyzed cells infected in vitro with DENV and PBMC from an individual experiencing a natural DENV infection utilizing 5’ capture single cell RNA sequencing (scRNAseq). Both positive- and negative-sense DENV RNA was detected in reactions containing either an oligo(dT) primer alone, or in reactions supplemented with a DENV-specific primer. The addition of a DENV-specific primer did not increase the total amount of DENV RNA captured or the fraction of cells identified as containing DENV RNA. However, inclusion of a DENV-specific cDNA primer did increase the viral genome coverage immediately 5’ to the primer binding site. Furthermore, while the majority of intracellular DENV sequence captured in this analysis mapped to the 5’ end of the viral genome, distinct patterns of enhanced coverage within the DENV polyprotein coding region were observed. The 5’ capture scRNAseq analysis of PBMC not only recapitulated previously published reports by detecting virally infected memory and naïve B cells, but also identified cell-associated genomic variants not observed in contemporaneous serum samples. These results demonstrate that oligo(dT) primed 5’ capture scRNAseq can detect DENV RNA and quantify virus-infected cells in physiologically relevant conditions, and provides insight into viral sequence variability within infected cells.
Collapse
Affiliation(s)
- Mark A Sanborn
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Tao Li
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Kaitlin Victor
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Hayden Siegfried
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Christian Fung
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Alan L Rothman
- Department of Cell and Molecular Biology, Institute for Immunology and Informatics, University of Rhode Island, Providence, RI, USA
| | - Anon Srikiatkhachorn
- Department of Cell and Molecular Biology, Institute for Immunology and Informatics, University of Rhode Island, Providence, RI, USA.,Faculty of Medicine, King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
| | - Stefan Fernandez
- Department of Virology, Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Damon Ellison
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Richard G Jarman
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Heather Friberg
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Irina Maljkovic Berry
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Jeffrey R Currier
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Adam T Waickman
- Viral Diseases Branch, Walter Reed Army Institute of Research, Silver Spring, MD, USA.
| |
Collapse
|
7
|
Ipa1 Is an RNA Polymerase II Elongation Factor that Facilitates Termination by Maintaining Levels of the Poly(A) Site Endonuclease Ysh1. Cell Rep 2020; 26:1919-1933.e5. [PMID: 30759400 PMCID: PMC7236606 DOI: 10.1016/j.celrep.2019.01.051] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 12/05/2018] [Accepted: 01/15/2019] [Indexed: 02/08/2023] Open
Abstract
The yeast protein Ipa1 was recently discovered to interact with the Ysh1
endonuclease of the prem-RNA cleavage and polyadenylation (C/P) machinery, and
Ipa1 mutation impairs 3′end processing. We report that Ipa1 globally
promotes proper transcription termination and poly(A) site selection, but with
variable effects on genes depending upon the specific configurations of
polyadenylation signals. Our findings suggest that the role of Ipa1 in
termination is mediated through interaction with Ysh1, since Ipa1 mutation leads
to decrease in Ysh1 and poor recruitment of the C/P complex to a transcribed
gene. The Ipa1 association with transcriptionally active chromatin resembles
that of elongation factors, and the mutant shows defective Pol II elongation
kinetics in vivo. Ysh1 overexpression in the Ipa1 mutant
rescues the termination defect, but not the mutant’s sensitivity to
6-azauracil, an indicator of defective elongation. Our findings support a model
in which an Ipa1/Ysh1 complex helps coordinate transcription elongation and
3′ end processing. The essential, uncharacterized Ipa1 protein was recently discovered to
interact with the Ysh1 endonuclease of the pre-mRNA cleavage and polyadenylation
machinery. Pearson et al. propose that the Ipa1/Ysh1 interaction provides the
cell with a means to coordinate and regulate transcription elongation with
3′ end processing in accordance with the cell’s needs.
Collapse
|
8
|
Roy KR, Chanfreau GF. Robust mapping of polyadenylated and non-polyadenylated RNA 3' ends at nucleotide resolution by 3'-end sequencing. Methods 2020; 176:4-13. [PMID: 31128237 PMCID: PMC6874744 DOI: 10.1016/j.ymeth.2019.05.016] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 04/10/2019] [Accepted: 05/20/2019] [Indexed: 10/26/2022] Open
Abstract
3'-end poly(A)+ sequencing is an efficient and economical method for global measurement of mRNA levels and alternative poly(A) site usage. A common method involves oligo(dT)19V reverse-transcription (RT)-based library preparation and high-throughput sequencing with a custom primer ending in (dT)19. While the majority of library products have the first sequenced nucleotide reflect the bona fide poly(A) site (pA), a substantial fraction of sequencing reads arise from various mis-priming events. These can result in incorrect pA site calls anywhere from several nucleotides downstream to several kilobases upstream from the bona fide pA site. While these mis-priming events can be mitigated by increasing annealing stringency (e.g. increasing temperature from 37 °C to 42 °C), they still persist at an appreciable level (∼10%) and computational methods must be used to prevent artifactual calls. Here we present a bioinformatics workflow for precise mapping of poly(A)+ 3' ends and handling of artifacts due to oligo(dT) mis-priming and sample polymorphisms. We test pA site calling with three different read mapping programs (STAR, BWA, and BBMap), and show that the way in which each handles terminal mismatches and soft clipping has a substantial impact on identifying correct pA sites, with BWA requiring the least post-processing to correct artifacts. We demonstrate the use of this pipeline for mapping pA sites in the model eukaryote S. cerevisiae, and further apply this technology to non-polyadenylated transcripts by employing in vitro polyadenylation prior to library prep (IVP-seq). As proof of principle, we show that a fraction of tRNAs harbor CCU 3' tails instead of the canonical CCA tail, and globally identify 3' ends of splicing intermediates arising from inefficiently spliced transcripts.
Collapse
Affiliation(s)
- Kevin R Roy
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095-1569, United States; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095-1570, United States
| | - Guillaume F Chanfreau
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095-1569, United States; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095-1570, United States.
| |
Collapse
|
9
|
Mechanism of ribosome stalling during translation of a poly(A) tail. Nat Struct Mol Biol 2019; 26:1132-1140. [PMID: 31768042 PMCID: PMC6900289 DOI: 10.1038/s41594-019-0331-x] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2019] [Accepted: 10/10/2019] [Indexed: 12/23/2022]
Abstract
Faulty or damaged mRNAs are detected by the cell when translating ribosomes stall during elongation and trigger pathways of mRNA decay, nascent protein degradation, and ribosome recycling. The most common mRNA defect in eukaryotes is probably inappropriate poly-adenylation at near-cognate sites within the coding region. How ribosomes stall selectively when they encounter poly(A) is unclear. Here, we use biochemical and structural approaches in mammalian systems to show that poly-lysine, encoded by poly(A), favors a peptidyl-tRNA conformation sub-optimal for peptide bond formation. This conformation partially slows elongation, permitting poly(A) mRNA in the ribosome’s decoding center to adopt an rRNA-stabilized single-stranded helix. The reconfigured decoding center clashes with incoming aminoacyl-tRNA, thereby precluding elongation. Thus, coincidence detection of poly-lysine in the exit tunnel and poly(A) in the decoding center allows ribosomes to detect aberrant mRNAs selectively, stall elongation, and trigger downstream quality control pathways essential for cellular homeostasis.
Collapse
|
10
|
Matsuyama T. Recent developments in terminator technology in Saccharomyces cerevisiae. J Biosci Bioeng 2019; 128:655-661. [PMID: 31324384 DOI: 10.1016/j.jbiosc.2019.06.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 06/07/2019] [Accepted: 06/07/2019] [Indexed: 11/26/2022]
Abstract
Metabolically engineered microorganisms that produce useful organic compounds will be helpful for realizing a sustainable society. The budding yeast Saccharomyces cerevisiae has high utility as a metabolic engineering platform because of its high fermentation ability, non-pathogenicity, and ease of handling. When producing yeast strains that produce exogenous compounds, it is a prerequisite to control the expression of exogenous enzyme-encoding genes. Terminator region in a gene expression cassette, as well as promoter region, could be used to improve metabolically engineered yeasts by increasing or decreasing the expression of the target enzyme-encoding genes. The findings on terminators have grown rapidly in the last decade, so an overview of these findings should provide a foothold for new developments.
Collapse
|
11
|
Zhao Z, Wu X, Ji G, Liang C, Li QQ. Genome-Wide Comparative Analyses of Polyadenylation Signals in Eukaryotes Suggest a Possible Origin of the AAUAAA Signal. Int J Mol Sci 2019; 20:ijms20040958. [PMID: 30813258 PMCID: PMC6413133 DOI: 10.3390/ijms20040958] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 02/18/2019] [Accepted: 02/19/2019] [Indexed: 01/09/2023] Open
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for almost all mRNA in eukaryotes. The cis-elements around the poly(A) sites, however, are very diverse among different organisms. We characterized the poly(A) signals of seven different species, and compared them with that of four well-studied organisms. We found that ciliates do not show any dominant poly(A) signal; a triplet (UAA) and tetramers (UAAA and GUAA) are dominant in diatoms and red alga, respectively; and green alga Ostreococcus uses UGUAA as its poly(A) signal. Spikemoss and moss use conserved AAUAAA signals that are similar to other land plants. Our analysis suggests that the first two bases (NN in NNUAAA) are likely degenerated whereas UAAA appears to be the core motif. Combined with other published results, it is suggested that the highly conserved poly(A) signal AAUAAA may be derived from UAA with an intermediate, putative UAAA, following a pathway of UAA→UAAA→AAUAAA.
Collapse
Affiliation(s)
- Zhixin Zhao
- College of Biopharmaceutical and Food Engineering, Shangluo University, Shangluo 726000, China.
- Department of Biology, Miami University, Oxford, OH 45056, USA.
| | - Xiaohui Wu
- Department of Automation, Xiamen University, Xiamen 361005, China.
| | - Guoli Ji
- Department of Automation, Xiamen University, Xiamen 361005, China.
| | - Chun Liang
- Department of Biology, Miami University, Oxford, OH 45056, USA.
| | - Qingshun Quinn Li
- Department of Biology, Miami University, Oxford, OH 45056, USA.
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, and College of the Environment and Ecology, Xiamen University, Xiamen 361102, China.
- Graduate College of Biomedical Sciences, Western University of Health Sciences, Pomona, CA 91766, USA.
| |
Collapse
|
12
|
Zhou Z, Dang Y, Zhou M, Yuan H, Liu Y. Codon usage biases co-evolve with transcription termination machinery to suppress premature cleavage and polyadenylation. eLife 2018; 7:33569. [PMID: 29547124 PMCID: PMC5869017 DOI: 10.7554/elife.33569] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 03/15/2018] [Indexed: 12/13/2022] Open
Abstract
Codon usage biases are found in all genomes and influence protein expression levels. The codon usage effect on protein expression was thought to be mainly due to its impact on translation. Here, we show that transcription termination is an important driving force for codon usage bias in eukaryotes. Using Neurospora crassa as a model organism, we demonstrated that introduction of rare codons results in premature transcription termination (PTT) within open reading frames and abolishment of full-length mRNA. PTT is a wide-spread phenomenon in Neurospora, and there is a strong negative correlation between codon usage bias and PTT events. Rare codons lead to the formation of putative poly(A) signals and PTT. A similar role for codon usage bias was also observed in mouse cells. Together, these results suggest that codon usage biases co-evolve with the transcription termination machinery to suppress premature termination of transcription and thus allow for optimal gene expression.
Collapse
Affiliation(s)
- Zhipeng Zhou
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| | - Yunkun Dang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.,Center for Life Science, School of Life Sciences, Yunnan University, Kunming, China
| | - Mian Zhou
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Haiyan Yuan
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| | - Yi Liu
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| |
Collapse
|
13
|
Yeh HS, Zhang W, Yong J. Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies. BMB Rep 2018; 50:201-207. [PMID: 28148393 PMCID: PMC5437964 DOI: 10.5483/bmbrep.2017.50.4.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Indexed: 01/08/2023] Open
Abstract
Alternations in usage of polyadenylation sites during transcription termination yield transcript isoforms from a gene. Recent findings of transcriptome-wide alternative polyadenylation (APA) as a molecular response to changes in biology position APA not only as a molecular event of early transcriptional termination but also as a cellular regulatory step affecting various biological pathways. With the development of high-throughput profiling technologies at a single nucleotide level and their applications targeted to the 3'-end of mRNAs, dynamics in the landscape of mRNA 3'-end is measureable at a global scale. In this review, methods and technologies that have been adopted to study APA events are discussed. In addition, various bioinformatics algorithms for APA isoform analysis using publicly available RNA-seq datasets are introduced. [BMB Reports 2017; 50(4): 201-207].
Collapse
Affiliation(s)
- Hsin-Sung Yeh
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Wei Zhang
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Jeongsik Yong
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
14
|
Lejeune F. Nonsense-mediated mRNA decay at the crossroads of many cellular pathways. BMB Rep 2018; 50:175-185. [PMID: 28115040 PMCID: PMC5437961 DOI: 10.5483/bmbrep.2017.50.4.015] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Indexed: 12/22/2022] Open
Abstract
Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism ensuring the fast decay of mRNAs harboring a premature termination codon (PTC). As a quality control mechanism, NMD distinguishes PTCs from normal termination codons in order to degrade PTC-carrying mRNAs only. For this, NMD is connected to various other cell processes which regulate or activate it under specific cell conditions or in response to mutations, mis-regulations, stresses, or particular cell programs. These cell processes and their connections with NMD are the focus of this review, which aims both to illustrate the complexity of the NMD mechanism and its regulation and to highlight the cellular consequences of NMD inhibition.
Collapse
Affiliation(s)
- Fabrice Lejeune
- University Lille, UMR8161 - M3T - Mechanisms of Tumorigenesis and Target Therapies; CNRS, UMR 8161, 3Institut Pasteur de Lille, F-59000 Lille, France
| |
Collapse
|
15
|
Janke R, Kong J, Braberg H, Cantin G, Yates JR, Krogan NJ, Heyer WD. Nonsense-mediated decay regulates key components of homologous recombination. Nucleic Acids Res 2016; 44:5218-30. [PMID: 27001511 PMCID: PMC4914092 DOI: 10.1093/nar/gkw182] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2016] [Revised: 03/08/2016] [Accepted: 03/09/2016] [Indexed: 12/29/2022] Open
Abstract
Cells frequently experience DNA damage that requires repair by homologous recombination (HR). Proteins involved in HR are carefully coordinated to ensure proper and efficient repair without interfering with normal cellular processes. In Saccharomyces cerevisiae, Rad55 functions in the early steps of HR and is regulated in response to DNA damage through phosphorylation by the Mec1 and Rad53 kinases of the DNA damage response. To further identify regulatory processes that target HR, we performed a high-throughput genetic interaction screen with RAD55 phosphorylation site mutants. Genes involved in the mRNA quality control process, nonsense-mediated decay (NMD), were found to genetically interact with rad55 phospho-site mutants. Further characterization revealed that RAD55 transcript and protein levels are regulated by NMD. Regulation of HR by NMD extends to multiple targets beyond RAD55, including RAD51, RAD54 and RAD57 Finally, we demonstrate that loss of NMD results in an increase in recombination rates and resistance to the DNA damaging agent methyl methanesulfonate, suggesting this pathway negatively regulates HR under normal growth conditions.
Collapse
Affiliation(s)
- Ryan Janke
- Department of Microbiology & Molecular Genetics, University of California, Davis, CA 95616-8665, USA
| | - Jeremy Kong
- Department of Microbiology & Molecular Genetics, University of California, Davis, CA 95616-8665, USA
| | - Hannes Braberg
- Department of Cellular & Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158-2517, USA
| | - Greg Cantin
- Department of Cell Biology, SR-11, Scripps Research institute, La Jolla, CA 92307, USA
| | - John R Yates
- Department of Cell Biology, SR-11, Scripps Research institute, La Jolla, CA 92307, USA
| | - Nevan J Krogan
- Department of Cellular & Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158-2517, USA California Institute for Quantitative Biosciences, QB3, San Francisco, CA 94158-2517, USA J. David Gladstone Institute, San Francisco, CA, 94158-2517, USA
| | - Wolf-Dietrich Heyer
- Department of Microbiology & Molecular Genetics, University of California, Davis, CA 95616-8665, USA Department of Molecular & Cellular Biology University of California, Davis, CA 95616-8665, USA
| |
Collapse
|
16
|
Yeh HS, Yong J. Alternative Polyadenylation of mRNAs: 3'-Untranslated Region Matters in Gene Expression. Mol Cells 2016; 39:281-5. [PMID: 26912084 PMCID: PMC4844933 DOI: 10.14348/molcells.2016.0035] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 02/11/2016] [Indexed: 12/26/2022] Open
Abstract
Almost all of eukaryotic mRNAs are subjected to polyadenylation during mRNA processing. Recent discoveries showed that many of these mRNAs contain more than one polyadenylation sites in their 3' untranslated regions (UTR) and that alternative polyadenylation (APA) is prevalent among these genes. Many biological processes such as differentiation, proliferation, and tumorigenesis have been correlated to global APA events in the 3' UTR of mRNAs, suggesting that these APA events are tightly regulated and may play important physiological roles. In this review, recent discoveries in the physiological roles of APA events, as well as the known and proposed mechanisms are summarized. Perspective for future directions is also discussed.
Collapse
Affiliation(s)
- Hsin-Sung Yeh
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455,
USA
| | - Jeongsik Yong
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455,
USA
| |
Collapse
|
17
|
Abstract
Messenger RNA polyadenylation is one of the essential processing steps during eukaryotic gene expression. The site of polyadenylation [poly(A) site] marks the end of a transcript, which is also the end of a gene in most cases. A computation program that is able to recognize poly(A) sites would not only be useful for genome annotation in finding genes ends, but also for predicting alternative poly(A) sites. PASS [Poly(A) Site Sleuth] and PAC [Poly(A) site Classifier] were developed to predict poly(A) sites in plants. PASS was built based on the Generalized Hidden Markov Model (GHMM), which consists of four functional modules: input model, poly(A) site recognition module, graphic process module, and output module. PAC is a classification model, integrating several features that define the poly(A) sites including K-gram pattern, Z-curve, position-specific scoring matrix, and first-order inhomogeneous Markov sub-model. PAC can be used to predict poly(A) sites from species whose polyadenylation profile is unknown. The result of PASS and PAC is an output of a few files with one of them containing the score or probability of being a poly(A) site for each position of a given sequence. While the models were built mostly based on poly(A) profile data from Arabidopsis, it is also functional in other higher plants since their profiles are quite similar.
Collapse
Affiliation(s)
- Xiaohui Wu
- Department of Automation, Xiamen University, 422 Siming South Road, Xiamen, Fujian, 361005, China,
| | | | | |
Collapse
|
18
|
Shalem O, Sharon E, Lubliner S, Regev I, Lotan-Pompan M, Yakhini Z, Segal E. Systematic dissection of the sequence determinants of gene 3' end mediated expression control. PLoS Genet 2015; 11:e1005147. [PMID: 25875337 PMCID: PMC4398552 DOI: 10.1371/journal.pgen.1005147] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2014] [Accepted: 03/17/2015] [Indexed: 01/09/2023] Open
Abstract
The 3'end genomic region encodes a wide range of regulatory process including mRNA stability, 3' end processing and translation. Here, we systematically investigate the sequence determinants of 3' end mediated expression control by measuring the effect of 13,000 designed 3' end sequence variants on constitutive expression levels in yeast. By including a high resolution scanning mutagenesis of more than 200 native 3' end sequences in this designed set, we found that most mutations had only a mild effect on expression, and that the vast majority (~90%) of strongly effecting mutations localized to a single positive TA-rich element, similar to a previously described 3' end processing efficiency element, and resulted in up to ten-fold decrease in expression. Measurements of 3' UTR lengths revealed that these mutations result in mRNAs with aberrantly long 3'UTRs, confirming the role for this element in 3' end processing. Interestingly, we found that other sequence elements that were previously described in the literature to be part of the polyadenylation signal had a minor effect on expression. We further characterize the sequence specificities of the TA-rich element using additional synthetic 3' end sequences and show that its activity is sensitive to single base pair mutations and strongly depends on the A/T content of the surrounding sequences. Finally, using a computational model, we show that the strength of this element in native 3' end sequences can explain some of their measured expression variability (R = 0.41). Together, our results emphasize the importance of efficient 3' end processing for endogenous protein levels and contribute to an improved understanding of the sequence elements involved in this process.
Collapse
Affiliation(s)
- Ophir Shalem
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Eilon Sharon
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Shai Lubliner
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Ifat Regev
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Maya Lotan-Pompan
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Zohar Yakhini
- Department of Computer Science, Technion, Haifa, Israel
- Agilent Laboratories, Tel Aviv, Israel
| | - Eran Segal
- Department of Computer Science and Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, The Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
19
|
Release factor eRF3 mediates premature translation termination on polylysine-stalled ribosomes in Saccharomyces cerevisiae. Mol Cell Biol 2014; 34:4062-76. [PMID: 25154418 DOI: 10.1128/mcb.00799-14] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Ribosome stalling is an important incident enabling the cellular quality control machinery to detect aberrant mRNA. Saccharomyces cerevisiae Hbs1-Dom34 and Ski7 are homologs of the canonical release factor eRF3-eRF1, which recognize stalled ribosomes, promote ribosome release, and induce the decay of aberrant mRNA. Polyadenylated nonstop mRNA encodes aberrant proteins containing C-terminal polylysine segments which cause ribosome stalling due to electrostatic interaction with the ribosomal exit tunnel. Here we describe a novel mechanism, termed premature translation termination, which releases C-terminally truncated translation products from ribosomes stalled on polylysine segments. Premature termination during polylysine synthesis was abolished when ribosome stalling was prevented due to the absence of the ribosomal protein Asc1. In contrast, premature termination was enhanced, when the general rate of translation elongation was lowered. The unconventional termination event was independent of Hbs1-Dom34 and Ski7, but it was dependent on eRF3. Moreover, premature termination during polylysine synthesis was strongly increased in the absence of the ribosome-bound chaperones ribosome-associated complex (RAC) and Ssb (Ssb1 and Ssb2). On the basis of the data, we suggest a model in which eRF3-eRF1 can catalyze the release of nascent polypeptides even though the ribosomal A-site contains a sense codon when the rate of translation is abnormally low.
Collapse
|
20
|
Ottoz DSM, Rudolf F, Stelling J. Inducible, tightly regulated and growth condition-independent transcription factor in Saccharomyces cerevisiae. Nucleic Acids Res 2014; 42:e130. [PMID: 25034689 PMCID: PMC4176152 DOI: 10.1093/nar/gku616] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The precise control of gene expression is essential in basic biological research as well as in biotechnological applications. Most regulated systems available in yeast enable only the overexpression of the target gene, excluding the possibility of intermediate or weak expression. Moreover, these systems are frequently toxic or depend on growth conditions. We constructed a heterologous transcription factor that overcomes these limitations. Our system is a fusion of the bacterial LexA DNA-binding protein, the human estrogen receptor (ER) and an activation domain (AD). The activity of this chimera, called LexA-ER-AD, is tightly regulated by the hormone β-estradiol. The selection of the AD proved to be crucial to avoid toxic effects and to define the range of activity that can be precisely tuned with β-estradiol. As our system is based on a heterologous DNA-binding domain, induction in different metabolic contexts is possible. Additionally, by controlling the number of LexA-binding sites in the target promoter, one can scale the expression levels up or down. Overall, our LexA-ER-AD system is a valuable tool to precisely control gene expression in different experimental contexts without toxic side effects.
Collapse
Affiliation(s)
- Diana S M Ottoz
- ETH Zurich and Swiss Institute of Bioinformatics, D-BSSE, Mattenstrasse 26, 4058 Basel, Switzerland Life Science Zurich PhD Program on Molecular and Translational Biomedicine, Zurich, Switzerland Competence Center for Systems Physiology and Metabolic Diseases, ETH Zurich, 8093 Zurich, Switzerland
| | - Fabian Rudolf
- ETH Zurich and Swiss Institute of Bioinformatics, D-BSSE, Mattenstrasse 26, 4058 Basel, Switzerland
| | - Joerg Stelling
- ETH Zurich and Swiss Institute of Bioinformatics, D-BSSE, Mattenstrasse 26, 4058 Basel, Switzerland Competence Center for Systems Physiology and Metabolic Diseases, ETH Zurich, 8093 Zurich, Switzerland
| |
Collapse
|
21
|
Ji G, Guan J, Zeng Y, Li QQ, Wu X. Genome-wide identification and predictive modeling of polyadenylation sites in eukaryotes. Brief Bioinform 2014; 16:304-13. [DOI: 10.1093/bib/bbu011] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
22
|
A novel protein, Pho92, has a conserved YTH domain and regulates phosphate metabolism by decreasing the mRNA stability of PHO4 in Saccharomyces cerevisiae. Biochem J 2014; 457:391-400. [PMID: 24206186 DOI: 10.1042/bj20130862] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The homologue of human YTHDF2, Ydr374c (Pho92), is the only protein that has a YTH (YT521-B homology) domain in Saccharomyces cerevisiae. Based on microarray analysis, genes involved in the phosphate signal transduction (PHO) pathway were up-regulated in the Δpho92 strain, as were genes regulated by Pho4, which is an important transcription factor in the PHO pathway. To identify the exact mechanism of Pho92 action with respect to phosphate metabolism, we investigated the effect of Pho92 on PHO4 expression. The half-life of PHO4 mRNA was increased in the Δpho92 strain; this phenotype was also observed in the deletion mutants UPF1 and POP2, which are components of the NMD (nonsense-mediated decay) pathway and the Pop2-Ccr4-Not deadenylase complex respectively. Pho92 interacts physically with Pop2 of the Pop2-Ccr4-Not deadenylase complex. Furthermore, Pho92 binding to the 3'-UTR of PHO4 was dependent on the phosphate concentration. Deletion of the PHO4 3'-UTR resulted in PHO4 mRNA resistance to Pho92-dependent degradation. The results of the present study indicate that Pho92 regulates Pho4 expression at the post-transcriptional level via the regulation of mRNA stability. Taken together, Pho92 participates in cellular phosphate metabolism, specifically via the regulation of PHO4 mRNA stability by binding to the 3'-UTR in a phosphate-dependent manner.
Collapse
|
23
|
Tanaka M, Tokuoka M, Gomi K. Effects of codon optimization on the mRNA levels of heterologous genes in filamentous fungi. Appl Microbiol Biotechnol 2014; 98:3859-67. [PMID: 24682479 DOI: 10.1007/s00253-014-5609-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Revised: 02/07/2014] [Accepted: 02/10/2014] [Indexed: 10/25/2022]
Abstract
Filamentous fungi, particularly Aspergillus species, have recently attracted attention as host organisms for recombinant protein production. Because the secretory yields of heterologous proteins are generally low compared with those of homologous proteins or proteins from closely related fungal species, several strategies to produce substantial amounts of recombinant proteins have been conducted. Codon optimization is a powerful tool for improving the production levels of heterologous proteins. Although codon optimization is generally believed to improve the translation efficiency of heterologous genes without affecting their mRNA levels, several studies have indicated that codon optimization causes an increase in the steady-state mRNA levels of heterologous genes in filamentous fungi. However, the mechanism that determines the low mRNA levels when native heterologous genes are expressed was poorly understood. We recently showed that the transcripts of heterologous genes are polyadenylated prematurely within the coding region and that the heterologous gene transcripts can be stabilized significantly by codon optimization, which is probably attributable to the prevention of premature polyadenylation in Aspergillus oryzae. In this review, we describe the detailed mechanism of premature polyadenylation and the rapid degradation of mRNA transcripts derived from heterologous genes in filamentous fungi.
Collapse
Affiliation(s)
- Mizuki Tanaka
- Department of Bioindustrial Informatics and Genomics, Laboratory of Bioindustrial Genomics, Graduate School of Agricultural Science, Tohoku University, 1-1 Tsutsumidori-Amamiyamachi, Aoba-ku, Sendai, 981-8555, Japan,
| | | | | |
Collapse
|
24
|
Genome-block expression-assisted association studies discover malaria resistance genes in Anopheles gambiae. Proc Natl Acad Sci U S A 2013; 110:20675-80. [PMID: 24297936 DOI: 10.1073/pnas.1321024110] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The malaria parasite-resistance island (PRI) of the African mosquito vector, Anopheles gambiae, was mapped to five genomic regions containing 80 genes, using coexpression patterns of genomic blocks. High-throughput sequencing identified 347 nonsynonymous single-nucleotide polymorphisms within these genes in mosquitoes from malaria-endemic areas in Kenya. Direct association studies between nonsynonymous single-nucleotide polymorphisms and Plasmodium falciparum infection identified three naturally occurring genetic variations in each of three genes (An. gambiae adenosine deaminase, fibrinogen-related protein 30, and fibrinogen-related protein 1) that were associated significantly with parasite infection. A role for these genes in the resistance phenotype was confirmed by RNA interference knockdown assays. Silencing fibrinogen-related protein 30 increased parasite infection significantly, whereas ablation of fibrinogen-related protein 1 transcripts resulted in mosquitoes nearly free of parasites. The discovered genes and single-nucleotide polymorphisms are anticipated to be useful in the development of tools for malaria control in endemic areas in Africa.
Collapse
|
25
|
Schlackow M, Marguerat S, Proudfoot NJ, Bähler J, Erban R, Gullerova M. Genome-wide analysis of poly(A) site selection in Schizosaccharomyces pombe. RNA (NEW YORK, N.Y.) 2013; 19:1617-1631. [PMID: 24152550 PMCID: PMC3884648 DOI: 10.1261/rna.040675.113] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2013] [Accepted: 09/17/2013] [Indexed: 05/31/2023]
Abstract
Polyadenylation of pre-mRNAs, a critical step in eukaryotic gene expression, is mediated by cis elements collectively called the polyadenylation signal. Genome-wide analysis of such polyadenylation signals was missing in fission yeast, even though it is an important model organism. We demonstrate that the canonical AATAAA motif is the most frequent and functional polyadenylation signal in Schizosaccharomyces pombe. Using analysis of RNA-Seq data sets from cells grown under various physiological conditions, we identify 3' UTRs for nearly 90% of the yeast genes. Heterogeneity of cleavage sites is common, as is alternative polyadenylation within and between conditions. We validated the computationally identified sequence elements likely to promote polyadenylation by functional assays, including qRT-PCR and 3'RACE analysis. The biological importance of the AATAAA motif is underlined by functional analysis of the genes containing it. Furthermore, it has been shown that convergent genes require trans elements, like cohesin for efficient transcription termination. Here we show that convergent genes lacking cohesin (on chromosome 2) are generally associated with longer overlapping mRNA transcripts. Our bioinformatic and experimental genome-wide results are summarized and can be accessed and customized in a user-friendly database Pomb(A).
Collapse
MESH Headings
- 3' Untranslated Regions
- Base Sequence
- Chromosome Mapping
- Chromosomes, Fungal/genetics
- Gene Expression Regulation, Fungal
- Genes, Fungal
- Genome, Fungal
- Molecular Sequence Annotation
- Molecular Sequence Data
- Polyadenylation
- RNA Cleavage
- RNA, Fungal/genetics
- RNA, Fungal/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Schizosaccharomyces/genetics
- Schizosaccharomyces/metabolism
- Sequence Analysis, RNA
- Transcription Termination, Genetic
Collapse
Affiliation(s)
- Margarita Schlackow
- Mathematical Institute, University of Oxford, Oxford, Oxfordshire OX1 3LB, United Kingdom
| | - Samuel Marguerat
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, United Kingdom
| | - Nicholas J. Proudfoot
- Sir William Dunn School of Pathology, University of Oxford, Oxford, Oxfordshire OX1 3RE, United Kingdom
| | - Jürg Bähler
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, United Kingdom
| | - Radek Erban
- Mathematical Institute, University of Oxford, Oxford, Oxfordshire OX1 3LB, United Kingdom
| | - Monika Gullerova
- Sir William Dunn School of Pathology, University of Oxford, Oxford, Oxfordshire OX1 3RE, United Kingdom
| |
Collapse
|
26
|
Abstract
Non-coding transcripts play an important role in gene expression regulation in all species, including budding and fission yeast. Such regulatory transcripts include intergenic ncRNA (non-coding RNA), 5′ and 3′ UTRs, introns and antisense transcripts. In the present review, we discuss advantages and limitations of recently developed sequencing techniques, such as ESTs, DNA microarrays, RNA-Seq (RNA sequencing), DRS (direct RNA sequencing) and TIF-Seq (transcript isoform sequencing). We provide an overview of methods applied in yeast and how each of them has contributed to our knowledge of gene expression regulation and transcription.
Collapse
|
27
|
|
28
|
Yamanishi M, Ito Y, Kintaka R, Imamura C, Katahira S, Ikeuchi A, Moriya H, Matsuyama T. A genome-wide activity assessment of terminator regions in Saccharomyces cerevisiae provides a ″terminatome″ toolbox. ACS Synth Biol 2013; 2:337-47. [PMID: 23654277 DOI: 10.1021/sb300116y] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The terminator regions of eukaryotes encode functional elements in the 3' untranslated region (3'-UTR) that influence the 3'-end processing of mRNA, mRNA stability, and translational efficiency, which can modulate protein production. However, the contribution of these terminator regions to gene expression remains unclear, and therefore their utilization in metabolic engineering or synthetic genetic circuits has been limited. Here, we comprehensively evaluated the activity of 5302 terminator regions from a total of 5880 genes in the budding yeast Saccharomyces cerevisiae by inserting each terminator region downstream of the P TDH3 - green fluorescent protein (GFP) reporter gene and measuring the fluorescent intensity of GFP. Terminator region activities relative to that of the PGK1 standard terminator ranged from 0.036 to 2.52, with a mean of 0.87. We thus could isolate the most and least active terminator regions. The activities of the terminator regions showed a positive correlation with mRNA abundance, indicating that the terminator region is a determinant of mRNA abundance. The least active terminator regions tended to encode longer 3'-UTRs, suggesting the existence of active degradation mechanisms for those mRNAs. The terminator regions of ribosomal protein genes tended to be the most active, suggesting the existence of a common regulator of those genes. The ″terminatome″ (the genome-wide set of terminator regions) thus not only provides valuable information to understand the modulatory roles of terminator regions on gene expression but also serves as a useful toolbox for the development of metabolically and genetically engineered yeast.
Collapse
Affiliation(s)
| | | | - Reiko Kintaka
- Research Core for Interdisciplinary
Sciences, Okayama University, 3-1-1 Tsushima-Naka,
Kita-ku, Okayama, 700-8530, Japan
| | | | | | | | - Hisao Moriya
- Research Core for Interdisciplinary
Sciences, Okayama University, 3-1-1 Tsushima-Naka,
Kita-ku, Okayama, 700-8530, Japan
| | | |
Collapse
|
29
|
Abstract
Systemic response to DNA damage and other stresses is a complex process that includes changes in the regulation and activity of nearly all stages of gene expression. One gene regulatory mechanism used by eukaryotes is selection among alternative transcript isoforms that differ in polyadenylation [poly(A)] sites, resulting in changes either to the coding sequence or to portions of the 3' UTR that govern translation, stability, and localization. To determine the extent to which this means of regulation is used in response to DNA damage, we conducted a global analysis of poly(A) site usage in Saccharomyces cerevisiae after exposure to the UV mimetic, 4-nitroquinoline 1-oxide (4NQO). Two thousand thirty-one genes were found to have significant variation in poly(A) site distributions following 4NQO treatment, with a strong bias toward loss of short transcripts, including many with poly(A) sites located within the protein coding sequence (CDS). We further explored one possible mechanism that could contribute to the widespread differences in mRNA isoforms. The change in poly(A) site profile was associated with an inhibition of cleavage and polyadenylation in cell extract and a decrease in the levels of several key subunits in the mRNA 3'-end processing complex. Sequence analysis identified differences in the cis-acting elements that flank putatively suppressed and enhanced poly(A) sites, suggesting a mechanism that could discriminate between variable and constitutive poly(A) sites. Our analysis indicates that variation in mRNA length is an important part of the regulatory response to DNA damage.
Collapse
|
30
|
RRM-RNA recognition: NMR or crystallography…and new findings. Curr Opin Struct Biol 2012; 23:100-8. [PMID: 23253355 DOI: 10.1016/j.sbi.2012.11.006] [Citation(s) in RCA: 145] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Revised: 11/02/2012] [Accepted: 11/20/2012] [Indexed: 12/27/2022]
Abstract
To characterize protein-RNA recognition at the molecular level, structural biology has turned out to be an indispensable approach. Detailed and direct insights into the mechanism of RNA binding and specificity have emerged from protein-RNA structures, especially from the most abundant RNA recognition motif (RRM). Although this protein domain has a very conserved α-β fold, it can recognize a large number of different RNA sequences and shapes and can be involved in a multitude of biological processes. Here, we report on recent single and multiple RRM-RNA structures and point out those features that provide novel insights into the mechanism of RNA recognition by RRMs. We further outline inherent problems to both NMR spectroscopy and X-ray crystallography methods and review recent strategies that emphasize the need to use both methodologies for more rapid and accurate structure determinations.
Collapse
|
31
|
Wu X, Ji G, Zeng Y. In silico prediction of mRNA poly(A) sites in Chlamydomonas reinhardtii. Mol Genet Genomics 2012; 287:895-907. [PMID: 23108961 DOI: 10.1007/s00438-012-0725-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Accepted: 10/20/2012] [Indexed: 12/31/2022]
Abstract
Accurately predicting polyadenylation [poly(A)] sites is important for defining the end of genes and understanding gene regulation mechanisms. Alternative polyadenylation (APA) has been demonstrated to play an important role in transcriptome diversity and regulating gene expression. To accurately predict poly(A) and APA sites in Chlamydomonas reinhardtii, a green alga that can be used to produce renewable energy, we proposed a novel model that integrated five methods for representing the features of these sites with a combined classifier. We presented a new grouping method based on pattern assembly to classify the poly(A) sites into four groups. We used five methods, involving the predicted RNA secondary structure, the term frequency-inverse document frequency weight, first-order Markov chain, pentamer ratio and a position weight matrix, to generate the feature space. We then developed a heuristic method to form the combined classifier by weighting multiple classifiers to predict poly(A) sites in each group. The high specificity and sensitivity of this model were demonstrated by testing the four groups of poly(A) sites and the intronic APA sites. The average prediction performance was approximately 8 % higher than the performance of a previous prediction model. For the group without any conserved patterns, the prediction accuracy was 9 % higher than for the accuracy with the previous technique. However, the prediction efficiency of this group was still significantly lower than that of the other groups, indicating the importance of identifying additional signal patterns for poly(A) site prediction. We also predicted the alternative poly(A) sites in introns with good accuracy. This prediction model was designed to be easily expanded with new classifiers or new features. Therefore, this model is applicable to new data or other species. Our model will be useful both in genome annotation because it predicts the end of a mature transcript and in genetic engineering because it enables researchers to eliminate undesirable poly(A) sites.
Collapse
Affiliation(s)
- Xiaohui Wu
- Department of Automation, Xiamen University, Xiamen 361000, China.
| | | | | |
Collapse
|
32
|
Abstract
Prions are agents of analog, protein conformation-based inheritance that can confer beneficial phenotypes to cells, especially under stress. Combined with genetic variation, prion-mediated inheritance can be channeled into prion-independent genomic inheritance. Latest screening shows that prions are common, at least in fungi. Thus, there is non-negligible flow of information from proteins to the genome in modern cells, in a direct violation of the Central Dogma of molecular biology. The prion-mediated heredity that violates the Central Dogma appears to be a specific, most radical manifestation of the widespread assimilation of protein (epigenetic) variation into genetic variation. The epigenetic variation precedes and facilitates genetic adaptation through a general 'look-ahead effect' of phenotypic mutations. This direction of the information flow is likely to be one of the important routes of environment-genome interaction and could substantially contribute to the evolution of complex adaptive traits.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
33
|
Abstract
Polyadenylation [poly(A)] signals (PAS) are a defining feature of eukaryotic protein-coding genes. The central sequence motif AAUAAA was identified in the mid-1970s and subsequently shown to require flanking, auxiliary elements for both 3'-end cleavage and polyadenylation of premessenger RNA (pre-mRNA) as well as to promote downstream transcriptional termination. More recent genomic analysis has established the generality of the PAS for eukaryotic mRNA. Evidence for the mechanism of mRNA 3'-end formation is outlined, as is the way this RNA processing reaction communicates with RNA polymerase II to terminate transcription. The widespread phenomenon of alternative poly(A) site usage and how this interrelates with pre-mRNA splicing is then reviewed. This shows that gene expression can be drastically affected by how the message is ended. A central theme of this review is that while genomic analysis provides generality for the importance of PAS selection, detailed mechanistic understanding still requires the direct analysis of specific genes by genetic and biochemical approaches.
Collapse
Affiliation(s)
- Nick J Proudfoot
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, United Kingdom.
| |
Collapse
|
34
|
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 3:385-96. [PMID: 22012871 DOI: 10.1002/wrna.116] [Citation(s) in RCA: 159] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for 3' end formation of almost all protein-coding transcripts in eukaryotes. The reaction, involving cleavage of nascent mRNA followed by addition of a polyadenylate or poly(A) tail, is controlled by cis-acting elements in the pre-mRNA surrounding the cleavage site. Experimental and bioinformatic studies in the past three decades have elucidated conserved and divergent elements across eukaryotes, from yeast to human. Here we review histories and current models of these elements in a broad range of species.
Collapse
Affiliation(s)
- Bin Tian
- UMDNJ-New Jersey Medical School, Newark, NJ, USA.
| | | |
Collapse
|
35
|
Copper tolerance of Saccharomyces cerevisiae nonsense-mediated mRNA decay mutants. Curr Genet 2011; 57:421-30. [PMID: 21918884 DOI: 10.1007/s00294-011-0356-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Revised: 08/23/2011] [Accepted: 08/28/2011] [Indexed: 10/17/2022]
Abstract
The eukaryotic nonsense-mediated mRNA (NMD) is a specialized pathway that leads to the recognition and rapid degradation of mRNAs with premature termination codons, and importantly some natural mRNAs as well. Natural mRNAs with atypically long 3'-untranslated regions (UTRs) are degraded by NMD in Saccharomyces cerevisiae. A number of S. cerevisiae mRNAs undergo alternative 3'-end processing producing mRNA isoforms that differ in their 3'-UTR lengths. Some of these alternatively 3'-end processed mRNA isoforms have atypically long 3'-UTRs and would be likely targets for NMD-mediated degradation. Here, we investigated the role NMD plays in the regulation of expression of CTR2, which encodes a vacuolar membrane copper transporter. CTR2 pre-mRNA undergoes alternative 3'-end processing to produce two mRNA isoforms with 300-nt and 2-kb 3'-UTRs. We show that both CTR2 mRNA isoforms are differentially regulated by NMD. The regulation of CTR2 mRNA by NMD has physiological consequences, since nmd mutants are more tolerant to toxic levels of copper relative to wild-type yeast cells and the copper tolerance of nmd mutants is dependent on the presence of CTR2.
Collapse
|
36
|
Doyle CE, Donaldson ME, Morrison EN, Saville BJ. Ustilago maydis transcript features identified through full-length cDNA analysis. Mol Genet Genomics 2011; 286:143-59. [PMID: 21750919 DOI: 10.1007/s00438-011-0634-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2011] [Accepted: 06/28/2011] [Indexed: 12/13/2022]
Abstract
Ustilago maydis is the model for investigating basidiomycete biotrophic plant pathogens. To further the annotation of its genome, 12,943 full-length cDNA sequences were used to construct databases for the promoter and untranslated regions of U. maydis genes. A subset of clones was sequenced to determine full cDNA sequences. These and the original ESTs were assembled into contigs representing 3,058, or 45%, of the predicted U. maydis genes. The new sequencing allowed the confirmation of 2,842 gene models, 690 of which contain an intron. The use of full-length cDNA clone sequences ensured that untranslated regions were physically linked to the open reading frames (ORFs), not merely aligned upstream of the start of transcription. Identified sequence features include: (1) over 500 potential short upstream ORFs, (2) 95 gene models that require further annotation, (3) one new potential ORF, (4) varying GC content in different gene regions, (5) a WebLogo motif for the start of translation, (6) the correlation of UTR length with transcript representation in cDNA libraries and with gene function categories, (7) a relationship between natural antisense transcripts and UTR length that differs from that of Saccharomyces cerevisiae, (8) a potential relationship between DNA replication and the control of transcription, and (9) new insights regarding mechanisms for the control of transcription and mRNA maturation in U. maydis.
Collapse
Affiliation(s)
- Colleen E Doyle
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, ON K9J 7B8, Canada
| | | | | | | |
Collapse
|
37
|
Tanaka M, Sakai Y, Yamada O, Shintani T, Gomi K. In silico analysis of 3'-end-processing signals in Aspergillus oryzae using expressed sequence tags and genomic sequencing data. DNA Res 2011; 18:189-200. [PMID: 21586533 PMCID: PMC3111234 DOI: 10.1093/dnares/dsr011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
To investigate 3'-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3'-untranslated region (3' UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3' UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3' UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15-30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3'-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3'-end-processing signals are similar to those in yeast and plants, some notable differences exist between them.
Collapse
Affiliation(s)
- Mizuki Tanaka
- Laboratory of Bioindustrial Genomics, Department of Bioindustrial Informatics and Genomics, Graduate School of Agricultural Science, Tohoku University, 1-1 Tsutsumidori-Amamiyamachi, Aoba-ku, Sendai 981-8555, Japan
| | | | | | | | | |
Collapse
|
38
|
Yang Q, Doublié S. Structural biology of poly(A) site definition. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 2:732-47. [PMID: 21823232 DOI: 10.1002/wrna.88] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
3' processing is an essential step in the maturation of all messenger RNAs (mRNAs) and is a tightly coupled two-step reaction: endonucleolytic cleavage at the poly(A) site is followed by the addition of a poly(A) tail, except for metazoan histone mRNAs, which are cleaved but not polyadenylated. The recognition of a poly(A) site is coordinated by the sequence elements in the mRNA 3' UTR and associated protein factors. In mammalian cells, three well-studied sequence elements, UGUA, AAUAAA, and GU-rich, are recognized by three multisubunit factors: cleavage factor I(m) (CFI(m) ), cleavage and polyadenylation specificity factor (CPSF), and cleavage stimulation factor (CstF), respectively. In the yeast Saccharomyces cerevisiae, UA repeats and A-rich sequence elements are recognized by Hrp1p and cleavage factor IA. Structural studies of protein-RNA complexes have helped decipher the mechanisms underlying sequence recognition and shed light on the role of protein factors in poly(A) site selection and 3' processing machinery assembly. In this review we focus on the interactions between the mRNA cis-elements and the protein factors (CFI(m) , CPSF, CstF, and homologous factors from yeast and other eukaryotes) that define the poly(A) site. WIREs RNA 2011 2 732-747 DOI: 10.1002/wrna.88 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Qin Yang
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, USA
| | | |
Collapse
|
39
|
Ezeokonkwo C, Zhelkovsky A, Lee R, Bohm A, Moore CL. A flexible linker region in Fip1 is needed for efficient mRNA polyadenylation. RNA (NEW YORK, N.Y.) 2011; 17:652-664. [PMID: 21282348 PMCID: PMC3062176 DOI: 10.1261/rna.2273111] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2010] [Accepted: 12/21/2010] [Indexed: 05/30/2023]
Abstract
Synthesis of the poly(A) tail of mRNA in Saccharomyces cerevisiae requires recruitment of the polymerase Pap1 to the 3' end of cleaved pre-mRNA. This is made possible by the tethering of Pap1 to the Cleavage/Polyadenylation Factor (CPF) by Fip1. We have recently reported that Fip1 is an unstructured protein in solution, and proposed that it might maintain this conformation as part of CPF, when bound to Pap1. However, the role that this feature of Fip1 plays in 3' end processing has not been investigated. We show here that Fip1 has a flexible linker in the middle of the protein, and that removal or replacement of the linker affects the efficiency of polyadenylation. However, the point of tethering is not crucial, as a fusion protein of Pap1 and Fip1 is fully functional in cells lacking genes encoding the essential individual proteins, and directly tethering Pap1 to RNA increases the rate of poly(A) addition. We also find that the linker region of Fip1 provides a platform for critical interactions with other parts of the processing machinery. Our results indicate that the Fip1 linker, through its flexibility and protein/protein interactions, allows Pap1 to reach the 3' end of the cleaved RNA and efficiently initiate poly(A) addition.
Collapse
Affiliation(s)
- Chukwudi Ezeokonkwo
- Department of Biochemistry, Tufts University School of Medicine and the Sackler Graduate School of Biomedical Sciences, Boston, Massachusetts 02111, USA
| | | | | | | | | |
Collapse
|
40
|
Ji G, Wu X, Shen Y, Huang J, Quinn Li Q. A classification-based prediction model of messenger RNA polyadenylation sites. J Theor Biol 2010; 265:287-96. [DOI: 10.1016/j.jtbi.2010.05.015] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2009] [Revised: 03/21/2010] [Accepted: 05/13/2010] [Indexed: 12/30/2022]
|
41
|
Pedrini N, Zhang S, Juárez MP, Keyhani NO. Molecular characterization and expression analysis of a suite of cytochrome P450 enzymes implicated in insect hydrocarbon degradation in the entomopathogenic fungus Beauveria bassiana. Microbiology (Reading) 2010; 156:2549-2557. [DOI: 10.1099/mic.0.039735-0] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The insect epicuticle or waxy layer comprises a heterogeneous mixture of lipids that include abundant levels of long-chain alkanes, alkenes, wax esters and fatty acids. This structure represents the first barrier against microbial attack and for broad-host-range insect pathogens, such as Beauveria bassiana, it is the initial interface mediating the host–pathogen interaction, since these organisms do not require any specialized mode of entry and infect target hosts via the cuticle. B. bassiana is able to grow on straight chain alkanes up to n-C33 as a sole source of carbon and energy. The cDNA and genomic sequences, including putative regulatory elements, for eight cytochrome P450 enzymes, postulated to be involved in alkane and insect epicuticle degradation, were isolated and characterized. Expression studies using a range of alkanes as well as an insect-derived epicuticular extract from the blood-sucking bug Triatomas infestans revealed a differential expression pattern for the P450 genes examined, and suggest that B. bassiana contains a series of hydrocarbon-assimilating enzymes with overlapping specificity in order to target the surface lipids of insect hosts. Phylogenetic analysis of the translated ORFs of the sequences revealed that the enzyme which displayed the highest levels of induction on both alkanes and the insect epicuticular extract represents the founding member of a new cytochrome P450 family, with three of the other sequences assigned as the first members of new P450 subfamilies. The remaining four proteins clustered with known P450 families whose members include alkane monooxygenases.
Collapse
Affiliation(s)
- Nicolás Pedrini
- Instituto de Investigaciones Bioquímicas de La Plata, CONICET, Facultad de Ciencias Médicas, UNLP, Calles 60 y 120 (1900), La Plata, Argentina
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
| | - Shizhu Zhang
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
| | - M. Patricia Juárez
- Instituto de Investigaciones Bioquímicas de La Plata, CONICET, Facultad de Ciencias Médicas, UNLP, Calles 60 y 120 (1900), La Plata, Argentina
| | - Nemat O. Keyhani
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
| |
Collapse
|
42
|
Leeper TC, Qu X, Lu C, Moore C, Varani G. Novel protein-protein contacts facilitate mRNA 3'-processing signal recognition by Rna15 and Hrp1. J Mol Biol 2010; 401:334-49. [PMID: 20600122 DOI: 10.1016/j.jmb.2010.06.032] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 06/10/2010] [Accepted: 06/10/2010] [Indexed: 12/22/2022]
Abstract
Precise 3'-end processing of mRNA is essential for correct gene expression, yet in yeast, 3'-processing signals consist of multiple ambiguous sequence elements. Two neighboring elements upstream of the cleavage site are particularly important for the accuracy (positioning element) and efficiency (efficiency element) of 3'-processing and are recognized by the RNA-binding proteins Rna15 and Hrp1, respectively. In vivo, these interactions are strengthened by the scaffolding protein Rna14 that stabilizes their association. The NMR structure of the 34 -kDa ternary complex of the RNA recognition motif (RRM) domains of Hrp1 and Rna15 bound to this pair of RNA elements was determined by residual dipolar coupling and paramagnetic relaxation experiments. It reveals how each of the proteins binds to RNA and introduces a novel class of protein-protein contact in regions of previously unknown function. These interdomain contacts had previously been overlooked in other multi-RRM structures, although a careful analysis suggests that they may be frequently present. Mutations in the regions of these contacts disrupt 3'-end processing, suggesting that they may structurally organize the ribonucleoprotein complexes responsible for RNA processing.
Collapse
Affiliation(s)
- Thomas C Leeper
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA.
| | | | | | | | | |
Collapse
|
43
|
Pancevac C, Goldstone DC, Ramos A, Taylor IA. Structure of the Rna15 RRM-RNA complex reveals the molecular basis of GU specificity in transcriptional 3'-end processing factors. Nucleic Acids Res 2010; 38:3119-32. [PMID: 20097654 PMCID: PMC2875009 DOI: 10.1093/nar/gkq002] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Rna15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3′-end processing factor from Saccharomyces cerevisiae. CFIA is required for polyA site selection/cleavage targeting RNA sequences that surround polyadenylation sites in the 3′-UTR of RNA polymerase-II transcripts. RNA recognition by CFIA is mediated by an RNA recognition motif (RRM) contained in the Rna15 subunit of the complex. We show here that Rna15 has a strong and unexpected preference for GU containing RNAs and reveal the molecular basis for a base selectivity mechanism that accommodates G or U but discriminates against C and A bases. This mode of base selectivity is rather different to that observed in other RRM-RNA structures and is structurally conserved in CstF64, the mammalian counterpart of Rna15. Our observations provide evidence for a highly conserved mechanism of base recognition amongst the 3′-end processing complexes that interact with the U-rich or U/G-rich elements at 3′-end cleavage/polyadenylation sites.
Collapse
Affiliation(s)
- Christina Pancevac
- Division of Molecular Structure, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | | | | | | |
Collapse
|
44
|
Unravelling cis-regulatory elements in the genome of the smallest photosynthetic eukaryote: phylogenetic footprinting in Ostreococcus. J Mol Evol 2009; 69:249-59. [PMID: 19693423 DOI: 10.1007/s00239-009-9271-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2009] [Revised: 07/17/2009] [Accepted: 07/27/2009] [Indexed: 10/20/2022]
Abstract
We used a phylogenetic footprinting approach, adapted to high levels of divergence, to estimate the level of constraint in intergenic regions of the extremely gene dense Ostreococcus algae genomes (Chlorophyta, Prasinophyceae). We first benchmarked our method against the Saccharomyces sensu stricto genome data and found that the proportion of conserved non-coding sites was consistent with those obtained with methods using calibration by the neutral substitution rate. We then applied our method to the complete genomes of Ostreococcus tauri and O. lucimarinus, which are the most divergent species from the same genus sequenced so far. We found that 77% of intergenic regions in Ostreococcus still contain some phylogenetic footprints, as compared to 88% for Saccharomyces, corresponding to an average rate of constraint on intergenic region of 17% and 30%, respectively. A comparison with some known functional cis-regulatory elements enabled us to investigate whether some transcriptional regulatory pathways were conserved throughout the green lineage. Strikingly, the size of the phylogenetic footprints depends on gene orientation of neighboring genes, and appears to be genus-specific. In Ostreococcus, 5' intergenic regions contain four times more conserved sites than 3' intergenic regions, whereas in yeast a higher frequency of constrained sites in intergenic regions between genes on the same DNA strand suggests a higher frequency of bidirectional regulatory elements. The phylogenetic footprinting approach can be used despite high levels of divergence in the ultrasmall Ostreococcus algae, to decipher structure of constrained regulatory motifs, and identify putative regulatory pathways conserved within the green lineage.
Collapse
|
45
|
Kebaara BW, Atkin AL. Long 3'-UTRs target wild-type mRNAs for nonsense-mediated mRNA decay in Saccharomyces cerevisiae. Nucleic Acids Res 2009; 37:2771-8. [PMID: 19270062 PMCID: PMC2685090 DOI: 10.1093/nar/gkp146] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The nonsense-mediated mRNA decay (NMD) pathway, present in most eukaryotic cells, is a specialized pathway that leads to the recognition and rapid degradation of mRNAs with premature termination codons and, importantly, some wild-type mRNAs. Earlier studies demonstrated that aberrant mRNAs with artificially extended 3′-untranslated regions (3′-UTRs) are degraded by NMD. However, the extent to which wild-type mRNAs with long 3′-UTRs are degraded by NMD is not known. We used a global approach to identify wild-type mRNAs in Saccharomyces cerevisiae that have longer than expected 3′-UTRs, and of these mRNAs tested, 91% were degraded by NMD. We demonstrate for the first time that replacement of the natural, long 3′-UTR from wild-type PGA1 mRNA, which encodes a protein that is important for cell wall biosynthesis, with a short 3′-UTR renders it immune to NMD. The natural PGA1 3′-UTR is sufficient to target a NMD insensitive mRNA for decay by the NMD pathway. Finally, we show that nmd mutants are sensitive to Calcofluor White, which suggests that the regulation of PGA1 and other cell wall biosynthesis proteins by NMD is physiologically significant.
Collapse
Affiliation(s)
- Bessie W Kebaara
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, NE 68588, USA
| | | |
Collapse
|
46
|
Kaplan N, Moore IK, Fondufe-Mittendorf Y, Gossett AJ, Tillo D, Field Y, LeProust EM, Hughes TR, Lieb JD, Widom J, Segal E. The DNA-encoded nucleosome organization of a eukaryotic genome. Nature 2008; 458:362-6. [PMID: 19092803 DOI: 10.1038/nature07667] [Citation(s) in RCA: 879] [Impact Index Per Article: 54.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Accepted: 11/26/2008] [Indexed: 11/09/2022]
Abstract
Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
Collapse
Affiliation(s)
- Noam Kaplan
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Unique features of nuclear mRNA poly(A) signals and alternative polyadenylation in Chlamydomonas reinhardtii. Genetics 2008; 179:167-76. [PMID: 18493049 DOI: 10.1534/genetics.108.088971] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To understand nuclear mRNA polyadenylation mechanisms in the model alga Chlamydomonas reinhardtii, we generated a data set of 16,952 in silico-verified poly(A) sites from EST sequencing traces based on Chlamydomonas Genome Assembly v.3.1. Analysis of this data set revealed a unique and complex polyadenylation signal profile that is setting Chlamydomonas apart from other organisms. In contrast to the high-AU content in the 3'-UTRs of other organisms, Chlamydomonas shows a high-guanylate content that transits to high-cytidylate around the poly(A) site. The average length of the 3'-UTR is 595 nucleotides (nt), significantly longer than that of Arabidopsis and rice. The dominant poly(A) signal, UGUAA, was found in 52% of the near-upstream elements, and its occurrence may be positively correlated with higher gene expression levels. The UGUAA signal also exists in Arabidopsis and in some mammalian genes but mainly in the far-upstream elements, suggesting a shift in function. The C-rich region after poly(A) sites with unique signal elements is a characteristic downstream element that is lacking in higher plants. We also found a high level of alternative polyadenylation in the Chlamydomonas genome, with a range of up to 33% of the 4057 genes analyzed having at least two unique poly(A) sites and approximately 1% of these genes having poly(A) sites residing in predicted coding sequences, introns, and 5'-UTRs. These potentially contribute to transcriptome diversity and gene expression regulation.
Collapse
|
48
|
Van Bogaert INA, De Maeseneire SL, Develter D, Soetaert W, Vandamme EJ. Cloning and characterisation of the glyceraldehyde 3-phosphate dehydrogenase gene of Candida bombicola and use of its promoter. J Ind Microbiol Biotechnol 2008; 35:1085-92. [DOI: 10.1007/s10295-008-0386-x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2008] [Accepted: 06/04/2008] [Indexed: 11/25/2022]
|
49
|
Gill EE, Becnel JJ, Fast NM. ESTs from the microsporidian Edhazardia aedis. BMC Genomics 2008; 9:296. [PMID: 18570666 PMCID: PMC2474625 DOI: 10.1186/1471-2164-9-296] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2007] [Accepted: 06/20/2008] [Indexed: 11/10/2022] Open
Abstract
Background Microsporidia are a group of parasites related to fungi that infect a wide variety of animals and have gained recognition from the medical community in the past 20 years due to their ability to infect immuno-compromised humans. Microsporidian genomes range in size from 2.3 to 19.5 Mbp, but almost all of our knowledge comes from species that have small genomes (primarily from the human parasite Encephalitozoon cuniculi and the locust parasite Antonospora locustae). We have conducted an EST survey of the mosquito parasite Edhazardia aedis, which has an estimated genome size several times that of more well-studied species. The only other microsporidian EST project is from A. locustae, and serves as a basis for comparison with E. aedis. Results The spore transcriptomes of A. locustae and E. aedis were compared and the numbers of unique transcripts that belong to each COG (Clusters of Orthologous Groups of proteins) category differ by at most 5%. The transcripts themselves have widely varying start sites and encode a number of proteins that have not been found in other microsporidia examined to date. However, E. aedis seems to lack the multi-gene transcripts present in A. locustae and E. cuniculi. We also present the first documented case of transcription of a transposable element in microsporidia. Conclusion Although E. aedis and A. locustae are distantly related, have very disparate life cycles and contain genomes estimated to be vastly different sizes, their patterns of transcription are similar. The architecture of the ancestral microsporidian genome is unknown, but the presence of genes in E. aedis that have not been found in other microsporidia suggests that extreme genome reduction and compaction is lineage specific and not typical of all microsporidia.
Collapse
Affiliation(s)
- Erin E Gill
- Department of Botany, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
| | | | | |
Collapse
|
50
|
Shen Y, Ji G, Haas BJ, Wu X, Zheng J, Reese GJ, Li QQ. Genome level analysis of rice mRNA 3'-end processing signals and alternative polyadenylation. Nucleic Acids Res 2008; 36:3150-61. [PMID: 18411206 PMCID: PMC2396415 DOI: 10.1093/nar/gkn158] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2007] [Revised: 03/18/2008] [Accepted: 03/19/2008] [Indexed: 12/24/2022] Open
Abstract
The position of a poly(A) site of eukaryotic mRNA is determined by sequence signals in pre-mRNA and a group of polyadenylation factors. To reveal rice poly(A) signals at a genome level, we constructed a dataset of 55 742 authenticated poly(A) sites and characterized the poly(A) signals. This resulted in identifying the typical tripartite cis-elements, including FUE, NUE and CE, as previously observed in Arabidopsis. The average size of the 3'-UTR was 289 nucleotides. When mapped to the genome, however, 15% of these poly(A) sites were found to be located in the currently annotated intergenic regions. Moreover, an extensive alternative polyadenylation profile was evident where 50% of the genes analyzed had more than one unique poly(A) site (excluding microheterogeneity sites), and 13% had four or more poly(A) sites. About 4% of the analyzed genes possessed alternative poly(A) sites at their introns, 5'-UTRs, or protein coding regions. The authenticity of these alternative poly(A) sites was partially confirmed using MPSS data. Analysis of nucleotide profile and signal patterns indicated that there may be a different set of poly(A) signals for those poly(A) sites found in the coding regions. Based on the features of rice poly(A) signals, an updated algorithm termed PASS-Rice was designed to predict poly(A) sites.
Collapse
Affiliation(s)
- Yingjia Shen
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Guoli Ji
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Brian J. Haas
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Xiaohui Wu
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Jianti Zheng
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Greg J. Reese
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Qingshun Quinn Li
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| |
Collapse
|