1
|
Murari E, Meadows D, Cuda N, Mangone M. A comprehensive analysis of 3'UTRs in Caenorhabditis elegans. Nucleic Acids Res 2024; 52:7523-7538. [PMID: 38917330 PMCID: PMC11260456 DOI: 10.1093/nar/gkae543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/29/2024] [Accepted: 06/11/2024] [Indexed: 06/27/2024] Open
Abstract
3'Untranslated regions (3'UTRs) are essential portions of genes containing elements necessary for pre-mRNA 3'end processing and are involved in post-transcriptional gene regulation. Despite their importance, they remain poorly characterized in eukaryotes. Here, we have used a multi-pronged approach to extract and curate 3'UTR data from 11533 publicly available datasets, corresponding to the entire collection of Caenorhabditis elegans transcriptomes stored in the NCBI repository from 2009 to 2023. We have also performed high throughput cloning pipelines to identify and validate rare 3'UTR isoforms and incorporated and manually curated 3'UTR isoforms from previously published datasets. This updated C. elegans 3'UTRome (v3) is the most comprehensive resource in any metazoan to date, covering 97.4% of the 20362 experimentally validated protein-coding genes with refined and updated 3'UTR boundaries for 23489 3'UTR isoforms. We also used this novel dataset to identify and characterize sequence elements involved in pre-mRNA 3'end processing and update miRNA target predictions. This resource provides important insights into the 3'UTR formation, function, and regulation in eukaryotes.
Collapse
Affiliation(s)
- Emma Murari
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Dalton Meadows
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Nicholas Cuda
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Marco Mangone
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
| |
Collapse
|
2
|
Zhukova M, Schedl P, Shidlovskii YV. The role of secondary structures in the functioning of 3' untranslated regions of mRNA: A review of functions of 3' UTRs' secondary structures and hypothetical involvement of secondary structures in cytoplasmic polyadenylation in Drosophila. Bioessays 2024; 46:e2300099. [PMID: 38161240 DOI: 10.1002/bies.202300099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 12/11/2023] [Accepted: 12/12/2023] [Indexed: 01/03/2024]
Abstract
3' untranslated regions (3' UTRs) of mRNAs have many functions, including mRNA processing and transport, translational regulation, and mRNA degradation and stability. These different functions require cis-elements in 3' UTRs that can be either sequence motifs or RNA structures. Here we review the role of secondary structures in the functioning of 3' UTRs and discuss some of the trans-acting factors that interact with these secondary structures in eukaryotic organisms. We propose potential participation of 3'-UTR secondary structures in cytoplasmic polyadenylation in the model organism Drosophila melanogaster. Because the secondary structures of 3' UTRs are essential for post-transcriptional regulation of gene expression, their disruption leads to a wide range of disorders, including cancer and cardiovascular diseases. Trans-acting factors, such as STAU1 and nucleolin, which interact with 3'-UTR secondary structures of target transcripts, influence the pathogenesis of neurodegenerative diseases and tumor metastasis, suggesting that they are possible therapeutic targets.
Collapse
Affiliation(s)
- Mariya Zhukova
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
| | - Paul Schedl
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA
| | - Yulii V Shidlovskii
- Laboratory of Gene Expression Regulation in Development, Russian Academy of Sciences, Institute of Gene Biology, Moscow, Russia
- Department of Biology and General Genetics, Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| |
Collapse
|
3
|
Zhang Q, Tian B. The emerging theme of 3'UTR mRNA isoform regulation in reprogramming of cell metabolism. Biochem Soc Trans 2023; 51:1111-1119. [PMID: 37171086 PMCID: PMC10771799 DOI: 10.1042/bst20221128] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 03/26/2023] [Accepted: 04/19/2023] [Indexed: 05/13/2023]
Abstract
The 3' untranslated region (3'UTR) of mRNA plays a key role in the post-transcriptional regulation of gene expression. Most eukaryotic protein-coding genes express 3'UTR isoforms owing to alternative cleavage and polyadenylation (APA). The 3'UTR isoform expression profile of a cell changes in cell proliferation, differentiation, and stress conditions. Here, we review the emerging theme of regulation of 3'UTR isoforms in cell metabolic reprogramming, focusing on cell growth and autophagy responses through the mTOR pathway. We discuss regulatory events that converge on the Cleavage Factor I complex, a master regulator of APA in 3'UTRs, and recent understandings of isoform-specific m6A modification and endomembrane association in determining differential metabolic fates of 3'UTR isoforms.
Collapse
Affiliation(s)
- Qiang Zhang
- Gene Expression and Regulation Program and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA 19104, U.S.A
| | - Bin Tian
- Gene Expression and Regulation Program and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA 19104, U.S.A
| |
Collapse
|
4
|
Wen H, Chen W, Chen Y, Wei G, Ni T. Integrative analysis of Iso-Seq and RNA-seq reveals dynamic changes of alternative promoter, alternative splicing and alternative polyadenylation during Angiotensin II-induced senescence in rat primary aortic endothelial cells. Front Genet 2023; 14:1064624. [PMID: 36741323 PMCID: PMC9892061 DOI: 10.3389/fgene.2023.1064624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 01/10/2023] [Indexed: 01/21/2023] Open
Abstract
In eukaryotes, alternative promoter (AP), alternative splicing (AS), and alternative polyadenylation (APA) are three crucial regulatory mechanisms that modulate message RNA (mRNA) diversity. Although AP, AS and APA are involved in diverse biological processess, whether they have dynamic changes in Angiotensin II (Ang II) induced senescence in rat primary aortic endothelial cells (RAECs), an important cellular model for studying cardiovascular disease, remains unclear. Here we integrated both PacBio single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short-read RNA sequencing (RNA-seq) to analyze the changes of AP, AS and APA in Ang II-induced senescent RAECs. Iso-Seq generated 36,278 isoforms from 10,145 gene loci and 65.81% of these isoforms are novel, which were further cross-validated by public data obtained by other techonologies such as CAGE, PolyA-Seq and 3'READS. APA contributed most to novel isoforms, followed by AS and AP. Further investigation showed that AP, AS and APA could all contribute to the regulation of isoform, but AS has more dynamic changes compared to AP and APA upon Ang II stimulation. Genes undergoing AP, AS and APA in Ang II-treated cells are enriched in various pathways related to aging or senescence, suggesting that these molecular changes are involved in functional alterations during Ang II-induced senescence. Together, the present study largely improved the annotation of rat genome and revealed gene expression changes at isoform level, extending the understanding of the complexity of gene regulation in Ang II-treated RAECs, and also provided novel clues for discovering the regulatory mechanism undelying Ang II caused vascular senescence and diseases.
Collapse
Affiliation(s)
- Haimei Wen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | - Wei Chen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Yu Chen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Gang Wei
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China,*Correspondence: Ting Ni, ; Gang Wei,
| | - Ting Ni
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,*Correspondence: Ting Ni, ; Gang Wei,
| |
Collapse
|
5
|
Binzel DW, Li X, Burns N, Khan E, Lee WJ, Chen LC, Ellipilli S, Miles W, Ho YS, Guo P. Thermostability, Tunability, and Tenacity of RNA as Rubbery Anionic Polymeric Materials in Nanotechnology and Nanomedicine-Specific Cancer Targeting with Undetectable Toxicity. Chem Rev 2021; 121:7398-7467. [PMID: 34038115 DOI: 10.1021/acs.chemrev.1c00009] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
RNA nanotechnology is the bottom-up self-assembly of nanometer-scale architectures, resembling LEGOs, composed mainly of RNA. The ideal building material should be (1) versatile and controllable in shape and stoichiometry, (2) spontaneously self-assemble, and (3) thermodynamically, chemically, and enzymatically stable with a long shelf life. RNA building blocks exhibit each of the above. RNA is a polynucleic acid, making it a polymer, and its negative-charge prevents nonspecific binding to negatively charged cell membranes. The thermostability makes it suitable for logic gates, resistive memory, sensor set-ups, and NEM devices. RNA can be designed and manipulated with a level of simplicity of DNA while displaying versatile structure and enzyme activity of proteins. RNA can fold into single-stranded loops or bulges to serve as mounting dovetails for intermolecular or domain interactions without external linking dowels. RNA nanoparticles display rubber- and amoeba-like properties and are stretchable and shrinkable through multiple repeats, leading to enhanced tumor targeting and fast renal excretion to reduce toxicities. It was predicted in 2014 that RNA would be the third milestone in pharmaceutical drug development. The recent approval of several RNA drugs and COVID-19 mRNA vaccines by FDA suggests that this milestone is being realized. Here, we review the unique properties of RNA nanotechnology, summarize its recent advancements, describe its distinct attributes inside or outside the body and discuss potential applications in nanotechnology, medicine, and material science.
Collapse
Affiliation(s)
- Daniel W Binzel
- Center for RNA Nanobiotechnology and Nanomedicine, College of Pharmacy, Dorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States
| | - Xin Li
- Center for RNA Nanobiotechnology and Nanomedicine, College of Pharmacy, Dorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States
| | - Nicolas Burns
- Center for RNA Nanobiotechnology and Nanomedicine, College of Pharmacy, Dorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States
| | - Eshan Khan
- Department of Cancer Biology and Genetics, The Ohio State University Comprehensive Cancer Center, College of Medicine, Center for RNA Biology, The Ohio State University, Columbus, Ohio 43210, United States
| | - Wen-Jui Lee
- TMU Research Center of Cancer Translational Medicine, School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Graduate Institute of Medical Sciences, College of Medicine, Taipei Medical University, Department of Laboratory Medicine, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Li-Ching Chen
- TMU Research Center of Cancer Translational Medicine, School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Graduate Institute of Medical Sciences, College of Medicine, Taipei Medical University, Department of Laboratory Medicine, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Satheesh Ellipilli
- Center for RNA Nanobiotechnology and Nanomedicine, College of Pharmacy, Dorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States
| | - Wayne Miles
- Department of Cancer Biology and Genetics, The Ohio State University Comprehensive Cancer Center, College of Medicine, Center for RNA Biology, The Ohio State University, Columbus, Ohio 43210, United States
| | - Yuan Soon Ho
- TMU Research Center of Cancer Translational Medicine, School of Medical Laboratory Science and Biotechnology, College of Medical Science and Technology, Graduate Institute of Medical Sciences, College of Medicine, Taipei Medical University, Department of Laboratory Medicine, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Peixuan Guo
- Center for RNA Nanobiotechnology and Nanomedicine, College of Pharmacy, Dorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
6
|
Chen M, Wei R, Wei G, Xu M, Su Z, Zhao C, Ni T. Systematic evaluation of the effect of polyadenylation signal variants on the expression of disease-associated genes. Genome Res 2021; 31:890-899. [PMID: 33875481 PMCID: PMC8092010 DOI: 10.1101/gr.270256.120] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Accepted: 03/02/2021] [Indexed: 01/18/2023]
Abstract
Single nucleotide variants (SNVs) within polyadenylation signals (PASs), a specific six-nucleotide sequence required for mRNA maturation, can impair RNA-level gene expression and cause human diseases. However, there is a lack of genome-wide investigation and systematic confirmation tools for identifying PAS variants. Here, we present a computational strategy to integrate the most reliable resources for discovering distinct genomic features of PAS variants and also develop a credible and convenient experimental tool to validate the effect of PAS variants on expression of disease-associated genes. This approach will greatly accelerate the deciphering of PAS variation-related human diseases.
Collapse
Affiliation(s)
- Meng Chen
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Eye & ENT Hospital, Fudan University, Shanghai, 200438, China.,Eye Institute, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, 200031, China.,NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, and Shanghai Key Laboratory of Visual Impairment and Restoration (Fudan University), Shanghai, 200031, China
| | - Ran Wei
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, China.,Department of Pathology, Fudan University Shanghai Cancer Center, Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Gang Wei
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, China.,MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Mingqing Xu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center of Genetics and Development, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Zhixi Su
- Singlera Genomics (Shanghai) Limited, Shanghai, 201318, China
| | - Chen Zhao
- Eye Institute, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, 200031, China.,NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, and Shanghai Key Laboratory of Visual Impairment and Restoration (Fudan University), Shanghai, 200031, China
| | - Ting Ni
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, China.,Shanghai Engineering Research Center of Industrial Microorganisms, School of Life Sciences, Fudan University, Shanghai, 200438, China
| |
Collapse
|
7
|
Yu H, Dai Z. SANPolyA: a deep learning method for identifying Poly(A) signals. Bioinformatics 2020; 36:2393-2400. [PMID: 31904817 DOI: 10.1093/bioinformatics/btz970] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 12/05/2019] [Accepted: 01/01/2020] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Polyadenylation plays a regulatory role in transcription. The recognition of polyadenylation signal (PAS) motif sequence is an important step in polyadenylation. In the past few years, some statistical machine learning-based and deep learning-based methods have been proposed for PAS identification. Although these methods predict PAS with success, there is room for their improvement on PAS identification. RESULTS In this study, we proposed a deep neural network-based computational method, called SANPolyA, for identifying PAS in human and mouse genomes. SANPolyA requires no manually crafted sequence features. We compared our method SANPolyA with several previous PAS identification methods on several PAS benchmark datasets. Our results showed that SANPolyA outperforms the state-of-art methods. SANPolyA also showed good performance on leave-one-motif-out evaluation. AVAILABILITY AND IMPLEMENTATION https://github.com/yuht4/SANPolyA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Zhiming Dai
- School of Data and Computer Science.,Guangdong Province Key Laboratory of Big Data Analysis and Processing, Sun Yat-Sen University, Guangzhou 510006, China
| |
Collapse
|
8
|
Nourse J, Spada S, Danckwardt S. Emerging Roles of RNA 3'-end Cleavage and Polyadenylation in Pathogenesis, Diagnosis and Therapy of Human Disorders. Biomolecules 2020; 10:biom10060915. [PMID: 32560344 PMCID: PMC7356254 DOI: 10.3390/biom10060915] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 06/10/2020] [Accepted: 06/13/2020] [Indexed: 12/11/2022] Open
Abstract
A crucial feature of gene expression involves RNA processing to produce 3′ ends through a process termed 3′ end cleavage and polyadenylation (CPA). This ensures the nascent RNA molecule can exit the nucleus and be translated to ultimately give rise to a protein which can execute a function. Further, alternative polyadenylation (APA) can produce distinct transcript isoforms, profoundly expanding the complexity of the transcriptome. CPA is carried out by multi-component protein complexes interacting with multiple RNA motifs and is tightly coupled to transcription, other steps of RNA processing, and even epigenetic modifications. CPA and APA contribute to the maintenance of a multitude of diverse physiological processes. It is therefore not surprising that disruptions of CPA and APA can lead to devastating disorders. Here, we review potential CPA and APA mechanisms involving both loss and gain of function that can have tremendous impacts on health and disease. Ultimately we highlight the emerging diagnostic and therapeutic potential CPA and APA offer.
Collapse
Affiliation(s)
- Jamie Nourse
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany; (J.N.); (S.S.)
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany
| | - Stefano Spada
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany; (J.N.); (S.S.)
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany
| | - Sven Danckwardt
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany; (J.N.); (S.S.)
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University, 55131 Mainz, Germany
- German Center for Cardiovascular Research (DZHK), Rhine-Main, Germany
- Correspondence:
| |
Collapse
|
9
|
Sun Y, Hamilton K, Tong L. Recent molecular insights into canonical pre-mRNA 3'-end processing. Transcription 2020; 11:83-96. [PMID: 32522085 DOI: 10.1080/21541264.2020.1777047] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The majority of eukaryotic messenger RNA precursors (pre-mRNAs) undergo cleavage and polyadenylation at their 3' end. This canonical 3'-end processing depends on sequence elements in the pre-mRNA as well as a mega-dalton protein machinery. The cleavage site in mammalian pre-mRNAs is located between an upstream poly(A) signal, most frequently an AAUAAA hexamer, and a GU-rich downstream sequence element. This review will summarize recent advances from the studies on this canonical 3'-end processing machinery. They have revealed the molecular mechanism for the recognition of the poly(A) signal and provided the first glimpse into the overall architecture of the machinery. The studies also show that the machinery is highly dynamic conformationally, and extensive re-arrangements are necessary for its activation. Inhibitors targeting the active site of the CPSF73 nuclease of this machinery have anti-cancer, anti-inflammatory and anti-protozoal effects, indicating that CPSF73 and pre-mRNA 3'-end processing in general are attractive targets for drug discovery. ABBREVIATIONS APA: alternative polyadenylation; β-CASP: metallo-β-lactamase-associated CPSF Artemis SNM1/PSO2; CTD: C-terminal domain; CF: cleavage factor; CPF: cleavage and polyadenylation factor; CPSF: cleavage and polyadenylation specificity factor; CstF: cleavage stimulation factor; DSE: downstream element; HAT: half a TPR; HCC: histone pre-mRNA cleavage complex; mCF: mammalian cleavage factor; mPSF: mammalian polyadenylation specificity factor; mRNA: messenger RNA; nt: nucleotide; NTD: N-terminal domain; PAP: polyadenylate polymerase; PAS: polyadenylation signal; PIM: mPSF interaction motif; Poly(A): polyadenylation, polyadenylate; Pol II: RNA polymerase II; pre-mRNA: messenger RNA precursor; RRM: RNA recognition module, RNA recognition motif; snRNP: small nuclear ribonucleoprotein; TPR: tetratricopeptide repeat; UTR: untranslated region; ZF: zinc finger.
Collapse
Affiliation(s)
- Yadong Sun
- Department of Biological Sciences, Columbia University , New York, NY, USA
| | - Keith Hamilton
- Department of Biological Sciences, Columbia University , New York, NY, USA
| | - Liang Tong
- Department of Biological Sciences, Columbia University , New York, NY, USA
| |
Collapse
|
10
|
The C. elegans 3' UTRome v2 resource for studying mRNA cleavage and polyadenylation, 3'-UTR biology, and miRNA targeting. Genome Res 2019; 29:2104-2116. [PMID: 31744903 PMCID: PMC6886508 DOI: 10.1101/gr.254839.119] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 10/10/2019] [Indexed: 12/11/2022]
Abstract
3′ Untranslated regions (3′ UTRs) of mRNAs emerged as central regulators of cellular function because they contain important but poorly characterized cis-regulatory elements targeted by a multitude of regulatory factors. The model nematode Caenorhabditis elegans is ideal to study these interactions because it possesses a well-defined 3′ UTRome. To improve its annotation, we have used a genome-wide bioinformatics approach to download raw transcriptome data for 1088 transcriptome data sets corresponding to the entire collection of C. elegans trancriptomes from 2015 to 2018 from the Sequence Read Archive at the NCBI. We then extracted and mapped high-quality 3′-UTR data at ultradeep coverage. Here, we describe and release to the community the updated version of the worm 3′ UTRome, which we named 3′ UTRome v2. This resource contains high-quality 3′-UTR data mapped at single-base ultraresolution for 23,084 3′-UTR isoform variants corresponding to 14,788 protein-coding genes and is updated to the latest release of WormBase. We used this data set to study and probe principles of mRNA cleavage and polyadenylation in C. elegans. The worm 3′ UTRome v2 represents the most comprehensive and high-resolution 3′-UTR data set available in C. elegans and provides a novel resource to investigate the mRNA cleavage and polyadenylation reaction, 3′-UTR biology, and miRNA targeting in a living organism.
Collapse
|
11
|
Bogard N, Linder J, Rosenberg AB, Seelig G. A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation. Cell 2019; 178:91-106.e23. [PMID: 31178116 PMCID: PMC6599575 DOI: 10.1016/j.cell.2019.04.046] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 03/18/2019] [Accepted: 04/29/2019] [Indexed: 12/22/2022]
Abstract
Alternative polyadenylation (APA) is a major driver of transcriptome diversity in human cells. Here, we use deep learning to predict APA from DNA sequence alone. We trained our model (APARENT, APA REgression NeT) on isoform expression data from over 3 million APA reporters. APARENT's predictions are highly accurate when tasked with inferring APA in synthetic and human 3'UTRs. Visualizing features learned across all network layers reveals that APARENT recognizes sequence motifs known to recruit APA regulators, discovers previously unknown sequence determinants of 3' end processing, and integrates these features into a comprehensive, interpretable, cis-regulatory code. We apply APARENT to forward engineer functional polyadenylation signals with precisely defined cleavage position and isoform usage and validate predictions experimentally. Finally, we use APARENT to quantify the impact of genetic variants on APA. Our approach detects pathogenic variants in a wide range of disease contexts, expanding our understanding of the genetic origins of disease.
Collapse
Affiliation(s)
- Nicholas Bogard
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA
| | - Johannes Linder
- Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA 98195, USA
| | - Alexander B Rosenberg
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA
| | - Georg Seelig
- Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, USA; Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
12
|
A comprehensive analysis of core polyadenylation sequences and regulation by microRNAs in a set of cancer predisposition genes. Gene 2019; 712:143943. [PMID: 31229581 DOI: 10.1016/j.gene.2019.143943] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 06/18/2019] [Accepted: 06/20/2019] [Indexed: 12/27/2022]
Abstract
Two core polyadenylation elements (CPE) located in the 3' untranslated region of eukaryotic pre-mRNAs play an essential role in their processing: the polyadenylation signal (PAS) AAUAAA and the cleavage site (CS), preferentially a CA dinucleotide. Herein, we characterized PAS and CS sequences in a set of cancer predisposition genes (CPGs) and performed an in silico investigation of microRNAs (miRNAs) regulation to identify potential tumor-suppressive and oncogenic miRNAs. NCBI and alternative polyadenylation databases were queried to characterize CPE sequences in 117 CPGs, including 81 and 17 known tumor suppressor genes and oncogenes, respectively. miRNA-mediated regulation analysis was performed using predicted and validated data sources. Based on NCBI analyses, we did not find an established PAS in 21 CPGs, and verified that the majority of PAS already described (74.4%) had the canonical sequence AAUAAA. Interestingly, "AA" dinucleotide was the most common CS (37.5%) associated with this set of genes. Approximately 90% of CPGs exhibited evidence of alternative polyadenylation (more than one functional PAS). Finally, the mir-192 family was significantly overrepresented as regulator of tumor suppressor genes (P < 0.01), which suggests a potential oncogenic function. Overall, this study provides a landscape of CPE in CPGs, which might be useful in development of future molecular analyses covering these frequently neglected regulatory sequences.
Collapse
|
13
|
Wang R, Zheng D, Yehia G, Tian B. A compendium of conserved cleavage and polyadenylation events in mammalian genes. Genome Res 2018; 28:1427-1441. [PMID: 30143597 PMCID: PMC6169888 DOI: 10.1101/gr.237826.118] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2018] [Accepted: 08/08/2018] [Indexed: 12/22/2022]
Abstract
Cleavage and polyadenylation is essential for 3' end processing of almost all eukaryotic mRNAs. Recent studies have shown widespread alternative cleavage and polyadenylation (APA) events leading to mRNA isoforms with different 3' UTRs and/or coding sequences. Here, we present a compendium of conserved cleavage and polyadenylation sites (PASs) in mammalian genes, based on approximately 1.2 billion 3' end sequencing reads from more than 360 human, mouse, and rat samples. We show that ∼80% of mammalian mRNA genes contain at least one conserved PAS, and ∼50% have conserved APA events. PAS conservation generally reduces promiscuous 3' end processing, stabilizing gene expression levels across species. Conservation of APA correlates with gene age, gene expression features, and gene functions. Genes with certain functions, such as cell morphology, cell proliferation, and mRNA metabolism, are particularly enriched with conserved APA events. Whereas tissue-specific genes typically have a low APA rate, brain-specific genes tend to evolve APA. In addition, we show enrichment of mRNA destabilizing motifs in alternative 3' UTR sequences, leading to substantial differences in mRNA stability between 3' UTR isoforms. Using conserved PASs, we reveal sequence motifs surrounding APA sites and a preference of adenosine at the cleavage site. Furthermore, we show that mutations of U-rich motifs around the PAS often accompany APA profile differences between species. Analysis of lncRNA PASs indicates a mechanism of PAS fixation through evolution of A-rich motifs. Taken together, our results present a comprehensive view of PAS evolution in mammals, and a phylogenic perspective on APA functions.
Collapse
Affiliation(s)
- Ruijia Wang
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| | - Dinghai Zheng
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| | - Ghassan Yehia
- Genome Editing Core Facility, Rutgers University, New Brunswick, New Jersey 08901, USA
| | - Bin Tian
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| |
Collapse
|
14
|
Wang R, Zheng D, Yehia G, Tian B. A compendium of conserved cleavage and polyadenylation events in mammalian genes. Genome Res 2018. [PMID: 30143597 DOI: 10.1101/gr.237826.118.28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023]
Abstract
Cleavage and polyadenylation is essential for 3' end processing of almost all eukaryotic mRNAs. Recent studies have shown widespread alternative cleavage and polyadenylation (APA) events leading to mRNA isoforms with different 3' UTRs and/or coding sequences. Here, we present a compendium of conserved cleavage and polyadenylation sites (PASs) in mammalian genes, based on approximately 1.2 billion 3' end sequencing reads from more than 360 human, mouse, and rat samples. We show that ∼80% of mammalian mRNA genes contain at least one conserved PAS, and ∼50% have conserved APA events. PAS conservation generally reduces promiscuous 3' end processing, stabilizing gene expression levels across species. Conservation of APA correlates with gene age, gene expression features, and gene functions. Genes with certain functions, such as cell morphology, cell proliferation, and mRNA metabolism, are particularly enriched with conserved APA events. Whereas tissue-specific genes typically have a low APA rate, brain-specific genes tend to evolve APA. In addition, we show enrichment of mRNA destabilizing motifs in alternative 3' UTR sequences, leading to substantial differences in mRNA stability between 3' UTR isoforms. Using conserved PASs, we reveal sequence motifs surrounding APA sites and a preference of adenosine at the cleavage site. Furthermore, we show that mutations of U-rich motifs around the PAS often accompany APA profile differences between species. Analysis of lncRNA PASs indicates a mechanism of PAS fixation through evolution of A-rich motifs. Taken together, our results present a comprehensive view of PAS evolution in mammals, and a phylogenic perspective on APA functions.
Collapse
Affiliation(s)
- Ruijia Wang
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| | - Dinghai Zheng
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| | - Ghassan Yehia
- Genome Editing Core Facility, Rutgers University, New Brunswick, New Jersey 08901, USA
| | - Bin Tian
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Rutgers Cancer Institute of New Jersey, Newark, New Jersey 07103, USA
| |
Collapse
|
15
|
Targeting the Polyadenylation Signal of Pre-mRNA: A New Gene Silencing Approach for Facioscapulohumeral Dystrophy. Int J Mol Sci 2018; 19:ijms19051347. [PMID: 29751519 PMCID: PMC5983732 DOI: 10.3390/ijms19051347] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 04/27/2018] [Accepted: 04/30/2018] [Indexed: 02/07/2023] Open
Abstract
Facioscapulohumeral dystrophy (FSHD) is characterized by the contraction of the D4Z4 array located in the sub-telomeric region of the chromosome 4, leading to the aberrant expression of the DUX4 transcription factor and the mis-regulation of hundreds of genes. Several therapeutic strategies have been proposed among which the possibility to target the polyadenylation signal to silence the causative gene of the disease. Indeed, defects in mRNA polyadenylation leads to an alteration of the transcription termination, a disruption of mRNA transport from the nucleus to the cytoplasm decreasing the mRNA stability and translation efficiency. This review discusses the polyadenylation mechanisms, why alternative polyadenylation impacts gene expression, and how targeting polyadenylation signal may be a potential therapeutic approach for FSHD.
Collapse
|
16
|
Ha KCH, Blencowe BJ, Morris Q. QAPA: a new method for the systematic analysis of alternative polyadenylation from RNA-seq data. Genome Biol 2018; 19:45. [PMID: 29592814 PMCID: PMC5874996 DOI: 10.1186/s13059-018-1414-4] [Citation(s) in RCA: 115] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 02/28/2018] [Indexed: 12/21/2022] Open
Abstract
Alternative polyadenylation (APA) affects most mammalian genes. The genome-wide investigation of APA has been hampered by an inability to reliably profile it using conventional RNA-seq. We describe 'Quantification of APA' (QAPA), a method that infers APA from conventional RNA-seq data. QAPA is faster and more sensitive than other methods. Application of QAPA reveals discrete, temporally coordinated APA programs during neurogenesis and that there is little overlap between genes regulated by alternative splicing and those by APA. Modeling of these data uncovers an APA sequence code. QAPA thus enables the discovery and characterization of programs of regulated APA using conventional RNA-seq.
Collapse
Affiliation(s)
- Kevin C H Ha
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5A 1A8, Canada.,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON, M5S 3E1, Canada
| | - Benjamin J Blencowe
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5A 1A8, Canada. .,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON, M5S 3E1, Canada.
| | - Quaid Morris
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5A 1A8, Canada. .,Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON, M5S 3E1, Canada. .,Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, ON, M5S 3G4, Canada. .,Vector Institute, 661 University Avenue, Toronto, ON, M5G 1M1, Canada.
| |
Collapse
|
17
|
Magana-Mora A, Kalkatawi M, Bajic VB. Omni-PolyA: a method and tool for accurate recognition of Poly(A) signals in human genomic DNA. BMC Genomics 2017; 18:620. [PMID: 28810905 PMCID: PMC5558757 DOI: 10.1186/s12864-017-4033-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 08/07/2017] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Polyadenylation is a critical stage of RNA processing during the formation of mature mRNA, and is present in most of the known eukaryote protein-coding transcripts and many long non-coding RNAs. The correct identification of poly(A) signals (PAS) not only helps to elucidate the 3'-end genomic boundaries of a transcribed DNA region and gene regulatory mechanisms but also gives insight into the multiple transcript isoforms resulting from alternative PAS. Although progress has been made in the in-silico prediction of genomic signals, the recognition of PAS in DNA genomic sequences remains a challenge. RESULTS In this study, we analyzed human genomic DNA sequences for the 12 most common PAS variants. Our analysis has identified a set of features that helps in the recognition of true PAS, which may be involved in the regulation of the polyadenylation process. The proposed features, in combination with a recognition model, resulted in a novel method and tool, Omni-PolyA. Omni-PolyA combines several machine learning techniques such as different classifiers in a tree-like decision structure and genetic algorithms for deriving a robust classification model. We performed a comparison between results obtained by state-of-the-art methods, deep neural networks, and Omni-PolyA. Results show that Omni-PolyA significantly reduced the average classification error rate by 35.37% in the prediction of the 12 considered PAS variants relative to the state-of-the-art results. CONCLUSIONS The results of our study demonstrate that Omni-PolyA is currently the most accurate model for the prediction of PAS in human and can serve as a useful complement to other PAS recognition methods. Omni-PolyA is publicly available as an online tool accessible at www.cbrc.kaust.edu.sa/omnipolya/ .
Collapse
Affiliation(s)
- Arturo Magana-Mora
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Manal Kalkatawi
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Vladimir B Bajic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| |
Collapse
|
18
|
Wu X, Bartel DP. Widespread Influence of 3'-End Structures on Mammalian mRNA Processing and Stability. Cell 2017; 169:905-917.e11. [PMID: 28525757 DOI: 10.1016/j.cell.2017.04.036] [Citation(s) in RCA: 93] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Revised: 03/13/2017] [Accepted: 04/25/2017] [Indexed: 11/28/2022]
Abstract
The physiological relevance of structures within mammalian mRNAs has been elusive, as these mRNAs are less folded in cells than in vitro and have predicted secondary structures no more stable than those of random sequences. Here, we investigate the possibility that mRNA structures facilitate the 3'-end processing of thousands of human mRNAs by juxtaposing poly(A) signals (PASs) and cleavage sites that are otherwise too far apart. We find that RNA structures are predicted to be more prevalent within these extended 3'-end regions than within PAS-upstream regions and indeed are substantially more folded within cells, as determined by intracellular probing. Analyses of thousands of ectopically expressed variants demonstrate that this folding both enhances processing and increases mRNA metabolic stability. Even folds with predicted stabilities resembling those of random sequences can enhance processing. Structure-controlled processing can also regulate neighboring gene expression. Thus, RNA structure has widespread roles in mammalian mRNA biogenesis and metabolism.
Collapse
Affiliation(s)
- Xuebing Wu
- Howard Hughes Medical Institute and Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - David P Bartel
- Howard Hughes Medical Institute and Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|
19
|
Peart N, Wagner EJ. A distal auxiliary element facilitates cleavage and polyadenylation of Dux4 mRNA in the pathogenic haplotype of FSHD. Hum Genet 2017; 136:1291-1301. [PMID: 28540412 DOI: 10.1007/s00439-017-1813-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 05/14/2017] [Indexed: 01/24/2023]
Abstract
The degenerative muscle disorder facioscapulohumeral dystrophy (FSHD) is thought to be caused by the inappropriate expression of the Double Homeobox 4 (Dux4) protein in muscle cells leading to apoptosis. Expression of Dux4 in the major form of FSHD is a function of two contributing molecular changes: contractions in the D4Z4 microsatellite repeat region where Dux4 is located and an SNP present within a region downstream of the D4Z4. This SNP provides a functional, yet non-consensus polyadenylation signal (PAS) is used for the Dux4 mRNA 3' end processing. Surprisingly, the sequences flanking the Dux4 PAS do not resemble a typical cleavage and polyadenylation landscape with no recognizable downstream sequence element and a suboptimal cleavage site. Here, we conducted a systematic analysis of the cis-acting elements that govern Dux4 cleavage and polyadenylation. Using a transcriptional read-through reporter, we determined that sequences downstream of the SNP located within the β-satellite region are critical for Dux4 cleavage and polyadenylation. We also demonstrate the feasibility of using antisense oligonucleotides to target these sequences as a means to reduce Dux4 expression. Our results underscore the complexity of the region immediately downstream of the D4Z4 and uncover a previously unknown function for the β-satellite region in Dux4 cleavage and polyadenylation.
Collapse
Affiliation(s)
- Natoya Peart
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch at Galveston, Galveston, USA
- Graduate Program in Biochemistry and Molecular Biology, The University of Texas Graduate School of Biomedical Sciences, Houston, TX, USA
- Department of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Eric J Wagner
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch at Galveston, Galveston, USA.
| |
Collapse
|
20
|
Ustyantsev IG, Golubchikova JS, Borodulina OR, Kramerov DA. Canonical and noncanonical RNA polyadenylation. Mol Biol 2017. [DOI: 10.1134/s0026893317010186] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
21
|
Rangel L, Lospitao E, Ruiz-Sáenz A, Alonso MA, Correas I. Alternative polyadenylation in a family of paralogous EPB41 genes generates protein 4.1 diversity. RNA Biol 2016; 14:236-244. [PMID: 27981895 DOI: 10.1080/15476286.2016.1270003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Alternative polyadenylation (APA) is a step in mRNA 3'-end processing that contributes to the complexity of the transcriptome by generating isoforms that differ in either their coding sequence or their 3'-untranslated regions (UTRs). The EPB41 genes, EPB41, EPB41L2, EPB41L3 and EPB41L1, encode an impressively complex array of structural adaptor proteins (designated 4.1R, 4.1G, 4.1B and 4.1N, respectively) by using alternative transcriptional promoters and tissue-specific alternative pre-mRNA splicing. The great variety of 4.1 proteins mainly results from 5'-end and internal processing of the EPB41 pre-mRNAs. Thus, 4.1 proteins can vary in their N-terminal extensions but all contain a highly homologous C-terminal domain (CTD). Here we study a new group of EPB41-related mRNAs that originate by APA and lack the exons encoding the CTD characteristic of prototypical 4.1 proteins, thereby encoding a new type of 4.1 protein. For the EPB41 gene, this type of processing was observed in all 11 human tissues analyzed. Comparative genomic analysis of EPB41 indicates that APA is conserved in various mammals. In addition, we show that APA also functions for the EPB41L2, EPB41L3 and EPB41L1 genes, but in a more restricted manner in the case of the latter 2 than it does for the EPB41 and EPB41L2 genes. Our study shows alternative polyadenylation to be an additional mechanism for the generation of 4.1 protein diversity in the already complex EPB41-related genes. Understanding the diversity of EPB41 RNA processing is essential for a full appreciation of the many 4.1 proteins expressed in normal and pathological tissues.
Collapse
Affiliation(s)
- Laura Rangel
- a Departamento de Biología Molecular , Universidad Autónoma de Madrid (UAM), Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC), Nicolás Cabrera , Cantoblanco, Madrid , Spain
| | - Eva Lospitao
- a Departamento de Biología Molecular , Universidad Autónoma de Madrid (UAM), Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC), Nicolás Cabrera , Cantoblanco, Madrid , Spain
| | - Ana Ruiz-Sáenz
- a Departamento de Biología Molecular , Universidad Autónoma de Madrid (UAM), Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC), Nicolás Cabrera , Cantoblanco, Madrid , Spain
| | - Miguel A Alonso
- a Departamento de Biología Molecular , Universidad Autónoma de Madrid (UAM), Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC), Nicolás Cabrera , Cantoblanco, Madrid , Spain
| | - Isabel Correas
- a Departamento de Biología Molecular , Universidad Autónoma de Madrid (UAM), Centro de Biología Molecular Severo Ochoa, Consejo Superior de Investigaciones Científicas (CSIC), Nicolás Cabrera , Cantoblanco, Madrid , Spain
| |
Collapse
|
22
|
Characterization of the Role of Hexamer AGUAAA and Poly(A) Tail in Coronavirus Polyadenylation. PLoS One 2016; 11:e0165077. [PMID: 27760233 PMCID: PMC5070815 DOI: 10.1371/journal.pone.0165077] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Accepted: 10/05/2016] [Indexed: 01/21/2023] Open
Abstract
Similar to eukaryotic mRNA, the positive-strand coronavirus genome of ~30 kilobases is 5’-capped and 3’-polyadenylated. It has been demonstrated that the length of the coronaviral poly(A) tail is not static but regulated during infection; however, little is known regarding the factors involved in coronaviral polyadenylation and its regulation. Here, we show that during infection, the level of coronavirus poly(A) tail lengthening depends on the initial length upon infection and that the minimum length to initiate lengthening may lie between 5 and 9 nucleotides. By mutagenesis analysis, it was found that (i) the hexamer AGUAAA and poly(A) tail are two important elements responsible for synthesis of the coronavirus poly(A) tail and may function in concert to accomplish polyadenylation and (ii) the function of the hexamer AGUAAA in coronaviral polyadenylation is position dependent. Based on these findings, we propose a process for how the coronaviral poly(A) tail is synthesized and undergoes variation. Our results provide the first genetic evidence to gain insight into coronaviral polyadenylation.
Collapse
|
23
|
Ogorodnikov A, Kargapolova Y, Danckwardt S. Processing and transcriptome expansion at the mRNA 3' end in health and disease: finding the right end. Pflugers Arch 2016; 468:993-1012. [PMID: 27220521 PMCID: PMC4893057 DOI: 10.1007/s00424-016-1828-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Accepted: 04/19/2016] [Indexed: 01/09/2023]
Abstract
The human transcriptome is highly dynamic, with each cell type, tissue, and organ system expressing an ensemble of transcript isoforms that give rise to considerable diversity. Apart from alternative splicing affecting the "body" of the transcripts, extensive transcriptome diversification occurs at the 3' end. Transcripts differing at the 3' end can have profound physiological effects by encoding proteins with distinct functions or regulatory properties or by affecting the mRNA fate via the inclusion or exclusion of regulatory elements (such as miRNA or protein binding sites). Importantly, the dynamic regulation at the 3' end is associated with various (patho)physiological processes, including the immune regulation but also tumorigenesis. Here, we recapitulate the mechanisms of constitutive mRNA 3' end processing and review the current understanding of the dynamically regulated diversity at the transcriptome 3' end. We illustrate the medical importance by presenting examples that are associated with perturbations of this process and indicate resulting implications for molecular diagnostics as well as potentially arising novel therapeutic strategies.
Collapse
Affiliation(s)
- Anton Ogorodnikov
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany
| | - Yulia Kargapolova
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany
| | - Sven Danckwardt
- Institute for Clinical Chemistry and Laboratory Medicine, University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany.
- Center for Thrombosis and Hemostasis (CTH), University Medical Center Mainz, Langenbeckstr 1, 55131, Mainz, Germany.
- German Center for Cardiovascular Research (DZHK), Langenbeckstr 1, 55131, Mainz, Germany.
| |
Collapse
|
24
|
Ni T, Majerciak V, Zheng ZM, Zhu J. PA-seq for Global Identification of RNA Polyadenylation Sites of Kaposi's Sarcoma-Associated Herpesvirus Transcripts. ACTA ACUST UNITED AC 2016; 41:14E.7.1-14E.7.18. [PMID: 27153384 DOI: 10.1002/cpmc.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Kaposi's sarcoma-associated herpesvirus (KSHV) is a human oncovirus linked to the development of several malignancies in immunocompromised patients. Like other herpesviruses, KSHV has a large DNA genome encoding more than 100 distinct gene products. Despite being transcribed and processed by cellular machinery, the structure and organization of KSHV genes in the virus genome differ from what is observed in cellular genes from the human genome. A typical feature of KSHV expression is the production of polycistronic transcripts initiated from different promoters but sharing the same polyadenylation site (pA site). This represents a challenge in determination of the 3' end of individual viral transcripts. Such information is critical for generation of a virus transcriptional map for genetic studies. Here we present PA-seq, a high-throughput method for genome-wide analysis of pA sites of KSHV transcripts in B lymphocytes with latent or lytic KSHV infection. Besides identification of all viral pA sites, PA-seq also provides quantitative information about the levels of viral transcripts associated with each pA site, making it possible to determine the relative expression levels of viral genes at various stages of infection. Due to the indiscriminate nature of PA-seq, the pA sites of host transcripts are also concurrently mapped in the testing samples. Therefore, this technology can simultaneously estimate the expression changes of host genes and RNA polyadenylation upon KSHV infection. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Ting Ni
- Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology and State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, People's Republic of China.,These authors should be considered co-first authors
| | - Vladimir Majerciak
- Tumor Virus RNA Biology Section, Gene Regulation and Chromosome Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland.,These authors should be considered co-first authors
| | - Zhi-Ming Zheng
- Tumor Virus RNA Biology Section, Gene Regulation and Chromosome Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland
| | - Jun Zhu
- Systems Biology Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland.,Corresponding author
| |
Collapse
|
25
|
Alenina N, Böhme I, Bader M, Walther T. Multiple non-coding exons and alternative splicing in the mouse Mas protooncogene. Gene 2015; 568:155-64. [PMID: 26003294 DOI: 10.1016/j.gene.2015.05.043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/23/2015] [Accepted: 05/16/2015] [Indexed: 10/23/2022]
Abstract
The Mas protooncogene encodes a G protein-coupled receptor with the common seven transmembrane domains, expressed mainly in the testis and brain. We provided evidence that Mas is a functional angiotensin-(1-7) receptor and can interact with the angiotensin II type 1 (AT1) receptor. The gene is transcriptionally regulated during development in the brain and testis, but its structure was unresolved. In this study we used 5'- and 3'-RACE, RT-PCR, and RNase-protection assays to elucidate the complete Mas gene structure and organization. We identified 12 exons in the mouse Mas gene with 11 in the 5' untranslated mRNA, which can be alternatively spliced. We also showed that Mas transcription can start from 4 tissue-specific promoters, whereby testis-specific Mas mRNA is transcribed from two upstream promoters, and the expression of Mas in the brain starts from two downstream promoters. Alternative splicing and multiple promoter usage result in at least 12 Mas transcripts in which different 5' untranslated regions are fused to a common coding sequence. Moreover, termination of Mas mRNA is regulated by two different polyadenylation signals. The gene spans approximately 27 kb, and the longest detected mRNA contains 2,451 bp. Thus, our results characterize the Mas protooncogene as the gene with the most complex gene structure of all described members of the gene family coding for G protein-coupled receptors.
Collapse
Affiliation(s)
- Natalia Alenina
- Max-Delbrück-Center for Molecular Medicine (MDC), Robert-Rössle-Straße 10, 13092 Berlin-Buch, Germany; Federal University of Minas Gerais (UFMG), ICB, 6627 Belo Horizonte, MG, Brasil
| | - Ilka Böhme
- Centre for Perinatal Medicine, University Medical Centre Leipzig, Liebigstraße 20a, 04103 Leipzig, Germany
| | - Michael Bader
- Max-Delbrück-Center for Molecular Medicine (MDC), Robert-Rössle-Straße 10, 13092 Berlin-Buch, Germany; Federal University of Minas Gerais (UFMG), ICB, 6627 Belo Horizonte, MG, Brasil; Charité University Medicine Berlin, Charitéplatz 1, 10117 Berlin, Germany
| | - Thomas Walther
- Centre for Perinatal Medicine, University Medical Centre Leipzig, Liebigstraße 20a, 04103 Leipzig, Germany; Department of Pharmacology and Therapeutics, 2nd Floor, Western Road, University College Cork, Cork, Ireland.
| |
Collapse
|
26
|
Hollerer I, Grund K, Hentze MW, Kulozik AE. mRNA 3'end processing: A tale of the tail reaches the clinic. EMBO Mol Med 2014; 6:16-26. [PMID: 24408965 PMCID: PMC3936486 DOI: 10.1002/emmm.201303300] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Recent advances reveal mRNA 3′end processing as a highly regulated process that fine-tunes posttranscriptional gene expression. This process can affect the site and/or the efficiency of 3′end processing, controlling the quality and the quantity of substrate mRNAs. The regulation of 3′end processing plays a central role in fundamental physiology such as blood coagulation and innate immunity. In addition, errors in mRNA 3′end processing have been associated with a broad spectrum of human diseases, including cancer. We summarize and discuss the paradigmatic shift in the understanding of 3′end processing as a mechanism of posttranscriptional gene regulation that has reached clinical medicine.
Collapse
Affiliation(s)
- Ina Hollerer
- Department of Pediatric Oncology, Hematology and Immunology, University of Heidelberg, Heidelberg, Germany
| | | | | | | |
Collapse
|
27
|
Laishram RS. Poly(A) polymerase (PAP) diversity in gene expression--star-PAP vs canonical PAP. FEBS Lett 2014; 588:2185-97. [PMID: 24873880 PMCID: PMC6309179 DOI: 10.1016/j.febslet.2014.05.029] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2014] [Revised: 05/02/2014] [Accepted: 05/15/2014] [Indexed: 01/09/2023]
Abstract
Almost all eukaryotic mRNAs acquire a poly(A) tail at the 3'-end by a concerted RNA processing event: cleavage and polyadenylation. The canonical PAP, PAPα, was considered the only nuclear PAP involved in general polyadenylation of mRNAs. A phosphoinositide-modulated nuclear PAP, Star-PAP, was then reported to regulate a select set of mRNAs in the cell. In addition, several non-canonical PAPs have been identified with diverse cellular functions. Further, canonical PAP itself exists in multiple isoforms thus illustrating the diversity of PAPs. In this review, we compare two nuclear PAPs, Star-PAP and PAPα with a general overview of PAP diversity in the cell. Emerging evidence suggests distinct niches of target pre-mRNAs for the two PAPs and that modulation of these PAPs regulates distinct cellular functions.
Collapse
Affiliation(s)
- Rakesh S Laishram
- Cancer Research Program, Rajiv Gandhi Centre for Biotechnology, Thiruvananthapuram 695014, India.
| |
Collapse
|
28
|
Jalkanen AL, Coleman SJ, Wilusz J. Determinants and implications of mRNA poly(A) tail size--does this protein make my tail look big? Semin Cell Dev Biol 2014; 34:24-32. [PMID: 24910447 DOI: 10.1016/j.semcdb.2014.05.018] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 05/31/2014] [Indexed: 12/22/2022]
Abstract
While the phenomenon of polyadenylation has been well-studied, the dynamics of poly(A) tail size and its impact on transcript function and cell biology are less well-appreciated. The goal of this review is to encourage readers to view the poly(A) tail as a dynamic, changeable aspect of a transcript rather than a simple static entity that marks the 3' end of an mRNA. This could open up new angles of regulation in the post-transcriptional control of gene expression throughout development, differentiation and cancer.
Collapse
Affiliation(s)
- Aimee L Jalkanen
- Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO 80523, USA
| | - Stephen J Coleman
- Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO 80523, USA
| | - Jeffrey Wilusz
- Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO 80523, USA.
| |
Collapse
|
29
|
Zheng D, Tian B. RNA-binding proteins in regulation of alternative cleavage and polyadenylation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 825:97-127. [PMID: 25201104 DOI: 10.1007/978-1-4939-1221-6_3] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Almost all eukaryotic pre-mRNAs are processed at the 3' end by the cleavage and polyadenylation (C/P) reaction, which preludes termination of transcription and gives rise to the poly(A) tail of mature mRNA. Genomic studies in recent years have indicated that most eukaryotic mRNA genes have multiple cleavage and polyadenylation sites (pAs), leading to alternative cleavage and polyadenylation (APA) products. APA isoforms generally differ in their 3' untranslated regions (3' UTRs), but can also have different coding sequences (CDSs). APA expands the repertoire of transcripts expressed from the genome, and is highly regulated under various physiological and pathological conditions. Growing lines of evidence have shown that RNA-binding proteins (RBPs) play important roles in regulation of APA. Some RBPs are part of the machinery for C/P; others influence pA choice through binding to adjacent regions. In this chapter, we review cis elements and trans factors involved in C/P, the significance of APA, and increasingly elucidated roles of RBPs in APA regulation. We also discuss analysis of APA using transcriptome-wide techniques as well as molecular biology approaches.
Collapse
Affiliation(s)
- Dinghai Zheng
- Department of Biochemistry and Molecular Biology, University of Medicine and Dentistry of New Jersey (UMDNJ)-New Jersey Medical School, 185 South Orange Ave., Newark, NJ, 07103, USA
| | | |
Collapse
|
30
|
Hafez D, Ni T, Mukherjee S, Zhu J, Ohler U. Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation. Bioinformatics 2013; 29:i108-16. [PMID: 23812974 PMCID: PMC3694680 DOI: 10.1093/bioinformatics/btt233] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Motivation: Pre-mRNA cleavage and polyadenylation are essential steps for 3′-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained by sequence content and position. More than 50% of human transcripts have multiple functional polyA sites, and the specific use of alternative polyA sites (APA) results in isoforms with variable 3′-untranslated regions, thus potentially affecting gene regulation. Elucidating the regulatory mechanisms underlying differential polyA preferences in multiple cell types has been hindered both by the lack of suitable data on the precise location of cleavage sites, as well as of appropriate tests for determining APAs with significant differences across multiple libraries. Results: We applied a tailored paired-end RNA-seq protocol to specifically probe the position of polyA sites in three human adult tissue types. We specified a linear-effects regression model to identify tissue-specific biases indicating regulated APA; the significance of differences between tissue types was assessed by an appropriately designed permutation test. This combination allowed to identify highly specific subsets of APA events in the individual tissue types. Predictive models successfully classified constitutive polyA sites from a biologically relevant background (auROC = 99.6%), as well as tissue-specific regulated sets from each other. We found that the main cis-regulatory elements described for polyadenylation are a strong, and highly informative, hallmark for constitutive sites only. Tissue-specific regulated sites were found to contain other regulatory motifs, with the canonical polyadenylation signal being nearly absent at brain-specific polyA sites. Together, our results contribute to the understanding of the diversity of post-transcriptional gene regulation. Availability: Raw data are deposited on SRA, accession numbers: brain SRX208132, kidney SRX208087 and liver SRX208134. Processed datasets as well as model code are published on our website: http://www.genome.duke.edu/labs/ohler/research/UTR/ Contact:uwe.ohler@duke.edu
Collapse
Affiliation(s)
- Dina Hafez
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | | | | | | | | |
Collapse
|
31
|
Li XQ, Du D. RNA polyadenylation sites on the genomes of microorganisms, animals, and plants. PLoS One 2013; 8:e79511. [PMID: 24260238 PMCID: PMC3832601 DOI: 10.1371/journal.pone.0079511] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2013] [Accepted: 09/29/2013] [Indexed: 01/15/2023] Open
Abstract
Pre–messenger RNA (mRNA) 3′-end cleavage and subsequent polyadenylation strongly regulate gene expression. In comparison with the upstream or downstream motifs, relatively little is known about the feature differences of polyadenylation [poly(A)] sites among major kingdoms. We suspect that the precise poly(A) sites are very selective, and we therefore mapped mRNA poly(A) sites on complete and nearly complete genomes using mRNA sequences available in the National Center for Biotechnology Information (NCBI) Nucleotide database. In this paper, we describe the mRNA nucleotide [i.e., the poly(A) tail attachment position] that is directly in attachment with the poly(A) tail and the pre-mRNA nucleotide [i.e., the poly(A) tail starting position] that corresponds to the first adenosine of the poly(A) tail in the 29 most-mapped species (2 fungi, 2 protists, 18 animals, and 7 plants). The most representative pre-mRNA dinucleotides covering these two positions were UA, CA, and GA in 17, 10, and 2 of the species, respectively. The pre-mRNA nucleotide at the poly(A) tail starting position was typically an adenosine [i.e., A-type poly(A) sites], sometimes a uridine, and occasionally a cytidine or guanosine. The order was U>C>G at the attachment position but A>>U>C≥G at the starting position. However, in comparison with the mRNA nucleotide composition (base composition), the poly(A) tail attachment position selected C over U in plants and both C and G over U in animals, in both A-type and non-A-type poly(A) sites. Animals, dicot plants, and monocot plants had clear differences in C/G ratios at the poly(A) tail attachment position of the non-A-type poly(A) sites. This study of poly(A) site evolution indicated that the two positions within poly(A) sites had distinct nucleotide compositions and were different among kingdoms.
Collapse
Affiliation(s)
- Xiu-Qing Li
- Molecular Genetics Laboratory, Potato Research Centre, Agriculture and Agri-Food Canada, Fredericton, New Brunswick, Canada
- * E-mail:
| | - Donglei Du
- Quantitative Methods Research Group, Faculty of Business Administration, University of New Brunswick, Fredericton, New Brunswick, Canada
| |
Collapse
|
32
|
Nucleosome distribution near the 3' ends of genes in the human genome. Biosci Biotechnol Biochem 2013; 77:2051-5. [PMID: 24096667 DOI: 10.1271/bbb.130399] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
By systematic analysis of high-throughput sequencing datasets from the human genome, we found that protein-coding genes have a specific chromatin structure near transcription termination sites relative to non-coding genes, one related to polyadenylation. Nucleosome was depleted near the site of cleavage/polyadenylation (polyA site) regardless of its relative position in the gene. DNA sequence plays an improtant role in nucleosome distribution, and conservative sequence elements and the protein binding to them are major determinants in causing nucleosome depletion near polyA sites. Furthermore, nucleosome occupancy was regulated by gene transcription and RNA polymerase II (RNAPII) occupancy. Our results reveal influences on nucleosome occupancy near polyadenylation sites and constitute evidence indicating that nucleosome distribution regulates 3' end processing of protein-coding genes.
Collapse
|
33
|
Functional premature polyadenylation signals and aberrant splicing within a recombinant protein coding sequence limit expression. Protein Expr Purif 2013; 92:14-20. [PMID: 23994311 DOI: 10.1016/j.pep.2013.08.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2013] [Revised: 07/19/2013] [Accepted: 08/19/2013] [Indexed: 11/20/2022]
Abstract
Recombinant glycoproteins can be produced at high levels in permanently transfected mammalian cells using expression vectors with strong viral promoters. CHO-K1 cell lines developed to produce the recombinant complement activator blocking protein, CAB-2 (a fusion of membrane co-factor protein, MCP, and decay accelerating factor, DAF), showed unexpectedly low expression. Northern blot analysis revealed that in addition to the expected 2300 base CAB-2 mRNA species, these cell lines expressed 790 and 1500 base mRNA species accounting for ~50% and ~10% of the total CAB-2 mRNA, respectively. RT-PCR studies established that the 1500 base species resulted from aberrant splicing from within the DAF region of the CAB-2 coding sequence to a site within the 3' untranslated region. 3' RACE analysis confirmed that the 790 base species resulted from premature polyadenylation at an AATAAA site within the MCP coding region of CAB-2. Another prematurely polyadenylated species, not observed on Northern blots, was observed in the DAF region by 3' RACE. Analysis of human tissues and cell lines revealed that these internal polyadenylation signals in native MCP and DAF coding regions also generated prematurely polyadenylated mRNAs. Genetic modification of these functional RNA processing elements within the CAB-2 gene eliminated the aberrant mRNA species and significantly increased recombinant CAB-2 expression. These results illustrate that protein expression can be limited by aberrant mRNA processing and demonstrate the importance of identifying and eliminating these mRNA processing signals from within coding DNA to maximize recombinant protein expression.
Collapse
|
34
|
Michalova E, Vojtesek B, Hrstka R. Impaired pre-mRNA processing and altered architecture of 3' untranslated regions contribute to the development of human disorders. Int J Mol Sci 2013; 14:15681-94. [PMID: 23896598 PMCID: PMC3759880 DOI: 10.3390/ijms140815681] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 06/21/2013] [Accepted: 06/24/2013] [Indexed: 11/16/2022] Open
Abstract
The biological fate of each mRNA and consequently, the protein to be synthesised, is highly dependent on the nature of the 3' untranslated region. Despite its non-coding character, the 3' UTR may affect the final mRNA stability, the localisation, the export from the nucleus and the translation efficiency. The conserved regulatory sequences within 3' UTRs and the specific elements binding to them enable gene expression control at the posttranscriptional level and all these processes reflect the actual state of the cell including proliferation, differentiation, cellular stress or tumourigenesis. Through this article, we briefly outline how the alterations in the establishment and final architecture of 3' UTRs may contribute to the development of various disorders in humans.
Collapse
Affiliation(s)
- Eva Michalova
- Regional Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Zluty kopec 7, Brno 656 53, Czech Republic; E-Mails: (E.M.); (B.V.)
| | - Borivoj Vojtesek
- Regional Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Zluty kopec 7, Brno 656 53, Czech Republic; E-Mails: (E.M.); (B.V.)
| | - Roman Hrstka
- Regional Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Zluty kopec 7, Brno 656 53, Czech Republic; E-Mails: (E.M.); (B.V.)
| |
Collapse
|
35
|
Schrom EM, Moschall R, Hartl MJ, Weitner H, Fecher D, Langemeier J, Bohne J, Wöhrl BM, Bodem J. U1snRNP-mediated suppression of polyadenylation in conjunction with the RNA structure controls poly (A) site selection in foamy viruses. Retrovirology 2013; 10:55. [PMID: 23718736 PMCID: PMC3694450 DOI: 10.1186/1742-4690-10-55] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 05/21/2013] [Indexed: 11/13/2022] Open
Abstract
Background During reverse transcription, retroviruses duplicate the long terminal repeats (LTRs). These identical LTRs carry both promoter regions and functional polyadenylation sites. To express full-length transcripts, retroviruses have to suppress polyadenylation in the 5′LTR and activate polyadenylation in the 3′LTR. Foamy viruses have a unique LTR structure with respect to the location of the major splice donor (MSD), which is located upstream of the polyadenylation signal. Results Here, we describe the mechanisms of foamy viruses regulating polyadenylation. We show that binding of the U1 small nuclear ribonucleoprotein (U1snRNP) to the MSD suppresses polyadenylation at the 5′LTR. In contrast, polyadenylation at the 3′LTR is achieved by adoption of a different RNA structure at the MSD region, which blocks U1snRNP binding and furthers RNA cleavage and subsequent polyadenylation. Conclusion Recently, it was shown that U1snRNP is able to suppress the usage of intronic cryptic polyadenylation sites in the cellular genome. Foamy viruses take advantage of this surveillance mechanism to suppress premature polyadenylation at the 5’end of their RNA. At the 3’end, Foamy viruses use a secondary structure to presumably block access of U1snRNP and thereby activate polyadenylation at the end of the genome. Our data reveal a contribution of U1snRNP to cellular polyadenylation site selection and to the regulation of gene expression.
Collapse
Affiliation(s)
- Eva-Maria Schrom
- Institute of Virology and Immunobiology, University of Würzburg, Würzburg, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Abstract
Cellular and viral preRNAs are extensively cotranscriptionally modified. These modifications include the processing of the 3' end. Most preRNAs are polyadenylated, which is required for nuclear export, RNA stability, and efficient translation. Integrated retroviral genomes are flanked by 3' and 5' long terminal repeats (LTRs). Both LTRs are identical on the nucleotide level, but 3' processing has to be limited to the 3'LTR. Otherwise, polyadenylation at the 5'LTR would result in prematurely terminated, noncoding viral RNAs. Retroviruses have developed a variety of different mechanisms to restrict polyadenylation to the 3'LTR, although the overall structure of the LTRs is similar among all retroviruses. In general, these mechanisms can be divided into three main groups: (1) activation of polyadenylation only at the 3' end by encoding the essential polyadenylation signal in the unique 3 region; (2) suppression of polyadenylation at the 5'LTR by downstream elements such as the major splice donor; and (3) the usage of weak polyadenylation sites, which results in some premature polyadenylated noncoding RNAs and in read-through transcripts at the 3'LTR. All these mechanisms exhibit intrinsic problems, and retroviruses have evolved additional regulatory elements to promote polyadenylation at the 3'LTR only. In this review, we describe the molecular regulation of retroviral polyadenylation and highlight the different mechanisms used for polyadenylation control.
Collapse
Affiliation(s)
- Eva-Maria Schrom
- Universität Würzburg, Institut für Virologie und Immunbiologie, Würzburg, Germany
| | | | | | | |
Collapse
|
37
|
Dominski Z, Carpousis AJ, Clouet-d'Orval B. Emergence of the β-CASP ribonucleases: highly conserved and ubiquitous metallo-enzymes involved in messenger RNA maturation and degradation. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2013; 1829:532-51. [PMID: 23403287 DOI: 10.1016/j.bbagrm.2013.01.010] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2012] [Revised: 01/18/2013] [Accepted: 01/22/2013] [Indexed: 01/05/2023]
Abstract
The β-CASP ribonucleases, which are found in the three domains of life, have in common a core of 460 residues containing seven conserved sequence motifs involved in the tight binding of two catalytic zinc ions. A hallmark of these enzymes is their ability to catalyze both endo- and exo-ribonucleolytic degradation. Exo-ribonucleolytic degradation proceeds in the 5' to 3' direction and is sensitive to the phosphorylation state of the 5' end of a transcript. Recent phylogenomic analyses have shown that the β-CASP ribonucleases can be partitioned into two major subdivisions that correspond to orthologs of eukaryal CPSF73 and bacterial RNase J. We discuss the known functions of the CPSF73 and RNase J orthologs, their association into complexes, and their structure as it relates to mechanism of action. Eukaryal CPSF73 is part of a large multiprotein complex that is involved in the maturation of the 3' end of RNA Polymerase II transcripts and the polyadenylation of messenger RNA. RNase J1 and J2 are paralogs in Bacillus subtilis that are involved in the degradation of messenger RNA and the maturation of non-coding RNA. RNase J1 and J2 co-purify as a heteromeric complex and there is recent evidence that they interact with other enzymes to form a bacterial RNA degradosome. Finally, we speculate on the evolutionary origin of β-CASP ribonucleases and on their functions in Archaea. Orthologs of CPSF73 with endo- and exo-ribonuclease activity are strictly conserved throughout the archaea suggesting a role for these enzymes in the maturation and/or degradation of messenger RNA. This article is part of a Special Issue entitled: RNA Decay mechanisms.
Collapse
Affiliation(s)
- Zbigniew Dominski
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA
| | | | | |
Collapse
|
38
|
Hon CC, Weber C, Sismeiro O, Proux C, Koutero M, Deloger M, Das S, Agrahari M, Dillies MA, Jagla B, Coppee JY, Bhattacharya A, Guillen N. Quantification of stochastic noise of splicing and polyadenylation in Entamoeba histolytica. Nucleic Acids Res 2012; 41:1936-52. [PMID: 23258700 PMCID: PMC3561952 DOI: 10.1093/nar/gks1271] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Alternative splicing and polyadenylation were observed pervasively in eukaryotic messenger RNAs. These alternative isoforms could either be consequences of physiological regulation or stochastic noise of RNA processing. To quantify the extent of stochastic noise in splicing and polyadenylation, we analyzed the alternative usage of splicing and polyadenylation sites in Entamoeba histolytica using RNA-Seq. First, we identified a large number of rarely spliced alternative junctions and then showed that the occurrence of these alternative splicing events is correlated with splicing site sequence, occurrence of constitutive splicing events and messenger RNA abundance. Our results implied the majority of these alternative splicing events are likely to be stochastic error of splicing machineries, and we estimated the corresponding error rates. Second, we observed extensive microheterogeneity of polyadenylation cleavage sites, and the extent of such microheterogeneity is correlated with the occurrence of constitutive cleavage events, suggesting most of such microheterogeneity is likely to be stochastic. Overall, we only observed a small fraction of alternative splicing and polyadenylation isoforms that are unlikely to be solely stochastic, implying the functional relevance of alternative splicing and polyadenylation in E. histolytica is limited. Lastly, we revised the gene models and annotated their 3′UTR in AmoebaDB, providing valuable resources to the community.
Collapse
Affiliation(s)
- Chung-Chau Hon
- Institut Pasteur, Unité Biologie Cellulaire du Parasitisme, Département Biologie cellulaire et infection, F-75015 Paris, France, INSERM U786, F-75015 Paris, France.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Derti A, Garrett-Engele P, Macisaac KD, Stevens RC, Sriram S, Chen R, Rohl CA, Johnson JM, Babak T. A quantitative atlas of polyadenylation in five mammals. Genome Res 2012; 22:1173-83. [PMID: 22454233 PMCID: PMC3371698 DOI: 10.1101/gr.132563.111] [Citation(s) in RCA: 474] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We developed PolyA-seq, a strand-specific and quantitative method for high-throughput sequencing of 3′ ends of polyadenylated transcripts, and used it to globally map polyadenylation (polyA) sites in 24 matched tissues in human, rhesus, dog, mouse, and rat. We show that PolyA-seq is as accurate as existing RNA sequencing (RNA-seq) approaches for digital gene expression (DGE), enabling simultaneous mapping of polyA sites and quantitative measurement of their usage. In human, we confirmed 158,533 known sites and discovered 280,857 novel sites (FDR < 2.5%). On average 10% of novel human sites were also detected in matched tissues in other species. Most novel sites represent uncharacterized alternative polyA events and extensions of known transcripts in human and mouse, but primarily delineate novel transcripts in the other three species. A total of 69.1% of known human genes that we detected have multiple polyA sites in their 3′UTRs, with 49.3% having three or more. We also detected polyadenylation of noncoding and antisense transcripts, including constitutive and tissue-specific primary microRNAs. The canonical polyA signal was strongly enriched and positionally conserved in all species. In general, usage of polyA sites is more similar within the same tissues across different species than within a species. These quantitative maps of polyA usage in evolutionarily and functionally related samples constitute a resource for understanding the regulatory mechanisms underlying alternative polyadenylation.
Collapse
Affiliation(s)
- Adnan Derti
- Department of Informatics IT, Merck and Co., Inc., Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Ruepp MD, Schümperli D, Barabino SML. mRNA 3' end processing and more--multiple functions of mammalian cleavage factor I-68. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 2:79-91. [PMID: 21956970 DOI: 10.1002/wrna.35] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The formation of defined 3(') ends is an important step in the biogenesis of mRNAs. In eukaryotic cells, all mRNA 3(') ends are generated by endonucleolytic cleavage of primary transcripts in reactions that are essentially posttranscriptional. Nevertheless, 3(') end formation is tightly connected to transcription in vivo, and a link with mRNA export to the cytoplasm has been postulated. Here, we briefly review the current knowledge about the two types of mRNA 3(') end processing reactions, cleavage/polyadenylation and histone RNA processing. We then focus on factors shared between these two reactions. In particular, we discuss evidence for new functions of the mammalian cleavage factor I subunit CF I(m) 68 in histone RNA 3(') processing and in the export of mature mRNAs from the nucleus to the cytoplasm.
Collapse
Affiliation(s)
- Marc-David Ruepp
- Institute of Cell Biology, University of Bern, Bern, Switzerland
| | | | | |
Collapse
|
41
|
Abstract
Polyadenylation [poly(A)] signals (PAS) are a defining feature of eukaryotic protein-coding genes. The central sequence motif AAUAAA was identified in the mid-1970s and subsequently shown to require flanking, auxiliary elements for both 3'-end cleavage and polyadenylation of premessenger RNA (pre-mRNA) as well as to promote downstream transcriptional termination. More recent genomic analysis has established the generality of the PAS for eukaryotic mRNA. Evidence for the mechanism of mRNA 3'-end formation is outlined, as is the way this RNA processing reaction communicates with RNA polymerase II to terminate transcription. The widespread phenomenon of alternative poly(A) site usage and how this interrelates with pre-mRNA splicing is then reviewed. This shows that gene expression can be drastically affected by how the message is ended. A central theme of this review is that while genomic analysis provides generality for the importance of PAS selection, detailed mechanistic understanding still requires the direct analysis of specific genes by genetic and biochemical approaches.
Collapse
Affiliation(s)
- Nick J Proudfoot
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, United Kingdom.
| |
Collapse
|
42
|
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 3:385-96. [PMID: 22012871 DOI: 10.1002/wrna.116] [Citation(s) in RCA: 159] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for 3' end formation of almost all protein-coding transcripts in eukaryotes. The reaction, involving cleavage of nascent mRNA followed by addition of a polyadenylate or poly(A) tail, is controlled by cis-acting elements in the pre-mRNA surrounding the cleavage site. Experimental and bioinformatic studies in the past three decades have elucidated conserved and divergent elements across eukaryotes, from yeast to human. Here we review histories and current models of these elements in a broad range of species.
Collapse
Affiliation(s)
- Bin Tian
- UMDNJ-New Jersey Medical School, Newark, NJ, USA.
| | | |
Collapse
|
43
|
Qi Y, Ma Y, He R, Wang N, Ruan Q, Ji Y, Li M, Sun Z, Ren G. Characterization of 3' termini of human cytomegalovirus UL138-UL145 transcripts in a clinical strain. Microbiol Immunol 2011; 55:95-9. [PMID: 21204946 DOI: 10.1111/j.1348-0421.2010.00294.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
The functions of some proteins encoded by human cytomegalovirus (HCMV) UL/b' genes have been studied; however, systematic analysis of the transcripts for this region is still insufficient. The results of both rapid amplification of cDNA ends (RACE) and cDNA library screening in this study proved that 3' termini of all transcripts in the UL138-UL145 region were located approximately 20 bp downstream from each potential poly (A) signal, which were at the positions of nucleotides 7184, 9954 and 12848 in the UL/b' sequence of the H strain, respectively. Thus, there were at least two large families of polycistronic transcripts in this gene region. The first family of 3'-coterminal transcripts contained UL139, UL140 and UL141 genes, and the second one consisted of UL142, UL143, UL144 and UL145 genes. The 3'-coterminal characterization further confirmed that multiple uses of polyadenylation signals were commonly used by HCMV to utilize genetic information.
Collapse
Affiliation(s)
- Ying Qi
- Virus Laboratory, Shengjing Hospital, China Medical University, No. 36 Sanhao Street, Heping District, Shenyang, Liaoning 110004, China
| | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Tanaka M, Sakai Y, Yamada O, Shintani T, Gomi K. In silico analysis of 3'-end-processing signals in Aspergillus oryzae using expressed sequence tags and genomic sequencing data. DNA Res 2011; 18:189-200. [PMID: 21586533 PMCID: PMC3111234 DOI: 10.1093/dnares/dsr011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
To investigate 3'-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3'-untranslated region (3' UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3' UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3' UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15-30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3'-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3'-end-processing signals are similar to those in yeast and plants, some notable differences exist between them.
Collapse
Affiliation(s)
- Mizuki Tanaka
- Laboratory of Bioindustrial Genomics, Department of Bioindustrial Informatics and Genomics, Graduate School of Agricultural Science, Tohoku University, 1-1 Tsutsumidori-Amamiyamachi, Aoba-ku, Sendai 981-8555, Japan
| | | | | | | | | |
Collapse
|
45
|
Khaladkar M, Smyda M, Hannenhalli S. Epigenomic and RNA structural correlates of polyadenylation. RNA Biol 2011; 8:529-37. [PMID: 21508683 DOI: 10.4161/rna.8.3.15194] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Polyadenylation (poly(A)) of mRNA plays a critical role in regulating gene expression. Identifying the sequence, structural, and epigenomic determinants of poly(A) site usage is an important long term goal. Several cis elements that mediate poly(A) regulation have been identified. Highly used poly(A) sites are also known to have a greater nucleosome occupancy in the immediate downstream. However, a detailed exploration of additional epigenomic and mRNA structural correlates of poly(A) site usage has not been reported. Importantly, functional interaction between sequence, structure, and the epigenome in determining the poly(A) site usage is not known. We show that highly used poly(A) sites are positively associated with an mRNA structure that is energetically more favorable and one that better exposes a critical polyadenylation cis element. In exploring potential interplay between RNA and chromatin structure, we found that a stronger nucleosome occupancy downstream of poly(A) site strongly correlated with (1) a more favorable mRNA structure, and (2) a greater accumulation of RNA Polymerase II (PolII) at the poly(A) site. Further analysis suggested a causal relationship pointing from PolII accumulation to a stable RNA structure. Additionally, we found that distinct patterns of histone modifications characterize poly(A) sites and these epigenetic patterns alone can distinguish true poly(A) sites with ~76% accuracy and also discriminate between high and low usage poly(A) sites with ~74% accuracy. Our results suggest a causative link between chromatin structure and mRNA structure whereby a compacted chromatin downstream of the poly(A) site slows down the elongating transcript, thus facilitating the folding of nascent mRNA in a favorable structure at poly(A) site during transcription. Additionally we report hitherto unknown epigenomic correlates for poly(A) site usage.
Collapse
Affiliation(s)
- Mugdha Khaladkar
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | | | | |
Collapse
|
46
|
Chan S, Choi EA, Shi Y. Pre-mRNA 3'-end processing complex assembly and function. WILEY INTERDISCIPLINARY REVIEWS-RNA 2010; 2:321-35. [PMID: 21957020 DOI: 10.1002/wrna.54] [Citation(s) in RCA: 112] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
The 3'-ends of almost all eukaryotic mRNAs are formed in a two-step process, an endonucleolytic cleavage followed by polyadenylation (the addition of a poly-adenosine or poly(A) tail). These reactions take place in the pre-mRNA 3' processing complex, a macromolecular machinery that consists of more than 20 proteins. A general framework for how the pre-mRNA 3' processing complex assembles and functions has emerged from extensive studies over the past several decades using biochemical, genetic, computational, and structural approaches. In this article, we review what we have learned about this important cellular machine and discuss the remaining questions and future challenges.
Collapse
Affiliation(s)
- Serena Chan
- Department of Microbiology and Molecular Genetics, University of California, Irvine, CA, USA
| | | | | |
Collapse
|
47
|
Ruepp MD, Vivarelli S, Pillai RS, Kleinschmidt N, Azzouz TN, Barabino SML, Schümperli D. The 68 kDa subunit of mammalian cleavage factor I interacts with the U7 small nuclear ribonucleoprotein and participates in 3'-end processing of animal histone mRNAs. Nucleic Acids Res 2010; 38:7637-50. [PMID: 20634199 PMCID: PMC2995043 DOI: 10.1093/nar/gkq613] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Metazoan replication-dependent histone pre-mRNAs undergo a unique 3′-cleavage reaction which does not result in mRNA polyadenylation. Although the cleavage site is defined by histone-specific factors (hairpin binding protein, a 100-kDa zinc-finger protein and the U7 snRNP), a large complex consisting of cleavage/polyadenylation specificity factor, two subunits of cleavage stimulation factor and symplekin acts as the effector of RNA cleavage. Here, we report that yet another protein involved in cleavage/polyadenylation, mammalian cleavage factor I 68-kDa subunit (CF Im68), participates in histone RNA 3′-end processing. CF Im68 was found in a highly purified U7 snRNP preparation. Its interaction with the U7 snRNP depends on the N-terminus of the U7 snRNP protein Lsm11, known to be important for histone RNA processing. In vivo, both depletion and overexpression of CF Im68 cause significant decreases in processing efficiency. In vitro 3′-end processing is slightly stimulated by the addition of low amounts of CF Im68, but inhibited by high amounts or by anti-CF Im68 antibody. Finally, immunoprecipitation of CF Im68 results in a strong enrichment of histone pre-mRNAs. In contrast, the small CF Im subunit, CF Im25, does not appear to be involved in histone RNA processing.
Collapse
Affiliation(s)
- Marc-David Ruepp
- Institute of Cell Biology, University of Bern, CH-3012 Bern, Switzerland
| | | | | | | | | | | | | |
Collapse
|
48
|
Cheng DW, Lin H, Takahashi Y, Walker MA, Civerolo EL, Stenger DC. Transcriptional regulation of the grape cytochrome P450 monooxygenase gene CYP736B expression in response to Xylella fastidiosa infection. BMC PLANT BIOLOGY 2010; 10:135. [PMID: 20591199 PMCID: PMC3095286 DOI: 10.1186/1471-2229-10-135] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2009] [Accepted: 07/01/2010] [Indexed: 05/18/2023]
Abstract
BACKGROUND Plant cytochrome P450 monooxygenases (CYP) mediate synthesis and metabolism of many physiologically important primary and secondary compounds that are related to plant defense against a range of pathogenic microbes and insects. To determine if cytochrome P450 monooxygenases are involved in defense response to Xylella fastidiosa (Xf) infection, we investigated expression and regulatory mechanisms of the cytochrome P450 monooxygenase CYP736B gene in both disease resistant and susceptible grapevines. RESULTS Cloning of genomic DNA and cDNA revealed that the CYP736B gene was composed of two exons and one intron with GT as a donor site and AG as an acceptor site. CYP736B transcript was up-regulated in PD-resistant plants and down-regulated in PD-susceptible plants 6 weeks after Xf inoculation. However, CYP736B expression was very low in stem tissues at all evaluated time points. 5'RACE and 3'RACE sequence analyses revealed that there were three candidate transcription start sites (TSS) in the upstream region and three candidate polyadenylation (PolyA) sites in the downstream region of CYP736B. Usage frequencies of each transcription initiation site and each polyadenylation site varied depending on plant genotype, developmental stage, tissue, and treatment. These results demonstrate that expression of CYP736B is regulated developmentally and in response to Xf infection at both transcriptional and post-transcriptional levels. Multiple transcription start and polyadenylation sites contribute to regulation of CYP736B expression. CONCLUSIONS This report provides evidence that the cytochrome P450 monooxygenase CYP736B gene is involved in defense response at a specific stage of Xf infection in grapevines; multiple transcription initiation and polyadenylation sites exist for CYP736B in grapevine; and coordinative and selective use of transcription initiation and polyadenylation sites play an important role in regulation of CYP736B expression during growth, development and response to Xf infection.
Collapse
Affiliation(s)
- Davis W Cheng
- San Joaquin Valley Agricultural Science Center, USDA-ARS 9611 South Riverbend Avenue, Parlier, CA 93648, USA
- Department of Biology, California State University, Fresno, CA 93740, USA
| | - Hong Lin
- San Joaquin Valley Agricultural Science Center, USDA-ARS 9611 South Riverbend Avenue, Parlier, CA 93648, USA
| | - Yuri Takahashi
- Department of Viticulture and Enology, University of California, Davis, CA 95616, USA
- Department of Food sciences, Ehime Women's College, Uwajima, Ehime, 798-0025 Japan
| | - M Andrew Walker
- Department of Viticulture and Enology, University of California, Davis, CA 95616, USA
| | - Edwin L Civerolo
- San Joaquin Valley Agricultural Science Center, USDA-ARS 9611 South Riverbend Avenue, Parlier, CA 93648, USA
| | - Drake C Stenger
- San Joaquin Valley Agricultural Science Center, USDA-ARS 9611 South Riverbend Avenue, Parlier, CA 93648, USA
| |
Collapse
|
49
|
Newnham CM, Hall-Pogar T, Liang S, Wu J, Tian B, Hu J, Lutz CS. Alternative polyadenylation of MeCP2: Influence of cis-acting elements and trans-acting factors. RNA Biol 2010; 7:361-72. [PMID: 20400852 DOI: 10.4161/rna.7.3.11564] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The human MeCP2 gene encodes a ubiquitously expressed methyl CpG binding protein. Mutations in this gene cause a neurodevelopmental disorder called Rett Syndrome (RS). Mutations identified in the coding region of MeCP2 account for approximately 65% of all RS cases. However, 35% of all patients do not show mutations in the coding region of MeCP2, suggesting that mutations in non-coding regions likely exist that affect MeCP2 expression rather than protein function. The gene is unusual in that is has a >8.5 kb 3' untranslated region (3' UTR), and the size of the 3'UTR is differentially regulated in various tissues because of distinct polyadenylation signals. We have identified putative cis-acting auxiliary regulatory elements that play a role in alternative polyadenylation of MeCP2 using an in vivo polyadenylation reporter assay and in a luciferase assay. These cis-acting auxiliary elements are found both upstream and downstream of the core CPSF binding sites. Mutation of one of these cis-acting auxiliary elements, a G-rich element (GRS) significantly reduced MeCP2 polyadenylation efficiency in vivo. We further investigated what trans-acting factor(s) might be binding to this cis-acting element and found that hnRNP F protein binds specifically to the element. We next investigated the MeCP2 3' UTRs by performing quantitative real-time PCR; the data suggest that altered RNA stability is not a major factor in differential MeCP2 3' UTR usage. In sum, the mechanism(s) of regulated alternative 3'UTR usage of MeCP2 are complex, and insight into these mechanisms will aid our understanding of the factors that influence MeCP2 expression.
Collapse
Affiliation(s)
- Catherine M Newnham
- Department of Physiology and Experimental Medicine, University of Toronto, Toronto, ON, Canada
| | | | | | | | | | | | | |
Collapse
|
50
|
Moraes KCM. RNA surveillance: molecular approaches in transcript quality control and their implications in clinical diseases. Mol Med 2010; 16:53-68. [PMID: 19829759 PMCID: PMC2761007 DOI: 10.2119/molmed.2009.00026] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2009] [Accepted: 10/06/2009] [Indexed: 11/06/2022] Open
Abstract
Production of mature mRNAs that encode functional proteins involves highly complex pathways of synthesis, processing and surveillance. At numerous steps during the maturation process, the mRNA transcript undergoes scrutiny by cellular quality control machinery. This extensive RNA surveillance ensures that only correctly processed mature mRNAs are translated and precludes production of aberrant transcripts that could encode mutant or possibly deleterious proteins. Recent advances in elucidating the molecular mechanisms of mRNA processing have demonstrated the existence of an integrated network of events, and have revealed that a variety of human diseases are caused by disturbances in the well-coordinated molecular equilibrium of these events. From a medical perspective, both loss and gain of function are relevant, and a considerable number of different diseases exemplify the importance of the mechanistic function of RNA surveillance in a cell. Here, mechanistic hallmarks of mRNA processing steps are reviewed, highlighting the medical relevance of their deregulation and how the understanding of such mechanisms can contribute to the development of therapeutic strategies.
Collapse
Affiliation(s)
- Karen C M Moraes
- Molecular Biology Laboratory, IP&D, Universidade do Vale do Paraíba, São José dos Campos, São Paulo, CEP-12244-000, Brazil.
| |
Collapse
|