1
|
Ding J, Su Y, Liu Y, Xu Y, Yang D, Wang X, Hao S, Zhou H, Li H. The role of CSTF2 in cancer: from technology to clinical application. Cell Cycle 2023; 22:2622-2636. [PMID: 38166492 PMCID: PMC10936678 DOI: 10.1080/15384101.2023.2299624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 12/03/2023] [Accepted: 12/20/2023] [Indexed: 01/04/2024] Open
Abstract
A protein called cleavage-stimulating factor subunit 2 (CSTF2, additionally called CSTF-64) binds RNA and is needed for the cleavage and polyadenylation of mRNA. CSTF2 is an important component subunit of the cleavage stimulating factor (CSTF), which is located on the X chromosome and encodes 557 amino acids. There is compelling evidence linking elevated CSTF2 expression to the pathological advancement of cancer and on its impact on the clinical aspects of the disease. The progression of cancers, including hepatocellular carcinoma, melanoma, prostate cancer, breast cancer, and pancreatic cancer, is correlated with the upregulation of CSTF2 expression. This review provides a fresh perspective on the investigation of the associations between CSTF2 and various malignancies and highlights current studies on the regulation of CSTF2. In particular, the mechanism of action and potential clinical applications of CSTF2 in cancer suggest that CSTF2 can serve as a new biomarker and individualized treatment target for a variety of cancer types.
Collapse
Affiliation(s)
- Jiaxiang Ding
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- School of Public Foundation, Bengbu Medical University, Bengbu, Anhui, China
| | - Yue Su
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- School of Public Foundation, Bengbu Medical University, Bengbu, Anhui, China
| | - Youru Liu
- The People’s Hospital of Bozhou, Bozhou, Anhui, China
| | - Yuanyuan Xu
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- School of Pharmacy, Bengbu Medical University, Bengbu, Anhui, China
| | - Dashuai Yang
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province; School of Pharmacy, Anhui Medical University, Hefei, Anhui, China
| | - Xuefeng Wang
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- School of Pharmacy, Bengbu Medical University, Bengbu, Anhui, China
| | - Shuli Hao
- The People’s Hospital of Bozhou, Bozhou, Anhui, China
| | - Huan Zhou
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- School of Public Foundation, Bengbu Medical University, Bengbu, Anhui, China
- School of Pharmacy, Bengbu Medical University, Bengbu, Anhui, China
| | - Hongtao Li
- Clinical Trial Center of the First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
| |
Collapse
|
2
|
Slight Variations in the Sequence Downstream of the Polyadenylation Signal Significantly Increase Transgene Expression in HEK293T and CHO Cells. Int J Mol Sci 2022; 23:ijms232415485. [PMID: 36555130 PMCID: PMC9779314 DOI: 10.3390/ijms232415485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/28/2022] [Accepted: 12/01/2022] [Indexed: 12/13/2022] Open
Abstract
Compared to transcription initiation, much less is known about transcription termination. In particular, large-scale mutagenesis studies have, so far, primarily concentrated on promoter and enhancer, but not terminator sequences. Here, we used a massively parallel reporter assay (MPRA) to systematically analyze the influence of short (8 bp) sequence variants (mutations) located downstream of the polyadenylation signal (PAS) on the steady-state mRNA level of the upstream gene, employing an eGFP reporter and human HEK293T cells as a model system. In total, we evaluated 227,755 mutations located at different overlapping positions within +17..+56 bp downstream of the PAS for their ability to regulate the reporter gene expression. We found that the positions +17..+44 bp downstream of the PAS are more essential for gene upregulation than those located more distal to the PAS, and that the mutation sequences ensuring high levels of eGFP mRNA expression are extremely T-rich. Next, we validated the positive effect of a couple of mutations identified in the MPRA screening on the eGFP and luciferase protein expression. The most promising mutation increased the expression of the reporter proteins 13-fold and sevenfold on average in HEK293T and CHO cells, respectively. Overall, these findings might be useful for further improving the efficiency of production of therapeutic products, e.g., recombinant antibodies.
Collapse
|
3
|
Zhang W, Wan Y, Zhang Y, Liu Q, Zhu X. CSTF2 Acts as a Prognostic Marker Correlated with Immune Infiltration in Hepatocellular Carcinoma. Cancer Manag Res 2022; 14:2691-2709. [PMID: 36117731 PMCID: PMC9481280 DOI: 10.2147/cmar.s359545] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 09/04/2022] [Indexed: 11/23/2022] Open
Abstract
Background Cleavage stimulation factor 2 (CSTF2) encodes a nuclear protein that is implicated in the development of various cancers. However, the role of CSTF2 in hepatocellular carcinoma (HCC) has not been understood. This study aims to explore the function of CSTF2 in HCC. Methods The expression, diagnostic capability, prognostic value, and immune cell effect of CSTF2 in HCC were explored using various databases. The expression level of CSTF2 were validated in our cell lines. The effect of CSTF2 on hepatocarcinogenesis was explored by CSTF2 silencing. Results CSTF2 expression was significantly elevated in HCC and correlated with multiple clinicopathological characteristics. CSTF2 exhibited good diagnostic capability in discriminating HCC samples from nontumorous samples. High CSTF2 expression was significantly related to poor overall survival. Univariate and multivariate Cox regression analyses suggested that CSTF2 expression was an independent risk factor for HCC. These results were validated in ICGC cohorts. In addition, the nomogram based on CSTF2 showed better predictive performance than the AJCC staging system in TCGA and ICGC cohorts. Functional enrichment analysis revealed that CSTF2-related genes were involved in DNA/RNA processing and the cell cycle. In addition, we found that CSTF2 expression was closely related to the levels of various infiltrating immune cells, especially neutrophils. Moreover, some immune checkpoints had positive relationships with CSTF2 expression. CSTF2 silencing inhibited proliferation, invasion and migration, and promoted apoptosis in HepG2 cells. Western blotting analysis revealed that CSTF2 silencing inactivated the Wnt/β-catenin signaling pathway. Conclusion High CSTF2 expression not only correlates with unfavorable outcomes but also affects immune cell infiltration and immune checkpoint expression in HCC. CSTF2 silencing can alleviate the malignant phenotypes of hepatic cancer cell by inactivating the Wnt/β-catenin signaling pathway. These results indicate that CSTF2 can serve as a promising prognostic marker and therapeutic target for HCC patients.
Collapse
Affiliation(s)
- Wang Zhang
- Departments of Gastroenterology and Hepatology, First Affiliated Hospital of Nanchang University, Nanchang, People's Republic of China
| | - Yipeng Wan
- Departments of Gastroenterology and Hepatology, First Affiliated Hospital of Nanchang University, Nanchang, People's Republic of China
| | - Yue Zhang
- Departments of Gastroenterology and Hepatology, First Affiliated Hospital of Nanchang University, Nanchang, People's Republic of China
| | - Qi Liu
- Departments of Gastroenterology and Hepatology, First Affiliated Hospital of Nanchang University, Nanchang, People's Republic of China
| | - Xuan Zhu
- Departments of Gastroenterology and Hepatology, First Affiliated Hospital of Nanchang University, Nanchang, People's Republic of China
| |
Collapse
|
4
|
Aptardi predicts polyadenylation sites in sample-specific transcriptomes using high-throughput RNA sequencing and DNA sequence. Nat Commun 2021; 12:1652. [PMID: 33712618 PMCID: PMC7955126 DOI: 10.1038/s41467-021-21894-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 02/18/2021] [Indexed: 02/01/2023] Open
Abstract
Annotation of polyadenylation sites from short-read RNA sequencing alone is a challenging computational task. Other algorithms rooted in DNA sequence predict potential polyadenylation sites; however, in vivo expression of a particular site varies based on a myriad of conditions. Here, we introduce aptardi (alternative polyadenylation transcriptome analysis from RNA-Seq data and DNA sequence information), which leverages both DNA sequence and RNA sequencing in a machine learning paradigm to predict expressed polyadenylation sites. Specifically, as input aptardi takes DNA nucleotide sequence, genome-aligned RNA-Seq data, and an initial transcriptome. The program evaluates these initial transcripts to identify expressed polyadenylation sites in the biological sample and refines transcript 3'-ends accordingly. The average precision of the aptardi model is twice that of a standard transcriptome assembler. In particular, the recall of the aptardi model (the proportion of true polyadenylation sites detected by the algorithm) is improved by over three-fold. Also, the model-trained using the Human Brain Reference RNA commercial standard-performs well when applied to RNA-sequencing samples from different tissues and different mammalian species. Finally, aptardi's input is simple to compile and its output is easily amenable to downstream analyses such as quantitation and differential expression.
Collapse
|
5
|
Ntini E, Vang Ørom UA. Targeting Polyadenylation for Retention of RNA at Chromatin. Methods Mol Biol 2021; 2161:51-58. [PMID: 32681505 DOI: 10.1007/978-1-0716-0680-3_5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The various steps of RNA polymerase II transcription, including transcription initiation, splicing, and termination, are interlinked and tightly coordinated. Efficient 3'end processing is defined by sequence motifs emerging in the nascent-transcribed RNA strand and the cotranscriptional binding of regulatory proteins. The processing of a mature 3'end consists of cleavage and polyadenylation and is coupled with RNA polymerase II transcription termination and the dissociation of the nascent RNA transcript from the chromatin-associated transcriptional template. The subcellular and subnuclear topological specificity of the various RNA species is important for their functions. For instance, the formation of RNA-binding protein interactions, critical for the final outcome of gene expression, may require the nucleoplasmic fully spliced and polyadenylated form of an RNA transcript. Thus, interfering with the critical step of transcription termination and 3'end formation provides a means for assaying the functional potential of a given RNA of interest.In this protocol, we describe a method for blocking 3'end processing of the nascent RNA transcript, by using RNase H-inactive antisense oligonucleotides targeting cleavage and polyadenylation, delivered via transient transfection in cell culture.
Collapse
Affiliation(s)
- Evgenia Ntini
- Max Planck Institute for Molecular Genetics, Berlin, Germany. .,Freie Universität Berlin, Berlin, Germany.
| | | |
Collapse
|
6
|
Fine gene expression regulation by minor sequence variations downstream of the polyadenylation signal. Mol Biol Rep 2021; 48:1539-1547. [PMID: 33517473 DOI: 10.1007/s11033-021-06160-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 01/12/2021] [Indexed: 12/22/2022]
Abstract
The termination of transcription is a complex process that substantially contributes to gene regulation in eukaryotes. Previously, it was noted that a single cytosine deletion at the position + 32 bp relative to the single polyadenylation signal AAUAAA (hereafter the dC mutation) causes a 2-fold increase in the transcription level of the upstream eGFP reporter in mouse embryonic stem cells. Here, we analyzed the conservation of this phenomenon in immortalized mouse, human and drosophila cell lines and the influence of the dC mutation on the choice of the pre-mRNA cleavage sites. We have constructed dual-reporter plasmids to accurately measure the effect of the dC and other nearby located mutations on eGFP mRNA level by RT-qPCR. In this way, we found that the dC mutation leads to a 2-fold increase in the expression level of the upstream eGFP reporter gene in cultured mouse and human, but not in drosophila cells. In addition, 3' RACE analysis demonstrated that eGFP pre-mRNAs are cut at multiple positions between + 14 to + 31, and that the most proximal cleavage site becomes almost exclusively utilized in the presence of the dC mutation. We also identified new short sequence variations located within positions + 25.. + 40 and + 33.. + 48 that increase eGFP expression up to ~2-4-fold. Altogether, the positive effect of the dC mutation seems to be conserved in mouse embryonic stem cells, mouse embryonic 3T3 fibroblasts and human HEK293T cells. In the latter cells, the dC mutation appears to be involved in regulating pre-mRNA cleavage site selection. Finally, a multiplexed approach is proposed to identify motifs located downstream of cleavage site(s) that are essential for transcription termination.
Collapse
|
7
|
Dhar S, Datta A, Brosh RM. DNA helicases and their roles in cancer. DNA Repair (Amst) 2020; 96:102994. [PMID: 33137625 DOI: 10.1016/j.dnarep.2020.102994] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 09/28/2020] [Indexed: 12/15/2022]
Abstract
DNA helicases, known for their fundamentally important roles in genomic stability, are high profile players in cancer. Not only are there monogenic helicase disorders with a strong disposition to cancer, it is well appreciated that helicase variants are associated with specific cancers (e.g., breast cancer). Flipping the coin, DNA helicases are frequently overexpressed in cancerous tissues and reduction in helicase gene expression results in reduced proliferation and growth capacity, as well as DNA damage induction and apoptosis of cancer cells. The seminal roles of helicases in the DNA damage and replication stress responses, as well as DNA repair pathways, validate their vital importance in cancer biology and suggest their potential values as targets in anti-cancer therapy. In recent years, many laboratories have characterized the specialized roles of helicase to resolve transcription-replication conflicts, maintain telomeres, mediate cell cycle checkpoints, remodel stalled replication forks, and regulate transcription. In vivo models, particularly mice, have been used to interrogate helicase function and serve as a bridge for preclinical studies that may lead to novel therapeutic approaches. In this review, we will summarize our current knowledge of DNA helicases and their roles in cancer, emphasizing the latest developments.
Collapse
Affiliation(s)
- Srijita Dhar
- Laboratory of Molecular Gerontology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA
| | - Arindam Datta
- Laboratory of Molecular Gerontology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA
| | - Robert M Brosh
- Laboratory of Molecular Gerontology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA.
| |
Collapse
|
8
|
Yang W, Hsu PL, Yang F, Song JE, Varani G. Reconstitution of the CstF complex unveils a regulatory role for CstF-50 in recognition of 3'-end processing signals. Nucleic Acids Res 2019; 46:493-503. [PMID: 29186539 PMCID: PMC5778602 DOI: 10.1093/nar/gkx1177] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 11/13/2017] [Indexed: 01/18/2023] Open
Abstract
Cleavage stimulation factor (CstF) is a highly conserved protein complex composed of three subunits that recognizes G/U-rich sequences downstream of the polyadenylation signal of eukaryotic mRNAs. While CstF has been identified over 25 years ago, the architecture and contribution of each subunit to RNA recognition have not been fully understood. In this study, we provide a structural basis for the recruitment of CstF-50 to CstF via interaction with CstF-77 and establish that the hexameric assembly of CstF creates a high affinity platform to target various G/U-rich sequences. We further demonstrate that CstF-77 boosts the affinity of the CstF-64 RRM to the RNA targets and CstF-50 fine tunes the ability of the complex to recognize G/U sequences of certain lengths and content.
Collapse
Affiliation(s)
- Wen Yang
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA
| | - Peter L Hsu
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA
| | - Fan Yang
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA
| | - Jae-Eun Song
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA
| | - Gabriele Varani
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA
| |
Collapse
|
9
|
Zhu S, Wu X, Fu H, Ye C, Chen M, Jiang Z, Ji G. Modeling of Genome-Wide Polyadenylation Signals in Xenopus tropicalis. Front Genet 2019; 10:647. [PMID: 31333724 PMCID: PMC6616101 DOI: 10.3389/fgene.2019.00647] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Accepted: 06/18/2019] [Indexed: 12/22/2022] Open
Abstract
Alternative polyadenylation (APA) is an important post-transcriptional modification event to process messenger RNA (mRNA) for transcriptional termination, transport, and translation. In the present study, we characterized poly(A) signals in Xenopus tropicalis using 70,918 highly confident poly(A) sites derived from 16,511 protein-coding genes to understand their roles in the regulation of embryo development and gender difference. We examined potential factors, including the gene length, the number of introns in a gene, and the intron length, that may affect the prevalence of APA. We observed 12 prominent poly(A) signal patterns, which accounted for approximately 92% of total APA sites in Xenopus tropicalis. Among them, three patterns are specific to X. tropicalis, so they are absent in other animals such as humans or mice. We catalogued APA sites based on their genomic regions and developed a bioinformatics pipeline to identify over-represented signal patterns for each class. Then the schema of cis elements for APA sites in each genomic region was proposed. More importantly, APA usage is dramatically dynamic in embryos along five developmental stages and well-coordinated with the maternal-to-zygotic transition event. We used an entropy-based method to identify developmental stage-specific APA sites and identified significant signal patterns around specific sites and constitutive sites. We found that the APA frequency in different genomic regions varies with developmental stages and that those sites located in intron or coding sequence regions contribute most to the dynamics of gene expression during developmental stages. This study deciphers the characteristics and poly(A) signal patterns for both canonical APA sites and non-canonical APA sites across different developmental stages and gender dimorphisms in X. tropicalis, providing new insights into the dynamic regulation of distal and proximal APA.
Collapse
Affiliation(s)
- Sheng Zhu
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
| | - Xiaohui Wu
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Innovation Center for Cell Signaling Network, Xiamen University, Xiamen, China
| | - Hongjuan Fu
- Department of Automation, Xiamen University, Xiamen, China
| | - Congting Ye
- National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Moliang Chen
- Department of Automation, Xiamen University, Xiamen, China
| | - Zhihua Jiang
- Department of Animal Sciences and Center for Reproductive Biology, Washington State University, Pullman, WA, United States
| | - Guoli Ji
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Innovation Center for Cell Signaling Network, Xiamen University, Xiamen, China
| |
Collapse
|
10
|
Cheng JK, Morse NJ, Wagner JM, Tucker SK, Alper HS. Design and Evaluation of Synthetic Terminators for Regulating Mammalian Cell Transgene Expression. ACS Synth Biol 2019; 8:1263-1275. [PMID: 31091408 DOI: 10.1021/acssynbio.8b00285] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Tuning heterologous gene expression in mammalian production hosts has predominantly relied upon engineering the promoter elements driving the transcription of the transgene. Moreover, most regulatory elements have borrowed genetic sequences from viral elements. Here, we generate a set of 10 rational and 30 synthetic terminators derived from nonviral elements and evaluate them in the HT1080 and HEK293 cell lines to demonstrate that they are comparable in terms of tuning gene expression/protein output to the viral SV40 element and often require less sequence footprint. The mode of action of these terminators is determined to be an increase in mRNA half-life. Furthermore, we demonstrate that constructs comprising completely nonviral regulatory elements ( i.e., promoters and terminators) can outperform commonly used, strong viral based elements by nearly 2-fold. Ultimately, this novel set of terminators expanded our genetic toolkit for engineering mammalian host cells.
Collapse
Affiliation(s)
- Joseph K. Cheng
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street, Stop C0400, Austin, Texas 78712, United States
| | - Nicholas J. Morse
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street, Stop C0400, Austin, Texas 78712, United States
| | - James M. Wagner
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street, Stop C0400, Austin, Texas 78712, United States
| | - Scott K. Tucker
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, 2500 Speedway Avenue, Austin, Texas 78712, United States
| | - Hal S. Alper
- McKetta Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street, Stop C0400, Austin, Texas 78712, United States
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, 2500 Speedway Avenue, Austin, Texas 78712, United States
| |
Collapse
|
11
|
Garrido-Lecca A, Saldi T, Blumenthal T. Localization of RNAPII and 3' end formation factor CstF subunits on C. elegans genes and operons. Transcription 2016; 7:96-110. [PMID: 27124504 DOI: 10.1080/21541264.2016.1168509] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Transcription termination is mechanistically coupled to pre-mRNA 3' end formation to prevent transcription much beyond the gene 3' end. C. elegans, however, engages in polycistronic transcription of operons in which 3' end formation between genes is not accompanied by termination. We have performed RNA polymerase II (RNAPII) and CstF ChIP-seq experiments to investigate at a genome-wide level how RNAPII can transcribe through multiple poly-A signals without causing termination. Our data shows that transcription proceeds in some ways as if operons were composed of multiple adjacent single genes. Total RNAPII shows a small peak at the promoter of the gene cluster and a much larger peak at 3' ends. These 3' peaks coincide with maximal phosphorylation of Ser2 within the C-terminal domain (CTD) of RNAPII and maximal localization of the 3' end formation factor CstF. This pattern occurs at all 3' ends including those at internal sites in operons where termination does not occur. Thus the normal mechanism of 3' end formation does not always result in transcription termination. Furthermore, reduction of CstF50 by RNAi did not substantially alter the pattern of CstF64, total RNAPII, or Ser2 phosphorylation at either internal or terminal 3' ends. However, CstF50 RNAi did result in a subtle reduction of CstF64 binding upstream of the site of 3' cleavage, suggesting that the CstF50/CTD interaction may facilitate bringing the 3' end machinery to the transcription complex.
Collapse
Affiliation(s)
- Alfonso Garrido-Lecca
- a Department of Molecular, Cellular, and Developmental Biology , University of Colorado , Boulder , CO , USA
| | - Tassa Saldi
- a Department of Molecular, Cellular, and Developmental Biology , University of Colorado , Boulder , CO , USA
| | - Thomas Blumenthal
- a Department of Molecular, Cellular, and Developmental Biology , University of Colorado , Boulder , CO , USA
| |
Collapse
|
12
|
Youngblood BA, MacDonald CC. CstF-64 is necessary for endoderm differentiation resulting in cardiomyocyte defects. Stem Cell Res 2014; 13:413-21. [PMID: 25460602 DOI: 10.1016/j.scr.2014.09.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Revised: 09/06/2014] [Accepted: 09/16/2014] [Indexed: 12/23/2022] Open
Abstract
Although adult cardiomyocytes have the capacity for cellular regeneration, they are unable to fully repair severely injured hearts. The use of embryonic stem cell (ESC)-derived cardiomyocytes as transplantable heart muscle cells has been proposed as a solution, but is limited by the lack of understanding of the developmental pathways leading to specification of cardiac progenitors. Identification of these pathways will enhance the ability to differentiate cardiomyocytes into a clinical source of transplantable cells. Here, we show that the mRNA 3' end processing protein, CstF-64, is essential for cardiomyocyte differentiation in mouse ESCs. Loss of CstF-64 in mouse ESCs results in loss of differentiation potential toward the endodermal lineage. However, CstF-64 knockout (Cstf2(E6)) cells were able to differentiate into neuronal progenitors, demonstrating that some differentiation pathways were still intact. Markers for mesodermal differentiation were also present, although Cstf2(E6) cells were defective in forming beating cardiomyocytes and expressing cardiac specific markers. Since the extraembryonic endoderm is needed for cardiomyocyte differentiation and endodermal markers were decreased, we hypothesized that endodermal factors were required for efficient cardiomyocyte formation in the Cstf2(E6) cells. Using conditioned medium from the extraembryonic endodermal (XEN) stem cell line we were able to restore cardiomyocyte differentiation in Cstf2(E6) cells, suggesting that CstF-64 has a role in regulating endoderm differentiation that is necessary for cardiac specification and that extraembryonic endoderm signaling is essential for cardiomyocyte development.
Collapse
Affiliation(s)
- Bradford A Youngblood
- Department of Cell Biology & Biochemistry, Texas Tech University Health Sciences Center, 3601 4th Street, Lubbock, TX 79430-6540, USA
| | - Clinton C MacDonald
- Department of Cell Biology & Biochemistry, Texas Tech University Health Sciences Center, 3601 4th Street, Lubbock, TX 79430-6540, USA.
| |
Collapse
|
13
|
Skourti-Stathaki K, Proudfoot NJ. A double-edged sword: R loops as threats to genome integrity and powerful regulators of gene expression. Genes Dev 2014; 28:1384-96. [PMID: 24990962 PMCID: PMC4083084 DOI: 10.1101/gad.242990.114] [Citation(s) in RCA: 394] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
R loops are three-stranded nucleic acid structures that comprise nascent RNA hybridized with the DNA template, leaving the nontemplate DNA single-stranded. These structures form naturally during transcription even though their persistent formation can have deleterious effects on genome integrity. Interestingly, an increasing number of studies also suggest that R loops function as potential gene expression regulators. Here, Skourti-Stathaki and Proudfoot review the most recent findings about R loops, highlighting their opposite roles in cellular fitness. R loops are three-stranded nucleic acid structures that comprise nascent RNA hybridized with the DNA template, leaving the nontemplate DNA single-stranded. R loops form naturally during transcription even though their persistent formation can be a risky outcome with deleterious effects on genome integrity. On the other hand, over the last few years, an increasingly strong case has been built for R loops as potential regulators of gene expression. Therefore, understanding their function and regulation under these opposite situations is essential to fully characterize the mechanisms that control genome integrity and gene expression. Here we review recent findings about these interesting structures that highlight their opposite roles in cellular fitness.
Collapse
Affiliation(s)
| | - Nicholas J Proudfoot
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, United Kingdom
| |
Collapse
|
14
|
Li XQ, Du D. Motif types, motif locations and base composition patterns around the RNA polyadenylation site in microorganisms, plants and animals. BMC Evol Biol 2014; 14:162. [PMID: 25052519 PMCID: PMC4360255 DOI: 10.1186/s12862-014-0162-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Accepted: 07/14/2014] [Indexed: 12/22/2022] Open
Abstract
Background The polyadenylation of RNA is critical for gene functioning, but the conserved sequence motifs (often called signal or signature motifs), motif locations and abundances, and base composition patterns around mRNA polyadenylation [poly(A)] sites are still uncharacterized in most species. The evolutionary tendency for poly(A) site selection is still largely unknown. Results We analyzed the poly(A) site regions of 31 species or phyla. Different groups of species showed different poly(A) signal motifs: UUACUU at the poly(A) site in the parasite Trypanosoma cruzi; UGUAAC (approximately 13 bases upstream of the site) in the alga Chlamydomonas reinhardtii; UGUUUG (or UGUUUGUU) at mainly the fourth base downstream of the poly(A) site in the parasite Blastocystis hominis; and AAUAAA at approximately 16 bases and approximately 19 bases upstream of the poly(A) site in animals and plants, respectively. Polyadenylation signal motifs are usually several hundred times more abundant around poly(A) sites than in whole genomes. These predominant motifs usually had very specific locations, whether upstream of, at, or downstream of poly(A) sites, depending on the species or phylum. The poly(A) site was usually an adenosine (A) in all analyzed species except for B. hominis, and there was weak A predominance in C. reinhardtii. Fungi, animals, plants, and the protist Phytophthora infestans shared a general base abundance pattern (or base composition pattern) of “U-rich—A-rich—U-rich—Poly(A) site—U-rich regions”, or U-A-U-A-U for short, with some variation for each kingdom or subkingdom. Conclusion This study identified the poly(A) signal motifs, motif locations, and base composition patterns around mRNA poly(A) sites in protists, fungi, plants, and animals and provided insight into poly(A) site evolution.
Collapse
Affiliation(s)
- Xiu-Qing Li
- Molecular Genetics Laboratory, Potato Research Centre, Agriculture and Agri-Food Canada, 850 Lincoln Road, Fredericton, New Brunswick, E3B 4Z7, Canada.
| | - Donglei Du
- Quantitative Methods Research Group, Faculty of Business Administration, University of New Brunswick, 7 Macaulay Lane, Fredericton, NB, E3B 5A3, Canada.
| |
Collapse
|
15
|
Majerciak V, Ni T, Yang W, Meng B, Zhu J, Zheng ZM. A viral genome landscape of RNA polyadenylation from KSHV latent to lytic infection. PLoS Pathog 2013; 9:e1003749. [PMID: 24244170 PMCID: PMC3828183 DOI: 10.1371/journal.ppat.1003749] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2013] [Accepted: 09/20/2013] [Indexed: 11/30/2022] Open
Abstract
RNA polyadenylation (pA) is one of the major steps in regulation of gene expression at the posttranscriptional level. In this report, a genome landscape of pA sites of viral transcripts in B lymphocytes with Kaposi sarcoma-associated herpesvirus (KSHV) infection was constructed using a modified PA-seq strategy. We identified 67 unique pA sites, of which 55 could be assigned for expression of annotated ∼90 KSHV genes. Among the assigned pA sites, twenty are for expression of individual single genes and the rest for multiple genes (average 2.7 genes per pA site) in cluster-gene loci of the genome. A few novel viral pA sites that could not be assigned to any known KSHV genes are often positioned in the antisense strand to ORF8, ORF21, ORF34, K8 and ORF50, and their associated antisense mRNAs to ORF21, ORF34 and K8 could be verified by 3′RACE. The usage of each mapped pA site correlates to its peak size, the larger (broad and wide) peak size, the more usage and thus, the higher expression of the pA site-associated gene(s). Similar to mammalian transcripts, KSHV RNA polyadenylation employs two major poly(A) signals, AAUAAA and AUUAAA, and is regulated by conservation of cis-elements flanking the mapped pA sites. Moreover, we found two or more alternative pA sites downstream of ORF54, K2 (vIL6), K9 (vIRF1), K10.5 (vIRF3), K11 (vIRF2), K12 (Kaposin A), T1.5, and PAN genes and experimentally validated the alternative polyadenylation for the expression of KSHV ORF54, K11, and T1.5 transcripts. Together, our data provide not only a comprehensive pA site landscape for understanding KSHV genome structure and gene expression, but also the first evidence of alternative polyadenylation as another layer of posttranscriptional regulation in viral gene expression. A genome-wide polyadenylation landscape in the expression of human herpesviruses has not been reported. In this study, we provide the first genome landscape of viral RNA polyadenylation sites in B cells from KSHV latent to lytic infection by using a modified PA-seq protocol and selectively validated by 3′ RACE. We found that KSHV genome contains 67 active pA sites for the expression of its ∼90 genes and a few antisense transcripts. Among the mapped pA sites, a large fraction of them are for the expression of cluster genes and the production of bicistronic or polycistronic transcripts from KSHV genome and only one-third are used for the expression of single genes. We found that the size of individual PA peaks is positively correlated with the usage of corresponding pA site, which is determined by the number of reads within the PA peak from latent to lytic KSHV infection, and the strength of cis-elements surrounding KSHV pA site determines the expression level of viral genes. Lastly, we identified and experimentally validated alternative polyadenylation of KSHV ORF54, T1.5, and K11 during viral lytic infection. To our knowledge, this is the first report on alternative polyadenylation events in KSHV infection.
Collapse
Affiliation(s)
- Vladimir Majerciak
- Tumor Virus RNA Biology Section, Gene Regulation and Chromosome Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Ting Ni
- DNA Sequencing and Genomics Core, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Wenjing Yang
- DNA Sequencing and Genomics Core, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Bowen Meng
- Tumor Virus RNA Biology Section, Gene Regulation and Chromosome Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Jun Zhu
- DNA Sequencing and Genomics Core, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (JZ); (ZMZ)
| | - Zhi-Ming Zheng
- Tumor Virus RNA Biology Section, Gene Regulation and Chromosome Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (JZ); (ZMZ)
| |
Collapse
|
16
|
Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality. Nat Struct Mol Biol 2013; 20:923-8. [PMID: 23851456 DOI: 10.1038/nsmb.2640] [Citation(s) in RCA: 205] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 06/26/2013] [Indexed: 12/19/2022]
Abstract
Active human promoters produce promoter-upstream transcripts (PROMPTs). Why these RNAs are coupled to decay, whereas their neighboring promoter-downstream mRNAs are not, is unknown. Here high-throughput sequencing demonstrates that PROMPTs generally initiate in the antisense direction closely upstream of the transcription start sites (TSSs) of their associated genes. PROMPT TSSs share features with mRNA-producing TSSs, including stalled RNA polymerase II (RNAPII) and the production of small TSS-associated RNAs. Notably, motif analyses around PROMPT 3' ends reveal polyadenylation (pA)-like signals. Mutagenesis studies demonstrate that PROMPT pA signals are functional but linked to RNA degradation. Moreover, pA signals are under-represented in promoter-downstream versus promoter-upstream regions, thus allowing for more efficient RNAPII progress in the sense direction from gene promoters. We conclude that asymmetric sequence distribution around human gene promoters serves to provide a directional RNA output from an otherwise bidirectional transcription process.
Collapse
|
17
|
Abstract
BACKGROUND Polyadenylation is present in all three domains of life, making it the most conserved post-transcriptional process compared with splicing and 5'-capping. Even though most mammalian poly(A) sites contain a highly conserved hexanucleotide in the upstream region and a far less conserved U/GU-rich sequence in the downstream region, there are many exceptions. Furthermore, poly(A) sites in other species, such as plants and invertebrates, exhibit high deviation from this genomic structure, making the construction of a general poly(A) site recognition model challenging. We surveyed nine poly(A) site prediction methods published between 1999 and 2011. All methods exploit the skewed nucleotide profile across the poly(A) sites, and the highly conserved poly(A) signal as the primary features for recognition. These methods typically use a large number of features, which increases the dimensionality of the models to crippling degrees, and typically are not validated against many kinds of genomes. RESULTS We propose a poly(A) site model that employs minimal features to capture the essence of poly(A) sites, and yet, produces better prediction accuracy across diverse species. Our model consists of three dior-trinucleotide profiles identified through principle component analysis, and the predicted nucleosome occupancy flanking the poly(A) sites. We validated our model using two machine learning methods: logistic regression and linear discriminant analysis. Results show that models achieve 85-92% sensitivity and 85-96% specificity in seven animals and plants. When we applied one model from one species to predict poly(A) sites from other species, the sensitivity scores correlate with phylogenetic distances. CONCLUSIONS A four-feature model geared towards small motifs was sufficient to accurately learn and predict poly(A) sites across eukaryotes.
Collapse
Affiliation(s)
- Eric S Ho
- Department of Molecular Genetics, Microbiology and Immunology, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, New Jersey, USA.
| | | | | |
Collapse
|
18
|
Vandenbon A, Kumagai Y, Teraguchi S, Amada KM, Akira S, Standley DM. A Parzen window-based approach for the detection of locally enriched transcription factor binding sites. BMC Bioinformatics 2013; 14:26. [PMID: 23331723 PMCID: PMC3602658 DOI: 10.1186/1471-2105-14-26] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 01/14/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identification of cis- and trans-acting factors regulating gene expression remains an important problem in biology. Bioinformatics analyses of regulatory regions are hampered by several difficulties. One is that binding sites for regulatory proteins are often not significantly over-represented in the set of DNA sequences of interest, because of high levels of false positive predictions, and because of positional restrictions on functional binding sites with regard to the transcription start site. RESULTS We have developed a novel method for the detection of regulatory motifs based on their local over-representation in sets of regulatory regions. The method makes use of a Parzen window-based approach for scoring local enrichment, and during evaluation of significance it takes into account GC content of sequences. We show that the accuracy of our method compares favourably to that of other methods, and that our method is capable of detecting not only generally over-represented regulatory motifs, but also locally over-represented motifs that are often missed by standard motif detection approaches. Using a number of examples we illustrate the validity of our approach and suggest applications, such as the analysis of weaker binding sites. CONCLUSIONS Our approach can be used to suggest testable hypotheses for wet-lab experiments. It has potential for future analyses, such as the prediction of weaker binding sites. An online application of our approach, called LocaMo Finder (Local Motif Finder), is available at http://sysimm.ifrec.osaka-u.ac.jp/tfbs/locamo/.
Collapse
Affiliation(s)
- Alexis Vandenbon
- Laboratory of Systems Immunology, Immunology Frontier Research Center, Osaka University, Osaka, Japan.
| | | | | | | | | | | |
Collapse
|
19
|
Huang KY, Shin JW, Huang PJ, Ku FM, Lin WC, Lin R, Hsu WM, Tang P. Functional profiling of the Tritrichomonas foetus transcriptome and proteome. Mol Biochem Parasitol 2013; 187:60-71. [DOI: 10.1016/j.molbiopara.2012.12.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2012] [Revised: 11/30/2012] [Accepted: 12/02/2012] [Indexed: 01/09/2023]
|
20
|
The τCstF-64 polyadenylation protein controls genome expression in testis. PLoS One 2012; 7:e48373. [PMID: 23110235 PMCID: PMC3482194 DOI: 10.1371/journal.pone.0048373] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Accepted: 09/24/2012] [Indexed: 12/21/2022] Open
Abstract
The τCstF-64 polyadenylation protein (gene symbol Cstf2t) is a testis-expressed orthologue of CstF-64. Mice in which Cstf2t was knocked out had a phenotype that was only detected in meiotic and postmeiotic male germ cells, giving us the opportunity to examine CstF-64 function in an isolated developmental system. We performed massively parallel clonally amplified sequencing of cDNAs from testes of wild type and Cstf2t−/− mice. These results revealed that loss of τCstF-64 resulted in large-scale changes in patterns of genome expression. We determined that there was a significant overrepresentation of RNAs from introns and intergenic regions in testes of Cstf2t−/− mice, and a concomitant use of more distal polyadenylation sites. We observed this effect particularly in intronless small genes, many of which are expressed retroposons that likely co-evolved with τCstF-64. Finally, we observed overexpression of long interspersed nuclear element (LINE) sequences in Cstf2t−/− testes. These results suggest that τCstF-64 plays a role in 3′ end determination and transcription termination for a large range of germ cell-expressed genes.
Collapse
|
21
|
Grosso AR, de Almeida SF, Braga J, Carmo-Fonseca M. Dynamic transitions in RNA polymerase II density profiles during transcription termination. Genome Res 2012; 22:1447-56. [PMID: 22684278 PMCID: PMC3409258 DOI: 10.1101/gr.138057.112] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Eukaryotic protein-coding genes are transcribed by RNA polymerase II (RNAPII) through a cycle composed of three main phases: initiation, elongation, and termination. Recent studies using chromatin immunoprecipitation coupled to high-throughput sequencing suggest that the density of RNAPII molecules is higher at the 3′-end relative to the gene body. Here we show that this view is biased due to averaging density profiles for “metagene” analysis. Indeed, the majority of genes exhibit little, if any, detectable accumulation of polymerases during transcription termination. Compared with genes with no enrichment, genes that accumulate RNAPII at the 3′-end are shorter, more frequently contain the canonical polyadenylation [poly(A)] signal AATAAA and G-rich motifs in the downstream sequence element, and have higher levels of expression. In 1% to 4% of actively transcribing genes, the RNAPII enriched at the 3′-end is phosphorylated on Ser5, and we provide evidence suggesting that these genes have their promoter and terminator regions juxtaposed. We also found a striking correlation between RNAPII accumulation and nucleosome organization, suggesting that the presence of nucleosomes after the poly(A) site induces pausing of polymerases, leading to their accumulation. Yet we further observe that nucleosome occupancy at the 3′-end of genes is dynamic and correlates with RNAPII density. Taken together, our results provide novel insight to transcription termination, a fundamental process that remains one of the least understood stages of the transcription cycle.
Collapse
Affiliation(s)
- Ana Rita Grosso
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, 1649-028 Lisboa, Portugal
| | | | | | | |
Collapse
|
22
|
Clustering of DNA words and biological function: A proof of principle. J Theor Biol 2012; 297:127-36. [DOI: 10.1016/j.jtbi.2011.12.024] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2011] [Revised: 12/20/2011] [Accepted: 12/21/2011] [Indexed: 02/08/2023]
|
23
|
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 3:385-96. [PMID: 22012871 DOI: 10.1002/wrna.116] [Citation(s) in RCA: 159] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for 3' end formation of almost all protein-coding transcripts in eukaryotes. The reaction, involving cleavage of nascent mRNA followed by addition of a polyadenylate or poly(A) tail, is controlled by cis-acting elements in the pre-mRNA surrounding the cleavage site. Experimental and bioinformatic studies in the past three decades have elucidated conserved and divergent elements across eukaryotes, from yeast to human. Here we review histories and current models of these elements in a broad range of species.
Collapse
Affiliation(s)
- Bin Tian
- UMDNJ-New Jersey Medical School, Newark, NJ, USA.
| | | |
Collapse
|
24
|
Skourti-Stathaki K, Proudfoot NJ, Gromak N. Human senataxin resolves RNA/DNA hybrids formed at transcriptional pause sites to promote Xrn2-dependent termination. Mol Cell 2011; 42:794-805. [PMID: 21700224 PMCID: PMC3145960 DOI: 10.1016/j.molcel.2011.04.026] [Citation(s) in RCA: 563] [Impact Index Per Article: 43.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2010] [Revised: 03/10/2011] [Accepted: 04/18/2011] [Indexed: 11/30/2022]
Abstract
We present a molecular dissection of pause site-dependent transcriptional termination for mammalian RNA polymerase II (Pol II)-transcribed genes. We show that nascent transcripts form RNA/DNA hybrid structures (R-loops) behind elongating Pol II and are especially prevalent over G-rich pause sites positioned downstream of gene poly(A) signals. Senataxin, a helicase protein associated with AOA2/ALS4 neurodegenerative disorders, acts to resolve these R-loop structures and by so doing allows access of the 5′–3′ exonuclease Xrn2 at 3′ cleavage poly(A) sites. This affords 3′ transcript degradation and consequent Pol II termination. In effect, R-loops formed over G-rich pause sites, followed by their resolution by senataxin, are key steps in the termination process.
Collapse
|
25
|
Ho ES, Gunderson SI. Long conserved fragments upstream of Mammalian polyadenylation sites. Genome Biol Evol 2011; 3:654-66. [PMID: 21705472 PMCID: PMC3157836 DOI: 10.1093/gbe/evr053] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/27/2011] [Indexed: 12/02/2022] Open
Abstract
Polyadenylation is a cotranscriptional nuclear RNA processing event involving endonucleolytic cleavage of the nascent, emerging pre-messenger RNA (pre-mRNA) from the RNA polymerase, immediately followed by the polymerization of adenine ribonucleotides, called the poly(A) tail, to the cleaved 3' end of the polyadenylation site (PAS). This apparently simple molecular processing step has been discovered to be connected to transcription and splicing therefore increasing its potential for regulation of gene expression. Here, through a bioinformatic analysis of cis-PAS-regulatory elements in mammals that includes taking advantage of multiple evolutionary time scales, we find unexpected selection pressure much further upstream, up to 200 nt, from the PAS than previously thought. Strikingly, close to 3,000 long (30-500 nt) noncoding conserved fragments (CFs) were discovered in the PAS flanking region of three remotely related mammalian species, human, mouse, and cow. When an even more remote transitional mammal, platypus, was included, still over a thousand CFs were found in the proximity of the PAS. Even though the biological function of these CFs remains unknown, their considerable sizes makes them unlikely to serve as protein recognition sites, which are typically ≤15 nt. By harnessing genome wide DNaseI hypersensitivity data, we have discovered that the presence of CFs correlates with chromatin accessibility. Our study is important in highlighting novel experimental targets, which may provide new understanding about the regulatory aspects of polyadenylation.
Collapse
Affiliation(s)
- Eric S. Ho
- Present address: Department of Molecular Genetics, Microbiology and Immunology, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Piscataway, New Jersey
| | | |
Collapse
|
26
|
Yang Q, Doublié S. Structural biology of poly(A) site definition. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 2:732-47. [PMID: 21823232 DOI: 10.1002/wrna.88] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
3' processing is an essential step in the maturation of all messenger RNAs (mRNAs) and is a tightly coupled two-step reaction: endonucleolytic cleavage at the poly(A) site is followed by the addition of a poly(A) tail, except for metazoan histone mRNAs, which are cleaved but not polyadenylated. The recognition of a poly(A) site is coordinated by the sequence elements in the mRNA 3' UTR and associated protein factors. In mammalian cells, three well-studied sequence elements, UGUA, AAUAAA, and GU-rich, are recognized by three multisubunit factors: cleavage factor I(m) (CFI(m) ), cleavage and polyadenylation specificity factor (CPSF), and cleavage stimulation factor (CstF), respectively. In the yeast Saccharomyces cerevisiae, UA repeats and A-rich sequence elements are recognized by Hrp1p and cleavage factor IA. Structural studies of protein-RNA complexes have helped decipher the mechanisms underlying sequence recognition and shed light on the role of protein factors in poly(A) site selection and 3' processing machinery assembly. In this review we focus on the interactions between the mRNA cis-elements and the protein factors (CFI(m) , CPSF, CstF, and homologous factors from yeast and other eukaryotes) that define the poly(A) site. WIREs RNA 2011 2 732-747 DOI: 10.1002/wrna.88 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Qin Yang
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, USA
| | | |
Collapse
|
27
|
|
28
|
Ahmed F, Benedito VA, Zhao PX. Mining Functional Elements in Messenger RNAs: Overview, Challenges, and Perspectives. FRONTIERS IN PLANT SCIENCE 2011; 2:84. [PMID: 22639614 PMCID: PMC3355573 DOI: 10.3389/fpls.2011.00084] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2011] [Accepted: 11/03/2011] [Indexed: 05/03/2023]
Abstract
Eukaryotic messenger RNA (mRNA) contains not only protein-coding regions but also a plethora of functional cis-elements that influence or coordinate a number of regulatory aspects of gene expression, such as mRNA stability, splicing forms, and translation rates. Understanding the rules that apply to each of these element types (e.g., whether the element is defined by primary or higher-order structure) allows for the discovery of novel mechanisms of gene expression as well as the design of transcripts with controlled expression. Bioinformatics plays a major role in creating databases and finding non-evident patterns governing each type of eukaryotic functional element. Much of what we currently know about mRNA regulatory elements in eukaryotes is derived from microorganism and animal systems, with the particularities of plant systems lagging behind. In this review, we provide a general introduction to the most well-known eukaryotic mRNA regulatory motifs (splicing regulatory elements, internal ribosome entry sites, iron-responsive elements, AU-rich elements, zipcodes, and polyadenylation signals) and describe available bioinformatics resources (databases and analysis tools) to analyze eukaryotic transcripts in search of functional elements, focusing on recent trends in bioinformatics methods and tool development. We also discuss future directions in the development of better computational tools based upon current knowledge of these functional elements. Improved computational tools would advance our understanding of the processes underlying gene regulations. We encourage plant bioinformaticians to turn their attention to this subject to help identify novel mechanisms of gene expression regulation using RNA motifs that have potentially evolved or diverged in plant species.
Collapse
Affiliation(s)
- Firoz Ahmed
- Bioinformatics Laboratory, Plant Biology Division, Samuel Roberts Noble FoundationArdmore, OK, USA
| | - Vagner A. Benedito
- Genetics and Developmental Biology, Plant and Soil Sciences Division, West Virginia UniversityMorgantown, WV, USA
| | - Patrick Xuechun Zhao
- Bioinformatics Laboratory, Plant Biology Division, Samuel Roberts Noble FoundationArdmore, OK, USA
- *Correspondence: Patrick Xuechun Zhao, Bioinformatics Laboratory, Plant Biology Division, Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA e-mail:
| |
Collapse
|
29
|
Pancevac C, Goldstone DC, Ramos A, Taylor IA. Structure of the Rna15 RRM-RNA complex reveals the molecular basis of GU specificity in transcriptional 3'-end processing factors. Nucleic Acids Res 2010; 38:3119-32. [PMID: 20097654 PMCID: PMC2875009 DOI: 10.1093/nar/gkq002] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Rna15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3′-end processing factor from Saccharomyces cerevisiae. CFIA is required for polyA site selection/cleavage targeting RNA sequences that surround polyadenylation sites in the 3′-UTR of RNA polymerase-II transcripts. RNA recognition by CFIA is mediated by an RNA recognition motif (RRM) contained in the Rna15 subunit of the complex. We show here that Rna15 has a strong and unexpected preference for GU containing RNAs and reveal the molecular basis for a base selectivity mechanism that accommodates G or U but discriminates against C and A bases. This mode of base selectivity is rather different to that observed in other RRM-RNA structures and is structurally conserved in CstF64, the mammalian counterpart of Rna15. Our observations provide evidence for a highly conserved mechanism of base recognition amongst the 3′-end processing complexes that interact with the U-rich or U/G-rich elements at 3′-end cleavage/polyadenylation sites.
Collapse
Affiliation(s)
- Christina Pancevac
- Division of Molecular Structure, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | | | | | | |
Collapse
|
30
|
Hockert JA, Yeh HJ, MacDonald CC. The hinge domain of the cleavage stimulation factor protein CstF-64 is essential for CstF-77 interaction, nuclear localization, and polyadenylation. J Biol Chem 2009; 285:695-704. [PMID: 19887456 DOI: 10.1074/jbc.m109.061705] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Because polyadenylation is essential for cell growth, in vivo examination of polyadenylation protein function has been difficult. Here we describe a new in vivo assay that allows structure-function assays on CstF-64, a protein that binds to pre-mRNAs downstream of the cleavage site for accurate and efficient polyadenylation. In this assay (the stem-loop luciferase assay for polyadenylation, SLAP), expression of a luciferase pre-mRNA with a modified downstream sequence element was made dependent upon co-expression of an MS2-CstF-64 fusion protein. We show here that SLAP accurately reflects CstF-64-dependent polyadenylation, confirming the validity of this assay. Using SLAP, we determined that CstF-64 domains involved in RNA binding, interaction with CstF-77 (the "Hinge" domain), and coupling to transcription are critical for polyadenylation. Further, we showed that the Hinge domain is necessary for CstF-64 interaction with CstF-77 and consequent nuclear localization, suggesting that nuclear import of a preformed CstF complex is an essential step in polyadenylation.
Collapse
Affiliation(s)
- J Andrew Hockert
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences Center, Lubbock, Texas 79430-6540, USA
| | | | | |
Collapse
|
31
|
Ho ES, Jakubowski CD, Gunderson SI. iTriplet, a rule-based nucleic acid sequence motif finder. Algorithms Mol Biol 2009; 4:14. [PMID: 19874606 PMCID: PMC2784457 DOI: 10.1186/1748-7188-4-14] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 10/29/2009] [Indexed: 12/29/2022] Open
Abstract
Background With the advent of high throughput sequencing techniques, large amounts of sequencing data are readily available for analysis. Natural biological signals are intrinsically highly variable making their complete identification a computationally challenging problem. Many attempts in using statistical or combinatorial approaches have been made with great success in the past. However, identifying highly degenerate and long (>20 nucleotides) motifs still remains an unmet challenge as high degeneracy will diminish statistical significance of biological signals and increasing motif size will cause combinatorial explosion. In this report, we present a novel rule-based method that is focused on finding degenerate and long motifs. Our proposed method, named iTriplet, avoids costly enumeration present in existing combinatorial methods and is amenable to parallel processing. Results We have conducted a comprehensive assessment on the performance and sensitivity-specificity of iTriplet in analyzing artificial and real biological sequences in various genomic regions. The results show that iTriplet is able to solve challenging cases. Furthermore we have confirmed the utility of iTriplet by showing it accurately predicts polyA-site-related motifs using a dual Luciferase reporter assay. Conclusion iTriplet is a novel rule-based combinatorial or enumerative motif finding method that is able to process highly degenerate and long motifs that have resisted analysis by other methods. In addition, iTriplet is distinguished from other methods of the same family by its parallelizability, which allows it to leverage the power of today's readily available high-performance computing systems.
Collapse
|
32
|
Shankarling GS, Coates PW, Dass B, Macdonald CC. A family of splice variants of CstF-64 expressed in vertebrate nervous systems. BMC Mol Biol 2009; 10:22. [PMID: 19284619 PMCID: PMC2660332 DOI: 10.1186/1471-2199-10-22] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2008] [Accepted: 03/12/2009] [Indexed: 01/23/2023] Open
Abstract
Background Alternative splicing and polyadenylation are important mechanisms for creating the proteomic diversity necessary for the nervous system to fulfill its specialized functions. The contribution of alternative splicing to proteomic diversity in the nervous system has been well documented, whereas the role of alternative polyadenylation in this process is less well understood. Since the CstF-64 polyadenylation protein is known to be an important regulator of tissue-specific polyadenylation, we examined its expression in brain and other organs. Results We discovered several closely related splice variants of CstF-64 – collectively called βCstF-64 – that could potentially contribute to proteomic diversity in the nervous system. The βCstF-64 splice variants are found predominantly in the brains of several vertebrate species including mice and humans. The major βCstF-64 variant mRNA is generated by inclusion of two alternate exons (that we call exons 8.1 and 8.2) found between exons 8 and 9 of the CstF-64 gene, and contains an additional 147 nucleotides, encoding 49 additional amino acids. Some variants of βCstF-64 contain only the first alternate exon (exon 8.1) while other variants contain both alternate exons (8.1 and 8.2). In mice, the predominant form of βCstF-64 also contains a deletion of 78 nucleotides from exon 9, although that variant is not seen in any other species examined, including rats. Immunoblot and 2D-PAGE analyses of mouse nuclear extracts indicate that a protein corresponding to βCstF-64 is expressed in brain at approximately equal levels to CstF-64. Since βCstF-64 splice variant family members were found in the brains of all vertebrate species examined (including turtles and fish), this suggests that βCstF-64 has an evolutionarily conserved function in these animals. βCstF-64 was present in both pre- and post-natal mice and in different regions of the nervous system, suggesting an important role for βCstF-64 in neural gene expression throughout development. Finally, experiments in representative cell lines suggest that βCstF-64 is expressed in neurons but not glia. Conclusion This is the first report of a family of splice variants encoding a key polyadenylation protein that is expressed in a nervous system-specific manner. We propose that βCstF-64 contributes to proteomic diversity by regulating alternative polyadenylation of neural mRNAs.
Collapse
Affiliation(s)
- Ganesh S Shankarling
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences Center, Lubbock, Texas 79430-6540, USA.
| | | | | | | |
Collapse
|
33
|
Hutchins LN, Murphy SM, Singh P, Graber JH. Position-dependent motif characterization using non-negative matrix factorization. ACTA ACUST UNITED AC 2008; 24:2684-90. [PMID: 18852176 PMCID: PMC2639279 DOI: 10.1093/bioinformatics/btn526] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Motivation:Cis-acting regulatory elements are frequently constrained by both sequence content and positioning relative to a functional site, such as a splice or polyadenylation site. We describe an approach to regulatory motif analysis based on non-negative matrix factorization (NMF). Whereas existing pattern recognition algorithms commonly focus primarily on sequence content, our method simultaneously characterizes both positioning and sequence content of putative motifs. Results: Tests on artificially generated sequences show that NMF can faithfully reproduce both positioning and content of test motifs. We show how the variation of the residual sum of squares can be used to give a robust estimate of the number of motifs or patterns in a sequence set. Our analysis distinguishes multiple motifs with significant overlap in sequence content and/or positioning. Finally, we demonstrate the use of the NMF approach through characterization of biologically interesting datasets. Specifically, an analysis of mRNA 3′-processing (cleavage and polyadenylation) sites from a broad range of higher eukaryotes reveals a conserved core pattern of three elements. Contact:joel.graber@jax.org Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lucie N Hutchins
- Center for Genome Dynamics, The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | | | | | | |
Collapse
|
34
|
Lee JY, Ji Z, Tian B. Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3'-end of genes. Nucleic Acids Res 2008; 36:5581-90. [PMID: 18757892 PMCID: PMC2553571 DOI: 10.1093/nar/gkn540] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
mRNA polyadenylation is an essential step for the maturation of almost all eukaryotic mRNAs, and is tightly coupled with termination of transcription in defining the 3′-end of genes. Large numbers of human and mouse genes harbor alternative polyadenylation sites [poly(A) sites] that lead to mRNA variants containing different 3′-untranslated regions (UTRs) and/or encoding distinct protein sequences. Here, we examined the conservation and divergence of different types of alternative poly(A) sites across human, mouse, rat and chicken. We found that the 3′-most poly(A) sites tend to be more conserved than upstream ones, whereas poly(A) sites located upstream of the 3′-most exon, also termed intronic poly(A) sites, tend to be much less conserved. Genes with longer evolutionary history are more likely to have alternative polyadenylation, suggesting gain of poly(A) sites through evolution. We also found that nonconserved poly(A) sites are associated with transposable elements (TEs) to a much greater extent than conserved ones, albeit less frequently utilized. Different classes of TEs have different characteristics in their association with poly(A) sites via exaptation of TE sequences into polyadenylation elements. Our results establish a conservation pattern for alternative poly(A) sites in several vertebrate species, and indicate that the 3′-end of genes can be dynamically modified by TEs through evolution.
Collapse
Affiliation(s)
- Ju Youn Lee
- Graduate School of Biomedical Sciences and Department of Biochemistry and Molecular Biology, New Jersey Medical School, University of Medicine and Dentistry of New Jersey, Newark, NJ 07103, USA
| | | | | |
Collapse
|
35
|
Tian B. Alternative Polyadenylation in the Human Genome: Evolution. ELS 2008. [DOI: 10.1002/9780470015902.a0020768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
36
|
Shen Y, Ji G, Haas BJ, Wu X, Zheng J, Reese GJ, Li QQ. Genome level analysis of rice mRNA 3'-end processing signals and alternative polyadenylation. Nucleic Acids Res 2008; 36:3150-61. [PMID: 18411206 PMCID: PMC2396415 DOI: 10.1093/nar/gkn158] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2007] [Revised: 03/18/2008] [Accepted: 03/19/2008] [Indexed: 12/24/2022] Open
Abstract
The position of a poly(A) site of eukaryotic mRNA is determined by sequence signals in pre-mRNA and a group of polyadenylation factors. To reveal rice poly(A) signals at a genome level, we constructed a dataset of 55 742 authenticated poly(A) sites and characterized the poly(A) signals. This resulted in identifying the typical tripartite cis-elements, including FUE, NUE and CE, as previously observed in Arabidopsis. The average size of the 3'-UTR was 289 nucleotides. When mapped to the genome, however, 15% of these poly(A) sites were found to be located in the currently annotated intergenic regions. Moreover, an extensive alternative polyadenylation profile was evident where 50% of the genes analyzed had more than one unique poly(A) site (excluding microheterogeneity sites), and 13% had four or more poly(A) sites. About 4% of the analyzed genes possessed alternative poly(A) sites at their introns, 5'-UTRs, or protein coding regions. The authenticity of these alternative poly(A) sites was partially confirmed using MPSS data. Analysis of nucleotide profile and signal patterns indicated that there may be a different set of poly(A) signals for those poly(A) sites found in the coding regions. Based on the features of rice poly(A) signals, an updated algorithm termed PASS-Rice was designed to predict poly(A) sites.
Collapse
Affiliation(s)
- Yingjia Shen
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Guoli Ji
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Brian J. Haas
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Xiaohui Wu
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Jianti Zheng
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Greg J. Reese
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| | - Qingshun Quinn Li
- Department of Botany, Miami University, Oxford, OH 45056, USA, Department of Automation, Xiamen University, Xiamen, Fujian, China 361005, The Genome Research Institute, Rockville, MD 20850 and IT Research Computing Support Group, Miami University, Oxford, OH 45056, USA
| |
Collapse
|
37
|
Sartini BL, Wang H, Wang W, Millette CF, Kilpatrick DL. Pre-Messenger RNA Cleavage Factor I (CFIm): Potential Role in Alternative Polyadenylation During Spermatogenesis1. Biol Reprod 2008; 78:472-82. [DOI: 10.1095/biolreprod.107.064774] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
|
38
|
Feng Z, Wu CF, Zhou X, Kuang J. Alternative polyadenylation produces two major transcripts of Alix. Arch Biochem Biophys 2007; 465:328-35. [PMID: 17673164 PMCID: PMC4104816 DOI: 10.1016/j.abb.2007.06.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2007] [Revised: 06/21/2007] [Accepted: 06/21/2007] [Indexed: 01/23/2023]
Abstract
The mammalian adaptor protein Alix participates in multiple cellular processes. Since mouse Alix cDNA detects two distinct transcripts of approximately 3.5 and approximately 7.0 kb in various mouse tissues, it is possible that there exist isoforms of Alix protein that perform varied biological functions. In this study, we first demonstrate that four different anti-Alix monoclonal antibodies immunoblot the single Alix protein in nine different mouse tissues. We then show that the two transcripts of 3.2 and 6.4 kb are widely expressed in various human tissues and cell lines. These two transcripts are generated from the same Alix gene localizing at 3p22.3 via alternative polyadenylation, thus containing an identical open reading frame. However, the 3.2-kb transcript is much more active in translation than the 6.4-kb transcript in a randomly selected cell line. These results eliminate the possibility that the two transcript variants encode different isoforms of Alix protein and suggest that alternative polyadenylation is one of the mechanisms controlling Alix protein expression.
Collapse
Affiliation(s)
| | | | - Xi Zhou
- Departments of Experimental Therapeutics, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030
| | - Jian Kuang
- Departments of Experimental Therapeutics, The University of Texas M. D. Anderson Cancer Center, Houston, TX 77030
| |
Collapse
|
39
|
Graber JH, Salisbury J, Hutchins LN, Blumenthal T. C. elegans sequences that control trans-splicing and operon pre-mRNA processing. RNA (NEW YORK, N.Y.) 2007; 13:1409-26. [PMID: 17630324 PMCID: PMC1950753 DOI: 10.1261/rna.596707] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2007] [Accepted: 05/17/2007] [Indexed: 05/04/2023]
Abstract
Many mRNAs in Caenorhabditis elegans are generated through a trans-splicing reaction that adds one of two classes of spliced leader RNA to an independently transcribed pre-mRNA. SL1 leaders are spliced mostly to pre-mRNAs from genes with outrons, intron-like sequences at the 5'-ends of the pre-mRNAs. In contrast, SL2 leaders are nearly exclusively trans-spliced to genes that occur downstream in polycistronic pre-mRNAs produced from operons. Operon pre-mRNA processing requires separation into individual transcripts, which is accomplished by 3'-processing of upstream genes and spliced leader trans-splicing to the downstream genes. We used a novel computational analysis, based on nonnegative matrix factorization, to identify and characterize significant differences in the cis-acting sequence elements that differentiate various types of functional site, including internal versus terminal 3'-processing sites, and SL1 versus SL2 trans-splicing sites. We describe several key elements, including the U-rich (Ur) element that couples 3'-processing with SL2 trans-splicing, and a novel outron (Ou) element that occurs upstream of SL1 trans-splicing sites. Finally, we present models of the distinct classes of trans-splicing reaction, including SL1 trans-splicing at the outron, SL2 trans-splicing in standard operons, competitive SL1-SL2 trans-splicing in operons with large intergenic separation, and SL1 trans-splicing in SL1-type operons, which have no intergenic separation.
Collapse
|
40
|
MacMorris M, Kumar M, Lasda E, Larsen A, Kraemer B, Blumenthal T. A novel family of C. elegans snRNPs contains proteins associated with trans-splicing. RNA (NEW YORK, N.Y.) 2007; 13:511-20. [PMID: 17283210 PMCID: PMC1831854 DOI: 10.1261/rna.426707] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 01/02/2007] [Indexed: 05/13/2023]
Abstract
In many Caenorhabditis elegans pre-mRNAs, the RNA sequence between the 5' cap and the first 3' splice site is replaced by trans-splicing a short spliced leader (SL) from the Sm snRNP, SL1. C. elegans also utilizes a similar Sm snRNP, SL2, to trans-splice at sites between genes in polycistronic pre-mRNAs from operons. How do SL1 and SL2 snRNPs function in different contexts? Here we show that the SL1 snRNP contains a complex of SL75p and SL21p, which are homologs of novel proteins previously reported in the Ascaris SL snRNP. Interestingly, we show that the SL2 snRNP does not contain these proteins. However, SL75p and SL26p, a paralog of SL21p, are components of another Sm snRNP that contains a novel snRNA species, Sm Y. Knockdown of SL75p is lethal. However, knockdown of either SL21p or SL26p alone leads to cold-sensitive sterility, whereas knockdown of both SL21p and SL26p is lethal. This suggests that these two proteins have overlapping functions even though they are associated with different classes of snRNP. These phenotypic relationships, along with the association of SL26p with SL75p, imply that, like the SL1 RNA/Sm/SL75p/SL21p complex, the Sm Y/Sm/SL75p/SL26p complex is associated with trans-splicing.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Caenorhabditis elegans/genetics
- Genes, Helminth
- Molecular Sequence Data
- Operon
- RNA Splicing
- RNA, Helminth/chemistry
- RNA, Helminth/genetics
- RNA, Helminth/metabolism
- RNA, Spliced Leader/metabolism
- RNA-Binding Proteins/chemistry
- RNA-Binding Proteins/metabolism
- Ribonucleoproteins, Small Nuclear/chemistry
- Ribonucleoproteins, Small Nuclear/classification
- Ribonucleoproteins, Small Nuclear/genetics
- Ribonucleoproteins, Small Nuclear/metabolism
- Sequence Homology, Amino Acid
- Trans-Splicing
Collapse
Affiliation(s)
- Margaret MacMorris
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | | | | | | | | | | |
Collapse
|
41
|
Liu D, Brockman JM, Dass B, Hutchins LN, Singh P, McCarrey JR, MacDonald CC, Graber JH. Systematic variation in mRNA 3'-processing signals during mouse spermatogenesis. Nucleic Acids Res 2006; 35:234-46. [PMID: 17158511 PMCID: PMC1802579 DOI: 10.1093/nar/gkl919] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Gene expression and processing during mouse male germ cell maturation (spermatogenesis) is highly specialized. Previous reports have suggested that there is a high incidence of alternative 3′-processing in male germ cell mRNAs, including reduced usage of the canonical polyadenylation signal, AAUAAA. We used EST libraries generated from mouse testicular cells to identify 3′-processing sites used at various stages of spermatogenesis (spermatogonia, spermatocytes and round spermatids) and testicular somatic Sertoli cells. We assessed differences in 3′-processing characteristics in the testicular samples, compared to control sets of widely used 3′-processing sites. Using a new method for comparison of degenerate regulatory elements between sequence samples, we identified significant changes in the use of putative 3′-processing regulatory sequence elements in all spermatogenic cell types. In addition, we observed a trend towards truncated 3′-untranslated regions (3′-UTRs), with the most significant differences apparent in round spermatids. In contrast, Sertoli cells displayed a much smaller trend towards 3′-UTR truncation and no significant difference in 3′-processing regulatory sequences. Finally, we identified a number of genes encoding mRNAs that were specifically subject to alternative 3′-processing during meiosis and postmeiotic development. Our results highlight developmental differences in polyadenylation site choice and in the elements that likely control them during spermatogenesis.
Collapse
Affiliation(s)
- Donglin Liu
- The Jackson Laboratory, 600 Main StreetBar Harbor, ME 04609, USA
| | - J. Michael Brockman
- The Jackson Laboratory, 600 Main StreetBar Harbor, ME 04609, USA
- Bioinformatics Program, Boston University24 Cummington Street, Boston, MA 02215, USA
| | - Brinda Dass
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences CenterLubbock, TX 79430, USA
| | | | - Priyam Singh
- The Jackson Laboratory, 600 Main StreetBar Harbor, ME 04609, USA
- Bioinformatics Program, Boston University24 Cummington Street, Boston, MA 02215, USA
| | - John R. McCarrey
- Department of Biology, University of Texas at San AntonioSan Antonio, TX 78249, USA
| | - Clinton C. MacDonald
- Department of Cell Biology and Biochemistry, Texas Tech University Health Sciences CenterLubbock, TX 79430, USA
| | - Joel H. Graber
- The Jackson Laboratory, 600 Main StreetBar Harbor, ME 04609, USA
- Bioinformatics Program, Boston University24 Cummington Street, Boston, MA 02215, USA
- To whom correspondence should be addressed. Tel: +1 207 288 6847; Fax: +1 207 288 6073;
| |
Collapse
|
42
|
Abstract
Pcf11 and Clp1 are subunits of cleavage factor IA (CFIA), an essential polyadenylation factor in Saccahromyces cerevisiae. We have determined the structure of a ternary complex of Clp1 together with ATP and the Clp1-binding region of Pcf11. Clp1 contains three domains, a small N-terminal β sandwich domain, a C-terminal domain containing a novel α/β-fold and a central domain that binds ATP. The arrangement of the nucleotide binding site is similar to that observed in SIMIBI-class ATPase subunits found in other multisubunit macromolecular complexes. However, despite this similarity, nucleotide hydrolysis does not occur. The Pcf11 binding site is also located in the central domain where three highly conserved residues in Pcf11 mediate many of the protein–protein interactions. We propose that this conserved Clp1–Pcf11 interaction is responsible for maintaining a tight coupling between the Clp1 nucleotide binding subunit and the other components of the polyadenylation machinery. Moreover, we suggest that this complex represents a stabilized ATP bound form of Clp1 that requires the participation of other non-CFIA processing factors in order to initiate timely ATP hydrolysis during 3′ end processing.
Collapse
Affiliation(s)
| | | | - Ian A. Taylor
- To whom correspondence should be addressed. Tel: ++44 020 88162552; Fax: ++44 020 88162580;
| |
Collapse
|