1
|
Shen A, Hencel K, Parker M, Scott R, Skukan R, Adesina A, Metheringham C, Miska E, Nam Y, Haerty W, Simpson G, Akay A. U6 snRNA m6A modification is required for accurate and efficient splicing of C. elegans and human pre-mRNAs. Nucleic Acids Res 2024; 52:9139-9160. [PMID: 38808663 PMCID: PMC11347140 DOI: 10.1093/nar/gkae447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 05/08/2024] [Accepted: 05/28/2024] [Indexed: 05/30/2024] Open
Abstract
pre-mRNA splicing is a critical feature of eukaryotic gene expression. Both cis- and trans-splicing rely on accurately recognising splice site sequences by spliceosomal U snRNAs and associated proteins. Spliceosomal snRNAs carry multiple RNA modifications with the potential to affect different stages of pre-mRNA splicing. Here, we show that the conserved U6 snRNA m6A methyltransferase METT-10 is required for accurate and efficient cis- and trans-splicing of C. elegans pre-mRNAs. The absence of METT-10 in C. elegans and METTL16 in humans primarily leads to alternative splicing at 5' splice sites with an adenosine at +4 position. In addition, METT-10 is required for splicing of weak 3' cis- and trans-splice sites. We identified a significant overlap between METT-10 and the conserved splicing factor SNRNP27K in regulating 5' splice sites with +4A. Finally, we show that editing endogenous 5' splice site +4A positions to +4U restores splicing to wild-type positions in a mett-10 mutant background, supporting a direct role for U6 snRNA m6A modification in 5' splice site recognition. We conclude that the U6 snRNA m6A modification is important for accurate and efficient pre-mRNA splicing.
Collapse
Affiliation(s)
- Aykut Shen
- School of Biological Sciences, University of East Anglia, NR4 7TJ Norwich, UK
| | - Katarzyna Hencel
- School of Biological Sciences, University of East Anglia, NR4 7TJ Norwich, UK
| | - Matthew T Parker
- School of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, UK
| | - Robyn Scott
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Roberta Skukan
- School of Biological Sciences, University of East Anglia, NR4 7TJ Norwich, UK
| | | | | | - Eric A Miska
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge CB2 1QN, UK
| | - Yunsun Nam
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Wilfried Haerty
- School of Biological Sciences, University of East Anglia, NR4 7TJ Norwich, UK
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Gordon G Simpson
- School of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, UK
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, DD2 5DA, UK
| | - Alper Akay
- School of Biological Sciences, University of East Anglia, NR4 7TJ Norwich, UK
| |
Collapse
|
2
|
Lugano DI, Barrett LN, Chaput D, Park MA, Westerheide SD. CCAR-1 works together with the U2AF large subunit UAF-1 to regulate alternative splicing. RNA Biol 2024; 21:1-11. [PMID: 38126797 PMCID: PMC10761121 DOI: 10.1080/15476286.2023.2289707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/17/2023] [Indexed: 12/23/2023] Open
Abstract
The Cell Division Cycle and Apoptosis Regulator (CCAR) protein family members have recently emerged as regulators of alternative splicing and transcription, as well as having other key physiological functions. For example, mammalian CCAR2/DBC1 forms a complex with the zinc factor protein ZNF326 to integrate alternative splicing with RNA polymerase II transcriptional elongation in AT-rich regions of the DNA. Additionally, Caenorhabditis elegans CCAR-1, a homolog to mammalian CCAR2, facilitates the alternative splicing of the perlecan unc-52 gene. However, much about the CCAR family's role in alternative splicing is unknown. Here, we have examined the role of CCAR-1 in genome-wide alternative splicing in Caenorhabditis elegans and have identified new alternative splicing targets of CCAR-1 using RNA sequencing. Also, we found that CCAR-1 interacts with the spliceosome factors UAF-1 and UAF-2 using mass spectrometry, and that knockdown of ccar-1 affects alternative splicing patterns, motility, and proteostasis of UAF-1 mutant worms. Collectively, we demonstrate the role of CCAR-1 in regulating global alternative splicing in C. elegans and in conjunction with UAF-1.
Collapse
Affiliation(s)
- Doreen I. Lugano
- Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, Tampa, FL, USA
| | - Lindsey N. Barrett
- Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, Tampa, FL, USA
| | - Dale Chaput
- Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, Tampa, FL, USA
| | - Margaret A. Park
- Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, Tampa, FL, USA
| | - Sandy D. Westerheide
- Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, Tampa, FL, USA
| |
Collapse
|
3
|
Shen A, Hencel K, Parker MT, Scott R, Skukan R, Adesina AS, Metheringham CL, Miska EA, Nam Y, Haerty W, Simpson GG, Akay A. U6 snRNA m6A modification is required for accurate and efficient cis- and trans-splicing of C. elegans mRNAs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.16.558044. [PMID: 37745402 PMCID: PMC10516052 DOI: 10.1101/2023.09.16.558044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
pre-mRNA splicing is a critical feature of eukaryotic gene expression. Many eukaryotes use cis-splicing to remove intronic sequences from pre-mRNAs. In addition to cis-splicing, many organisms use trans-splicing to replace the 5' ends of mRNAs with a non-coding spliced-leader RNA. Both cis- and trans-splicing rely on accurately recognising splice site sequences by spliceosomal U snRNAs and associated proteins. Spliceosomal snRNAs carry multiple RNA modifications with the potential to affect different stages of pre-mRNA splicing. Here, we show that m6A modification of U6 snRNA A43 by the RNA methyltransferase METT-10 is required for accurate and efficient cis- and trans-splicing of C. elegans pre-mRNAs. The absence of U6 snRNA m6A modification primarily leads to alternative splicing at 5' splice sites. Furthermore, weaker 5' splice site recognition by the unmodified U6 snRNA A43 affects splicing at 3' splice sites. U6 snRNA m6A43 and the splicing factor SNRNP27K function to recognise an overlapping set of 5' splice sites with an adenosine at +4 position. Finally, we show that U6 snRNA m6A43 is required for efficient SL trans-splicing at weak 3' trans-splice sites. We conclude that the U6 snRNA m6A modification is important for accurate and efficient cis- and trans-splicing in C. elegans.
Collapse
Affiliation(s)
- Aykut Shen
- School of Biological Sciences, University of East Anglia, NR4 7TJ, Norwich
| | - Katarzyna Hencel
- School of Biological Sciences, University of East Anglia, NR4 7TJ, Norwich
- These authors contributed equally
| | - Matthew T Parker
- School of Life Sciences, University of Dundee, Dow Street, Dundee, DD1 5EH, UK
- These authors contributed equally
| | - Robyn Scott
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Roberta Skukan
- School of Biological Sciences, University of East Anglia, NR4 7TJ, Norwich
| | | | | | - Eric A Miska
- Wellcome/CRUK Gurdon Institute, University of Cambridge, Tennis Court Rd, Cambridge, CB2 1QN, UK
| | - Yunsun Nam
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Wilfried Haerty
- School of Biological Sciences, University of East Anglia, NR4 7TJ, Norwich
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Gordon G Simpson
- School of Life Sciences, University of Dundee, Dow Street, Dundee, DD1 5EH, UK
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, DD2 5DA, UK
| | - Alper Akay
- School of Biological Sciences, University of East Anglia, NR4 7TJ, Norwich
| |
Collapse
|
4
|
Young AF, Schmidt HF, Sundaram MV. Molecular lesions in alleles of the Caenorhabditis elegans lin-11 gene. MICROPUBLICATION BIOLOGY 2022; 2022:10.17912/micropub.biology.000589. [PMID: 35800971 PMCID: PMC9253893 DOI: 10.17912/micropub.biology.000589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 06/12/2022] [Accepted: 06/15/2022] [Indexed: 11/06/2022]
Abstract
The LIM homeodomain transcription factor LIN-11 is a key regulator of vulva, uterine, and neuron development in C. elegans. Multiple alleles of lin-11 are available, but none had been sequenced. We found that the reference allele, n389, is a 15900 bp deletion that also affects two other protein-coding genes, ZC247.1 and ZC247.2. The frequently used n566 allele is a 288bp deletion located in an intron and affecting the splice acceptor site.
Collapse
Affiliation(s)
- Adrie F. Young
- Department of Genetics, University of Pennsylvania Perelman School of Medicine
| | - Helen F. Schmidt
- Department of Genetics, University of Pennsylvania Perelman School of Medicine
| | - Meera V. Sundaram
- Department of Genetics, University of Pennsylvania Perelman School of Medicine
,
Correspondence to: Meera V. Sundaram (
)
| |
Collapse
|
5
|
Watabe E, Togo-Ohno M, Ishigami Y, Wani S, Hirota K, Kimura-Asami M, Hasan S, Takei S, Fukamizu A, Suzuki Y, Suzuki T, Kuroyanagi H. m 6 A-mediated alternative splicing coupled with nonsense-mediated mRNA decay regulates SAM synthetase homeostasis. EMBO J 2021; 40:e106434. [PMID: 34152017 DOI: 10.15252/embj.2020106434] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 04/29/2021] [Accepted: 05/07/2021] [Indexed: 12/24/2022] Open
Abstract
Alternative splicing of pre-mRNAs can regulate gene expression levels by coupling with nonsense-mediated mRNA decay (NMD). In order to elucidate a repertoire of mRNAs regulated by alternative splicing coupled with NMD (AS-NMD) in an organism, we performed long-read RNA sequencing of poly(A)+ RNAs from an NMD-deficient mutant strain of Caenorhabditis elegans, and obtained full-length sequences for mRNA isoforms from 259 high-confidence AS-NMD genes. Among them are the S-adenosyl-L-methionine (SAM) synthetase (sams) genes sams-3 and sams-4. SAM synthetase activity autoregulates sams gene expression through AS-NMD in a negative feedback loop. We furthermore find that METT-10, the orthologue of human U6 snRNA methyltransferase METTL16, is required for the splicing regulation in␣vivo, and specifically methylates the invariant AG dinucleotide at the distal 3' splice site (3'SS) in␣vitro. Direct RNA sequencing coupled with machine learning confirms m6 A modification of endogenous sams mRNAs. Overall, these results indicate that homeostasis of SAM synthetase in C. elegans is maintained by alternative splicing regulation through m6 A modification at the 3'SS of the sams genes.
Collapse
Affiliation(s)
- Eichi Watabe
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Marina Togo-Ohno
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Yuma Ishigami
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Shotaro Wani
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Keiko Hirota
- Life Science Center for Survival Dynamics, Tsukuba Advanced Research Alliance (TARA), University of Tsukuba, Tsukuba-shi, Ibaraki, Japan
| | - Mariko Kimura-Asami
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Sharmin Hasan
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Satomi Takei
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan
| | - Akiyoshi Fukamizu
- Life Science Center for Survival Dynamics, Tsukuba Advanced Research Alliance (TARA), University of Tsukuba, Tsukuba-shi, Ibaraki, Japan
| | - Yutaka Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa-shi, Chiba, Japan
| | - Tsutomu Suzuki
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Bunkyo-ku, Tokyo, Japan.,Department of Biochemistry, Graduate School of Medicine, University of the Ryukyus, Nishihara-cho, Okinawa, Japan
| |
Collapse
|
6
|
Warnasooriya C, Feeney CF, Laird KM, Ermolenko DN, Kielkopf CL. A splice site-sensing conformational switch in U2AF2 is modulated by U2AF1 and its recurrent myelodysplasia-associated mutation. Nucleic Acids Res 2020; 48:5695-5709. [PMID: 32343311 PMCID: PMC7261175 DOI: 10.1093/nar/gkaa293] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 04/09/2020] [Accepted: 04/17/2020] [Indexed: 02/02/2023] Open
Abstract
An essential heterodimer of the U2AF1 and U2AF2 pre-mRNA splicing factors nucleates spliceosome assembly at polypyrimidine (Py) signals preceding the major class of 3′ splice sites. U2AF1 frequently acquires an S34F-encoding mutation among patients with myelodysplastic syndromes (MDS). The influence of the U2AF1 subunit and its S34F mutation on the U2AF2 conformations remains unknown. Here, we employ single molecule Förster resonance energy transfer (FRET) to determine the influence of wild-type or S34F-substituted U2AF1 on the conformational dynamics of U2AF2 and its splice site RNA complexes. In the absence of RNA, the U2AF1 subunit stabilizes a high FRET value, which by structure-guided mutagenesis corresponds to a closed conformation of the tandem U2AF2 RNA recognition motifs (RRMs). When the U2AF heterodimer is bound to a strong, uridine-rich splice site, U2AF2 switches to a lower FRET value characteristic of an open, side-by-side arrangement of the RRMs. Remarkably, the U2AF heterodimer binds weak, uridine-poor Py tracts as a mixture of closed and open U2AF2 conformations, which are modulated by the S34F mutation. Shifts between open and closed U2AF2 may underlie U2AF1-dependent splicing of degenerate Py tracts and contribute to a subset of S34F-dysregulated splicing events in MDS patients.
Collapse
Affiliation(s)
- Chandani Warnasooriya
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Callen F Feeney
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Kholiswa M Laird
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Dmitri N Ermolenko
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Clara L Kielkopf
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| |
Collapse
|
7
|
Arribere JA, Kuroyanagi H, Hundley HA. mRNA Editing, Processing and Quality Control in Caenorhabditis elegans. Genetics 2020; 215:531-568. [PMID: 32632025 PMCID: PMC7337075 DOI: 10.1534/genetics.119.301807] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 05/03/2020] [Indexed: 02/06/2023] Open
Abstract
While DNA serves as the blueprint of life, the distinct functions of each cell are determined by the dynamic expression of genes from the static genome. The amount and specific sequences of RNAs expressed in a given cell involves a number of regulated processes including RNA synthesis (transcription), processing, splicing, modification, polyadenylation, stability, translation, and degradation. As errors during mRNA production can create gene products that are deleterious to the organism, quality control mechanisms exist to survey and remove errors in mRNA expression and processing. Here, we will provide an overview of mRNA processing and quality control mechanisms that occur in Caenorhabditis elegans, with a focus on those that occur on protein-coding genes after transcription initiation. In addition, we will describe the genetic and technical approaches that have allowed studies in C. elegans to reveal important mechanistic insight into these processes.
Collapse
Affiliation(s)
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan, and
| | - Heather A Hundley
- Medical Sciences Program, Indiana University School of Medicine-Bloomington, Indiana 47405
| |
Collapse
|
8
|
Nguyen H, Das U, Xie J. Genome-wide evolution of wobble base-pairing nucleotides of branchpoint motifs with increasing organismal complexity. RNA Biol 2019; 17:311-324. [PMID: 31814500 DOI: 10.1080/15476286.2019.1697548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Abstract
How have the branchpoint motifs evolved in organisms of different complexity? Here we identified and examined the consensus motifs (R1C2T3R4A5Y6, R: A or G, Y: C or T) of 898 fungal genomes. In Ascomycota unicellular yeasts, the G4/A4 ratio is mostly (98%) below 0.125 but increases sharply in multicellular species by about 40 times on average, and in the more complex Basidiomycota, it increases further by about 7 times. The global G4 increase is consistent with A4 to G4 transitions in evolution. Of the G4/A4-interacting amino acids of the branchpoint binding protein MSL5 (SF1) and the HSH155 (SF3B1), as well as the 5' splice sites (SS) and U2 snRNA genes, the 5' SS G3/A3 co-vary with the G4 to some extent. However, corresponding increase of the G4-complementary GCAGTA-U2 gene is rare, suggesting wobble-base pairing between the G4-containing branchpoint motif and GTAGTA-U2 in most of these species. Interestingly, the G4/A4 ratio correlates well with the abundance of alternative splicing in the two phyla, and G4 enriched significantly at the alternative 3' SS of genes in RNA metabolism, kinases and membrane proteins. Similar wobble nucleotides also enriched at the 3' SS of multicellular fungi with only thousands of protein-coding genes. Thus, branchpoint motifs have evolved U2-complementarity in unicellular Ascomycota yeasts, but have gradually gained more wobble base-pairing nucleotides in fungi of higher complexity, likely to destabilize branchpoint motif-U2 interaction and/or branchpoint A protrusion for alternative splicing. This implies an important role of relaxing the branchpoint signals in the multicellularity and further complexity of fungi.
Collapse
Affiliation(s)
- Hai Nguyen
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada.,Department of Applied Computer Sciences, University of Winnipeg, Winnipeg, Canada
| | - Urmi Das
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
| | - Jiuyong Xie
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
| |
Collapse
|
9
|
Nguyen H, Xie J. Widespread Separation of the Polypyrimidine Tract From 3' AG by G Tracts in Association With Alternative Exons in Metazoa and Plants. Front Genet 2019; 9:741. [PMID: 30693020 PMCID: PMC6339879 DOI: 10.3389/fgene.2018.00741] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 12/22/2018] [Indexed: 12/23/2022] Open
Abstract
At the end of introns, the polypyrimidine tract (Py) is often close to the 3′ AG in a consensus (Y)20NCAGgt in humans. Interestingly, we have found that they could also be separated by purine-rich elements including G tracts in thousands of human genes. These regulatory elements between the Py and 3′ AG (REPA) mainly regulate alternative 3′ splice sites (3′ SS) and intron retention. Here we show their widespread distribution and special properties across kingdoms. The purine-rich 3′ SS are found in up to about 60% of the introns among more than 1,000 species/lineages by whole genome analysis, and up to 18% of these introns contain the REPA G-tracts (REPAG) in about 0.6 million of 3′ SS in total. In particular, they are significantly enriched over their 3′ SS and genome backgrounds in metazoa and plants, and highly associated with alternative splicing of genes in diverse functional clusters. Cryptic splice sites harboring such G- and the other purine-triplets tend to be enriched (2–9 folds over the disrupted canonical 3′ SS) and aberrantly used in cancer patients carrying mutations of the SF3B1 or U2AF35, factors critical for branch point (BP) or 3′ AG recognition, respectively. Moreover, the REPAGs are significantly associated with reduced occurrences of BP motifs between the −24 and −4 positions, in particular absent between the −7 and −5 positions in several model organisms examined. The more distant BPs are associated with increased occurrences of alternative splicing in humans and zebrafish. The REPAGs appear to have evolved in a species- or phylum-specific way. Thus, there is widespread separation of the Py and 3′ AG by REPAGs that have evolved differentially. This special 3′ SS arrangement likely contributes to the generation of diverse transcript or protein isoforms in biological functions or diseases through alternative or aberrant splicing.
Collapse
Affiliation(s)
- Hai Nguyen
- Department of Physiology and Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada.,Department of Applied Computer Sciences, University of Winnipeg, Winnipeg, MB, Canada
| | - Jiuyong Xie
- Department of Physiology and Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| |
Collapse
|
10
|
Newman MA, Ji F, Fischer SEJ, Anselmo A, Sadreyev RI, Ruvkun G. The surveillance of pre-mRNA splicing is an early step in C. elegans RNAi of endogenous genes. Genes Dev 2018; 32:670-681. [PMID: 29739806 PMCID: PMC6004069 DOI: 10.1101/gad.311514.118] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2018] [Accepted: 04/03/2018] [Indexed: 01/03/2023]
Abstract
Here, Newman et al. provide insight into the role of mRNA splicing in C. elegans RNAi and show that viable null mutations in U1 and U2 snRNP-specific splicing factor genes cause defects in RNAi. They also demonstrate that RNAi targeted transcripts are tightly bound to spliceosomes, which suggests multiple layers of regulation by the spliceosome at early steps of small RNA-mediated gene silencing. RNAi pathways detect and silence foreign nucleic acids such as viruses as well as endogenous genes in many species. The phylogenetic profile across eukaryotes of proteins that mediate key steps in RNAi is correlated with the profiles of multiple mRNA splicing proteins and with intron number, suggesting that RNAi may surveil mRNA splicing to detect the divergent or absent introns of viruses. Here we examine the role of mRNA splicing in Caenorhabditis elegans RNAi. We found that viable null mutations in U1 and U2 small nuclear ribonucleic protein (snRNP)-specific splicing factor genes cause defects in RNAi. The U1A ortholog rnp-2 is required for normal ERGO-1 Argonaute class 26G siRNA biogenesis, trans-splicing of the eri-6/7 transcript, and targeting of poorly conserved gene transcripts by WAGO Argonaute class 22G siRNAs. We found that gene transcripts engaged by the siRNA-generating machinery are poorly conserved, possess few introns, and often have introns that are divergent from introns with strong consensus splicing sites found in highly conserved genes. We present biochemical evidence that RNAi targeted transcripts are tightly bound to spliceosomes. These findings suggest multiple layers of regulation by the spliceosome at early steps of small RNA-mediated gene silencing.
Collapse
Affiliation(s)
- Martin A Newman
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Fei Ji
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Sylvia E J Fischer
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Anthony Anselmo
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Ruslan I Sadreyev
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Gary Ruvkun
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
11
|
Nguyen H, Das U, Wang B, Xie J. The matrices and constraints of GT/AG splice sites of more than 1000 species/lineages. Gene 2018; 660:92-101. [PMID: 29588184 DOI: 10.1016/j.gene.2018.03.031] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 03/08/2018] [Accepted: 03/12/2018] [Indexed: 12/11/2022]
Abstract
To provide a resource for the splice sites (SS) of different species, we calculated the matrices of nucleotide compositions of about 38 million splice sites from >1000 species/lineages. The matrices are enriched of aGGTAAGT (5'SS) or (Y)6N(C/t)AG(g/a)t (3'SS) overall; however, they are quite diverse among hundreds of species. The diverse matrices remain prominent even under sequence selection pressures, suggesting the existence of diverse constraints as well as U snRNAs and other spliceosomal factors and/or their interactions with the splice sites. Using an algorithm to measure and compare the splice site constraints across all species, we demonstrate their distinct differences quantitatively. As an example of the resource's application to answering specific questions, we confirm that high constraints of particular positions are significantly associated with transcriptome-wide, increased occurrences of alternative splicing when uncommon nucleotides are present. More interestingly, the abundance of alternative splicing in 16 species correlates with the average constraint index of splice sites in a bell curve. This resource will allow users to assess specific sequences/splice sites against the consensus of every Ensembl-annotated species, and to explore the evolutionary changes or relationship to alternative splicing and transcriptome diversity. Web-search or update features are also included.
Collapse
Affiliation(s)
- Hai Nguyen
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB R3E 0J9, Canada; University of Winnipeg, Winnipeg, MB R3B 2E9, Canada
| | - Urmi Das
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB R3E 0J9, Canada
| | - Benjamin Wang
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB R3E 0J9, Canada; University of Illinois Urbana-Champaign, IL, USA
| | - Jiuyong Xie
- Department of Physiology & Pathophysiology, Max Rady College of Medicine, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB R3E 0J9, Canada.
| |
Collapse
|
12
|
N-Ethyl-N-Nitrosourea (ENU) Mutagenesis Reveals an Intronic Residue Critical for Caenorhabditis elegans 3' Splice Site Function in Vivo. G3-GENES GENOMES GENETICS 2016; 6:1751-6. [PMID: 27172199 PMCID: PMC4889670 DOI: 10.1534/g3.116.028662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Metazoan introns contain a polypyrimidine tract immediately upstream of the AG dinucleotide that defines the 3' splice site. In the nematode Caenorhabditis elegans, 3' splice sites are characterized by a highly conserved UUUUCAG/R octamer motif. While the conservation of pyrimidines in this motif is strongly suggestive of their importance in pre-mRNA splicing, in vivo evidence in support of this is lacking. In an N-ethyl-N-nitrosourea (ENU) mutagenesis screen in Caenorhabditis elegans, we have isolated a strain containing a point mutation in the octamer motif of a 3' splice site in the daf-12 gene. This mutation, a single base T-to-G transversion at the -5 position relative to the splice site, causes a strong daf-12 loss-of-function phenotype by abrogating splicing. The resulting transcript is predicted to encode a truncated DAF-12 protein generated by translation into the retained intron, which contains an in-frame stop codon. Other than the perfectly conserved AG dinucleotide at the site of splicing, G at the -5 position of the octamer motif is the most uncommon base in C. elegans 3' splice sites, occurring at closely paired sites where the better match to the splicing consensus is a few bases downstream. Our results highlight both the biological importance of the highly conserved -5 uridine residue in the C. elegans 3' splice site octamer motif as well as the utility of using ENU as a mutagen to study the function of polypyrimidine tracts and other AU- or AT-rich motifs in vivo.
Collapse
|
13
|
Sohail M, Xie J. Diverse regulation of 3' splice site usage. Cell Mol Life Sci 2015; 72:4771-93. [PMID: 26370726 PMCID: PMC11113787 DOI: 10.1007/s00018-015-2037-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Revised: 08/12/2015] [Accepted: 09/03/2015] [Indexed: 01/13/2023]
Abstract
The regulation of splice site (SS) usage is important for alternative pre-mRNA splicing and thus proper expression of protein isoforms in cells; its disruption causes diseases. In recent years, an increasing number of novel regulatory elements have been found within or nearby the 3'SS in mammalian genes. The diverse elements recruit a repertoire of trans-acting factors or form secondary structures to regulate 3'SS usage, mostly at the early steps of spliceosome assembly. Their mechanisms of action mainly include: (1) competition between the factors for RNA elements, (2) steric hindrance between the factors, (3) direct interaction between the factors, (4) competition between two splice sites, or (5) local RNA secondary structures or longer range loops, according to the mode of protein/RNA interactions. Beyond the 3'SS, chromatin remodeling/transcription, posttranslational modifications of trans-acting factors and upstream signaling provide further layers of regulation. Evolutionarily, some of the 3'SS elements seem to have emerged in mammalian ancestors. Moreover, other possibilities of regulation such as that by non-coding RNA remain to be explored. It is thus likely that there are more diverse elements/factors and mechanisms that influence the choice of an intron end. The diverse regulation likely contributes to a more complex but refined transcriptome and proteome in mammals.
Collapse
Affiliation(s)
- Muhammad Sohail
- Department of Physiology and Pathophysiology, College of Medicine, Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, R3E 0J9, Canada
| | - Jiuyong Xie
- Department of Physiology and Pathophysiology, College of Medicine, Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, R3E 0J9, Canada.
- Department of Biochemistry and Medical Genetics, College of Medicine, Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, R3E 0J9, Canada.
| |
Collapse
|
14
|
Gao X, Teng Y, Luo J, Huang L, Li M, Zhang Z, Ma YC, Ma L. The survival motor neuron gene smn-1 interacts with the U2AF large subunit gene uaf-1 to regulate Caenorhabditis elegans lifespan and motor functions. RNA Biol 2015; 11:1148-60. [PMID: 25483032 DOI: 10.4161/rna.36100] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Spinal muscular atrophy (SMA), the most frequent human congenital motor neuron degenerative disease, is caused by loss-of-function mutations in the highly conserved survival motor neuron gene SMN1. Mutations in SMN could affect several molecular processes, among which aberrant pre-mRNA splicing caused by defective snRNP biogenesis is hypothesized as a major cause of SMA. To date little is known about the interactions of SMN with other splicing factor genes and how SMN affects splicing in vivo. The nematode Caenorhabditis elegans carries a single ortholog of SMN, smn-1, and has been used as a model for studying the molecular functions of SMN. We analyzed RNA splicing of reporter genes in an smn-1 deletion mutant and found that smn-1 is required for efficient splicing at weak 3' splice sites. Genetic studies indicate that the defective lifespan and motor functions of the smn-1 deletion mutants could be significantly improved by mutations of the splicing factor U2AF large subunit gene uaf-1. In smn-1 mutants we detected a reduced expression of U1 and U5 snRNAs and an increased expression of U2, U4 and U6 snRNAs. Our study verifies an essential role of smn-1 for RNA splicing in vivo, identifies the uaf-1 gene as a potential genetic modifier of smn-1 mutants, and suggests that SMN-1 has multifaceted effects on the expression of spliceosomal snRNAs.
Collapse
Affiliation(s)
- Xiaoyang Gao
- a State Key Laboratory of Medical Genetics; School of Life Sciences ; Central South University ; Changsha , Hunan , China
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Genome-wide identification of zero nucleotide recursive splicing in Drosophila. Nature 2015; 521:376-9. [PMID: 25970244 PMCID: PMC4529404 DOI: 10.1038/nature14475] [Citation(s) in RCA: 138] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 04/16/2015] [Indexed: 01/23/2023]
Abstract
Recursive splicing is a process in which large introns are removed in multiple steps by resplicing at ratchet points - 5′ splice sites recreated after splicing1. Recursive splicing was first identified in the Drosophila Ultrabithorax (Ubx) gene1 and only three additional Drosophila genes have since been experimentally shown to undergo recursive splicing2,3. Here, we identify 197 zero nucleotide exon ratchet points in 130 introns of 115 Drosophila genes from total RNA sequencing data generated from developmental time points, dissected tissues, and cultured cells. The sequential nature of recursive splicing was confirmed by identification of lariat introns generated by splicing to and from the ratchet points. We also show that recursive splicing is a constitutive process, that depletion of U2AF inhibits recursive splicing, and that the sequence and function of ratchet points are evolutionarily conserved in Drosophila. Finally, we identified four recursively spliced human genes, one of which is also recursively spliced in Drosophila. Together these results indicate that recursive splicing is commonly used in Drosophila, occurs in human and provides insight into the mechanisms by which some large introns are removed.
Collapse
|
16
|
Ragle JM, Katzman S, Akers TF, Barberan-Soler S, Zahler AM. Coordinated tissue-specific regulation of adjacent alternative 3' splice sites in C. elegans. Genome Res 2015; 25:982-94. [PMID: 25922281 PMCID: PMC4484395 DOI: 10.1101/gr.186783.114] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 04/27/2015] [Indexed: 12/30/2022]
Abstract
Adjacent alternative 3′ splice sites, those separated by ≤18 nucleotides, provide a unique problem in the study of alternative splicing regulation; there is overlap of the cis-elements that define the adjacent sites. Identification of the intron's 3′ end depends upon sequence elements that define the branchpoint, polypyrimidine tract, and terminal AG dinucleotide. Starting with RNA-seq data from germline-enriched and somatic cell-enriched Caenorhabditis elegans samples, we identify hundreds of introns with adjacent alternative 3′ splice sites. We identify 203 events that undergo tissue-specific alternative splicing. For these, the regulation is monodirectional, with somatic cells preferring to splice at the distal 3′ splice site (furthest from the 5′ end of the intron) and germline cells showing a distinct shift toward usage of the adjacent proximal 3′ splice site (closer to the 5′ end of the intron). Splicing patterns in somatic cells follow C. elegans consensus rules of 3′ splice site definition; a short stretch of pyrimidines preceding an AG dinucleotide. Splicing in germline cells occurs at proximal 3′ splice sites that lack a preceding polypyrimidine tract, and in three instances the germline-specific site lacks the AG dinucleotide. We provide evidence that use of germline-specific proximal 3′ splice sites is conserved across Caenorhabditis species. We propose that there are differences between germline and somatic cells in the way that the basal splicing machinery functions to determine the intron terminus.
Collapse
Affiliation(s)
- James Matthew Ragle
- Department of Molecular, Cell, and Developmental Biology and The Center for Molecular Biology of RNA, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Sol Katzman
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Taylor F Akers
- Department of Molecular, Cell, and Developmental Biology and The Center for Molecular Biology of RNA, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Sergio Barberan-Soler
- Gene Regulation, Stem Cells, and Cancer Program, Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
| | - Alan M Zahler
- Department of Molecular, Cell, and Developmental Biology and The Center for Molecular Biology of RNA, University of California Santa Cruz, Santa Cruz, California 95064, USA
| |
Collapse
|
17
|
Structure-guided U2AF65 variant improves recognition and splicing of a defective pre-mRNA. Proc Natl Acad Sci U S A 2014; 111:17420-5. [PMID: 25422459 DOI: 10.1073/pnas.1412743111] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Purine interruptions of polypyrimidine (Py) tract splice site signals contribute to human genetic diseases. The essential splicing factor U2AF(65) normally recognizes a Py tract consensus sequence preceding the major class of 3' splice sites. We found that neurofibromatosis- or retinitis pigmentosa-causing mutations in the 5' regions of Py tracts severely reduce U2AF(65) affinity. Conversely, we identified a preferred binding site of U2AF(65) for purine substitutions in the 3' regions of Py tracts. Based on a comparison of new U2AF(65) structures bound to either A- or G-containing Py tracts with previously identified pyrimidine-containing structures, we expected to find that a D231V amino acid change in U2AF(65) would specify U over other nucleotides. We found that the crystal structure of the U2AF(65)-D231V variant confirms favorable packing between the engineered valine and a target uracil base. The D231V amino acid change restores U2AF(65) affinity for two mutated splice sites that cause human genetic diseases and successfully promotes splicing of a defective retinitis pigmentosa-causing transcript. We conclude that reduced U2AF(65) binding is a molecular consequence of disease-relevant mutations, and that a structure-guided U2AF(65) variant is capable of manipulating gene expression in eukaryotic cells.
Collapse
|
18
|
McInerney-Leo AM, Sparrow DB, Harris JE, Gardiner BB, Marshall MS, O'Reilly VC, Shi H, Brown MA, Leo PJ, Zankl A, Dunwoodie SL, Duncan EL. Compound heterozygous mutations in RIPPLY2 associated with vertebral segmentation defects. Hum Mol Genet 2014; 24:1234-42. [DOI: 10.1093/hmg/ddu534] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
|
19
|
Kuroyanagi H, Takei S, Suzuki Y. Comprehensive analysis of mutually exclusive alternative splicing in C. elegans. WORM 2014; 3:e28459. [PMID: 25254147 PMCID: PMC4165533 DOI: 10.4161/worm.28459] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 02/28/2014] [Accepted: 03/06/2014] [Indexed: 12/03/2022]
Abstract
Mutually exclusive selection of one exon in a cluster of exons is a rare form of alternative pre-mRNA splicing, yet suggests strict regulation. However, the repertoires of regulation mechanisms for the mutually exclusive (ME) splicing in vivo are still unknown. Here, we experimentally explore putative ME exons in C. elegans to demonstrate that 29 ME exon clusters in 27 genes are actually selected in a mutually exclusive manner. Twenty-two of the clusters consist of homologous ME exons. Five clusters have too short intervening introns to be excised between the ME exons. Fidelity of ME splicing relies at least in part on nonsense-mediated mRNA decay for 14 clusters. These results thus characterize all the repertoires of ME splicing in this organism.
Collapse
Affiliation(s)
- Hidehito Kuroyanagi
- Laboratory of Gene Expression; Medical Research Institute; Tokyo Medical and Dental University; Bunkyo-ku, Tokyo, Japan
| | - Satomi Takei
- Laboratory of Gene Expression; Medical Research Institute; Tokyo Medical and Dental University; Bunkyo-ku, Tokyo, Japan
| | - Yutaka Suzuki
- Department of Medical Genome Science; University of Tokyo; Kashiwa, Chiba, Japan
| |
Collapse
|
20
|
Kuroyanagi H, Watanabe Y, Suzuki Y, Hagiwara M. Position-dependent and neuron-specific splicing regulation by the CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans. Nucleic Acids Res 2013; 41:4015-25. [PMID: 23416545 PMCID: PMC3627589 DOI: 10.1093/nar/gkt097] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
A large fraction of protein-coding genes in metazoans undergo alternative pre-mRNA splicing in tissue- or cell-type-specific manners. Recent genome-wide approaches have identified many putative-binding sites for some of tissue-specific trans-acting splicing regulators. However, the mechanisms of splicing regulation in vivo remain largely unknown. To elucidate the modes of splicing regulation by the neuron-specific CELF family RNA-binding protein UNC-75 in Caenorhabditis elegans, we performed deep sequencing of poly(A)+ RNAs from the unc-75(+)- and unc-75-mutant worms and identified more than 20 cassette and mutually exclusive exons repressed or activated by UNC-75. Motif searches revealed that (G/U)UGUUGUG stretches are enriched in the upstream and downstream introns of the UNC-75-repressed and -activated exons, respectively. Recombinant UNC-75 protein specifically binds to RNA fragments carrying the (G/U)UGUUGUG stretches in vitro. Bi-chromatic fluorescence alternative splicing reporters revealed that the UNC-75-target exons are regulated in tissue-specific and (G/U)UGUUGUG element-dependent manners in vivo. The unc-75 mutation affected the splicing reporter expression specifically in the nervous system. These results indicate that UNC-75 regulates alternative splicing of its target exons in neuron-specific and position-dependent manners through the (G/U)UGUUGUG elements in C. elegans. This study thus reveals the repertoire of target events for the CELF family in the living organism.
Collapse
Affiliation(s)
- Hidehito Kuroyanagi
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo 113-8510, Japan.
| | | | | | | |
Collapse
|
21
|
Jenkins JL, Agrawal AA, Gupta A, Green MR, Kielkopf CL. U2AF65 adapts to diverse pre-mRNA splice sites through conformational selection of specific and promiscuous RNA recognition motifs. Nucleic Acids Res 2013; 41:3859-73. [PMID: 23376934 PMCID: PMC3616741 DOI: 10.1093/nar/gkt046] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Degenerate splice site sequences mark the intron boundaries of pre-mRNA transcripts in multicellular eukaryotes. The essential pre-mRNA splicing factor U2AF65 is faced with the paradoxical tasks of accurately targeting polypyrimidine (Py) tracts preceding 3′ splice sites while adapting to both cytidine and uridine nucleotides with nearly equivalent frequencies. To understand how U2AF65 recognizes degenerate Py tracts, we determined six crystal structures of human U2AF65 bound to cytidine-containing Py tracts. As deoxy-ribose backbones were required for co-crystallization with these Py tracts, we also determined two baseline structures of U2AF65 bound to the deoxy-uridine counterparts and compared the original, RNA-bound structure. Local structural changes suggest that the N-terminal RNA recognition motif 1 (RRM1) is more promiscuous for cytosine-containing Py tracts than the C-terminal RRM2. These structural differences between the RRMs were reinforced by the specificities of wild-type and site-directed mutant U2AF65 for region-dependent cytosine- and uracil-containing RNA sites. Small-angle X-ray scattering analyses further demonstrated that Py tract variations select distinct inter-RRM spacings from a pre-existing ensemble of U2AF65 conformations. Our results highlight both local and global conformational selection as a means for universal 3′ splice site recognition by U2AF65.
Collapse
Affiliation(s)
- Jermaine L Jenkins
- Center for RNA Biology and Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | | | | | | | | |
Collapse
|
22
|
Ohno G, Ono K, Togo M, Watanabe Y, Ono S, Hagiwara M, Kuroyanagi H. Muscle-specific splicing factors ASD-2 and SUP-12 cooperatively switch alternative pre-mRNA processing patterns of the ADF/cofilin gene in Caenorhabditis elegans. PLoS Genet 2012; 8:e1002991. [PMID: 23071450 PMCID: PMC3469465 DOI: 10.1371/journal.pgen.1002991] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Accepted: 08/10/2012] [Indexed: 11/18/2022] Open
Abstract
Pre-mRNAs are often processed in complex patterns in tissue-specific manners to produce a variety of protein isoforms from single genes. However, mechanisms orchestrating the processing of the entire transcript are not well understood. Muscle-specific alternative pre-mRNA processing of the unc-60 gene in Caenorhabditis elegans, encoding two tissue-specific isoforms of ADF/cofilin with distinct biochemical properties in regulating actin organization, provides an excellent in vivo model of complex and tissue-specific pre-mRNA processing; it consists of a single first exon and two separate series of downstream exons. Here we visualize the complex muscle-specific processing pattern of the unc-60 pre-mRNA with asymmetric fluorescence reporter minigenes. By disrupting juxtaposed CUAAC repeats and UGUGUG stretch in intron 1A, we demonstrate that these elements are required for retaining intron 1A, as well as for switching the processing patterns of the entire pre-mRNA from non-muscle-type to muscle-type. Mutations in genes encoding muscle-specific RNA-binding proteins ASD-2 and SUP-12 turned the colour of the unc-60 reporter worms. ASD-2 and SUP-12 proteins specifically and cooperatively bind to CUAAC repeats and UGUGUG stretch in intron 1A, respectively, to form a ternary complex in vitro. Immunohistochemical staining and RT-PCR analyses demonstrate that ASD-2 and SUP-12 are also required for switching the processing patterns of the endogenous unc-60 pre-mRNA from UNC-60A to UNC-60B in muscles. Furthermore, systematic analyses of partially spliced RNAs reveal the actual orders of intron removal for distinct mRNA isoforms. Taken together, our results demonstrate that muscle-specific splicing factors ASD-2 and SUP-12 cooperatively promote muscle-specific processing of the unc-60 gene, and provide insight into the mechanisms of complex pre-mRNA processing; combinatorial regulation of a single splice site by two tissue-specific splicing regulators determines the binary fate of the entire transcript.
Collapse
Affiliation(s)
- Genta Ohno
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Tokyo, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
- Research Fellowship for Young Scientists, Japan Society for the Promotion of Science (JSPS), Tokyo, Japan
| | - Kanako Ono
- Department of Pathology, Emory University, Atlanta, Georgia, United States of America
| | - Marina Togo
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Tokyo, Japan
| | - Yohei Watanabe
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Tokyo, Japan
| | - Shoichiro Ono
- Department of Pathology, Emory University, Atlanta, Georgia, United States of America
| | - Masatoshi Hagiwara
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Tokyo, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
- Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, Graduate School of Biomedical Science, Tokyo Medical and Dental University, Tokyo, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
- Precursory Research for Embryonic Science and Technology (PRESTO), Japan Science and Technology Agency (JST), Kawaguchi, Saitama, Japan
| |
Collapse
|
23
|
Wang C, Wilson-Berry L, Schedl T, Hansen D. TEG-1 CD2BP2 regulates stem cell proliferation and sex determination in the C. elegans germ line and physically interacts with the UAF-1 U2AF65 splicing factor. Dev Dyn 2012; 241:505-21. [PMID: 22275078 PMCID: PMC3466600 DOI: 10.1002/dvdy.23735] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/03/2012] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND For a stem cell population to exist over an extended period, a balance must be maintained between self-renewing (proliferating) and differentiating daughter cells. Within the Caenorhabditis elegans germ line, this balance is controlled by a genetic regulatory pathway, which includes the canonical Notch signaling pathway. RESULTS Genetic screens identified the gene teg-1 as being involved in regulating the proliferation versus differentiation decision in the C. elegans germ line. Cloning of TEG-1 revealed that it is a homolog of mammalian CD2BP2, which has been implicated in a number of cellular processes, including in U4/U6.U5 tri-snRNP formation in the pre-mRNA splicing reaction. The position of teg-1 in the genetic pathway regulating the proliferation versus differentiation decision, its single mutant phenotype, and its enrichment in nuclei, all suggest TEG-1 also functions as a splicing factor. TEG-1, as well as its human homolog, CD2BP2, directly bind to UAF-1 U2AF65, a component of the U2 auxiliary factor. CONCLUSIONS TEG-1 functions as a splicing factor and acts to regulate the proliferation versus meiosis decision. The interaction of TEG-1 CD2BP2 with UAF-1 U2AF65, combined with its previously described function in U4/U6.U5 tri-snRNP, suggests that TEG-1 CD2BP2 functions in two distinct locations in the splicing cascade.
Collapse
Affiliation(s)
- Chris Wang
- University of Calgary, Department of Biological Sciences, Alberta, Calgary, Canada
| | | | | | | |
Collapse
|
24
|
Ma L, Tan Z, Teng Y, Hoersch S, Horvitz HR. In vivo effects on intron retention and exon skipping by the U2AF large subunit and SF1/BBP in the nematode Caenorhabditis elegans. RNA (NEW YORK, N.Y.) 2011; 17:2201-2211. [PMID: 22033331 PMCID: PMC3222132 DOI: 10.1261/rna.027458.111] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2011] [Accepted: 09/27/2011] [Indexed: 05/31/2023]
Abstract
The in vivo analysis of the roles of splicing factors in regulating alternative splicing in animals remains a challenge. Using a microarray-based screen, we identified a Caenorhabditis elegans gene, tos-1, that exhibited three of the four major types of alternative splicing: intron retention, exon skipping, and, in the presence of U2AF large subunit mutations, the use of alternative 3' splice sites. Mutations in the splicing factors U2AF large subunit and SF1/BBP altered the splicing of tos-1. 3' splice sites of the retained intron or before the skipped exon regulate the splicing pattern of tos-1. Our study provides in vivo evidence that intron retention and exon skipping can be regulated largely by the identities of 3' splice sites.
Collapse
Affiliation(s)
- Long Ma
- State Key Laboratory of Medical Genetics, School of Biological Sciences and Technology, Central South University, Changsha, Hunan 410078, China
| | - Zhiping Tan
- Center for Clinical Gene Diagnosis and Therapy, The Second Xiangya Hospital, State Key Laboratory of Medical Genetics, Central South University, Changsha 410078, China
| | - Yanling Teng
- State Key Laboratory of Medical Genetics, School of Biological Sciences and Technology, Central South University, Changsha, Hunan 410078, China
| | - Sebastian Hoersch
- Koch Institute for Integrative Cancer Research, MIT, Cambridge, Massachusetts 02139, USA
- Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany
| | - H. Robert Horvitz
- Department of Biology, Howard Hughes Medical Institute, MIT, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
25
|
Branch point identification and sequence requirements for intron splicing in Plasmodium falciparum. EUKARYOTIC CELL 2011; 10:1422-8. [PMID: 21926333 DOI: 10.1128/ec.05193-11] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Splicing of mRNA is an ancient and evolutionarily conserved process in eukaryotic organisms, but intron-exon structures vary. Plasmodium falciparum has an extreme AT nucleotide bias (>80%), providing a unique opportunity to investigate how evolutionary forces have acted on intron structures. In this study, we developed an in vivo luciferase reporter splicing assay and employed it in combination with lariat isolation and sequencing to characterize 5' and 3' splicing requirements and experimentally determine the intron branch point in P. falciparum. This analysis indicates that P. falciparum mRNAs have canonical 5' and 3' splice sites. However, the 5' consensus motif is weakly conserved and tolerates nucleotide substitution, including the fifth nucleotide in the intron, which is more typically a G nucleotide in most eukaryotes. In comparison, the 3' splice site has a strong eukaryotic consensus sequence and adjacent polypyrimidine tract. In four different P. falciparum pre-mRNAs, multiple branch points per intron were detected, with some at U instead of the typical A residue. A weak branch point consensus was detected among 18 identified branch points. This analysis indicates that P. falciparum retains many consensus eukaryotic splice site features, despite having an extreme codon bias, and possesses flexibility in branch point nucleophilic attack.
Collapse
|
26
|
|
27
|
Wang F, Huang S, Ma L. Caenorhabditis elegans operons contain a higher proportion of genes with multiple transcripts and use 3' splice sites differentially. PLoS One 2010; 5:e12456. [PMID: 20805997 PMCID: PMC2929210 DOI: 10.1371/journal.pone.0012456] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2010] [Accepted: 08/05/2010] [Indexed: 01/23/2023] Open
Abstract
RNA splicing generates multiple transcript isoforms from a single gene and enhances the complexity of eukaryotic gene expression. In some eukaryotes, operon exists as an ancient regulatory mechanism of gene expression that requires strict positional and regulatory relationships among its genes. It remains unknown whether operonic genes generate transcript isoforms in a similar manner as non-operonic genes do, the expression of which is less likely limited by their positions and relationships with surrounding genes. We analyzed the number of transcript isoforms of Caenorhabditis elegans operonic genes and found that C. elegans operons contain a much higher proportion of genes with multiple transcript isoforms than non-operonic genes do. For genes that express multiple transcript isoforms, there is no apparent difference between the number of isoforms in operonic and non-operonic genes. C. elegans operonic genes also have a different preference of the 20 most common 3′ splice sites compared to non-operonic genes. Our analyses suggest that C. elegans operons enhance expression complexity by increasing the proportion of genes that express multiple transcript isoforms and maintain splicing efficiency by differential use of common 3′ splice sites.
Collapse
Affiliation(s)
- Fei Wang
- State Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Shi Huang
- State Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Long Ma
- State Key Laboratory of Medical Genetics, Central South University, Changsha, China
- * E-mail:
| |
Collapse
|
28
|
Sinha R, Zimmer AD, Bolte K, Lang D, Reski R, Platzer M, Rensing SA, Backofen R. Identification and characterization of NAGNAG alternative splicing in the moss Physcomitrella patens. BMC PLANT BIOLOGY 2010; 10:76. [PMID: 20426810 PMCID: PMC3095350 DOI: 10.1186/1471-2229-10-76] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Accepted: 04/28/2010] [Indexed: 05/05/2023]
Abstract
BACKGROUND Alternative splicing (AS) involving tandem acceptors that are separated by three nucleotides (NAGNAG) is an evolutionarily widespread class of AS, which is well studied in Homo sapiens (human) and Mus musculus (mouse). It has also been shown to be common in the model seed plants Arabidopsis thaliana and Oryza sativa (rice). In one of the first studies involving sequence-based prediction of AS in plants, we performed a genome-wide identification and characterization of NAGNAG AS in the model plant Physcomitrella patens, a moss. RESULTS Using Sanger data, we found 295 alternatively used NAGNAG acceptors in P. patens. Using 31 features and training and test datasets of constitutive and alternative NAGNAGs, we trained a classifier to predict the splicing outcome at NAGNAG tandem splice sites (alternative splicing, constitutive at the first acceptor, or constitutive at the second acceptor). Our classifier achieved a balanced specificity and sensitivity of >or= 89%. Subsequently, a classifier trained exclusively on data well supported by transcript evidence was used to make genome-wide predictions of NAGNAG splicing outcomes. By generation of more transcript evidence from a next-generation sequencing platform (Roche 454), we found additional evidence for NAGNAG AS, with altogether 664 alternative NAGNAGs being detected in P. patens using all currently available transcript evidence. The 454 data also enabled us to validate the predictions of the classifier, with 64% (80/125) of the well-supported cases of AS being predicted correctly. CONCLUSION NAGNAG AS is just as common in the moss P. patens as it is in the seed plants A. thaliana and O. sativa (but not conserved on the level of orthologous introns), and can be predicted with high accuracy. The most informative features are the nucleotides in the NAGNAG and in its immediate vicinity, along with the splice sites scores, as found earlier for NAGNAG AS in animals. Our results suggest that the mechanism behind NAGNAG AS in plants is similar to that in animals and is largely dependent on the splice site and its immediate neighborhood.
Collapse
Affiliation(s)
- Rileen Sinha
- Bioinformatics group, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
- Centre for Biological Signalling Studies (bioss), University of Freiburg, Albertstr. 19, 79104 Freiburg, Germany
| | - Andreas D Zimmer
- Faculty of Biology, University of Freiburg, Hauptstrasse 1, 79104 Freiburg, Germany
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
| | - Kathrin Bolte
- Faculty of Biology, University of Freiburg, Hauptstrasse 1, 79104 Freiburg, Germany
- Freiburg Initiative for Systems Biology (FRISYS), University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
- Philipps-Universität Marburg, Laboratorium für Zellbiologie, Karl-von-Frisch Str., 35032 Marburg, Germany
| | - Daniel Lang
- Faculty of Biology, University of Freiburg, Hauptstrasse 1, 79104 Freiburg, Germany
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
| | - Ralf Reski
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
- Freiburg Initiative for Systems Biology (FRISYS), University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
- Centre for Biological Signalling Studies (bioss), University of Freiburg, Albertstr. 19, 79104 Freiburg, Germany
| | - Matthias Platzer
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany
| | - Stefan A Rensing
- Faculty of Biology, University of Freiburg, Hauptstrasse 1, 79104 Freiburg, Germany
- Freiburg Initiative for Systems Biology (FRISYS), University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
- Centre for Biological Signalling Studies (bioss), University of Freiburg, Albertstr. 19, 79104 Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics group, University of Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany
- Freiburg Initiative for Systems Biology (FRISYS), University of Freiburg, Schaenzlestrasse 1, 79104 Freiburg, Germany
- Centre for Biological Signalling Studies (bioss), University of Freiburg, Albertstr. 19, 79104 Freiburg, Germany
| |
Collapse
|
29
|
Ma L, Horvitz HR. Mutations in the Caenorhabditis elegans U2AF large subunit UAF-1 alter the choice of a 3' splice site in vivo. PLoS Genet 2009; 5:e1000708. [PMID: 19893607 PMCID: PMC2762039 DOI: 10.1371/journal.pgen.1000708] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2009] [Accepted: 10/05/2009] [Indexed: 11/18/2022] Open
Abstract
The removal of introns from eukaryotic RNA transcripts requires the activities of five multi-component ribonucleoprotein complexes and numerous associated proteins. The lack of mutations affecting splicing factors essential for animal survival has limited the study of the in vivo regulation of splicing. From a screen for suppressors of the Caenorhabditis elegans unc-93(e1500) rubberband Unc phenotype, we identified mutations in genes that encode the C. elegans orthologs of two splicing factors, the U2AF large subunit (UAF-1) and SF1/BBP (SFA-1). The uaf-1(n4588) mutation resulted in temperature-sensitive lethality and caused the unc-93 RNA transcript to be spliced using a cryptic 3′ splice site generated by the unc-93(e1500) missense mutation. The sfa-1(n4562) mutation did not cause the utilization of this cryptic 3′ splice site. We isolated four uaf-1(n4588) intragenic suppressors that restored the viability of uaf-1 mutants at 25°C. These suppressors differentially affected the recognition of the cryptic 3′ splice site and implicated a small region of UAF-1 between the U2AF small subunit-interaction domain and the first RNA recognition motif in affecting the choice of 3′ splice site. We constructed a reporter for unc-93 splicing and using site-directed mutagenesis found that the position of the cryptic splice site affects its recognition. We also identified nucleotides of the endogenous 3′ splice site important for recognition by wild-type UAF-1. Our genetic and molecular analyses suggested that the phenotypic suppression of the unc-93(e1500) Unc phenotype by uaf-1(n4588) and sfa-1(n4562) was likely caused by altered splicing of an unknown gene. Our observations provide in vivo evidence that UAF-1 can act in regulating 3′ splice-site choice and establish a system that can be used to investigate the in vivo regulation of RNA splicing in C. elegans. Eukaryotic genes contain intervening intronic sequences that must be removed from pre-mRNA transcripts by RNA splicing to generate functional messenger RNAs. While studying genes that encode and control a presumptive muscle potassium channel complex in the nematode Caenorhabditis elegans, we found that mutations in two splicing factors, the U2AF large subunit and SF1/BBP suppress the rubberband Unc phenotype caused by a rare missense mutation in the gene unc-93. Mutations affecting the U2AF large subunit caused the recognition of a cryptic 3′ splice site generated by the unc-93 mutation, providing in vivo evidence that the U2AF large subunit can affect splice-site selection. By contrast, an SF1/BBP mutation that suppressed the rubberband Unc phenotype did not cause splicing using this cryptic 3′ splice site. Our genetic studies identified a region of the U2AF large subunit important for its effect on 3′ splice-site choice. Our mutagenesis analysis of in vivo transgene splicing identified a positional effect on weak 3′ splice site selection and nucleotides of the endogenous 3′ splice site important for recognition. The system we have defined should facilitate future in vivo analyses of pre–mRNA splicing.
Collapse
Affiliation(s)
- Long Ma
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - H. Robert Horvitz
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
30
|
Sinha R, Nikolajewa S, Szafranski K, Hiller M, Jahn N, Huse K, Platzer M, Backofen R. Accurate prediction of NAGNAG alternative splicing. Nucleic Acids Res 2009; 37:3569-79. [PMID: 19359358 PMCID: PMC2699507 DOI: 10.1093/nar/gkp220] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Alternative splicing (AS) involving NAGNAG tandem acceptors is an evolutionarily widespread class of AS. Recent predictions of alternative acceptor usage reported better results for acceptors separated by larger distances, than for NAGNAGs. To improve the latter, we aimed at the use of Bayesian networks (BN), and extensive experimental validation of the predictions. Using carefully constructed training and test datasets, a balanced sensitivity and specificity of ≥92% was achieved. A BN trained on the combined dataset was then used to make predictions, and 81% (38/47) of the experimentally tested predictions were verified. Using a BN learned on human data on six other genomes, we show that while the performance for the vertebrate genomes matches that achieved on human data, there is a slight drop for Drosophila and worm. Lastly, using the prediction accuracy according to experimental validation, we estimate the number of yet undiscovered alternative NAGNAGs. State of the art classifiers can produce highly accurate prediction of AS at NAGNAGs, indicating that we have identified the major features of the ‘NAGNAG-splicing code’ within the splice site and its immediate neighborhood. Our results suggest that the mechanism behind NAGNAG AS is simple, stochastic, and conserved among vertebrates and beyond.
Collapse
Affiliation(s)
- Rileen Sinha
- Leibniz Institute for Age Research - Fritz Lipmann Institute, Genome Analysis, Beutenbergstrasse 11, 07745 Jena, Germany
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Hiller M, Platzer M. Widespread and subtle: alternative splicing at short-distance tandem sites. Trends Genet 2008; 24:246-55. [PMID: 18394746 DOI: 10.1016/j.tig.2008.03.003] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2008] [Revised: 03/05/2008] [Accepted: 03/06/2008] [Indexed: 12/11/2022]
Abstract
Alternative splicing at donor or acceptor sites located just a few nucleotides apart is widespread in many species. It results in subtle changes in the transcripts and often in the encoded proteins. Several of these tandem splice events contribute to the repertoire of functionally different proteins, whereas many are neutral or deleterious. Remarkably, some of the functional events are differentially spliced in tissues or developmental stages, whereas others exhibit constant splicing ratios, indicating that function is not always associated with differential splicing. Stochastic splice site selection seems to play a major role in these processes. Here, we review recent progress in understanding functional and evolutionary aspects as well as the mechanism of splicing at short-distance tandem sites.
Collapse
Affiliation(s)
- Michael Hiller
- Bioinformatics Group, Albert-Ludwigs-University Freiburg, 79110 Freiburg, Germany.
| | | |
Collapse
|
32
|
Hiller M, Szafranski K, Huse K, Backofen R, Platzer M. Selection against tandem splice sites affecting structured protein regions. BMC Evol Biol 2008; 8:89. [PMID: 18366714 PMCID: PMC2279118 DOI: 10.1186/1471-2148-8-89] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2007] [Accepted: 03/21/2008] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Alternative selection of splice sites in tandem donors and acceptors is a major mode of alternative splicing. Here, we analyzed whether in-frame tandem sites leading to subtle mRNA insertions/deletions of 3, 6, or 9 nucleotides are under natural selection. RESULTS We found multiple lines of evidence that the human protein coding sequences are under selection against such in-frame tandem splice events, indicating that these events are often deleterious. The strength of selection is not homogeneous within the coding sequence as protein regions that fold into a fixed 3D structure (intrinsically ordered) are under stronger selection, especially against sites with a strong minor splice site. Investigating structures of functional protein domains, we found that tandem acceptors are preferentially located at the domain surface and outside structural elements such as helices and sheets. Using three-species comparisons, we estimate that more than half of all mutations that create NAGNAG acceptors in the coding region have been eliminated by selection. CONCLUSION We estimate that ~2,400 introns are under selection against possessing a tandem site.
Collapse
Affiliation(s)
- Michael Hiller
- Bioinformatics Group, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, 79110 Freiburg, Germany.
| | | | | | | | | |
Collapse
|
33
|
Ohno G, Hagiwara M, Kuroyanagi H. STAR family RNA-binding protein ASD-2 regulates developmental switching of mutually exclusive alternative splicing in vivo. Genes Dev 2008; 22:360-74. [PMID: 18230701 PMCID: PMC2216695 DOI: 10.1101/gad.1620608] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2007] [Accepted: 12/04/2007] [Indexed: 11/25/2022]
Abstract
Alternative splicing of pre-mRNAs greatly contributes to the spatiotemporal diversity of gene expression in metazoans. However, the molecular basis of developmental regulation and the precise sequence of alternative pre-mRNA processing in vivo are poorly understood. In the present study, we focus on the developmental switching of the mutually exclusive alternative splicing of the let-2 gene of Caenorhabditis elegans from the exon 9 form in embryos to the exon 10 form in adults. By visualizing the usage of the let-2 mutually exclusive exons through differential expression of green fluorescent protein (GFP) and red fluorescent protein (RFP), we isolated several switching-defective mutants and identified the alternative splicing defective-2 (asd-2) gene, encoding a novel member of the evolutionarily conserved STAR (signal transduction activators of RNA) family of RNA-binding proteins. Comparison of the amounts of partially spliced let-2 RNAs in synchronized wild-type and asd-2 mutant worms suggested that either of the introns downstream from the let-2 mutually exclusive exons is removed prior to the removal of the upstream ones, and that asd-2 promotes biased excision of intron 10 in the late larval stages. We propose that the developmental switching between alternative sequences of intron removal determines the ratio between the mature let-2 mRNA isoforms.
Collapse
Affiliation(s)
- Genta Ohno
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| | - Masatoshi Hagiwara
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| | - Hidehito Kuroyanagi
- Laboratory of Gene Expression, School of Biomedical Science, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
- Department of Functional Genomics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| |
Collapse
|
34
|
Schwartz SH, Silva J, Burstein D, Pupko T, Eyras E, Ast G. Large-scale comparative analysis of splicing signals and their corresponding splicing factors in eukaryotes. Genome Res 2007; 18:88-103. [PMID: 18032728 DOI: 10.1101/gr.6818908] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Introns are among the hallmarks of eukaryotic genes. Splicing of introns is directed by three main splicing signals: the 5' splice site (5'ss), the branch site (BS), and the polypyrimdine tract/3'splice site (PPT-3'ss). To study the evolution of these splicing signals, we have conducted a systematic comparative analysis of these signals in over 1.2 million introns from 22 eukaryotes. Our analyses suggest that all these signals have dramatically evolved: The PPT is weak among most fungi, intermediate in plants and protozoans, and strongest in metazoans. Within metazoans it shows a gradual strengthening from Caenorhabditis elegans to human. The 5'ss and the BS were found to be degenerate among most organisms, but highly conserved among some fungi. A maximum parsimony-based algorithm for reconstructing ancestral position-specific scoring matrices suggested that the ancestral 5'ss and BS were degenerate, as in metazoans. To shed light on the evolutionary variation in splicing signals, we have analyzed the evolutionary changes in the factors that bind these signals. Our analysis reveals coevolution of splicing signals and their corresponding splicing factors: The strength of the PPT is correlated to changes in key residues in its corresponding splicing factor U2AF2; limited correlation was found between changes in the 5'ss and U1 snRNA that binds it; but not between the BS and U2 snRNA. Thus, although the basic ability of eukaryotes to splice introns has remained conserved throughout evolution, the splicing signals and their corresponding splicing factors have considerably evolved, uniquely shaping the splicing mechanisms of different organisms.
Collapse
Affiliation(s)
- Schraga H Schwartz
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel-Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | | | |
Collapse
|
35
|
The Fox-1 family and SUP-12 coordinately regulate tissue-specific alternative splicing in vivo. Mol Cell Biol 2007; 27:8612-21. [PMID: 17923701 DOI: 10.1128/mcb.01508-07] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Many pre-mRNAs are alternatively spliced in a tissue-specific manner in multicellular organisms. The Fox-1 family of RNA-binding proteins regulate alternative splicing by either activating or repressing exon inclusion through specific binding to UGCAUG stretches. However, the precise cellular contexts that determine the action of the Fox-1 family in vivo remain to be elucidated. We have recently demonstrated that ASD-1 and FOX-1, members of the Fox-1 family in Caenorhabditis elegans, regulate tissue-specific alternative splicing of the fibroblast growth factor receptor gene, egl-15, which eventually determines the ligand specificity of the receptor in vivo. Here we report that another RNA-binding protein, SUP-12, coregulates the egl-15 alternative splicing. By screening for mutants defective in the muscle-specific expression of our alternative splicing reporter, we identified the muscle-specific RNA-binding protein SUP-12. We identified juxtaposed conserved stretches as the cis elements responsible for the regulation. The Fox-1 family and the SUP-12 proteins form a stable complex with egl-15 RNA, depending on the cis elements. Furthermore, the asd-1; sup-12 double mutant is defective in sex myoblast migration, phenocopying the isoform-specific egl-15(5A) mutant. These results establish an in vivo model that coordination of the two families of RNA-binding proteins regulates tissue-specific alternative splicing of a specific target gene.
Collapse
|
36
|
Graber JH, Salisbury J, Hutchins LN, Blumenthal T. C. elegans sequences that control trans-splicing and operon pre-mRNA processing. RNA (NEW YORK, N.Y.) 2007; 13:1409-26. [PMID: 17630324 PMCID: PMC1950753 DOI: 10.1261/rna.596707] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2007] [Accepted: 05/17/2007] [Indexed: 05/04/2023]
Abstract
Many mRNAs in Caenorhabditis elegans are generated through a trans-splicing reaction that adds one of two classes of spliced leader RNA to an independently transcribed pre-mRNA. SL1 leaders are spliced mostly to pre-mRNAs from genes with outrons, intron-like sequences at the 5'-ends of the pre-mRNAs. In contrast, SL2 leaders are nearly exclusively trans-spliced to genes that occur downstream in polycistronic pre-mRNAs produced from operons. Operon pre-mRNA processing requires separation into individual transcripts, which is accomplished by 3'-processing of upstream genes and spliced leader trans-splicing to the downstream genes. We used a novel computational analysis, based on nonnegative matrix factorization, to identify and characterize significant differences in the cis-acting sequence elements that differentiate various types of functional site, including internal versus terminal 3'-processing sites, and SL1 versus SL2 trans-splicing sites. We describe several key elements, including the U-rich (Ur) element that couples 3'-processing with SL2 trans-splicing, and a novel outron (Ou) element that occurs upstream of SL1 trans-splicing sites. Finally, we present models of the distinct classes of trans-splicing reaction, including SL1 trans-splicing at the outron, SL2 trans-splicing in standard operons, competitive SL1-SL2 trans-splicing in operons with large intergenic separation, and SL1 trans-splicing in SL1-type operons, which have no intergenic separation.
Collapse
|
37
|
Sridharan V, Singh R. A conditional role of U2AF in splicing of introns with unconventional polypyrimidine tracts. Mol Cell Biol 2007; 27:7334-44. [PMID: 17709389 PMCID: PMC2168890 DOI: 10.1128/mcb.00627-07] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Recognition of polypyrimidine (Py) tracts typically present between the branch point and the 3' splice site by the large subunit of the essential splicing factor U2AF is a key early step in pre-mRNA splicing. Diverse intronic sequence arrangements exist, however, including 3' splice sites lacking recognizable Py tracts, which raises the question of how general the requirement for U2AF is for various intron architectures. Our analysis of fission yeast introns in vivo has unexpectedly revealed that whereas introns lacking Py tracts altogether remain dependent on both subunits of U2AF, introns with long Py tracts, unconventionally positioned upstream of branch points, are unaffected by U2AF inactivation. Nevertheless, mutation of these Py tracts causes strong dependence on the large subunit U2AF59. We also find that Py tract diversity influences the requirement for the conserved C-terminal domain of U2AF59 (RNA recognition motif 3), which has been implicated in protein-protein interactions with other splicing factors. Together, these results suggest that in addition to Py tract binding by U2AF, supplementary mechanisms of U2AF recruitment and 3' splice site identification exist to accommodate diverse intron architectures, which have gone unappreciated in biochemical studies of model pre-mRNAs.
Collapse
Affiliation(s)
- Vinod Sridharan
- Department of Molecular, Cellular and Developmental Biology, University of Colorado at Boulder, Boulder, CO 80309, USA
| | | |
Collapse
|
38
|
Rekha TS, Mitra CK. Comparative analysis of splice site regions by information content. GENOMICS PROTEOMICS & BIOINFORMATICS 2007; 4:230-7. [PMID: 17531798 PMCID: PMC5054069 DOI: 10.1016/s1672-0229(07)60003-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have applied concepts from information theory for a comparative analysis of donor (gt) and acceptor (ag) splice site regions in the genes of five different organisms by calculating their mutual information content (relative entropy) over a selected block of nucleotides. A similar pattern that the information content decreases as the block size increases was observed for both regions in all the organisms studied. This result suggests that the information required for splicing might be contained in the consensus of ~6–8 nt at both regions. We assume from our study that even though the nucleotides are showing some degrees of conservation in the flanking regions of the splice sites, certain level of variability is still tolerated, which leads the splicing process to occur normally even if the extent of base pairing is not fully satisfied. We also suggest that this variability can be compensated by recognizing different splice sites with different spliceosomal factors.
Collapse
|
39
|
MacMorris M, Kumar M, Lasda E, Larsen A, Kraemer B, Blumenthal T. A novel family of C. elegans snRNPs contains proteins associated with trans-splicing. RNA (NEW YORK, N.Y.) 2007; 13:511-20. [PMID: 17283210 PMCID: PMC1831854 DOI: 10.1261/rna.426707] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2006] [Accepted: 01/02/2007] [Indexed: 05/13/2023]
Abstract
In many Caenorhabditis elegans pre-mRNAs, the RNA sequence between the 5' cap and the first 3' splice site is replaced by trans-splicing a short spliced leader (SL) from the Sm snRNP, SL1. C. elegans also utilizes a similar Sm snRNP, SL2, to trans-splice at sites between genes in polycistronic pre-mRNAs from operons. How do SL1 and SL2 snRNPs function in different contexts? Here we show that the SL1 snRNP contains a complex of SL75p and SL21p, which are homologs of novel proteins previously reported in the Ascaris SL snRNP. Interestingly, we show that the SL2 snRNP does not contain these proteins. However, SL75p and SL26p, a paralog of SL21p, are components of another Sm snRNP that contains a novel snRNA species, Sm Y. Knockdown of SL75p is lethal. However, knockdown of either SL21p or SL26p alone leads to cold-sensitive sterility, whereas knockdown of both SL21p and SL26p is lethal. This suggests that these two proteins have overlapping functions even though they are associated with different classes of snRNP. These phenotypic relationships, along with the association of SL26p with SL75p, imply that, like the SL1 RNA/Sm/SL75p/SL21p complex, the Sm Y/Sm/SL75p/SL26p complex is associated with trans-splicing.
Collapse
MESH Headings
- Alternative Splicing
- Amino Acid Sequence
- Animals
- Caenorhabditis elegans/genetics
- Genes, Helminth
- Molecular Sequence Data
- Operon
- RNA Splicing
- RNA, Helminth/chemistry
- RNA, Helminth/genetics
- RNA, Helminth/metabolism
- RNA, Spliced Leader/metabolism
- RNA-Binding Proteins/chemistry
- RNA-Binding Proteins/metabolism
- Ribonucleoproteins, Small Nuclear/chemistry
- Ribonucleoproteins, Small Nuclear/classification
- Ribonucleoproteins, Small Nuclear/genetics
- Ribonucleoproteins, Small Nuclear/metabolism
- Sequence Homology, Amino Acid
- Trans-Splicing
Collapse
Affiliation(s)
- Margaret MacMorris
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | | | | | | | | | | |
Collapse
|
40
|
Vorechovský I. Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization. Nucleic Acids Res 2006; 34:4630-41. [PMID: 16963498 PMCID: PMC1636351 DOI: 10.1093/nar/gkl535] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The frequency distribution of mutation-induced aberrant 3' splice sites (3'ss) in exons and introns is more complex than for 5' splice sites, largely owing to sequence constraints upstream of intron/exon boundaries. As a result, prediction of their localization remains a challenging task. Here, nucleotide sequences of previously reported 218 aberrant 3'ss activated by disease-causing mutations in 131 human genes were compared with their authentic counterparts using currently available splice site prediction tools. Each tested algorithm distinguished authentic 3'ss from cryptic sites more effectively than from de novo sites. The best discrimination between aberrant and authentic 3'ss was achieved by the maximum entropy model. Almost one half of aberrant 3'ss was activated by AG-creating mutations and approximately 95% of the newly created AGs were selected in vivo. The overall nucleotide structure upstream of aberrant 3'ss was characterized by higher purine content than for authentic sites, particularly in position -3, that may be compensated by more stringent requirements for positive and negative nucleotide signatures centred around position -11. A newly developed online database of aberrant 3'ss will facilitate identification of splicing mutations in a gene or phenotype of interest and future optimization of splice site prediction tools.
Collapse
Affiliation(s)
- Igor Vorechovský
- University of Southampton School of Medicine, Division of Human Genetics, Mailpoint 808, Southampton SO16 6YD, UK
| |
Collapse
|
41
|
Sheth N, Roca X, Hastings ML, Roeder T, Krainer AR, Sachidanandam R. Comprehensive splice-site analysis using comparative genomics. Nucleic Acids Res 2006; 34:3955-67. [PMID: 16914448 PMCID: PMC1557818 DOI: 10.1093/nar/gkl556] [Citation(s) in RCA: 286] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2006] [Revised: 07/13/2006] [Accepted: 07/17/2006] [Indexed: 11/12/2022] Open
Abstract
We have collected over half a million splice sites from five species-Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana-and classified them into four subtypes: U2-type GT-AG and GC-AG and U12-type GT-AG and AT-AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT-AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3' splice sites (3'ss) and (iv) distinct evolutionary histories of 5' and 3'ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing.
Collapse
Affiliation(s)
- Nihar Sheth
- Cold Spring Harbor Laboratory1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Xavier Roca
- Cold Spring Harbor Laboratory1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | | | - Ted Roeder
- Cold Spring Harbor Laboratory1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Adrian R. Krainer
- Cold Spring Harbor Laboratory1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Ravi Sachidanandam
- Cold Spring Harbor Laboratory1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
42
|
Patton JR, Padgett RW. Pseudouridine modification in Caenorhabditis elegans spliceosomal snRNAs: unique modifications are found in regions involved in snRNA-snRNA interactions. BMC Mol Biol 2005; 6:20. [PMID: 16236171 PMCID: PMC1276797 DOI: 10.1186/1471-2199-6-20] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2005] [Accepted: 10/19/2005] [Indexed: 01/25/2023] Open
Abstract
Background Pseudouridine (Ψ) is an abundant modified nucleoside in RNA and a number of studies have shown that the presence of Ψ affects RNA structure and function. The positions of Ψ in spliceosomal small nuclear RNAs (snRNAs) have been determined for a number of species but not for the snRNAs from Caenorhabditis elegans (C. elegans), a popular experimental model system of development. Results As a prelude to determining the function of or requirement for this modification in snRNAs, we have mapped the positions of Ψ in U1, U2, U4, U5, and U6 snRNAs from worms using a specific primer extension method. As with other species, C. elegans U2 snRNA has the greatest number of Ψ residues, with nine, located in the 5' half of the U2 snRNA. U5 snRNA has three Ψs, in or near the loop of the large stem-loop that dominates the structure of this RNA. U6 and U1 snRNAs each have one Ψ, and two Ψ residues were found in U4 snRNA. Conclusion The total number of Ψs found in the snRNAs of C. elegans is significantly higher than the minimal amount found in yeasts but it is lower than that seen in sequenced vertebrate snRNAs. When the actual sites of modification on C. elegans snRNAs are compared with other sequenced snRNAs most of the positions correspond to modifications found in other species. However, two of the positions modified on C. elegans snRNAs are unique, one at position 28 on U2 snRNA and one at position 62 on U4 snRNA. Both of these modifications are in regions of these snRNAs that interact with U6 snRNA either in the spliceosome or in the U4/U6 small nuclear ribonucleoprotein particle (snRNP) and the presence of Ψ may be involved in strengthening the intermolecular association of the snRNAs.
Collapse
Affiliation(s)
- Jeffrey R Patton
- Department of Pathology and Microbiology, University of South Carolina School of Medicine Columbia, SC 29208 USA
| | - Richard W Padgett
- Waksman Institute, Department of Molecular Biology and Biochemistry and Cancer Institute of New Jersey, Rutgers University, Piscataway, NJ 08854 USA
| |
Collapse
|