1
|
Mucaki EJ, Shirley BC, Rogan PK. Expression Changes Confirm Genomic Variants Predicted to Result in Allele-Specific, Alternative mRNA Splicing. Front Genet 2020; 11:109. [PMID: 32211018 PMCID: PMC7066660 DOI: 10.3389/fgene.2020.00109] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Accepted: 01/30/2020] [Indexed: 12/11/2022] Open
Abstract
Splice isoform structure and abundance can be affected by either noncoding or masquerading coding variants that alter the structure or abundance of transcripts. When these variants are common in the population, these nonconstitutive transcripts are sufficiently frequent so as to resemble naturally occurring, alternative mRNA splicing. Prediction of the effects of such variants has been shown to be accurate using information theory-based methods. Single nucleotide polymorphisms (SNPs) predicted to significantly alter natural and/or cryptic splice site strength were shown to affect gene expression. Splicing changes for known SNP genotypes were confirmed in HapMap lymphoblastoid cell lines with gene expression microarrays and custom designed q-RT-PCR or TaqMan assays. The majority of these SNPs (15 of 22) as well as an independent set of 24 variants were then subjected to RNAseq analysis using the ValidSpliceMut web beacon (http://validsplicemut.cytognomix.com), which is based on data from the Cancer Genome Atlas and International Cancer Genome Consortium. SNPs from different genes analyzed with gene expression microarray and q-RT-PCR exhibited significant changes in affected splice site use. Thirteen SNPs directly affected exon inclusion and 10 altered cryptic site use. Homozygous SNP genotypes resulting in stronger splice sites exhibited higher levels of processed mRNA than alleles associated with weaker sites. Four SNPs exhibited variable expression among individuals with the same genotypes, masking statistically significant expression differences between alleles. Genome-wide information theory and expression analyses (RNAseq) in tumor exomes and genomes confirmed splicing effects for 7 of the HapMap SNP and 14 SNPs identified from tumor genomes. q-RT-PCR resolved rare splice isoforms with read abundance too low for statistical significance in ValidSpliceMut. Nevertheless, the web-beacon provides evidence of unanticipated splicing outcomes, for example, intron retention due to compromised recognition of constitutive splice sites. Thus, ValidSpliceMut and q-RT-PCR represent complementary resources for identification of allele-specific, alternative splicing.
Collapse
Affiliation(s)
- Eliseos J Mucaki
- Department of Biochemistry, University of Western Ontario, London, ON, Canada
| | | | - Peter K Rogan
- Department of Biochemistry, University of Western Ontario, London, ON, Canada.,CytoGnomix, London, ON, Canada.,Department of Oncology University of Western Ontario, London, ON, Canada.,Department of Computer Science, University of Western Ontario, London, ON, Canada
| |
Collapse
|
2
|
Bolisetty MT, Beemon KL. Splicing of internal large exons is defined by novel cis-acting sequence elements. Nucleic Acids Res 2012; 40:9244-54. [PMID: 22790982 PMCID: PMC3467050 DOI: 10.1093/nar/gks652] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Human internal exons have an average size of 147 nt, and most are <300 nt. This small size is thought to facilitate exon definition. A small number of large internal exons have been identified and shown to be alternatively spliced. We identified 1115 internal exons >1000 nt in the human genome; these were found in 5% of all protein-coding genes, and most were expressed and translated. Surprisingly, 40% of these were expressed at levels similar to the flanking exons, suggesting they were constitutively spliced. While all of the large exons had strong splice sites, the constitutively spliced large exons had a higher ratio of splicing enhancers/silencers and were more conserved across mammals than the alternatively spliced large exons. We asked if large exons contain specific sequences that promote splicing and identified 38 sequences enriched in the large exons relative to small exons. The consensus sequence is C-rich with a central invariant CA dinucleotide. Mutation of these sequences in a candidate large exon indicated that these are important for recognition of large exons by the splicing machinery. We propose that these sequences are large exon splicing enhancers (LESEs).
Collapse
Affiliation(s)
- Mohan T Bolisetty
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA.
| | | |
Collapse
|
3
|
Ke S, Chasin LA. Context-dependent splicing regulation: exon definition, co-occurring motif pairs and tissue specificity. RNA Biol 2011; 8:384-8. [PMID: 21444999 DOI: 10.4161/rna.8.3.14458] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Splicing is a crucial process in gene expression in higher organisms because: 1) most vertebrate genes contain introns; and 2) alternative splicing is primarily responsible for increasing proteomic complexity and functional diversity. Intron definition, the coordination across an intron, is a mandatory step in the splicing process. However, exon definition, the coordination across an exon, is also thought to be required for the splicing of most vertebrate exons. Recent investigations of exon definition complexes provide insights into splicing dynamics. That splicing regulators act in a context-dependent mode is supported by a large collection of evidence. Splicing contexts generally can be classified as cis-element and trans-element contexts. A widespread cis-element context is defined by co-occurring motif pairs to which splicing regulatory factors bind to direct specific molecular interactions. Splicing regulation is also coordinated by trans-element contexts as exemplified by tissue specific splicing, where alternative exons can be coordinately regulated by a few splicing factors, the expression and/or activity of which are concertedly higher or lower in the corresponding tissues.
Collapse
Affiliation(s)
- Shengdong Ke
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | | |
Collapse
|
4
|
Cristea S, Gregory PD, Urnov FD, Cost GJ. Dissection of splicing regulation at an endogenous locus by zinc-finger nuclease-mediated gene editing. PLoS One 2011; 6:e16961. [PMID: 21347446 PMCID: PMC3035666 DOI: 10.1371/journal.pone.0016961] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2010] [Accepted: 01/18/2011] [Indexed: 01/22/2023] Open
Abstract
Sequences governing RNA splicing are difficult to study in situ due to the great difficulty of traditional targeted mutagenesis. Zinc-finger nuclease (ZFN) technology allows for the rapid and efficient introduction of site-specific mutations into mammalian chromosomes. Using a ZFN pair along with a donor plasmid to manipulate the outcomes of DNA repair, we introduced several discrete, targeted mutations into the fourth intron of the endogenous BAX gene in Chinese hamster ovary cells. Putative lariat branch points, the polypyrimidine tract, and the splice acceptor site were targeted. We recovered numerous otherwise isogenic clones carrying the intended mutations and analyzed the effect of each on BAX pre-mRNA splicing. Mutation of one of three possible branch points, the polypyrimidine tract, and the splice acceptor site all caused exclusion of exon five from BAX mRNA. Interestingly, these exon-skipping mutations allowed usage of cryptic splice acceptor sites within intron four. These data demonstrate that ZFN-mediated gene editing is a highly effective tool for dissection of pre-mRNA splicing regulatory sequences in their endogenous context.
Collapse
Affiliation(s)
- Sandra Cristea
- Sangamo BioSciences, Richmond, California, United States of America
| | | | | | | |
Collapse
|
5
|
Lu ZX, Jiang P, Cai JJ, Xing Y. Context-dependent robustness to 5' splice site polymorphisms in human populations. Hum Mol Genet 2010; 20:1084-96. [PMID: 21224255 DOI: 10.1093/hmg/ddq553] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
There has been growing evidence for extensive diversity of alternative splicing in human populations. Genetic variants within the 5' splice site can cause splicing differences among human individuals and constitute an important class of human disease mutations. In this study, we explored whether natural variations of splicing could reveal important signals of 5' splice site recognition. In seven lymphoblastoid cell lines of Asian, European and African ancestry, we identified 1174 single nucleotide polymorphisms (SNPs) within the consensus 5' splice site. We selected 129 SNPs predicted to significantly alter the splice site activity, and quantitatively examined their splicing impact in the seven individuals. Surprisingly, outside of the essential GT dinucleotide position, only ∼14% of the tested SNPs altered splicing. Bioinformatic and minigene analyses identified signals that could modify the impact of 5' splice site polymorphisms, most notably a strong 3' splice site and the presence of intronic motifs downstream of the 5' splice site. Strikingly, we found that the poly-G run, a known intronic splicing enhancer, was the most significantly enriched motif downstream of exons unaffected by 5' splice site SNPs. In TRIM62, the upstream 3' splice site and downstream intronic poly-G runs functioned redundantly to protect an exon from its 5' splice site polymorphism. Collectively, our study reveals widespread context-dependent robustness to 5' splice site polymorphisms in human transcriptomes. Consequently, certain exons are more susceptible to 5' splice site mutations. Additionally, our work demonstrates that genetic diversity of alternative splicing can provide significant insights into the splicing code of mammalian cells.
Collapse
Affiliation(s)
- Zhi-xiang Lu
- Department of Internal Medicine, University of Iowa, 3294 CBRB, 285 Newton Rd, Iowa City, IA 52242, USA
| | | | | | | |
Collapse
|
6
|
Ke S, Chasin LA. Intronic motif pairs cooperate across exons to promote pre-mRNA splicing. Genome Biol 2010; 11:R84. [PMID: 20704715 PMCID: PMC2945786 DOI: 10.1186/gb-2010-11-8-r84] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2010] [Revised: 07/02/2010] [Accepted: 08/12/2010] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND A very early step in splice site recognition is exon definition, a process that is as yet poorly understood. Communication between the two ends of an exon is thought to be required for this step. We report genome-wide evidence for exons being defined through the combinatorial activity of motifs located in flanking intronic regions. RESULTS Strongly co-occurring motifs were found to specifically reside in four intronic regions surrounding a large number of human exons. These paired motifs occur around constitutive and alternative exons but not pseudo exons. Most co-occurring motifs are limited to intronic regions within 100 nucleotides of the exon. They are preferentially associated with weaker exons. Their pairing is conserved in evolution and they exhibit a lower frequency of single nucleotide polymorphism when paired. Paired motifs display specificity with respect to distance from the exon borders and in constitutive versus alternative splicing. Many resemble binding sites for heterogeneous nuclear ribonucleoproteins. Specific pairs are associated with tissue-specific genes, the higher expression of which coincides with that of the pertinent RNA binding proteins. Tested pairs acted synergistically to enhance exon inclusion, and this enhancement was found to be exon-specific. CONCLUSIONS The exon-flanking sequence pairs identified here by genomic analysis promote exon inclusion and may play a role in the exon definition step in pre-mRNA splicing. We propose a model in which multiple concerted interactions are required between exonic sequences and flanking intronic sequences to effect exon definition.
Collapse
Affiliation(s)
- Shengdong Ke
- Department of Biological Sciences, Columbia University, 1212 Amsterdam Ave, MC 2433, New York, NY 10027, USA.
| | | |
Collapse
|
7
|
Ke S, Zhang XHF, Chasin LA. Positive selection acting on splicing motifs reflects compensatory evolution. Genome Res 2008; 18:533-43. [PMID: 18204002 DOI: 10.1101/gr.070268.107] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
We have used comparative genomics to characterize the evolutionary behavior of predicted splicing regulatory motifs. Using base substitution rates in intronic regions as a calibrator for neutral change, we found a strong avoidance of synonymous substitutions that disrupt predicted exonic splicing enhancers or create predicted exonic splicing silencers. These results attest to the functionality of the hexameric motif set used and suggest that they are subject to purifying selection. We also found that synonymous substitutions in constitutive exons tend to create exonic splicing enhancers and to disrupt exonic splicing silencers, implying positive selection for these splicing promoting events. We present evidence that this positive selection is the result of splicing-positive events compensating for splicing-negative events as well as for mutations that weaken splice-site sequences. Such compensatory events include nonsynonymous mutations, synonymous mutations, and mutations at splice sites. Compensation was also seen from the fact that orthologous exons tend to maintain the same number of predicted splicing motifs. Our data fit a splicing compensation model of exon evolution, in which selection for splicing-positive mutations takes place to counter the effect of an ongoing splicing-negative mutational process, with the exon as a whole being conserved as a unit of splicing. In the course of this analysis, we observed that synonymous positions in general are conserved relative to intronic sequences, suggesting that messenger RNA molecules are rich in sequence information for functions beyond protein coding and splicing.
Collapse
Affiliation(s)
- Shengdong Ke
- Department of Biological Sciences Columbia University New York, New York 10027, USA
| | | | | |
Collapse
|
8
|
Kurmangaliyev YZ, Gelfand MS. Computational analysis of splicing errors and mutations in human transcripts. BMC Genomics 2008; 9:13. [PMID: 18194514 PMCID: PMC2234086 DOI: 10.1186/1471-2164-9-13] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2007] [Accepted: 01/14/2008] [Indexed: 01/10/2023] Open
Abstract
Background Most retained introns found in human cDNAs generated by high-throughput sequencing projects seem to result from underspliced transcripts, and thus they capture intermediate steps of pre-mRNA splicing. On the other hand, mutations in splice sites cause exon skipping of the respective exon or activation of pre-existing cryptic sites. Both types of events reflect properties of the splicing mechanism. Results The retained introns were significantly shorter than constitutive ones, and skipped exons are shorter than exons with cryptic sites. Both donor and acceptor splice sites of retained introns were weaker than splice sites of constitutive introns. The authentic acceptor sites affected by mutations were significantly weaker in exons with activated cryptic sites than in skipped exons. The distance from a mutated splice site to the nearest equivalent site is significantly shorter in cases of activated cryptic sites compared to exon skipping events. The prevalence of retained introns within genes monotonically increased in the 5'-to-3' direction (more retained introns close to the 3'-end), consistent with the model of co-transcriptional splicing. The density of exonic splicing enhancers was higher, and the density of exonic splicing silencers lower in retained introns compared to constitutive ones and in exons with cryptic sites compared to skipped exons. Conclusion Thus the analysis of retained introns in human cDNA, exons skipped due to mutations in splice sites and exons with cryptic sites produced results consistent with the intron definition mechanism of splicing of short introns, co-transcriptional splicing, dependence of splicing efficiency on the splice site strength and the density of candidate exonic splicing enhancers and silencers. These results are consistent with other, recently published analyses.
Collapse
Affiliation(s)
- Yerbol Z Kurmangaliyev
- Institute for Information Transmission Problems (the Kharkevich Institute) RAS, Bolshoi Karetny pereulok 19, Moscow, 127994, Russia.
| | | |
Collapse
|
9
|
Abstract
Accurate and efficient splicing of eukaryotic pre-mRNAs requires recognition by trans-acting factors of a complex array of cis-acting RNA elements. Here, we developed a generalized Bayesian network to model the coevolution of splicing cis elements in diverse eukaryotic taxa. Cross-exon but not cross-intron compensatory interactions between the 5' splice site (5'ss) and 3' splice site (3'ss) were observed in human/mouse, indicating that the exon is the primary evolutionary unit in mammals. Studied plants, fungi, and invertebrates exhibited exclusively cross-intron interactions, suggesting that intron definition drives evolution in these organisms. In mammals, 5'ss strength and the strength of several classes of exonic splicing silencers (ESSs) evolved in a correlated way, whereas specific exonic splicing enhancers (ESEs), including motifs associated with hTra2, SRp55, and SRp20, evolved in a compensatory manner relative to the 5'ss and 3'ss. Interactions between specific ESS or ESE motifs were not observed, suggesting that elements bound by different factors are not commonly interchangeable. Thus, the splicing elements defining exons coevolve in a way that preserves overall exon strength, allowing specific elements to substitute for loss or weakening of others.
Collapse
|
10
|
Lai Y, Yue Y, Liu M, Duan D. Synthetic intron improves transduction efficiency of trans-splicing adeno-associated viral vectors. Hum Gene Ther 2007; 17:1036-42. [PMID: 17007565 PMCID: PMC2431463 DOI: 10.1089/hum.2006.17.1036] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Trans-splicing adeno-associated viral (AAV) vectors hold great promise in many gene therapy applications. We have shown that rational selection of the gene-splitting site in a therapeutic target gene can lead to extremely efficient trans-splicing vectors [Lai, Y., Yue, Y., Liu, M., Ghosh, A., Engelhardt, J.F., Chamberlain, J.S., and Duan, D. (2005). Nat. Biotechnol. 23, 1435-1439]. Our original strategy requires the screening of endogenous introns that are capable of overcoming the mRNA accumulation barrier. To further develop transsplicing vectors, we have tested whether the use of a generic synthetic intron can bypass the labor-intensive intron-screening process. Two previously characterized exon/intron/exon junctions (60/60/61 and 63/63/64, respectively) in the 6 kb minidystrophin gene were used as templates to represent highly efficient (60/60/61) and relatively poor (63/63/64) gene-splitting sites. We compared RNA production from the reconstituted viral genome and transduction efficiency of the trans-splicing vectors in dystrophin-null mdx mouse skeletal muscle. Our results suggest that a synthetic intron can successfully overcome the mRNA accumulation barrier at the exon 63/64 junction. Furthermore, when the gene was split at the exon 63/64 junction, the synthetic intronbased vectors performed better than the endogenous intron-based vectors. When the gene was split at the exon 60/61 junction, we observed only nominal improvement in mRNA production. Nevertheless, vectors based on the exon 60/61 junction remain the best set in transduction efficiency. Taken together, our results suggest that optimizing intron sequence may boost the transduction efficiency of trans-splicing AAV vectors.
Collapse
Affiliation(s)
- Yi Lai
- Department of Molecular Microbiology and Immunology, University of Missouri, Columbia 65212, USA
| | | | | | | |
Collapse
|
11
|
Lei H, Vorechovsky I. Identification of splicing silencers and enhancers in sense Alus: a role for pseudoacceptors in splice site repression. Mol Cell Biol 2005; 25:6912-20. [PMID: 16055705 PMCID: PMC1190243 DOI: 10.1128/mcb.25.16.6912-6920.2005] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Auxiliary splicing signals in introns play an important role in splice site selection, but these elements are poorly understood. We show that a subset of serine/arginine (SR)-rich proteins activate a cryptic 3' splice site in a sense Alu repeat located in intron 4 of the human LST1 gene. Utilization of this cryptic splice site is controlled by juxtaposed Alu-derived splicing silencers and enhancers between closely linked short tandem repeats TNFd and TNFe. Systematic mutagenesis of these elements showed that AG dinucleotides that were not preceded by purine residues were critical for repressing exon inclusion of a chimeric splicing reporter. Since the splice acceptor-like sequences are present in excess in exonic splicing silencers, these signals may contribute to inhibition of a large number of pseudosites in primate genomes.
Collapse
Affiliation(s)
- Haixin Lei
- University of Southampton School of Medicine, Division of Human Genetics, Duthie Building, UK
| | | |
Collapse
|
12
|
Zhang XHF, Kangsamaksin T, Chao MSP, Banerjee JK, Chasin LA. Exon inclusion is dependent on predictable exonic splicing enhancers. Mol Cell Biol 2005; 25:7323-32. [PMID: 16055740 PMCID: PMC1190244 DOI: 10.1128/mcb.25.16.7323-7332.2005] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
We have previously formulated a list of approximately 2,000 RNA octamers as putative exonic splicing enhancers (PESEs) based on a statistical comparison of human exonic and nonexonic sequences (X. H. Zhang and L. A. Chasin, Genes Dev. 18:1241-1250, 2004). When inserted into a poorly spliced test exon, all eight tested octamers stimulated splicing, a result consistent with their identification as exonic splicing enhancers (ESEs). Here we present a much more stringent test of the validity of this list of PESEs. Twenty-two naturally occurring examples of nonoverlapping PESEs or PESE clusters were identified in six mammalian exons; five of the six exons tested are constitutively spliced. Each of the 22 individual PESEs or PESE clusters was disrupted by site-directed mutagenesis, usually by a single-base substitution. Eighteen of the 22 disruptions (82%) resulted in decreased splicing efficiency. In contrast, 24 control mutations had little or no effect on splicing. This high rate of success suggests that most PESEs function as ESEs in their natural context. Like most exons, these exons contain several PESEs. Since knocking out any one of several could produce a severalfold decrease in splicing efficiency, we conclude that there is little redundancy among ESEs in an exon and that they must work in concert to optimize splicing.
Collapse
Affiliation(s)
- Xiang H-F Zhang
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | | | | | | | | |
Collapse
|
13
|
Zhang XHF, Leslie CS, Chasin LA. Dichotomous splicing signals in exon flanks. Genome Res 2005; 15:768-79. [PMID: 15930489 PMCID: PMC1142467 DOI: 10.1101/gr.3217705] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2004] [Accepted: 03/04/2005] [Indexed: 02/06/2023]
Abstract
Intronic elements flanking the splice-site consensus sequences are thought to play a role in pre-mRNA splicing. However, the generality of this role, the catalog of effective sequences, and the mechanisms involved are still lacking. Using molecular genetic tests, we first showed that the approximately 50-nt intronic flanking sequences of exons beyond the splice-site consensus are generally important for splicing. We then went on to characterize exon flank sequences on a genomic scale. The G+C content of flanks displayed a bimodal distribution reflecting an exaggeration of this base composition in flanks relative to the gene as a whole. We divided all exons into two classes according to their flank G+C content and used computational and statistical methods to define pentamers of high relative abundance and phylogenetic conservation in exon flanks. Upstream pentamers were often common to the two classes, whereas downstream pentamers were totally different. Upstream and downstream pentamers were often identical around low G+C exons, and in contrast, were often complementary around high G+C exons. In agreement with this complementarity, predicted base pairing was more frequent between the flanks of high G+C exons. Pseudo exons did not exhibit this behavior, but rather tended to form base pairs between flanks and exon bodies. We conclude that most exons require signals in their immediate flanks for efficient splicing. G+C content is a sequence feature correlated with many genetic and genomic attributes. We speculate that there may be different mechanisms for splice site recognition depending on G+C content.
Collapse
Affiliation(s)
- Xiang H-F Zhang
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | | | | |
Collapse
|
14
|
Zhang XHF, Heller KA, Hefter I, Leslie CS, Chasin LA. Sequence information for the splicing of human pre-mRNA identified by support vector machine classification. Genome Res 2003; 13:2637-50. [PMID: 14656968 PMCID: PMC403805 DOI: 10.1101/gr.1679003] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2003] [Accepted: 09/10/2003] [Indexed: 12/23/2022]
Abstract
Vertebrate pre-mRNA transcripts contain many sequences that resemble splice sites on the basis of agreement to the consensus,yet these more numerous false splice sites are usually completely ignored by the cellular splicing machinery. Even at the level of exon definition,pseudo exons defined by such false splices sites outnumber real exons by an order of magnitude. We used a support vector machine to discover sequence information that could be used to distinguish real exons from pseudo exons. This machine learning tool led to the definition of potential branch points,an extended polypyrimidine tract,and C-rich and TG-rich motifs in a region limited to 50 nt upstream of constitutively spliced exons. C-rich sequences were also found in a region extending to 80 nt downstream of exons,along with G-triplet motifs. In addition,it was shown that combinations of three bases within the splice donor consensus sequence were more effective than consensus values in distinguishing real from pseudo splice sites; two-way base combinations were optimal for distinguishing 3' splice sites. These data also suggest that interactions between two or more of these elements may contribute to exon recognition,and provide candidate sequences for assessment as intronic splicing enhancers.
Collapse
Affiliation(s)
- Xiang H-F Zhang
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | | | | | | | | |
Collapse
|
15
|
Takahara K, Schwarze U, Imamura Y, Hoffman GG, Toriello H, Smith LT, Byers PH, Greenspan DS. Order of intron removal influences multiple splice outcomes, including a two-exon skip, in a COL5A1 acceptor-site mutation that results in abnormal pro-alpha1(V) N-propeptides and Ehlers-Danlos syndrome type I. Am J Hum Genet 2002; 71:451-65. [PMID: 12145749 PMCID: PMC379186 DOI: 10.1086/342099] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2002] [Accepted: 05/22/2002] [Indexed: 11/03/2022] Open
Abstract
Ehlers-Danlos syndrome (EDS) type I (the classical variety) is a dominantly inherited, genetically heterogeneous connective-tissue disorder. Mutations in the COL5A1 and COL5A2 genes, which encode type V collagen, have been identified in several individuals. Most mutations affect either the triple-helical domain of the protein or the expression of one COL5A1 allele. We identified a novel splice-acceptor mutation (IVS4-2A-->G) in the N-propeptide-encoding region of COL5A1, in one patient with EDS type I. The outcome of this mutation was complex: In the major product, both exons 5 and 6 were skipped; other products included a small amount in which only exon 5 was skipped and an even smaller amount in which cryptic acceptor sites within exon 5 were used. All products were in frame. Pro-alpha1(V) chains with abnormal N-propeptides were secreted and were incorporated into extracellular matrix, and the mutation resulted in dramatic alterations in collagen fibril structure. The two-exon skip occurred in transcripts in which intron 5 was removed rapidly relative to introns 4 and 6, leaving a large (270 nt) composite exon that can be skipped in its entirety. The transcripts in which only exon 5 was skipped were derived from those in which intron 6 was removed prior to intron 5. The use of cryptic acceptor sites in exon 5 occurred in transcripts in which intron 4 was removed subsequent to introns 5 and 6. These findings suggest that the order of intron removal plays an important role in the outcome of splice-site mutations and provide a model that explains why multiple products derive from a mutation at a single splice site.
Collapse
Affiliation(s)
- Kazuhiko Takahara
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Ulrike Schwarze
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Yasutada Imamura
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Guy G. Hoffman
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Helga Toriello
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Lynne T. Smith
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Peter H. Byers
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| | - Daniel S. Greenspan
- Department of Pathology and Laboratory Medicine, University of Wisconsin, Madison; Departments of Pathology and Medicine, University of Washington, Seattle; and Spectrum Health Genetics Services, Grand Rapids, MI
| |
Collapse
|
16
|
Mount SM. Genomic sequence, splicing, and gene annotation. Am J Hum Genet 2000; 67:788-92. [PMID: 10986039 PMCID: PMC1287883 DOI: 10.1086/303098] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2000] [Accepted: 08/15/2000] [Indexed: 11/03/2022] Open
Affiliation(s)
- S M Mount
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
17
|
Abstract
Mammalian genes are characterized by relatively small exons surrounded by variable lengths of intronic sequence. Sequences similar to the splice signals that define the 5' and 3' boundaries of these exons are also present in abundance throughout the surrounding introns. What causes the real sites to be distinguished from the multitude of pseudosites in pre-mRNA is unclear. Much progress has been made in defining additional sequence elements that enhance the use of particular sites. Less work has been done on sequences that repress the use of particular splice sites. To find additional examples of sequences that inhibit splicing, we searched human genomic DNA libraries for sequences that would inhibit the inclusion of a constitutively spliced exon. Genetic selection experiments suggested that such sequences were common, and we subsequently tested randomly chosen restriction fragments of about 100 bp. When inserted into the central exon of a three-exon minigene, about one in three inhibited inclusion, revealing a high frequency of inhibitory elements in human DNA. In contrast, only 1 in 27 Escherichia coli DNA fragments was inhibitory. Several previously identified silencing elements derived from alternatively spliced exons functioned weakly in this constitutively spliced exon. In contrast, a high-affinity site for U2AF65 strongly inhibited exon inclusion. Together, our results suggest that splicing occurs in a background of repression and, since many of our inhibitors contain splice like signals, we suggest that repression of some pseudosites may occur through an inhibitory arrangement of these sites.
Collapse
Affiliation(s)
- W G Fairbrother
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | | |
Collapse
|
18
|
Abstract
Splice site consensus sequences alone are insufficient to dictate the recognition of real constitutive splice sites within the typically large transcripts of higher eukaryotes, and large numbers of pseudoexons flanked by pseudosplice sites with good matches to the consensus sequences can be easily designated. In an attempt to identify elements that prevent pseudoexon splicing, we have systematically altered known splicing signals, as well as immediately adjacent flanking sequences, of an arbitrarily chosen pseudoexon from intron 1 of the human hprt gene. The substitution of a 5' splice site that perfectly matches the 5' consensus combined with mutation to match the CAG/G sequence of the 3' consensus failed to get this model pseudoexon included as the central exon in a dhfr minigene context. Provision of a real 3' splice site and a consensus 5' splice site and removal of an upstream inhibitory sequence were necessary and sufficient to confer splicing on the pseudoexon. This activated context also supported the splicing of a second pseudoexon sequence containing no apparent enhancer. Thus, both the 5' splice site sequence and the polypyrimidine tract of the pseudoexon are defective despite their good agreement with the consensus. On the other hand, the pseudoexon body did not exert a negative influence on splicing. The introduction into the pseudoexon of a sequence selected for binding to ASF/SF2 or its replacement with beta-globin exon 2 only partially reversed the effect of the upstream negative element and the defective polypyrimidine tract. These results support the idea that exon-bridging enhancers are not a prerequisite for constitutive exon definition and suggest that intrinsically defective splice sites and negative elements play important roles in distinguishing the real splicing signal from the vast number of false splicing signals.
Collapse
Affiliation(s)
- H Sun
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | | |
Collapse
|
19
|
Ohno K, Brengman JM, Felice KJ, Cornblath DR, Engel AG. Congenital end-plate acetylcholinesterase deficiency caused by a nonsense mutation and an A-->G splice-donor-site mutation at position +3 of the collagenlike-tail-subunit gene (COLQ): how does G at position +3 result in aberrant splicing? Am J Hum Genet 1999; 65:635-44. [PMID: 10441569 PMCID: PMC1377969 DOI: 10.1086/302551] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Congenital end-plate acetylcholinesterase (AChE) deficiency (CEAD), the cause of a disabling myasthenic syndrome, arises from defects in the COLQ gene, which encodes the AChE triple-helical collagenlike-tail subunit that anchors catalytic subunits of AChE to the synaptic basal lamina. Here we describe a patient with CEAD with a nonsense mutation (R315X) and a splice-donor-site mutation at position +3 of intron 16 (IVS16+3A-->G) of COLQ. Because both A and G are consensus nucleotides at the +3 position of splice-donor sites, we constructed a minigene that spans exons 15-17 and harbors IVS16+3A-->G for expression in COS cells. We found that the mutation causes skipping of exon 16. The mutant splice-donor site of intron 16 harbors five discordant nucleotides (at -3, -2, +3, +4, and +6) that do not base-pair with U1 small-nuclear RNA (snRNA), the molecule responsible for splice-donor-site recognition. Versions of the minigene harboring, at either +4 or +6, nucleotides complementary to U1 snRNA restore normal splicing. Analysis of 1,801 native splice-donor sites reveals that presence of a G nucleotide at +3 is associated with preferential usage, at positions +4 to +6, of nucleotides concordant to U1 snRNA. Analysis of 11 disease-associated IVS+3A-->G mutations indicates that, on average, two of three nucleotides at positions +4 to +6 fail to base-pair, and that the nucleotide at +4 never base-pairs, with U1 snRNA. We conclude that, with G at +3, normal splicing generally depends on the concordance that residues at +4 to +6 have with U1 snRNA, but other cis-acting elements may also be important in assuring the fidelity of splicing.
Collapse
Affiliation(s)
- K Ohno
- Department of Neurology, Mayo Clinic, Rochester, MN 55905, USA
| | | | | | | | | |
Collapse
|
20
|
Wu Q, Krainer AR. AT-AC pre-mRNA splicing mechanisms and conservation of minor introns in voltage-gated ion channel genes. Mol Cell Biol 1999; 19:3225-36. [PMID: 10207048 PMCID: PMC84117 DOI: 10.1128/mcb.19.5.3225] [Citation(s) in RCA: 115] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Affiliation(s)
- Q Wu
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | |
Collapse
|
21
|
Rae J, Newburger PE, Dinauer MC, Noack D, Hopkins PJ, Kuruto R, Curnutte JT. X-Linked chronic granulomatous disease: mutations in the CYBB gene encoding the gp91-phox component of respiratory-burst oxidase. Am J Hum Genet 1998; 62:1320-31. [PMID: 9585602 PMCID: PMC1377153 DOI: 10.1086/301874] [Citation(s) in RCA: 136] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Chronic granulomatous disease (CGD) is a hereditary disorder of host defense due to absent or decreased activity of phagocyte NADPH oxidase. The X-linked form of the disease derives from defects in the CYBB gene, which encodes the 91-kD glycoprotein component (termed "gp91-phox") of the oxidase. We have identified the mutations in the CYBB gene responsible for X-linked CGD in 131 consecutive independent kindreds. Screening by SSCP analysis identified mutations in 124 of the kindreds, and sequencing of all exons and intron boundary regions revealed the other seven mutations. We detected 103 different specific mutations; no single mutation appeared in more than seven independent kindreds. The types of mutations included large and small deletions (11%), frameshifts (24%), nonsense mutations (23%), missense mutations (23%), splice-region mutations (17%), and regulatory-region mutations (2%). The distribution of mutations within the CYBB gene exhibited great heterogeneity, with no apparent mutational hot spots. Evaluation of 87 available mothers revealed X-linked carrier status in all but 10. The heterogeneity of mutations and the lack of any predominant genotype indicate that the disease represents many different mutational events, without a founder effect, as is expected for a disorder with a previously lethal phenotype.
Collapse
Affiliation(s)
- J Rae
- Department of Immunology, Genetech, Inc., South San Francisco, CA, USA
| | | | | | | | | | | | | |
Collapse
|
22
|
Luukkonen BG, Séraphin B. The role of branchpoint-3' splice site spacing and interaction between intron terminal nucleotides in 3' splice site selection in Saccharomyces cerevisiae. EMBO J 1997; 16:779-92. [PMID: 9049307 PMCID: PMC1169679 DOI: 10.1093/emboj/16.4.779] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
A conserved 3' splice site YAG is essential for the second step of pre-mRNA splicing but no trans-acting factor recognizing this sequence has been found. A direct, non-Watson-Crick interaction between the intron terminal nucleotides was suggested to affect YAG selection. The mechanism of YAG recognition was proposed to involve 5' to 3' scanning originating from the branchpoint or the polypyrimidine tract. We have constructed a yeast intron harbouring two closely spaced 3' splice sites. Preferential selection of a wild-type site over mutant ones indicated that the two sites are competing. For two identical sequences, the proximal site is selected. As previously observed, an A at the first intron nucleotide spliced most efficiently with a 3' splice site UAC. In this context, UAA or UAU were also more efficient 3' splice sites than UAG and competed more efficiently than the wild-type sequence with a 3' splice site UAC. We observed that a U at the first intron nucleotide is used for splicing in combination with 3' splice sites UAG, UAA or UAU. Our data indicate that the 3' splice site is not primarily selected through an interaction with the first intron nucleotide. Selection of the 3' splice site depends critically on its distance from the branchpoint but does not occur by a simple leaky scanning mechanism.
Collapse
Affiliation(s)
- B G Luukkonen
- European Molecular Biology Laboratory, Heidelberg, Germany
| | | |
Collapse
|
23
|
Jin Y, Dietz HC, Montgomery RA, Bell WR, McIntosh I, Coller B, Bray PF. Glanzmann thrombasthenia. Cooperation between sequence variants in cis during splice site selection. J Clin Invest 1996; 98:1745-54. [PMID: 8878424 PMCID: PMC507612 DOI: 10.1172/jci118973] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Glanzmann thrombasthenia (GT), an autosomal recessive bleeding disorder, results from abnormalities in the platelet fibrinogen receptor, GP(IIb)-IIIa (integrin alpha(IIb)beta3). A patient with GT was identified as homozygous for a G-->A mutation 6 bp upstream of the GP(IIIa) exon 9 splice donor site. Patient platelet GP(IIIa) transcripts lacked exon 9 despite normal DNA sequence in all of the cis-acting sequences known to regulate splice site selection. In vitro analysis of transcripts generated from mini-gene constructs demonstrated that exon skipping occurred only when the G-->A mutation was cis to a polymorphism 116 bp upstream, providing precedence that two sequence variations in the same exon which do not alter consensus splice sites and do not generate missense or nonsense mutations, can affect splice site selection. The mutant transcript resulted from utilization of a cryptic splice acceptor site and returned the open reading frame. These data support the hypothesis that pre-mRNA secondary structure and allelic sequence variants can influence splicing and provide new insight into the regulated control of RNA processing. In addition, haplotype analysis suggested that the patient has two identical copies of chromosome 17. Markers studied on three other chromosomes suggested this finding was not due to consanguinity. The restricted phenotype in this patient may provide information regarding the expression of potentially imprinted genes on chromosome 17.
Collapse
Affiliation(s)
- Y Jin
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | | | | | |
Collapse
|
24
|
Nonsense but not missense mutations can decrease the abundance of nuclear mRNA for the mouse major urinary protein, while both types of mutations can facilitate exon skipping. Mol Cell Biol 1994. [PMID: 8065364 DOI: 10.1128/mcb.14.9.6326] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
In an effort to understand the mechanisms by which nonsense codons affect RNA metabolism in mammalian cells, nonsense mutations were generated within the gene for the secretory major urinary protein (MUP) of mice. The translation of MUP mRNA normally begins within exon 1 and terminates within exon 6, the penultimate exon. Through the use of Northern (RNA) blot hybridization and assays that couple reverse transcription and PCR, a nonsense mutation within codon 50 of exon 2 or codon 143 of exon 5 was found to reduce the abundance of fully spliced, nuclear MUP mRNA to 10 to 20% of normal without an additional reduction in the abundance of cytoplasmic mRNA. In contrast, a nonsense mutation within codon 172 of exon 5 was found to have no effects on the abundance of MUP mRNA. These findings suggest that a boundary between nonsense mutations that do and do not reduce the abundance of nuclear mRNA exists within the exon preceding the exon that harbors the normal site of translation termination. In this way, the boundary is analogous to the boundary that exists within the penultimate exon of the human gene for the cytosolic enzyme triosephosphate isomerase. Assays for exon skipping, i.e., the removal of an exon as a part of the flanking introns during the process of splicing, reveal that 0.1, 2.0, and 0.1% of MUP mRNA normally lack exon 5, exon 6, and exons 5 plus 6, respectively. Relative to normal, the two nonsense mutations within exon 5 increase the abundance of RNA lacking exon 5 on average 20-fold and increase the abundance of RNA lacking exons 5 plus 6 on average 5-fold. Since only one of these nonsense mutations also reduces the abundance of fully spliced nuclear mRNA to 10 to 20% of normal, the two mechanisms by which a nonsense mutation can alter nuclear RNA metabolism must be distinct. The analysis of missense mutations within codons 143 and 172, some of which retain the nonsense mutation, indicates that the reduction in the abundance of fully spliced nuclear mRNA is dependent upon the premature termination of MUP mRNA translation, whereas skipping is attributable to nonsense mutation-mediated changes in exon 5 structure rather than to the premature termination of translation. The increase in exon 5 skipping by either the nonsense or missense mutations within codon 172 correlates with a decrease in the complementarity of exon 5 to U1 snRNA. This suggests that a 5' splice site may extend as far as 12 nucleotides into the upstream exon, which is, to our knowledge, the largest extension.
Collapse
|
25
|
Belgrader P, Maquat LE. Nonsense but not missense mutations can decrease the abundance of nuclear mRNA for the mouse major urinary protein, while both types of mutations can facilitate exon skipping. Mol Cell Biol 1994; 14:6326-36. [PMID: 8065364 PMCID: PMC359159 DOI: 10.1128/mcb.14.9.6326-6336.1994] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
In an effort to understand the mechanisms by which nonsense codons affect RNA metabolism in mammalian cells, nonsense mutations were generated within the gene for the secretory major urinary protein (MUP) of mice. The translation of MUP mRNA normally begins within exon 1 and terminates within exon 6, the penultimate exon. Through the use of Northern (RNA) blot hybridization and assays that couple reverse transcription and PCR, a nonsense mutation within codon 50 of exon 2 or codon 143 of exon 5 was found to reduce the abundance of fully spliced, nuclear MUP mRNA to 10 to 20% of normal without an additional reduction in the abundance of cytoplasmic mRNA. In contrast, a nonsense mutation within codon 172 of exon 5 was found to have no effects on the abundance of MUP mRNA. These findings suggest that a boundary between nonsense mutations that do and do not reduce the abundance of nuclear mRNA exists within the exon preceding the exon that harbors the normal site of translation termination. In this way, the boundary is analogous to the boundary that exists within the penultimate exon of the human gene for the cytosolic enzyme triosephosphate isomerase. Assays for exon skipping, i.e., the removal of an exon as a part of the flanking introns during the process of splicing, reveal that 0.1, 2.0, and 0.1% of MUP mRNA normally lack exon 5, exon 6, and exons 5 plus 6, respectively. Relative to normal, the two nonsense mutations within exon 5 increase the abundance of RNA lacking exon 5 on average 20-fold and increase the abundance of RNA lacking exons 5 plus 6 on average 5-fold. Since only one of these nonsense mutations also reduces the abundance of fully spliced nuclear mRNA to 10 to 20% of normal, the two mechanisms by which a nonsense mutation can alter nuclear RNA metabolism must be distinct. The analysis of missense mutations within codons 143 and 172, some of which retain the nonsense mutation, indicates that the reduction in the abundance of fully spliced nuclear mRNA is dependent upon the premature termination of MUP mRNA translation, whereas skipping is attributable to nonsense mutation-mediated changes in exon 5 structure rather than to the premature termination of translation. The increase in exon 5 skipping by either the nonsense or missense mutations within codon 172 correlates with a decrease in the complementarity of exon 5 to U1 snRNA. This suggests that a 5' splice site may extend as far as 12 nucleotides into the upstream exon, which is, to our knowledge, the largest extension.
Collapse
Affiliation(s)
- P Belgrader
- Department of Human Genetics, Roswell Park Cancer Institute, Buffalo, New York 14263
| | | |
Collapse
|
26
|
Sequences homologous to 5' splice sites are required for the inhibitory activity of papillomavirus late 3' untranslated regions. Mol Cell Biol 1994. [PMID: 8035806 DOI: 10.1128/mcb.14.8.5278] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Expression of bovine papillomavirus type 1 (BPV-1) late genes is limited to terminally differentiated keratinocytes in an infected epithelium. We have previously shown that although the BPV-1 late polyadenylation site is functional in nonpermissive cells, a 53-nucleotide (nt) fragment of the late 3' untranslated region acts posttranscriptionally to reduce polyadenylated cytoplasmic RNA levels. This 53-nt fragment does not appear to function by destabilizing polyadenylated cytoplasmic RNA (P. A. Furth and C. C. Baker, J. Virol. 65:5806-5812, 1991). In this study, we used site-directed mutagenesis and deletion analysis to demonstrate that the sequence AAG/GUAAGU, which is identical to the consensus 5' splice site sequence, was both necessary and sufficient for the inhibitory activity of the 53-nt fragment. Furthermore, base pairing between the 5' end of the U1 small nuclear RNA and this 5' splice site-like sequence was shown to be required for the inhibitory activity in vivo. We have also further mapped the human papillomavirus type 16 late 3' inhibitory element (I. M. Kennedy, J. K. Haddow, and J. B. Clements, J. Virol. 65:2093-2097, 1991) to a 51-nt region containing four overlapping sequence motifs with partial homology to 5' splice sites. Mutation of each of these motifs demonstrated that only one of these motifs is required for the inhibitory activity. However, the presence of the other motifs may contribute to the full inhibitory activity of the element. No BPV-1 or human papillomavirus type 16 mRNAs which are spliced by using the potential 5' splice sites present in the viral late 3' untranslated regions have been identified. This suggests that the primary function of these 5' splice site-like sequences is the inhibition of late gene expression. The most likely mechanism of action of these elements is reduction of polyadenylation efficiency, perhaps through interference with 3'-terminal exon definition.
Collapse
|
27
|
Furth PA, Choe WT, Rex JH, Byrne JC, Baker CC. Sequences homologous to 5' splice sites are required for the inhibitory activity of papillomavirus late 3' untranslated regions. Mol Cell Biol 1994; 14:5278-89. [PMID: 8035806 PMCID: PMC359047 DOI: 10.1128/mcb.14.8.5278-5289.1994] [Citation(s) in RCA: 85] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Expression of bovine papillomavirus type 1 (BPV-1) late genes is limited to terminally differentiated keratinocytes in an infected epithelium. We have previously shown that although the BPV-1 late polyadenylation site is functional in nonpermissive cells, a 53-nucleotide (nt) fragment of the late 3' untranslated region acts posttranscriptionally to reduce polyadenylated cytoplasmic RNA levels. This 53-nt fragment does not appear to function by destabilizing polyadenylated cytoplasmic RNA (P. A. Furth and C. C. Baker, J. Virol. 65:5806-5812, 1991). In this study, we used site-directed mutagenesis and deletion analysis to demonstrate that the sequence AAG/GUAAGU, which is identical to the consensus 5' splice site sequence, was both necessary and sufficient for the inhibitory activity of the 53-nt fragment. Furthermore, base pairing between the 5' end of the U1 small nuclear RNA and this 5' splice site-like sequence was shown to be required for the inhibitory activity in vivo. We have also further mapped the human papillomavirus type 16 late 3' inhibitory element (I. M. Kennedy, J. K. Haddow, and J. B. Clements, J. Virol. 65:2093-2097, 1991) to a 51-nt region containing four overlapping sequence motifs with partial homology to 5' splice sites. Mutation of each of these motifs demonstrated that only one of these motifs is required for the inhibitory activity. However, the presence of the other motifs may contribute to the full inhibitory activity of the element. No BPV-1 or human papillomavirus type 16 mRNAs which are spliced by using the potential 5' splice sites present in the viral late 3' untranslated regions have been identified. This suggests that the primary function of these 5' splice site-like sequences is the inhibition of late gene expression. The most likely mechanism of action of these elements is reduction of polyadenylation efficiency, perhaps through interference with 3'-terminal exon definition.
Collapse
Affiliation(s)
- P A Furth
- Laboratory of Tumor Virus Biology, National Cancer Institute, Bethesda, Maryland 20892
| | | | | | | | | |
Collapse
|
28
|
Chanfreau G, Legrain P, Dujon B, Jacquier A. Interaction between the first and last nucleotides of pre-mRNA introns is a determinant of 3' splice site selection in S. cerevisiae. Nucleic Acids Res 1994; 22:1981-7. [PMID: 8029003 PMCID: PMC308110 DOI: 10.1093/nar/22.11.1981] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The splicing of group II and nuclear pre-mRNAs introns occurs via a similar splicing pathway and some of the RNA-RNA interactions involved in these splicing reactions show structural similarities. Recently, genetic analyses performed in a group II intron and the yeast nuclear actin gene suggested that non Watson-Crick interactions between intron boundaries are important for the second splicing step efficiency in both classes of introns. We here show that, in the yeast nuclear rp51A intron, a G to A mutation at the first position activates cryptic 3' splice sites with the sequences UAC/ or UAA/. Moreover, the natural 3' splice site could be reactivated by a G to C substitution of the last intron nucleotide. These results demonstrate that the interaction between the first and last intron nucleotides is a conserved feature of nuclear pre-mRNA splicing in yeast and is involved in the mechanism of 3' splice site selection.
Collapse
Affiliation(s)
- G Chanfreau
- Département de Biologie Moléculaire, Institut Pasteur, Paris, France
| | | | | | | |
Collapse
|
29
|
Will K, Dörk T, Stuhrmann M, Meitinger T, Bertele-Harms R, Tümmler B, Schmidtke J. A novel exon in the cystic fibrosis transmembrane conductance regulator gene activated by the nonsense mutation E92X in airway epithelial cells of patients with cystic fibrosis. J Clin Invest 1994; 93:1852-9. [PMID: 7512993 PMCID: PMC294265 DOI: 10.1172/jci117172] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene. We report on a novel nonsense mutation that leads to exon skipping and the activation of a cryptic exon. Screening of genomic DNA from 700 German patients with CF uncovered four cases with the nonsense mutation E92X, a G-->T transversion that creates a termination codon and affects the first base of exon 4 of the CFTR gene. Lymphocyte RNA of two CF patients heterozygous for E92X was found to contain the wild type sequence and a differentially spliced isoform lacking exon 4. In RNA derived from nasal epithelial cells of E92X patients, a third fragment of longer size was observed. Sequencing revealed the presence of E92X and an additional 183-bp fragment, inserted between exons 3 and 4. The 183-bp sequence was mapped to intron 3 of the CFTR gene. It is flanked by acceptor and donor splice sites. We conclude that the 183-bp fragment in intron 3 is a cryptic CFTR exon that can be activated in epithelial cells by the presence of the E92X mutation. E92X abolishes correctly spliced CFTR mRNA and leads to severe cystic fibrosis.
Collapse
Affiliation(s)
- K Will
- Abteilung für Humangenetik, Medizinische Hochschule, Hannover, Germany
| | | | | | | | | | | | | |
Collapse
|
30
|
Abstract
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
Collapse
|
31
|
Abstract
Exon sizes in vertebrate genes are, with a few exceptions, limited to less than 300 bases. It has been proposed that this limitation may derive from the exon definition model of splice site recognition. In this model, a downstream donor site enhances splicing at the upstream acceptor site of the same exon. This enhancement may require contact between factors bound to each end of the exon; an exon size limitation would promote such contact. To test the idea that proximity was required for exon definition, we inserted random DNA fragments from Escherichia coli into a central exon in a three-exon dihydrofolate reductase minigene and tested whether the expanded exons were efficiently spliced. DNA from a plasmid library of expanded minigenes was used to transfect a CHO cell deletion mutant lacking the dhfr locus. PCR analysis of DNA isolated from the pooled stable cotransfectant populations displayed a range of DNA insert sizes from 50 to 1,500 nucleotides. A parallel analysis of the RNA from this population by reverse transcription followed by PCR showed a similar size distribution. Central exons as large as 1,400 bases could be spliced into mRNA. We also tested individual plasmid clones containing exon inserts of defined sizes. The largest exon included in mRNA was 1,200 bases in length, well above the 300-base limit implied by the survey of naturally occurring exons. We conclude that a limitation in exon size is not part of the exon definition mechanism.
Collapse
Affiliation(s)
- I T Chen
- Department of Biological Sciences, Columbia University, New York, New York 10027
| | | |
Collapse
|