1
|
Zhang XHF, Leslie CS, Chasin LA. Dichotomous splicing signals in exon flanks. Genome Res 2005; 15:768-79. [PMID: 15930489 PMCID: PMC1142467 DOI: 10.1101/gr.3217705] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2004] [Accepted: 03/04/2005] [Indexed: 02/06/2023]
Abstract
Intronic elements flanking the splice-site consensus sequences are thought to play a role in pre-mRNA splicing. However, the generality of this role, the catalog of effective sequences, and the mechanisms involved are still lacking. Using molecular genetic tests, we first showed that the approximately 50-nt intronic flanking sequences of exons beyond the splice-site consensus are generally important for splicing. We then went on to characterize exon flank sequences on a genomic scale. The G+C content of flanks displayed a bimodal distribution reflecting an exaggeration of this base composition in flanks relative to the gene as a whole. We divided all exons into two classes according to their flank G+C content and used computational and statistical methods to define pentamers of high relative abundance and phylogenetic conservation in exon flanks. Upstream pentamers were often common to the two classes, whereas downstream pentamers were totally different. Upstream and downstream pentamers were often identical around low G+C exons, and in contrast, were often complementary around high G+C exons. In agreement with this complementarity, predicted base pairing was more frequent between the flanks of high G+C exons. Pseudo exons did not exhibit this behavior, but rather tended to form base pairs between flanks and exon bodies. We conclude that most exons require signals in their immediate flanks for efficient splicing. G+C content is a sequence feature correlated with many genetic and genomic attributes. We speculate that there may be different mechanisms for splice site recognition depending on G+C content.
Collapse
Affiliation(s)
- Xiang H-F Zhang
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | | | | |
Collapse
|
2
|
Differential block of U small nuclear ribonucleoprotein particle interactions during in vitro splicing of adenovirus E1A transcripts containing abnormally short introns. Mol Cell Biol 1991. [PMID: 1825346 DOI: 10.1128/mcb.11.3.1258] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We have studied the consequences of decreasing the donor site-branch site distance on splicing factor-splice site interactions by analyzing alternative splicing of adenovirus E1A pre-mRNAs in vitro. We show that the proximal 13S donor site has a cis-inhibiting effect on the 9S and 12S mRNA reactions when it is brought too close to the common branch site, suggesting that the factor interactions in the common 3' part of the intron are impaired by the U1 small nuclear ribonucleoprotein particle (snRNP) binding to the displaced 13S donor site. Further analysis of the interactions was carried out by studying complex assembly and the accessibility to micrococcal nuclease digestion of 5'-truncated E1A substrates containing only splice sites for the 13S mRNA reaction. A deletion which brings the donor site- branch site distance to 49 nucleotides, which is just below the minimal functional distance, results in a complete block of the U4-U5-U6 snRNP binding, whereas a deletion 15 nucleotides larger results in a severe inhibition of the formation of the U2 snRNP-containing complexes. Sequence accessibility analyses performed by using the last mini-intron-containing transcript demonstrate that the interactions of U2 snRNP with the branch site are strongly impaired whereas the initial bindings of U1 snRNP to the donor site and of specific factors to the 3' splice site are not significantly modified. Our results strongly suggest that the interaction of U1 snRNP with the donor site of a mini-intron is stable enough in vitro to affect the succession of events leading to U2 snRNP binding with the branch site.
Collapse
|
3
|
Himmelspach M, Gattoni R, Gerst C, Chebli K, Stévenin J. Differential block of U small nuclear ribonucleoprotein particle interactions during in vitro splicing of adenovirus E1A transcripts containing abnormally short introns. Mol Cell Biol 1991; 11:1258-69. [PMID: 1825346 PMCID: PMC369397 DOI: 10.1128/mcb.11.3.1258-1269.1991] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
We have studied the consequences of decreasing the donor site-branch site distance on splicing factor-splice site interactions by analyzing alternative splicing of adenovirus E1A pre-mRNAs in vitro. We show that the proximal 13S donor site has a cis-inhibiting effect on the 9S and 12S mRNA reactions when it is brought too close to the common branch site, suggesting that the factor interactions in the common 3' part of the intron are impaired by the U1 small nuclear ribonucleoprotein particle (snRNP) binding to the displaced 13S donor site. Further analysis of the interactions was carried out by studying complex assembly and the accessibility to micrococcal nuclease digestion of 5'-truncated E1A substrates containing only splice sites for the 13S mRNA reaction. A deletion which brings the donor site- branch site distance to 49 nucleotides, which is just below the minimal functional distance, results in a complete block of the U4-U5-U6 snRNP binding, whereas a deletion 15 nucleotides larger results in a severe inhibition of the formation of the U2 snRNP-containing complexes. Sequence accessibility analyses performed by using the last mini-intron-containing transcript demonstrate that the interactions of U2 snRNP with the branch site are strongly impaired whereas the initial bindings of U1 snRNP to the donor site and of specific factors to the 3' splice site are not significantly modified. Our results strongly suggest that the interaction of U1 snRNP with the donor site of a mini-intron is stable enough in vitro to affect the succession of events leading to U2 snRNP binding with the branch site.
Collapse
Affiliation(s)
- M Himmelspach
- Unité 184 de Biologie Moléculaire et de Génie Génétique, Institut National de la Santé et de la Recherche Medicale, Faculté de Médecine, Strasbourg, France
| | | | | | | | | |
Collapse
|
4
|
Krainer AR, Conway GC, Kozak D. The essential pre-mRNA splicing factor SF2 influences 5' splice site selection by activating proximal sites. Cell 1990; 62:35-42. [PMID: 2364434 DOI: 10.1016/0092-8674(90)90237-9] [Citation(s) in RCA: 362] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
SF2 is a 33 kd protein factor required for 5' splice site cleavage and lariat formation during pre-mRNA splicing in HeLa cell extracts. In addition to its essential role in constitutive splicing, SF2 can strongly influence 5' splice site selection. When pre-mRNAs containing multiple cis-competing 5' splice sites are spliced in vitro, high concentrations of purified SF2 promote the use of the 5' splice site closest to the 3' splice site. However, SF2 discriminates properly between authentic and cryptic splice sites. These effects of SF2 on splice site selection may reflect the cellular mechanisms that prevent exon skipping and ensure the accuracy of splicing. In addition, alterations in the concentration or activity of SF2, and of other general splicing factors, may serve to regulate alternative splicing in vivo.
Collapse
Affiliation(s)
- A R Krainer
- Cold Spring Harbor Laboratory, New York 11724
| | | | | |
Collapse
|
5
|
Prats AC, Roy C, Wang PA, Erard M, Housset V, Gabus C, Paoletti C, Darlix JL. cis elements and trans-acting factors involved in dimer formation of murine leukemia virus RNA. J Virol 1990; 64:774-83. [PMID: 2153242 PMCID: PMC249172 DOI: 10.1128/jvi.64.2.774-783.1990] [Citation(s) in RCA: 157] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The genetic material of all retroviruses examined so far consists of two identical RNA molecules joined at their 5' ends by the dimer linkage structure (DLS). Since the precise location of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analyzed the dimerization process of Moloney murine leukemia virus (MoMuLV) genomic RNA. For this purpose we derived an in vitro model for RNA dimerization. By using this model, murine leukemia virus RNA was shown to form dimeric molecules. Deletion mutagenesis in the 620-nucleotide leader of MoMuLV RNA showed that the dimer promoting sequences are located within the encapsidation element Psi between positions 215 and 420. Furthermore, hybridization assays in which DNA oligomers were used to probe monomer and dimer forms of MoMuLV RNA indicated that the DLS probably maps between positions 280 and 330 from the RNA 5' end. Also, retroviral nucleocapsid protein was shown to catalyze dimerization of MoMuLV RNA and to be tightly bound to genomic dimer RNA in virions. These results suggest that MoMuLV RNA dimerization and encapsidation are probably controlled by the same cis element, Psi, and trans-acting factor, nucleocapsid protein, and thus might be linked during virion formation.
Collapse
Affiliation(s)
- A C Prats
- Labo-Retro, Centre de Recherche de Biochimie et Génétique Cellulaires du C.N.R.S., Toulouse, France
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Bieth E, Gabus C, Darlix JL. A study of the dimer formation of Rous sarcoma virus RNA and of its effect on viral protein synthesis in vitro. Nucleic Acids Res 1990; 18:119-27. [PMID: 2155394 PMCID: PMC330211 DOI: 10.1093/nar/18.1.119] [Citation(s) in RCA: 150] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The genetic material of all retroviruses examined so far is an RNA dimer where two identical RNA subunits are joined at their 5' ends by a structure named dimer linkage structure (DLS). Since the precise location and structure of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analysed the dimerization process of Rous sarcoma virus (RSV) RNA. For this purpose we set up an in vitro model for RSV RNA dimerization. Using this model RSV RNA was shown to form dimeric molecules and this dimerization process was greatly activated by nucleocapsid protein (NCp12) of RSV. Furthermore, RSV RNA dimerization was performed in the presence of complementary 5'32P-DNA oligomers in order to probe the monomer and dimer forms of RSV RNA. Data indicated that the DLS of RSV RNA probably maps between positions 544-564 from the 5' end. In an attempt to define sequences needed for the dimerization of RSV RNA, deletion mutageneses were generated in the 5' 600 nt. The results showed that the dimer promoting sequences probably are located within positions 208-270 and 400-600 from the 5' end and hence possibly encompassing the cis-acting elements needed for the specific encapsidation of RSV genomic RNA. Also it is reported that synthesis of the polyprotein precursor Pr76gag is inhibited upon dimerization of RSV RNA. These results suggest that dimerization and encapsidation of genome length RSV RNA might be linked in the course of virion formation since they appear to be under the control of the same cis elements, E and DLS, and the trans-acting factor nucleocapsid protein NCp12.
Collapse
Affiliation(s)
- E Bieth
- LaboRetro, Centre de Recherches de Biochimie et de Génétique Cellulaires du CNRS, Toulouse, France
| | | | | |
Collapse
|
7
|
Nussinov R. Conserved signals around the 5' splice sites in eukaryotic nuclear precursor mRNAs: G-runs are frequent in the introns and C in the exons near both 5' and 3' splice sites. J Biomol Struct Dyn 1989; 6:985-1000. [PMID: 2590511 DOI: 10.1080/07391102.1989.10506526] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
It is known that the GT doublet is well conserved at the 5' exon/intron splice junction and is frequently embedded in the AGGT quartet. Although only the underlined G is invariable, splicing and ligation are accurately executed. In this work we search for additional conserved potential signals which may aid in 5' splice site recognition. Extensive searches which are not limited to a preconceived consensus sequence are carried out. We investigate the distributions of the 256 quartets in a 1000 nucleotide span around the 5' splice sites in approximately 1700 eukaryotic nuclear precursor mRNAs. Several potential signals are noted. Of particular interest are quartets containing runs of G, e.g., G4, G3T, G3C, G3A and AG3 in the intron immediately downstream and some C-containing quartets in the exon upstream of the 5' splice site. In an analogous calculation, (A)GGG(A) has also been found to be frequent in the intron, 60 nucleotides upstream and (A)CCC(A) in the exon downstream of the 3' splice site. These results are consistent with the recent indications that exon sequences may play a role in efficient splicing. Some models are proposed.
Collapse
Affiliation(s)
- R Nussinov
- Sackler Institute of Molecular Medicine, Tel Aviv University, Ramat Aviv, Israel
| |
Collapse
|
8
|
Abstract
Analysis of a 1000 nucleotide span around 664 primate 5' exon/intron junctions revealed frequent recurrences of G-rich runs downstream of the 5' splice sites. In particular, AGGG, GGGA, GGGG, GGGT and TGGG are frequent at this site. Some C-rich quarters are frequent upstream of the 5' splice site. Similar behaviour of these G- and C-rich quartets is indicated for the 587 rodent introns and for a combined eukaryotic file containing 1688 introns. (A)GGG(A) is also frequent in the introns 60 nucleotides upstream of the 3' splice site, and (A)CCC(A) is frequently found in the exons downstream of the 3' site. The same consistent behaviour of the 3' splice sites is obtained as for the 5' sites, for the primates, rodents and combined eukaryotic file. These results suggest that in addition to the well-conserved 5' and 3' splice sequences, exon as well as intron sequences may play a role in nuclear pre-mRNA splicing.
Collapse
Affiliation(s)
- R Nussinov
- Laboratory of Mathematical Biology, National Cancer Institute, Bethesda, MD 20892
| |
Collapse
|
9
|
Hall KB, Green MR, Redfield AG. Structure of a pre-mRNA branch point/3' splice site region. Proc Natl Acad Sci U S A 1988; 85:704-8. [PMID: 3422452 PMCID: PMC279623 DOI: 10.1073/pnas.85.3.704] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
We have analyzed the solution structure of RNA containing the branch point/3' splice site region of the first intervening sequence (IVS1) of human beta-globin pre-mRNA by nuclease mapping and NMR. Nuclease mapping indicates that there are two distinct structural domains: one contains the branch point region, and the other includes the 3' splice site and second exon sequences. In the branch point domain, the adenosine at which the RNA branch forms appears to be in the loop of a stem/loop structure. The branch point structure does not appear to interact with other parts of the RNA, since its unique nuclease digestion pattern is conserved among transcripts containing the entire intron or only the branch point region. This is confirmed by a comparison of the NMR spectra of two RNA transcripts; a distinct set of resonances appears in the spectra of the RNA containing only branch sequences or including 3' splice site/exon 2 sequences. NMR studies further show that the 3' splice site/exon 2 domain has a lower melting temperature than the branch point domain, suggesting that the two regions are distinct dynamically as well as structurally. Nuclease mapping studies of adenovirus major late IVS1 indicate that this RNA has structural features in common with the human beta-globin transcript.
Collapse
Affiliation(s)
- K B Hall
- Biochemistry Department, Brandeis University, Waltham, MA 02254
| | | | | |
Collapse
|
10
|
Abstract
Several 3' splice signals are known todate. At the 3' splice site an AG doublet is frequently found. Just upstream of the splice site there is a string of 6-11 pyrimidines. More recently it has been found that one of the stages in the splicing process involves formation of a lariat, in which the 5' end of the intron forms a 2'-5' branch with an A residue located 18-37 nucleotides upstream of the 3' splice site. The branching-point consensus is weakly defined and consists of the sequence YNYTRAY, where Y is a pyrimidine, R a purine and N any base. The A in the sixth position is the one with which branching occurs. Here we present the results of extensive searches for additional putative signals around the branching-point consensus and the 3' splice site in rodent nuclear precursor mRNAs. The signals obtained for the over 370 rodent introns are compared with those found in a larger eukaryotic sample containing over 900 nuclear pre-mRNA introns. Of particular interest are GGGA and CCCA. In both analyses GGGA occurs about 60 nucleotides upstream and CCCA is found 3-40 nucleotides downstream from the 3' splice site. A model explaining some of the putative signals discussed here is also proposed. This model involves formation of alternate stem-loop structures around the branching point and 3' splice site. Such signals and structures can possibly aid in protein or nucleoprotein branching point and splice site recognition.
Collapse
Affiliation(s)
- R Nussinov
- Sackler Institute of Molecular Medicine, Tel Aviv University, Ramat Aviv, Israel
| |
Collapse
|