101
|
Splicing in the eukaryotic ancestor: form, function and dysfunction. Trends Ecol Evol 2009; 24:447-55. [DOI: 10.1016/j.tree.2009.04.005] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2008] [Revised: 03/30/2009] [Accepted: 04/01/2009] [Indexed: 12/11/2022]
|
102
|
Pohl AA, Sugnet CW, Clark TA, Smith K, Fujita PA, Cline MS. Affy exon tissues: exon levels in normal tissues in human, mouse and rat. ACTA ACUST UNITED AC 2009; 25:2442-3. [PMID: 19570805 PMCID: PMC2735668 DOI: 10.1093/bioinformatics/btp414] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
SUMMARY Most genes in human, mouse and rat produce more than one transcript isoform. The Affymetrix Exon Array is a tool for studying the many processes that regulate RNA production, with separate probesets measuring RNA levels at known and putative exons. For insights on how exons levels vary between normal tissues, we constructed the Affy Exon Tissues track from tissue data published by Affymetrix. This track reports exon probeset intensities as log ratios relative to median values across the dataset and renders them as colored heat maps, to yield quick visual identification of exons with intensities that vary between normal tissues. AVAILABILITY Affy Exon Tissues track is freely available under the UCSC Genome Browser (http://genome.ucsc.edu/) for human (hg18), mouse (mm8 and mm9), and rat (rn4).
Collapse
Affiliation(s)
- Andrew A Pohl
- Center for Biomolecular Science & Engineering, University of California, Santa Cruz, CA 95064, USA
| | | | | | | | | | | |
Collapse
|
103
|
Abstract
Even though nearly every human gene has at least one alternative splice form, very little is so far known about the structure and function of resulting protein products. It is becoming increasingly clear that a significant fraction of all isoforms are products of noisy selection of splice sites and thus contribute little to actual functional diversity, and may potentially be deleterious. In this study, we examine the impact of alternative splicing on protein sequence and structure in three datasets: alternative splicing events conserved across multiple species, alternative splicing events in genes that are strongly linked to disease and all observed alternative splicing events. We find that the vast majority of all alternative isoforms result in unstable protein conformations. In contrast to that, the small subset of isoforms conserved across species tends to maintain protein structural integrity to a greater extent. Alternative splicing in disease-associated genes produces unstable structures just as frequently as all other genes, indicating that selection to reduce the effects of alternative splicing on this set is not especially pronounced. Overall, the properties of alternative spliced proteins are consistent with the outcome of noisy selection of splice sites by splicing machinery.
Collapse
Affiliation(s)
- Eugene Melamud
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
| | | |
Collapse
|
104
|
Evolution of alternative splicing regulation: changes in predicted exonic splicing regulators are not associated with changes in alternative splicing levels in primates. PLoS One 2009; 4:e5800. [PMID: 19495418 PMCID: PMC2686173 DOI: 10.1371/journal.pone.0005800] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2009] [Accepted: 05/12/2009] [Indexed: 12/12/2022] Open
Abstract
Alternative splicing is tightly regulated in a spatio-temporal and quantitative manner. This regulation is achieved by a complex interplay between spliceosomal (trans) factors that bind to different sequence (cis) elements. cis-elements reside in both introns and exons and may either enhance or silence splicing. Differential combinations of cis-elements allows for a huge diversity of overall splicing signals, together comprising a complex ‘splicing code’. Many cis-elements have been identified, and their effects on exon inclusion levels demonstrated in reporter systems. However, the impact of interspecific differences in these elements on the evolution of alternative splicing levels has not yet been investigated at genomic level. Here we study the effect of interspecific differences in predicted exonic splicing regulators (ESRs) on exon inclusion levels in human and chimpanzee. For this purpose, we compiled and studied comprehensive datasets of predicted ESRs, identified by several computational and experimental approaches, as well as microarray data for changes in alternative splicing levels between human and chimpanzee. Surprisingly, we found no association between changes in predicted ESRs and changes in alternative splicing levels. This observation holds across different ESR exon positions, exon lengths, and 5′ splice site strengths. We suggest that this lack of association is mainly due to the great importance of context for ESR functionality: many ESR-like motifs in primates may have little or no effect on splicing, and thus interspecific changes at short-time scales may primarily occur in these effectively neutral ESRs. These results underscore the difficulties of using current computational ESR prediction algorithms to identify truly functionally important motifs, and provide a cautionary tale for studies of the effect of SNPs on splicing in human disease.
Collapse
|
105
|
Comparative component analysis of exons with different splicing frequencies. PLoS One 2009; 4:e5387. [PMID: 19404386 PMCID: PMC2671145 DOI: 10.1371/journal.pone.0005387] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2008] [Accepted: 03/31/2009] [Indexed: 12/12/2022] Open
Abstract
Transcriptional isoforms are not just random combinations of exons. What has caused exons to be differentially spliced and whether exons with different splicing frequencies are subjected to divergent regulation by potential elements or splicing signals? Beyond the conventional classification for alternatively spliced exons (ASEs) and constitutively spliced exons (CSEs), we have classified exons from alternatively spliced human genes and their mouse orthologs (12,314 and 5,464, respectively) into four types based on their splicing frequencies. Analysis has indicated that different groups of exons presented divergent compositional and regulatory properties. Interestingly, with the decrease of splicing frequency, exons tend to have greater lengths, higher GC content, and contain more splicing elements and repetitive elements, which seem to imply that the splicing frequency is influenced by such factors. Comparison of non-alternatively spliced (NAS) mouse genes with alternatively spliced human orthologs also suggested that exons with lower splicing frequencies may be newly evolved ones which gained functions with splicing frequencies altered through the evolution. Our findings have revealed for the first time that certain factors may have critical influence on the splicing frequency, suggesting that exons with lower splicing frequencies may originate from old repetitive sequences, with splicing sites altered by mutation, gaining novel functions and become more frequently spliced.
Collapse
|
106
|
Li H, Liu G, Yu J, Cao W, Lobo VG, Xie J. In vivo selection of kinase-responsive RNA elements controlling alternative splicing. J Biol Chem 2009; 284:16191-16201. [PMID: 19386606 DOI: 10.1074/jbc.m900393200] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Alternative pre-mRNA splicing is often controlled by cell signals, for example, those activating the cAMP-dependent protein kinase (PKA) or the Ca2+/calmodulin-dependent protein kinase IV (CaMKIV). We have shown that CaMKIV regulates alternative splicing through short CA repeats and hnRNP L. Here we use a splicing reporter that shows PKA/CaMKIV promotion of exon inclusion to select from exons containing random 13-nt sequences for RNA elements responsive to the kinases in cultured cells. This selection not only identified both PKA- and CaMKIV-responsive elements that are similar to the CaMKIV-responsive RNA element 1 (CaRRE1) or CA repeats, but also A-rich elements not previously known to respond to these kinases. Consistently, hnRNP L is identified as a factor binding the CA-rich elements. Analyses of the motifs in the highly responsive elements indicate that they are indeed critical for the kinase effect and are enriched in alternative exons. Interestingly, a CAAAAAA motif is sufficient for the PKA/CaMKIV-regulated splicing of the exon 16 of the CaMK kinase beta1 (CaMKK2) transcripts, implying a role of this motif in signaling cross-talk or feedback regulation between these kinases through alternative splicing. Therefore, these experiments identified a group of RNA elements responsive to PKA and CaMKIV from in vivo selection. This also provides an approach for selecting RNA elements similarly responsive to other cell signals controlling alternative splicing.
Collapse
Affiliation(s)
- Hongzhao Li
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
| | - Guodong Liu
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
| | - Jiankun Yu
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
| | - Wenguang Cao
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
| | - Vincent G Lobo
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
| | - Jiuyong Xie
- From the Department of Physiology, Faculty of Medicine, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada.
| |
Collapse
|
107
|
A global view of cancer-specific transcript variants by subtractive transcriptome-wide analysis. PLoS One 2009; 4:e4732. [PMID: 19266097 PMCID: PMC2648985 DOI: 10.1371/journal.pone.0004732] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2008] [Accepted: 01/29/2009] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Alternative pre-mRNA splicing (AS) plays a central role in generating complex proteomes and influences development and disease. However, the regulation and etiology of AS in human tumorigenesis is not well understood. METHODOLOGY/PRINCIPAL FINDINGS A Basic Local Alignment Search Tool database was constructed for the expressed sequence tags (ESTs) from all available databases of human cancer and normal tissues. An insertion or deletion in the alignment of EST/EST was used to identify alternatively spliced transcripts. Alignment of the ESTs with the genomic sequence was further used to confirm AS. Alternatively spliced transcripts in each tissue were then subtractively cross-screened to obtain tissue-specific variants. We systematically identified and characterized cancer/tissue-specific and alternatively spliced variants in the human genome based on a global view. We identified 15,093 cancer-specific variants of 9,989 genes from 27 types of human cancers and 14,376 normal tissue-specific variants of 7,240 genes from 35 normal tissues, which cover the main types of human tumors and normal tissues. Approximately 70% of these transcripts are novel. These data were integrated into a database HCSAS (http://202.114.72.39/database/human.html, pass:68756253). Moreover, we observed that the cancer-specific AS of both oncogenes and tumor suppressor genes are associated with specific cancer types. Cancer shows a preference in the selection of alternative splice-sites and utilization of alternative splicing types. CONCLUSIONS/SIGNIFICANCE These features of human cancer, together with the discovery of huge numbers of novel splice forms for cancer-associated genes, suggest an important and global role of cancer-specific AS during human tumorigenesis. We advise the use of cancer-specific alternative splicing as a potential source of new diagnostic, prognostic, predictive, and therapeutic tools for human cancer. The global view of cancer-specific AS is not only useful for exploring the complexity of the cancer transcriptome but also widens the eyeshot of clinical research.
Collapse
|
108
|
Yeo GW, Coufal NG, Liang TY, Peng GE, Fu XD, Gage FH. An RNA code for the FOX2 splicing regulator revealed by mapping RNA-protein interactions in stem cells. Nat Struct Mol Biol 2009; 16:130-7. [PMID: 19136955 DOI: 10.1038/nsmb.1545] [Citation(s) in RCA: 457] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 12/11/2008] [Indexed: 12/13/2022]
Abstract
The elucidation of a code for regulated splicing has been a long-standing goal in understanding the control of post-transcriptional gene expression events that are crucial for cell survival, differentiation and development. We decoded functional RNA elements in vivo by constructing an RNA map for the cell type-specific splicing regulator FOX2 (also known as RBM9) via cross-linking immunoprecipitation coupled with high-throughput sequencing (CLIP-seq) in human embryonic stem cells. The map identified a large cohort of specific FOX2 targets, many of which are themselves splicing regulators, and comparison between the FOX2 binding profile and validated splicing events revealed a general rule for FOX2-regulated exon inclusion or skipping in a position-dependent manner. These findings suggest that FOX2 functions as a critical regulator of a splicing network, and we further show that FOX2 is important for the survival of human embryonic stem cells.
Collapse
Affiliation(s)
- Gene W Yeo
- Crick-Jacobs Center for Theoretical and Computational Biology, Salk Institute, 10010 North Torrey Pines Road, La Jolla, California 92037, USA.
| | | | | | | | | | | |
Collapse
|
109
|
Irimia M, Rukov JL, Roy SW, Vinther J, Garcia-Fernandez J. Quantitative regulation of alternative splicing in evolution and development. Bioessays 2009; 31:40-50. [DOI: 10.1002/bies.080092] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
110
|
Determination of tag density required for digital transcriptome analysis: application to an androgen-sensitive prostate cancer model. Proc Natl Acad Sci U S A 2008; 105:20179-84. [PMID: 19088194 DOI: 10.1073/pnas.0807121105] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
High-throughput sequencing has rapidly gained popularity for transcriptome analysis in mammalian cells because of its ability to generate digital and quantitative information on annotated genes and to detect transcripts and mRNA isoforms. Here, we described a double-random priming method for deep sequencing to profile double poly(A)-selected RNA from LNCaP cells before and after androgen stimulation. From approximately 20 million sequence tags, we uncovered 71% of annotated genes and identified hormone-regulated gene expression events that are highly correlated with quantitative real time PCR measurement. A fraction of the sequence tags were mapped to constitutive and alternative splicing events to detect known and new mRNA isoforms expressed in the cell. Finally, curve fitting was used to estimate the number of tags necessary to reach a "saturating" discovery rate among individual applications. This study provides a general guide for analysis of gene expression and alternative splicing by deep sequencing.
Collapse
|
111
|
A postnatal switch of CELF and MBNL proteins reprograms alternative splicing in the developing heart. Proc Natl Acad Sci U S A 2008; 105:20333-8. [PMID: 19075228 DOI: 10.1073/pnas.0809045105] [Citation(s) in RCA: 382] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
From a large-scale screen using splicing microarrays and RT-PCR, we identified 63 alternative splicing (AS) events that are coordinated in 3 distinct temporal patterns during mouse heart development. More than half of these splicing transitions are evolutionarily conserved between mouse and chicken. Computational analysis of the introns flanking these splicing events identified enriched and conserved motifs including binding sites for CUGBP and ETR-3-like factors (CELF), muscleblind-like (MBNL) and Fox proteins. We show that CELF proteins are down-regulated >10-fold during heart development, and MBNL1 protein is concomitantly up-regulated nearly 4-fold. Using transgenic and knockout mice, we show that reproducing the embryonic expression patterns for CUGBP1 and MBNL1 in adult heart induces the embryonic splicing patterns for more than half of the developmentally regulated AS transitions. These findings indicate that CELF and MBNL proteins are determinative for a large subset of splicing transitions that occur during postnatal heart development.
Collapse
|
112
|
Kechris K, Yang YH, Yeh RF. Prediction of alternatively skipped exons and splicing enhancers from exon junction arrays. BMC Genomics 2008; 9:551. [PMID: 19021909 PMCID: PMC2631580 DOI: 10.1186/1471-2164-9-551] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2008] [Accepted: 11/20/2008] [Indexed: 12/22/2022] Open
Abstract
Background Alternative splicing of exons in a pre-mRNA transcript is an important mechanism which contributes to protein diversity in human. Arrays for detecting alternative splicing are available using several different probe designs, including those based on exon-junctions. In this work, we introduce a new method for predicting alternatively skipped exons from exon-junction arrays. Predictions based on our method are compared against controls and their sequences are analyzed to identify motifs important for regulating alternative splicing. Results Our comparison of several alternative methods shows that an exon-skipping score based on neighboring junctions best discriminates between positive and negative controls. Sequence analysis of our predicted exons confirms the presence of known splicing regulatory sequences. In addition, we also derive a set of development-related alternatively spliced genes based on fetal versus adult tissue comparisons and find that our predictions are consistent with their functional annotations. Ab initio motif finding algorithms are applied to identify several motifs that may be relevant for splicing during development. Conclusion This work describes a new method for analyzing exon-junction arrays, identifies sequence motifs that are specific for alternative and constitutive splicing and suggests a role for several known splicing factors and their motifs in developmental regulation.
Collapse
Affiliation(s)
- Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Denver, 4200 East 9th Avenue, B-119, Denver, CO 80262, USA.
| | | | | |
Collapse
|
113
|
Sinha R, Hiller M, Pudimat R, Gausmann U, Platzer M, Backofen R. Improved identification of conserved cassette exons using Bayesian networks. BMC Bioinformatics 2008; 9:477. [PMID: 19014490 PMCID: PMC2621368 DOI: 10.1186/1471-2105-9-477] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2008] [Accepted: 11/12/2008] [Indexed: 12/14/2022] Open
Abstract
Background Alternative splicing is a major contributor to the diversity of eukaryotic transcriptomes and proteomes. Currently, large scale detection of alternative splicing using expressed sequence tags (ESTs) or microarrays does not capture all alternative splicing events. Moreover, for many species genomic data is being produced at a far greater rate than corresponding transcript data, hence in silico methods of predicting alternative splicing have to be improved. Results Here, we show that the use of Bayesian networks (BNs) allows accurate prediction of evolutionary conserved exon skipping events. At a stringent false positive rate of 0.5%, our BN achieves an improved true positive rate of 61%, compared to a previously reported 50% on the same dataset using support vector machines (SVMs). Incorporating several novel discriminative features such as intronic splicing regulatory elements leads to the improvement. Features related to mRNA secondary structure increase the prediction performance, corroborating previous findings that secondary structures are important for exon recognition. Random labelling tests rule out overfitting. Cross-validation on another dataset confirms the increased performance. When using the same dataset and the same set of features, the BN matches the performance of an SVM in earlier literature. Remarkably, we could show that about half of the exons which are labelled constitutive but receive a high probability of being alternative by the BN, are in fact alternative exons according to the latest EST data. Finally, we predict exon skipping without using conservation-based features, and achieve a true positive rate of 29% at a false positive rate of 0.5%. Conclusion BNs can be used to achieve accurate identification of alternative exons and provide clues about possible dependencies between relevant features. The near-identical performance of the BN and SVM when using the same features shows that good classification depends more on features than on the choice of classifier. Conservation based features continue to be the most informative, and hence distinguishing alternative exons from constitutive ones without using conservation based features remains a challenging problem.
Collapse
Affiliation(s)
- Rileen Sinha
- Genome Analysis, Leibniz Institute for Age Research, Fritz Lipmann Institute, Jena, Germany.
| | | | | | | | | | | |
Collapse
|
114
|
Castle JC, Zhang C, Shah JK, Kulkarni AV, Kalsotra A, Cooper TA, Johnson JM. Expression of 24,426 human alternative splicing events and predicted cis regulation in 48 tissues and cell lines. Nat Genet 2008; 40:1416-25. [PMID: 18978788 PMCID: PMC3197713 DOI: 10.1038/ng.264] [Citation(s) in RCA: 242] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Accepted: 09/25/2008] [Indexed: 01/14/2023]
Abstract
Alternative pre–messenger RNA splicing impacts development, physiology, and disease, but its regulation in humans is not well understood, partially due to the limited scale to which the expression of specific splicing events has been measured. We generated the first genome-scale expression compendium of human alternative splicing events using custom whole-transcript microarrays monitoring expression of 24,426 alternative splicing events in 48 diverse human samples. Over 11,700 genes and 9,500 splicing events were differentially expressed, providing a rich resource for studying splicing regulation. An unbiased, systematic screen of 21,760 4-mer to 7-mer words for cis-regulatory motifs identified 143 RNA 'words' enriched near regulated cassette exons, including six clusters of motifs represented by UCUCU, UGCAUG, UGCU, UGUGU, UUUU, and AGGG, which map to trans-acting regulators PTB, Fox, Muscleblind, CELF/CUG-BP, TIA-1, and hnRNP F/H, respectively. Each cluster showed a distinct pattern of genomic location and tissue specificity. For example, UCUCU occurs 110 to 35 nucleotides preceding cassette exons upregulated in brain and striated muscle but depleted in other tissues. UCUCU and UGCAUG appear to have similar function but independent action, occurring 5' and 3', respectively, of 33% of the cassette exons upregulated in skeletal muscle but co-occurring for only 2%.
Collapse
|
115
|
Width of gene expression profile drives alternative splicing. PLoS One 2008; 3:e3587. [PMID: 18974852 PMCID: PMC2575406 DOI: 10.1371/journal.pone.0003587] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2008] [Accepted: 10/09/2008] [Indexed: 01/26/2023] Open
Abstract
Alternative splicing generates an enormous amount of functional and proteomic diversity in metazoan organisms. This process is probably central to the macromolecular and cellular complexity of higher eukaryotes. While most studies have focused on the molecular mechanism triggering and controlling alternative splicing, as well as on its incidence in different species, its maintenance and evolution within populations has been little investigated. Here, we propose to address these questions by comparing the structural characteristics as well as the functional and transcriptional profiles of genes with monomorphic or polymorphic splicing, referred to as MS and PS genes, respectively. We find that MS and PS genes differ particularly in the number of tissues and cell types where they are expressed.We find a striking deficit of PS genes on the sex chromosomes, particularly on the Y chromosome where it is shown not to be due to the observed lower breadth of expression of genes on that chromosome. The development of a simple model of evolution of cis-regulated alternative splicing leads to predictions in agreement with these observations. It further predicts the conditions for the emergence and the maintenance of cis-regulated alternative splicing, which are both favored by the tissue specific expression of splicing variants. We finally propose that the width of the gene expression profile is an essential factor for the acquisition of new transcript isoforms that could later be maintained by a new form of balancing selection.
Collapse
|
116
|
Zhang C, Zhang Z, Castle J, Sun S, Johnson J, Krainer AR, Zhang MQ. Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. Genes Dev 2008; 22:2550-63. [PMID: 18794351 DOI: 10.1101/gad.1703108] [Citation(s) in RCA: 241] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The precise regulation of many alternative splicing (AS) events by specific splicing factors is essential to determine tissue types and developmental stages. However, the molecular basis of tissue-specific AS regulation and the properties of splicing regulatory networks (SRNs) are poorly understood. Here we comprehensively predict the targets of the brain- and muscle-specific splicing factor Fox-1 (A2BP1) and its paralog Fox-2 (RBM9) and systematically define the corresponding SRNs genome-wide. Fox-1/2 are conserved from worm to human, and specifically recognize the RNA element UGCAUG. We integrate Fox-1/2-binding specificity with phylogenetic conservation, splicing microarray data, and additional computational and experimental characterization. We predict thousands of Fox-1/2 targets with conserved binding sites, at a false discovery rate (FDR) of approximately 24%, including many validated experimentally, suggesting a surprisingly extensive SRN. The preferred position of the binding sites differs according to AS pattern, and determines either activation or repression of exon recognition by Fox-1/2. Many predicted targets are important for neuromuscular functions, and have been implicated in several genetic diseases. We also identified instances of binding site creation or loss in different vertebrate lineages and human populations, which likely reflect fine-tuning of gene expression regulation during evolution.
Collapse
Affiliation(s)
- Chaolin Zhang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | | | | | | | | | | |
Collapse
|
117
|
Identification of nuclear and cytoplasmic mRNA targets for the shuttling protein SF2/ASF. PLoS One 2008; 3:e3369. [PMID: 18841201 PMCID: PMC2556390 DOI: 10.1371/journal.pone.0003369] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2008] [Accepted: 07/31/2008] [Indexed: 12/15/2022] Open
Abstract
The serine and arginine-rich protein family (SR proteins) are highly conserved regulators of pre-mRNA splicing. SF2/ASF, a prototype member of the SR protein family, is a multifunctional RNA binding protein with roles in pre-mRNA splicing, mRNA export and mRNA translation. These observations suggest the intriguing hypothesis that SF2/ASF may couple splicing and translation of specific mRNA targets in vivo. Unfortunately the paucity of endogenous mRNA targets for SF2/ASF has hindered testing of this hypothesis. Here, we identify endogenous mRNAs directly cross-linked to SF2/ASF in different sub-cellular compartments. Cross-Linking Immunoprecipitation (CLIP) captures the in situ specificity of protein-RNA interaction and allows for the simultaneous identification of endogenous RNA targets as well as the locations of binding sites within the RNA transcript. Using the CLIP method we identified 326 binding sites for SF2/ASF in RNA transcripts from 180 protein coding genes. A purine-rich consensus motif was identified in binding sites located within exon sequences but not introns. Furthermore, 72 binding sites were occupied by SF2/ASF in different sub-cellular fractions suggesting that these binding sites may influence the splicing or translational control of endogenous mRNA targets. We demonstrate that ectopic expression of SF2/ASF regulates the splicing and polysome association of transcripts derived from the SFRS1, PABC1, NETO2 and ENSA genes. Taken together the data presented here indicate that SF2/ASF has the capacity to co-regulate the nuclear and cytoplasmic processing of specific mRNAs and provide further evidence that the nuclear history of an mRNA may influence its cytoplasmic fate.
Collapse
|
118
|
Hewetson A, Wright-Pastusek AE, Helmer RA, Wesley KA, Chilton BS. Conservation of inter-protein binding sites in RUSH and RFBP, an ATP11B isoform. Mol Cell Endocrinol 2008; 292:79-86. [PMID: 18584949 PMCID: PMC2575751 DOI: 10.1016/j.mce.2008.05.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2008] [Revised: 05/01/2008] [Accepted: 05/06/2008] [Indexed: 11/25/2022]
Abstract
Isoforms of RUSH interact with a RING-finger binding protein (RFBP), which is a splice variant of the Type IV P-type ATPase, ATP11B. Splice arrays and RT-PCR showed that although most splice variants in RUSH and ATP11B are conserved in human and rabbit, the RFBP isoform is specific to rabbit. Interactions between the discontinuous PVITHC-HAKCPL sequence in the RING-domain of RUSH and the KVIRLIKIS sequence in the catalytic loop of RFBP were first identified with pull-down assays. Fine mapping involved probing CLIPS-constrained RING peptides with GST-tagged KVIRLIKIS. When the companion site in RFBP was fine mapped by replacement analysis with MBP-tagged RING, a four-fold increase in binding was noted for the KVIRLDKIS mutant. Direct comparison of splicing events in the RUSH and ATP11B genes between human and rabbit shows high structural stability in these protein interactions sites, which are 100% conserved in all mammalian orthologs.
Collapse
Affiliation(s)
- Aveline Hewetson
- Department of Cell Biology & Biochemistry, Rebecca Sealy Hospital, University of Texas Medical Branch, Galveston, TX 77555
| | - Amber E. Wright-Pastusek
- Texas Tech University Health Sciences Center, Lubbock, TX 79430 and Psych/Behavioral Sciences HS, Rebecca Sealy Hospital, University of Texas Medical Branch, Galveston, TX 77555
| | - Rebecca A. Helmer
- Department of Cell Biology & Biochemistry, Rebecca Sealy Hospital, University of Texas Medical Branch, Galveston, TX 77555
| | - Kerrie A. Wesley
- Department of Cell Biology & Biochemistry, Rebecca Sealy Hospital, University of Texas Medical Branch, Galveston, TX 77555
| | - Beverly S. Chilton
- Department of Cell Biology & Biochemistry, Rebecca Sealy Hospital, University of Texas Medical Branch, Galveston, TX 77555
| |
Collapse
|
119
|
Friedman BA, Stadler MB, Shomron N, Ding Y, Burge CB. Ab initio identification of functionally interacting pairs of cis-regulatory elements. Genome Res 2008; 18:1643-51. [PMID: 18799692 DOI: 10.1101/gr.080085.108] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Cooperatively acting pairs of cis-regulatory elements play important roles in many biological processes. Here, we describe a statistical approach, compositionally orthogonalized co-occurrence analysis (coCOA) that detects pairs of oligonucleotides that preferentially co-occur in pairs of sequence regions, controlling for correlations between the compositions of the analyzed regions. coCOA identified three clusters of oligonucleotide pairs that frequently co-occur at 5' and 3' ends of human and mouse introns. The largest cluster involved GC-rich sequences at the 5' ends of introns that co-occur and are co-conserved with specific AU-rich sequences near intron 3' ends. These motifs are preferentially conserved when they occur together, as measured by a new co-conservation measure, supporting common in vivo function. These motif pairs are also enriched in introns flanking alternative "cassette" exons, suggesting a role in silencing of intervening exons, and we showed that these motifs can cooperatively silence splicing of an intervening exon in a splicing reporter assay. This approach can be easily generalized to problems beyond RNA splicing.
Collapse
Affiliation(s)
- Brad A Friedman
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
| | | | | | | | | |
Collapse
|
120
|
Copley RR. The animal in the genome: comparative genomics and evolution. Philos Trans R Soc Lond B Biol Sci 2008; 363:1453-61. [PMID: 18192189 PMCID: PMC2614226 DOI: 10.1098/rstb.2007.2235] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Comparisons between completely sequenced metazoan genomes have generally emphasized how similar their encoded protein content is, even when the comparison is between phyla. Given the manifest differences between phyla and, in particular, intuitive notions that some animals are more complex than others, this creates something of a paradox. Simplistic explanations have included arguments such as increased numbers of genes; greater numbers of protein products produced through alternative splicing; increased numbers of regulatory non-coding RNAs and increased complexity of the cis-regulatory code. An obvious value of complete genome sequences lies in their ability to provide us with inventories of such components. I examine progress being made in linking genome content to the pattern of animal evolution, and argue that the gap between genomic and phenotypic complexity can only be understood through the totality of interacting components.
Collapse
|
121
|
Purdom E, Simpson KM, Robinson MD, Conboy JG, Lapuk AV, Speed TP. FIRMA: a method for detection of alternative splicing from exon array data. Bioinformatics 2008; 24:1707-14. [PMID: 18573797 PMCID: PMC2638867 DOI: 10.1093/bioinformatics/btn284] [Citation(s) in RCA: 98] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2008] [Revised: 05/18/2008] [Accepted: 06/06/2008] [Indexed: 12/22/2022] Open
Abstract
MOTIVATION Analyses of EST data show that alternative splicing is much more widespread than once thought. The advent of exon and tiling microarrays means that researchers now have the capacity to experimentally measure alternative splicing on a genome wide level. New methods are needed to analyze the data from these arrays. RESULTS We present a method, finding isoforms using robust multichip analysis (FIRMA), for detecting differential alternative splicing in exon array data. FIRMA has been developed for Affymetrix exon arrays, but could in principle be extended to other exon arrays, tiling arrays or splice junction arrays. We have evaluated the method using simulated data, and have also applied it to two datasets: a panel of 11 human tissues and a set of 10 pairs of matched normal and tumor colon tissue. FIRMA is able to detect exons in several genes confirmed by reverse transcriptase PCR. AVAILABILITY R code implementing our methods is contributed to the package aroma.affymetrix.
Collapse
Affiliation(s)
- E Purdom
- Department of Statistics, University of California at Berkeley, 367 Evans Hall #3860, Berkeley, CA 94720-3860, USA.
| | | | | | | | | | | |
Collapse
|
122
|
Chen L, Zheng S. Identify alternative splicing events based on position-specific evolutionary conservation. PLoS One 2008; 3:e2806. [PMID: 18665247 PMCID: PMC2467489 DOI: 10.1371/journal.pone.0002806] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2008] [Accepted: 07/07/2008] [Indexed: 11/19/2022] Open
Abstract
The evolution of eukaryotes is accompanied by the increased complexity of alternative splicing which greatly expands genome information. One of the greatest challenges in the post-genome era is a complete revelation of human transcriptome with consideration of alternative splicing. Here, we introduce a comparative genomics approach to systemically identify alternative splicing events based on the differential evolutionary conservation between exons and introns and the high-quality annotation of the ENCODE regions. Specifically, we focus on exons that are included in some transcripts but are completely spliced out for others and we call them conditional exons. First, we characterize distinguishing features among conditional exons, constitutive exons and introns. One of the most important features is the position-specific conservation score. There are dramatic differences in conservation scores between conditional exons and constitutive exons. More importantly, the differences are position-specific. For flanking intronic regions, the differences between conditional exons and constitutive exons are also position-specific. Using the Random Forests algorithm, we can classify conditional exons with high specificities (97% for the identification of conditional exons from intron regions and 95% for the classification of known exons) and fair sensitivities (64% and 32% respectively). We applied the method to the human genome and identified 39,640 introns that actually contain conditional exons and classified 8,813 conditional exons from the current RefSeq exon list. Among those, 31,673 introns containing conditional exons and 5,294 conditional exons classified from known exons cannot be inferred from RefSeq, UCSC or Ensembl annotations. Some of these de novo predictions were experimentally verified.
Collapse
Affiliation(s)
- Liang Chen
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California, United States of America.
| | | |
Collapse
|
123
|
Barbazuk WB, Fu Y, McGinnis KM. Genome-wide analyses of alternative splicing in plants: opportunities and challenges. Genome Res 2008; 18:1381-92. [PMID: 18669480 DOI: 10.1101/gr.053678.106] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Alternative splicing (AS) creates multiple mRNA transcripts from a single gene. While AS is known to contribute to gene regulation and proteome diversity in animals, the study of its importance in plants is in its early stages. However, recently available plant genome and transcript sequence data sets are enabling a global analysis of AS in many plant species. Results of genome analysis have revealed differences between animals and plants in the frequency of alternative splicing. The proportion of plant genes that have one or more alternative transcript isoforms is approximately 20%, indicating that AS in plants is not rare, although this rate is approximately one-third of that observed in human. The majority of plant AS events have not been functionally characterized, but evidence suggests that AS participates in important plant functions, including stress response, and may impact domestication and trait selection. The increasing availability of plant genome sequence data will enable larger comparative analyses that will identify functionally important plant AS events based on their evolutionary conservation, determine the influence of genome duplication on the evolution of AS, and discover plant-specific cis-elements that regulate AS. This review summarizes recent analyses of AS in plants, discusses the importance of further analysis, and suggests directions for future efforts.
Collapse
Affiliation(s)
- W Brad Barbazuk
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA.
| | | | | |
Collapse
|
124
|
Zhang X, Merkler KA, McLean MP. Characterization of regulatory intronic and exonic sequences involved in alternative splicing of scavenger receptor class B gene. Biochem Biophys Res Commun 2008; 372:173-8. [DOI: 10.1016/j.bbrc.2008.05.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2008] [Accepted: 05/02/2008] [Indexed: 01/04/2023]
|
125
|
Saltzman AL, Kim YK, Pan Q, Fagnani MM, Maquat LE, Blencowe BJ. Regulation of multiple core spliceosomal proteins by alternative splicing-coupled nonsense-mediated mRNA decay. Mol Cell Biol 2008; 28:4320-30. [PMID: 18443041 PMCID: PMC2447145 DOI: 10.1128/mcb.00361-08] [Citation(s) in RCA: 163] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2008] [Revised: 04/01/2008] [Accepted: 04/17/2008] [Indexed: 01/27/2023] Open
Abstract
Alternative splicing (AS) can regulate gene expression by introducing premature termination codons (PTCs) into spliced mRNA that subsequently elicit transcript degradation by the nonsense-mediated mRNA decay (NMD) pathway. However, the range of cellular functions controlled by this process and the factors required are poorly understood. By quantitative AS microarray profiling, we find that there are significant overlaps among the sets of PTC-introducing AS events affected by individual knockdown of the three core human NMD factors, Up-Frameshift 1 (UPF1), UPF2, and UPF3X/B. However, the levels of some PTC-containing splice variants are less or not detectably affected by the knockdown of UPF2 and/or UPF3X, compared with the knockdown of UPF1. The intron sequences flanking the affected alternative exons are often highly conserved, suggesting important regulatory roles for these AS events. The corresponding genes represent diverse cellular functions, and surprisingly, many encode core spliceosomal proteins and assembly factors. We further show that conserved, PTC-introducing AS events are enriched in genes that encode core spliceosomal proteins. Where tested, altering the expression levels of these core spliceosomal components affects the regulation of PTC-containing splice variants from the corresponding genes. Together, our results show that AS-coupled NMD can have different UPF factor requirements and is likely to regulate many general components of the spliceosome. The results further implicate general spliceosomal components in AS regulation.
Collapse
Affiliation(s)
- Arneet L Saltzman
- Department of Molecular Genetics, Centre for Cellular and Biomolecular Research, 160 College Street, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | | | | | | | | | | |
Collapse
|
126
|
Lev-Maor G, Goren A, Sela N, Kim E, Keren H, Doron-Faigenboim A, Leibman-Barak S, Pupko T, Ast G. The "alternative" choice of constitutive exons throughout evolution. PLoS Genet 2008; 3:e203. [PMID: 18020709 PMCID: PMC2077895 DOI: 10.1371/journal.pgen.0030203] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2007] [Accepted: 10/01/2007] [Indexed: 12/23/2022] Open
Abstract
Alternative cassette exons are known to originate from two processes—exonization of intronic sequences and exon shuffling. Herein, we suggest an additional mechanism by which constitutively spliced exons become alternative cassette exons during evolution. We compiled a dataset of orthologous exons from human and mouse that are constitutively spliced in one species but alternatively spliced in the other. Examination of these exons suggests that the common ancestors were constitutively spliced. We show that relaxation of the 5′ splice site during evolution is one of the molecular mechanisms by which exons shift from constitutive to alternative splicing. This shift is associated with the fixation of exonic splicing regulatory sequences (ESRs) that are essential for exon definition and control the inclusion level only after the transition to alternative splicing. The effect of each ESR on splicing and the combinatorial effects between two ESRs are conserved from fish to human. Our results uncover an evolutionary pathway that increases transcriptome diversity by shifting exons from constitutive to alternative splicing. Alternative splicing is believed to play a major role in the creation of transcriptomic diversification leading to higher order of organismal complexity, especially in mammals. As much as 80% of human genes generate more than one type of mRNA by alternative splicing. Thus, alternative splicing can bridge the low number of protein coding genes (∼24,500) and the total number of proteins generated in the human proteome (∼90,000). The correlation between the higher order of phenotypic diversity and alternative splicing was recently demonstrated and thus the origin of alternative splicing is of great interest. There are currently two models regarding the origin of alternatively spliced exons—exonization of intronic sequences and exon shuffling. According to these two mechanisms, a protein-coding gene was first established and only then a new alternative exon appeared within it or was added to the gene. Our current study provides evidences for a new mechanism indicating that during evolution constitutively spliced exons became alternatively spliced. Large-scale bioinformatic analyses reveal the magnitude of this process and experimental validation systems provide insights into its mechanisms.
Collapse
Affiliation(s)
- Galit Lev-Maor
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
| | - Amir Goren
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
| | - Noa Sela
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
| | - Eddo Kim
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
| | - Hadas Keren
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
| | - Adi Doron-Faigenboim
- Department of Cell Research and Immunology, Tel Aviv University, Tel Aviv, Israel
| | | | - Tal Pupko
- Department of Cell Research and Immunology, Tel Aviv University, Tel Aviv, Israel
| | - Gil Ast
- Department of Human Molecular Genetics, Tel Aviv University, Tel Aviv, Israel
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
127
|
Aznarez I, Barash Y, Shai O, He D, Zielenski J, Tsui LC, Parkinson J, Frey BJ, Rommens JM, Blencowe BJ. A systematic analysis of intronic sequences downstream of 5' splice sites reveals a widespread role for U-rich motifs and TIA1/TIAL1 proteins in alternative splicing regulation. Genome Res 2008; 18:1247-58. [PMID: 18456862 DOI: 10.1101/gr.073155.107] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
To identify human intronic sequences associated with 5' splice site recognition, we performed a systematic search for motifs enriched in introns downstream of both constitutive and alternative cassette exons. Significant enrichment was observed for U-rich motifs within 100 nucleotides downstream of 5' splice sites of both classes of exons, with the highest enrichment between positions +6 and +30. Exons adjacent to U-rich intronic motifs contain lower frequencies of exonic splicing enhancers and higher frequencies of exonic splicing silencers, compared with exons not followed by U-rich intronic motifs. These findings motivated us to explore the possibility of a widespread role for U-rich motifs in promoting exon inclusion. Since cytotoxic granule-associated RNA binding protein (TIA1) and TIA1-like 1 (TIAL1; also known as TIAR) were previously shown in vitro to bind to U-rich motifs downstream of 5' splice sites, and to facilitate 5' splice site recognition in vitro and in vivo, we investigated whether these factors function more generally in the regulation of splicing of exons followed by U-rich intronic motifs. Simultaneous knockdown of TIA1 and TIAL1 resulted in increased skipping of 36/41 (88%) of alternatively spliced exons associated with U-rich motifs, but did not affect 32/33 (97%) alternatively spliced exons that are not associated with U-rich motifs. The increase in exon skipping correlated with the proximity of the first U-rich motif and the overall "U-richness" of the adjacent intronic region. The majority of the alternative splicing events regulated by TIA1/TIAL1 are conserved in mouse, and the corresponding genes are associated with diverse cellular functions. Based on our results, we estimate that approximately 15% of alternative cassette exons are regulated by TIA1/TIAL1 via U-rich intronic elements.
Collapse
Affiliation(s)
- Isabel Aznarez
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
128
|
Wang Z, Burge CB. Splicing regulation: from a parts list of regulatory elements to an integrated splicing code. RNA (NEW YORK, N.Y.) 2008; 14:802-13. [PMID: 18369186 PMCID: PMC2327353 DOI: 10.1261/rna.876308] [Citation(s) in RCA: 710] [Impact Index Per Article: 44.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Alternative splicing of pre-mRNAs is a major contributor to both proteomic diversity and control of gene expression levels. Splicing is tightly regulated in different tissues and developmental stages, and its disruption can lead to a wide range of human diseases. An important long-term goal in the splicing field is to determine a set of rules or "code" for splicing that will enable prediction of the splicing pattern of any primary transcript from its sequence. Outside of the core splice site motifs, the bulk of the information required for splicing is thought to be contained in exonic and intronic cis-regulatory elements that function by recruitment of sequence-specific RNA-binding protein factors that either activate or repress the use of adjacent splice sites. Here, we summarize the current state of knowledge of splicing cis-regulatory elements and their context-dependent effects on splicing, emphasizing recent global/genome-wide studies and open questions.
Collapse
Affiliation(s)
- Zefeng Wang
- Department of Pharmacology, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| | | |
Collapse
|
129
|
Bortfeldt R, Schindler S, Szafranski K, Schuster S, Holste D. Comparative analysis of sequence features involved in the recognition of tandem splice sites. BMC Genomics 2008; 9:202. [PMID: 18447903 PMCID: PMC2423196 DOI: 10.1186/1471-2164-9-202] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Accepted: 04/30/2008] [Indexed: 01/05/2023] Open
Abstract
Background The splicing of pre-mRNAs is conspicuously often variable and produces multiple alternatively spliced (AS) isoforms that encode different messages from one gene locus. Computational studies uncovered a class of highly similar isoforms, which were related to tandem 5'-splice sites (5'ss) and 3'-splice sites (3'ss), yet with very sparse anecdotal evidence in experimental studies. To compare the types and levels of alternative tandem splice site exons occurring in different human organ systems and cell types, and to study known sequence features involved in the recognition and distinction of neighboring splice sites, we performed large-scale, stringent alignments of cDNA sequences and ESTs to the human and mouse genomes, followed by experimental validation. Results We analyzed alternative 5'ss exons (A5Es) and alternative 3'ss exons (A3Es), derived from transcript sequences that were aligned to assembled genome sequences to infer patterns of AS occurring in several thousands of genes. Comparing the levels of overlapping (tandem) and non-overlapping (competitive) A5Es and A3Es, a clear preference of isoforms was seen for tandem acceptors and donors, with four nucleotides and three to six nucleotides long exon extensions, respectively. A subset of inferred A5E tandem exons was selected and experimentally validated. With the focus on A5Es, we investigated their transcript coverage, sequence conservation and base-paring to U1 snRNA, proximal and distal splice site classification, candidate motifs for cis-regulatory activity, and compared A5Es with A3Es, constitutive and pseudo-exons, in H. sapiens and M. musculus. The results reveal a small but authentic enriched set of tandem splice site preference, with specific distances between proximal and distal 5'ss (3'ss), which showed a marked dichotomy between the levels of in- and out-of-frame splicing for A5Es and A3Es, respectively, identified a number of candidate NMD targets, and allowed a rough estimation of a number of undetected tandem donors based on splice site information. Conclusion This comparative study distinguishes tandem 5'ss and 3'ss, with three to six nucleotides long extensions, as having unusually high proportions of AS, experimentally validates tandem donors in a panel of different human tissues, highlights the dichotomy in the types of AS occurring at tandem splice sites, and elucidates that human alternative exons spliced at overlapping 5'ss posses features of typical splice variants that could well be beneficial for the cell.
Collapse
Affiliation(s)
- Ralf Bortfeldt
- Department of Bioinformatics, Friedrich-Schiller University, Ernst-Abbe-Platz 2, D-07743 Jena, Germany.
| | | | | | | | | |
Collapse
|
130
|
Peng T, Xue C, Bi J, Li T, Wang X, Zhang X, Li Y. Functional importance of different patterns of correlation between adjacent cassette exons in human and mouse. BMC Genomics 2008; 9:191. [PMID: 18439302 PMCID: PMC2432081 DOI: 10.1186/1471-2164-9-191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Accepted: 04/26/2008] [Indexed: 12/19/2022] Open
Abstract
Background Alternative splicing expands transcriptome diversity and plays an important role in regulation of gene expression. Previous studies focus on the regulation of a single cassette exon, but recent experiments indicate that multiple cassette exons within a gene may interact with each other. This interaction can increase the potential to generate various transcripts and adds an extra layer of complexity to gene regulation. Several cases of exon interaction have been discovered. However, the extent to which the cassette exons coordinate with each other remains unknown. Results Based on EST data, we employed a metric of correlation coefficients to describe the interaction between two adjacent cassette exons and then categorized these exon pairs into three different groups by their interaction (correlation) patterns. Sequence analysis demonstrates that strongly-correlated groups are more conserved and contain a higher proportion of pairs with reading frame preservation in a combinatorial manner. Multiple genome comparison further indicates that different groups of correlated pairs have different evolutionary courses: (1) The vast majority of positively-correlated pairs are old, (2) most of the weakly-correlated pairs are relatively young, and (3) negatively-correlated pairs are a mixture of old and young events. Conclusion We performed a large-scale analysis of interactions between adjacent cassette exons. Compared with weakly-correlated pairs, the strongly-correlated pairs, including both the positively and negatively correlated ones, show more evidence that they are under delicate splicing control and tend to be functionally important. Additionally, the positively-correlated pairs bear strong resemblance to constitutive exons, which suggests that they may evolve from ancient constitutive exons, while negatively and weakly correlated pairs are more likely to contain newly emerging exons.
Collapse
Affiliation(s)
- Tao Peng
- MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100084, PRoC.
| | | | | | | | | | | | | |
Collapse
|
131
|
Hiller M, Szafranski K, Sinha R, Huse K, Nikolajewa S, Rosenstiel P, Schreiber S, Backofen R, Platzer M. Assessing the fraction of short-distance tandem splice sites under purifying selection. RNA (NEW YORK, N.Y.) 2008; 14:616-29. [PMID: 18268022 PMCID: PMC2271360 DOI: 10.1261/rna.883908] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Many alternative splice events result in subtle mRNA changes, and most of them occur at short-distance tandem donor and acceptor sites. The splicing mechanism of such tandem sites likely involves the stochastic selection of either splice site. While tandem splice events are frequent, it is unknown how many are functionally important. Here, we use phylogenetic conservation to address this question, focusing on tandems with a distance of 3-9 nucleotides. We show that previous contradicting results on whether alternative or constitutive tandem motifs are more conserved between species can be explained by a statistical paradox (Simpson's paradox). Applying methods that take biases into account, we found higher conservation of alternative tandems in mouse, dog, and even chicken, zebrafish, and Fugu genomes. We estimated a lower bound for the number of alternative sites that are under purifying (negative) selection. While the absolute number of conserved tandem motifs decreases with the evolutionary distance, the fraction under selection increases. Interestingly, a number of frameshifting tandems are under selection, suggesting a role in regulating mRNA and protein levels via nonsense-mediated decay (NMD). An analysis of the intronic flanks shows that purifying selection also acts on the intronic sequence. We propose that stochastic splice site selection can be an advantageous mechanism that allows constant splice variant ratios in situations where a deviation in this ratio is deleterious.
Collapse
Affiliation(s)
- Michael Hiller
- Bioinformatics Group, Albert-Ludwigs-University Freiburg, 79110 Freiburg, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|
132
|
Xiao Y, Segal MR. Biological sequence classification utilizing positive and unlabeled data. Bioinformatics 2008; 24:1198-205. [DOI: 10.1093/bioinformatics/btn089] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
133
|
Szafranski K, Schindler S, Taudien S, Hiller M, Huse K, Jahn N, Schreiber S, Backofen R, Platzer M. Violating the splicing rules: TG dinucleotides function as alternative 3' splice sites in U2-dependent introns. Genome Biol 2008; 8:R154. [PMID: 17672918 PMCID: PMC2374985 DOI: 10.1186/gb-2007-8-8-r154] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2007] [Revised: 06/14/2007] [Accepted: 08/01/2007] [Indexed: 01/25/2023] Open
Abstract
TG dinucleotides functioning as alternative 3' splice sites were identified and experimentally verified in 36 human genes. Background Despite some degeneracy of sequence signals that govern splicing of eukaryotic pre-mRNAs, it is an accepted rule that U2-dependent introns exhibit the 3' terminal dinucleotide AG. Intrigued by anecdotal evidence for functional non-AG 3' splice sites, we carried out a human genome-wide screen. Results We identified TG dinucleotides functioning as alternative 3' splice sites in 36 human genes. The TG-derived splice variants were experimentally validated with a success rate of 92%. Interestingly, ratios of alternative splice variants are tissue-specific for several introns. TG splice sites and their flanking intron sequences are substantially conserved between orthologous vertebrate genes, even between human and frog, indicating functional relevance. Remarkably, TG splice sites are exclusively found as alternative 3' splice sites, never as the sole 3' splice site for an intron, and we observed a distance constraint for TG-AG splice site tandems. Conclusion Since TGs splice sites are exclusively found as alternative 3' splice sites, the U2 spliceosome apparently accomplishes perfect specificity for 3' AGs at an early splicing step, but may choose 3' TGs during later steps. Given the tiny fraction of TG 3' splice sites compared to the vast amount of non-viable TGs, cis-acting sequence signals must significantly contribute to splice site definition. Thus, we consider TG-AG 3' splice site tandems as promising subjects for studies on the mechanisms of 3' splice site selection.
Collapse
Affiliation(s)
- Karol Szafranski
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| | - Stefanie Schindler
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| | - Stefan Taudien
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| | - Michael Hiller
- Institute of Computer Science, Bioinformatics Group, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee, 79110 Freiburg, Germany
| | - Klaus Huse
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| | - Niels Jahn
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| | - Stefan Schreiber
- Institute of Clinical Molecular Biology, Christian Albrechts University Kiel, Schittenhelmstr., 24105 Kiel, Germany
| | - Rolf Backofen
- Institute of Computer Science, Bioinformatics Group, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee, 79110 Freiburg, Germany
| | - Matthias Platzer
- Genome Analysis, Leibniz Institute for Age Research - Fritz Lipmann Institute, Beutenbergstr., 07745 Jena, Germany
| |
Collapse
|
134
|
Holste D, Ohler U. Strategies for identifying RNA splicing regulatory motifs and predicting alternative splicing events. PLoS Comput Biol 2008; 4:e21. [PMID: 18225947 PMCID: PMC2217580 DOI: 10.1371/journal.pcbi.0040021] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Affiliation(s)
- Dirk Holste
- * To whom correspondence should be addressed. E-mail: (UO), (DH)
| | - Uwe Ohler
- * To whom correspondence should be addressed. E-mail: (UO), (DH)
| |
Collapse
|
135
|
Ke S, Zhang XHF, Chasin LA. Positive selection acting on splicing motifs reflects compensatory evolution. Genome Res 2008; 18:533-43. [PMID: 18204002 DOI: 10.1101/gr.070268.107] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
We have used comparative genomics to characterize the evolutionary behavior of predicted splicing regulatory motifs. Using base substitution rates in intronic regions as a calibrator for neutral change, we found a strong avoidance of synonymous substitutions that disrupt predicted exonic splicing enhancers or create predicted exonic splicing silencers. These results attest to the functionality of the hexameric motif set used and suggest that they are subject to purifying selection. We also found that synonymous substitutions in constitutive exons tend to create exonic splicing enhancers and to disrupt exonic splicing silencers, implying positive selection for these splicing promoting events. We present evidence that this positive selection is the result of splicing-positive events compensating for splicing-negative events as well as for mutations that weaken splice-site sequences. Such compensatory events include nonsynonymous mutations, synonymous mutations, and mutations at splice sites. Compensation was also seen from the fact that orthologous exons tend to maintain the same number of predicted splicing motifs. Our data fit a splicing compensation model of exon evolution, in which selection for splicing-positive mutations takes place to counter the effect of an ongoing splicing-negative mutational process, with the exon as a whole being conserved as a unit of splicing. In the course of this analysis, we observed that synonymous positions in general are conserved relative to intronic sequences, suggesting that messenger RNA molecules are rich in sequence information for functions beyond protein coding and splicing.
Collapse
Affiliation(s)
- Shengdong Ke
- Department of Biological Sciences Columbia University New York, New York 10027, USA
| | | | | |
Collapse
|
136
|
Kurmangaliyev YZ, Gelfand MS. Computational analysis of splicing errors and mutations in human transcripts. BMC Genomics 2008; 9:13. [PMID: 18194514 PMCID: PMC2234086 DOI: 10.1186/1471-2164-9-13] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2007] [Accepted: 01/14/2008] [Indexed: 01/10/2023] Open
Abstract
Background Most retained introns found in human cDNAs generated by high-throughput sequencing projects seem to result from underspliced transcripts, and thus they capture intermediate steps of pre-mRNA splicing. On the other hand, mutations in splice sites cause exon skipping of the respective exon or activation of pre-existing cryptic sites. Both types of events reflect properties of the splicing mechanism. Results The retained introns were significantly shorter than constitutive ones, and skipped exons are shorter than exons with cryptic sites. Both donor and acceptor splice sites of retained introns were weaker than splice sites of constitutive introns. The authentic acceptor sites affected by mutations were significantly weaker in exons with activated cryptic sites than in skipped exons. The distance from a mutated splice site to the nearest equivalent site is significantly shorter in cases of activated cryptic sites compared to exon skipping events. The prevalence of retained introns within genes monotonically increased in the 5'-to-3' direction (more retained introns close to the 3'-end), consistent with the model of co-transcriptional splicing. The density of exonic splicing enhancers was higher, and the density of exonic splicing silencers lower in retained introns compared to constitutive ones and in exons with cryptic sites compared to skipped exons. Conclusion Thus the analysis of retained introns in human cDNA, exons skipped due to mutations in splice sites and exons with cryptic sites produced results consistent with the intron definition mechanism of splicing of short introns, co-transcriptional splicing, dependence of splicing efficiency on the splice site strength and the density of candidate exonic splicing enhancers and silencers. These results are consistent with other, recently published analyses.
Collapse
Affiliation(s)
- Yerbol Z Kurmangaliyev
- Institute for Information Transmission Problems (the Kharkevich Institute) RAS, Bolshoi Karetny pereulok 19, Moscow, 127994, Russia.
| | | |
Collapse
|
137
|
Karolchik D, Kuhn RM, Baertsch R, Barber GP, Clawson H, Diekhans M, Giardine B, Harte RA, Hinrichs AS, Hsu F, Kober KM, Miller W, Pedersen JS, Pohl A, Raney BJ, Rhead B, Rosenbloom KR, Smith KE, Stanke M, Thakkapallayil A, Trumbower H, Wang T, Zweig AS, Haussler D, Kent WJ. The UCSC Genome Browser Database: 2008 update. Nucleic Acids Res 2007; 36:D773-9. [PMID: 18086701 PMCID: PMC2238835 DOI: 10.1093/nar/gkm966] [Citation(s) in RCA: 403] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The University of California, Santa Cruz, Genome Browser Database (GBD) provides integrated sequence and annotation data for a large collection of vertebrate and model organism genomes. Seventeen new assemblies have been added to the database in the past year, for a total coverage of 19 vertebrate and 21 invertebrate species as of September 2007. For each assembly, the GBD contains a collection of annotation data aligned to the genomic sequence. Highlights of this year's additions include a 28-species human-based vertebrate conservation annotation, an enhanced UCSC Genes set, and more human variation, MGC, and ENCODE data. The database is optimized for fast interactive performance with a set of web-based tools that may be used to view, manipulate, filter and download the annotation data. New toolset features include the Genome Graphs tool for displaying genome-wide data sets, session saving and sharing, better custom track management, expanded Genome Browser configuration options and a Genome Browser wiki site. The downloadable GBD data, the companion Genome Browser toolset and links to documentation and related information can be found at: http://genome.ucsc.edu/.
Collapse
Affiliation(s)
- D Karolchik
- Center for Biomolecular Science and Engineering, University of California Santa Cruz (UCSC), Santa Cruz, CA 95064, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
138
|
Kashyap L, Sharma RK. Alternative splicing: a paradoxical qudo in eukaryotic genomes. Bioinformation 2007; 2:155-6. [PMID: 21670794 PMCID: PMC2255073 DOI: 10.6026/97320630002155] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2007] [Revised: 12/08/2007] [Accepted: 12/11/2007] [Indexed: 11/23/2022] Open
Abstract
One of the most remarkable observations stemming from the sequencing of genomes of diverse species is that the number of protein-coding genes in an organism does not correlate with its overall cellular complexity. Alternative splicing, a key mechanism for generating protein complexity, has been suggested as one of the major explanation for this discrepancy between the number of genes and genome complexity. Determining the extent and importance of alternative splicing required the confluence of critical advances in data acquisition, improved understanding of biological processes and the development of fast and accurate computational analysis tools. Although many model organisms have now been completely sequenced, we are still very far from understanding the exact frequency of alternative splicing from these sequenced genomes.This paper will highlight some recent progress and future challenges for functional genomics and bioinformatics in this rapidly developing area.
Collapse
Affiliation(s)
- Luv Kashyap
- Department of Biochemistry, Faculty of Life Sciences, Aligarh Muslim University, Aligarh, India
| | - Ravi Kumar Sharma
- Botany Division, Central Drug Research Institute, M G Marg, Lucknow, India
| |
Collapse
|
139
|
Irimia M, Rukov JL, Penny D, Garcia-Fernandez J, Vinther J, Roy SW. Widespread evolutionary conservation of alternatively spliced exons in Caenorhabditis. Mol Biol Evol 2007; 25:375-82. [PMID: 18048400 DOI: 10.1093/molbev/msm262] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Alternative splicing (AS) contributes to increased transcriptome and proteome diversity in various eukaryotic lineages. Previous studies showed low levels of conservation of alternatively spliced (cassette) exons within mammals and within dipterans. We report a strikingly different pattern in Caenorhabditis nematodes-more than 92% of cassette exons from Caenorhabditis elegans are conserved in Caenorhabditis briggsae and/or Caenorhabditis remanei. High levels of conservation extend to minor-form exons (present in a minority of transcripts) and are particularly pronounced for exons showing complex patterns of splicing. The functionality of the vast majority of cassette exons is underscored by various other features. We suggest that differences in conservation between lineages reflect differences in levels of functionality and further suggest that these differences are due to differences in intron length and the strength of consensus boundaries across lineages. Finally, we demonstrate an inverse relationship between AS and gene duplication, suggesting that the latter may be primarily responsible for the emergence of new functional transcripts in nematodes.
Collapse
Affiliation(s)
- Manuel Irimia
- Departament de Genètica, Universitat de Barcelona, Barcelona, Spain.
| | | | | | | | | | | |
Collapse
|
140
|
Abstract
Alternative pre-mRNA splicing has an important role in the control of neuronal gene expression. Many neuronal proteins are structurally diversified through the differential inclusion and exclusion of sequences in the final spliced mRNA. Here, we discuss common mechanisms of splicing regulation and provide examples of how alternative splicing has important roles in neuronal development and mature neuron function. Finally, we describe regulatory proteins that control the splicing of some neuronally expressed transcripts.
Collapse
Affiliation(s)
- Qin Li
- Howard Hughes Medical Institute, University of California, Los Angeles, 6-762 MacDonald Research Laboratories, 675 Charles E. Young Drive South, Los Angeles, California 90095, USA
| | | | | |
Collapse
|
141
|
Leparc GG, Mitra RD. A sensitive procedure to detect alternatively spliced mRNA in pooled-tissue samples. Nucleic Acids Res 2007; 35:e146. [PMID: 18000005 PMCID: PMC2175357 DOI: 10.1093/nar/gkm989] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
One important goal of genomics is to explore the extent of alternative splicing in the transcriptome and generate a comprehensive catalog of splice forms. New computational and experimental approaches have led to an increase in the number of predicted alternatively spliced transcripts; however, validation of these predictions has not kept pace. In this work, we systematically explore different methods for the validation of cassette exons predicted by computational methods or tiling microarrays. Our goal was to find a procedure that is cost effective, sensitive and specific. We examined three ways of priming the reverse transcription (RT) reaction—poly-dT priming, random priming and pooled exon-specific priming. We also examined two strategies for PCR amplification—flanking PCR, which uses primers that hybridize to the constitutive exons flanking the predicted exon, and a semi-nested PCR with a primer that targets the predicted exon. We found that the combination of RT using a pool of gene-specific primers followed by semi-nested PCR resulted in a significant increase in sensitivity over the most commonly used methodology (97% of the test set was detected versus 14%). Our method was also highly specific—no false positives were detected using a test set of true negatives. Finally, we demonstrate that this method is able to detect alternative exons with a high sensitivity from whole-organism RNA, allowing all tissues to be sampled in a single experiment. The protocol developed here is an accurate and cost-effective way to validate predictions of alternative splicing.
Collapse
Affiliation(s)
- Germán Gastón Leparc
- Department of Genetics and Center for Genome Sciences, Washington University in St Louis, 4444 Forest Park Parkway, Campus Box 8510, St Louis, MO 63108, USA
| | | |
Collapse
|
142
|
Clark TA, Schweitzer AC, Chen TX, Staples MK, Lu G, Wang H, Williams A, Blume JE. Discovery of tissue-specific exons using comprehensive human exon microarrays. Genome Biol 2007; 8:R64. [PMID: 17456239 PMCID: PMC1896007 DOI: 10.1186/gb-2007-8-4-r64] [Citation(s) in RCA: 235] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2006] [Revised: 03/09/2007] [Accepted: 04/24/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Higher eukaryotes express a diverse population of messenger RNAs generated by alternative splicing. Large-scale methods for monitoring gene expression must adapt in order to accurately detect the transcript variation generated by this splicing. RESULTS We have designed a high-density oligonucleotide microarray with probesets for more than one million annotated and predicted exons in the human genome. Using these arrays and a simple algorithm that normalizes exon signal to signal from the gene as a whole, we have identified tissue-specific exons from a panel of 16 different normal adult tissues. RT-PCR validation confirms approximately 86% of the predicted tissue-enriched probesets. Pair-wise comparisons between the tissues suggest that as many as 73% of detected genes are differentially alternatively spliced. We also demonstrate how an inclusive exon microarray can be used to discover novel alternative splicing events. As examples, 17 new tissue-specific exons from 11 genes were validated by RT-PCR and sequencing. CONCLUSION In conjunction with a conceptually simple algorithm, comprehensive exon microarrays can detect tissue-specific alternative splicing events. Our data suggest significant expression outside of known exons and well annotated genes and a high frequency of alternative splicing events. In addition, we identified and validated a number of novel exons with tissue-specific splicing patterns. The tissue map data will likely serve as a valuable source of information on the regulation of alternative splicing.
Collapse
Affiliation(s)
- Tyson A Clark
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| | | | - Tina X Chen
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| | | | - Gang Lu
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| | - Hui Wang
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| | - Alan Williams
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| | - John E Blume
- Affymetrix, Inc., 3420 Central Expressway, Santa Clara, CA 95051, USA
| |
Collapse
|
143
|
Calarco JA, Xing Y, Cáceres M, Calarco JP, Xiao X, Pan Q, Lee C, Preuss TM, Blencowe BJ. Global analysis of alternative splicing differences between humans and chimpanzees. Genes Dev 2007; 21:2963-75. [PMID: 17978102 DOI: 10.1101/gad.1606907] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Alternative splicing is a powerful mechanism affording extensive proteomic and regulatory diversity from a limited repertoire of genes. However, the extent to which alternative splicing has contributed to the evolution of primate species-specific characteristics has not been assessed previously. Using comparative genomics and quantitative microarray profiling, we performed the first global analysis of alternative splicing differences between humans and chimpanzees. Surprisingly, 6%-8% of profiled orthologous exons display pronounced splicing level differences in the corresponding tissues from the two species. Little overlap is observed between the genes associated with alternative splicing differences and the genes that display steady-state transcript level differences, indicating that these layers of regulation have evolved rapidly to affect distinct subsets of genes in humans and chimpanzees. The alternative splicing differences we detected are predicted to affect diverse functions including gene expression, signal transduction, cell death, immune defense, and susceptibility to diseases. Differences in expression at the protein level of the major splice variant of Glutathione S-transferase omega-2 (GSTO2), which functions in the protection against oxidative stress and is associated with human aging-related diseases, suggests that this enzyme is less active in human cells compared with chimpanzee cells. The results of this study thus support an important role for alternative splicing in establishing differences between humans and chimpanzees.
Collapse
Affiliation(s)
- John A Calarco
- Banting and Best Department of Medical Research, University of Toronto, Terrence Donnelly Center for Cellular and Biomolecular Research, Toronto, Ontario M5S 3E1, Canada
| | | | | | | | | | | | | | | | | |
Collapse
|
144
|
Jelen N, Ule J, Živin M, Darnell RB. Evolution of Nova-dependent splicing regulation in the brain. PLoS Genet 2007; 3:1838-47. [PMID: 17937501 PMCID: PMC2014790 DOI: 10.1371/journal.pgen.0030173] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2007] [Accepted: 08/24/2007] [Indexed: 01/08/2023] Open
Abstract
A large number of alternative exons are spliced with tissue-specific patterns, but little is known about how such patterns have evolved. Here, we study the conservation of the neuron-specific splicing factors Nova1 and Nova2 and of the alternatively spliced exons they regulate in mouse brain. Whereas Nova RNA binding domains are 94% identical across vertebrate species, Nova-dependent splicing silencer and enhancer elements (YCAY clusters) show much greater divergence, as less than 50% of mouse YCAY clusters are conserved at orthologous positions in the zebrafish genome. To study the relation between the evolution of tissue-specific splicing and YCAY clusters, we compared the brain-specific splicing of Nova-regulated exons in zebrafish, chicken, and mouse. The presence of YCAY clusters in lower vertebrates invariably predicted conservation of brain-specific splicing across species, whereas their absence in lower vertebrates correlated with a loss of alternative splicing. We hypothesize that evolution of Nova-regulated splicing in higher vertebrates proceeds mainly through changes in cis-acting elements, that tissue-specific splicing might in some cases evolve in a single step corresponding to evolution of a YCAY cluster, and that the conservation level of YCAY clusters relates to the functions encoded by the regulated RNAs.
Collapse
Affiliation(s)
- Nejc Jelen
- Laboratory of Molecular Neuro-Oncology, The Rockefeller University, New York, New York, United States of America
- Howard Hughes Medical Institute, The Rockefeller University, New York, New York, United States of America
- Division of Structural Studies, MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
- Brain Research Laboratory, Institute of Pathophysiology, University of Ljubljana, Ljubljana, Slovenia
| | - Jernej Ule
- Division of Structural Studies, MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Marko Živin
- Brain Research Laboratory, Institute of Pathophysiology, University of Ljubljana, Ljubljana, Slovenia
| | - Robert B Darnell
- Laboratory of Molecular Neuro-Oncology, The Rockefeller University, New York, New York, United States of America
- Howard Hughes Medical Institute, The Rockefeller University, New York, New York, United States of America
| |
Collapse
|
145
|
Zhang C, Hastings ML, Krainer AR, Zhang MQ. Dual-specificity splice sites function alternatively as 5' and 3' splice sites. Proc Natl Acad Sci U S A 2007; 104:15028-33. [PMID: 17848517 PMCID: PMC1986607 DOI: 10.1073/pnas.0703773104] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
As a result of large-scale sequencing projects and recent splicing-microarray studies, estimates of mammalian genes expressing multiple transcripts continue to increase. This expansion of transcript information makes it possible to better characterize alternative splicing events and gain insights into splicing mechanisms and regulation. Here, we describe a class of splice sites that we call dual-specificity splice sites, which we identified through genome-wide, high-quality alignment of mRNA/EST and genome sequences and experimentally verified by RT-PCR. These splice sites can be alternatively recognized as either 5' or 3' splice sites, and the dual splicing is conceptually similar to a pair of mutually exclusive exons separated by a zero-length intron. The dual-splice-site sequences are essentially a composite of canonical 5' and 3' splice-site consensus sequences, with a CAG|GURAG core. The relative use of a dual site as a 5' or 3' splice site can be accurately predicted by assuming competition for specific binding between spliceosomal components involved in recognition of 5' and 3' splice sites, respectively. Dual-specificity splice sites exist in human and mouse, and possibly in other vertebrate species, although most sites are not conserved, suggesting that their origin is recent. We discuss the implications of this unusual splicing pattern for the diverse mechanisms of exon recognition and for gene evolution.
Collapse
Affiliation(s)
- Chaolin Zhang
- *Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724; and
- Department of Biomedical Engineering, State University of New York, Stony Brook, NY 11794
| | - Michelle L. Hastings
- *Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724; and
| | - Adrian R. Krainer
- *Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724; and
- To whom correspondence may be addressed. E-mail:
or
| | - Michael Q. Zhang
- *Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724; and
- To whom correspondence may be addressed. E-mail:
or
| |
Collapse
|
146
|
He C, Zuo Z, Chen H, Zhang L, Zhou F, Cheng H, Zhou R. Genome-wide detection of testis- and testicular cancer-specific alternative splicing. Carcinogenesis 2007; 28:2484-90. [PMID: 17724370 DOI: 10.1093/carcin/bgm194] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Alternative pre-messenger RNA (mRNA) splicing is a key molecular event that allows for protein diversity and plays important roles in development and disease. Alternative pre-mRNA splicing regulations during spermatogenesis and alternative pre-mRNA splicing etiology in testicular tumorigenesis are yet to be characterized. By genome-wide analysis, here we describe alternative splicing features that distinguish distinctive patterns of alternative pre-mRNA splicing among human testis, testicular cancer and mouse testis. Through computationally subtractive analysis, we detected 80 testis-specific transcript candidates in human testis, 175 in human testicular cancer and 262 in mouse testis, which were integrated into a database. Reverse transcription-polymerase chain reaction confirmed that most of these transcript candidates from mouse testis were testis specific. Around 40% of the transcripts were from unknown/hypothetical genes, which were useful for further functional analysis. These transcripts were not overlapped, indicating lack of evolutionary conservation. Further chromosome mapping showed distinct chromosomal preference of alternative pre-mRNA splicing events. Comparison analysis indicated that alternative pre-mRNA splicing in human testicular tumor shared some characters/trends with those in mouse testis. Moreover, human testicular tumor tended to use rare splice sites and there were also distinct sequences adjacent dominant splice sites between normal testis and testicular tumor. These special features of alternative pre-mRNA splicing in human testicular tumor suggested that testicular tumorigenesis was involved in multiple steps/levels of alternative splicing events. Using alternative splicing as a potential source for new clinical diagnostic, prognostic and therapeutic strategies for treatment of testicular tumors seems to have a bright prospect.
Collapse
Affiliation(s)
- Chunjiang He
- Department of Genetics and Center for Developmental Biology, College of Life Sciences, Wuhan University, Wuhan 430072, P. R. China
| | | | | | | | | | | | | |
Collapse
|
147
|
Yeo GW, Xu X, Liang TY, Muotri AR, Carson CT, Coufal NG, Gage FH. Alternative splicing events identified in human embryonic stem cells and neural progenitors. PLoS Comput Biol 2007; 3:1951-67. [PMID: 17967047 PMCID: PMC2041973 DOI: 10.1371/journal.pcbi.0030196] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2007] [Accepted: 08/24/2007] [Indexed: 02/06/2023] Open
Abstract
Human embryonic stem cells (hESCs) and neural progenitor (NP) cells are excellent models for recapitulating early neuronal development in vitro, and are key to establishing strategies for the treatment of degenerative disorders. While much effort had been undertaken to analyze transcriptional and epigenetic differences during the transition of hESC to NP, very little work has been performed to understand post-transcriptional changes during neuronal differentiation. Alternative RNA splicing (AS), a major form of post-transcriptional gene regulation, is important in mammalian development and neuronal function. Human ESC, hESC-derived NP, and human central nervous system stem cells were compared using Affymetrix exon arrays. We introduced an outlier detection approach, REAP (Regression-based Exon Array Protocol), to identify 1,737 internal exons that are predicted to undergo AS in NP compared to hESC. Experimental validation of REAP-predicted AS events indicated a threshold-dependent sensitivity ranging from 56% to 69%, at a specificity of 77% to 96%. REAP predictions significantly overlapped sets of alternative events identified using expressed sequence tags and evolutionarily conserved AS events. Our results also reveal that focusing on differentially expressed genes between hESC and NP will overlook 14% of potential AS genes. In addition, we found that REAP predictions are enriched in genes encoding serine/threonine kinase and helicase activities. An example is a REAP-predicted alternative exon in the SLK (serine/threonine kinase 2) gene that is differentially included in hESC, but skipped in NP as well as in other differentiated tissues. Lastly, comparative sequence analysis revealed conserved intronic cis-regulatory elements such as the FOX1/2 binding site GCAUG as being proximal to candidate AS exons, suggesting that FOX1/2 may participate in the regulation of AS in NP and hESC. In summary, a new methodology for exon array analysis was introduced, leading to new insights into the complexity of AS in human embryonic stem cells and their transition to neural stem cells.
Collapse
Affiliation(s)
- Gene W Yeo
- Crick-Jacobs Center for Theoretical and Computational Biology, Salk Institute, La Jolla, California, United States of America
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
- * To whom correspondence should be addressed. E-mail: (GWY); (FHG)
| | - Xiangdong Xu
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
| | - Tiffany Y Liang
- Crick-Jacobs Center for Theoretical and Computational Biology, Salk Institute, La Jolla, California, United States of America
| | - Alysson R Muotri
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
| | - Christian T Carson
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
| | - Nicole G Coufal
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
| | - Fred H Gage
- Laboratory of Genetics, Salk Institute, La Jolla, California, United States of America
- * To whom correspondence should be addressed. E-mail: (GWY); (FHG)
| |
Collapse
|
148
|
Boutz PL, Stoilov P, Li Q, Lin CH, Chawla G, Ostrow K, Shiue L, Ares M, Black DL. A post-transcriptional regulatory switch in polypyrimidine tract-binding proteins reprograms alternative splicing in developing neurons. Genes Dev 2007; 21:1636-52. [PMID: 17606642 PMCID: PMC1899473 DOI: 10.1101/gad.1558107] [Citation(s) in RCA: 414] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2007] [Accepted: 05/16/2007] [Indexed: 02/06/2023]
Abstract
Many metazoan gene transcripts exhibit neuron-specific splicing patterns, but the developmental control of these splicing events is poorly understood. We show that the splicing of a large group of exons is reprogrammed during neuronal development by a switch in expression between two highly similar polypyrimidine tract-binding proteins, PTB and nPTB (neural PTB). PTB is a well-studied regulator of alternative splicing, but nPTB is a closely related paralog whose functional relationship to PTB is unknown. In the brain, nPTB protein is specifically expressed in post-mitotic neurons, whereas PTB is restricted to neuronal precursor cells (NPC), glia, and other nonneuronal cells. Interestingly, nPTB mRNA transcripts are found in NPCs and other nonneuronal cells, but in these cells nPTB protein expression is repressed. This repression is due in part to PTB-induced alternative splicing of nPTB mRNA, leading to nonsense-mediated decay (NMD). However, we find that even properly spliced mRNA fails to express nPTB protein when PTB is present, indicating contributions from additional post-transcriptional mechanisms. The PTB-controlled repression of nPTB results in a mutually exclusive pattern of expression in the brain, where the loss of PTB in maturing neurons allows the synthesis of nPTB in these cells. To examine the consequences of this switch, we used splicing-sensitive microarrays to identify different sets of exons regulated by PTB, nPTB, or both proteins. During neuronal differentiation, the splicing of these exon sets is altered as predicted from the observed changes in PTB and nPTB expression. These data show that the post-transcriptional switch from PTB to nPTB controls a widespread alternative splicing program during neuronal development.
Collapse
Affiliation(s)
- Paul L. Boutz
- Department of Microbiology, Immunology, and Molecular Genetics, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| | - Peter Stoilov
- Department of Microbiology, Immunology, and Molecular Genetics, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| | - Qin Li
- Howard Hughes Medical Institute, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| | - Chia-Ho Lin
- Department of Microbiology, Immunology, and Molecular Genetics, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| | - Geetanjali Chawla
- Howard Hughes Medical Institute, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| | - Kristin Ostrow
- Department of Medicine, University of California at San Francisco, San Francisco, California 94143, USA
| | - Lily Shiue
- Sinsheimer Laboratories, Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Manuel Ares
- Sinsheimer Laboratories, Department of Molecular, Cell and Developmental Biology, University of California at Santa Cruz, Santa Cruz, California 95064, USA
| | - Douglas L. Black
- Department of Microbiology, Immunology, and Molecular Genetics, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
- Howard Hughes Medical Institute, 6-762 MacDonald Research Laboratories, Los Angeles, California 90095, USA
| |
Collapse
|
149
|
Voelker RB, Berglund JA. A comprehensive computational characterization of conserved mammalian intronic sequences reveals conserved motifs associated with constitutive and alternative splicing. Genes Dev 2007; 17:1023-33. [PMID: 17525134 PMCID: PMC1899113 DOI: 10.1101/gr.6017807] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2006] [Accepted: 04/12/2007] [Indexed: 11/24/2022]
Abstract
Orthologous mammalian introns contain many highly conserved sequences. Of these sequences, many are likely to represent protein binding sites that are under strong positive selection. In order to identify conserved protein binding sites that are important for splicing, we analyzed the composition of intronic sequences that are conserved between human and six eutherian mammals. We focused on all completely conserved sequences of seven or more nucleotides located in the regions adjacent to splice-junctions. We found that these conserved intronic sequences are enriched in specific motifs, and that many of these motifs are statistically associated with either alternative or constitutive splicing. In validation of our methods, we identified several motifs that are known to play important roles in alternative splicing. In addition, we identified several novel motifs containing GCT that are abundant and are associated with alternative splicing. Furthermore, we demonstrate that, for some of these motifs, conservation is a strong indicator of potential functionality since conserved instances are associated with alternative splicing while nonconserved instances are not. A surprising outcome of this analysis was the identification of a large number of AT-rich motifs that are strongly associated with constitutive splicing. Many of these appear to be novel and may represent conserved intronic splicing enhancers (ISEs). Together these data show that conservation provides important insights into the identification and possible roles of cis-acting intronic sequences important for alternative and constitutive splicing.
Collapse
Affiliation(s)
- Rodger B. Voelker
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403, USA
| | - J. Andrew Berglund
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403, USA
| |
Collapse
|
150
|
Zhang Z, Habara Y, Nishiyama A, Oyazato Y, Yagi M, Takeshima Y, Matsuo M. Identification of seven novel cryptic exons embedded in the dystrophin gene and characterization of 14 cryptic dystrophin exons. J Hum Genet 2007; 52:607-617. [PMID: 17579806 DOI: 10.1007/s10038-007-0163-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2007] [Accepted: 05/10/2007] [Indexed: 11/29/2022]
Abstract
The dystrophin gene, which is mutated in Duchenne and Becker muscular dystrophy, is characterized by its extremely large introns. Seven cryptic exons from the intronic sequences of the dystrophin gene have been shown to be inserted into the processed mRNA. In this study, we have cloned seven novel cryptic exons embedded in dystrophin introns that were amplified from dystrophin mRNA isolated from lymphocytes. All of these sequences, which ranged in size from 27 to 151 bp, were found to be cryptic exons because they were completely homologous to intronic sequences (introns 1, 18, 29, 63, 67, and 77), and possessed consensus sequences for branch points, splice acceptor sites, and splice donor sites. Compared with the 77 authentic dystrophin exons, the 14 cryptic exons were characterized by (1) lower Shapiro's splicing probability scores for the splice donor and acceptor sites; (2) smaller and larger densities of splicing enhancer and silencer motifs, respectively; (3) a longer distance between the putative branch site and the splice acceptor site; and (4) with one exception, the introduction of premature stop codons into their respective transcripts. These characteristics indicated that the cryptic exons were weaker than the authentic exons. Our results suggested that a mutation deep within an intron that changed these parameters could cause dystrophinopathy. The cryptic exons identified provide areas that should be examined for the detection of mutations in the dystrophin gene, and they may help us to understand the roles of large dystrophin introns.
Collapse
Affiliation(s)
- Zhujun Zhang
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Yasuaki Habara
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Atsushi Nishiyama
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Yoshinobu Oyazato
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Mariko Yagi
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Yasuhiro Takeshima
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan
| | - Masafumi Matsuo
- Department of Pediatrics, Graduate School of Medicine, Kobe University, 7-5-1 Kusunokicho, Chuo, Kobe, 650-0017, Japan.
| |
Collapse
|