1
|
Roy B, Haupt LM, Griffiths LR. Review: Alternative Splicing (AS) of Genes As An Approach for Generating Protein Complexity. Curr Genomics 2013; 14:182-94. [PMID: 24179441 PMCID: PMC3664468 DOI: 10.2174/1389202911314030004] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Revised: 02/08/2013] [Accepted: 02/25/2013] [Indexed: 12/22/2022] Open
Abstract
Prior to the completion of the human genome project, the human genome was thought to have a greater number of genes as it seemed structurally and functionally more complex than other simpler organisms. This along with the belief of “one gene, one protein”, were demonstrated to be incorrect. The inequality in the ratio of gene to protein formation gave rise to the theory of alternative splicing (AS). AS is a mechanism by which one gene gives rise to multiple protein products. Numerous databases and online bioinformatic tools are available for the detection and analysis of AS. Bioinformatics provides an important approach to study mRNA and protein diversity by various tools such as expressed sequence tag (EST) sequences obtained from completely processed mRNA. Microarrays and deep sequencing approaches also aid in the detection of splicing events. Initially it was postulated that AS occurred only in about 5% of all genes but was later found to be more abundant. Using bioinformatic approaches, the level of AS in human genes was found to be fairly high with 35-59% of genes having at least one AS form. Our ability to determine and predict AS is important as disorders in splicing patterns may lead to abnormal splice variants resulting in genetic diseases. In addition, the diversity of proteins produced by AS poses a challenge for successful drug discovery and therefore a greater understanding of AS would be beneficial.
Collapse
Affiliation(s)
- Bishakha Roy
- Genomics Research Centre, Griffith Health Institute, Griffith University Gold Coast, Queensland 4222, Australia
| | | | | |
Collapse
|
2
|
Kelemen O, Convertini P, Zhang Z, Wen Y, Shen M, Falaleeva M, Stamm S. Function of alternative splicing. Gene 2013; 514:1-30. [PMID: 22909801 PMCID: PMC5632952 DOI: 10.1016/j.gene.2012.07.083] [Citation(s) in RCA: 514] [Impact Index Per Article: 46.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Revised: 07/21/2012] [Accepted: 07/30/2012] [Indexed: 12/15/2022]
Abstract
Almost all polymerase II transcripts undergo alternative pre-mRNA splicing. Here, we review the functions of alternative splicing events that have been experimentally determined. The overall function of alternative splicing is to increase the diversity of mRNAs expressed from the genome. Alternative splicing changes proteins encoded by mRNAs, which has profound functional effects. Experimental analysis of these protein isoforms showed that alternative splicing regulates binding between proteins, between proteins and nucleic acids as well as between proteins and membranes. Alternative splicing regulates the localization of proteins, their enzymatic properties and their interaction with ligands. In most cases, changes caused by individual splicing isoforms are small. However, cells typically coordinate numerous changes in 'splicing programs', which can have strong effects on cell proliferation, cell survival and properties of the nervous system. Due to its widespread usage and molecular versatility, alternative splicing emerges as a central element in gene regulation that interferes with almost every biological function analyzed.
Collapse
Affiliation(s)
- Olga Kelemen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Paolo Convertini
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Zhaiyi Zhang
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Yuan Wen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Manli Shen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Marina Falaleeva
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Stefan Stamm
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| |
Collapse
|
3
|
Funari VA, Voevodski K, Leyfer D, Yerkes L, Cramer D, Tolan DR. Quantitative gene expression profiles in real time from expressed sequence tag databases. Gene Expr 2010; 14:321-36. [PMID: 20635574 PMCID: PMC2954622 DOI: 10.3727/105221610x12717040569820] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
An accumulation of expressed sequence tag (EST) data in the public domain and the availability of bioinformatic programs have made EST gene expression profiling a common practice. However, the utility and validity of using EST databases (e.g., dbEST) has been criticized, particularly for quantitative assessment of gene expression. Problems with EST sequencing errors, library construction, EST annotation, and multiple paralogs make generation of specific and sensitive qualitative arid quantitative expression profiles a concern. In addition, most EST-derived expression data exists in previously assembled databases. The Virtual Northern Blot (VNB) (http: //tlab.bu.edu/vnb.html) allows generation, evaluation, and optimization of expression profiles in real time, which is especially important for alternatively spliced, novel, or poorly characterized genes. Representative gene families with variable nucleotide sequence identity, tissue specificity, and levels of expression (bcl-xl, aldoA, and cyp2d9) are used to assess the quality of VNB's output. The profiles generated by VNB are more sensitive and specific than those constructed with ESTs listed in preindexed databases at UCSC and NCBI. Moreover, quantitative expression profiles produced by VNB are comparable to quantization obtained from Northern blots and qPCR. The VNB pipeline generates real-time gene expression profiles for single-gene queries that are both qualitatively and quantitatively reliable.
Collapse
Affiliation(s)
| | | | - Dimitry Leyfer
- †Bioinformatics Program, Boston University, Boston, MA, USA
| | - Laura Yerkes
- *Biology Department, Boston University, Boston, MA, USA
| | - Donald Cramer
- *Biology Department, Boston University, Boston, MA, USA
| | - Dean R. Tolan
- *Biology Department, Boston University, Boston, MA, USA
- †Bioinformatics Program, Boston University, Boston, MA, USA
| |
Collapse
|
4
|
Gu L, Guo R. Genome-wide detection and analysis of alternative splicing for nucleotide binding site-leucine-rich repeats sequences in rice. J Genet Genomics 2009; 34:247-57. [PMID: 17498622 DOI: 10.1016/s1673-8527(07)60026-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2006] [Accepted: 08/03/2006] [Indexed: 11/20/2022]
Abstract
Alternative splicing is a major contributor to genomic complexity and proteome diversity, yet the analysis of alternative splicing for the sequence containing nucleotide binding site and leucine-rich repeats (NBS-LRR) domain has not been explored in rice (Oryza sativa L.). Hidden Markov model (HMM) searches were performed for NBS-LRR domain. 875 NBS-LRR-encoding sequences were obtained from the Institute for Genomic Research (TIGR). All of them were used to blast Knowledge-based Oryza Molecular Biological Encyclopaedia (KOME), TIGR rice gene index (TGI), and Universal Protein Resource (UniProt) to obtain homologous full-length cDNAs (FL-cDNAs), tentative consensus sequences, and protein sequences. Alternative splicing events were detected from genomic alignment of FL-cDNAs, tentative consensus sequences, and protein sequences, which provide valuable information on splice variants of genes. These sequences were aligned to the corresponding BAC sequences using the Spidey and Sim4 programs and each of the proteins was aligned by tBLASTn. Of the 875 NBS-LRR sequences, 119 (13.6%) sequences had alternative splicing where multiple FL-cDNAs, TGI sequences and proteins corresponded to the same gene. 71 intron retention events, 20 exon skipping events, 16 alternative termination events, 25 alternative initiation events, 12 alternative 5' splicing events, and 16 alternative 3' splicing events were identified. Most of these alternative splices were supported by two or more transcripts. The data sets are available at http://www.bioinfor.org Furthermore, the bioinformatics analysis of splice boundaries showed that exon skipping and intron retention did not exhibit strong consensus. This implies a different regulation mechanism that guides the expression of splice isoforms. This article also presents the analysis of the effects of intron retention on proteins. The C-terminal regions of alternative proteins turned out to be more variable than the N-terminal regions. Finally, tissue distribution and protein localization of alternative splicing were explored. The largest categories of tissue distributions for alternative splicing were shoot and callus. More than one-thirds of protein localization for splice forms was plasma membrane and cytoplasm. All the NBS-LRR proteins for splice forms may have important function in disease resistance and activate downstream signaling pathways.
Collapse
Affiliation(s)
- Lianfeng Gu
- College of Agriculture, Guangdong Ocean University, Zhanjiang 524088, China
| | | |
Collapse
|
5
|
Akepati VR, Müller EC, Otto A, Strauss HM, Portwich M, Alexander C. Characterization of OPA1 isoforms isolated from mouse tissues. J Neurochem 2008; 106:372-83. [PMID: 18419770 DOI: 10.1111/j.1471-4159.2008.05401.x] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
OPA1, a nuclear encoded mitochondrial protein causing autosomal dominant optic atrophy, is a key player in mitochondrial fusion and cristae morphology regulation. In the present study, we have compared the OPA1 transcription and translation products of different mouse tissues. Unlike in humans, we found only two exons (4b and 5b) to be involved in alternative splicing. The relative abundance of the resulting four different splice variants is tissue-dependent. Proteolytic cleavage by mitochondrial processing peptidase generates two long forms, isoforms 1 and 7, which lead to three short forms representing the end products after further proteolytic processing. In contrast, isoforms 5 and 8 are directly processed into their corresponding short forms. Short form 1 molecules form 184 kDa dimers, whereas all other isoforms contribute to 285 kDa complexes. Coiled-coil domains of the OPA1 protein specifically homo-associate and may be involved in the formation of these complexes. Furthermore, the region encoded by exon 5b inhibits the self-association of coiled-coil domain-I. Finally, our data pinpoint isoform 1 as the, by far, most abundant isoform in the nervous tissue. We postulate that manipulation of isoform 1 protein levels in relation to the other isoforms induces changes in the mitochondrial network in the cell and therefore, mutations affecting the level of functional isoform 1 could lead to devastating effects on retinal ganglion cells.
Collapse
Affiliation(s)
- Vasudheva Reddy Akepati
- Department of Neurodegeneration, Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
| | | | | | | | | | | |
Collapse
|
6
|
Abstract
In recent years, genome-wide detection of alternative splicing based on Expressed Sequence Tag (EST) sequence alignments with mRNA and genomic sequences has dramatically expanded our understanding of the role of alternative splicing in functional regulation. This chapter reviews the data, methodology, and technical challenges of these genome-wide analyses of alternative splicing, and briefly surveys some of the uses to which such alternative splicing databases have been put. For example, with proper alternative splicing database schema design, it is possible to query genome-wide for alternative splicing patterns that are specific to particular tissues, disease states (e.g., cancer), gender, or developmental stages. EST alignments can be used to estimate exon inclusion or exclusion level of alternatively spliced exons and evolutionary changes for various species can be inferred from exon inclusion level. Such databases can also help automate design of probes for RT-PCR and microarrays, enabling high throughput experimental measurement of alternative splicing.
Collapse
|
7
|
Tang H, Heeley T, Morlec R, Hubbard SJ. Characterising alternate splicing and tissue specific expression in the chicken from ESTs. Cytogenet Genome Res 2007; 117:268-77. [PMID: 17675868 PMCID: PMC2266501 DOI: 10.1159/000103188] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2006] [Accepted: 11/15/2006] [Indexed: 01/19/2023] Open
Abstract
Alternate splicing is believed to produce the greatest diversity in transcriptional complexity and function in eukaryotic species. In this study, we present an analysis of alternative splicing events that occur in the chicken, using the recently sequenced genomic sequence and over 580,000 EST sequences mapped back to the genome. A carefully controlled EST-to-genome mapping pipeline is presented, based around the EXONERATE program using the est2genome model, which also considers several quality control steps to filter out erroneous matches. The data is then used to estimate the level of alternate splicing events with respect to Ensembl predicted transcripts. The EST-genome mappings are characterised at the exon level, in order to classify individual splicing events and provide estimates of novel transcripts not currently annotated by the Ensembl genome database. This is the first large scale analysis of this kind in an avian species, and suggests that chicken displays a similar level of alternate splicing as that found in other higher vertebrates such as human and mouse, both in terms of the number of genes that undergo alternate splicing events, and the average number of transcripts produced per gene. The EST data suggests alternate splicing may occur in some 50-60% of the chicken gene set and with an average of around 2.3 transcripts per gene which undergo this process. The EST data is also used to look at gene and transcript usage in the tissues sequenced in embryonic and adult libraries. Genes which display notable biases were analysed in more detail, including twinfilin-2 and embryonic heavy chain myosin. This also highlights several as yet functionally un-annotated genes which appear to be important in embryonic tissues and also undergo alternate splicing events. The analysis also demonstrates some of the difficulties involved in using EST-based data to annotate transcriptional activity in eukaryotic genes, where a broad spectrum of tissues and a large number of sequenced transcripts are required in order to fully characterise alternate splicing and differential expression.
Collapse
Affiliation(s)
- H Tang
- Faculty of Life Sciences, The University of Manchester, Manchester, UK
| | | | | | | |
Collapse
|
8
|
A new advance in alternative splicing databases: from catalogue to detailed analysis of regulation of expression and function of human alternative splicing variants. BMC Bioinformatics 2007; 8:180. [PMID: 17547750 PMCID: PMC1904244 DOI: 10.1186/1471-2105-8-180] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2006] [Accepted: 06/04/2007] [Indexed: 11/25/2022] Open
Abstract
Background Most human genes produce several transcripts with different exon contents by using alternative promoters, alternative polyadenylation sites and alternative splice sites. Much effort has been devoted to describing known gene transcripts through the development of numerous databases. Nevertheless, owing to the diversity of the transcriptome, there is a need for interactive databases that provide information about the potential function of each splicing variant, as well as its expression pattern. Description After setting up a database in which human and mouse splicing variants were compiled, we developed tools (1) to predict the production of protein isoforms from these transcripts, taking account of the presence of open reading frames and mechanisms that could potentially eliminate transcripts and/or inhibit their translation, i.e. nonsense-mediated mRNA decay and microRNAs; (2) to support studies of the regulation of transcript expression at multiple levels, including transcription and splicing, particularly in terms of tissue specificity; and (3) to assist in experimental analysis of the expression of splicing variants. Importantly, analyses of all features from transcript metabolism to functional protein domains were integrated in a highly interactive, user-friendly web interface that allows the functional and regulatory features of gene transcripts to be assessed rapidly and accurately. Conclusion In addition to identifying the transcripts produced by human and mouse genes, fast DB provides tools for analyzing the putative functions of these transcripts and the regulation of their expression. Therefore, fast DB has achieved an advance in alternative splicing databases by providing resources for the functional interpretation of splicing variants for the human and mouse genomes. Because gene expression studies are increasingly employed in clinical analyses, our web interface has been designed to be as user-friendly as possible and to be readily searchable and intelligible at a glance by the whole biomedical community.
Collapse
|
9
|
Nemazanyy I, Panasyuk G, Breus O, Zhyvoloup A, Filonenko V, Gout IT. Identification of a novel CoA synthase isoform, which is primarily expressed in the brain. Biochem Biophys Res Commun 2006; 341:995-1000. [PMID: 16460672 DOI: 10.1016/j.bbrc.2006.01.051] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2005] [Accepted: 01/09/2006] [Indexed: 10/25/2022]
Abstract
CoA and its derivatives Acetyl-CoA and Acyl-CoA are important players in cellular metabolism and signal transduction. CoA synthase is a bifunctional enzyme which mediates the final stages of CoA biosynthesis. In previous studies, we have reported molecular cloning, biochemical characterization, and subcellular localization of CoA synthase (CoASy). Here, we describe the existence of a novel CoA synthase isoform, which is the product of alternative splicing and possesses a 29aa extension at the N-terminus. We termed it CoASy beta and originally identified CoA synthase, CoASy alpha. The transcript specific for CoASy beta was identified by electronic screening and by RT-PCR analysis of various rat tissues. The existence of this novel isoform was further confirmed by immunoblot analysis with antibodies directed to the N-terminal peptide of CoASy beta. In contrast to CoASy alpha, which shows ubiquitous expression, CoASy beta is primarily expressed in the brain. Using confocal microscopy, we demonstrated that both isoforms are localized on mitochondria. The N-terminal extension does not affect the activity of CoA synthase, but possesses a proline-rich sequence which can bring the enzyme into complexes with signalling proteins containing SH3 or WW domains. The role of this novel isoform in CoA biosynthesis, especially in the brain, requires further elucidation.
Collapse
Affiliation(s)
- Ivan Nemazanyy
- Department of Structure and Function of Nucleic Acids, Institute of Molecular Biology and Genetics, 150 Zabolotnogo St, Kyiv 03680, Ukraine.
| | | | | | | | | | | |
Collapse
|