1
|
Yang Y, Xie Y, Li Z, Diala C, Ali M, Li R, Xu Y, Wu A, Kim P, Hosseini SR, Bi E, Zhao H, Zheng WJ. Systematic characterization of protein structural features of alternative splicing isoforms using AlphaFold 2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.30.578053. [PMID: 38464054 PMCID: PMC10925173 DOI: 10.1101/2024.01.30.578053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Alternative splicing is an important cellular process in eukaryotes, altering pre-mRNA to yield multiple protein isoforms from a single gene. However, our understanding of the impact of alternative splicing events on protein structures is currently constrained by a lack of sufficient protein structural data. To address this limitation, we employed AlphaFold 2, a cutting-edge protein structure prediction tool, to conduct a comprehensive analysis of alternative splicing for approximately 3,000 human genes, providing valuable insights into its impact on the protein structural. Our investigation employed state of the art high-performance computing infrastructure to systematically characterize structural features in alternatively spliced regions and identified changes in protein structure following alternative splicing events. Notably, we found that alternative splicing tends to alter the structure of residues primarily located in coils and beta-sheets. Our research highlighted a significant enrichment of loops and highly exposed residues within human alternatively spliced regions. Specifically, our examination of the Septin-9 protein revealed potential associations between loops and alternative splicing, providing insights into its evolutionary role. Furthermore, our analysis uncovered two missense mutations in the Tau protein that could influence alternative splicing, potentially contributing to the pathogenesis of Alzheimer's disease. In summary, our work, through a thorough statistical analysis of extensive protein structural data, sheds new light on the intricate relationship between alternative splicing, evolution, and human disease.
Collapse
|
2
|
Skeldal S, Voss LF, Lende J, Pedersen SB, Mølgaard S, Kaas M, Demange P, Bentsen AH, Fuglsang M, Sander MR, Buttenschøn H, Gustafsen C, Madsen P, Glerup S. Alternative splicing regulates adaptor protein binding, trafficking, and activity of the Vps10p domain receptor SorCS2 in neuronal development. J Biol Chem 2023; 299:105102. [PMID: 37507021 PMCID: PMC10463258 DOI: 10.1016/j.jbc.2023.105102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Revised: 07/12/2023] [Accepted: 07/16/2023] [Indexed: 07/30/2023] Open
Abstract
The Vps10p domain receptor SorCS2 is crucial for the development and function of the nervous system and essential for brain-derived neurotrophic factor (BDNF)-induced changes in neuronal morphology and plasticity. SorCS2 regulates the subcellular trafficking of the BDNF signaling receptor TrkB as well as selected neurotransmitter receptors in a manner that is dependent on the SorCS2 intracellular domain (ICD). However, the cellular machinery and adaptor protein (AP) interactions that regulate receptor trafficking via the SorCS2 ICD are unknown. We here identify four splice variants of human SorCS2 differing in the insertion of an acidic cluster motif and/or a serine residue within the ICD. We show that each variant undergoes posttranslational proteolytic processing into a one- or two-chain receptor, giving rise to eight protein isoforms, the expression of which differs between neuronal and nonneuronal tissues and is affected by cellular stressors. We found that the only variants without the serine were able to rescue BDNF-induced branching of SorCS2 knockout hippocampal neurons, while variants without the acidic cluster showed increased interactions with clathrin-associated APs AP-1, AP-2, and AP-3. Using yeast two-hybrid screens, we further discovered that all variants bound dynein light chain Tctex-type 3; however, only variants with an acidic cluster motif bound kinesin light chain 1. Accordingly, splice variants showed markedly different trafficking properties and localized to different subcellular compartments. Taken together, our findings demonstrate the existence of eight functional SorCS2 isoforms with differential capacity for interactions with cytosolic ligands dynein light chain Tctex-type 3 and kinesin light chain 1, which potentially allows cell-type specific SorCS2 trafficking and BDNF signaling.
Collapse
Affiliation(s)
- Sune Skeldal
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | | | - Jonas Lende
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | | | - Simon Mølgaard
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | - Mathias Kaas
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | - Perline Demange
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | - Andreas Høiberg Bentsen
- Department of Surgical Gastroenterology and Transplantation, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
| | - Marie Fuglsang
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark
| | | | - Henriette Buttenschøn
- NIDO | Centre for Research and Education, Gødstrup Hospital, Herning, Denmark; Department of Clinical Medicine, Aarhus University, Aarhus N, Denmark
| | | | - Peder Madsen
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark.
| | - Simon Glerup
- Department of Biomedicine, Aarhus University, Aarhus C, Denmark.
| |
Collapse
|
3
|
Desai J, Francis C, Longo K, Hoss A. OUP accepted manuscript. Nucleic Acids Res 2022; 50:3128-3141. [PMID: 35286381 PMCID: PMC8989546 DOI: 10.1093/nar/gkac155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 02/14/2022] [Accepted: 02/26/2022] [Indexed: 11/13/2022] Open
Abstract
Alternative splicing is frequently involved in the diversification of protein function and can also be modulated for therapeutic purposes. Here we develop a predictive model, called Exon ByPASS (predicting Exon skipping Based on Protein amino acid SequenceS), to assess the criticality of exon inclusion based solely on information contained in the amino acid sequence upstream and downstream of the exon junctions. By focusing on protein sequence, Exon ByPASS predicts exon skipping independent of tissue and species in the absence of any intronic information. We validate model predictions using transcriptomic and proteomic data and show that the model can capture exon skipping in different tissues and species. Additionally, we reveal potential therapeutic opportunities by predicting synthetically skippable exons and neo-junctions arising in cancer cells.
Collapse
Affiliation(s)
- Jigar Desai
- To whom correspondence should be addressed. Tel: +1 704 214 7914;
| | | | | | - Andrew Hoss
- Wave Life Sciences, Cambridge, MA 02138, USA
| |
Collapse
|
4
|
Sulakhe D, D'Souza M, Wang S, Balasubramanian S, Athri P, Xie B, Canzar S, Agam G, Gilliam TC, Maltsev N. Exploring the functional impact of alternative splicing on human protein isoforms using available annotation sources. Brief Bioinform 2020; 20:1754-1768. [PMID: 29931155 DOI: 10.1093/bib/bby047] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Revised: 05/02/2018] [Indexed: 12/30/2022] Open
Abstract
In recent years, the emphasis of scientific inquiry has shifted from whole-genome analyses to an understanding of cellular responses specific to tissue, developmental stage or environmental conditions. One of the central mechanisms underlying the diversity and adaptability of the contextual responses is alternative splicing (AS). It enables a single gene to encode multiple isoforms with distinct biological functions. However, to date, the functions of the vast majority of differentially spliced protein isoforms are not known. Integration of genomic, proteomic, functional, phenotypic and contextual information is essential for supporting isoform-based modeling and analysis. Such integrative proteogenomics approaches promise to provide insights into the functions of the alternatively spliced protein isoforms and provide high-confidence hypotheses to be validated experimentally. This manuscript provides a survey of the public databases supporting isoform-based biology. It also presents an overview of the potential global impact of AS on the human canonical gene functions, molecular interactions and cellular pathways.
Collapse
Affiliation(s)
- Dinanath Sulakhe
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| | - Mark D'Souza
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA
| | - Sheng Wang
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Toyota Technological Institute at Chicago, 6045 S. Kenwood Avenue, Chicago, IL, USA
| | - Sandhya Balasubramanian
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Genentech, Inc. 1 DNA Way, Mail Stop: 35-6J, South San Francisco, CA, USA
| | - Prashanth Athri
- Department of Computer Science and Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, Kasavanahalli, Carmelaram P.O., Bengaluru, Karnataka, India
| | - Bingqing Xie
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Department of Computer Science, Illinois Institute of Technology, Chicago, IL, USA
| | - Stefan Canzar
- Toyota Technological Institute at Chicago, 6045 S. Kenwood Avenue, Chicago, IL, USA.,Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Gady Agam
- Department of Computer Science, Illinois Institute of Technology, Chicago, IL, USA
| | - T Conrad Gilliam
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| | - Natalia Maltsev
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| |
Collapse
|
5
|
Che Y, Fu L. Aberrant expression and regulatory network of splicing factor-SRSF3 in tumors. J Cancer 2020; 11:3502-3511. [PMID: 32284746 PMCID: PMC7150454 DOI: 10.7150/jca.42645] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 02/26/2020] [Indexed: 12/14/2022] Open
Abstract
Alternative splicing facilitates the splicing of precursor RNA into different isoforms. Alternatively spliced transcripts often exhibit antagonistic functions or differential temporal or spatial expression patterns. There is increasing evidence that alternative splicing, especially by the serine-arginine rich (SR) protein family, leads to abnormal expression patterns and is closely related to the development of cancer. SRSF3, also known as SRp20, is a splicing factor. Through alternative splicing, it plays important roles in regulating various biological functions, such as cell cycle, cell proliferation, migration and invasion, under pathological and physiological conditions. Deregulation of SRSF3 is an essential feature of cancers. SRSF3 is also considered a candidate therapeutic target. Therefore, the involvement of abnormal splicing in tumorigenesis and the regulation of splicing factors deserve further analysis and discussion. Here, we summarize the function of SRSF3-regulated alternative transcripts in cancer cell biology at different stages of tumor development and the regulation of SRSF3 in tumorigenesis.
Collapse
Affiliation(s)
- Yingying Che
- Institute of Chronic Disease, Qingdao Municipal Hospital, Qingdao University, Qingdao 266000, China
| | - Lin Fu
- Institute of Chronic Disease, Qingdao Municipal Hospital, Qingdao University, Qingdao 266000, China
| |
Collapse
|
6
|
Identification and Analysis of Micro-Exon Genes in the Rice Genome. Int J Mol Sci 2019; 20:ijms20112685. [PMID: 31159166 PMCID: PMC6600660 DOI: 10.3390/ijms20112685] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 05/25/2019] [Accepted: 05/29/2019] [Indexed: 11/24/2022] Open
Abstract
Micro-exons are a kind of exons with lengths no more than 51 nucleotides. They are generally ignored in genome annotation due to the short length, whereas recent studies indicate that they have special splicing properties and important functions. Considering that there has been no genome-wide study of micro-exons in plants up to now, we screened and analyzed genes containing micro-exons in two indica rice varieties in this study. According to the annotation of Zhenshan 97 (ZS97) and Minghui 63 (MH63), ~23% of genes possess micro-exons. We then identified micro-exons from RNA-seq data and found that >65% micro-exons had been annotated and most of novel micro-exons were located in gene regions. About 60% micro-exons were constitutively spliced, and the others were alternatively spliced in different tissues. Besides, we observed that approximately 54% of genes harboring micro-exons tended to be ancient genes, and 13% were Oryza genus-specific. Micro-exon genes were highly conserved in Oryza genus with consistent domains. In particular, the predicted protein structures showed that alternative splicing of in-frame micro-exons led to a local structural recombination, which might affect some core structure of domains, and alternative splicing of frame-shifting micro-exons usually resulted in premature termination of translation by introducing a stop codon or missing functional domains. Overall, our study provided the genome-wide distribution, evolutionary conservation, and potential functions of micro-exons in rice.
Collapse
|
7
|
Angiolini F, Belloni E, Giordano M, Campioni M, Forneris F, Paronetto MP, Lupia M, Brandas C, Pradella D, Di Matteo A, Giampietro C, Jodice G, Luise C, Bertalot G, Freddi S, Malinverno M, Irimia M, Moulton JD, Summerton J, Chiapparino A, Ghilardi C, Giavazzi R, Nyqvist D, Gabellini D, Dejana E, Cavallaro U, Ghigna C. A novel L1CAM isoform with angiogenic activity generated by NOVA2-mediated alternative splicing. eLife 2019; 8:44305. [PMID: 30829570 PMCID: PMC6398979 DOI: 10.7554/elife.44305] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 02/08/2019] [Indexed: 12/13/2022] Open
Abstract
The biological players involved in angiogenesis are only partially defined. Here, we report that endothelial cells (ECs) express a novel isoform of the cell-surface adhesion molecule L1CAM, termed L1-ΔTM. The splicing factor NOVA2, which binds directly to L1CAM pre-mRNA, is necessary and sufficient for the skipping of L1CAM transmembrane domain in ECs, leading to the release of soluble L1-ΔTM. The latter exerts high angiogenic function through both autocrine and paracrine activities. Mechanistically, L1-ΔTM-induced angiogenesis requires fibroblast growth factor receptor-1 signaling, implying a crosstalk between the two molecules. NOVA2 and L1-ΔTM are overexpressed in the vasculature of ovarian cancer, where L1-ΔTM levels correlate with tumor vascularization, supporting the involvement of NOVA2-mediated L1-ΔTM production in tumor angiogenesis. Finally, high NOVA2 expression is associated with poor outcome in ovarian cancer patients. Our results point to L1-ΔTM as a novel, EC-derived angiogenic factor which may represent a target for innovative antiangiogenic therapies. Growing tumors stimulate the formation of new blood vessels to supply the oxygen and nutrients the cancerous cells need to stay alive. Stopping tumors from forming the blood vessels could therefore help us to treat cancer. To do so, we need to understand how different proteins control when and how blood vessels develop. Cells make proteins by first ‘transcribing’ genes to form RNA molecules. In many cases, the RNA then goes through a process called alternative splicing. Proteins known as splicing factors cut out different segments of the RNA molecule and stick together the remaining segments to form templates for protein production. This enables a single gene to produce many different variants of a protein. Angiolini, Belloni, Giordano et al. have now studied mouse and human versions of the cells that line the blood vessels grown by tumors. This revealed that a splicing factor called NOVA2 targets a protein called L1CAM, which is normally responsible for gluing adjacent cells together. Angiolini et al. found that NOVA2 splices L1CAM into a form not seen before. Instead of remaining anchored to cell surfaces, the newly identified form of L1CAM is released into the blood circulation, where it stimulates new blood vessels to grow. Samples taken from the blood vessels of human ovarian tumors showed high levels of both NOVA2 and the modified form of L1CAM, while blood vessels in healthy tissue contain no, or very low levels of both proteins. Therefore, if the new form of L1CAM can be detected in the blood, it could be used to help cancer diagnosis, and to indicate which patients would benefit from treatments that restrict the growth of blood vessels in tumors. Further work is now needed to explore these possibilities.
Collapse
Affiliation(s)
- Francesca Angiolini
- Unit of Gynecological Oncology Research, Program of Gynecological Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Elisa Belloni
- Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche, Pavia, Italy
| | - Marco Giordano
- Unit of Gynecological Oncology Research, Program of Gynecological Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Matteo Campioni
- The Armenise-Harvard Laboratory of Structural Biology, Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Federico Forneris
- The Armenise-Harvard Laboratory of Structural Biology, Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Maria Paola Paronetto
- Department of Movement, Human and Health Sciences, Università degli Studi di Roma "Foro Italico", Rome, Italy
| | - Michela Lupia
- Unit of Gynecological Oncology Research, Program of Gynecological Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Chiara Brandas
- Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche, Pavia, Italy
| | - Davide Pradella
- Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche, Pavia, Italy.,Università degli Studi di Pavia, Pavia, Italy
| | - Anna Di Matteo
- Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche, Pavia, Italy
| | | | - Giovanna Jodice
- Molecular Medicine Program, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Chiara Luise
- Molecular Medicine Program, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Giovanni Bertalot
- Molecular Medicine Program, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Stefano Freddi
- Molecular Medicine Program, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | | | - Manuel Irimia
- Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra, Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| | | | | | - Antonella Chiapparino
- The Armenise-Harvard Laboratory of Structural Biology, Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
| | - Carmen Ghilardi
- Laboratory of Biology and Treatment of Metastasis, IRCCS-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
| | - Raffaella Giavazzi
- Laboratory of Biology and Treatment of Metastasis, IRCCS-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
| | - Daniel Nyqvist
- Division of Vascular Biology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - Davide Gabellini
- Division of Genetics and Cell Biology, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Elisabetta Dejana
- FIRC Institute of Molecular Oncology, Milan, Italy.,Rudbeck Laboratory and Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Ugo Cavallaro
- Unit of Gynecological Oncology Research, Program of Gynecological Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Claudia Ghigna
- Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche, Pavia, Italy
| |
Collapse
|
8
|
Bush SJ, Chen L, Tovar-Corona JM, Urrutia AO. Alternative splicing and the evolution of phenotypic novelty. Philos Trans R Soc Lond B Biol Sci 2017; 372:rstb.2015.0474. [PMID: 27994117 DOI: 10.1098/rstb.2015.0474] [Citation(s) in RCA: 111] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/01/2016] [Indexed: 12/21/2022] Open
Abstract
Alternative splicing, a mechanism of post-transcriptional RNA processing whereby a single gene can encode multiple distinct transcripts, has been proposed to underlie morphological innovations in multicellular organisms. Genes with developmental functions are enriched for alternative splicing events, suggestive of a contribution of alternative splicing to developmental programmes. The role of alternative splicing as a source of transcript diversification has previously been compared to that of gene duplication, with the relationship between the two extensively explored. Alternative splicing is reduced following gene duplication with the retention of duplicate copies higher for genes which were alternatively spliced prior to duplication. Furthermore, and unlike the case for overall gene number, the proportion of alternatively spliced genes has also increased in line with the evolutionary diversification of cell types, suggesting alternative splicing may contribute to the complexity of developmental programmes. Together these observations suggest a prominent role for alternative splicing as a source of functional innovation. However, it is unknown whether the proliferation of alternative splicing events indeed reflects a functional expansion of the transcriptome or instead results from weaker selection acting on larger species, which tend to have a higher number of cell types and lower population sizes.This article is part of the themed issue 'Evo-devo in the genomics era, and the origins of morphological diversity'.
Collapse
Affiliation(s)
- Stephen J Bush
- The Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, UK
| | - Lu Chen
- West China Second University Hospital, State Key Laboratory of Biotherapy, Sichuan University, Chengdu 610041, People's Republic of China
| | | | - Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, UK .,Milner Centre for Evolution, University of Bath, Bath BA2 7AY, UK
| |
Collapse
|
9
|
Ghadie MA, Lambourne L, Vidal M, Xia Y. Domain-based prediction of the human isoform interactome provides insights into the functional impact of alternative splicing. PLoS Comput Biol 2017; 13:e1005717. [PMID: 28846689 PMCID: PMC5591010 DOI: 10.1371/journal.pcbi.1005717] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2017] [Revised: 09/08/2017] [Accepted: 08/03/2017] [Indexed: 11/19/2022] Open
Abstract
Alternative splicing is known to remodel protein-protein interaction networks (“interactomes”), yet large-scale determination of isoform-specific interactions remains challenging. We present a domain-based method to predict the isoform interactome from the reference interactome. First, we construct the domain-resolved reference interactome by mapping known domain-domain interactions onto experimentally-determined interactions between reference proteins. Then, we construct the isoform interactome by predicting that an isoform loses an interaction if it loses the domain mediating the interaction. Our prediction framework is of high-quality when assessed by experimental data. The predicted human isoform interactome reveals extensive network remodeling by alternative splicing. Protein pairs interacting with different isoforms of the same gene tend to be more divergent in biological function, tissue expression, and disease phenotype than protein pairs interacting with the same isoforms. Our prediction method complements experimental efforts, and demonstrates that integrating structural domain information with interactomes provides insights into the functional impact of alternative splicing. Protein-protein interaction networks have been extensively used in systems biology to study the role of proteins in cell function and disease. However, current network biology studies typically assume that one gene encodes one protein isoform, ignoring the effect of alternative splicing. Alternative splicing allows a gene to produce multiple protein isoforms, by alternatively selecting distinct regions in the gene to be translated to protein products. Here, we present a computational method to predict and analyze the large-scale effect of alternative splicing on protein-protein interaction networks. Starting with a reference protein-protein interaction network determined by experiments, our method annotates protein-protein interactions with domain-domain interactions, and predicts that a protein isoform loses an interaction if it loses the domain mediating the interaction as a result of alternative splicing. Our predictions reveal the central role of alternative splicing in extensively remodeling the human protein-protein interaction network, and in increasing the functional complexity of the human cell. Our prediction method complements ongoing experimental efforts by predicting isoform-specific interactions for genes not tested yet by experiments and providing insights into the functional impact of alternative splicing.
Collapse
Affiliation(s)
- Mohamed Ali Ghadie
- Department of Bioengineering, McGill University, Montreal, Québec, Canada
| | - Luke Lambourne
- Department of Bioengineering, McGill University, Montreal, Québec, Canada
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Québec, Canada
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
10
|
Climente-González H, Porta-Pardo E, Godzik A, Eyras E. The Functional Impact of Alternative Splicing in Cancer. Cell Rep 2017; 20:2215-2226. [DOI: 10.1016/j.celrep.2017.08.012] [Citation(s) in RCA: 358] [Impact Index Per Article: 51.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 07/15/2017] [Accepted: 07/26/2017] [Indexed: 12/29/2022] Open
|
11
|
Wang J, Dumartin L, Mafficini A, Ulug P, Sangaralingam A, Alamiry NA, Radon TP, Salvia R, Lawlor RT, Lemoine NR, Scarpa A, Chelala C, Crnogorac-Jurcevic T. Splice variants as novel targets in pancreatic ductal adenocarcinoma. Sci Rep 2017; 7:2980. [PMID: 28592875 PMCID: PMC5462735 DOI: 10.1038/s41598-017-03354-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 04/26/2017] [Indexed: 12/22/2022] Open
Abstract
Despite a wealth of genomic information, a comprehensive alternative splicing (AS) analysis of pancreatic ductal adenocarcinoma (PDAC) has not been performed yet. In the present study, we assessed whole exome-based transcriptome and AS profiles of 43 pancreas tissues using Affymetrix exon array. The AS analysis of PDAC indicated on average two AS probe-sets (ranging from 1-28) in 1,354 significantly identified protein-coding genes, with skipped exon and alternative first exon being the most frequently utilised. In addition to overrepresented extracellular matrix (ECM)-receptor interaction and focal adhesion that were also seen in transcriptome differential expression (DE) analysis, Fc gamma receptor-mediated phagocytosis and axon guidance AS genes were also highly represented. Of note, the highest numbers of AS probe-sets were found in collagen genes, which encode the characteristically abundant stroma seen in PDAC. We also describe a set of 37 'hypersensitive' genes which were frequently targeted by somatic mutations, copy number alterations, DE and AS, indicating their propensity for multidimensional regulation. We provide the most comprehensive overview of the AS landscape in PDAC with underlying changes in the spliceosomal machinery. We also collate a set of AS and DE genes encoding cell surface proteins, which present promising diagnostic and therapeutic targets in PDAC.
Collapse
Affiliation(s)
- Jun Wang
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK.
| | - Laurent Dumartin
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Andrea Mafficini
- ARC-Net Research Centre and Department of Diagnostics and Publich Health, Section of Pathology, University and Hospital Trust of Verona, Verona, Italy
| | - Pinar Ulug
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Ajanthah Sangaralingam
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Namaa Audi Alamiry
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Tomasz P Radon
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Roberto Salvia
- ARC-Net Research Centre and Department of Diagnostics and Publich Health, Section of Pathology, University and Hospital Trust of Verona, Verona, Italy
| | - Rita T Lawlor
- ARC-Net Research Centre and Department of Diagnostics and Publich Health, Section of Pathology, University and Hospital Trust of Verona, Verona, Italy
| | - Nicholas R Lemoine
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Aldo Scarpa
- ARC-Net Research Centre and Department of Diagnostics and Publich Health, Section of Pathology, University and Hospital Trust of Verona, Verona, Italy
| | - Claude Chelala
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK
| | - Tatjana Crnogorac-Jurcevic
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, London, EC1M 6BQ, UK.
| |
Collapse
|
12
|
Kwon YJ, Park MJ, Kim SG, Baldwin IT, Park CM. Alternative splicing and nonsense-mediated decay of circadian clock genes under environmental stress conditions in Arabidopsis. BMC PLANT BIOLOGY 2014; 14:136. [PMID: 24885185 PMCID: PMC4035800 DOI: 10.1186/1471-2229-14-136] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Accepted: 05/14/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND The circadian clock enables living organisms to anticipate recurring daily and seasonal fluctuations in their growth habitats and synchronize their biology to the environmental cycle. The plant circadian clock consists of multiple transcription-translation feedback loops that are entrained by environmental signals, such as light and temperature. In recent years, alternative splicing emerges as an important molecular mechanism that modulates the clock function in plants. Several clock genes are known to undergo alternative splicing in response to changes in environmental conditions, suggesting that the clock function is intimately associated with environmental responses via the alternative splicing of the clock genes. However, the alternative splicing events of the clock genes have not been studied at the molecular level. RESULTS We systematically examined whether major clock genes undergo alternative splicing under various environmental conditions in Arabidopsis. We also investigated the fates of the RNA splice variants of the clock genes. It was found that the clock genes, including EARLY FLOWERING 3 (ELF3) and ZEITLUPE (ZTL) that have not been studied in terms of alternative splicing, undergo extensive alternative splicing through diverse modes of splicing events, such as intron retention, exon skipping, and selection of alternative 5' splice site. Their alternative splicing patterns were differentially influenced by changes in photoperiod, temperature extremes, and salt stress. Notably, the RNA splice variants of TIMING OF CAB EXPRESSION 1 (TOC1) and ELF3 were degraded through the nonsense-mediated decay (NMD) pathway, whereas those of other clock genes were insensitive to NMD. CONCLUSION Taken together, our observations demonstrate that the major clock genes examined undergo extensive alternative splicing under various environmental conditions, suggesting that alternative splicing is a molecular scheme that underlies the linkage between the clock and environmental stress adaptation in plants. It is also envisioned that alternative splicing of the clock genes plays more complex roles than previously expected.
Collapse
Affiliation(s)
- Young-Ju Kwon
- Department of Chemistry, Seoul National University, Seoul 151-742, Korea
| | - Mi-Jeong Park
- Department of Chemistry, Seoul National University, Seoul 151-742, Korea
| | - Sang-Gyu Kim
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, 07745 Jena, Germany
| | - Ian T Baldwin
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, 07745 Jena, Germany
| | - Chung-Mo Park
- Department of Chemistry, Seoul National University, Seoul 151-742, Korea
| |
Collapse
|
13
|
Roy B, Haupt LM, Griffiths LR. Review: Alternative Splicing (AS) of Genes As An Approach for Generating Protein Complexity. Curr Genomics 2013; 14:182-94. [PMID: 24179441 PMCID: PMC3664468 DOI: 10.2174/1389202911314030004] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Revised: 02/08/2013] [Accepted: 02/25/2013] [Indexed: 12/22/2022] Open
Abstract
Prior to the completion of the human genome project, the human genome was thought to have a greater number of genes as it seemed structurally and functionally more complex than other simpler organisms. This along with the belief of “one gene, one protein”, were demonstrated to be incorrect. The inequality in the ratio of gene to protein formation gave rise to the theory of alternative splicing (AS). AS is a mechanism by which one gene gives rise to multiple protein products. Numerous databases and online bioinformatic tools are available for the detection and analysis of AS. Bioinformatics provides an important approach to study mRNA and protein diversity by various tools such as expressed sequence tag (EST) sequences obtained from completely processed mRNA. Microarrays and deep sequencing approaches also aid in the detection of splicing events. Initially it was postulated that AS occurred only in about 5% of all genes but was later found to be more abundant. Using bioinformatic approaches, the level of AS in human genes was found to be fairly high with 35-59% of genes having at least one AS form. Our ability to determine and predict AS is important as disorders in splicing patterns may lead to abnormal splice variants resulting in genetic diseases. In addition, the diversity of proteins produced by AS poses a challenge for successful drug discovery and therefore a greater understanding of AS would be beneficial.
Collapse
Affiliation(s)
- Bishakha Roy
- Genomics Research Centre, Griffith Health Institute, Griffith University Gold Coast, Queensland 4222, Australia
| | | | | |
Collapse
|
14
|
Ochoa-Leyva A, Montero-Morán G, Saab-Rincón G, Brieba LG, Soberón X. Alternative splice variants in TIM barrel proteins from human genome correlate with the structural and evolutionary modularity of this versatile protein fold. PLoS One 2013; 8:e70582. [PMID: 23950966 PMCID: PMC3741200 DOI: 10.1371/journal.pone.0070582] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2013] [Accepted: 06/20/2013] [Indexed: 12/20/2022] Open
Abstract
After the surprisingly low number of genes identified in the human genome, alternative splicing emerged as a major mechanism to generate protein diversity in higher eukaryotes. However, it is still not known if its prevalence along the genome evolution has contributed to the overall functional protein diversity or if it simply reflects splicing noise. The (βα)8 barrel or TIM barrel is one of the most frequent, versatile, and ancient fold encountered among enzymes. Here, we analyze the structural modifications present in TIM barrel proteins from the human genome product of alternative splicing events. We found that 87% of all splicing events involved deletions; most of these events resulted in protein fragments that corresponded to the (βα)2, (βα)4, (βα)5, (βα)6, and (βα)7 subdomains of TIM barrels. Because approximately 7% of all the splicing events involved internal β-strand substitutions, we decided, based on the genomic data, to design β-strand and α-helix substitutions in a well-studied TIM barrel enzyme. The biochemical characterization of one of the chimeric variants suggests that some of the splice variants in the human genome with β-strand substitutions may be evolving novel functions via either the oligomeric state or substrate specificity. We provide results of how the splice variants represent subdomains that correlate with the independently folding and evolving structural units previously reported. This work is the first to observe a link between the structural features of the barrel and a recurrent genetic mechanism. Our results suggest that it is reasonable to expect that a sizeable fraction of splice variants found in the human genome represent structurally viable functional proteins. Our data provide additional support for the hypothesis of the origin of the TIM barrel fold through the assembly of smaller subdomains. We suggest a model of how nature explores new proteins through alternative splicing as a mechanism to diversify the proteins encoded in the human genome.
Collapse
Affiliation(s)
- Adrián Ochoa-Leyva
- Instituto Nacional de Medicina Genómica (INMEGEN), México City, México
- * E-mail: (AOL); (XS)
| | - Gabriela Montero-Morán
- Laboratorio Nacional de Genómica para la Biodiversidad, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Irapuato, Guanajuato, México
| | - Gloria Saab-Rincón
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Luis G. Brieba
- Laboratorio Nacional de Genómica para la Biodiversidad, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Irapuato, Guanajuato, México
| | - Xavier Soberón
- Instituto Nacional de Medicina Genómica (INMEGEN), México City, México
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
- * E-mail: (AOL); (XS)
| |
Collapse
|
15
|
Kelemen O, Convertini P, Zhang Z, Wen Y, Shen M, Falaleeva M, Stamm S. Function of alternative splicing. Gene 2013; 514:1-30. [PMID: 22909801 PMCID: PMC5632952 DOI: 10.1016/j.gene.2012.07.083] [Citation(s) in RCA: 514] [Impact Index Per Article: 46.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Revised: 07/21/2012] [Accepted: 07/30/2012] [Indexed: 12/15/2022]
Abstract
Almost all polymerase II transcripts undergo alternative pre-mRNA splicing. Here, we review the functions of alternative splicing events that have been experimentally determined. The overall function of alternative splicing is to increase the diversity of mRNAs expressed from the genome. Alternative splicing changes proteins encoded by mRNAs, which has profound functional effects. Experimental analysis of these protein isoforms showed that alternative splicing regulates binding between proteins, between proteins and nucleic acids as well as between proteins and membranes. Alternative splicing regulates the localization of proteins, their enzymatic properties and their interaction with ligands. In most cases, changes caused by individual splicing isoforms are small. However, cells typically coordinate numerous changes in 'splicing programs', which can have strong effects on cell proliferation, cell survival and properties of the nervous system. Due to its widespread usage and molecular versatility, alternative splicing emerges as a central element in gene regulation that interferes with almost every biological function analyzed.
Collapse
Affiliation(s)
- Olga Kelemen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Paolo Convertini
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Zhaiyi Zhang
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Yuan Wen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Manli Shen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Marina Falaleeva
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| | - Stefan Stamm
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, United States of America
| |
Collapse
|
16
|
Ellis JD, Barrios-Rodiles M, Colak R, Irimia M, Kim T, Calarco JA, Wang X, Pan Q, O'Hanlon D, Kim PM, Wrana JL, Blencowe BJ. Tissue-specific alternative splicing remodels protein-protein interaction networks. Mol Cell 2012; 46:884-92. [PMID: 22749401 DOI: 10.1016/j.molcel.2012.05.037] [Citation(s) in RCA: 293] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2012] [Revised: 03/29/2012] [Accepted: 05/07/2012] [Indexed: 01/12/2023]
Abstract
Alternative splicing plays a key role in the expansion of proteomic and regulatory complexity, yet the functions of the vast majority of differentially spliced exons are not known. In this study, we observe that brain and other tissue-regulated exons are significantly enriched in flexible regions of proteins that likely form conserved interaction surfaces. These proteins participate in significantly more interactions in protein-protein interaction (PPI) networks than other proteins. Using LUMIER, an automated PPI assay, we observe that approximately one-third of analyzed neural-regulated exons affect PPIs. Inclusion of these exons stimulated and repressed different partner interactions at comparable frequencies. This assay further revealed functions of individual exons, including a role for a neural-specific exon in promoting an interaction between Bridging Integrator 1 (Bin1)/Amphiphysin II and Dynamin 2 (Dnm2) that facilitates endocytosis. Collectively, our results provide evidence that regulated alternative exons frequently remodel interactions to establish tissue-dependent PPI networks.
Collapse
Affiliation(s)
- Jonathan D Ellis
- Banting and Best Department of Medical Research, Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Menon R, Roy A, Mukherjee S, Belkin S, Zhang Y, Omenn GS. Functional implications of structural predictions for alternative splice proteins expressed in Her2/neu-induced breast cancers. J Proteome Res 2011; 10:5503-11. [PMID: 22003824 DOI: 10.1021/pr200772w] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Alternative splicing allows a single gene to generate multiple mRNA transcripts, which can be translated into functionally diverse proteins. However, experimentally determined structures of protein splice isoforms are rare, and homology modeling methods are poor at predicting atomic-level structural differences because of high sequence identity. Here we exploit the state-of-the-art structure prediction method I-TASSER to analyze the structural and functional consequences of alternative splicing of proteins differentially expressed in a breast cancer model. We first successfully benchmarked the I-TASSER pipeline for structure modeling of all seven pairs of protein splice isoforms, which are known to have experimentally solved structures. We then modeled three cancer-related variant pairs reported to have opposite functions. In each pair, we observed structural differences in regions where the presence or absence of a motif can directly influence the distinctive functions of the variants. Finally, we applied the method to five splice variants overexpressed in mouse Her2/neu mammary tumor: anxa6, calu, cdc42, ptbp1, and tax1bp3. Despite >75% sequence identity between the variants, structural differences were observed in biologically important regions of these protein pairs. These results demonstrate the feasibility of integrating proteomic analysis with structure-based conformational predictions of differentially expressed alternative splice variants in cancers and other conditions.
Collapse
Affiliation(s)
- Rajasree Menon
- Center for Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States.
| | | | | | | | | | | |
Collapse
|
18
|
Ghigna C, Valacca C, Biamonti G. Alternative splicing and tumor progression. Curr Genomics 2011; 9:556-70. [PMID: 19516963 PMCID: PMC2694562 DOI: 10.2174/138920208786847971] [Citation(s) in RCA: 133] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2008] [Revised: 08/12/2008] [Accepted: 08/18/2008] [Indexed: 12/15/2022] Open
Abstract
Alternative splicing is a key molecular mechanism for increasing the functional diversity of the eukaryotic proteomes. A large body of experimental data implicates aberrant splicing in various human diseases, including cancer. Both mutations in cis-acting splicing elements and alterations in the expression and/or activity of splicing regulatory factors drastically affect the splicing profile of many cancer-associated genes. In addition, the splicing profile of several cancer-associated genes is altered in particular types of cancer arguing for a direct role of specific splicing isoforms in tumor progression. Deciphering the mechanisms underlying aberrant splicing in cancer may prove crucial to understand how splicing machinery is controlled and integrated with other cellular processes, in particular transcription and signaling pathways. Moreover, the characterization of splicing deregulation in cancer will lead to a better comprehension of malignant transformation. Cancer-associated alternative splicing variants may be new tools for the diagnosis and classification of cancers and could be the targets for innovative therapeutical interventions based on highly selective splicing correction approaches.
Collapse
Affiliation(s)
- Claudia Ghigna
- Istituto di Genetica Molecolare - Consiglio Nazionale delle Ricerche, Via Abbiategrasso 207. 27100 Pavia, Italy
| | | | | |
Collapse
|
19
|
Calarco JA, Zhen M, Blencowe BJ. Networking in a global world: establishing functional connections between neural splicing regulators and their target transcripts. RNA (NEW YORK, N.Y.) 2011; 17:775-91. [PMID: 21415141 PMCID: PMC3078728 DOI: 10.1261/rna.2603911] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Recent genome-wide analyses have indicated that almost all primary transcripts from multi-exon human genes undergo alternative pre-mRNA splicing (AS). Given the prevalence of AS and its importance in expanding proteomic complexity, a major challenge that lies ahead is to determine the functional specificity of isoforms in a cellular context. A significant fraction of alternatively spliced transcripts are regulated in a tissue- or cell-type-specific manner, suggesting that these mRNA variants likely function in the generation of cellular diversity. Complementary to these observations, several tissue-specific splicing factors have been identified, and a number of methodological advances have enabled the identification of large repertoires of target transcripts regulated by these proteins. An emerging theme is that tissue-specific splicing factors regulate coherent sets of splice variants in genes known to function in related biological pathways. This review focuses on the recent progress in our understanding of neural-specific splicing factors and their regulatory networks and outlines existing and emerging strategies for uncovering important biological roles for the isoforms that comprise these networks.
Collapse
Affiliation(s)
- John A Calarco
- Banting and Best Department of Medical Research, Donnelly Centre, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
20
|
Hegyi H, Kalmar L, Horvath T, Tompa P. Verification of alternative splicing variants based on domain integrity, truncation length and intrinsic protein disorder. Nucleic Acids Res 2010; 39:1208-19. [PMID: 20972208 PMCID: PMC3045584 DOI: 10.1093/nar/gkq843] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
According to current estimations ∼95% of multi-exonic human protein-coding genes undergo alternative splicing (AS). However, for 4000 human proteins in PDB, only 14 human proteins have structures of at least two alternative isoforms. Surveying these structural isoforms revealed that the maximum insertion accommodated by an isoform of a fully ordered protein domain was 5 amino acids, other instances of domain changes involved intrinsic structural disorder. After collecting 505 minor isoforms of human proteins with evidence for their existence we analyzed their length, protein disorder and exposed hydrophobic surface. We found that strict rules govern the selection of alternative splice variants aimed to preserve the integrity of globular domains: alternative splice sites (i) tend to avoid globular domains or (ii) affect them only marginally or (iii) tend to coincide with a location where the exposed hydrophobic surface is minimal or (iv) the protein is disordered. We also observed an inverse correlation between the domain fraction lost and the full length of the minor isoform containing the domain, possibly indicating a buffering effect for the isoform protein counteracting the domain truncation effect. These observations provide the basis for a prediction method (currently under development) to predict the viability of splice variants.
Collapse
Affiliation(s)
- Hedi Hegyi
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, PO Box 7, 1518 Budapest, Hungary.
| | | | | | | |
Collapse
|
21
|
Kim R, Guo JT. Systematic analysis of short internal indels and their impact on protein folding. BMC STRUCTURAL BIOLOGY 2010; 10:24. [PMID: 20684774 PMCID: PMC2924343 DOI: 10.1186/1472-6807-10-24] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2010] [Accepted: 08/04/2010] [Indexed: 12/03/2022]
Abstract
Background Protein sequence insertions/deletions (indels) can be introduced during evolution or through alternative splicing (AS). Alternative splicing is an important biological phenomenon and is considered as the major means of expanding structural and functional diversity in eukaryotes. Knowledge of the structural changes due to indels is critical to our understanding of the evolution of protein structure and function. In addition, it can help us probe the evolution of alternative splicing and the diversity of functional isoforms. However, little is known about the effects of indels, in particular the ones involving core secondary structures, on the folding of protein structures. The long term goal of our study is to accurately predict the protein AS isoform structures. As a first step towards this goal, we performed a systematic analysis on the structural changes caused by short internal indels through mining highly homologous proteins in Protein Data Bank (PDB). Results We compiled a non-redundant dataset of short internal indels (2-40 amino acids) from highly homologous protein pairs and analyzed the sequence and structural features of the indels. We found that about one third of indel residues are in disordered state and majority of the residues are exposed to solvent, suggesting that these indels are generally located on the surface of proteins. Though naturally occurring indels are fewer than engineered ones in the dataset, there are no statistically significant differences in terms of amino acid frequencies and secondary structure types between the "Natural" indels and "All" indels in the dataset. Structural comparisons show that all the protein pairs with short internal indels in the dataset preserve the structural folds and about 85% of protein pairs have global RMSDs (root mean square deviations) of 2Å or less, suggesting that protein structures tend to be conserved and can tolerate short insertions and deletions. A few pairs with high RMSDs are results of relative domain positions of the proteins, probably due to the intrinsically dynamic nature of the proteins. Conclusions The analysis demonstrated that protein structures have the "plasticity" to tolerate short indels. This study can provide valuable guides in modeling protein AS isoform structures and homologous proteins with indels through placing the indels at the right locations since the accuracy of sequence alignments dictate model qualities in homology modeling.
Collapse
Affiliation(s)
- RyangGuk Kim
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte 9201 University City Blvd, Charlotte, NC 28223 USA
| | | |
Collapse
|
22
|
Lapuk A, Marr H, Jakkula L, Pedro H, Bhattacharya S, Purdom E, Hu Z, Simpson K, Pachter L, Durinck S, Wang N, Parvin B, Fontenay G, Speed T, Garbe J, Stampfer M, Bayandorian H, Dorton S, Clark TA, Schweitzer A, Wyrobek A, Feiler H, Spellman P, Conboy J, Gray JW. Exon-level microarray analyses identify alternative splicing programs in breast cancer. Mol Cancer Res 2010; 8:961-74. [PMID: 20605923 DOI: 10.1158/1541-7786.mcr-09-0528] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Protein isoforms produced by alternative splicing (AS) of many genes have been implicated in several aspects of cancer genesis and progression. These observations motivated a genome-wide assessment of AS in breast cancer. We accomplished this by measuring exon level expression in 31 breast cancer and nonmalignant immortalized cell lines representing luminal, basal, and claudin-low breast cancer subtypes using Affymetrix Human Junction Arrays. We analyzed these data using a computational pipeline specifically designed to detect AS with a low false-positive rate. This identified 181 splice events representing 156 genes as candidates for AS. Reverse transcription-PCR validation of a subset of predicted AS events confirmed 90%. Approximately half of the AS events were associated with basal, luminal, or claudin-low breast cancer subtypes. Exons involved in claudin-low subtype-specific AS were significantly associated with the presence of evolutionarily conserved binding motifs for the tissue-specific Fox2 splicing factor. Small interfering RNA knockdown of Fox2 confirmed the involvement of this splicing factor in subtype-specific AS. The subtype-specific AS detected in this study likely reflects the splicing pattern in the breast cancer progenitor cells in which the tumor arose and suggests the utility of assays for Fox-mediated AS in cancer subtype definition and early detection. These data also suggest the possibility of reducing the toxicity of protein-targeted breast cancer treatments by targeting protein isoforms that are not present in limiting normal tissues.
Collapse
Affiliation(s)
- Anna Lapuk
- Life Sciences Division, Lawrence Berkeley National Laboratory, University of California at Berkeley, Berkeley, California 94720, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Uversky VN, Dunker AK. Understanding protein non-folding. BIOCHIMICA ET BIOPHYSICA ACTA 2010; 1804:1231-64. [PMID: 20117254 PMCID: PMC2882790 DOI: 10.1016/j.bbapap.2010.01.017] [Citation(s) in RCA: 901] [Impact Index Per Article: 64.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2009] [Revised: 01/09/2010] [Accepted: 01/21/2010] [Indexed: 02/07/2023]
Abstract
This review describes the family of intrinsically disordered proteins, members of which fail to form rigid 3-D structures under physiological conditions, either along their entire lengths or only in localized regions. Instead, these intriguing proteins/regions exist as dynamic ensembles within which atom positions and backbone Ramachandran angles exhibit extreme temporal fluctuations without specific equilibrium values. Many of these intrinsically disordered proteins are known to carry out important biological functions which, in fact, depend on the absence of a specific 3-D structure. The existence of such proteins does not fit the prevailing structure-function paradigm, which states that a unique 3-D structure is a prerequisite to function. Thus, the protein structure-function paradigm has to be expanded to include intrinsically disordered proteins and alternative relationships among protein sequence, structure, and function. This shift in the paradigm represents a major breakthrough for biochemistry, biophysics and molecular biology, as it opens new levels of understanding with regard to the complex life of proteins. This review will try to answer the following questions: how were intrinsically disordered proteins discovered? Why don't these proteins fold? What is so special about intrinsic disorder? What are the functional advantages of disordered proteins/regions? What is the functional repertoire of these proteins? What are the relationships between intrinsically disordered proteins and human diseases?
Collapse
Affiliation(s)
- Vladimir N Uversky
- Institute for Intrinsically Disordered Protein Research, Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
| | | |
Collapse
|
24
|
Kanapin AA, Mulder N, Kuznetsov VA. Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity. BMC Genomics 2010; 11 Suppl 1:S4. [PMID: 20158875 PMCID: PMC2822532 DOI: 10.1186/1471-2164-11-s1-s4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
We consider the problem of biological complexity via a projection of protein-coding genes of complex organisms onto the functional space of the proteome. The latter can be defined as a set of all functions committed by proteins of an organism. Alternative splicing (AS) allows an organism to generate diverse mature RNA transcripts from a single mRNA strand and thus it could be one of the key mechanisms of increasing of functional complexity of the organism's proteome and a driving force of biological evolution. Thus, the projection of transcription units (TU) and alternative splice-variant (SV) forms onto proteome functional space could generate new types of relational networks (e.g. SV-protein function networks, SFN) and lead to discoveries of novel evolutionarily conservative functional modules. Such types of networks might provide new reliable characteristics of organism complexity and a better understanding of the evolutionary integration and plasticity of interconnection of genome-transcriptome-proteome functions.
Collapse
|
25
|
Abstract
Even though nearly every human gene has at least one alternative splice form, very little is so far known about the structure and function of resulting protein products. It is becoming increasingly clear that a significant fraction of all isoforms are products of noisy selection of splice sites and thus contribute little to actual functional diversity, and may potentially be deleterious. In this study, we examine the impact of alternative splicing on protein sequence and structure in three datasets: alternative splicing events conserved across multiple species, alternative splicing events in genes that are strongly linked to disease and all observed alternative splicing events. We find that the vast majority of all alternative isoforms result in unstable protein conformations. In contrast to that, the small subset of isoforms conserved across species tends to maintain protein structural integrity to a greater extent. Alternative splicing in disease-associated genes produces unstable structures just as frequently as all other genes, indicating that selection to reduce the effects of alternative splicing on this set is not especially pronounced. Overall, the properties of alternative spliced proteins are consistent with the outcome of noisy selection of splice sites by splicing machinery.
Collapse
Affiliation(s)
- Eugene Melamud
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
| | | |
Collapse
|
26
|
Kurmangaliyev YZ, Gelfand MS. Alternative splicing tends to involve protein phosphorylation sites. Mol Biol 2009. [DOI: 10.1134/s0026893309030224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
27
|
Comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome. BMC Genomics 2009; 10:154. [PMID: 19358722 PMCID: PMC2674458 DOI: 10.1186/1471-2164-10-154] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 04/09/2009] [Indexed: 11/23/2022] Open
Abstract
Background Alternative splicing (AS) is a widespread phenomenon in higher eukaryotes but the extent to which it leads to functional protein isoforms and to proteome expansion at large is still a matter of debate. In contrast to animal species, for which AS has been studied extensively at the protein and functional level, protein-centered studies of AS in plant species are scarce. Here we investigate the functional impact of AS in dicot and monocot plant species using a comparative approach. Results Detailed comparison of AS events in alternative spliced orthologs from the dicot Arabidopsis thaliana and the monocot Oryza sativa (rice) revealed that the vast majority of AS events in both species do not result from functional conservation. Transcript isoforms that are putative targets for the nonsense-mediated decay (NMD) pathway are as likely to contain conserved AS events as isoforms that are translated into proteins. Similar results were obtained when the same comparison was performed between the two more closely related monocot species rice and Zea mays (maize). Genome-wide computational analysis of functional protein domains encoded in alternatively and constitutively spliced genes revealed that only the RNA recognition motif (RRM) is overrepresented in alternatively spliced genes in all species analyzed. In contrast, three domain types were overrepresented in constitutively spliced genes. AS events were found to be less frequent within than outside predicted protein domains and no domain type was found to be enriched with AS introns. Analysis of AS events that result in the removal of complete protein domains revealed that only a small number of domain types is spliced-out in all species analyzed. Finally, in a substantial fraction of cases where a domain is completely removed, this domain appeared to be a unit of a tandem repeat. Conclusion The results from the ortholog comparisons suggest that the ability of a gene to produce more than one functional protein through AS does not persist during evolution. Cross-species comparison of the results of the protein-domain oriented analyses indicates little correspondence between the analyzed species. Based on the premise that functional genetic features are most likely to be conserved during evolution, we conclude that AS has only a limited role in functional expansion of the proteome in plants.
Collapse
|
28
|
Zhang Y. Protein structure prediction: when is it useful? Curr Opin Struct Biol 2009; 19:145-55. [PMID: 19327982 PMCID: PMC2673339 DOI: 10.1016/j.sbi.2009.02.005] [Citation(s) in RCA: 191] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2008] [Revised: 02/18/2009] [Accepted: 02/19/2009] [Indexed: 10/21/2022]
Abstract
Computationally predicted three-dimensional structure of protein molecules has demonstrated the usefulness in many areas of biomedicine, ranging from approximate family assignments to precise drug screening. For nearly 40 years, however, the accuracy of the predicted models has been dictated by the availability of close structural templates. Progress has recently been achieved in refining low-resolution models closer to the native ones; this has been made possible by combining knowledge-based information from multiple sources of structural templates as well as by improving the energy funnel of physics-based force fields. Unfortunately, there has been no essential progress in the development of techniques for detecting remotely homologous templates and for predicting novel protein structures.
Collapse
Affiliation(s)
- Yang Zhang
- Center for Bioinformatics and Department of Molecular Biosciences, University of Kansas, 2030 Becker Drive, Lawrence, KS 66047, USA.
| |
Collapse
|
29
|
Hirschfeld M, zur Hausen A, Bettendorf H, Jäger M, Stickeler E. Alternative splicing of Cyr61 is regulated by hypoxia and significantly changed in breast cancer. Cancer Res 2009; 69:2082-90. [PMID: 19244129 DOI: 10.1158/0008-5472.can-08-1997] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Hypoxia is known to induce the transcriptional activation of pathways involved in angiogenesis, growth factor signaling, and tissue invasion and is therefore a potential key regulator of tumor growth. Cyr61 (cysteine rich 61) is a secreted, matricellular protein with proangiogenic capabilities and is transcriptionally induced under hypoxic conditions. High expression levels of Cyr61 were already detected in various cancer types and linked to tumor progression and advanced stages in breast cancer. Besides hypoxia, there is some evidence that posttranscriptional pre-mRNA processing could be involved in the regulation of Cyr61 expression, but was thus far not investigated. We studied the expression pattern of Cyr61 mRNA and protein in breast cancer cell lines as well as in matched pairs of noncancerous breast tissue, preinvasive lesions, and invasive breast cancers, respectively. In addition, we analyzed the potential regulatory capability of hypoxia on Cyr61 expression by functional tissue culture experiments. Our study revealed a stage-dependent induction of Cyr61 mRNA and protein in breast cancer tumorigenesis and for the first time alternative splicing of the Cyr61 gene due to intron retention. Breast carcinogenesis was accompanied by a shift from an intron 3 retaining toward an intron 3 skipping mRNA phenotype consecutively leading to processing of the biological active Cyr61 protein. The functional analyses strongly emphasize that hypoxia serves as a specific inducer of alternative Cyr61 splicing toward the intron skipping mRNA isoform with potential biological consequences in tumor cells.
Collapse
Affiliation(s)
- Marc Hirschfeld
- Department of Obstetrics and Gynecology and Institute of Pathology, Freiburg University Medical Center, Freiburg, Germany
| | | | | | | | | |
Collapse
|
30
|
Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, Chen JW, Vacic V, Obradovic Z, Uversky VN. The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics 2008; 9 Suppl 2:S1. [PMID: 18831774 PMCID: PMC2559873 DOI: 10.1186/1471-2164-9-s2-s1] [Citation(s) in RCA: 386] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Our first predictor of protein disorder was published just over a decade ago in the Proceedings of the IEEE International Conference on Neural Networks (Romero P, Obradovic Z, Kissinger C, Villafranca JE, Dunker AK (1997) Identifying disordered regions in proteins from amino acid sequence. Proceedings of the IEEE International Conference on Neural Networks, 1: 90-95). By now more than twenty other laboratory groups have joined the efforts to improve the prediction of protein disorder. While the various prediction methodologies used for protein intrinsic disorder resemble those methodologies used for secondary structure prediction, the two types of structures are entirely different. For example, the two structural classes have very different dynamic properties, with the irregular secondary structure class being much less mobile than the disorder class. The prediction of secondary structure has been useful. On the other hand, the prediction of intrinsic disorder has been revolutionary, leading to major modifications of the more than 100 year-old views relating protein structure and function. Experimentalists have been providing evidence over many decades that some proteins lack fixed structure or are disordered (or unfolded) under physiological conditions. In addition, experimentalists are also showing that, for many proteins, their functions depend on the unstructured rather than structured state; such results are in marked contrast to the greater than hundred year old views such as the lock and key hypothesis. Despite extensive data on many important examples, including disease-associated proteins, the importance of disorder for protein function has been largely ignored. Indeed, to our knowledge, current biochemistry books don't present even one acknowledged example of a disorder-dependent function, even though some reports of disorder-dependent functions are more than 50 years old. The results from genome-wide predictions of intrinsic disorder and the results from other bioinformatics studies of intrinsic disorder are demanding attention for these proteins. RESULTS Disorder prediction has been important for showing that the relatively few experimentally characterized examples are members of a very large collection of related disordered proteins that are wide-spread over all three domains of life. Many significant biological functions are now known to depend directly on, or are importantly associated with, the unfolded or partially folded state. Here our goal is to review the key discoveries and to weave these discoveries together to support novel approaches for understanding sequence-function relationships. CONCLUSION Intrinsically disordered protein is common across the three domains of life, but especially common among the eukaryotic proteomes. Signaling sequences and sites of posttranslational modifications are frequently, or very likely most often, located within regions of intrinsic disorder. Disorder-to-order transitions are coupled with the adoption of different structures with different partners. Also, the flexibility of intrinsic disorder helps different disordered regions to bind to a common binding site on a common partner. Such capacity for binding diversity plays important roles in both protein-protein interaction networks and likely also in gene regulation networks. Such disorder-based signaling is further modulated in multicellular eukaryotes by alternative splicing, for which such splicing events map to regions of disorder much more often than to regions of structure. Associating alternative splicing with disorder rather than structure alleviates theoretical and experimentally observed problems associated with the folding of different length, isomeric amino acid sequences. The combination of disorder and alternative splicing is proposed to provide a mechanism for easily "trying out" different signaling pathways, thereby providing the mechanism for generating signaling diversity and enabling the evolution of cell differentiation and multicellularity. Finally, several recent small molecules of interest as potential drugs have been shown to act by blocking protein-protein interactions based on intrinsic disorder of one of the partners. Study of these examples has led to a new approach for drug discovery, and bioinformatics analysis of the human proteome suggests that various disease-associated proteins are very rich in such disorder-based drug discovery targets.
Collapse
Affiliation(s)
- A Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University Schools of Medicine and Informatics, Indianapolis, IN 46202, USA
| | - Christopher J Oldfield
- Center for Computational Biology and Bioinformatics, Indiana University School of Informatics, Indianapolis, IN 46202, USA
| | - Jingwei Meng
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Pedro Romero
- Center for Computational Biology and Bioinformatics, Indiana University School of Informatics, Indianapolis, IN 46202, USA
| | - Jack Y Yang
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Jessica Walton Chen
- Center for Computational Biology and Bioinformatics, Indiana University School of Informatics, Indianapolis, IN 46202, USA
| | - Vladimir Vacic
- Center for Computational Biology and Bioinformatics, Indiana University School of Informatics, Indianapolis, IN 46202, USA
| | - Zoran Obradovic
- Center for Information Science and Technology, Temple University, Philadelphia, PA 19122, USA
| | - Vladimir N Uversky
- Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, USA
- Institute for Intrinsically Disordered Protein Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| |
Collapse
|
31
|
Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, Chen JW, Vacic V, Obradovic Z, Uversky VN. The unfoldomics decade: an update on intrinsically disordered proteins. BMC Genomics 2008. [PMID: 18831774 DOI: 10.1186/1471‐2164‐9‐s2‐s1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Our first predictor of protein disorder was published just over a decade ago in the Proceedings of the IEEE International Conference on Neural Networks (Romero P, Obradovic Z, Kissinger C, Villafranca JE, Dunker AK (1997) Identifying disordered regions in proteins from amino acid sequence. Proceedings of the IEEE International Conference on Neural Networks, 1: 90-95). By now more than twenty other laboratory groups have joined the efforts to improve the prediction of protein disorder. While the various prediction methodologies used for protein intrinsic disorder resemble those methodologies used for secondary structure prediction, the two types of structures are entirely different. For example, the two structural classes have very different dynamic properties, with the irregular secondary structure class being much less mobile than the disorder class. The prediction of secondary structure has been useful. On the other hand, the prediction of intrinsic disorder has been revolutionary, leading to major modifications of the more than 100 year-old views relating protein structure and function. Experimentalists have been providing evidence over many decades that some proteins lack fixed structure or are disordered (or unfolded) under physiological conditions. In addition, experimentalists are also showing that, for many proteins, their functions depend on the unstructured rather than structured state; such results are in marked contrast to the greater than hundred year old views such as the lock and key hypothesis. Despite extensive data on many important examples, including disease-associated proteins, the importance of disorder for protein function has been largely ignored. Indeed, to our knowledge, current biochemistry books don't present even one acknowledged example of a disorder-dependent function, even though some reports of disorder-dependent functions are more than 50 years old. The results from genome-wide predictions of intrinsic disorder and the results from other bioinformatics studies of intrinsic disorder are demanding attention for these proteins. RESULTS Disorder prediction has been important for showing that the relatively few experimentally characterized examples are members of a very large collection of related disordered proteins that are wide-spread over all three domains of life. Many significant biological functions are now known to depend directly on, or are importantly associated with, the unfolded or partially folded state. Here our goal is to review the key discoveries and to weave these discoveries together to support novel approaches for understanding sequence-function relationships. CONCLUSION Intrinsically disordered protein is common across the three domains of life, but especially common among the eukaryotic proteomes. Signaling sequences and sites of posttranslational modifications are frequently, or very likely most often, located within regions of intrinsic disorder. Disorder-to-order transitions are coupled with the adoption of different structures with different partners. Also, the flexibility of intrinsic disorder helps different disordered regions to bind to a common binding site on a common partner. Such capacity for binding diversity plays important roles in both protein-protein interaction networks and likely also in gene regulation networks. Such disorder-based signaling is further modulated in multicellular eukaryotes by alternative splicing, for which such splicing events map to regions of disorder much more often than to regions of structure. Associating alternative splicing with disorder rather than structure alleviates theoretical and experimentally observed problems associated with the folding of different length, isomeric amino acid sequences. The combination of disorder and alternative splicing is proposed to provide a mechanism for easily "trying out" different signaling pathways, thereby providing the mechanism for generating signaling diversity and enabling the evolution of cell differentiation and multicellularity. Finally, several recent small molecules of interest as potential drugs have been shown to act by blocking protein-protein interactions based on intrinsic disorder of one of the partners. Study of these examples has led to a new approach for drug discovery, and bioinformatics analysis of the human proteome suggests that various disease-associated proteins are very rich in such disorder-based drug discovery targets.
Collapse
Affiliation(s)
- A Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University Schools of Medicine and Informatics, Indianapolis, IN 46202, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Affiliation(s)
- John Moult
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, MD 20850, USA.
| |
Collapse
|
33
|
Laskowski RA, Thornton JM. Understanding the molecular machinery of genetics through 3D structures. Nat Rev Genet 2008; 9:141-51. [PMID: 18160966 DOI: 10.1038/nrg2273] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Detailed knowledge of the three-dimensional structures of biological molecules has had an enormous impact on all areas of biological science, including genetics, as structure can reveal the fine details of how molecules perform their biological functions. Here we consider how changes in protein sequence affect the corresponding 3D structure, and describe how structural information about proteins, DNA and chromatin has shed light on gene regulatory mechanisms and the storage and transmission of epigenetic information. Finally, we describe how structure determination is benefiting from the high-throughput technologies of the worldwide structural genomics projects.
Collapse
Affiliation(s)
- Roman A Laskowski
- European Bioinformatics Institute, European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | |
Collapse
|
34
|
Abstract
Alternative splicing is thought to be one of the major sources for functional diversity in higher eukaryotes. Interestingly, when mapping splicing events onto protein structures, about half of the events affect structured and even highly conserved regions i.e. are non-trivial on the structure level. This has led to the controversial hypothesis that such splice variants result in nonsense-mediated mRNA decay or non-functional, unstructured proteins, which do not contribute to the functional diversity of an organism. Here we show in a comprehensive study on alternative splicing that proteins appear to be much more tolerant to structural deletions, insertions and replacements than previously thought. We find literature evidence that such non-trivial splicing isoforms exhibit different functional properties compared to their native counterparts and allow for interesting regulatory patterns on the protein network level. We provide examples that splicing events may represent transitions between different folds in the protein sequence–structure space and explain these links by a common genetic mechanism. Taken together, those findings hint to a more prominent role of splicing in protein structure evolution and to a different view of phenotypic plasticity of protein structures.
Collapse
Affiliation(s)
- Fabian Birzele
- Practical Informatics and Bioinformatics Group, Department of Informatics, Ludwig-Maximilians-University, Amalienstrasse 17, D-80333 Munich, Germany.
| | | | | |
Collapse
|
35
|
Ben-Dov C, Hartmann B, Lundgren J, Valcárcel J. Genome-wide analysis of alternative pre-mRNA splicing. J Biol Chem 2007; 283:1229-33. [PMID: 18024428 DOI: 10.1074/jbc.r700033200] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Alternative splicing of mRNA precursors allows the synthesis of multiple mRNAs from a single primary transcript, significantly expanding the information content and regulatory possibilities of higher eukaryotic genomes. High-throughput enabling technologies, particularly large-scale sequencing and splicing-sensitive microarrays, are providing unprecedented opportunities to address key questions in this field. The picture emerging from these pioneering studies is that alternative splicing affects most human genes and a significant fraction of the genes in other multicellular organisms, with the potential to greatly influence the evolution of complex genomes. A combinatorial code of regulatory signals and factors can deploy physiologically coherent programs of alternative splicing that are distinct from those regulated at other steps of gene expression. Pre-mRNA splicing and its regulation play important roles in human pathologies, and genome-wide analyses in this area are paving the way for improved diagnostic tools and for the identification of novel and more specific pharmaceutical targets.
Collapse
Affiliation(s)
- Claudia Ben-Dov
- Centre de Regulació Genòmica, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | | |
Collapse
|
36
|
Birzele F, Küffner R, Meier F, Oefinger F, Potthast C, Zimmer R. ProSAS: a database for analyzing alternative splicing in the context of protein structures. Nucleic Acids Res 2007; 36:D63-8. [PMID: 17933774 PMCID: PMC2238869 DOI: 10.1093/nar/gkm793] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Alternative splicing is known to be one of the major sources for functional diversity in higher eukaryotes. Several splicing isoforms have been characterized in the literature that play important roles in cellular processes like apoptosis or signal transduction pathways. Splicing events can often be detected on the mRNA level by large-scale cDNA or EST experiments and such data is collected and annotated in several databases. Nevertheless, the effects of splicing on the structure of a protein are largely unknown. The ProSAS (Protein Structure and Alternative Splicing) database fills this gap and provides a unified resource for analyzing effects of alternative splicing events in the context of protein structures. ProSAS comprehensively annotates and models protein structures for several Ensembl genomes as well as SwissProt entries harbouring splicing events. Alternative isoforms annotated in Ensembl or SwissProt can be analyzed on the protein structure and protein function level using an intuitive user interface that provides several features and tools for a structure-based analysis of alternative splicing events. The ProSAS database is freely accessible at http://www.bio.ifi.lmu.de/ProSAS.
Collapse
Affiliation(s)
- Fabian Birzele
- Practical Informatics and Bioinformatics Group, Department of Informatics, Ludwig-Maximilians-University, Amalienstrasse 17, D-80333 Munich, Germany
| | | | | | | | | | | |
Collapse
|
37
|
Yang SY, He XY, Miller D. HSD17B10: a gene involved in cognitive function through metabolism of isoleucine and neuroactive steroids. Mol Genet Metab 2007; 92:36-42. [PMID: 17618155 DOI: 10.1016/j.ymgme.2007.06.001] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/10/2007] [Revised: 06/06/2007] [Accepted: 06/06/2007] [Indexed: 12/20/2022]
Abstract
The HSD17B10 gene maps on chromosome Xp11.2, a region highly associated with X-linked mental retardation. This gene encodes HSD10, a mitochondrial multifunctional enzyme that plays a significant part in the metabolism of neuroactive steroids and the degradation of isoleucine. The HSD17B10 gene is composed of six exons and five introns. Its exon 5 is an alternative exon such that there are several HSD17B10 mRNA isoforms in brain. A silent mutation (c.605C-->A) and three missense mutations (c.395C-->G; c.419C-->T; c.771A-->G), respectively, cause the X-linked mental retardation, choreoathetosis, and abnormal behavior (MRXS10) and the hydroxyacyl-CoA dehydrogenase II deficiency. The latter condition seems to be a multifactorial disease due to the disturbance of more than one metabolic pathway by the HSD10 deficiency. HSD10 inactivates the positive modulators of GABAA receptors, and plays a role in the maintenance of GABAergic neuronal function. This working model may account for the mental retardation of these patients. The dehydrogenase activity is slightly inhibited by the binding of amyloid-beta peptide to the loop D of HSD10. Elevated levels of HSD10 were observed in hippocampi of Alzheimer disease patients so this multifunctional enzyme may be related to Alzheimer disease pathogenesis; however, the molecular mechanism of its involvement remains to be ascertained.
Collapse
Affiliation(s)
- Song-Yu Yang
- Department of Neurochemistry, New York State Institute for Basic Research in Developmental Disabilities, 1050 Forest Hill Road, Staten Island, NY 10314, USA.
| | | | | |
Collapse
|
38
|
Zhang J, Wang Z, Wei F, Qiu W, Zhang L, Huang Q. Computational modeling and functional analysis of Herpes simplex virus type-1 thymidine kinase and Escherichia coli cytosine deaminase fusion protein. Biochem Biophys Res Commun 2007; 360:46-50. [PMID: 17588533 DOI: 10.1016/j.bbrc.2007.05.220] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2007] [Accepted: 05/29/2007] [Indexed: 11/23/2022]
Abstract
Herpes simplex virus type-1 thymidine kinase (HSV-1TK) and Escherichia coli cytosine deaminase (CD) fusion protein was designed using InsightII software. The structural rationality of the fusion proteins incorporating a series of flexible linker peptide was analyzed, and a suitable linker peptide was chosen for further investigated. The recombinant plasmid containing the coding regions of HSV-1TK and CD cDNA connected by this linker peptide coding sequence was generated and subsequently transfected into the human embryonic kidney 293 cells (HEK293). The Western blotting indicated that the recombinant fusion protein existed as a dimer with a molecular weight of approximately 90 kDa. The toxicity of the prodrug on the recombinant plasmid-transfected human lung cancer cell line NCIH460 was evaluated, which showed that TKglyCD-expressing cells conferred upon cells prodrug sensitivities equivalent to that observed for each enzyme independently. Most noteworthy, cytotoxicity could be enhanced by concurrently treating TKglyCD-expressing cells with prodrugs GCV and 5-FC. The results indicate that we have successfully constructed a HSV-1TKglyCD fusion gene which might have a potential application for cancer gene therapy.
Collapse
Affiliation(s)
- Jufeng Zhang
- Central Experimental Laboratory, the First People's Hospital, Shanghai Jiaotong University, Shanghai 200080, China
| | | | | | | | | | | |
Collapse
|
39
|
The functional modulation of epigenetic regulators by alternative splicing. BMC Genomics 2007; 8:252. [PMID: 17651478 PMCID: PMC1949830 DOI: 10.1186/1471-2164-8-252] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2007] [Accepted: 07/25/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Epigenetic regulators (histone acetyltransferases, methyltransferases, chromatin-remodelling enzymes, etc) play a fundamental role in the control of gene expression by modifying the local state of chromatin. However, due to their recent discovery, little is yet known about their own regulation. This paper addresses this point, focusing on alternative splicing regulation, a mechanism already known to play an important role in other protein families, e.g. transcription factors, membrane receptors, etc. RESULTS To this end, we compiled the data available on the presence/absence of alternative splicing for a set of 160 different epigenetic regulators, taking advantage of the relatively large amount of unexplored data on alternative splicing available in public databases. We found that 49 % (70 % in human) of these genes express more than one transcript. We then studied their alternative splicing patterns, focusing on those changes affecting the enzyme's domain composition. In general, we found that these sequence changes correspond to different mechanisms, either repressing the enzyme's function (e.g. by creating dominant-negative inhibitors of the functional isoform) or creating isoforms with new functions. CONCLUSION We conclude that alternative splicing of epigenetic regulators can be an important tool for the function modulation of these enzymes. Considering that the latter control the transcriptional state of large sets of genes, we propose that epigenetic regulation of gene expression is itself strongly regulated by alternative splicing.
Collapse
|
40
|
Guo JT, Jaromczyk JW, Xu Y. Analysis of chameleon sequences and their implications in biological processes. Proteins 2007; 67:548-58. [PMID: 17299764 DOI: 10.1002/prot.21285] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Chameleon sequences have been implicated in amyloid related diseases. Here we report an analysis of two types of chameleon sequences, chameleon-HS (Helix vs. Strand) and chameleon-HE (Helix vs. Sheet), based on known structures in Protein Data Bank. Our survey shows that the longest chameleon-HS is eight residues while the longest chameleon-HE is seven residues. We have done a detailed analysis on the local and global environment that might contribute to the unique conformation of a chameleon sequence. We found that the existence of chameleon sequences does not present a problem for secondary structure prediction programs, including the first generation prediction programs, such as Chou-Fasman algorithm, and the third generation prediction programs that utilize evolution information. We have also investigated the possible implication of chameleon sequences in structural conservation and functional diversity of alternatively spliced protein isoforms.
Collapse
Affiliation(s)
- Jun-Tao Guo
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | | |
Collapse
|
41
|
Talavera D, Vogel C, Orozco M, Teichmann SA, de la Cruz X. The (in)dependence of alternative splicing and gene duplication. PLoS Comput Biol 2007; 3:e33. [PMID: 17335345 PMCID: PMC1808492 DOI: 10.1371/journal.pcbi.0030033] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Accepted: 01/04/2007] [Indexed: 11/18/2022] Open
Abstract
Alternative splicing (AS) and gene duplication (GD) both are processes that diversify the protein repertoire. Recent examples have shown that sequence changes introduced by AS may be comparable to those introduced by GD. In addition, the two processes are inversely correlated at the genomic scale: large gene families are depleted in splice variants and vice versa. All together, these data strongly suggest that both phenomena result in interchangeability between their effects. Here, we tested the extent to which this applies with respect to various protein characteristics. The amounts of AS and GD per gene are anticorrelated even when accounting for different gene functions or degrees of sequence divergence. In contrast, the two processes appear to be independent in their influence on variation in mRNA expression. Further, we conducted a detailed comparison of the effect of sequence changes in both alternative splice variants and gene duplicates on protein structure, in particular the size, location, and types of sequence substitutions and insertions/deletions. We find that, in general, alternative splicing affects protein sequence and structure in a more drastic way than gene duplication and subsequent divergence. Our results reveal an interesting paradox between the anticorrelation of AS and GD at the genomic level, and their impact at the protein level, which shows little or no equivalence in terms of effects on protein sequence, structure, and function. We discuss possible explanations that relate to the order of appearance of AS and GD in a gene family, and to the selection pressure imposed by the environment.
Collapse
Affiliation(s)
- David Talavera
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
| | - Christine Vogel
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
- Institute for Cellular and Molecular Biology, University of Texas Austin, Austin, Texas, United States of America
- * To whom correspondence should be addressed. E-mail: (XdlC); (CV)
| | - Modesto Orozco
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Departament de Bioquímica i Biologia Molecular, Universitat de Barcelona, Barcelona, Spain
- Institut per a la Recerca i Estudis Avançats (IRCA), Barcelona, Spain
| | - Sarah A Teichmann
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom
| | - Xavier de la Cruz
- Molecular Modeling and Bioinformatics Unit, Parc Científic de Barcelona, Barcelona, Spain
- Protein Structure and Modelling Node, Instituto Nacional de Bioinfomática, Genoma España, Parc Científic de Barcelona, Barcelona, Spain
- Computational Biology Program, Barcelona Supercomputing Center, Barcelona, Spain
- * To whom correspondence should be addressed. E-mail: (XdlC); (CV)
| |
Collapse
|
42
|
Technologies for the Global Discovery and Analysis of Alternative Splicing. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2007; 623:64-84. [DOI: 10.1007/978-0-387-77374-2_5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
43
|
Xing Y, Lee C. Relating alternative splicing to proteome complexity and genome evolution. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2007; 623:36-49. [PMID: 18380339 DOI: 10.1007/978-0-387-77374-2_3] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Prior to genomics, studies of alternative splicing primarily focused on the function and mechanism of alternative splicing in individual genes and exons. This has changed dramatically since the late 1990s. High-throughput genomics technologies, such as EST sequencing and microarrays designed to detect changes in splicing, led to genome-wide discoveries and quantification of alternative splicing in a wide range of species from human to Arabidopsis. Consensus estimates of AS frequency in the human genome grew from less than 5% in mid-1990s to as high as 60-74% now. The rapid growth in sequence and microarray data for alternative splicing has made it possible to look into the global impact of alternative splicing on protein function and evolution of genomes. In this chapter, we review recent research on alternative splicing's impact on proteomic complexity and its role in genome evolution.
Collapse
Affiliation(s)
- Yi Xing
- Department of Internal Medicine, Roy J. and Lucille A. Carver College of Medicine, University of Iowa, Iowa City, USA
| | | |
Collapse
|
44
|
Abstract
Recent analyses of sequence and microarray data have suggested that alternative splicing plays a major role in the generation of proteomic and functional diversity in metazoan organisms. Efforts are now being directed at establishing the full repertoire of functionally relevant transcript variants generated by alternative splicing, the specific roles of such variants in normal and disease physiology, and how alternative splicing is coordinated on a global level to achieve cell- and tissue-specific functions. Recent progress in these areas is summarized in this review.
Collapse
Affiliation(s)
- Benjamin J Blencowe
- Banting and Best Department of Medical Research and Department of Molecular and Medical Genetics, Centre for Cellular and Biomolecular Research, Donnelly CCBR Building, University of Toronto, Toronto, ON M5S 3E1, Canada.
| |
Collapse
|