1
|
Boratkó A, Csortos C. TIMAP, the versatile protein phosphatase 1 regulator in endothelial cells. IUBMB Life 2017; 69:918-928. [DOI: 10.1002/iub.1695] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 10/26/2017] [Indexed: 01/28/2023]
Affiliation(s)
- Anita Boratkó
- Department of Medical Chemistry; Faculty of Medicine, University of Debrecen, Egyetem tér 1; Debrecen Hungary
| | - Csilla Csortos
- Department of Medical Chemistry; Faculty of Medicine, University of Debrecen, Egyetem tér 1; Debrecen Hungary
| |
Collapse
|
2
|
Roy B, Haupt LM, Griffiths LR. Review: Alternative Splicing (AS) of Genes As An Approach for Generating Protein Complexity. Curr Genomics 2013; 14:182-94. [PMID: 24179441 PMCID: PMC3664468 DOI: 10.2174/1389202911314030004] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Revised: 02/08/2013] [Accepted: 02/25/2013] [Indexed: 12/22/2022] Open
Abstract
Prior to the completion of the human genome project, the human genome was thought to have a greater number of genes as it seemed structurally and functionally more complex than other simpler organisms. This along with the belief of “one gene, one protein”, were demonstrated to be incorrect. The inequality in the ratio of gene to protein formation gave rise to the theory of alternative splicing (AS). AS is a mechanism by which one gene gives rise to multiple protein products. Numerous databases and online bioinformatic tools are available for the detection and analysis of AS. Bioinformatics provides an important approach to study mRNA and protein diversity by various tools such as expressed sequence tag (EST) sequences obtained from completely processed mRNA. Microarrays and deep sequencing approaches also aid in the detection of splicing events. Initially it was postulated that AS occurred only in about 5% of all genes but was later found to be more abundant. Using bioinformatic approaches, the level of AS in human genes was found to be fairly high with 35-59% of genes having at least one AS form. Our ability to determine and predict AS is important as disorders in splicing patterns may lead to abnormal splice variants resulting in genetic diseases. In addition, the diversity of proteins produced by AS poses a challenge for successful drug discovery and therefore a greater understanding of AS would be beneficial.
Collapse
Affiliation(s)
- Bishakha Roy
- Genomics Research Centre, Griffith Health Institute, Griffith University Gold Coast, Queensland 4222, Australia
| | | | | |
Collapse
|
3
|
Abstract
The relationship between sequence polymorphisms and human disease has been studied mostly in terms of effects of single nucleotide polymorphisms (SNPs) leading to single amino acid substitutions that change protein structure and function. However, less attention has been paid to more drastic sequence polymorphisms which cause premature termination of a protein’s sequence or large changes, insertions, or deletions in the sequence. We have analyzed a large set (n = 512) of insertions and deletions (indels) and single nucleotide polymorphisms causing premature termination of translation in disease-related genes. Prediction of protein-destabilization effects was performed by graphical presentation of the locations of polymorphisms in the protein structure, using the Genomes TO Protein (GTOP) database, and manual annotation with a set of specific criteria. Protein-destabilization was predicted for 44.4% of the nonsense SNPs, 32.4% of the frameshifting indels, and 9.1% of the non-frameshifting indels. A prediction of nonsense-mediated decay allowed to infer which truncated proteins would actually be translated as defective proteins. These cases included the proteins linked to diseases inherited dominantly, suggesting a relation between these diseases and toxic aggregation. Our approach would be useful in identifying potentially aggregation-inducing polymorphisms that may have pathological effects.
Collapse
|
4
|
Fukuchi S, Hosoda K, Homma K, Gojobori T, Nishikawa K. Binary classification of protein molecules into intrinsically disordered and ordered segments. BMC STRUCTURAL BIOLOGY 2011; 11:29. [PMID: 21693062 PMCID: PMC3199747 DOI: 10.1186/1472-6807-11-29] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2011] [Accepted: 06/22/2011] [Indexed: 11/17/2022]
Abstract
Background Although structural domains in proteins (SDs) are important, half of the regions in the human proteome are currently left with no SD assignments. These unassigned regions consist not only of novel SDs, but also of intrinsically disordered (ID) regions since proteins, especially those in eukaryotes, generally contain a significant fraction of ID regions. As ID regions can be inferred from amino acid sequences, a method that combines SD and ID region assignments can determine the fractions of SDs and ID regions in any proteome. Results In contrast to other available ID prediction programs that merely identify likely ID regions, the DICHOT system we previously developed classifies the entire protein sequence into SDs and ID regions. Application of DICHOT to the human proteome revealed that residue-wise ID regions constitute 35%, SDs with similarity to PDB structures comprise 52%, while SDs with no similarity to PDB structures account for the remaining 13%. The last group consists of novel structural domains, termed cryptic domains, which serve as good targets of structural genomics. The DICHOT method applied to the proteomes of other model organisms indicated that eukaryotes generally have high ID contents, while prokaryotes do not. In human proteins, ID contents differ among subcellular localizations: nuclear proteins had the highest residue-wise ID fraction (47%), while mitochondrial proteins exhibited the lowest (13%). Phosphorylation and O-linked glycosylation sites were found to be located preferentially in ID regions. As O-linked glycans are attached to residues in the extracellular regions of proteins, the modification is likely to protect the ID regions from proteolytic cleavage in the extracellular environment. Alternative splicing events tend to occur more frequently in ID regions. We interpret this as evidence that natural selection is operating at the protein level in alternative splicing. Conclusions We classified entire regions of proteins into the two categories, SDs and ID regions and thereby obtained various kinds of complete genome-wide statistics. The results of the present study are important basic information for understanding protein structural architectures and have been made publicly available at http://spock.genes.nig.ac.jp/~genome/DICHOT.
Collapse
Affiliation(s)
- Satoshi Fukuchi
- Center for Information Biology & DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan.
| | | | | | | | | |
Collapse
|
5
|
Nishikawa K. Natively unfolded proteins: An overview. Biophysics (Nagoya-shi) 2009; 5:53-58. [PMID: 27857579 PMCID: PMC5036634 DOI: 10.2142/biophysics.5.53] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2009] [Accepted: 09/02/2009] [Indexed: 12/01/2022] Open
Abstract
Proteins with wholly or partly denatured structures in vivo are called intrinsically disordered or natively unfolded proteins (NUPs). Functional importance of NUPs was revealed by NMR studies as first reviewed by P. Wright in 1999. Since then, computational analyses on NUPs have also been intensively carried out to predict that approximately one third of eukaryotic proteins are NUPs. I will start this overview with the question why it took so long to identify NUPs as an important subject of protein science, and then move on to several issues such as, whether or not NUPs are specific to eukaryotes, what a particularly higher fraction of NUPs existing in the nucleus means, and what evolutionary implications NUPs have.
Collapse
Affiliation(s)
- Ken Nishikawa
- Department of Bioinformatics, Maebashi Institute of Technology, 460-1 Kamisadori, Maebashi, Gunma 371-0816, Japan
| |
Collapse
|
6
|
Alternative splicing of transcription factors' genes: beyond the increase of proteome diversity. Comp Funct Genomics 2009:905894. [PMID: 19609452 PMCID: PMC2709715 DOI: 10.1155/2009/905894] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2008] [Revised: 04/06/2009] [Accepted: 05/18/2009] [Indexed: 11/29/2022] Open
Abstract
Functional modification of transcription regulators may lead to developmental changes and phenotypical differences between species. In this work, we study the influence of alternative splicing on transcription factors in human and mouse. Our results show that the impact of alternative splicing on transcription factors is similar in both species, meaning that the ways to increase variability should also be similar. However, when looking at the expression patterns of transcription factors, we observe that they tend to diverge regardless of the role of alternative splicing. Finally, we hypothesise that transcription regulation of alternatively spliced transcription factors could play an important role in the phenotypical differences between species, without discarding other phenomena or functional families.
Collapse
|
7
|
Abstract
Even though nearly every human gene has at least one alternative splice form, very little is so far known about the structure and function of resulting protein products. It is becoming increasingly clear that a significant fraction of all isoforms are products of noisy selection of splice sites and thus contribute little to actual functional diversity, and may potentially be deleterious. In this study, we examine the impact of alternative splicing on protein sequence and structure in three datasets: alternative splicing events conserved across multiple species, alternative splicing events in genes that are strongly linked to disease and all observed alternative splicing events. We find that the vast majority of all alternative isoforms result in unstable protein conformations. In contrast to that, the small subset of isoforms conserved across species tends to maintain protein structural integrity to a greater extent. Alternative splicing in disease-associated genes produces unstable structures just as frequently as all other genes, indicating that selection to reduce the effects of alternative splicing on this set is not especially pronounced. Overall, the properties of alternative spliced proteins are consistent with the outcome of noisy selection of splice sites by splicing machinery.
Collapse
Affiliation(s)
- Eugene Melamud
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
| | | |
Collapse
|
8
|
Fukuchi S, Homma K, Sakamoto S, Sugawara H, Tateno Y, Gojobori T, Nishikawa K. The GTOP database in 2009: updated content and novel features to expand and deepen insights into protein structures and functions. Nucleic Acids Res 2008; 37:D333-7. [PMID: 18987007 PMCID: PMC2686575 DOI: 10.1093/nar/gkn855] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The Genomes TO Protein Structures and Functions (GTOP) database (http://spock.genes.nig.ac.jp/~genome/gtop.html) freely provides an extensive collection of information on protein structures and functions obtained by application of various computational tools to the amino acid sequences of entirely sequenced genomes. GTOP contains annotations of 3D structures, protein families, functions, and other useful data of a protein of interest in user-friendly ways to give a deep insight into the protein structure. From the initial 1999 version, GTOP has been continually updated to reap the fruits of genome projects and augmented to supply novel information, in particular intrinsically disordered regions. As intrinsically disordered regions constitute a considerable fraction of proteins and often play crucial roles especially in eukaryotes, their assignments give important additional clues to the functionality of proteins. Additionally, we have incorporated the following features into GTOP: a platform independent structural viewer, results of HMM searches against SCOP and Pfam, secondary structure predictions, color display of exon boundaries in eukaryotic proteins, assignments of gene ontology terms, search tools, and master files.
Collapse
Affiliation(s)
- Satoshi Fukuchi
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan.
| | | | | | | | | | | | | |
Collapse
|
9
|
Robin G, Cowieson NP, Guncar G, Forwood JK, Listwan P, Hume DA, Kobe B, Martin JL, Huber T. A general target selection method for crystallographic proteomics. Methods Mol Biol 2008; 426:27-35. [PMID: 18542855 DOI: 10.1007/978-1-60327-058-8_2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Increasing the success in obtaining structures and maximizing the value of the structures determined are the two major goals of target selection in structural proteomics. This chapter presents an efficient and flexible target selection procedure supplemented with a Web-based resource that is suitable for small- to large-scale structural genomics projects that use crystallography as the major means of structure determination. Based on three criteria, biological significance, structural novelty, and "crystallizability," the approach first removes (filters) targets that do not meet minimal criteria and then ranks the remaining targets based on their "crystallizability" estimates. This novel procedure was designed to maximize selection efficiency, and its prevailing criteria categories make it suitable for a broad range of structural proteomics projects.
Collapse
Affiliation(s)
- Gautier Robin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Artamonova II, Gelfand MS. Comparative Genomics and Evolution of Alternative Splicing: The Pessimists' Science. Chem Rev 2007; 107:3407-30. [PMID: 17645315 DOI: 10.1021/cr068304c] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Irena I Artamonova
- Group of Bioinformatics, Vavilov Institute of General Genetics, RAS, Gubkina 3, Moscow 119991, Russia
| | | |
Collapse
|
11
|
Yura K, Shionyu M, Hagino K, Hijikata A, Hirashima Y, Nakahara T, Eguchi T, Shinoda K, Yamaguchi A, Takahashi KI, Itoh T, Imanishi T, Gojobori T, Go M. Alternative splicing in human transcriptome: Functional and structural influence on proteins. Gene 2006; 380:63-71. [PMID: 16872759 DOI: 10.1016/j.gene.2006.05.015] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2006] [Accepted: 05/03/2006] [Indexed: 11/21/2022]
Abstract
Alternative splicing is a molecular mechanism that produces multiple proteins from a single gene, and is thought to produce variety in proteins translated from a limited number of genes. Here we analyzed how alternative splicing produced variety in protein structure and function, by using human full-length cDNAs on the assumption that all of the alternatively spliced mRNAs were translated to proteins. We found that the length of alternatively spliced amino acid sequences, in most cases, fell into a size shorter than that of average protein domain. We evaluated comprehensively the presumptive three-dimensional structures of the alternatively spliced products to assess the impact of alternative splicing on gene function. We found that more than half of the products encoded proteins which were involved in signal transduction, transcription and translation, and more than half of alternatively spliced regions comprised interaction sites between proteins and their binding partners, including substrates, DNA/RNA, and other proteins. Intriguingly, 67% of the alternatively spliced isoforms showed significant alterations to regions of the protein structural core, which likely resulted in large conformational change. Based on those findings, we speculate that there are a large number of cases that alternative splicing modulates protein networks through significant alteration in protein conformation.
Collapse
Affiliation(s)
- Kei Yura
- Quantum Bioinformatics Team, Center for Computational Science and Engineering, Japan Atomic Energy Agency, 8-1 Umemidai, Kizu-cho, Souraku-gun, Kyoto 619-0215, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Minezaki Y, Homma K, Kinjo AR, Nishikawa K. Human transcription factors contain a high fraction of intrinsically disordered regions essential for transcriptional regulation. J Mol Biol 2006; 359:1137-49. [PMID: 16697407 DOI: 10.1016/j.jmb.2006.04.016] [Citation(s) in RCA: 193] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2006] [Revised: 04/04/2006] [Accepted: 04/05/2006] [Indexed: 12/11/2022]
Abstract
Human transcriptional regulation factors, such as activators, repressors, and enhancer-binding factors are quite different from their prokaryotic counterparts in two respects: the average sequence in human is more than twice as long as that in prokaryotes, while the fraction of sequence aligned to domains of known structure is 31% in human transcription factors (TFs), less than half of that in bacterial TFs (72%). Intrinsically disordered (ID) regions were identified by a disorder-prediction program, and were found to be in good agreement with available experimental data. Analysis of 401 human TFs with experimental evidence from the Swiss-Prot database showed that as high as 49% of the entire sequence of human TFs is occupied by ID regions. More than half of the human TFs consist of a small DNA binding domain (DBD) and long ID regions frequently sandwiching unassigned regions. The remaining TFs have structural domains in addition to DBDs and ID regions. Experimental studies, particularly those with NMR, revealed that the transactivation domains in unbound TFs are usually unstructured, but become structured upon binding to their partners. The sequences of human and mouse TF orthologues are 90.5% identical despite a high incidence of ID regions, probably reflecting important functional roles played by ID regions. In general ID regions occupy a high fraction in TFs of eukaryotes, but not in prokaryotes. Implications of this dichotomy are discussed in connection with their functional roles in transcriptional regulation and evolution.
Collapse
Affiliation(s)
- Yoshiaki Minezaki
- Laboratory of Gene-Product Informatics, Center For Information Biology & DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan
| | | | | | | |
Collapse
|
13
|
Fukuchi S, Homma K, Minezaki Y, Nishikawa K. Intrinsically Disordered Loops Inserted into the Structural Domains of Human Proteins. J Mol Biol 2006; 355:845-57. [PMID: 16324711 DOI: 10.1016/j.jmb.2005.10.037] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2005] [Revised: 09/29/2005] [Accepted: 10/12/2005] [Indexed: 10/25/2022]
Abstract
Much attention has been paid recently to proteins with partially or fully disordered structures, which are found to exist mostly in eukaryotes and are involved mainly in pivotal cellular processes such as transcriptional regulation, translation and cellular signal transduction. Long disordered sequences are sometimes inserted within the single structural domains of proteins, forming loops from the molecular surface. Such intrinsically disordered loops (IDLs) either are invisible in X-ray crystallography, or hamper protein crystallization itself due to great flexibility. Perhaps because of this, such long disordered sequences have not been characterized adequately. Here, we propose an informational method that stringently identifies IDLs in the structural domains of proteins using the amino acid sequence alone. A genome-wide survey of human proteins conducted with the method identified 50 IDL-containing proteins, several of which have experimentally determined 3D structures. Similar searches in other entirely sequenced organisms revealed that IDLs are prevalent in eukaryotes, while they are much less so in prokaryotes. As there is a statistically significant coincidence between the boundaries of IDLs and those of exons, we suggest that IDLs were produced mainly by exon addition in eukaryotes. IDLs are almost always located at the surface of proteins and are enriched with hydrophilic residues, and IDL-containing proteins tend to be intracellular. Some of the well-characterized proteins with IDLs illustrate that IDLs play pivotal roles in the switching of intracellular signaling or regulatory functions, suggesting that IDL insertion is an effective way to create functionally different domain variants.
Collapse
Affiliation(s)
- Satoshi Fukuchi
- Center For Information Biology & DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima Shizuoka 411-8540, Japan.
| | | | | | | |
Collapse
|
14
|
Hiller M, Huse K, Platzer M, Backofen R. Non-EST based prediction of exon skipping and intron retention events using Pfam information. Nucleic Acids Res 2005; 33:5611-21. [PMID: 16204458 PMCID: PMC1243800 DOI: 10.1093/nar/gki870] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2005] [Revised: 08/19/2005] [Accepted: 09/09/2005] [Indexed: 11/12/2022] Open
Abstract
Most of the known alternative splice events have been detected by the comparison of expressed sequence tags (ESTs) and cDNAs. However, not all splice events are represented in EST databases since ESTs have several biases. Therefore, non-EST based approaches are needed to extend our view of a transcriptome. Here, we describe a novel method for the ab initio prediction of alternative splice events that is solely based on the annotation of Pfam domains. Furthermore, we applied this approach in a genome-wide manner to all human RefSeq transcripts and predicted a total of 321 exon skipping and intron retention events. We show that this method is very reliable as 78% (250 of 321) of our predictions are confirmed by ESTs or cDNAs. Subsequent analyses of splice events within Pfam domains revealed a significant preference of alternative exon junctions to be located at the protein surface and to avoid secondary structure elements. Thus, splice events within Pfams are probable to alter the structure and function of a domain which makes them highly interesting for detailed biological investigation. As Pfam domains are annotated in many other species, our strategy to predict exon skipping and intron retention events might be important for species with a lower number of ESTs.
Collapse
Affiliation(s)
- Michael Hiller
- Institute of Computer Science, Friedrich-Schiller-University Jena, Chair for Bioinformatics, Ernst-Abbe-Platz 2, 07743 Jena, Germany.
| | | | | | | |
Collapse
|