Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Improved prediction of residue flexibility by embedding optimized amino acid grouping into RSA-based linear models. Amino Acids 2014;46:2665-80. [DOI: 10.1007/s00726-014-1817-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 07/21/2014] [Indexed: 11/26/2022]

van der Lee R, Buljan M, Lang B, Weatheritt RJ, Daughdrill GW, Dunker AK, Fuxreiter M, Gough J, Gsponer J, Jones D, Kim PM, Kriwacki R, Oldfield CJ, Pappu RV, Tompa P, Uversky VN, Wright P, Babu MM. Classification of intrinsically disordered regions and proteins. Chem Rev 2014;114:6589-631. [PMID: 24773235 PMCID: PMC4095912 DOI: 10.1021/cr400525m] [Citation(s) in RCA: 1440] [Impact Index Per Article: 144.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Indexed: 12/11/2022]

Affiliation(s)

Robin van der Lee MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, 6500 HB Nijmegen, The Netherlands
Marija Buljan MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
Benjamin Lang MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
Robert J. Weatheritt MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
Gary W. Daughdrill Department of Cell Biology, Microbiology, and Molecular Biology, University of South Florida, 3720 Spectrum Boulevard, Suite 321, Tampa, Florida 33612, United States
A. Keith Dunker Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
Monika Fuxreiter MTA-DE Momentum Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, H-4032 Debrecen, Nagyerdei krt 98, Hungary
Julian Gough Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, United Kingdom
Joerg Gsponer Department of Biochemistry and Molecular Biology, Centre for High-Throughput Biology, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
David T. Jones Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, United Kingdom
Philip M. Kim Terrence Donnelly Centre for Cellular and Biomolecular Research, Department of Molecular Genetics, and Department of Computer Science, University of Toronto, Toronto, Ontario M5S 3E1, Canada
Richard W. Kriwacki Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee 38105, United States
Christopher J. Oldfield Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
Rohit V. Pappu Department of Biomedical Engineering and Center for Biological Systems Engineering, Washington University in St. Louis, St. Louis, Missouri 63130, United States
Peter Tompa VIB Department of Structural Biology, Vrije Universiteit Brussel, Brussels, Belgium Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Budapest, Hungary
Vladimir N. Uversky Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida 33612, United States Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
Peter E. Wright Department of Integrative Structural and Computational Biology and Skaggs Institute of Chemical Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
M. Madan Babu MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom

Collapse

Macossay-Castillo M, Kosol S, Tompa P, Pancsa R. Synonymous constraint elements show a tendency to encode intrinsically disordered protein segments. PLoS Comput Biol 2014;10:e1003607. [PMID: 24809503 PMCID: PMC4014394 DOI: 10.1371/journal.pcbi.1003607] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 03/17/2014] [Indexed: 01/22/2023] Open

Pavlović MD, Jandrlić DR, Mitić NS. Epitope distribution in ordered and disordered protein regions. Part B — Ordered regions and disordered binding sites are targets of T- and B-cell immunity. J Immunol Methods 2014;407:90-107. [DOI: 10.1016/j.jim.2014.03.027] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Revised: 03/31/2014] [Accepted: 03/31/2014] [Indexed: 01/04/2023]

Mitić NS, Pavlović MD, Jandrlić DR. Epitope distribution in ordered and disordered protein regions - part A. T-cell epitope frequency, affinity and hydropathy. J Immunol Methods 2014;406:83-103. [PMID: 24614036 DOI: 10.1016/j.jim.2014.02.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2013] [Revised: 02/27/2014] [Accepted: 02/27/2014] [Indexed: 02/08/2023]

Abstract

Highly disordered protein regions are prevalently hydrophilic, extremely sensitive to proteolysis in vitro, and are expected to be under-represented as T-cell epitopes. The aim of this research was to find out whether disorder and hydropathy prediction methods could help in understanding epitope processing and presentation. According to the pan-specific T-cell epitope predictors NetMHCpan and NetMHCIIpan and 9 publicly available disorder predictors, frequency of epitopes presented by human leukocyte antigens (HLA) class-I or -II was found to be more than 2.5 times higher in ordered than in disordered protein regions (depending on the disorder predictor). Both HLA class-I and HLA class-II binding epitopes are prevalently hydrophilic in disordered and prevalently hydrophobic in ordered protein regions, whereas epitopes recognized by HLA class-II alleles are more hydrophobic than those recognized by HLA class-I. As regards both classes of HLA molecules, high-affinity binding epitopes display more hydrophobicity than low affinity-binding epitopes (in both ordered and disordered regions). Epitopes belonging to disordered protein regions were not predicted to have poor affinity to HLA class-II molecules, as expected from disorder intrinsic proteolytic instability. The relation of epitope hydrophobicity and order/disorder location was also valid if alleles were grouped according to the HLA class-I and HLA class-II supertypes, except for the class-I supertype A3 in which the main part of recognized epitopes was prevalently hydrophilic. Regarding specific supertypes, the affinity of epitopes belonging to ordered regions varies only slightly (depending on the disorder predictor) compared to the affinity of epitopes in corresponding disordered regions. The distribution of epitopes in ordered and disordered protein regions has revealed that the curves of order-epitope distribution were convex-like while the curves of disorder-epitope distribution were concave-like. The percentage of prevalently hydrophobic epitopes increases with the enhancement of epitope promiscuity level and moving from disordered to ordered regions. These data suggests that reverse vaccinology, oriented towards promiscuous and high-affinity epitopes, is also oriented towards prevalently hydrophobic, ordered regions. The analysis of predicted and experimentally evaluated epitopes of cancer-testis antigen MAGE-A3 has confirmed that the majority of T-cell epitopes, particularly those that are promiscuous or naturally processed, was located in ordered and disorder/order boundary protein regions overlapping hydrophobic regions.

Collapse

Mannige RV. Dynamic New World: Refining Our View of Protein Structure, Function and Evolution. Proteomes 2014;2:128-153. [PMID: 28250374 PMCID: PMC5302727 DOI: 10.3390/proteomes2010128] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 02/12/2014] [Accepted: 02/20/2014] [Indexed: 01/06/2023] Open

Das S, Pal U, Das S, Bagga K, Roy A, Mrigwani A, Maiti NC. Sequence complexity of amyloidogenic regions in intrinsically disordered human proteins. PLoS One 2014;9:e89781. [PMID: 24594841 PMCID: PMC3940659 DOI: 10.1371/journal.pone.0089781] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 01/26/2014] [Indexed: 01/03/2023] Open

Monastyrskyy B, Kryshtafovych A, Moult J, Tramontano A, Fidelis K. Assessment of protein disorder region predictions in CASP10. Proteins 2013;82 Suppl 2:127-37. [PMID: 23946100 DOI: 10.1002/prot.24391] [Citation(s) in RCA: 124] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Revised: 06/14/2013] [Accepted: 06/18/2013] [Indexed: 12/12/2022]

Kale A, Hire RS, Hadapad AB, D'Souza SF, Kumar V. Interaction between mosquito-larvicidal Lysinibacillus sphaericus binary toxin components: analysis of complex formation. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2013;43:1045-1054. [PMID: 23974012 DOI: 10.1016/j.ibmb.2013.07.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Revised: 07/19/2013] [Accepted: 07/29/2013] [Indexed: 06/02/2023]

Peng Z, Mizianty MJ, Kurgan L. Genome-scale prediction of proteins with long intrinsically disordered regions. Proteins 2013;82:145-58. [PMID: 23798504 DOI: 10.1002/prot.24348] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Accepted: 06/06/2013] [Indexed: 12/24/2022]

Abstract

Proteins with long disordered regions (LDRs), defined as having 30 or more consecutive disordered residues, are abundant in eukaryotes, and these regions are recognized as a distinct class of biologically functional domains. LDRs facilitate various cellular functions and are important for target selection in structural genomics. Motivated by the lack of methods that directly predict proteins with LDRs, we designed Super-fast predictor of proteins with Long Intrinsically DisordERed regions (SLIDER). SLIDER utilizes logistic regression that takes an empirically chosen set of numerical features, which consider selected physicochemical properties of amino acids, sequence complexity, and amino acid composition, as its inputs. Empirical tests show that SLIDER offers competitive predictive performance combined with low computational cost. It outperforms, by at least a modest margin, a comprehensive set of modern disorder predictors (that can indirectly predict LDRs) and is 16 times faster compared to the best currently available disorder predictor. Utilizing our time-efficient predictor, we characterized abundance and functional roles of proteins with LDRs over 110 eukaryotic proteomes. Similar to related studies, we found that eukaryotes have many (on average 30.3%) proteins with LDRs with majority of proteomes having between 25 and 40%, where higher abundance is characteristic to proteomes that have larger proteins. Our first-of-its-kind large-scale functional analysis shows that these proteins are enriched in a number of cellular functions and processes including certain binding events, regulation of catalytic activities, cellular component organization, biogenesis, biological regulation, and some metabolic and developmental processes. A webserver that implements SLIDER is available at http://biomine.ece.ualberta.ca/SLIDER/.

Collapse

Light S, Sagit R, Sachenkova O, Ekman D, Elofsson A. Protein Expansion Is Primarily due to Indels in Intrinsically Disordered Regions. Mol Biol Evol 2013;30:2645-53. [DOI: 10.1093/molbev/mst157] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Kahali B, Ghosh TC. Disorderness inEscherichia coliproteome: perception of folding fidelity and protein–protein interactions. J Biomol Struct Dyn 2013;31:472-6. [DOI: 10.1080/07391102.2012.706071] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Fan X, Kurgan L. Accurate prediction of disorder in protein chains with a comprehensive and empirically designed consensus. J Biomol Struct Dyn 2013;32:448-64. [DOI: 10.1080/07391102.2013.775969] [Citation(s) in RCA: 113] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

The N-terminal intrinsically disordered domain of Mgm101p is localized to the mitochondrial nucleoid. PLoS One 2013;8:e56465. [PMID: 23418572 PMCID: PMC3572067 DOI: 10.1371/journal.pone.0056465] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2012] [Accepted: 01/14/2013] [Indexed: 01/22/2023] Open

Long indels are disordered: a study of disorder and indels in homologous eukaryotic proteins. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013;1834:890-7. [PMID: 23333420 DOI: 10.1016/j.bbapap.2013.01.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 12/30/2012] [Accepted: 01/03/2013] [Indexed: 11/21/2022]

Orosz F. A new protein superfamily: TPPP-like proteins. PLoS One 2012;7:e49276. [PMID: 23166627 PMCID: PMC3498115 DOI: 10.1371/journal.pone.0049276] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 10/08/2012] [Indexed: 12/02/2022] Open

Abstract

The introduction of the term ‘Tubulin Polymerization Promoting Protein (TPPP)-like proteins’ is suggested. They constitute a eukaryotic protein superfamily, characterized by the presence of the p25alpha domain (Pfam05517, IPR008907), and named after the first identified member, TPPP/p25, exhibiting microtubule stabilizing function. TPPP-like proteins can be grouped on the basis of two characteristics: the length of their p25alpha domain, which can be long, short, truncated or partial, and the presence or absence of additional domain(s). TPPPs, in the strict sense, contain no other domains but one long or short p25alpha one (long- and short-type TPPPs, respectively). Proteins possessing truncated p25alpha domain are first described in this paper. They evolved from the long-type TPPPs and can be considered as arthropod-specific paralogs of long-type TPPPs. Phylogenetic analysis shows that the two groups (long-type and truncated TPPPs) split in the common ancestor of arthropods. Incomplete p25alpha domains can be found in multidomain TPPP-like proteins as well. The various subfamilies occur with a characteristic phyletic distribution: e. g., animal genomes/proteomes contain almost without exception long-type TPPPs; the multidomain apicortins occur almost exclusively in apicomplexan parasites. There are no data about the physiological function of these proteins except two human long-type TPPP paralogs which are involved in developmental processes of the brain and the musculoskeletal system, respectively. I predict that the superfamily members containing long or partial p25alpha domain are often intrinsically disordered proteins, while those with short or truncated domain(s) are structurally ordered. Interestingly, members of this superfamily connected or maybe connected to diseases are intrinsically disordered proteins.

Collapse

Bardwell JCA, Jakob U. Conditional disorder in chaperone action. Trends Biochem Sci 2012;37:517-25. [PMID: 23018052 DOI: 10.1016/j.tibs.2012.08.006] [Citation(s) in RCA: 110] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Revised: 08/17/2012] [Accepted: 08/29/2012] [Indexed: 11/18/2022]

Yruela I, Contreras-Moreira B. Protein disorder in plants: a view from the chloroplast. BMC PLANT BIOLOGY 2012;12:165. [PMID: 22970728 PMCID: PMC3460767 DOI: 10.1186/1471-2229-12-165] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Accepted: 09/10/2012] [Indexed: 05/08/2023]

Lobanov MY, Sokolovskiy IV, Galzitskaya OV. IsUnstruct: prediction of the residue status to be ordered or disordered in the protein chain by a method based on the Ising model. J Biomol Struct Dyn 2012;31:1034-43. [PMID: 22963167 DOI: 10.1080/07391102.2012.718529] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Seeger MA, Zhang Y, Rice SE. Kinesin tail domains are intrinsically disordered. Proteins 2012;80:2437-46. [PMID: 22674872 DOI: 10.1002/prot.24128] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2012] [Revised: 05/22/2012] [Accepted: 05/25/2012] [Indexed: 12/11/2022]

Kovačević JJ. Computational analysis of position-dependent disorder content in DisProt database. GENOMICS PROTEOMICS & BIOINFORMATICS 2012;10:158-65. [PMID: 22917189 PMCID: PMC5056116 DOI: 10.1016/j.gpb.2012.01.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2011] [Revised: 01/27/2012] [Accepted: 01/31/2012] [Indexed: 11/27/2022]

Rawat N, Biswas P. Hydrophobic moments, shape, and packing in disordered proteins. J Phys Chem B 2012;116:6326-35. [PMID: 22582807 DOI: 10.1021/jp3016529] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Karlin D, Belshaw R. Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins. PLoS One 2012;7:e31719. [PMID: 22403617 PMCID: PMC3293882 DOI: 10.1371/journal.pone.0031719] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 01/18/2012] [Indexed: 11/19/2022] Open

Abstract

Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P) plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa), several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains) that could be detected simply by comparing orthologous proteins.

Collapse

Lobanov MY, Galzitskaya OV. Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes. ACTA ACUST UNITED AC 2012;8:327-37. [DOI: 10.1039/c1mb05318c] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Peng Z, Mizianty MJ, Xue B, Kurgan L, Uversky VN. More than just tails: intrinsic disorder in histone proteins. MOLECULAR BIOSYSTEMS 2012;8:1886-901. [DOI: 10.1039/c2mb25102g] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Ghalwash MF, Dunker AK, Obradović Z. Uncertainty analysis in protein disorder prediction. MOLECULAR BIOSYSTEMS 2011;8:381-91. [PMID: 22101336 DOI: 10.1039/c1mb05373f] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Abstract

UNLABELLED

A grand challenge in the proteomics and structural genomics era is the prediction of protein structure, including identification of those proteins that are partially or wholly unstructured. A number of predictors for identification of intrinsically disordered proteins (IDPs) have been developed over the last decade, but none can be taken as a fully reliable on its own. Using a single model for prediction is typically inadequate because prediction based on only the most accurate model ignores model uncertainty. In this paper, we present an empirical method to specify and measure uncertainty associated with disorder predictions. In particular, we analyze the uncertainty in the reference model itself and the uncertainty in data. This is achieved by training a set of models and developing several meta predictors on top of them. The best meta predictor achieved comparable or better results than any other single model, suggesting that incorporating different aspects of protein disorder prediction is important for the disorder prediction task. In addition, the best meta-predictor had more balanced sensitivity and specificity than any individual model. We also assessed the effects of changes in disorder prediction as a function of changes in the protein sequence. For collections of homologous sequences, we found that mutations caused many of the predicted disordered residues to be flipped to be predicted as ordered residues, while the reverse was observed much less frequently. These results suggest that disorder tendencies are more sensitive to allowed mutations than structure tendencies and the conservation of disorder is indeed less stable than conservation of structure.

AVAILABILITY

five meta-predictors and four single models developed for this study will be publicly freely accessible for non-commercial use.

Collapse

Lobanov MY, Galzitskaya OV. Disordered patterns in clustered Protein Data Bank and in eukaryotic and bacterial proteomes. PLoS One 2011;6:e27142. [PMID: 22073276 PMCID: PMC3208572 DOI: 10.1371/journal.pone.0027142] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2011] [Accepted: 10/11/2011] [Indexed: 11/18/2022] Open

Monastyrskyy B, Fidelis K, Moult J, Tramontano A, Kryshtafovych A. Evaluation of disorder predictions in CASP9. Proteins 2011;79 Suppl 10:107-18. [PMID: 21928402 PMCID: PMC3212657 DOI: 10.1002/prot.23161] [Citation(s) in RCA: 105] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2011] [Revised: 07/11/2011] [Accepted: 07/15/2011] [Indexed: 11/10/2022]

Mészáros B, Tóth J, Vértessy BG, Dosztányi Z, Simon I. Proteins with complex architecture as potential targets for drug design: a case study of Mycobacterium tuberculosis. PLoS Comput Biol 2011;7:e1002118. [PMID: 21814507 PMCID: PMC3140968 DOI: 10.1371/journal.pcbi.1002118] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 05/24/2011] [Indexed: 02/04/2023] Open

Orosz F. Apicomplexan apicortins possess a long disordered N-terminal extension. INFECTION GENETICS AND EVOLUTION 2011;11:1037-44. [DOI: 10.1016/j.meegid.2011.03.023] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2010] [Revised: 03/24/2011] [Accepted: 03/25/2011] [Indexed: 01/01/2023]

Song J, Tan H, Boyd SE, Shen H, Mahmood K, Webb GI, Akutsu T, Whisstock JC, Pike RN. Bioinformatic approaches for predicting substrates of proteases. J Bioinform Comput Biol 2011;9:149-78. [PMID: 21328711 DOI: 10.1142/s0219720011005288] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 10/08/2010] [Accepted: 10/09/2010] [Indexed: 11/18/2022]

Mészáros B, Simon I, Dosztányi Z. The expanding view of protein-protein interactions: complexes involving intrinsically disordered proteins. Phys Biol 2011;8:035003. [PMID: 21572179 DOI: 10.1088/1478-3975/8/3/035003] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Lobanov MY, Galzitskaya OV. The Ising model for prediction of disordered residues from protein sequence alone. Phys Biol 2011;8:035004. [PMID: 21572175 DOI: 10.1088/1478-3975/8/3/035004] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Orosz F, Ovádi J. Proteins without 3D structure: definition, detection and beyond. ACTA ACUST UNITED AC 2011;27:1449-54. [PMID: 21493654 DOI: 10.1093/bioinformatics/btr175] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Perkins JR, Diboun I, Dessailly BH, Lees JG, Orengo C. Transient protein-protein interactions: structural, functional, and network properties. Structure 2011;18:1233-43. [PMID: 20947012 DOI: 10.1016/j.str.2010.08.007] [Citation(s) in RCA: 370] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2010] [Revised: 07/13/2010] [Accepted: 08/02/2010] [Indexed: 11/28/2022]

Pavlović-Lažetić GM, Mitić NS, Kovačević JJ, Obradović Z, Malkov SN, Beljanski MV. Bioinformatics analysis of disordered proteins in prokaryotes. BMC Bioinformatics 2011;12:66. [PMID: 21366926 PMCID: PMC3062596 DOI: 10.1186/1471-2105-12-66] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2010] [Accepted: 03/02/2011] [Indexed: 01/06/2023] Open

Abstract

Background

A significant number of proteins have been shown to be intrinsically disordered, meaning that they lack a fixed 3 D structure or contain regions that do not posses a well defined 3 D structure. It has also been proven that a protein's disorder content is related to its function. We have performed an exhaustive analysis and comparison of the disorder content of proteins from prokaryotic organisms (i.e., superkingdoms Archaea and Bacteria) with respect to functional categories they belong to, i.e., Clusters of Orthologous Groups of proteins (COGs) and groups of COGs-Cellular processes (Cp), Information storage and processing (Isp), Metabolism (Me) and Poorly characterized (Pc).

We also analyzed the disorder content of proteins with respect to various genomic, metabolic and ecological characteristics of the organism they belong to. We used correlations and association rule mining in order to identify the most confident associations between specific modalities of the characteristics considered and disorder content.

Results

Bacteria are shown to have a somewhat higher level of protein disorder than archaea, except for proteins in the Me functional group. It is demonstrated that the Isp and Cp functional groups in particular (L-repair function and N-cell motility and secretion COGs of proteins in specific) possess the highest disorder content, while Me proteins, in general, posses the lowest. Disorder fractions have been confirmed to have the lowest level for the so-called order-promoting amino acids and the highest level for the so-called disorder promoters.

For each pair of organism characteristics, specific modalities are identified with the maximum disorder proteins in the corresponding organisms, e.g., high genome size-high GC content organisms, facultative anaerobic-low GC content organisms, aerobic-high genome size organisms, etc. Maximum disorder in archaea is observed for high GC content-low genome size organisms, high GC content-facultative anaerobic or aquatic or mesophilic organisms, etc. Maximum disorder in bacteria is observed for high GC content-high genome size organisms, high genome size-aerobic organisms, etc.

Some of the most reliable association rules mined establish relationships between high GC content and high protein disorder, medium GC content and both medium and low protein disorder, anaerobic organisms and medium protein disorder, Gammaproteobacteria and low protein disorder, etc. A web site Prokaryote Disorder Database has been designed and implemented at the address http://bioinfo.matf.bg.ac.rs/disorder, which contains complete results of the analysis of protein disorder performed for 296 prokaryotic completely sequenced genomes.

Conclusions

Exhaustive disorder analysis has been performed by functional classes of proteins, for a larger dataset of prokaryotic organisms than previously done. Results obtained are well correlated to those previously published, with some extension in the range of disorder level and clear distinction between functional classes of proteins. Wide correlation and association analysis between protein disorder and genomic and ecological characteristics has been performed for the first time. The results obtained give insight into multi-relationships among the characteristics and protein disorder. Such analysis provides for better understanding of the evolutionary process and may be useful for taxon determination. The main drawback of the approach is the fact that the disorder considered has been predicted and not experimentally established.

Collapse

Tamburstuen MV, Reseland JE, Spahr A, Brookes SJ, Kvalheim G, Slaby I, Snead ML, Lyngstadaas SP. Ameloblastin expression and putative autoregulation in mesenchymal cells suggest a role in early bone formation and repair. Bone 2011;48:406-13. [PMID: 20854943 PMCID: PMC4469498 DOI: 10.1016/j.bone.2010.09.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2009] [Revised: 08/24/2010] [Accepted: 09/07/2010] [Indexed: 10/19/2022]

Lobanov MY, Furletova EI, Bogatyreva NS, Roytberg MA, Galzitskaya OV. Library of disordered patterns in 3D protein structures. PLoS Comput Biol 2010;6:e1000958. [PMID: 20976197 PMCID: PMC2954861 DOI: 10.1371/journal.pcbi.1000958] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Accepted: 09/16/2010] [Indexed: 01/11/2023] Open