26
|
Wodak SJ, Malevanets A, MacKinnon SS. The Landscape of Intertwined Associations in Homooligomeric Proteins. Biophys J 2015; 109:1087-100. [PMID: 26340815 DOI: 10.1016/j.bpj.2015.08.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Revised: 06/06/2015] [Accepted: 08/03/2015] [Indexed: 01/22/2023] Open
Abstract
We present an overview of the full repertoire of intertwined associations in homooligomeric proteins. This overview summarizes recent findings on the different categories of intertwined associations in known protein structures, their assembly modes, the properties of their interfaces, and their structural plasticity. Furthermore, the current body of knowledge on the so-called three-dimensional domain-swapped systems is reexamined in the context of the wider landscape of intertwined homooligomers, with a particular focus on the mechanistic aspects that underpin intertwined self-association processes in proteins. Insights gained from this integrated overview into the physical and biological roles of intertwining are highlighted.
Collapse
|
27
|
Janin J, Wodak SJ, Lensink MF, Velankar S. Assessing Structural Predictions of Protein-Protein Recognition: The CAPRI Experiment. REVIEWS IN COMPUTATIONAL CHEMISTRY 2015. [DOI: 10.1002/9781118889886.ch4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
28
|
MacKinnon SS, Wodak SJ. Landscape of intertwined associations in multi-domain homo-oligomeric proteins. J Mol Biol 2014; 427:350-70. [PMID: 25451036 DOI: 10.1016/j.jmb.2014.11.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Revised: 10/31/2014] [Accepted: 11/03/2014] [Indexed: 10/24/2022]
Abstract
This study charts the landscape of multi-domain protein structures that form intertwined homodimers by exchanging structural domains between subunits. A representative dataset of such homodimers was derived from the Protein Data Bank, and their structural and topological properties were compared to those of a representative set of non-intertwined homodimers. Most of the intertwined dimers form closed assemblies with head-to-tail arrangements, where the subunit interface involves contacts between dissimilar domains. In contrast, the non-intertwined dimers form preferentially head-to-head arrangements, where the subunit interface involves contacts between identical domains. Most of these contacts engage only one structural domain from each subunit, leaving the remaining domains free to form other associations. Remarkably, we find that multi-domain proteins closely related to the intertwined homodimers are significantly more likely than relatives of the non-intertwined versions to adopt alternative intramolecular domain arrangements. In ~40% of the intertwined dimers, the plasticity in domain arrangements among relatives affords maintenance of the head-to-head or head-to-tail topology and conservation of the corresponding subunit interface. This property seems to be exploited in several systems to regulate DNA binding. In ~58%, however, intramolecular domain re-arrangements are associated with changes in oligomeric states and poorly conserved interfaces among relatives. This time, the corresponding structural plasticity appears to be exploited by evolution to modulate function by switching between active and inactive states of the protein. Surprisingly, in total, only three systems were found to undergo the classical monomer to intertwined dimer conversion associated with three-dimensional domain swapping.
Collapse
|
29
|
Lensink MF, Wodak SJ. Score_set: A CAPRI benchmark for scoring protein complexes. Proteins 2014; 82:3163-9. [DOI: 10.1002/prot.24678] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 08/05/2014] [Accepted: 08/22/2014] [Indexed: 12/26/2022]
|
30
|
Qiao W, Wang W, Laurenti E, Turinsky AL, Wodak SJ, Bader GD, Dick JE, Zandstra PW. Intercellular network structure and regulatory motifs in the human hematopoietic system. Mol Syst Biol 2014; 10:741. [PMID: 25028490 PMCID: PMC4299490 DOI: 10.15252/msb.20145141] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The hematopoietic system is a distributed tissue that consists of functionally distinct cell types continuously produced through hematopoietic stem cell (HSC) differentiation. Combining genomic and phenotypic data with high-content experiments, we have built a directional cell-cell communication network between 12 cell types isolated from human umbilical cord blood. Network structure analysis revealed that ligand production is cell type dependent, whereas ligand binding is promiscuous. Consequently, additional control strategies such as cell frequency modulation and compartmentalization were needed to achieve specificity in HSC fate regulation. Incorporating the in vitro effects (quiescence, self-renewal, proliferation, or differentiation) of 27 HSC binding ligands into the topology of the cell-cell communication network allowed coding of cell type-dependent feedback regulation of HSC fate. Pathway enrichment analysis identified intracellular regulatory motifs enriched in these cell type- and ligand-coupled responses. This study uncovers cellular mechanisms of hematopoietic cell feedback in HSC fate regulation, provides insight into the design principles of the human hematopoietic system, and serves as a foundation for the analysis of intercellular regulation in multicellular systems.
Collapse
|
31
|
Lensink MF, Moal IH, Bates PA, Kastritis PL, Melquiond ASJ, Karaca E, Schmitz C, van Dijk M, Bonvin AMJJ, Eisenstein M, Jiménez-García B, Grosdidier S, Solernou A, Pérez-Cano L, Pallara C, Fernández-Recio J, Xu J, Muthu P, Praneeth Kilambi K, Gray JJ, Grudinin S, Derevyanko G, Mitchell JC, Wieting J, Kanamori E, Tsuchiya Y, Murakami Y, Sarmiento J, Standley DM, Shirota M, Kinoshita K, Nakamura H, Chavent M, Ritchie DW, Park H, Ko J, Lee H, Seok C, Shen Y, Kozakov D, Vajda S, Kundrotas PJ, Vakser IA, Pierce BG, Hwang H, Vreven T, Weng Z, Buch I, Farkash E, Wolfson HJ, Zacharias M, Qin S, Zhou HX, Huang SY, Zou X, Wojdyla JA, Kleanthous C, Wodak SJ. Blind prediction of interfacial water positions in CAPRI. Proteins 2014; 82:620-32. [PMID: 24155158 PMCID: PMC4582081 DOI: 10.1002/prot.24439] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2013] [Revised: 09/16/2013] [Accepted: 09/26/2013] [Indexed: 12/30/2022]
Abstract
We report the first assessment of blind predictions of water positions at protein-protein interfaces, performed as part of the critical assessment of predicted interactions (CAPRI) community-wide experiment. Groups submitting docking predictions for the complex of the DNase domain of colicin E2 and Im2 immunity protein (CAPRI Target 47), were invited to predict the positions of interfacial water molecules using the method of their choice. The predictions-20 groups submitted a total of 195 models-were assessed by measuring the recall fraction of water-mediated protein contacts. Of the 176 high- or medium-quality docking models-a very good docking performance per se-only 44% had a recall fraction above 0.3, and a mere 6% above 0.5. The actual water positions were in general predicted to an accuracy level no better than 1.5 Å, and even in good models about half of the contacts represented false positives. This notwithstanding, three hotspot interface water positions were quite well predicted, and so was one of the water positions that is believed to stabilize the loop that confers specificity in these complexes. Overall the best interface water predictions was achieved by groups that also produced high-quality docking models, indicating that accurate modelling of the protein portion is a determinant factor. The use of established molecular mechanics force fields, coupled to sampling and optimization procedures also seemed to confer an advantage. Insights gained from this analysis should help improve the prediction of protein-water interactions and their role in stabilizing protein complexes.
Collapse
|
32
|
Turinsky AL, Razick S, Turner B, Donaldson IM, Wodak SJ. Navigating the global protein-protein interaction landscape using iRefWeb. Methods Mol Biol 2014; 1091:315-31. [PMID: 24203342 DOI: 10.1007/978-1-62703-691-7_22] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
iRefWeb is a bioinformatics resource that offers access to a large collection of data on protein-protein interactions in over a thousand organisms. This collection is consolidated from 14 major public databases that curate the scientific literature. The collection is enhanced with a range of versatile data filters and search options that categorize various types of protein-protein interactions and protein complexes. Users of iRefWeb are able to retrieve all curated interactions for a given organism or those involving a given protein (or a list of proteins), narrow down their search results based on different supporting evidence, and assess the reliability of these interactions using various criteria. They may also examine all data and annotations related to any publication that described the interaction-detection experiments. iRefWeb is freely available to the research community worldwide at http://wodaklab.org/iRefWeb .
Collapse
|
33
|
Wodak SJ, Vlasblom J, Turinsky AL, Pu S. Protein–protein interaction networks: the puzzling riches. Curr Opin Struct Biol 2013; 23:941-53. [DOI: 10.1016/j.sbi.2013.08.002] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2013] [Revised: 07/14/2013] [Accepted: 08/08/2013] [Indexed: 12/13/2022]
|
34
|
Wintjens RT, Rooman MJ, Wodak SJ. Identification of Short Turn Motifs in Proteins Using Sequence and Structure Fingerprints. Isr J Chem 2013. [DOI: 10.1002/ijch.199400030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
35
|
Moretti R, Fleishman SJ, Agius R, Torchala M, Bates PA, Kastritis PL, Rodrigues JPGLM, Trellet M, Bonvin AMJJ, Cui M, Rooman M, Gillis D, Dehouck Y, Moal I, Romero-Durana M, Perez-Cano L, Pallara C, Jimenez B, Fernandez-Recio J, Flores S, Pacella M, Kilambi KP, Gray JJ, Popov P, Grudinin S, Esquivel-Rodríguez J, Kihara D, Zhao N, Korkin D, Zhu X, Demerdash ONA, Mitchell JC, Kanamori E, Tsuchiya Y, Nakamura H, Lee H, Park H, Seok C, Sarmiento J, Liang S, Teraguchi S, Standley DM, Shimoyama H, Terashi G, Takeda-Shitaka M, Iwadate M, Umeyama H, Beglov D, Hall DR, Kozakov D, Vajda S, Pierce BG, Hwang H, Vreven T, Weng Z, Huang Y, Li H, Yang X, Ji X, Liu S, Xiao Y, Zacharias M, Qin S, Zhou HX, Huang SY, Zou X, Velankar S, Janin J, Wodak SJ, Baker D. Community-wide evaluation of methods for predicting the effect of mutations on protein-protein interactions. Proteins 2013; 81:1980-7. [PMID: 23843247 PMCID: PMC4143140 DOI: 10.1002/prot.24356] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2013] [Revised: 06/13/2013] [Accepted: 06/18/2013] [Indexed: 12/25/2022]
Abstract
Community-wide blind prediction experiments such as CAPRI and CASP provide an objective measure of the current state of predictive methodology. Here we describe a community-wide assessment of methods to predict the effects of mutations on protein-protein interactions. Twenty-two groups predicted the effects of comprehensive saturation mutagenesis for two designed influenza hemagglutinin binders and the results were compared with experimental yeast display enrichment data obtained using deep sequencing. The most successful methods explicitly considered the effects of mutation on monomer stability in addition to binding affinity, carried out explicit side-chain sampling and backbone relaxation, evaluated packing, electrostatic, and solvation effects, and correctly identified around a third of the beneficial mutations. Much room for improvement remains for even the best techniques, and large-scale fitness landscapes should continue to provide an excellent test bed for continued evaluation of both existing and new prediction methodologies.
Collapse
|
36
|
Lensink MF, Wodak SJ. Docking, scoring, and affinity prediction in CAPRI. Proteins 2013; 81:2082-95. [DOI: 10.1002/prot.24428] [Citation(s) in RCA: 199] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Revised: 09/12/2013] [Accepted: 09/13/2013] [Indexed: 01/28/2023]
|
37
|
Wodak SJ, Garton M, Malevanets A, McKinnon S. 174 Self-association prompts proteins for new function: The role of altered dynamic properties. J Biomol Struct Dyn 2013. [DOI: 10.1080/07391102.2013.786416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
38
|
Havugimana PC, Hart GT, Nepusz T, Yang H, Turinsky AL, Li Z, Wang PI, Boutz DR, Fong V, Phanse S, Babu M, Craig SA, Hu P, Wan C, Vlasblom J, Dar VUN, Bezginov A, Clark GW, Wu GC, Wodak SJ, Tillier ERM, Paccanaro A, Marcotte EM, Emili A. A census of human soluble protein complexes. Cell 2012; 150:1068-81. [PMID: 22939629 DOI: 10.1016/j.cell.2012.08.011] [Citation(s) in RCA: 629] [Impact Index Per Article: 52.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2012] [Revised: 07/30/2012] [Accepted: 08/10/2012] [Indexed: 12/19/2022]
Abstract
Cellular processes often depend on stable physical associations between proteins. Despite recent progress, knowledge of the composition of human protein complexes remains limited. To close this gap, we applied an integrative global proteomic profiling approach, based on chromatographic separation of cultured human cell extracts into more than one thousand biochemical fractions that were subsequently analyzed by quantitative tandem mass spectrometry, to systematically identify a network of 13,993 high-confidence physical interactions among 3,006 stably associated soluble human proteins. Most of the 622 putative protein complexes we report are linked to core biological processes and encompass both candidate disease genes and unannotated proteins to inform on mechanism. Strikingly, whereas larger multiprotein assemblies tend to be more extensively annotated and evolutionarily conserved, human protein complexes with five or fewer subunits are far more likely to be functionally unannotated or restricted to vertebrates, suggesting more recent functional innovations.
Collapse
|
39
|
Babu M, Vlasblom J, Pu S, Guo X, Graham C, Bean BDM, Burston HE, Vizeacoumar FJ, Snider J, Phanse S, Fong V, Tam YYC, Davey M, Hnatshak O, Bajaj N, Chandran S, Punna T, Christopolous C, Wong V, Yu A, Zhong G, Li J, Stagljar I, Conibear E, Wodak SJ, Emili A, Greenblatt JF. Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae. Nature 2012; 489:585-9. [PMID: 22940862 DOI: 10.1038/nature11354] [Citation(s) in RCA: 176] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Accepted: 06/27/2012] [Indexed: 01/03/2023]
Abstract
Macromolecular assemblies involving membrane proteins (MPs) serve vital biological roles and are prime drug targets in a variety of diseases. Large-scale affinity purification studies of soluble-protein complexes have been accomplished for diverse model organisms, but no global characterization of MP-complex membership has been described so far. Here we report a complete survey of 1,590 putative integral, peripheral and lipid-anchored MPs from Saccharomyces cerevisiae, which were affinity purified in the presence of non-denaturing detergents. The identities of the co-purifying proteins were determined by tandem mass spectrometry and subsequently used to derive a high-confidence physical interaction map encompassing 1,726 membrane protein-protein interactions and 501 putative heteromeric complexes associated with the various cellular membrane systems. Our analysis reveals unexpected physical associations underlying the membrane biology of eukaryotes and delineates the global topological landscape of the membrane interactome.
Collapse
|
40
|
|
41
|
Fiume M, Smith EJM, Brook A, Strbenac D, Turner B, Mezlini AM, Robinson MD, Wodak SJ, Brudno M. Savant Genome Browser 2: visualization and analysis for population-scale genomics. Nucleic Acids Res 2012; 40:W615-21. [PMID: 22638571 PMCID: PMC3394255 DOI: 10.1093/nar/gks427] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
High-throughput sequencing (HTS) technologies are providing an unprecedented capacity for data generation, and there is a corresponding need for efficient data exploration and analysis capabilities. Although most existing tools for HTS data analysis are developed for either automated (e.g. genotyping) or visualization (e.g. genome browsing) purposes, such tools are most powerful when combined. For example, integration of visualization and computation allows users to iteratively refine their analyses by updating computational parameters within the visual framework in real-time. Here we introduce the second version of the Savant Genome Browser, a standalone program for visual and computational analysis of HTS data. Savant substantially improves upon its predecessor and existing tools by introducing innovative visualization modes and navigation interfaces for several genomic datatypes, and synergizing visual and automated analyses in a way that is powerful yet easy even for non-expert users. We also present a number of plugins that were developed by the Savant Community, which demonstrate the power of integrating visual and automated analyses using Savant. The Savant Genome Browser is freely available (open source) at www.savantbrowser.com.
Collapse
|
42
|
Wodak SJ, Mietchen D, Collings AM, Russell RB, Bourne PE. Topic pages: PLoS Computational Biology meets Wikipedia. PLoS Comput Biol 2012; 8:e1002446. [PMID: 22479174 PMCID: PMC3315447 DOI: 10.1371/journal.pcbi.1002446] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
43
|
Malevanets A, Wodak SJ. Multiple replica repulsion technique for efficient conformational sampling of biological systems. Biophys J 2011; 101:951-60. [PMID: 21843487 DOI: 10.1016/j.bpj.2011.06.043] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2011] [Revised: 06/10/2011] [Accepted: 06/21/2011] [Indexed: 11/17/2022] Open
Abstract
Here, we propose a technique for sampling complex molecular systems with many degrees of freedom. The technique, termed "multiple replica repulsion" (MRR), does not suffer from poor scaling with the number of degrees of freedom associated with common replica exchange procedures and does not require sampling at high temperatures. The algorithm involves creation of multiple copies (replicas) of the system, which interact with one another through a repulsive potential that can be applied to the system as a whole or to portions of it. The proposed scheme prevents oversampling of the most populated states and provides accurate descriptions of conformational perturbations typically associated with sampling ground-state energy wells. The performance of MRR is illustrated for three systems of increasing complexity. A two-dimensional toy potential surface is used to probe the sampling efficiency as a function of key parameters of the procedure. MRR simulations of the Met-enkephalin pentapeptide, and the 76-residue protein ubiquitin, performed in presence of explicit water molecules and totaling 32 ns each, investigate the ability of MRR to characterize the conformational landscape of the peptide, and the protein native basin, respectively. Results obtained for the enkephalin peptide reflect more closely the extensive conformational flexibility of this peptide than previously reported simulations. Those obtained for ubiquitin show that conformational ensembles sampled by MRR largely encompass structural fluctuations relevant to biological recognition, which occur on the microsecond timescale, or are observed in crystal structures of ubiquitin complexes with other proteins. MRR thus emerges as a very promising simple and versatile technique for modeling the structural plasticity of complex biological systems.
Collapse
|
44
|
Fleishman SJ, Whitehead TA, Strauch EM, Corn JE, Qin S, Zhou HX, Mitchell JC, Demerdash ON, Takeda-Shitaka M, Terashi G, Moal IH, Li X, Bates PA, Zacharias M, Park H, Ko JS, Lee H, Seok C, Bourquard T, Bernauer J, Poupon A, Azé J, Soner S, Ovali ŞK, Ozbek P, Ben Tal N, Haliloglu T, Hwang H, Vreven T, Pierce BG, Weng Z, Pérez-Cano L, Pons C, Fernández-Recio J, Jiang F, Yang F, Gong X, Cao L, Xu X, Liu B, Wang P, Li C, Wang C, Robert CH, Guharoy M, Liu S, Huang Y, Li L, Guo D, Chen Y, Xiao Y, London N, Itzhaki Z, Schueler-Furman O, Inbar Y, Patapov V, Cohen M, Schreiber G, Tsuchiya Y, Kanamori E, Standley DM, Nakamura H, Kinoshita K, Driggers CM, Hall RG, Morgan JL, Hsu VL, Zhan J, Yang Y, Zhou Y, Kastritis PL, Bonvin AM, Zhang W, Camacho CJ, Kilambi KP, Sircar A, Gray JJ, Ohue M, Uchikoga N, Matsuzaki Y, Ishida T, Akiyama Y, Khashan R, Bush S, Fouches D, Tropsha A, Esquivel-Rodríguez J, Kihara D, Stranges PB, Jacak R, Kuhlman B, Huang SY, Zou X, Wodak SJ, Janin J, Baker D. Community-wide assessment of protein-interface modeling suggests improvements to design methodology. J Mol Biol 2011; 414:289-302. [PMID: 22001016 PMCID: PMC3839241 DOI: 10.1016/j.jmb.2011.09.031] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Revised: 09/08/2011] [Accepted: 09/16/2011] [Indexed: 11/26/2022]
Abstract
The CAPRI (Critical Assessment of Predicted Interactions) and CASP (Critical Assessment of protein Structure Prediction) experiments have demonstrated the power of community-wide tests of methodology in assessing the current state of the art and spurring progress in the very challenging areas of protein docking and structure prediction. We sought to bring the power of community-wide experiments to bear on a very challenging protein design problem that provides a complementary but equally fundamental test of current understanding of protein-binding thermodynamics. We have generated a number of designed protein-protein interfaces with very favorable computed binding energies but which do not appear to be formed in experiments, suggesting that there may be important physical chemistry missing in the energy calculations. A total of 28 research groups took up the challenge of determining what is missing: we provided structures of 87 designed complexes and 120 naturally occurring complexes and asked participants to identify energetic contributions and/or structural features that distinguish between the two sets. The community found that electrostatics and solvation terms partially distinguish the designs from the natural complexes, largely due to the nonpolar character of the designed interactions. Beyond this polarity difference, the community found that the designed binding surfaces were, on average, structurally less embedded in the designed monomers, suggesting that backbone conformational rigidity at the designed surface is important for realization of the designed function. These results can be used to improve computational design strategies, but there is still much to be learned; for example, one designed complex, which does form in experiments, was classified by all metrics as a nonbinder.
Collapse
|
45
|
Rajendram R, Ferreira JC, Grafodatskaya D, Choufani S, Chiang T, Pu S, Butcher DT, Wodak SJ, Weksberg R. Assessment of methylation level prediction accuracy in methyl-DNA immunoprecipitation and sodium bisulfite based microarray platforms. Epigenetics 2011; 6:410-5. [PMID: 21343703 DOI: 10.4161/epi.6.4.14763] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
In this study, we verified the accuracy of two array methods--methylated DNA immunoprecipitation coupled with CpG island microarrays (MeDIP-CGI-arrays) and sodium bisulfite conversion based microarrays (BC-arrays)--in predicting regional methylation levels as measured by pyrosequencing of bisulfite converted DNA (BC-pyrosequencing). To test the accuracy of these methods we used the Agilent Human CpG island and the Illumina HumanMethylation27 microarrays respectively, and compared microarray outputs to the data from targeted BC-pyrosequencing assays from several genomic regions of corresponding samples. We observed relatively high correlation with BC-pyrosequencing data for both array platforms, R = 0.87 for BC-Array and R = 0.79 for MeDIP-CGI array. However, MeDIP-CGI array were less reliable in predicting intermediate levels of DNA methylation. Several bioinformatics strategies, to ameliorate the performance of the MeDIP-CGI-Arrays did not improve the correlation with BC-pyrosequencing data. The high scalability, low cost and simpler analysis of BC-arrays, together with the recent extended coverage may make them a more versatile methylation analysis tool.
Collapse
|
46
|
Abstract
Protein docking algorithms are assessed by evaluating blind predictions performed during 2007-2009 in Rounds 13-19 of the community-wide experiment on critical assessment of predicted interactions (CAPRI). We evaluated the ability of these algorithms to sample docking poses and to single out specific association modes in 14 targets, representing 11 distinct protein complexes. These complexes play important biological roles in RNA maturation, G-protein signal processing, and enzyme inhibition and function. One target involved protein-RNA interactions not previously considered in CAPRI, several others were hetero-oligomers, or featured multiple interfaces between the same protein pair. For most targets, predictions started from the experimentally determined structures of the free (unbound) components, or from models built from known structures of related or similar proteins. To succeed they therefore needed to account for conformational changes and model inaccuracies. In total, 64 groups and 12 web-servers submitted docking predictions of which 4420 were evaluated. Overall our assessment reveals that 67% of the groups, more than ever before, produced acceptable models or better for at least one target, with many groups submitting multiple high- and medium-accuracy models for two to six targets. Forty-one groups including four web-servers participated in the scoring experiment with 1296 evaluated models. Scoring predictions also show signs of progress evidenced from the large proportion of correct models submitted. But singling out the best models remains a challenge, which also adversely affects the ability to correctly rank docking models. With the increased interest in translating abstract protein interaction networks into realistic models of protein assemblies, the growing CAPRI community is actively developing more efficient and reliable docking and scoring methods for everyone to use.
Collapse
|
47
|
Hao Y, Merkoulovitch A, Vlasblom J, Pu S, Turinsky AL, Roudeva D, Turner B, Greenblatt J, Wodak SJ. OrthoNets: simultaneous visual analysis of orthologs and their interaction neighborhoods across different organisms. Bioinformatics 2011; 27:883-4. [PMID: 21257609 PMCID: PMC3051336 DOI: 10.1093/bioinformatics/btr035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Motivation: Protein interaction networks contain a wealth of biological information, but their large size often hinders cross-organism comparisons. We present OrthoNets, a Cytoscape plugin that displays protein–protein interaction (PPI) networks from two organisms simultaneously, highlighting orthology relationships and aggregating several types of biomedical annotations. OrthoNets also allows PPI networks derived from experiments to be overlaid on networks extracted from public databases, supporting the identification and verification of new interactors. Any newly identified PPIs can be validated by checking whether their orthologs interact in another organism. Availability: OrthoNets is freely available at http://wodaklab.org/orthonets/. Contact:jim.vlasblom@utoronto.ca
Collapse
|
48
|
Wodak SJ, Vlasblom J, Pu S. High-throughput analyses and curation of protein interactions in yeast. Methods Mol Biol 2011; 759:381-406. [PMID: 21863499 DOI: 10.1007/978-1-61779-173-4_22] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The yeast Saccharomyces cerevisiae is the model organism in which protein interactions have been most extensively analyzed. The vast majority of these interactions have been characterized by a variety of sophisticated high-throughput techniques probing different aspects of protein association. This chapter summarizes the major techniques, highlights their complementary nature, discusses the data they produce, and highlights some of the biases from which they suffer. A main focus is the key role played by computational methods for processing, analyzing, and validating the large body of noisy data produced by the experimental procedures. It also describes how computational methods are used to extend the coverage and reliability of protein interaction data by integrating information from heterogeneous sources and reviews the current status of literature-curated data on yeast protein interactions stored in specialized databases.
Collapse
|
49
|
Turinsky AL, Turner B, Borja RC, Gleeson JA, Heath M, Pu S, Switzer T, Dong D, Gong Y, On T, Xiong X, Emili A, Greenblatt J, Parkinson J, Zhang Z, Wodak SJ. DAnCER: disease-annotated chromatin epigenetics resource. Nucleic Acids Res 2011; 39:D889-94. [PMID: 20876685 PMCID: PMC3013761 DOI: 10.1093/nar/gkq857] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2010] [Accepted: 09/12/2010] [Indexed: 12/15/2022] Open
Abstract
Chromatin modification (CM) is a set of epigenetic processes that govern many aspects of DNA replication, transcription and repair. CM is carried out by groups of physically interacting proteins, and their disruption has been linked to a number of complex human diseases. CM remains largely unexplored, however, especially in higher eukaryotes such as human. Here we present the DAnCER resource, which integrates information on genes with CM function from five model organisms, including human. Currently integrated are gene functional annotations, Pfam domain architecture, protein interaction networks and associated human diseases. Additional supporting evidence includes orthology relationships across organisms, membership in protein complexes, and information on protein 3D structure. These data are available for 962 experimentally confirmed and manually curated CM genes and for over 5000 genes with predicted CM function on the basis of orthology and domain composition. DAnCER allows visual explorations of the integrated data and flexible query capabilities using a variety of data filters. In particular, disease information and functional annotations are mapped onto the protein interaction networks, enabling the user to formulate new hypotheses on the function and disease associations of a given gene based on those of its interaction partners. DAnCER is freely available at http://wodaklab.org/dancer/.
Collapse
|
50
|
Turinsky AL, Razick S, Turner B, Donaldson IM, Wodak SJ. Literature curation of protein interactions: measuring agreement across major public databases. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:baq026. [PMID: 21183497 PMCID: PMC3011985 DOI: 10.1093/database/baq026] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Literature curation of protein interaction data faces a number of challenges. Although curators increasingly adhere to standard data representations, the data that various databases actually record from the same published information may differ significantly. Some of the reasons underlying these differences are well known, but their global impact on the interactions collectively curated by major public databases has not been evaluated. Here we quantify the agreement between curated interactions from 15 471 publications shared across nine major public databases. Results show that on average, two databases fully agree on 42% of the interactions and 62% of the proteins curated from the same publication. Furthermore, a sizable fraction of the measured differences can be attributed to divergent assignments of organism or splice isoforms, different organism focus and alternative representations of multi-protein complexes. Our findings highlight the impact of divergent curation policies across databases, and should be relevant to both curators and data consumers interested in analyzing protein-interaction data generated by the scientific community. Database URL:http://wodaklab.org/iRefWeb
Collapse
|