51
|
Abstract
Rather than providing a single specific protocol, the inclusive area of seed proteomics is reviewed; methods are described and compared and primary literature citations are provided. The limitations and challenges of proteomics as an approach to study seed biology are emphasized. The proteomic analysis of seeds encounters some specific problems that do not impinge on analyses of other plant cells, tissues, or organs. There are anatomic considerations. Seeds comprise the seed coat, the storage organ(s), and the embryonic axis. Are these to be studied individually or as a composite? The physiological status of the seeds must be considered; developing, mature, or germinating? If mature, are they quiescent or dormant? If mature and quiescent, then orthodox or recalcitrant? The genetic uniformity of the population of seeds being compared must be considered. Finally, seeds are protein-rich and the extreme abundance of the storage proteins results in a study-subject with a dynamic range that spans several orders of magnitude. This represents a problem that must be dealt with if the study involves analysis of proteins that are of "normal" to low abundance. Several different methods of prefractionation are described and the results compared.
Collapse
Affiliation(s)
- Ján A Miernyk
- USDA, Agricultural Research Service, Plant Genetics Research Unit, Department of Biochemistry, Interdisciplinary Plant Group, University of Missouri, Columbia, MO, USA
| |
Collapse
|
52
|
Teo G, Liu G, Zhang J, Nesvizhskii AI, Gingras AC, Choi H. SAINTexpress: improvements and additional features in Significance Analysis of INTeractome software. J Proteomics 2013; 100:37-43. [PMID: 24513533 DOI: 10.1016/j.jprot.2013.10.023] [Citation(s) in RCA: 413] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2013] [Revised: 10/05/2013] [Accepted: 10/18/2013] [Indexed: 10/26/2022]
Abstract
UNLABELLED Significance Analysis of INTeractome (SAINT) is a statistical method for probabilistically scoring protein-protein interaction data from affinity purification-mass spectrometry (AP-MS) experiments. The utility of the software has been demonstrated in many protein-protein interaction mapping studies, yet the extensive testing also revealed some practical drawbacks. In this paper, we present a new implementation, SAINTexpress, with simpler statistical model and quicker scoring algorithm, leading to significant improvements in computational speed and sensitivity of scoring. SAINTexpress also incorporates external interaction data to compute supplemental topology-based scores to improve the likelihood of identifying co-purifying protein complexes in a probabilistically objective manner. Overall, these changes are expected to improve the performance and user experience of SAINT across various types of high quality datasets. BIOLOGICAL SIGNIFICANCE We present SAINTexpress, an upgraded implementation of Significance Analysis of INTeractome (SAINT) for filtering high confidence interaction data from affinity purification-mass spectrometry (AP-MS) experiments. SAINTexpress features faster computation and incorporation of external data sources into the scoring, improving the performance and user experience of SAINT across various types of datasets. This article is part of a Special Issue entitled: Can Proteomics Fill the Gap Between Genomics and Phenotypes?
Collapse
Affiliation(s)
- Guoci Teo
- Department of Statistics and Applied Probability, National University of Singapore, Singapore
| | - Guomin Liu
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Jianping Zhang
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Alexey I Nesvizhskii
- Departments of Pathology and Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Anne-Claude Gingras
- Centre for Systems Biology, Lunenfeld-Tanenbaum Research Institute at Mount Sinai Hospital, ON M5G 1X5, Canada; Department of Molecular Genetics, University of Toronto, ON M5S 1A8, Canada
| | - Hyungwon Choi
- Saw Swee Hock School of Public Health, National University of Singapore and National University Health System, Singapore.
| |
Collapse
|
53
|
Tucker G, Loh PR, Berger B. A sampling framework for incorporating quantitative mass spectrometry data in protein interaction analysis. BMC Bioinformatics 2013; 14:299. [PMID: 24093595 PMCID: PMC3851523 DOI: 10.1186/1471-2105-14-299] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Accepted: 09/14/2013] [Indexed: 11/15/2022] Open
Abstract
Background Comprehensive protein-protein interaction (PPI) maps are a powerful resource for uncovering the molecular basis of genetic interactions and providing mechanistic insights. Over the past decade, high-throughput experimental techniques have been developed to generate PPI maps at proteome scale, first using yeast two-hybrid approaches and more recently via affinity purification combined with mass spectrometry (AP-MS). Unfortunately, data from both protocols are prone to both high false positive and false negative rates. To address these issues, many methods have been developed to post-process raw PPI data. However, with few exceptions, these methods only analyze binary experimental data (in which each potential interaction tested is deemed either observed or unobserved), neglecting quantitative information available from AP-MS such as spectral counts. Results We propose a novel method for incorporating quantitative information from AP-MS data into existing PPI inference methods that analyze binary interaction data. Our approach introduces a probabilistic framework that models the statistical noise inherent in observations of co-purifications. Using a sampling-based approach, we model the uncertainty of interactions with low spectral counts by generating an ensemble of possible alternative experimental outcomes. We then apply the existing method of choice to each alternative outcome and aggregate results over the ensemble. We validate our approach on three recent AP-MS data sets and demonstrate performance comparable to or better than state-of-the-art methods. Additionally, we provide an in-depth discussion comparing the theoretical bases of existing approaches and identify common aspects that may be key to their performance. Conclusions Our sampling framework extends the existing body of work on PPI analysis using binary interaction data to apply to the richer quantitative data now commonly available through AP-MS assays. This framework is quite general, and many enhancements are likely possible. Fruitful future directions may include investigating more sophisticated schemes for converting spectral counts to probabilities and applying the framework to direct protein complex prediction methods.
Collapse
Affiliation(s)
- George Tucker
- Mathematics Department and Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| | | | | |
Collapse
|
54
|
Chen B, Wu FX. Identifying protein complexes based on multiple topological structures in PPI networks. IEEE Trans Nanobioscience 2013; 12:165-172. [PMID: 23974659 DOI: 10.1109/tnb.2013.2264097] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2024]
Abstract
Various computational algorithms are developed to identify protein complexes based on only one of specific topological structures in protein-protein interaction (PPI) networks, such as cliques, dense subgraphs, core-attachment structures and starlike structures. However, protein complexes exhibit intricate connections in a PPI network. They cannot be fully detected by only single topological structure. In this paper, we propose an algorithm based on multiple topological structures to identify protein complexes from PPI networks. In the proposed algorithm, four single topological structure based algorithms are first employed to identify raw predictions with specific topological structures, respectively. Those raw predictions are trimmed according to their topological information or GO annotations. Similar results are carefully merged before generating final predictions. Numerical experiments are conducted on a yeast PPI network of DIP and a human PPI network of HPRD. The predicted results show that the multiple topological structure based algorithm can not only obtain a more number of predictions, but also generate results with high accuracy in terms of f-score, matching with known protein complexes and functional enrichments with GO.
Collapse
Affiliation(s)
- Bolin Chen
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK, Canada.
| | | |
Collapse
|
55
|
Mellacheruvu D, Wright Z, Couzens AL, Lambert JP, St-Denis N, Li T, Miteva YV, Hauri S, Sardiu ME, Low TY, Halim VA, Bagshaw RD, Hubner NC, al-Hakim A, Bouchard A, Faubert D, Fermin D, Dunham WH, Goudreault M, Lin ZY, Badillo BG, Pawson T, Durocher D, Coulombe B, Aebersold R, Superti-Furga G, Colinge J, Heck AJR, Choi H, Gstaiger M, Mohammed S, Cristea IM, Bennett KL, Washburn MP, Raught B, Ewing RM, Gingras AC, Nesvizhskii AI. The CRAPome: a contaminant repository for affinity purification-mass spectrometry data. Nat Methods 2013; 10:730-6. [PMID: 23921808 PMCID: PMC3773500 DOI: 10.1038/nmeth.2557] [Citation(s) in RCA: 1149] [Impact Index Per Article: 104.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2013] [Accepted: 05/18/2013] [Indexed: 02/07/2023]
Abstract
Affinity purification coupled with mass spectrometry (AP-MS) is a widely used approach for the identification of protein-protein interactions. However, for any given protein of interest, determining which of the identified polypeptides represent bona fide interactors versus those that are background contaminants (for example, proteins that interact with the solid-phase support, affinity reagent or epitope tag) is a challenging task. The standard approach is to identify nonspecific interactions using one or more negative-control purifications, but many small-scale AP-MS studies do not capture a complete, accurate background protein set when available controls are limited. Fortunately, negative controls are largely bait independent. Hence, aggregating negative controls from multiple AP-MS studies can increase coverage and improve the characterization of background associated with a given experimental protocol. Here we present the contaminant repository for affinity purification (the CRAPome) and describe its use for scoring protein-protein interactions. The repository (currently available for Homo sapiens and Saccharomyces cerevisiae) and computational tools are freely accessible at http://www.crapome.org/.
Collapse
Affiliation(s)
- Dattatreya Mellacheruvu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Zachary Wright
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Amber L. Couzens
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Jean-Philippe Lambert
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Nicole St-Denis
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Tuo Li
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Yana V. Miteva
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Simon Hauri
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | - Teck Yew Low
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands
- Netherlands Proteomics Center, Utrecht, The Netherlands
| | - Vincentius A. Halim
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands
- Netherlands Proteomics Center, Utrecht, The Netherlands
- Division of Cell Biology, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Richard D. Bagshaw
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Nina C. Hubner
- Department of Molecular Biology; Faculty of Science; Nijmegen Centre for Molecular Life Sciences; Radboud University; Nijmegen, The Netherlands
| | - Abdallah al-Hakim
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Annie Bouchard
- Institut de recherches cliniques de Montréal (IRCM), Montréal, QC, Canada
| | - Denis Faubert
- Institut de recherches cliniques de Montréal (IRCM), Montréal, QC, Canada
| | - Damian Fermin
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Wade H. Dunham
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Marilyn Goudreault
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Zhen-Yuan Lin
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Beatriz Gonzalez Badillo
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | - Tony Pawson
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Daniel Durocher
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Benoit Coulombe
- Institut de recherches cliniques de Montréal (IRCM), Montréal, QC, Canada
- Department of Biochemistry, Université de Montréal, Montréal, QC, Canada
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Giulio Superti-Furga
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
| | - Jacques Colinge
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
| | - Albert J. R. Heck
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands
- Netherlands Proteomics Center, Utrecht, The Netherlands
| | - Hyungwon Choi
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| | - Matthias Gstaiger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Shabaz Mohammed
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, The Netherlands
- Netherlands Proteomics Center, Utrecht, The Netherlands
| | - Ileana M. Cristea
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Keiryn L. Bennett
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
| | - Mike P. Washburn
- Stowers Institute for Medical Research, Kansas City, MO, USA
- Department of Pathology & Laboratory Medicine, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Brian Raught
- Ontario Cancer Institute, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Rob M. Ewing
- Center for Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, OH, USA
- Department of Genetics and Genome Science, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Anne-Claude Gingras
- Centre for Systems Biology, Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Alexey I. Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
56
|
Rupakula A, Kruse T, Boeren S, Holliger C, Smidt H, Maillard J. The restricted metabolism of the obligate organohalide respiring bacterium Dehalobacter restrictus: lessons from tiered functional genomics. Philos Trans R Soc Lond B Biol Sci 2013; 368:20120325. [PMID: 23479754 DOI: 10.1098/rstb.2012.0325] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Dehalobacter restrictus strain PER-K23 is an obligate organohalide respiring bacterium, which displays extremely narrow metabolic capabilities. It grows only via coupling energy conservation to anaerobic respiration of tetra- and trichloroethene with hydrogen as sole electron donor. Dehalobacter restrictus represents the paradigmatic member of the genus Dehalobacter, which in recent years has turned out to be a major player in the bioremediation of an increasing number of organohalides, both in situ and in laboratory studies. The recent elucidation of the D. restrictus genome revealed a rather elaborate genome with predicted pathways that were not suspected from its restricted metabolism, such as a complete corrinoid biosynthetic pathway, the Wood-Ljungdahl (WL) pathway for CO2 fixation, abundant transcriptional regulators and several types of hydrogenases. However, one important feature of the genome is the presence of 25 reductive dehalogenase genes, from which so far only one, pceA, has been characterized on genetic and biochemical levels. This study describes a multi-level functional genomics approach on D. restrictus across three different growth phases. A global proteomic analysis allowed consideration of general metabolic pathways relevant to organohalide respiration, whereas the dedicated genomic and transcriptomic analysis focused on the diversity, composition and expression of genes associated with reductive dehalogenases.
Collapse
Affiliation(s)
- Aamani Rupakula
- Ecole Polytechnique Fédérale de Lausanne (EPFL), School of Architecture, Civil and Environmental Engineering, Laboratory for Environmental Biotechnology, Station 6, 1015 Lausanne, Switzerland
| | | | | | | | | | | |
Collapse
|
57
|
Byron A, Humphries JD, Humphries MJ. Defining the extracellular matrix using proteomics. Int J Exp Pathol 2013; 94:75-92. [PMID: 23419153 DOI: 10.1111/iep.12011] [Citation(s) in RCA: 118] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Revised: 09/13/2012] [Accepted: 11/16/2012] [Indexed: 12/11/2022] Open
Abstract
The cell microenvironment has a profound influence on the behaviour, growth and survival of cells. The extracellular matrix (ECM) provides not only mechanical and structural support to cells and tissues but also binds soluble ligands and transmembrane receptors to provide spatial coordination of signalling processes. The ability of cells to sense the chemical, mechanical and topographical features of the ECM enables them to integrate complex, multiparametric information into a coherent response to the surrounding microenvironment. Consequently, dysregulation or mutation of ECM components results in a broad range of pathological conditions. Characterization of the composition of ECM derived from various cells has begun to reveal insights into ECM structure and function, and mechanisms of disease. Proteomic methodologies permit the global analysis of subcellular systems, but extracellular and transmembrane proteins present analytical difficulties to proteomic strategies owing to the particular biochemical properties of these molecules. Here, we review advances in proteomic approaches that have been applied to furthering our understanding of the ECM microenvironment. We survey recent studies that have addressed challenges in the analysis of ECM and discuss major outcomes in the context of health and disease. In addition, we summarize efforts to progress towards a systems-level understanding of ECM biology.
Collapse
Affiliation(s)
- Adam Byron
- Wellcome Trust Centre for Cell-Matrix Research, Faculty of Life Sciences, University of Manchester, Manchester, UK
| | | | | |
Collapse
|
58
|
White EA, Howley PM. Proteomic approaches to the study of papillomavirus-host interactions. Virology 2013; 435:57-69. [PMID: 23217616 PMCID: PMC3522865 DOI: 10.1016/j.virol.2012.09.046] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2012] [Accepted: 09/30/2012] [Indexed: 01/22/2023]
Abstract
The identification of interactions between viral and host cellular proteins has provided major insights into papillomavirus research, and these interactions are especially relevant to the role of papillomaviruses in the cancers with which they are associated. Recent advances in mass spectrometry technology and data processing now allow the systematic identification of such interactions. This has led to an improved understanding of the different pathologies associated with the many papillomavirus types, and the diverse nature of these viruses is reflected in the spectrum of interactions with host proteins. Here we review a history of proteomic approaches, particularly as applied to the papillomaviruses, and summarize current techniques. Current proteomic studies on the papillomaviruses use yeast-two-hybrid or affinity purification-mass spectrometry approaches. We detail the advantages and disadvantages of each and describe current examples of papillomavirus proteomic studies, with a particular focus on the HPV E6 and E7 oncoproteins.
Collapse
Affiliation(s)
- Elizabeth A. White
- Department of Microbiology and Immunobiology, Harvard Medical School, NRB Room 950, 77 Avenue Louis Pasteur, Boston, MA 02115
| | - Peter M. Howley
- Department of Microbiology and Immunobiology, Harvard Medical School, NRB Room 950, 77 Avenue Louis Pasteur, Boston, MA 02115
| |
Collapse
|
59
|
Braun P, Aubourg S, Van Leene J, De Jaeger G, Lurin C. Plant protein interactomes. ANNUAL REVIEW OF PLANT BIOLOGY 2013; 64:161-87. [PMID: 23330791 DOI: 10.1146/annurev-arplant-050312-120140] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Protein-protein interactions are a critical element of biological systems, and the analysis of interaction partners can provide valuable hints about unknown functions of a protein. In recent years, several large-scale protein interaction studies have begun to unravel the complex networks through which plant proteins exert their functions. Two major classes of experimental approaches are used for protein interaction mapping: analysis of direct interactions using binary methods such as yeast two-hybrid or split ubiquitin, and analysis of protein complexes through affinity purification followed by mass spectrometry. In addition, bioinformatics predictions can suggest interactions that have evaded detection by other methods or those of proteins that have not been investigated. Here we review the major approaches to construct, analyze, use, and carry out quality control on plant protein interactome networks. We present experimental and computational approaches for large-scale mapping, methods for validation or smaller-scale functional studies, important bioinformatics resources, and findings from recently published large-scale plant interactome network maps.
Collapse
Affiliation(s)
- Pascal Braun
- Department of Plant Systems Biology, Center for Life and Food Sciences Weihenstephan, Technische Universität München (TUM), 85354 Freising-Weihenstephan, Germany.
| | | | | | | | | |
Collapse
|
60
|
Choi H, Liu G, Mellacheruvu D, Tyers M, Gingras AC, Nesvizhskii AI. Analyzing protein-protein interactions from affinity purification-mass spectrometry data with SAINT. ACTA ACUST UNITED AC 2012; Chapter 8:8.15.1-8.15.23. [PMID: 22948729 DOI: 10.1002/0471250953.bi0815s39] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Significance Analysis of INTeractome (SAINT) is a software package for scoring protein-protein interactions based on label-free quantitative proteomics data (e.g., spectral count or intensity) in affinity purification-mass spectrometry (AP-MS) experiments. SAINT allows bench scientists to select bona fide interactions and remove nonspecific interactions in an unbiased manner. However, there is no 'one-size-fits-all' statistical model for every dataset, since the experimental design varies across studies. Key variables include the number of baits, the number of biological replicates per bait, and control purifications. Here we give a detailed account of input data format, control data, selection of high-confidence interactions, and visualization of filtered data. We explain additional options for customizing the statistical model for optimal filtering in specific datasets. We also discuss a graphical user interface of SAINT in connection to the LIMS system ProHits, which can be installed as a virtual machine on Mac OS X or Windows computers.
Collapse
Affiliation(s)
- Hyungwon Choi
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| | | | | | | | | | | |
Collapse
|
61
|
Altelaar AFM, Munoz J, Heck AJR. Next-generation proteomics: towards an integrative view of proteome dynamics. Nat Rev Genet 2012. [PMID: 23207911 DOI: 10.1038/nrg3356] [Citation(s) in RCA: 515] [Impact Index Per Article: 42.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Next-generation sequencing allows the analysis of genomes, including those representing disease states. However, the causes of most disorders are multifactorial, and systems-level approaches, including the analysis of proteomes, are required for a more comprehensive understanding. The proteome is extremely multifaceted owing to splicing and protein modifications, and this is further amplified by the interconnectivity of proteins into complexes and signalling networks that are highly divergent in time and space. Proteome analysis heavily relies on mass spectrometry (MS). MS-based proteomics is starting to mature and to deliver through a combination of developments in instrumentation, sample preparation and computational analysis. Here we describe this emerging next generation of proteomics and highlight recent applications.
Collapse
Affiliation(s)
- A F Maarten Altelaar
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
| | | | | |
Collapse
|
62
|
Label-free quantitative proteomics trends for protein-protein interactions. J Proteomics 2012; 81:91-101. [PMID: 23153790 DOI: 10.1016/j.jprot.2012.10.027] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2012] [Revised: 10/24/2012] [Accepted: 10/31/2012] [Indexed: 12/14/2022]
Abstract
Understanding protein interactions within the complexity of a living cell is challenging, but techniques coupling affinity purification and mass spectrometry have enabled important progress to be made in the past 15 years. As identification of protein-protein interactions is becoming easier, the quantification of the interaction dynamics is the next frontier. Several quantitative mass spectrometric approaches have been developed to address this issue that vary in their strengths and weaknesses. While isotopic labeling approaches continue to contribute to the identification of regulated interactions, techniques that do not require labeling are becoming increasingly used in the field. Here, we describe the major types of label-free quantification used in interaction proteomics, and discuss the relative merits of data dependent and data independent acquisition approaches in label-free quantification. This article is part of a Special Issue entitled: From protein structures to clinical applications.
Collapse
|
63
|
Dunham WH, Mullin M, Gingras AC. Affinity-purification coupled to mass spectrometry: basic principles and strategies. Proteomics 2012; 12:1576-90. [PMID: 22611051 DOI: 10.1002/pmic.201100523] [Citation(s) in RCA: 234] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Identifying the interactions established by a protein of interest can be a critical step in understanding its function. This is especially true when an unknown protein of interest is demonstrated to physically interact with proteins of known function. While many techniques have been developed to characterize protein-protein interactions, one strategy that has gained considerable momentum over the past decade for identification and quantification of protein-protein interactions, is affinity-purification followed by mass spectrometry (AP-MS). Here, we briefly review the basic principles used in affinity-purification coupled to mass spectrometry, with an emphasis on tools (both biochemical and computational), which enable the discovery and reporting of high quality protein-protein interactions.
Collapse
Affiliation(s)
- Wade H Dunham
- Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | | | | |
Collapse
|
64
|
Armean IM, Lilley KS, Trotter MWB. Popular computational methods to assess multiprotein complexes derived from label-free affinity purification and mass spectrometry (AP-MS) experiments. Mol Cell Proteomics 2012; 12:1-13. [PMID: 23071097 DOI: 10.1074/mcp.r112.019554] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Advances in sensitivity, resolution, mass accuracy, and throughput have considerably increased the number of protein identifications made via mass spectrometry. Despite these advances, state-of-the-art experimental methods for the study of protein-protein interactions yield more candidate interactions than may be expected biologically owing to biases and limitations in the experimental methodology. In silico methods, which distinguish between true and false interactions, have been developed and applied successfully to reduce the number of false positive results yielded by physical interaction assays. Such methods may be grouped according to: (1) the type of data used: methods based on experiment-specific measurements (e.g., spectral counts or identification scores) versus methods that extract knowledge encoded in external annotations (e.g., public interaction and functional categorisation databases); (2) the type of algorithm applied: the statistical description and estimation of physical protein properties versus predictive supervised machine learning or text-mining algorithms; (3) the type of protein relation evaluated: direct (binary) interaction of two proteins in a cocomplex versus probability of any functional relationship between two proteins (e.g., co-occurrence in a pathway, sub cellular compartment); and (4) initial motivation: elucidation of experimental data by evaluation versus prediction of novel protein-protein interaction, to be experimentally validated a posteriori. This work reviews several popular computational scoring methods and software platforms for protein-protein interactions evaluation according to their methodology, comparative strengths and weaknesses, data representation, accessibility, and availability. The scoring methods and platforms described include: CompPASS, SAINT, Decontaminator, MINT, IntAct, STRING, and FunCoup. References to related work are provided throughout in order to provide a concise but thorough introduction to a rapidly growing interdisciplinary field of investigation.
Collapse
Affiliation(s)
- Irina M Armean
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, CB2 1GA, UK
| | | | | |
Collapse
|
65
|
De Las Rivas J, Fontanillo C. Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell. Brief Funct Genomics 2012; 11:489-96. [PMID: 22908212 DOI: 10.1093/bfgp/els036] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Mapping and understanding of the protein interaction networks with their key modules and hubs can provide deeper insights into the molecular machinery underlying complex phenotypes. In this article, we present the basic characteristics and definitions of protein networks, starting with a distinction of the different types of associations between proteins. We focus the review on protein-protein interactions (PPIs), a subset of associations defined as physical contacts between proteins that occur by selective molecular docking in a particular biological context. We present such definition as opposed to other types of protein associations derived from regulatory, genetic, structural or functional relations. To determine PPIs, a variety of binary and co-complex methods exist; however, not all the technologies provide the same information and data quality. A way of increasing confidence in a given protein interaction is to integrate orthogonal experimental evidences. The use of several complementary methods testing each single interaction assesses the accuracy of PPI data and tries to minimize the occurrence of false interactions. Following this approach there have been important efforts to unify primary databases of experimentally proven PPIs into integrated databases. These meta-databases provide a measure of the confidence of interactions based on the number of experimental proofs that report them. As a conclusion, we can state that integrated information allows the building of more reliable interaction networks. Identification of communities, cliques, modules and hubs by analysing the topological parameters and graph properties of the protein networks allows the discovery of central/critical nodes, which are candidates to regulate cellular flux and dynamics.
Collapse
Affiliation(s)
- Javier De Las Rivas
- Bioinformatics and Functional Genomics Research Group, Cancer Research Center (IBMCC, CSIC/USAL), Salamanca, Spain.
| | | |
Collapse
|
66
|
Kean MJ, Couzens AL, Gingras AC. Mass spectrometry approaches to study mammalian kinase and phosphatase associated proteins. Methods 2012; 57:400-8. [PMID: 22710030 DOI: 10.1016/j.ymeth.2012.06.002] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2011] [Revised: 06/04/2012] [Accepted: 06/08/2012] [Indexed: 12/24/2022] Open
Abstract
Reversible phosphorylation events regulate critical aspects of cellular biology by affecting protein conformation, cellular localization, enzymatic activity and associations with interaction partners. Kinases and phosphatases interact not only with their substrates but also with regulatory subunits and other proteins, including scaffolds. In recent years, affinity purification coupled to mass spectrometry (AP-MS) has proven to be a powerful tool to identify protein-protein interactions (PPIs) involving kinases and phosphatases. In this review we outline general considerations for successful AP-MS, and describe strategies that we have used to characterize the interactions of kinases and phosphatases in human cells.
Collapse
Affiliation(s)
- Michelle J Kean
- Samuel Lunenfeld Research Institute at Mount Sinai Hospital, 600 University Ave., Rm 992, Toronto, ON, Canada M5G 1X5
| | | | | |
Collapse
|