1
|
Prokai L, Zaman K, Prokai-Tatrai K. Mass spectrometry-based retina proteomics. MASS SPECTROMETRY REVIEWS 2023; 42:1032-1062. [PMID: 35670041 PMCID: PMC9730434 DOI: 10.1002/mas.21786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/28/2022] [Accepted: 04/29/2022] [Indexed: 06/15/2023]
Abstract
A subfield of neuroproteomics, retina proteomics has experienced a transformative growth since its inception due to methodological advances in enabling chemical, biochemical, and molecular biology techniques. This review focuses on mass spectrometry's contributions to facilitate mammalian and avian retina proteomics to catalog and quantify retinal protein expressions, determine their posttranslational modifications, as well as its applications to study the proteome of the retina in the context of biology, health and diseases, and therapy developments.
Collapse
Affiliation(s)
- Laszlo Prokai
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Khadiza Zaman
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Katalin Prokai-Tatrai
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| |
Collapse
|
2
|
Chase Huizar C, Raphael I, Forsthuber TG. Genomic, proteomic, and systems biology approaches in biomarker discovery for multiple sclerosis. Cell Immunol 2020; 358:104219. [PMID: 33039896 PMCID: PMC7927152 DOI: 10.1016/j.cellimm.2020.104219] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 09/13/2020] [Accepted: 09/16/2020] [Indexed: 12/12/2022]
Abstract
Multiple sclerosis (MS) is a neuroinflammatory disorder characterized by autoimmune-mediated inflammatory lesions in CNS leading to myelin damage and axonal loss. MS is a heterogenous disease with variable and unpredictable disease course. Due to its complex nature, MS is difficult to diagnose and responses to specific treatments may vary between individuals. Therefore, there is an indisputable need for biomarkers for early diagnosis, prediction of disease exacerbations, monitoring the progression of disease, and for measuring responses to therapy. Genomic and proteomic studies have sought to understand the molecular basis of MS and find biomarker candidates. Advances in next-generation sequencing and mass-spectrometry techniques have yielded an unprecedented amount of genomic and proteomic data; yet, translation of the results into the clinic has been underwhelming. This has prompted the development of novel data science techniques for exploring these large datasets to identify biologically relevant relationships and ultimately point towards useful biomarkers. Herein we discuss optimization of omics study designs, advances in the generation of omics data, and systems biology approaches aimed at improving biomarker discovery and translation to the clinic for MS.
Collapse
Affiliation(s)
- Carol Chase Huizar
- Department of Biology, University of Texas at San Antonio, San Antonio, TX, USA
| | - Itay Raphael
- Department of Neurological Surgery, University of Pittsburgh, UPMC Children's Hospital, Pittsburgh, PA, USA.
| | - Thomas G Forsthuber
- Department of Biology, University of Texas at San Antonio, San Antonio, TX, USA.
| |
Collapse
|
3
|
Seneviratne CJ, Suriyanarayanan T, Widyarman AS, Lee LS, Lau M, Ching J, Delaney C, Ramage G. Multi-omics tools for studying microbial biofilms: current perspectives and future directions. Crit Rev Microbiol 2020; 46:759-778. [PMID: 33030973 DOI: 10.1080/1040841x.2020.1828817] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The advent of omics technologies has greatly improved our understanding of microbial biology, particularly in the last two decades. The field of microbial biofilms is, however, relatively new, consolidated in the 1980s. The morphogenic switching by microbes from planktonic to biofilm phenotype confers numerous survival advantages such as resistance to desiccation, antibiotics, biocides, ultraviolet radiation, and host immune responses, thereby complicating treatment strategies for pathogenic microorganisms. Hence, understanding the mechanisms governing the biofilm phenotype can result in efficient treatment strategies directed specifically against molecular markers mediating this process. The application of omics technologies for studying microbial biofilms is relatively less explored and holds great promise in furthering our understanding of biofilm biology. In this review, we provide an overview of the application of omics tools such as transcriptomics, proteomics, and metabolomics as well as multi-omics approaches for studying microbial biofilms in the current literature. We also highlight how the use of omics tools directed at various stages of the biological information flow, from genes to metabolites, can be integrated via multi-omics platforms to provide a holistic view of biofilm biology. Following this, we propose a future artificial intelligence-based multi-omics platform that can predict the pathways associated with different biofilm phenotypes.
Collapse
Affiliation(s)
- Chaminda J Seneviratne
- Singapore Oral Microbiomics Initiative (SOMI), National Dental Research Institute Singapore, National Dental Centre, Singapore, Singapore.,Duke NUS Medical School, Singapore, Singapore
| | - Tanujaa Suriyanarayanan
- Singapore Oral Microbiomics Initiative (SOMI), National Dental Research Institute Singapore, National Dental Centre, Singapore, Singapore.,Duke NUS Medical School, Singapore, Singapore
| | - Armelia Sari Widyarman
- Department of Microbiology, Faculty of Dentistry, Trisakti University, Grogol, West Jakarta, Indonesia
| | - Lye Siang Lee
- Duke-NUS Medical School, Metabolomics Lab, Cardiovascular and Metabolic Disorders, Singapore, Singapore
| | - Matthew Lau
- Singapore Oral Microbiomics Initiative (SOMI), National Dental Research Institute Singapore, National Dental Centre, Singapore, Singapore
| | - Jianhong Ching
- Duke-NUS Medical School, Metabolomics Lab, Cardiovascular and Metabolic Disorders, Singapore, Singapore
| | - Christopher Delaney
- School of Medicine, Dentistry & Nursing, Glasgow Dental Hospital & School, University of Glasgow, Glasgow, UK
| | - Gordon Ramage
- School of Medicine, Dentistry & Nursing, Glasgow Dental Hospital & School, University of Glasgow, Glasgow, UK
| |
Collapse
|
4
|
Aguilan JT, Kulej K, Sidoli S. Guide for protein fold change and p-value calculation for non-experts in proteomics. Mol Omics 2020; 16:573-582. [PMID: 32968743 DOI: 10.1039/d0mo00087f] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.
Collapse
Affiliation(s)
- Jennifer T Aguilan
- Laboratory for Macromolecular Analysis and Proteomics Facility, Albert Einstein College of Medicine, NY 10461, USA.
| | | | | |
Collapse
|
5
|
Brunoro GVF, Carvalho PC, Barbosa VC, Pagnoncelli D, De Moura Gallo CV, Perales J, Zahedi RP, Valente RH, Neves-Ferreira AGDC. Differential proteomic comparison of breast cancer secretome using a quantitative paired analysis workflow. BMC Cancer 2019; 19:365. [PMID: 30999875 PMCID: PMC6474050 DOI: 10.1186/s12885-019-5547-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 03/28/2019] [Indexed: 12/22/2022] Open
Abstract
Background Worldwide, breast cancer is the main cause of cancer mortality in women. Most cases originate in mammary ductal cells that produce the nipple aspirate fluid (NAF). In cancer patients, this secretome contains proteins associated with the tumor microenvironment. NAF studies are challenging because of inter-individual variability. We introduced a paired-proteomic shotgun strategy that relies on NAF analysis from both breasts of patients with unilateral breast cancer and extended PatternLab for Proteomics software to take advantage of this setup. Methods The software is based on a peptide-centric approach and uses the binomial distribution to attribute a probability for each peptide as being linked to the disease; these probabilities are propagated to a final protein p-value according to the Stouffer’s Z-score method. Results A total of 1227 proteins were identified and quantified, of which 87 were differentially abundant, being mainly involved in glycolysis (Warburg effect) and immune system activation (activated stroma). Additionally, in the estrogen receptor-positive subgroup, proteins related to the regulation of insulin-like growth factor transport and platelet degranulation displayed higher abundance, confirming the presence of a proliferative microenvironment. Conclusions We debuted a differential bioinformatics workflow for the proteomic analysis of NAF, validating this secretome as a treasure-trove for studying a paired-organ cancer type. Electronic supplementary material The online version of this article (10.1186/s12885-019-5547-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Giselle Villa Flor Brunoro
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Av. Brasil 4365, Manguinhos, Rio de Janeiro, 21040-360, Brazil
| | - Paulo Costa Carvalho
- Laboratory for Proteomics and Protein Engineering, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, CIC, Paraná, 81350-010, Brazil
| | - Valmir C Barbosa
- Systems Engineering and Computer Science Program, Federal University of Rio de Janeiro, Caixa Postal 68511, Ilha do Fundão, Rio de Janeiro, 21941-972, Brazil
| | - Dante Pagnoncelli
- Laboratory of Applied Molecular Biology, Gynecology Department, Fernandes Figueira Institute, Fiocruz, Av. Rui Barbosa 716, Flamengo, Rio de Janeiro, 22250-020, Brazil
| | - Claudia Vitória De Moura Gallo
- Laboratory of Molecular Biology of Tumors, Department of Genetics, State University of Rio de Janeiro, Rua São Francisco Xavier 524, Maracanã, Rio de Janeiro, 20550-900, Brazil
| | - Jonas Perales
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Av. Brasil 4365, Manguinhos, Rio de Janeiro, 21040-360, Brazil
| | - René Peiman Zahedi
- Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V, Otto-Hahn-Straße 6b, 44227, Dortmund, Germany.,Segal Cancer Proteomics Centre, Lady Davis Institute at the Jewish General Hospital, McGill University, 3755 Chemin de la Côte-Sainte-Catherine, Montréal, H3T 1E2, Canada
| | - Richard Hemmi Valente
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Av. Brasil 4365, Manguinhos, Rio de Janeiro, 21040-360, Brazil
| | | |
Collapse
|
6
|
Peffers MJ, Smagul A, Anderson JR. Proteomic analysis of synovial fluid: current and potential uses to improve clinical outcomes. Expert Rev Proteomics 2019; 16:287-302. [PMID: 30793992 DOI: 10.1080/14789450.2019.1578214] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
INTRODUCTION Synovial fluid (SF) is in close proximity to tissues which are primarily altered during articular disease and has significant potential to better understand the underlying disease pathogeneses of articular pathologies and biomarker discovery. Although development of mass spectrometry-based methods has allowed faster and higher sensitivity techniques, interrogation of the SF proteome has been hindered by its large protein concentration dynamic range, impeding quantification of lower abundant proteins. Areas covered: Recent advances have developed methodologies to reduce the large protein concentration dynamic range of SF and subsequently allow deeper exploration of the SF proteome. This review concentrates on methods to overcome biofluid complexity, mass spectrometry proteomics methodologies, extracellular vesicles proteomics and the application of advances within the field in clinical disease, including osteoarthritis, rheumatoid arthritis, spondyloarthritis and juvenile arthritis. A narrative review was conducted with articles searched using PubMed, 1991-2018. Expert opinion: The SF proteomics field faces various challenges, including the requirement for rigorous and standardised methods of sample collection/storage, the sensitivity and specificity of proteomic assays, techniques to combat the large protein concentration dynamic range and comprehensive data analysis to reduce falsely identified markers. Additionally, there are challenges in developing multi 'omic' integration techniques, with computational integration enhancing analysis.
Collapse
Affiliation(s)
- Mandy Jayne Peffers
- a Comparative Musculoskeletal Biology, Institute of Ageing and Chronic Disease , University of Liverpool , Liverpool , UK
| | - Aibek Smagul
- a Comparative Musculoskeletal Biology, Institute of Ageing and Chronic Disease , University of Liverpool , Liverpool , UK
| | - James Ross Anderson
- a Comparative Musculoskeletal Biology, Institute of Ageing and Chronic Disease , University of Liverpool , Liverpool , UK
| |
Collapse
|
7
|
Pandeswari PB, Sabareesh V. Middle-down approach: a choice to sequence and characterize proteins/proteomes by mass spectrometry. RSC Adv 2018; 9:313-344. [PMID: 35521579 PMCID: PMC9059502 DOI: 10.1039/c8ra07200k] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 12/11/2018] [Indexed: 12/27/2022] Open
Abstract
Owing to rapid growth in the elucidation of genome sequences of various organisms, deducing proteome sequences has become imperative, in order to have an improved understanding of biological processes. Since the traditional Edman method was unsuitable for high-throughput sequencing and also for N-terminus modified proteins, mass spectrometry (MS) based methods, mainly based on soft ionization modes: electrospray ionization and matrix-assisted laser desorption/ionization, began to gain significance. MS based methods were adaptable for high-throughput studies and applicable for sequencing N-terminus blocked proteins/peptides too. Consequently, over the last decade a new discipline called 'proteomics' has emerged, which encompasses the attributes necessary for high-throughput identification of proteins. 'Proteomics' may also be regarded as an offshoot of the classic field, 'biochemistry'. Many protein sequencing and proteomic investigations were successfully accomplished through MS dependent sequence elucidation of 'short proteolytic peptides (typically: 7-20 amino acid residues), which is called the 'shotgun' or 'bottom-up (BU)' approach. While the BU approach continues as a workhorse for proteomics/protein sequencing, attempts to sequence intact proteins without proteolysis, called the 'top-down (TD)' approach started, due to ambiguities in the BU approach, e.g., protein inference problem, identification of proteoforms and the discovery of posttranslational modifications (PTMs). The high-throughput TD approach (TD proteomics) is yet in its infancy. Nevertheless, TD characterization of purified intact proteins has been useful for detecting PTMs. With the hope to overcome the pitfalls of BU and TD strategies, another concept called the 'middle-down (MD)' approach was put forward. Similar to BU, the MD approach also involves proteolysis, but in a restricted manner, to produce 'longer' proteolytic peptides than the ones usually obtained in BU studies, thereby providing better sequence coverage. In this regard, special proteases (OmpT, Sap9, IdeS) have been used, which can cleave proteins to produce longer proteolytic peptides. By reviewing ample evidences currently existing in the literature that is predominantly on PTM characterization of histones and antibodies, herein we highlight salient features of the MD approach. Consequently, we are inclined to claim that the MD concept might have widespread applications in future for various research areas, such as clinical, biopharmaceuticals (including PTM analysis) and even for general/routine characterization of proteins including therapeutic proteins, but not just limited to analysis of histones or antibodies.
Collapse
Affiliation(s)
- P Boomathi Pandeswari
- Advanced Centre for Bio Separation Technology (CBST), Vellore Institute of Technology (VIT) Vellore Tamil Nadu 632014 India
| | - Varatharajan Sabareesh
- Advanced Centre for Bio Separation Technology (CBST), Vellore Institute of Technology (VIT) Vellore Tamil Nadu 632014 India
| |
Collapse
|
8
|
Bittremieux W, Tabb DL, Impens F, Staes A, Timmerman E, Martens L, Laukens K. Quality control in mass spectrometry-based proteomics. MASS SPECTROMETRY REVIEWS 2018; 37:697-711. [PMID: 28802010 DOI: 10.1002/mas.21544] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Revised: 07/24/2017] [Accepted: 07/24/2017] [Indexed: 05/21/2023]
Abstract
Mass spectrometry is a highly complex analytical technique and mass spectrometry-based proteomics experiments can be subject to a large variability, which forms an obstacle to obtaining accurate and reproducible results. Therefore, a comprehensive and systematic approach to quality control is an essential requirement to inspire confidence in the generated results. A typical mass spectrometry experiment consists of multiple different phases including the sample preparation, liquid chromatography, mass spectrometry, and bioinformatics stages. We review potential sources of variability that can impact the results of a mass spectrometry experiment occurring in all of these steps, and we discuss how to monitor and remedy the negative influences on the experimental results. Furthermore, we describe how specialized quality control samples of varying sample complexity can be incorporated into the experimental workflow and how they can be used to rigorously assess detailed aspects of the instrument performance.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Center Antwerp (Biomina), University of Antwerp/Antwerp University Hospital, Edegem, Belgium
| | - David L Tabb
- Division of Molecular Biology and Human Genetics, Stellenbosch University Faculty of Medicine and Health Sciences, Tygerberg Hospital, Cape Town, South Africa
| | - Francis Impens
- VIB Proteomics Core, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
- Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium
| | - An Staes
- VIB Proteomics Core, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
- Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium
| | - Evy Timmerman
- VIB Proteomics Core, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
- Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
- Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Zwijnaarde, Belgium
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Center Antwerp (Biomina), University of Antwerp/Antwerp University Hospital, Edegem, Belgium
| |
Collapse
|
9
|
Identification of AHL- and BDSF-Controlled Proteins in Burkholderia cenocepacia by Proteomics. Methods Mol Biol 2018; 1673:193-202. [PMID: 29130174 DOI: 10.1007/978-1-4939-7309-5_15] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
We used comparative proteome analysis to determine the target genes of the two quorum sensing (QS) circuits in the opportunistic pathogen Burkholderia cenocepacia: the N-acyl homoserine lactone (AHL)-based CepIR system and the BDSF (B urkholderia diffusible signal factor, cis-2-dodecenoic acid)-based RpfFR system. In this book chapter, we focus on the description of the practical procedure we currently use in the laboratory to perform a sensitive GeLC-MS/MS shotgun proteomics experiment; we also briefly describe the downstream bioinformatic data analysis.
Collapse
|
10
|
Heyer R, Schallert K, Zoun R, Becher B, Saake G, Benndorf D. Challenges and perspectives of metaproteomic data analysis. J Biotechnol 2017; 261:24-36. [PMID: 28663049 DOI: 10.1016/j.jbiotec.2017.06.1201] [Citation(s) in RCA: 118] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Revised: 06/20/2017] [Accepted: 06/23/2017] [Indexed: 02/07/2023]
Abstract
In nature microorganisms live in complex microbial communities. Comprehensive taxonomic and functional knowledge about microbial communities supports medical and technical application such as fecal diagnostics as well as operation of biogas plants or waste water treatment plants. Furthermore, microbial communities are crucial for the global carbon and nitrogen cycle in soil and in the ocean. Among the methods available for investigation of microbial communities, metaproteomics can approximate the activity of microorganisms by investigating the protein content of a sample. Although metaproteomics is a very powerful method, issues within the bioinformatic evaluation impede its success. In particular, construction of databases for protein identification, grouping of redundant proteins as well as taxonomic and functional annotation pose big challenges. Furthermore, growing amounts of data within a metaproteomics study require dedicated algorithms and software. This review summarizes recent metaproteomics software and addresses the introduced issues in detail.
Collapse
Affiliation(s)
- Robert Heyer
- Otto von Guericke University, Bioprocess Engineering, Universitätsplatz 2, 39106 Magdeburg, Germany.
| | - Kay Schallert
- Otto von Guericke University, Bioprocess Engineering, Universitätsplatz 2, 39106 Magdeburg, Germany.
| | - Roman Zoun
- Otto von Guericke University, Institute for Technical and Business Information Systems, Universitätsplatz 2, 39106 Magdeburg, Germany.
| | - Beatrice Becher
- Otto von Guericke University, Bioprocess Engineering, Universitätsplatz 2, 39106 Magdeburg, Germany.
| | - Gunter Saake
- Otto von Guericke University, Institute for Technical and Business Information Systems, Universitätsplatz 2, 39106 Magdeburg, Germany.
| | - Dirk Benndorf
- Otto von Guericke University, Bioprocess Engineering, Universitätsplatz 2, 39106 Magdeburg, Germany; Max Planck Institute for Dynamics of Complex Technical Systems, Bioprocess Engineering, Sandtorstraße 1, 39106, Magdeburg, Germany.
| |
Collapse
|
11
|
Abstract
Because proteomics experiments are so complex they can readily fail, and do so without clear cause. Using standard experimental design techniques and incorporating quality control can greatly increase the chances of success. This chapter introduces the relevant concepts and provides examples specific to proteomic workflows. Applying these notions to design successful proteomics experiments is straightforward. It can help identify failure causes and greatly increase the likelihood of inter-laboratory reproducibility.
Collapse
Affiliation(s)
- Daniel Ruderman
- Lawrence J. Ellison Institute for Transformative Medicine of USC, Keck School of Medicine of USC, 2250 Alcazar St. CSC-240, Los Angeles, CA, 90033, USA.
| |
Collapse
|
12
|
Zhang C, Liu Y. Retrieving Quantitative Information of Histone PTMs by Mass Spectrometry. Methods Enzymol 2016; 586:165-191. [PMID: 28137562 DOI: 10.1016/bs.mie.2016.10.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Posttranslational modifications (PTMs) of histones are one of the main research interests in the rapidly growing field of epigenetics. Accurate and precise quantification of these highly complex histone PTMs is critical for understanding the histone code and the biological significance behind it. It nonetheless remains a major analytical challenge. Mass spectrometry (MS) has been proven as a robust tool in retrieving quantitative information of histone PTMs, and a variety of MS-based quantitative strategies have been successfully developed and employed in basic research as well as clinical studies. In this chapter, we provide an overview for quantitative analysis of histone PTMs, often highly flexible and case dependent, as a primer for future experimental designs.
Collapse
Affiliation(s)
- C Zhang
- Baylor College of Medicine, Houston, TX, United States.
| | - Y Liu
- University of Michigan, Ann Arbor, MI, United States.
| |
Collapse
|
13
|
Development of a 45kpsi ultrahigh pressure liquid chromatography instrument for gradient separations of peptides using long microcapillary columns and sub-2μm particles. J Chromatogr A 2016; 1469:60-67. [PMID: 27702615 DOI: 10.1016/j.chroma.2016.09.053] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Revised: 09/22/2016] [Accepted: 09/23/2016] [Indexed: 11/22/2022]
Abstract
Commercial chromatographic instrumentation for bottom-up proteomics is often inadequate to resolve the number of peptides in many samples. This has inspired a number of complex approaches to increase peak capacity, including various multidimensional approaches, and reliance on advancements in mass spectrometry. One-dimensional reversed phase separations are limited by the pressure capabilities of commercial instruments and prevent the realization of greater separation power in terms of speed and resolution inherent to smaller sorbents and ultrahigh pressure liquid chromatography. Many applications with complex samples could benefit from the increased separation performance of long capillary columns packed with sub-2μm sorbents. Here, we introduce a system that operates at a constant pressure and is capable of separations at pressures up to 45kpsi. The system consists of a commercially available capillary liquid chromatography instrument, for sample management and gradient creation, and is modified with a storage loop and isolated pneumatic amplifier pump for elevated separation pressure. The system's performance is assessed with a complex peptide mixture and a range of microcapillary columns packed with sub-2μm C18 particles.
Collapse
|
14
|
Huang JH, Berkovitch SS, Iaconelli J, Watmuff B, Park H, Chattopadhyay S, McPhie D, Öngür D, Cohen BM, Clish CB, Karmacharya R. Perturbational Profiling of Metabolites in Patient Fibroblasts Implicates α-Aminoadipate as a Potential Biomarker for Bipolar Disorder. MOLECULAR NEUROPSYCHIATRY 2016; 2:97-106. [PMID: 27606323 DOI: 10.1159/000446654] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 05/04/2016] [Indexed: 12/27/2022]
Abstract
Many studies suggest the presence of aberrations in cellular metabolism in bipolar disorder. We studied the metabolome in bipolar disorder to gain insight into cellular pathways that may be dysregulated in bipolar disorder and to discover evidence of novel biomarkers. We measured polar and nonpolar metabolites in fibroblasts from subjects with bipolar I disorder and matched healthy control subjects, under normal conditions and with two physiologic perturbations: low-glucose media and exposure to the stress-mediating hormone dexamethasone. Metabolites that were significantly different between bipolar and control subjects showed distinct separation by principal components analysis methods. The most statistically significant findings were observed in the perturbation experiments. The metabolite with the lowest p value in both the low-glucose and dexamethasone experiments was α-aminoadipate, whose intracellular level was consistently lower in bipolar subjects. Our study implicates α-aminoadipate as a possible biomarker in bipolar disorder that manifests under cellular stress. This is an intriguing finding given the known role of α-aminoadipate in the modulation of kynurenic acid in the brain, especially as abnormal kynurenic acid levels have been implicated in bipolar disorder.
Collapse
Affiliation(s)
- Joanne H Huang
- Center for Experimental Drugs and Diagnostics, Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Harvard Medical School and Massachusetts General Hospital, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Shaunna S Berkovitch
- Center for Experimental Drugs and Diagnostics, Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Harvard Medical School and Massachusetts General Hospital, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Jonathan Iaconelli
- Center for Experimental Drugs and Diagnostics, Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Harvard Medical School and Massachusetts General Hospital, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Bradley Watmuff
- Center for Experimental Drugs and Diagnostics, Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Harvard Medical School and Massachusetts General Hospital, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Hyoungjun Park
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Mass., USA
| | - Shrikanta Chattopadhyay
- MGH Cancer Center, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Donna McPhie
- Schizophrenia and Bipolar Disorder Program, Harvard Medical School and McLean Hospital, Belmont, Mass., USA
| | - Dost Öngür
- Schizophrenia and Bipolar Disorder Program, Harvard Medical School and McLean Hospital, Belmont, Mass., USA
| | - Bruce M Cohen
- Schizophrenia and Bipolar Disorder Program, Harvard Medical School and McLean Hospital, Belmont, Mass., USA
| | - Clary B Clish
- Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| | - Rakesh Karmacharya
- Center for Experimental Drugs and Diagnostics, Psychiatric and Neurodevelopmental Genetics Unit, Center for Human Genetic Research, Harvard Medical School and Massachusetts General Hospital, Boston, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA; Chemical Biology Program, Broad Institute of Harvard and MIT, Mass., USA
| |
Collapse
|
15
|
Hecht ES, McCord JP, Muddiman DC. A Quantitative Glycomics and Proteomics Combined Purification Strategy. J Vis Exp 2016. [PMID: 27023253 PMCID: PMC4828233 DOI: 10.3791/53735] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
There is a growing desire in the biological and clinical sciences to integrate and correlate multiple classes of biomolecules to unravel biology, define pathways, improve treatment, understand disease, and aid biomarker discovery. N-linked glycosylation is one of the most important and robust post-translational modifications on proteins and regulates critical cell functions such as signaling, adhesion, and enzymatic function. Analytical techniques to purify and analyze N-glycans have remained relatively static over the last decade. While accurate and effective, they commonly require significant expertise and resources. Though some high-throughput purification schemes have been developed, they have yet to find widespread adoption and often rely on the enrichment of glycopeptides. One promising method, developed by Thomas-Oates et al., filter aided N-glycan separation (FANGS), was qualitatively demonstrated on tissues. Herein, we adapted FANGS to plasma and coupled it to the individuality normalization when labeling with glycan hydrazide tags strategy in order to achieve accurate relative quantification by liquid chromatography mass spectrometry and enhanced electrospray ionization. Furthermore, we designed new functionality to the protocol by achieving tandem, shotgun proteomics and glycosylation site analysis on hen plasma. We showed that N-glycans purified on filter and derivatized by hydrophobic hydrazide tags were comparable in terms of abundance and class to those by solid phase extraction (SPE); the latter is considered a gold standard in the field. Importantly, the variability in the two protocols was not statistically different. Proteomic data that was collected in-line with glycomic data had the same depth compared to a standard trypsin digest. Peptide deamidation is minimized in the protocol, limiting non-specific deamidation detected at glycosylation motifs. This allowed for direct glycosylation site analysis, though the protocol can accommodate (18)O site labeling as well. Overall, we demonstrated a new in-line high-throughput, unbiased, filter based protocol for quantitative glycomics and proteomics analysis.
Collapse
Affiliation(s)
| | - James P McCord
- Department of Chemistry, North Carolina State University
| | | |
Collapse
|
16
|
Computational Methods in Mass Spectrometry-Based Proteomics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 939:63-89. [PMID: 27807744 DOI: 10.1007/978-981-10-1503-8_4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
This chapter introduces computational methods used in mass spectrometry-based proteomics, including those for addressing the critical problems such as peptide identification and protein inference, peptide and protein quantification, characterization of posttranslational modifications (PTMs), and data-independent acquisitions (DIA). The chapter concludes with emerging applications of proteomic techniques, such as metaproteomics, glycoproteomics, and proteogenomics.
Collapse
|
17
|
Lum KK, Cristea IM. Proteomic approaches to uncovering virus-host protein interactions during the progression of viral infection. Expert Rev Proteomics 2016; 13:325-40. [PMID: 26817613 PMCID: PMC4919574 DOI: 10.1586/14789450.2016.1147353] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 01/25/2016] [Indexed: 01/10/2023]
Abstract
The integration of proteomic methods to virology has facilitated a significant breadth of biological insight into mechanisms of virus replication, antiviral host responses and viral subversion of host defenses. Throughout the course of infection, these cellular mechanisms rely heavily on the formation of temporally and spatially regulated virus-host protein-protein interactions. Reviewed here are proteomic-based approaches that have been used to characterize this dynamic virus-host interplay. Specifically discussed are the contribution of integrative mass spectrometry, antibody-based affinity purification of protein complexes, cross-linking and protein array techniques for elucidating complex networks of virus-host protein associations during infection with a diverse range of RNA and DNA viruses. The benefits and limitations of applying proteomic methods to virology are explored, and the contribution of these approaches to important biological discoveries and to inspiring new tractable avenues for the design of antiviral therapeutics is highlighted.
Collapse
Affiliation(s)
- Krystal K Lum
- Department of Molecular Biology, Princeton
University, Princeton, NJ, USA
| | - Ileana M Cristea
- Department of Molecular Biology, Princeton
University, Princeton, NJ, USA
| |
Collapse
|
18
|
Chen Y, Wang F, Xu F, Yang T. Mass Spectrometry-Based Protein Quantification. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 919:255-279. [PMID: 27975224 DOI: 10.1007/978-3-319-41448-5_15] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Quantification of individual proteins and even entire proteomes is an important theme in proteomics research. Quantitative proteomics is an approach to obtain quantitative information about proteins in a sample. Compared to qualitative or semi-quantitative proteomics, this approach can provide more insight into the effects of a specific stimulus, such as a change in the expression level of a protein and its posttranslational modifications, or to a panel of proposed biomarkers in a given disease state. Proteomics methodologies, along with a variety of bioinformatics approaches, are a major tool in quantitative proteomics. As the theory and technological aspects underlying the proteomics methodologies will be extensively described in Chap. 20 , and protein identification as a prerequisite of quantification has been discussed in Chap. 17 , we will focus on the quantitative proteomics bioinformatics algorithms and software tools in this chapter. Our goal is to provide researchers and newcomers a rational framework to select suitable bioinformatics tools for data analysis, interpretation, and integration in protein quantification. Before doing so, a brief overview of quantitative proteomics is provided.
Collapse
Affiliation(s)
- Yun Chen
- School of Pharmacy, Nanjing Medical University, 818 Tian Yuan East Road, Nanjing, 211166, China.
| | - Fuqiang Wang
- School of Pharmacy, Nanjing Medical University, 818 Tian Yuan East Road, Nanjing, 211166, China
| | - Feifei Xu
- School of Pharmacy, Nanjing Medical University, 818 Tian Yuan East Road, Nanjing, 211166, China
| | - Ting Yang
- School of Pharmacy, Nanjing Medical University, 818 Tian Yuan East Road, Nanjing, 211166, China
| |
Collapse
|
19
|
Chan JCY, Zhou L, Chan ECY. The Isotope-Coded Affinity Tag Method for Quantitative Protein Profile Comparison and Relative Quantitation of Cysteine Redox Modifications. ACTA ACUST UNITED AC 2015; 82:23.2.1-23.2.19. [PMID: 26521713 DOI: 10.1002/0471140864.ps2302s82] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The isotope-coded affinity tag (ICAT) technique has been applied to measure pairwise changes in protein expression through differential stable isotopic labeling of proteins or peptides followed by identification and quantification using a mass spectrometer. Changes in protein expression are observed when the identical peptide from each of two biological conditions is identified and a difference is detected in the measurements comparing the peptide labeled with the heavy isotope to the one with a normal isotopic distribution. This approach allows the simultaneous comparison of the expression of many proteins between two different biological states (e.g., yeast grown on galactose versus glucose, or normal versus cancer cells). Due to the cysteine-specificity of the ICAT reagents, the ICAT technique has also been applied to perform relative quantitation of cysteine redox modifications such as oxidation and nitrosylation. This unit describes both protein quantitation and profiling of cysteine redox modifications using the ICAT technique.
Collapse
Affiliation(s)
| | - Lei Zhou
- Singapore Eye Research Institute, Singapore
| | | |
Collapse
|
20
|
Schilling B, MacLean B, Held JM, Sahu AK, Rardin MJ, Sorensen DJ, Peters T, Wolfe AJ, Hunter CL, MacCoss MJ, Gibson BW. Multiplexed, Scheduled, High-Resolution Parallel Reaction Monitoring on a Full Scan QqTOF Instrument with Integrated Data-Dependent and Targeted Mass Spectrometric Workflows. Anal Chem 2015; 87:10222-9. [PMID: 26398777 PMCID: PMC5677521 DOI: 10.1021/acs.analchem.5b02983] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Recent advances in commercial mass spectrometers with higher resolving power and faster scanning capabilities have expanded their functionality beyond traditional data-dependent acquisition (DDA) to targeted proteomics with higher precision and multiplexing. Using an orthogonal quadrupole time-of flight (QqTOF) LC-MS system, we investigated the feasibility of implementing large-scale targeted quantitative assays using scheduled, high resolution multiple reaction monitoring (sMRM-HR), also referred to as parallel reaction monitoring (sPRM). We assessed the selectivity and reproducibility of PRM, also referred to as parallel reaction monitoring, by measuring standard peptide concentration curves and system suitability assays. By evaluating up to 500 peptides in a single assay, the robustness and accuracy of PRM assays were compared to traditional SRM workflows on triple quadrupole instruments. The high resolution and high mass accuracy of the full scan MS/MS spectra resulted in sufficient selectivity to monitor 6-10 MS/MS fragment ions per target precursor, providing flexibility in postacquisition assay refinement and optimization. The general applicability of the sPRM workflow was assessed in complex biological samples by first targeting 532 peptide precursor ions in a yeast lysate, and then 466 peptide precursors from a previously generated candidate list of differentially expressed proteins in whole cell lysates from E. coli. Lastly, we found that sPRM assays could be rapidly and efficiently developed in Skyline from DDA libraries when acquired on the same QqTOF platform, greatly facilitating their successful implementation. These results establish a robust sPRM workflow on a QqTOF platform to rapidly transition from discovery analysis to highly multiplexed, targeted peptide quantitation.
Collapse
Affiliation(s)
- Birgit Schilling
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
| | - Brendan MacLean
- Department of Genome Sciences, University of Washington School of Medicine, Foege Building S113, 3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - Jason M. Held
- Departments of Medicine and Anesthesiology, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, Missouri 63110, United States
| | - Alexandria K. Sahu
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
| | - Matthew J. Rardin
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
| | - Dylan J. Sorensen
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
| | - Theodore Peters
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
| | - Alan J. Wolfe
- Department of Microbiology and Immunology, Stritch School of Medicine, Health Sciences Division, Loyola University Chicago, 2160 South First Avenue, Maywood, Illinois 60153, United States
| | | | - Michael J. MacCoss
- Department of Genome Sciences, University of Washington School of Medicine, Foege Building S113, 3720 15th Avenue NE, Seattle, Washington 98195, United States
| | - Bradford W. Gibson
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, California 94945, United States
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94143, United States
| |
Collapse
|
21
|
Fan J, Saha S, Barker G, Heesom KJ, Ghali F, Jones AR, Matthews DA, Bessant C. Galaxy Integrated Omics: Web-based Standards-Compliant Workflows for Proteomics Informed by Transcriptomics. Mol Cell Proteomics 2015; 14:3087-93. [PMID: 26269333 PMCID: PMC4638048 DOI: 10.1074/mcp.o115.048777] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Indexed: 01/13/2023] Open
Abstract
With the recent advent of RNA-seq technology the proteomics community has begun to generate sample-specific protein databases for peptide and protein identification, an approach we call proteomics informed by transcriptomics (PIT). This approach has gained a lot of interest, particularly among researchers who work with nonmodel organisms or with particularly dynamic proteomes such as those observed in developmental biology and host-pathogen studies. PIT has been shown to improve coverage of known proteins, and to reveal potential novel gene products. However, many groups are impeded in their use of PIT by the complexity of the required data analysis. Necessarily, this analysis requires complex integration of a number of different software tools from at least two different communities, and because PIT has a range of biological applications a single software pipeline is not suitable for all use cases. To overcome these problems, we have created GIO, a software system that uses the well-established Galaxy platform to make PIT analysis available to the typical bench scientist via a simple web interface. Within GIO we provide workflows for four common use cases: a standard search against a reference proteome; PIT protein identification without a reference genome; PIT protein identification using a genome guide; and PIT genome annotation. These workflows comprise individual tools that can be reconfigured and rearranged within the web interface to create new workflows to support additional use cases.
Collapse
Affiliation(s)
- Jun Fan
- From the ‡School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
| | - Shyamasree Saha
- From the ‡School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
| | - Gary Barker
- ¶School of Biological Sciences, University of Bristol, University Walk, Bristol. BS8 1TD. UK
| | - Kate J Heesom
- ‖School of Biochemistry, University of Bristol, University Walk, Bristol. BS8 1TD, UK
| | - Fawaz Ghali
- **Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Andrew R Jones
- **Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - David A Matthews
- §School of Cellular and Molecular Medicine, University of Bristol, University Walk, Bristol. BS8 1TD, UK
| | - Conrad Bessant
- From the ‡School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK;
| |
Collapse
|
22
|
Qi D, Zhang H, Fan J, Perkins S, Pisconti A, Simpson DM, Bessant C, Hubbard S, Jones AR. The mzqLibrary--An open source Java library supporting the HUPO-PSI quantitative proteomics standard. Proteomics 2015; 15:3152-62. [PMID: 26037908 PMCID: PMC4973685 DOI: 10.1002/pmic.201400535] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Revised: 05/02/2015] [Accepted: 05/29/2015] [Indexed: 12/02/2022]
Abstract
The mzQuantML standard has been developed by the Proteomics Standards Initiative for capturing, archiving and exchanging quantitative proteomic data, derived from mass spectrometry. It is a rich XML‐based format, capable of representing data about two‐dimensional features from LC‐MS data, and peptides, proteins or groups of proteins that have been quantified from multiple samples. In this article we report the development of an open source Java‐based library of routines for mzQuantML, called the mzqLibrary, and associated software for visualising data called the mzqViewer. The mzqLibrary contains routines for mapping (peptide) identifications on quantified features, inference of protein (group)‐level quantification values from peptide‐level values, normalisation and basic statistics for differential expression. These routines can be accessed via the command line, via a Java programming interface access or a basic graphical user interface. The mzqLibrary also contains several file format converters, including import converters (to mzQuantML) from OpenMS, Progenesis LC‐MS and MaxQuant, and exporters (from mzQuantML) to other standards or useful formats (mzTab, HTML, csv). The mzqViewer contains in‐built routines for viewing the tables of data (about features, peptides or proteins), and connects to the R statistical library for more advanced plotting options. The mzqLibrary and mzqViewer packages are available from https://code.google.com/p/mzq‐lib/.
Collapse
Affiliation(s)
- Da Qi
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Huaizhong Zhang
- The Faculty of Life Sciences, University of Manchester, Manchester, UK
| | - Jun Fan
- The School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Simon Perkins
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | | | - Deborah M Simpson
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Conrad Bessant
- The School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Simon Hubbard
- The Faculty of Life Sciences, University of Manchester, Manchester, UK
| | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| |
Collapse
|
23
|
Deutsch EW, Albar JP, Binz PA, Eisenacher M, Jones AR, Mayer G, Omenn GS, Orchard S, Vizcaíno JA, Hermjakob H. Development of data representation standards by the human proteome organization proteomics standards initiative. J Am Med Inform Assoc 2015; 22:495-506. [PMID: 25726569 PMCID: PMC4457114 DOI: 10.1093/jamia/ocv001] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Revised: 09/29/2014] [Accepted: 01/05/2015] [Indexed: 11/22/2022] Open
Abstract
OBJECTIVE To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI's evolution, and future directions and synergies for the group. MATERIALS AND METHODS The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. RESULTS We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community.Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and biological phenotypes. The PSI website can be accessed at http://www.psidev.info.
Collapse
Affiliation(s)
| | - Juan Pablo Albar
- Died July 18, 2014 Proteomics Facility, Centro Nacional de Biotecnología - CSIC, Madrid, Spain ProteoRed Consortium, Spanish National Institute of Proteomics, Madrid, Spain
| | - Pierre-Alain Binz
- CHUV Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Martin Eisenacher
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, Bochum, Germany
| | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Gerhard Mayer
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, Bochum, Germany
| | - Gilbert S Omenn
- Institute for Systems Biology, Seattle, USA Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, USA
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
24
|
Oyarzun P, Ellis JJ, Gonzalez-Galarza FF, Jones AR, Middleton D, Boden M, Kobe B. A bioinformatics tool for epitope-based vaccine design that accounts for human ethnic diversity: application to emerging infectious diseases. Vaccine 2015; 33:1267-73. [PMID: 25629524 DOI: 10.1016/j.vaccine.2015.01.040] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Revised: 12/11/2014] [Accepted: 01/14/2015] [Indexed: 11/17/2022]
Abstract
BACKGROUND Peptide vaccination based on multiple T-cell epitopes can be used to target well-defined ethnic populations. Because the response to T-cell epitopes is restricted by HLA proteins, the HLA specificity of T-cell epitopes becomes a major consideration for epitope-based vaccine design. We have previously shown that CD4+ T-cell epitopes restricted by 95% of human MHC class II proteins can be predicted with high-specificity. METHODS We describe here the integration of epitope prediction with population coverage and epitope selection algorithms. The population coverage assessment makes use of the Allele Frequency Net Database. We present the computational platform Predivac-2.0 for HLA class II-restricted epitope-based vaccine design, which accounts comprehensively for human genetic diversity. RESULTS We validated the performance of the tool on the identification of promiscuous and immunodominant CD4+ T-cell epitopes from the human immunodeficiency virus (HIV) protein Gag. We further describe an application for epitope-based vaccine design in the context of emerging infectious diseases associated with Lassa, Nipah and Hendra viruses. Putative CD4+ T-cell epitopes were mapped on the surface glycoproteins of these pathogens and are good candidates to be experimentally tested, as they hold potential to provide cognate help in vaccination settings in their respective target populations. CONCLUSION Predivac-2.0 is a novel approach in epitope-based vaccine design, particularly suited to be applied to virus-related emerging infectious diseases, because the geographic distributions of the viruses are well defined and ethnic populations in need of vaccination can be determined ("ethnicity-oriented approach"). Predivac-2.0 is accessible through the website http://predivac.biosci.uq.edu.au/.
Collapse
Affiliation(s)
- Patricio Oyarzun
- School of Chemistry and Molecular Biosciences, Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, University of Queensland, Australia; Biotechnology Centre, Universidad San Sebastián, Concepción, Chile.
| | - Jonathan J Ellis
- School of Chemistry and Molecular Biosciences, Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, University of Queensland, Australia
| | | | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, United Kingdom
| | - Derek Middleton
- Transplant Immunology Laboratory, Royal Liverpool University Hospital & School of Infection and Host Defence University of Liverpool, United Kingdom
| | - Mikael Boden
- School of Chemistry and Molecular Biosciences, Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, University of Queensland, Australia; School of Information Technology and Electrical Engineering, University of Queensland, Queensland 4072, Australia
| | - Bostjan Kobe
- School of Chemistry and Molecular Biosciences, Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, University of Queensland, Australia.
| |
Collapse
|
25
|
Spainhour JCG, Janech MG, Schwacke JH, Velez JCQ, Ramakrishnan V. The application of Gaussian mixture models for signal quantification in MALDI-TOF mass spectrometry of peptides. PLoS One 2014; 9:e111016. [PMID: 25372836 PMCID: PMC4221630 DOI: 10.1371/journal.pone.0111016] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 09/19/2014] [Indexed: 02/07/2023] Open
Abstract
Matrix assisted laser desorption/ionization time-of-flight (MALDI-TOF) coupled with stable isotope standards (SIS) has been used to quantify native peptides. This peptide quantification by MALDI-TOF approach has difficulties quantifying samples containing peptides with ion currents in overlapping spectra. In these overlapping spectra the currents sum together, which modify the peak heights and make normal SIS estimation problematic. An approach using Gaussian mixtures based on known physical constants to model the isotopic cluster of a known compound is proposed here. The characteristics of this approach are examined for single and overlapping compounds. The approach is compared to two commonly used SIS quantification methods for single compound, namely Peak Intensity method and Riemann sum area under the curve (AUC) method. For studying the characteristics of the Gaussian mixture method, Angiotensin II, Angiotensin-2-10, and Angiotenisn-1-9 and their associated SIS peptides were used. The findings suggest, Gaussian mixture method has similar characteristics as the two methods compared for estimating the quantity of isolated isotopic clusters for single compounds. All three methods were tested using MALDI-TOF mass spectra collected for peptides of the renin-angiotensin system. The Gaussian mixture method accurately estimated the native to labeled ratio of several isolated angiotensin peptides (5.2% error in ratio estimation) with similar estimation errors to those calculated using peak intensity and Riemann sum AUC methods (5.9% and 7.7%, respectively). For overlapping angiotensin peptides, (where the other two methods are not applicable) the estimation error of the Gaussian mixture was 6.8%, which is within the acceptable range. In summary, for single compounds the Gaussian mixture method is equivalent or marginally superior compared to the existing methods of peptide quantification and is capable of quantifying overlapping (convolved) peptides within the acceptable margin of error.
Collapse
Affiliation(s)
- John Christian G. Spainhour
- Medical University of South Carolina, Department of Public Health Sciences, Charleston, South Carolina, United States of America
- * E-mail:
| | - Michael G. Janech
- Division of Nephrology, Department of Medicine, Medical University of South Carolina, Charleston, South Carolina, United States of America
| | - John H. Schwacke
- Scientific Research Corporation, North Charleston, South Carolina, United States of America
| | - Juan Carlos Q. Velez
- Division of Nephrology, Department of Medicine, Medical University of South Carolina, Charleston, South Carolina, United States of America
- Ralph H. Johnson Veterans Affairs Medical Center, Charleston, South Carolina, United States of America
| | - Viswanathan Ramakrishnan
- Medical University of South Carolina, Department of Public Health Sciences, Charleston, South Carolina, United States of America
| |
Collapse
|
26
|
MS-GF+ makes progress towards a universal database search tool for proteomics. Nat Commun 2014; 5:5277. [PMID: 25358478 PMCID: PMC5036525 DOI: 10.1038/ncomms6277] [Citation(s) in RCA: 764] [Impact Index Per Article: 76.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2014] [Accepted: 09/16/2014] [Indexed: 02/06/2023] Open
Abstract
Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but the software tools to analyze tandem mass spectra are lagging behind. We present a database search tool MS-GF+ that is sensitive (it identifies more peptides than most other database search tools) and universal (it works well for diverse types of spectra, different configurations of MS instruments and different experimental protocols). We benchmark MS-GF+ using diverse spectral datasets: (i) spectra of varying fragmentation methods; (ii) spectra of multiple enzyme digests; (iii) spectra of phosphorylated peptides; (iv) spectra of peptides with unusual fragmentation propensities produced by a novel alpha-lytic protease. For all these datasets, MS-GF+ significantly increases the number of identified peptides compared to commonly used methods for peptide identifications. We emphasize that while MS-GF+ is not specifically designed for any particular experimental set-up, it improves upon the performance of tools specifically designed for these applications (e.g., specialized tools for phosphoproteomics).
Collapse
|
27
|
Otto A, Becher D, Schmidt F. Quantitative proteomics in the field of microbiology. Proteomics 2014; 14:547-65. [PMID: 24376008 DOI: 10.1002/pmic.201300403] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2013] [Revised: 11/15/2013] [Accepted: 12/06/2013] [Indexed: 12/11/2022]
Abstract
Quantitative proteomics has become an indispensable analytical tool for microbial research. Modern microbial proteomics covers a wide range of topics in basic and applied research from in vitro characterization of single organisms to unravel the physiological implications of stress/starvation to description of the proteome content of a cell at a given time. With the techniques available, ranging from classical gel-based procedures to modern MS-based quantitative techniques, including metabolic and chemical labeling, as well as label-free techniques, quantitative proteomics is today highly successful in sophisticated settings of high complexity such as host-pathogen interactions, mixed microbial communities, and microbial metaproteomics. In this review, we will focus on the vast range of techniques practically applied in current research with an introduction of the workflows used for quantitative comparisons, a description of the advantages/disadvantages of the various methods, reference to hallmark publications and presentation of applications in current microbial research.
Collapse
Affiliation(s)
- Andreas Otto
- Institute for Microbiology, Ernst Moritz Arndt University Greifswald, Germany
| | | | | |
Collapse
|
28
|
Elo LL, Karjalainen R, Ohman T, Hintsanen P, Nyman TA, Heckman CA, Aittokallio T. Statistical detection of quantitative protein biomarkers provides insights into signaling networks deregulated in acute myeloid leukemia. Proteomics 2014; 14:2443-53. [PMID: 25211154 DOI: 10.1002/pmic.201300460] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2013] [Revised: 07/31/2014] [Accepted: 09/08/2014] [Indexed: 12/12/2022]
Abstract
The increasing coverage and sensitivity of LC-MS/MS-based proteomics have expanded its applications in systems medicine. In particular, label-free quantitation approaches are enabling biomarker discovery in terms of statistical comparison of proteomic profiles across large numbers of clinical samples. However, it still remains poorly understood how much protein markers can add novel insights compared to markers derived from mRNA transcriptomic profiling. Using paired label-free LC-MS/MS and gene expression microarray measurements from primary samples of patients with acute myeloid leukemia (AML), we demonstrate here that while the quantitative proteomic and transcriptomic profiles were highly correlated, in general, the marker panels showing statistically significant expression changes across the disease and healthy groups were profoundly different between protein and mRNA levels. In particular, the proteomic assay enabled unique links to known leukemic processes, which were missed when using the transcriptomic profiling alone, as well as identified additional links to metabolic regulators and chromatin remodelers, such as GPX1, fumarate hydratase, and SET oncogene, which have subsequently been evaluated in independent AML samples. Overall, these results highlighted the complementary and informative view obtained from the quantitative LC-MS/MS approach into the AML deregulated signaling networks.
Collapse
Affiliation(s)
- Laura L Elo
- Department of Mathematics and Statistics, University of Turku, Turku, Finland; Turku Centre for Biotechnology, Turku, Finland
| | | | | | | | | | | | | |
Collapse
|
29
|
Amaya M, Baer A, Voss K, Campbell C, Mueller C, Bailey C, Kehn-Hall K, Petricoin E, Narayanan A. Proteomic strategies for the discovery of novel diagnostic and therapeutic targets for infectious diseases. Pathog Dis 2014; 71:177-89. [PMID: 24488789 PMCID: PMC7108530 DOI: 10.1111/2049-632x.12150] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Revised: 01/18/2014] [Accepted: 01/23/2014] [Indexed: 12/14/2022] Open
Abstract
Viruses have developed numerous and elegant strategies to manipulate the host cell machinery to establish a productive infectious cycle. The interaction of viral proteins with host proteins plays an important role in infection and pathogenesis, often bypassing traditional host defenses such as the interferon response and apoptosis. Host–viral protein interactions can be studied using a variety of proteomic approaches ranging from genetic and biochemical to large‐scale high‐throughput technologies. Protein interactions between host and viral proteins are greatly influenced by host signal transduction pathways. In this review, we will focus on comparing proteomic information obtained through differing technologies and how their integration can be used to determine the functional aspect of the host response to infection. We will briefly review and evaluate techniques employed to elucidate viral–host interactions with a primary focus on Protein Microarrays (PMA) and Mass Spectrometry (MS) as potential tools in the discovery of novel therapeutic targets. As many potential molecular markers and targets are proteins, proteomic profiling is expected to yield both clearer and more direct answers to functional and pharmacologic questions.
Collapse
Affiliation(s)
- Moushimi Amaya
- National Center for Biodefense and Infectious Diseases, George Mason University, Manassas, VA, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Qi D, Krishna R, Jones AR. The jmzQuantML programming interface and validator for the mzQuantML data standard. Proteomics 2014; 14:685-8. [PMID: 24453188 PMCID: PMC4016760 DOI: 10.1002/pmic.201300281] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Revised: 11/20/2013] [Accepted: 12/20/2013] [Indexed: 11/17/2022]
Abstract
The mzQuantML standard from the HUPO Proteomics Standards Initiative has recently been released, capturing quantitative data about peptides and proteins, following analysis of MS data. We present a Java application programming interface (API) for mzQuantML called jmzQuantML. The API provides robust bridges between Java classes and elements in mzQuantML files and allows random access to any part of the file. The API provides read and write capabilities, and is designed to be embedded in other software packages, enabling mzQuantML support to be added to proteomics software tools (http://code.google.com/p/jmzquantml/). The mzQuantML standard is designed around a multilevel validation system to ensure that files are structurally and semantically correct for different proteomics quantitative techniques. In this article, we also describe a Java software tool (http://code.google.com/p/mzquantml-validator/) for validating mzQuantML files, which is a formal part of the data standard.
Collapse
Affiliation(s)
- Da Qi
- Institute of Integrative Biology, University of Liverpool, UK
| | | | | |
Collapse
|
31
|
Perez-Riverol Y, Wang R, Hermjakob H, Müller M, Vesada V, Vizcaíno JA. Open source libraries and frameworks for mass spectrometry based proteomics: a developer's perspective. BIOCHIMICA ET BIOPHYSICA ACTA 2014; 1844:63-76. [PMID: 23467006 PMCID: PMC3898926 DOI: 10.1016/j.bbapap.2013.02.032] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Revised: 02/05/2013] [Accepted: 02/22/2013] [Indexed: 12/23/2022]
Abstract
Data processing, management and visualization are central and critical components of a state of the art high-throughput mass spectrometry (MS)-based proteomics experiment, and are often some of the most time-consuming steps, especially for labs without much bioinformatics support. The growing interest in the field of proteomics has triggered an increase in the development of new software libraries, including freely available and open-source software. From database search analysis to post-processing of the identification results, even though the objectives of these libraries and packages can vary significantly, they usually share a number of features. Common use cases include the handling of protein and peptide sequences, the parsing of results from various proteomics search engines output files, and the visualization of MS-related information (including mass spectra and chromatograms). In this review, we provide an overview of the existing software libraries, open-source frameworks and also, we give information on some of the freely available applications which make use of them. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- Department of Proteomics, Center for Genetic Engineering and Biotechnology, Ciudad de la Habana, Cuba
| | - Rui Wang
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Henning Hermjakob
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Markus Müller
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CMU - 1, rue Michel Servet CH-1211 Geneva, Switzerland
| | - Vladimir Vesada
- Department of Proteomics, Center for Genetic Engineering and Biotechnology, Ciudad de la Habana, Cuba
| | - Juan Antonio Vizcaíno
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
32
|
Romanova EV, Dowd SE, Sweedler JV. Quantitation of endogenous peptides using mass spectrometry based methods. Curr Opin Chem Biol 2013; 17:801-8. [PMID: 23790312 DOI: 10.1016/j.cbpa.2013.05.030] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 05/24/2013] [Indexed: 10/26/2022]
Abstract
The mass spectrometry-based 'omics' sub-discipline that focuses on comprehensive, often exploratory, analyses of endogenous peptides involved in cell-to-cell communication is oftentimes referred to as peptidomics. Although the progress in bioanalytical technology development for peptide discovery has been tremendous, perhaps the largest advances have involved robust quantitative mass spectrometric approaches and data mining algorithms. These efforts have accelerated the discovery and validation of biomarkers, functionally important posttranslational modifications, and unexpected molecular interactions, information that aids drug development. In this article we outline the current approaches used in quantitative peptidomics and the technical challenges that stimulate new advances in the field, while also reviewing the newest literature on functional characterizations of endogenous peptides using quantitative mass spectrometry.
Collapse
Affiliation(s)
- Elena V Romanova
- Department of Chemistry and the Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | | | |
Collapse
|
33
|
Using R and Bioconductor for proteomics data analysis. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:42-51. [PMID: 23692960 DOI: 10.1016/j.bbapap.2013.04.032] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2012] [Revised: 04/09/2013] [Accepted: 04/30/2013] [Indexed: 10/26/2022]
Abstract
This review presents how R, the popular statistical environment and programming language, can be used in the frame of proteomics data analysis. A short introduction to R is given, with special emphasis on some of the features that make R and its add-on packages premium software for sound and reproducible data analysis. The reader is also advised on how to find relevant R software for proteomics. Several use cases are then presented, illustrating data input/output, quality control, quantitative proteomics and data analysis. Detailed code and additional links to extensive documentation are available in the freely available companion package RforProteomics. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
|
34
|
Walzer M, Qi D, Mayer G, Uszkoreit J, Eisenacher M, Sachsenberg T, Gonzalez-Galarza FF, Fan J, Bessant C, Deutsch EW, Reisinger F, Vizcaíno JA, Medina-Aunon JA, Albar JP, Kohlbacher O, Jones AR. The mzQuantML data standard for mass spectrometry-based quantitative studies in proteomics. Mol Cell Proteomics 2013; 12:2332-40. [PMID: 23599424 PMCID: PMC3734589 DOI: 10.1074/mcp.o113.028506] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
The range of heterogeneous approaches available for quantifying protein abundance via mass spectrometry (MS)1 leads to considerable challenges in modeling, archiving, exchanging, or submitting experimental data sets as supplemental material to journals. To date, there has been no widely accepted format for capturing the evidence trail of how quantitative analysis has been performed by software, for transferring data between software packages, or for submitting to public databases. In the context of the Proteomics Standards Initiative, we have developed the mzQuantML data standard. The standard can represent quantitative data about regions in two-dimensional retention time versus mass/charge space (called features), peptides, and proteins and protein groups (where there is ambiguity regarding peptide-to-protein inference), and it offers limited support for small molecule (metabolomic) data. The format has structures for representing replicate MS runs, grouping of replicates (for example, as study variables), and capturing the parameters used by software packages to arrive at these values. The format has the capability to reference other standards such as mzML and mzIdentML, and thus the evidence trail for the MS workflow as a whole can now be described. Several software implementations are available, and we encourage other bioinformatics groups to use mzQuantML as an input, internal, or output format for quantitative software and for structuring local repositories. All project resources are available in the public domain from the HUPO Proteomics Standards Initiative http://www.psidev.info/mzquantml.
Collapse
Affiliation(s)
- Mathias Walzer
- Quantitative Biology Center and Department of Computer Science, Center for Bioinformatics, University of Tübingen, Sand 14, 72076 Tübingen, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Gonzalez-Galarza FF, Qi D, Fan J, Bessant C, Jones AR. A tutorial for software development in quantitative proteomics using PSI standard formats. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:88-97. [PMID: 23584085 PMCID: PMC4008935 DOI: 10.1016/j.bbapap.2013.04.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Revised: 02/22/2013] [Accepted: 04/05/2013] [Indexed: 01/21/2023]
Abstract
The Human Proteome Organisation — Proteomics Standards Initiative (HUPO-PSI) has been working for ten years on the development of standardised formats that facilitate data sharing and public database deposition. In this article, we review three HUPO-PSI data standards — mzML, mzIdentML and mzQuantML, which can be used to design a complete quantitative analysis pipeline in mass spectrometry (MS)-based proteomics. In this tutorial, we briefly describe the content of each data model, sufficient for bioinformaticians to devise proteomics software. We also provide guidance on the use of recently released application programming interfaces (APIs) developed in Java for each of these standards, which makes it straightforward to read and write files of any size. We have produced a set of example Java classes and a basic graphical user interface to demonstrate how to use the most important parts of the PSI standards, available from http://code.google.com/p/psi-standard-formats-tutorial. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan. A tutorial to help software developers use PSI standard formats. A description of programming interfaces and tools available. Code snippets and a basic graphical interface to assist understanding.
Collapse
|
36
|
Fontana S, Simona F, Saieva L, Laura S, Taverna S, Simona T, Alessandro R, Riccardo A. Contribution of proteomics to understanding the role of tumor-derived exosomes in cancer progression: state of the art and new perspectives. Proteomics 2013; 13:1581-94. [PMID: 23401131 DOI: 10.1002/pmic.201200398] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Revised: 10/01/2012] [Accepted: 10/02/2012] [Indexed: 12/14/2022]
Abstract
Exosomes are nanometer-sized vesicles (40-100 nm diameter) of endocytic origin released from different cell types under both normal and pathological conditions. They function as cell free messengers, playing a relevant role in the cell-cell communication that is strongly related to the nature of the molecules (proteins, mRNAs, miRNAs, and lipids) that they transport. Tumor cells actively shed exosomes into their surrounding microenvironment and growing evidence indicates that these vesicles have pleiotropic functions in the regulation of tumor progression, promoting immune escape, tumor invasion, neovascularization, and metastasis. During the last few years remarkable efforts have been made to obtain an accurate definition of the protein content of tumor-derived exosomes (TDEs) by applying MS-based proteomic technologies. To date, TDEs proteomic studies have been mainly utilized to catalog TDEs proteins with the purpose of identifying disease biomarkers. The future challenge for improving our understanding and characterization of TDEs will be the implementation of new systems-driven and proteomic integrative strategies. The aim of this article is to provide an overview of the most characterized exosomes-mediated mechanisms that contribute to the pathogenesis of cancer and to review recent proteomics data that support the protumorigenic role of TDEs.
Collapse
Affiliation(s)
| | - Fontana Simona
- Dipartimento di Biopatologia e Biotecnologie Mediche e Forensi, Sezione di Biologia e Genetica, Università di Palermo, Palermo, Italy.
| | | | | | | | | | | | | |
Collapse
|
37
|
Medina-Aunon JA, Krishna R, Ghali F, Albar JP, Jones AJ. A guide for integration of proteomic data standards into laboratory workflows. Proteomics 2013; 13:480-92. [DOI: 10.1002/pmic.201200268] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2012] [Revised: 08/14/2012] [Accepted: 09/10/2012] [Indexed: 01/28/2023]
Affiliation(s)
| | - Ritesh Krishna
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| | - Fawaz Ghali
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| | - Juan P. Albar
- Centro Nacional de Biotecnología; CSIC; Madrid; Spain
| | - Andrew J. Jones
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| |
Collapse
|
38
|
Jones AR. Bioinformatics challenges and solutions in proteomics as quantitative methods mature. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2012; 16:429-430. [PMID: 22804490 PMCID: PMC3698678 DOI: 10.1089/omi.2012.0051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
|
39
|
Qi D, Brownridge P, Xia D, Mackay K, Gonzalez-Galarza FF, Kenyani J, Harman V, Beynon RJ, Jones AR. A software toolkit and interface for performing stable isotope labeling and top3 quantification using Progenesis LC-MS. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2012; 16:489-95. [PMID: 22888986 DOI: 10.1089/omi.2012.0042] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Numerous software packages exist to provide support for quantifying peptides and proteins from mass spectrometry (MS) data. However, many support only a subset of experimental methods or instrument types, meaning that laboratories often have to use multiple software packages. The Progenesis LC-MS software package from Nonlinear Dynamics is a software solution for label-free quantitation. However, many laboratories using Progenesis also wish to employ stable isotope-based methods that are not natively supported in Progenesis. We have developed a Java programming interface that can use the output files produced by Progenesis, allowing the basic MS features quantified across replicates to be used in a range of different experimental methods. We have developed post-processing software (the Progenesis Post-Processor) to embed Progenesis in the analysis of stable isotope labeling data and top3 pseudo-absolute quantitation. We have also created export ability to the new data standard, mzQuantML, produced by the Proteomics Standards Initiative to facilitate the development and standardization process. The software is provided to users with a simple graphical user interface for accessing the different features. The underlying programming interface may also be used by Java developers to develop other routines for analyzing data produced by Progenesis.
Collapse
Affiliation(s)
- Da Qi
- Institute of Integrative Biology, University of Liverpool, Merseyside, UK.
| | | | | | | | | | | | | | | | | |
Collapse
|