1
|
König S, Schork K, Eisenacher M. Observations from the Proteomics Bench. Proteomes 2024; 12:6. [PMID: 38390966 PMCID: PMC10885119 DOI: 10.3390/proteomes12010006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 01/26/2024] [Accepted: 02/02/2024] [Indexed: 02/24/2024] Open
Abstract
Many challenges in proteomics result from the high-throughput nature of the experiments. This paper first presents pre-analytical problems, which still occur, although the call for standardization in omics has been ongoing for many years. This article also discusses aspects that affect bioinformatic analysis based on three sets of reference data measured with different orbitrap instruments. Despite continuous advances in mass spectrometer technology as well as analysis software, data-set-wise quality control is still necessary, and decoy-based estimation, although challenged by modern instruments, should be utilized. We draw attention to the fact that numerous young researchers perceive proteomics as a mature, readily applicable technology. However, it is important to emphasize that the maximum potential of the technology can only be realized by an educated handling of its limitations.
Collapse
Affiliation(s)
- Simone König
- IZKF Core Unit Proteomics, University of Münster, 48149 Münster, Germany
| | - Karin Schork
- Medizinisches Proteom-Center, Medical Faculty, Ruhr-University Bochum, 44801 Bochum, Germany
- Center for Protein Diagnostics (PRODI), Medical Proteome Analysis, Ruhr-University Bochum, 44801 Bochum, Germany
- Core Unit for Bioinformatics (CUBiMed.RUB), Medical Faculty, Ruhr-University Bochum, 44801 Bochum, Germany
| | - Martin Eisenacher
- Medizinisches Proteom-Center, Medical Faculty, Ruhr-University Bochum, 44801 Bochum, Germany
- Center for Protein Diagnostics (PRODI), Medical Proteome Analysis, Ruhr-University Bochum, 44801 Bochum, Germany
- Core Unit for Bioinformatics (CUBiMed.RUB), Medical Faculty, Ruhr-University Bochum, 44801 Bochum, Germany
| |
Collapse
|
2
|
Deutsch EW, Vizcaíno JA, Jones AR, Binz PA, Lam H, Klein J, Bittremieux W, Perez-Riverol Y, Tabb DL, Walzer M, Ricard-Blum S, Hermjakob H, Neumann S, Mak TD, Kawano S, Mendoza L, Van Den Bossche T, Gabriels R, Bandeira N, Carver J, Pullman B, Sun Z, Hoffmann N, Shofstahl J, Zhu Y, Licata L, Quaglia F, Tosatto SCE, Orchard SE. Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work. J Proteome Res 2023; 22:287-301. [PMID: 36626722 PMCID: PMC9903322 DOI: 10.1021/acs.jproteome.2c00637] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Indexed: 01/11/2023]
Abstract
The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies (CVs) for the proteomics community and other fields supported by mass spectrometry since its inception 20 years ago. Here we describe the general operation of the PSI, including its leadership, working groups, yearly workshops, and the document process by which proposals are thoroughly and publicly reviewed in order to be ratified as PSI standards. We briefly describe the current state of the many existing PSI standards, some of which remain the same as when originally developed, some of which have undergone subsequent revisions, and some of which have become obsolete. Then the set of proposals currently being developed are described, with an open call to the community for participation in the forging of the next generation of standards. Finally, we describe some synergies and collaborations with other organizations and look to the future in how the PSI will continue to promote the open sharing of data and thus accelerate the progress of the field of proteomics.
Collapse
Affiliation(s)
- Eric W. Deutsch
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Juan Antonio Vizcaíno
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Andrew R. Jones
- Institute
of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Pierre-Alain Binz
- Clinical
Chemistry Service, Lausanne University Hospital, 1011 976 Lausanne, Switzerland
| | - Henry Lam
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, P. R. China.
| | - Joshua Klein
- Program for
Bioinformatics, Boston University, Boston, Massachusetts 02215, United States
| | - Wout Bittremieux
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Department
of Computer Science, University of Antwerp, 2020 Antwerpen, Belgium
| | - Yasset Perez-Riverol
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David L. Tabb
- SA MRC
Centre for TB Research, DST/NRF Centre of Excellence for Biomedical
TB Research, Division of Molecular Biology and Human Genetics, Faculty
of Medicine and Health Sciences, Stellenbosch
University, Cape Town 7602, South Africa
| | - Mathias Walzer
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Sylvie Ricard-Blum
- Univ.
Lyon, Université Lyon 1, ICBMS, UMR 5246, 69622 Villeurbanne, France
| | - Henning Hermjakob
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Steffen Neumann
- Bioinformatics
and Scientific Data, Leibniz Institute of
Plant Biochemistry, 06120 Halle, Germany
- German
Centre for Integrative Biodiversity Research (iDiv), 04103 Halle-Jena-Leipzig, Germany
| | - Tytus D. Mak
- Mass Spectrometry
Data Center, National Institute of Standards
and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United
States
| | - Shin Kawano
- Database
Center for Life Science, Joint Support Center for Data Science Research, Research Organization of Information and Systems, Chiba 277-0871, Japan
- Faculty
of Contemporary Society, Toyama University
of International Studies, Toyama 930-1292, Japan
- School
of Frontier Engineering, Kitasato University, Sagamihara 252-0373, Japan
| | - Luis Mendoza
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Tim Van Den Bossche
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Nuno Bandeira
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Jeremy Carver
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Benjamin Pullman
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Zhi Sun
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Nils Hoffmann
- Institute
for Bio- and Geosciences (IBG-5), Forschungszentrum
Jülich GmbH, 52428 Jülich, Germany
| | - Jim Shofstahl
- Thermo
Fisher Scientific, 355 River Oaks Parkway, San Jose, California 95134, United States
| | - Yunping Zhu
- National
Center for Protein Sciences (Beijing), Beijing
Institute of Lifeomics, #38, Life Science Park, Changping District, Beijing 102206, China
| | - Luana Licata
- Fondazione
Human Technopole, 20157 Milan, Italy
- Department
of Biology, University of Rome Tor Vergata, 00133 Rome, Italy
| | - Federica Quaglia
- Institute
of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), 70126 Bari, Italy
- Department
of Biomedical Sciences, University of Padova, 35131 Padova, Italy
| | | | - Sandra E. Orchard
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
3
|
Tebani A, Bekri S. [The promise of omics in the precision medicine era]. Rev Med Interne 2022; 43:649-660. [PMID: 36041909 DOI: 10.1016/j.revmed.2022.07.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 07/12/2022] [Indexed: 10/15/2022]
Abstract
The rise of omics technologies that simultaneously measure thousands of molecules in a complex biological sample represents the core of systems biology. These technologies have profoundly impacted biomarkers and therapeutic targets discovery in the precision medicine era. Systems biology aims to perform a systematic probing of complex interactions in biological systems. Powered by high-throughput omics technologies and high-performance computing, systems biology provides relevant, resolving, and multi-scale overviews from cells to populations. Precision medicine takes advantage of these conceptual and technological developments and is based on two main pillars: the generation of multimodal data and their subsequent modeling. High-throughput omics technologies enable the comprehensive and holistic extraction of biological information, while computational capabilities enable multidimensional modeling and, as a result, offer an intuitive and intelligible visualization. Despite their promise, translating these technologies into clinically actionable tools has been slow. In this contribution, we present the most recent multi-omics data generation and analysis strategies and their clinical deployment in the post-genomic era. Furthermore, medical application challenges of omics-based biomarkers are discussed.
Collapse
Affiliation(s)
- A Tebani
- UNIROUEN, Inserm U1245, Department of Metabolic Biochemistry, Normandie University, CHU Rouen, 76000 Rouen, France.
| | - S Bekri
- UNIROUEN, Inserm U1245, Department of Metabolic Biochemistry, Normandie University, CHU Rouen, 76000 Rouen, France
| |
Collapse
|
4
|
Bittremieux W, Bouyssié D, Dorfer V, Locard-Paulet M, Perez-Riverol Y, Schwämmle V, Uszkoreit J, Van Den Bossche T. The European Bioinformatics Community for Mass Spectrometry (EuBIC-MS): an open community for bioinformatics training and research. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2021:e9087. [PMID: 33861485 DOI: 10.1002/rcm.9087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/13/2021] [Accepted: 03/18/2021] [Indexed: 06/12/2023]
Abstract
The European Bioinformatics Community for Mass Spectrometry (EuBIC-MS; eubic-ms.org) was founded in 2014 to unite European computational mass spectrometry researchers and proteomics bioinformaticians working in academia and industry. EuBIC-MS maintains educational resources (proteomics-academy.org) and organises workshops at national and international conferences on proteomics and mass spectrometry. Furthermore, EuBIC-MS is actively involved in several community initiatives such as the Human Proteome Organization's Proteomics Standards Initiative (HUPO-PSI). Apart from these collaborations, EuBIC-MS has organised two Winter Schools and two Developers' Meetings that have contributed to the strengthening of the European mass spectrometry network and fostered international collaboration in this field, even beyond Europe. Moreover, EuBIC-MS is currently actively developing a community-driven standard dedicated to mass spectrometry data annotation (SDRF-Proteomics) that will facilitate data reuse and collaboration. This manuscript highlights what EuBIC-MS is, what it does, and what it already has achieved. A warm invitation is extended to new researchers at all career stages to join the EuBIC-MS community on its Slack channel (eubic.slack.com).
Collapse
Affiliation(s)
- Wout Bittremieux
- European Bioinformatics Community for Mass Spectrometry, Belgium
- University of California San Diego, La Jolla, CA, USA
- University of Antwerp, Antwerp, Belgium
| | - David Bouyssié
- European Bioinformatics Community for Mass Spectrometry, Belgium
- IPBS, University of Toulouse, CNRS, UPS, Toulouse, France
| | - Viktoria Dorfer
- European Bioinformatics Community for Mass Spectrometry, Belgium
- Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria
| | - Marie Locard-Paulet
- European Bioinformatics Community for Mass Spectrometry, Belgium
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Denmark
| | - Yasset Perez-Riverol
- European Bioinformatics Community for Mass Spectrometry, Belgium
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Veit Schwämmle
- European Bioinformatics Community for Mass Spectrometry, Belgium
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Julian Uszkoreit
- European Bioinformatics Community for Mass Spectrometry, Belgium
- Center for Protein Diagnostics (PRODI), Medical Proteome Analysis, Ruhr University Bochum, Bochum, Germany
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, Bochum, Germany
| | - Tim Van Den Bossche
- European Bioinformatics Community for Mass Spectrometry, Belgium
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
5
|
Review of Issues and Solutions to Data Analysis Reproducibility and Data Quality in Clinical Proteomics. Methods Mol Biol 2019. [PMID: 31552637 DOI: 10.1007/978-1-4939-9744-2_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
In any analytical discipline, data analysis reproducibility is closely interlinked with data quality. In this book chapter focused on mass spectrometry-based proteomics approaches, we introduce how both data analysis reproducibility and data quality can influence each other and how data quality and data analysis designs can be used to increase robustness and improve reproducibility. We first introduce methods and concepts to design and maintain robust data analysis pipelines such that reproducibility can be increased in parallel. The technical aspects related to data analysis reproducibility are challenging, and current ways to increase the overall robustness are multifaceted. Software containerization and cloud infrastructures play an important part.We will also show how quality control (QC) and quality assessment (QA) approaches can be used to spot analytical issues, reduce the experimental variability, and increase confidence in the analytical results of (clinical) proteomics studies, since experimental variability plays a substantial role in analysis reproducibility. Therefore, we give an overview on existing solutions for QC/QA, including different quality metrics, and methods for longitudinal monitoring. The efficient use of both types of approaches undoubtedly provides a way to improve the experimental reliability, reproducibility, and level of consistency in proteomics analytical measurements.
Collapse
|
6
|
Kim T, Chen IR, Parker BL, Humphrey SJ, Crossett B, Cordwell SJ, Yang P, Yang JYH. QCMAP: An Interactive Web-Tool for Performance Diagnosis and Prediction of LC-MS Systems. Proteomics 2019; 19:e1900068. [PMID: 31099962 DOI: 10.1002/pmic.201900068] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/07/2019] [Indexed: 01/04/2023]
Abstract
The increasing role played by liquid chromatography-mass spectrometry (LC-MS)-based proteomics in biological discovery has led to a growing need for quality control (QC) on the LC-MS systems. While numerous quality control tools have been developed to track the performance of LC-MS systems based on a pre-defined set of performance factors (e.g., mass error, retention time), the precise influence and contribution of the performance factors and their generalization property to different biological samples are not as well characterized. Here, a web-based application (QCMAP) is developed for interactive diagnosis and prediction of the performance of LC-MS systems across different biological sample types. Leveraging on a standardized HeLa cell sample run as QC within a multi-user facility, predictive models are trained on a panel of commonly used performance factors to pinpoint the precise conditions to a (un)satisfactory performance in three LC-MS systems. It is demonstrated that the learned model can be applied to predict LC-MS system performance for brain samples generated from an independent study. By compiling these predictive models into our web-application, QCMAP allows users to benchmark the performance of their LC-MS systems using their own samples and identify key factors for instrument optimization. QCMAP is freely available from: http://shiny.maths.usyd.edu.au/QCMAP/.
Collapse
Affiliation(s)
- Taiyun Kim
- School of Mathematics and Statistics, University of Sydney, NSW, 2006, Australia.,Judith and David Coffey Life Lab, Charles Perkins Centre, University of Sydney, NSW, 2006, Australia
| | - Irene Rui Chen
- School of Mathematics and Statistics, University of Sydney, NSW, 2006, Australia.,Judith and David Coffey Life Lab, Charles Perkins Centre, University of Sydney, NSW, 2006, Australia
| | - Benjamin L Parker
- School of Life and Environmental Sciences, University of Sydney, NSW, 2006, Australia
| | - Sean J Humphrey
- School of Life and Environmental Sciences, University of Sydney, NSW, 2006, Australia
| | - Ben Crossett
- Sydney Mass Spectrometry, University of Sydney, NSW, 2006, Australia
| | - Stuart J Cordwell
- School of Life and Environmental Sciences, University of Sydney, NSW, 2006, Australia.,Sydney Mass Spectrometry, University of Sydney, NSW, 2006, Australia
| | - Pengyi Yang
- School of Mathematics and Statistics, University of Sydney, NSW, 2006, Australia.,Computational Systems Biology Group, Children's Medical Research Institute, Faculty of Medicine and Health, University of Sydney, Westmead, NSW, 2145, Australia
| | - Jean Yee Hwa Yang
- School of Mathematics and Statistics, University of Sydney, NSW, 2006, Australia.,Judith and David Coffey Life Lab, Charles Perkins Centre, University of Sydney, NSW, 2006, Australia
| |
Collapse
|
7
|
Andjelković U, Josić D. Mass spectrometry based proteomics as foodomics tool in research and assurance of food quality and safety. Trends Food Sci Technol 2018. [DOI: 10.1016/j.tifs.2018.04.008] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
8
|
Willems S, Bouyssié D, Deforce D, Dorfer V, Gorshkov V, Kopczynski D, Laukens K, Locard-Paulet M, Schwämmle V, Uszkoreit J, Valkenborg D, Vaudel M, Bittremieux W. Proceedings of the EuBIC developer's meeting 2018. J Proteomics 2018; 187:25-27. [PMID: 29864591 DOI: 10.1016/j.jprot.2018.05.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 05/27/2018] [Indexed: 11/18/2022]
Abstract
The inaugural European Bioinformatics Community (EuBIC) developer's meeting was held from January 9th to January 12th 2018 in Ghent, Belgium. While the meeting kicked off with an interactive keynote session featuring four internationally renowned experts in the field of computational proteomics, its primary focus were the hands-on hackathon sessions which featured six community-proposed projects revolving around three major topics: Here, we present an overview of the scientific program of the EuBIC developer's meeting and provide a starting point for follow-up on the covered projects.
Collapse
Affiliation(s)
- Sander Willems
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - David Bouyssié
- Institute of Pharmacology and Structural Biology, University of Toulouse, CNRS, UPS, Toulouse, France
| | - Dieter Deforce
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Viktoria Dorfer
- Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark
| | - Dominik Kopczynski
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
| | - Marie Locard-Paulet
- Institute of Pharmacology and Structural Biology, University of Toulouse, CNRS, UPS, Toulouse, France
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark
| | - Julian Uszkoreit
- Medizinisches Proteom-Center, Ruhr University Bochum, Bochum, Germany
| | - Dirk Valkenborg
- Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, Belgium; Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway; Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium; Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| |
Collapse
|
9
|
Chiva C, Olivella R, Borràs E, Espadas G, Pastor O, Solé A, Sabidó E. QCloud: A cloud-based quality control system for mass spectrometry-based proteomics laboratories. PLoS One 2018; 13:e0189209. [PMID: 29324744 PMCID: PMC5764250 DOI: 10.1371/journal.pone.0189209] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 11/21/2017] [Indexed: 01/03/2023] Open
Abstract
The increasing number of biomedical and translational applications in mass spectrometry-based proteomics poses new analytical challenges and raises the need for automated quality control systems. Despite previous efforts to set standard file formats, data processing workflows and key evaluation parameters for quality control, automated quality control systems are not yet widespread among proteomics laboratories, which limits the acquisition of high-quality results, inter-laboratory comparisons and the assessment of variability of instrumental platforms. Here we present QCloud, a cloud-based system to support proteomics laboratories in daily quality assessment using a user-friendly interface, easy setup, automated data processing and archiving, and unbiased instrument evaluation. QCloud supports the most common targeted and untargeted proteomics workflows, it accepts data formats from different vendors and it enables the annotation of acquired data and reporting incidences. A complete version of the QCloud system has successfully been developed and it is now open to the proteomics community (http://qcloud.crg.eu). QCloud system is an open source project, publicly available under a Creative Commons License Attribution-ShareAlike 4.0.
Collapse
Affiliation(s)
- Cristina Chiva
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Roger Olivella
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Eva Borràs
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Guadalupe Espadas
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Olga Pastor
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Amanda Solé
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
| | - Eduard Sabidó
- Proteomics Unit, Centre de Regulació Genòmica (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Barcelona
- Universitat Pompeu Fabra (UPF), Barcelona, Barcelona
- * E-mail:
| |
Collapse
|
10
|
Deutsch EW, Orchard S, Binz PA, Bittremieux W, Eisenacher M, Hermjakob H, Kawano S, Lam H, Mayer G, Menschaert G, Perez-Riverol Y, Salek RM, Tabb DL, Tenzer S, Vizcaíno JA, Walzer M, Jones AR. Proteomics Standards Initiative: Fifteen Years of Progress and Future Work. J Proteome Res 2017; 16:4288-4298. [PMID: 28849660 PMCID: PMC5715286 DOI: 10.1021/acs.jproteome.7b00370] [Citation(s) in RCA: 69] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Indexed: 12/21/2022]
Abstract
The Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) has now been developing and promoting open community standards and software tools in the field of proteomics for 15 years. Under the guidance of the chair, cochairs, and other leadership positions, the PSI working groups are tasked with the development and maintenance of community standards via special workshops and ongoing work. Among the existing ratified standards, the PSI working groups continue to update PSI-MI XML, MITAB, mzML, mzIdentML, mzQuantML, mzTab, and the MIAPE (Minimum Information About a Proteomics Experiment) guidelines with the advance of new technologies and techniques. Furthermore, new standards are currently either in the final stages of completion (proBed and proBAM for proteogenomics results as well as PEFF) or in early stages of design (a spectral library standard format, a universal spectrum identifier, the qcML quality control format, and the Protein Expression Interface (PROXI) web services Application Programming Interface). In this work we review the current status of all of these aspects of the PSI, describe synergies with other efforts such as the ProteomeXchange Consortium, the Human Proteome Project, and the metabolomics community, and provide a look at future directions of the PSI.
Collapse
Affiliation(s)
- Eric W. Deutsch
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Sandra Orchard
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Pierre-Alain Binz
- CHUV
Centre Hospitalier Universitaire Vaudois, 1011 Lausanne, Switzerland
| | - Wout Bittremieux
- Department
of Mathematics and Computer Science, University
of Antwerp, Middelheimlaan
1, 2020 Antwerp, Belgium
| | - Martin Eisenacher
- Medizinisches
Proteom Center (MPC), Ruhr-Universität
Bochum, D-44801 Bochum, Germany
| | - Henning Hermjakob
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing
Institute of Radiation Medicine, National
Center for Protein Sciences, Beijing, Beijing 102206, China
| | - Shin Kawano
- Database
Center for Life Science, Joint Support Center for Data Science Research,
Research Organization of Information and Systems, Kashiwa, Chiba 277-0871, Japan
| | - Henry Lam
- Division
of Biomedical Engineering, The Hong Kong
University of Science and Technology, Clear Water Bay, Hong Kong, P. R. China
- Department
of Chemical and Biomolecular Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong, P. R. China
| | - Gerhard Mayer
- Medizinisches
Proteom Center (MPC), Ruhr-Universität
Bochum, D-44801 Bochum, Germany
| | - Gerben Menschaert
- Lab of Bioinformatics
and Computational Genomics (BioBix), Faculty of Bioscience Engineering, Ghent University, 9000 Ghent, Belgium
| | - Yasset Perez-Riverol
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Reza M. Salek
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David L. Tabb
- SA
MRC Centre
for TB Research, DST/NRF Centre of Excellence for Biomedical TB Research,
Division of Molecular Biology and Human Genetics, Faculty of Medicine
and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Stefan Tenzer
- Institute
for Immunology, University Medical Center
of the Johannes-Gutenberg University Mainz, 55131 Mainz, Germany
| | - Juan Antonio Vizcaíno
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Mathias Walzer
- European
Molecular Biology Laboratory, European Bioinformatics
Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Andrew R. Jones
- Institute
of Integrative Biology, University of Liverpool, South Wirral L64 4AY, United Kingdom
| |
Collapse
|
11
|
Pfeuffer J, Sachsenberg T, Alka O, Walzer M, Fillbrunn A, Nilse L, Schilling O, Reinert K, Kohlbacher O. OpenMS – A platform for reproducible analysis of mass spectrometry data. J Biotechnol 2017; 261:142-148. [DOI: 10.1016/j.jbiotec.2017.05.016] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2017] [Revised: 05/17/2017] [Accepted: 05/22/2017] [Indexed: 10/19/2022]
|
12
|
Meysman P, Titeca K, Eyckerman S, Tavernier J, Goethals B, Martens L, Valkenborg D, Laukens K. Protein complex analysis: From raw protein lists to protein interaction networks. MASS SPECTROMETRY REVIEWS 2017; 36:600-614. [PMID: 26709718 DOI: 10.1002/mas.21485] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2015] [Accepted: 11/17/2015] [Indexed: 06/05/2023]
Abstract
The elucidation of molecular interaction networks is one of the pivotal challenges in the study of biology. Affinity purification-mass spectrometry and other co-complex methods have become widely employed experimental techniques to identify protein complexes. These techniques typically suffer from a high number of false negatives and false positive contaminants due to technical shortcomings and purification biases. To support a diverse range of experimental designs and approaches, a large number of computational methods have been proposed to filter, infer and validate protein interaction networks from experimental pull-down MS data. Nevertheless, this expansion of available methods complicates the selection of the most optimal ones to support systems biology-driven knowledge extraction. In this review, we give an overview of the most commonly used computational methods to process and interpret co-complex results, and we discuss the issues and unsolved problems that still exist within the field. © 2015 Wiley Periodicals, Inc. Mass Spec Rev 36:600-614, 2017.
Collapse
Affiliation(s)
- Pieter Meysman
- Advanced Database Research and Modelling (ADReM), Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Edegem, Belgium
| | - Kevin Titeca
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Sven Eyckerman
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Jan Tavernier
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Bart Goethals
- Advanced Database Research and Modelling (ADReM), Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
| | - Lennart Martens
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- IBioStat, Hasselt University, Hasselt, Belgium
- CFP-CeProMa, University of Antwerp, Antwerp, Belgium
| | - Kris Laukens
- Advanced Database Research and Modelling (ADReM), Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Edegem, Belgium
| |
Collapse
|
13
|
Barbieri R, Guryev V, Brandsma CA, Suits F, Bischoff R, Horvatovich P. Proteogenomics: Key Driver for Clinical Discovery and Personalized Medicine. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 926:21-47. [PMID: 27686804 DOI: 10.1007/978-3-319-42316-6_3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Proteogenomics is a multi-omics research field that has the aim to efficiently integrate genomics, transcriptomics and proteomics. With this approach it is possible to identify new patient-specific proteoforms that may have implications in disease development, specifically in cancer. Understanding the impact of a large number of mutations detected at the genomics level is needed to assess the effects at the proteome level. Proteogenomics data integration would help in identifying molecular changes that are persistent across multiple molecular layers and enable better interpretation of molecular mechanisms of disease, such as the causal relationship between single nucleotide polymorphisms (SNPs) and the expression of transcripts and translation of proteins compared to mainstream proteomics approaches. Identifying patient-specific protein forms and getting a better picture of molecular mechanisms of disease opens the avenue for precision and personalized medicine. Proteogenomics is, however, a challenging interdisciplinary science that requires the understanding of sample preparation, data acquisition and processing for genomics, transcriptomics and proteomics. This chapter aims to guide the reader through the technology and bioinformatics aspects of these multi-omics approaches, illustrated with proteogenomics applications having clinical or biological relevance.
Collapse
Affiliation(s)
- Ruggero Barbieri
- Department of Gastroenterology and Hepatology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Corry-Anke Brandsma
- Department of Pathology & Medical Biology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Frank Suits
- IBM T.J. Watson Research Centre, 1101 Kitchawan Road, Yorktown Heights, New York, 10598, NY, USA
| | - Rainer Bischoff
- Department of Analytical Biochemistry, Research Institute of Pharmacy, University of Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, Research Institute of Pharmacy, University of Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands.
| |
Collapse
|
14
|
Abstract
In order to achieve more accurate disease prevention, diagnosis, and treatment, clinical and genetic data need extensive and systematically associated study. As one way to achieve precision medicine, a laboratory information management system (LIMS) can effectively associate clinical data in a macrocosmic aspect and genomic data in a microcosmic aspect. This chapter summarizes the application of the LIMS in a clinical data management and implementation mode. It also discusses the principles of a LIMS in clinical data management, as well as the opportunities and challenges in the context of medical informatics.
Collapse
|
15
|
Bittremieux W, Walzer M, Tenzer S, Zhu W, Salek RM, Eisenacher M, Tabb DL. The Human Proteome Organization-Proteomics Standards Initiative Quality Control Working Group: Making Quality Control More Accessible for Biological Mass Spectrometry. Anal Chem 2017; 89:4474-4479. [PMID: 28318237 DOI: 10.1021/acs.analchem.6b04310] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
To have confidence in results acquired during biological mass spectrometry experiments, a systematic approach to quality control is of vital importance. Nonetheless, until now, only scattered initiatives have been undertaken to this end, and these individual efforts have often not been complementary. To address this issue, the Human Proteome Organization-Proteomics Standards Initiative has established a new working group on quality control at its meeting in the spring of 2016. The goal of this working group is to provide a unifying framework for quality control data. The initial focus will be on providing a community-driven standardized file format for quality control. For this purpose, the previously proposed qcML format will be adapted to support a variety of use cases for both proteomics and metabolomics applications, and it will be established as an official PSI format. An important consideration is to avoid enforcing restrictive requirements on quality control but instead provide the basic technical necessities required to support extensive quality control for any type of mass spectrometry-based workflow. We want to emphasize that this is an open community effort, and we seek participation from all scientists with an interest in this field.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp , Middelheimlaan 1, 2020 Antwerp, Belgium.,Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital , Wilrijkstraat 10, 2650 Edegem, Belgium
| | - Mathias Walzer
- Department of Computer Science, University of Tübingen , Tübingen 72076, Germany.,Center for Bioinformatics, University of Tübingen , Tübingen 72074, Germany
| | - Stefan Tenzer
- Institute for Immunology, University Medical Center of the Johannes-Gutenberg University Mainz D 55131, Germany
| | - Weimin Zhu
- National Center for Protein Science , No. 38, Science Park Road, Changping District, Beijing 102206, China
| | - Reza M Salek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI) , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Martin Eisenacher
- Medical Bioinformatics, Medizinisches Proteom-Center, Ruhr-University Bochum , Bochum 44801, Germany
| | - David L Tabb
- Division of Molecular Biology and Human Genetics, Stellenbosch University Faculty of Medicine and Health Sciences , Tygerberg Hospital, Francie Van Zijl Drive, Cape Town 7505, South Africa
| |
Collapse
|
16
|
A Golden Age for Working with Public Proteomics Data. Trends Biochem Sci 2017; 42:333-341. [PMID: 28118949 PMCID: PMC5414595 DOI: 10.1016/j.tibs.2017.01.001] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/13/2016] [Accepted: 01/02/2017] [Indexed: 11/23/2022]
Abstract
Data sharing in mass spectrometry (MS)-based proteomics is becoming a common scientific practice, as is now common in the case of other, more mature ‘omics’ disciplines like genomics and transcriptomics. We want to highlight that this situation, unprecedented in the field, opens a plethora of opportunities for data scientists. First, we explain in some detail some of the work already achieved, such as systematic reanalysis efforts. We also explain existing applications of public proteomics data, such as proteogenomics and the creation of spectral libraries and spectral archives. Finally, we discuss the main existing challenges and mention the first attempts to combine public proteomics data with other types of omics data sets. The field of proteomics has matured and diversified substantially over the past 10 years. Proteomics data are increasingly shared through centralized, public repositories. Standardization efforts have ensured that a large proportion of these public data can be read and processed by any interested researcher. Because any proteomics data set is only partially understood, there is great opportunity for (orthogonal) reuse of public data. While public proteomics data has so far remained outside ethics and privacy discussions, recent work indicates that there is an inherent risk.
Collapse
|
17
|
Bekri S. The role of metabolomics in precision medicine. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2016. [DOI: 10.1080/23808993.2016.1273067] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Soumeya Bekri
- Department of Metabolic Biochemistry, Rouen University Hospital, Rouen 76000, France
- Normandie Univ, UNIROUEN, INSERM, CHU Rouen, IRIB, INSERM U1245, Rouen 76000, France
| |
Collapse
|
18
|
Bittremieux W, Valkenborg D, Martens L, Laukens K. Computational quality control tools for mass spectrometry proteomics. Proteomics 2016; 17. [DOI: 10.1002/pmic.201600159] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 07/28/2016] [Accepted: 08/19/2016] [Indexed: 12/30/2022]
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science; University of Antwerp; Antwerp Belgium
- Biomedical Informatics Research Center Antwerp (biomina); University of Antwerp/Antwerp, University Hospital; Edegem Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO); Mol Belgium
- CFP; University of Antwerp; Antwerp Belgium
- I-BioStat; Hasselt University; Diepenbeek Belgium
| | - Lennart Martens
- Medical Biotechnology Center; VIB; Ghent Belgium
- Department of Biochemistry, Faculty of Medicine and Health Sciences; Ghent University; Ghent Belgium
- Bioinformatics Institute Ghent; Ghent University; Zwijnaarde Belgium
| | - Kris Laukens
- Department of Mathematics and Computer Science; University of Antwerp; Antwerp Belgium
- Biomedical Informatics Research Center Antwerp (biomina); University of Antwerp/Antwerp, University Hospital; Edegem Belgium
| |
Collapse
|
19
|
Tebani A, Afonso C, Marret S, Bekri S. Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations. Int J Mol Sci 2016; 17:ijms17091555. [PMID: 27649151 PMCID: PMC5037827 DOI: 10.3390/ijms17091555] [Citation(s) in RCA: 105] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 09/06/2016] [Accepted: 09/07/2016] [Indexed: 12/20/2022] Open
Abstract
The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era.
Collapse
Affiliation(s)
- Abdellah Tebani
- Department of Metabolic Biochemistry, Rouen University Hospital, 76031 Rouen, France.
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
- Normandie University, UNIROUEN, INSA Rouen, CNRS, COBRA, 76000 Rouen, France.
| | - Carlos Afonso
- Normandie University, UNIROUEN, INSA Rouen, CNRS, COBRA, 76000 Rouen, France.
| | - Stéphane Marret
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
- Department of Neonatal Pediatrics, Intensive Care and Neuropediatrics, Rouen University Hospital, 76031 Rouen, France.
| | - Soumeya Bekri
- Department of Metabolic Biochemistry, Rouen University Hospital, 76031 Rouen, France.
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
| |
Collapse
|
20
|
Röst HL, Sachsenberg T, Aiche S, Bielow C, Weisser H, Aicheler F, Andreotti S, Ehrlich HC, Gutenbrunner P, Kenar E, Liang X, Nahnsen S, Nilse L, Pfeuffer J, Rosenberger G, Rurik M, Schmitt U, Veit J, Walzer M, Wojnar D, Wolski WE, Schilling O, Choudhary JS, Malmström L, Aebersold R, Reinert K, Kohlbacher O. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat Methods 2016; 13:741-8. [DOI: 10.1038/nmeth.3959] [Citation(s) in RCA: 365] [Impact Index Per Article: 45.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 06/27/2016] [Indexed: 12/28/2022]
|
21
|
Avtonomov D, Raskind A, Nesvizhskii AI. BatMass: a Java Software Platform for LC-MS Data Visualization in Proteomics and Metabolomics. J Proteome Res 2016; 15:2500-9. [PMID: 27306858 PMCID: PMC5583644 DOI: 10.1021/acs.jproteome.6b00021] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Mass spectrometry (MS) coupled to liquid chromatography (LC) is a commonly used technique in metabolomic and proteomic research. As the size and complexity of LC-MS-based experiments grow, it becomes increasingly more difficult to perform quality control of both raw data and processing results. In a practical setting, quality control steps for raw LC-MS data are often overlooked, and assessment of an experiment's success is based on some derived metrics such as "the number of identified compounds". The human brain interprets visual data much better than plain text, hence the saying "a picture is worth a thousand words". Here, we present the BatMass software package, which allows for performing quick quality control of raw LC-MS data through its fast visualization capabilities. It also serves as a testbed for developers of LC-MS data processing algorithms by providing a data access library for open mass spectrometry file formats and a means of visually mapping processing results back to the original data. We illustrate the utility of BatMass with several use cases of quality control and data exploration.
Collapse
Affiliation(s)
- Dmitry Avtonomov
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109
| | | | - Alexey I. Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109
| |
Collapse
|
22
|
Ranninger C, Schmidt LE, Rurik M, Limonciel A, Jennings P, Kohlbacher O, Huber CG. Improving global feature detectabilities through scan range splitting for untargeted metabolomics by high-performance liquid chromatography-Orbitrap mass spectrometry. Anal Chim Acta 2016; 930:13-22. [PMID: 27265900 DOI: 10.1016/j.aca.2016.05.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2016] [Revised: 05/13/2016] [Accepted: 05/16/2016] [Indexed: 12/14/2022]
Abstract
Untargeted metabolomics aims at obtaining quantitative information on the highest possible number of low-molecular biomolecules present in a biological sample. Rather small changes in mass spectrometric spectrum acquisition parameters may have a significant influence on the detectabilities of metabolites in untargeted global-scale studies by means of high-performance liquid chromatography-mass spectrometry (HPLC-MS). Employing whole cell lysates of human renal proximal tubule cells, we present a systematic global-scale study of the influence of mass spectrometric scan parameters and post-acquisition data treatment on the number and intensity of metabolites detectable in whole cell lysates. Ion transmission and ion collection efficiencies in an Orbitrap-based mass spectrometer basically depend on the m/z range scanned, which, ideally, requires different instrument settings for the respective mass ranges investigated. Therefore, we split a full scan range of m/z 50-1000 relevant for metabolites into two separate segments (m/z 50-200 and m/z 200-1,000), allowing an independent tuning of the ion transmission parameters for both mass ranges. Three different implementations, involving either scanning from m/z 50-1000 in a single scan, or scanning from m/z 50-200 and from m/z 200-1000 in two alternating scans, or performing two separate HPLC-MS runs with m/z 50-200 and m/z 200-1000 scan ranges were critically assessed. The detected features were subjected to rigorous background filtering and quality control in order to obtain reliable metabolite features for subsequent differential quantification. The most efficient approach in terms of feature number, which forms the basis for statistical analysis, identification, and for generating biological hypotheses, was the separate analysis of two different mass ranges. This lead to an increase in the number of detectable metabolite features, especially in the higher mass range (m/z greater than 400), by 2.5 (negative mode) to 6-fold (positive mode) as compared to analysis involving a single scan range. The total number of features confidently detectable was 560 in positive ion mode, and 436 in negative ion mode.
Collapse
Affiliation(s)
- Christina Ranninger
- Department of Molecular Biology, Division of Chemistry and Bioanalytics, University of Salzburg, Salzburg, Austria.
| | - Lukas E Schmidt
- Department of Molecular Biology, Division of Chemistry and Bioanalytics, University of Salzburg, Salzburg, Austria.
| | - Marc Rurik
- Center for Bioinformatics, University of Tübingen, Tübingen, Germany; Department of Computer Science, University of Tübingen, Germany.
| | - Alice Limonciel
- Department of Physiology and Medical Physics, Division of Physiology, Medical University of Innsbruck, Innsbruck, 6020, Austria.
| | - Paul Jennings
- Department of Physiology and Medical Physics, Division of Physiology, Medical University of Innsbruck, Innsbruck, 6020, Austria.
| | - Oliver Kohlbacher
- Center for Bioinformatics, University of Tübingen, Tübingen, Germany; Department of Computer Science, University of Tübingen, Germany; Quantitative Biology Center, University of Tübingen, Germany; Faculty of Medicine, University of Tübingen, Germany; Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany.
| | - Christian G Huber
- Department of Molecular Biology, Division of Chemistry and Bioanalytics, University of Salzburg, Salzburg, Austria.
| |
Collapse
|
23
|
Martens L. Public proteomics data: How the field has evolved from sceptical inquiry to the promise of in silico proteomics. EUPA OPEN PROTEOMICS 2016; 11:42-44. [PMID: 29900110 PMCID: PMC5988554 DOI: 10.1016/j.euprot.2016.02.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2015] [Revised: 02/13/2016] [Accepted: 02/15/2016] [Indexed: 12/23/2022]
Abstract
Proteomics data sharing moved from validation to re-use. New tools and services make data very easily accessible. Metadata provision can still benefit from improvements. Quality control metrics will soon be reported along with submitted data. Data re-use will enable the advent of actual in silico proteomics.
Collapse
Affiliation(s)
- Lennart Martens
- Department of Medical Protein Research, VIB 9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, 9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
24
|
Distler U, Kuharev J, Navarro P, Tenzer S. Label-free quantification in ion mobility–enhanced data-independent acquisition proteomics. Nat Protoc 2016; 11:795-812. [DOI: 10.1038/nprot.2016.042] [Citation(s) in RCA: 176] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
25
|
Bittremieux W, Meysman P, Martens L, Valkenborg D, Laukens K. Unsupervised Quality Assessment of Mass Spectrometry Proteomics Experiments by Multivariate Quality Control Metrics. J Proteome Res 2016; 15:1300-7. [PMID: 26974716 DOI: 10.1021/acs.jproteome.6b00028] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Despite many technological and computational advances, the results of a mass spectrometry proteomics experiment are still subject to a large variability. For the understanding and evaluation of how technical variability affects the results of an experiment, several computationally derived quality control metrics have been introduced. However, despite the availability of these metrics, a systematic approach to quality control is often still lacking because the metrics are not fully understood and are hard to interpret. Here, we present a toolkit of powerful techniques to analyze and interpret multivariate quality control metrics to assess the quality of mass spectrometry proteomics experiments. We show how unsupervised techniques applied to these quality control metrics can provide an initial discrimination between low-quality experiments and high-quality experiments prior to manual investigation. Furthermore, we provide a technique to obtain detailed information on the quality control metrics that are related to the decreased performance, which can be used as actionable information to improve the experimental setup. Our toolkit is released as open-source and can be downloaded from https://bitbucket.org/proteinspector/qc_analysis/ .
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp , 2020 Antwerp, Belgium.,Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital , 2650 Edegem, Belgium
| | - Pieter Meysman
- Department of Mathematics and Computer Science, University of Antwerp , 2020 Antwerp, Belgium.,Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital , 2650 Edegem, Belgium
| | - Lennart Martens
- Department of Medical Protein Research, VIB , 9000 Ghent, Belgium.,Department of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University , 9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , 9000 Ghent, Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO) , 2400 Mol, Belgium.,CFP, University of Antwerp , 2020 Antwerp, Belgium.,I-BioStat, Hasselt University , 3590 Diepenbeek, Belgium
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp , 2020 Antwerp, Belgium.,Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital , 2650 Edegem, Belgium
| |
Collapse
|
26
|
Vizcaíno JA, Csordas A, del-Toro N, Dianes JA, Griss J, Lavidas I, Mayer G, Perez-Riverol Y, Reisinger F, Ternent T, Xu QW, Wang R, Hermjakob H. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res 2016; 44:D447-56. [PMID: 26527722 PMCID: PMC4702828 DOI: 10.1093/nar/gkv1145] [Citation(s) in RCA: 2514] [Impact Index Per Article: 314.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 10/14/2015] [Accepted: 10/16/2015] [Indexed: 11/18/2022] Open
Abstract
The PRoteomics IDEntifications (PRIDE) database is one of the world-leading data repositories of mass spectrometry (MS)-based proteomics data. Since the beginning of 2014, PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) is the new PRIDE archival system, replacing the original PRIDE database. Here we summarize the developments in PRIDE resources and related tools since the previous update manuscript in the Database Issue in 2013. PRIDE Archive constitutes a complete redevelopment of the original PRIDE, comprising a new storage backend, data submission system and web interface, among other components. PRIDE Archive supports the most-widely used PSI (Proteomics Standards Initiative) data standard formats (mzML and mzIdentML) and implements the data requirements and guidelines of the ProteomeXchange Consortium. The wide adoption of ProteomeXchange within the community has triggered an unprecedented increase in the number of submitted data sets (around 150 data sets per month). We outline some statistics on the current PRIDE Archive data contents. We also report on the status of the PRIDE related stand-alone tools: PRIDE Inspector, PRIDE Converter 2 and the ProteomeXchange submission tool. Finally, we will give a brief update on the resources under development 'PRIDE Cluster' and 'PRIDE Proteomes', which provide a complementary view and quality-scored information of the peptide and protein identification data available in PRIDE Archive.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Attila Csordas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Noemi del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - José A Dianes
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Johannes Griss
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Division of Immunology, Allergy and Infectious Diseases, Department of Dermatology, Medical University of Vienna, Austria
| | - Ilias Lavidas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gerhard Mayer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Florian Reisinger
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Qing-Wei Xu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Department of Computer Science and Technology, Hubei University of Education, Wuhan, China
| | - Rui Wang
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK National Center for Protein Sciences, Beijing, China
| |
Collapse
|
27
|
Codrea MC, Nahnsen S. Platforms and Pipelines for Proteomics Data Analysis and Management. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 919:203-215. [PMID: 27975218 DOI: 10.1007/978-3-319-41448-5_9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Since mass spectrometry was introduced as the core technology for large-scale analysis of the proteome, the speed of data acquisition, dynamic ranges of measurements, and data quality are continuously improving. These improvements are triggered by regular launches of new methodologies and instruments.
Collapse
Affiliation(s)
- Marius Cosmin Codrea
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076, Tübingen, Germany
| | - Sven Nahnsen
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076, Tübingen, Germany.
| |
Collapse
|
28
|
Bielow C, Mastrobuoni G, Kempa S. Proteomics Quality Control: Quality Control Software for MaxQuant Results. J Proteome Res 2015; 15:777-87. [PMID: 26653327 DOI: 10.1021/acs.jproteome.5b00780] [Citation(s) in RCA: 112] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Mass spectrometry-based proteomics coupled to liquid chromatography has matured into an automatized, high-throughput technology, producing data on the scale of multiple gigabytes per instrument per day. Consequently, an automated quality control (QC) and quality analysis (QA) capable of detecting measurement bias, verifying consistency, and avoiding propagation of error is paramount for instrument operators and scientists in charge of downstream analysis. We have developed an R-based QC pipeline called Proteomics Quality Control (PTXQC) for bottom-up LC-MS data generated by the MaxQuant software pipeline. PTXQC creates a QC report containing a comprehensive and powerful set of QC metrics, augmented with automated scoring functions. The automated scores are collated to create an overview heatmap at the beginning of the report, giving valuable guidance also to nonspecialists. Our software supports a wide range of experimental designs, including stable isotope labeling by amino acids in cell culture (SILAC), tandem mass tags (TMT), and label-free data. Furthermore, we introduce new metrics to score MaxQuant's Match-between-runs (MBR) functionality by which peptide identifications can be transferred across Raw files based on accurate retention time and m/z. Last but not least, PTXQC is easy to install and use and represents the first QC software capable of processing MaxQuant result tables. PTXQC is freely available at https://github.com/cbielow/PTXQC .
Collapse
Affiliation(s)
- Chris Bielow
- Max-Delbrück-Centrum for Molecular Medicine Berlin , Robert-Rössle-Straße 10, 13125 Berlin, Germany.,Berlin Institute of Health , Kapelle-Ufer 2, 10117 Berlin, Germany
| | - Guido Mastrobuoni
- Max-Delbrück-Centrum for Molecular Medicine Berlin , Robert-Rössle-Straße 10, 13125 Berlin, Germany
| | - Stefan Kempa
- Max-Delbrück-Centrum for Molecular Medicine Berlin , Robert-Rössle-Straße 10, 13125 Berlin, Germany.,Berlin Institute of Health , Kapelle-Ufer 2, 10117 Berlin, Germany
| |
Collapse
|
29
|
Nilse L, Sigloch FC, Biniossek ML, Schilling O. Toward improved peptide feature detection in quantitative proteomics using stable isotope labeling. Proteomics Clin Appl 2015; 9:706-14. [DOI: 10.1002/prca.201400173] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Revised: 02/23/2015] [Accepted: 04/28/2015] [Indexed: 11/11/2022]
Affiliation(s)
- Lars Nilse
- Institute of Molecular Medicine and Cell Research; University of Freiburg; Freiburg Germany
| | - Florian Christoph Sigloch
- Institute of Molecular Medicine and Cell Research; University of Freiburg; Freiburg Germany
- Faculty of Biology; University of Freiburg; Freiburg Germany
| | - Martin L. Biniossek
- Institute of Molecular Medicine and Cell Research; University of Freiburg; Freiburg Germany
| | - Oliver Schilling
- Institute of Molecular Medicine and Cell Research; University of Freiburg; Freiburg Germany
- BIOSS Centre for Biological Signalling Studies; University of Freiburg; Freiburg Germany
| |
Collapse
|
30
|
Campos A, Díaz R, Martínez-Bartolomé S, Sierra J, Gallardo O, Sabidó E, López-Lucendo M, Ignacio Casal J, Pasquarello C, Scherl A, Chiva C, Borras E, Odena A, Elortza F, Azkargorta M, Ibarrola N, Canals F, Albar JP, Oliveira E. Multicenter experiment for quality control of peptide-centric LC-MS/MS analysis - A longitudinal performance assessment with nLC coupled to orbitrap MS analyzers. J Proteomics 2015; 127:264-74. [PMID: 25982386 DOI: 10.1016/j.jprot.2015.05.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Revised: 05/07/2015] [Accepted: 05/11/2015] [Indexed: 11/19/2022]
Abstract
Proteomic technologies based on mass spectrometry (MS) have greatly evolved in the past years, and nowadays it is possible to routinely identify thousands of peptides from complex biological samples in a single LC-MS/MS experiment. Despite the advancements in proteomic technologies, the scientific community still faces important challenges in terms of depth and reproducibility of proteomics analyses. Here, we present a multicenter study designed to evaluate long-term performance of LC-MS/MS platforms within the Spanish Proteomics Facilities Network (ProteoRed-ISCIII). The study was performed under well-established standard operating procedures, and demonstrated that it is possible to attain qualitative and quantitative reproducibility over time. Our study highlights the importance of deploying quality assessment metrics routinely in individual laboratories and in multi-laboratory studies. The mass spectrometry data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD000205.This article is part of a Special Issue entitled: HUPO 2014.
Collapse
Affiliation(s)
- Alex Campos
- ProteoRed-ISCIII, Proteomics Platform, Barcelona Science Park, Barcelona, Spain; Integromics, Madrid, Spain.
| | - Ramón Díaz
- ProteoRed-ISCIII, Proteomics Platform, Barcelona Science Park, Barcelona, Spain
| | | | | | - Oscar Gallardo
- ProteoRed-ISCIII, CSIC/UAB Proteomics Laboratory, Instituto de Investigaciones Biomédicas de Barcelona, Spanish National Research Council, Barcelona, Spain
| | - Eduard Sabidó
- ProteoRed-ISCIII, Proteomics Unit, Universitat Pompeu Fabra (UPF) and Centre de Regulació Genòmica (CRG), Barcelona, Spain
| | - Maria López-Lucendo
- ProteoRed-ISCIII, Proteomics Facility and Functional Proteomics Laboratory, Centro de Investigaciones, Biológicas, Madrid, Spain
| | - J Ignacio Casal
- ProteoRed-ISCIII, Proteomics Facility and Functional Proteomics Laboratory, Centro de Investigaciones, Biológicas, Madrid, Spain
| | | | - Alexander Scherl
- Department of Human Protein Sciences, CMU, University of Geneva, Switzerland
| | - Cristina Chiva
- ProteoRed-ISCIII, Proteomics Unit, Universitat Pompeu Fabra (UPF) and Centre de Regulació Genòmica (CRG), Barcelona, Spain
| | - Eva Borras
- ProteoRed-ISCIII, Proteomics Unit, Universitat Pompeu Fabra (UPF) and Centre de Regulació Genòmica (CRG), Barcelona, Spain
| | - Antonia Odena
- ProteoRed-ISCIII, Proteomics Platform, Barcelona Science Park, Barcelona, Spain
| | - Félix Elortza
- ProteoRed-ISCIII, Proteomics Platform, CIC bioGUNE, CIBERehd, Technology Park of Bizkaia, Derio, Spain
| | - Mikel Azkargorta
- ProteoRed-ISCIII, Proteomics Platform, CIC bioGUNE, CIBERehd, Technology Park of Bizkaia, Derio, Spain
| | - Nieves Ibarrola
- ProteoRed-ISCIII, Centro de Investigación del Cáncer and Instituto de Biología Molecular y Celular del Cáncer, CSIC-University of Salamanca, Salamanca, Spain
| | - Francesc Canals
- ProteoRed-ISCIII, Proteomic Laboratory, Vall d'Hebron Institute of Oncology-VHIO, Vall d'Hebron University Hospital, Barcelona, Spain
| | - Juan P Albar
- ProteoRed-ISCIII, Proteomics Facility, Centro Nacional de Biotecnología - CSIC, Madrid, Spain
| | - Eliandre Oliveira
- ProteoRed-ISCIII, Proteomics Platform, Barcelona Science Park, Barcelona, Spain
| |
Collapse
|
31
|
Deutsch EW, Albar JP, Binz PA, Eisenacher M, Jones AR, Mayer G, Omenn GS, Orchard S, Vizcaíno JA, Hermjakob H. Development of data representation standards by the human proteome organization proteomics standards initiative. J Am Med Inform Assoc 2015; 22:495-506. [PMID: 25726569 PMCID: PMC4457114 DOI: 10.1093/jamia/ocv001] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Revised: 09/29/2014] [Accepted: 01/05/2015] [Indexed: 11/22/2022] Open
Abstract
OBJECTIVE To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI's evolution, and future directions and synergies for the group. MATERIALS AND METHODS The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. RESULTS We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community.Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and biological phenotypes. The PSI website can be accessed at http://www.psidev.info.
Collapse
Affiliation(s)
| | - Juan Pablo Albar
- Died July 18, 2014 Proteomics Facility, Centro Nacional de Biotecnología - CSIC, Madrid, Spain ProteoRed Consortium, Spanish National Institute of Proteomics, Madrid, Spain
| | - Pierre-Alain Binz
- CHUV Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Martin Eisenacher
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, Bochum, Germany
| | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Gerhard Mayer
- Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, Bochum, Germany
| | - Gilbert S Omenn
- Institute for Systems Biology, Seattle, USA Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, USA
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
32
|
Bittremieux W, Willems H, Kelchtermans P, Martens L, Laukens K, Valkenborg D. iMonDB: Mass Spectrometry Quality Control through Instrument Monitoring. J Proteome Res 2015; 14:2360-6. [PMID: 25798920 DOI: 10.1021/acs.jproteome.5b00127] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Over the past few years, awareness has risen that for mass-spectrometry-based proteomics methods to mature into everyday analytical and clinical practices, extensive quality assessment is mandatory. A currently overlooked source of qualitative information originates from the mass spectrometer itself. Apart from the actual mass spectral data, raw-data objects also contain parameter settings and sensory information about the mass instrument. This information gives a detailed account of the operation of the instrument, which eventually can be related to observations in mass spectral data. The advantage of instrument information at the lowest level is the high sensitivity to detect emerging defects in a timely fashion. To this end, we introduce the Instrument MONitoring DataBase (iMonDB), which allows us to automatically extract, store, and manage the instrument parameters from raw-data objects into a highly efficient database structure. This enables us to monitor the instrument parameters over a considerable time period. Time course information about the instrument performance is necessary to define the normal range of operation and to detect anomalies that may correlate with instrument failure. The proposed tools foster an additional handle on quality control and are released as open source under the permissive Apache 2.0 license. The tools can be downloaded from https://bitbucket.org/proteinspector/imondb.
Collapse
Affiliation(s)
- Wout Bittremieux
- †Department of Mathematics and Computer Science, University of Antwerp, Middelheimlaan 1, 2020 Antwerp, Belgium.,‡Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Wilrijkstraat 10, 2650 Edegem, Belgium
| | - Hanny Willems
- §Flemish Institute for Technological Research (VITO), Boeretang 200, BE-2400 Mol, Belgium
| | - Pieter Kelchtermans
- §Flemish Institute for Technological Research (VITO), Boeretang 200, BE-2400 Mol, Belgium.,∥Department of Medical Protein Research, VIB, Albert Baertsoenkaai 3, 9000 Ghent, Belgium.,⊥Department of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Albert Baertsoenkaai 3, 9000 Ghent, Belgium
| | - Lennart Martens
- ∥Department of Medical Protein Research, VIB, Albert Baertsoenkaai 3, 9000 Ghent, Belgium.,⊥Department of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Albert Baertsoenkaai 3, 9000 Ghent, Belgium
| | - Kris Laukens
- †Department of Mathematics and Computer Science, University of Antwerp, Middelheimlaan 1, 2020 Antwerp, Belgium.,‡Biomedical Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Wilrijkstraat 10, 2650 Edegem, Belgium
| | - Dirk Valkenborg
- §Flemish Institute for Technological Research (VITO), Boeretang 200, BE-2400 Mol, Belgium.,#CFP, University of Antwerp, Groenenborgerlaan 171, 2020 Antwerp, Belgium.,▽I-BioStat, Hasselt University, Agoralaan - Building D, 3590 Diepenbeek, Belgium
| |
Collapse
|
33
|
Aiche S, Sachsenberg T, Kenar E, Walzer M, Wiswedel B, Kristl T, Boyles M, Duschl A, Huber CG, Berthold MR, Reinert K, Kohlbacher O. Workflows for automated downstream data analysis and visualization in large-scale computational mass spectrometry. Proteomics 2015; 15:1443-7. [PMID: 25604327 PMCID: PMC4415483 DOI: 10.1002/pmic.201400391] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Revised: 11/23/2014] [Accepted: 01/16/2015] [Indexed: 12/27/2022]
Abstract
MS-based proteomics and metabolomics are rapidly evolving research fields driven by the development of novel instruments, experimental approaches, and analysis methods. Monolithic analysis tools perform well on single tasks but lack the flexibility to cope with the constantly changing requirements and experimental setups. Workflow systems, which combine small processing tools into complex analysis pipelines, allow custom-tailored and flexible data-processing workflows that can be published or shared with collaborators. In this article, we present the integration of established tools for computational MS from the open-source software framework OpenMS into the workflow engine Konstanz Information Miner (KNIME) for the analysis of large datasets and production of high-quality visualizations. We provide example workflows to demonstrate combined data processing and visualization for three diverse tasks in computational MS: isobaric mass tag based quantitation in complex experimental setups, label-free quantitation and identification of metabolites, and quality control for proteomics experiments.
Collapse
Affiliation(s)
- Stephan Aiche
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Oveland E, Muth T, Rapp E, Martens L, Berven FS, Barsnes H. Viewing the proteome: how to visualize proteomics data? Proteomics 2015; 15:1341-55. [PMID: 25504833 DOI: 10.1002/pmic.201400412] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Revised: 10/23/2014] [Accepted: 12/05/2014] [Indexed: 01/18/2023]
Abstract
Proteomics has become one of the main approaches for analyzing and understanding biological systems. Yet similar to other high-throughput analysis methods, the presentation of the large amounts of obtained data in easily interpretable ways remains challenging. In this review, we present an overview of the different ways in which proteomics software supports the visualization and interpretation of proteomics data. The unique challenges and current solutions for visualizing the different aspects of proteomics data, from acquired spectra via protein identification and quantification to pathway analysis, are discussed, and examples of the most useful visualization approaches are highlighted. Finally, we offer our ideas about future directions for proteomics data visualization.
Collapse
Affiliation(s)
- Eystein Oveland
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway; KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, Bergen, Norway; Department of Clinical Medicine, University of Bergen, Bergen, Norway
| | | | | | | | | | | |
Collapse
|
35
|
Bereman MS. Tools for monitoring system suitability in LC MS/MS centric proteomic experiments. Proteomics 2014; 15:891-902. [DOI: 10.1002/pmic.201400373] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 09/12/2014] [Accepted: 10/13/2014] [Indexed: 11/06/2022]
Affiliation(s)
- Michael S. Bereman
- Department of Biological Sciences, Center for Human Health and the Environment; North Carolina State University; Raleigh NC USA
| |
Collapse
|
36
|
Bittremieux W, Kelchtermans P, Valkenborg D, Martens L, Laukens K. jqcML: An Open-Source Java API for Mass Spectrometry Quality Control Data in the qcML Format. J Proteome Res 2014; 13:3484-7. [DOI: 10.1021/pr401274z] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Wout Bittremieux
- Department
of Mathematics and Computer Science, University of Antwerp, Middelheimlaan
1, B-2020 Antwerp, Belgium
- Biomedical
Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Wilrijkstraat 10, B-2650 Antwerp, Belgium
| | - Pieter Kelchtermans
- Department
of Medical Protein Research, VIB, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Department
of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Flemish Institute for Technological Research (VITO), Boeretang 200, B-2400 Mol, Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO), Boeretang 200, B-2400 Mol, Belgium
- CFP-CeProMa, University of Antwerp, Groenenborgerlaan 171, B-2020 Antwerp, Belgium
- I-BioStat, Hasselt University, Agoralaan - Building D, B-3590 Diepenbeek, Belgium
| | - Lennart Martens
- Department
of Medical Protein Research, VIB, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Department
of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
| | - Kris Laukens
- Department
of Mathematics and Computer Science, University of Antwerp, Middelheimlaan
1, B-2020 Antwerp, Belgium
- Biomedical
Informatics Research Center Antwerp (biomina), University of Antwerp/Antwerp University Hospital, Wilrijkstraat 10, B-2650 Antwerp, Belgium
| |
Collapse
|