1
|
Gagaoua M, Suman SP, Purslow PP, Lebret B. The color of fresh pork: Consumers expectations, underlying farm-to-fork factors, myoglobin chemistry and contribution of proteomics to decipher the biochemical mechanisms. Meat Sci 2023; 206:109340. [PMID: 37708621 DOI: 10.1016/j.meatsci.2023.109340] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 08/14/2023] [Accepted: 09/06/2023] [Indexed: 09/16/2023]
Abstract
The color of fresh pork is a crucial quality attribute that significantly influences consumer perception and purchase decisions. This review first explores consumer expectations and discrimination regarding pork color, as well as an overview of the underlying factors that, from farm-to-fork, contribute to its variation. Understanding the husbandry factors, peri- and post-mortem factors and consumer preferences is essential for the pork industry to meet market demands effectively. This review then delves into current knowledge of pork myoglobin chemistry, its modifications and pork discoloration. Pork myoglobin, which has certain peculiarities comparted to other meat species, plays a weak role in determining pork color, and a thorough understanding of the biochemical changes it undergoes is crucial to understand and improve color stability. Furthermore, the growing role of proteomics as a high-throughput approach and its application as a powerful research tool in meat research, mainly to decipher the biochemical mechanisms involved in pork color determination and identify protein biomarkers, are highlighted. Based on an integrative muscle biology approach, the available proteomics studies on pork color have enabled us to provide the first repertoire of pork color biomarkers, to shortlist and propose a list of proteins for evaluation, and to provide valuable insights into the interconnected biochemical processes implicated in pork color determination. By highlighting the contributions of proteomics in elucidating the biochemical mechanisms underlying pork color determination, the knowledge gained hold significant potential for the pork industry to effectively meet market demands, enhance product quality, and ensure consistent and appealing pork color.
Collapse
Affiliation(s)
| | - Surendranath P Suman
- Department of Animal and Food Sciences, University of Kentucky, Lexington, KY 40546, United States
| | | | | |
Collapse
|
2
|
Wu L, Hoque A, Lam H. Spectroscape enables real-time query and visualization of a spectral archive in proteomics. Nat Commun 2023; 14:6267. [PMID: 37805652 PMCID: PMC10560257 DOI: 10.1038/s41467-023-42006-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 09/26/2023] [Indexed: 10/09/2023] Open
Abstract
In proteomics, spectral archives organize the enormous amounts of publicly available peptide tandem mass spectra by similarity, offering opportunities for error correction and novel discoveries. Here we adapt an indexing algorithm developed by Facebook for organizing online multimedia resources to tandem mass spectra and achieve practically instantaneous retrieval and clustering of approximate nearest neighbors in a large spectral archive. An interactive web-based graphical user interface enables the user to view a query spectrum in its clustered neighborhood, which facilitates contextual validation of peptide identifications and exploration of the dark proteome.
Collapse
Affiliation(s)
- Long Wu
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
- Department of Electrical and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Ayman Hoque
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Henry Lam
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong.
| |
Collapse
|
3
|
Wen B, Zhang B. PepQuery2 democratizes public MS proteomics data for rapid peptide searching. Nat Commun 2023; 14:2213. [PMID: 37072382 PMCID: PMC10113256 DOI: 10.1038/s41467-023-37462-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 03/17/2023] [Indexed: 04/20/2023] Open
Abstract
We present PepQuery2, which leverages a new tandem mass spectrometry (MS/MS) data indexing approach to enable ultrafast, targeted identification of novel and known peptides in any local or publicly available MS proteomics datasets. The stand-alone version of PepQuery2 allows directly searching more than one billion indexed MS/MS spectra in the PepQueryDB or any public datasets from PRIDE, MassIVE, iProX, or jPOSTrepo, whereas the web version enables users to search datasets in PepQueryDB with a user-friendly interface. We demonstrate the utilities of PepQuery2 in a wide range of applications including detecting proteomic evidence for genomically predicted novel peptides, validating novel and known peptides identified using spectrum-centric database searching, prioritizing tumor-specific antigens, identifying missing proteins, and selecting proteotypic peptides for targeted proteomics experiments. By putting public MS proteomics data directly into the hands of scientists, PepQuery2 opens many new ways to transform these data into useful information for the broad research community.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
4
|
Claeys T, Menu M, Bouwmeester R, Gevaert K, Martens L. Machine Learning on Large-Scale Proteomics Data Identifies Tissue and Cell-Type Specific Proteins. J Proteome Res 2023; 22:1181-1192. [PMID: 36963412 PMCID: PMC10088018 DOI: 10.1021/acs.jproteome.2c00644] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]
Abstract
Using data from 183 public human data sets from PRIDE, a machine learning model was trained to identify tissue and cell-type specific protein patterns. PRIDE projects were searched with ionbot and tissue/cell type annotation was manually added. Data from physiological samples were used to train a Random Forest model on protein abundances to classify samples into tissues and cell types. Subsequently, a one-vs-all classification and feature importance were used to analyze the most discriminating protein abundances per class. Based on protein abundance alone, the model was able to predict tissues with 98% accuracy, and cell types with 99% accuracy. The F-scores describe a clear view on tissue-specific proteins and tissue-specific protein expression patterns. In-depth feature analysis shows slight confusion between physiologically similar tissues, demonstrating the capacity of the algorithm to detect biologically relevant patterns. These results can in turn inform downstream uses, from identification of the tissue of origin of proteins in complex samples such as liquid biopsies, to studying the proteome of tissue-like samples such as organoids and cell lines.
Collapse
Affiliation(s)
- Tine Claeys
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium
| | - Maxime Menu
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium
| | - Kris Gevaert
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
5
|
NeuroLINCS Proteomics: Defining human-derived iPSC proteomes and protein signatures of pluripotency. Sci Data 2023; 10:24. [PMID: 36631473 PMCID: PMC9834231 DOI: 10.1038/s41597-022-01687-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 09/07/2022] [Indexed: 01/13/2023] Open
Abstract
The National Institute of Health (NIH) Library of integrated network-based cellular signatures (LINCS) program is premised on the generation of a publicly available data resource of cell-based biochemical responses or "signatures" to genetic or environmental perturbations. NeuroLINCS uses human inducible pluripotent stem cells (hiPSCs), derived from patients and healthy controls, and differentiated into motor neuron cell cultures. This multi-laboratory effort strives to establish i) robust multi-omic workflows for hiPSC and differentiated neuronal cultures, ii) public annotated data sets and iii) relevant and targetable biological pathways of spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis (ALS). Here, we focus on the proteomics and the quality of the developed workflow of hiPSC lines from 6 individuals, though epigenomics and transcriptomics data are also publicly available. Known and commonly used markers representing 73 proteins were reproducibly quantified with consistent expression levels across all hiPSC lines. Data quality assessments, data levels and metadata of all 6 genetically diverse human iPSCs analysed by DIA-MS are parsable and available as a high-quality resource to the public.
Collapse
|
6
|
Maia GA, Filho VB, Kawagoe EK, Teixeira Soratto TA, Moreira RS, Grisard EC, Wagner G. AnnotaPipeline: An integrated tool to annotate eukaryotic proteins using multi-omics data. Front Genet 2022; 13:1020100. [DOI: 10.3389/fgene.2022.1020100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 11/11/2022] [Indexed: 11/23/2022] Open
Abstract
Assignment of gene function has been a crucial, laborious, and time-consuming step in genomics. Due to a variety of sequencing platforms that generates increasing amounts of data, manual annotation is no longer feasible. Thus, the need for an integrated, automated pipeline allowing the use of experimental data towards validation of in silico prediction of gene function is of utmost relevance. Here, we present a computational workflow named AnnotaPipeline that integrates distinct software and data types on a proteogenomic approach to annotate and validate predicted features in genomic sequences. Based on FASTA (i) nucleotide or (ii) protein sequences or (iii) structural annotation files (GFF3), users can input FASTQ RNA-seq data, MS/MS data from mzXML or similar formats, as the pipeline uses both transcriptomic and proteomic information to corroborate annotations and validate gene prediction, providing transcription and expression evidence for functional annotation. Reannotation of the available Arabidopsis thaliana, Caenorhabditis elegans, Candida albicans, Trypanosoma cruzi, and Trypanosoma rangeli genomes was performed using the AnnotaPipeline, resulting in a higher proportion of annotated proteins and a reduced proportion of hypothetical proteins when compared to the annotations publicly available for these organisms. AnnotaPipeline is a Unix-based pipeline developed using Python and is available at: https://github.com/bioinformatics-ufsc/AnnotaPipeline.
Collapse
|
7
|
An Optimized Comparative Proteomic Approach as a Tool in Neurodegenerative Disease Research. Cells 2022; 11:cells11172653. [PMID: 36078061 PMCID: PMC9454658 DOI: 10.3390/cells11172653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 08/16/2022] [Accepted: 08/22/2022] [Indexed: 11/25/2022] Open
Abstract
Recent advances in proteomic technologies now allow unparalleled assessment of the molecular composition of a wide range of sample types. However, the application of such technologies and techniques should not be undertaken lightly. Here, we describe why the design of a proteomics experiment itself is only the first step in yielding high-quality, translatable results. Indeed, the effectiveness and/or impact of the majority of contemporary proteomics screens are hindered not by commonly considered technical limitations such as low proteome coverage but rather by insufficient analyses. Proteomic experimentation requires a careful methodological selection to account for variables from sample collection, through to database searches for peptide identification to standardised post-mass spectrometry options directed analysis workflow, which should be adjusted for each study, from determining when and how to filter proteomic data to choosing holistic versus trend-wise analyses for biologically relevant patterns. Finally, we highlight and discuss the difficulties inherent in the modelling and study of the majority of progressive neurodegenerative conditions. We provide evidence (in the context of neurodegenerative research) for the benefit of undertaking a comparative approach through the application of the above considerations in the alignment of publicly available pre-existing data sets to identify potential novel regulators of neuronal stability.
Collapse
|
8
|
Integrated view and comparative analysis of baseline protein expression in mouse and rat tissues. PLoS Comput Biol 2022; 18:e1010174. [PMID: 35714157 PMCID: PMC9246241 DOI: 10.1371/journal.pcbi.1010174] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 06/30/2022] [Accepted: 05/05/2022] [Indexed: 11/19/2022] Open
Abstract
The increasingly large amount of proteomics data in the public domain enables, among other applications, the combined analyses of datasets to create comparative protein expression maps covering different organisms and different biological conditions. Here we have reanalysed public proteomics datasets from mouse and rat tissues (14 and 9 datasets, respectively), to assess baseline protein abundance. Overall, the aggregated dataset contained 23 individual datasets, including a total of 211 samples coming from 34 different tissues across 14 organs, comprising 9 mouse and 3 rat strains, respectively.
In all cases, we studied the distribution of canonical proteins between the different organs. The number of canonical proteins per dataset ranged from 273 (tendon) and 9,715 (liver) in mouse, and from 101 (tendon) and 6,130 (kidney) in rat. Then, we studied how protein abundances compared across different datasets and organs for both species. As a key point we carried out a comparative analysis of protein expression between mouse, rat and human tissues. We observed a high level of correlation of protein expression among orthologs between all three species in brain, kidney, heart and liver samples, whereas the correlation of protein expression was generally slightly lower between organs within the same species. Protein expression results have been integrated into the resource Expression Atlas for widespread dissemination.
Collapse
|
9
|
Walzer M, García-Seisdedos D, Prakash A, Brack P, Crowther P, Graham RL, George N, Mohammed S, Moreno P, Papatheodorou I, Hubbard SJ, Vizcaíno JA. Implementing the reuse of public DIA proteomics datasets: from the PRIDE database to Expression Atlas. Sci Data 2022; 9:335. [PMID: 35701420 PMCID: PMC9197839 DOI: 10.1038/s41597-022-01380-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 05/12/2022] [Indexed: 11/14/2022] Open
Abstract
The number of mass spectrometry (MS)-based proteomics datasets in the public domain keeps increasing, particularly those generated by Data Independent Acquisition (DIA) approaches such as SWATH-MS. Unlike Data Dependent Acquisition datasets, the re-use of DIA datasets has been rather limited to date, despite its high potential, due to the technical challenges involved. We introduce a (re-)analysis pipeline for public SWATH-MS datasets which includes a combination of metadata annotation protocols, automated workflows for MS data analysis, statistical analysis, and the integration of the results into the Expression Atlas resource. Automation is orchestrated with Nextflow, using containerised open analysis software tools, rendering the pipeline readily available and reproducible. To demonstrate its utility, we reanalysed 10 public DIA datasets from the PRIDE database, comprising 1,278 SWATH-MS runs. The robustness of the analysis was evaluated, and the results compared to those obtained in the original publications. The final expression values were integrated into Expression Atlas, making SWATH-MS experiments more widely available and combining them with expression data originating from other proteomics and transcriptomics datasets.
Collapse
Affiliation(s)
- Mathias Walzer
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom.
| | - David García-Seisdedos
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Ananth Prakash
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Paul Brack
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Oxford Road, Manchester, M13 9PT, United Kingdom
| | - Peter Crowther
- Melandra Limited, 16 Brook Road, Urmston, Manchester, M41 5RY, United Kingdom
| | - Robert L Graham
- School of Biological Sciences, Chlorine Gardens, Queen's University Belfast, Belfast, BT9 5DL, United Kingdom
| | - Nancy George
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Suhaib Mohammed
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Pablo Moreno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Irene Papatheodorou
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Simon J Hubbard
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Oxford Road, Manchester, M13 9PT, United Kingdom
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom.
| |
Collapse
|
10
|
Tiberti M, Terkelsen T, Degn K, Beltrame L, Cremers TC, da Piedade I, Di Marco M, Maiani E, Papaleo E. MutateX: an automated pipeline for in silico saturation mutagenesis of protein structures and structural ensembles. Brief Bioinform 2022; 23:6552273. [PMID: 35323860 DOI: 10.1093/bib/bbac074] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/28/2022] [Accepted: 02/16/2022] [Indexed: 12/26/2022] Open
Abstract
Mutations, which result in amino acid substitutions, influence the stability of proteins and their binding to biomolecules. A molecular understanding of the effects of protein mutations is both of biotechnological and medical relevance. Empirical free energy functions that quickly estimate the free energy change upon mutation (ΔΔG) can be exploited for systematic screenings of proteins and protein complexes. In silico saturation mutagenesis can guide the design of new experiments or rationalize the consequences of known mutations. Often software such as FoldX, while fast and reliable, lack the necessary automation features to apply them in a high-throughput manner. We introduce MutateX, a software to automate the prediction of ΔΔGs associated with the systematic mutation of each residue within a protein, or protein complex to all other possible residue types, using the FoldX energy function. MutateX also supports ΔΔG calculations over protein ensembles, upon post-translational modifications and in multimeric assemblies. At the heart of MutateX lies an automated pipeline engine that handles input preparation, parallelization and outputs publication-ready figures. We illustrate the MutateX protocol applied to different case studies. The results of the high-throughput scan provided by our tools can help in different applications, such as the analysis of disease-associated mutations, to complement experimental deep mutational scans, or assist the design of variants for industrial applications. MutateX is a collection of Python tools that relies on open-source libraries. It is available free of charge under the GNU General Public License from https://github.com/ELELAB/mutatex.
Collapse
Affiliation(s)
- Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Thilde Terkelsen
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Kristine Degn
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Ludovica Beltrame
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Tycho Canter Cremers
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Isabelle da Piedade
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Miriam Di Marco
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Emiliano Maiani
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark.,Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark.,Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
11
|
Carrasco-Reinado R, Bermudez-Sauco M, Escobar-Niño A, Cantoral JM, Fernández-Acero FJ. Development of the "Applied Proteomics" Concept for Biotechnology Applications in Microalgae: Example of the Proteome Data in Nannochloropsis gaditana. Mar Drugs 2021; 20:38. [PMID: 35049892 PMCID: PMC8780095 DOI: 10.3390/md20010038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/19/2021] [Accepted: 12/26/2021] [Indexed: 11/23/2022] Open
Abstract
Most of the marine ecosystems on our planet are still unknown. Among these ecosystems, microalgae act as a baseline due to their role as primary producers. The estimated millions of species of these microorganisms represent an almost infinite source of potentially active biocomponents offering unlimited biotechnology applications. This review considers current research in microalgae using the "omics" approach, which today is probably the most important biotechnology tool. These techniques enable us to obtain a large volume of data from a single experiment. The specific focus of this review is proteomics as a technique capable of generating a large volume of interesting information in a single proteomics assay, and particularly the concept of applied proteomics. As an example, this concept has been applied to the study of Nannochloropsis gaditana, in which proteomics data generated are transformed into information of high commercial value by identifying proteins with direct applications in the biomedical and agri-food fields, such as the protein designated UCA01 which presents antitumor activity, obtained from N. gaditana.
Collapse
Affiliation(s)
- Rafael Carrasco-Reinado
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - María Bermudez-Sauco
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Almudena Escobar-Niño
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Jesús M. Cantoral
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| | - Francisco Javier Fernández-Acero
- Microbiology Laboratory, Institute of Viticulture and Agri-Food Research (IVAGRO), Marine and Environmental Sciences Faculty, University of Cadiz (UCA), 11500 Puerto Real, Spain; (R.C.-R.); (M.B.-S.); (A.E.-N.); (J.M.C.)
| |
Collapse
|
12
|
Chen T, Ma J, Liu Y, Chen Z, Xiao N, Lu Y, Fu Y, Yang C, Li M, Wu S, Wang X, Li D, He F, Hermjakob H, Zhu Y. iProX in 2021: connecting proteomics data sharing with big data. Nucleic Acids Res 2021; 50:D1522-D1527. [PMID: 34871441 PMCID: PMC8728291 DOI: 10.1093/nar/gkab1081] [Citation(s) in RCA: 221] [Impact Index Per Article: 73.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/16/2021] [Accepted: 10/22/2021] [Indexed: 12/12/2022] Open
Abstract
The rapid development of proteomics studies has resulted in large volumes of experimental data. The emergence of big data platform provides the opportunity to handle these large amounts of data. The integrated proteome resource, iProX (https://www.iprox.cn), which was initiated in 2017, has been greatly improved with an up-to-date big data platform implemented in 2021. Here, we describe the main iProX developments since its first publication in Nucleic Acids Research in 2019. First, a hyper-converged architecture with high scalability supports the submission process. A hadoop cluster can store large amounts of proteomics datasets, and a distributed, RESTful-styled Elastic Search engine can query millions of records within one second. Also, several new features, including the Universal Spectrum Identifier (USI) mechanism proposed by ProteomeXchange, RESTful Web Service API, and a high-efficiency reanalysis pipeline, have been added to iProX for better open data sharing. By the end of August 2021, 1526 datasets had been submitted to iProX, reaching a total data volume of 92.42TB. With the implementation of the big data platform, iProX can support PB-level data storage, hundreds of billions of spectra records, and second-level latency service capabilities that meet the requirements of the fast growing field of proteomics.
Collapse
Affiliation(s)
- Tao Chen
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jie Ma
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yi Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Zhiguang Chen
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Nong Xiao
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Yutong Lu
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Yinjin Fu
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Chunyuan Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Mansheng Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Songfeng Wu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xue Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dongsheng Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Henning Hermjakob
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Yunping Zhu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,Basic Medical School, Anhui Medical University, Anhui 230032, China
| |
Collapse
|
13
|
Keddy KH, Saha S, Okeke IN, Kalule JB, Qamar FN, Kariuki S. Combating Childhood Infections in LMICs: evaluating the contribution of Big Data Big data, biomarkers and proteomics: informing childhood diarrhoeal disease management in Low- and Middle-Income Countries. EBioMedicine 2021; 73:103668. [PMID: 34742129 PMCID: PMC8579132 DOI: 10.1016/j.ebiom.2021.103668] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 09/26/2021] [Accepted: 10/20/2021] [Indexed: 01/20/2023] Open
Abstract
Despite efforts to reduce the global burden of childhood diarrhoea, 50% of all cases globally occur in children under five years in Low–Income and Middle- Income Countries (LMICs) and knowledge gaps remain regarding the aetiological diagnosis, introduction of diarrhoeal vaccines, and the role of environmental enteric dysfunction and severe acute malnutrition. Biomarkers may assist in understanding disease processes, from diagnostics, to management of childhood diarrhoea and the sequelae to vaccine development. Proteomics has the potential to assist in the identification of new biomarkers to understand the processes in the development of childhood diarrhoea and to aid in developing new vaccines. Centralised repositories that enable mining of large data sets to better characterise risk factors, the proteome of both the patient and the different diarrhoeal pathogens, and the environment, could inform patient management and vaccine development, providing a systems biological approach to address the burden of childhood diarrhoea in LMICs.
Collapse
Affiliation(s)
- Karen H Keddy
- Tuberculosis Platform, South African Medical Research Council, 1 Soutpansberg Rd, Pretoria, 0001, South Africa.
| | - Senjuti Saha
- Child Health Research Foundation, 23/2 Khilji Road, Mohammadpur, Dhaka 1207, Bangladesh
| | - Iruka N Okeke
- Department of Pharmaceutical Microbiology, Faculty of Pharmacy, University of Ibadan, Oyo State, Nigeria
| | - John Bosco Kalule
- Biotechnical and Diagnostic Sciences, College of Veterinary Medicine Animal Resources and Biosecurity, Makerere University, Uganda
| | - Farah Naz Qamar
- Department of Pediatrics and Child Health. Aga Khan University, Stadoum road Karachi, Pakistan 74800
| | - Samuel Kariuki
- Centre for Microbiology Research, Kenya Medical Research Institute, Off Mbagathi Road, Nairobi, Kenya
| |
Collapse
|
14
|
An integrated landscape of protein expression in human cancer. Sci Data 2021; 8:115. [PMID: 33893311 PMCID: PMC8065022 DOI: 10.1038/s41597-021-00890-2] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 03/12/2021] [Indexed: 12/14/2022] Open
Abstract
Using 11 proteomics datasets, mostly available through the PRIDE database, we assembled a reference expression map for 191 cancer cell lines and 246 clinical tumour samples, across 13 lineages. We found unique peptides identified only in tumour samples despite a much higher coverage in cell lines. These were mainly mapped to proteins related to regulation of signalling receptor activity. Correlations between baseline expression in cell lines and tumours were calculated. We found these to be highly similar across all samples with most similarity found within a given sample type. Integration of proteomics and transcriptomics data showed median correlation across cell lines to be 0.58 (range between 0.43 and 0.66). Additionally, in agreement with previous studies, variation in mRNA levels was often a poor predictor of changes in protein abundance. To our knowledge, this work constitutes the first meta-analysis focusing on cancer-related public proteomics datasets. We therefore also highlight shortcomings and limitations of such studies. All data is available through PRIDE dataset identifier PXD013455 and in Expression Atlas.
Collapse
|
15
|
Caufield JH, Fu J, Wang D, Guevara-Gonzalez V, Wang W, Ping P. A Second Look at FAIR in Proteomic Investigations. J Proteome Res 2021; 20:2182-2186. [PMID: 33719446 DOI: 10.1021/acs.jproteome.1c00177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Proteomics is, by definition, comprehensive and large-scale, seeking to unravel ome-level protein features with phenotypic information on an entire system, an organ, cells, or organisms. This scope consistently involves and extends beyond single experiments. Multitudinous resources now exist to assist in making the results of proteomics experiments more findable, accessible, interoperable, and reusable (FAIR), yet many tools are awaiting to be adopted by our community. Here we highlight strategies for expanding the impact of proteomics data beyond single studies. We show how linking specific terminologies, identifiers, and text (words) can unify individual data points across a wide spectrum of studies and, more importantly, how this approach may potentially reveal novel relationships. In this effort, we explain how data sets and methods can be rendered more linkable and how this maximizes their value. We also include a discussion on how data linking strategies benefit stakeholders across the proteomics community and beyond.
Collapse
|
16
|
Bandeira N, Deutsch EW, Kohlbacher O, Martens L, Vizcaíno JA. Data Management of Sensitive Human Proteomics Data: Current Practices, Recommendations, and Perspectives for the Future. Mol Cell Proteomics 2021; 20:100071. [PMID: 33711481 PMCID: PMC8056256 DOI: 10.1016/j.mcpro.2021.100071] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/01/2021] [Accepted: 03/02/2021] [Indexed: 12/12/2022] Open
Abstract
Today it is the norm that all relevant proteomics data that support the conclusions in scientific publications are made available in public proteomics data repositories. However, given the increase in the number of clinical proteomics studies, an important emerging topic is the management and dissemination of clinical, and thus potentially sensitive, human proteomics data. Both in the United States and in the European Union, there are legal frameworks protecting the privacy of individuals. Implementing privacy standards for publicly released research data in genomics and transcriptomics has led to processes to control who may access the data, so-called "controlled access" data. In parallel with the technological developments in the field, it is clear that the privacy risks of sharing proteomics data need to be properly assessed and managed. In our view, the proteomics community must be proactive in addressing these issues. Yet a careful balance must be kept. On the one hand, neglecting to address the potential of identifiability in human proteomics data could lead to reputational damage of the field, while on the other hand, erecting barriers to open access to clinical proteomics data will inevitably reduce reuse of proteomics data and could substantially delay critical discoveries in biomedical research. In order to balance these apparently conflicting requirements for data privacy and efficient use and reuse of research efforts through the sharing of clinical proteomics data, development efforts will be needed at different levels including bioinformatics infrastructure, policymaking, and mechanisms of oversight.
Collapse
Affiliation(s)
- Nuno Bandeira
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, California, USA; Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, California, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, California, USA
| | | | - Oliver Kohlbacher
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Quantitative Biology Center, University of Tübingen, Tübingen, Germany; Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany; Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.
| |
Collapse
|
17
|
Isolation of Industrial Important Bioactive Compounds from Microalgae. Molecules 2021; 26:molecules26040943. [PMID: 33579001 PMCID: PMC7916812 DOI: 10.3390/molecules26040943] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 12/24/2020] [Accepted: 01/05/2021] [Indexed: 12/24/2022] Open
Abstract
Microalgae are known as a rich source of bioactive compounds which exhibit different biological activities. Increased demand for sustainable biomass for production of important bioactive components with various potential especially therapeutic applications has resulted in noticeable interest in algae. Utilisation of microalgae in multiple scopes has been growing in various industries ranging from harnessing renewable energy to exploitation of high-value products. The focuses of this review are on production and the use of value-added components obtained from microalgae with current and potential application in the pharmaceutical, nutraceutical, cosmeceutical, energy and agri-food industries, as well as for bioremediation. Moreover, this work discusses the advantage, potential new beneficial strains, applications, limitations, research gaps and future prospect of microalgae in industry.
Collapse
|
18
|
Bazile J, Jaffrezic F, Dehais P, Reichstadt M, Klopp C, Laloe D, Bonnet M. Molecular signatures of muscle growth and composition deciphered by the meta-analysis of age-related public transcriptomics data. Physiol Genomics 2020; 52:322-332. [PMID: 32657225 DOI: 10.1152/physiolgenomics.00020.2020] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The lean-to-fat ratio is a major issue in the beef meat industry from both carcass and meat production perspectives. This industrial perspective has motivated meat physiologists to use transcriptomics technologies to decipher mechanisms behind fat deposition within muscle during the time course of muscle growth. However, synthetic biological information from this volume of data remains to be produced to identify mechanisms found in various breeds and rearing practices. We conducted a meta-analysis on 10 transcriptomic data sets stored in public databases, from the longissimus thoracis of five different bovine breeds divergent by age. We updated gene identifiers on the last version of the bovine genome (UCD1.2), and the 715 genes common to the 10 studies were subjected to the meta-analysis. Of the 238 genes differentially expressed (DEG), we identified a transcriptional signature of the dynamic regulation of glycolytic and oxidative metabolisms that agrees with a known shift between those two pathways from the animal puberty. We proposed some master genes of the myogenesis, namely MYOG and MAPK14, as probable regulators of the glycolytic and oxidative metabolisms. We also identified overexpressed genes related to lipid metabolism (APOE, LDLR, MXRA8, and HSP90AA1) that may contribute to the expected enhanced marbling as age increases. Lastly, we proposed a transcriptional signature related to the induction (YBX1) or repression (MAPK14, YWAH, ERBB2) of the commitment of myogenic progenitors into the adipogenic lineage. The relationships between the abundance of the identified mRNA and marbling values remain to be analyzed in a marbling biomarkers discovery perspectives.
Collapse
Affiliation(s)
- Jeanne Bazile
- INRAE, UMR Herbivores, Université Clermont Auvergne, VetAgro Sup, Saint-Genès-Champanelle, France
| | - Florence Jaffrezic
- INRAE, UMR1313 Génétique Animale et Biologie Intégrative, Jouy-en-Josas, France
| | - Patrice Dehais
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRAE, Castanet Tolosan, France.,SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, Castanet Tolosan, France
| | - Matthieu Reichstadt
- INRAE, UMR Herbivores, Université Clermont Auvergne, VetAgro Sup, Saint-Genès-Champanelle, France
| | - Christophe Klopp
- Plate-forme bio-informatique Genotoul, Mathématiques et Informatique Appliquées de Toulouse, INRAE, Castanet Tolosan, France.,SIGENAE, GenPhySE, Université de Toulouse, INRAE, ENVT, Castanet Tolosan, France
| | - Denis Laloe
- INRAE, UMR1313 Génétique Animale et Biologie Intégrative, Jouy-en-Josas, France
| | - Muriel Bonnet
- INRAE, UMR Herbivores, Université Clermont Auvergne, VetAgro Sup, Saint-Genès-Champanelle, France
| |
Collapse
|
19
|
Perez-Riverol Y, Csordas A, Bai J, Bernal-Llinares M, Hewapathirana S, Kundu DJ, Inuganti A, Griss J, Mayer G, Eisenacher M, Pérez E, Uszkoreit J, Pfeuffer J, Sachsenberg T, Yilmaz S, Tiwary S, Cox J, Audain E, Walzer M, Jarnuczak AF, Ternent T, Brazma A, Vizcaíno JA. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 2020; 47:D442-D450. [PMID: 30395289 PMCID: PMC6323896 DOI: 10.1093/nar/gky1106] [Citation(s) in RCA: 5029] [Impact Index Per Article: 1257.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 10/22/2018] [Indexed: 02/06/2023] Open
Abstract
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Attila Csordas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jingwen Bai
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Manuel Bernal-Llinares
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Suresh Hewapathirana
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Deepti J Kundu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Avinash Inuganti
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Johannes Griss
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.,Division of Immunology, Allergy and Infectious Diseases, Department of Dermatology, Medical University of Vienna, Vienna, 1090, Austria
| | - Gerhard Mayer
- Ruhr University Bochum, Medical Faculty, Medizinisches Proteom-Center, D-44801 Bochum, Germany
| | - Martin Eisenacher
- Ruhr University Bochum, Medical Faculty, Medizinisches Proteom-Center, D-44801 Bochum, Germany
| | - Enrique Pérez
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Julian Uszkoreit
- Ruhr University Bochum, Medical Faculty, Medizinisches Proteom-Center, D-44801 Bochum, Germany
| | - Julianus Pfeuffer
- Applied Bioinformatics, Department for Computer Science, University of Tuebingen, Sand 14, 72076 Tuebingen, Germany
| | - Timo Sachsenberg
- Applied Bioinformatics, Department for Computer Science, University of Tuebingen, Sand 14, 72076 Tuebingen, Germany
| | - Sule Yilmaz
- Computational Systems Biochemistry, Max Planck Institute for Biochemistry, Martinsried, 82152, Germany
| | - Shivani Tiwary
- Computational Systems Biochemistry, Max Planck Institute for Biochemistry, Martinsried, 82152, Germany
| | - Jürgen Cox
- Computational Systems Biochemistry, Max Planck Institute for Biochemistry, Martinsried, 82152, Germany
| | - Enrique Audain
- Department of Congenital Heart Disease and Pediatric Cardiology, Universitätsklinikum Schleswig-Holstein Kiel, Kiel, 24105, Germany
| | - Mathias Walzer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew F Jarnuczak
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alvis Brazma
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
20
|
Verheggen K, Raeder H, Berven FS, Martens L, Barsnes H, Vaudel M. Anatomy and evolution of database search engines-a central component of mass spectrometry based proteomic workflows. MASS SPECTROMETRY REVIEWS 2020; 39:292-306. [PMID: 28902424 DOI: 10.1002/mas.21543] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 07/05/2017] [Indexed: 06/07/2023]
Abstract
Sequence database search engines are bioinformatics algorithms that identify peptides from tandem mass spectra using a reference protein sequence database. Two decades of development, notably driven by advances in mass spectrometry, have provided scientists with more than 30 published search engines, each with its own properties. In this review, we present the common paradigm behind the different implementations, and its limitations for modern mass spectrometry datasets. We also detail how the search engines attempt to alleviate these limitations, and provide an overview of the different software frameworks available to the researcher. Finally, we highlight alternative approaches for the identification of proteomic mass spectrometry datasets, either as a replacement for, or as a complement to, sequence database search engines.
Collapse
Affiliation(s)
- Kenneth Verheggen
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Helge Raeder
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Department of Pediatrics, Haukeland University Hospital, Bergen, Norway
| | - Frode S Berven
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Harald Barsnes
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Norway
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
- Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| |
Collapse
|
21
|
Deutsch EW, Bandeira N, Sharma V, Perez-Riverol Y, Carver JJ, Kundu DJ, García-Seisdedos D, Jarnuczak AF, Hewapathirana S, Pullman BS, Wertz J, Sun Z, Kawano S, Okuda S, Watanabe Y, Hermjakob H, MacLean B, MacCoss MJ, Zhu Y, Ishihama Y, Vizcaíno JA. The ProteomeXchange consortium in 2020: enabling 'big data' approaches in proteomics. Nucleic Acids Res 2020; 48:D1145-D1152. [PMID: 31686107 PMCID: PMC7145525 DOI: 10.1093/nar/gkz984] [Citation(s) in RCA: 325] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Revised: 10/11/2019] [Accepted: 10/14/2019] [Indexed: 11/24/2022] Open
Abstract
The ProteomeXchange (PX) consortium of proteomics resources (http://www.proteomexchange.org) has standardized data submission and dissemination of mass spectrometry proteomics data worldwide since 2012. In this paper, we describe the main developments since the previous update manuscript was published in Nucleic Acids Research in 2017. Since then, in addition to the four PX existing members at the time (PRIDE, PeptideAtlas including the PASSEL resource, MassIVE and jPOST), two new resources have joined PX: iProX (China) and Panorama Public (USA). We first describe the updated submission guidelines, now expanded to include six members. Next, with current data submission statistics, we demonstrate that the proteomics field is now actively embracing public open data policies. At the end of June 2019, more than 14 100 datasets had been submitted to PX resources since 2012, and from those, more than 9 500 in just the last three years. In parallel, an unprecedented increase of data re-use activities in the field, including 'big data' approaches, is enabling novel research and new data resources. At last, we also outline some of our future plans for the coming years.
Collapse
Affiliation(s)
| | - Nuno Bandeira
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | | | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jeremy J Carver
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Deepti J Kundu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - David García-Seisdedos
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Andrew F Jarnuczak
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Suresh Hewapathirana
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Benjamin S Pullman
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Julie Wertz
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Zhi Sun
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Shin Kawano
- Faculty of Contemporary Society, Toyama University of International Studies, Toyama 930–1292, Japan
- Database Center for Life Science (DBCLS), Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Chiba 277–0871, Japan
| | - Shujiro Okuda
- Niigata University Graduate School of Medical and Dental Sciences, Niigata 951–8510, Japan
| | - Yu Watanabe
- Niigata University Graduate School of Medical and Dental Sciences, Niigata 951–8510, Japan
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | | | | | - Yunping Zhu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606–8501, Japan
| | - Juan A Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
22
|
Hulstaert N, Shofstahl J, Sachsenberg T, Walzer M, Barsnes H, Martens L, Perez-Riverol Y. ThermoRawFileParser: Modular, Scalable, and Cross-Platform RAW File Conversion. J Proteome Res 2019; 19:537-542. [PMID: 31755270 DOI: 10.1021/acs.jproteome.9b00328] [Citation(s) in RCA: 94] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The field of computational proteomics is approaching the big data age, driven both by a continuous growth in the number of samples analyzed per experiment as well as by the growing amount of data obtained in each analytical run. In order to process these large amounts of data, it is increasingly necessary to use elastic compute resources such as Linux-based cluster environments and cloud infrastructures. Unfortunately, the vast majority of cross-platform proteomics tools are not able to operate directly on the proprietary formats generated by the diverse mass spectrometers. Here, we present ThermoRawFileParser, an open-source, cross-platform tool that converts Thermo RAW files into open file formats such as MGF and the HUPO-PSI standard file format mzML. To ensure the broadest possible availability and to increase integration capabilities with popular workflow systems such as Galaxy or Nextflow, we have also built Conda package and BioContainers container around ThermoRawFileParser. In addition, we implemented a user-friendly interface (ThermoRawFileParserGUI) for those users not familiar with command-line tools. Finally, we performed a benchmark of ThermoRawFileParser and msconvert to verify that the converted mzML files contain reliable quantitative results.
Collapse
Affiliation(s)
- Niels Hulstaert
- VIB-UGent Center for Medical Biotechnology, VIB , Ghent B-9000 , Belgium.,Department of Biomolecular Medicine , Ghent University , Ghent B-9000 , Belgium
| | - Jim Shofstahl
- Thermo Fisher Scientific , 355 River Oaks Parkway , San Jose , California 95134 , United States
| | - Timo Sachsenberg
- Applied Bioinformatics, Department for Computer Science , University of Tuebingen , Sand 14 , 72076 Tuebingen , Germany
| | - Mathias Walzer
- European Molecular Biology Laboratory , European Bioinformatics Institute (EMBL-EBI) , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , United Kingdom
| | - Harald Barsnes
- Computational Biology Unit (CBU), Department of Informatics , University of Bergen , Bergen 5020 , Norway.,Proteomics Unit (PROBE), Department of Biomedicine , University of Bergen , Bergen 5020 , Norway
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB , Ghent B-9000 , Belgium.,Department of Biomolecular Medicine , Ghent University , Ghent B-9000 , Belgium
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory , European Bioinformatics Institute (EMBL-EBI) , Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD , United Kingdom
| |
Collapse
|
23
|
Noor Z, Ranganathan S. Bioinformatics approaches for improving seminal plasma proteome analysis. Theriogenology 2019; 137:43-49. [PMID: 31186128 DOI: 10.1016/j.theriogenology.2019.05.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Reproduction efficiency of male animals is one of the key factors influencing the sustainability of livestock. Mass spectrometry (MS) based proteomics has become an important tool for studying seminal plasma proteomes. In this review, we summarize bioinformatics analysis strategies for current proteomics approaches, for identifying novel biomarkers of reproductive robustness.
Collapse
Affiliation(s)
- Zainab Noor
- Department of Molecular Sciences, Macquarie University, Sydney, Australia
| | - Shoba Ranganathan
- Department of Molecular Sciences, Macquarie University, Sydney, Australia.
| |
Collapse
|
24
|
Shao W, Pedrioli PGA, Wolski W, Scurtescu C, Schmid E, Vizcaíno JA, Courcelles M, Schuster H, Kowalewski D, Marino F, Arlehamn CSL, Vaughan K, Peters B, Sette A, Ottenhoff THM, Meijgaarden KE, Nieuwenhuizen N, Kaufmann SHE, Schlapbach R, Castle JC, Nesvizhskii AI, Nielsen M, Deutsch EW, Campbell DS, Moritz RL, Zubarev RA, Ytterberg AJ, Purcell AW, Marcilla M, Paradela A, Wang Q, Costello CE, Ternette N, van Veelen PA, van Els CACM, Heck AJR, de Souza GA, Sollid LM, Admon A, Stevanovic S, Rammensee HG, Thibault P, Perreault C, Bassani-Sternberg M, Aebersold R, Caron E. The SysteMHC Atlas project. Nucleic Acids Res 2019; 46:D1237-D1247. [PMID: 28985418 PMCID: PMC5753376 DOI: 10.1093/nar/gkx664] [Citation(s) in RCA: 106] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 07/21/2017] [Indexed: 11/25/2022] Open
Abstract
Mass spectrometry (MS)-based immunopeptidomics investigates the repertoire of peptides presented at the cell surface by major histocompatibility complex (MHC) molecules. The broad clinical relevance of MHC-associated peptides, e.g. in precision medicine, provides a strong rationale for the large-scale generation of immunopeptidomic datasets and recent developments in MS-based peptide analysis technologies now support the generation of the required data. Importantly, the availability of diverse immunopeptidomic datasets has resulted in an increasing need to standardize, store and exchange this type of data to enable better collaborations among researchers, to advance the field more efficiently and to establish quality measures required for the meaningful comparison of datasets. Here we present the SysteMHC Atlas (https://systemhcatlas.org), a public database that aims at collecting, organizing, sharing, visualizing and exploring immunopeptidomic data generated by MS. The Atlas includes raw mass spectrometer output files collected from several laboratories around the globe, a catalog of context-specific datasets of MHC class I and class II peptides, standardized MHC allele-specific peptide spectral libraries consisting of consensus spectra calculated from repeat measurements of the same peptide sequence, and links to other proteomics and immunology databases. The SysteMHC Atlas project was created and will be further expanded using a uniform and open computational pipeline that controls the quality of peptide identifications and peptide annotations. Thus, the SysteMHC Atlas disseminates quality controlled immunopeptidomic information to the public domain and serves as a community resource toward the generation of a high-quality comprehensive map of the human immunopeptidome and the support of consistent measurement of immunopeptidomic sample cohorts.
Collapse
Affiliation(s)
- Wenguang Shao
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland
| | - Patrick G A Pedrioli
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland
| | - Witold Wolski
- Functional Genomics Center Zurich, ETH Zurich and University of Zurich, Zurich 8057, Switzerland
| | | | - Emanuel Schmid
- Scientific IT Services (SIS), ETH Zurich, Zurich 8093, Switzerland
| | - Juan A Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mathieu Courcelles
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, H3T 1J4, Canada
| | - Heiko Schuster
- Department of Immunology, Interfaculty Institute for Cell Biology, University of Tübingen, Tübingen, 72076, Germany.,German Cancer Consortium (DKTK), DKFZ partner site Tübingen, Tübingen, 72076, Germany
| | - Daniel Kowalewski
- Department of Immunology, Interfaculty Institute for Cell Biology, University of Tübingen, Tübingen, 72076, Germany.,German Cancer Consortium (DKTK), DKFZ partner site Tübingen, Tübingen, 72076, Germany
| | - Fabio Marino
- Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne 1011, Switzerland.,Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, 3584 CH, The Netherlands.,Netherlands Proteomics Centre, Utrecht, 3584 CH, The Netherlands
| | | | - Kerrie Vaughan
- La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037, USA
| | - Bjoern Peters
- La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037, USA
| | - Alessandro Sette
- La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037, USA
| | - Tom H M Ottenhoff
- Department of Infectious Diseases, Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | - Krista E Meijgaarden
- Department of Infectious Diseases, Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | - Natalie Nieuwenhuizen
- Department of Immunology, Max Planck Institute for Infection Biology, Berlin 10117, Germany
| | - Stefan H E Kaufmann
- Department of Immunology, Max Planck Institute for Infection Biology, Berlin 10117, Germany
| | - Ralph Schlapbach
- Functional Genomics Center Zurich, ETH Zurich and University of Zurich, Zurich 8057, Switzerland
| | - John C Castle
- Vaccine Research and Translational Medicine, Agenus Switzerland Inc., 4157 Basel, Switzerland
| | - Alexey I Nesvizhskii
- Department of Pathology, BRCF Metabolomics Core, University of Michigan, Ann Arbor, MI 48109, USA.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Morten Nielsen
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, 1650, Argentina.,Department of Bio and Health Informatics, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
| | | | | | | | - Roman A Zubarev
- Division of Physiological Chemistry I, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Anders Jimmy Ytterberg
- Division of Physiological Chemistry I, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-171 77, Sweden.,Rheumatology Unit, Department of Medicine, Solna, Karolinska Institutet, Stockholm, SE-171 77, Sweden
| | - Anthony W Purcell
- Infection and Immunity Program, Department of Biochemistry and Molecular Biology, Monash Biomedicine Discovery Institute, Monash University, Clayton 3800, Australia
| | - Miguel Marcilla
- Proteomics Unit, Spanish National Biotechnology Centre, Madrid 28049, Spain
| | - Alberto Paradela
- Proteomics Unit, Spanish National Biotechnology Centre, Madrid 28049, Spain
| | - Qi Wang
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston, MA 02118, USA
| | - Catherine E Costello
- Center for Biomedical Mass Spectrometry, Department of Biochemistry, Boston University School of Medicine, Boston, MA 02118, USA
| | - Nicola Ternette
- The Jenner Institute, Target Discovery Institute Mass Spectrometry Laboratory, University of Oxford, Oxford, OX3 7FZ, UK
| | - Peter A van Veelen
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | - Cécile A C M van Els
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, 3720 BA, The Netherlands
| | - Albert J R Heck
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, 3584 CH, The Netherlands.,Netherlands Proteomics Centre, Utrecht, 3584 CH, The Netherlands
| | - Gustavo A de Souza
- Centre for Immune Regulation, Department of Immunology, University of Oslo and Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway.,The Brain Institute, Universidade Federal do Rio Grande do Norte, 59056-450, Natal-RN, Brazil
| | - Ludvig M Sollid
- Centre for Immune Regulation, Department of Immunology, University of Oslo and Oslo University Hospital-Rikshospitalet, Oslo 0372, Norway
| | - Arie Admon
- Department of Biology, Technion, Israel Institute of Technology, Haifa 3200003, Israel
| | - Stefan Stevanovic
- Department of Immunology, Interfaculty Institute for Cell Biology, University of Tübingen, Tübingen, 72076, Germany.,German Cancer Consortium (DKTK), DKFZ partner site Tübingen, Tübingen, 72076, Germany
| | - Hans-Georg Rammensee
- Department of Immunology, Interfaculty Institute for Cell Biology, University of Tübingen, Tübingen, 72076, Germany.,German Cancer Consortium (DKTK), DKFZ partner site Tübingen, Tübingen, 72076, Germany
| | - Pierre Thibault
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, H3T 1J4, Canada
| | - Claude Perreault
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, H3T 1J4, Canada
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne 1011, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland.,Faculty of Science, University of Zurich, 8006 Zurich, Switzerland
| | - Etienne Caron
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland
| |
Collapse
|
25
|
Milk proteome from in silico data aggregation allows the identification of putative biomarkers of negative energy balance in dairy cows. Sci Rep 2019; 9:9718. [PMID: 31273261 PMCID: PMC6609625 DOI: 10.1038/s41598-019-46142-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 06/19/2019] [Indexed: 01/13/2023] Open
Abstract
A better knowledge of the bovine milk proteome and its main drivers is a prerequisite for the modulation of bioactive proteins in milk for human nutrition, as well as for the discovery of biomarkers that are useful in husbandry and veterinary medicine. Milk composition is affected by lactation stage and reflects, in part, the energy balance of dairy cows. We aggregated the cow milk proteins reported in 20 recent proteomics publications to produce an atlas of 4654 unique proteins. A multistep assessment was applied to the milk proteome datasets according to lactation stages and milk fractions, including annotations, pathway analysis and literature mining. Fifty-nine proteins were exclusively detected in milk from early lactation. Among them, we propose six milk proteins as putative biomarkers of negative energy balance based on their implication in metabolic adaptative pathways. These proteins are PCK2, which is a gluconeogenic enzyme; ACAT1 and IVD, which are involved in ketone metabolism; SDHA and UQCRC1, which are related to mitochondrial oxidative metabolism; and LRRC59, which is linked to mammary gland cell proliferation. The cellular origin of these proteins warrants more in-depth research but may constitute part of a molecular signature for metabolic adaptations typical of early lactation.
Collapse
|
26
|
Hernandez-Valladares M, Wangen R, Berven FS, Guldbrandsen A. Protein Post-Translational Modification Crosstalk in Acute Myeloid Leukemia Calls for Action. Curr Med Chem 2019; 26:5317-5337. [PMID: 31241430 DOI: 10.2174/0929867326666190503164004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Revised: 11/23/2018] [Accepted: 02/01/2019] [Indexed: 01/24/2023]
Abstract
BACKGROUND Post-translational modification (PTM) crosstalk is a young research field. However, there is now evidence of the extraordinary characterization of the different proteoforms and their interactions in a biological environment that PTM crosstalk studies can describe. Besides gene expression and phosphorylation profiling of acute myeloid leukemia (AML) samples, the functional combination of several PTMs that might contribute to a better understanding of the complexity of the AML proteome remains to be discovered. OBJECTIVE By reviewing current workflows for the simultaneous enrichment of several PTMs and bioinformatics tools to analyze mass spectrometry (MS)-based data, our major objective is to introduce the PTM crosstalk field to the AML research community. RESULTS After an introduction to PTMs and PTM crosstalk, this review introduces several protocols for the simultaneous enrichment of PTMs. Two of them allow a simultaneous enrichment of at least three PTMs when using 0.5-2 mg of cell lysate. We have reviewed many of the bioinformatics tools used for PTM crosstalk discovery as its complex data analysis, mainly generated from MS, becomes challenging for most AML researchers. We have presented several non-AML PTM crosstalk studies throughout the review in order to show how important the characterization of PTM crosstalk becomes for the selection of disease biomarkers and therapeutic targets. CONCLUSION Herein, we have reviewed the advances and pitfalls of the emerging PTM crosstalk field and its potential contribution to unravel the heterogeneity of AML. The complexity of sample preparation and bioinformatics workflows demands a good interaction between experts of several areas.
Collapse
Affiliation(s)
- Maria Hernandez-Valladares
- Department of Clinical Science, Faculty of Medicine, University of Bergen, Jonas Lies vei 87, N-5021 Bergen, Norway.,The Proteomics Unit at the University of Bergen, Department of Biomedicine, Building for Basic Biology, Faculty of Medicine, University of Bergen, Jonas Lies vei 91, N-5009 Bergen, Norway
| | - Rebecca Wangen
- Department of Clinical Science, Faculty of Medicine, University of Bergen, Jonas Lies vei 87, N-5021 Bergen, Norway.,The Proteomics Unit at the University of Bergen, Department of Biomedicine, Building for Basic Biology, Faculty of Medicine, University of Bergen, Jonas Lies vei 91, N-5009 Bergen, Norway.,Department of Internal Medicine, Hematology Section, Haukeland University Hospital, Jonas Lies vei 65, N-5021 Bergen, Norway
| | - Frode S Berven
- The Proteomics Unit at the University of Bergen, Department of Biomedicine, Building for Basic Biology, Faculty of Medicine, University of Bergen, Jonas Lies vei 91, N-5009 Bergen, Norway
| | - Astrid Guldbrandsen
- The Proteomics Unit at the University of Bergen, Department of Biomedicine, Building for Basic Biology, Faculty of Medicine, University of Bergen, Jonas Lies vei 91, N-5009 Bergen, Norway.,Computational Biology Unit, Department of Informatics, Faculty of Mathematics and Natural Sciences, University of Bergen, Thormøhlensgt 55, N-5008 Bergen, Norway
| |
Collapse
|
27
|
Pavkovic M, Pantano L, Gerlach CV, Brutus S, Boswell SA, Everley RA, Shah JV, Sui SH, Vaidya VS. Multi omics analysis of fibrotic kidneys in two mouse models. Sci Data 2019; 6:92. [PMID: 31201317 PMCID: PMC6570759 DOI: 10.1038/s41597-019-0095-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Accepted: 05/07/2019] [Indexed: 12/12/2022] Open
Abstract
Kidney fibrosis represents an urgent unmet clinical need due to the lack of effective therapies and an inadequate understanding of the molecular pathogenesis. We have generated a comprehensive and combined multi-omics dataset (proteomics, mRNA and small RNA transcriptomics) of fibrotic kidneys that is searchable through a user-friendly web application: http://hbcreports.med.harvard.edu/fmm/. Two commonly used mouse models were utilized: a reversible chemical-induced injury model (folic acid (FA) induced nephropathy) and an irreversible surgically-induced fibrosis model (unilateral ureteral obstruction (UUO)). mRNA and small RNA sequencing, as well as 10-plex tandem mass tag (TMT) proteomics were performed with kidney samples from different time points over the course of fibrosis development. The bioinformatics workflow used to process, technically validate, and combine the single omics data will be described. In summary, we present temporal multi-omics data from fibrotic mouse kidneys that are accessible through an interrogation tool (Mouse Kidney Fibromics browser) to provide a searchable transcriptome and proteome for kidney fibrosis researchers. Design Type(s) | transcription profiling design • proteomic profiling design • stimulus or stress design | Measurement Type(s) | transcription profiling assay • protein expression profiling assay | Technology Type(s) | RNA sequencing • mass spectrometry | Factor Type(s) | experimental condition • temporal_instant • biological replicate | Sample Characteristic(s) | Mus musculus • kidney |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Collapse
Affiliation(s)
- Mira Pavkovic
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.,Department of Medicine - Renal Division, Brigham and Women's Hospital, Boston, MA, USA
| | - Lorena Pantano
- Bioinformatics Core, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Cory V Gerlach
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.,Department of Medicine - Renal Division, Brigham and Women's Hospital, Boston, MA, USA.,Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Sergine Brutus
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.,Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA.,Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Sarah A Boswell
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA
| | - Robert A Everley
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA
| | - Jagesh V Shah
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA.,Department of Medicine - Renal Division, Brigham and Women's Hospital, Boston, MA, USA
| | - Shannan H Sui
- Bioinformatics Core, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Vishal S Vaidya
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA. .,Department of Medicine - Renal Division, Brigham and Women's Hospital, Boston, MA, USA. .,Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
| |
Collapse
|
28
|
A dynamic view of the proteomic landscape during differentiation of ReNcell VM cells, an immortalized human neural progenitor line. Sci Data 2019; 6:190016. [PMID: 30778261 PMCID: PMC6380223 DOI: 10.1038/sdata.2019.16] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 12/21/2018] [Indexed: 01/25/2023] Open
Abstract
The immortalized human ReNcell VM cell line represents a reproducible and easy-to-propagate cell culture system for studying the differentiation of neural progenitors. To better characterize the starting line and its subsequent differentiation, we assessed protein and phospho-protein levels and cell morphology over a 15-day period during which ReNcell progenitors differentiated into neurons, astrocytes and oligodendrocytes. Five of the resulting datasets measured protein levels or states of phosphorylation based on tandem-mass-tag (TMT) mass spectrometry and four datasets characterized cellular phenotypes using high-content microscopy. Proteomic analysis revealed reproducible changes in pathways responsible for cytoskeletal rearrangement, cell phase transitions, neuronal migration, glial differentiation, neurotrophic signalling and extracellular matrix regulation. Proteomic and imaging data revealed accelerated differentiation in cells treated with the poly-selective CDK and GSK3 inhibitor kenpaullone or the HMG-CoA reductase inhibitor mevastatin, both of which have previously been reported to promote neural differentiation. These data provide in-depth information on the ReNcell progenitor state and on neural differentiation in the presence and absence of drugs, setting the stage for functional studies.
Collapse
|
29
|
Jarnuczak AF, Ternent T, Vizcaíno JA. Quantitative Proteomics Data in the Public Domain: Challenges and Opportunities. Methods Mol Biol 2019; 1977:217-235. [PMID: 30980331 DOI: 10.1007/978-1-4939-9232-4_14] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Mass spectrometry based proteomics is no longer only a qualitative discipline, and can be successfully employed to obtain a truly multidimensional view of the proteome. In particular, systematic protein expression profiling is now a routine part of many studies in the field and beyond. The large growth in the number of quantitative studies is accompanied by a trend to share publicly the associated analysis results and the underlying raw data. This trend, established and strongly supported by public repositories such as the PRIDE database at the European Bioinformatics Institute, opens up enormous possibilities to explore the data beyond the original publications, for instance by reusing, reanalyzing, and performing different flavors of meta-analysis studies. To help researchers and scientists realize about this potential, here we describe the mainstream public proteomics resources containing quantitative proteomics data, including the processed analysis results and/or the underlying raw data. We then present and discuss the most important points to consider when attempting to (re)use proteomics data in the public domain. We conclude by highlighting potential pitfalls of (re)using quantitative data and discuss some of our own experiences in this context.
Collapse
Affiliation(s)
- Andrew F Jarnuczak
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
| |
Collapse
|
30
|
Gustafsson OJR, Winderbaum LJ, Condina MR, Boughton BA, Hamilton BR, Undheim EAB, Becker M, Hoffmann P. Balancing sufficiency and impact in reporting standards for mass spectrometry imaging experiments. Gigascience 2018; 7:5074354. [PMID: 30124809 PMCID: PMC6203951 DOI: 10.1093/gigascience/giy102] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 07/24/2018] [Accepted: 08/07/2018] [Indexed: 02/06/2023] Open
Abstract
Reproducibility, or a lack thereof, is an increasingly important topic across many research fields. A key aspect of reproducibility is accurate reporting of both experiments and the resulting data. Herein, we propose a reporting guideline for mass spectrometry imaging (MSI). Previous standards have laid out guidelines sufficient to guarantee a certain quality of reporting; however, they set a high bar and as a consequence can be exhaustive and broad, thus limiting uptake.To help address this lack of uptake, we propose a reporting supplement-Minimum Information About a Mass Spectrometry Imaging Experiment (MIAMSIE)-and its abbreviated reporting standard version, MSIcheck. MIAMSIE is intended to improve author-driven reporting. It is intentionally not exhaustive, but is rather designed for extensibility and could therefore eventually become analogous to existing standards that aim to guarantee reporting quality. Conversely, its abbreviated form MSIcheck is intended as a diagnostic tool focused on key aspects in MSI reporting.We discuss how existing standards influenced MIAMSIE/MSIcheck and how these new approaches could positively impact reporting quality, followed by test implementation of both standards to demonstrate their use. For MIAMSIE, we report on author reviews of four articles and a dataset. For MSIcheck, we show a snapshot review of a one-month subset of the MSI literature that indicated issues with data provision and the reporting of both data analysis steps and calibration settings for MS systems. Although our contribution is MSI specific, we believe the underlying approach could be considered as a general strategy for improving scientific reporting.
Collapse
Affiliation(s)
- Ove J R Gustafsson
- ARC Centre of Excellence in Convergent Bio-Nano Science & Technology (CBNS), University of South Australia, Mawson Lakes, South Australia 5095, Australia
- Future Industries Institute, University of South Australia, Mawson Lakes, South Australia 5095, Australia
| | - Lyron J Winderbaum
- Future Industries Institute, University of South Australia, Mawson Lakes, South Australia 5095, Australia
| | - Mark R Condina
- Future Industries Institute, University of South Australia, Mawson Lakes, South Australia 5095, Australia
| | - Berin A Boughton
- Metabolomics Australia, School of BioSciences, University of Melbourne, Parkville, Victoria 3010, Australia
| | - Brett R Hamilton
- Centre for Microscopy and Microanalysis, University of Queensland, St. Lucia, Queensland 4072, Australia
- Centre for Advanced Imaging, University of Queensland, St. Lucia, Queensland 4072, Australia
| | - Eivind A B Undheim
- Centre for Advanced Imaging, University of Queensland, St. Lucia, Queensland 4072, Australia
| | - Michael Becker
- Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach a.d. Riss 88397, Germany
| | - Peter Hoffmann
- Future Industries Institute, University of South Australia, Mawson Lakes, South Australia 5095, Australia
| |
Collapse
|
31
|
Basilicata MF, Bruel AL, Semplicio G, Valsecchi CIK, Aktaş T, Duffourd Y, Rumpf T, Morton J, Bache I, Szymanski WG, Gilissen C, Vanakker O, Õunap K, Mittler G, van der Burgt I, El Chehadeh S, Cho MT, Pfundt R, Tan TY, Kirchhoff M, Menten B, Vergult S, Lindstrom K, Reis A, Johnson DS, Fryer A, McKay V, Fisher RB, Thauvin-Robinet C, Francis D, Roscioli T, Pajusalu S, Radtke K, Ganesh J, Brunner HG, Wilson M, Faivre L, Kalscheuer VM, Thevenon J, Akhtar A. De novo mutations in MSL3 cause an X-linked syndrome marked by impaired histone H4 lysine 16 acetylation. Nat Genet 2018; 50:1442-1451. [PMID: 30224647 DOI: 10.1038/s41588-018-0220-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 08/01/2018] [Indexed: 12/15/2022]
Abstract
The etiological spectrum of ultra-rare developmental disorders remains to be fully defined. Chromatin regulatory mechanisms maintain cellular identity and function, where misregulation may lead to developmental defects. Here, we report pathogenic variations in MSL3, which encodes a member of the chromatin-associated male-specific lethal (MSL) complex responsible for bulk histone H4 lysine 16 acetylation (H4K16ac) in flies and mammals. These variants cause an X-linked syndrome affecting both sexes. Clinical features of the syndrome include global developmental delay, progressive gait disturbance, and recognizable facial dysmorphism. MSL3 mutations affect MSL complex assembly and activity, accompanied by a pronounced loss of H4K16ac levels in vivo. Patient-derived cells display global transcriptome alterations of pathways involved in morphogenesis and cell migration. Finally, we use histone deacetylase inhibitors to rebalance acetylation levels, alleviating some of the molecular and cellular phenotypes of patient cells. Taken together, we characterize a syndrome that allowed us to decipher the developmental importance of MSL3 in humans.
Collapse
Affiliation(s)
- M Felicia Basilicata
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | - Ange-Line Bruel
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France
| | - Giuseppe Semplicio
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | | | - Tuğçe Aktaş
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | - Yannis Duffourd
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France
| | - Tobias Rumpf
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | - Jenny Morton
- West Midlands Regional Clinical Genetics Service and Birmingham Health Partners, Birmingham Women's Hospital NHS Foundation Trust, Birmingham, UK
| | - Iben Bache
- Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark.,Wilhelm Johannsen Centre for Functional Genome Research, Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Witold G Szymanski
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | - Christian Gilissen
- Department of Human Genetics, Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Olivier Vanakker
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Katrin Õunap
- Department of Clinical Genetics, United Laboratories, Tartu University Hospital and Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
| | - Gerhard Mittler
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany
| | - Ineke van der Burgt
- Department of Human Genetics, Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Salima El Chehadeh
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France.,Service de Génétique Médicale, Hôpital de Hautepierre, Strasbourg, France
| | | | - Rolph Pfundt
- Department of Human Genetics, Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Tiong Yang Tan
- Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, University of Melbourne Department of Paediatrics, Parkville, VIC, Australia
| | - Maria Kirchhoff
- Department of Clinical Genetics, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Björn Menten
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Sarah Vergult
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
| | - Kristin Lindstrom
- Division of Genetics and Metabolism, Phoenix Children's Hospital, Phoenix, AZ, USA
| | - André Reis
- Institute of Human Genetics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Diana S Johnson
- Sheffield Clinical Genetics Service, Sheffield Children's NHS Foundation Trust, Sheffield, UK
| | - Alan Fryer
- Department of Clinical Genetics, Liverpool Women's NHS Foundation Trust, Liverpool, UK
| | - Victoria McKay
- Department of Clinical Genetics, Liverpool Women's NHS Foundation Trust, Liverpool, UK
| | | | - Richard B Fisher
- Northern Genetics Service, Teesside Genetics Unit, The James Cook University Hospital, Middlesbrough, UK
| | - Christel Thauvin-Robinet
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France
| | - David Francis
- Cytogenetic Laboratory, Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Royal Children's Hospital, Melbourne, Victoria, Australia
| | - Tony Roscioli
- Neuroscience Research Australia, Sydney, New South Wales, Australia.,Prince of Wales Clinical School, University of New South Wales, Sydney, New South Wales, Australia.,Department of Medical Genetics, Sydney Children's Hospital, Sydney, New South Wales, Australia
| | - Sander Pajusalu
- Department of Clinical Genetics, United Laboratories, Tartu University Hospital and Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
| | - Kelly Radtke
- Department of Clinical Genomics, Ambry Genetics, Aliso Viejo, CA, USA
| | - Jaya Ganesh
- Division of Genetics, Cooper University Hospital and Cooper Medical School at Rowan University, Camden, NJ, USA
| | - Han G Brunner
- Department of Human Genetics, Radboud University Medical Center, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.,Department of Clinical Genetics and School for Oncology and Developmental Biology, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Meredith Wilson
- Department of Clinical Genetics, Children's Hospital at Westmead, Disciplines of Genetic Medicine and Child and Adolescent Health, University of Sydney, Sydney, New South Wales, Australia
| | - Laurence Faivre
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France
| | - Vera M Kalscheuer
- Research Group Development and Disease, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Julien Thevenon
- Inserm UMR 1231 GAD, Genetics of Developmental disorders and Centre de Référence Maladies Rares Anomalies du Développement et syndromes malformatifs FHU TRANSLAD, Université de Bourgogne-Franche Comté, Dijon, France. .,CNRS UMR 5309, INSERM, U1209, Institute of Advanced Biosciences, Université Grenoble-Alpes CHU Grenoble, Grenoble, France.
| | - Asifa Akhtar
- Max Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany.
| |
Collapse
|
32
|
moFF: a robust and automated approach to extract peptide ion intensities. Nat Methods 2018; 13:964-966. [PMID: 27898063 DOI: 10.1038/nmeth.4075] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
33
|
Misra BB. Updates on resources, software tools, and databases for plant proteomics in 2016-2017. Electrophoresis 2018; 39:1543-1557. [PMID: 29420853 DOI: 10.1002/elps.201700401] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2017] [Revised: 01/23/2018] [Accepted: 02/02/2018] [Indexed: 11/05/2022]
Abstract
Proteomics data processing, annotation, and analysis can often lead to major hurdles in large-scale high-throughput bottom-up proteomics experiments. Given the recent rise in protein-based big datasets being generated, efforts in in silico tool development occurrences have had an unprecedented increase; so much so, that it has become increasingly difficult to keep track of all the advances in a particular academic year. However, these tools benefit the plant proteomics community in circumventing critical issues in data analysis and visualization, as these continually developing open-source and community-developed tools hold potential in future research efforts. This review will aim to introduce and summarize more than 50 software tools, databases, and resources developed and published during 2016-2017 under the following categories: tools for data pre-processing and analysis, statistical analysis tools, peptide identification tools, databases and spectral libraries, and data visualization and interpretation tools. Intended for a well-informed proteomics community, finally, efforts in data archiving and validation datasets for the community will be discussed as well. Additionally, the author delineates the current and most commonly used proteomics tools in order to introduce novice readers to this -omics discovery platform.
Collapse
Affiliation(s)
- Biswapriya B Misra
- Department of Internal Medicine, Section of Molecular Medicine, Medical Center Boulevard, Winston-Salem, NC, USA
| |
Collapse
|
34
|
Menschaert G, Wang X, Jones AR, Ghali F, Fenyö D, Olexiouk V, Zhang B, Deutsch EW, Ternent T, Vizcaíno JA. The proBAM and proBed standard formats: enabling a seamless integration of genomics and proteomics data. Genome Biol 2018; 19:12. [PMID: 29386051 PMCID: PMC5793360 DOI: 10.1186/s13059-017-1377-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 12/07/2017] [Indexed: 01/23/2023] Open
Abstract
On behalf of The Human Proteome Organization (HUPO) Proteomics Standards Initiative, we introduce here two novel standard data formats, proBAM and proBed, that have been developed to address the current challenges of integrating mass spectrometry-based proteomics data with genomics and transcriptomics information in proteogenomics studies. proBAM and proBed are adaptations of the well-defined, widely used file formats SAM/BAM and BED, respectively, and both have been extended to meet the specific requirements entailed by proteomics data. Therefore, existing popular genomics tools such as SAMtools and Bedtools, and several widely used genome browsers, can already be used to manipulate and visualize these formats "out-of-the-box." We also highlight that a number of specific additional software tools, properly supporting the proteomics information available in these formats, are now available providing functionalities such as file generation, file conversion, and data analysis. All the related documentation, including the detailed file format specifications and example files, are accessible at http://www.psidev.info/probam and at http://www.psidev.info/probed .
Collapse
Affiliation(s)
- Gerben Menschaert
- Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Coupure links 653, 9000, Gent, Belgium.
| | - Xiaojing Wang
- Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, TX, USA. .,Department of Epidemiology and Biostatistics, The University of Texas Health Science Center at San Antonio, San Antonio, TX, USA.
| | - Andrew R Jones
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Fawaz Ghali
- Institute of Integrative Biology, University of Liverpool, Liverpool, UK.,School of Computing, Mathematics and Digital Technology, Manchester Metropolitan University, Manchester, M1 5GD, UK
| | - David Fenyö
- Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY, USA.,Institute for Systems Genetics, New York University School of Medicine, New York, NY, USA
| | - Volodimir Olexiouk
- Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Coupure links 653, 9000, Gent, Belgium
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | | | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
35
|
Spicer RA, Steinbeck C. A lost opportunity for science: journals promote data sharing in metabolomics but do not enforce it. Metabolomics 2018; 14:16. [PMID: 29479297 PMCID: PMC5808091 DOI: 10.1007/s11306-017-1309-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 12/08/2017] [Indexed: 12/11/2022]
Abstract
INTRODUCTION Data sharing is being increasingly required by journals and has been heralded as a solution to the 'replication crisis'. OBJECTIVES (i) Review data sharing policies of journals publishing the most metabolomics papers associated with open data and (ii) compare these journals' policies to those that publish the most metabolomics papers. METHODS A PubMed search was used to identify metabolomics papers. Metabolomics data repositories were manually searched for linked publications. RESULTS Journals that support data sharing are not necessarily those with the most papers associated to open metabolomics data. CONCLUSION Further efforts are required to improve data sharing in metabolomics.
Collapse
Affiliation(s)
- Rachel A Spicer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Christoph Steinbeck
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University, Jena, Germany.
| |
Collapse
|
36
|
Söllner JF, Leparc G, Hildebrandt T, Klein H, Thomas L, Stupka E, Simon E. An RNA-Seq atlas of gene expression in mouse and rat normal tissues. Sci Data 2017; 4:170185. [PMID: 29231921 PMCID: PMC5726313 DOI: 10.1038/sdata.2017.185] [Citation(s) in RCA: 78] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Accepted: 10/27/2017] [Indexed: 12/21/2022] Open
Abstract
Gene functionality is closely connected to its expression specificity across tissues and cell types. RNA-Seq is a powerful quantitative tool to explore genome wide expression. The aim of this study is to provide a comprehensive RNA-Seq dataset across the same 13 tissues for mouse and rat, two of the most relevant species for biomedical research. The dataset provides the transcriptome across tissues from three male C57BL6 mice and three male Han Wistar rats. We also describe our bioinformatics pipeline to process and technically validate the data. Principal component analysis shows that tissue samples from both species cluster similarly. We show by comparative genomics that many genes with high sequence identity with respect to their human orthologues also have a highly correlated tissue distribution profile and are in agreement with manually curated literature data for human. In summary, the present study provides a unique resource for comparative genomics and will facilitate the analysis of tissue specificity and cross-species conservation in higher organisms.
Collapse
Affiliation(s)
- Julia F Söllner
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany.,Integrative Transcriptomics, Center for Bioinformatics, University of Tübingen, 72076 Tübingen, Germany
| | - German Leparc
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| | - Tobias Hildebrandt
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| | - Holger Klein
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| | - Leo Thomas
- Cardiometabolic Research, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| | - Elia Stupka
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| | - Eric Simon
- Target Discovery, Research Boehringer Ingelheim Pharma GmbH & CoKG, Biberach 88397, Germany
| |
Collapse
|
37
|
Pfeuffer J, Sachsenberg T, Alka O, Walzer M, Fillbrunn A, Nilse L, Schilling O, Reinert K, Kohlbacher O. OpenMS – A platform for reproducible analysis of mass spectrometry data. J Biotechnol 2017; 261:142-148. [DOI: 10.1016/j.jbiotec.2017.05.016] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2017] [Revised: 05/17/2017] [Accepted: 05/22/2017] [Indexed: 10/19/2022]
|
38
|
Merkley ED, Sego LH, Lin A, Leiser OP, Kaiser BLD, Adkins JN, Keim PS, Wagner DM, Kreuzer HW. Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague. PLoS One 2017; 12:e0183478. [PMID: 28854255 PMCID: PMC5576697 DOI: 10.1371/journal.pone.0183478] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 08/05/2017] [Indexed: 11/19/2022] Open
Abstract
The rapid pace of bacterial evolution enables organisms to adapt to the laboratory environment with repeated passage and thus diverge from naturally-occurring environmental ("wild") strains. Distinguishing wild and laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to convergent phenotypes, difficulty in detecting certain types of mutations, or perhaps because some adaptive modifications are epigenetic. Monitoring protein abundance, a molecular measure of phenotype, can overcome some of these difficulties. We have assembled a collection of Yersinia pestis proteomics datasets from our own published and unpublished work, and from a proteomics data archive, and demonstrated that protein abundance data can clearly distinguish laboratory-adapted from wild. We developed a lasso logistic regression classifier that uses binary (presence/absence) or quantitative protein abundance measures to predict whether a sample is laboratory-adapted or wild that proved to be ~98% accurate, as judged by replicated 10-fold cross-validation. Protein features selected by the classifier accord well with our previous study of laboratory adaptation in Y. pestis. The input data was derived from a variety of unrelated experiments and contained significant confounding variables. We show that the classifier is robust with respect to these variables. The methodology is able to discover signatures for laboratory facility and culture medium that are largely independent of the signature of laboratory adaptation. Going beyond our previous laboratory evolution study, this work suggests that proteomic differences between laboratory-adapted and wild Y. pestis are general, potentially pointing to a process that could apply to other species as well. Additionally, we show that proteomics datasets (even archived data collected for different purposes) contain the information necessary to distinguish wild and laboratory samples. This work has clear applications in biomarker detection as well as biodefense.
Collapse
Affiliation(s)
- Eric D. Merkley
- Chemical and Biological Signature Sciences, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- * E-mail:
| | - Landon H. Sego
- Applied Statistics and Computational Modeling, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Andy Lin
- Chemical and Biological Signature Sciences, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Owen P. Leiser
- Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Brooke L. Deatherage Kaiser
- Chemical and Biological Signature Sciences, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Joshua N. Adkins
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Paul S. Keim
- Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - David M. Wagner
- Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Helen W. Kreuzer
- Chemical and Biological Signature Sciences, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| |
Collapse
|
39
|
Vizcaíno JA, Walzer M, Jiménez RC, Bittremieux W, Bouyssié D, Carapito C, Corrales F, Ferro M, Heck AJR, Horvatovich P, Hubalek M, Lane L, Laukens K, Levander F, Lisacek F, Novak P, Palmblad M, Piovesan D, Pühler A, Schwämmle V, Valkenborg D, van Rijswijk M, Vondrasek J, Eisenacher M, Martens L, Kohlbacher O. A community proposal to integrate proteomics activities in ELIXIR. F1000Res 2017; 6. [PMID: 28713550 PMCID: PMC5499783 DOI: 10.12688/f1000research.11751.1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/06/2017] [Indexed: 11/20/2022] Open
Abstract
Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on ‘The Future of Proteomics in ELIXIR’ that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR’s existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, CB10 1SD, UK
| | - Mathias Walzer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, CB10 1SD, UK
| | | | - Wout Bittremieux
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, 2020, Belgium
| | - David Bouyssié
- French Proteomics Infrastructure ProFI, Grenoble, (EDyP U1038, CEA/Inserm/ Grenoble Alpes University) Toulouse (IPBS, Université de Toulouse, CNRS, UPS), Strasbourg (LSMBO, IPHC UMR7178, CNRS-Université de Strasbourg), France
| | - Christine Carapito
- French Proteomics Infrastructure ProFI, Grenoble, (EDyP U1038, CEA/Inserm/ Grenoble Alpes University) Toulouse (IPBS, Université de Toulouse, CNRS, UPS), Strasbourg (LSMBO, IPHC UMR7178, CNRS-Université de Strasbourg), France
| | - Fernando Corrales
- ProteoRed, Proteomics Unit, Centro Nacional de Biotecnología (CSIC), Madrid, 28049, Spain
| | - Myriam Ferro
- French Proteomics Infrastructure ProFI, Grenoble, (EDyP U1038, CEA/Inserm/ Grenoble Alpes University) Toulouse (IPBS, Université de Toulouse, CNRS, UPS), Strasbourg (LSMBO, IPHC UMR7178, CNRS-Université de Strasbourg), France
| | - Albert J R Heck
- Biomolecular Mass Spectrometry and Proteomics, Bijvoet Centre for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, University of Utrecht, Utrecht, 3548 CH, Netherlands.,Netherlands Proteomics Center, Utretcht, 3584 CH, Netherlands
| | - Peter Horvatovich
- Analytical Biochemistry, Department of Pharmacy, University of Groningen, Groningen, 9713 AV, Netherlands
| | - Martin Hubalek
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 1, 117 20, Czech Republic
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, Geneva, 1015, Switzerland.,Department of Human Protein Science, Faculty of Medicine, University of Geneva, Geneva, 1205, Switzerland
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp, Antwerp, 2020, Belgium
| | - Fredrik Levander
- National Bioinformatics Infrastructure Sweden (NBIS), SciLifeLab, Department of Immunotechnology, Lund University, Lund, 223 62, Sweden
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, 1015, Switzerland.,Computer Science Department, University of Geneva, Geneva, 1205, Switzerland
| | - Petr Novak
- Institute of Microbiology, Czech Academy of Sciences, Prague 1, 117 20, Czech Republic
| | - Magnus Palmblad
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, 2333 ZA, Netherlands
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, I-35121, Italy
| | - Alfred Pühler
- Center for Biotechnology, Bielefeld University, Bielefeld, 33615, Germany
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, 5230, Denmark
| | - Dirk Valkenborg
- Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, 3500, Belgium.,Center for Proteomics, University of Antwerp, Antwerpen, 2000, Belgium.,Applied Bio & Molecular Systems, VITO, Mol, BE-2400, Belgium
| | - Merlijn van Rijswijk
- Netherlands Metabolomics Centre, Utrecht, 3511 GC, Netherlands.,Dutch Techcentre for Life Sciences / ELIXIR-NL, Utrecht, 3511 GC, Netherlands
| | - Jiri Vondrasek
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 1, 117 20, Czech Republic
| | - Martin Eisenacher
- Medical Bioinformatics, Medizinisches Proteom-Center, Ruhr-University Bochum, Bochum, 44801, Germany
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, Ghent, 9052, Belgium.,Department of Biochemistry, Ghent University, Ghent, 9000, Belgium
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, Tübingen, 72074, Germany.,Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, 72074, Germany.,Quantitative Biology Center, University of Tübingen, Tübingen, 72074, Germany.,Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, 72076, Germany
| |
Collapse
|
40
|
Gupta S, Verheggen K, Tavernier J, Martens L. Unbiased Protein Association Study on the Public Human Proteome Reveals Biological Connections between Co-Occurring Protein Pairs. J Proteome Res 2017; 16:2204-2212. [PMID: 28480704 PMCID: PMC5491052 DOI: 10.1021/acs.jproteome.6b01066] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
![]()
Mass-spectrometry-based, high-throughput
proteomics experiments
produce large amounts of data. While typically acquired to answer
specific biological questions, these data can also be reused in orthogonal
ways to reveal new biological knowledge. We here present a novel method
for such orthogonal data reuse of public proteomics data. Our method
elucidates biological relationships between proteins based on the
co-occurrence of these proteins across human experiments in the PRIDE
database. The majority of the significantly co-occurring protein pairs
that were detected by our method have been successfully mapped to
existing biological knowledge. The validity of our novel method is
substantiated by the extremely few pairs that can be mapped to existing
knowledge based on random associations between the same set of proteins.
Moreover, using literature searches and the STRING database, we were
able to derive meaningful biological associations for unannotated
protein pairs that were detected using our method, further illustrating
that as-yet unknown associations present highly interesting targets
for follow-up analysis.
Collapse
Affiliation(s)
- Surya Gupta
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| | - Kenneth Verheggen
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| | - Jan Tavernier
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB , A. Baertsoenkaai 3, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University , B-9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University , B-9000 Ghent, Belgium
| |
Collapse
|
41
|
El-Rami F, Nelson K, Xu P. Proteomic Approach for Extracting Cytoplasmic Proteins from Streptococcus sanguinis using Mass Spectrometry. ACTA ACUST UNITED AC 2017; 7:50-57. [PMID: 29152022 PMCID: PMC5693382 DOI: 10.5539/jmbr.v7n1p50] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Streptococcus sanguinis is a commensal and early colonizer of oral cavity as well as an opportunistic pathogen of infectious endocarditis. Extracting the soluble proteome of this bacterium provides deep insights about the physiological dynamic changes under different growth and stress conditions, thus defining "proteomic signatures" as targets for therapeutic intervention. In this protocol, we describe an experimentally verified approach to extract maximal cytoplasmic proteins from Streptococcus sanguinis SK36 strain. A combination of procedures was adopted that broke the thick cell wall barrier and minimized denaturation of the intracellular proteome, using optimized buffers and a sonication step. Extracted proteome was quantitated using Pierce BCA Protein Quantitation assay and protein bands were macroscopically assessed by Coomassie Blue staining. Finally, a high resolution detection of the extracted proteins was conducted through Synapt G2Si mass spectrometer, followed by label-free relative quantification via Progenesis QI. In conclusion, this pipeline for proteomic extraction and analysis of soluble proteins provides a fundamental tool in deciphering the biological complexity of Streptococcus sanguinis.
Collapse
Affiliation(s)
- Fadi El-Rami
- Philips Institute for Oral Health Research, Virginia Commonwealth University, Richmond, Virginia, USA.,Department of Microbiology and Immunology, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Kristina Nelson
- Chemical and Proteomic Mass Spectrometry Core Facility, Department of Chemistry, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Ping Xu
- Philips Institute for Oral Health Research, Virginia Commonwealth University, Richmond, Virginia, USA.,Department of Microbiology and Immunology, Virginia Commonwealth University, Richmond, Virginia, USA
| |
Collapse
|
42
|
A Golden Age for Working with Public Proteomics Data. Trends Biochem Sci 2017; 42:333-341. [PMID: 28118949 PMCID: PMC5414595 DOI: 10.1016/j.tibs.2017.01.001] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/13/2016] [Accepted: 01/02/2017] [Indexed: 11/23/2022]
Abstract
Data sharing in mass spectrometry (MS)-based proteomics is becoming a common scientific practice, as is now common in the case of other, more mature ‘omics’ disciplines like genomics and transcriptomics. We want to highlight that this situation, unprecedented in the field, opens a plethora of opportunities for data scientists. First, we explain in some detail some of the work already achieved, such as systematic reanalysis efforts. We also explain existing applications of public proteomics data, such as proteogenomics and the creation of spectral libraries and spectral archives. Finally, we discuss the main existing challenges and mention the first attempts to combine public proteomics data with other types of omics data sets. The field of proteomics has matured and diversified substantially over the past 10 years. Proteomics data are increasingly shared through centralized, public repositories. Standardization efforts have ensured that a large proportion of these public data can be read and processed by any interested researcher. Because any proteomics data set is only partially understood, there is great opportunity for (orthogonal) reuse of public data. While public proteomics data has so far remained outside ethics and privacy discussions, recent work indicates that there is an inherent risk.
Collapse
|
43
|
Gligorijević V, Malod-Dognin N, Pržulj N. Integrative methods for analyzing big data in precision medicine. Proteomics 2016; 16:741-58. [PMID: 26677817 DOI: 10.1002/pmic.201500396] [Citation(s) in RCA: 98] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Revised: 11/16/2015] [Accepted: 12/09/2015] [Indexed: 12/19/2022]
Abstract
We provide an overview of recent developments in big data analyses in the context of precision medicine and health informatics. With the advance in technologies capturing molecular and medical data, we entered the area of "Big Data" in biology and medicine. These data offer many opportunities to advance precision medicine. We outline key challenges in precision medicine and present recent advances in data integration-based methods to uncover personalized information from big data produced by various omics studies. We survey recent integrative methods for disease subtyping, biomarkers discovery, and drug repurposing, and list the tools that are available to domain scientists. Given the ever-growing nature of these big data, we highlight key issues that big data integration methods will face.
Collapse
Affiliation(s)
| | | | - Nataša Pržulj
- Department of Computing, Imperial College London, London, UK
| |
Collapse
|
44
|
Guldbrandsen A, Farag Y, Kroksveen AC, Oveland E, Lereim RR, Opsahl JA, Myhr KM, Berven FS, Barsnes H. CSF-PR 2.0: An Interactive Literature Guide to Quantitative Cerebrospinal Fluid Mass Spectrometry Data from Neurodegenerative Disorders. Mol Cell Proteomics 2016; 16:300-309. [PMID: 27890865 DOI: 10.1074/mcp.o116.064477] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 11/18/2016] [Indexed: 01/23/2023] Open
Abstract
The rapidly growing number of biomedical studies supported by mass spectrometry based quantitative proteomics data has made it increasingly difficult to obtain an overview of the current status of the research field. A better way of organizing the biomedical proteomics information from these studies and making it available to the research community is therefore called for. In the presented work, we have investigated scientific publications describing the analysis of the cerebrospinal fluid proteome in relation to multiple sclerosis, Parkinson's disease and Alzheimer's disease. Based on a detailed set of filtering criteria we extracted 85 data sets containing quantitative information for close to 2000 proteins. This information was made available in CSF-PR 2.0 (http://probe.uib.no/csf-pr-2.0), which includes novel approaches for filtering, visualizing and comparing quantitative proteomics information in an interactive and user-friendly environment. CSF-PR 2.0 will be an invaluable resource for anyone interested in quantitative proteomics on cerebrospinal fluid.
Collapse
Affiliation(s)
- Astrid Guldbrandsen
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Yehia Farag
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Ann Cathrine Kroksveen
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Eystein Oveland
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Ragnhild R Lereim
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Jill A Opsahl
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway
| | - Kjell-Morten Myhr
- §KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway.,¶Norwegian Multiple Sclerosis Registry and Biobank, Haukeland University Hospital, 5021 Bergen, Norway
| | - Frode S Berven
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway; .,§KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, 5020 Bergen, Norway.,‖Norwegian Multiple Sclerosis Competence Centre, Department of Neurology, Haukeland University Hospital, 5021 Bergen, Norway
| | - Harald Barsnes
- From the ‡Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, 5009 Bergen, Norway.,**Department of Clinical Science, University of Bergen, 5020 Bergen, Norway.,‡‡Computational Biology Unit, Department of Informatics, University of Bergen, 5020 Bergen, Norway
| |
Collapse
|
45
|
Deutsch EW, Csordas A, Sun Z, Jarnuczak A, Perez-Riverol Y, Ternent T, Campbell DS, Bernal-Llinares M, Okuda S, Kawano S, Moritz RL, Carver JJ, Wang M, Ishihama Y, Bandeira N, Hermjakob H, Vizcaíno JA. The ProteomeXchange consortium in 2017: supporting the cultural change in proteomics public data deposition. Nucleic Acids Res 2016; 45:D1100-D1106. [PMID: 27924013 PMCID: PMC5210636 DOI: 10.1093/nar/gkw936] [Citation(s) in RCA: 659] [Impact Index Per Article: 82.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Accepted: 10/07/2016] [Indexed: 11/13/2022] Open
Abstract
The ProteomeXchange (PX) Consortium of proteomics resources (http://www.proteomexchange.org) was formally started in 2011 to standardize data submission and dissemination of mass spectrometry proteomics data worldwide. We give an overview of the current consortium activities and describe the advances of the past few years. Augmenting the PX founding members (PRIDE and PeptideAtlas, including the PASSEL resource), two new members have joined the consortium: MassIVE and jPOST. ProteomeCentral remains as the common data access portal, providing the ability to search for data sets in all participating PX resources, now with enhanced data visualization components. We describe the updated submission guidelines, now expanded to include four members instead of two. As demonstrated by data submission statistics, PX is supporting a change in culture of the proteomics field: public data sharing is now an accepted standard, supported by requirements for journal submissions resulting in public data release becoming the norm. More than 4500 data sets have been submitted to the various PX resources since 2012. Human is the most represented species with approximately half of the data sets, followed by some of the main model organisms and a growing list of more than 900 diverse species. Data reprocessing activities are becoming more prominent, with both MassIVE and PeptideAtlas releasing the results of reprocessed data sets. Finally, we outline the upcoming advances for ProteomeXchange.
Collapse
Affiliation(s)
| | - Attila Csordas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Zhi Sun
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Andrew Jarnuczak
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | - Manuel Bernal-Llinares
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Shujiro Okuda
- Niigata University Graduate School of Medical and Dental Sciences, Niigata 951-8510, Japan
| | - Shin Kawano
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Kashiwa 277-0871, Japan
| | | | - Jeremy J Carver
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Mingxun Wang
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Nuno Bandeira
- Center for Computational Mass Spectrometry, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Department Computer Science and Engineering, University of California, San Diego (UCSD), La Jolla, CA 92093, USA.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego (UCSD), La Jolla, CA 92093, USA
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.,National Center for Protein Sciences, Beijing, China
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
46
|
Lohscheider JN, Friso G, van Wijk KJ. Phosphorylation of plastoglobular proteins in Arabidopsis thaliana. JOURNAL OF EXPERIMENTAL BOTANY 2016; 67:3975-84. [PMID: 26962209 PMCID: PMC4915526 DOI: 10.1093/jxb/erw091] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Plastoglobules (PGs) are plastid lipid-protein particles with a small specialized proteome and metabolome. Among the 30 core PG proteins are six proteins of the ancient ABC1 atypical kinase (ABC1K) family and their locations in an Arabidopsis mRNA-based co-expression network suggested central regulatory roles. To identify candidate ABC1K targets and a possible ABC1K hierarchical phosphorylation network within the chloroplast PG proteome, we searched Arabidopsis phosphoproteomics data from publicly available sources. Evaluation of underlying spectra and/or associated information was challenging for a variety of reasons, but supported pSer sites and a few pThr sites in nine PG proteins, including five FIBRILLINS. PG phosphorylation motifs are discussed in the context of possible responsible kinases. The challenges of collection and evaluation of published Arabidopsis phosphorylation data are discussed, illustrating the importance of deposition of all mass spectrometry data in well-organized repositories such as PRIDE and ProteomeXchange. This study provides a starting point for experimental testing of phosho-sites in PG proteins and also suggests that phosphoproteomics studies specifically designed toward the PG proteome and its ABC1K are needed to understand phosphorylation networks in these specialized particles.
Collapse
Affiliation(s)
- Jens N Lohscheider
- Section of Plant Biology, School of Integrated Plant Science (SIPS), Cornell University, Ithaca, NY 14853, USA
| | - Giulia Friso
- Section of Plant Biology, School of Integrated Plant Science (SIPS), Cornell University, Ithaca, NY 14853, USA
| | - Klaas J van Wijk
- Section of Plant Biology, School of Integrated Plant Science (SIPS), Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
47
|
Maes E, Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Hooyberghs J, Mertens I, Baggerman G, Ramon J, Laukens K, Martens L, Valkenborg D. Designing biomedical proteomics experiments: state-of-the-art and future perspectives. Expert Rev Proteomics 2016; 13:495-511. [PMID: 27031651 DOI: 10.1586/14789450.2016.1172967] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
With the current expanded technical capabilities to perform mass spectrometry-based biomedical proteomics experiments, an improved focus on the design of experiments is crucial. As it is clear that ignoring the importance of a good design leads to an unprecedented rate of false discoveries which would poison our results, more and more tools are developed to help researchers designing proteomic experiments. In this review, we apply statistical thinking to go through the entire proteomics workflow for biomarker discovery and validation and relate the considerations that should be made at the level of hypothesis building, technology selection, experimental design and the optimization of the experimental parameters.
Collapse
Affiliation(s)
- Evelyne Maes
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Pieter Kelchtermans
- b CFP , University of Antwerp , Antwerp , Belgium.,c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Wout Bittremieux
- f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
| | - Kurt De Grave
- h Department of Computer Science , KU Leuven , Leuven , Belgium
| | - Sven Degroeve
- c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Jef Hooyberghs
- a Applied Bio & molecular systems , VITO , Mol , Belgium
| | - Inge Mertens
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Geert Baggerman
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Jan Ramon
- h Department of Computer Science , KU Leuven , Leuven , Belgium.,i INRIA , Lille , France
| | - Kris Laukens
- f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
| | - Lennart Martens
- c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Dirk Valkenborg
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium.,j Interuniversity Institute for Biostatistics and statistical Bioinformatics , Hasselt University , Hasselt , Belgium
| |
Collapse
|
48
|
Martens L. Public proteomics data: How the field has evolved from sceptical inquiry to the promise of in silico proteomics. EUPA OPEN PROTEOMICS 2016; 11:42-44. [PMID: 29900110 PMCID: PMC5988554 DOI: 10.1016/j.euprot.2016.02.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2015] [Revised: 02/13/2016] [Accepted: 02/15/2016] [Indexed: 12/23/2022]
Abstract
Proteomics data sharing moved from validation to re-use. New tools and services make data very easily accessible. Metadata provision can still benefit from improvements. Quality control metrics will soon be reported along with submitted data. Data re-use will enable the advent of actual in silico proteomics.
Collapse
Affiliation(s)
- Lennart Martens
- Department of Medical Protein Research, VIB 9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, 9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
49
|
Vizcaíno JA, Csordas A, del-Toro N, Dianes JA, Griss J, Lavidas I, Mayer G, Perez-Riverol Y, Reisinger F, Ternent T, Xu QW, Wang R, Hermjakob H. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res 2016; 44:D447-56. [PMID: 26527722 PMCID: PMC4702828 DOI: 10.1093/nar/gkv1145] [Citation(s) in RCA: 2514] [Impact Index Per Article: 314.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 10/14/2015] [Accepted: 10/16/2015] [Indexed: 11/18/2022] Open
Abstract
The PRoteomics IDEntifications (PRIDE) database is one of the world-leading data repositories of mass spectrometry (MS)-based proteomics data. Since the beginning of 2014, PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) is the new PRIDE archival system, replacing the original PRIDE database. Here we summarize the developments in PRIDE resources and related tools since the previous update manuscript in the Database Issue in 2013. PRIDE Archive constitutes a complete redevelopment of the original PRIDE, comprising a new storage backend, data submission system and web interface, among other components. PRIDE Archive supports the most-widely used PSI (Proteomics Standards Initiative) data standard formats (mzML and mzIdentML) and implements the data requirements and guidelines of the ProteomeXchange Consortium. The wide adoption of ProteomeXchange within the community has triggered an unprecedented increase in the number of submitted data sets (around 150 data sets per month). We outline some statistics on the current PRIDE Archive data contents. We also report on the status of the PRIDE related stand-alone tools: PRIDE Inspector, PRIDE Converter 2 and the ProteomeXchange submission tool. Finally, we will give a brief update on the resources under development 'PRIDE Cluster' and 'PRIDE Proteomes', which provide a complementary view and quality-scored information of the peptide and protein identification data available in PRIDE Archive.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Attila Csordas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Noemi del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - José A Dianes
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Johannes Griss
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Division of Immunology, Allergy and Infectious Diseases, Department of Dermatology, Medical University of Vienna, Austria
| | - Ilias Lavidas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gerhard Mayer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Medizinisches Proteom Center (MPC), Ruhr-Universität Bochum, D-44801 Bochum, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Florian Reisinger
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Tobias Ternent
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Qing-Wei Xu
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK Department of Computer Science and Technology, Hubei University of Education, Wuhan, China
| | - Rui Wang
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK National Center for Protein Sciences, Beijing, China
| |
Collapse
|
50
|
Vaudel M, Barsnes H, Ræder H, Berven FS. Using Proteomics Bioinformatics Tools and Resources in Proteogenomic Studies. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 926:65-75. [DOI: 10.1007/978-3-319-42316-6_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|