101
|
Fernández-Costa C, Martínez-Bartolomé S, McClatchy D, Yates JR. Improving Proteomics Data Reproducibility with a Dual-Search Strategy. Anal Chem 2020; 92:1697-1701. [PMID: 31880919 DOI: 10.1021/acs.analchem.9b04955] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mass spectrometry-based proteomics is an invaluable tool for addressing important biological questions. Data-dependent acquisition methods effectuate stochastic acquisition of data in complex mixtures, which results in missing identifications across replicates. We developed a search approach that improves the reproducibility of data acquired from any mass spectrometer. In our approach, a spectral library is built from the identification results from a database search, and then, the library is used to research the same data files to obtain the final result. We showed that higher identification and quantification reproducibility is achieved with the dual-search approach than with a typical database search. Four datasets with different complexity were compared: (1) data from a cell lysate study performed in our lab, (2) data from an interactome study performed in our lab, (3) a publicly available extracellular vesicles dataset, and (4) a publicly available phosphoproteomics dataset. Our results show that the dual-search approach can be widely and easily used to improve data quality in proteomics data.
Collapse
Affiliation(s)
- Carolina Fernández-Costa
- Department of Molecular Medicine , The Scripps Research Institute , La Jolla , California 92037 , United States
| | - Salvador Martínez-Bartolomé
- Department of Molecular Medicine , The Scripps Research Institute , La Jolla , California 92037 , United States
| | - Daniel McClatchy
- Department of Molecular Medicine , The Scripps Research Institute , La Jolla , California 92037 , United States
| | - John R Yates
- Department of Molecular Medicine , The Scripps Research Institute , La Jolla , California 92037 , United States
| |
Collapse
|
102
|
Mohammed Y, Palmblad M. Using the Object-Oriented PowerShell for Simple Proteomics Data Analysis. Methods Mol Biol 2020; 2051:389-405. [PMID: 31552639 DOI: 10.1007/978-1-4939-9744-2_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Scripting languages such as Python and Bash are appreciated for solving simple, everyday tasks in bioinformatics. A more recent, object-oriented command shell and scripting language, PowerShell, has many attractive features: an object-oriented interactive command line, fluent navigation and manipulation of XML files, ability to explore and consume Web services from the command line, consistent syntax and grammar, rich regular expressions, and advanced output formatting. The key difference between classical command shells and scripting languages, such as bash, and object-oriented ones, such as PowerShell, is that in the latter the result of a command is a structured object with inherited properties and methods rather than a simple stream of characters. Conveniently, PowerShell is included in all new releases of Microsoft Windows and is available for Linux and macOS, making any data processing script portable. In this chapter we demonstrate how PowerShell in particular allows easy interaction with mass spectrometry data in XML formats, connection to Web services for tools such as BLAST, and presentation of results as formatted text or graphics. These features make PowerShell much more than "yet another scripting language."
Collapse
Affiliation(s)
- Yassene Mohammed
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, The Netherlands.
- University of Victoria-Genome British Columbia Proteomics Centre, Victoria, BC, Canada.
| | - Magnus Palmblad
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
103
|
Lin YM, Chen CT, Chang JM. MS2CNN: predicting MS/MS spectrum based on protein sequence using deep convolutional neural networks. BMC Genomics 2019; 20:906. [PMID: 31874640 PMCID: PMC6929458 DOI: 10.1186/s12864-019-6297-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 11/15/2019] [Indexed: 01/22/2023] Open
Abstract
Background Tandem mass spectrometry allows biologists to identify and quantify protein samples in the form of digested peptide sequences. When performing peptide identification, spectral library search is more sensitive than traditional database search but is limited to peptides that have been previously identified. An accurate tandem mass spectrum prediction tool is thus crucial in expanding the peptide space and increasing the coverage of spectral library search. Results We propose MS2CNN, a non-linear regression model based on deep convolutional neural networks, a deep learning algorithm. The features for our model are amino acid composition, predicted secondary structure, and physical-chemical features such as isoelectric point, aromaticity, helicity, hydrophobicity, and basicity. MS2CNN was trained with five-fold cross validation on a three-way data split on the large-scale human HCD MS2 dataset of Orbitrap LC-MS/MS downloaded from the National Institute of Standards and Technology. It was then evaluated on a publicly available independent test dataset of human HeLa cell lysate from LC-MS experiments. On average, our model shows better cosine similarity and Pearson correlation coefficient (0.690 and 0.632) than MS2PIP (0.647 and 0.601) and is comparable with pDeep (0.692 and 0.642). Notably, for the more complex MS2 spectra of 3+ peptides, MS2PIP is significantly better than both MS2PIP and pDeep. Conclusions We showed that MS2CNN outperforms MS2PIP for 2+ and 3+ peptides and pDeep for 3+ peptides. This implies that MS2CNN, the proposed convolutional neural network model, generates highly accurate MS2 spectra for LC-MS/MS experiments using Orbitrap machines, which can be of great help in protein and peptide identifications. The results suggest that incorporating more data for deep learning model may improve performance.
Collapse
Affiliation(s)
- Yang-Ming Lin
- Department of Computer Science, National Chengchi University, 11605, Taipei City, Taiwan
| | - Ching-Tai Chen
- Institute of Information Science, Academia Sinica, 115, Taipei City, Taiwan
| | - Jia-Ming Chang
- Department of Computer Science, National Chengchi University, 11605, Taipei City, Taiwan.
| |
Collapse
|
104
|
den Ridder M, Daran-Lapujade P, Pabst M. Shot-gun proteomics: why thousands of unidentified signals matter. FEMS Yeast Res 2019; 20:5682490. [DOI: 10.1093/femsyr/foz088] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Accepted: 12/19/2019] [Indexed: 12/14/2022] Open
Abstract
ABSTRACT
Mass spectrometry-based proteomics has become a constitutional part of the multi-omics toolbox in yeast research, advancing fundamental knowledge of molecular processes and guiding decisions in strain and product developmental pipelines. Nevertheless, post-translational protein modifications (PTMs) continue to challenge the field of proteomics. PTMs are not directly encoded in the genome; therefore, they require a sensitive analysis of the proteome itself. In yeast, the relevance of post-translational regulators has already been established, such as for phosphorylation, which can directly affect the reaction rates of metabolic enzymes. Whereas, the selective analysis of single modifications has become a broadly employed technique, the sensitive analysis of a comprehensive set of modifications still remains a challenge. At the same time, a large number of fragmentation spectra in a typical shot-gun proteomics experiment remain unidentified. It has been estimated that a good proportion of those unidentified spectra originates from unexpected modifications or natural peptide variants. In this review, recent advancements in microbial proteomics for unrestricted protein modification discovery are reviewed, and recent research integrating this additional layer of information to elucidate protein interaction and regulation in yeast is briefly discussed.
Collapse
Affiliation(s)
- Maxime den Ridder
- Delft University of Technology, Department of Biotechnology, van der Maasweg 9, 2629 HZ Delft, The Netherlands
| | - Pascale Daran-Lapujade
- Delft University of Technology, Department of Biotechnology, van der Maasweg 9, 2629 HZ Delft, The Netherlands
| | - Martin Pabst
- Delft University of Technology, Department of Biotechnology, van der Maasweg 9, 2629 HZ Delft, The Netherlands
| |
Collapse
|
105
|
Razi A, Davis JH, Hao Y, Jahagirdar D, Thurlow B, Basu K, Jain N, Gomez-Blanco J, Britton RA, Vargas J, Guarné A, Woodson SA, Williamson JR, Ortega J. Role of Era in assembly and homeostasis of the ribosomal small subunit. Nucleic Acids Res 2019; 47:8301-8317. [PMID: 31265110 PMCID: PMC6736133 DOI: 10.1093/nar/gkz571] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 06/11/2019] [Accepted: 06/27/2019] [Indexed: 01/23/2023] Open
Abstract
Assembly factors provide speed and directionality to the maturation process of the 30S subunit in bacteria. To gain a more precise understanding of how these proteins mediate 30S maturation, it is important to expand on studies of 30S assembly intermediates purified from bacterial strains lacking particular maturation factors. To reveal the role of the essential protein Era in the assembly of the 30S ribosomal subunit, we analyzed assembly intermediates that accumulated in Era-depleted Escherichia coli cells using quantitative mass spectrometry, high resolution cryo-electron microscopy and in-cell footprinting. Our combined approach allowed for visualization of the small subunit as it assembled and revealed that with the exception of key helices in the platform domain, all other 16S rRNA domains fold even in the absence of Era. Notably, the maturing particles did not stall while waiting for the platform domain to mature and instead re-routed their folding pathway to enable concerted maturation of other structural motifs spanning multiple rRNA domains. We also found that binding of Era to the mature 30S subunit destabilized helix 44 and the decoding center preventing binding of YjeQ, another assembly factor. This work establishes Era’s role in ribosome assembly and suggests new roles in maintaining ribosome homeostasis.
Collapse
Affiliation(s)
- Aida Razi
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| | - Joseph H Davis
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.,Department of Chemistry and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.,Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Yumeng Hao
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Dushyant Jahagirdar
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| | - Brett Thurlow
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario L8S4K1, Canada
| | - Kaustuv Basu
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| | - Nikhil Jain
- Department of Molecular Virology and Microbiology, Baylor College of Medicine,Houston, TX 77030, USA.,Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX 77030, USA
| | - Josue Gomez-Blanco
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| | - Robert A Britton
- Department of Molecular Virology and Microbiology, Baylor College of Medicine,Houston, TX 77030, USA.,Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX 77030, USA
| | - Javier Vargas
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| | - Alba Guarné
- Department of Biochemistry, McGill University, Montreal, Quebec H3G 0B1 Canada
| | - Sarah A Woodson
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - James R Williamson
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.,Department of Chemistry and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Joaquin Ortega
- Department of Anatomy and Cell Biology, McGill University, Montreal, Quebec H3A 0C7, Canada
| |
Collapse
|
106
|
Optimization of TripleTOF spectral simulation and library searching for confident localization of phosphorylation sites. PLoS One 2019; 14:e0225885. [PMID: 31790495 PMCID: PMC6886777 DOI: 10.1371/journal.pone.0225885] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2019] [Accepted: 11/14/2019] [Indexed: 12/31/2022] Open
Abstract
Tandem mass spectrometry (MS/MS) has been used in analysis of proteins and their post-translational modifications. A recently developed data analysis method, which simulates MS/MS spectra of phosphopeptides and performs spectral library searching using SpectraST, facilitates confident localization of phosphorylation sites. However, its performance has been evaluated only on MS/MS spectra acquired using Orbitrap HCD mass spectrometers so far. In this study, we have investigated whether this approach would be applicable to another type of mass spectrometers, and optimized the simulation and search conditions to achieve sensitive and confident site localization. Synthetic phosphopeptides and enriched K562 cell phosphopeptides were analyzed using a TripleTOF 6600 mass spectrometer before and after enzymatic dephosphorylation. Dephosphorylated peptides identified by X!Tandem database searching were subjected to spectral simulation of all possible single phosphorylations using SimPhospho software. Phosphopeptides were identified and localized by SpectraST searching against a library of the simulated spectra. Although no synthetic phosphopeptide was localized at 1% false localization rate under the previous conditions, optimization of the spectral simulation and search conditions for the TripleTOF datasets achieved the localization and improved the sensitivity. Furthermore, the optimized conditions enabled sensitive localization of K562 phosphopeptides at 1% false discovery and localization rates. These results suggest that accurate phosphopeptide simulation of TripleTOF MS/MS spectra is possible and the simulated spectral libraries can be used in SpectraST searching for confident localization of phosphorylation sites.
Collapse
|
107
|
Lucaciu R, Pelikan C, Gerner SM, Zioutis C, Köstlbacher S, Marx H, Herbold CW, Schmidt H, Rattei T. A Bioinformatics Guide to Plant Microbiome Analysis. FRONTIERS IN PLANT SCIENCE 2019; 10:1313. [PMID: 31708944 PMCID: PMC6819368 DOI: 10.3389/fpls.2019.01313] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 09/20/2019] [Indexed: 05/18/2023]
Abstract
Recent evidence for intimate relationship of plants with their microbiota shows that plants host individual and diverse microbial communities that are essential for their survival. Understanding their relatedness using genome-based and high-throughput techniques remains a hot topic in microbiome research. Molecular analysis of the plant holobiont necessitates the application of specific sampling and preparatory steps that also consider sources of unwanted information, such as soil, co-amplified plant organelles, human DNA, and other contaminations. Here, we review state-of-the-art and present practical guidelines regarding experimental and computational aspects to be considered in molecular plant-microbiome studies. We discuss sequencing and "omics" techniques with a focus on the requirements needed to adapt these methods to individual research approaches. The choice of primers and sequence databases is of utmost importance for amplicon sequencing, while the assembly and binning of shotgun metagenomic sequences is crucial to obtain quality data. We discuss specific bioinformatic workflows to overcome the limitation of genome database resources and for covering large eukaryotic genomes such as fungi. In transcriptomics, it is necessary to account for the separation of host mRNA or dual-RNAseq data. Metaproteomics approaches provide a snapshot of the protein abundances within a plant tissue which requires the knowledge of complete and well-annotated plant genomes, as well as microbial genomes. Metabolomics offers a powerful tool to detect and quantify small molecules and molecular changes at the plant-bacteria interface if the necessary requirements with regard to (secondary) metabolite databases are considered. We highlight data integration and complementarity which should help to widen our understanding of the interactions among individual players of the plant holobiont in the future.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Hannes Schmidt
- Department of Microbiology and Ecosystem Science, University of Vienna, Vienna, Austria
| | - Thomas Rattei
- Department of Microbiology and Ecosystem Science, University of Vienna, Vienna, Austria
| |
Collapse
|
108
|
Bittremieux W, Laukens K, Noble WS. Extremely Fast and Accurate Open Modification Spectral Library Searching of High-Resolution Mass Spectra Using Feature Hashing and Graphics Processing Units. J Proteome Res 2019; 18:3792-3799. [PMID: 31448616 PMCID: PMC6886738 DOI: 10.1021/acs.jproteome.9b00291] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Open modification searching (OMS) is a powerful search strategy to identify peptides with any type of modification. OMS works by using a very wide precursor mass window to allow modified spectra to match against their unmodified variants, after which the modification types can be inferred from the corresponding precursor mass differences. A disadvantage of this strategy, however, is the large computational cost, because each query spectrum has to be compared against a multitude of candidate peptides. We have previously introduced the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. Here we demonstrate how this candidate selection procedure can be further optimized using graphics processing units. Additionally, we introduce a feature hashing scheme to convert high-resolution spectra to low-dimensional vectors. On the basis of these algorithmic advances, along with low-level code optimizations, the new version of ANN-SoLo is up to an order of magnitude faster than its initial version. This makes it possible to efficiently perform open searches on a large scale to gain a deeper understanding about the protein modification landscape. We demonstrate the computational efficiency and identification performance of ANN-SoLo based on a large data set of the draft human proteome. ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo .
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
| | - Kris Laukens
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| | - William Stafford Noble
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
- Department of Computer Science and Engineering , University of Washington , Seattle , Washington 98195 , United States
| |
Collapse
|
109
|
O’Bryon I, Tucker AE, Kaiser BLD, Wahl KL, Merkley ED. Constructing a Tandem Mass Spectral Library for Forensic Ricin Identification. J Proteome Res 2019; 18:3926-3935. [DOI: 10.1021/acs.jproteome.9b00377] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Isabelle O’Bryon
- Chemical and Biological Signature Sciences Group, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Abigail E. Tucker
- Chemical and Biological Signature Sciences Group, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Brooke L. D. Kaiser
- Chemical and Biological Signature Sciences Group, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Karen L. Wahl
- Chemical and Biological Signature Sciences Group, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Eric D. Merkley
- Chemical and Biological Signature Sciences Group, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
110
|
Capelli-Peixoto J, Mule SN, Tano FT, Palmisano G, Stolf BS. Proteomics and Leishmaniasis: Potential Clinical Applications. Proteomics Clin Appl 2019; 13:e1800136. [PMID: 31347770 DOI: 10.1002/prca.201800136] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 07/02/2019] [Indexed: 02/06/2023]
Abstract
Leishmaniases are diseases caused by protozoan parasites of the genus Leishmania. They are endemic in 98 countries, affect around 12 million people worldwide and may present several distinct clinical forms. Unfortunately, there are only a few drugs available for treatment of leishmaniasis, which are toxic and not always effective. Different parasite species and different clinical forms require optimization of the treatment or more specific therapies, which are not available. The emergence of resistance is also a matter of concern. Besides, diagnosis can sometimes be complicated due to atypical manifestations and associations with other pathologies. In this review, proteomic data are presented and discussed in terms of their application in important issues in leishmaniasis such as parasite resistance to chemotherapy, diagnosis of active disease in patients and dogs, markers for different clinical forms, identification of virulence factors, and their potential use in vaccination. It is shown that proteomics has contributed to the discovery of potential biomarkers for prognosis, diagnosis, therapeutics, monitoring of disease progression, treatment follow-up and identification of vaccine candidates for specific diseases. However, the authors believe its capabilities have not yet been fully explored for routine clinical analysis for several reasons, which will be presented in this review.
Collapse
Affiliation(s)
- Janaína Capelli-Peixoto
- Leishmaniasis laboratory, Institute of Biomedical Sciences, Department of Parasitology, University of São Paulo, São Paulo, Brazil
| | - Simon Ngao Mule
- GlycoProteomics laboratory, Institute of Biomedical Sciences, Department of Parasitology, University of São Paulo, São Paulo, Brazil
| | - Fabia Tomie Tano
- Leishmaniasis laboratory, Institute of Biomedical Sciences, Department of Parasitology, University of São Paulo, São Paulo, Brazil
| | - Giuseppe Palmisano
- GlycoProteomics laboratory, Institute of Biomedical Sciences, Department of Parasitology, University of São Paulo, São Paulo, Brazil
| | - Beatriz Simonsen Stolf
- Leishmaniasis laboratory, Institute of Biomedical Sciences, Department of Parasitology, University of São Paulo, São Paulo, Brazil
| |
Collapse
|
111
|
Ryu SY, Wendt GA, Chandler CE, Ernst RK, Goodlett DR. Model-Based Spectral Library Approach for Bacterial Identification via Membrane Glycolipids. Anal Chem 2019; 91:11482-11487. [PMID: 31369253 DOI: 10.1021/acs.analchem.9b03340] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
By circumventing the need for a pure colony, MALDI-TOF mass spectrometry of bacterial membrane glycolipids (lipid A) has the potential to identify microbes more rapidly than protein-based methods. However, currently available bioinformatics algorithms (e.g., dot products) do not work well with glycolipid mass spectra such as those produced by lipid A, the membrane anchor of lipopolysaccharide. To address this issue, we propose a spectral library approach coupled with a machine learning technique to more accurately identify microbes. Here, we demonstrate the performance of the model-based spectral library approach for microbial identification using approximately a thousand mass spectra collected from multi-drug-resistant bacteria. At false discovery rates < 1%, our approach identified many more bacterial species than the existing approaches such as the Bruker Biotyper and characterized over 97% of their phenotypes accurately. As the diversity in our glycolipid mass spectral library increases, we anticipate that it will provide valuable information to more rapidly treat infected patients.
Collapse
Affiliation(s)
- So Young Ryu
- School of Community Health Sciences , University of Nevada Reno , Reno , Nevada 89557 , United States
| | - George A Wendt
- School of Community Health Sciences , University of Nevada Reno , Reno , Nevada 89557 , United States.,Department of Epidemiology, School of Public Health , University of California Berkeley , Berkeley , California 94720 , United States
| | - Courtney E Chandler
- Department of Microbial Pathogenesis, School of Dentistry , University of Maryland , Baltimore , Maryland 21201 , United States
| | - Robert K Ernst
- Department of Microbial Pathogenesis, School of Dentistry , University of Maryland , Baltimore , Maryland 21201 , United States
| | - David R Goodlett
- Department of Microbial Pathogenesis, School of Dentistry , University of Maryland , Baltimore , Maryland 21201 , United States.,International Centre for Cancer Vaccine Science , University of Gdansk , 80-308 Gdansk , Poland
| |
Collapse
|
112
|
Zhu C, Miller M, Marpaka S, Vaysberg P, Rühlemann MC, Wu G, Heinsen FA, Tempel M, Zhao L, Lieb W, Franke A, Bromberg Y. Functional sequencing read annotation for high precision microbiome analysis. Nucleic Acids Res 2019; 46:e23. [PMID: 29194524 PMCID: PMC5829635 DOI: 10.1093/nar/gkx1209] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/27/2017] [Indexed: 01/16/2023] Open
Abstract
The vast majority of microorganisms on Earth reside in often-inseparable environment-specific communities—microbiomes. Meta-genomic/-transcriptomic sequencing could reveal the otherwise inaccessible functionality of microbiomes. However, existing analytical approaches focus on attributing sequencing reads to known genes/genomes, often failing to make maximal use of available data. We created faser (functional annotation of sequencing reads), an algorithm that is optimized to map reads to molecular functions encoded by the read-correspondent genes. The mi-faser microbiome analysis pipeline, combining faser with our manually curated reference database of protein functions, accurately annotates microbiome molecular functionality. mi-faser’s minutes-per-microbiome processing speed is significantly faster than that of other methods, allowing for large scale comparisons. Microbiome function vectors can be compared between different conditions to highlight environment-specific and/or time-dependent changes in functionality. Here, we identified previously unseen oil degradation-specific functions in BP oil-spill data, as well as functional signatures of individual-specific gut microbiome responses to a dietary intervention in children with Prader–Willi syndrome. Our method also revealed variability in Crohn's Disease patient microbiomes and clearly distinguished them from those of related healthy individuals. Our analysis highlighted the microbiome role in CD pathogenicity, demonstrating enrichment of patient microbiomes in functions that promote inflammation and that help bacteria survive it.
Collapse
Affiliation(s)
- Chengsheng Zhu
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Maximilian Miller
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA.,Department for Bioinformatics and Computational Biology, Technische Universität München, Boltzmannstr. 3, 85748 Garching/Munich, Germany.,TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Technische Universität München, 85748 Garching/Munich, Germany
| | - Srinayani Marpaka
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Pavel Vaysberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Malte C Rühlemann
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - Guojun Wu
- State Key Laboratory of Microbial Metabolism and Ministry of Education Key Laboratory of Systems Biomedicine, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | | | - Marie Tempel
- Institue of Epidemiology, Kiel University, Kiel, Germany
| | - Liping Zhao
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA.,State Key Laboratory of Microbial Metabolism and Ministry of Education Key Laboratory of Systems Biomedicine, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.,Canadian Institute for Advanced Research, Toronto, Canada
| | - Wolfgang Lieb
- Institue of Epidemiology, Kiel University, Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
| | - Yana Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA.,Department of Genetics, Rutgers University, Human Genetics Institute, Life Sciences Building, 145 Bevier Road, Piscataway, NJ 08854, USA.,Institute for Advanced Study, Technische Universität München (TUM-IAS), Lichtenbergstrasse 2 a, D-85748 Garching, Germany
| |
Collapse
|
113
|
Amon S, Meier-Abt F, Gillet LC, Dimitrieva S, Theocharides APA, Manz MG, Aebersold R. Sensitive Quantitative Proteomics of Human Hematopoietic Stem and Progenitor Cells by Data-independent Acquisition Mass Spectrometry. Mol Cell Proteomics 2019; 18:1454-1467. [PMID: 30975897 PMCID: PMC6601215 DOI: 10.1074/mcp.tir119.001431] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 03/08/2019] [Indexed: 12/14/2022] Open
Abstract
Physiological processes in multicellular organisms depend on the function and interactions of specialized cell types operating in context. Some of these cell types are rare and thus obtainable only in minute quantities. For example, tissue-specific stem and progenitor cells are numerically scarce, but functionally highly relevant, and fulfill critical roles in development, tissue maintenance, and disease. Whereas low numbers of cells are routinely analyzed by genomics and transcriptomics, corresponding proteomic analyses have so far not been possible due to methodological limitations. Here we describe a sensitive and robust quantitative technique based on data-independent acquisition mass spectrometry. We quantified the proteome of sets of 25,000 human hematopoietic stem/multipotent progenitor cells (HSC/MPP) and three committed progenitor cell subpopulations of the myeloid differentiation pathway (common myeloid progenitors, megakaryocyte-erythrocyte progenitors, and granulocyte-macrophage progenitors), isolated by fluorescence-activated cell sorting from five healthy donors. On average, 5,851 protein groups were identified per sample. A subset of 4,131 stringently filtered protein groups was quantitatively compared across the 20 samples, defining unique signatures for each subpopulation. A comparison of proteomic and transcriptomic profiles indicated HSC/MPP-specific divergent regulation of biochemical functions such as telomerase maintenance and quiescence-inducing enzymes, including isocitrate dehydrogenases. These are essential for maintaining stemness and were detected at proteome, but not transcriptome, level. The method is equally applicable to almost any rare cell type, including healthy and cancer stem cells or physiologically and pathologically infiltrating cell populations. It thus provides essential new information toward the detailed biochemical understanding of cell development and functionality in health and disease.
Collapse
Affiliation(s)
- Sabine Amon
- From the ‡Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - Fabienne Meier-Abt
- From the ‡Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland;; §Hematology, University and University Hospital Zurich, 8091 Zurich, Switzerland
| | - Ludovic C Gillet
- From the ‡Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - Slavica Dimitrieva
- ¶Functional Genomics Center Zurich, ETH Zurich and University of Zurich, 8057 Zurich, Switzerland
| | | | - Markus G Manz
- §Hematology, University and University Hospital Zurich, 8091 Zurich, Switzerland
| | - Ruedi Aebersold
- From the ‡Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland;; ‖Faculty of Science, University of Zurich, 8057 Zurich, Switzerland.
| |
Collapse
|
114
|
Comparative analysis of mRNA and protein degradation in prostate tissues indicates high stability of proteins. Nat Commun 2019; 10:2524. [PMID: 31175306 PMCID: PMC6555818 DOI: 10.1038/s41467-019-10513-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Accepted: 05/16/2019] [Indexed: 01/18/2023] Open
Abstract
Deterioration of biomolecules in clinical tissues is an inevitable pre-analytical process, which affects molecular measurements and thus potentially confounds conclusions from cohort analyses. Here, we investigate the degradation of mRNA and protein in 68 pairs of adjacent prostate tissue samples using RNA-Seq and SWATH mass spectrometry, respectively. To objectively quantify the extent of protein degradation, we develop a numerical score, the Proteome Integrity Number (PIN), that faithfully measures the degree of protein degradation. Our results indicate that protein degradation only affects 5.9% of the samples tested and shows negligible correlation with mRNA degradation in the adjacent samples. These findings are confirmed by independent analyses on additional clinical sample cohorts and across different mass spectrometric methods. Overall, the data show that the majority of samples tested are not compromised by protein degradation, and establish the PIN score as a generic and accurate indicator of sample quality for proteomic analyses. Protein degradation in clinical samples is largely unexplored. Here, the authors analyze the transcriptome and proteome of clinical tissue samples and develop an algorithm to assess protein degradation, showing that protein degradation is negligible in most tissue samples and does not correlate with transcript degradation.
Collapse
|
115
|
Zelanis A, Silva DA, Kitano ES, Liberato T, Fukushima I, Serrano SMT, Tashima AK. A first step towards building spectral libraries as complementary tools for snake venom proteome/peptidome studies. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2019; 31:100599. [PMID: 31181499 DOI: 10.1016/j.cbd.2019.100599] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 05/29/2019] [Accepted: 05/29/2019] [Indexed: 01/31/2023]
Abstract
Snake venoms are complex mixtures of a large number of distinct proteins and peptides with biological activity. Peptide spectral libraries are compilations of previously identified MS/MS spectra obtained from proteomics experiments. Here we present the generation and use of a Venom Peptidome and a Venom Proteome spectral library for the analysis of venom proteomes and peptidomes from distinct snake species.
Collapse
Affiliation(s)
- André Zelanis
- Functional Proteomics Laboratory, Department of Science and Technology, Universidade Federal de São Paulo (ICT-UNIFESP), São José dos Campos, SP, Brazil.
| | - Débora A Silva
- Laboratório Especial de Toxinologia Aplicada, Center of Toxins, Immune-Response and Cell Signaling (CeTICS), Instituto Butantan, São Paulo, Brazil
| | - Eduardo S Kitano
- Laboratório Especial de Toxinologia Aplicada, Center of Toxins, Immune-Response and Cell Signaling (CeTICS), Instituto Butantan, São Paulo, Brazil; Laboratório de Imunologia, Hospital de Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), São Paulo, Brazil
| | - Tarcísio Liberato
- Functional Proteomics Laboratory, Department of Science and Technology, Universidade Federal de São Paulo (ICT-UNIFESP), São José dos Campos, SP, Brazil
| | - Isabella Fukushima
- Functional Proteomics Laboratory, Department of Science and Technology, Universidade Federal de São Paulo (ICT-UNIFESP), São José dos Campos, SP, Brazil
| | - Solange M T Serrano
- Laboratório Especial de Toxinologia Aplicada, Center of Toxins, Immune-Response and Cell Signaling (CeTICS), Instituto Butantan, São Paulo, Brazil
| | - Alexandre K Tashima
- Laboratório Especial de Toxinologia Aplicada, Center of Toxins, Immune-Response and Cell Signaling (CeTICS), Instituto Butantan, São Paulo, Brazil; Departamento de Bioquímica, Escola Paulista de Medicina, Universidade Federal de São Paulo (EPM-UNIFESP), São Paulo, SP, Brazil
| |
Collapse
|
116
|
Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat Methods 2019; 16:509-518. [DOI: 10.1038/s41592-019-0426-7] [Citation(s) in RCA: 340] [Impact Index Per Article: 56.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 04/18/2019] [Indexed: 11/08/2022]
|
117
|
Maes E, Oeyen E, Boonen K, Schildermans K, Mertens I, Pauwels P, Valkenborg D, Baggerman G. The challenges of peptidomics in complementing proteomics in a clinical context. MASS SPECTROMETRY REVIEWS 2019; 38:253-264. [PMID: 30372792 DOI: 10.1002/mas.21581] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 10/01/2018] [Indexed: 06/08/2023]
Abstract
Naturally occurring peptides, including growth factors, hormones, and neurotransmitters, represent an important class of biomolecules and have crucial roles in human physiology. The study of these peptides in clinical samples is therefore as relevant as ever. Compared to more routine proteomics applications in clinical research, peptidomics research questions are more challenging and have special requirements with regard to sample handling, experimental design, and bioinformatics. In this review, we describe the issues that confront peptidomics in a clinical context. After these hurdles are (partially) overcome, peptidomics will be ready for a successful translation into medical practice.
Collapse
Affiliation(s)
- Evelyne Maes
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Food and Bio-Based Products, AgResearch Ltd., Lincoln, New Zealand
| | - Eline Oeyen
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Kurt Boonen
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Karin Schildermans
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Inge Mertens
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| | - Patrick Pauwels
- Molecular Pathology Unit, Department of Pathology, Antwerp University Hospital, Edegem, Belgium
| | - Dirk Valkenborg
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Center for Statistics, Hasselt University, Diepenbeek, Belgium
| | - Geert Baggerman
- Flemish Institute for Technological Research (VITO), Mol, Belgium
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
118
|
Fert-Bober J, Venkatraman V, Hunter CL, Liu R, Crowgey EL, Pandey R, Holewinski RJ, Stotland A, Berman BP, Van Eyk JE. Mapping Citrullinated Sites in Multiple Organs of Mice Using Hypercitrullinated Library. J Proteome Res 2019; 18:2270-2278. [PMID: 30990720 PMCID: PMC10363406 DOI: 10.1021/acs.jproteome.9b00118] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Protein citrullination (or deimination), an irreversible post-translational modification, has been implicated in several physiological and pathological processes, including gene expression regulation, apoptosis, rheumatoid arthritis, and Alzheimer's disease. Several research studies have been carried out on citrullination under many conditions. However, until now, challenges in sample preparation and data analysis have made it difficult to confidently identify a citrullinated protein and assign the citrullinated site. To overcome these limitations, we generated a mouse hyper-citrullinated spectral library and set up coordinates to confidently identify and validate citrullinated sites. Using this workflow, we detect a four-fold increase in citrullinated proteome coverage across six mouse organs compared with the current state-of-the art techniques. Our data reveal that the subcellular distribution of citrullinated proteins is tissue-type-dependent and that citrullinated targets are involved in fundamental physiological processes, including the metabolic process. These data represent the first report of a hyper-citrullinated library for the mouse and serve as a central resource for exploring the role of citrullination in this organism.
Collapse
Affiliation(s)
- Justyna Fert-Bober
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Vidya Venkatraman
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | | | - Ruining Liu
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Erin L. Crowgey
- Nemours Biomedical Research, Nemours - Alfred I. duPont Hospital for Children, Wilmington, Delaware 19803, United States
| | - Rakhi Pandey
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Ronald J. Holewinski
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Aleksandr Stotland
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Benjamin P. Berman
- Bioinformatics and Computational Biology Research Center, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jennifer E. Van Eyk
- The Smidt Heart Institute, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| |
Collapse
|
119
|
Muth T, Renard BY. Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification? Brief Bioinform 2019; 19:954-970. [PMID: 28369237 DOI: 10.1093/bib/bbx033] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Indexed: 01/24/2023] Open
Abstract
While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally, we discuss the potential of de novo sequencing for now becoming more widely used in the field.
Collapse
Affiliation(s)
- Thilo Muth
- Research Group Bioinformatics, Robert Koch Institute, Berlin, Germany
| | - Bernhard Y Renard
- Research Group Bioinformatics, Robert Koch Institute, Berlin, Germany
| |
Collapse
|
120
|
Hutchins PD, Russell JD, Coon JJ. Mapping Lipid Fragmentation for Tailored Mass Spectral Libraries. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2019; 30:659-668. [PMID: 30756325 PMCID: PMC6447430 DOI: 10.1007/s13361-018-02125-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 12/17/2018] [Accepted: 12/17/2018] [Indexed: 05/17/2023]
Abstract
Libraries of simulated lipid fragmentation spectra enable the identification of hundreds of unique lipids from complex lipid extracts, even when the corresponding lipid reference standards do not exist. Often, these in silico libraries are generated through expert annotation of spectra to extract and model fragmentation rules common to a given lipid class. Although useful for a given sample source or instrumental platform, the time-consuming nature of this approach renders it impractical for the growing array of dissociation techniques and instrument platforms. Here, we introduce Library Forge, a unique algorithm capable of deriving lipid fragment mass-to-charge (m/z) and intensity patterns directly from high-resolution experimental spectra with minimal user input. Library Forge exploits the modular construction of lipids to generate m/z transformed spectra in silico which reveal the underlying fragmentation pathways common to a given lipid class. By learning these fragmentation patterns directly from observed spectra, the algorithm increases lipid spectral matching confidence while reducing spectral library development time from days to minutes. We embed the algorithm within the preexisting lipid analysis architecture of LipiDex to integrate automated and robust library generation within a comprehensive LC-MS/MS lipidomics workflow. Graphical Abstract.
Collapse
Affiliation(s)
- Paul D Hutchins
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Genome Center of Wisconsin, Madison, WI, 53706, USA
| | - Jason D Russell
- Genome Center of Wisconsin, Madison, WI, 53706, USA
- Morgridge Institute for Research, Madison, WI, 53715, USA
| | - Joshua J Coon
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA.
- Genome Center of Wisconsin, Madison, WI, 53706, USA.
- Morgridge Institute for Research, Madison, WI, 53715, USA.
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
121
|
Ammar C, Berchtold E, Csaba G, Schmidt A, Imhof A, Zimmer R. Multi-Reference Spectral Library Yields Almost Complete Coverage of Heterogeneous LC-MS/MS Data Sets. J Proteome Res 2019; 18:1553-1566. [DOI: 10.1021/acs.jproteome.8b00819] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Constantin Ammar
- Institute of Bioinformatics, Department of Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, 80333 München, Germany
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81337 München, Germany
| | - Evi Berchtold
- Institute of Bioinformatics, Department of Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, 80333 München, Germany
| | - Gergely Csaba
- Institute of Bioinformatics, Department of Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, 80333 München, Germany
| | - Andreas Schmidt
- Zentrallabor für Proteinanalytik (Protein Analysis Unit), Ludwig-Maximilians-Universität München, Grosshaderner Strasse 9, 82152 Planegg-Martinsried, Germany
| | - Axel Imhof
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81337 München, Germany
- Zentrallabor für Proteinanalytik (Protein Analysis Unit), Ludwig-Maximilians-Universität München, Grosshaderner Strasse 9, 82152 Planegg-Martinsried, Germany
| | - Ralf Zimmer
- Institute of Bioinformatics, Department of Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, 80333 München, Germany
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81337 München, Germany
| |
Collapse
|
122
|
Liu Y, Mi Y, Mueller T, Kreibich S, Williams EG, Van Drogen A, Borel C, Frank M, Germain PL, Bludau I, Mehnert M, Seifert M, Emmenlauer M, Sorg I, Bezrukov F, Bena FS, Zhou H, Dehio C, Testa G, Saez-Rodriguez J, Antonarakis SE, Hardt WD, Aebersold R. Multi-omic measurements of heterogeneity in HeLa cells across laboratories. Nat Biotechnol 2019; 37:314-322. [PMID: 30778230 DOI: 10.1038/s41587-019-0037-y] [Citation(s) in RCA: 210] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Accepted: 11/21/2018] [Indexed: 11/09/2022]
Abstract
Reproducibility in research can be compromised by both biological and technical variation, but most of the focus is on removing the latter. Here we investigate the effects of biological variation in HeLa cell lines using a systems-wide approach. We determine the degree of molecular and phenotypic variability across 14 stock HeLa samples from 13 international laboratories. We cultured cells in uniform conditions and profiled genome-wide copy numbers, mRNAs, proteins and protein turnover rates in each cell line. We discovered substantial heterogeneity between HeLa variants, especially between lines of the CCL2 and Kyoto varieties, and observed progressive divergence within a specific cell line over 50 successive passages. Genomic variability has a complex, nonlinear effect on transcriptome, proteome and protein turnover profiles, and proteotype patterns explain the varying phenotypic response of different cell lines to Salmonella infection. These findings have implications for the interpretation and reproducibility of research results obtained from human cultured cells.
Collapse
Affiliation(s)
- Yansheng Liu
- Department of Pharmacology, Yale University School of Medicine, New Haven, CT, USA. .,Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.
| | - Yang Mi
- Heidelberg University, Faculty of Biosciences, Heidelberg, Germany.,Joint Research Center for Computational Biomedicine (JRC-COMBINE), Faculty of Medicine, RWTH Aachen University, Aachen, Germany
| | - Torsten Mueller
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | - Evan G Williams
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Audrey Van Drogen
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Christelle Borel
- Department of Genetic Medicine and Development, University of Geneva Medical School, and University Hospitals of Geneva, Geneva, Switzerland
| | - Max Frank
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | - Isabell Bludau
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Martin Mehnert
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Michael Seifert
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany.,National Center for Tumor Diseases, Dresden, Germany
| | | | - Isabel Sorg
- Biozentrum, University of Basel, Basel, Switzerland
| | - Fedor Bezrukov
- Department of Genetic Medicine and Development, University of Geneva Medical School, and University Hospitals of Geneva, Geneva, Switzerland
| | | | - Hu Zhou
- Department of Analytical Chemistry and CAS Key Laboratory of Receptor Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | | | - Giuseppe Testa
- IEO, European Institute of Oncology IRCCS, Milan, Italy.,Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | - Julio Saez-Rodriguez
- Joint Research Center for Computational Biomedicine (JRC-COMBINE), Faculty of Medicine, RWTH Aachen University, Aachen, Germany.,Institute for Computational Biomedicine, Heidelberg University, Faculty of Medicine, Bioquant Heidelberg, Germany
| | - Stylianos E Antonarakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, and University Hospitals of Geneva, Geneva, Switzerland.,Service of Genetic Medicine, University Hospitals of Geneva, Geneva, Switzerland.,iGE3 Institute of Genetics and Genomics of Geneva, Geneva, Switzerland
| | | | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland. .,Faculty of Science, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
123
|
Development and comparative analysis of yeast protein extraction protocols for mass spectrometry. Anal Biochem 2019; 567:90-95. [DOI: 10.1016/j.ab.2018.10.028] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Revised: 10/25/2018] [Accepted: 10/29/2018] [Indexed: 11/22/2022]
|
124
|
Borna NN, Kishita Y, Kohda M, Lim SC, Shimura M, Wu Y, Mogushi K, Yatsuka Y, Harashima H, Hisatomi Y, Fushimi T, Ichimoto K, Murayama K, Ohtake A, Okazaki Y. Mitochondrial ribosomal protein PTCD3 mutations cause oxidative phosphorylation defects with Leigh syndrome. Neurogenetics 2019; 20:9-25. [PMID: 30607703 DOI: 10.1007/s10048-018-0561-9] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 12/06/2018] [Indexed: 02/06/2023]
Abstract
Pentatricopeptide repeat domain proteins are a large family of RNA-binding proteins involved in mitochondrial RNA editing, stability, and translation. Mitochondrial translation machinery defects are an expanding group of genetic diseases in humans. We describe a patient who presented with low birth weight, mental retardation, and optic atrophy. Brain MRI showed abnormal bilateral signals at the basal ganglia and brainstem, and the patient was diagnosed as Leigh syndrome. Exome sequencing revealed two potentially loss-of-function variants [c.415-2A>G, and c.1747_1748insCT (p.Phe583Serfs*3)] in PTCD3 (also known as MRPS39). PTCD3, a member of the pentatricopeptide repeat domain protein family, is a component of the small mitoribosomal subunit. The patient had marked decreases in mitochondrial complex I and IV levels and activities, oxygen consumption and ATP biosynthesis, and generalized mitochondrial translation defects in fibroblasts. Quantitative proteomic analysis revealed decreased levels of the small mitoribosomal subunits. Complementation experiments rescued oxidative phosphorylation complex I and IV levels and activities, ATP biosynthesis, and MT-RNR1 rRNA transcript level, providing functional validation of the pathogenicity of identified variants. This is the first report of an association of PTCD3 mutations with Leigh syndrome along with combined oxidative phosphorylation deficiencies caused by defects in the mitochondrial translation machinery.
Collapse
Affiliation(s)
- Nurun Nahar Borna
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Yoshihito Kishita
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Masakazu Kohda
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Sze Chern Lim
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Masaru Shimura
- Department of Metabolism, Chiba Children's Hospital, Midori, Chiba, 266-0007, Japan
| | - Yibo Wu
- Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
| | - Kaoru Mogushi
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Yukiko Yatsuka
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan
| | - Hiroko Harashima
- Department of Pediatrics, Saitama Medical University, Moroyama, Saitama, 350-0495, Japan
| | - Yuichiro Hisatomi
- Department of Pediatrics, Kumamoto City Hospital, Higashi-ku, Kumamoto, 862-8505, Japan
| | - Takuya Fushimi
- Department of Metabolism, Chiba Children's Hospital, Midori, Chiba, 266-0007, Japan
| | - Keiko Ichimoto
- Department of Metabolism, Chiba Children's Hospital, Midori, Chiba, 266-0007, Japan
| | - Kei Murayama
- Department of Metabolism, Chiba Children's Hospital, Midori, Chiba, 266-0007, Japan
| | - Akira Ohtake
- Department of Pediatrics, Saitama Medical University, Moroyama, Saitama, 350-0495, Japan
| | - Yasushi Okazaki
- Diagnostics and Therapeutics of Intractable Diseases, Intractable Disease Research Center, Graduate School of Medicine, Juntendo University, Hongo 2-1-1, Bunkyo-ku, Tokyo, 113-8421, Japan. .,Laboratory for Comprehensive Genomic Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
125
|
Kunath BJ, Minniti G, Skaugen M, Hagen LH, Vaaje-Kolstad G, Eijsink VGH, Pope PB, Arntzen MØ. Metaproteomics: Sample Preparation and Methodological Considerations. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1073:187-215. [DOI: 10.1007/978-3-030-12298-0_8] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
126
|
Noor Z, Adhikari S, Ranganathan S, Mohamedali A. Quantification of Proteins From Proteomic Analysis. ENCYCLOPEDIA OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY 2019:871-890. [DOI: 10.1016/b978-0-12-809633-8.20677-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
127
|
Wang L, Li S, Tang H. msCRUSH: Fast Tandem Mass Spectral Clustering Using Locality Sensitive Hashing. J Proteome Res 2018; 18:147-158. [PMID: 30511858 DOI: 10.1021/acs.jproteome.8b00448] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Large-scale proteomics projects often generate massive and highly redundant tandem mass spectra. Spectral clustering algorithms can reduce the redundancy in these data sets and thus speed up database searching for peptide identification, a major bottleneck for proteomic data analysis. The key challenge of spectral clustering is to reduce the redundancy in the MS/MS spectra data while retaining sufficient sensitivity to identify peptides from the clustered spectra. We present the software msCRUSH, which implements a novel spectral clustering algorithm based on the locality sensitive hashing technique. When tested on a large-scale proteomic data set consisting of 23.6 million spectra (including 14.4 million spectra of charge 2+), msCRUSH runs 6.9-11.3 times faster than the state-of-the-art spectral clustering software, PRIDE Cluster, while achieving higher clustering sensitivity and comparable accuracy. Using the consensus spectra reported by msCRUSH, commonly used spectra search engines MSGF+ and Mascot can identify 3 and 1% more unique peptides, respectively, compared with the identification results from the raw MS/MS spectra at the same false discovery rate (1% FDR) of peptide level. msCRUSH is implemented in C++ and is released as open-source software.
Collapse
Affiliation(s)
- Lei Wang
- School of Informatics, Computing and Engineering , Indiana University , Bloomington , Indiana 47408 , United States
| | - Sujun Li
- School of Informatics, Computing and Engineering , Indiana University , Bloomington , Indiana 47408 , United States
| | - Haixu Tang
- School of Informatics, Computing and Engineering , Indiana University , Bloomington , Indiana 47408 , United States
| |
Collapse
|
128
|
Wang J, Luo D, Liang M, Zhang T, Yin X, Zhang Y, Yang X, Liu W. Spectrum-Effect Relationships between High-Performance Liquid Chromatography (HPLC) Fingerprints and the Antioxidant and Anti-Inflammatory Activities of Collagen Peptides. Molecules 2018; 23:E3257. [PMID: 30544714 PMCID: PMC6320860 DOI: 10.3390/molecules23123257] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 12/02/2018] [Accepted: 12/06/2018] [Indexed: 01/11/2023] Open
Abstract
A total of 13 batches of collagen peptide samples were extracted, isolated, and purified from chicken sternal cartilage under various process parameters. The fingerprint profiles of 13 batches of collagen peptides were established by high-performance liquid chromatography (HPLC). In addition, the amino acid profiles and molecular weight distributions of collagen peptides were investigated. The in vitro antioxidant activities of the peptide samples were measured using the 2,2'-Azinobis (3-ethylbenzothiazoline-6-sulphonic acid) diammonium salt (ABTS) assay, the 2,2-diphenyl-1-picrylhydrazyl (DPPH) assay, the ferric-reducing antioxidant power (FRAP) assay and an assay of the oxidative damage induced by hydrogen peroxide (H₂O₂) in the degenerative cartilage cells from the knee joint of rat C518 (C518 cell line). The anti-inflammatory activities of the peptide samples were assessed by measuring the inflammatory responses induced by lipopolysaccharides (LPSes) in C518 cells. Subsequently, the spectrum-effect relationships between HPLC fingerprints and the antioxidant and anti-inflammatory activities of collagen peptides were investigated using grey relational analysis (GRA). Fifteen common peaks were obtained from the HPLC fingerprints of collagen peptides. Each collagen peptide sample had a characteristic set of amino acid types and contents. All of the hydrolysates of the collagen peptides were primarily composed of fractions II (500⁻1000 Da) and III (1000⁻3000 Da). Collagen peptides exhibited good scavenging activity on ABTS radical, DPPH radical, and ferric-reducing antioxidant power. Collagen peptides were also effective against H₂O₂-induced cellular oxidative damage in C518 cells. The antioxidant activity of collagen peptides was due to the low molecular weight and the presence of antioxidant and hydrophobic amino acid residues within its sequence. Collagen peptides significantly inhibited the secretion of inflammatory cytokines IL-1β, TNF-α, and PGE2 in C518 cells. The anti-inflammatory activity of collagen peptides may include increased synthesis of the key components of extracellular matrix (ECM) and inhibited apoptosis of chondrocytes. The GRA results showed that peaks 2, 3, and 8 were the main components contributing to the antioxidant activity of the collagen peptides, whereas peaks 11 and 14 were the main components contributing to the anti-inflammatory activity of the collagen peptides. The components of peaks 8 and 14 were identified as GPRGPPGPVGP and VAIQAVLSLYASGR by UPLC-MS/MS. Those identified collagen peptides offer a potential therapeutic strategy for the treatment of osteoarthritis (OA) due to their antioxidative stress and due to them disturbing the catabolism and anabolism processes in arthrodial cartilage.
Collapse
Affiliation(s)
- Junwen Wang
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
| | - Dan Luo
- National Engineering Research Center for Nanomedicine, Wuhan 430075, China.
| | - Ming Liang
- Infinitus (China) Co., Ltd., Guangzhou 510665, China.
| | - Ting Zhang
- Infinitus (China) Co., Ltd., Guangzhou 510665, China.
| | - Xiquan Yin
- Infinitus (China) Co., Ltd., Guangzhou 510665, China.
| | - Ying Zhang
- Infinitus (China) Co., Ltd., Guangzhou 510665, China.
| | - Xiangliang Yang
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
- National Engineering Research Center for Nanomedicine, Wuhan 430075, China.
| | - Wei Liu
- College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
- National Engineering Research Center for Nanomedicine, Wuhan 430075, China.
| |
Collapse
|
129
|
Deutsch EW, Perez-Riverol Y, Chalkley RJ, Wilhelm M, Tate S, Sachsenberg T, Walzer M, Käll L, Delanghe B, Böcker S, Schymanski EL, Wilmes P, Dorfer V, Kuster B, Volders PJ, Jehmlich N, Vissers JP, Wolan DW, Wang AY, Mendoza L, Shofstahl J, Dowsey AW, Griss J, Salek RM, Neumann S, Binz PA, Lam H, Vizcaíno JA, Bandeira N, Röst H. Expanding the Use of Spectral Libraries in Proteomics. J Proteome Res 2018; 17:4051-4060. [PMID: 30270626 PMCID: PMC6443480 DOI: 10.1021/acs.jproteome.8b00485] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The 2017 Dagstuhl Seminar on Computational Proteomics provided an opportunity for a broad discussion on the current state and future directions of the generation and use of peptide tandem mass spectrometry spectral libraries. Their use in proteomics is growing slowly, but there are multiple challenges in the field that must be addressed to further increase the adoption of spectral libraries and related techniques. The primary bottlenecks are the paucity of high quality and comprehensive libraries and the general difficulty of adopting spectral library searching into existing workflows. There are several existing spectral library formats, but none captures a satisfactory level of metadata; therefore, a logical next improvement is to design a more advanced, Proteomics Standards Initiative-approved spectral library format that can encode all of the desired metadata. The group discussed a series of metadata requirements organized into three designations of completeness or quality, tentatively dubbed bronze, silver, and gold. The metadata can be organized at four different levels of granularity: at the collection (library) level, at the individual entry (peptide ion) level, at the peak (fragment ion) level, and at the peak annotation level. Strategies for encoding mass modifications in a consistent manner and the requirement for encoding high-quality and commonly seen but as-yet-unidentified spectra were discussed. The group also discussed related topics, including strategies for comparing two spectra, techniques for generating representative spectra for a library, approaches for selection of optimal signature ions for targeted workflows, and issues surrounding the merging of two or more libraries into one. We present here a review of this field and the challenges that the community must address in order to accelerate the adoption of spectral libraries in routine analysis of proteomics datasets.
Collapse
Affiliation(s)
- Eric W. Deutsch
- Institute for Systems Biology, Seattle, Washington, 98109, United States
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Robert J. Chalkley
- University of California San Francisco, San Francisco, 94158, California, United States
| | - Mathias Wilhelm
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, 85354, Germany
| | | | - Timo Sachsenberg
- Department of Computer Science, Center for Bioinformatics, University of Tübingen, Sand 14, Tübingen, 72076, Germany
| | - Mathias Walzer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Lukas Käll
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH − Royal Institute of Technology, Stockholm 114 28, Sweden
| | - Bernard Delanghe
- Thermo Fisher Scientific Bremen, Hanna-Kunath Str. 11, 28199 Bremen, Germany
| | - Sebastian Böcker
- Chair for Bioinformatics, Friedrich-Schiller-University Jena, 07743 Jena, Germany
| | - Emma L. Schymanski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 avenue du Swing, L-4367 Belvaux, Luxembourg
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 avenue du Swing, L-4367 Belvaux, Luxembourg
| | - Viktoria Dorfer
- University of Applied Sciences Upper Austria, Bioinformatics Research Group, Hagenberg, 4232, Austria
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, 85354, Germany
- Bavarian Biomolecular Mass Spectrometry Center (BayBioMS), Technical University of Munich, Freising, 85354, Germany
| | | | - Nico Jehmlich
- Helmholtz-Centre for Environmental Research - UFZ, Leipzig, Germany
| | | | - Dennis W. Wolan
- Department of Molecular Medicine, The Scripps Research Institute, 92037, La Jolla, California, United States
| | - Ana Y. Wang
- Department of Molecular Medicine, The Scripps Research Institute, 92037, La Jolla, California, United States
| | - Luis Mendoza
- Institute for Systems Biology, Seattle, Washington, 98109, United States
| | - Jim Shofstahl
- Thermo Fisher Scientific, 355 River Oaks Parkway San Jose, CA 95134
| | - Andrew W. Dowsey
- Department of Population Health Sciences and Bristol Veterinary School, Faculty of Health Sciences, University of Bristol, Bristol BS9 1BN, UK
| | - Johannes Griss
- Division of Immunology, Allergy and Infectious Diseases, Department of Dermatology, Medical University of Vienna, Währinger Gürtel 18-20, Vienna 1090, Austria
| | - Reza M. Salek
- The International Agency for Research on Cancer (IARC), 150 Cours Albert Thomas, 69372 Lyon CEDEX 08, France
| | - Steffen Neumann
- Leibniz Institute of Plant Biochemistry, Department of Stress and Developmental Biology, 06120 Halle, Germany
- German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, 04103 Leipzig, Germany
| | - Pierre-Alain Binz
- Clinical Chemistry Service, Centre Hospitalier Universitaire Vaudois, 1011 Lausanne, Switzerland
| | - Henry Lam
- Department of Chemical and Biological Engineering, the Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Nuno Bandeira
- Center for Computational Mass Spectrometry, Department of Computer Science and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 92093-0404, USA
| | - Hannes Röst
- The Donnelly Centre, University of Toronto, 160 College St., Toronto, ON, M5S 3E1, Canada
| |
Collapse
|
130
|
Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry. Nat Commun 2018; 9:5128. [PMID: 30510204 PMCID: PMC6277451 DOI: 10.1038/s41467-018-07454-w] [Citation(s) in RCA: 340] [Impact Index Per Article: 48.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Accepted: 10/16/2018] [Indexed: 01/27/2023] Open
Abstract
Data independent acquisition (DIA) mass spectrometry is a powerful technique that is improving the reproducibility and throughput of proteomics studies. Here, we introduce an experimental workflow that uses this technique to construct chromatogram libraries that capture fragment ion chromatographic peak shape and retention time for every detectable peptide in a proteomics experiment. These coordinates calibrate protein databases or spectrum libraries to a specific mass spectrometer and chromatography setup, facilitating DIA-only pipelines and the reuse of global resource libraries. We also present EncyclopeDIA, a software tool for generating and searching chromatogram libraries, and demonstrate the performance of our workflow by quantifying proteins in human and yeast cells. We find that by exploiting calibrated retention time and fragmentation specificity in chromatogram libraries, EncyclopeDIA can detect 20–25% more peptides from DIA experiments than with data dependent acquisition-based spectrum libraries alone. Data-independent acquisition (DIA)-based proteomics often relies on mass spectrum libraries from data-dependent acquisition experiments. Here, the authors present a method to generate DIA-based chromatogram libraries, enabling DIA-only workflows and detecting more peptides than with spectrum libraries alone.
Collapse
|
131
|
Bittremieux W, Meysman P, Noble WS, Laukens K. Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing. J Proteome Res 2018; 17:3463-3474. [PMID: 30184435 PMCID: PMC6173621 DOI: 10.1021/acs.jproteome.8b00359] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Open modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results. We present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts. ANN-SoLo achieves state-of-the-art performance in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies more spectra than SpectraST or MSFragger and achieves a speedup of an order of magnitude compared with SpectraST. ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo .
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
| | - Pieter Meysman
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| | - William Stafford Noble
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
- Department of Computer Science and Engineering , University of Washington , Seattle , Washington 98195 , United States
| | - Kris Laukens
- Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium
- Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| |
Collapse
|
132
|
Assembling the Community-Scale Discoverable Human Proteome. Cell Syst 2018; 7:412-421.e5. [PMID: 30172843 PMCID: PMC6279426 DOI: 10.1016/j.cels.2018.08.004] [Citation(s) in RCA: 108] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 12/22/2017] [Accepted: 08/03/2018] [Indexed: 01/15/2023]
Abstract
The increasing throughput and sharing of proteomics mass spectrometry data have now yielded over one-third of a million public mass spectrometry runs. However, these discoveries are not continuously aggregated in an open and error-controlled manner, which limits their utility. To facilitate the reusability of these data, we built the MassIVE Knowledge Base (MassIVE-KB), a community-wide, continuously updating knowledge base that aggregates proteomics mass spectrometry discoveries into an open reusable format with full provenance information for community scrutiny. Reusing >31 TB of public human data stored in a mass spectrometry interactive virtual environment (MassIVE), the MassIVE-KB contains >2.1 million precursors from 19,610 proteins (48% larger than before; 97% of the total) and doubles proteome coverage to 6 million amino acids (54% of the proteome) with strict library-scale false discovery controls, thereby providing evidence for 430 proteins for which sufficient protein-level evidence was previously missing. Furthermore, MassIVE-KB can inform experimental design, helps identify and quantify new data, and provides tools for community construction of specialized spectral libraries. Wang et al. introduce MassIVE-KB, a program designed to distill the entire community’s mass spectrometry data into reusable spectral library resources. As a result, the statistically-significant discovery of a peptide or protein in a single researcher’s data will thus be made available to the whole community to support its identification (in shotgun experiments) or quantitative detection (in targeted experiments) in all future analyses.
Collapse
|
133
|
Muth T, Hartkopf F, Vaudel M, Renard BY. A Potential Golden Age to Come-Current Tools, Recent Use Cases, and Future Avenues for De Novo Sequencing in Proteomics. Proteomics 2018; 18:e1700150. [PMID: 29968278 DOI: 10.1002/pmic.201700150] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 05/23/2018] [Indexed: 01/15/2023]
Abstract
In shotgun proteomics, peptide and protein identification is most commonly conducted using database search engines, the method of choice when reference protein sequences are available. Despite its widespread use the database-driven approach is limited, mainly because of its static search space. In contrast, de novo sequencing derives peptide sequence information in an unbiased manner, using only the fragment ion information from the tandem mass spectra. In recent years, with the improvements in MS instrumentation, various new methods have been proposed for de novo sequencing. This review article provides an overview of existing de novo sequencing algorithms and software tools ranging from peptide sequencing to sequence-to-protein mapping. Various use cases are described for which de novo sequencing was successfully applied. Finally, limitations of current methods are highlighted and new directions are discussed for a wider acceptance of de novo sequencing in the community.
Collapse
Affiliation(s)
- Thilo Muth
- Bioinformatics Unit (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353, Berlin, Germany
| | - Felix Hartkopf
- Bioinformatics Unit (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353, Berlin, Germany
| | - Marc Vaudel
- K.G. Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, 5020, Bergen, Norway.,Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, 5020, Bergen, Norway
| | - Bernhard Y Renard
- Bioinformatics Unit (MF 1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353, Berlin, Germany
| |
Collapse
|
134
|
Shen J, Pagala VR, Breuer AM, Peng J, Bin Ma, Wang X. Spectral Library Search Improves Assignment of TMT Labeled MS/MS Spectra. J Proteome Res 2018; 17:3325-3331. [PMID: 30096983 DOI: 10.1021/acs.jproteome.8b00594] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Tandem mass tag (TMT)-based liquid chromatography-tandem mass spectrometry (LC-MS/MS) is a proven approach for large-scale multiplexed protein quantification. However, the identification of TMT-labeled peptides is compromised by the labeling during traditional sequence database searches. In this study, we aim to use a spectral library search to increase the sensitivity and specificity of peptide identification for TMT-based MS data. Compared to MS/MS spectra of unlabeled peptides, the spectra of TMT-labeled counterparts usually display intensified b ions, suggesting that TMT labeling can alter product ion patterns during MS/MS fragementation. We compiled a human TMT spectral library of 401,168 unique peptides of high quality from millions of peptide-spectrum matches in tens of profiling projects, matching to 14,048 nonredundant proteins (13,953 genes). A mouse TMT spectral library of similar size was also constructed. The libraries were subsequently appended with decoy spectra to evaluate the false discovery rate, which was validated by a simulated null TMT data set. The performance of the library search was further optimized by removing TMT reporter ions and selecting an appropriate library construction method. Finally, we searched a human TMT data set against the spectral library to demonstrate that the spectral library outperformed the sequence database. Both human and mouse TMT libraries were made publicly available to the research community.
Collapse
Affiliation(s)
- Jianqiao Shen
- Department of Computer Science , University of Waterloo , Waterloo , Ontario N2L 3G1 , Canada
| | | | | | | | - Bin Ma
- Department of Computer Science , University of Waterloo , Waterloo , Ontario N2L 3G1 , Canada
| | | |
Collapse
|
135
|
Perez‐Riverol Y, Vizcaíno JA, Griss J. Future Prospects of Spectral Clustering Approaches in Proteomics. Proteomics 2018; 18:e1700454. [PMID: 29882266 PMCID: PMC6099476 DOI: 10.1002/pmic.201700454] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2018] [Revised: 05/23/2018] [Indexed: 12/14/2022]
Abstract
In this article, current and future applications of spectral clustering are discussed in the context of mass spectrometry-based proteomics approaches. First of all, the main algorithms and tools that can currently be used to perform spectral clustering are introduced. In addition, its main applications and their use in current computational proteomics workflows are explained, including the generation of spectral libraries and spectral archives. Finally, possible future directions for spectral clustering, including its potential use to achieve a deeper coverage of the proteome and the discovery of novel post-translational modifications and single amino acid variants.
Collapse
Affiliation(s)
- Yasset Perez‐Riverol
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)Wellcome Trust Genome CampusHinxtonCambridgeCB10 1SDUK
| | - Juan Antonio Vizcaíno
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)Wellcome Trust Genome CampusHinxtonCambridgeCB10 1SDUK
| | - Johannes Griss
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)Wellcome Trust Genome CampusHinxtonCambridgeCB10 1SDUK
- Division of ImmunologyAllergy and Infectious DiseasesDepartment of DermatologyMedical University of Vienna1090ViennaAustria
| |
Collapse
|
136
|
Development of a Trypanosoma cruzi strain typing assay using MS2 peptide spectral libraries (Tc-STAMS2). PLoS Negl Trop Dis 2018; 12:e0006351. [PMID: 29608573 PMCID: PMC5897034 DOI: 10.1371/journal.pntd.0006351] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2017] [Revised: 04/12/2018] [Accepted: 02/27/2018] [Indexed: 11/24/2022] Open
Abstract
Background Chagas disease also known as American trypanosomiasis is caused by the protozoan Trypanosoma cruzi. Over the last 30 years, Chagas disease has expanded from a neglected parasitic infection of the rural population to an urbanized chronic disease, becoming a potentially emergent global health problem. T. cruzi strains were assigned to seven genetic groups (TcI-TcVI and TcBat), named discrete typing units (DTUs), which represent a set of isolates that differ in virulence, pathogenicity and immunological features. Indeed, diverse clinical manifestations (from asymptomatic to highly severe disease) have been attempted to be related to T.cruzi genetic variability. Due to that, several DTU typing methods have been introduced. Each method has its own advantages and drawbacks such as high complexity and analysis time and all of them are based on genetic signatures. Recently, a novel method discriminated bacterial strains using a peptide identification-free, genome sequence-independent shotgun proteomics workflow. Here, we aimed to develop a Trypanosoma cruzi Strain Typing Assay using MS/MS peptide spectral libraries, named Tc-STAMS2. Methods/Principal findings The Tc-STAMS2 method uses shotgun proteomics combined with spectral library search to assign and discriminate T. cruzi strains independently on the genome knowledge. The method is based on the construction of a library of MS/MS peptide spectra built using genotyped T. cruzi reference strains. For identification, the MS/MS peptide spectra of unknown T. cruzi cells are identified using the spectral matching algorithm SpectraST. The Tc-STAMS2 method allowed correct identification of all DTUs with high confidence. The method was robust towards different sample preparations, length of chromatographic gradients and fragmentation techniques. Moreover, a pilot inter-laboratory study showed the applicability to different MS platforms. Conclusions and significance This is the first study that develops a MS-based platform for T. cruzi strain typing. Indeed, the Tc-STAMS2 method allows T. cruzi strain typing using MS/MS spectra as discriminatory features and allows the differentiation of TcI-TcVI DTUs. Similar to genomic-based strategies, the Tc-STAMS2 method allows identification of strains within DTUs. Its robustness towards different experimental and biological variables makes it a valuable complementary strategy to the current T. cruzi genotyping assays. Moreover, this method can be used to identify DTU-specific features correlated with the strain phenotype. Chagas disease is one of the most important neglected diseases with an estimated number of 6–7 million infected individuals, the majority living in Central and South America. The Trypanosoma cruzi (T.cruzi) protozoan parasite is the etiological agent of Chagas disease. T.cruzi is highly genetically diverse and a new nomenclature assigned each strain to seven genetic groups (TcI-TcVI and Tcbat), named Discrete Typing Units (DTUs), based on their biochemical, immunological and phenotypical characteristics. T.cruzi DTUs have been correlated to diverse clinical outcomes highlighting the importance of molecular epidemiological screens. Despite the development of T.cruzi typing methods based on genetic signatures, each method presents its own advantages and challenges. The work presented here shows the application of mass spectrometry for Trypanosoma cruzi Strain Typing Assay using MS2 peptide spectral libraries (Tc-STAMS2). The novelty of the method is based on the use of peptide fragmentation spectra as strain-specific fingerprints to classify and identify DTUs. Initially, a spectra library is generated from characterized T.cruzi strains. The library is subsequently inspected using MS/MS spectra from unknown strains and confidently assigned to a specific strain in an automated and computationally-driven approach. The Tc-STAMS2 method was challenged to test several variables such as sample type and preparation, instrument setup and identification platform. Tc-STAMS2 provided high confidence and robustness in T.cruzi strain typing. The Tc-STAMS2 method represents a proof-of-concept of a complementary strategy to the current DNA-based T. cruzi genotyping methods. Moreover, the method allows the identification of strain-specific features that could be related to the biology of T.cruzi strains and their clinical outcomes.
Collapse
|
137
|
Ritz D, Sani E, Debiec H, Ronco P, Neri D, Fugmann T. Membranal and Blood-Soluble HLA Class II Peptidome Analyses Using Data-Dependent and Independent Acquisition. Proteomics 2018; 18:e1700246. [PMID: 29314611 DOI: 10.1002/pmic.201700246] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 11/29/2017] [Indexed: 12/18/2022]
Abstract
The interaction between HLA class II peptide complexes on antigen-presenting cells and CD4+ T cells is of fundamental importance for anticancer and antipathogen immunity as well as for the maintenance of immunological tolerance. To study CD4+ T cell reactivities, detailed knowledge of the presented peptides is necessary. In recent years, dramatic advances in the characterization of membranal and soluble HLA class I peptidomes could be observed. However, the same is not true for HLA class II peptidomes, where only few studies identify more than hundred peptides. Here we describe a MS-based workflow for the characterization of membranal and soluble HLA class II DR and DQ peptidomes. Using this workflow, we identify a total of 8595 and 3727 HLA class II peptides from Maver-1 and DOHH2 cells, respectively. Based on this data, a motif-based binding predictor is developed and compared to NetMHCIIpan 3.1. We then apply the workflow to human plasma, resulting in the identification of between 34 and 152 HLA-DR and between 100 and 180 HLA-DQ peptides, respectively. Finally, we implement a data-independent acquisition workflow to increase reproducibility and sensitivity of HLA class II peptidome characterizations.
Collapse
Affiliation(s)
- Danilo Ritz
- Philochem AG, Libernstrasse 3, Otelfingen, Switzerland
| | | | - Hanna Debiec
- Inserm UMRS 1155, Hôpital Tenon, Paris, France.,Sorbonne Universités, UPMC Univ Paris 06, Paris, France
| | - Pierre Ronco
- Inserm UMRS 1155, Hôpital Tenon, Paris, France.,Sorbonne Universités, UPMC Univ Paris 06, Paris, France
| | - Dario Neri
- Institute of Pharmaceutical Sciences, ETH Zurich, Zurich, Switzerland
| | - Tim Fugmann
- Philochem AG, Libernstrasse 3, Otelfingen, Switzerland
| |
Collapse
|
138
|
Messner S, Fredriksson L, Lauschke VM, Roessger K, Escher C, Bober M, Kelm JM, Ingelman-Sundberg M, Moritz W. Transcriptomic, Proteomic, and Functional Long-Term Characterization of Multicellular Three-Dimensional Human Liver Microtissues. APPLIED IN VITRO TOXICOLOGY 2018; 4:1-12. [PMID: 32953943 PMCID: PMC7500040 DOI: 10.1089/aivt.2017.0022] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Three-Dimensional (3D) liver microtissues, specifically prepared from primary human hepatocytes (PHH) in coculture with nonparenchymal cells (NPCs), have been shown to be a valuable tool for in vitro toxicology. However, a lack of thorough characterization on a functional, transcriptomic, and proteomic level of such models during long-term cultivation is evident. By integrating multiple omics technologies, we provide in this study an in-depth long-term characterization of 3D microtissues composed of PHH from three different donors cocultured with primary NPCs. The 3D human liver microtissues (hLiMTs) exhibited stable adenosine triphosphate (ATP) content and albumin secretion over 5 weeks. Histological analysis indicated a healthy liver tissue with polarized expression of bile salt export pump (BSEP) and multidrug resistance protein 2 (MRP2) in a structure reminiscent of bile canaliculi. The 3D microtissues exhibited stable basal and inducible cytochrome P450 activities up to 5 weeks in culture. Analysis of 40,716 transcripts using RNA arrays revealed distinct similarities to native human liver gene expression. Long-term culture showed a stable phenotype up to 5 weeks, with differences in liver gene expression primarily attributed to individual donors. Proteomic profiling of 2200 unique proteins by label-free LC-MS/MS revealed a relatively stable protein expression where only 7.3% were up- or downregulated more than twofold from day 7 to 35 in culture. Taken together, these results suggest that hLiMTs represent a responsive and physiologically relevant in vitro liver model that maintains stable function over 5 weeks and is therefore well suited for repeated-dose toxicity testing.
Collapse
Affiliation(s)
| | - Lisa Fredriksson
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | - Volker M. Lauschke
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | | | | | | | | | | | | |
Collapse
|
139
|
Junker S, Maaβ S, Otto A, Michalik S, Morgenroth F, Gerth U, Hecker M, Becher D. Spectral Library Based Analysis of Arginine Phosphorylations in Staphylococcus aureus. Mol Cell Proteomics 2018; 17:335-348. [PMID: 29183913 PMCID: PMC5795395 DOI: 10.1074/mcp.ra117.000378] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Indexed: 12/19/2022] Open
Abstract
Reversible protein phosphorylation is one of the major mechanisms in the regulation of protein expression and protein activity, controlling physiological functions of the important human pathogen Staphylococcus aureus Phosphorylations at serine, threonine and tyrosine are known to influence for example protein activity in central metabolic pathways and the more energy-rich phosphorylations at histidine, aspartate or cysteine can be found as part of two component system sensor domains or mediating bacterial virulence. In addition to these well-known phosphorylations, the phosphorylation at arginine residues plays an essential role. Hence, the deletion mutant S. aureus COL ΔptpB (protein tyrosine phosphatase B) was studied because the protein PtpB is assumed to be an arginine phosphatase. A gel-free approach was applied to analyze the changes in the phosphoproteome of the deletion mutant ΔptpB and the wild type in growing cells, thereby focusing on the occurrence of phosphorylation on arginine residues. In order to enhance the reliability of identified phosphorylation sites at arginine residues, a subset of arginine phosphorylated peptides was chemically synthesized. Combined spectral libraries based on phosphoenriched samples, synthetic arginine phosphorylated peptides and classical proteome samples provide a sophisticated tool for the analysis of arginine phosphorylations. This way, 212 proteins phosphorylated on serine, threonine, tyrosine or arginine residues were identified within the mutant ΔptpB and 102 in wild type samples. Among them, 207 arginine phosphosites were identified exclusively within the mutant ΔptpB, widely distributed along the whole bacterial metabolism. This identification of putative targets of PtpB allows further investigation of the physiological relevance of arginine phosphorylations and provides the basis for reliable quantification of arginine phosphorylations in bacteria.
Collapse
Affiliation(s)
- Sabryna Junker
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Sandra Maaβ
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Andreas Otto
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Stephan Michalik
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | | | - Ulf Gerth
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Michael Hecker
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Dörte Becher
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| |
Collapse
|
140
|
Junker S, Maaβ S, Otto A, Michalik S, Morgenroth F, Gerth U, Hecker M, Becher D. Spectral Library Based Analysis of Arginine Phosphorylations in Staphylococcus aureus. MOLECULAR & CELLULAR PROTEOMICS : MCP 2017. [PMID: 29183913 DOI: 10.1074/mcp.ra117.000378.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Reversible protein phosphorylation is one of the major mechanisms in the regulation of protein expression and protein activity, controlling physiological functions of the important human pathogen Staphylococcus aureus Phosphorylations at serine, threonine and tyrosine are known to influence for example protein activity in central metabolic pathways and the more energy-rich phosphorylations at histidine, aspartate or cysteine can be found as part of two component system sensor domains or mediating bacterial virulence. In addition to these well-known phosphorylations, the phosphorylation at arginine residues plays an essential role. Hence, the deletion mutant S. aureus COL ΔptpB (protein tyrosine phosphatase B) was studied because the protein PtpB is assumed to be an arginine phosphatase. A gel-free approach was applied to analyze the changes in the phosphoproteome of the deletion mutant ΔptpB and the wild type in growing cells, thereby focusing on the occurrence of phosphorylation on arginine residues. In order to enhance the reliability of identified phosphorylation sites at arginine residues, a subset of arginine phosphorylated peptides was chemically synthesized. Combined spectral libraries based on phosphoenriched samples, synthetic arginine phosphorylated peptides and classical proteome samples provide a sophisticated tool for the analysis of arginine phosphorylations. This way, 212 proteins phosphorylated on serine, threonine, tyrosine or arginine residues were identified within the mutant ΔptpB and 102 in wild type samples. Among them, 207 arginine phosphosites were identified exclusively within the mutant ΔptpB, widely distributed along the whole bacterial metabolism. This identification of putative targets of PtpB allows further investigation of the physiological relevance of arginine phosphorylations and provides the basis for reliable quantification of arginine phosphorylations in bacteria.
Collapse
Affiliation(s)
- Sabryna Junker
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Sandra Maaβ
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Andreas Otto
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Stephan Michalik
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | | | - Ulf Gerth
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Michael Hecker
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| | - Dörte Becher
- From the ‡Institute for Microbiology, University of Greifswald, Germany
| |
Collapse
|
141
|
Fuchs S, Mehlan H, Bernhardt J, Hennig A, Michalik S, Surmann K, Pané-Farré J, Giese A, Weiss S, Backert L, Herbig A, Nieselt K, Hecker M, Völker U, Mäder U. AureoWiki ̵ The repository of the Staphylococcus aureus research and annotation community. Int J Med Microbiol 2017; 308:558-568. [PMID: 29198880 DOI: 10.1016/j.ijmm.2017.11.011] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Revised: 11/20/2017] [Accepted: 11/24/2017] [Indexed: 11/28/2022] Open
Abstract
In light of continuously accumulating data and knowledge on major human pathogens, comprehensive and up-to-date sources of easily accessible information are urgently required. The AureoWiki database (http://aureowiki.med.uni-greifswald.de) provides detailed information on the genes and proteins of clinically and experimentally relevant S. aureus strains, currently covering NCTC 8325, COL, Newman, USA300_FPR3757, and N315. By implementing a pan-genome approach, AureoWiki facilitates the transfer of knowledge gained in studies with different S. aureus strains, thus supporting functional annotation and better understanding of this organism. All data related to a given gene or gene product is compiled on a strain-specific gene page. The gene pages contain sequence-based information complemented by data on, for example, protein function and localization, transcriptional regulation, and gene expression. The information provided is connected via links to other databases and published literature. Importantly, orthologous genes of the individual strains, which are linked by a pan-genome gene identifier and a unified gene name, are presented side by side using strain-specific tabs. The respective pan-genome gene page contains an orthologue table for 32 S. aureus strains, a multiple-strain genome viewer, a protein sequence alignment as well as other comparative information. The data collected in AureoWiki is also accessible through various download options in order to support bioinformatics applications. In addition, based on two large-scale gene expression data sets, AureoWiki provides graphical representations of condition-dependent mRNA levels and protein profiles under various laboratory and infection-related conditions.
Collapse
Affiliation(s)
- Stephan Fuchs
- FG13 Nosocomial Pathogens and Antibiotic Resistance, Robert Koch Institute, Wernigerode, Germany; Institute for Microbiology, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany
| | - Henry Mehlan
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Jörg Bernhardt
- Institute for Microbiology, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany
| | - André Hennig
- Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany
| | - Stephan Michalik
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Kristin Surmann
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Jan Pané-Farré
- Institute for Microbiology, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany
| | - Anne Giese
- Institute for Microbiology, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany
| | - Stefan Weiss
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Linus Backert
- Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany
| | - Alexander Herbig
- Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany
| | - Kay Nieselt
- Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany
| | - Michael Hecker
- Institute for Microbiology, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany; ZIK FunGene, Ernst-Moritz-Arndt-University Greifswald and University Medicine Greifswald, Greifswald, Germany
| | - Uwe Völker
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany; ZIK FunGene, Ernst-Moritz-Arndt-University Greifswald and University Medicine Greifswald, Greifswald, Germany
| | - Ulrike Mäder
- Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany.
| |
Collapse
|
142
|
Systematic proteome and proteostasis profiling in human Trisomy 21 fibroblast cells. Nat Commun 2017; 8:1212. [PMID: 29089484 PMCID: PMC5663699 DOI: 10.1038/s41467-017-01422-6] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Accepted: 09/15/2017] [Indexed: 12/17/2022] Open
Abstract
Down syndrome (DS) is mostly caused by a trisomy of the entire Chromosome 21 (Trisomy 21, T21). Here, we use SWATH mass spectrometry to quantify protein abundance and protein turnover in fibroblasts from a monozygotic twin pair discordant for T21, and to profile protein expression in 11 unrelated DS individuals and matched controls. The integration of the steady-state and turnover proteomic data indicates that protein-specific degradation of members of stoichiometric complexes is a major determinant of T21 gene dosage outcome, both within and between individuals. This effect is not apparent from genomic and transcriptomic data. The data also reveal that T21 results in extensive proteome remodeling, affecting proteins encoded by all chromosomes. Finally, we find broad, organelle-specific post-transcriptional effects such as significant downregulation of the mitochondrial proteome contributing to T21 hallmarks. Overall, we provide a valuable proteomic resource to understand the origin of DS phenotypic manifestations. Trisomy 21 (T21) is a major cause of Down syndrome but little is known about its impact on the cellular proteome. Here, the authors define the proteome of T21 fibroblasts and its turnover and also map proteomic differences in monozygotic T21-discordant twins, revealing extensive, organelle-specific changes caused by T21.
Collapse
|
143
|
Williams BJ, Ciavarini SJ, Devlin C, Cohn SM, Xie R, Vissers JPC, Martin LB, Caswell A, Langridge JI, Geromanos SJ. Multi-mode acquisition (MMA): An MS/MS acquisition strategy for maximizing selectivity, specificity and sensitivity of DIA product ion spectra. Proteomics 2017; 16:2284-301. [PMID: 27296928 DOI: 10.1002/pmic.201500492] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Revised: 05/16/2016] [Accepted: 06/10/2016] [Indexed: 01/08/2023]
Abstract
In proteomics studies, it is generally accepted that depth of coverage and dynamic range is limited in data-directed acquisitions. The serial nature of the method limits both sensitivity and the number of precursor ions that can be sampled. To that end, a number of data-independent acquisition (DIA) strategies have been introduced with these methods, for the most part, immune to the sampling issue; nevertheless, some do have other limitations with respect to sensitivity. The major limitation with DIA approaches is interference, i.e., MS/MS spectra are highly chimeric and often incapable of being identified using conventional database search engines. Utilizing each available dimension of separation prior to ion detection, we present a new multi-mode acquisition (MMA) strategy multiplexing both narrowband and wideband DIA acquisitions in a single analytical workflow. The iterative nature of the MMA workflow limits the adverse effects of interference with minimal loss in sensitivity. Qualitative identification can be performed by selected ion chromatograms or conventional database search strategies.
Collapse
Affiliation(s)
| | | | | | | | - Rong Xie
- Waters Corporation, Milford, MA, USA
| | | | | | | | | | | |
Collapse
|
144
|
Next-Generation Proteomics and Its Application to Clinical Breast Cancer Research. THE AMERICAN JOURNAL OF PATHOLOGY 2017; 187:2175-2184. [DOI: 10.1016/j.ajpath.2017.07.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Revised: 07/05/2017] [Accepted: 07/06/2017] [Indexed: 12/17/2022]
|
145
|
Mrzic A, Lermyte F, Vu TN, Valkenborg D, Laukens K. InSourcerer: a high-throughput method to search for unknown metabolite modifications by mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2017; 31:1396-1404. [PMID: 28569011 DOI: 10.1002/rcm.7910] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Revised: 05/15/2017] [Accepted: 05/22/2017] [Indexed: 06/07/2023]
Abstract
RATIONALE Using mass spectrometry, the analysis of known metabolite structures has become feasible in a systematic high-throughput fashion. Nevertheless, the identification of previously unknown structures remains challenging, partially because many unidentified variants originate from known molecules that underwent unexpected modifications. Here, we present a method for the discovery of unknown metabolite modifications and conjugate metabolite isoforms in a high-throughput fashion. METHODS The method is based on user-controlled in-source fragmentation which is used to induce loss of weakly bound modifications. This is followed by the comparison of product ions from in-source fragmentation and collision-induced dissociation (CID). Diagonal MS2 -MS3 matching allows the detection of unknown metabolite modifications, as well as substructure similarities. As the method relies heavily on the advantages of in-source fragmentation and its ability to 'magically' elucidate unknown modification, we have named it inSourcerer as a portmanteau of in-source and sorcerer. RESULTS The method was evaluated using a set of 15 different cytokinin standards. Product ions from in-source fragmentation and CID were compared. Hierarchical clustering revealed that good matches are due to the presence of common substructures. Plant leaf extract, spiked with a mix of all 15 standards, was used to demonstrate the method's ability to detect these standards in a complex mixture, as well as confidently identify compounds already present in the plant material. CONCLUSIONS Here we present a method that incorporates a classic liquid chromatography/mass spectrometry (LC/MS) workflow with fragmentation models and computational algorithms. The assumptions upon which the concept of the method was built were shown to be valid and the method showed that in-source fragmentation can be used to pinpoint structural similarities and indicate the occurrence of a modification.
Collapse
Affiliation(s)
- Aida Mrzic
- Department of Mathematics and Computer Science, University of Antwerp, Antwerpen, Belgium
- Biomedical Informatics Research Network Antwerpen (biomina), University of Antwerp / Antwerp University Hospital, Antwerpen, Belgium
| | - Frederik Lermyte
- Applied Bio & Molecular Systems, Flemish Institute for Technological Research (VITO), Mol, Belgium
- UA-VITO Center for Proteomics, University of Antwerp, Antwerpen, Belgium
- Department of Chemistry, University of Antwerp, Antwerpen, Belgium
| | - Trung Nghia Vu
- Department of Mathematics and Computer Science, University of Antwerp, Antwerpen, Belgium
- Biomedical Informatics Research Network Antwerpen (biomina), University of Antwerp / Antwerp University Hospital, Antwerpen, Belgium
| | - Dirk Valkenborg
- Applied Bio & Molecular Systems, Flemish Institute for Technological Research (VITO), Mol, Belgium
- UA-VITO Center for Proteomics, University of Antwerp, Antwerpen, Belgium
- Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Hasselt, Belgium
| | - Kris Laukens
- Department of Mathematics and Computer Science, University of Antwerp, Antwerpen, Belgium
- Biomedical Informatics Research Network Antwerpen (biomina), University of Antwerp / Antwerp University Hospital, Antwerpen, Belgium
| |
Collapse
|
146
|
A global Staphylococcus aureus proteome resource applied to the in vivo characterization of host-pathogen interactions. Sci Rep 2017; 7:9718. [PMID: 28887440 PMCID: PMC5591248 DOI: 10.1038/s41598-017-10059-w] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 07/24/2017] [Indexed: 12/16/2022] Open
Abstract
Data-independent acquisition mass spectrometry promises higher performance in terms of quantification and reproducibility compared to data-dependent acquisition mass spectrometry methods. To enable high-accuracy quantification of Staphylococcus aureus proteins, we have developed a global ion library for data-independent acquisition approaches employing high-resolution time of flight or Orbitrap instruments for this human pathogen. We applied this ion library resource to investigate the time-resolved adaptation of S. aureus to the intracellular niche in human bronchial epithelial cells and in a murine pneumonia model. In epithelial cells, abundance changes for more than 400 S. aureus proteins were quantified, revealing, e.g., the precise temporal regulation of the SigB-dependent stress response and differential regulation of translation, fermentation, and amino acid biosynthesis. Using an in vivo murine pneumonia model, our data-independent acquisition quantification analysis revealed for the first time the in vivo proteome adaptation of S. aureus. From approximately 2.15 × 105 S. aureus cells, 578 proteins were identified. Increased abundance of proteins required for oxidative stress response, amino acid biosynthesis, and fermentation together with decreased abundance of ribosomal proteins and nucleotide reductase NrdEF was observed in post-infection samples compared to the pre-infection state.
Collapse
|
147
|
Shao W, Lam H. Tandem mass spectral libraries of peptides and their roles in proteomics research. MASS SPECTROMETRY REVIEWS 2017; 36:634-648. [PMID: 27403644 DOI: 10.1002/mas.21512] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 05/21/2016] [Indexed: 05/15/2023]
Abstract
Proteomics is a rapidly maturing field aimed at the high-throughput identification and quantification of all proteins in a biological system. The cornerstone of proteomic technology is tandem mass spectrometry of peptides resulting from the digestion of protein mixtures. The fragmentation pattern of each peptide ion is captured in its tandem mass spectrum, which enables its identification and acts as a fingerprint for the peptide. Spectral libraries are simply searchable collections of these fingerprints, which have taken on an increasingly prominent role in proteomic data analysis. This review describes the historical development of spectral libraries in proteomics, details the computational procedures behind library building and searching, surveys the current applications of spectral libraries, and discusses the outstanding challenges. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:634-648, 2017.
Collapse
Affiliation(s)
- Wenguang Shao
- Department of Biology, Institute of Molecular Systems Biology, Eidgenössische Technische Hochschule (ETH) Zurich, Zurich, Switzerland
- Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Henry Lam
- Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
- Department of Chemical and Biomolecular Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| |
Collapse
|
148
|
Yu F, Li N, Yu W. Exhaustively Identifying Cross-Linked Peptides with a Linear Computational Complexity. J Proteome Res 2017; 16:3942-3952. [PMID: 28825304 DOI: 10.1021/acs.jproteome.7b00338] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Chemical cross-linking coupled to mass spectrometry is a powerful tool to study protein-protein interactions and protein conformations. Two linked peptides are ionized and fragmented to produce a tandem mass spectrum. In such an experiment, a tandem mass spectrum contains ions from two peptides. The peptide identification problem becomes a peptide-peptide pair identification problem. Currently, most tools do not search all possible pairs due to the quadratic time complexity. Consequently, missed findings are unavoidable. In our previous work, we developed a tool named ECL to search all pairs of peptides exhaustively. Unfortunately, it is very slow due to the quadratic computational complexity, especially when the database is large. Furthermore, ECL uses a score function without statistical calibration, while researchers1-3 have proposed that it is inappropriate to directly compare uncalibrated scores because different spectra have different random score distributions. Here we propose an advanced version of ECL, named ECL2. It achieves a linear time and space complexity by taking advantage of the additive property of a score function. It can search a data set containing tens of thousands of spectra against a database containing thousands of proteins in a few hours. Comparison with other five state-of-the-art tools shows that ECL2 is much faster than pLink, StavroX, ProteinProspector, and ECL. Kojak is the only one that is faster than ECL2, but Kojak does not exhaustively search all possible peptide pairs. The comparison shows that ECL2 has the highest sensitivity among the state-of-the-art tools. The experiment using a large-scale in vivo cross-linking data set demonstrates that ECL2 is the only tool that can find the peptide-spectrum matches (PSMs) passing the false discovery rate/q-value threshold. The result illustrates that the exhaustive search and a well-calibrated score function are useful to find PSMs from a huge search space.
Collapse
Affiliation(s)
- Fengchao Yu
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology , Hong Kong, China
| | - Ning Li
- Division of Life Science, The Hong Kong University of Science and Technology , Hong Kong, China.,Division of Biomedical Engineering, The Hong Kong University of Science and Technology , Hong Kong, China
| | - Weichuan Yu
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology , Hong Kong, China.,Division of Biomedical Engineering, The Hong Kong University of Science and Technology , Hong Kong, China
| |
Collapse
|
149
|
Blank-Landeshammer B, Kollipara L, Biß K, Pfenninger M, Malchow S, Shuvaev K, Zahedi RP, Sickmann A. Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics. J Proteome Res 2017; 16:3209-3218. [DOI: 10.1021/acs.jproteome.7b00198] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
| | - Laxmikanth Kollipara
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
| | - Karsten Biß
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
| | - Markus Pfenninger
- Biodiversity
and Climate Research Centre, Senckenberg Gesellschaft für Naturforschung, 60325 Frankfurt am Main, Germany
- Faculty
of Biological Science, Institute for Ecology, Evolution and Diversity,
Department of Molecular Ecology, Goethe University, Max-von-Laue-Straße
9, 60438 Frankfurt
am Main, Germany
| | - Sebastian Malchow
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
| | - Konstantin Shuvaev
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
| | - René P. Zahedi
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
| | - Albert Sickmann
- Leibniz-Institut für Analytische Wissenschaften − ISAS − e.V., 44139 Dortmund, Germany
- Medizinische
Fakultät, Medizinische Proteom-Center (MPC), Ruhr-Universität Bochum, 44801 Bochum, Germany
- Department
of Chemistry, College of Physical Sciences, University of Aberdeen, Aberdeen AB24 3FX, Scotland, United Kingdom
| |
Collapse
|
150
|
Bioinformatics Resources for Interpreting Proteomics Mass Spectrometry Data. Methods Mol Biol 2017. [PMID: 28809010 DOI: 10.1007/978-1-4939-7201-2_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
Developments in mass spectrometry (MS) instrumentation have supported the advance of a variety of proteomic technologies that have enabled scientists to assess differences between healthy and diseased states. In particular, the ability to identify altered biological processes in a cell has led to the identification of novel drug targets, the development of more effective therapeutic drugs, and the growth of new diagnostic approaches and tools for personalized medicine applications. Nevertheless, large-scale proteomic data generated by modern mass spectrometers are extremely complex and necessitate equally complex bioinformatics tools and computational algorithms for their interpretation. A vast number of commercial and public resources have been developed for this purpose, often leaving the researcher perplexed at the overwhelming list of choices that exist. To address this challenge, the aim of this chapter is to provide a roadmap to the basic steps that are involved in mass spectrometry data acquisition and processing, and to describe the most common tools that are available for placing the results in biological context.
Collapse
|