1
|
Lozano C, Pible O, Eschlimann M, Giraud M, Debroas S, Gaillard JC, Bellanger L, Taysse L, Armengaud J. Universal Identification of Pathogenic Viruses by Liquid Chromatography Coupled with Tandem Mass Spectrometry Proteotyping. Mol Cell Proteomics 2024; 23:100822. [PMID: 39084562 DOI: 10.1016/j.mcpro.2024.100822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 07/24/2024] [Accepted: 07/28/2024] [Indexed: 08/02/2024] Open
Abstract
Accurate and rapid identification of viruses is crucial for an effective medical diagnosis when dealing with infections. Conventional methods, including DNA amplification techniques or lateral-flow assays, are constrained to a specific set of targets to search for. In this study, we introduce a novel tandem mass spectrometry proteotyping-based method that offers a universal approach for the identification of pathogenic viruses and other components, eliminating the need for a priori knowledge of the sample composition. Our protocol relies on a time and cost-efficient peptide sample preparation, followed by an analysis with liquid chromatography coupled to high-resolution tandem mass spectrometry. As a proof of concept, we first assessed our method on publicly available shotgun proteomics datasets obtained from virus preparations and fecal samples of infected individuals. Successful virus identification was achieved with 53 public datasets, spanning 23 distinct viral species. Furthermore, we illustrated the method's capability to discriminate closely related viruses within the same sample, using alphaviruses as an example. The clinical applicability of our method was demonstrated by the accurate detection of the vaccinia virus in spiked saliva, a matrix of paramount clinical significance due to its non-invasive and easily obtainable nature. This innovative approach represents a significant advancement in pathogen detection and paves the way for enhanced diagnostic capabilities.
Collapse
Affiliation(s)
- Clément Lozano
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France.
| | - Olivier Pible
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France
| | - Marine Eschlimann
- Direction Générale de l'Armement Maîtrise NRBC, Vert-le-Petit, France
| | - Mathieu Giraud
- Direction Générale de l'Armement Maîtrise NRBC, Vert-le-Petit, France
| | - Stéphanie Debroas
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France
| | - Jean-Charles Gaillard
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France
| | - Laurent Bellanger
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France
| | - Laurent Taysse
- Direction Générale de l'Armement Maîtrise NRBC, Vert-le-Petit, France
| | - Jean Armengaud
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, Bagnols-sur-Cèze, France.
| |
Collapse
|
2
|
Alves G, Ogurtsov AY, Porterfield H, Maity T, Jenkins LM, Sacks DB, Yu YK. Multiplexing the Identification of Microorganisms via Tandem Mass Tag Labeling Augmented by Interference Removal through a Novel Modification of the Expectation Maximization Algorithm. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1138-1155. [PMID: 38740383 PMCID: PMC11157548 DOI: 10.1021/jasms.3c00445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 04/12/2024] [Accepted: 04/17/2024] [Indexed: 05/16/2024]
Abstract
Having fast, accurate, and broad spectrum methods for the identification of microorganisms is of paramount importance to public health, research, and safety. Bottom-up mass spectrometer-based proteomics has emerged as an effective tool for the accurate identification of microorganisms from microbial isolates. However, one major hurdle that limits the deployment of this tool for routine clinical diagnosis, and other areas of research such as culturomics, is the instrument time required for the mass spectrometer to analyze a single sample, which can take ∼1 h per sample, when using mass spectrometers that are presently used in most institutes. To address this issue, in this study, we employed, for the first time, tandem mass tags (TMTs) in multiplex identifications of microorganisms from multiple TMT-labeled samples in one MS/MS experiment. A difficulty encountered when using TMT labeling is the presence of interference in the measured intensities of TMT reporter ions. To correct for interference, we employed in the proposed method a modified version of the expectation maximization (EM) algorithm that redistributes the signal from ion interference back to the correct TMT-labeled samples. We have evaluated the sensitivity and specificity of the proposed method using 94 MS/MS experiments (covering a broad range of protein concentration ratios across TMT-labeled channels and experimental parameters), containing a total of 1931 true positive TMT-labeled channels and 317 true negative TMT-labeled channels. The results of the evaluation show that the proposed method has an identification sensitivity of 93-97% and a specificity of 100% at the species level. Furthermore, as a proof of concept, using an in-house-generated data set composed of some of the most common urinary tract pathogens, we demonstrated that by using the proposed method the mass spectrometer time required per sample, using a 1 h LC-MS/MS run, can be reduced to 10 and 6 min when samples are labeled with TMT-6 and TMT-10, respectively. The proposed method can also be used along with Orbitrap mass spectrometers that have faster MS/MS acquisition rates, like the recently released Orbitrap Astral mass spectrometer, to further reduce the mass spectrometer time required per sample.
Collapse
Affiliation(s)
- Gelio Alves
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| | - Aleksey Y. Ogurtsov
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| | - Harry Porterfield
- Department
of Laboratory Medicine, Clinical Center, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Tapan Maity
- Laboratory
of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Lisa M. Jenkins
- Laboratory
of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - David B. Sacks
- Department
of Laboratory Medicine, Clinical Center, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Yi-Kuo Yu
- National
Center for Biotechnology Information, National Library of Medicine,
National Institutes of Health, Bethesda, Maryland 20894, United States
| |
Collapse
|
3
|
Ogurtsov A, Alves G, Rubio A, Joyce B, Andersson B, Karlsson R, Moore ER, Yu YK. MiCId GUI: The Graphical User Interface for MiCId, a Fast Microorganism Classification and Identification Workflow with Accurate Statistics and High Recall. J Comput Biol 2024; 31:175-178. [PMID: 38301204 PMCID: PMC10874827 DOI: 10.1089/cmb.2023.0149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024] Open
Abstract
Although many user-friendly workflows exist for identifications of peptides and proteins in mass-spectrometry-based proteomics, there is a need of easy to use, fast, and accurate workflows for identifications of microorganisms, antimicrobial resistant proteins, and biomass estimation. Identification of microorganisms is a computationally demanding task that requires querying thousands of MS/MS spectra in a database containing thousands to tens of thousands of microorganisms. Existing software can't handle such a task in a time efficient manner, taking hours to process a single MS/MS experiment. Another paramount factor to consider is the necessity of accurate statistical significance to properly control the proportion of false discoveries among the identified microorganisms, and antimicrobial-resistant proteins, and to provide robust biomass estimation. Recently, we have developed Microorganism Classification and Identification (MiCId) workflow that assigns accurate statistical significance to identified microorganisms, antimicrobial-resistant proteins, and biomass estimation. MiCId's workflow is also computationally efficient, taking about 6-17 minutes to process a tandem mass-spectrometry (MS/MS) experiment using computer resources that are available in most laptop and desktop computers, making it a portable workflow. To make data analysis accessible to a broader range of users, beyond users familiar with the Linux environment, we have developed a graphical user interface (GUI) for MiCId's workflow. The GUI brings to users all the functionality of MiCId's workflow in a friendly interface along with tools for data analysis, visualization, and to export results.
Collapse
Affiliation(s)
- Aleksey Ogurtsov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Gelio Alves
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Alex Rubio
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Brendan Joyce
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Björn Andersson
- Bioinformatics Core Facility, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Roger Karlsson
- Department of Infectious Diseases, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Nanoxis Consulting AB, Gothenburg, Sweden
| | - Edward R.B. Moore
- Department of Infectious Diseases, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Culture Collection University of Gothenburg, Sahlgrenska Academy, University of Gothenburg, Sweden
| | - Yi-Kuo Yu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
4
|
Mappa C, Alpha-Bazin B, Pible O, Armengaud J. Mix24X, a Lab-Assembled Reference to Evaluate Interpretation Procedures for Tandem Mass Spectrometry Proteotyping of Complex Samples. Int J Mol Sci 2023; 24:8634. [PMID: 37239979 PMCID: PMC10218423 DOI: 10.3390/ijms24108634] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 05/09/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open
Abstract
Correct identification of the microorganisms present in a complex sample is a crucial issue. Proteotyping based on tandem mass spectrometry can help establish an inventory of organisms present in a sample. Evaluation of bioinformatics strategies and tools for mining the recorded datasets is essential to establish confidence in the results obtained and to improve these pipelines in terms of sensitivity and accuracy. Here, we propose several tandem mass spectrometry datasets recorded on an artificial reference consortium comprising 24 bacterial species. This assemblage of environmental and pathogenic bacteria covers 20 different genera and 5 bacterial phyla. The dataset comprises difficult cases, such as the Shigella flexneri species, which is closely related to Escherichia coli, and several highly sequenced clades. Different acquisition strategies simulate real-life scenarios: from rapid survey sampling to exhaustive analysis. We provide access to individual proteomes of each bacterium separately to provide a rational basis for evaluating the assignment strategy of MS/MS spectra when recorded from complex mixtures. This resource should provide an interesting common reference for developers who wish to compare their proteotyping tools and for those interested in evaluating protein assignment when dealing with complex samples, such as microbiomes.
Collapse
Affiliation(s)
- Charlotte Mappa
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, 30200 Bagnols-sur-Cèze, France (O.P.)
- Laboratoire Innovations Technologiques Pour la Détection et le Diagnostic (Li2D), Université de Montpellier, 30207 Bagnols sur Cèze, France
| | - Béatrice Alpha-Bazin
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, 30200 Bagnols-sur-Cèze, France (O.P.)
| | - Olivier Pible
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, 30200 Bagnols-sur-Cèze, France (O.P.)
| | - Jean Armengaud
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, SPI, 30200 Bagnols-sur-Cèze, France (O.P.)
| |
Collapse
|
5
|
Mappa C, Alpha-Bazin B, Pible O, Armengaud J. Evaluation of the Limit of Detection of Bacteria by Tandem Mass Spectrometry Proteotyping and Phylopeptidomics. Microorganisms 2023; 11:1170. [PMCID: PMC10223342 DOI: 10.3390/microorganisms11051170] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 04/20/2023] [Accepted: 04/22/2023] [Indexed: 06/01/2023] Open
Abstract
Shotgun proteomics has proven to be an attractive alternative for identifying a pathogen and characterizing the antimicrobial resistance genes it produces. Because of its performance, proteotyping of microorganisms by tandem mass spectrometry is expected to become an essential tool in modern healthcare. Proteotyping microorganisms that have been isolated from the environment by culturomics is also a cornerstone for the development of new biotechnological applications. Phylopeptidomics is a new strategy that estimates the phylogenetic distances between the organisms present in the sample and calculates the ratio of their shared peptides, thus improving the quantification of their contributions to the biomass. Here, we established the limit of detection of tandem mass spectrometry proteotyping based on MS/MS data recorded for several bacteria. The limit of detection for Salmonella bongori with our experimental set-up is 4 × 104 colony-forming units from a sample volume of 1 mL. This limit of detection is directly related to the amount of protein per cell and therefore depends on the shape and size of the microorganism. We have demonstrated that identification of bacteria by phylopeptidomics is independent of their growth stage and that the limit of detection of the method is not degraded in presence of additional bacteria in the same proportion.
Collapse
Affiliation(s)
- Charlotte Mappa
- Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, 30200 Bagnols-sur-Cèze, France
- Laboratoire Innovations Technologiques pour la Détection et le Diagnostic (Li2D), Université de Montpellier, 30207 Bagnols-sur-Cèze, France
| | - Béatrice Alpha-Bazin
- Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, 30200 Bagnols-sur-Cèze, France
| | - Olivier Pible
- Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, 30200 Bagnols-sur-Cèze, France
| | - Jean Armengaud
- Université Paris-Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (DMTS), SPI, 30200 Bagnols-sur-Cèze, France
| |
Collapse
|