1
|
Yan B, Zeng L, Lu Y, Li M, Lu W, Zhou B, He Q. Rapid bacterial identification through volatile organic compound analysis and deep learning. BMC Bioinformatics 2024; 25:347. [PMID: 39506632 DOI: 10.1186/s12859-024-05967-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 10/22/2024] [Indexed: 11/08/2024] Open
Abstract
BACKGROUND The increasing antimicrobial resistance caused by the improper use of antibiotics poses a significant challenge to humanity. Rapid and accurate identification of microbial species in clinical settings is crucial for precise medication and reducing the development of antimicrobial resistance. This study aimed to explore a method for automatic identification of bacteria using Volatile Organic Compounds (VOCs) analysis and deep learning algorithms. RESULTS AlexNet, where augmentation is applied, produces the best results. The average accuracy rate for single bacterial culture classification reached 99.24% using cross-validation, and the accuracy rates for identifying the three bacteria in randomly mixed cultures were SA:98.6%, EC:98.58% and PA:98.99%, respectively. CONCLUSION This work provides a new approach to quickly identify bacterial microorganisms. Using this method can automatically identify bacteria in GC-IMS detection results, helping clinical doctors quickly detect bacterial species, accurately prescribe medication, thereby controlling epidemics, and minimizing the negative impact of bacterial resistance on society.
Collapse
Affiliation(s)
- Bowen Yan
- Research Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Lin Zeng
- Research Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Yanyi Lu
- Research Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Min Li
- Laboratory Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Weiping Lu
- Laboratory Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Bangfu Zhou
- Research Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China
| | - Qinghua He
- Research Department, Daping Hosipital, Army Medical University, Chongqing, 400042, China.
| |
Collapse
|
2
|
Chudy SFJ, Phanzu DM, Kolk AHJ, Sopoh GE, Barogui YT, Tzfadia O, Eddyani M, Fissette K, de Jong BC, Brinkman P. Volatile organic compound detection of Buruli ulcer disease: Headspace analysis of Mycobacterium ulcerans and used gauzes of Buruli-compatible ulcers. PLoS Negl Trop Dis 2024; 18:e0012514. [PMID: 39312571 PMCID: PMC11449299 DOI: 10.1371/journal.pntd.0012514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 10/03/2024] [Accepted: 09/05/2024] [Indexed: 09/25/2024] Open
Abstract
Diagnosing Buruli ulcer (BU) is complicated by limited access to the sensitive IS2404 qPCR. Experienced clinicians report a distinct odour of Buruli ulcers. We explored the potential of headspace analysis by thermal desorption-gas chromatography-mass spectrometry to detect volatile organic compounds (VOCs) from Mycobacterium ulcerans both in vitro and clinically. This study was conducted in two phases: a discovery and validation phase. During the discovery phase, VOCs that enable identification of M. ulcerans cultures were determined. During the validation phase, these VOCs were evaluated in clinical samples for which we used gauzes from patients with skin ulcerations in the Democratic Republic of Congo. Seven M. ulcerans headspace samples were compared with four from sterile growth medium and laboratory environmental air. The univariate analysis resulted in the selection of 24 retained VOC fragments and a perfect differentiation between cultures and controls. Sixteen of 24 fragments were identified, resulting in eleven unique compounds, mainly alkanes. Methylcyclohexane was the best performing compound. Based on these 24 fragments, headspace samples originating from gauzes of 50 open skin lesions (12 qPCR positive and 38 negative) were analysed and an AUC of 0.740 (95%-CI 0.583-0.897) was obtained. As this is an experimental study, future research has to confirm whether the identified compounds can serve as novel biomarkers.
Collapse
Affiliation(s)
- Stan F J Chudy
- Department of Respiratory Medicine, Academic Medical Centre, Amsterdam, The Netherlands
| | - Delphin M Phanzu
- Institut Medical Evangélique de Kimpese (IME), Kimpese, Democratic Republic of Congo
- Centre de Recherche en Santé de Kimpese (CRSK), Kimpese, Democratic Republic of Congo
| | - Arend H J Kolk
- Department of Respiratory Medicine, Academic Medical Centre, Amsterdam, The Netherlands
| | - Ghislain E Sopoh
- Centre De Dépistage et de Traitement de l'Ulcère de Buruli (CDTUB), Allada, Benin
| | | | - Oren Tzfadia
- Mycobacteriology Unit, Institute of Tropical Medicine, Antwerp, Belgium
| | | | - Krista Fissette
- Mycobacteriology Unit, Institute of Tropical Medicine, Antwerp, Belgium
| | - Bouke C de Jong
- Mycobacteriology Unit, Institute of Tropical Medicine, Antwerp, Belgium
| | - Paul Brinkman
- Department of Respiratory Medicine, Academic Medical Centre, Amsterdam, The Netherlands
| |
Collapse
|
3
|
Żuchowska K, Filipiak W. Modern approaches for detection of volatile organic compounds in metabolic studies focusing on pathogenic bacteria: Current state of the art. J Pharm Anal 2024; 14:100898. [PMID: 38634063 PMCID: PMC11022102 DOI: 10.1016/j.jpha.2023.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 10/03/2023] [Accepted: 11/15/2023] [Indexed: 04/19/2024] Open
Abstract
Pathogenic microorganisms produce numerous metabolites, including volatile organic compounds (VOCs). Monitoring these metabolites in biological matrices (e.g., urine, blood, or breath) can reveal the presence of specific microorganisms, enabling the early diagnosis of infections and the timely implementation of targeted therapy. However, complex matrices only contain trace levels of VOCs, and their constituent components can hinder determination of these compounds. Therefore, modern analytical techniques enabling the non-invasive identification and precise quantification of microbial VOCs are needed. In this paper, we discuss bacterial VOC analysis under in vitro conditions, in animal models and disease diagnosis in humans, including techniques for offline and online analysis in clinical settings. We also consider the advantages and limitations of novel microextraction techniques used to prepare biological samples for VOC analysis, in addition to reviewing current clinical studies on bacterial volatilomes that address inter-species interactions, the kinetics of VOC metabolism, and species- and drug-resistance specificity.
Collapse
Affiliation(s)
- Karolina Żuchowska
- Department of Pharmacodynamics and Molecular Pharmacology, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Torun, 85-089 Bydgoszcz, Poland
| | - Wojciech Filipiak
- Department of Pharmacodynamics and Molecular Pharmacology, Faculty of Pharmacy, Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Torun, 85-089 Bydgoszcz, Poland
| |
Collapse
|
4
|
Gaida M, Stefanuto PH, Focant JF. Theoretical modeling and machine learning-based data processing workflows in comprehensive two-dimensional gas chromatography-A review. J Chromatogr A 2023; 1711:464467. [PMID: 37871505 DOI: 10.1016/j.chroma.2023.464467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 10/15/2023] [Accepted: 10/17/2023] [Indexed: 10/25/2023]
Abstract
In recent years, comprehensive two-dimensional gas chromatography (GC × GC) has been gradually gaining prominence as a preferred method for the analysis of complex samples due to its higher peak capacity and resolution power compared to conventional gas chromatography (GC). Nonetheless, to fully benefit from the capabilities of GC × GC, a holistic approach to method development and data processing is essential for a successful and informative analysis. Method development enables the fine-tuning of the chromatographic separation, resulting in high-quality data. While generating such data is pivotal, it does not necessarily guarantee that meaningful information will be extracted from it. To this end, the first part of this manuscript reviews the importance of theoretical modeling in achieving good optimization of the separation conditions, ultimately improving the quality of the chromatographic separation. Multiple theoretical modeling approaches are discussed, with a special focus on thermodynamic-based modeling. The second part of this review highlights the importance of establishing robust data processing workflows, with a special emphasis on the use of advanced data processing tools such as, Machine Learning (ML) algorithms. Three widely used ML algorithms are discussed: Random Forest (RF), Support Vector Machine (SVM), and Partial Least Square-Discriminate Analysis (PLS-DA), highlighting their role in discovery-based analysis.
Collapse
Affiliation(s)
- Meriem Gaida
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Pierre-Hugues Stefanuto
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Jean-François Focant
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| |
Collapse
|
5
|
Gaida M, Cain CN, Synovec RE, Focant JF, Stefanuto PH. Tile-Based Random Forest Analysis for Analyte Discovery in Balanced and Unbalanced GC × GC-TOFMS Data Sets. Anal Chem 2023; 95:13519-13527. [PMID: 37647642 DOI: 10.1021/acs.analchem.3c01872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
In this study, we introduce a new nontargeted tile-based supervised analysis method that combines the four-grid tiling scheme previously established for the Fisher ratio (F-ratio) analysis (FRA) with the estimation of tile hit importance using the machine learning (ML) algorithm Random Forest (RF). This approach is termed tile-based RF analysis. As opposed to the standard tile-based F-ratio analysis, the RF approach can be extended to the analysis of unbalanced data sets, i.e., different numbers of samples per class. Tile-based RF computes out-of-bag (oob) tile hit importance estimates for every summed chromatographic signal within each tile on a per-mass channel basis (m/z). These estimates are then used to rank tile hits in a descending order of importance. In the present investigation, the RF approach was applied for a two-class comparison of stool samples collected from omnivore (O) subjects and stored using two different storage conditions: liquid (Liq) and lyophilized (Lyo). Two final hit lists were generated using balanced (8 vs Eight comparison) and unbalanced (8 vs Nine comparison) data sets and compared to the hit list generated by the standard F-ratio analysis. Similar class-distinguishing analytes (p < 0.01) were discovered by both methods. However, while the FRA discovered a more comprehensive hit list (65 hits), the RF approach strictly discovered hits (31 hits for the balanced data set comparison and 29 hits for the unbalanced data set comparison) with concentration ratios, [OLiq]/[OLyo], greater than 2 (or less than 0.5). This difference is attributed to the more stringent feature selection process used by the RF algorithm. Moreover, our findings suggest that the RF approach is a promising method for identifying class-distinguishing analytes in settings characterized by both high between-class variance and high within-class variance, making it an advantageous method in the study of complex biological matrices.
Collapse
Affiliation(s)
- Meriem Gaida
- Organic and Biological Analytical Chemistry Group, Molecular Systems Research Unit, University of Liège, 4000 Liège, Belgium
| | - Caitlin N Cain
- Department of Chemistry, University of Washington, Seattle, Washington 98195-1700, United States
| | - Robert E Synovec
- Department of Chemistry, University of Washington, Seattle, Washington 98195-1700, United States
| | - Jean-François Focant
- Organic and Biological Analytical Chemistry Group, Molecular Systems Research Unit, University of Liège, 4000 Liège, Belgium
| | - Pierre-Hugues Stefanuto
- Organic and Biological Analytical Chemistry Group, Molecular Systems Research Unit, University of Liège, 4000 Liège, Belgium
| |
Collapse
|
6
|
Boness HVM, de Sá HC, Dos Santos EKP, Canuto GAB. Sample Preparation in Microbial Metabolomics: Advances and Challenges. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1439:149-183. [PMID: 37843809 DOI: 10.1007/978-3-031-41741-2_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Microbial metabolomics has gained significant interest as it reflects the physiological state of microorganisms. Due to the great variability of biological organisms, in terms of physicochemical characteristics and variable range of concentration of metabolites, the choice of sample preparation methods is a crucial step in the metabolomics workflow and will reflect on the quality and reliability of the results generated. The procedures applied to the preparation of microbial samples will vary according to the type of microorganism studied, the metabolomics approach (untargeted or targeted), and the analytical platform of choice. This chapter aims to provide an overview of the sample preparation workflow for microbial metabolomics, highlighting the pre-analytical factors associated with cultivation, harvesting, metabolic quenching, and extraction. Discussions focus on obtaining intracellular and extracellular metabolites. Finally, we introduced advanced sample preparation methods based on automated systems.
Collapse
Affiliation(s)
- Heiter V M Boness
- Department of Analytical Chemistry, Institute of Chemistry, Federal University of Bahia, Salvador, BA, Brazil
| | - Hanna C de Sá
- Department of Analytical Chemistry, Institute of Chemistry, Federal University of Bahia, Salvador, BA, Brazil
| | - Emile K P Dos Santos
- Department of Analytical Chemistry, Institute of Chemistry, Federal University of Bahia, Salvador, BA, Brazil
| | - Gisele A B Canuto
- Department of Analytical Chemistry, Institute of Chemistry, Federal University of Bahia, Salvador, BA, Brazil.
| |
Collapse
|