1
|
Rusic D, Kumric M, Seselja Perisin A, Leskur D, Bukic J, Modun D, Vilovic M, Vrdoljak J, Martinovic D, Grahovac M, Bozic J. Tackling the Antimicrobial Resistance "Pandemic" with Machine Learning Tools: A Summary of Available Evidence. Microorganisms 2024; 12:842. [PMID: 38792673 PMCID: PMC11123121 DOI: 10.3390/microorganisms12050842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 04/16/2024] [Accepted: 04/19/2024] [Indexed: 05/26/2024] Open
Abstract
Antimicrobial resistance is recognised as one of the top threats healthcare is bound to face in the future. There have been various attempts to preserve the efficacy of existing antimicrobials, develop new and efficient antimicrobials, manage infections with multi-drug resistant strains, and improve patient outcomes, resulting in a growing mass of routinely available data, including electronic health records and microbiological information that can be employed to develop individualised antimicrobial stewardship. Machine learning methods have been developed to predict antimicrobial resistance from whole-genome sequencing data, forecast medication susceptibility, recognise epidemic patterns for surveillance purposes, or propose new antibacterial treatments and accelerate scientific discovery. Unfortunately, there is an evident gap between the number of machine learning applications in science and the effective implementation of these systems. This narrative review highlights some of the outstanding opportunities that machine learning offers when applied in research related to antimicrobial resistance. In the future, machine learning tools may prove to be superbugs' kryptonite. This review aims to provide an overview of available publications to aid researchers that are looking to expand their work with new approaches and to acquaint them with the current application of machine learning techniques in this field.
Collapse
Affiliation(s)
- Doris Rusic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Marko Kumric
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Ana Seselja Perisin
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Dario Leskur
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Josipa Bukic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Darko Modun
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Marino Vilovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Josip Vrdoljak
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Dinko Martinovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Department of Maxillofacial Surgery, University Hospital of Split, Spinciceva 1, 21000 Split, Croatia
| | - Marko Grahovac
- Department of Pharmacology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia;
| | - Josko Bozic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| |
Collapse
|
2
|
Batisti Biffignandi G, Chindelevitch L, Corbella M, Feil EJ, Sassera D, Lees JA. Optimising machine learning prediction of minimum inhibitory concentrations in Klebsiella pneumoniae. Microb Genom 2024; 10:001222. [PMID: 38529944 PMCID: PMC10995625 DOI: 10.1099/mgen.0.001222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 03/07/2024] [Indexed: 03/27/2024] Open
Abstract
Minimum Inhibitory Concentrations (MICs) are the gold standard for quantitatively measuring antibiotic resistance. However, lab-based MIC determination can be time-consuming and suffers from low reproducibility, and interpretation as sensitive or resistant relies on guidelines which change over time. Genome sequencing and machine learning promise to allow in silico MIC prediction as an alternative approach which overcomes some of these difficulties, albeit the interpretation of MIC is still needed. Nevertheless, precisely how we should handle MIC data when dealing with predictive models remains unclear, since they are measured semi-quantitatively, with varying resolution, and are typically also left- and right-censored within varying ranges. We therefore investigated genome-based prediction of MICs in the pathogen Klebsiella pneumoniae using 4367 genomes with both simulated semi-quantitative traits and real MICs. As we were focused on clinical interpretation, we used interpretable rather than black-box machine learning models, namely, Elastic Net, Random Forests, and linear mixed models. Simulated traits were generated accounting for oligogenic, polygenic, and homoplastic genetic effects with different levels of heritability. Then we assessed how model prediction accuracy was affected when MICs were framed as regression and classification. Our results showed that treating the MICs differently depending on the number of concentration levels of antibiotic available was the most promising learning strategy. Specifically, to optimise both prediction accuracy and inference of the correct causal variants, we recommend considering the MICs as continuous and framing the learning problem as a regression when the number of observed antibiotic concentration levels is large, whereas with a smaller number of concentration levels they should be treated as a categorical variable and the learning problem should be framed as a classification. Our findings also underline how predictive models can be improved when prior biological knowledge is taken into account, due to the varying genetic architecture of each antibiotic resistance trait. Finally, we emphasise that incrementing the population database is pivotal for the future clinical implementation of these models to support routine machine-learning based diagnostics.
Collapse
Affiliation(s)
- Gherard Batisti Biffignandi
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- MRC Centre for Global Infectious Disease Analysis, Imperial College, London, England, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, Imperial College, London, England, UK
| | - Marta Corbella
- Microbiology and Virology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Edward J. Feil
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK
| | - Davide Sassera
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - John A. Lees
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| |
Collapse
|
3
|
Rahman MK, Williams RB, Ajulo S, Levent G, Loneragan GH, Awosile B. Predictive Modeling of Phenotypic Antimicrobial Susceptibility of Selected Beta-Lactam Antimicrobials from Beta-Lactamase Resistance Genes. Antibiotics (Basel) 2024; 13:224. [PMID: 38534659 DOI: 10.3390/antibiotics13030224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 02/23/2024] [Accepted: 02/26/2024] [Indexed: 03/28/2024] Open
Abstract
The outcome of bacterial infection management relies on prompt diagnosis and effective treatment, but conventional antimicrobial susceptibility testing can be slow and labor-intensive. Therefore, this study aims to predict phenotypic antimicrobial susceptibility of selected beta-lactam antimicrobials in the bacteria of the family Enterobacteriaceae from different beta-lactamase resistance genotypes. Using human datasets extracted from the Antimicrobial Testing Leadership and Surveillance (ATLAS) program conducted by Pfizer and retail meat datasets from the National Antimicrobial Resistance Monitoring System for Enteric Bacteria (NARMS), we used a robust or weighted least square multivariable linear regression modeling framework to explore the relationship between antimicrobial susceptibility data of beta-lactam antimicrobials and different types of beta-lactamase resistance genes. In humans, in the presence of the blaCTX-M-1, blaCTX-M-2, blaCTX-M-8/25, and blaCTX-M-9 groups, MICs of cephalosporins significantly increased by values between 0.34-3.07 μg/mL, however, the MICs of carbapenem significantly decreased by values between 0.81-0.87 μg/mL. In the presence of carbapenemase genes (blaKPC, blaNDM, blaIMP, and blaVIM), the MICs of cephalosporin antimicrobials significantly increased by values between 1.06-5.77 μg/mL, while the MICs of carbapenem antimicrobials significantly increased by values between 5.39-67.38 μg/mL. In retail meat, MIC of ceftriaxone increased significantly in the presence of blaCMY-2, blaCTX-M-1, blaCTX-M-55, blaCTX-M-65, and blaSHV-2 by 55.16 μg/mL, 222.70 μg/mL, 250.81 μg/mL, 204.89 μg/mL, and 31.51 μg/mL respectively. MIC of cefoxitin increased significantly in the presence of blaCTX-M-65 and blaTEM-1 by 1.57 μg/mL and 1.04 μg/mL respectively. In the presence of blaCMY-2, MIC of cefoxitin increased by an average of 8.66 μg/mL over 17 years. Compared to E. coli isolates, MIC of cefoxitin in Salmonella enterica isolates decreased significantly by 0.67 μg/mL. On the other hand, MIC of ceftiofur increased in the presence of blaCTX-M-1, blaCTX-M-65, blaSHV-2, and blaTEM-1 by 8.82 μg/mL, 9.11 μg/mL, 8.18 μg/mL, and 1.04 μg/mL respectively. In the presence of blaCMY-2, MIC of ceftiofur increased by an average of 10.20 μg/mL over 14 years. The ability to predict antimicrobial susceptibility of beta-lactam antimicrobials directly from beta-lactamase resistance genes may help reduce the reliance on routine phenotypic testing with higher turnaround times in diagnostic, therapeutic, and surveillance of antimicrobial-resistant bacteria of the family Enterobacteriaceae.
Collapse
Affiliation(s)
- Md Kaisar Rahman
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| | - Ryan B Williams
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| | - Samuel Ajulo
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| | - Gizem Levent
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| | - Guy H Loneragan
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| | - Babafela Awosile
- School of Veterinary Medicine, Texas Tech University, Amarillo, TX 79106, USA
| |
Collapse
|
4
|
Liu GY, Yu D, Fan MM, Zhang X, Jin ZY, Tang C, Liu XF. Antimicrobial resistance crisis: could artificial intelligence be the solution? Mil Med Res 2024; 11:7. [PMID: 38254241 PMCID: PMC10804841 DOI: 10.1186/s40779-024-00510-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open
Abstract
Antimicrobial resistance is a global public health threat, and the World Health Organization (WHO) has announced a priority list of the most threatening pathogens against which novel antibiotics need to be developed. The discovery and introduction of novel antibiotics are time-consuming and expensive. According to WHO's report of antibacterial agents in clinical development, only 18 novel antibiotics have been approved since 2014. Therefore, novel antibiotics are critically needed. Artificial intelligence (AI) has been rapidly applied to drug development since its recent technical breakthrough and has dramatically improved the efficiency of the discovery of novel antibiotics. Here, we first summarized recently marketed novel antibiotics, and antibiotic candidates in clinical development. In addition, we systematically reviewed the involvement of AI in antibacterial drug development and utilization, including small molecules, antimicrobial peptides, phage therapy, essential oils, as well as resistance mechanism prediction, and antibiotic stewardship.
Collapse
Affiliation(s)
- Guang-Yu Liu
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Dan Yu
- National Key Discipline of Pediatrics Key Laboratory of Major Diseases in Children Ministry of Education, Laboratory of Dermatology, Beijing Pediatric Research Institute, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing, 100045, China
| | - Mei-Mei Fan
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Xu Zhang
- Robert and Arlene Kogod Center on Aging, Mayo Clinic, Rochester, MN, 55905, USA
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Ze-Yu Jin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Christoph Tang
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK.
| | - Xiao-Fen Liu
- Institute of Antibiotics, Huashan Hospital, Fudan University, Key Laboratory of Clinical Pharmacology of Antibiotics, National Health Commission of the People's Republic of China, National Clinical Research Centre for Aging and Medicine, Huashan Hospital, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
5
|
Gao Y, Li H, Zhao C, Li S, Yin G, Wang H. Machine learning and feature extraction for rapid antimicrobial resistance prediction of Acinetobacter baumannii from whole-genome sequencing data. Front Microbiol 2024; 14:1320312. [PMID: 38274740 PMCID: PMC10808480 DOI: 10.3389/fmicb.2023.1320312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 12/22/2023] [Indexed: 01/27/2024] Open
Abstract
Background Whole-genome sequencing (WGS) has contributed significantly to advancements in machine learning methods for predicting antimicrobial resistance (AMR). However, the comparisons of different methods for AMR prediction without requiring prior knowledge of resistance remains to be conducted. Methods We aimed to predict the minimum inhibitory concentrations (MICs) of 13 antimicrobial agents against Acinetobacter baumannii using three machine learning algorithms (random forest, support vector machine, and XGBoost) combined with k-mer features extracted from WGS data. Results A cohort of 339 isolates was used for model construction. The average essential agreement and category agreement of the best models exceeded 90.90% (95%CI, 89.03-92.77%) and 95.29% (95%CI, 94.91-95.67%), respectively; the exceptions being levofloxacin, minocycline and imipenem. The very major error rates ranged from 0.0 to 5.71%. We applied feature selection pipelines to extract the top-ranked 11-mers to optimise training time and computing resources. This approach slightly improved the prediction performance and enabled us to obtain prediction results within 10 min. Notably, when employing these top-ranked 11-mers in an independent test dataset (120 isolates), we achieved an average accuracy of 0.96. Conclusion Our study is the first to demonstrate that AMR prediction for A. baumannii using machine learning methods based on k-mer features has competitive performance over traditional workflows; hence, sequence-based AMR prediction and its application could be further promoted. The k-mer-based workflow developed in this study demonstrated high recall/sensitivity and specificity, making it a dependable tool for MIC prediction in clinical settings.
Collapse
Affiliation(s)
- Yue Gao
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Henan Li
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Chunjiang Zhao
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Shuguang Li
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Guankun Yin
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Hui Wang
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| |
Collapse
|
6
|
Wang X, Patil N, Li F, Wang Z, Zhan H, Schmidt D, Thompson P, Guo Y, Landersdorfer CB, Shen HH, Peleg AY, Li J, Song J. PmxPred: A data-driven approach for the identification of active polymyxin analogues against gram-negative bacteria. Comput Biol Med 2024; 168:107681. [PMID: 37992470 DOI: 10.1016/j.compbiomed.2023.107681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 10/07/2023] [Accepted: 11/06/2023] [Indexed: 11/24/2023]
Abstract
The multidrug-resistant Gram-negative bacteria has evolved into a worldwide threat to human health; over recent decades, polymyxins have re-emerged in clinical practice due to their high activity against multidrug-resistant bacteria. Nevertheless, the nephrotoxicity and neurotoxicity of polymyxins seriously hinder their practical use in the clinic. Based on the quantitative structure-activity relationship (QSAR), analogue design is an efficient strategy for discovering biologically active compounds with fewer adverse effects. To accelerate the polymyxin analogues discovery process and find the polymyxin analogues with high antimicrobial activity against Gram-negative bacteria, here we developed PmxPred, a GCN and catBoost-based machine learning framework. The RDKit descriptors were used for the molecule and residues representation, and the ensemble learning model was utilized for the antimicrobial activity prediction. This framework was trained and evaluated on multiple Gram-negative bacteria datasets, including Acinetobacter baumannii, Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa and a general Gram-negative bacteria dataset achieving an AUROC of 0.857, 0.880, 0.756, 0.895 and 0.865 on the independent test, respectively. PmxPred outperformed the transfer learning method that trained on 10 million molecules. We interpreted our model well-trained model by analysing the importance of global and residue features. Overall, PmxPred provides a powerful additional tool for predicting active polymyxin analogues, and holds the potential elucidate the mechanisms underlying the antimicrobial activity of polymyxins. The source code is publicly available on GitHub (https://github.com/yanwu20/PmxPred).
Collapse
Affiliation(s)
- Xiaoyu Wang
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Monash Data Futures Institute, Monash University, Melbourne, VIC, 3800, Australia; Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia
| | - Nitin Patil
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia
| | - Fuyi Li
- College of Information Engineering, Northwest A&F University, Yangling, 712100, Shaanxi, China; Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Victoria, Australia
| | - Zhikang Wang
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Monash Data Futures Institute, Monash University, Melbourne, VIC, 3800, Australia; Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia
| | - Haolan Zhan
- Faculty of Information Technology, Monash University, Melbourne, VIC, 3800, Australia
| | - Daniel Schmidt
- Monash Data Futures Institute, Monash University, Melbourne, VIC, 3800, Australia; Faculty of Information Technology, Monash University, Melbourne, VIC, 3800, Australia
| | - Philip Thompson
- Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
| | - Yuming Guo
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, 3004, Australia
| | - Cornelia B Landersdorfer
- Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia; Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
| | - Hsin-Hui Shen
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Department of Materials Science and Engineering, Faculty of Engineering, Monash University, Clayton, VIC, 3800, Australia
| | - Anton Y Peleg
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia; Department of Infectious Diseases, Alfred Hospital, Alfred Health, Melbourne, Victoria, Australia
| | - Jian Li
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Monash Data Futures Institute, Monash University, Melbourne, VIC, 3800, Australia; Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia.
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, VIC, 3800, Australia; Monash Data Futures Institute, Monash University, Melbourne, VIC, 3800, Australia; Centre to Impact AMR, Monash University, Melbourne, VIC, 3800, Australia.
| |
Collapse
|
7
|
Ding X, Huang Y, Tian X, Zhao Y, Feng G, Gao Z. Diagnosis, Treatment, and Management of Otitis Media with Artificial Intelligence. Diagnostics (Basel) 2023; 13:2309. [PMID: 37443702 DOI: 10.3390/diagnostics13132309] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 06/04/2023] [Accepted: 06/14/2023] [Indexed: 07/15/2023] Open
Abstract
A common infectious disease, otitis media (OM) has a low rate of early diagnosis, which significantly increases the difficulty of treating the disease and the likelihood of serious complications developing including hearing loss, speech impairment, and even intracranial infection. Several areas of healthcare have shown great promise in the application of artificial intelligence (AI) systems, such as the accurate detection of diseases, the automated interpretation of images, and the prediction of patient outcomes. Several articles have reported some machine learning (ML) algorithms such as ResNet, InceptionV3 and Unet, were applied to the diagnosis of OM successfully. The use of these techniques in the OM is still in its infancy, but their potential is enormous. We present in this review important concepts related to ML and AI, describe how these technologies are currently being applied to diagnosing, treating, and managing OM, and discuss the challenges associated with developing AI-assisted OM technologies in the future.
Collapse
Affiliation(s)
- Xin Ding
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yu Huang
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Xu Tian
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yang Zhao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Guodong Feng
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Zhiqiang Gao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| |
Collapse
|
8
|
Farhat F, Athar MT, Ahmad S, Madsen DØ, Sohail SS. Antimicrobial resistance and machine learning: past, present, and future. Front Microbiol 2023; 14:1179312. [PMID: 37303800 PMCID: PMC10250749 DOI: 10.3389/fmicb.2023.1179312] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 05/15/2023] [Indexed: 06/13/2023] Open
Abstract
Machine learning has become ubiquitous across all industries, including the relatively new application of predicting antimicrobial resistance. As the first bibliometric review in this field, we expect it to inspire further research in this area. The review employs standard bibliometric indicators such as article count, citation count, and the Hirsch index (H-index) to evaluate the relevance and impact of the leading countries, organizations, journals, and authors in this field. VOSviewer and Biblioshiny programs are utilized to analyze citation and co-citation networks, collaboration networks, keyword co-occurrence, and trend analysis. The United States has the highest contribution with 254 articles, accounting for over 37.57% of the total corpus, followed by China (103) and the United Kingdom (78). Among 58 publishers, the top four publishers account for 45% of the publications, with Elsevier leading with 15% of the publications, followed by Springer Nature (12%), MDPI, and Frontiers Media SA with 9% each. Frontiers in Microbiology is the most frequent publication source (33 articles), followed by Scientific Reports (29 articles), PLoS One (17 articles), and Antibiotics (16 articles). The study reveals a substantial increase in research and publications on the use of machine learning to predict antibiotic resistance. Recent research has focused on developing advanced machine learning algorithms that can accurately forecast antibiotic resistance, and a range of algorithms are now being used to address this issue.
Collapse
Affiliation(s)
- Faiza Farhat
- Department of Zoology, Aligarh Muslim University, Aligarh, India
| | - Md Tanwir Athar
- Department of Pharmacognosy and Pharmaceutical Chemistry, College of Dentistry and Pharmacy, Buraydah Colleges, Buraydah, Al-Qassim, Saudi Arabia
| | - Sultan Ahmad
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia
- Department of Computer Science and Engineering, University Center for Research and Development (UCRD), Chandigarh University, Mohali, Punjab, India
| | - Dag Øivind Madsen
- School of Business, University of South-Eastern Norway, Hønefoss, Norway
| | - Shahab Saquib Sohail
- Department of Computer Science and Engineering, Jamia Hamdard University, New Delhi, India
| |
Collapse
|
9
|
Álvarez VE, Quiroga MP, Centrón D. Identification of a Specific Biomarker of Acinetobacter baumannii Global Clone 1 by Machine Learning and PCR Related to Metabolic Fitness of ESKAPE Pathogens. mSystems 2023:e0073422. [PMID: 37184409 DOI: 10.1128/msystems.00734-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023] Open
Abstract
Since the emergence of high-risk clones worldwide, constant investigations have been undertaken to comprehend the molecular basis that led to their prevalent dissemination in nosocomial settings over time. So far, the complex and multifactorial genetic traits of this type of epidemic clones have allowed only the identification of biomarkers with low specificity. A machine learning algorithm was able to recognize unequivocally a biomarker for early and accurate detection of Acinetobacter baumannii global clone 1 (GC1), one of the most disseminated high-risk clones. A support vector machine model identified the U1 sequence with a length of 367 nucleotides that matched a fragment of the moaCB gene, which encodes the molybdenum cofactor biosynthesis C and B proteins. U1 differentiates specifically between A. baumannii GC1 and non-GC1 strains, becoming a suitable biomarker capable of being translated into clinical settings as a molecular typing method for early diagnosis based on PCR as shown here. Since the metabolic pathways of Mo enzymes have been recognized as putative therapeutic targets for ESKAPE (Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species) pathogens, our findings highlight that machine learning can also be useful in knowledge gaps of high-risk clones and provides noteworthy support to the literature to identify relevant nosocomial biomarkers for other multidrug-resistant high-risk clones. IMPORTANCE A. baumannii GC1 is an important high-risk clone that rapidly develops extreme drug resistance in the nosocomial niche. Furthermore, several strains have been identified worldwide in environmental samples, exacerbating the risk of human interactions. Early diagnosis is mandatory to limit its dissemination and to outline appropriate antibiotic stewardship schedules. A region with a length of 367 bp (U1) within the moaCB gene that is not subjected to lateral genetic transfer or to antibiotic pressures was successfully found by a support vector machine model that predicts A. baumannii GC1 strains. At the same time, research on the group of Mo enzymes proposed this metabolic pathway related to the superbug's metabolism as a potential future drug target site for ESKAPE pathogens due to its central role in bacterial fitness during infection. These findings confirm that machine learning used for the identification of biomarkers of high-risk lineages can also serve to identify putative novel therapeutic target sites.
Collapse
Affiliation(s)
- Verónica Elizabeth Álvarez
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - María Paula Quiroga
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
- Nodo de Bioinformática. Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Técnicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - Daniela Centrón
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| |
Collapse
|
10
|
Saber MM, Donner J, Levade I, Acosta N, Parkins MD, Boyle B, Levesque RC, Nguyen D, Shapiro BJ. Single nucleotide variants in Pseudomonas aeruginosa populations from sputum correlate with baseline lung function and predict disease progression in individuals with cystic fibrosis. Microb Genom 2023; 9. [PMID: 37052589 DOI: 10.1099/mgen.0.000981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2023] Open
Abstract
The severity and progression of lung disease are highly variable across individuals with cystic fibrosis (CF) and are imperfectly predicted by mutations in the human gene CFTR, lung microbiome variation or other clinical factors. The opportunistic pathogen Pseudomonas aeruginosa (Pa) dominates airway infections in most CF adults. Here we hypothesized that within-host genetic variation of Pa populations would be associated with lung disease severity. To quantify Pa genetic variation within CF sputum samples, we used deep amplicon sequencing (AmpliSeq) of 209 Pa genes previously associated with pathogenesis or adaptation to the CF lung. We trained machine learning models using Pa single nucleotide variants (SNVs), microbiome diversity data and clinical factors to classify lung disease severity at the time of sputum sampling, and to predict lung function decline after 5 years in a cohort of 54 adult CF patients with chronic Pa infection. Models using Pa SNVs alone classified lung disease severity with good sensitivity and specificity (area under the receiver operating characteristic curve: AUROC=0.87). Models were less predictive of lung function decline after 5 years (AUROC=0.74) but still significantly better than random. The addition of clinical data, but not sputum microbiome diversity data, yielded only modest improvements in classifying baseline lung function (AUROC=0.92) and predicting lung function decline (AUROC=0.79), suggesting that Pa AmpliSeq data account for most of the predictive value. Our work provides a proof of principle that Pa genetic variation in sputum tracks lung disease severity, moderately predicts lung function decline and could serve as a disease biomarker among CF patients with chronic Pa infections.
Collapse
Affiliation(s)
- Morteza M Saber
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
| | - Jannik Donner
- Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - Inès Levade
- Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - Nicole Acosta
- Department of Microbiology, Immunology and Infectious Disease, University of Calgary, Calgary, AB, Canada
| | - Michael D Parkins
- Department of Microbiology, Immunology and Infectious Disease, University of Calgary, Calgary, AB, Canada
- Department of Medicine, University of Calgary, Calgary, AB, Canada
| | - Brian Boyle
- Integrative Systems Biology Institute, University of Laval, Québec, QC, Canada
| | - Roger C Levesque
- Integrative Systems Biology Institute, University of Laval, Québec, QC, Canada
| | - Dao Nguyen
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
- Meakins Christie Laboratories, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - B Jesse Shapiro
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- McGill Genome Centre, Montreal, QC, Canada
| |
Collapse
|
11
|
Yu J, Lin YT, Chen WC, Tseng KH, Lin HH, Tien N, Cho CF, Huang JY, Liang SJ, Ho LC, Hsieh YW, Hsu KC, Ho MW, Hsueh PR, Cho DY. Direct prediction of carbapenem-resistant, carbapenemase-producing, and colistin-resistant Klebsiella pneumoniae isolates from routine MALDI-TOF mass spectra using machine learning and outcome evaluation. Int J Antimicrob Agents 2023; 61:106799. [PMID: 37004755 DOI: 10.1016/j.ijantimicag.2023.106799] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 03/14/2023] [Accepted: 03/26/2023] [Indexed: 04/03/2023]
Abstract
The objective of this study was to develop a rapid prediction method for carbapenem-resistant Klebsiella pneumoniae (CRKP) and colistin-resistant K. pneumoniae (ColRKP) based on routine MALDI-TOF mass spectrometry (MS) results in order to formulate a suitable and rapid treatment strategy. In total, 830 CRKP and 1,462 carbapenem-susceptible K. pneumoniae (CSKP) isolates were collected; 54 ColRKP isolates and 1,592 colistin-intermediate K. pneumoniae (ColIKP) isolates were also included. Routine MALDI-TOF MS, antimicrobial susceptibility testing, NG-Test CARBA 5, and resistance gene detection were followed by machine learning (ML). Using the ML model, the accuracy and area under the curve for differentiating CRKP and CSKP were 0.8869 and 0.9551, and those for ColRKP and ColIKP were 0.8361 and 0.8447, respectively. The most important MS features of CRKP and ColRKP were m/z 4520-4529 and m/z 4170-4179, respectively. Of the CRKP isolates, MS m/z 4520-4529 was a potential biomarker for distinguishing KPC from OXA, NDM, IMP, and VIM. Of the 34 patients who received preliminary CRKP ML prediction results (by texting), 24 (70.6%) were confirmed to have CRKP infection. The mortality rate was lower in patients who received antibiotic regimen adjustment based on the preliminary ML prediction (4/14, 28.6%). In conclusion, the proposed model can provide rapid results for differentiating CRKP and CSKP, as well as ColRKP and ColIKP. The combination of ML-based CRKP with preliminary reporting of results can help physicians alter the regimen approximately 24 h earlier, resulting in improved survival of patients with timely antibiotic intervention.
Collapse
|
12
|
Kim JH, Kim Y, Yoo K, Kim M, Kang SS, Kwon YS, Lee JJ. Prediction of Postoperative Pulmonary Edema Risk Using Machine Learning. J Clin Med 2023; 12:jcm12051804. [PMID: 36902590 PMCID: PMC10003313 DOI: 10.3390/jcm12051804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/16/2023] [Accepted: 02/22/2023] [Indexed: 02/27/2023] Open
Abstract
Postoperative pulmonary edema (PPE) is a well-known postoperative complication. We hypothesized that a machine learning model could predict PPE risk using pre- and intraoperative data, thereby improving postoperative management. This retrospective study analyzed the medical records of patients aged > 18 years who underwent surgery between January 2011 and November 2021 at five South Korean hospitals. Data from four hospitals (n = 221,908) were used as the training dataset, whereas data from the remaining hospital (n = 34,991) were used as the test dataset. The machine learning algorithms used were extreme gradient boosting, light-gradient boosting machine, multilayer perceptron, logistic regression, and balanced random forest (BRF). The prediction abilities of the machine learning models were assessed using the area under the receiver operating characteristic curve, feature importance, and average precisions of precision-recall curve, precision, recall, f1 score, and accuracy. PPE occurred in 3584 (1.6%) and 1896 (5.4%) patients in the training and test sets, respectively. The BRF model exhibited the best performance (area under the receiver operating characteristic curve: 0.91, 95% confidence interval: 0.84-0.98). However, its precision and f1 score metrics were not good. The five major features included arterial line monitoring, American Society of Anesthesiologists physical status, urine output, age, and Foley catheter status. Machine learning models (e.g., BRF) could predict PPE risk and improve clinical decision-making, thereby enhancing postoperative management.
Collapse
Affiliation(s)
- Jong Ho Kim
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon-si 24253, Republic of Korea
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon-si 24252, Republic of Korea
| | - Youngmi Kim
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon-si 24252, Republic of Korea
| | - Kookhyun Yoo
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon-si 24253, Republic of Korea
| | - Minguan Kim
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon-si 24253, Republic of Korea
| | - Seong Sik Kang
- Department of Anesthesiology and Pain Medicine, College of Medicine, Kangwon National University, Chuncheon-si 24341, Republic of Korea
| | - Young-Suk Kwon
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon-si 24253, Republic of Korea
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon-si 24252, Republic of Korea
- Correspondence: (Y.-S.K.); (J.J.L.); Tel.: +82-33-240-5271 (Y.-S.K. & J.J.L.)
| | - Jae Jun Lee
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon-si 24253, Republic of Korea
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon-si 24252, Republic of Korea
- Correspondence: (Y.-S.K.); (J.J.L.); Tel.: +82-33-240-5271 (Y.-S.K. & J.J.L.)
| |
Collapse
|
13
|
Li S, Wu J, Ma N, Liu W, Shao M, Ying N, Zhu L. Prediction of genome-wide imipenem resistance features in Klebsiella pneumoniae using machine learning. J Med Microbiol 2023; 72. [PMID: 36753438 DOI: 10.1099/jmm.0.001657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Introduction. The resistance rate of Klebsiella pneumoniae (K. pneumoniae) to imipenem is increasing year by year, and the imipenem resistance mechanism of K. pneumoniae is complex. Therefore, it is urgent to develop new strategies to explore the resistance mechanism of imipenem for its effective and accurate use in clinical practice.Hypothesis/Gap sStatement. Machine learning could identify resistance features and biological process that influence microbial resistance from whole-genome sequencing (WGS) data.Aims. This work aimed to predict imipenem resistance genetic features in K. pneumoniae from whole-genome k-mer features, and analyse their function for understanding its resistance mechanism.Methods. This study analysed WGS data of K. pneumoniae combined with resistance phenotype for imipenem, and established K. pneumoniae to imipenem genotype-phenotype model to predict resistance features using chi-squared test and random forest. An external clinical dataset was used to verify prediction power of resistance features. The potential genes were identified through alignment the resistance features with the K. pneumoniae reference genome using blastn, the functions of potential genes were further analysed to explore its resistance-related signalling pathways with GO and KEGG analysis, the resistance sequence patterns were screened using streme software. Finally, the resistance features were combined and modelled through four machine-learning algorithms (logistic regression, SVM, GBDT and XGBoost) to evaluate their phenotype prediction ability.Results. A total of 16 670 imipenem resistance features were predicted from genotype-phenotype model. The 30 potential genes were identified by annotating the resistance features and corresponded to known antibiotic-related genes (mdtM, dedA, rne, etc.). GO and KEGG pathway analyses indicated the possible association of imipenem resistance with metabolism process and cell membrane. CRYCAGCDN and CGRDAAAN were found from the imipenem resistance features, which were widely presented in the reported β-lactam resistance genes (bla SHV, bla CTX-M, bla TEM, etc.), and YCYAGCMCAST with metabolic functions (organic substance metabolic process, nitrogen compound metabolic process and cellular metabolic process) was identified from the top 50 resistance features. The 25 resistance genes in the training dataset included 19 genes in the external dataset, which verified the accuracy of prediction. The area under curve values of logistics regression, SVM, GBDT and XGBoost were 0.965, 0.966, 0.969 and 0.969, respectively, indicating that the imipenem resistance features have a strong prediction power.Conclusion. Machine-learning methods could effectively predict the imipenem resistance feature in K. pneumoniae, and provide resistance sequence profiles for predicting resistance phenotype and exploring potential resistance mechanisms. It provides an important insight into the potential therapeutic strategies of K. pneumoniae resistance to imipenem, and speed up the application of machine learning in routine diagnosis.
Collapse
Affiliation(s)
- Shanshan Li
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Jun Wu
- Lin'an Center for Disease Control and Prevention, Lin'an, 311300, PR China
| | - Nan Ma
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Wenjia Liu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,College of Electronics and Information Engineering, Hangzhou Dianzi University, Hangzhou 310018, PR China
| | - Mengjie Shao
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Nanjiao Ying
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering and Instrument, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Lei Zhu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering and Instrument, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| |
Collapse
|
14
|
Tharmakulasingam M, Gardner B, La Ragione R, Fernando A. Rectified Classifier Chains for Prediction of Antibiotic Resistance From Multi-Labelled Data With Missing Labels. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:625-636. [PMID: 35130168 DOI: 10.1109/tcbb.2022.3148577] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Predicting Antimicrobial Resistance (AMR) from genomic data has important implications for human and animal healthcare, and especially given its potential for more rapid diagnostics and informed treatment choices. With the recent advances in sequencing technologies, applying machine learning techniques for AMR prediction have indicated promising results. Despite this, there are shortcomings in the literature concerning methodologies suitable for multi-drug AMR prediction and especially where samples with missing labels exist. To address this shortcoming, we introduce a Rectified Classifier Chain (RCC) method for predicting multi-drug resistance. This RCC method was tested using annotated features of genomics sequences and compared with similar multi-label classification methodologies. We found that applying the eXtreme Gradient Boosting (XGBoost) base model to our RCC model outperformed the second-best model, XGBoost based binary relevance model, by 3.3% in Hamming accuracy and 7.8% in F1-score. Additionally, we note that in the literature machine learning models applied to AMR prediction typically are unsuitable for identifying biomarkers informative of their decisions; in this study, we show that biomarkers contributing to AMR prediction can also be identified using the proposed RCC method. We expect this can facilitate genome annotation and pave the path towards identifying new biomarkers indicative of AMR.
Collapse
|
15
|
Geurtsen J, de Been M, Weerdenburg E, Zomer A, McNally A, Poolman J. Genomics and pathotypes of the many faces of Escherichia coli. FEMS Microbiol Rev 2022; 46:6617594. [PMID: 35749579 PMCID: PMC9629502 DOI: 10.1093/femsre/fuac031] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 06/22/2022] [Indexed: 01/09/2023] Open
Abstract
Escherichia coli is the most researched microbial organism in the world. Its varied impact on human health, consisting of commensalism, gastrointestinal disease, or extraintestinal pathologies, has generated a separation of the species into at least eleven pathotypes (also known as pathovars). These are broadly split into two groups, intestinal pathogenic E. coli (InPEC) and extraintestinal pathogenic E. coli (ExPEC). However, components of E. coli's infinite open accessory genome are horizontally transferred with substantial frequency, creating pathogenic hybrid strains that defy a clear pathotype designation. Here, we take a birds-eye view of the E. coli species, characterizing it from historical, clinical, and genetic perspectives. We examine the wide spectrum of human disease caused by E. coli, the genome content of the bacterium, and its propensity to acquire, exchange, and maintain antibiotic resistance genes and virulence traits. Our portrayal of the species also discusses elements that have shaped its overall population structure and summarizes the current state of vaccine development targeted at the most frequent E. coli pathovars. In our conclusions, we advocate streamlining efforts for clinical reporting of ExPEC, and emphasize the pathogenic potential that exists throughout the entire species.
Collapse
Affiliation(s)
- Jeroen Geurtsen
- Janssen Vaccines and Prevention B.V., 2333 Leiden, the Netherlands
| | - Mark de Been
- Janssen Vaccines and Prevention B.V., 2333 Leiden, the Netherlands
| | | | - Aldert Zomer
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 Utrecht, the Netherlands
| | - Alan McNally
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, B15 2TT Birmingham, United Kingdom
| | - Jan Poolman
- Janssen Vaccines and Prevention B.V., 2333 Leiden, the Netherlands
| |
Collapse
|
16
|
Pei N, Sun W, He J, Li Y, Chen X, Liang T, Kristiansen K, Liu W, Li J. Genome-wide association study of Klebsiella pneumoniae identifies variations linked to carbapenems resistance. Front Microbiol 2022; 13:997769. [PMID: 36386631 PMCID: PMC9664935 DOI: 10.3389/fmicb.2022.997769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 10/10/2022] [Indexed: 01/25/2023] Open
Abstract
Klebsiella pneumoniae (KP) is one of the microorganisms that can acquire carbapenem-resistance (CR), and few antimicrobial therapy options exist for infections caused by Carbapenem-Resistant KP (CRKP). In recent years, with the increase of carbapenem resistance rates, treating CRKP has become a serious public health threat in clinical practice. We have collected 2,035 clinical KP isolates from a tertiary hospital in China. Whole genome sequencing data coupled with their binary antimicrobial susceptibility testing data were obtained to conduct the genome-wide association study using a bayesian-based method, including single nucleotide polymorphisms (SNPs) and genes. We identified 28 and 37 potential maker genes associated with imipenem and meropenem resistance, respectively. Among which 19 of them were selected in both drugs by genome-wide association study (GWAS), 11 genes among them were simultaneously validated in independent datasets. These genes were likely related to biofilm formation, efflux pump, and DNA repairing. Moreover, we identified 13 significant CR related SNPs in imipenem or meropenem, with one SNP located in the non-coding region and validated in the independent datasets. Our study indicates complex mechanisms of carbapenems resistance and further investigation of CRKP-related factors are warranted to better understand their contributions to carbapenems resistance. These identified biomarkers may provide targets for future drug interventions or treatments.
Collapse
Affiliation(s)
- Na Pei
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Jingxuan He
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Yanming Li
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Xia Chen
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Tianzhu Liang
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China
| | - Karsten Kristiansen
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Wenen Liu
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China,*Correspondence: Junhua Li, ; Wenen Liu,
| | - Junhua Li
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China,*Correspondence: Junhua Li, ; Wenen Liu,
| |
Collapse
|
17
|
Machine Learning for Antimicrobial Resistance Prediction: Current Practice, Limitations, and Clinical Perspective. Clin Microbiol Rev 2022; 35:e0017921. [PMID: 35612324 DOI: 10.1128/cmr.00179-21] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Antimicrobial resistance (AMR) is a global health crisis that poses a great threat to modern medicine. Effective prevention strategies are urgently required to slow the emergence and further dissemination of AMR. Given the availability of data sets encompassing hundreds or thousands of pathogen genomes, machine learning (ML) is increasingly being used to predict resistance to different antibiotics in pathogens based on gene content and genome composition. A key objective of this work is to advocate for the incorporation of ML into front-line settings but also highlight the further refinements that are necessary to safely and confidently incorporate these methods. The question of what to predict is not trivial given the existence of different quantitative and qualitative laboratory measures of AMR. ML models typically treat genes as independent predictors, with no consideration of structural and functional linkages; they also may not be accurate when new mutational variants of known AMR genes emerge. Finally, to have the technology trusted by end users in public health settings, ML models need to be transparent and explainable to ensure that the basis for prediction is clear. We strongly advocate that the next set of AMR-ML studies should focus on the refinement of these limitations to be able to bridge the gap to diagnostic implementation.
Collapse
|
18
|
Coll F, Gouliouris T, Bruchmann S, Phelan J, Raven KE, Clark TG, Parkhill J, Peacock SJ. PowerBacGWAS: a computational pipeline to perform power calculations for bacterial genome-wide association studies. Commun Biol 2022; 5:266. [PMID: 35338232 PMCID: PMC8956664 DOI: 10.1038/s42003-022-03194-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 02/25/2022] [Indexed: 12/14/2022] Open
Abstract
Genome-wide association studies (GWAS) are increasingly being applied to investigate the genetic basis of bacterial traits. However, approaches to perform power calculations for bacterial GWAS are limited. Here we implemented two alternative approaches to conduct power calculations using existing collections of bacterial genomes. First, a sub-sampling approach was undertaken to reduce the allele frequency and effect size of a known and detectable genotype-phenotype relationship by modifying phenotype labels. Second, a phenotype-simulation approach was conducted to simulate phenotypes from existing genetic variants. We implemented both approaches into a computational pipeline (PowerBacGWAS) that supports power calculations for burden testing, pan-genome and variant GWAS; and applied it to collections of Enterococcus faecium, Klebsiella pneumoniae and Mycobacterium tuberculosis. We used this pipeline to determine sample sizes required to detect causal variants of different minor allele frequencies (MAF), effect sizes and phenotype heritability, and studied the effect of homoplasy and population diversity on the power to detect causal variants. Our pipeline and user documentation are made available and can be applied to other bacterial populations. PowerBacGWAS can be used to determine sample sizes required to find statistically significant associations, or the associations detectable with a given sample size. We recommend to perform power calculations using existing genomes of the bacterial species and population of study.
Collapse
Affiliation(s)
- Francesc Coll
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK.
| | - Theodore Gouliouris
- Department of Medicine, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | | | - Jody Phelan
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK
| | - Kathy E Raven
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Taane G Clark
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Faculty of Epidemiology and Population Health, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | | |
Collapse
|
19
|
Wang S, Zhao C, Yin Y, Chen F, Chen H, Wang H. A Practical Approach for Predicting Antimicrobial Phenotype Resistance in Staphylococcus aureus Through Machine Learning Analysis of Genome Data. Front Microbiol 2022; 13:841289. [PMID: 35308374 PMCID: PMC8924536 DOI: 10.3389/fmicb.2022.841289] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 02/11/2022] [Indexed: 11/28/2022] Open
Abstract
With the reduction in sequencing price and acceleration of sequencing speed, it is particularly important to directly link the genotype and phenotype of bacteria. Here, we firstly predicted the minimum inhibitory concentrations of ten antimicrobial agents for Staphylococcus aureus using 466 isolates by directly extracting k-mer from whole genome sequencing data combined with three machine learning algorithms: random forest, support vector machine, and XGBoost. Considering one two-fold dilution, the essential agreement and the category agreement could reach >85% and >90% for most antimicrobial agents. For clindamycin, cefoxitin and trimethoprim-sulfamethoxazole, the essential agreement and the category agreement could reach >91% and >93%, providing important information for clinical treatment. The successful prediction of cefoxitin resistance showed that the model could identify methicillin-resistant S. aureus. The results suggest that small datasets available in large hospitals could bypass the existing basic research and known antimicrobial resistance genes and accurately predict the bacterial phenotype.
Collapse
Affiliation(s)
- Shuyi Wang
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China.,Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Chunjiang Zhao
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Yuyao Yin
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Fengning Chen
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China.,Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Hongbin Chen
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Hui Wang
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China.,Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| |
Collapse
|
20
|
Liu W, Ying N, Mo Q, Li S, Shao M, Sun L, Zhu L. Machine learning for identifying resistance features of Klebsiella pneumoniae using whole-genome sequence single nucleotide polymorphisms. J Med Microbiol 2021; 70. [PMID: 34812714 DOI: 10.1099/jmm.0.001474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Introduction. Klebsiella pneumoniae, a gram-negative bacterium, is a common pathogen causing nosocomial infection. The drug-resistance rate of K. pneumoniae is increasing year by year, posing a severe threat to public health worldwide. K. pneumoniae has been listed as one of the pathogens causing the global crisis of antimicrobial resistance in nosocomial infections. We need to explore the drug resistance of K. pneumoniae for clinical diagnosis. Single nucleotide polymorphisms (SNPs) are of high density and have rich genetic information in whole-genome sequencing (WGS), which can affect the structure or expression of proteins. SNPs can be used to explore mutation sites associated with bacterial resistance.Hypothesis/Gap Statement. Machine learning methods can detect genetic features associated with the drug resistance of K. pneumoniae from whole-genome SNP data.Aims. This work used Fast Feature Selection (FFS) and Codon Mutation Detection (CMD) machine learning methods to detect genetic features related to drug resistance of K. pneumoniae from whole-genome SNP data.Methods. WGS data on resistance of K. pneumoniae strains to four antibiotics (tetracycline, gentamicin, imipenem, amikacin) were downloaded from the European Nucleotide Archive (ENA). Sequence alignments were performed with MUMmer 3 to complete SNP calling using K. pneumoniae HS11286 chromosome as the reference genome. The FFS algorithm was applied to feature selection of the SNP dataset. The training set was constructed based on mutation sites with mutation frequency >0.995. Based on the original SNP training set, 70% of SNPs were randomly selected from each dataset as the test set to verify the accuracy of the training results. Finally, the resistance genes were obtained by the CMD algorithm and Venny.Results. The number of strains resistant to tetracycline, gentamicin, imipenem and amikacin was 931, 1048, 789 and 203, respectively. Machine learning algorithms were applied to the SNP training set and test set, and 28 and 23 resistance genes were predicted, respectively. The 28 resistance genes in the training set included 22 genes in the test set, which verified the accuracy of gene prediction. Among them, some genes (KPHS_35310, KPHS_18220, KPHS_35880, etc.) corresponded to known resistance genes (Eef2, lpxK, MdtC, etc). Logistic regression classifiers were established based on the identified SNPs in the training set. The area under the curves (AUCs) of the four antibiotics was 0.939, 0.950, 0.912 and 0.935, showing a strong ability to predict bacterial resistance.Conclusion. Machine learning methods can effectively be used to predict resistance genes and associated SNPs. The FFS and CMD algorithms have wide applicability. They can be used for the drug-resistance analysis of any microorganism with genomic variation and phenotypic data. This work lays a foundation for resistance research in clinical applications.
Collapse
Affiliation(s)
- Wenjia Liu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Nanjiao Ying
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Qiusi Mo
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Shanshan Li
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Mengjie Shao
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Lingli Sun
- Key Laboratory of Microorganism Technology and Bioinformatics Research of Zhejiang Province, Hangzhou, Zhejiang, 310012, PR China.,NMPA Key Laboratory for Testing and Risk Warning of Pharmaceutical Microbiology, Hangzhou, Zhejiang, 310012, PR China
| | - Lei Zhu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| |
Collapse
|
21
|
Shi H, Yang D, Tang K, Hu C, Li L, Zhang L, Gong T, Cui Y. Explainable machine learning model for predicting the occurrence of postoperative malnutrition in children with congenital heart disease. Clin Nutr 2021; 41:202-210. [PMID: 34906845 DOI: 10.1016/j.clnu.2021.11.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 10/25/2021] [Accepted: 11/05/2021] [Indexed: 01/25/2023]
Abstract
BACKGROUND & AIMS Malnutrition is persistent in 50%-75% of children with congenital heart disease (CHD) after surgery, and early prediction is crucial for nutritional intervention. The aim of this study was to develop and validate machine learning (ML) models to predict the malnutrition status of children with CHD. We used explainable ML methods to provide insight into the model's predictions and outcomes. METHODS This prospective cohort study included consecutive children with CHD admitted to the hospital from December 2017 to May 2020. The cohort data were divided into the training and test data sets based on the follow-up time. The outcome of the study was CHD child malnutrition 1 year after surgery, the primary outcome was an underweight status, and the secondary outcomes were stunted and wasting status. We used five ML algorithms with multiple features to construct prediction models, and the performance of these ML models was measured by an area under the receiver operating characteristic curve (AUC) analysis. We also used the permutation importance and SHapley Additive exPlanations (SHAP) to determine the importance of the selected features and interpret the ML models. RESULTS We enrolled 536 children with CHD who underwent complete repair. The proportions of children with an underweight, stunted, or wasting status 1 year after surgery were 18.1% (97/536), 12.1% (65/536), and 17.5% (94/536), respectively. All patients contributed to the generation of 115 useable features, which allowed us to build models to predict malnutrition. Five prediction algorithms were used, and the XGBoost model achieved the greatest AUC in all outcomes. The results obtained from the permutation importance and SHAP analyses showed that the 1-month postoperative WAZ-score, discharge WAZ score and preoperative WAZ score were the top 3 important features in predicting an underweight status in the XGBoost algorithm. Regarding the stunted status, the top 3 important features were the 1-month postoperative HAZ score, discharge HAZ score, and aortic clamping time. Regarding the wasting status, the top 3 important features were the hospital length of stay, formula intake, and discharge WHZ-score. We also used a narrative case report as an example to describe the clinical manifestations and predicted the primary outcomes of two children. CONCLUSIONS We developed an ML model (XGBoost) that provides accurate early predictions of malnutrition 1-year postoperatively in children with CHD. Because the ML model is explainable, it may better enable clinicians to better understand the reasoning underlying the outcome. Our study could aid in determining individual treatment and nutritional follow-up strategies for children with CHD.
Collapse
Affiliation(s)
- Hui Shi
- Guangzhou Women and Children's Medical Center, Institute of Pediatrics, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou, 510623, China; Department of Biostatistics and Epidemiology, School of Public Health, Sun Yat-sen University, Guangzhou, China
| | - Dong Yang
- Guangzhou AID Cloud Technology, No. 68 Huacheng Avenue, Tianhe District, Guangzhou, China
| | - Kaichen Tang
- Guangzhou AID Cloud Technology, No. 68 Huacheng Avenue, Tianhe District, Guangzhou, China
| | - Chunmei Hu
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Lijuan Li
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Linfang Zhang
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Ting Gong
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China
| | - Yanqin Cui
- Cardiac Intensive Care Unit, The Heart Center, Guangzhou Women and Children Medical Center, Guangzhou Medical University, No.9 Jinsui Road, Zhujiang Newtown, Tianhe District, Guangzhou 510623, China; Department of Pediatric Surgery, Guangdong Provincial Key Laboratory of Research in Structural Birth Defect Disease, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, 510623, Guangdong, China.
| |
Collapse
|
22
|
David L, Brata AM, Mogosan C, Pop C, Czako Z, Muresan L, Ismaiel A, Dumitrascu DI, Leucuta DC, Stanculete MF, Iaru I, Popa SL. Artificial Intelligence and Antibiotic Discovery. Antibiotics (Basel) 2021; 10:antibiotics10111376. [PMID: 34827314 PMCID: PMC8614913 DOI: 10.3390/antibiotics10111376] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 11/01/2021] [Accepted: 11/08/2021] [Indexed: 01/13/2023] Open
Abstract
Over recent decades, a new antibiotic crisis has been unfolding due to a decreased research in this domain, a low return of investment for the companies that developed the drug, a lengthy and difficult research process, a low success rate for candidate molecules, an increased use of antibiotics in farms and an overall inappropriate use of antibiotics. This has led to a series of pathogens developing antibiotic resistance, which poses severe threats to public health systems while also driving up the costs of hospitalization and treatment. Moreover, without proper action and collaboration between academic and health institutions, a catastrophic trend might develop, with the possibility of returning to a pre-antibiotic era. Nevertheless, new emerging AI-based technologies have started to enter the field of antibiotic and drug development, offering a new perspective to an ever-growing problem. Cheaper and faster research can be achieved through algorithms that identify hit compounds, thereby further accelerating the development of new antibiotics, which represents a vital step in solving the current antibiotic crisis. The aim of this review is to provide an extended overview of the current artificial intelligence-based technologies that are used for antibiotic discovery, together with their technological and economic impact on the industrial sector.
Collapse
Affiliation(s)
- Liliana David
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| | - Anca Monica Brata
- Faculty of Environmental Protection, University of Oradea, 410048 Oradea, Romania
- Correspondence:
| | - Cristina Mogosan
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Cristina Pop
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Zoltan Czako
- Department of Computer Science, Technical University of Cluj-Napoca, 400027 Cluj-Napoca, Romania;
| | - Lucian Muresan
- Department of Cardiology, “Emile Muller” Hospital, 68200 Mulhouse, France;
| | - Abdulrahman Ismaiel
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| | - Dinu Iuliu Dumitrascu
- Department of Anatomy, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania;
| | - Daniel Corneliu Leucuta
- Department of Medical Informatics and Biostatistics, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400349 Cluj-Napoca, Romania;
| | - Mihaela Fadygas Stanculete
- Department of Neurosciences, Discipline of Psychiatry and Pediatric Psychiatry, “Iuliu Hatieganu“ University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania;
| | - Irina Iaru
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Stefan Lucian Popa
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| |
Collapse
|
23
|
He S, Leanse LG, Feng Y. Artificial intelligence and machine learning assisted drug delivery for effective treatment of infectious diseases. Adv Drug Deliv Rev 2021; 178:113922. [PMID: 34461198 DOI: 10.1016/j.addr.2021.113922] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 07/14/2021] [Accepted: 08/09/2021] [Indexed: 12/23/2022]
Abstract
In the era of antimicrobial resistance, the prevalence of multidrug-resistant microorganisms that resist conventional antibiotic treatment has steadily increased. Thus, it is now unquestionable that infectious diseases are significant global burdens that urgently require innovative treatment strategies. Emerging studies have demonstrated that artificial intelligence (AI) can transform drug delivery to promote effective treatment of infectious diseases. In this review, we propose to evaluate the significance, essential principles, and popular tools of AI in drug delivery for infectious disease treatment. Specifically, we will focus on the achievements and key findings of current research, as well as the applications of AI on drug delivery throughout the whole antimicrobial treatment process, with an emphasis on drug development, treatment regimen optimization, drug delivery system and administration route design, and drug delivery outcome prediction. To that end, the challenges of AI in drug delivery for infectious disease treatments and their current solutions and future perspective will be presented and discussed.
Collapse
Affiliation(s)
- Sheng He
- Boston Children's Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| | - Leon G Leanse
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA
| | - Yanfang Feng
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| |
Collapse
|
24
|
Avershina E, Shapovalova V, Shipulin G. Fighting Antibiotic Resistance in Hospital-Acquired Infections: Current State and Emerging Technologies in Disease Prevention, Diagnostics and Therapy. Front Microbiol 2021; 12:707330. [PMID: 34367112 PMCID: PMC8334188 DOI: 10.3389/fmicb.2021.707330] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 06/29/2021] [Indexed: 12/20/2022] Open
Abstract
Rising antibiotic resistance is a global threat that is projected to cause more deaths than all cancers combined by 2050. In this review, we set to summarize the current state of antibiotic resistance, and to give an overview of the emerging technologies aimed to escape the pre-antibiotic era recurrence. We conducted a comprehensive literature survey of >150 original research and review articles indexed in the Web of Science using "antimicrobial resistance," "diagnostics," "therapeutics," "disinfection," "nosocomial infections," "ESKAPE pathogens" as key words. We discuss the impact of nosocomial infections on the spread of multi-drug resistant bacteria, give an overview over existing and developing strategies for faster diagnostics of infectious diseases, review current and novel approaches in therapy of infectious diseases, and finally discuss strategies for hospital disinfection to prevent MDR bacteria spread.
Collapse
Affiliation(s)
- Ekaterina Avershina
- Department of Biotechnology, Inland Norway University of Applied Sciences, Hamar, Norway
- Laboratory or Postgenomic Technologies, Izmerov Research Institute of Occupational Health, Moscow, Russia
| | - Valeria Shapovalova
- Federal State Budgetary Institution “Centre for Strategic Planning and Management of Biomedical Health Risks” of the Federal Medical Biological Agency, Centre for Strategic Planning of FMBA of Russia, Moscow, Russia
| | - German Shipulin
- Federal State Budgetary Institution “Centre for Strategic Planning and Management of Biomedical Health Risks” of the Federal Medical Biological Agency, Centre for Strategic Planning of FMBA of Russia, Moscow, Russia
| |
Collapse
|
25
|
Applications of Machine Learning to the Problem of Antimicrobial Resistance: an Emerging Model for Translational Research. J Clin Microbiol 2021; 59:e0126020. [PMID: 33536291 DOI: 10.1128/jcm.01260-20] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Antimicrobial resistance (AMR) remains one of the most challenging phenomena of modern medicine. Machine learning (ML) is a subfield of artificial intelligence that focuses on the development of algorithms that learn how to accurately predict outcome variables using large sets of predictor variables that are typically not hand selected and are minimally curated. Models are parameterized using a training data set and then applied to a test data set on which predictive performance is evaluated. The application of ML algorithms to the problem of AMR has garnered increasing interest in the past 5 years due to the exponential growth of experimental and clinical data, heavy investment in computational capacity, improvements in algorithm performance, and increasing urgency for innovative approaches to reducing the burden of disease. Here, we review the current state of research at the intersection of ML and AMR with an emphasis on three domains of work. The first is the prediction of AMR using genomic data. The second is the use of ML to gain insight into the cellular functions disrupted by antibiotics, which forms the basis for understanding mechanisms of action and developing novel anti-infectives. The third focuses on the application of ML for antimicrobial stewardship using data extracted from the electronic health record. Although the use of ML for understanding, diagnosing, treating, and preventing AMR is still in its infancy, the continued growth of data and interest ensures it will become an important tool for future translational research programs.
Collapse
|
26
|
Borelli TC, Lovate GL, Scaranello AFT, Ribeiro LF, Zaramela L, Pereira-dos-Santos FM, Silva-Rocha R, Guazzaroni ME. Combining Functional Genomics and Whole-Genome Sequencing to Detect Antibiotic Resistance Genes in Bacterial Strains Co-Occurring Simultaneously in a Brazilian Hospital. Antibiotics (Basel) 2021; 10:antibiotics10040419. [PMID: 33920372 PMCID: PMC8070361 DOI: 10.3390/antibiotics10040419] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/01/2021] [Accepted: 04/06/2021] [Indexed: 01/01/2023] Open
Abstract
(1) Background: The rise of multi-antibiotic resistant bacteria represents an emergent threat to human health. Here, we investigate antibiotic resistance mechanisms in bacteria of several species isolated from an intensive care unit in Brazil. (2) Methods: We used whole-genome analysis to identify antibiotic resistance genes (ARGs) and plasmids in 34 strains of Gram-negative and Gram-positive bacteria, providing the first genomic description of Morganella morganii and Ralstonia mannitolilytica clinical isolates from South America. (3) Results: We identified a high abundance of beta-lactamase genes in resistant organisms, including seven extended-spectrum beta-lactamases (OXA-1, OXA-10, CTX-M-1, KPC, TEM, HYDRO, BLP) shared between organisms from different species. Additionally, we identified several ARG-carrying plasmids indicating the potential for a fast transmission of resistance mechanism between bacterial strains. Furthermore, we uncovered two pairs of (near) identical plasmids exhibiting multi-drug resistance. Finally, since many highly resistant strains carry several different ARGs, we used functional genomics to investigate which of them were indeed functional. In this sense, for three bacterial strains (Escherichia coli, Klebsiella pneumoniae, and M. morganii), we identified six beta-lactamase genes out of 15 predicted in silico as those mainly responsible for the resistance mechanisms observed, corroborating the existence of redundant resistance mechanisms in these organisms. (4) Conclusions: Systematic studies similar to the one presented here should help to prevent outbreaks of novel multidrug-resistant bacteria in healthcare facilities.
Collapse
Affiliation(s)
- Tiago Cabral Borelli
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-901, Brazil; (T.C.B.); (G.L.L.); (A.F.T.S.); (L.F.R.)
| | - Gabriel Lencioni Lovate
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-901, Brazil; (T.C.B.); (G.L.L.); (A.F.T.S.); (L.F.R.)
| | - Ana Flavia Tonelli Scaranello
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-901, Brazil; (T.C.B.); (G.L.L.); (A.F.T.S.); (L.F.R.)
| | - Lucas Ferreira Ribeiro
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-901, Brazil; (T.C.B.); (G.L.L.); (A.F.T.S.); (L.F.R.)
| | - Livia Zaramela
- Department of Pediatrics, University of California San Diego, San Diego, CA 92161, USA;
| | - Felipe Marcelo Pereira-dos-Santos
- Department of Cell and Molecular Biology, Faculdade de Medicina de Ribeirão Preto, University of São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-900, Brazil; (F.M.P.-d.-S.); (R.S.-R.)
| | - Rafael Silva-Rocha
- Department of Cell and Molecular Biology, Faculdade de Medicina de Ribeirão Preto, University of São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-900, Brazil; (F.M.P.-d.-S.); (R.S.-R.)
| | - María-Eugenia Guazzaroni
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Av. Bandeirantes 3900, Ribeirão Preto, SP 14049-901, Brazil; (T.C.B.); (G.L.L.); (A.F.T.S.); (L.F.R.)
- Correspondence:
| |
Collapse
|
27
|
Chen C, Yang D, Gao S, Zhang Y, Chen L, Wang B, Mo Z, Yang Y, Hei Z, Zhou S. Development and performance assessment of novel machine learning models to predict pneumonia after liver transplantation. Respir Res 2021; 22:94. [PMID: 33789673 PMCID: PMC8011203 DOI: 10.1186/s12931-021-01690-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 03/22/2021] [Indexed: 02/06/2023] Open
Abstract
Background Pneumonia is the most frequently encountered postoperative pulmonary complications (PPC) after orthotopic liver transplantation (OLT), which cause high morbidity and mortality rates. We aimed to develop a model to predict postoperative pneumonia in OLT patients using machine learning (ML) methods. Methods Data of 786 adult patients underwent OLT at the Third Affiliated Hospital of Sun Yat-sen University from January 2015 to September 2019 was retrospectively extracted from electronic medical records and randomly subdivided into a training set and a testing set. With the training set, six ML models including logistic regression (LR), support vector machine (SVM), random forest (RF), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost) and gradient boosting machine (GBM) were developed. These models were assessed by the area under curve (AUC) of receiver operating characteristic on the testing set. The related risk factors and outcomes of pneumonia were also probed based on the chosen model. Results 591 OLT patients were eventually included and 253 (42.81%) were diagnosed with postoperative pneumonia, which was associated with increased postoperative hospitalization and mortality (P < 0.05). Among the six ML models, XGBoost model performed best. The AUC of XGBoost model on the testing set was 0.734 (sensitivity: 52.6%; specificity: 77.5%). Pneumonia was notably associated with 14 items features: INR, HCT, PLT, ALB, ALT, FIB, WBC, PT, serum Na+, TBIL, anesthesia time, preoperative length of stay, total fluid transfusion and operation time. Conclusion Our study firstly demonstrated that the XGBoost model with 14 common variables might predict postoperative pneumonia in OLT patients. Supplementary Information The online version contains supplementary material available at 10.1186/s12931-021-01690-3.
Collapse
Affiliation(s)
- Chaojin Chen
- Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 10630, Guangdong, People's Republic of China
| | - Dong Yang
- Guangzhou AID Cloud Technology Co., LTD, Guangzhou, People's Republic of China
| | - Shilong Gao
- Department of Information, The Third Affiliated Hospital of Sun Yat-Sen University, Guangzhou, People's Republic of China
| | - Yihan Zhang
- Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 10630, Guangdong, People's Republic of China
| | - Liubing Chen
- Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 10630, Guangdong, People's Republic of China
| | - Bohan Wang
- Guangzhou AID Cloud Technology Co., LTD, Guangzhou, People's Republic of China
| | - Zihan Mo
- Guangzhou AID Cloud Technology Co., LTD, Guangzhou, People's Republic of China
| | - Yang Yang
- Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 510630, Guangdong, People's Republic of China.
| | - Ziqing Hei
- Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 10630, Guangdong, People's Republic of China.
| | - Shaoli Zhou
- Department of Anesthesiology, The Third Affiliated Hospital of Sun Yat-Sen University, No. 600 Tianhe Road, Guangzhou, 10630, Guangdong, People's Republic of China.
| |
Collapse
|
28
|
Avershina E, Sharma P, Taxt AM, Singh H, Frye SA, Paul K, Kapil A, Naseer U, Kaur P, Ahmad R. AMR-Diag: Neural network based genotype-to-phenotype prediction of resistance towards β-lactams in Escherichia coli and Klebsiella pneumoniae. Comput Struct Biotechnol J 2021; 19:1896-1906. [PMID: 33897984 PMCID: PMC8060595 DOI: 10.1016/j.csbj.2021.03.027] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 03/15/2021] [Accepted: 03/23/2021] [Indexed: 12/11/2022] Open
Abstract
Antibiotic resistance poses a major threat to public health. More effective ways of the antibiotic prescription are needed to delay the spread of antibiotic resistance. Employment of sequencing technologies coupled with the use of trained neural network algorithms for genotype-to-phenotype prediction will reduce the time needed for antibiotic susceptibility profile identification from days to hours. In this work, we have sequenced and phenotypically characterized 171 clinical isolates of Escherichia coli and Klebsiella pneumoniae from Norway and India. Based on the data, we have created neural networks to predict susceptibility for ampicillin, 3rd generation cephalosporins and carbapenems. All networks were trained on unassembled data, enabling prediction within minutes after the sequencing information becomes available. Moreover, they can be used both on Illumina and MinION generated data and do not require high genome coverage for phenotype prediction. We cross-checked our networks with previously published algorithms for genotype-to-phenotype prediction and their corresponding datasets. Besides, we also created an ensemble of networks trained on different datasets, which improved the cross-dataset prediction compared to a single network. Additionally, we have used data from direct sequencing of spiked blood cultures and found that AMR-Diag networks, coupled with MinION sequencing, can predict bacterial species, resistome, and phenotype as fast as 1–8 h from the sequencing start. To our knowledge, this is the first study for genotype-to-phenotype prediction: (1) employing a neural network method; (2) using data from more than one sequencing platform; and (3) utilizing sequence data from spiked blood cultures.
Collapse
Affiliation(s)
- Ekaterina Avershina
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata 22, 2317 Hamar, Norway
| | - Priyanka Sharma
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Arne M Taxt
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata 22, 2317 Hamar, Norway.,Department of Microbiology, Division of Laboratory Medicine, Oslo University Hospital, PB 4956, Nydalen, 0424 Oslo, Norway
| | - Harpreet Singh
- Informatics, System and Research Management, Indian Council of Medical Research, New Delhi, India
| | - Stephan A Frye
- Department of Microbiology, Division of Laboratory Medicine, Oslo University Hospital, PB 4956, Nydalen, 0424 Oslo, Norway
| | - Kolin Paul
- Department of Computer Science & Engineering, IIT Delhi, New Delhi, India
| | - Arti Kapil
- Department of Microbiology, All India Institute of Medical Sciences, New Delhi, India
| | - Umaer Naseer
- Department of Zoonotic, Food- and Waterborne Infections, 0213 Oslo, Norwegian Institute of Public Health, Oslo, Norway
| | - Punit Kaur
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Rafi Ahmad
- Department of Biotechnology, Inland Norway University of Applied Sciences, Holsetgata 22, 2317 Hamar, Norway.,Institute of Clinical Medicine, Faculty of Health Sciences, UiT - The Arctic University of Norway, Hansine Hansens veg 18, 9019 Tromsø, Norway
| |
Collapse
|
29
|
Feng Y, Zou S, Chen H, Yu Y, Ruan Z. BacWGSTdb 2.0: a one-stop repository for bacterial whole-genome sequence typing and source tracking. Nucleic Acids Res 2021; 49:D644-D650. [PMID: 33010178 PMCID: PMC7778894 DOI: 10.1093/nar/gkaa821] [Citation(s) in RCA: 125] [Impact Index Per Article: 41.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 09/15/2020] [Accepted: 09/17/2020] [Indexed: 11/13/2022] Open
Abstract
An increasing prevalence of hospital acquired infections and foodborne illnesses caused by pathogenic and multidrug-resistant bacteria has stimulated a pressing need for benchtop computational techniques to rapidly and accurately classify bacteria from genomic sequence data, and based on that, to trace the source of infection. BacWGSTdb (http://bacdb.org/BacWGSTdb) is a free publicly accessible database we have developed for bacterial whole-genome sequence typing and source tracking. This database incorporates extensive resources for bacterial genome sequencing data and the corresponding metadata, combined with specialized bioinformatics tools that enable the systematic characterization of the bacterial isolates recovered from infections. Here, we present BacWGSTdb 2.0, which encompasses several major updates, including (i) the integration of the core genome multi-locus sequence typing (cgMLST) approach, which is highly scalable and appropriate for typing isolates belonging to different lineages; (ii) the addition of a multiple genome analysis module that can process dozens of user uploaded sequences in a batch mode; (iii) a new source tracking module for comparing user uploaded plasmid sequences to those deposited in the public databases; (iv) the number of species encompassed in BacWGSTdb 2.0 has increased from 9 to 20, which represents bacterial pathogens of medical importance; (v) a newly designed, user-friendly interface and a set of visualization tools for providing a convenient platform for users are also included. Overall, the updated BacWGSTdb 2.0 bears great utility in continuing to provide users, including epidemiologists, clinicians and bench scientists, with a one-stop solution to bacterial genome sequence analysis.
Collapse
Affiliation(s)
- Ye Feng
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310016, China.,Institute of Translational Medicine, Zhejiang University School of Medicine, Hangzhou 310029, China
| | - Shengmei Zou
- Institute of Translational Medicine, Zhejiang University School of Medicine, Hangzhou 310029, China
| | - Hangfei Chen
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310016, China
| | - Yunsong Yu
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310016, China
| | - Zhi Ruan
- Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310016, China
| |
Collapse
|
30
|
Jaillard M, Palmieri M, van Belkum A, Mahé P. Interpreting k-mer-based signatures for antibiotic resistance prediction. Gigascience 2020; 9:giaa110. [PMID: 33068113 PMCID: PMC7568433 DOI: 10.1093/gigascience/giaa110] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 07/23/2020] [Accepted: 09/16/2020] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND Recent years have witnessed the development of several k-mer-based approaches aiming to predict phenotypic traits of bacteria on the basis of their whole-genome sequences. While often convincing in terms of predictive performance, the underlying models are in general not straightforward to interpret, the interplay between the actual genetic determinant and its translation as k-mers being generally hard to decipher. RESULTS We propose a simple and computationally efficient strategy allowing one to cope with the high correlation inherent to k-mer-based representations in supervised machine learning models, leading to concise and easily interpretable signatures. We demonstrate the benefit of this approach on the task of predicting the antibiotic resistance profile of a Klebsiella pneumoniae strain from its genome, where our method leads to signatures defined as weighted linear combinations of genetic elements that can easily be identified as genuine antibiotic resistance determinants, with state-of-the-art predictive performance. CONCLUSIONS By enhancing the interpretability of genomic k-mer-based antibiotic resistance prediction models, our approach improves their clinical utility and hence will facilitate their adoption in routine diagnostics by clinicians and microbiologists. While antibiotic resistance was the motivating application, the method is generic and can be transposed to any other bacterial trait. An R package implementing our method is available at https://gitlab.com/biomerieux-data-science/clustlasso.
Collapse
Affiliation(s)
| | | | | | - Pierre Mahé
- bioMérieux, Chemin de l'Orme, 69280 Marcy l'Etoile, France
| |
Collapse
|