1
|
Jin C, Jia C, Hu W, Xu H, Shen Y, Yue M. Predicting antimicrobial resistance in E. coli with discriminative position fused deep learning classifier. Comput Struct Biotechnol J 2024; 23:559-565. [PMID: 38274998 PMCID: PMC10809114 DOI: 10.1016/j.csbj.2023.12.041] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/26/2023] [Accepted: 12/26/2023] [Indexed: 01/27/2024] Open
Abstract
Escherichia coli (E. coli) has become a particular concern due to the increasing incidence of antimicrobial resistance (AMR) observed worldwide. Using machine learning (ML) to predict E. coli AMR is a more efficient method than traditional laboratory testing. However, further improvement in the predictive performance of existing models remains challenging. In this study, we collected 1937 high-quality whole genome sequencing (WGS) data from public databases with an antimicrobial resistance phenotype and modified the existing workflow by adding an attention mechanism to enable the modified workflow to focus more on core single nucleotide polymorphisms (SNPs) that may significantly lead to the development of AMR in E. coli. While comparing the model performance before and after adding the attention mechanism, we also performed a cross-comparison among the published models using random forest (RF), support vector machine (SVM), logistic regression (LR), and convolutional neural network (CNN). Our study demonstrates that the discriminative positional colors of Chaos Game Representation (CGR) images can selectively influence and highlight genome regions without prior knowledge, enhancing prediction accuracy. Furthermore, we developed an online tool (https://github.com/tjiaa/E.coli-ML/tree/main) for assisting clinicians in the rapid prediction of the AMR phenotype of E. coli and accelerating clinical decision-making.
Collapse
Affiliation(s)
- Canghong Jin
- School of Computer and Computing Science, Hangzhou City University, Hangzhou 310015, China
| | - Chenghao Jia
- Institute of Preventive Veterinary Sciences and Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou 310058, China
| | - Wenkang Hu
- School of Computer and Computing Science, Hangzhou City University, Hangzhou 310015, China
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
| | - Haidong Xu
- School of Computer and Computing Science, Hangzhou City University, Hangzhou 310015, China
| | - Yanyi Shen
- School of Computer and Computing Science, Hangzhou City University, Hangzhou 310015, China
| | - Min Yue
- Institute of Preventive Veterinary Sciences and Department of Veterinary Medicine, Zhejiang University College of Animal Sciences, Hangzhou 310058, China
- Hainan Institute of Zhejiang University, Sanya 572000, China
- Zhejiang Provincial Key Laboratory of Preventive Veterinary Medicine, Hangzhou 310058, China
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou 310003, China
| |
Collapse
|
2
|
Farhat M, Cox H, Ghanem M, Denkinger CM, Rodrigues C, Abd El Aziz MS, Enkh-Amgalan H, Vambe D, Ugarte-Gil C, Furin J, Pai M. Drug-resistant tuberculosis: a persistent global health concern. Nat Rev Microbiol 2024; 22:617-635. [PMID: 38519618 DOI: 10.1038/s41579-024-01025-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2024] [Indexed: 03/25/2024]
Abstract
Drug-resistant tuberculosis (TB) is estimated to cause 13% of all antimicrobial resistance-attributable deaths worldwide and is driven by both ongoing resistance acquisition and person-to-person transmission. Poor outcomes are exacerbated by late diagnosis and inadequate access to effective treatment. Advances in rapid molecular testing have recently improved the diagnosis of TB and drug resistance. Next-generation sequencing of Mycobacterium tuberculosis has increased our understanding of genetic resistance mechanisms and can now detect mutations associated with resistance phenotypes. All-oral, shorter drug regimens that can achieve high cure rates of drug-resistant TB within 6-9 months are now available and recommended but have yet to be scaled to global clinical use. Promising regimens for the prevention of drug-resistant TB among high-risk contacts are supported by early clinical trial data but final results are pending. A person-centred approach is crucial in managing drug-resistant TB to reduce the risk of poor treatment outcomes, side effects, stigma and mental health burden associated with the diagnosis. In this Review, we describe current surveillance of drug-resistant TB and the causes, risk factors and determinants of drug resistance as well as the stigma and mental health considerations associated with it. We discuss recent advances in diagnostics and drug-susceptibility testing and outline the progress in developing better treatment and preventive therapies.
Collapse
Affiliation(s)
- Maha Farhat
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Helen Cox
- Institute of Infectious Disease and Molecular Medicine, Wellcome Centre for Infectious Disease Research and Division of Medical Microbiology, University of Cape Town, Cape Town, South Africa
| | - Marwan Ghanem
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Claudia M Denkinger
- Division of Infectious Disease and Tropical Medicine, Heidelberg University Hospital, Heidelberg, Germany
- German Center for Infection Research (DZIF), partner site Heidelberg University Hospital, Heidelberg, Germany
| | | | - Mirna S Abd El Aziz
- Division of Infectious Disease and Tropical Medicine, Heidelberg University Hospital, Heidelberg, Germany
| | | | - Debrah Vambe
- National TB Control Programme, Manzini, Eswatini
| | - Cesar Ugarte-Gil
- School of Public and Population Health, University of Texas Medical Branch, Galveston, TX, USA
| | - Jennifer Furin
- Department of Global Health and Social Medicine, Harvard Medical School, Boston, MA, USA
| | - Madhukar Pai
- McGill International TB Centre, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
3
|
Sufian MA, Hamzi W, Sharifi T, Zaman S, Alsadder L, Lee E, Hakim A, Hamzi B. AI-Driven Thoracic X-ray Diagnostics: Transformative Transfer Learning for Clinical Validation in Pulmonary Radiography. J Pers Med 2024; 14:856. [PMID: 39202047 PMCID: PMC11355475 DOI: 10.3390/jpm14080856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Revised: 07/23/2024] [Accepted: 08/01/2024] [Indexed: 09/03/2024] Open
Abstract
Our research evaluates advanced artificial (AI) methodologies to enhance diagnostic accuracy in pulmonary radiography. Utilizing DenseNet121 and ResNet50, we analyzed 108,948 chest X-ray images from 32,717 patients and DenseNet121 achieved an area under the curve (AUC) of 94% in identifying the conditions of pneumothorax and oedema. The model's performance surpassed that of expert radiologists, though further improvements are necessary for diagnosing complex conditions such as emphysema, effusion, and hernia. Clinical validation integrating Latent Dirichlet Allocation (LDA) and Named Entity Recognition (NER) demonstrated the potential of natural language processing (NLP) in clinical workflows. The NER system achieved a precision of 92% and a recall of 88%. Sentiment analysis using DistilBERT provided a nuanced understanding of clinical notes, which is essential for refining diagnostic decisions. XGBoost and SHapley Additive exPlanations (SHAP) enhanced feature extraction and model interpretability. Local Interpretable Model-agnostic Explanations (LIME) and occlusion sensitivity analysis further enriched transparency, enabling healthcare providers to trust AI predictions. These AI techniques reduced processing times by 60% and annotation errors by 75%, setting a new benchmark for efficiency in thoracic diagnostics. The research explored the transformative potential of AI in medical imaging, advancing traditional diagnostics and accelerating medical evaluations in clinical settings.
Collapse
Affiliation(s)
- Md Abu Sufian
- IVR Low-Carbon Research Institute, Chang’an University, Xi’an 710018, China;
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK
| | - Wahiba Hamzi
- Laboratoire de Biotechnologie Santé et Environnement, Department of Biology, University of Blida, Blida 09000, Algeria
| | - Tazkera Sharifi
- Data Science Architect-Lead Technologist, Booz Allen Hamilton, Texas City, TX 78226, USA
| | - Sadia Zaman
- Department of Physiology, Queen Mary University, London E1 4NS, UK
| | - Lujain Alsadder
- Department of Physiology, Queen Mary University, London E1 4NS, UK
| | - Esther Lee
- Department of Physiology, Queen Mary University, London E1 4NS, UK
| | - Amir Hakim
- Department of Physiology, Queen Mary University, London E1 4NS, UK
| | - Boumediene Hamzi
- Department of Computing and Mathematical Sciences, California Institute of Technology, Caltech, CA 91125, USA
- The Alan Turing Institute, London NW1 2DB, UK
- Department of Mathematics, Imperial College London, London SW7 2AZ, UK
- Department of Mathematics, Gulf University for Science and Technology (GUST), Mubarak Al-Abdullah 32093, Kuwait
| |
Collapse
|
4
|
Yu D, Andersson-Li M, Maes S, Andersson-Li L, Neumann NF, Odlare M, Jonsson A. Development of a logic regression-based approach for the discovery of host- and niche-informative biomarkers in Escherichia coli and their application for microbial source tracking. Appl Environ Microbiol 2024; 90:e0022724. [PMID: 38940567 PMCID: PMC11267920 DOI: 10.1128/aem.00227-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 06/07/2024] [Indexed: 06/29/2024] Open
Abstract
Microbial source tracking leverages a wide range of approaches designed to trace the origins of fecal contamination in aquatic environments. Although source tracking methods are typically employed within the laboratory setting, computational techniques can be leveraged to advance microbial source tracking methodology. Herein, we present a logic regression-based supervised learning approach for the discovery of source-informative genetic markers within intergenic regions across the Escherichia coli genome that can be used for source tracking. With just single intergenic loci, logic regression was able to identify highly source-specific (i.e., exceeding 97.00%) biomarkers for a wide range of host and niche sources, with sensitivities reaching as high as 30.00%-50.00% for certain source categories, including pig, sheep, mouse, and wastewater, depending on the specific intergenic locus analyzed. Restricting the source range to reflect the most prominent zoonotic sources of E. coli transmission (i.e., bovine, chicken, human, and pig) allowed for the generation of informative biomarkers for all host categories, with specificities of at least 90.00% and sensitivities between 12.50% and 70.00%, using the sequence data from key intergenic regions, including emrKY-evgAS, ibsB-(mdtABCD-baeSR), ompC-rcsDB, and yedS-yedR, that appear to be involved in antibiotic resistance. Remarkably, we were able to use this approach to classify 48 out of 113 river water E. coli isolates collected in Northwestern Sweden as either beaver, human, or reindeer in origin with a high degree of consensus-thus highlighting the potential of logic regression modeling as a novel approach for augmenting current source tracking efforts.IMPORTANCEThe presence of microbial contaminants, particularly from fecal sources, within water poses a serious risk to public health. The health and economic burden of waterborne pathogens can be substantial-as such, the ability to detect and identify the sources of fecal contamination in environmental waters is crucial for the control of waterborne diseases. This can be accomplished through microbial source tracking, which involves the use of various laboratory techniques to trace the origins of microbial pollution in the environment. Building on current source tracking methodology, we describe a novel workflow that uses logic regression, a supervised machine learning method, to discover genetic markers in Escherichia coli, a common fecal indicator bacterium, that can be used for source tracking efforts. Importantly, our research provides an example of how the rise in prominence of machine learning algorithms can be applied to improve upon current microbial source tracking methodology.
Collapse
Affiliation(s)
- Daniel Yu
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
| | | | - Sharon Maes
- Department of Natural Sciences, Design and Sustainable Development, Mid Sweden University, Östersund, Sweden
| | - Lili Andersson-Li
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Solna, Sweden
| | - Norman F. Neumann
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
| | - Monica Odlare
- Department of Natural Sciences, Design and Sustainable Development, Mid Sweden University, Östersund, Sweden
| | - Anders Jonsson
- Department of Natural Sciences, Design and Sustainable Development, Mid Sweden University, Östersund, Sweden
| |
Collapse
|
5
|
Al-Otaibi NM, Alsulaiman B, Alreshoodi FM, Mukhtar LE, Alajel SM, Binsaeedan NM, Alshabrmi FM. Screening for Antibiotic Resistance Genes in Bacteria and the Presence of Heavy Metals in the Upstream and Downstream Areas of the Wadi Hanifah Valley in Riyadh, Saudi Arabia. Antibiotics (Basel) 2024; 13:426. [PMID: 38786154 PMCID: PMC11117234 DOI: 10.3390/antibiotics13050426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 04/29/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024] Open
Abstract
Valley surface water is considered a focal public health concern owing to the presence of multi-drug-resistant bacteria. The distribution of antimicrobial resistance (AMR) bacteria in the surface water is affected by the presence of multiple factors, including antibiotics coming from wastewater discharge or other contaminant sources such as pharmaceuticals, biocides, and heavy metals. Furthermore, there is evidence suggesting that high levels of antibiotic resistance genes (ARGs) can be transferred within bacterial communities under the influence of heavy metal stress. Hence, the primary aim of this study is to investigate the presence of heavy metals and bacterial ARGs in upstream as well as downstream locations of Wadi Hanifah Valley in Riyadh, Saudi Arabia. Sample collection was conducted at eighteen surface water sites within the valley in total. The selection of ARGs was associated with the most common antibiotics, including β-lactam, tetracycline, erythromycin, gentamicin, sulphonamide, chloramphenicol, vancomycin, trimethoprim, and colistin antibiotics, which were detected qualitatively using polymerase chain reaction (PCR) technology. The tested antibiotic resistance genes (ARGs) included (blaNDM-1 (for the antibiotic class Beta-lactamases), mecA (methicillin-resistant Staphylococcus aureus), tet(M) and tet(B) (for the antibiotic class Tetracycline), ampC (for the antibiotic class Beta-lactamases), vanA (for the antibiotic class vancomycin), mcr-1 (for the antibiotic class colistin), erm(B) (for the antibiotic class erythromycin), aac6'-Ie-aph2-Ia (for the antibiotic class Gentamicin), sulII (for the antibiotic class sulphonamide), catII (for the antibiotic class Chlorophincol), and dfrA1 (for the antibiotic class trimethoprim). Moreover, an assessment of the levels of heavy metals such as lithium (Li), beryllium (Be), chromium (Cr), cobalt (Co), arsenic (As), cadmium (Cd), tin (Sn), mercury (Hg), and lead (Pb) was conducted by using inductively coupled plasma mass spectrometry (ICPMS). According to our findings, the concentrations of sulphonamide, erythromycin, and chloramphenicol ARGs (erm(B), sulII, and catII) were observed to be the most elevated. Conversely, two ARGs, namely mecA and mcr-1, were not detected in the samples. Moreover, our data illustrated a significant rise in ARGs in the bacteria of water samples from the upstream sites as compared with the water samples from the downstream sites of Wadi Hanifah Valley. The mean concentration of Li, Be, Cr, Co, As, Cd, Sn, Hg, and Pb in the water samples was estimated to be 37.25 µg/L, 0.02 µg/L, 0.56 µg/L,0.32 µg/L, 0.93 µg/L, 0.01 µg/L, 200.4 µg/L, 0.027 µg/L, and 0.26 µg/L, respectively, for the selected 18 sites. Furthermore, it was revealed that the concentrations of the screened heavy metals in the water samples collected from various sites did not surpass the maximum limits set by the World Health Organization (WHO). In conclusion, this study offers a concise overview of the presence of heavy metals and ARGs in water samples obtained from the Wadi Hanifah Valley in Riyadh, KSA. Such findings will contribute to the ongoing monitoring and future risk assessment of ARGs spread in surface water.
Collapse
Affiliation(s)
- Norah M. Al-Otaibi
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Bassam Alsulaiman
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Fahad M. Alreshoodi
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Lenah E. Mukhtar
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Sulaiman M. Alajel
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Norah M. Binsaeedan
- Executive Department of Reference Laboratories, Research and Laboratories, Saudi Food and Drug Authority (SFDA), Riyadh 13513, Saudi Arabia; (N.M.A.-O.); (B.A.); (S.M.A.); (N.M.B.)
| | - Fahad M. Alshabrmi
- Department of Medical Laboratories, College of Applied Medical Sciences, Qassim University, Buraydah 51452, Saudi Arabia
| |
Collapse
|
6
|
Rusic D, Kumric M, Seselja Perisin A, Leskur D, Bukic J, Modun D, Vilovic M, Vrdoljak J, Martinovic D, Grahovac M, Bozic J. Tackling the Antimicrobial Resistance "Pandemic" with Machine Learning Tools: A Summary of Available Evidence. Microorganisms 2024; 12:842. [PMID: 38792673 PMCID: PMC11123121 DOI: 10.3390/microorganisms12050842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 04/16/2024] [Accepted: 04/19/2024] [Indexed: 05/26/2024] Open
Abstract
Antimicrobial resistance is recognised as one of the top threats healthcare is bound to face in the future. There have been various attempts to preserve the efficacy of existing antimicrobials, develop new and efficient antimicrobials, manage infections with multi-drug resistant strains, and improve patient outcomes, resulting in a growing mass of routinely available data, including electronic health records and microbiological information that can be employed to develop individualised antimicrobial stewardship. Machine learning methods have been developed to predict antimicrobial resistance from whole-genome sequencing data, forecast medication susceptibility, recognise epidemic patterns for surveillance purposes, or propose new antibacterial treatments and accelerate scientific discovery. Unfortunately, there is an evident gap between the number of machine learning applications in science and the effective implementation of these systems. This narrative review highlights some of the outstanding opportunities that machine learning offers when applied in research related to antimicrobial resistance. In the future, machine learning tools may prove to be superbugs' kryptonite. This review aims to provide an overview of available publications to aid researchers that are looking to expand their work with new approaches and to acquaint them with the current application of machine learning techniques in this field.
Collapse
Affiliation(s)
- Doris Rusic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Marko Kumric
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Ana Seselja Perisin
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Dario Leskur
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Josipa Bukic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Darko Modun
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (D.R.); (A.S.P.); (D.L.); (J.B.); (D.M.)
| | - Marino Vilovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Josip Vrdoljak
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Dinko Martinovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Department of Maxillofacial Surgery, University Hospital of Split, Spinciceva 1, 21000 Split, Croatia
| | - Marko Grahovac
- Department of Pharmacology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia;
| | - Josko Bozic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia; (M.K.); (M.V.); (J.V.); (D.M.)
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| |
Collapse
|
7
|
Sawant PA, Hiralkar SS, Hulsurkar YP, Phutane MS, Mahajan US, Kudale AM. Predicting over-the-counter antibiotic use in rural Pune, India, using machine learning methods. Epidemiol Health 2024; 46:e2024044. [PMID: 38637971 PMCID: PMC11417445 DOI: 10.4178/epih.e2024044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/25/2024] [Indexed: 04/20/2024] Open
Abstract
OBJECTIVES Over-the-counter (OTC) antibiotic use can cause antibiotic resistance, threatening global public health gains. To counter OTC use, this study used machine learning (ML) methods to identify predictors of OTC antibiotic use in rural Pune, India. METHODS The features of OTC antibiotic use were selected using stepwise logistic, lasso, random forest, XGBoost, and Boruta algorithms. Regression and tree-based models with all confirmed and tentatively important features were built to predict the use of OTC antibiotics. Five-fold cross-validation was used to tune the models' hyperparameters. The final model was selected based on the highest area under the curve (AUROC) with a 95% confidence interval (CI) and the lowest log-loss. RESULTS In rural Pune, the prevalence of OTC antibiotic use was 35.9% (95% CI, 31.6 to 40.5). The perception that buying medicines directly from a medicine shop/pharmacy is useful, using antibiotics for eye-related complaints, more household members consuming antibiotics, and longer duration and higher doses of antibiotic consumption in rural blocks and other social groups were confirmed as important features by the Boruta algorithm. The final model was the XGBoost+Boruta model with 7 predictors (AUROC, 0.934; 95% CI, 0.891 to 0.978; log-loss, 0.279) log-loss. CONCLUSIONS XGBoost+Boruta, with 7 predictors, was the most accurate model for predicting OTC antibiotic use in rural Pune. Using OTC antibiotics for eye-related complaints, higher consumption of antibiotics and the perception that buying antibiotics directly from a medicine shop/pharmacy is useful were identified as key factors for planning interventions to improve awareness about proper antibiotic use.
Collapse
Affiliation(s)
- Pravin Arun Sawant
- Department of Health Sciences, School of Health Sciences, Savitribai Phule Pune University, Pune, India
| | - Sakshi Shantanu Hiralkar
- Department of Health Sciences, School of Health Sciences, Savitribai Phule Pune University, Pune, India
| | | | - Mugdha Sharad Phutane
- Department of Health Sciences, School of Health Sciences, Savitribai Phule Pune University, Pune, India
| | - Uma Satish Mahajan
- Department of Health Sciences, School of Health Sciences, Savitribai Phule Pune University, Pune, India
| | - Abhay Machindra Kudale
- Department of Health Sciences, School of Health Sciences, Savitribai Phule Pune University, Pune, India
| |
Collapse
|
8
|
Dixit A, Freschi L, Vargas R, Gröschel MI, Nakhoul M, Tahseen S, Alam SMM, Kamal SMM, Skrahina A, Basilio RP, Lim DR, Ismail N, Farhat MR. Estimation of country-specific tuberculosis resistance antibiograms using pathogen genomics and machine learning. BMJ Glob Health 2024; 9:e013532. [PMID: 38548342 PMCID: PMC10982777 DOI: 10.1136/bmjgh-2023-013532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 02/26/2024] [Indexed: 04/02/2024] Open
Abstract
BACKGROUND Global tuberculosis (TB) drug resistance (DR) surveillance focuses on rifampicin. We examined the potential of public and surveillance Mycobacterium tuberculosis (Mtb) whole-genome sequencing (WGS) data, to generate expanded country-level resistance prevalence estimates (antibiograms) using in silico resistance prediction. METHODS We curated and quality-controlled Mtb WGS data. We used a validated random forest model to predict phenotypic resistance to 12 drugs and bias-corrected for model performance, outbreak sampling and rifampicin resistance oversampling. Validation leveraged a national DR survey conducted in South Africa. RESULTS Mtb isolates from 29 countries (n=19 149) met sequence quality criteria. Global marginal genotypic resistance among mono-resistant TB estimates overlapped with the South African DR survey, except for isoniazid, ethionamide and second-line injectables, which were underestimated (n=3134). Among multidrug resistant (MDR) TB (n=268), estimates overlapped for the fluoroquinolones but overestimated other drugs. Globally pooled mono-resistance to isoniazid was 10.9% (95% CI: 10.2-11.7%, n=14 012). Mono-levofloxacin resistance rates were highest in South Asia (Pakistan 3.4% (0.1-11%), n=111 and India 2.8% (0.08-9.4%), n=114). Given the recent interest in drugs enhancing ethionamide activity and their expected activity against isolates with resistance discordance between isoniazid and ethionamide, we measured this rate and found it to be high at 74.4% (IQR: 64.5-79.7%) of isoniazid-resistant isolates predicted to be ethionamide susceptible. The global susceptibility rate to pyrazinamide and levofloxacin among MDR was 15.1% (95% CI: 10.2-19.9%, n=3964). CONCLUSIONS This is the first attempt at global Mtb antibiogram estimation. DR prevalence in Mtb can be reliably estimated using public WGS and phenotypic resistance prediction for key antibiotics, but public WGS data demonstrates oversampling of isolates with higher resistance levels than MDR. Nevertheless, our results raise concerns about the empiric use of short-course fluoroquinolone regimens for drug-susceptible TB in South Asia and indicate underutilisation of ethionamide in MDR treatment.
Collapse
Affiliation(s)
- Avika Dixit
- Division of Infectious Diseases, Department of Pediatrics, Boston Children's Hospital, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Luca Freschi
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Roger Vargas
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
- Center for Computational Biomedicine, Harvard Medical School, Boston, Massachusetts, USA
| | - Matthias I Gröschel
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Maria Nakhoul
- Informatics and Analytics Department, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Sabira Tahseen
- National Tuberculosis Control Programme, Islamabad, Pakistan
| | - S M Masud Alam
- Ministry of Health and Family Welfare, Kolkata, West Bengal, India
| | - S M Mostofa Kamal
- National Institute of Diseases of the Chest and Hospital, Dhaka, Bangladesh
| | - Alena Skrahina
- Republican Scientific and Practical Center for Pulmonology and Tuberculosis, Minsk, Belarus
| | - Ramon P Basilio
- Research Institute for Tropical Medicine, Muntinlupa City, Philippines
| | - Dodge R Lim
- Research Institute for Tropical Medicine, Muntinlupa City, Philippines
| | - Nazir Ismail
- Clinical Microbiology and Infectious Diseases, University of the Witwatersrand Johannesburg Faculty of Health Sciences, Johannesburg, South Africa
| | - Maha R Farhat
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
- Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
| |
Collapse
|
9
|
Hu K, Meyer F, Deng ZL, Asgari E, Kuo TH, Münch PC, McHardy AC. Assessing computational predictions of antimicrobial resistance phenotypes from microbial genomes. Brief Bioinform 2024; 25:bbae206. [PMID: 38706320 PMCID: PMC11070729 DOI: 10.1093/bib/bbae206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 04/08/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
The advent of rapid whole-genome sequencing has created new opportunities for computational prediction of antimicrobial resistance (AMR) phenotypes from genomic data. Both rule-based and machine learning (ML) approaches have been explored for this task, but systematic benchmarking is still needed. Here, we evaluated four state-of-the-art ML methods (Kover, PhenotypeSeeker, Seq2Geno2Pheno and Aytan-Aktug), an ML baseline and the rule-based ResFinder by training and testing each of them across 78 species-antibiotic datasets, using a rigorous benchmarking workflow that integrates three evaluation approaches, each paired with three distinct sample splitting methods. Our analysis revealed considerable variation in the performance across techniques and datasets. Whereas ML methods generally excelled for closely related strains, ResFinder excelled for handling divergent genomes. Overall, Kover most frequently ranked top among the ML approaches, followed by PhenotypeSeeker and Seq2Geno2Pheno. AMR phenotypes for antibiotic classes such as macrolides and sulfonamides were predicted with the highest accuracies. The quality of predictions varied substantially across species-antibiotic combinations, particularly for beta-lactams; across species, resistance phenotyping of the beta-lactams compound, aztreonam, amoxicillin/clavulanic acid, cefoxitin, ceftazidime and piperacillin/tazobactam, alongside tetracyclines demonstrated more variable performance than the other benchmarked antibiotics. By organism, Campylobacter jejuni and Enterococcus faecium phenotypes were more robustly predicted than those of Escherichia coli, Staphylococcus aureus, Salmonella enterica, Neisseria gonorrhoeae, Klebsiella pneumoniae, Pseudomonas aeruginosa, Acinetobacter baumannii, Streptococcus pneumoniae and Mycobacterium tuberculosis. In addition, our study provides software recommendations for each species-antibiotic combination. It furthermore highlights the need for optimization for robust clinical applications, particularly for strains that diverge substantially from those used for training.
Collapse
Affiliation(s)
- Kaixin Hu
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Fernando Meyer
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Zhi-Luo Deng
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Ehsaneddin Asgari
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Molecular Cell Biomechanics Laboratory, Department of Bioengineering and Mechanical Engineering, University of California, Berkeley, USA
| | - Tzu-Hao Kuo
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Philipp C Münch
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
- German Center for Infection Research (DZIF), partner site Hannover Braunschweig, Braunschweig, Germany
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
| | - Alice C McHardy
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| |
Collapse
|
10
|
Wang Y, Jiang Z, Liang P, Liu Z, Cai H, Sun Q. TB-DROP: deep learning-based drug resistance prediction of Mycobacterium tuberculosis utilizing whole genome mutations. BMC Genomics 2024; 25:167. [PMID: 38347478 PMCID: PMC10860279 DOI: 10.1186/s12864-024-10066-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/30/2024] [Indexed: 02/15/2024] Open
Abstract
The most widely practiced strategy for constructing the deep learning (DL) prediction model for drug resistance of Mycobacterium tuberculosis (MTB) involves the adoption of ready-made and state-of-the-art architectures usually proposed for non-biological problems. However, the ultimate goal is to construct a customized model for predicting the drug resistance of MTB and eventually for the biological phenotypes based on genotypes. Here, we constructed a DL training framework to standardize and modularize each step during the training process using the latest tensorflow 2 API. A systematic and comprehensive evaluation of each module in the three currently representative models, including Convolutional Neural Network, Denoising Autoencoder, and Wide & Deep, which were adopted by CNNGWP, DeepAMR, and WDNN, respectively, was performed in this framework regarding module contributions in order to assemble a novel model with proper dedicated modules. Based on the whole-genome level mutations, a de novo learning method was developed to overcome the intrinsic limitations of previous models that rely on known drug resistance-associated loci. A customized DL model with the multilayer perceptron architecture was constructed and achieved a competitive performance (the mean sensitivity and specificity were 0.90 and 0.87, respectively) compared to previous ones. The new model developed was applied in an end-to-end user-friendly graphical tool named TB-DROP (TuBerculosis Drug Resistance Optimal Prediction: https://github.com/nottwy/TB-DROP ), in which users only provide sequencing data and TB-DROP will complete analysis within several minutes for one sample. Our study contributes to both a new strategy of model construction and clinical application of deep learning-based drug-resistance prediction methods.
Collapse
Affiliation(s)
- Yu Wang
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Zhonghua Jiang
- Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Pengkuan Liang
- Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China
- Zhejiang Yangshengtang Institute of Natural Medication Co., Ltd, Hangzhou, China
| | - Zhuochong Liu
- Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Haoyang Cai
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China.
| | - Qun Sun
- Key Laboratory of Bio-Resources and Eco-Environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610064, China.
| |
Collapse
|
11
|
Sanz-García F, Laborda P, Ochoa-Sánchez LE, Martínez JL, Hernando-Amado S. The Pseudomonas aeruginosa Resistome: Permanent and Transient Antibiotic Resistance, an Overview. Methods Mol Biol 2024; 2721:85-102. [PMID: 37819517 DOI: 10.1007/978-1-0716-3473-8_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
One of the most concerning characteristics of Pseudomonas aeruginosa is its low susceptibility to several antibiotics of common use in clinics, as well as its facility to acquire increased resistance levels. Consequently, the study of the antibiotic resistance mechanisms of this bacterium is of relevance for human health. For such a study, different types of resistance should be distinguished. The intrinsic resistome is composed of a set of genes, present in the core genome of P. aeruginosa, which contributes to its characteristic, species-specific, phenotype of susceptibility to antibiotics. Acquired resistance refers to those genetic events, such as the acquisition of mutations or antibiotic resistance genes that reduce antibiotic susceptibility. Finally, antibiotic resistance can be transiently acquired in the presence of specific compounds or under some growing conditions. The current article provides information on methods currently used to analyze intrinsic, mutation-driven, and transient antibiotic resistance in P. aeruginosa.
Collapse
Affiliation(s)
| | - Pablo Laborda
- Centro Nacional de Biotecnología, CSIC, Madrid, Spain
| | | | | | | |
Collapse
|
12
|
Yurtseven A, Buyanova S, Agrawal AA, Bochkareva OO, Kalinina OV. Machine learning and phylogenetic analysis allow for predicting antibiotic resistance in M. tuberculosis. BMC Microbiol 2023; 23:404. [PMID: 38124060 PMCID: PMC10731705 DOI: 10.1186/s12866-023-03147-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 12/07/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Antimicrobial resistance (AMR) poses a significant global health threat, and an accurate prediction of bacterial resistance patterns is critical for effective treatment and control strategies. In recent years, machine learning (ML) approaches have emerged as powerful tools for analyzing large-scale bacterial AMR data. However, ML methods often ignore evolutionary relationships among bacterial strains, which can greatly impact performance of the ML methods, especially if resistance-associated features are attempted to be detected. Genome-wide association studies (GWAS) methods like linear mixed models accounts for the evolutionary relationships in bacteria, but they uncover only highly significant variants which have already been reported in literature. RESULTS In this work, we introduce a novel phylogeny-related parallelism score (PRPS), which measures whether a certain feature is correlated with the population structure of a set of samples. We demonstrate that PRPS can be used, in combination with SVM- and random forest-based models, to reduce the number of features in the analysis, while simultaneously increasing models' performance. We applied our pipeline to publicly available AMR data from PATRIC database for Mycobacterium tuberculosis against six common antibiotics. CONCLUSIONS Using our pipeline, we re-discovered known resistance-associated mutations as well as new candidate mutations which can be related to resistance and not previously reported in the literature. We demonstrated that taking into account phylogenetic relationships not only improves the model performance, but also yields more biologically relevant predicted most contributing resistance markers.
Collapse
Affiliation(s)
- Alper Yurtseven
- Department of Drug Bioinformatics, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Campus E8.1, Saarbrücken, 66123, Saarland, Germany.
- Graduate School of Computer Science, Saarland University, Saarbrücken, 66123, Saarland, Germany.
| | - Sofia Buyanova
- Institute of Science and Technology Austria (ISTA), Am Campus 1, Klosterneuburg, 3400, Austria
| | - Amay Ajaykumar Agrawal
- Department of Drug Bioinformatics, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Campus E8.1, Saarbrücken, 66123, Saarland, Germany
- Graduate School of Computer Science, Saarland University, Saarbrücken, 66123, Saarland, Germany
| | - Olga O Bochkareva
- Institute of Science and Technology Austria (ISTA), Am Campus 1, Klosterneuburg, 3400, Austria
- Centre for Microbiology and Environmental Systems Science, Division of Computational System Biology, University of Vienna, Djerassiplatz 1 A, Wien, 1030, Austria
| | - Olga V Kalinina
- Department of Drug Bioinformatics, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Campus E8.1, Saarbrücken, 66123, Saarland, Germany
- Graduate School of Computer Science, Saarland University, Saarbrücken, 66123, Saarland, Germany
- Faculty of Medicine, Saarland University, Homburg, 66421, Saarland, Germany
| |
Collapse
|
13
|
Chung HC, Foxx CL, Hicks JA, Stuber TP, Friedberg I, Dorman KS, Harris B. An accurate and interpretable model for antimicrobial resistance in pathogenic Escherichia coli from livestock and companion animal species. PLoS One 2023; 18:e0290473. [PMID: 37616210 PMCID: PMC10449230 DOI: 10.1371/journal.pone.0290473] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023] Open
Abstract
Understanding the microbial genomic contributors to antimicrobial resistance (AMR) is essential for early detection of emerging AMR infections, a pressing global health threat in human and veterinary medicine. Here we used whole genome sequencing and antibiotic susceptibility test data from 980 disease causing Escherichia coli isolated from companion and farm animals to model AMR genotypes and phenotypes for 24 antibiotics. We determined the strength of genotype-to-phenotype relationships for 197 AMR genes with elastic net logistic regression. Model predictors were designed to evaluate different potential modes of AMR genotype translation into resistance phenotypes. Our results show a model that considers the presence of individual AMR genes and total number of AMR genes present from a set of genes known to confer resistance was able to accurately predict isolate resistance on average (mean F1 score = 98.0%, SD = 2.3%, mean accuracy = 98.2%, SD = 2.7%). However, fitted models sometimes varied for antibiotics in the same class and for the same antibiotic across animal hosts, suggesting heterogeneity in the genetic determinants of AMR resistance. We conclude that an interpretable AMR prediction model can be used to accurately predict resistance phenotypes across multiple host species and reveal testable hypotheses about how the mechanism of resistance may vary across antibiotics within the same class and across animal hosts for the same antibiotic.
Collapse
Affiliation(s)
- Henri C. Chung
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States of America
| | - Christine L. Foxx
- Research Participation Program, Oak Ridge Institute for Science and Education, Oak Ridge, TN, United States of America
| | - Jessica A. Hicks
- National Veterinary Services Laboratories, Animal and Plant Health Inspection Service, U.S. Department of Agriculture, Ames, IA, United States of America
| | - Tod P. Stuber
- National Veterinary Services Laboratories, Animal and Plant Health Inspection Service, U.S. Department of Agriculture, Ames, IA, United States of America
| | - Iddo Friedberg
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States of America
| | - Karin S. Dorman
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States of America
- Department of Statistics, Iowa State University, Ames, IA, United States of America
| | - Beth Harris
- National Animal Health Laboratory Network, National Veterinary Services Laboratories, Animal and Plant Health Inspection Service, U.S. Department of Agriculture, Ames, IA, United States of America
| |
Collapse
|
14
|
Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023; 47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open
Abstract
When selecting microbial strains for the production of fermented foods, various microbial phenotypes need to be taken into account to achieve target product characteristics, such as biosafety, flavor, texture, and health-promoting effects. Through continuous advances in sequencing technologies, microbial whole-genome sequences of increasing quality can now be obtained both cheaper and faster, which increases the relevance of genome-based characterization of microbial phenotypes. Prediction of microbial phenotypes from genome sequences makes it possible to quickly screen large strain collections in silico to identify candidates with desirable traits. Several microbial phenotypes relevant to the production of fermented foods can be predicted using knowledge-based approaches, leveraging our existing understanding of the genetic and molecular mechanisms underlying those phenotypes. In the absence of this knowledge, data-driven approaches can be applied to estimate genotype-phenotype relationships based on large experimental datasets. Here, we review computational methods that implement knowledge- and data-driven approaches for phenotype prediction, as well as methods that combine elements from both approaches. Furthermore, we provide examples of how these methods have been applied in industrial biotechnology, with special focus on the fermented food industry.
Collapse
Affiliation(s)
- Signe T Karlsen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Martin H Rau
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Benjamín J Sánchez
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Kristian Jensen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Ahmad A Zeidan
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| |
Collapse
|
15
|
Farhat F, Athar MT, Ahmad S, Madsen DØ, Sohail SS. Antimicrobial resistance and machine learning: past, present, and future. Front Microbiol 2023; 14:1179312. [PMID: 37303800 PMCID: PMC10250749 DOI: 10.3389/fmicb.2023.1179312] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 05/15/2023] [Indexed: 06/13/2023] Open
Abstract
Machine learning has become ubiquitous across all industries, including the relatively new application of predicting antimicrobial resistance. As the first bibliometric review in this field, we expect it to inspire further research in this area. The review employs standard bibliometric indicators such as article count, citation count, and the Hirsch index (H-index) to evaluate the relevance and impact of the leading countries, organizations, journals, and authors in this field. VOSviewer and Biblioshiny programs are utilized to analyze citation and co-citation networks, collaboration networks, keyword co-occurrence, and trend analysis. The United States has the highest contribution with 254 articles, accounting for over 37.57% of the total corpus, followed by China (103) and the United Kingdom (78). Among 58 publishers, the top four publishers account for 45% of the publications, with Elsevier leading with 15% of the publications, followed by Springer Nature (12%), MDPI, and Frontiers Media SA with 9% each. Frontiers in Microbiology is the most frequent publication source (33 articles), followed by Scientific Reports (29 articles), PLoS One (17 articles), and Antibiotics (16 articles). The study reveals a substantial increase in research and publications on the use of machine learning to predict antibiotic resistance. Recent research has focused on developing advanced machine learning algorithms that can accurately forecast antibiotic resistance, and a range of algorithms are now being used to address this issue.
Collapse
Affiliation(s)
- Faiza Farhat
- Department of Zoology, Aligarh Muslim University, Aligarh, India
| | - Md Tanwir Athar
- Department of Pharmacognosy and Pharmaceutical Chemistry, College of Dentistry and Pharmacy, Buraydah Colleges, Buraydah, Al-Qassim, Saudi Arabia
| | - Sultan Ahmad
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia
- Department of Computer Science and Engineering, University Center for Research and Development (UCRD), Chandigarh University, Mohali, Punjab, India
| | - Dag Øivind Madsen
- School of Business, University of South-Eastern Norway, Hønefoss, Norway
| | - Shahab Saquib Sohail
- Department of Computer Science and Engineering, Jamia Hamdard University, New Delhi, India
| |
Collapse
|
16
|
Álvarez VE, Quiroga MP, Centrón D. Identification of a Specific Biomarker of Acinetobacter baumannii Global Clone 1 by Machine Learning and PCR Related to Metabolic Fitness of ESKAPE Pathogens. mSystems 2023:e0073422. [PMID: 37184409 DOI: 10.1128/msystems.00734-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023] Open
Abstract
Since the emergence of high-risk clones worldwide, constant investigations have been undertaken to comprehend the molecular basis that led to their prevalent dissemination in nosocomial settings over time. So far, the complex and multifactorial genetic traits of this type of epidemic clones have allowed only the identification of biomarkers with low specificity. A machine learning algorithm was able to recognize unequivocally a biomarker for early and accurate detection of Acinetobacter baumannii global clone 1 (GC1), one of the most disseminated high-risk clones. A support vector machine model identified the U1 sequence with a length of 367 nucleotides that matched a fragment of the moaCB gene, which encodes the molybdenum cofactor biosynthesis C and B proteins. U1 differentiates specifically between A. baumannii GC1 and non-GC1 strains, becoming a suitable biomarker capable of being translated into clinical settings as a molecular typing method for early diagnosis based on PCR as shown here. Since the metabolic pathways of Mo enzymes have been recognized as putative therapeutic targets for ESKAPE (Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species) pathogens, our findings highlight that machine learning can also be useful in knowledge gaps of high-risk clones and provides noteworthy support to the literature to identify relevant nosocomial biomarkers for other multidrug-resistant high-risk clones. IMPORTANCE A. baumannii GC1 is an important high-risk clone that rapidly develops extreme drug resistance in the nosocomial niche. Furthermore, several strains have been identified worldwide in environmental samples, exacerbating the risk of human interactions. Early diagnosis is mandatory to limit its dissemination and to outline appropriate antibiotic stewardship schedules. A region with a length of 367 bp (U1) within the moaCB gene that is not subjected to lateral genetic transfer or to antibiotic pressures was successfully found by a support vector machine model that predicts A. baumannii GC1 strains. At the same time, research on the group of Mo enzymes proposed this metabolic pathway related to the superbug's metabolism as a potential future drug target site for ESKAPE pathogens due to its central role in bacterial fitness during infection. These findings confirm that machine learning used for the identification of biomarkers of high-risk lineages can also serve to identify putative novel therapeutic target sites.
Collapse
Affiliation(s)
- Verónica Elizabeth Álvarez
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - María Paula Quiroga
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
- Nodo de Bioinformática. Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Técnicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| | - Daniela Centrón
- Laboratorio de Investigaciones en Mecanismos de Resistencia a Antibióticos (LIMRA), Instituto de Investigaciones en Microbiología y Parasitología Médica, Facultad de Medicina, Universidad de Buenos Aires-Consejo Nacional de Investigaciones Científicas y Tecnológicas (IMPaM, UBA-CONICET), Ciudad Autónoma de Buenos Aires, Argentina
| |
Collapse
|
17
|
Naidu A, Nayak SS, Lulu S S, Sundararajan V. Advances in computational frameworks in the fight against TB: The way forward. Front Pharmacol 2023; 14:1152915. [PMID: 37077815 PMCID: PMC10106641 DOI: 10.3389/fphar.2023.1152915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 03/20/2023] [Indexed: 04/05/2023] Open
Abstract
Around 1.6 million people lost their life to Tuberculosis in 2021 according to WHO estimates. Although an intensive treatment plan exists against the causal agent, Mycobacterium Tuberculosis, evolution of multi-drug resistant strains of the pathogen puts a large number of global populations at risk. Vaccine which can induce long-term protection is still in the making with many candidates currently in different phases of clinical trials. The COVID-19 pandemic has further aggravated the adversities by affecting early TB diagnosis and treatment. Yet, WHO remains adamant on its "End TB" strategy and aims to substantially reduce TB incidence and deaths by the year 2035. Such an ambitious goal would require a multi-sectoral approach which would greatly benefit from the latest computational advancements. To highlight the progress of these tools against TB, through this review, we summarize recent studies which have used advanced computational tools and algorithms for-early TB diagnosis, anti-mycobacterium drug discovery and in the designing of the next-generation of TB vaccines. At the end, we give an insight on other computational tools and Machine Learning approaches which have successfully been applied in biomedical research and discuss their prospects and applications against TB.
Collapse
Affiliation(s)
| | | | | | - Vino Sundararajan
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, India
| |
Collapse
|
18
|
Kumar R, Yadav G, Kuddus M, Ashraf GM, Singh R. Unlocking the microbial studies through computational approaches: how far have we reached? ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:48929-48947. [PMID: 36920617 PMCID: PMC10016191 DOI: 10.1007/s11356-023-26220-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 02/24/2023] [Indexed: 04/16/2023]
Abstract
The metagenomics approach accelerated the study of genetic information from uncultured microbes and complex microbial communities. In silico research also facilitated an understanding of protein-DNA interactions, protein-protein interactions, docking between proteins and phyto/biochemicals for drug design, and modeling of the 3D structure of proteins. These in silico approaches provided insight into analyzing pathogenic and nonpathogenic strains that helped in the identification of probable genes for vaccines and antimicrobial agents and comparing whole-genome sequences to microbial evolution. Artificial intelligence, more precisely machine learning (ML) and deep learning (DL), has proven to be a promising approach in the field of microbiology to handle, analyze, and utilize large data that are generated through nucleic acid sequencing and proteomics. This enabled the understanding of the functional and taxonomic diversity of microorganisms. ML and DL have been used in the prediction and forecasting of diseases and applied to trace environmental contaminants and environmental quality. This review presents an in-depth analysis of the recent application of silico approaches in microbial genomics, proteomics, functional diversity, vaccine development, and drug design.
Collapse
Affiliation(s)
- Rajnish Kumar
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India
- Department of Veterinary Medicine and Surgery, College of Veterinary Medicine, University of Missouri, Columbia, MO, USA
| | - Garima Yadav
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India
| | - Mohammed Kuddus
- Department of Biochemistry, College of Medicine, University of Hail, Hail, Saudi Arabia
| | - Ghulam Md Ashraf
- Department of Medical Laboratory Sciences, College of Health Sciences, and Sharjah Institute for Medical Research, University of Sharjah, Sharjah , 27272, United Arab Emirates
| | - Rachana Singh
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India.
| |
Collapse
|
19
|
Shariati A, Arshadi M, Khosrojerdi MA, Abedinzadeh M, Ganjalishahi M, Maleki A, Heidary M, Khoshnood S. The resistance mechanisms of bacteria against ciprofloxacin and new approaches for enhancing the efficacy of this antibiotic. Front Public Health 2022; 10:1025633. [PMID: 36620240 PMCID: PMC9815622 DOI: 10.3389/fpubh.2022.1025633] [Citation(s) in RCA: 51] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 11/22/2022] [Indexed: 12/24/2022] Open
Abstract
For around three decades, the fluoroquinolone (FQ) antibiotic ciprofloxacin has been used to treat a range of diseases, including chronic otorrhea, endocarditis, lower respiratory tract, gastrointestinal, skin and soft tissue, and urinary tract infections. Ciprofloxacin's main mode of action is to stop DNA replication by blocking the A subunit of DNA gyrase and having an extra impact on the substances in cell walls. Available in intravenous and oral formulations, ciprofloxacin reaches therapeutic concentrations in the majority of tissues and bodily fluids with a low possibility for side effects. Despite the outstanding qualities of this antibiotic, Salmonella typhi, Staphylococcus aureus, Escherichia coli, and Pseudomonas aeruginosa have all shown an increase in ciprofloxacin resistance over time. The rise of infections that are resistant to ciprofloxacin shows that new pharmacological synergisms and derivatives are required. To this end, ciprofloxacin may be more effective against the biofilm community of microorganisms and multi-drug resistant isolates when combined with a variety of antibacterial agents, such as antibiotics from various classes, nanoparticles, natural products, bacteriophages, and photodynamic therapy. This review focuses on the resistance mechanisms of bacteria against ciprofloxacin and new approaches for enhancing its efficacy.
Collapse
Affiliation(s)
- Aref Shariati
- Molecular and Medicine Research Center, Khomein University of Medical Sciences, Khomein, Iran
| | - Maniya Arshadi
- Infectious and Tropical Diseases Research Center, Health Research Institute, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran,Department of Microbiology, School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | | | - Mostafa Abedinzadeh
- Student Research Committee, Sabzevar University of Medical Sciences, Sabzevar, Iran
| | - Mahsa Ganjalishahi
- Student Research Committee, Sabzevar University of Medical Sciences, Sabzevar, Iran
| | - Abbas Maleki
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran
| | - Mohsen Heidary
- Department of Laboratory Sciences, School of Paramedical Sciences, Sabzevar University of Medical Sciences, Sabzevar, Iran,Cellular and Molecular Research Center, Sabzevar University of Medical Sciences, Sabzevar, Iran,*Correspondence: Mohsen Heidary
| | - Saeed Khoshnood
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran,Student Research Committee, Ilam University of Medical Sciences, Ilam, Iran,Saeed Khoshnood
| |
Collapse
|
20
|
Khatamifard SK, Chowdhury Z, Pande N, Razaviyayn M, Kim C, Karpuzcu UR. GeNVoM: Read Mapping Near Non-Volatile Memory. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3482-3496. [PMID: 34613917 DOI: 10.1109/tcbb.2021.3118018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
DNA sequencing is the physical/biochemical process of identifying the location of the four bases (Adenine, Guanine, Cytosine, Thymine) in a DNA strand. As semiconductor technology revolutionized computing, modern DNA sequencing technology (termed Next Generation Sequencing, NGS) revolutionized genomic research. As a result, modern NGS platforms can sequence hundreds of millions of short DNA fragments in parallel. The sequenced DNA fragments, representing the output of NGS platforms, are termed reads. Besides genomic variations, NGS imperfections induce noise in reads. Mapping each read to (the most similar portion of) a reference genome of the same species, i.e., read mapping, is a common critical first step in a diverse set of emerging bioinformatics applications. Mapping represents a search-heavy memory-intensive similarity matching problem, therefore, can greatly benefit from near-memory processing. Intuition suggests using fast associative search enabled by Ternary Content Addressable Memory (TCAM) by construction. However, the excessive energy consumption and lack of support for similarity matching (under NGS and genomic variation induced noise) renders direct application of TCAM infeasible, irrespective of volatility, where only non-volatile TCAM can accommodate the large memory footprint in an area-efficient way. This paper introduces GeNVoM, a scalable, energy-efficient and high-throughput solution. Instead of optimizing an algorithm developed for general-purpose computers or GPUs, GeNVoM rethinks the algorithm and non-volatile TCAM-based accelerator design together from the ground up. Thereby GeNVoM can improve the throughput by up to 3.67×; the energy consumption, by up to 1.36×, when compared to an ASIC baseline, which represents one of the highest-throughput implementations known.
Collapse
|
21
|
Kim JI, Maguire F, Tsang KK, Gouliouris T, Peacock SJ, McAllister TA, McArthur AG, Beiko RG. Machine Learning for Antimicrobial Resistance Prediction: Current Practice, Limitations, and Clinical Perspective. Clin Microbiol Rev 2022; 35:e0017921. [PMID: 35612324 PMCID: PMC9491192 DOI: 10.1128/cmr.00179-21] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Antimicrobial resistance (AMR) is a global health crisis that poses a great threat to modern medicine. Effective prevention strategies are urgently required to slow the emergence and further dissemination of AMR. Given the availability of data sets encompassing hundreds or thousands of pathogen genomes, machine learning (ML) is increasingly being used to predict resistance to different antibiotics in pathogens based on gene content and genome composition. A key objective of this work is to advocate for the incorporation of ML into front-line settings but also highlight the further refinements that are necessary to safely and confidently incorporate these methods. The question of what to predict is not trivial given the existence of different quantitative and qualitative laboratory measures of AMR. ML models typically treat genes as independent predictors, with no consideration of structural and functional linkages; they also may not be accurate when new mutational variants of known AMR genes emerge. Finally, to have the technology trusted by end users in public health settings, ML models need to be transparent and explainable to ensure that the basis for prediction is clear. We strongly advocate that the next set of AMR-ML studies should focus on the refinement of these limitations to be able to bridge the gap to diagnostic implementation.
Collapse
Affiliation(s)
- Jee In Kim
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
- Institute for Comparative Genomics, Dalhousie University, Halifax, Canada
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Canada
| | - Finlay Maguire
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
- Institute for Comparative Genomics, Dalhousie University, Halifax, Canada
- Department of Community Health and Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Canada
- Shared Hospital Laboratory, Toronto, Canada
- Sunnybrook Research Institute, Sunnybrook Health Sciences Centre, Toronto, Canada
| | - Kara K. Tsang
- London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Theodore Gouliouris
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
- Clinical Microbiology and Public Health Laboratory, Public Health England, Cambridge, United Kingdom
- Cambridge University Hospitals NHS Foundation Trust, Cambridge, United Kingdom
| | - Sharon J. Peacock
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Tim A. McAllister
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, Lethbridge, Canada
| | - Andrew G. McArthur
- David Braley Centre for Antibiotic Discovery, McMaster University, Hamilton, Canada
- M.G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, Canada
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Canada
| | - Robert G. Beiko
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
- Institute for Comparative Genomics, Dalhousie University, Halifax, Canada
| |
Collapse
|
22
|
Liang S, Ma J, Wang G, Shao J, Li J, Deng H, Wang C, Li W. The Application of Artificial Intelligence in the Diagnosis and Drug Resistance Prediction of Pulmonary Tuberculosis. Front Med (Lausanne) 2022; 9:935080. [PMID: 35966878 PMCID: PMC9366014 DOI: 10.3389/fmed.2022.935080] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 06/13/2022] [Indexed: 11/30/2022] Open
Abstract
With the increasing incidence and mortality of pulmonary tuberculosis, in addition to tough and controversial disease management, time-wasting and resource-limited conventional approaches to the diagnosis and differential diagnosis of tuberculosis are still awkward issues, especially in countries with high tuberculosis burden and backwardness. In the meantime, the climbing proportion of drug-resistant tuberculosis poses a significant hazard to public health. Thus, auxiliary diagnostic tools with higher efficiency and accuracy are urgently required. Artificial intelligence (AI), which is not new but has recently grown in popularity, provides researchers with opportunities and technical underpinnings to develop novel, precise, rapid, and automated implements for pulmonary tuberculosis care, including but not limited to tuberculosis detection. In this review, we aimed to introduce representative AI methods, focusing on deep learning and radiomics, followed by definite descriptions of the state-of-the-art AI models developed using medical images and genetic data to detect pulmonary tuberculosis, distinguish the infection from other pulmonary diseases, and identify drug resistance of tuberculosis, with the purpose of assisting physicians in deciding the appropriate therapeutic schedule in the early stage of the disease. We also enumerated the challenges in maximizing the impact of AI in this field such as generalization and clinical utility of the deep learning models.
Collapse
Affiliation(s)
- Shufan Liang
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
- Precision Medicine Key Laboratory of Sichuan Province, Precision Medicine Research Center, West China Hospital, Sichuan University, Chengdu, China
| | - Jiechao Ma
- AI Lab, Deepwise Healthcare, Beijing, China
| | - Gang Wang
- Precision Medicine Key Laboratory of Sichuan Province, Precision Medicine Research Center, West China Hospital, Sichuan University, Chengdu, China
| | - Jun Shao
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
| | - Jingwei Li
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
| | - Hui Deng
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
- Precision Medicine Key Laboratory of Sichuan Province, Precision Medicine Research Center, West China Hospital, Sichuan University, Chengdu, China
- *Correspondence: Hui Deng,
| | - Chengdi Wang
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
- Chengdi Wang,
| | - Weimin Li
- Department of Respiratory and Critical Care Medicine, Med-X Center for Manufacturing, Frontiers Science Center for Disease-Related Molecular Network, West China School of Medicine, West China Hospital, Sichuan University, Chengdu, China
- Weimin Li,
| |
Collapse
|
23
|
Green AG, Yoon CH, Chen ML, Ektefaie Y, Fina M, Freschi L, Gröschel MI, Kohane I, Beam A, Farhat M. A convolutional neural network highlights mutations relevant to antimicrobial resistance in Mycobacterium tuberculosis. Nat Commun 2022; 13:3817. [PMID: 35780211 PMCID: PMC9250494 DOI: 10.1038/s41467-022-31236-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 06/10/2022] [Indexed: 11/30/2022] Open
Abstract
Long diagnostic wait times hinder international efforts to address antibiotic resistance in M. tuberculosis. Pathogen whole genome sequencing, coupled with statistical and machine learning models, offers a promising solution. However, generalizability and clinical adoption have been limited by a lack of interpretability, especially in deep learning methods. Here, we present two deep convolutional neural networks that predict antibiotic resistance phenotypes of M. tuberculosis isolates: a multi-drug CNN (MD-CNN), that predicts resistance to 13 antibiotics based on 18 genomic loci, with AUCs 82.6-99.5% and higher sensitivity than state-of-the-art methods; and a set of 13 single-drug CNNs (SD-CNN) with AUCs 80.1-97.1% and higher specificity than the previous state-of-the-art. Using saliency methods to evaluate the contribution of input sequence features to the SD-CNN predictions, we identify 18 sites in the genome not previously associated with resistance. The CNN models permit functional variant discovery, biologically meaningful interpretation, and clinical applicability.
Collapse
Affiliation(s)
- Anna G Green
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Chang Ho Yoon
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, OX37LF, UK
| | - Michael L Chen
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
- Stanford University School of Medicine, 291 Campus Dr, Stanford, CA, 94305, USA
| | - Yasha Ektefaie
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Mack Fina
- Harvard College, Cambridge, MA, 02138, USA
| | - Luca Freschi
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Matthias I Gröschel
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Isaac Kohane
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA
| | - Andrew Beam
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA.
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, 677 Huntington Ave, Boston, MA, 02115, USA.
| | - Maha Farhat
- Department of Biomedical Informatics, Harvard Medical School, 25 Shattuck St, Boston, MA, 02115, USA.
- Division of Pulmonary & Critical Care, Massachusetts General Hospital, 55 Fruit St, Boston, MA, 02114, USA.
| |
Collapse
|
24
|
Tzelves L, Lazarou L, Feretzakis G, Kalles D, Mourmouris P, Loupelis E, Basourakos S, Berdempes M, Manolitsis I, Mitsogiannis I, Skolarikos A, Varkarakis I. Using machine learning techniques to predict antimicrobial resistance in stone disease patients. World J Urol 2022; 40:1731-1736. [PMID: 35616713 DOI: 10.1007/s00345-022-04043-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 05/02/2022] [Indexed: 10/18/2022] Open
Abstract
PURPOSE Artificial intelligence is part of our daily life and machine learning techniques offer possibilities unknown until now in medicine. This study aims to offer an evaluation of the performance of machine learning (ML) techniques, for predicting bacterial resistance in a urology department. METHODS Data were retrieved from laboratory information system (LIS) concerning 239 patients with urolithiasis hospitalized in the urology department of a tertiary hospital over a 1-year period (2019): age, gender, Gram stain (positive, negative), bacterial species, sample type, antibiotics and antimicrobial susceptibility. In our experiments, we compared several classifiers following a tenfold cross-validation approach on 2 different versions of our dataset; the first contained only information of Gram stain, while the second had knowledge of bacterial species. RESULTS The best results in the balanced dataset containing Gram stain, achieve a weighted average receiver operator curve (ROC) area of 0.768 and F-measure of 0.708, using a multinomial logistic regression model with a ridge estimator. The corresponding results of the balanced dataset, that contained bacterial species, achieve a weighted average ROC area of 0.874 and F-measure of 0.783, with a bagging classifier. CONCLUSIONS Artificial intelligence technology can be used for making predictions on antibiotic resistance patterns when knowing Gram staining with an accuracy of 77% and nearly 87% when identifying specific microorganisms. This knowledge can aid urologists prescribing the appropriate antibiotic 24-48 h before test results are known.
Collapse
Affiliation(s)
- Lazaros Tzelves
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Lazaros Lazarou
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Georgios Feretzakis
- School of Science and Technology, Hellenic Open University, Patras, Greece.,Department of Quality Control, Research and Continuing Education, Sismanogleio General Hospital, Marousi, Greece.,Information Technologies Department, Sismanogleio General Hospital, Marousi, Greece
| | - Dimitris Kalles
- School of Science and Technology, Hellenic Open University, Patras, Greece
| | - Panagiotis Mourmouris
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Evangelos Loupelis
- Information Technologies Department, Sismanogleio General Hospital, Marousi, Greece
| | - Spyridon Basourakos
- Department of Urology, New York Presbyterian Hospital/Weill Cornell Medicine, New York, NY, USA
| | - Marinos Berdempes
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Ioannis Manolitsis
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece.
| | - Iraklis Mitsogiannis
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Andreas Skolarikos
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| | - Ioannis Varkarakis
- 2nd Department of Urology, Sismanogleio General Hospital, National and Kapodistrian University of Athens, Sismanogleiou 37, Marousi, 15126, Athens, Greece
| |
Collapse
|
25
|
Jiang Z, Lu Y, Liu Z, Wu W, Xu X, Dinnyés A, Yu Z, Chen L, Sun Q. Drug resistance prediction and resistance genes identification in Mycobacterium tuberculosis based on a hierarchical attentive neural network utilizing genome-wide variants. Brief Bioinform 2022; 23:6553603. [PMID: 35325021 DOI: 10.1093/bib/bbac041] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/18/2022] [Accepted: 01/27/2022] [Indexed: 01/25/2023] Open
Abstract
Prediction of antimicrobial resistance based on whole-genome sequencing data has attracted greater attention due to its rapidity and convenience. Numerous machine learning-based studies have used genetic variants to predict drug resistance in Mycobacterium tuberculosis (MTB), assuming that variants are homogeneous, and most of these studies, however, have ignored the essential correlation between variants and corresponding genes when encoding variants, and used a limited number of variants as prediction input. In this study, taking advantage of genome-wide variants for drug-resistance prediction and inspired by natural language processing, we summarize drug resistance prediction into document classification, in which variants are considered as words, mutated genes in an isolate as sentences, and an isolate as a document. We propose a novel hierarchical attentive neural network model (HANN) that helps discover drug resistance-related genes and variants and acquire more interpretable biological results. It captures the interaction among variants in a mutated gene as well as among mutated genes in an isolate. Our results show that for the four first-line drugs of isoniazid (INH), rifampicin (RIF), ethambutol (EMB) and pyrazinamide (PZA), the HANN achieves the optimal area under the ROC curve of 97.90, 99.05, 96.44 and 95.14% and the optimal sensitivity of 94.63, 96.31, 92.56 and 87.05%, respectively. In addition, without any domain knowledge, the model identifies drug resistance-related genes and variants consistent with those confirmed by previous studies, and more importantly, it discovers one more potential drug-resistance-related gene.
Collapse
Affiliation(s)
- Zhonghua Jiang
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China
| | - Yongmei Lu
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Zhuochong Liu
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China
| | - Wei Wu
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China
| | - Xinyi Xu
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China
| | - András Dinnyés
- BioTalentum Ltd. Aulich Lajos str. 26. 2100 Gödöllõ, Hungary
| | - Zhonghua Yu
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Li Chen
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Qun Sun
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan 610064, China
| |
Collapse
|
26
|
Yasir M, Mustafa Karim A, Kausar Malik S, Bajaffer AA, Azhar EI. Prediction of Antimicrobial Minimal Inhibitory Concentrations for Neisseria gonorrhoeae using Machine Learning Models. Saudi J Biol Sci 2022; 29:3687-3693. [PMID: 35844400 PMCID: PMC9280306 DOI: 10.1016/j.sjbs.2022.02.047] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/12/2022] [Accepted: 02/27/2022] [Indexed: 11/26/2022] Open
Abstract
The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, reliable antimicrobial susceptibility data is crucial, and it will help clinicians about which drug to prescribe. Although few prediction studies based on strategies have been conducted, however, no single machine learning (ML) modelling has been carried out to predict MICs in N. gonorrhoeae. In this study, we propose a ML based approach that can predict MICs of a specific antibiotic using unitigs sequences data. We retrieved N. gonorrhoeae genomes from European Nucleotide Archive and NCBI and analysed them combined with their respective MIC data for cefixime, ciprofloxacin, and azithromycin and then we constructed unitigs by using de Brujin graphs. We built and compared 35 different ML regression models to predict MICs. Our results demonstrate that RandomForest and CATBoost models showed best performance in predicting MICs of the three antibiotics. The coefficient of determination, R2, (a statistical measure of how well the regression predictions approximate the real data points) for cefixime, ciprofloxacin, and azithromycin was 0.75787, 0.77241, and 0.79009 respectively using RandomForest. For CATBoost model, the R2 value was 0.74570, 0.77393, and 0.79317 for cefixime, ciprofloxacin, and azithromycin respectively. Lastly, using feature importance, we explore the important genomic regions identified by the models for predicting MICs. The major mutations which are responsible for resistance against these three antibiotics were chosen by ML models as a top feature in case of each antibiotics. CATBoost, DecisionTree, GradientBoosting, and RandomForest regression models chose the same unitigs which are responsible for resistance. This unitigs-based strategy for developing models for MIC prediction, clinical diagnostics, and surveillance can be applicable for other critical bacterial pathogens.
Collapse
|
27
|
Müller SJ, Meraba RL, Dlamini GS, Mapiye DS. First-line drug resistance profiling of Mycobacterium tuberculosis: a machine learning approach. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2021:891-899. [PMID: 35309001 PMCID: PMC8861754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The persistence and emergence of new multi-drug resistant Mycobacterium tuberculosis (M. tb) strains continues to advance the devastating tuberculosis (TB) epidemic. Robust systems are needed to accurately and rapidly perform drug-resistance profiling, and machine learning (ML) methods combined with genomic sequence data may provide novel insights into drug-resistance mechanisms. Using 372 M. tb isolates, the combined utility of ML and bioinformatics to perform drug-resistance profiling is demonstrated. SNPs, InDels, and dinucleotide frequencies are explored as input features for three ML models, namely Decision Trees, Random Forest, and the eXtreme Gradient Boosted model. Using SNPs and InDels, all three models performed equally well yielding a 99% accuracy, 97% recall, and 99% F1-score. Using dinucleotide frequencies, the XGBoost algorithm was superior with a 97% accuracy, 94% recall and 97% F1-score. This study validates the use of variants and presents dinucleotide features as another effective feature encoding method for ML-based phenotype classification.
Collapse
|
28
|
Kuang X, Wang F, Hernandez KM, Zhang Z, Grossman RL. Accurate and rapid prediction of tuberculosis drug resistance from genome sequence data using traditional machine learning algorithms and CNN. Sci Rep 2022; 12:2427. [PMID: 35165358 PMCID: PMC8844416 DOI: 10.1038/s41598-022-06449-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 01/31/2022] [Indexed: 12/04/2022] Open
Abstract
Effective and timely antibiotic treatment depends on accurate and rapid in silico antimicrobial-resistant (AMR) predictions. Existing statistical rule-based Mycobacterium tuberculosis (MTB) drug resistance prediction methods using bacterial genomic sequencing data often achieve varying results: high accuracy on some antibiotics but relatively low accuracy on others. Traditional machine learning (ML) approaches have been applied to classify drug resistance for MTB and have shown more stable performance. However, there is no study that uses deep learning architecture like Convolutional Neural Network (CNN) on a large and diverse cohort of MTB samples for AMR prediction. We developed 24 binary classifiers of MTB drug resistance status across eight anti-MTB drugs and three different ML algorithms: logistic regression, random forest and 1D CNN using a training dataset of 10,575 MTB isolates collected from 16 countries across six continents, where an extended pan-genome reference was used for detecting genetic features. Our 1D CNN architecture was designed to integrate both sequential and non-sequential features. In terms of F1-scores, 1D CNN models are our best classifiers that are also more accurate and stable than the state-of-the-art rule-based tool Mykrobe predictor (81.1 to 93.8%, 93.7 to 96.2%, 93.1 to 94.8%, 95.9 to 97.2% and 97.1 to 98.2% for ethambutol, rifampicin, pyrazinamide, isoniazid and ofloxacin respectively). We applied filter-based feature selection to find AMR relevant features. All selected variant features are AMR-related ones in CARD database. 78.8% of them are also in the catalogue of MTB mutations that were recently identified as drug resistance-associated ones by WHO. To facilitate ML model development for AMR prediction, we packaged every step into an automated pipeline and shared the source code at https://github.com/KuangXY3/MTB-AMR-classification-CNN.
Collapse
|
29
|
Deelder W, Napier G, Campino S, Palla L, Phelan J, Clark TG. A modified decision tree approach to improve the prediction and mutation discovery for drug resistance in Mycobacterium tuberculosis. BMC Genomics 2022; 23:46. [PMID: 35016609 PMCID: PMC8753810 DOI: 10.1186/s12864-022-08291-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 01/03/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Drug resistant Mycobacterium tuberculosis is complicating the effective treatment and control of tuberculosis disease (TB). With the adoption of whole genome sequencing as a diagnostic tool, machine learning approaches are being employed to predict M. tuberculosis resistance and identify underlying genetic mutations. However, machine learning approaches can overfit and fail to identify causal mutations if they are applied out of the box and not adapted to the disease-specific context. We introduce a machine learning approach that is customized to the TB setting, which extracts a library of genomic variants re-occurring across individual studies to improve genotypic profiling. RESULTS We developed a customized decision tree approach, called Treesist-TB, that performs TB drug resistance prediction by extracting and evaluating genomic variants across multiple studies. The application of Treesist-TB to rifampicin (RIF), isoniazid (INH) and ethambutol (EMB) drugs, for which resistance mutations are known, demonstrated a level of predictive accuracy similar to the widely used TB-Profiler tool (Treesist-TB vs. TB-Profiler tool: RIF 97.5% vs. 97.6%; INH 96.8% vs. 96.5%; EMB 96.8% vs. 95.8%). Application of Treesist-TB to less understood second-line drugs of interest, ethionamide (ETH), cycloserine (CYS) and para-aminosalisylic acid (PAS), led to the identification of new variants (52, 6 and 11, respectively), with a high number absent from the TB-Profiler library (45, 4, and 6, respectively). Thereby, Treesist-TB had improved predictive sensitivity (Treesist-TB vs. TB-Profiler tool: PAS 64.3% vs. 38.8%; CYS 45.3% vs. 30.7%; ETH 72.1% vs. 71.1%). CONCLUSION Our work reinforces the utility of machine learning for drug resistance prediction, while highlighting the need to customize approaches to the disease-specific context. Through applying a modified decision learning approach (Treesist-TB) across a range of anti-TB drugs, we identified plausible resistance-encoding genomic variants with high predictive ability, whilst potentially overcoming the overfitting challenges that can affect standard machine learning applications.
Collapse
Affiliation(s)
- Wouter Deelder
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
- Dalberg Advisors, 7 Rue de Chantepoulet, CH-1201, Geneva, Switzerland
| | - Gary Napier
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
| | - Susana Campino
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
| | - Luigi Palla
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
- Department of Public Health and Infectious Diseases, University of Rome La Sapienza, Rome, Italy
| | - Jody Phelan
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
| | - Taane G Clark
- London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK.
- Department of Infection Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK.
| |
Collapse
|
30
|
Sharma A, Machado E, Lima KVB, Suffys PN, Conceição EC. Tuberculosis drug resistance profiling based on machine learning: A literature review. Braz J Infect Dis 2022; 26:102332. [PMID: 35176257 PMCID: PMC9387475 DOI: 10.1016/j.bjid.2022.102332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 12/18/2021] [Accepted: 01/01/2022] [Indexed: 11/30/2022] Open
Abstract
Tuberculosis (TB), caused by Mycobacterium tuberculosis (MTB), is one of the top 10 causes of death worldwide. Drug-resistant tuberculosis (DR-TB) poses a major threat to the World Health Organization's "End TB" strategy which has defined its target as the year 2035. In 2019, there were close to 0.5 million cases of DRTB, of which 78% were resistant to multiple TB drugs. The traditional culture-based drug susceptibility test (DST - the current gold standard) often takes multiple weeks and the necessary laboratory facilities are not readily available in low-income countries. Whole genome sequencing (WGS) technology is rapidly becoming an important tool in clinical and research applications including transmission detection or prediction of DR-TB. For the latter, many tools have recently been developed using curated database(s) of known resistance conferring mutations. However, documenting all the mutations and their effect is a time-taking and a continuous process and therefore Machine Learning (ML) techniques can be useful for predicting the presence of DR-TB based on WGS data. This can pave the way to an earlier detection of drug resistance and consequently more efficient treatment when compared to the traditional DST.
Collapse
Affiliation(s)
- Abhinav Sharma
- Faculty of Engineering and Technology, Liverpool John Moores University (LJMU), Liverpool, United Kingdom
| | - Edson Machado
- Fundação Oswaldo Cruz-Fiocruz, Instituto Oswaldo Cruz, Laboratório de Biologia Molecular Aplicada a Micobactérias, Rio de Janeiro, RJ, Brazil
| | - Karla Valeria Batista Lima
- Instituto Evandro Chagas, Seção de Bacteriologia e Micologia, Ananindeua, PA, Brazil
- Universidade do Estado do Pará, Instituto de Ciências Biológicas e da Saúde, Pós-Graduação em Biologia Parasitária na Amazônia, Belém, PA, Brazil
| | - Philip Noel Suffys
- Fundação Oswaldo Cruz-Fiocruz, Instituto Oswaldo Cruz, Laboratório de Biologia Molecular Aplicada a Micobactérias, Rio de Janeiro, RJ, Brazil
| | - Emilyn Costa Conceição
- Programa de Pós-graduação em Pesquisa Clínica e Doenças Infecciosas, Instituto Nacional de Infectologia Evandro Chagas, Fundação Oswaldo Cruz, Rio de Janeiro, RJ, Brazil
- Department of Science and Innovation - National Research Foundation Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
31
|
Ren Y, Chakraborty T, Doijad S, Falgenhauer L, Falgenhauer J, Goesmann A, Schwengers O, Heider D. Multi-label classification for multi-drug resistance prediction of Escherichia coli. Comput Struct Biotechnol J 2022; 20:1264-1270. [PMID: 35317240 PMCID: PMC8918850 DOI: 10.1016/j.csbj.2022.03.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 03/08/2022] [Accepted: 03/08/2022] [Indexed: 11/03/2022] Open
|
32
|
Yang Y, Walker TM, Kouchaki S, Wang C, Peto TEA, Crook DW, Clifton DA. An end-to-end heterogeneous graph attention network for Mycobacterium tuberculosis drug-resistance prediction. Brief Bioinform 2021; 22:6355133. [PMID: 34414415 PMCID: PMC8575050 DOI: 10.1093/bib/bbab299] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/28/2021] [Accepted: 07/16/2021] [Indexed: 11/23/2022] Open
Abstract
Antimicrobial resistance (AMR) poses a threat to global public health. To mitigate the impacts of AMR, it is important to identify the molecular mechanisms of AMR and thereby determine optimal therapy as early as possible. Conventional machine learning-based drug-resistance analyses assume genetic variations to be homogeneous, thus not distinguishing between coding and intergenic sequences. In this study, we represent genetic data from Mycobacterium tuberculosis as a graph, and then adopt a deep graph learning method—heterogeneous graph attention network (‘HGAT–AMR’)—to predict anti-tuberculosis (TB) drug resistance. The HGAT–AMR model is able to accommodate incomplete phenotypic profiles, as well as provide ‘attention scores’ of genes and single nucleotide polymorphisms (SNPs) both at a population level and for individual samples. These scores encode the inputs, which the model is ‘paying attention to’ in making its drug resistance predictions. The results show that the proposed model generated the best area under the receiver operating characteristic (AUROC) for isoniazid and rifampicin (98.53 and 99.10%), the best sensitivity for three first-line drugs (94.91% for isoniazid, 96.60% for ethambutol and 90.63% for pyrazinamide), and maintained performance when the data were associated with incomplete phenotypes (i.e. for those isolates for which phenotypic data for some drugs were missing). We also demonstrate that the model successfully identifies genes and SNPs associated with drug resistance, mitigating the impact of resistance profile while considering particular drug resistance, which is consistent with domain knowledge.
Collapse
Affiliation(s)
- Yang Yang
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, OX3 7DQ, UK.,Oxford-Suzhou Centre for Advanced Research, Suzhou, 215123, China
| | - Timothy M Walker
- Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
| | - Samaneh Kouchaki
- Centre for vision, Speech, and Signal processing, University of Surrey, Guildford, UK
| | - Chenyang Wang
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, OX3 7DQ, UK
| | - Timothy E A Peto
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital Headley Way, OX3 9DU, Oxford, UK
| | - Derrick W Crook
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital Headley Way, OX3 9DU, Oxford, UK.,NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Headley Way Headington, OX3 9DU, Oxford, UK
| | | | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, OX3 7DQ, UK.,Oxford-Suzhou Centre for Advanced Research, Suzhou, 215123, China
| |
Collapse
|
33
|
Li D, Wang Y, Hu W, Chen F, Zhao J, Chen X, Han L. Application of Machine Learning Classifier to Candida auris Drug Resistance Analysis. Front Cell Infect Microbiol 2021; 11:742062. [PMID: 34722336 PMCID: PMC8554202 DOI: 10.3389/fcimb.2021.742062] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 09/22/2021] [Indexed: 12/30/2022] Open
Abstract
Candida auris (C. auris) is an emerging fungus associated with high morbidity. It has a unique transmission ability and is often resistant to multiple drugs. In this study, we evaluated the ability of different machine learning models to classify the drug resistance and predicted and ranked the drug resistance mutations of C. auris. Two C. auris strains were obtained. Combined with other 356 strains collected from the European Bioinformatics Institute (EBI) databases, the whole genome sequencing (WGS) data were analyzed by bioinformatics. Machine learning classifiers were used to build drug resistance models, which were evaluated and compared by various evaluation methods based on AUC value. Briefly, two strains were assigned to Clade III in the phylogenetic tree, which was consistent with previous studies; nevertheless, the phylogenetic tree was not completely consistent with the conclusion of clustering according to the geographical location discovered earlier. The clustering results of C. auris were related to its drug resistance. The resistance genes of C. auris were not under additional strong selection pressure, and the performance of different models varied greatly for different drugs. For drugs such as azoles and echinocandins, the models performed relatively well. In addition, two machine learning algorithms, based on the balanced test and imbalanced test, were designed and evaluated; for most drugs, the evaluation results on the balanced test set were better than on the imbalanced test set. The mutations strongly be associated with drug resistance of C. auris were predicted and ranked by Recursive Feature Elimination with Cross-Validation (RFECV) combined with a machine learning classifier. In addition to known drug resistance mutations, some new resistance mutations were predicted, such as Y501H and I466M mutation in the ERG11 gene and R278H mutation in the ERG10 gene, which may be associated with fluconazole (FCZ), micafungin (MCF), and amphotericin B (AmB) resistance, respectively; these mutations were in the “hot spot” regions of the ergosterol pathway. To sum up, this study suggested that machine learning classifiers are a useful and cost-effective method to identify fungal drug resistance-related mutations, which is of great significance for the research on the resistance mechanism of C. auris.
Collapse
Affiliation(s)
- Dingchen Li
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China
| | - Yaru Wang
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China.,School of Mathematics and Statistics, Shaanxi Normal University, Xi'an, China
| | - Wenjuan Hu
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China.,School of Mathematics and Statistics, Shaanxi Normal University, Xi'an, China
| | - Fangyan Chen
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China
| | - Jingya Zhao
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China
| | - Xia Chen
- School of Mathematics and Statistics, Shaanxi Normal University, Xi'an, China
| | - Li Han
- Department of Disinfection and Infection Control, Chinese People's Liberation Army (PLA) Center for Disease Control and Prevention, Beijing, China
| |
Collapse
|
34
|
He S, Leanse LG, Feng Y. Artificial intelligence and machine learning assisted drug delivery for effective treatment of infectious diseases. Adv Drug Deliv Rev 2021; 178:113922. [PMID: 34461198 DOI: 10.1016/j.addr.2021.113922] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 07/14/2021] [Accepted: 08/09/2021] [Indexed: 12/23/2022]
Abstract
In the era of antimicrobial resistance, the prevalence of multidrug-resistant microorganisms that resist conventional antibiotic treatment has steadily increased. Thus, it is now unquestionable that infectious diseases are significant global burdens that urgently require innovative treatment strategies. Emerging studies have demonstrated that artificial intelligence (AI) can transform drug delivery to promote effective treatment of infectious diseases. In this review, we propose to evaluate the significance, essential principles, and popular tools of AI in drug delivery for infectious disease treatment. Specifically, we will focus on the achievements and key findings of current research, as well as the applications of AI on drug delivery throughout the whole antimicrobial treatment process, with an emphasis on drug development, treatment regimen optimization, drug delivery system and administration route design, and drug delivery outcome prediction. To that end, the challenges of AI in drug delivery for infectious disease treatments and their current solutions and future perspective will be presented and discussed.
Collapse
Affiliation(s)
- Sheng He
- Boston Children's Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| | - Leon G Leanse
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA
| | - Yanfang Feng
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| |
Collapse
|
35
|
Ren Y, Chakraborty T, Doijad S, Falgenhauer L, Falgenhauer J, Goesmann A, Hauschild AC, Schwengers O, Heider D. Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning. Bioinformatics 2021; 38:325-334. [PMID: 34613360 PMCID: PMC8722762 DOI: 10.1093/bioinformatics/btab681] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/27/2021] [Accepted: 09/24/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Antimicrobial resistance (AMR) is one of the biggest global problems threatening human and animal health. Rapid and accurate AMR diagnostic methods are thus very urgently needed. However, traditional antimicrobial susceptibility testing (AST) is time-consuming, low throughput and viable only for cultivable bacteria. Machine learning methods may pave the way for automated AMR prediction based on genomic data of the bacteria. However, comparing different machine learning methods for the prediction of AMR based on different encodings and whole-genome sequencing data without previously known knowledge remains to be done. RESULTS In this study, we evaluated logistic regression (LR), support vector machine (SVM), random forest (RF) and convolutional neural network (CNN) for the prediction of AMR for the antibiotics ciprofloxacin, cefotaxime, ceftazidime and gentamicin. We could demonstrate that these models can effectively predict AMR with label encoding, one-hot encoding and frequency matrix chaos game representation (FCGR encoding) on whole-genome sequencing data. We trained these models on a large AMR dataset and evaluated them on an independent public dataset. Generally, RFs and CNNs perform better than LR and SVM with AUCs up to 0.96. Furthermore, we were able to identify mutations that are associated with AMR for each antibiotic. AVAILABILITY AND IMPLEMENTATION Source code in data preparation and model training are provided at GitHub website (https://github.com/YunxiaoRen/ML-iAMR). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yunxiao Ren
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg 35032, Germany
| | - Trinad Chakraborty
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany,German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany
| | - Swapnil Doijad
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany,German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany
| | - Linda Falgenhauer
- German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany,Institute of Hygiene and Environmental Medicine, Justus Liebig University Giessen, Giessen 35392, Germany,Hessisches universitäres Kompetenzzentrum Krankenhaushygiene, Giessen 35392, Germany
| | - Jane Falgenhauer
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen 35392, Germany,German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany
| | - Alexander Goesmann
- German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany,Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Anne-Christin Hauschild
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg 35032, Germany
| | - Oliver Schwengers
- German Center for Infection Research, Partner site Giessen-Marburg-Langen, Giessen 35392, Germany,Department of Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | | |
Collapse
|
36
|
Tan R, Yu A, Liu Z, Liu Z, Jiang R, Wang X, Liu J, Gao J, Wang X. Prediction of Minimal Inhibitory Concentration of Meropenem Against Klebsiella pneumoniae Using Metagenomic Data. Front Microbiol 2021; 12:712886. [PMID: 34497594 PMCID: PMC8421019 DOI: 10.3389/fmicb.2021.712886] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/26/2021] [Indexed: 11/29/2022] Open
Abstract
Minimal inhibitory concentration (MIC) is defined as the lowest concentration of an antimicrobial agent that can inhibit the visible growth of a particular microorganism after overnight incubation. Clinically, antibiotic doses for specific infections are determined according to the fraction of MIC. Therefore, credible assessment of MICs will provide a physician valuable information on the choice of therapeutic strategy. Early and precise usage of antibiotics is the key to an infection therapy. Compared with the traditional culture-based method, the approach of whole genome sequencing to identify MICs can shorten the experimental time, thereby improving clinical efficacy. Klebsiella pneumoniae is one of the most significant members of the genus Klebsiella in the Enterobacteriaceae family and also a common non-social pathogen. Meropenem is a broad-spectrum antibacterial agent of the carbapenem family, which can produce antibacterial effects of most Gram-positive and -negative bacteria. In this study, we used single-nucleotide polymorphism (SNP) information and nucleotide k-mers count based on metagenomic data to predict MICs of meropenem against K. pneumoniae. Then, features of 110 sequenced K. pneumoniae genome data were combined and modeled with XGBoost algorithm and deep neural network (DNN) algorithm to predict MICs. We first use the XGBoost classification model and the XGBoost regression model. After five runs, the average accuracy of the test set was calculated. The accuracy of using nucleotide k-mers to predict MICs of the XGBoost classification model and XGBoost regression model was 84.5 and 89.1%. The accuracy of SNP in predicting MIC was 80 and 81.8%, respectively. The results show that XGBoost regression is better than XGBoost classification in both nucleotide k-mers and SNPs to predict MICs. We further selected 40 nucleotide k-mers and 40 SNPs with the highest correlation with MIC values as features to retrain the XGBoost regression model and DNN regression model. After 100 and 1,000 runs, the results show that the accuracy of the two models was improved. The accuracy of the XGBoost regression model for k-mers, SNPs, and k-mers & SNPs was 91.1, 85.2, and 91.3%, respectively. The accuracy of the DNN regression model was 91.9, 87.1, and 91.8%, respectively. Through external verification, some of the selected features were found to be related to drug resistance.
Collapse
Affiliation(s)
- Rundong Tan
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Anqi Yu
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Ziming Liu
- Medical Information Engineering, Department of Medical Information, Harbin Medical University, Harbin, China
| | - Ziqi Liu
- Department of Biostatistics, School of Global Public Health, New York University, New York, NY, United States
| | - Rongfeng Jiang
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Xiaoli Wang
- Department of Critical Care Medicine, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Jialin Liu
- Department of Critical Care Medicine, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Junhui Gao
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Xinjun Wang
- Translational Medical Center for Stem Cell Therapy, Shanghai East Hospital, School of Medicine, Tongji University, Shanghai, China
| |
Collapse
|
37
|
Melo MCR, Maasch JRMA, de la Fuente-Nunez C. Accelerating antibiotic discovery through artificial intelligence. Commun Biol 2021; 4:1050. [PMID: 34504303 PMCID: PMC8429579 DOI: 10.1038/s42003-021-02586-0] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 07/16/2021] [Indexed: 02/07/2023] Open
Abstract
By targeting invasive organisms, antibiotics insert themselves into the ancient struggle of the host-pathogen evolutionary arms race. As pathogens evolve tactics for evading antibiotics, therapies decline in efficacy and must be replaced, distinguishing antibiotics from most other forms of drug development. Together with a slow and expensive antibiotic development pipeline, the proliferation of drug-resistant pathogens drives urgent interest in computational methods that promise to expedite candidate discovery. Strides in artificial intelligence (AI) have encouraged its application to multiple dimensions of computer-aided drug design, with increasing application to antibiotic discovery. This review describes AI-facilitated advances in the discovery of both small molecule antibiotics and antimicrobial peptides. Beyond the essential prediction of antimicrobial activity, emphasis is also given to antimicrobial compound representation, determination of drug-likeness traits, antimicrobial resistance, and de novo molecular design. Given the urgency of the antimicrobial resistance crisis, we analyze uptake of open science best practices in AI-driven antibiotic discovery and argue for openness and reproducibility as a means of accelerating preclinical research. Finally, trends in the literature and areas for future inquiry are discussed, as artificially intelligent enhancements to drug discovery at large offer many opportunities for future applications in antibiotic development.
Collapse
Affiliation(s)
- Marcelo C R Melo
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Jacqueline R M A Maasch
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
- Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
38
|
Zabeti H, Dexter N, Safari AH, Sedaghat N, Libbrecht M, Chindelevitch L. INGOT-DR: an interpretable classifier for predicting drug resistance in M. tuberculosis. Algorithms Mol Biol 2021; 16:17. [PMID: 34376217 PMCID: PMC8353837 DOI: 10.1186/s13015-021-00198-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 07/23/2021] [Indexed: 12/13/2022] Open
Abstract
Motivation Prediction of drug resistance and identification of its mechanisms in bacteria such as Mycobacterium tuberculosis, the etiological agent of tuberculosis, is a challenging problem. Solving this problem requires a transparent, accurate, and flexible predictive model. The methods currently used for this purpose rarely satisfy all of these criteria. On the one hand, approaches based on testing strains against a catalogue of previously identified mutations often yield poor predictive performance; on the other hand, machine learning techniques typically have higher predictive accuracy, but often lack interpretability and may learn patterns that produce accurate predictions for the wrong reasons. Current interpretable methods may either exhibit a lower accuracy or lack the flexibility needed to generalize them to previously unseen data. Contribution In this paper we propose a novel technique, inspired by group testing and Boolean compressed sensing, which yields highly accurate predictions, interpretable results, and is flexible enough to be optimized for various evaluation metrics at the same time. Results We test the predictive accuracy of our approach on five first-line and seven second-line antibiotics used for treating tuberculosis. We find that it has a higher or comparable accuracy to that of commonly used machine learning models, and is able to identify variants in genes with previously reported association to drug resistance. Our method is intrinsically interpretable, and can be customized for different evaluation metrics. Our implementation is available at github.com/hoomanzabeti/INGOT_DR and can be installed via The Python Package Index (Pypi) under ingotdr. This package is also compatible with most of the tools in the Scikit-learn machine learning library.
Collapse
|
39
|
Mirzaei M, Furxhi I, Murphy F, Mullins M. A Machine Learning Tool to Predict the Antibacterial Capacity of Nanoparticles. NANOMATERIALS (BASEL, SWITZERLAND) 2021; 11:1774. [PMID: 34361160 PMCID: PMC8308172 DOI: 10.3390/nano11071774] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 06/13/2021] [Accepted: 07/06/2021] [Indexed: 12/22/2022]
Abstract
The emergence and rapid spread of multidrug-resistant bacteria strains are a public health concern. This emergence is caused by the overuse and misuse of antibiotics leading to the evolution of antibiotic-resistant strains. Nanoparticles (NPs) are objects with all three external dimensions in the nanoscale that varies from 1 to 100 nm. Research on NPs with enhanced antimicrobial activity as alternatives to antibiotics has grown due to the increased incidence of nosocomial and community acquired infections caused by pathogens. Machine learning (ML) tools have been used in the field of nanoinformatics with promising results. As a consequence of evident achievements on a wide range of predictive tasks, ML techniques are attracting significant interest across a variety of stakeholders. In this article, we present an ML tool that successfully predicts the antibacterial capacity of NPs while the model's validation demonstrates encouraging results (R2 = 0.78). The data were compiled after a literature review of 60 articles and consist of key physico-chemical (p-chem) properties and experimental conditions (exposure variables and bacterial clustering) from in vitro studies. Following data homogenization and pre-processing, we trained various regression algorithms and we validated them using diverse performance metrics. Finally, an important attribute evaluation, which ranks the attributes that are most important in predicting the outcome, was performed. The attribute importance revealed that NP core size, the exposure dose, and the species of bacterium are key variables in predicting the antibacterial effect of NPs. This tool assists various stakeholders and scientists in predicting the antibacterial effects of NPs based on their p-chem properties and diverse exposure settings. This concept also aids the safe-by-design paradigm by incorporating functionality tools.
Collapse
Affiliation(s)
- Mahsa Mirzaei
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland; (M.M.); (F.M.); (M.M.)
| | - Irini Furxhi
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland; (M.M.); (F.M.); (M.M.)
- Transgero Limited, Cullinagh, Newcastle West, V42V384 Limerick, Ireland
| | - Finbarr Murphy
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland; (M.M.); (F.M.); (M.M.)
- Transgero Limited, Cullinagh, Newcastle West, V42V384 Limerick, Ireland
| | - Martin Mullins
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland; (M.M.); (F.M.); (M.M.)
| |
Collapse
|
40
|
|
41
|
Goodswen SJ, Barratt JLN, Kennedy PJ, Kaufer A, Calarco L, Ellis JT. Machine learning and applications in microbiology. FEMS Microbiol Rev 2021; 45:6174022. [PMID: 33724378 PMCID: PMC8498514 DOI: 10.1093/femsre/fuab015] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 02/28/2021] [Indexed: 12/15/2022] Open
Abstract
To understand the intricacies of microorganisms at the molecular level requires making sense of copious volumes of data such that it may now be humanly impossible to detect insightful data patterns without an artificial intelligence application called machine learning. Applying machine learning to address biological problems is expected to grow at an unprecedented rate, yet it is perceived by the uninitiated as a mysterious and daunting entity entrusted to the domain of mathematicians and computer scientists. The aim of this review is to identify key points required to start the journey of becoming an effective machine learning practitioner. These key points are further reinforced with an evaluation of how machine learning has been applied so far in a broad scope of real-life microbiology examples. This includes predicting drug targets or vaccine candidates, diagnosing microorganisms causing infectious diseases, classifying drug resistance against antimicrobial medicines, predicting disease outbreaks and exploring microbial interactions. Our hope is to inspire microbiologists and other related researchers to join the emerging machine learning revolution.
Collapse
Affiliation(s)
- Stephen J Goodswen
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Joel L N Barratt
- Parasitic Diseases Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Paul J Kennedy
- School of Computer Science, Faculty of Engineering and Information Technology and the Australian Artificial Intelligence Institute, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Alexa Kaufer
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Larissa Calarco
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - John T Ellis
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| |
Collapse
|
42
|
Tiwari P, Khare T, Shriram V, Bae H, Kumar V. Plant synthetic biology for producing potent phyto-antimicrobials to combat antimicrobial resistance. Biotechnol Adv 2021; 48:107729. [PMID: 33705914 DOI: 10.1016/j.biotechadv.2021.107729] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 01/22/2021] [Accepted: 03/04/2021] [Indexed: 12/14/2022]
Abstract
Inappropriate and injudicious use of antimicrobial drugs in human health, hygiene, agriculture, animal husbandry and food industries has contributed significantly to rapid emergence and persistence of antimicrobial resistance (AMR), one of the serious global public health threats. The crisis of AMR versus slower discovery of newer antibiotics put forth a daunting task to control these drug-resistant superbugs. Several phyto-antimicrobials have been identified in recent years with direct-killing (bactericidal) and/or drug-resistance reversal (re-sensitization of AMR phenotypes) potencies. Phyto-antimicrobials may hold the key in combating AMR owing to their abilities to target major microbial drug-resistance determinants including cell membrane, drug-efflux pumps, cell communication and biofilms. However, limited distribution, low intracellular concentrations, eco-geographical variations, beside other considerations like dynamic environments, climate change and over-exploitation of plant-resources are major blockades in full potential exploration phyto-antimicrobials. Synthetic biology (SynBio) strategies integrating metabolic engineering, RNA-interference, genome editing/engineering and/or systems biology approaches using plant chassis (as engineerable platforms) offer prospective tools for production of phyto-antimicrobials. With expanding SynBio toolkit, successful attempts towards introduction of entire gene cluster, reconstituting the metabolic pathway or transferring an entire metabolic (or synthetic) pathway into heterologous plant systems highlight the potential of this field. Through this perspective review, we are presenting herein the current situation and options for addressing AMR, emphasizing on the significance of phyto-antimicrobials in this apparently post-antibiotic era, and effective use of plant chassis for phyto-antimicrobial production at industrial scales along with major SynBio tools and useful databases. Current knowledge, recent success stories, associated challenges and prospects of translational success are also discussed.
Collapse
Affiliation(s)
- Pragya Tiwari
- Molecular Metabolic Engineering Lab, Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk 38541, Republic of Korea
| | - Tushar Khare
- Department of Biotechnology, Modern College of Arts, Science and Commerce, Savitribai Phule Pune University, Ganeshkhind, Pune 411016, India; Department of Environmental Science, Savitribai Phule Pune University, Pune 411007, India
| | - Varsha Shriram
- Department of Botany, Prof. Ramkrishna More Arts, Commerce and Science College, Savitribai Phule Pune University, Akurdi, Pune 411044, India
| | - Hanhong Bae
- Molecular Metabolic Engineering Lab, Department of Biotechnology, Yeungnam University, Gyeongsan, Gyeongbuk 38541, Republic of Korea.
| | - Vinay Kumar
- Department of Biotechnology, Modern College of Arts, Science and Commerce, Savitribai Phule Pune University, Ganeshkhind, Pune 411016, India; Department of Environmental Science, Savitribai Phule Pune University, Pune 411007, India.
| |
Collapse
|
43
|
Minias A, Żukowska L, Lechowicz E, Gąsior F, Knast A, Podlewska S, Zygała D, Dziadek J. Early Drug Development and Evaluation of Putative Antitubercular Compounds in the -Omics Era. Front Microbiol 2021; 11:618168. [PMID: 33603720 PMCID: PMC7884339 DOI: 10.3389/fmicb.2020.618168] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 12/30/2020] [Indexed: 12/14/2022] Open
Abstract
Tuberculosis (TB) is an infectious disease caused by the bacterium Mycobacterium tuberculosis. According to the WHO, the disease is one of the top 10 causes of death of people worldwide. Mycobacterium tuberculosis is an intracellular pathogen with an unusually thick, waxy cell wall and a complex life cycle. These factors, combined with M. tuberculosis ability to enter prolonged periods of latency, make the bacterium very difficult to eradicate. The standard treatment of TB requires 6-20months, depending on the drug susceptibility of the infecting strain. The need to take cocktails of antibiotics to treat tuberculosis effectively and the emergence of drug-resistant strains prompts the need to search for new antitubercular compounds. This review provides a perspective on how modern -omic technologies facilitate the drug discovery process for tuberculosis treatment. We discuss how methods of DNA and RNA sequencing, proteomics, and genetic manipulation of organisms increase our understanding of mechanisms of action of antibiotics and allow the evaluation of drugs. We explore the utility of mathematical modeling and modern computational analysis for the drug discovery process. Finally, we summarize how -omic technologies contribute to our understanding of the emergence of drug resistance.
Collapse
Affiliation(s)
- Alina Minias
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
| | - Lidia Żukowska
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- BioMedChem Doctoral School of the University of Lodz and the Institutes of the Polish Academy of Sciences in Lodz, Lodz, Poland
| | - Ewelina Lechowicz
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- Institute of Microbiology, Biotechnology and Immunology, Faculty of Biology and Environmental Protection, University of Lodz, Lodz, Poland
| | - Filip Gąsior
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- BioMedChem Doctoral School of the University of Lodz and the Institutes of the Polish Academy of Sciences in Lodz, Lodz, Poland
| | - Agnieszka Knast
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- Institute of Molecular and Industrial Biotechnology, Faculty of Biotechnology and Food Sciences, Lodz University of Technology, Lodz, Poland
| | - Sabina Podlewska
- Department of Technology and Biotechnology of Drugs, Jagiellonian University Medical College, Krakow, Poland
- Maj Institute of Pharmacology, Polish Academy of Sciences, Krakow, Poland
| | - Daria Zygała
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
- Institute of Microbiology, Biotechnology and Immunology, Faculty of Biology and Environmental Protection, University of Lodz, Lodz, Poland
| | - Jarosław Dziadek
- Laboratory of Genetics and Physiology of Mycobacterium, Institute of Medical Biology, Polish Academy of Sciences, Lodz, Poland
| |
Collapse
|
44
|
Robust detection of point mutations involved in multidrug-resistant Mycobacterium tuberculosis in the presence of co-occurrent resistance markers. PLoS Comput Biol 2020; 16:e1008518. [PMID: 33347430 PMCID: PMC7785249 DOI: 10.1371/journal.pcbi.1008518] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 01/05/2021] [Accepted: 11/11/2020] [Indexed: 11/23/2022] Open
Abstract
Tuberculosis disease is a major global public health concern and the growing prevalence of drug-resistant Mycobacterium tuberculosis is making disease control more difficult. However, the increasing application of whole-genome sequencing as a diagnostic tool is leading to the profiling of drug resistance to inform clinical practice and treatment decision making. Computational approaches for identifying established and novel resistance-conferring mutations in genomic data include genome-wide association study (GWAS) methodologies, tests for convergent evolution and machine learning techniques. These methods may be confounded by extensive co-occurrent resistance, where statistical models for a drug include unrelated mutations known to be causing resistance to other drugs. Here, we introduce a novel ‘cannibalistic’ elimination algorithm (“Hungry, Hungry SNPos”) that attempts to remove these co-occurrent resistant variants. Using an M. tuberculosis genomic dataset for the virulent Beijing strain-type (n = 3,574) with phenotypic resistance data across five drugs (isoniazid, rifampicin, ethambutol, pyrazinamide, and streptomycin), we demonstrate that this new approach is considerably more robust than traditional methods and detects resistance-associated variants too rare to be likely picked up by correlation-based techniques like GWAS. Tuberculosis is one of the deadliest infectious diseases, being responsible for more than one million deaths per year. The causing bacteria are becoming increasingly drug-resistant, which is hampering disease control. At the same time, an unprecedented amount of bacterial whole-genome sequencing is increasingly informing clinical practice. In order to detect the genetic alterations responsible for developing drug resistance and predict resistance status from genomic data, bio-statistical methods and machine learning models have been employed. However, due to strongly overlapping drug resistance phenotypes and genotypes in multidrug-resistant datasets, the results of these correlation-based approaches frequently also contain mutations related to resistance against other drugs. In the past, this issue has often been ignored or partially resolved by either restricting the input data or in post-analysis screening—with both strategies relying on prior information. Here we present a heuristic algorithm for finding resistance-associated variants and demonstrate that it is considerably more robust towards co-occurrent resistance compared to traditional techniques. The software is available at https://github.com/julibeg/HHS.
Collapse
|
45
|
Li X, Lin J, Hu Y, Zhou J. PARMAP: A Pan-Genome-Based Computational Framework for Predicting Antimicrobial Resistance. Front Microbiol 2020; 11:578795. [PMID: 33193203 PMCID: PMC7642336 DOI: 10.3389/fmicb.2020.578795] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/24/2020] [Indexed: 11/17/2022] Open
Abstract
Antimicrobial resistance (AMR) has emerged as one of the most urgent global threats to public health. Accurate detection of AMR phenotypes is critical for reducing the spread of AMR strains. Here, we developed PARMAP (Prediction of Antimicrobial Resistance by MAPping genetic alterations in pan-genome) to predict AMR phenotypes and to identify AMR-associated genetic alterations based on the pan-genome of bacteria by utilizing machine learning algorithms. When we applied PARMAP to 1,597 Neisseria gonorrhoeae strains, it successfully predicted their AMR phenotypes based on a pan-genome analysis. Furthermore, it identified 328 genetic alterations in 23 known AMR genes and discovered many new AMR-associated genetic alterations in ciprofloxacin-resistant N. gonorrhoeae, and it clearly indicated the genetic heterogeneity of AMR genes in different subtypes of resistant N. gonorrhoeae. Additionally, PARMAP performed well in predicting the AMR phenotypes of Mycobacterium tuberculosis and Escherichia coli, indicating the robustness of the PARMAP framework. In conclusion, PARMAP not only precisely predicts the AMR of a population of strains of a given species but also uses whole-genome sequencing data to prioritize candidate AMR-associated genetic alterations based on their likelihood of contributing to AMR. Thus, we believe that PARMAP will accelerate investigations into AMR mechanisms in other human pathogens.
Collapse
Affiliation(s)
- Xuefei Li
- Dermatology Hospital, Southern Medical University, Guangzhou, China
| | - Jingxia Lin
- Dermatology Hospital, Southern Medical University, Guangzhou, China
| | - Yongfei Hu
- Dermatology Hospital, Southern Medical University, Guangzhou, China
| | - Jiajian Zhou
- Dermatology Hospital, Southern Medical University, Guangzhou, China
| |
Collapse
|
46
|
Hicks ND, Giffen SR, Culviner PH, Chao MC, Dulberger CL, Liu Q, Stanley S, Brown J, Sixsmith J, Wolf ID, Fortune SM. Mutations in dnaA and a cryptic interaction site increase drug resistance in Mycobacterium tuberculosis. PLoS Pathog 2020; 16:e1009063. [PMID: 33253310 PMCID: PMC7738170 DOI: 10.1371/journal.ppat.1009063] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2020] [Revised: 12/15/2020] [Accepted: 10/09/2020] [Indexed: 01/26/2023] Open
Abstract
Genomic dissection of antibiotic resistance in bacterial pathogens has largely focused on genetic changes conferring growth above a single critical concentration of drug. However, reduced susceptibility to antibiotics-even below this breakpoint-is associated with poor treatment outcomes in the clinic, including in tuberculosis. Clinical strains of Mycobacterium tuberculosis exhibit extensive quantitative variation in antibiotic susceptibility but the genetic basis behind this spectrum of drug susceptibility remains ill-defined. Through a genome wide association study, we show that non-synonymous mutations in dnaA, which encodes an essential and highly conserved regulator of DNA replication, are associated with drug resistance in clinical M. tuberculosis strains. We demonstrate that these dnaA mutations specifically enhance M. tuberculosis survival during isoniazid treatment via reduced expression of katG, the activator of isoniazid. To identify DnaA interactors relevant to this phenotype, we perform the first genome-wide biochemical mapping of DnaA binding sites in mycobacteria which reveals a DnaA interaction site that is the target of recurrent mutation in clinical strains. Reconstructing clinically prevalent mutations in this DnaA interaction site reproduces the phenotypes of dnaA mutants, suggesting that clinical strains of M. tuberculosis have evolved mutations in a previously uncharacterized DnaA pathway that quantitatively increases resistance to the key first-line antibiotic isoniazid. Discovering genetic mechanisms that reduce drug susceptibility and support the evolution of high-level drug resistance will guide development of biomarkers capable of prospectively identifying patients at risk of treatment failure in the clinic.
Collapse
Affiliation(s)
- Nathan D. Hicks
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Samantha R. Giffen
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Peter H. Culviner
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Michael C. Chao
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Charles L. Dulberger
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Qingyun Liu
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Sydney Stanley
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Jessica Brown
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Jaimie Sixsmith
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Ian D. Wolf
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Sarah M. Fortune
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| |
Collapse
|
47
|
Amino Acid k-mer Feature Extraction for Quantitative Antimicrobial Resistance (AMR) Prediction by Machine Learning and Model Interpretation for Biological Insights. BIOLOGY 2020; 9:biology9110365. [PMID: 33126516 PMCID: PMC7694136 DOI: 10.3390/biology9110365] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 10/17/2020] [Accepted: 10/19/2020] [Indexed: 12/31/2022]
Abstract
Machine learning algorithms can learn mechanisms of antimicrobial resistance from the data of DNA sequence without any a priori information. Interpreting a trained machine learning algorithm can be exploited for validating the model and obtaining new information about resistance mechanisms. Different feature extraction methods, such as SNP calling and counting nucleotide k-mers have been proposed for presenting DNA sequences to the model. However, there are trade-offs between interpretability, computational complexity and accuracy for different feature extraction methods. In this study, we have proposed a new feature extraction method, counting amino acid k-mers or oligopeptides, which provides easier model interpretation compared to counting nucleotide k-mers and reaches the same or even better accuracy in comparison with different methods. Additionally, we have trained machine learning algorithms using different feature extraction methods and compared the results in terms of accuracy, model interpretability and computational complexity. We have built a new feature selection pipeline for extraction of important features so that new AMR determinants can be discovered by analyzing these features. This pipeline allows the construction of models that only use a small number of features and can predict resistance accurately.
Collapse
|
48
|
Jaillard M, Palmieri M, van Belkum A, Mahé P. Interpreting k-mer-based signatures for antibiotic resistance prediction. Gigascience 2020; 9:giaa110. [PMID: 33068113 PMCID: PMC7568433 DOI: 10.1093/gigascience/giaa110] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 07/23/2020] [Accepted: 09/16/2020] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND Recent years have witnessed the development of several k-mer-based approaches aiming to predict phenotypic traits of bacteria on the basis of their whole-genome sequences. While often convincing in terms of predictive performance, the underlying models are in general not straightforward to interpret, the interplay between the actual genetic determinant and its translation as k-mers being generally hard to decipher. RESULTS We propose a simple and computationally efficient strategy allowing one to cope with the high correlation inherent to k-mer-based representations in supervised machine learning models, leading to concise and easily interpretable signatures. We demonstrate the benefit of this approach on the task of predicting the antibiotic resistance profile of a Klebsiella pneumoniae strain from its genome, where our method leads to signatures defined as weighted linear combinations of genetic elements that can easily be identified as genuine antibiotic resistance determinants, with state-of-the-art predictive performance. CONCLUSIONS By enhancing the interpretability of genomic k-mer-based antibiotic resistance prediction models, our approach improves their clinical utility and hence will facilitate their adoption in routine diagnostics by clinicians and microbiologists. While antibiotic resistance was the motivating application, the method is generic and can be transposed to any other bacterial trait. An R package implementing our method is available at https://gitlab.com/biomerieux-data-science/clustlasso.
Collapse
Affiliation(s)
| | | | | | - Pierre Mahé
- bioMérieux, Chemin de l'Orme, 69280 Marcy l'Etoile, France
| |
Collapse
|
49
|
McDermott PF, Davis JJ. Predicting antimicrobial susceptibility from the bacterial genome: A new paradigm for one health resistance monitoring. J Vet Pharmacol Ther 2020; 44:223-237. [PMID: 33010049 DOI: 10.1111/jvp.12913] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 08/25/2020] [Accepted: 09/09/2020] [Indexed: 12/11/2022]
Abstract
The laboratory identification of antibacterial resistance is a cornerstone of infectious disease medicine. In vitro antimicrobial susceptibility testing has long been based on the growth response of organisms in pure culture to a defined concentration of antimicrobial agents. By comparing individual isolates to wild-type susceptibility patterns, strains with acquired resistance can be identified. Acquired resistance can also be detected genetically. After many decades of research, the inventory of genes underlying antimicrobial resistance is well known for several pathogenic genera including zoonotic enteric organisms such as Salmonella and Campylobacter and continues to grow substantially for others. With the decline in costs for large scale DNA sequencing, it is now practicable to characterize bacteria using whole genome sequencing, including the carriage of resistance genes in individual microorganisms and those present in complex biological samples. With genomics, we can generate comprehensive, detailed information on the bacterium, the mechanisms of antibiotic resistance, clues to its source, and the nature of mobile DNA elements by which resistance spreads. These developments point to a new paradigm for antimicrobial resistance detection and tracking for both clinical and public health purposes.
Collapse
Affiliation(s)
- Patrick F McDermott
- Office of Research, Center for Veterinary Medicine, U.S. Food and Drug Administration, Laurel, MD, USA
| | - James J Davis
- Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA.,University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
| |
Collapse
|
50
|
Pataki BÁ, Matamoros S, van der Putten BCL, Remondini D, Giampieri E, Aytan-Aktug D, Hendriksen RS, Lund O, Csabai I, Schultsz C. Understanding and predicting ciprofloxacin minimum inhibitory concentration in Escherichia coli with machine learning. Sci Rep 2020; 10:15026. [PMID: 32929164 PMCID: PMC7490380 DOI: 10.1038/s41598-020-71693-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Accepted: 08/18/2020] [Indexed: 11/13/2022] Open
Abstract
It is important that antibiotics prescriptions are based on antimicrobial susceptibility data to ensure effective treatment outcomes. The increasing availability of next-generation sequencing, bacterial whole genome sequencing (WGS) can facilitate a more reliable and faster alternative to traditional phenotyping for the detection and surveillance of AMR. This work proposes a machine learning approach that can predict the minimum inhibitory concentration (MIC) for a given antibiotic, here ciprofloxacin, on the basis of both genome-wide mutation profiles and profiles of acquired antimicrobial resistance genes. We analysed 704 Escherichia coli genomes combined with their respective MIC measurements for ciprofloxacin originating from different countries. The four most important predictors found by the model, mutations in gyrA residues Ser83 and Asp87, a mutation in parC residue Ser80 and presence of the qnrS1 gene, have been experimentally validated before. Using only these four predictors in a linear regression model, 65% and 93% of the test samples’ MIC were correctly predicted within a two- and a four-fold dilution range, respectively. The presented work does not treat machine learning as a black box model concept, but also identifies the genomic features that determine susceptibility. The recent progress in WGS technology in combination with machine learning analysis approaches indicates that in the near future WGS of bacteria might become cheaper and faster than a MIC measurement.
Collapse
Affiliation(s)
- Bálint Ármin Pataki
- Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Budapest, Hungary. .,Department of Computational Sciences, Wigner Research Centre for Physics of the HAS, Budapest, Hungary.
| | - Sébastien Matamoros
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Boas C L van der Putten
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.,Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Daniel Remondini
- Department of Physics and Astronomy (DIFA), University of Bologna, Bologna, Italy
| | - Enrico Giampieri
- Department of Experimental, Diagnostic and Specialty Medicine (DIMES), University of Bologna, Bologna, Italy
| | - Derya Aytan-Aktug
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Rene S Hendriksen
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Ole Lund
- Department of Bioinformatics, Technical University of Denmark, Lyngby, Denmark
| | - István Csabai
- Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Budapest, Hungary.,Department of Computational Sciences, Wigner Research Centre for Physics of the HAS, Budapest, Hungary
| | - Constance Schultsz
- Department of Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands.,Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | | |
Collapse
|