1
|
Boeckaerts D, Stock M, Ferriol-González C, Oteo-Iglesias J, Sanjuán R, Domingo-Calap P, De Baets B, Briers Y. Prediction of Klebsiella phage-host specificity at the strain level. Nat Commun 2024; 15:4355. [PMID: 38778023 PMCID: PMC11111740 DOI: 10.1038/s41467-024-48675-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/08/2024] [Indexed: 05/25/2024] Open
Abstract
Phages are increasingly considered promising alternatives to target drug-resistant bacterial pathogens. However, their often-narrow host range can make it challenging to find matching phages against bacteria of interest. Current computational tools do not accurately predict interactions at the strain level in a way that is relevant and properly evaluated for practical use. We present PhageHostLearn, a machine learning system that predicts strain-level interactions between receptor-binding proteins and bacterial receptors for Klebsiella phage-bacteria pairs. We evaluate this system both in silico and in the laboratory, in the clinically relevant setting of finding matching phages against bacterial strains. PhageHostLearn reaches a cross-validated ROC AUC of up to 81.8% in silico and maintains this performance in laboratory validation. Our approach provides a framework for developing and evaluating phage-host prediction methods that are useful in practice, which we believe to be a meaningful contribution to the machine-learning-guided development of phage therapeutics and diagnostics.
Collapse
Affiliation(s)
- Dimitri Boeckaerts
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Michiel Stock
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Celia Ferriol-González
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Jesús Oteo-Iglesias
- Laboratorio de Referencia e Investigación en Resistencia a Antibióticos e Infecciones Relacionadas con la Asistencia Sanitaria, Centro Nacional de Microbiología, Instituto de Salud Carlos III, Madrid, Spain
- CIBER de Enfermedades Infecciosas (CIBERINFEC), Instituto de Salud Carlos III, Madrid, Spain
| | - Rafael Sanjuán
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Pilar Domingo-Calap
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Bernard De Baets
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Yves Briers
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium.
| |
Collapse
|
2
|
Nie W, Qiu T, Wei Y, Ding H, Guo Z, Qiu J. Advances in phage-host interaction prediction: in silico method enhances the development of phage therapies. Brief Bioinform 2024; 25:bbae117. [PMID: 38555471 PMCID: PMC10981677 DOI: 10.1093/bib/bbae117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 01/15/2024] [Accepted: 03/02/2024] [Indexed: 04/02/2024] Open
Abstract
Phages can specifically recognize and kill bacteria, which lead to important application value of bacteriophage in bacterial identification and typing, livestock aquaculture and treatment of human bacterial infection. Considering the variety of human-infected bacteria and the continuous discovery of numerous pathogenic bacteria, screening suitable therapeutic phages that are capable of infecting pathogens from massive phage databases has been a principal step in phage therapy design. Experimental methods to identify phage-host interaction (PHI) are time-consuming and expensive; high-throughput computational method to predict PHI is therefore a potential substitute. Here, we systemically review bioinformatic methods for predicting PHI, introduce reference databases and in silico models applied in these methods and highlight the strengths and challenges of current tools. Finally, we discuss the application scope and future research direction of computational prediction methods, which contribute to the performance improvement of prediction models and the development of personalized phage therapy.
Collapse
Affiliation(s)
- Wanchun Nie
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Tianyi Qiu
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
- Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, 200032, China
| | - Yiwen Wei
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Hao Ding
- Institute of Clinical Science, Zhongshan Hospital; Intelligent Medicine Institute, Fudan University, Shanghai, 200032, China
| | - Zhixiang Guo
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| | - Jingxuan Qiu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
| |
Collapse
|
3
|
Liu GY, Yu D, Fan MM, Zhang X, Jin ZY, Tang C, Liu XF. Antimicrobial resistance crisis: could artificial intelligence be the solution? Mil Med Res 2024; 11:7. [PMID: 38254241 PMCID: PMC10804841 DOI: 10.1186/s40779-024-00510-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open
Abstract
Antimicrobial resistance is a global public health threat, and the World Health Organization (WHO) has announced a priority list of the most threatening pathogens against which novel antibiotics need to be developed. The discovery and introduction of novel antibiotics are time-consuming and expensive. According to WHO's report of antibacterial agents in clinical development, only 18 novel antibiotics have been approved since 2014. Therefore, novel antibiotics are critically needed. Artificial intelligence (AI) has been rapidly applied to drug development since its recent technical breakthrough and has dramatically improved the efficiency of the discovery of novel antibiotics. Here, we first summarized recently marketed novel antibiotics, and antibiotic candidates in clinical development. In addition, we systematically reviewed the involvement of AI in antibacterial drug development and utilization, including small molecules, antimicrobial peptides, phage therapy, essential oils, as well as resistance mechanism prediction, and antibiotic stewardship.
Collapse
Affiliation(s)
- Guang-Yu Liu
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Dan Yu
- National Key Discipline of Pediatrics Key Laboratory of Major Diseases in Children Ministry of Education, Laboratory of Dermatology, Beijing Pediatric Research Institute, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing, 100045, China
| | - Mei-Mei Fan
- Department of Immunology and Pathogen Biology, School of Basic Medical Sciences, Hangzhou Normal University, Key Laboratory of Aging and Cancer Biology of Zhejiang Province, Key Laboratory of Inflammation and Immunoregulation of Hangzhou, Hangzhou Normal University, Hangzhou, 311121, China
| | - Xu Zhang
- Robert and Arlene Kogod Center on Aging, Mayo Clinic, Rochester, MN, 55905, USA
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Ze-Yu Jin
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Christoph Tang
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK.
| | - Xiao-Fen Liu
- Institute of Antibiotics, Huashan Hospital, Fudan University, Key Laboratory of Clinical Pharmacology of Antibiotics, National Health Commission of the People's Republic of China, National Clinical Research Centre for Aging and Medicine, Huashan Hospital, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
4
|
Nolin SJ, Siegel PB, Ashwell CM. Differences in the microbiome of the small intestine of Leghorn lines divergently selected for antibody titer to sheep erythrocytes suggest roles for commensals in host humoral response. Front Physiol 2024; 14:1304051. [PMID: 38260103 PMCID: PMC10800846 DOI: 10.3389/fphys.2023.1304051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 12/18/2023] [Indexed: 01/24/2024] Open
Abstract
For forty generations, two lines of White Leghorn chickens have been selected for high (HAS) or low (LAS) antibody response to a low dose injection of sheep red blood cells (SRBCs). Their gut is home to billons of microorganisms and the largest number of immune cells in the body; therefore, the objective of this experiment was to gain understanding of the ways the microbiome may influence the differential antibody response observed in these lines. We achieved this by characterizing the small intestinal microbiome of HAS and LAS chickens, determining their functional microbiome profiles, and by using machine learning to identify microbes which best differentiate HAS from LAS and associating the abundance of those microbes with host gene expression. Microbiome sequencing revealed greater diversity in LAS but statistically higher abundance of several strains, particularly those of Lactobacillus, in HAS. Enrichment of microbial metabolites implicated in immune response such as lactic acid, short chain fatty acids, amino acids, and vitamins were different between HAS and LAS. The abundance of several microbial strains corresponds to enriched host gene expression pathways related to immune response. These data provide a compelling argument that the microbiome is both likely affected by host divergent genetic selection and that it exerts influence on host antibody response by various mechanisms.
Collapse
Affiliation(s)
- Shelly J. Nolin
- Prestage Department of Poultry Science, North Carolina State University, Raleigh, NC, United States
| | - Paul B. Siegel
- School of Animal Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Christopher M. Ashwell
- Davis College of Agriculture, Natural Resources, and Design, West Virginia University, Morgantown, WV, United States
| |
Collapse
|
5
|
Pan J, You Z, You W, Zhao T, Feng C, Zhang X, Ren F, Ma S, Wu F, Wang S, Sun Y. PTBGRP: predicting phage-bacteria interactions with graph representation learning on microbial heterogeneous information network. Brief Bioinform 2023; 24:bbad328. [PMID: 37742053 DOI: 10.1093/bib/bbad328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 08/14/2023] [Accepted: 08/30/2023] [Indexed: 09/25/2023] Open
Abstract
Identifying the potential bacteriophages (phage) candidate to treat bacterial infections plays an essential role in the research of human pathogens. Computational approaches are recognized as a valid way to predict bacteria and target phages. However, most of the current methods only utilize lower-order biological information without considering the higher-order connectivity patterns, which helps to improve the predictive accuracy. Therefore, we developed a novel microbial heterogeneous interaction network (MHIN)-based model called PTBGRP to predict new phages for bacterial hosts. Specifically, PTBGRP first constructs an MHIN by integrating phage-bacteria interaction (PBI) and six bacteria-bacteria interaction networks with their biological attributes. Then, different representation learning methods are deployed to extract higher-level biological features and lower-level topological features from MHIN. Finally, PTBGRP employs a deep neural network as the classifier to predict unknown PBI pairs based on the fused biological information. Experiment results demonstrated that PTBGRP achieves the best performance on the corresponding ESKAPE pathogens and PBI dataset when compared with state-of-art methods. In addition, case studies of Klebsiella pneumoniae and Staphylococcus aureus further indicate that the consideration of rich heterogeneous information enables PTBGRP to accurately predict PBI from a more comprehensive perspective. The webserver of the PTBGRP predictor is freely available at http://120.77.11.78/PTBGRP/.
Collapse
Affiliation(s)
- Jie Pan
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Zhuhong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China
| | - Wencai You
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Tian Zhao
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Chenlu Feng
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Xuexia Zhang
- North China Pharmaceutical Group, Shijiazhuang 050015, Hebei, China
- National Microbial Medicine Engineering & Research Center, Shijiazhuang 050015, Hebei, China
| | - Fengzhi Ren
- North China Pharmaceutical Group, Shijiazhuang 050015, Hebei, China
- National Microbial Medicine Engineering & Research Center, Shijiazhuang 050015, Hebei, China
| | - Sanxing Ma
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Fan Wu
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Shiwei Wang
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Yanmei Sun
- Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, the College of Life Sciences, Northwest University, Xi'an 710069, China
| |
Collapse
|
6
|
Arora P, Jain A, Kumar A. Phage design and directed evolution to evolve phage for therapy. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2023; 200:103-126. [PMID: 37739551 DOI: 10.1016/bs.pmbts.2023.03.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/24/2023]
Abstract
Phage therapy or Phage treatment is the use of bacteriolysing phage in treating bacterial infections by using the viruses that infects and kills bacteria. This technique has been studied and practiced very long ago, but with the advent of antibiotics, it has been neglected. This foregone technique is now witnessing a revival due to development of bacterial resistance. Nowadays, with the awareness of genetic sequence of organisms, it is required that informed choices of phages have to be made for the most efficacious results. Furthermore, phages with the evolving genes are taken into consideration for the subsequent improvement in treating the patients for bacterial diseases. In addition, direct evolution methods are increasingly developing, since these are capable of creating new biological molecules having changed or unique activities, such as, improved target specificity, evolution of novel proteins with new catalytic properties or creation of nucleic acids that are capable of recognizing required pathogenic bacteria. This system is incorporates continuous evolution such as protein or genes are put under continuous evolution by providing continuous mutagenesis with least human intervention. Although, this system providing continuous directed evolution is very effective, it imposes some challenges due to requirement of heavy investment of time and resources. This chapter focuses on development of phage as a therapeutic agent against various bacteria causing diseases and it improvement using direct evolution of proteins and nucleic acids such that they target specific organisms.
Collapse
Affiliation(s)
- Priyancka Arora
- Department of Biotechnology, Faculty of Engineering and Technology, Rama University, Kanpur, Uttar Pradesh, India
| | - Avni Jain
- Department of Biotechnology, Faculty of Engineering and Technology, Rama University, Kanpur, Uttar Pradesh, India
| | - Ajay Kumar
- Department of Biotechnology, Faculty of Engineering and Technology, Rama University, Kanpur, Uttar Pradesh, India.
| |
Collapse
|
7
|
Iuchi H, Kawasaki J, Kubo K, Fukunaga T, Hokao K, Yokoyama G, Ichinose A, Suga K, Hamada M. Bioinformatics approaches for unveiling virus-host interactions. Comput Struct Biotechnol J 2023; 21:1774-1784. [PMID: 36874163 PMCID: PMC9969756 DOI: 10.1016/j.csbj.2023.02.044] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/22/2023] [Accepted: 02/22/2023] [Indexed: 03/03/2023] Open
Abstract
The coronavirus disease-2019 (COVID-19) pandemic has elucidated major limitations in the capacity of medical and research institutions to appropriately manage emerging infectious diseases. We can improve our understanding of infectious diseases by unveiling virus-host interactions through host range prediction and protein-protein interaction prediction. Although many algorithms have been developed to predict virus-host interactions, numerous issues remain to be solved, and the entire network remains veiled. In this review, we comprehensively surveyed algorithms used to predict virus-host interactions. We also discuss the current challenges, such as dataset biases toward highly pathogenic viruses, and the potential solutions. The complete prediction of virus-host interactions remains difficult; however, bioinformatics can contribute to progress in research on infectious diseases and human health.
Collapse
Affiliation(s)
- Hitoshi Iuchi
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan
| | - Junna Kawasaki
- Faculty of Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Kento Kubo
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Nishi Waseda, Shinjuku-ku, Tokyo 169-0051, Japan
| | - Koki Hokao
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Gentaro Yokoyama
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Akiko Ichinose
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan
| | - Kanta Suga
- School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan
| | - Michiaki Hamada
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,School of Advanced Science and Engineering, Waseda University, Okubo Shinjuku-ku, Tokyo 169-8555, Japan.,Graduate School of Medicine, Nippon Medical School, Tokyo 113-8602, Japan
| |
Collapse
|
8
|
Phage Therapy for Crops: Concepts, Experimental and Bioinformatics Approaches to Direct Its Application. Int J Mol Sci 2022; 24:ijms24010325. [PMID: 36613768 PMCID: PMC9820149 DOI: 10.3390/ijms24010325] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 12/14/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Phage therapy consists of applying bacteriophages, whose natural function is to kill specific bacteria. Bacteriophages are safe, evolve together with their host, and are environmentally friendly. At present, the indiscriminate use of antibiotics and salt minerals (Zn2+ or Cu2+) has caused the emergence of resistant strains that infect crops, causing difficulties and loss of food production. Phage therapy is an alternative that has shown positive results and can improve the treatments available for agriculture. However, the success of phage therapy depends on finding effective bacteriophages. This review focused on describing the potential, up to now, of applying phage therapy as an alternative treatment against bacterial diseases, with sustainable improvement in food production. We described the current isolation techniques, characterization, detection, and selection of lytic phages, highlighting the importance of complementary studies using genome analysis of the phage and its host. Finally, among these studies, we concentrated on the most relevant bacteriophages used for biocontrol of Pseudomonas spp., Xanthomonas spp., Pectobacterium spp., Ralstonia spp., Burkholderia spp., Dickeya spp., Clavibacter michiganensis, and Agrobacterium tumefaciens as agents that cause damage to crops, and affect food production around the world.
Collapse
|
9
|
Ataee S, Brochet X, Peña-Reyes CA. Bacteriophage Genetic Edition Using LSTM. FRONTIERS IN BIOINFORMATICS 2022; 2:932319. [PMID: 36353213 PMCID: PMC9639385 DOI: 10.3389/fbinf.2022.932319] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 06/06/2022] [Indexed: 09/16/2023] Open
Abstract
Bacteriophages are gaining increasing interest as antimicrobial tools, largely due to the emergence of multi-antibiotic-resistant bacteria. Although their huge diversity and virulence make them particularly attractive for targeting a wide range of bacterial pathogens, it is difficult to select suitable phages due to their high specificity which limits their host range. In addition, other challenges remain such as structural fragility under certain environmental conditions, immunogenicity of phage therapy, or development of bacterial resistance. The use of genetically engineered phages may reduce characteristics that hinder prophylactic and therapeutic applications of phages. Nowadays, there is no systematic method to modify a given phage genome conferring its sought characteristics. We explore the use of artificial intelligence for this purpose as it has the potential to both guide and accelerate genome modification to generate phage variants with unique properties that overcome the limitations of natural phages. We propose an original architecture composed of two deep learning-driven components: a phage-bacterium interaction predictor and a phage genome-sequence generator. The former is a multi-branch 1-D convolutional neural network (1D-CNN) that analyses phage and bacterial genomes to predict interactions. The latter is a recurrent neural network, more particularly a long short-term memory (LSTM), that performs genomic modifications to a phage to offer substantial host range improvement. For this component, we developed two different architectures composed of one or two stacked LSTM layers with 256 neurons each. These generators are used to modify, more precisely to rewrite, the genome sequence of 42 selected phages, while the predictor is used to estimate the host range of the modified bacteriophages across 46 strains of Pseudomonas aeruginosa. The proposed generators, trained with an average accuracy of 96.1%, are able to improve the host range for an average of 18 phages among the 42 under study, increasing both their average host range, by 73.0 and 103.7%, and the maximum host ranges from 21 to 24 and 29, respectively. These promising results showed that the use of deep learning methodologies allows genetic modification of phages to extend, for instance, their host range, confirming the potential of these approaches to guide bacteriophage engineering.
Collapse
Affiliation(s)
- Shabnam Ataee
- Institute of Information and Communication Technology (IICT), School of Management and Engineering Vaud (HEIG-VD), Yverdon-les-Bains, Switzerland
- HES-SO University of Applied Sciences and Arts Western Switzerland, Delémont, Switzerland
- CI4CB—Computational Intelligence for Computational Biology, SIB—Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Xavier Brochet
- Institute of Information and Communication Technology (IICT), School of Management and Engineering Vaud (HEIG-VD), Yverdon-les-Bains, Switzerland
- HES-SO University of Applied Sciences and Arts Western Switzerland, Delémont, Switzerland
- CI4CB—Computational Intelligence for Computational Biology, SIB—Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Carlos Andrés Peña-Reyes
- Institute of Information and Communication Technology (IICT), School of Management and Engineering Vaud (HEIG-VD), Yverdon-les-Bains, Switzerland
- HES-SO University of Applied Sciences and Arts Western Switzerland, Delémont, Switzerland
- CI4CB—Computational Intelligence for Computational Biology, SIB—Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
10
|
Li J, Yang F, Xiao M, Li A. Advances and challenges in cataloging the human gut virome. Cell Host Microbe 2022; 30:908-916. [PMID: 35834962 DOI: 10.1016/j.chom.2022.06.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 06/02/2022] [Accepted: 06/07/2022] [Indexed: 11/17/2022]
Abstract
The human gut virome, which is often referred to as the "dark matter" of the gut microbiome, remains understudied. A better understanding of the composition and variations of the gut virome across populations is critical for exploring its impact on diseases and health. A series of advances in the characterization of human gut virome have unveiled high genetic diversity and various functional potentials of gut viruses. Here, we summarize the recently available human gut virome databases and discuss their features, procedures, and challenges with the intention to provide a reference to researchers to use while choosing a profiling database. We also propose a "best practice" for cataloging the viral population.
Collapse
Affiliation(s)
- Junhua Li
- BGI-Shenzhen, Shenzhen 518083, China; Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen 518083, China.
| | | | - Minfeng Xiao
- BGI-Shenzhen, Shenzhen 518083, China; Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen 518083, China.
| | - Aixin Li
- BGI-Shenzhen, Shenzhen 518083, China; Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen 518083, China
| |
Collapse
|
11
|
Zhou F, Gan R, Zhang F, Ren C, Yu L, Si Y, Huang Z. PHISDetector: A Tool to Detect Diverse In Silico Phage-host Interaction Signals for Virome Studies. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:508-523. [PMID: 35272051 PMCID: PMC9801046 DOI: 10.1016/j.gpb.2022.02.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/22/2021] [Accepted: 02/28/2022] [Indexed: 01/26/2023]
Abstract
Phage-microbe interactions are appealing systems to study coevolution, and have also been increasingly emphasized due to their roles in human health, disease, and the development of novel therapeutics. Phage-microbe interactions leave diverse signals in bacterial and phage genomic sequences, defined as phage-host interaction signals (PHISs), which include clustered regularly interspaced short palindromic repeats (CRISPR) targeting, prophage, and protein-protein interaction signals. In the present study, we developed a novel tool phage-host interaction signal detector (PHISDetector) to predict phage-host interactions by detecting and integrating diverse in silico PHISs, and scoring the probability of phage-host interactions using machine learning models based on PHIS features. We evaluated the performance of PHISDetector on multiple benchmark datasets and application cases. When tested on a dataset of 758 annotated phage-host pairs, PHISDetector yields the prediction accuracies of 0.51 and 0.73 at the species and genus levels, respectively, outperforming other phage-host prediction tools. When applied to on 125,842 metagenomic viral contigs (mVCs) derived from 3042 geographically diverse samples, a detection rate of 54.54% could be achieved. Furthermore, PHISDetector could predict infecting phages for 85.6% of 368 multidrug-resistant (MDR) bacteria and 30% of 454 human gut bacteria obtained from the National Institutes of Health (NIH) Human Microbiome Project (HMP). The PHISDetector can be run either as a web server (http://www.microbiome-bigdata.com/PHISDetector/) for general users to study individual inputs or as a stand-alone version (https://github.com/HIT-ImmunologyLab/PHISDetector) to process massive phage contigs from virome studies.
Collapse
Affiliation(s)
- Fengxia Zhou
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Rui Gan
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Fan Zhang
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Chunyan Ren
- Department of Hematology/oncology, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Ling Yu
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Yu Si
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China
| | - Zhiwei Huang
- HIT Center for Life Sciences, School of Life Science and Technology, Harbin Institute of Technology, Harbin 150080, China,Corresponding author.
| |
Collapse
|
12
|
Jiang Y, Luo J, Huang D, Liu Y, Li DD. Machine Learning Advances in Microbiology: A Review of Methods and Applications. Front Microbiol 2022; 13:925454. [PMID: 35711777 PMCID: PMC9196628 DOI: 10.3389/fmicb.2022.925454] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 05/09/2022] [Indexed: 12/18/2022] Open
Abstract
Microorganisms play an important role in natural material and elemental cycles. Many common and general biology research techniques rely on microorganisms. Machine learning has been gradually integrated with multiple fields of study. Machine learning, including deep learning, aims to use mathematical insights to optimize variational functions to aid microbiology using various types of available data to help humans organize and apply collective knowledge of various research objects in a systematic and scaled manner. Classification and prediction have become the main achievements in the development of microbial community research in the direction of computational biology. This review summarizes the application and development of machine learning and deep learning in the field of microbiology and shows and compares the advantages and disadvantages of different algorithm tools in four fields: microbiome and taxonomy, microbial ecology, pathogen and epidemiology, and drug discovery.
Collapse
|
13
|
Abstract
Since the large-scale experimental characterization of protein–protein interactions (PPIs) is not possible for all species, several computational PPI prediction methods have been developed that harness existing data from other species. While PPI network prediction has been extensively used in eukaryotes, microbial network inference has lagged behind. However, bacterial interactomes can be built using the same principles and techniques; in fact, several methods are better suited to bacterial genomes. These predicted networks allow systems-level analyses in species that lack experimental interaction data. This review describes the current network inference and analysis techniques and summarizes the use of computationally-predicted microbial interactomes to date.
Collapse
|
14
|
Sørensen AN, Woudstra C, Sørensen MCH, Brøndsted L. Subtypes of tail spike proteins predicts the host range of Ackermannviridae phages. Comput Struct Biotechnol J 2021; 19:4854-4867. [PMID: 34527194 PMCID: PMC8432352 DOI: 10.1016/j.csbj.2021.08.030] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 08/19/2021] [Accepted: 08/19/2021] [Indexed: 12/01/2022] Open
Abstract
Phages belonging to the Ackermannviridae family encode up to four tail spike proteins (TSPs), each recognizing a specific receptor of their bacterial hosts. Here, we determined the TSPs diversity of 99 Ackermannviridae phages by performing a comprehensive in silico analysis. Based on sequence diversity, we assigned all TSPs into distinctive subtypes of TSP1, TSP2, TSP3 and TSP4, and found each TSP subtype to be specifically associated with the genera (Kuttervirus, Agtrevirus, Limestonevirus, Taipeivirus) of the Ackermannviridae family. Further analysis showed that the N-terminal XD1 and XD2 domains in TSP2 and TSP4, hinging the four TSPs together, are preserved. In contrast, the C-terminal receptor binding modules were only conserved within TSP subtypes, except for some Kuttervirus TSP1s and TSP3s that were similar to specific TSP4s. A conserved motif in TSP1, TSP3 and TSP4 of Kuttervirus phages may allow recombination between receptor binding modules, thus altering host recognition. The receptors for numerous uncharacterized phages expressing TSPs in the same subtypes were predicted using previous host range data. To validate our predictions, we experimentally determined the host recognition of three of the four TSPs expressed by kuttervirus S117. We confirmed that S117 TSP1 and TSP2 bind to their predicted host receptors, and identified the receptor for TSP3, which is shared by 51 other Kuttervirus phages. Kuttervirus phages were thus shown encode a vast genetic diversity of potentially exchangeable TSPs influencing host recognition. Overall, our study demonstrates that comprehensive in silico and host range analysis of TSPs can predict host recognition of Ackermannviridae phages.
Collapse
Key Words
- ANI, Average nucleotide identity
- Ackermannviridae family
- Bacteriophage
- CPS, Capsular polysaccharide
- EOP, Efficiency of plating
- Escherichia coli O:157
- Host range
- LB, Luria-Bertani
- LPS, Lipopolysaccharide
- NCBI, National Center for Biotechnology Information
- O-antigen
- ORF, Open reading frame
- PFU, Plaque formation unit
- RBP, Receptor binding protein
- Receptor-binding proteins
- Salmonella
- TSP, Tail spike protein
- Tail spike proteins
- VriC, Virulence-associated protein
Collapse
Affiliation(s)
- Anders Nørgaard Sørensen
- Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, 1870 Frederiksberg C, Denmark
| | - Cedric Woudstra
- Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, 1870 Frederiksberg C, Denmark
| | - Martine C Holst Sørensen
- Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, 1870 Frederiksberg C, Denmark
| | - Lone Brøndsted
- Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, 1870 Frederiksberg C, Denmark
| |
Collapse
|
15
|
Li M, Wang Y, Li F, Zhao Y, Liu M, Zhang S, Bin Y, Smith AI, Webb GI, Li J, Song J, Xia J. A Deep Learning-Based Method for Identification of Bacteriophage-Host Interaction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1801-1810. [PMID: 32813660 PMCID: PMC8703204 DOI: 10.1109/tcbb.2020.3017386] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Multi-drug resistance (MDR) has become one of the greatest threats to human health worldwide, and novel treatment methods of infections caused by MDR bacteria are urgently needed. Phage therapy is a promising alternative to solve this problem, to which the key is correctly matching target pathogenic bacteria with the corresponding therapeutic phage. Deep learning is powerful for mining complex patterns to generate accurate predictions. In this study, we develop PredPHI (Predicting Phage-Host Interactions), a deep learning-based tool capable of predicting the host of phages from sequence data. We collect >3000 phage-host pairs along with their protein sequences from PhagesDB and GenBank databases and extract a set of features. Then we select high-quality negative samples based on the K-Means clustering method and construct a balanced training set. Finally, we employ a deep convolutional neural network to build the predictive model. The results indicate that PredPHI can achieve a predictive performance of 81 percent in terms of the area under the receiver operating characteristic curve on the test set, and the clustering-based method is significantly more robust than that based on randomly selecting negative samples. These results highlight that PredPHI is a useful and accurate tool for identifying phage-host interactions from sequence data.
Collapse
|
16
|
Li M, Zhang W. PHIAF: prediction of phage-host interactions with GAN-based data augmentation and sequence-based feature fusion. Brief Bioinform 2021; 23:6362109. [PMID: 34472593 DOI: 10.1093/bib/bbab348] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Revised: 07/05/2021] [Accepted: 07/18/2021] [Indexed: 01/01/2023] Open
Abstract
Phage therapy has become one of the most promising alternatives to antibiotics in the treatment of bacterial diseases, and identifying phage-host interactions (PHIs) helps to understand the possible mechanism through which a phage infects bacteria to guide the development of phage therapy. Compared with wet experiments, computational methods of identifying PHIs can reduce costs and save time and are more effective and economic. In this paper, we propose a PHI prediction method with a generative adversarial network (GAN)-based data augmentation and sequence-based feature fusion (PHIAF). First, PHIAF applies a GAN-based data augmentation module, which generates pseudo PHIs to alleviate the data scarcity. Second, PHIAF fuses the features originated from DNA and protein sequences for better performance. Third, PHIAF utilizes an attention mechanism to consider different contributions of DNA/protein sequence-derived features, which also provides interpretability of the prediction model. In computational experiments, PHIAF outperforms other state-of-the-art PHI prediction methods when evaluated via 5-fold cross-validation (AUC and AUPR are 0.88 and 0.86, respectively). An ablation study shows that data augmentation, feature fusion and an attention mechanism are all beneficial to improve the prediction performance of PHIAF. Additionally, four new PHIs with the highest PHIAF score in the case study were verified by recent literature. In conclusion, PHIAF is a promising tool to accelerate the exploration of phage therapy.
Collapse
Affiliation(s)
- Menglu Li
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Wen Zhang
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
17
|
Coutinho FH, Zaragoza-Solas A, López-Pérez M, Barylski J, Zielezinski A, Dutilh BE, Edwards R, Rodriguez-Valera F. RaFAH: Host prediction for viruses of Bacteria and Archaea based on protein content. PATTERNS 2021; 2:100274. [PMID: 34286299 PMCID: PMC8276007 DOI: 10.1016/j.patter.2021.100274] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 11/23/2020] [Accepted: 05/07/2021] [Indexed: 02/06/2023]
Abstract
Culture-independent approaches have recently shed light on the genomic diversity of viruses of prokaryotes. One fundamental question when trying to understand their ecological roles is: which host do they infect? To tackle this issue we developed a machine-learning approach named Random Forest Assignment of Hosts (RaFAH), that uses scores to 43,644 protein clusters to assign hosts to complete or fragmented genomes of viruses of Archaea and Bacteria. RaFAH displayed performance comparable with that of other methods for virus-host prediction in three different benchmarks encompassing viruses from RefSeq, single amplified genomes, and metagenomes. RaFAH was applied to assembled metagenomic datasets of uncultured viruses from eight different biomes of medical, biotechnological, and environmental relevance. Our analyses led to the identification of 537 sequences of archaeal viruses representing unknown lineages, whose genomes encode novel auxiliary metabolic genes, shedding light on how these viruses interfere with the host molecular machinery. RaFAH is available at https://sourceforge.net/projects/rafah/. RaFAH was developed to predict the hosts of viruses of Bacteria and Archaea RaFAH displayed comparable or superior performance to other host-prediction tools RaFAH performed well across viromes from eight different ecosystems RaFAH identified hundreds of genomic sequences as derived from viruses of Archaea
Viruses that infect Bacteria and Archaea are ubiquitous and extremely abundant. Recent advances have led to the discovery of many thousands of complete and partial genomes of these biological entities. Understanding the biology of these viruses and how they influence their ecosystems depends on knowing which hosts they infect. We developed a tool that uses data from complete or fragmented genomes to predict the hosts of viruses using a machine-learning approach. Our tool, RaFAH, displayed performance comparable with or superior to that of other host-prediction tools. In addition, it identified hundreds of sequences as derived from the genomes of viruses of Archaea, which are one of the least characterized fractions of the global virosphere.
Collapse
Affiliation(s)
- Felipe Hernandes Coutinho
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Asier Zaragoza-Solas
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Mario López-Pérez
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain
| | - Jakub Barylski
- Molecular Virology Research Unit, Faculty of Biology, Adam Mickiewicz University Poznan, 61-614 Poznan, Poland
| | - Andrzej Zielezinski
- Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, 61-614 Poznan, Poland
| | - Bas E Dutilh
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboud University Medical Centre/Radboud Institute for Molecular Life Sciences, 6525 GA Nijmegen, the Netherlands.,Theoretical Biology and Bioinformatics, Science for Life, Utrecht University (UU), 3584 CH Utrecht, the Netherlands
| | - Robert Edwards
- College of Science and Engineering, Flinders University, Bedford Park, SA 5042, Australia
| | - Francisco Rodriguez-Valera
- Evolutionary Genomics Group, Departamento de Producción Vegetal y Microbiología, Universidad Miguel Hernández, Aptdo. 18., Ctra. Alicante-Valencia N-332, s/n, San Juan de Alicante, 03550 Alicante, Spain.,Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| |
Collapse
|
18
|
Global overview and major challenges of host prediction methods for uncultivated phages. Curr Opin Virol 2021; 49:117-126. [PMID: 34126465 DOI: 10.1016/j.coviro.2021.05.003] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 05/20/2021] [Accepted: 05/22/2021] [Indexed: 12/14/2022]
Abstract
Bacterial communities play critical roles across all of Earth's biomes, affecting human health and global ecosystem functioning. They do so under strong constraints exerted by viruses, that is, bacteriophages or 'phages'. Phages can reshape bacterial communities' structure, influence long-term evolution of bacterial populations, and alter host cell metabolism during infection. Metagenomics approaches, that is, shotgun sequencing of environmental DNA or RNA, recently enabled large-scale exploration of phage genomic diversity, yielding several millions of phage genomes now to be further analyzed and characterized. One major challenge however is the lack of direct host information for these phages. Several methods and tools have been proposed to bioinformatically predict the potential host(s) of uncultivated phages based only on genome sequence information. Here we review these different approaches and highlight their distinct strengths and limitations. We also outline complementary experimental assays which are being proposed to validate and refine these bioinformatic predictions.
Collapse
|
19
|
Sudhakar P, Machiels K, Verstockt B, Korcsmaros T, Vermeire S. Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions. Front Microbiol 2021; 12:618856. [PMID: 34046017 PMCID: PMC8148342 DOI: 10.3389/fmicb.2021.618856] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 03/19/2021] [Indexed: 12/11/2022] Open
Abstract
The microbiome, by virtue of its interactions with the host, is implicated in various host functions including its influence on nutrition and homeostasis. Many chronic diseases such as diabetes, cancer, inflammatory bowel diseases are characterized by a disruption of microbial communities in at least one biological niche/organ system. Various molecular mechanisms between microbial and host components such as proteins, RNAs, metabolites have recently been identified, thus filling many gaps in our understanding of how the microbiome modulates host processes. Concurrently, high-throughput technologies have enabled the profiling of heterogeneous datasets capturing community level changes in the microbiome as well as the host responses. However, due to limitations in parallel sampling and analytical procedures, big gaps still exist in terms of how the microbiome mechanistically influences host functions at a system and community level. In the past decade, computational biology and machine learning methodologies have been developed with the aim of filling the existing gaps. Due to the agnostic nature of the tools, they have been applied in diverse disease contexts to analyze and infer the interactions between the microbiome and host molecular components. Some of these approaches allow the identification and analysis of affected downstream host processes. Most of the tools statistically or mechanistically integrate different types of -omic and meta -omic datasets followed by functional/biological interpretation. In this review, we provide an overview of the landscape of computational approaches for investigating mechanistic interactions between individual microbes/microbiome and the host and the opportunities for basic and clinical research. These could include but are not limited to the development of activity- and mechanism-based biomarkers, uncovering mechanisms for therapeutic interventions and generating integrated signatures to stratify patients.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Kathleen Machiels
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
| | - Bram Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| | - Tamas Korcsmaros
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Séverine Vermeire
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| |
Collapse
|
20
|
Hastings J, Glauer M, Memariani A, Neuhaus F, Mossakowski T. Learning chemistry: exploring the suitability of machine learning for the task of structure-based chemical ontology classification. J Cheminform 2021; 13:23. [PMID: 33726837 PMCID: PMC7962259 DOI: 10.1186/s13321-021-00500-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 02/26/2021] [Indexed: 12/22/2022] Open
Abstract
Chemical data is increasingly openly available in databases such as PubChem, which contains approximately 110 million compound entries as of February 2021. With the availability of data at such scale, the burden has shifted to organisation, analysis and interpretation. Chemical ontologies provide structured classifications of chemical entities that can be used for navigation and filtering of the large chemical space. ChEBI is a prominent example of a chemical ontology, widely used in life science contexts. However, ChEBI is manually maintained and as such cannot easily scale to the full scope of public chemical data. There is a need for tools that are able to automatically classify chemical data into chemical ontologies, which can be framed as a hierarchical multi-class classification problem. In this paper we evaluate machine learning approaches for this task, comparing different learning frameworks including logistic regression, decision trees and long short-term memory artificial neural networks, and different encoding approaches for the chemical structures, including cheminformatics fingerprints and character-based encoding from chemical line notation representations. We find that classical learning approaches such as logistic regression perform well with sets of relatively specific, disjoint chemical classes, while the neural network is able to handle larger sets of overlapping classes but needs more examples per class to learn from, and is not able to make a class prediction for every molecule. Future work will explore hybrid and ensemble approaches, as well as alternative network architectures including neuro-symbolic approaches.
Collapse
Affiliation(s)
- Janna Hastings
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Martin Glauer
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Adel Memariani
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Fabian Neuhaus
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Till Mossakowski
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| |
Collapse
|
21
|
Abstract
Supplemental Digital Content is available in the text. Objective: Bacterial infections caused by antibiotic-resistant pathogens are a major problem for patients requiring critical care. An approach to combat resistance is the use of bacterial viruses known as “phage therapy.” This review provides a brief “clinicians guide” to phage biology and discusses recent applications in the context of common infections encountered in ICUs. Data Sources: Research articles were sourced from PubMed using search term combinations of “bacteriophages” or “phage therapy” with either “lung,” “pneumonia,” “bloodstream,” “abdominal,” “urinary tract,” or “burn wound.” Study Selection: Preclinical trials using animal models, case studies detailing compassionate use of phage therapy in humans, and randomized controlled trials were included. Data Extraction: We systematically extracted: 1) the infection setting, 2) the causative bacterial pathogen and its antibiotic resistance profile, 3) the nature of the phage therapeutic and how it was administered, 4) outcomes of the therapy, and 5) adverse events. Data Synthesis: Phage therapy for the treatment of experimental infections in animal models and in cases of compassionate use in humans has been associated with largely positive outcomes. These findings, however, have failed to translate into positive patient outcomes in the limited number of randomized controlled trails that have been performed to date. Conclusions: Widespread clinical implementation of phage therapy depends on success in randomized controlled trials. Additional translational and reverse translational studies aimed at overcoming phage resistance, exploiting phage-antibiotic synergies, and optimizing phage administration will likely improve the design and outcome of future trials.
Collapse
|
22
|
Lamy-Besnier Q, Brancotte B, Ménager H, Debarbieux L. Viral Host Range database, an online tool for recording, analyzing and disseminating virus-host interactions. Bioinformatics 2021; 37:2798-2801. [PMID: 33594411 PMCID: PMC8428608 DOI: 10.1093/bioinformatics/btab070] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 01/11/2021] [Accepted: 02/15/2021] [Indexed: 11/13/2022] Open
Abstract
Motivation Viruses are ubiquitous in the living world, and their ability to infect more than one host defines their host range. However, information about which virus infects which host, and about which host is infected by which virus, is not readily available. Results We developed a web-based tool called the Viral Host Range database to record, analyze and disseminate experimental host range data for viruses infecting archaea, bacteria and eukaryotes. Availability and implementation The ViralHostRangeDB application is available from https://viralhostrangedb.pasteur.cloud. Its source code is freely available from the Gitlab instance of Institut Pasteur (https://gitlab.pasteur.fr/hub/viralhostrangedb).
Collapse
Affiliation(s)
- Quentin Lamy-Besnier
- Bacteriophage, Bacterium, Host Laboratory, Department of Microbiology, Institut Pasteur, Paris, F-75015, France.,Université de Paris, Paris, France
| | - Bryan Brancotte
- Bioinformatics and Biostatistics, Institut Pasteur, Paris, F-75015, France
| | - Hervé Ménager
- Bioinformatics and Biostatistics, Institut Pasteur, Paris, F-75015, France
| | - Laurent Debarbieux
- Bacteriophage, Bacterium, Host Laboratory, Department of Microbiology, Institut Pasteur, Paris, F-75015, France
| |
Collapse
|
23
|
Boeckaerts D, Stock M, Criel B, Gerstmans H, De Baets B, Briers Y. Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins. Sci Rep 2021; 11:1467. [PMID: 33446856 PMCID: PMC7809048 DOI: 10.1038/s41598-021-81063-4] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 12/30/2020] [Indexed: 12/04/2022] Open
Abstract
Nowadays, bacteriophages are increasingly considered as an alternative treatment for a variety of bacterial infections in cases where classical antibiotics have become ineffective. However, characterizing the host specificity of phages remains a labor- and time-intensive process. In order to alleviate this burden, we have developed a new machine-learning-based pipeline to predict bacteriophage hosts based on annotated receptor-binding protein (RBP) sequence data. We focus on predicting bacterial hosts from the ESKAPE group, Escherichia coli, Salmonella enterica and Clostridium difficile. We compare the performance of our predictive model with that of the widely used Basic Local Alignment Search Tool (BLAST). Our best-performing predictive model reaches Precision-Recall Area Under the Curve (PR-AUC) scores between 73.6 and 93.8% for different levels of sequence similarity in the collected data. Our model reaches a performance comparable to that of BLASTp when sequence similarity in the data is high and starts outperforming BLASTp when sequence similarity drops below 75%. Therefore, our machine learning methods can be especially useful in settings in which sequence similarity to other known sequences is low. Predicting the hosts of novel metagenomic RBP sequences could extend our toolbox to tune the host spectrum of phages or phage tail-like bacteriocins by swapping RBPs.
Collapse
Affiliation(s)
- Dimitri Boeckaerts
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
| | - Michiel Stock
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Bjorn Criel
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
| | - Hans Gerstmans
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium
- MeBioS-Biosensors group, Department of BioSystems, KU Leuven, Leuven, Belgium
| | - Bernard De Baets
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Yves Briers
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium.
| |
Collapse
|
24
|
Van Belleghem JD, Manasherob R, Miȩdzybrodzki R, Rogóż P, Górski A, Suh GA, Bollyky PL, Amanatullah DF. The Rationale for Using Bacteriophage to Treat and Prevent Periprosthetic Joint Infections. Front Microbiol 2020; 11:591021. [PMID: 33408703 PMCID: PMC7779626 DOI: 10.3389/fmicb.2020.591021] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 11/24/2020] [Indexed: 12/19/2022] Open
Abstract
Prosthetic joint infection (PJI) is a devastating complication after a joint replacement. PJI and its treatment have a high monetary cost, morbidity, and mortality. The lack of success treating PJI with conventional antibiotics alone is related to the presence of bacterial biofilm on medical implants. Consequently, surgical removal of the implant and prolonged intravenous antibiotics to eradicate the infection are necessary prior to re-implanting a new prosthetic joint. Growing clinical data shows that bacterial predators, called bacteriophages (phages), could be an alternative treatment strategy or prophylactic approach for PJI. Phages could further be exploited to degrade biofilms, making bacteria more susceptible to antibiotics and enabling potential combinatorial therapies. Emerging research suggests that phages may also directly interact with the innate immune response. Phage therapy may play an important, and currently understudied, role in the clearance of PJI, and has the potential to treat thousands of patients who would either have to undergo revision surgery to attempt to clear an infections, take antibiotics for a prolonged period to try and suppress the re-emerging infection, or potentially risk losing a limb.
Collapse
Affiliation(s)
- Jonas D. Van Belleghem
- Division of Infectious Diseases and Geographic Medicine, Department of Medicine, Stanford University, Stanford, CA, United States
| | - Robert Manasherob
- Department of Orthopaedic Surgery, Stanford University, Stanford, CA, United States
| | - Ryszard Miȩdzybrodzki
- Ludwik Hirszfeld Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, Wrocław, Poland
| | - Paweł Rogóż
- Ludwik Hirszfeld Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, Wrocław, Poland
| | - Andrzej Górski
- Ludwik Hirszfeld Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, Wrocław, Poland
| | | | - Paul L. Bollyky
- Division of Infectious Diseases and Geographic Medicine, Department of Medicine, Stanford University, Stanford, CA, United States
| | - Derek F. Amanatullah
- Department of Orthopaedic Surgery, Stanford University, Stanford, CA, United States
| |
Collapse
|
25
|
Lenneman BR, Fernbach J, Loessner MJ, Lu TK, Kilcher S. Enhancing phage therapy through synthetic biology and genome engineering. Curr Opin Biotechnol 2020; 68:151-159. [PMID: 33310655 DOI: 10.1016/j.copbio.2020.11.003] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/26/2020] [Accepted: 11/09/2020] [Indexed: 12/20/2022]
Abstract
The antimicrobial and therapeutic efficacy of bacteriophages is currently limited, mostly due to rapid emergence of phage-resistance and the inability of most phage isolates to bind and infect a broad range of clinical strains. Here, we discuss how phage therapy can be improved through recent advances in genetic engineering. First, we outline how receptor-binding proteins and their relevant structural domains are engineered to redirect phage specificity and to avoid resistance. Next, we summarize how phages are reprogrammed as prokaryotic gene therapy vectors that deliver antimicrobial 'payload' proteins, such as sequence-specific nucleases, to target defined cells within complex microbiomes. Finally, we delineate big data- and novel artificial intelligence-driven approaches that may guide the design of improved synthetic phage in the future.
Collapse
Affiliation(s)
- Bryan R Lenneman
- Research Laboratory of Electronics, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Synthetic Biology Center, MIT, Cambridge, MA 02139, USA
| | - Jonas Fernbach
- Institute of Food, Nutrition, and Health, ETH Zürich, Schmelzbergstrasse 7, 8092 Zürich, Switzerland
| | - Martin J Loessner
- Institute of Food, Nutrition, and Health, ETH Zürich, Schmelzbergstrasse 7, 8092 Zürich, Switzerland
| | - Timothy K Lu
- Research Laboratory of Electronics, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA; Synthetic Biology Center, MIT, Cambridge, MA 02139, USA; Department of Electrical Engineering and Computer Science, MIT, Cambridge, MA 02139, USA; Department of Biological Engineering, MIT, Cambridge, MA 02139, USA
| | - Samuel Kilcher
- Institute of Food, Nutrition, and Health, ETH Zürich, Schmelzbergstrasse 7, 8092 Zürich, Switzerland.
| |
Collapse
|
26
|
Pollock J, Low AS, McHugh RE, Muwonge A, Stevens MP, Corbishley A, Gally DL. Alternatives to antibiotics in a One Health context and the role genomics can play in reducing antimicrobial use. Clin Microbiol Infect 2020; 26:1617-1621. [PMID: 32220638 DOI: 10.1016/j.cmi.2020.02.028] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 02/19/2020] [Accepted: 02/22/2020] [Indexed: 01/16/2023]
Abstract
BACKGROUND This review follows on from the International Conference on One Health Antimicrobial Resistance (ICOHAR 2019), where strategies to improve the fundamental understanding and management of antimicrobial resistance at the interface between humans, animals and the environment were discussed. OBJECTIVE This review identifies alternatives to antimicrobials in a One Health context, noting how advances in genomic technologies are assisting their development and enabling more targeted use of antimicrobials. SOURCES Key articles on the use of microbiota modulation, livestock breeding and gene editing, vaccination, antivirulence strategies and bacteriophage therapy are discussed. CONTENT Antimicrobials are central for disease control, but reducing their use is paramount as a result of the rise of transmissible antimicrobial resistance. This review discusses antimicrobial alternatives in the context of improved understanding of fundamental host-pathogen and microbiota interactions using genomic tools. IMPLICATIONS Host and microbial genomics and other novel technologies play an important role in devising disease control strategies for healthier animals and humans that in turn reduce our reliance on antimicrobials.
Collapse
Affiliation(s)
- J Pollock
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK
| | - A S Low
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK
| | - R E McHugh
- Institute of Infection, Immunity & Inflammation, University of Glasgow, Glasgow, Scotland, UK; Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, Scotland, UK
| | - A Muwonge
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK
| | - M P Stevens
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK
| | - A Corbishley
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK
| | - D L Gally
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Edinburgh, UK.
| |
Collapse
|
27
|
Unlocking the next generation of phage therapy: the key is in the receptors. Curr Opin Biotechnol 2020; 68:115-123. [PMID: 33202354 DOI: 10.1016/j.copbio.2020.10.002] [Citation(s) in RCA: 67] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 09/21/2020] [Accepted: 10/09/2020] [Indexed: 12/18/2022]
Abstract
Phage therapy, the clinical use of viruses that kill bacteria, is a promising strategy in the fight against antimicrobial resistance. Before administration, phages undergo a careful examination of their safety and interactions with target bacteria. This characterization seldom includes identifying the receptor on the bacterial surface involved in phage adsorption. In this perspective article, we propose that understanding the function and location of these phage receptors can open the door to improved and innovative ways to use phage therapy. With knowledge of phage receptors, we can design intelligent phage cocktails, discover new phage-derived antimicrobials, and steer the evolution of phage-resistance towards clinically exploitable phenotypes. In an effort to jump-start this initiative, we recommend priority groups of hosts and phages. Finally, we review modern approaches for the identification of phage receptors, including molecular platforms for high-throughput mutagenesis, synthetic biology, and machine learning.
Collapse
|
28
|
Application of artificial intelligence to the in silico assessment of antimicrobial resistance and risks to human and animal health presented by priority enteric bacterial pathogens. ACTA ACUST UNITED AC 2020; 46:180-185. [PMID: 32673383 DOI: 10.14745/ccdr.v46i06a05] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Each year, approximately one in eight Canadians are affected by foodborne illness, either through outbreaks or sporadic illness, with animals being the major reservoir for the pathogens. Whole genome sequence analyses are now routinely implemented by public and animal health laboratories to define epidemiological disease clusters and to identify potential sources of infection. Similarly, a number of bioinformatics tools can be used to identify virulence and antimicrobial resistance (AMR) determinants in the genomes of pathogenic strains. Many important clinical and phenotypic characteristics of these pathogens can now be predicted using machine learning algorithms applied to whole genome sequence data. In this overview, we compare the ability of support vector machines, gradient-boosted decision trees and artificial neural networks to predict the levels of AMR within Salmonella enterica and extended-spectrum β-lactamase (ESBL) producing Escherichia coli. We show that minimum inhibitory concentrations (MIC) for each of 13 antimicrobials for S. enterica strains can be accurately determined, and that ESBL-producing E. coli strains can be accurately classified as susceptible, intermediate or resistant for each of seven antimicrobials. In addition to AMR and bacterial populations of greatest risk to human health, artificial intelligence algorithms hold promise as tools to predict other clinically and epidemiologically important phenotypes of enteric pathogens.
Collapse
|
29
|
Young F, Rogers S, Robertson DL. Predicting host taxonomic information from viral genomes: A comparison of feature representations. PLoS Comput Biol 2020; 16:e1007894. [PMID: 32453718 PMCID: PMC7307784 DOI: 10.1371/journal.pcbi.1007894] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Revised: 06/22/2020] [Accepted: 04/21/2020] [Indexed: 12/13/2022] Open
Abstract
The rise in metagenomics has led to an exponential growth in virus discovery. However, the majority of these new virus sequences have no assigned host. Current machine learning approaches to predicting virus host interactions have a tendency to focus on nucleotide features, ignoring other representations of genomic information. Here we investigate the predictive potential of features generated from four different ‘levels’ of viral genome representation: nucleotide, amino acid, amino acid properties and protein domains. This more fully exploits the biological information present in the virus genomes. Over a hundred and eighty binary datasets for infecting versus non-infecting viruses at all taxonomic ranks of both eukaryote and prokaryote hosts were compiled. The viral genomes were converted into the four different levels of genome representation and twenty feature sets were generated by extracting k-mer compositions and predicted protein domains. We trained and tested Support Vector Machine, SVM, classifiers to compare the predictive capacity of each of these feature sets for each dataset. Our results show that all levels of genome representation are consistently predictive of host taxonomy and that prediction k-mer composition improves with increasing k-mer length for all k-mer based features. Using a phylogenetically aware holdout method, we demonstrate that the predictive feature sets contain signals reflecting both the evolutionary relationship between the viruses infecting related hosts, and host-mimicry. Our results demonstrate that incorporating a range of complementary features, generated purely from virus genome sequences, leads to improved accuracy for a range of virus host prediction tasks enabling computational assignment of host taxonomic information. Elucidating the host of a newly identified virus species is an important challenge, with applications from knowing the source species of a newly emerged pathogen to understanding the bacteriophage-host relationships within the microbiome of any of earth’s ecosystems. Current high throughput methods used to identify viruses within biological or environmental samples have resulted in an unprecedented increase in virus discovery. However, for the majority of these virus genomes the host species/taxonomic classification remains unknown. To address this gap in our knowledge there is a need for fast, accurate computational methods for the assignment of putative host taxonomic information. Machine learning is an ideal approach but to maximise predictive accuracy the viral genomes need to be represented in a format (sets of features) that makes the discriminative information available to the machine learning algorithm. Here, we compare different types of features derived from the same viral genomes for their ability to predict host information. Our results demonstrate that all these feature sets are predictive of host taxonomy and when combined have the potential to improve accuracy over the use of individual feature sets across many virus host prediction applications.
Collapse
Affiliation(s)
- Francesca Young
- MRC-University of Glasgow Centre For Virus Research, Glasgow, United Kingdom
| | - Simon Rogers
- School of Computing Science, University of Glasgow, Glasgow, United Kingdom
| | - David L. Robertson
- MRC-University of Glasgow Centre For Virus Research, Glasgow, United Kingdom
- * E-mail:
| |
Collapse
|
30
|
Caflisch KM, Suh GA, Patel R. Biological challenges of phage therapy and proposed solutions: a literature review. Expert Rev Anti Infect Ther 2019; 17:1011-1041. [PMID: 31735090 DOI: 10.1080/14787210.2019.1694905] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Introduction: In light of the emergence of antibiotic-resistant bacteria, phage (bacteriophage) therapy has been recognized as a potential alternative or addition to antibiotics in Western medicine for use in humans.Areas covered: This review assessed the scientific literature on phage therapy published between 1 January 2007 and 21 October 2019, with a focus on the successes and challenges of this prospective therapeutic.Expert opinion: Efficacy has been shown in animal models and experimental findings suggest promise for the safety of human phagotherapy. Significant challenges remain to be addressed prior to the standardization of phage therapy in the West, including the development of phage-resistant bacteria; the pharmacokinetic complexities of phage; and any potential human immune response incited by phagotherapy.
Collapse
Affiliation(s)
- Katherine M Caflisch
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
| | - Gina A Suh
- Division of Infectious Diseases, Department of Medicine, Mayo Clinic, Rochester, MN, USA
| | - Robin Patel
- Division of Infectious Diseases, Department of Medicine, Mayo Clinic, Rochester, MN, USA.,Division of Clinical Microbiology, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| |
Collapse
|
31
|
Computational Health Engineering Applied to Model Infectious Diseases and Antimicrobial Resistance Spread. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9122486] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Infectious diseases are the primary cause of mortality worldwide. The dangers of infectious disease are compounded with antimicrobial resistance, which remains the greatest concern for human health. Although novel approaches are under investigation, the World Health Organization predicts that by 2050, septicaemia caused by antimicrobial resistant bacteria could result in 10 million deaths per year. One of the main challenges in medical microbiology is to develop novel experimental approaches, which enable a better understanding of bacterial infections and antimicrobial resistance. After the introduction of whole genome sequencing, there was a great improvement in bacterial detection and identification, which also enabled the characterization of virulence factors and antimicrobial resistance genes. Today, the use of in silico experiments jointly with computational and machine learning offer an in depth understanding of systems biology, allowing us to use this knowledge for the prevention, prediction, and control of infectious disease. Herein, the aim of this review is to discuss the latest advances in human health engineering and their applicability in the control of infectious diseases. An in-depth knowledge of host–pathogen–protein interactions, combined with a better understanding of a host’s immune response and bacterial fitness, are key determinants for halting infectious diseases and antimicrobial resistance dissemination.
Collapse
|
32
|
Tkachev V, Sorokin M, Mescheryakov A, Simonov A, Garazha A, Buzdin A, Muchnik I, Borisov N. FLOating-Window Projective Separator (FloWPS): A Data Trimming Tool for Support Vector Machines (SVM) to Improve Robustness of the Classifier. Front Genet 2019; 9:717. [PMID: 30697229 PMCID: PMC6341065 DOI: 10.3389/fgene.2018.00717] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2018] [Accepted: 12/21/2018] [Indexed: 01/31/2023] Open
Abstract
Here, we propose a heuristic technique of data trimming for SVM termed FLOating Window Projective Separator (FloWPS), tailored for personalized predictions based on molecular data. This procedure can operate with high throughput genetic datasets like gene expression or mutation profiles. Its application prevents SVM from extrapolation by excluding non-informative features. FloWPS requires training on the data for the individuals with known clinical outcomes to create a clinically relevant classifier. The genetic profiles linked with the outcomes are broken as usual into the training and validation datasets. The unique property of FloWPS is that irrelevant features in validation dataset that don’t have significant number of neighboring hits in the training dataset are removed from further analyses. Next, similarly to the k nearest neighbors (kNN) method, for each point of a validation dataset, FloWPS takes into account only the proximal points of the training dataset. Thus, for every point of a validation dataset, the training dataset is adjusted to form a floating window. FloWPS performance was tested on ten gene expression datasets for 992 cancer patients either responding or not on the different types of chemotherapy. We experimentally confirmed by leave-one-out cross-validation that FloWPS enables to significantly increase quality of a classifier built based on the classical SVM in most of the applications, particularly for polynomial kernels.
Collapse
Affiliation(s)
- Victor Tkachev
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States
| | - Maxim Sorokin
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | | | - Alexander Simonov
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States
| | - Andrew Garazha
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States
| | - Anton Buzdin
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia.,I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| | - Ilya Muchnik
- Hill Center, Rutgers University, Piscataway, NJ, United States
| | - Nicolas Borisov
- Department of Bioinformatics and Molecular Networks, OmicsWay Corporation, Walnut, CA, United States.,I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| |
Collapse
|