1
|
Jung JM, Rahman A, Schiffer AM, Weisberg AJ. Beav: a bacterial genome and mobile element annotation pipeline. mSphere 2024:e0020924. [PMID: 39037262 DOI: 10.1128/msphere.00209-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 06/28/2024] [Indexed: 07/23/2024] Open
Abstract
Comprehensive and accurate genome annotation is crucial for inferring the predicted functions of an organism. Numerous tools exist to annotate genes, gene clusters, mobile genetic elements, and other diverse features. However, these tools and pipelines can be difficult to install and run, be specialized for a particular element or feature, or lack annotations for larger elements that provide important genomic context. Integrating results across analyses is also important for understanding gene function. To address these challenges, we present the Beav annotation pipeline. Beav is a command-line tool that automates the annotation of bacterial genome sequences, mobile genetic elements, molecular systems and gene clusters, key regulatory features, and other elements. Beav uses existing tools in addition to custom models, scripts, and databases to annotate diverse elements, systems, and sequence features. Custom databases for plant-associated microbes are incorporated to improve annotation of key virulence and symbiosis genes in agriculturally important pathogens and mutualists. Beav includes an optional Agrobacterium-specific pipeline that identifies and classifies oncogenic plasmids and annotates plasmid-specific features. Following the completion of all analyses, annotations are consolidated to produce a single comprehensive output. Finally, Beav generates publication-quality genome and plasmid maps. Beav is on Bioconda and is available for download at https://github.com/weisberglab/beav. IMPORTANCE Annotation of genome features, such as the presence of genes and their predicted function, or larger loci encoding secretion systems or biosynthetic gene clusters, is necessary for understanding the functions encoded by an organism. Genomes can also host diverse mobile genetic elements, such as integrative and conjugative elements and/or phages, that are often not annotated by existing pipelines. These elements can horizontally mobilize genes encoding for virulence, antimicrobial resistance, or other adaptive functions and alter the phenotype of an organism. We developed a software pipeline, called Beav, that combines new and existing tools for the comprehensive annotation of these and other major features. Existing pipelines often misannotate loci important for virulence or mutualism in plant-associated bacteria. Beav includes custom databases and optional workflows for the improved annotation of plant-associated bacteria. Beav is designed to be easy to install and run, making comprehensive genome annotation broadly available to the research community.
Collapse
Affiliation(s)
- Jewell M Jung
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Arafat Rahman
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Andrea M Schiffer
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Alexandra J Weisberg
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| |
Collapse
|
2
|
Jensen RO, Schulz F, Roux S, Klingeman DM, Mitchell WP, Udwary D, Moraïs S, Reynoso V, Winkler J, Nagaraju S, De Tissera S, Shapiro N, Ivanova N, Reddy TBK, Mizrahi I, Utturkar SM, Bayer EA, Woyke T, Mouncey NJ, Jewett MC, Simpson SD, Köpke M, Jones DT, Brown SD. Phylogenomics and genetic analysis of solvent-producing Clostridium species. Sci Data 2024; 11:432. [PMID: 38693191 PMCID: PMC11063209 DOI: 10.1038/s41597-024-03210-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 04/02/2024] [Indexed: 05/03/2024] Open
Abstract
The genus Clostridium is a large and diverse group within the Bacillota (formerly Firmicutes), whose members can encode useful complex traits such as solvent production, gas-fermentation, and lignocellulose breakdown. We describe 270 genome sequences of solventogenic clostridia from a comprehensive industrial strain collection assembled by Professor David Jones that includes 194 C. beijerinckii, 57 C. saccharobutylicum, 4 C. saccharoperbutylacetonicum, 5 C. butyricum, 7 C. acetobutylicum, and 3 C. tetanomorphum genomes. We report methods, analyses and characterization for phylogeny, key attributes, core biosynthetic genes, secondary metabolites, plasmids, prophage/CRISPR diversity, cellulosomes and quorum sensing for the 6 species. The expanded genomic data described here will facilitate engineering of solvent-producing clostridia as well as non-model microorganisms with innately desirable traits. Sequences could be applied in conventional platform biocatalysts such as yeast or Escherichia coli for enhanced chemical production. Recently, gene sequences from this collection were used to engineer Clostridium autoethanogenum, a gas-fermenting autotrophic acetogen, for continuous acetone or isopropanol production, as well as butanol, butanoic acid, hexanol and hexanoic acid production.
Collapse
Affiliation(s)
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | | | - Daniel Udwary
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sarah Moraïs
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
| | | | | | | | | | - Nicole Shapiro
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Natalia Ivanova
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Itzhak Mizrahi
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
| | - Sagar M Utturkar
- Institute for Cancer Research, Purdue University, West Lafayette, IN, USA
| | - Edward A Bayer
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
- Department of Biomolecular Sciences, The Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Tanja Woyke
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- University of California Merced, Life and Environmental Sciences, Merced, CA, USA
| | - Nigel J Mouncey
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michael C Jewett
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | | | | | - David T Jones
- Department of Microbiology, University of Otago, Dunedin, New Zealand.
| | | |
Collapse
|
3
|
Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, Chain PSG, Nayfach S, Kyrpides NC. Identification of mobile genetic elements with geNomad. Nat Biotechnol 2023:10.1038/s41587-023-01953-y. [PMID: 37735266 DOI: 10.1038/s41587-023-01953-y] [Citation(s) in RCA: 61] [Impact Index Per Article: 61.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 08/17/2023] [Indexed: 09/23/2023]
Abstract
Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for diverse plasmids and viruses (Matthews correlation coefficient of 77.8% and 95.3%, respectively), substantially outperforming other tools. Leveraging geNomad's speed and scalability, we processed over 2.7 trillion base pairs of sequencing data, leading to the discovery of millions of viruses and plasmids that are available through the IMG/VR and IMG/PR databases. geNomad is available at https://portal.nersc.gov/genomad .
Collapse
Affiliation(s)
- Antonio Pedro Camargo
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michal Babinski
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Yan Xu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Patrick S G Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
4
|
Jurdzinski KT, Mehrshad M, Delgado LF, Deng Z, Bertilsson S, Andersson AF. Large-scale phylogenomics of aquatic bacteria reveal molecular mechanisms for adaptation to salinity. SCIENCE ADVANCES 2023; 9:eadg2059. [PMID: 37235649 PMCID: PMC10219603 DOI: 10.1126/sciadv.adg2059] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Accepted: 04/21/2023] [Indexed: 05/28/2023]
Abstract
The crossing of environmental barriers poses major adaptive challenges. Rareness of freshwater-marine transitions separates the bacterial communities, but how these are related to brackish counterparts remains elusive, as do the molecular adaptations facilitating cross-biome transitions. We conducted large-scale phylogenomic analysis of freshwater, brackish, and marine quality-filtered metagenome-assembled genomes (11,248). Average nucleotide identity analyses showed that bacterial species rarely existed in multiple biomes. In contrast, distinct brackish basins cohosted numerous species, but their intraspecific population structures displayed clear signs of geographic separation. We further identified the most recent cross-biome transitions, which were rare, ancient, and most commonly directed toward the brackish biome. Transitions were accompanied by systematic changes in amino acid composition and isoelectric point distributions of inferred proteomes, which evolved over millions of years, as well as convergent gains or losses of specific gene functions. Therefore, adaptive challenges entailing proteome reorganization and specific changes in gene content constrains the cross-biome transitions, resulting in species-level separation between aquatic biomes.
Collapse
Affiliation(s)
- Krzysztof T. Jurdzinski
- Department of Gene Technology, KTH Royal Institute of Technology, Science for Life Laboratory, Stockholm, Sweden
| | - Maliheh Mehrshad
- Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Luis Fernando Delgado
- Department of Gene Technology, KTH Royal Institute of Technology, Science for Life Laboratory, Stockholm, Sweden
| | - Ziling Deng
- Department of Gene Technology, KTH Royal Institute of Technology, Science for Life Laboratory, Stockholm, Sweden
| | - Stefan Bertilsson
- Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Anders F. Andersson
- Department of Gene Technology, KTH Royal Institute of Technology, Science for Life Laboratory, Stockholm, Sweden
| |
Collapse
|
5
|
van der Gulik PT, Egas M, Kraaijeveld K, Dombrowski N, Groot AT, Spang A, Hoff WD, Gallie J. On distinguishing between canonical tRNA genes and tRNA gene fragments in prokaryotes. RNA Biol 2023; 20:48-58. [PMID: 36727270 PMCID: PMC9897764 DOI: 10.1080/15476286.2023.2172370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Automated genome annotation is essential for extracting biological information from sequence data. The identification and annotation of tRNA genes is frequently performed by the software package tRNAscan-SE, the output of which is listed for selected genomes in the Genomic tRNA database (GtRNAdb). Here, we highlight a pervasive error in prokaryotic tRNA gene sets on GtRNAdb: the mis-categorization of partial, non-canonical tRNA genes as standard, canonical tRNA genes. Firstly, we demonstrate the issue using the tRNA gene sets of 20 organisms from the archaeal taxon Thermococcaceae. According to GtRNAdb, these organisms collectively deviate from the expected set of tRNA genes in 15 instances, including the listing of eleven putative canonical tRNA genes. However, after detailed manual annotation, only one of these eleven remains; the others are either partial, non-canonical tRNA genes resulting from the integration of genetic elements or CRISPR-Cas activity (seven instances), or attributable to ambiguities in input sequences (three instances). Secondly, we show that similar examples of the mis-categorization of predicted tRNA sequences occur throughout the prokaryotic sections of GtRNAdb. While both canonical and non-canonical prokaryotic tRNA gene sequences identified by tRNAscan-SE are biologically interesting, the challenge of reliably distinguishing between them remains. We recommend employing a combination of (i) screening input sequences for the genetic elements typically associated with non-canonical tRNA genes, and ambiguities, (ii) activating the tRNAscan-SE automated pseudogene detection function, and (iii) scrutinizing predicted tRNA genes with low isotype scores. These measures greatly reduce manual annotation efforts, and lead to improved prokaryotic tRNA gene set predictions.
Collapse
Affiliation(s)
- Peter T.S. van der Gulik
- Department of Algorithms and Complexity, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands,CONTACT Peter T.S. van der Gulik Centrum Wiskunde & Informatica, Amsterdam, The Netherlands
| | - Martijn Egas
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Ken Kraaijeveld
- Leiden Centre for Applied Bioscience, University of Applied Sciences Leiden, Leiden, The Netherlands
| | - Nina Dombrowski
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Den Burg, The Netherlands
| | - Astrid T. Groot
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Anja Spang
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands,Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Den Burg, The Netherlands
| | - Wouter D. Hoff
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, Oklahoma, USA,Wouter Hoff
| | - Jenna Gallie
- Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany,Jenna Gallie
| |
Collapse
|
6
|
Vuong I, Mageeney CM, Williams KP. BigDNA: Primer Design Software for Overlap-Based Assembly of Phage Genomes and Larger DNAs. PHAGE (NEW ROCHELLE, N.Y.) 2022; 3:213-220. [PMID: 36793884 PMCID: PMC9917320 DOI: 10.1089/phage.2022.0033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Background Gibson assembly and assembly-in-yeast are strategies to create long synthetic DNAs from diverse fragments, for example, when engineering bacteriophage genomes. Design for these methods requires terminal sequence overlaps in the fragments, determining the order of assembly. Design to rebuild a genomic fragment that is too long for a single PCR presents a puzzle since some candidate joint regions cannot yield satisfactory primers for the overlap. No existing overlap assembly design software is open-source, and none explicitly supports rebuilding. Methods We describe here bigDNA software that solves the rebuilding puzzle by recursive backtracking, with options to remove or introduce genes; it also tests for mispriming on the template DNA. BigDNA was tested with 3082 prophages and other genomic islands (GIs), from 20 to 100 kb, and the synthetic Mycoplasma genitalium genome. Results Rebuilding assembly design succeeded for all but ∼1% of GIs. Conclusion BigDNA will speed and standardize assembly design.
Collapse
Affiliation(s)
- Ivan Vuong
- Systems Biology, Sandia National Laboratories, Livermore, California, USA
| | | | - Kelly P. Williams
- Systems Biology, Sandia National Laboratories, Livermore, California, USA
| |
Collapse
|
7
|
When bacteria are phage playgrounds: interactions between viruses, cells, and mobile genetic elements. Curr Opin Microbiol 2022; 70:102230. [PMID: 36335712 DOI: 10.1016/j.mib.2022.102230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 09/23/2022] [Accepted: 10/06/2022] [Indexed: 11/06/2022]
Abstract
Studies of viral adaptation have focused on the selective pressures imposed by hosts. However, there is increasing evidence that interactions between viruses, cells, and other mobile genetic elements are determinant to the success of infections. These interactions are often associated with antagonism and competition, but sometimes involve cooperation or parasitism. We describe two key types of interactions - defense systems and genetic regulation - that allow the partners of the interaction to destroy or control the others. These interactions evolve rapidly by genetic exchanges, including among competing partners. They are sometimes followed by functional diversification. Gene exchanges also facilitate the emergence of cross-talk between elements in the same bacterium. In the end, these processes produce multilayered networks of interactions that shape the outcome of viral infections.
Collapse
|
8
|
Mageeney CM, Trubl G, Williams KP. Improved Mobilome Delineation in Fragmented Genomes. FRONTIERS IN BIOINFORMATICS 2022; 2:866850. [PMID: 36304297 PMCID: PMC9580842 DOI: 10.3389/fbinf.2022.866850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 03/17/2022] [Indexed: 11/26/2022] Open
Abstract
The mobilome of a microbe, i.e., its set of mobile elements, has major effects on its ecology, and is important to delineate properly in each genome. This becomes more challenging for incomplete genomes, and even more so for metagenome-assembled genomes (MAGs), where misbinning of scaffolds and other losses can occur. Genomic islands (GIs), which integrate into the host chromosome, are a major component of the mobilome. Our GI-detection software TIGER, unique in its precise mapping of GI termini, was applied to 74,561 genomes from 2,473 microbial species, each species containing at least one MAG and one isolate genome. A species-normalized deficit of ∼1.6 GIs/genome was measured for MAGs relative to isolates. To test whether this undercount was due to the higher fragmentation of MAG genomes, TIGER was updated to enable detection of split GIs whose termini are on separate scaffolds or that wrap around the origin of a circular replicon. This doubled GI yields, and the new split GIs matched the quality of single-scaffold GIs, except that highly fragmented GIs may lack central portions. Cross-scaffold search is an important upgrade to GI detection as fragmented genomes increasingly dominate public databases. TIGER2 better captures MAG microdiversity, recovering niche-defining GIs and supporting microbiome research aims such as virus-host linking and ecological assessment.
Collapse
Affiliation(s)
- Catherine M. Mageeney
- Systems Biology Department, Sandia National Laboratories, Livermore, CA, United States
| | - Gareth Trubl
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Kelly P. Williams
- Systems Biology Department, Sandia National Laboratories, Livermore, CA, United States
- *Correspondence: Kelly P. Williams,
| |
Collapse
|
9
|
Deploying Viruses against Phytobacteria: Potential Use of Phage Cocktails as a Multifaceted Approach to Combat Resistant Bacterial Plant Pathogens. Viruses 2022; 14:v14020171. [PMID: 35215763 PMCID: PMC8879233 DOI: 10.3390/v14020171] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 01/13/2022] [Accepted: 01/14/2022] [Indexed: 02/05/2023] Open
Abstract
Plants in nature are under the persistent intimidation of severe microbial diseases, threatening a sustainable food production system. Plant-bacterial pathogens are a major concern in the contemporary era, resulting in reduced plant growth and productivity. Plant antibiotics and chemical-based bactericides have been extensively used to evade plant bacterial diseases. To counteract this pressure, bacteria have evolved an array of resistance mechanisms, including innate and adaptive immune systems. The emergence of resistant bacteria and detrimental consequences of antimicrobial compounds on the environment and human health, accentuates the development of an alternative disease evacuation strategy. The phage cocktail therapy is a multidimensional approach effectively employed for the biocontrol of diverse resistant bacterial infections without affecting the fauna and flora. Phages engage a diverse set of counter defense strategies to undermine wide-ranging anti-phage defense mechanisms of bacterial pathogens. Microbial ecology, evolution, and dynamics of the interactions between phage and plant-bacterial pathogens lead to the engineering of robust phage cocktail therapeutics for the mitigation of devastating phytobacterial diseases. In this review, we highlight the concrete and fundamental determinants in the development and application of phage cocktails and their underlying mechanism, combating resistant plant-bacterial pathogens. Additionally, we provide recent advances in the use of phage cocktail therapy against phytobacteria for the biocontrol of devastating plant diseases.
Collapse
|
10
|
Smyshlyaev G, Bateman A, Barabas O. Sequence analysis of tyrosine recombinases allows annotation of mobile genetic elements in prokaryotic genomes. Mol Syst Biol 2021; 17:e9880. [PMID: 34018328 PMCID: PMC8138268 DOI: 10.15252/msb.20209880] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 04/18/2021] [Accepted: 04/20/2021] [Indexed: 11/16/2022] Open
Abstract
Mobile genetic elements (MGEs) sequester and mobilize antibiotic resistance genes across bacterial genomes. Efficient and reliable identification of such elements is necessary to follow resistance spreading. However, automated tools for MGE identification are missing. Tyrosine recombinase (YR) proteins drive MGE mobilization and could provide markers for MGE detection, but they constitute a diverse family also involved in housekeeping functions. Here, we conducted a comprehensive survey of YRs from bacterial, archaeal, and phage genomes and developed a sequence‐based classification system that dissects the characteristics of MGE‐borne YRs. We revealed that MGE‐related YRs evolved from non‐mobile YRs by acquisition of a regulatory arm‐binding domain that is essential for their mobility function. Based on these results, we further identified numerous unknown MGEs. This work provides a resource for comparative analysis and functional annotation of YRs and aids the development of computational tools for MGE annotation. Additionally, we reveal how YRs adapted to drive gene transfer across species and provide a tool to better characterize antibiotic resistance dissemination.
Collapse
Affiliation(s)
- Georgy Smyshlyaev
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.,European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Heidelberg, Germany.,Department of Molecular Biology, University of Geneva, Geneva, Switzerland
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Orsolya Barabas
- European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Heidelberg, Germany.,Department of Molecular Biology, University of Geneva, Geneva, Switzerland
| |
Collapse
|
11
|
Abstract
The antibiotic resistance crisis has led to renewed interest in phage therapy as an alternative means of treating infection. However, conventional methods for isolating pathogen-specific phage are slow, labor-intensive, and frequently unsuccessful. We have demonstrated that computationally identified prophages carried by near-neighbor bacteria can serve as starting material for production of engineered phages that kill the target pathogen. Our approach and technology platform offer new opportunity for rapid development of phage therapies against most, if not all, bacterial pathogens, a foundational advance for use of phage in treating infectious disease. New therapies are necessary to combat increasingly antibiotic-resistant bacterial pathogens. We have developed a technology platform of computational, molecular biology, and microbiology tools which together enable on-demand production of phages that target virtually any given bacterial isolate. Two complementary computational tools that identify and precisely map prophages and other integrative genetic elements in bacterial genomes are used to identify prophage-laden bacteria that are close relatives of the target strain. Phage genomes are engineered to disable lysogeny, through use of long amplicon PCR and Gibson assembly. Finally, the engineered phage genomes are introduced into host bacteria for phage production. As an initial demonstration, we used this approach to produce a phage cocktail against the opportunistic pathogen Pseudomonas aeruginosa PAO1. Two prophage-laden P. aeruginosa strains closely related to PAO1 were identified, ATCC 39324 and ATCC 27853. Deep sequencing revealed that mitomycin C treatment of these strains induced seven phages that grow on P. aeruginosa PAO1. The most diverse five phages were engineered for nonlysogeny by deleting the integrase gene (int), which is readily identifiable and typically conveniently located at one end of the prophage. The Δint phages, individually and in cocktails, killed P. aeruginosa PAO1 in liquid culture as well as in a waxworm (Galleria mellonella) model of infection. IMPORTANCE The antibiotic resistance crisis has led to renewed interest in phage therapy as an alternative means of treating infection. However, conventional methods for isolating pathogen-specific phage are slow, labor-intensive, and frequently unsuccessful. We have demonstrated that computationally identified prophages carried by near-neighbor bacteria can serve as starting material for production of engineered phages that kill the target pathogen. Our approach and technology platform offer new opportunity for rapid development of phage therapies against most, if not all, bacterial pathogens, a foundational advance for use of phage in treating infectious disease.
Collapse
|
12
|
Genome Sequences of Burkholderia thailandensis Strains E421, E426, and DW503. Microbiol Resour Announc 2020; 9:9/21/e00312-20. [PMID: 32439666 PMCID: PMC7242668 DOI: 10.1128/mra.00312-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We present the draft genome sequences of three Burkholderia thailandensis strains, E421, E426, and DW503. E421 consists of 90 contigs of 6,639,935 bp and 67.73% GC content. E426 consists of 106 contigs of 6,587,853 bp and 67.73% GC content. DW503 consists of 102 contigs of 6,458,767 bp and 67.64% GC content. We present the draft genome sequences of three Burkholderia thailandensis strains, E421, E426, and DW503. E421 consists of 90 contigs of 6,639,935 bp and 67.73% GC content. E426 consists of 106 contigs of 6,587,853 bp and 67.73% GC content. DW503 consists of 102 contigs of 6,458,767 bp and 67.64% GC content.
Collapse
|