1
|
Wang S, Jiang Y, Che L, Wang RH, Li SC. Enhancing insights into diseases through horizontal gene transfer event detection from gut microbiome. Nucleic Acids Res 2024:gkae515. [PMID: 38884260 DOI: 10.1093/nar/gkae515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 04/23/2024] [Accepted: 06/04/2024] [Indexed: 06/18/2024] Open
Abstract
Horizontal gene transfer (HGT) phenomena pervade the gut microbiome and significantly impact human health. Yet, no current method can accurately identify complete HGT events, including the transferred sequence and the associated deletion and insertion breakpoints from shotgun metagenomic data. Here, we develop LocalHGT, which facilitates the reliable and swift detection of complete HGT events from shotgun metagenomic data, delivering an accuracy of 99.4%-verified by Nanopore data-across 200 gut microbiome samples, and achieving an average F1 score of 0.99 on 100 simulated data. LocalHGT enables a systematic characterization of HGT events within the human gut microbiome across 2098 samples, revealing that multiple recipient genome sites can become targets of a transferred sequence, microhomology is enriched in HGT breakpoint junctions (P-value = 3.3e-58), and HGTs can function as host-specific fingerprints indicated by the significantly higher HGT similarity of intra-personal temporal samples than inter-personal samples (P-value = 4.3e-303). Crucially, HGTs showed potential contributions to colorectal cancer (CRC) and acute diarrhoea, as evidenced by the enrichment of the butyrate metabolism pathway (P-value = 3.8e-17) and the shigellosis pathway (P-value = 5.9e-13) in the respective associated HGTs. Furthermore, differential HGTs demonstrated promise as biomarkers for predicting various diseases. Integrating HGTs into a CRC prediction model achieved an AUC of 0.87.
Collapse
Affiliation(s)
- Shuai Wang
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Yiqi Jiang
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Lijia Che
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Ruo Han Wang
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Shuai Cheng Li
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, China
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| |
Collapse
|
2
|
van der Gulik PTS, Hoff WD, Speijer D. The contours of evolution: In defence of Darwin's tree of life paradigm. Bioessays 2024; 46:e2400012. [PMID: 38436469 DOI: 10.1002/bies.202400012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 02/12/2024] [Accepted: 02/15/2024] [Indexed: 03/05/2024]
Abstract
Both the concept of a Darwinian tree of life (TOL) and the possibility of its accurate reconstruction have been much criticized. Criticisms mostly revolve around the extensive occurrence of lateral gene transfer (LGT), instances of uptake of complete organisms to become organelles (with the associated subsequent gene transfer to the nucleus), as well as the implications of more subtle aspects of the biological species concept. Here we argue that none of these criticisms are sufficient to abandon the valuable TOL concept and the biological realities it captures. Especially important is the need to conceptually distinguish between organismal trees and gene trees, which necessitates incorporating insights of widely occurring LGT into modern evolutionary theory. We demonstrate that all criticisms, while based on important new findings, do not invalidate the TOL. After considering the implications of these new insights, we find that the contours of evolution are best represented by a TOL.
Collapse
Affiliation(s)
| | - Wouter D Hoff
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, Oklahoma, USA
| | - Dave Speijer
- Department of Medical Biochemistry, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
3
|
Wu Y, Garushyants SK, van den Hurk A, Aparicio-Maldonado C, Kushwaha SK, King CM, Ou Y, Todeschini TC, Clokie MRJ, Millard AD, Gençay YE, Koonin EV, Nobrega FL. Bacterial defense systems exhibit synergistic anti-phage activity. Cell Host Microbe 2024; 32:557-572.e6. [PMID: 38402614 PMCID: PMC11009048 DOI: 10.1016/j.chom.2024.01.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/23/2024] [Accepted: 01/30/2024] [Indexed: 02/27/2024]
Abstract
Bacterial defense against phage predation involves diverse defense systems acting individually and concurrently, yet their interactions remain poorly understood. We investigated >100 defense systems in 42,925 bacterial genomes and identified numerous instances of their non-random co-occurrence and negative association. For several pairs of defense systems significantly co-occurring in Escherichia coli strains, we demonstrate synergistic anti-phage activity. Notably, Zorya II synergizes with Druantia III and ietAS defense systems, while tmn exhibits synergy with co-occurring systems Gabija, Septu I, and PrrC. For Gabija, tmn co-opts the sensory switch ATPase domain, enhancing anti-phage activity. Some defense system pairs that are negatively associated in E. coli show synergy and significantly co-occur in other taxa, demonstrating that bacterial immune repertoires are largely shaped by selection for resistance against host-specific phages rather than negative epistasis. Collectively, these findings demonstrate compatibility and synergy between defense systems, allowing bacteria to adopt flexible strategies for phage defense.
Collapse
Affiliation(s)
- Yi Wu
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Sofya K Garushyants
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Anne van den Hurk
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | | | - Simran Krishnakant Kushwaha
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK; Department of Biological Sciences, Birla Institute of Technology and Science (BITS), Pilani, Rajasthan, India
| | - Claire M King
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Yaqing Ou
- Wellcome Centre for Cell-Matrix Research, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Thomas C Todeschini
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Martha R J Clokie
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | - Andrew D Millard
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | | | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Franklin L Nobrega
- School of Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK.
| |
Collapse
|
4
|
Miliotis G, Sengupta P, Hameed A, Chuvochina M, McDonagh F, Simpson AC, Parker CW, Singh NK, Rekha PD, Morris D, Raman K, Kyrpides NC, Hugenholtz P, Venkateswaran K. Novel spore-forming species exhibiting intrinsic resistance to third- and fourth-generation cephalosporins and description of Tigheibacillus jepli gen. nov., sp. nov. mBio 2024; 15:e0018124. [PMID: 38477597 PMCID: PMC11005411 DOI: 10.1128/mbio.00181-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 01/29/2024] [Indexed: 03/14/2024] Open
Abstract
A comprehensive microbial surveillance was conducted at NASA's Mars 2020 spacecraft assembly facility (SAF), where whole-genome sequencing (WGS) of 110 bacterial strains was performed. One isolate, designated 179-BFC-A-HST, exhibited less than 80% average nucleotide identity (ANI) to known species, suggesting a novel organism. This strain demonstrated high-level resistance [minimum inhibitory concentration (MIC) >256 mg/L] to third-generation cephalosporins, including ceftazidime, cefpodoxime, combination ceftazidime/avibactam, and the fourth-generation cephalosporin cefepime. The results of a comparative genomic analysis revealed that 179-BFC-A-HST is most closely related to Virgibacillus halophilus 5B73CT, sharing an ANI of 78.7% and a digital DNA-DNA hybridization (dDDH) value of 23.5%, while their 16S rRNA gene sequences shared 97.7% nucleotide identity. Based on these results and the recent recognition that the genus Virgibacillus is polyphyletic, strain 179-BFC-A-HST is proposed as a novel species of a novel genus, Tigheibacillus jepli gen. nov., sp. nov (type strain 179-BFC-A-HST = DSM 115946T = NRRL B-65666T), and its closest neighbor, V. halophilus, is proposed to be reassigned to this genus as Tigheibacillus halophilus comb. nov. (type strain 5B73CT = DSM 21623T = JCM 21758T = KCTC 13935T). It was also necessary to reclassify its second closest neighbor Virgibacillus soli, as a member of a novel genus Paracerasibacillus, reflecting its phylogenetic position relative to the genus Cerasibacillus, for which we propose Paracerasibacillus soli comb. nov. (type strain CC-YMP-6T = DSM 22952T = CCM 7714T). Within Amphibacillaceae (n = 64), P. soli exhibited 11 antibiotic resistance genes (ARG), while T. jepli encoded for 3, lacking any known β-lactamases, suggesting resistance from variant penicillin-binding proteins, disrupting cephalosporin efficacy. P. soli was highly resistant to azithromycin (MIC >64 mg/L) yet susceptible to cephalosporins and penicillins. IMPORTANCE The significance of this research extends to understanding microbial survival and adaptation in oligotrophic environments, such as those found in SAF. Whole-genome sequencing of several strains isolated from Mars 2020 mission assembly cleanroom facilities, including the discovery of the novel species Tigheibacillus jepli, highlights the resilience and antimicrobial resistance (AMR) in clinically relevant antibiotic classes of microbes in nutrient-scarce settings. The study also redefines the taxonomic classifications within the Amphibacillaceae family, aligning genetic identities with phylogenetic data. Investigating ARG and virulence factors (VF) across these strains illuminates the microbial capability for resistance under resource-limited conditions while emphasizing the role of human-associated VF in microbial survival, informing sterilization practices and microbial management in similar oligotrophic settings beyond spacecraft assembly cleanrooms such as pharmaceutical and medical industry cleanrooms.
Collapse
Affiliation(s)
- Georgios Miliotis
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
- Centre for One Health, Ryan Institute, University of Galway, Galway, Ireland
| | - Pratyay Sengupta
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Center for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Asif Hameed
- Division of Microbiology and Biotechnology, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
| | - Maria Chuvochina
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, Brisbane, Australia
| | - Francesca McDonagh
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
| | - Anna C. Simpson
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Ceth W. Parker
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Nitin K. Singh
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Punchappady D. Rekha
- Division of Microbiology and Biotechnology, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
| | - Dearbháile Morris
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
- Centre for One Health, Ryan Institute, University of Galway, Galway, Ireland
| | - Karthik Raman
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Center for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Nikos C. Kyrpides
- US Department of Energy Joint Genome Institute, Berkeley, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Philip Hugenholtz
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, Brisbane, Australia
| | - Kasthuri Venkateswaran
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| |
Collapse
|
5
|
Teichman S, Lee MD, Willis AD. Analyzing microbial evolution through gene and genome phylogenies. Biostatistics 2023:kxad025. [PMID: 37897441 DOI: 10.1093/biostatistics/kxad025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 08/15/2023] [Accepted: 08/27/2023] [Indexed: 10/30/2023] Open
Abstract
Microbiome scientists critically need modern tools to explore and analyze microbial evolution. Often this involves studying the evolution of microbial genomes as a whole. However, different genes in a single genome can be subject to different evolutionary pressures, which can result in distinct gene-level evolutionary histories. To address this challenge, we propose to treat estimated gene-level phylogenies as data objects, and present an interactive method for the analysis of a collection of gene phylogenies. We use a local linear approximation of phylogenetic tree space to visualize estimated gene trees as points in low-dimensional Euclidean space, and address important practical limitations of existing related approaches, allowing an intuitive visualization of complex data objects. We demonstrate the utility of our proposed approach through microbial data analyses, including by identifying outlying gene histories in strains of Prevotella, and by contrasting Streptococcus phylogenies estimated using different gene sets. Our method is available as an open-source R package, and assists with estimating, visualizing, and interacting with a collection of bacterial gene phylogenies.
Collapse
Affiliation(s)
- Sarah Teichman
- University of Washington Department of Statistics, Box 354322, Seattle, WA 98195-4322, USA
| | - Michael D Lee
- KBR NASA Ames Research Center, PO Box 1, Moffett Field, CA 94035-1000
- Blue Marble Space Institute of Science, 600 1st Avenue, 1st Floor, Seattle, WA 98104, USA
| | - Amy D Willis
- University of Washington Department of Biostatistics, Hans Rosling Center for Population Health, Box 351617, Seattle, WA 98195-1617, USA
| |
Collapse
|
6
|
Teichman S, Lee MD, Willis AD. Analyzing microbial evolution through gene and genome phylogenies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.15.553440. [PMID: 37645842 PMCID: PMC10462103 DOI: 10.1101/2023.08.15.553440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Microbiome scientists critically need modern tools to explore and analyze microbial evolution. Often this involves studying the evolution of microbial genomes as a whole. However, different genes in a single genome can be subject to different evolutionary pressures, which can result in distinct gene-level evolutionary histories. To address this challenge, we propose to treat estimated gene-level phylogenies as data objects, and present an interactive method for the analysis of a collection of gene phylogenies. We use a local linear approximation of phylogenetic tree space to visualize estimated gene trees as points in low-dimensional Euclidean space, and address important practical limitations of existing related approaches, allowing an intuitive visualization of complex data objects. We demonstrate the utility of our proposed approach through microbial data analyses, including by identifying outlying gene histories in strains of Prevotella, and by contrasting Streptococcus phylogenies estimated using different gene sets. Our method is available as an open-source R package, and assists with estimating, visualizing and interacting with a collection of bacterial gene phylogenies. dimension reduction, microbiome, non-Euclidean, statistical genetics, visualization.
Collapse
Affiliation(s)
| | - Michael D Lee
- NASA Ames Research Center and Blue Marble Space Institute of Science
| | - Amy D Willis
- Department of Biostatistics, University of Washington
| |
Collapse
|
7
|
van der Gulik PTS, Hoff WD, Speijer D. Renewing Linnaean taxonomy: a proposal to restructure the highest levels of the Natural System. Biol Rev Camb Philos Soc 2023; 98:584-602. [PMID: 36366773 DOI: 10.1111/brv.12920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 10/31/2022] [Accepted: 11/02/2022] [Indexed: 11/13/2022]
Abstract
During the last century enormous progress has been made in the understanding of biological diversity, involving a dramatic shift from macroscopic to microscopic organisms. The question now arises as to whether the Natural System introduced by Carl Linnaeus, which has served as the central system for organizing biological diversity, can accommodate the great expansion of diversity that has been discovered. Important discoveries regarding biological diversity have not been fully integrated into a formal, coherent taxonomic system. In addition, because of taxonomic challenges and conflicts, various proposals have been made to abandon key aspects of the Linnaean system. We review the current status of taxonomy of the living world, focussing on groups at the taxonomic level of phylum and above. We summarize the main arguments against and in favour of abandoning aspects of the Linnaean system. Based on these considerations, we conclude that retaining the Linnaean Natural System provides important advantages. We propose a relatively small number of amendments for extending this system, particularly to include the named rank of world (Latin alternative mundis) formally to include non-cellular entities (viruses), and the named rank of empire (Latin alternative imperium) to accommodate the depth of diversity in (unicellular) eukaryotes that has been uncovered. We argue that in the case of both the eukaryotic domain and the viruses the cladistic approach intrinsically fails. However, the resulting semi-cladistic system provides a productive way forward that can help resolve taxonomic challenges. The amendments proposed allow us to: (i) retain named taxonomic levels and the three-domain system, (ii) improve understanding of the main eukaryotic lineages, and (iii) incorporate viruses into the Natural System. Of note, the proposal described herein is intended to serve as the starting point for a broad scientific discussion regarding the modernization of the Linnaean system.
Collapse
Affiliation(s)
| | - Wouter D Hoff
- Department of Microbiology and Molecular Genetics and Department of Chemistry, Oklahoma State University, Stillwater, OK, 74078, USA
| | - David Speijer
- Department of Medical Biochemistry, AmsterdamUMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, The Netherlands
| |
Collapse
|
8
|
Downing T, Rahm A. Bacterial plasmid-associated and chromosomal proteins have fundamentally different properties in protein interaction networks. Sci Rep 2022; 12:19203. [PMID: 36357451 PMCID: PMC9649638 DOI: 10.1038/s41598-022-20809-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 09/19/2022] [Indexed: 11/12/2022] Open
Abstract
Plasmids facilitate horizontal gene transfer, which enables the diversification of pathogens into new anatomical and environmental niches, implying that plasmid-encoded genes can cooperate well with chromosomal genes. We hypothesise that such mobile genes are functionally different to chromosomal ones due to this ability to encode proteins performing non-essential functions like antimicrobial resistance and traverse distinct host cells. The effect of plasmid-driven gene gain on protein-protein interaction network topology is an important question in this area. Moreover, the extent to which these chromosomally- and plasmid-encoded proteins interact with proteins from their own groups compared to the levels with the other group remains unclear. Here, we examined the incidence and protein-protein interactions of all known plasmid-encoded proteins across representative specimens from most bacteria using all available plasmids. We found that plasmid-encoded genes constitute ~ 0.65% of the total number of genes per bacterial sample, and that plasmid genes are preferentially associated with different species but had limited taxonomical power beyond this. Surprisingly, plasmid-encoded proteins had both more protein-protein interactions compared to chromosomal proteins, countering the hypothesis that genes with higher mobility rates should have fewer protein-level interactions. Nonetheless, topological analysis and investigation of the protein-protein interaction networks' connectivity and change in the number of independent components demonstrated that the plasmid-encoded proteins had limited overall impact in > 96% of samples. This paper assembled extensive data on plasmid-encoded proteins, their interactions and associations with diverse bacterial specimens that is available for the community to investigate in more detail.
Collapse
Affiliation(s)
- Tim Downing
- grid.15596.3e0000000102380260School of Biotechnology, Dublin City University, Dublin, Ireland ,grid.63622.330000 0004 0388 7540Present Address: The Pirbright Institute, Pirbright, UK
| | - Alexander Rahm
- grid.449688.f0000 0004 0647 1487GAATI Lab, University of French Polynesia, Tahiti, French Polynesia
| |
Collapse
|
9
|
Cao S, Brandis G, Huseby DL, Hughes D. Positive selection during niche adaptation results in large-scale and irreversible rearrangement of chromosomal gene order in bacteria. Mol Biol Evol 2022; 39:6554941. [PMID: 35348727 PMCID: PMC9016547 DOI: 10.1093/molbev/msac069] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Analysis of bacterial genomes shows that, whereas diverse species share many genes in common, their linear order on the chromosome is often not conserved. Whereas rearrangements in gene order could occur by genetic drift, an alternative hypothesis is rearrangement driven by positive selection during niche adaptation (SNAP). Here, we provide the first experimental support for the SNAP hypothesis. We evolved Salmonella to adapt to growth on malate as the sole carbon source and followed the evolutionary trajectories. The initial adaptation to growth in the new environment involved the duplication of 1.66 Mb, corresponding to one-third of the Salmonella chromosome. This duplication is selected to increase the copy number of a single gene, dctA, involved in the uptake of malate. Continuing selection led to the rapid loss or mutation of duplicate genes from either copy of the duplicated region. After 2000 generations, only 31% of the originally duplicated genes remained intact and the gene order within the Salmonella chromosome has been significantly and irreversibly altered. These results experientially validate predictions made by the SNAP hypothesis and show that SNAP can be a strong driving force for rearrangements in chromosomal gene order.
Collapse
Affiliation(s)
- Sha Cao
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.,These authors contributed equally: Sha Cao, Gerrit Brandis
| | - Gerrit Brandis
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.,These authors contributed equally: Sha Cao, Gerrit Brandis
| | - Douglas L Huseby
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Diarmaid Hughes
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
10
|
Pathogenicity and Its Implications in Taxonomy: The Brucella and Ochrobactrum Case. Pathogens 2022; 11:pathogens11030377. [PMID: 35335701 PMCID: PMC8954888 DOI: 10.3390/pathogens11030377] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 03/09/2022] [Accepted: 03/16/2022] [Indexed: 11/21/2022] Open
Abstract
The intracellular pathogens of the genus Brucella are phylogenetically close to Ochrobactrum, a diverse group of free-living bacteria with a few species occasionally infecting medically compromised patients. A group of taxonomists recently included all Ochrobactrum organisms in the genus Brucella based on global genome analyses and alleged equivalences with genera such as Mycobacterium. Here, we demonstrate that such equivalencies are incorrect because they overlook the complexities of pathogenicity. By summarizing Brucella and Ochrobactrum divergences in lifestyle, structure, physiology, population, closed versus open pangenomes, genomic traits, and pathogenicity, we show that when they are adequately understood, they are highly relevant in taxonomy and not unidimensional quantitative characters. Thus, the Ochrobactrum and Brucella differences are not limited to their assignments to different “risk-groups”, a biologically (and hence, taxonomically) oversimplified description that, moreover, does not support ignoring the nomen periculosum rule, as proposed. Since the epidemiology, prophylaxis, diagnosis, and treatment are thoroughly unrelated, merging free-living Ochrobactrum organisms with highly pathogenic Brucella organisms brings evident risks for veterinarians, medical doctors, and public health authorities who confront brucellosis, a significant zoonosis worldwide. Therefore, from taxonomical and practical standpoints, the Brucella and Ochrobactrum genera must be maintained apart. Consequently, we urge researchers, culture collections, and databases to keep their canonical nomenclature.
Collapse
|
11
|
Chin AF, Wrabl JO, Hilser VJ. A thermodynamic atlas of proteomes reveals energetic innovation across the tree of life. Mol Biol Evol 2022; 39:6509521. [PMID: 35038744 PMCID: PMC8896757 DOI: 10.1093/molbev/msac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies is not known. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals; 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria, 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome, 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C), and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/.
Collapse
Affiliation(s)
- Alexander F Chin
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA.,T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| |
Collapse
|
12
|
Borodovich T, Shkoporov AN, Ross RP, Hill C. OUP accepted manuscript. Gastroenterol Rep (Oxf) 2022; 10:goac012. [PMID: 35425613 PMCID: PMC9006064 DOI: 10.1093/gastro/goac012] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/25/2021] [Revised: 02/08/2022] [Accepted: 03/04/2022] [Indexed: 11/26/2022] Open
Abstract
Horizontal gene transfer (HGT) in the microbiome has profound consequences for human health and disease. The spread of antibiotic resistance genes, virulence, and pathogenicity determinants predominantly occurs by way of HGT. Evidence exists of extensive horizontal transfer in the human gut microbiome. Phage transduction is a type of HGT event in which a bacteriophage transfers non-viral DNA from one bacterial host cell to another. The abundance of tailed bacteriophages in the human gut suggests that transduction could act as a significant mode of HGT in the gut microbiome. Here we review in detail the known mechanisms of phage-mediated HGT, namely specialized and generalized transduction, lateral transduction, gene-transfer agents, and molecular piracy, as well as methods used to detect phage-mediated HGT, and discuss its potential implications for the human gut microbiome.
Collapse
Affiliation(s)
- Tatiana Borodovich
- APC Microbiome Ireland, University College Cork, Cork, Ireland
- Corresponding author. APC Microbiome Ireland, Biosciences Institute, University College Cork, Room 3.63, College Road, Cork, T12 YT20, Ireland.
| | - Andrey N Shkoporov
- APC Microbiome Ireland, University College Cork, Cork, Ireland
- School of Microbiology, University College Cork, Cork, Ireland
| | - R Paul Ross
- APC Microbiome Ireland, University College Cork, Cork, Ireland
- School of Microbiology, University College Cork, Cork, Ireland
| | - Colin Hill
- APC Microbiome Ireland, University College Cork, Cork, Ireland
- School of Microbiology, University College Cork, Cork, Ireland
| |
Collapse
|
13
|
Bansal MS. Deciphering Microbial Gene Family Evolution Using Duplication-Transfer-Loss Reconciliation and RANGER-DTL. Methods Mol Biol 2022; 2569:233-252. [PMID: 36083451 DOI: 10.1007/978-1-0716-2691-7_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Phylogenetic reconciliation has emerged as a principled, highly effective technique for investigating the origin, spread, and evolutionary history of microbial gene families. Proper application of phylogenetic reconciliation requires a clear understanding of potential pitfalls and sources of error, and knowledge of the most effective reconciliation-based tools and protocols to use to maximize accuracy. In this book chapter, we provide a brief overview of Duplication-Transfer-Loss (DTL) reconciliation, the standard reconciliation model used to study microbial gene families and provide a step-by-step computational protocol to maximize the accuracy of DTL reconciliation and minimize false-positive evolutionary inferences.
Collapse
Affiliation(s)
- Mukul S Bansal
- Department of Computer Science & Engineering, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
14
|
Takenaka S, Kawashima T, Arita M. A sugar utilization phenotype contributes to the formation of genetic exchange communities in lactic acid bacteria. FEMS Microbiol Lett 2021; 368:6360976. [PMID: 34468734 PMCID: PMC8440127 DOI: 10.1093/femsle/fnab117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 08/30/2021] [Indexed: 11/13/2022] Open
Abstract
In prokaryotes, a major contributor to genomic evolution is the exchange of genes via horizontal gene transfer (HGT). Areas with a high density of HGT networks are defined as genetic exchange communities (GECs). Although some phenotypes associated with specific ecological niches are linked to GECs, little is known about the phenotypic influences on HGT in bacterial groups within a taxonomic family. Thanks to the published genome sequences and phenotype data of lactic acid bacteria (LAB), it is now possible to obtain more detailed information about the phenotypes that affect GECs. Here, we have investigated the relationship between HGT and internal and external environmental factors for 178 strains from 24 genera in the Lactobacillaceae family. We found a significant correlation between strains with high utilization of sugars and HGT bias. The result suggests that the phenotype of the utilization of a variety of sugars is key to the construction of GECs in this family. This feature is consistent with the fact that the Lactobacillaceae family contributes to the production of a wide variety of fermented foods by sharing niches such as those in vegetables, dairy products and brewing-related environments. This result provides the first evidence that phenotypes associated with ecological niches contribute to form GECs in the LAB family.
Collapse
Affiliation(s)
- Shinkuro Takenaka
- Department of Genetics, The Graduate University for Advanced Studies, SOKENDAI, Mishima, Shizuoka 411-8540, Japan
| | - Takeshi Kawashima
- Department of Genetics, The Graduate University for Advanced Studies, SOKENDAI, Mishima, Shizuoka 411-8540, Japan.,National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Masanori Arita
- Department of Genetics, The Graduate University for Advanced Studies, SOKENDAI, Mishima, Shizuoka 411-8540, Japan.,National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
15
|
Coleman GA, Davín AA, Mahendrarajah TA, Szánthó LL, Spang A, Hugenholtz P, Szöllősi GJ, Williams TA. A rooted phylogeny resolves early bacterial evolution. Science 2021; 372:372/6542/eabe0511. [PMID: 33958449 DOI: 10.1126/science.abe0511] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 11/05/2020] [Accepted: 04/01/2021] [Indexed: 12/17/2022]
Abstract
A rooted bacterial tree is necessary to understand early evolution, but the position of the root is contested. Here, we model the evolution of 11,272 gene families to identify the root, extent of horizontal gene transfer (HGT), and the nature of the last bacterial common ancestor (LBCA). Our analyses root the tree between the major clades Terrabacteria and Gracilicutes and suggest that LBCA was a free-living flagellated, rod-shaped double-membraned organism. Contrary to recent proposals, our analyses reject a basal placement of the Candidate Phyla Radiation, which instead branches sister to Chloroflexota within Terrabacteria. While most gene families (92%) have evidence of HGT, overall, two-thirds of gene transmissions have been vertical, suggesting that a rooted tree provides a meaningful frame of reference for interpreting bacterial evolution.
Collapse
Affiliation(s)
- Gareth A Coleman
- School of Biological Sciences, University of Bristol, Bristol BS8 1TQ, UK
| | - Adrián A Davín
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Tara A Mahendrarajah
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, 1790 AB Den Burg, Netherlands
| | - Lénárd L Szánthó
- Department of Biological Physics, Eötvös Loránd University, 1117 Budapest, Hungary.,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, 1117 Budapest, Hungary
| | - Anja Spang
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, 1790 AB Den Burg, Netherlands.,Department of Cell- and Molecular Biology, Uppsala University, SE-75123 Uppsala, Sweden
| | - Philip Hugenholtz
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland 4072, Australia.
| | - Gergely J Szöllősi
- Department of Biological Physics, Eötvös Loránd University, 1117 Budapest, Hungary. .,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, 1117 Budapest, Hungary.,Institute of Evolution, Centre for Ecological Research, 1121 Budapest, Hungary
| | - Tom A Williams
- School of Biological Sciences, University of Bristol, Bristol BS8 1TQ, UK.
| |
Collapse
|
16
|
Berkemer SJ, McGlynn SE. A New Analysis of Archaea-Bacteria Domain Separation: Variable Phylogenetic Distance and the Tempo of Early Evolution. Mol Biol Evol 2021; 37:2332-2340. [PMID: 32316034 PMCID: PMC7403611 DOI: 10.1093/molbev/msaa089] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Comparative genomics and molecular phylogenetics are foundational for understanding biological evolution. Although many studies have been made with the aim of understanding the genomic contents of early life, uncertainty remains. A study by Weiss et al. (Weiss MC, Sousa FL, Mrnjavac N, Neukirchen S, Roettger M, Nelson-Sathi S, Martin WF. 2016. The physiology and habitat of the last universal common ancestor. Nat Microbiol. 1(9):16116.) identified a number of protein families in the last universal common ancestor of archaea and bacteria (LUCA) which were not found in previous works. Here, we report new research that suggests the clustering approaches used in this previous study undersampled protein families, resulting in incomplete phylogenetic trees which do not reflect protein family evolution. Phylogenetic analysis of protein families which include more sequence homologs rejects a simple LUCA hypothesis based on phylogenetic separation of the bacterial and archaeal domains for a majority of the previously identified LUCA proteins (∼82%). To supplement limitations of phylogenetic inference derived from incompletely populated orthologous groups and to test the hypothesis of a period of rapid evolution preceding the separation of the domains, we compared phylogenetic distances both within and between domains, for thousands of orthologous groups. We find a substantial diversity of interdomain versus intradomain branch lengths, even among protein families which exhibit a single domain separating branch and are thought to be associated with the LUCA. Additionally, phylogenetic trees with long interdomain branches relative to intradomain branches are enriched in information categories of protein families in comparison to those associated with metabolic functions. These results provide a new view of protein family evolution and temper claims about the phenotype and habitat of the LUCA.
Collapse
Affiliation(s)
- Sarah J Berkemer
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.,Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Competence Center for Scalable Data Services and Solutions, Dresden/Leipzig, Germany
| | - Shawn E McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan.,Blue Marble Space Institute of Science, Seattle, WA.,RIKEN Center for Sustainable Resource Science (CSRS), Saitama, Japan
| |
Collapse
|
17
|
Abstract
The advent of comparative genomics in the late 1990s led to the discovery of extensive lateral gene transfer in prokaryotes. The resulting debate over whether life as a whole is best represented as a tree or a network has since given way to a general consensus in which trees and networks co-exist rather than stand in opposition. Embracing this consensus allows us to move beyond the question of which is true or false. The future of the tree of life debate lies in asking what trees and networks can, and should, do for science.
Collapse
Affiliation(s)
- Cédric Blais
- Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada; Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada.
| | - John M Archibald
- Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada; Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada.
| |
Collapse
|
18
|
França A, Gaio V, Lopes N, Melo LDR. Virulence Factors in Coagulase-Negative Staphylococci. Pathogens 2021; 10:170. [PMID: 33557202 PMCID: PMC7913919 DOI: 10.3390/pathogens10020170] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/29/2021] [Accepted: 01/29/2021] [Indexed: 12/13/2022] Open
Abstract
Coagulase-negative staphylococci (CoNS) have emerged as major pathogens in healthcare-associated facilities, being S. epidermidis, S. haemolyticus and, more recently, S. lugdunensis, the most clinically relevant species. Despite being less virulent than the well-studied pathogen S. aureus, the number of CoNS strains sequenced is constantly increasing and, with that, the number of virulence factors identified in those strains. In this regard, biofilm formation is considered the most important. Besides virulence factors, the presence of several antibiotic-resistance genes identified in CoNS is worrisome and makes treatment very challenging. In this review, we analyzed the different aspects involved in CoNS virulence and their impact on health and food.
Collapse
Affiliation(s)
- Angela França
- Laboratory of Research in Biofilms Rosário Oliveira, Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal; (V.G.); (N.L.)
| | | | | | - Luís D. R. Melo
- Laboratory of Research in Biofilms Rosário Oliveira, Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal; (V.G.); (N.L.)
| |
Collapse
|
19
|
Koonin EV, Makarova KS, Wolf YI. Evolution of Microbial Genomics: Conceptual Shifts over a Quarter Century. Trends Microbiol 2021; 29:582-592. [PMID: 33541841 DOI: 10.1016/j.tim.2021.01.005] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/07/2021] [Accepted: 01/08/2021] [Indexed: 12/20/2022]
Abstract
Prokaryote genomics started in earnest in 1995, with the complete sequences of two small bacterial genomes, those of Haemophilus influenzae and Mycoplasma genitalium. During the next quarter century, the prokaryote genome database has been growing exponentially, with no saturation in sight. For most of these 25 years, genome sequencing remained limited to cultivable microbes. Together with next-generation sequencing methods, advances in metagenomics and single-cell genomics have lifted this limitation, providing for an increasingly unbiased characterization of the global prokaryote diversity. Advances in computational genomics followed the progress of genome sequencing, even if occasionally lagging behind. Several major new branches of bacteria and archaea were discovered, including Asgard archaea, the apparent closest relatives of eukaryotes and expansive groups of bacteria and archaea with small genomes thought to be symbionts of other prokaryotes. Comparative analysis of numerous prokaryote genomes spanning a wide range of evolutionary distances changed the conceptual foundations of microbiology, supplanting the notion of species genomes with fixed gene sets with that of dynamic pangenomes and the notion of a single Tree of Life (ToL) with a statistical tree-like trend among individual gene trees. Strides were also made towards a theory and quantitative laws of prokaryote genome evolution.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA.
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894, USA
| |
Collapse
|
20
|
Tovo A, Menzel P, Krogh A, Cosentino Lagomarsino M, Suweis S. Taxonomic classification method for metagenomics based on core protein families with Core-Kaiju. Nucleic Acids Res 2020; 48:e93. [PMID: 32633756 PMCID: PMC7498351 DOI: 10.1093/nar/gkaa568] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 06/12/2020] [Accepted: 06/24/2020] [Indexed: 12/19/2022] Open
Abstract
Characterizing species diversity and composition of bacteria hosted by biota is revolutionizing our understanding of the role of symbiotic interactions in ecosystems. Determining microbiomes diversity implies the assignment of individual reads to taxa by comparison to reference databases. Although computational methods aimed at identifying the microbe(s) taxa are available, it is well known that inferences using different methods can vary widely depending on various biases. In this study, we first apply and compare different bioinformatics methods based on 16S ribosomal RNA gene and shotgun sequencing to three mock communities of bacteria, of which the compositions are known. We show that none of these methods can infer both the true number of taxa and their abundances. We thus propose a novel approach, named Core-Kaiju, which combines the power of shotgun metagenomics data with a more focused marker gene classification method similar to 16S, but based on emergent statistics of core protein domain families. We thus test the proposed method on various mock communities and we show that Core-Kaiju reliably predicts both number of taxa and abundances. Finally, we apply our method on human gut samples, showing how Core-Kaiju may give more accurate ecological characterization and a fresh view on real microbiomes.
Collapse
Affiliation(s)
- Anna Tovo
- Physics and Astronomy Department, LIPh Lab, University of Padova, Via Marzolo 8, 35131 Padova, Italy.,Mathematics Department, University of Padova, via Trieste 63, 35121 Padova, Italy
| | - Peter Menzel
- Labor Berlin Charité Vivantes GmbH, Sylter Str. 2, 13353 Berlin, Germany
| | - Anders Krogh
- Department of Computer Science, University of Copenhagen, Universitetsparken 1, DK-2100 Copenhagen, Denmark
| | - Marco Cosentino Lagomarsino
- IFOM, FIRC Institute of Molecular Oncology, Via Adamello 16, 20143 Milan, Italy.,Physics Department, University of Milan, and I.N.F.N., Via Celoria 16, 20133 Milan, Italy
| | - Samir Suweis
- Physics and Astronomy Department, LIPh Lab, University of Padova, Via Marzolo 8, 35131 Padova, Italy.,Padova Neuroscience Center, University of Padova, Via Orus 2/B, 35131 Padova, Italy
| |
Collapse
|
21
|
Avni E, Snir S. A New Phylogenomic Approach For Quantifying Horizontal Gene Transfer Trends in Prokaryotes. Sci Rep 2020; 10:12425. [PMID: 32709941 PMCID: PMC7381616 DOI: 10.1038/s41598-020-62446-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 01/27/2020] [Indexed: 11/09/2022] Open
Abstract
It is well established nowadays that among prokaryotes, various families of orthologous genes exhibit conflicting evolutionary history. A prime factor for this conflict is horizontal gene transfer (HGT) - the transfer of genetic material not via vertical descent. Thus, the prevalence of HGT is challenging the meaningfulness of the classical Tree of Life concept. Here we present a comprehensive study of HGT representing the entire prokaryotic world. We mainly rely on a novel analytic approach for analyzing an aggregate of gene histories, by means of the quartet plurality distribution (QPD) that we develop. Through the analysis of real and simulated data, QPD is used to reveal evidence of a barrier against HGT, separating the archaea from the bacteria and making HGT between the two domains, in general, quite rare. In contrast, bacteria's confined HGT is substantially more frequent than archaea's. Our approach also reveals that despite intensive HGT, a strong tree-like signal can be extracted, corroborating several previous works. Thus, QPD, which enables one to analytically combine information from an aggregate of gene trees, can be used for understanding patterns and rates of HGT in prokaryotes, as well as for validating or refuting models of horizontal genetic transfers and evolution in general.
Collapse
Affiliation(s)
- Eliran Avni
- Department of Evolutionary Biology, University of Haifa, Haifa, 31905, Israel.
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, Haifa, 31905, Israel.
| |
Collapse
|
22
|
Redondo-Salvo S, Fernández-López R, Ruiz R, Vielva L, de Toro M, Rocha EPC, Garcillán-Barcia MP, de la Cruz F. Pathways for horizontal gene transfer in bacteria revealed by a global map of their plasmids. Nat Commun 2020; 11:3602. [PMID: 32681114 PMCID: PMC7367871 DOI: 10.1038/s41467-020-17278-2] [Citation(s) in RCA: 148] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 06/19/2020] [Indexed: 01/04/2023] Open
Abstract
Plasmids can mediate horizontal gene transfer of antibiotic resistance, virulence genes, and other adaptive factors across bacterial populations. Here, we analyze genomic composition and pairwise sequence identity for over 10,000 reference plasmids to obtain a global map of the prokaryotic plasmidome. Plasmids in this map organize into discrete clusters, which we call plasmid taxonomic units (PTUs), with high average nucleotide identity between its members. We identify 83 PTUs in the order Enterobacterales, 28 of them corresponding to previously described archetypes. Furthermore, we develop an automated algorithm for PTU identification, and validate its performance using stochastic blockmodeling. The algorithm reveals a total of 276 PTUs in the bacterial domain. Each PTU exhibits a characteristic host distribution, organized into a six-grade scale (I–VI), ranging from plasmids restricted to a single host species (grade I) to plasmids able to colonize species from different phyla (grade VI). More than 60% of the plasmids in the global map are in groups with host ranges beyond the species barrier. Plasmids can mediate gene transfer across bacterial populations. Here, the authors describe a global map of the prokaryotic plasmidome, where plasmids organize into discrete ‘plasmid taxonomic units’ based on their genomic composition and pairwise sequence identity.
Collapse
Affiliation(s)
- Santiago Redondo-Salvo
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Universidad de Cantabria-CSIC, C/Albert Einstein 22, 39011, Santander, Spain
| | - Raúl Fernández-López
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Universidad de Cantabria-CSIC, C/Albert Einstein 22, 39011, Santander, Spain
| | - Raúl Ruiz
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Universidad de Cantabria-CSIC, C/Albert Einstein 22, 39011, Santander, Spain
| | - Luis Vielva
- Departamento de Ingeniería de las Comunicaciones, Universidad de Cantabria, Santander, Spain
| | - María de Toro
- CIBIR, Centro de Investigación Biomédica de La Rioja, Logroño, Spain
| | - Eduardo P C Rocha
- Microbial Evolutionary Genomics, Institut Pasteur, CNRS, UMR3525, Paris, France
| | - M Pilar Garcillán-Barcia
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Universidad de Cantabria-CSIC, C/Albert Einstein 22, 39011, Santander, Spain
| | - Fernando de la Cruz
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Universidad de Cantabria-CSIC, C/Albert Einstein 22, 39011, Santander, Spain.
| |
Collapse
|
23
|
Sevillya G, Doerr D, Lerner Y, Stoye J, Steel M, Snir S. Horizontal Gene Transfer Phylogenetics: A Random Walk Approach. Mol Biol Evol 2020; 37:1470-1479. [PMID: 31845962 DOI: 10.1093/molbev/msz302] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The dramatic decrease in time and cost for generating genetic sequence data has opened up vast opportunities in molecular systematics, one of which is the ability to decipher the evolutionary history of strains of a species. Under this fine systematic resolution, the standard markers are too crude to provide a phylogenetic signal. Nevertheless, among prokaryotes, genome dynamics in the form of horizontal gene transfer (HGT) between organisms and gene loss seem to provide far richer information by affecting both gene order and gene content. The "synteny index" (SI) between a pair of genomes combines these latter two factors, allowing comparison of genomes with unequal gene content, together with order considerations of their common genes. Although this approach is useful for classifying close relatives, no rigorous statistical modeling for it has been suggested. Such modeling is valuable, as it allows observed measures to be transformed into estimates of time periods during evolution, yielding the "additivity" of the measure. To the best of our knowledge, there is no other additivity proof for other gene order/content measures under HGT. Here, we provide a first statistical model and analysis for the SI measure. We model the "gene neighborhood" as a "birth-death-immigration" process affected by the HGT activity over the genome, and analytically relate the HGT rate and time to the expected SI. This model is asymptotic and thus provides accurate results, assuming infinite size genomes. Therefore, we also developed a heuristic model following an "exponential decay" function, accounting for biologically realistic values, which performed well in simulations. Applying this model to 1,133 prokaryotes partitioned to 39 clusters by the rank of genus yields that the average number of genome dynamics events per gene in the phylogenetic depth of genus is around half with significant variability between genera. This result extends and confirms similar results obtained for individual genera in different manners.
Collapse
Affiliation(s)
- Gur Sevillya
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| | - Daniel Doerr
- Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Yael Lerner
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| | - Jens Stoye
- Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Mike Steel
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| |
Collapse
|
24
|
John J, George S, Nori SRC, Nelson-Sathi S. Phylogenomic Analysis Reveals the Evolutionary Route of Resistant Genes in Staphylococcus aureus. Genome Biol Evol 2020; 11:2917-2926. [PMID: 31589296 PMCID: PMC6808081 DOI: 10.1093/gbe/evz213] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2019] [Indexed: 01/02/2023] Open
Abstract
Multidrug-resistant Staphylococcus aureus is a leading concern worldwide. Coagulase-Negative Staphylococci are claimed to be the reservoir and source of important resistant elements in S. aureus. However, the origin and evolutionary route of resistant genes in S. aureus are still remaining unknown. Here, we performed a detailed phylogenomic analysis of 152 completely sequenced S. aureus strains in comparison with 7,529 non-Staphylococcus aureus reference bacterial genomes. Our results reveal that S. aureus has a large open pan-genome where 97 (55%) of its known resistant-related genes belonging to its accessory genome. Among these genes, 47 (27%) were located within the Staphylococcal Cassette Chromosome mec (SCCmec), a transposable element responsible for resistance against major classes of antibiotics including beta-lactams, macrolides, and aminoglycosides. However, the physically linked mec-box genes (MecA–MecR–MecI) that are responsible for the maintenance of SCCmec elements is not unique to S. aureus, instead it is widely distributed within Staphylococcaceae family. The phyletic patterns of SCCmec-encoded resistant genes in Staphylococcus species are significantly different from that of its core genes indicating frequent exchange of these genes between Staphylococcus species. Our in-depth analysis of SCCmec-resistant gene phylogenies reveals that genes such as blaZ, ble, kmA, and tetK that are responsible for beta-lactam, bleomycin, kanamycin, and tetracycline resistance in S. aureus were laterally transferred from non-Staphylococcus sources. In addition, at least 11 non-SCCmec-encoded resistant genes in S. aureus, were laterally acquired from distantly related species. Our study evidently shows that gene transfers played a crucial role in shaping the evolution of antibiotic resistance in S. aureus.
Collapse
Affiliation(s)
- Jiffy John
- Computational Biology Laboratory, Interdisciplinary Biology, Rajiv Gandhi Centre for Biotechnology (RGCB), Thiruvananthapuram, India.,Manipal Academy of Higher Education (MAHE), Manipal, India
| | - Sinumol George
- Computational Biology Laboratory, Interdisciplinary Biology, Rajiv Gandhi Centre for Biotechnology (RGCB), Thiruvananthapuram, India
| | - Sai Ravi Chandra Nori
- Computational Biology Laboratory, Interdisciplinary Biology, Rajiv Gandhi Centre for Biotechnology (RGCB), Thiruvananthapuram, India
| | - Shijulal Nelson-Sathi
- Computational Biology Laboratory, Interdisciplinary Biology, Rajiv Gandhi Centre for Biotechnology (RGCB), Thiruvananthapuram, India
| |
Collapse
|
25
|
Bernard G, Chan CX, Chan YB, Chua XY, Cong Y, Hogan JM, Maetschke SR, Ragan MA. Alignment-free inference of hierarchical and reticulate phylogenomic relationships. Brief Bioinform 2019; 20:426-435. [PMID: 28673025 PMCID: PMC6433738 DOI: 10.1093/bib/bbx067] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Revised: 05/04/2017] [Indexed: 11/22/2022] Open
Abstract
We are amidst an ongoing flood of sequence data arising from the application of high-throughput technologies, and a concomitant fundamental revision in our understanding of how genomes evolve individually and within the biosphere. Workflows for phylogenomic inference must accommodate data that are not only much larger than before, but often more error prone and perhaps misassembled, or not assembled in the first place. Moreover, genomes of microbes, viruses and plasmids evolve not only by tree-like descent with modification but also by incorporating stretches of exogenous DNA. Thus, next-generation phylogenomics must address computational scalability while rethinking the nature of orthogroups, the alignment of multiple sequences and the inference and comparison of trees. New phylogenomic workflows have begun to take shape based on so-called alignment-free (AF) approaches. Here, we review the conceptual foundations of AF phylogenetics for the hierarchical (vertical) and reticulate (lateral) components of genome evolution, focusing on methods based on k-mers. We reflect on what seems to be successful, and on where further development is needed.
Collapse
|
26
|
Abstract
Analysis of sequence read pairs can be essential for characterizing structural variation, including junction-spanning pairs of reads (JSPRs) suggesting recent lateral/horizontal gene transfer. TwinBLAST can be used to facilitate this analysis of JSPRs by enabling the visualization and curation of two BLAST reports side by side in a single interface. Analysis of sequence read pairs can be essential for characterizing structural variation, including junction-spanning pairs of reads (JSPRs) suggesting recent lateral/horizontal gene transfer. TwinBLAST can be used to facilitate this analysis of JSPRs by enabling the visualization and curation of two BLAST reports side by side in a single interface.
Collapse
|
27
|
Sevillya G, Snir S. Synteny footprints provide clearer phylogenetic signal than sequence data for prokaryotic classification. Mol Phylogenet Evol 2019; 136:128-137. [DOI: 10.1016/j.ympev.2019.03.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 03/07/2019] [Accepted: 03/17/2019] [Indexed: 01/22/2023]
|
28
|
Corel E, Méheust R, Watson AK, McInerney JO, Lopez P, Bapteste E. Bipartite Network Analysis of Gene Sharings in the Microbial World. Mol Biol Evol 2019; 35:899-913. [PMID: 29346651 PMCID: PMC5888944 DOI: 10.1093/molbev/msy001] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Extensive microbial gene flows affect how we understand virology, microbiology, medical sciences, genetic modification, and evolutionary biology. Phylogenies only provide a narrow view of these gene flows: plasmids and viruses, lacking core genes, cannot be attached to cellular life on phylogenetic trees. Yet viruses and plasmids have a major impact on cellular evolution, affecting both the gene content and the dynamics of microbial communities. Using bipartite graphs that connect up to 149,000 clusters of homologous genes with 8,217 related and unrelated genomes, we can in particular show patterns of gene sharing that do not map neatly with the organismal phylogeny. Homologous genes are recycled by lateral gene transfer, and multiple copies of homologous genes are carried by otherwise completely unrelated (and possibly nested) genomes, that is, viruses, plasmids and prokaryotes. When a homologous gene is present on at least one plasmid or virus and at least one chromosome, a process of "gene externalization," affected by a postprocessed selected functional bias, takes place, especially in Bacteria. Bipartite graphs give us a view of vertical and horizontal gene flow beyond classic taxonomy on a single very large, analytically tractable, graph that goes beyond the cellular Web of Life.
Collapse
Affiliation(s)
- Eduardo Corel
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Raphaël Méheust
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Andrew K Watson
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - James O McInerney
- Chair in Evolutionary Biology, The University of Manchester, United Kingdom
| | - Philippe Lopez
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| | - Eric Bapteste
- Unité Mixte de Recherche 7138 Evolution Paris-Seine, Centre National de la Recherche Scientifique, Institut de Biologie Paris-Seine, Sorbonne Université, Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
29
|
Puigbò P, Wolf YI, Koonin EV. Genome-Wide Comparative Analysis of Phylogenetic Trees: The Prokaryotic Forest of Life. Methods Mol Biol 2019; 1910:241-269. [PMID: 31278667 DOI: 10.1007/978-1-4939-9074-0_8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the boot-split distance (BSD) method is introduced as an extension of the previously developed split distance (SD) method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting treelike and netlike evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the applications methods used to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."
Collapse
Affiliation(s)
- Pere Puigbò
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.,Division of Genetics and Physiology, Department of Biology, University of Turku, Turku, Finland
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
30
|
Avni E, Snir S. A New Quartet-Based Statistical Method for Comparing Sets of Gene Trees Is Developed Using a Generalized Hoeffding Inequality. J Comput Biol 2018; 26:27-37. [PMID: 30422680 DOI: 10.1089/cmb.2018.0129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Extracting the strength of the tree signal that is encompassed by a collection of gene trees is an exceptionally challenging problem in phylogenomics. Often, this problem not only involves the construction of individual phylogenies based on different genes, which may be a difficult endeavor on its own, but is also exacerbated by many factors that create conflicts between the evolutionary histories of different gene families, such as duplications or losses of genes; hybridization events; incomplete lineage sorting; and horizontal gene transfer, the latter two play central roles in the evolution of eukaryotes and prokaryotes, respectively. In this work, we tackle the aforementioned problem by focusing on quartet trees, which are the most basic unit of information in the context of unrooted phylogenies. In the first part, we show how a theorem of Janson that generalizes the classical Hoeffding inequality can be used to develop a statistical test involving quartets. In the second part, we study real and simulated data using this theoretical advancement, thus demonstrating how the significance of the differences between sets of quartets can be assessed. Our results are particularly intriguing since they nonstandardly require the analysis of dependent random variables.
Collapse
Affiliation(s)
- Eliran Avni
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, Haifa, Israel
| |
Collapse
|
31
|
Why Prokaryotes Genomes Lack Genes with Introns Processed by Spliceosomes? J Mol Evol 2018; 86:611-612. [PMID: 30382299 DOI: 10.1007/s00239-018-9874-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 10/26/2018] [Indexed: 10/28/2022]
|
32
|
Gallagher AL, Miller SR. Expression of Novel Gene Content Drives Adaptation to Low Iron in the Cyanobacterium Acaryochloris. Genome Biol Evol 2018; 10:1484-1492. [PMID: 29850825 PMCID: PMC6007379 DOI: 10.1093/gbe/evy099] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2018] [Indexed: 12/24/2022] Open
Abstract
Variation in genome content is a potent mechanism of microbial adaptation. The genomes of members of the cyanobacterial genus Acaryochloris vary greatly in gene content as a consequence of the idiosyncratic retention of both recent gene duplicates and plasmid-encoded genes acquired by horizontal transfer. For example, the genome of Acaryochloris strain MBIC11017, which was isolated from an iron-limited environment, is enriched in duplicated and novel genes involved in iron assimilation. Here, we took an integrative approach to characterize the adaptation of Acaryochloris MBIC11017 to low environmental iron availability and the relative contributions of the expression of duplicated versus novel genes. We observed that Acaryochloris MBIC11017 grew faster and to a higher yield in the presence of nanomolar concentrations of iron than did a closely related strain. These differences were associated with both a higher rate of iron assimilation and a greater abundance of iron assimilation transcripts. However, recently duplicated genes contributed little to increased transcript dosage; rather, the maintenance of these duplicates in the MBIC11017 genome is likely due to the sharing of ancestral dosage by expression reduction. Instead, novel, horizontally transferred genes are responsible for the differences in transcript abundance. The study provides insights on the mechanisms of adaptive genome evolution and gene expression in Acaryochloris.
Collapse
Affiliation(s)
| | - Scott R Miller
- Division of Biological Sciences, The University of Montana
| |
Collapse
|
33
|
Abstract
BACKGROUND Deciphering the history of life on Earth has long been regarded as one of the most central tasks in biology. In past years, widespread discordance between the evolutionary histories of different groups of orthologous genes of prokaryotes have been revealed, primarily due to horizontal gene transfers (HGTs). Nonetheless, evidence that support a strong tree-like signal of evolution have been uncovered, despite the presence of HGT events. Therefore, a challenging task is to distill this tree-like signal from the noise induced by all sources of non-tree-like events. RESULTS In this work we tackle this question, using real and simulated data. We first tighten a recent related theoretical result in this field. In a simulation study, we infer individual quartet topologies, and then use the inferred quartets to reconstruct simulated species trees. We demonstrate that accurate tree reconstruction is feasible despite surprisingly high rates of HGT. In a real data study, we construct phylogenies of two sets of prokaryotes, and show that our tree reconstruction scheme is comparable with (and complementary better than) other commonly used methods. CONCLUSIONS Using a blend of theoretical and empirical investigations, our study proves the feasibility of accurate quartet-based phylogenetic reconstruction, the vast impact of HGT events notwithstanding.
Collapse
Affiliation(s)
- Eliran Avni
- Department of Evolutionary Biology, University of Haifa, 199 Aba Khoushy Ave. Mount Carmel, Haifa, 3498838, Israel
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, 199 Aba Khoushy Ave. Mount Carmel, Haifa, 3498838, Israel.
| |
Collapse
|
34
|
Porse A, Schou TS, Munck C, Ellabaan MMH, Sommer MOA. Biochemical mechanisms determine the functional compatibility of heterologous genes. Nat Commun 2018; 9:522. [PMID: 29410400 PMCID: PMC5802803 DOI: 10.1038/s41467-018-02944-3] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 01/09/2018] [Indexed: 11/28/2022] Open
Abstract
Elucidating the factors governing the functional compatibility of horizontally transferred genes is important to understand bacterial evolution, including the emergence and spread of antibiotic resistance, and to successfully engineer biological systems. In silico efforts and work using single-gene libraries have suggested that sequence composition is a strong barrier for the successful integration of heterologous genes. Here we sample 200 diverse genes, representing >80% of sequenced antibiotic resistance genes, to interrogate the factors governing genetic compatibility in new hosts. In contrast to previous work, we find that GC content, codon usage, and mRNA-folding energy are of minor importance for the compatibility of mechanistically diverse gene products at moderate expression. Instead, we identify the phylogenetic origin, and the dependence of a resistance mechanism on host physiology, as major factors governing the functionality and fitness of antibiotic resistance genes. These findings emphasize the importance of biochemical mechanism for heterologous gene compatibility, and suggest physiological constraints as a pivotal feature orienting the evolution of antibiotic resistance. Sequence composition is thought to be a major factor governing the functionality of horizontally transferred genes. In contrast, Porse et al. show that phylogenetic origin, and the type of resistance mechanism, are major factors affecting the functionality of horizontally transferred antibiotic resistance genes.
Collapse
Affiliation(s)
- Andreas Porse
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, DK-2800, Denmark
| | - Thea S Schou
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, DK-2800, Denmark
| | - Christian Munck
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, DK-2800, Denmark
| | - Mostafa M H Ellabaan
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, DK-2800, Denmark
| | - Morten O A Sommer
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, DK-2800, Denmark.
| |
Collapse
|
35
|
Updating the genomic taxonomy and epidemiology of Campylobacter hyointestinalis. Sci Rep 2018; 8:2393. [PMID: 29403020 PMCID: PMC5799301 DOI: 10.1038/s41598-018-20889-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 01/25/2018] [Indexed: 12/24/2022] Open
Abstract
Campylobacter hyointestinalis is a member of an emerging group of zoonotic Campylobacter spp. that are increasingly identified in both gastric and non-gastric disease in humans. Here, we discovered C. hyointestinalis in three separate classes of New Zealand ruminant livestock; cattle, sheep and deer. To investigate the relevance of these findings we performed a systematic literature review on global C. hyointestinalis epidemiology and used comparative genomics to better understand and classify members of the species. We found that C. hyointestinalis subspecies hyointestinalis has an open pangenome, with accessory gene contents involved in many essential processes such as metabolism, virulence and defence. We observed that horizontal gene transfer is likely to have played an overwhelming role in species diversification, favouring a public-goods-like mechanism of gene ‘acquisition and resampling’ over a tree-of-life-like vertical inheritance model of evolution. As a result, simplistic gene-based inferences of taxonomy by similarity are likely to be misleading. Such genomic plasticity will also mean that local evolutionary histories likely influence key species characteristics, such as host-association and virulence. This may help explain geographical differences in reported C. hyointestinalis epidemiology and limits what characteristics may be generalised, requiring further genomic studies of C. hyointestinalis in areas where it causes disease.
Collapse
|
36
|
Affiliation(s)
- Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Artem S. Novozhilov
- Department of Mathematics, North Dakota State University, Fargo, North Dakota 58108, USA
| |
Collapse
|
37
|
Motlagh AM, Bhattacharjee AS, Coutinho FH, Dutilh BE, Casjens SR, Goel RK. Insights of Phage-Host Interaction in Hypersaline Ecosystem through Metagenomics Analyses. Front Microbiol 2017; 8:352. [PMID: 28316597 PMCID: PMC5334351 DOI: 10.3389/fmicb.2017.00352] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 02/20/2017] [Indexed: 01/21/2023] Open
Abstract
Bacteriophages, as the most abundant biological entities on Earth, place significant predation pressure on their hosts. This pressure plays a critical role in the evolution, diversity, and abundance of bacteria. In addition, phages modulate the genetic diversity of prokaryotic communities through the transfer of auxiliary metabolic genes. Various studies have been conducted in diverse ecosystems to understand phage-host interactions and their effects on prokaryote metabolism and community composition. However, hypersaline environments remain among the least studied ecosystems and the interaction between the phages and prokaryotes in these habitats is poorly understood. This study begins to fill this knowledge gap by analyzing bacteriophage-host interactions in the Great Salt Lake, the largest prehistoric hypersaline lake in the Western Hemisphere. Our metagenomics analyses allowed us to comprehensively identify the bacterial and phage communities with Proteobacteria, Firmicutes, and Bacteroidetes as the most dominant bacterial species and Siphoviridae, Myoviridae, and Podoviridae as the most dominant viral families found in the metagenomic sequences. We also characterized interactions between the phage and prokaryotic communities of Great Salt Lake and determined how these interactions possibly influence the community diversity, structure, and biogeochemical cycles. In addition, presence of prophages and their interaction with the prokaryotic host was studied and showed the possibility of prophage induction and subsequent infection of prokaryotic community present in the Great Salt Lake environment under different environmental stress factors. We found that carbon cycle was the most susceptible nutrient cycling pathways to prophage induction in the presence of environmental stresses. This study gives an enhanced snapshot of phage and prokaryote abundance and diversity as well as their interactions in a hypersaline complex ecosystem, which can pave the way for further research studies.
Collapse
Affiliation(s)
- Amir Mohaghegh Motlagh
- Department of Civil and Environmental Engineering, University of Utah Salt Lake, UT, USA
| | - Ananda S Bhattacharjee
- Department of Civil and Environmental Engineering, University of Utah Salt Lake, UT, USA
| | - Felipe H Coutinho
- Instituto de Biologia, Universidade Federal do Rio de JaneiroRio de Janeiro, Brazil; Radboud Institute for Molecular Life Sciences, Centre for Molecular and Biomolecular Informatics, Radboud University Medical CentreNijmegen, Netherlands
| | - Bas E Dutilh
- Instituto de Biologia, Universidade Federal do Rio de JaneiroRio de Janeiro, Brazil; Radboud Institute for Molecular Life Sciences, Centre for Molecular and Biomolecular Informatics, Radboud University Medical CentreNijmegen, Netherlands; Theoretical Biology and Bioinformatics, Utrecht UniversityUtrecht, Netherlands
| | | | - Ramesh K Goel
- Department of Civil and Environmental Engineering, University of Utah Salt Lake, UT, USA
| |
Collapse
|
38
|
Cong Y, Chan YB, Phillips CA, Langston MA, Ragan MA. Robust Inference of Genetic Exchange Communities from Microbial Genomes Using TF-IDF. Front Microbiol 2017; 8:21. [PMID: 28154557 PMCID: PMC5243798 DOI: 10.3389/fmicb.2017.00021] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 01/04/2017] [Indexed: 11/13/2022] Open
Abstract
Bacteria and archaea can exchange genetic material across lineages through processes of lateral genetic transfer (LGT). Collectively, these exchange relationships can be modeled as a network and analyzed using concepts from graph theory. In particular, densely connected regions within an LGT network have been defined as genetic exchange communities (GECs). However, it has been problematic to construct networks in which edges solely represent LGT. Here we apply term frequency-inverse document frequency (TF-IDF), an alignment-free method originating from document analysis, to infer regions of lateral origin in bacterial genomes. We examine four empirical datasets of different size (number of genomes) and phyletic breadth, varying a key parameter (word length k) within bounds established in previous work. We map the inferred lateral regions to genes in recipient genomes, and construct networks in which the nodes are groups of genomes, and the edges natively represent LGT. We then extract maximum and maximal cliques (i.e., GECs) from these graphs, and identify nodes that belong to GECs across a wide range of k. Most surviving lateral transfer has happened within these GECs. Using Gene Ontology enrichment tests we demonstrate that biological processes associated with metabolism, regulation and transport are often over-represented among the genes affected by LGT within these communities. These enrichments are largely robust to change of k.
Collapse
Affiliation(s)
- Yingnan Cong
- Institute for Molecular Bioscience and ARC Centre of Excellence in Bioinformatics, University of Queensland, St Lucia QLD, Australia
| | - Yao-Ban Chan
- School of Mathematics and Statistics, University of Melbourne, Parkville VIC, Australia
| | - Charles A Phillips
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville TN, USA
| | - Michael A Langston
- Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville TN, USA
| | - Mark A Ragan
- Institute for Molecular Bioscience and ARC Centre of Excellence in Bioinformatics, University of Queensland, St Lucia QLD, Australia
| |
Collapse
|
39
|
Snir S. Ordered orthology as a tool in prokaryotic evolutionary inference. Mob Genet Elements 2017; 6:e1120576. [PMID: 28090377 DOI: 10.1080/2159256x.2015.1120576] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Revised: 10/27/2015] [Accepted: 11/10/2015] [Indexed: 10/22/2022] Open
Abstract
Molecular data is accumulated at exponentially increasing pace. This deluge of information should have brought us closer to resolving one of the most fundamental issues in biology - deciphering the history of life on Earth. So far, however, this abundance of data only seems to blur our understanding of the problem. This is largely due to horizontal gene transfer (HGT), the transfer of genetic material between evolutionarily unrelated organisms that transforms the prokaryotic tree into a network of relationships. Recently, we developed a method to infer evolutionary relationships among closely related species where the conventional evolutionary markers do not provide a strong enough signal. The method relies on the loss of synteny, gene order conservation among species that provides a stronger signal, sufficient to classify even strains of a given species. Here we elaborate on this method and suggest further uses of it in the context of detecting HGT events and genome architecture.
Collapse
Affiliation(s)
- Sagi Snir
- Department of Evolutionary Biology, University of Haifa , Haifa, Israel
| |
Collapse
|
40
|
Chan CX, Beiko RG, Ragan MA. Scaling Up the Phylogenetic Detection of Lateral Gene Transfer Events. Methods Mol Biol 2017; 1525:421-432. [PMID: 27896730 DOI: 10.1007/978-1-4939-6622-6_16] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Lateral genetic transfer (LGT) is the process by which genetic material moves between organisms (and viruses) in the biosphere. Among the many approaches developed for the inference of LGT events from DNA sequence data, methods based on the comparison of phylogenetic trees remain the gold standard for many types of problem. Identifying LGT events from sequenced genomes typically involves a series of steps in which homologous sequences are identified and aligned, phylogenetic trees are inferred, and their topologies are compared to identify unexpected or conflicting relationships. These types of approach have been used to elucidate the nature and extent of LGT and its physiological and ecological consequences throughout the Tree of Life. Advances in DNA sequencing technology have led to enormous increases in the number of sequenced genomes, including ultra-deep sampling of specific taxonomic groups and single cell-based sequencing of unculturable "microbial dark matter." Environmental shotgun sequencing enables the study of LGT among organisms that share the same habitat.This abundance of genomic data offers new opportunities for scientific discovery, but poses two key problems. As ever more genomes are generated, the assembly and annotation of each individual genome receives less scrutiny; and with so many genomes available it is tempting to include them all in a single analysis, but thousands of genomes and millions of genes can overwhelm key algorithms in the analysis pipeline. Identifying LGT events of interest therefore depends on choosing the right dataset, and on algorithms that appropriately balance speed and accuracy given the size and composition of the chosen set of genomes.
Collapse
Affiliation(s)
- Cheong Xin Chan
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Robert G Beiko
- Faculty of Computer Science, Dalhousie University, Halifax, NS, B3H 4R2, Canada
| | - Mark A Ragan
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia.
| |
Collapse
|
41
|
Naushad S, Barkema HW, Luby C, Condas LAZ, Nobrega DB, Carson DA, De Buck J. Comprehensive Phylogenetic Analysis of Bovine Non- aureus Staphylococci Species Based on Whole-Genome Sequencing. Front Microbiol 2016; 7:1990. [PMID: 28066335 PMCID: PMC5168469 DOI: 10.3389/fmicb.2016.01990] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2016] [Accepted: 11/28/2016] [Indexed: 11/19/2022] Open
Abstract
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity.
Collapse
Affiliation(s)
- Sohail Naushad
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Herman W Barkema
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Christopher Luby
- Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada; Department of Large Animal Clinical Sciences, Western College of Veterinary Medicine, University of SaskatchewanSaskatoon, SK, Canada
| | - Larissa A Z Condas
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Diego B Nobrega
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Domonique A Carson
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| | - Jeroen De Buck
- Department of Production Animal Health, Faculty of Veterinary Medicine, University of CalgaryCalgary, AB, Canada; Canadian Bovine Mastitis and Milk Quality Research NetworkSt-Hyacinthe, QC, Canada
| |
Collapse
|
42
|
Frenkel Z, Kiat Y, Izhaki I, Snir S. Convex recoloring as an evolutionary marker. Mol Phylogenet Evol 2016; 107:209-220. [PMID: 27818264 DOI: 10.1016/j.ympev.2016.10.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2016] [Revised: 10/16/2016] [Accepted: 10/25/2016] [Indexed: 11/27/2022]
Abstract
With the availability of enormous quantities of genetic data it has become common to construct very accurate trees describing the evolutionary history of the species under study, as well as every single gene of these species. These trees allow us to examine the evolutionary compliance of given markers (characters). A marker compliant with the history of the species investigated, has undergone mutations along the species tree branches, such that every subtree of that tree exhibits a different state. Convex recoloring (CR) uses combinatorial representation to measure the adequacy of a taxonomic classifier to a given tree. Despite its biological origins, research on CR has been almost exclusively dedicated to mathematical properties of the problem, or variants of it with little, if any, relationship to taxonomy. In this work we return to the origins of CR. We put CR in a statistical framework and introduce and learn the notion of the statistical significance of a character. We apply this measure to two data sets - Passerine birds and prokaryotes, and four examples. These examples demonstrate various applications of CR, from evolutionary relatedness, through lateral evolution, to supertree construction. The above study was done with a new software that we provide, containing algorithmic improvement with a graphical output of a (optimally) recolored tree. AVAILABILITY A code implementing the features and a README is available at http://research.haifa.ac.il/ssagi/software/convexrecoloring.zip.
Collapse
Affiliation(s)
- Zeev Frenkel
- Department of Ecology and Evolutionary Biology, University of Haifa, Israel
| | - Yosef Kiat
- Israeli Bird Ringing Center, Society for the Protection of Nature in Israel, Israel
| | - Ido Izhaki
- Department of Ecology and Evolutionary Biology, University of Haifa, Israel
| | - Sagi Snir
- Department of Ecology and Evolutionary Biology, University of Haifa, Israel
| |
Collapse
|
43
|
Koonin EV. Horizontal gene transfer: essentiality and evolvability in prokaryotes, and roles in evolutionary transitions. F1000Res 2016; 5. [PMID: 27508073 PMCID: PMC4962295 DOI: 10.12688/f1000research.8737.1] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/18/2016] [Indexed: 01/01/2023] Open
Abstract
The wide spread of gene exchange and loss in the prokaryotic world has prompted the concept of ‘lateral genomics’ to the point of an outright denial of the relevance of phylogenetic trees for evolution. However, the pronounced coherence congruence of the topologies of numerous gene trees, particularly those for (nearly) universal genes, translates into the notion of a statistical tree of life (STOL), which reflects a central trend of vertical evolution. The STOL can be employed as a framework for reconstruction of the evolutionary processes in the prokaryotic world. Quantitatively, however, horizontal gene transfer (HGT) dominates microbial evolution, with the rate of gene gain and loss being comparable to the rate of point mutations and much greater than the duplication rate. Theoretical models of evolution suggest that HGT is essential for the survival of microbial populations that otherwise deteriorate due to the Muller’s ratchet effect. Apparently, at least some bacteria and archaea evolved dedicated vehicles for gene transfer that evolved from selfish elements such as plasmids and viruses. Recent phylogenomic analyses suggest that episodes of massive HGT were pivotal for the emergence of major groups of organisms such as multiple archaeal phyla as well as eukaryotes. Similar analyses appear to indicate that, in addition to donating hundreds of genes to the emerging eukaryotic lineage, mitochondrial endosymbiosis severely curtailed HGT. These results shed new light on the routes of evolutionary transitions, but caution is due given the inherent uncertainty of deep phylogenies.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
44
|
Bernard G, Chan CX, Ragan MA. Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer. Sci Rep 2016; 6:28970. [PMID: 27363362 PMCID: PMC4929450 DOI: 10.1038/srep28970] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 06/13/2016] [Indexed: 12/22/2022] Open
Abstract
Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution.
Collapse
Affiliation(s)
- Guillaume Bernard
- Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Cheong Xin Chan
- Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Mark A. Ragan
- Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
45
|
Bromberg R, Grishin NV, Otwinowski Z. Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer. PLoS Comput Biol 2016; 12:e1004985. [PMID: 27336403 PMCID: PMC4918981 DOI: 10.1371/journal.pcbi.1004985] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Accepted: 05/10/2016] [Indexed: 01/20/2023] Open
Abstract
Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz. Due to their lack of distinct morphological features, bacteria and archaea were extremely difficult to classify until technology was developed to obtain their DNA sequences; these sequences could then be compared to estimate evolutionary relationships. Now, due to technological advances, there is a flood of available sequences from a wide variety of organisms. These advances have spurred the development of algorithms which can estimate evolutionary relationships using whole genomes, in contrast to the more traditional methods which used single genes earlier and now typically use groups of conserved genes. However, there are many challenges when attempting to infer evolutionary relationships, in particular horizontal gene transfer, where DNA is transferred from one organism to another, resulting in an organism’s genome containing DNA that does not reflect its evolution by descent. We developed a new whole-genome method for estimating evolutionary distances which identifies and corrects for horizontal transfer. We found that for SlopeTree and all other whole-genome methods we applied, horizontal transfer causes some evolutionary distances to be grossly underestimated, and that our correction corrects for this.
Collapse
Affiliation(s)
- Raquel Bromberg
- Department of Biophysics and Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, United States of America
| | - Nick V. Grishin
- Department of Biophysics and Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, United States of America
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, United States of America
| | - Zbyszek Otwinowski
- Department of Biophysics and Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, United States of America
- * E-mail:
| |
Collapse
|
46
|
Gupta RS. Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification. FEMS Microbiol Rev 2016; 40:520-53. [PMID: 27279642 DOI: 10.1093/femsre/fuw011] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/24/2022] Open
Abstract
Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
47
|
Akanni WA, Siu-Ting K, Creevey CJ, McInerney JO, Wilkinson M, Foster PG, Pisani D. Horizontal gene flow from Eubacteria to Archaebacteria and what it means for our understanding of eukaryogenesis. Philos Trans R Soc Lond B Biol Sci 2016; 370:20140337. [PMID: 26323767 DOI: 10.1098/rstb.2014.0337] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The origin of the eukaryotic cell is considered one of the major evolutionary transitions in the history of life. Current evidence strongly supports a scenario of eukaryotic origin in which two prokaryotes, an archaebacterial host and an α-proteobacterium (the free-living ancestor of the mitochondrion), entered a stable symbiotic relationship. The establishment of this relationship was associated with a process of chimerization, whereby a large number of genes from the α-proteobacterial symbiont were transferred to the host nucleus. A general framework allowing the conceptualization of eukaryogenesis from a genomic perspective has long been lacking. Recent studies suggest that the origins of several archaebacterial phyla were coincident with massive imports of eubacterial genes. Although this does not indicate that these phyla originated through the same process that led to the origin of Eukaryota, it suggests that Archaebacteria might have had a general propensity to integrate into their genomes large amounts of eubacterial DNA. We suggest that this propensity provides a framework in which eukaryogenesis can be understood and studied in the light of archaebacterial ecology. We applied a recently developed supertree method to a genomic dataset composed of 392 eubacterial and 51 archaebacterial genera to test whether large numbers of genes flowing from Eubacteria are indeed coincident with the origin of major archaebacterial clades. In addition, we identified two potential large-scale transfers of uncertain directionality at the base of the archaebacterial tree. Our results are consistent with previous findings and seem to indicate that eubacterial gene imports (particularly from δ-Proteobacteria, Clostridia and Actinobacteria) were an important factor in archaebacterial history. Archaebacteria seem to have long relied on Eubacteria as a source of genetic diversity, and while the precise mechanism that allowed these imports is unknown, we suggest that our results support the view that processes comparable to those through which eukaryotes emerged might have been common in archaebacterial history.
Collapse
Affiliation(s)
- Wasiu A Akanni
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Karen Siu-Ting
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Department of Life Science, The Natural History Museum, London SW7 5BD, UK Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, Ceredigion SY23 3FG, UK
| | - Christopher J Creevey
- Institute of Biological, Environmental and Rural Sciences (IBERS), Aberystwyth University, Aberystwyth, Ceredigion SY23 3FG, UK
| | - James O McInerney
- Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Mark Wilkinson
- Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Peter G Foster
- Department of Life Science, The Natural History Museum, London SW7 5BD, UK
| | - Davide Pisani
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, Bristol BS8 1TG, UK
| |
Collapse
|
48
|
McInerney J, Pisani D, O'Connell MJ. The ring of life hypothesis for eukaryote origins is supported by multiple kinds of data. Philos Trans R Soc Lond B Biol Sci 2016; 370:20140323. [PMID: 26323755 DOI: 10.1098/rstb.2014.0323] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The literature is replete with manuscripts describing the origin of eukaryotic cells. Most of the models for eukaryogenesis are either autogenous (sometimes called slow-drip), or symbiogenic (sometimes called big-bang). In this article, we use large and diverse suites of 'Omics' and other data to make the inference that autogeneous hypotheses are a very poor fit to the data and the origin of eukaryotic cells occurred in a single symbiosis.
Collapse
Affiliation(s)
- James McInerney
- Department of Biology, National University of Ireland Maynooth, Co. Kildare, Republic of Ireland Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Davide Pisani
- School of Biological Sciences and School of Earth Sciences, University of Bristol, Life Sciences Building, 24 Tyndall Avenue, Bristol BS8 1TG, UK
| | - Mary J O'Connell
- School of Biotechnology, Dublin City University, Glasnevin, Dublin 9, Republic of Ireland
| |
Collapse
|
49
|
Boto L. Evolutionary change and phylogenetic relationships in light of horizontal gene transfer. J Biosci 2016; 40:465-72. [PMID: 25963270 DOI: 10.1007/s12038-015-9514-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Horizontal gene transfer has, over the past 25 years, become a part of evolutionary thinking. In the present paper I discuss horizontal gene transfer (HGT) in relation to contingency, natural selection, evolutionary change speed and the Tree-of-Life endeavour, with the aim of contributing to the understanding of the role of HGT in evolutionary processes. In addition, the challenges that HGT imposes on the current view of evolution are emphasized.
Collapse
Affiliation(s)
- Luis Boto
- Departamento de Biodiversidad y Biologia Evolutiva, Museo Nacional Ciencias Naturales, CSIC, C/ Jose Gutierrez Abascal 2, 28006, Madrid, Spain,
| |
Collapse
|
50
|
Daubin V, Szöllősi GJ. Horizontal Gene Transfer and the History of Life. Cold Spring Harb Perspect Biol 2016; 8:a018036. [PMID: 26801681 DOI: 10.1101/cshperspect.a018036] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Microbes acquire DNA from a variety of sources. The last decades, which have seen the development of genome sequencing, have revealed that horizontal gene transfer has been a major evolutionary force that has constantly reshaped genomes throughout evolution. However, because the history of life must ultimately be deduced from gene phylogenies, the lack of methods to account for horizontal gene transfer has thrown into confusion the very concept of the tree of life. As a result, many questions remain open, but emerging methodological developments promise to use information conveyed by horizontal gene transfer that remains unexploited today.
Collapse
Affiliation(s)
- Vincent Daubin
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, 69000 Lyon, France Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5558, Université Lyon 1, 69622 Villeurbanne, France
| | | |
Collapse
|