1
|
Tao Y, Zeng Z, Deng Y, Zhang M, Wang F, Wang Y. Phylogeny and evolution of dissimilatory sulfite reduction in prokaryotes. Mol Phylogenet Evol 2024; 201:108208. [PMID: 39343112 DOI: 10.1016/j.ympev.2024.108208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 09/22/2024] [Accepted: 09/26/2024] [Indexed: 10/01/2024]
Abstract
Sulfate is the second most common nonmetallic ion in modern oceans, as its concentration dramatically increased alongside tectonic activity and atmospheric oxidation in the Proterozoic. Microbial sulfate/sulfite metabolism, involving organic carbon or hydrogen oxidation, is linked to sulfur and carbon biogeochemical cycles. However, the coevolution of microbial sulfate/sulfite metabolism and Earth's history remains unclear. Here, we conducted a comprehensive phylogenetic analysis to explore the evolutionary history of the dissimilatory sulfite reduction (Dsr) pathway. The phylogenies of the Dsr-related genes presented similar branching patterns but also some incongruencies, indicating the complex origin and evolution of Dsr. Among these genes, dsrAB is the hallmark of sulfur-metabolizing prokaryotes. Our detailed analyses suggested that the evolution of dsrAB was shaped by vertical inheritance and multiple horizontal gene transfer events and that selection pressure varied across distinct lineages. Dated phylogenetic trees indicated that key evolutionary events of dissimilatory sulfur-metabolizing prokaryotes were related to the Great Oxygenation Event (2.4-2.0 Ga) and several geological events in the "Boring Billion" (1.8-0.8 Ga), including the fragmentation of the Columbia supercontinent (approximately 1.6 Ga), the rapid increase in marine sulfate (1.3-1.2 Ga), and the Neoproterozoic glaciation event (approximately 1.0 Ga). We also proposed that the voluminous iron formations (approximately 1.88 Ga) might have induced the metabolic innovation of iron reduction. In summary, our study provides new insights into Dsr evolution and a systematic view of the coevolution of dissimilatory sulfur-metabolizing prokaryotes and the Earth's environment.
Collapse
Affiliation(s)
- Yuxin Tao
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China; State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, School of Life Science, Fudan University, Shanghai 200438, China
| | - Zichao Zeng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yuhui Deng
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Menghan Zhang
- Research Institute of Intelligent Complex Systems, Fudan University, Shanghai 200438, China
| | - Fengping Wang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China; School of Oceanography, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Yinzhao Wang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
| |
Collapse
|
2
|
Li Y, Guo T, Sun L, Wang ET, Young JPW, Tian CF. Phylogenomic analyses and reclassification of the Mesorhizobium complex: proposal for 9 novel genera and reclassification of 15 species. BMC Genomics 2024; 25:419. [PMID: 38684951 PMCID: PMC11057113 DOI: 10.1186/s12864-024-10333-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 04/22/2024] [Indexed: 05/02/2024] Open
Abstract
BACKGROUD The genus Mesorhizobium is shown by phylogenomics to be paraphyletic and forms part of a complex that includes the genera Aminobacter, Aquamicrobium, Pseudaminobacter and Tianweitania. The relationships for type strains belong to these genera need to be carefully re-evaluated. RESULTS The relationships of Mesorhizobium complex are evaluated based on phylogenomic analyses and overall genome relatedness indices (OGRIs) of 61 type strains. According to the maximum likelihood phylogenetic tree based on concatenated sequences of 539 core proteins and the tree constructed using the bac120 bacterial marker set from Genome Taxonomy Database, 65 type strains were grouped into 9 clusters. Moreover, 10 subclusters were identified based on the OGRIs including average nucleotide identity (ANI), average amino acid identity (AAI) and core-proteome average amino acid identity (cAAI), with AAI and cAAI showing a clear intra- and inter-(sub)cluster gaps of 77.40-80.91% and 83.98-86.16%, respectively. Combined with the phylogenetic trees and OGRIs, the type strains were reclassified into 15 genera. This list includes five defined genera Mesorhizobium, Aquamicrobium, Pseudaminobacter, Aminobacterand Tianweitania, among which 40/41 Mesorhizobium species and one Aminobacter species are canonical legume microsymbionts. The other nine (sub)clusters are classified as novel genera. Cluster III, comprising symbiotic M. alhagi and M. camelthorni, is classified as Allomesorhizobium gen. nov. Cluster VI harbored a single symbiotic species M. albiziae and is classified as Neomesorhizobium gen. nov. The remaining seven non-symbiotic members were proposed as: Neoaquamicrobium gen. nov., Manganibacter gen. nov., Ollibium gen. nov., Terribium gen. nov., Kumtagia gen. nov., Borborobacter gen. nov., Aerobium gen. nov.. Furthermore, the genus Corticibacterium is restored and two species in Subcluster IX-1 are reclassified as the member of this genus. CONCLUSION The Mesorhizobium complex are classified into 15 genera based on phylogenomic analyses and OGRIs of 65 type strains. This study resolved previously non-monophyletic genera in the Mesorhizobium complex.
Collapse
Affiliation(s)
- Yan Li
- Yantai Key Laboratory of Characteristic Agricultural Biological Resources Conservation and Germplasm Innovation Utilization, Jiaodong Microbial Resource Center of Yantai University, College of Life Sciences, Yantai University, Yantai, 264005, Shandong, China.
| | - Tingyan Guo
- Yantai Key Laboratory of Characteristic Agricultural Biological Resources Conservation and Germplasm Innovation Utilization, Jiaodong Microbial Resource Center of Yantai University, College of Life Sciences, Yantai University, Yantai, 264005, Shandong, China
| | - Liqin Sun
- Yantai Key Laboratory of Characteristic Agricultural Biological Resources Conservation and Germplasm Innovation Utilization, Jiaodong Microbial Resource Center of Yantai University, College of Life Sciences, Yantai University, Yantai, 264005, Shandong, China
| | - En-Tao Wang
- Departamento de Microbiología, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Ciudad de México, 11340, México
| | - J Peter W Young
- Department of Biology, University of York, York, YO10 5DD, UK
| | - Chang-Fu Tian
- State Key Laboratory of Plant Environmental Resilience, MOA Key Laboratory of Soil Microbiology, Rhizobium Research Center, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
3
|
Sheinman M, Arndt PF, Massip F. Modeling the mosaic structure of bacterial genomes to infer their evolutionary history. Proc Natl Acad Sci U S A 2024; 121:e2313367121. [PMID: 38517978 PMCID: PMC10990148 DOI: 10.1073/pnas.2313367121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 01/30/2024] [Indexed: 03/24/2024] Open
Abstract
The chronology and phylogeny of bacterial evolution are difficult to reconstruct due to a scarce fossil record. The analysis of bacterial genomes remains challenging because of large sequence divergence, the plasticity of bacterial genomes due to frequent gene loss, horizontal gene transfer, and differences in selective pressure from one locus to another. Therefore, taking advantage of the rich and rapidly accumulating genomic data requires accurate modeling of genome evolution. An important technical consideration is that loci with high effective mutation rates may diverge beyond the detection limit of the alignment algorithms used, biasing the genome-wide divergence estimates toward smaller divergences. In this article, we propose a novel method to gain insight into bacterial evolution based on statistical properties of genome comparisons. We find that the length distribution of sequence matches is shaped by the effective mutation rates of different loci, by the horizontal transfers, and by the aligner sensitivity. Based on these inputs, we build a model and show that it accounts for the empirically observed distributions, taking the Enterobacteriaceae family as an example. Our method allows to distinguish segments of vertical and horizontal origins and to estimate the time divergence and exchange rate between any pair of taxa from genome-wide alignments. Based on the estimated time divergences, we construct a time-calibrated phylogenetic tree to demonstrate the accuracy of the method.
Collapse
Affiliation(s)
- Michael Sheinman
- Institute for Advanced Studies, Sevastopol State University, Sevastopol299053, Crimea
| | - Peter F. Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin12163, Germany
| | - Florian Massip
- Department U900, Centre for Computational Biology, Mines Paris, PSL University, Paris75006, France
- Department U900, Institut Curie, Université Paris Sciences et Lettres, Paris75005, France
- INSERM, U900, Paris75005, France
| |
Collapse
|
4
|
Hellmuth M, Stadler PF. The Theory of Gene Family Histories. Methods Mol Biol 2024; 2802:1-32. [PMID: 38819554 DOI: 10.1007/978-1-0716-3838-5_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Most genes are part of larger families of evolutionary-related genes. The history of gene families typically involves duplications and losses of genes as well as horizontal transfers into other organisms. The reconstruction of detailed gene family histories, i.e., the precise dating of evolutionary events relative to phylogenetic tree of the underlying species has remained a challenging topic despite their importance as a basis for detailed investigations into adaptation and functional evolution of individual members of the gene family. The identification of orthologs, moreover, is a particularly important subproblem of the more general setting considered here. In the last few years, an extensive body of mathematical results has appeared that tightly links orthology, a formal notion of best matches among genes, and horizontal gene transfer. The purpose of this chapter is to broadly outline some of the key mathematical insights and to discuss their implication for practical applications. In particular, we focus on tree-free methods, i.e., methods to infer orthology or horizontal gene transfer as well as gene trees, species trees, and reconciliations between them without using a priori knowledge of the underlying trees or statistical models for the inference of phylogenetic trees. Instead, the initial step aims to extract binary relations among genes.
Collapse
Affiliation(s)
- Marc Hellmuth
- Department of Mathematics, Faculty of Science, Stockholm University, Stockholm, Sweden
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany.
- Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany.
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.
- Universidad Nacional de Colombia, Bogotá, Colombia.
- Institute for Theoretical Chemistry, University of Vienna, Wien, Austria.
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark.
- Santa Fe Institute, Santa Fe, NM, USA.
| |
Collapse
|
5
|
de la Haba RR, Arahal DR, Sánchez-Porro C, Chuvochina M, Wittouck S, Hugenholtz P, Ventosa A. A long-awaited taxogenomic investigation of the family Halomonadaceae. Front Microbiol 2023; 14:1293707. [PMID: 38045027 PMCID: PMC10690426 DOI: 10.3389/fmicb.2023.1293707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 10/05/2023] [Indexed: 12/05/2023] Open
Abstract
The family Halomonadaceae is the largest family composed of halophilic bacteria, with more than 160 species with validly published names as of July 2023. Several classifications to circumscribe this family are available in major resources, such as those provided by the List of Prokaryotic names with Standing in Nomenclature (LPSN), NCBI Taxonomy, Genome Taxonomy Database (GTDB), and Bergey's Manual of Systematics of Archaea and Bacteria (BMSAB), with some degree of disagreement between them. Moreover, regardless of the classification adopted, the genus Halomonas is not phylogenetically consistent, likely because it has been used as a catch-all for newly described species within the family Halomonadaceae that could not be clearly accommodated in other Halomonadaceae genera. In the past decade, some taxonomic rearrangements have been conducted on the Halomonadaceae based on ribosomal and alternative single-copy housekeeping gene sequence analysis. High-throughput technologies have enabled access to the genome sequences of many type strains belonging to the family Halomonadaceae; however, genome-based studies specifically addressing its taxonomic status have not been performed to date. In this study, we accomplished the genome sequencing of 17 missing type strains of Halomonadaceae species that, together with other publicly available genome sequences, allowed us to re-evaluate the genetic relationship, phylogeny, and taxonomy of the species and genera within this family. The approach followed included the estimate of the Overall Genome Relatedness Indexes (OGRIs) such as the average amino acid identity (AAI), phylogenomic reconstructions using amino acid substitution matrices customized for the family Halomonadaceae, and the analysis of clade-specific signature genes. Based on our results, we conclude that the genus Halovibrio is obviously out of place within the family Halomonadaceae, and, on the other hand, we propose a division of the genus Halomonas into seven separate genera and the transfer of seven species from Halomonas to the genus Modicisalibacter, together with the emendation of the latter. Additionally, data from this study demonstrate the existence of various synonym species names in this family.
Collapse
Affiliation(s)
- Rafael R. de la Haba
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Sevilla, Sevilla, Spain
| | - David R. Arahal
- Departament of Microbiology and Ecology, University of Valencia, Valencia, Spain
| | - Cristina Sánchez-Porro
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Sevilla, Sevilla, Spain
| | - Maria Chuvochina
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, St Lucia, QLD, Australia
| | - Stijn Wittouck
- Research Group Environmental Ecology and Applied Microbiology, Department of Bioscience Engineering, University of Antwerp, Antwerp, Belgium
| | - Philip Hugenholtz
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, St Lucia, QLD, Australia
| | - Antonio Ventosa
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Sevilla, Sevilla, Spain
| |
Collapse
|
6
|
Kronmiller BA, Feau N, Shen D, Tabima JF, Ali SS, Armitage AD, Arredondo F, Bailey BA, Bollmann SR, Dale A, Harrison RJ, Hrywkiw K, Kasuga T, McDougal R, Nellist CF, Panda P, Tripathy S, Williams NM, Ye W, Wang Y, Hamelin RC, Grünwald NJ. Comparative Genomic Analysis of 31 Phytophthora Genomes Reveals Genome Plasticity and Horizontal Gene Transfer. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2023; 36:26-46. [PMID: 36306437 DOI: 10.1094/mpmi-06-22-0133-r] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Phytophthora species are oomycete plant pathogens that cause great economic and ecological impacts. The Phytophthora genus includes over 180 known species, infecting a wide range of plant hosts, including crops, trees, and ornamentals. We sequenced the genomes of 31 individual Phytophthora species and 24 individual transcriptomes to study genetic relationships across the genus. De novo genome assemblies revealed variation in genome sizes, numbers of predicted genes, and in repetitive element content across the Phytophthora genus. A genus-wide comparison evaluated orthologous groups of genes. Predicted effector gene counts varied across Phytophthora species by effector family, genome size, and plant host range. Predicted numbers of apoplastic effectors increased as the host range of Phytophthora species increased. Predicted numbers of cytoplasmic effectors also increased with host range but leveled off or decreased in Phytophthora species that have enormous host ranges. With extensive sequencing across the Phytophthora genus, we now have the genomic resources to evaluate horizontal gene transfer events across the oomycetes. Using a machine-learning approach to identify horizontally transferred genes with bacterial or fungal origin, we identified 44 candidates over 36 Phytophthora species genomes. Phylogenetic reconstruction indicates that the transfers of most of these 44 candidates happened in parallel to major advances in the evolution of the oomycetes and Phytophthora spp. We conclude that the 31 genomes presented here are essential for investigating genus-wide genomic associations in genus Phytophthora. [Formula: see text] Copyright © 2023 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Brent A Kronmiller
- Center for Quantitative Life Sciences, Oregon State University, Corvallis, OR, U.S.A
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, U.S.A
| | - Nicolas Feau
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, Canada
| | - Danyu Shen
- Department of Plant Pathology, Nanjing Agricultural University, Nanjing, China
| | - Javier F Tabima
- Department of Biology, Clark University, Worcester, MA, U.S.A
| | - Shahin S Ali
- Sustainable Perennial Crops Laboratory, Northeast Area, USDA/ARS, Beltsville Agricultural Research Center-West, Beltsville, MD, U.S.A
| | - Andrew D Armitage
- Natural Resources Institute, University of Greenwich, Chatham Maritime, U.K
| | - Felipe Arredondo
- Center for Quantitative Life Sciences, Oregon State University, Corvallis, OR, U.S.A
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, U.S.A
| | - Bryan A Bailey
- Sustainable Perennial Crops Laboratory, Northeast Area, USDA/ARS, Beltsville Agricultural Research Center-West, Beltsville, MD, U.S.A
| | - Stephanie R Bollmann
- Department of Integrative Biology, Oregon State University, Corvallis, OR, U.S.A
| | - Angela Dale
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, Canada
- SC-New Construction Materials, FPInnovations, Vancouver, V6T 1Z4, Canada
| | | | - Kelly Hrywkiw
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, Canada
| | - Takao Kasuga
- Crops Pathology and Genetics Research Unit, Agricultural Research Service, United States Department of Agriculture, Davis, CA, U.S.A
| | - Rebecca McDougal
- Scion (Zealand Forest Research Institute), 49 Sala Street, Te Papa Tipu Innovation Park, Private Bag 3020, Rotorua, New Zealand
| | | | - Preeti Panda
- The New Zealand Institute for Plant and Food Research Ltd, 74 Gerald Street, Lincoln, 7608, New Zealand
| | | | - Nari M Williams
- Scion (Zealand Forest Research Institute), 49 Sala Street, Te Papa Tipu Innovation Park, Private Bag 3020, Rotorua, New Zealand
- Department of Pathogen Ecology and Control, Plant and Food Research, Private Bag 1401, Havelock North, New Zealand
| | - Wenwu Ye
- Department of Plant Pathology, Nanjing Agricultural University, Nanjing, China
| | - Yuanchao Wang
- Department of Plant Pathology, Nanjing Agricultural University, Nanjing, China
| | - Richard C Hamelin
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Canada
- Département des sciences du bois et de la forêt, Faculté de Foresterie et Géographie, Université Laval, Québec, Canada
| | - Niklaus J Grünwald
- Horticultural Crop Research Unit, United States Department of Agriculture, Agricultural Research Service, Corvallis, OR, U.S.A
| |
Collapse
|
7
|
Karamycheva S, Wolf YI, Persi E, Koonin EV, Makarova KS. Analysis of lineage-specific protein family variability in prokaryotes combined with evolutionary reconstructions. Biol Direct 2022; 17:22. [PMID: 36042479 PMCID: PMC9425974 DOI: 10.1186/s13062-022-00337-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 08/13/2022] [Indexed: 12/24/2022] Open
Abstract
Background Evolutionary rate is a key characteristic of gene families that is linked to the functional importance of the respective genes as well as specific biological functions of the proteins they encode. Accurate estimation of evolutionary rates is a challenging task that requires precise phylogenetic analysis. Here we present an easy to estimate protein family level measure of sequence variability based on alignment column homogeneity in multiple alignments of protein sequences from Clade-Specific Clusters of Orthologous Genes (csCOGs). Results We report genome-wide estimates of variability for 8 diverse groups of bacteria and archaea and investigate the connection between variability and various genomic and biological features. The variability estimates are based on homogeneity distributions across amino acid sequence alignments and can be obtained for multiple groups of genomes at minimal computational expense. About half of the variance in variability values can be explained by the analyzed features, with the greatest contribution coming from the extent of gene paralogy in the given csCOG. The correlation between variability and paralogy appears to originate, primarily, not from gene duplication, but from acquisition of distant paralogs and xenologs, introducing sequence variants that are more divergent than those that could have evolved in situ during the lifetime of the given group of organisms. Both high-variability and low-variability csCOGs were identified in all functional categories, but as expected, proteins encoded by integrated mobile elements as well as proteins involved in defense functions and cell motility are, on average, more variable than proteins with housekeeping functions. Additionally, using linear discriminant analysis, we found that variability and fraction of genomes carrying a given gene are the two variables that provide the best prediction of gene essentiality as compared to the results of transposon mutagenesis in Sulfolobus islandicus. Conclusions Variability, a measure of sequence diversity within an alignment relative to the overall diversity within a group of organisms, offers a convenient proxy for evolutionary rate estimates and is informative with respect to prediction of functional properties of proteins. In particular, variability is a strong predictor of gene essentiality for the respective organisms and indicative of sub- or neofunctionalization of paralogs. Supplementary Information The online version contains supplementary material available at 10.1186/s13062-022-00337-7.
Collapse
Affiliation(s)
- Svetlana Karamycheva
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Erez Persi
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA.
| |
Collapse
|
8
|
Comparative genomic analysis of hyper-ammonia producing Acetoanaerobium sticklandii DSM 519 with purinolytic Gottschalkia acidurici 9a and pathogenic Peptoclostridium difficile 630. Genomics 2021; 113:4196-4205. [PMID: 34780936 DOI: 10.1016/j.ygeno.2021.11.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 02/18/2021] [Accepted: 11/10/2021] [Indexed: 10/19/2022]
Abstract
Acetoanaerobium sticklandii DSM519 (CST) is a hype-ammonia producing non-pathogenic anaerobe that can use amino acids as important carbon and energy sources through the Stickland reactions. Biochemical aspects of this organism have been extensively studied, but systematic studies addressing its metabolic discrepancy remain scant. In this perspective, we have intensively analyzed its genomic and metabolic characteristics to comprehend the evolutionary conservation of amino acid catabolism by a comparative genomic approach. The whole-genome data indicated that CST has shown a phylogenomic similarity with hyper-ammonia producing, purinolytic, and proteolytic pathogenic Clostridia. CST has shown to common genomic context sharing across the purinolytic Gottschalkia acidurici 9a and pathogenic Peptoclostridium difficile 630. Genome syntenic analysis described that syntenic orthologs might be originated from the recent ancestor at a slow evolution rate and syntenic-out paralogs evolved from either CDF or CAC via α-event and β-event. Collinearity of either gene orders or gene families was adjusted with syntenic out-paralogs across these genomes. The genome-wide metabolic analysis predicted 11 unique putative metabolic subsystems from the CST genome for amino acid catabolism and hydrogen production. The in silico analysis of our study revealed that a characteristic system for amino acid catabolism-directed biofuel synthesis might have slowly evolved and established as a core genomic content of CST.
Collapse
|
9
|
Rangel LT, Soucy SM, Setubal JC, Gogarten JP, Fournier GP. An efficient, non-phylogenetic method for detecting genes sharing evolutionary signals in phylogenomic datasets. Genome Biol Evol 2021; 13:6352501. [PMID: 34390574 PMCID: PMC8483891 DOI: 10.1093/gbe/evab187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2021] [Indexed: 11/25/2022] Open
Abstract
Assessing the compatibility between gene family phylogenies is a crucial and often computationally demanding step in many phylogenomic analyses. Here, we describe the Evolutionary Similarity Index (IES), a means to assess shared evolution between gene families using a weighted orthogonal distance regression model applied to sequence distances. The utilization of pairwise distance matrices circumvents comparisons between gene tree topologies, which are inherently uncertain and sensitive to evolutionary model choice, phylogenetic reconstruction artifacts, and other sources of error. Furthermore, IES enables the many-to-many pairing of multiple copies between similarly evolving gene families. This is done by selecting non-overlapping pairs of copies, one from each assessed family, and yielding the least sum of squared residuals. Analyses of simulated gene family data sets show that IES’s accuracy is on par with popular tree-based methods while also less susceptible to noise introduced by sequence alignment and evolutionary model fitting. Applying IES to an empirical data set of 1,322 genes from 42 archaeal genomes identified eight major clusters of gene families with compatible evolutionary trends. The most cohesive cluster consisted of 62 genes with compatible evolutionary signal, which occur as both single-copy and multiple homologs per genome; phylogenetic analysis of concatenated alignments from this cluster produced a tree closely matching previously published species trees for Archaea. Four other clusters are mainly composed of accessory genes with limited distribution among Archaea and enriched toward specific metabolic functions. Pairwise evolutionary distances obtained from these accessory gene clusters suggest patterns of interphyla horizontal gene transfer. An IES implementation is available at https://github.com/lthiberiol/evolSimIndex.
Collapse
Affiliation(s)
- Luiz Thibério Rangel
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Corresponding author: E-mail:
| | - Shannon M Soucy
- Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Hanover, New Hampshire, USA
| | - João C Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Brasil
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, USA
- Institute for Systems Genomics, University of Connecticut, USA
| | - Gregory P Fournier
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
10
|
Abstract
The ancestors of cyanobacteria generated Earth's first biogenic molecular oxygen, but how they dealt with oxidative stress remains unconstrained. Here we investigate when superoxide dismutase enzymes (SODs) capable of removing superoxide free radicals evolved and estimate when Cyanobacteria originated. Our Bayesian molecular clocks, calibrated with microfossils, predict that stem Cyanobacteria arose 3300-3600 million years ago. Shortly afterwards, we find phylogenetic evidence that ancestral cyanobacteria used SODs with copper and zinc cofactors (CuZnSOD) during the Archaean. By the Paleoproterozoic, they became genetically capable of using iron, nickel, and manganese as cofactors (FeSOD, NiSOD, and MnSOD respectively). The evolution of NiSOD is particularly intriguing because it corresponds with cyanobacteria's invasion of the open ocean. Our analyses of metalloenzymes dealing with reactive oxygen species (ROS) now demonstrate that marine geochemical records alone may not predict patterns of metal usage by phototrophs from freshwater and terrestrial habitats.
Collapse
|
11
|
Indirect identification of horizontal gene transfer. J Math Biol 2021; 83:10. [PMID: 34218334 PMCID: PMC8254804 DOI: 10.1007/s00285-021-01631-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 04/06/2021] [Accepted: 06/13/2021] [Indexed: 12/04/2022]
Abstract
Several implicit methods to infer horizontal gene transfer (HGT) focus on pairs of genes that have diverged only after the divergence of the two species in which the genes reside. This situation defines the edge set of a graph, the later-divergence-time (LDT) graph, whose vertices correspond to genes colored by their species. We investigate these graphs in the setting of relaxed scenarios, i.e., evolutionary scenarios that encompass all commonly used variants of duplication-transfer-loss scenarios in the literature. We characterize LDT graphs as a subclass of properly vertex-colored cographs, and provide a polynomial-time recognition algorithm as well as an algorithm to construct a relaxed scenario that explains a given LDT. An edge in an LDT graph implies that the two corresponding genes are separated by at least one HGT event. The converse is not true, however. We show that the complete xenology relation is described by an rs-Fitch graph, i.e., a complete multipartite graph satisfying constraints on the vertex coloring. This class of vertex-colored graphs is also recognizable in polynomial time. We finally address the question “how much information about all HGT events is contained in LDT graphs” with the help of simulations of evolutionary scenarios with a wide range of duplication, loss, and HGT events. In particular, we show that a simple greedy graph editing scheme can be used to efficiently detect HGT events that are implicitly contained in LDT graphs.
Collapse
|
12
|
Sheinman M, Arkhipova K, Arndt PF, Dutilh BE, Hermsen R, Massip F. Identical sequences found in distant genomes reveal frequent horizontal transfer across the bacterial domain. eLife 2021; 10:62719. [PMID: 34121661 PMCID: PMC8270642 DOI: 10.7554/elife.62719] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 06/13/2021] [Indexed: 12/19/2022] Open
Abstract
Horizontal gene transfer (HGT) is an essential force in microbial evolution. Despite detailed studies on a variety of systems, a global picture of HGT in the microbial world is still missing. Here, we exploit that HGT creates long identical DNA sequences in the genomes of distant species, which can be found efficiently using alignment-free methods. Our pairwise analysis of 93,481 bacterial genomes identified 138,273 HGT events. We developed a model to explain their statistical properties as well as estimate the transfer rate between pairs of taxa. This reveals that long-distance HGT is frequent: our results indicate that HGT between species from different phyla has occurred in at least 8% of the species. Finally, our results confirm that the function of sequences strongly impacts their transfer rate, which varies by more than three orders of magnitude between different functional categories. Overall, we provide a comprehensive view of HGT, illuminating a fundamental process driving bacterial evolution.
Collapse
Affiliation(s)
- Michael Sheinman
- Theoretical Biology and Bioinformatics, Biology Department, Utrecht University, Utrecht, Netherlands.,Division of Molecular Carcinogenesis, the Netherlands Cancer Institute, Amsterdam, Netherlands
| | - Ksenia Arkhipova
- Theoretical Biology and Bioinformatics, Biology Department, Utrecht University, Utrecht, Netherlands
| | - Peter F Arndt
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Bas E Dutilh
- Theoretical Biology and Bioinformatics, Biology Department, Utrecht University, Utrecht, Netherlands
| | - Rutger Hermsen
- Theoretical Biology and Bioinformatics, Biology Department, Utrecht University, Utrecht, Netherlands
| | - Florian Massip
- Berlin Institute for Medical Systems Biology, Max Delbrück Center, Berlin, Germany.,Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villleurbanne, France
| |
Collapse
|
13
|
Outer membrane vesicles mediated horizontal transfer of an aerobic denitrification gene between Escherichia coli. Biodegradation 2021; 32:435-448. [PMID: 33886019 DOI: 10.1007/s10532-021-09945-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Accepted: 04/12/2021] [Indexed: 10/21/2022]
Abstract
Bacterial genetic material can be horizontally transferred between microorganisms via outer membrane vesicles (OMVs) released by bacteria. Up to now, the application of vesicle-mediated horizontal transfer of "degrading genes" in environmental remediation has not been reported. In this study, the nirS gene from an aerobic denitrification bacterium, Pseudomonas stutzeri, was enclosed in a pET28a plasmid, transformed into Escherichia coli (E. coli) DH5α and expressed in E. coli BL21. The E. coli DH5α released OMVs containing the recombination plasmid pET28a-nirS-EGFP. When compared with the free pET28a-nirS-EGFP plasmid's inability to transform, nirS in OMVs could be transferred into E. coli BL21 with the transformation frequency of 2.76 × 106 CFU/g when the dosage of OMVs was 200 µg under natural conditions, and nirS could express successfully in recipient bacteria. Furthermore, the recipient bacteria that received OMVs containing pET28a-nirS-EGFP could produce 18.16 U/mL activity of nitrite reductase.
Collapse
|
14
|
Brankovics B, van Diepeningen AD, de Hoog GS, van der Lee TAJ, Waalwijk C. Detecting Introgression Between Members of the Fusarium fujikuroi and F. oxysporum Species Complexes by Comparative Mitogenomics. Front Microbiol 2020; 11:1092. [PMID: 32582074 PMCID: PMC7285627 DOI: 10.3389/fmicb.2020.01092] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 04/30/2020] [Indexed: 11/18/2022] Open
Abstract
The Fusarium fujikuroi species complex (FFSC) and F. oxysporum species complex (FOSC) are two related groups of plant pathogens causing a wide diversity of diseases in agricultural crops world wide. The aims of this study are (1) to clarify the phylogeny of the FFSC, (2) to identify potential deviation from tree-like evolution, (3) to explore the value of using mitogenomes for these kinds of analyses, and (4) to better understand mitogenome evolution. In total, we have sequenced 24 species from the FFSC and a representative set of recently analyzed FOSC strains was chosen, while F. redolens was used as outgroup for the two species complexes. A species tree was constructed based on the concatenated alignment of seven nuclear genes and the mitogenome, which was contrasted to individual gene trees to identify potential conflicts. These comparisons indicated conflicts especially within the previously described African clade of the FFSC. Furthermore, the analysis of the mitogenomes revealed the presence of a variant of the large variable (LV) region in FFSC which was previously only reported for FOSC. The distribution of this variant and the results of sequence comparisons indicate horizontal genetic transfer between members of the two species complexes, most probably through introgression. In addition, a duplication of atp9 was found inside an intron of cob, which suggests that even highly conserved mitochondrial genes can have paralogs. Paralogization in turn may lead to inaccurate single gene phylogenies. In conclusion, mitochondrial genomes provide a robust basis for phylogeny. Comparative phylogenetic analysis indicated that gene flow among and between members of FFSC and FOSC has played an important role in the evolutionary history of these two groups. Since mitogenomes show greater levels of conservation and synteny than nuclear regions, they are more likely to be compatible for recombination than nuclear regions. Therefore, mitogenomes can be used as indicators to detect interspecies gene flow.
Collapse
Affiliation(s)
- Balázs Brankovics
- B.U. Biointeractions and Plant Health, Wageningen Plant Research, Wageningen University & Research, Wageningen, Netherlands
| | - Anne D van Diepeningen
- B.U. Biointeractions and Plant Health, Wageningen Plant Research, Wageningen University & Research, Wageningen, Netherlands
| | - G Sybren de Hoog
- Westerdijk Fungal Biodiversity Institute, KNAW, Utrecht, Netherlands.,Center of Expertise in Mycology, Radboud University Medical Center, Nijmegen, Netherlands
| | - Theo A J van der Lee
- B.U. Biointeractions and Plant Health, Wageningen Plant Research, Wageningen University & Research, Wageningen, Netherlands
| | - Cees Waalwijk
- B.U. Biointeractions and Plant Health, Wageningen Plant Research, Wageningen University & Research, Wageningen, Netherlands
| |
Collapse
|
15
|
Zheng J, Wittouck S, Salvetti E, Franz CMAP, Harris HMB, Mattarelli P, O'Toole PW, Pot B, Vandamme P, Walter J, Watanabe K, Wuyts S, Felis GE, Gänzle MG, Lebeer S. A taxonomic note on the genus Lactobacillus: Description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int J Syst Evol Microbiol 2020; 70:2782-2858. [PMID: 32293557 DOI: 10.1099/ijsem.0.004107] [Citation(s) in RCA: 1566] [Impact Index Per Article: 391.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The genus Lactobacillus comprises 261 species (at March 2020) that are extremely diverse at phenotypic, ecological and genotypic levels. This study evaluated the taxonomy of Lactobacillaceae and Leuconostocaceae on the basis of whole genome sequences. Parameters that were evaluated included core genome phylogeny, (conserved) pairwise average amino acid identity, clade-specific signature genes, physiological criteria and the ecology of the organisms. Based on this polyphasic approach, we propose reclassification of the genus Lactobacillus into 25 genera including the emended genus Lactobacillus, which includes host-adapted organisms that have been referred to as the Lactobacillus delbrueckii group, Paralactobacillus and 23 novel genera for which the names Holzapfelia, Amylolactobacillus, Bombilactobacillus, Companilactobacillus, Lapidilactobacillus, Agrilactobacillus, Schleiferilactobacillus, Loigolactobacilus, Lacticaseibacillus, Latilactobacillus, Dellaglioa, Liquorilactobacillus, Ligilactobacillus, Lactiplantibacillus, Furfurilactobacillus, Paucilactobacillus, Limosilactobacillus, Fructilactobacillus, Acetilactobacillus, Apilactobacillus, Levilactobacillus, Secundilactobacillus and Lentilactobacillus are proposed. We also propose to emend the description of the family Lactobacillaceae to include all genera that were previously included in families Lactobacillaceae and Leuconostocaceae. The generic term 'lactobacilli' will remain useful to designate all organisms that were classified as Lactobacillaceae until 2020. This reclassification reflects the phylogenetic position of the micro-organisms, and groups lactobacilli into robust clades with shared ecological and metabolic properties, as exemplified for the emended genus Lactobacillus encompassing species adapted to vertebrates (such as Lactobacillus delbrueckii, Lactobacillus iners, Lactobacillus crispatus, Lactobacillus jensensii, Lactobacillus johnsonii and Lactobacillus acidophilus) or invertebrates (such as Lactobacillus apis and Lactobacillus bombicola).
Collapse
Affiliation(s)
- Jinshui Zheng
- Huazhong Agricultural University, State Key Laboratory of Agricultural Microbiology, Hubei Key Laboratory of Agricultural Bioinformatics, Wuhan, Hubei, PR China
| | - Stijn Wittouck
- Research Group Environmental Ecology and Applied Microbiology, Department of Bioscience Engineering, University of Antwerp, Antwerp, Belgium
| | - Elisa Salvetti
- Dept. of Biotechnology, University of Verona, Verona, Italy
| | - Charles M A P Franz
- Max Rubner-Institut, Department of Microbiology and Biotechnology, Kiel, Germany
| | - Hugh M B Harris
- School of Microbiology & APC Microbiome Ireland, University College Cork, Co. Cork, Ireland
| | - Paola Mattarelli
- University of Bologna, Dept. of Agricultural and Food Sciences, Bologna, Italy
| | - Paul W O'Toole
- School of Microbiology & APC Microbiome Ireland, University College Cork, Co. Cork, Ireland
| | - Bruno Pot
- Research Group of Industrial Microbiology and Food Biotechnology (IMDO), Vrije Universiteit Brussel, Brussels, Belgium
| | - Peter Vandamme
- Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Jens Walter
- Department of Biological Sciences, University of Alberta, Edmonton, Canada.,Department of Agricultural, Food & Nutritional Science, University of Alberta, Edmonton, Canada
| | - Koichi Watanabe
- Food Industry Research and Development Institute, Bioresource Collection and Research Center, Hsinchu, Taiwan, ROC.,National Taiwan University, Dept. of Animal Science and Technology, Taipei, Taiwan, ROC
| | - Sander Wuyts
- Research Group Environmental Ecology and Applied Microbiology, Department of Bioscience Engineering, University of Antwerp, Antwerp, Belgium
| | | | - Michael G Gänzle
- Hubei University of Technology, College of Bioengineering and Food Science, Wuhan, Hubei, PR China.,Department of Agricultural, Food & Nutritional Science, University of Alberta, Edmonton, Canada
| | - Sarah Lebeer
- Research Group Environmental Ecology and Applied Microbiology, Department of Bioscience Engineering, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
16
|
Evolution of Predicted Acid Resistance Mechanisms in the Extremely Acidophilic Leptospirillum Genus. Genes (Basel) 2020; 11:genes11040389. [PMID: 32260256 PMCID: PMC7231039 DOI: 10.3390/genes11040389] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 03/02/2020] [Accepted: 03/04/2020] [Indexed: 02/01/2023] Open
Abstract
Organisms that thrive in extremely acidic environments (≤pH 3.5) are of widespread importance in industrial applications, environmental issues, and evolutionary studies. Leptospirillum spp. constitute the only extremely acidophilic microbes in the phylogenetically deep-rooted bacterial phylum Nitrospirae. Leptospirilli are Gram-negative, obligatory chemolithoautotrophic, aerobic, ferrous iron oxidizers. This paper predicts genes that Leptospirilli use to survive at low pH and infers their evolutionary trajectory. Phylogenetic and other bioinformatic approaches suggest that these genes can be classified into (i) "first line of defense", involved in the prevention of the entry of protons into the cell, and (ii) neutralization or expulsion of protons that enter the cell. The first line of defense includes potassium transporters, predicted to form an inside positive membrane potential, spermidines, hopanoids, and Slps (starvation-inducible outer membrane proteins). The "second line of defense" includes proton pumps and enzymes that consume protons. Maximum parsimony, clustering methods, and gene alignments are used to infer the evolutionary trajectory that potentially enabled the ancestral Leptospirillum to transition from a postulated circum-neutral pH environment to an extremely acidic one. The hypothesized trajectory includes gene gains/loss events driven extensively by horizontal gene transfer, gene duplications, gene mutations, and genomic rearrangements.
Collapse
|
17
|
Stimson J, Gardy J, Mathema B, Crudu V, Cohen T, Colijn C. Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions. Mol Biol Evol 2019; 36:587-603. [PMID: 30690464 DOI: 10.1093/molbev/msy242] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Whole-genome sequencing (WGS) is increasingly used to aid the understanding of pathogen transmission. A first step in analyzing WGS data is usually to define "transmission clusters," sets of cases that are potentially linked by direct transmission. This is often done by including two cases in the same cluster if they are separated by fewer single-nucleotide polymorphisms (SNPs) than a specified threshold. However, there is little agreement as to what an appropriate threshold should be. We propose a probabilistic alternative, suggesting that the key inferential target for transmission clusters is the number of transmissions separating cases. We characterize this by combining the number of SNP differences and the length of time over which those differences have accumulated, using information about case timing, molecular clock, and transmission processes. Our framework has the advantage of allowing for variable mutation rates across the genome and can incorporate other epidemiological data. We use two tuberculosis studies to illustrate the impact of our approach: with British Columbia data by using spatial divisions; with Republic of Moldova data by incorporating antibiotic resistance. Simulation results indicate that our transmission-based method is better in identifying direct transmissions than a SNP threshold, with dissimilarity between clusterings of on average 0.27 bits compared with 0.37 bits for the SNP-threshold method and 0.84 bits for randomly permuted data. These results show that it is likely to outperform the SNP-threshold method where clock rates are variable and sample collection times are spread out. We implement the method in the R package transcluster.
Collapse
Affiliation(s)
- James Stimson
- Department of Mathematics, Imperial College London, London, UK
| | - Jennifer Gardy
- British Columbia Centre for Disease Control, Communicable Disease Prevention and Control Services, Vancouver, Canada.,School of Population and Public Health, University of British Columbia, Vancouver, Canada
| | - Barun Mathema
- Department of Epidemiology, Columbia University Mailman School of Public Health, New York, USA
| | - Valeriu Crudu
- Phthisiopneumology Institute, Chisinau, Republic of Moldova
| | - Ted Cohen
- Yale University School of Public Health, New Haven
| | - Caroline Colijn
- Department of Mathematics, Imperial College London, London, UK.,Department of Mathematics, Simon Fraser University, Vancouver, Canada
| |
Collapse
|
18
|
Petitjean C, Makarova KS, Wolf YI, Koonin EV. Extreme Deviations from Expected Evolutionary Rates in Archaeal Protein Families. Genome Biol Evol 2018; 9:2791-2811. [PMID: 28985292 PMCID: PMC5737733 DOI: 10.1093/gbe/evx189] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/12/2017] [Indexed: 02/07/2023] Open
Abstract
Origin of new biological functions is a complex phenomenon ranging from single-nucleotide substitutions to the gain of new genes via horizontal gene transfer or duplication. Neofunctionalization and subfunctionalization of proteins is often attributed to the emergence of paralogs that are subject to relaxed purifying selection or positive selection and thus evolve at accelerated rates. Such phenomena potentially could be detected as anomalies in the phylogenies of the respective gene families. We developed a computational pipeline to search for such anomalies in 1,834 orthologous clusters of archaeal genes, focusing on lineage-specific subfamilies that significantly deviate from the expected rate of evolution. Multiple potential cases of neofunctionalization and subfunctionalization were identified, including some ancient, house-keeping gene families, such as ribosomal protein S10, general transcription factor TFIIB and chaperone Hsp20. As expected, many cases of apparent acceleration of evolution are associated with lineage-specific gene duplication. On other occasions, long branches in phylogenetic trees correspond to horizontal gene transfer across long evolutionary distances. Significant deceleration of evolution is less common than acceleration, and the underlying causes are not well understood; functional shifts accompanied by increased constraints could be involved. Many gene families appear to be “highly evolvable,” that is, include both long and short branches. Even in the absence of precise functional predictions, this approach allows one to select targets for experimentation in search of new biology.
Collapse
Affiliation(s)
- Celine Petitjean
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
19
|
Reconstructing gene trees from Fitch's xenology relation. J Math Biol 2018; 77:1459-1491. [PMID: 29951855 DOI: 10.1007/s00285-018-1260-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Revised: 06/20/2018] [Indexed: 10/28/2022]
Abstract
Two genes are xenologs in the sense of Fitch if they are separated by at least one horizontal gene transfer event. Horizonal gene transfer is asymmetric in the sense that the transferred copy is distinguished from the one that remains within the ancestral lineage. Hence xenology is more precisely thought of as a non-symmetric relation: y is xenologous to x if y has been horizontally transferred at least once since it diverged from the least common ancestor of x and y. We show that xenology relations are characterized by a small set of forbidden induced subgraphs on three vertices. Furthermore, each xenology relation can be derived from a unique least-resolved edge-labeled phylogenetic tree. We provide a linear-time algorithm for the recognition of xenology relations and for the construction of its least-resolved edge-labeled phylogenetic tree. The fact that being a xenology relation is a heritable graph property, finally has far-reaching consequences on approximation problems associated with xenology relations.
Collapse
|
20
|
Kurokawa M, Seno S, Matsuda H, Ying BW. Correlation between genome reduction and bacterial growth. DNA Res 2016; 23:517-525. [PMID: 27374613 PMCID: PMC5144675 DOI: 10.1093/dnares/dsw035] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Accepted: 06/08/2016] [Indexed: 11/13/2022] Open
Abstract
Genome reduction by removing dispensable genomic sequences in bacteria is commonly used in both fundamental and applied studies to determine the minimal genetic requirements for a living system or to develop highly efficient bioreactors. Nevertheless, whether and how the accumulative loss of dispensable genomic sequences disturbs bacterial growth remains unclear. To investigate the relationship between genome reduction and growth, a series of Escherichia coli strains carrying genomes reduced in a stepwise manner were used. Intensive growth analyses revealed that the accumulation of multiple genomic deletions caused decreases in the exponential growth rate and the saturated cell density in a deletion-length-dependent manner as well as gradual changes in the patterns of growth dynamics, regardless of the growth media. Accordingly, a perspective growth model linking genome evolution to genome engineering was proposed. This study provides the first demonstration of a quantitative connection between genomic sequence and bacterial growth, indicating that growth rate is potentially associated with dispensable genomic sequences.
Collapse
Affiliation(s)
- Masaomi Kurokawa
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Ibaraki 305-8572, Japan
| | - Shigeto Seno
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Hideo Matsuda
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Bei-Wen Ying
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Ibaraki 305-8572, Japan
| |
Collapse
|
21
|
Abstract
Campylobacter jejuni is among the most frequent agent of foodborne gastroenteritis in the world, but its physiology and pathogenesis is less well understood than other bacterial enteric pathogens. This is due in part to the incompatibility of the molecular tools that have enabled advances in the characterization of other bacterial species. Most notably, the dearth of plasmid-based complementation, reporter assays, and plasmid-based unmarked mutagenesis procedures in many of the type strains has hindered research progress. The techniques themselves are not inadequate in Campylobacter species, but rather the barrier to genetic transfer of these genetic constructs from non-Campylobacter cloning stains such as Escherichia coli. Here, we review the modes of genetic transfer in C. jejuni and review the current state of research into the mechanism of each. Also reviewed are two systems (CRISPR-Cas and restriction modification) that are common to many strains of C. jejuni and are at least partly responsible for these barriers.
Collapse
|
22
|
Bevilacqua A, Mischitelli M, Pietropaolo V, Ciuffreda E, Sinigaglia M, Corbo MR. Genotypic and Phenotypic Heterogeneity in Alicyclobacillus acidoterrestris: A Contribution to Species Characterization. PLoS One 2015; 10:e0141228. [PMID: 26484547 PMCID: PMC4618475 DOI: 10.1371/journal.pone.0141228] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2015] [Accepted: 10/05/2015] [Indexed: 01/02/2023] Open
Abstract
Alicyclobacillus acidoterrestris is the main cause of most spoilage problems in fruit juices and acidic products. Since soil borne species often contaminate fruit juices and do not need strict extreme requirements for survival, it is a great concern to investigate whether and how soil species could evolve from their ecological niches in microbial community to new environments as fruit juices. In this study, 23 isolates of thermo-acidophilic, spore-forming bacteria from soil were characterized by cultural and molecular methods. In addition, 2 strains isolated from a spoilage incident in pear juice were typed. Strains phenotyping showed that they could be grouped into 3 different clusters, and some isolates showed identical or quite similar patterns. Analyzing pH and temperature ranges for growth, the majority of strains were able to grow at values described for many species of Alicyclobacillus. Qualitative utilization of lysine, arginine and indole production from tryptophan revealed, for the first time, deamination of lysine and decarboxylation of arginine. Resistance to 5% NaCl as well as the ability to hydrolyze starch and gelatin, nitrate reduction, catalase and oxidase activities confirmed literature evidences. Examining of 16S rRNA, showed that isolates were divided into three blocks represented by effectively soil species and strains that are moving from soil to other possible growing source characterized by parameters that could strongly influence bacterial survival. RAPD PCR technique evidenced a great variability in banding patterns and, although it was not possible to obtain genotypically well-distinguished groups, it was feasible to appreciate genetic similarity between some strains. In conclusion, the investigation of a microbial community entails a combination of metagenomic and classic culture-dependent approaches to expand our knowledge about Alicyclobacillus and to look for new subspecies.
Collapse
Affiliation(s)
- Antonio Bevilacqua
- Department of the Science of Agriculture, Food and Environment (SAFE), University of Foggia, Via Napoli, 25, 71122, Foggia, Italy
| | - Monica Mischitelli
- Department of Public Health and Infectious Diseases, Sapienza University of Rome, P.le Aldo Moro, 5, 00185, Rome, Italy
| | - Valeria Pietropaolo
- Department of Public Health and Infectious Diseases, Sapienza University of Rome, P.le Aldo Moro, 5, 00185, Rome, Italy
| | - Emanuela Ciuffreda
- Department of the Science of Agriculture, Food and Environment (SAFE), University of Foggia, Via Napoli, 25, 71122, Foggia, Italy
| | - Milena Sinigaglia
- Department of the Science of Agriculture, Food and Environment (SAFE), University of Foggia, Via Napoli, 25, 71122, Foggia, Italy
| | - Maria Rosaria Corbo
- Department of the Science of Agriculture, Food and Environment (SAFE), University of Foggia, Via Napoli, 25, 71122, Foggia, Italy
- * E-mail:
| |
Collapse
|
23
|
Garushyants SK, Kazanov MD, Gelfand MS. Horizontal gene transfer and genome evolution in Methanosarcina. BMC Evol Biol 2015; 15:102. [PMID: 26044078 PMCID: PMC4455057 DOI: 10.1186/s12862-015-0393-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 05/29/2015] [Indexed: 12/29/2022] Open
Abstract
Background Genomes of Methanosarcina spp. are among the largest archaeal genomes. One suggested reason for that is massive horizontal gene transfer (HGT) from bacteria. Genes of bacterial origin may be involved in the central metabolism and solute transport, in particular sugar synthesis, sulfur metabolism, phosphate metabolism, DNA repair, transport of small molecules etc. Horizontally transferred (HT) genes are considered to play the key role in the ability of Methanosarcina spp. to inhabit diverse environments. At the moment, genomes of three Methanosarcina spp. have been sequenced, and while these genomes vary in length and number of protein-coding genes, they all have been shown to accumulate HT genes. However, previous estimates had been made when fewer archaeal genomes were known. Moreover, several Methanosarcinaceae genomes from other genera have been sequenced recently. Here, we revise the census of genes of bacterial origin in Methanosarcinaceae. Results About 5 % of Methanosarcina genes have been shown to be horizontally transferred from various bacterial groups to the last common ancestor either of Methanosarcinaceae, or Methanosarcina, or later in the evolution. Simulation of the composition of the NCBI protein non-redundant database for different years demonstrates that the estimates of the HGT rate have decreased drastically since 2002, the year of publication of the first Methanosarcina genome. The phylogenetic distribution of HT gene donors is non-uniform. Most HT genes were transferred from Firmicutes and Proteobacteria, while no HGT events from Actinobacteria to the common ancestor of Methanosarcinaceae were found. About 50 % of HT genes are involved in metabolism. Horizontal transfer of transcription factors is not common, while 46 % of horizontally transferred genes have demonstrated differential expression in a variety of conditions. HGT of complete operons is relatively infrequent and half of HT genes do not belong to operons. Conclusions While genes of bacterial origin are still more frequent in Methanosarcinaceae than in other Archaea, most HGT events described earlier as Methanosarcina-specific seem to have occurred before the divergence of Methanosarcinaceae. Genes horizontally transferred from bacteria to archaea neither tend to be transferred with their regulators, nor in long operons. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0393-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sofya K Garushyants
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoi Karetny per. 19, build.1, Moscow, 127051, Russia.
| | - Marat D Kazanov
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoi Karetny per. 19, build.1, Moscow, 127051, Russia.
| | - Mikhail S Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoi Karetny per. 19, build.1, Moscow, 127051, Russia. .,Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia.
| |
Collapse
|
24
|
Abstract
Horizontal or Lateral Gene Transfer (HGT or LGT) is the transmission of portions of genomic DNA between organisms through a process decoupled from vertical inheritance. In the presence of HGT events, different fragments of the genome are the result of different evolutionary histories. This can therefore complicate the investigations of evolutionary relatedness of lineages and species. Also, as HGT can bring into genomes radically different genotypes from distant lineages, or even new genes bearing new functions, it is a major source of phenotypic innovation and a mechanism of niche adaptation. For example, of particular relevance to human health is the lateral transfer of antibiotic resistance and pathogenicity determinants, leading to the emergence of pathogenic lineages. Computational identification of HGT events relies upon the investigation of sequence composition or evolutionary history of genes. Sequence composition-based ("parametric") methods search for deviations from the genomic average, whereas evolutionary history-based ("phylogenetic") approaches identify genes whose evolutionary history significantly differs from that of the host species. The evaluation and benchmarking of HGT inference methods typically rely upon simulated genomes, for which the true history is known. On real data, different methods tend to infer different HGT events, and as a result it can be difficult to ascertain all but simple and clear-cut HGT events.
Collapse
Affiliation(s)
| | - Nives Škunca
- ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Zurich, Switzerland
| | | | - Christophe Dessimoz
- University College London, London, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
25
|
A functional and phylogenetic comparison of quorum sensing related genes in Brucella melitensis 16M. J Microbiol 2014; 52:709-15. [PMID: 24994008 DOI: 10.1007/s12275-014-3570-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/04/2014] [Accepted: 04/29/2014] [Indexed: 10/25/2022]
Abstract
A quorum-sensing (QS) system is involved in Brucella melitensis survival inside the host cell. Two transcriptional regulators identified in B. melitensis, BlxR and VjbR, regulate the expression of virB, an operon required for bacterial intracellular persistence. In this work, 628 genes affected by VjbR and 124 by BlxR were analyzed to gain insights into their functional and taxonomical distributions among the Bacteria and Archaea cellular domains. In this regard, the Cluster of Orthologous Groups (COG) genes and orthologous genes in 789 nonredundant bacterial and archaeal genomes were obtained and compared against a group of randomly selected genes. From these analyses, we found 71 coaffected genes between VjbR and BlxR. In the COG comparison, VjbR activated genes associated with intracellular trafficking, secretion and vesicular transport and defense mechanisms, while BlxR affected genes related to energy production and conversion (with an equal effect) and translation, ribosomal structure and biogenesis, posttranslational modifications and carbohydrate and amino acid metabolism (with a negative effect). When the taxonomical distribution of orthologous genes was evaluated, the VjbR- and BlxR-related genes presented more orthologous genes in Crenarchaeota (Archaea), Firmicutes, and Tenericutes and fewer genes in Proteobacteria than expected by chance. These findings suggest that QS system exert a fine-tuning modulation of gene expression, by which VjbR activates genes related to infection persistence and defense, while BlxR represses general bacterial metabolism for intracellular adaptations. Finally, these affected genes present a degree of presence among Bacteria and Archaea genomes that is different from that expected by chance.
Collapse
|
26
|
Brambila-Tapia AJL, Armenta-Medina D, Rivera-Gomez N, Perez-Rueda E. Main functions and taxonomic distribution of virulence genes in Brucella melitensis 16 M. PLoS One 2014; 9:e100349. [PMID: 24964015 PMCID: PMC4070974 DOI: 10.1371/journal.pone.0100349] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Accepted: 05/24/2014] [Indexed: 11/19/2022] Open
Abstract
Many virulence genes have been detected in attenuated mutants of Brucella melitensis 16 M; nevertheless, a complete report of these genes, including the main Cluster of Orthologous Groups (COG) represented as well as the taxonomical distribution among all complete bacterial and archaeal genomes, has not been analyzed. In this work a total of 160 virulence genes that have been reported in attenuated mutants in B. melitensis were included and analyzed. Additionally, we obtained 250 B. melitensis randomly selected genes as a reference group for the taxonomical comparisons. The COGs and the taxonomical distribution profile for 789 nonredundant bacterial and archaeal genomes were obtained and compared with the whole-genome COG distribution and with the 250 randomly selected genes, respectively. The main COGs associated with virulence genes corresponded to the following: intracellular trafficking, secretion and vesicular transport (U); cell motility (N); nucleotide transport and metabolism (F); transcription (K); and cell wall/membrane/envelope biogenesis (M). In addition, we found that virulence genes presented a higher proportion of orthologs in the Euryarchaeota and Proteobacteria phyla, with a significant decrease in Chlamydiae, Bacteroidetes, Tenericutes, Firmicutes and Thermotogae. In conclusion, we found that genes related to specific functions are more relevant to B. melitensis virulence, with the COG U the most significant. Additionally, the taxonomical distribution of virulence genes highlights the importance of these genes in the related Proteobacteria, being less relevant in distant groups of organisms with the exception of Euryarchaeota.
Collapse
Affiliation(s)
- Aniel Jessica Leticia Brambila-Tapia
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
- * E-mail: (AJLBT); (EPR)
| | - Dagoberto Armenta-Medina
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Nancy Rivera-Gomez
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Ernesto Perez-Rueda
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
- * E-mail: (AJLBT); (EPR)
| |
Collapse
|
27
|
Sjostrand J, Tofigh A, Daubin V, Arvestad L, Sennblad B, Lagergren J. A Bayesian Method for Analyzing Lateral Gene Transfer. Syst Biol 2014; 63:409-20. [DOI: 10.1093/sysbio/syu007] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
28
|
Perez Di Giorgio J, Soto G, Alleva K, Jozefkowicz C, Amodeo G, Muschietti JP, Ayub ND. Prediction of Aquaporin Function by Integrating Evolutionary and Functional Analyses. J Membr Biol 2013; 247:107-25. [DOI: 10.1007/s00232-013-9618-8] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Accepted: 11/09/2013] [Indexed: 01/08/2023]
|
29
|
Bobay LM, Touchon M, Rocha EPC. Manipulating or superseding host recombination functions: a dilemma that shapes phage evolvability. PLoS Genet 2013; 9:e1003825. [PMID: 24086157 PMCID: PMC3784561 DOI: 10.1371/journal.pgen.1003825] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 08/08/2013] [Indexed: 11/18/2022] Open
Abstract
Phages, like many parasites, tend to have small genomes and may encode autonomous functions or manipulate those of their hosts'. Recombination functions are essential for phage replication and diversification. They are also nearly ubiquitous in bacteria. The E. coli genome encodes many copies of an octamer (Chi) motif that upon recognition by RecBCD favors repair of double strand breaks by homologous recombination. This might allow self from non-self discrimination because RecBCD degrades DNA lacking Chi. Bacteriophage Lambda, an E. coli parasite, lacks Chi motifs, but escapes degradation by inhibiting RecBCD and encoding its own autonomous recombination machinery. We found that only half of 275 lambdoid genomes encode recombinases, the remaining relying on the host's machinery. Unexpectedly, we found that some lambdoid phages contain extremely high numbers of Chi motifs concentrated between the phage origin of replication and the packaging site. This suggests a tight association between replication, packaging and RecBCD-mediated recombination in these phages. Indeed, phages lacking recombinases strongly over-represent Chi motifs. Conversely, phages encoding recombinases and inhibiting host recombination machinery select for the absence of Chi motifs. Host and phage recombinases use different mechanisms and the latter are more tolerant to sequence divergence. Accordingly, we show that phages encoding their own recombination machinery have more mosaic genomes resulting from recent recombination events and have more diverse gene repertoires, i.e. larger pan genomes. We discuss the costs and benefits of superseding or manipulating host recombination functions and how this decision shapes phage genome structure and evolvability. Bacterial viruses, called bacteriophages, are extremely abundant in the biosphere. They have key roles in the regulation of bacterial populations and in the diversification of bacterial genomes. Among these viruses, lambdoid phages are very abundant in enterobacteria and exchange genetic material very frequently. This latter process is thought to increase phage diversity and therefore facilitate adaptation to hosts. Recombination is also essential for the replication of many lambdoid phages. Lambdoids have been described to encode their own recombination genes and inhibit their hosts'. In this study, we show that lambdoids are split regarding their capacity to encode autonomous recombination functions and that this affects the abundance of recombination-related sequence motifs. Half of the phages encode an autonomous system and inhibit their hosts'. The trade-off between superseding and manipulating the hosts' recombination functions has important consequences. The phages encoding autonomous recombination functions have more diverse gene repertoires and recombine more frequently. Viruses, as many other parasites, have small genomes and depend on their hosts for several housekeeping functions. Hence, they often face trade-offs between supersession and manipulation of molecular machineries. Our results suggest these trade-offs may shape viral gene repertoires, their sequence composition and even influence their evolvability.
Collapse
Affiliation(s)
- Louis-Marie Bobay
- Microbial Evolutionary Genomics, Institut Pasteur, Paris, France
- CNRS, UMR3525, Paris, France
- Université Pierre et Marie Curie, Cellule Pasteur UPMC, Paris, France
- * E-mail:
| | - Marie Touchon
- Microbial Evolutionary Genomics, Institut Pasteur, Paris, France
- CNRS, UMR3525, Paris, France
| | - Eduardo P. C. Rocha
- Microbial Evolutionary Genomics, Institut Pasteur, Paris, France
- CNRS, UMR3525, Paris, France
| |
Collapse
|
30
|
Swithers KS, Soucy SM, Lasek-Nesselquist E, Lapierre P, Gogarten JP. Distribution and Evolution of the Mobile vma-1b Intein. Mol Biol Evol 2013; 30:2676-87. [DOI: 10.1093/molbev/mst164] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
31
|
Wolf YI, Koonin EV. Genome reduction as the dominant mode of evolution. Bioessays 2013; 35:829-37. [PMID: 23801028 PMCID: PMC3840695 DOI: 10.1002/bies.201300037] [Citation(s) in RCA: 192] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2013] [Accepted: 05/21/2013] [Indexed: 11/09/2022]
Abstract
A common belief is that evolution generally proceeds towards greater complexity at both the organismal and the genomic level, numerous examples of reductive evolution of parasites and symbionts notwithstanding. However, recent evolutionary reconstructions challenge this notion. Two notable examples are the reconstruction of the complex archaeal ancestor and the intron-rich ancestor of eukaryotes. In both cases, evolution in most of the lineages was apparently dominated by extensive loss of genes and introns, respectively. These and many other cases of reductive evolution are consistent with a general model composed of two distinct evolutionary phases: the short, explosive, innovation phase that leads to an abrupt increase in genome complexity, followed by a much longer reductive phase, which encompasses either a neutral ratchet of genetic material loss or adaptive genome streamlining. Quantitatively, the evolution of genomes appears to be dominated by reduction and simplification, punctuated by episodes of complexification.
Collapse
Affiliation(s)
- Yuri I Wolf
- National Center for Biotechnology Information, NLM, National Institutes of Health, Bethesda, MA, USA.
| | | |
Collapse
|
32
|
Snir S, Wolf YI, Koonin EV. Universal pacemaker of genome evolution. PLoS Comput Biol 2012; 8:e1002785. [PMID: 23209393 PMCID: PMC3510094 DOI: 10.1371/journal.pcbi.1002785] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 10/02/2012] [Indexed: 11/18/2022] Open
Abstract
A fundamental observation of comparative genomics is that the distribution of evolution rates across the complete sets of orthologous genes in pairs of related genomes remains virtually unchanged throughout the evolution of life, from bacteria to mammals. The most straightforward explanation for the conservation of this distribution appears to be that the relative evolution rates of all genes remain nearly constant, or in other words, that evolutionary rates of different genes are strongly correlated within each evolving genome. This correlation could be explained by a model that we denoted Universal PaceMaker (UPM) of genome evolution. The UPM model posits that the rate of evolution changes synchronously across genome-wide sets of genes in all evolving lineages. Alternatively, however, the correlation between the evolutionary rates of genes could be a simple consequence of molecular clock (MC). We sought to differentiate between the MC and UPM models by fitting thousands of phylogenetic trees for bacterial and archaeal genes to supertrees that reflect the dominant trend of vertical descent in the evolution of archaea and bacteria and that were constrained according to the two models. The goodness of fit for the UPM model was better than the fit for the MC model, with overwhelming statistical significance, although similarly to the MC, the UPM is strongly overdispersed. Thus, the results of this analysis reveal a universal, genome-wide pacemaker of evolution that could have been in operation throughout the history of life.
Collapse
Affiliation(s)
- Sagi Snir
- Department of Evolutionary and Environmental Biology and The Institute of Evolution, University of Haifa Mount Carmel, Haifa, Israel
| | | | | |
Collapse
|
33
|
Koonin EV, Wolf YI. Evolution of microbes and viruses: a paradigm shift in evolutionary biology? Front Cell Infect Microbiol 2012; 2:119. [PMID: 22993722 PMCID: PMC3440604 DOI: 10.3389/fcimb.2012.00119] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Accepted: 08/27/2012] [Indexed: 01/21/2023] Open
Abstract
When Charles Darwin formulated the central principles of evolutionary biology in the Origin of Species in 1859 and the architects of the Modern Synthesis integrated these principles with population genetics almost a century later, the principal if not the sole objects of evolutionary biology were multicellular eukaryotes, primarily animals and plants. Before the advent of efficient gene sequencing, all attempts to extend evolutionary studies to bacteria have been futile. Sequencing of the rRNA genes in thousands of microbes allowed the construction of the three- domain “ribosomal Tree of Life” that was widely thought to have resolved the evolutionary relationships between the cellular life forms. However, subsequent massive sequencing of numerous, complete microbial genomes revealed novel evolutionary phenomena, the most fundamental of these being: (1) pervasive horizontal gene transfer (HGT), in large part mediated by viruses and plasmids, that shapes the genomes of archaea and bacteria and call for a radical revision (if not abandonment) of the Tree of Life concept, (2) Lamarckian-type inheritance that appears to be critical for antivirus defense and other forms of adaptation in prokaryotes, and (3) evolution of evolvability, i.e., dedicated mechanisms for evolution such as vehicles for HGT and stress-induced mutagenesis systems. In the non-cellular part of the microbial world, phylogenomics and metagenomics of viruses and related selfish genetic elements revealed enormous genetic and molecular diversity and extremely high abundance of viruses that come across as the dominant biological entities on earth. Furthermore, the perennial arms race between viruses and their hosts is one of the defining factors of evolution. Thus, microbial phylogenomics adds new dimensions to the fundamental picture of evolution even as the principle of descent with modification discovered by Darwin and the laws of population genetics remain at the core of evolutionary biology.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, USA.
| | | |
Collapse
|
34
|
The ecology of bacterial genes and the survival of the new. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:394026. [PMID: 22900231 PMCID: PMC3415099 DOI: 10.1155/2012/394026] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/21/2012] [Accepted: 06/26/2012] [Indexed: 11/18/2022]
Abstract
Much of the observed variation among closely related bacterial genomes is attributable to gains and losses of genes that are acquired horizontally as well as to gene duplications and larger amplifications. The genomic flexibility that results from these mechanisms certainly contributes to the ability of bacteria to survive and adapt in varying environmental challenges. However, the duplicability and transferability of individual genes imply that natural selection should operate, not only at the organismal level, but also at the level of the gene. Genes can be considered semiautonomous entities that possess specific functional niches and evolutionary dynamics. The evolution of bacterial genes should respond both to selective pressures that favor competition, mostly among orthologs or paralogs that may occupy the same functional niches, and cooperation, with the majority of other genes coexisting in a given genome. The relative importance of either type of selection is likely to vary among different types of genes, based on the functional niches they cover and on the tightness of their association with specific organismal lineages. The frequent availability of new functional niches caused by environmental changes and biotic evolution should enable the constant diversification of gene families and the survival of new lineages of genes.
Collapse
|
35
|
Jiao YL, Wang SJ, Lv MS, Fang YW, Liu S. An evolutionary analysis of the GH57 amylopullulanases based on the DOMON_glucodextranase_like domains. J Basic Microbiol 2012; 53:231-9. [DOI: 10.1002/jobm.201100530] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2011] [Accepted: 01/18/2012] [Indexed: 11/11/2022]
Affiliation(s)
- Yu-Liang Jiao
- College of Marine Sciences; HuaiHai Institute of Technology; Lianyungang, People's Republic of China
| | - Shu-Jun Wang
- College of Marine Sciences; HuaiHai Institute of Technology; Lianyungang, People's Republic of China
| | - Ming-Sheng Lv
- College of Marine Sciences; HuaiHai Institute of Technology; Lianyungang, People's Republic of China
| | - Yao-Wei Fang
- College of Marine Sciences; HuaiHai Institute of Technology; Lianyungang, People's Republic of China
| | - Shu Liu
- College of Marine Sciences; HuaiHai Institute of Technology; Lianyungang, People's Republic of China
| |
Collapse
|
36
|
Microbial systematics in the post-genomics era. Antonie van Leeuwenhoek 2011; 101:45-54. [DOI: 10.1007/s10482-011-9663-1] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 10/15/2011] [Indexed: 10/16/2022]
|
37
|
Andam CP, Gogarten JP. Biased gene transfer and its implications for the concept of lineage. Biol Direct 2011; 6:47. [PMID: 21943000 PMCID: PMC3191353 DOI: 10.1186/1745-6150-6-47] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 09/23/2011] [Indexed: 11/10/2022] Open
Abstract
Background In the presence of horizontal gene transfer (HGT), the concepts of lineage and genealogy in the microbial world become more ambiguous because chimeric genomes trace their ancestry from a myriad of sources, both living and extinct. Results We present the evolutionary histories of three aminoacyl-tRNA synthetases (aaRS) to illustrate that the concept of organismal lineage in the prokaryotic world is defined by both vertical inheritance and reticulations due to HGT. The acquisition of a novel gene from a distantly related taxon can be considered as a shared derived character that demarcates a group of organisms, as in the case of the spirochaete Phenylalanyl-tRNA synthetase (PheRS). On the other hand, when organisms transfer genetic material with their close kin, the similarity and therefore relatedness observed among them is essentially shaped by gene transfer. Studying the distribution patterns of divergent genes with identical functions, referred to as homeoalleles, can reveal preferences for transfer partners. We describe the very ancient origin and the distribution of the archaeal homeoalleles for Threonyl-tRNA synthetases (ThrRS) and Seryl-tRNA synthetases (SerRS). Conclusions Patterns created through biased HGT can be undistinguishable from those created through shared organismal ancestry. A re-evaluation of the definition of lineage is necessary to reflect genetic relatedness due to both HGT and vertical inheritance. In most instances, HGT bias will maintain and strengthen similarity within groups. Only in cases where HGT bias is due to other factors, such as shared ecological niche, do patterns emerge from gene phylogenies that are in conflict with those reflecting shared organismal ancestry. Reviewers This article was reviewed by W. Ford Doolittle, François-Joseph Lapointe, and Frederic Bouchard.
Collapse
Affiliation(s)
- Cheryl P Andam
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | |
Collapse
|
38
|
A close relationship between primary nucleotides sequence structure and the composition of functional genes in the genome of prokaryotes. Mol Phylogenet Evol 2011; 61:650-8. [PMID: 21864693 DOI: 10.1016/j.ympev.2011.08.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2010] [Revised: 05/31/2011] [Accepted: 08/05/2011] [Indexed: 11/21/2022]
Abstract
Comparative genomics is an essential tool to unravel how genomes change over evolutionary time and to gain clues on the links between functional genomics and evolution. In prokaryotes, the large, good quality, genome sequences available in public databases and the recently developed large-scale computational methods, offer an unprecedent view on the ecology and evolution of microorganisms through comparative genomics. In this work, we examined the links among genome structure (i.e., the sequential distribution of nucleotides itself by detrended fluctuation analysis, DFA) and genomic diversity (i.e., gene functionality by Clusters of Orthologous Genes, COGs) in 828 full sequenced prokaryotic genomes from 548 different bacteria and archaea species. DFA scaling exponent α indicated persistent long-range correlations (fractality) in each genome analyzed. Higher resolution power was found when considering the sequential succession of purine (AG) vs. pyrimidine (CT) bases than either keto (GT) to amino (AC) forms or strongly (GC) vs. weakly (AT) bonded nucleotides. Interestingly, the phyla Aquificae, Fusobacteria, Dictyoglomi, Nitrospirae, and Thermotogae were closer to archaea than to their bacterial counterparts. A strong significant correlation was found between scaling exponent α and COGs distribution, and we consistently observed that the larger α the more heterogeneous was the gene distribution within each functional category, suggesting a close relationship between primary nucleotides sequence structure and functional genes composition.
Collapse
|
39
|
Yeasty clocks: dating genomic changes in yeasts. C R Biol 2011; 334:620-8. [PMID: 21819943 DOI: 10.1016/j.crvi.2011.05.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2010] [Accepted: 03/17/2011] [Indexed: 02/04/2023]
Abstract
Calibration of clocks to date evolutionary changes is of primary importance for comparative genomics. In the absence of fossil records, the dating of changes during yeast genome evolution can only rely on the properties of the genomes themselves, given the uncertainty of extrapolations using clocks from other organisms. In this work, we use the experimentally determined mutational rate of Saccharomyces cerevisiae to calculate the numbers of successive generations corresponding to observed sequence polymorphism between strains or species of other yeasts. We then examine synteny conservation across the entire subphylum of Saccharomycotina yeasts, and compare this second clock based on chromosomal rearrangements with the first one based on sequence divergence. A non-linear relationship is observed, that interestingly also applies to insects although, for equivalent sequence divergence, their rate of chromosomal rearrangements is higher than that of yeasts.
Collapse
|
40
|
Core gene set as the basis of multilocus sequence analysis of the subclass Actinobacteridae. PLoS One 2011; 6:e14792. [PMID: 21483493 PMCID: PMC3069002 DOI: 10.1371/journal.pone.0014792] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2009] [Accepted: 12/13/2010] [Indexed: 11/20/2022] Open
Abstract
Comparative genomic sequencing is shedding new light on bacterial identification, taxonomy and phylogeny. An in silico assessment of a core gene set necessary for cellular functioning was made to determine a consensus set of genes that would be useful for the identification, taxonomy and phylogeny of the species belonging to the subclass Actinobacteridae which contained two orders Actinomycetales and Bifidobacteriales. The subclass Actinobacteridae comprised about 85% of the actinobacteria families. The following recommended criteria were used to establish a comprehensive gene set; the gene should (i) be long enough to contain phylogenetically useful information, (ii) not be subject to horizontal gene transfer, (iii) be a single copy (iv) have at least two regions sufficiently conserved that allow the design of amplification and sequencing primers and (v) predict whole-genome relationships. We applied these constraints to 50 different Actinobacteridae genomes and made 1,224 pairwise comparisons of the genome conserved regions and gene fragments obtained by using Sequence VARiability Analysis Program (SVARAP), which allow designing the primers. Following a comparative statistical modeling phase, 3 gene fragments were selected, ychF, rpoB, and secY with R2>0.85. Selected sets of broad range primers were tested from the 3 gene fragments and were demonstrated to be useful for amplification and sequencing of 25 species belonging to 9 genera of Actinobacteridae. The intraspecies similarities were 96.3–100% for ychF, 97.8–100% for rpoB and 96.9–100% for secY among 73 strains belonging to 15 species of the subclass Actinobacteridae compare to 99.4–100% for 16S rRNA. The phylogenetic topology obtained from the combined datasets ychF+rpoB+secY was globally similar to that inferred from the 16S rRNA but with higher confidence. It was concluded that multi-locus sequence analysis using core gene set might represent the first consensus and valid approach for investigating the bacterial identification, phylogeny and taxonomy.
Collapse
|
41
|
Dikow RB. Systematic relationships within the Vibrionaceae (Bacteria: Gammaproteobacteria): steps toward a phylogenetic taxonomy. Cladistics 2011; 27:9-28. [DOI: 10.1111/j.1096-0031.2010.00312.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
|
42
|
Ogura A, Yoshida MA, Fukuzaki M, Sese J. In vitro homology search array comprehensively reveals highly conserved genes and their functional characteristics in non-sequenced species. BMC Genomics 2010; 11 Suppl 4:S9. [PMID: 21143818 PMCID: PMC3005928 DOI: 10.1186/1471-2164-11-s4-s9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND With the increase in genomic and transcriptomic data produced by the recent advancements in next generation sequencers and microarrays, it is now easier than ever to conduct large-scale comparative genomic studies for familiar species. However, there are more than ten million species on earth, and the study of all remaining species is not realistic in terms of cost and time. There have been a number of attempts at using microarrays for cross-species hybridization; however, those approaches only utilized the same probes for each species or different probes designed from orthologous genes. To establish easier and cheaper methods for the large-scale comparative genomic study of non-sequenced species, we developed an in vitro homology search array with the aid of a bioinformatic approach to probe design. RESULTS To perform large-scale genomic comparisons of non-sequenced species, we chose squid, one of the most intelligent species among Protostomes, for comparison with human genes. We designed a microarray using human single copy genes and conducted microarray experiments with mRNAs extracted from the squid. Multi-copy genes could not be detected using the microarray in this study because their sequence similarity caused cross-hybridization. A search for squid homologous genes among human genes revealed that 68% of the human probes tested showed the expression of squid homolog genes and 95 genes were confirmed to be expressed highly in squid. Functional classification analysis showed that these highly expressed genes comprise DNA binding proteins, which are under pressure of DNA level mutation and, consequently, show high similarity at the nucleotide level. CONCLUSIONS Our array could detect homologous genes in squids and humans in spite of the distant phylogenic relationships between the species. This experimental method will be useful for identifying homologs in non-sequenced species, for the development of genetic resources and for the collection of information on biodiversity, particularly when using the genome of sibling or closely related species.
Collapse
Affiliation(s)
- Atsushi Ogura
- Ochadai Academic Production, Ochanomizu University, Bunkyo, Tokyo, Japan.
| | | | | | | |
Collapse
|
43
|
Kloesges T, Popa O, Martin W, Dagan T. Networks of gene sharing among 329 proteobacterial genomes reveal differences in lateral gene transfer frequency at different phylogenetic depths. Mol Biol Evol 2010; 28:1057-74. [PMID: 21059789 PMCID: PMC3021791 DOI: 10.1093/molbev/msq297] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Lateral gene transfer (LGT) is an important mechanism of natural variation among prokaryotes. Over the full course of evolution, most or all of the genes resident in a given prokaryotic genome have been affected by LGT, yet the frequency of LGT can vary greatly across genes and across prokaryotic groups. The proteobacteria are among the most diverse of prokaryotic taxa. The prevalence of LGT in their genome evolution calls for the application of network-based methods instead of tree-based methods to investigate the relationships among these species. Here, we report networks that capture both vertical and horizontal components of evolutionary history among 1,207,272 proteins distributed across 329 sequenced proteobacterial genomes. The network of shared proteins reveals modularity structure that does not correspond to current classification schemes. On the basis of shared protein-coding genes, the five classes of proteobacteria fall into two main modules, one including the alpha-, delta-, and epsilonproteobacteria and the other including beta- and gammaproteobacteria. The first module is stable over different protein identity thresholds. The second shows more plasticity with regard to the sequence conservation of proteins sampled, with the gammaproteobacteria showing the most chameleon-like evolutionary characteristics within the present sample. Using a minimal lateral network approach, we compared LGT rates at different phylogenetic depths. In general, gene evolution by LGT within proteobacteria is very common. At least one LGT event was inferred to have occurred in at least 75% of the protein families. The average LGT rate at the species and class depth is about one LGT event per protein family, the rate doubling at the phylum level to an average of two LGT events per protein family. Hence, our results indicate that the rate of gene acquisition per protein family is similar at the level of species (by recombination) and at the level of classes (by LGT). The frequency of LGT per genome strongly depends on the species lifestyle, with endosymbionts showing far lower LGT frequencies than free-living species. Moreover, the nature of the transferred genes suggests that gene transfer in proteobacteria is frequently mediated by conjugation.
Collapse
Affiliation(s)
- Thorsten Kloesges
- Institute of Botany III, Heinrich-Heine University Düsseldorf, Düsseldorf, Germany
| | | | | | | |
Collapse
|
44
|
Bohlin J, Snipen L, Cloeckaert A, Lagesen K, Ussery D, Kristoffersen AB, Godfroid J. Genomic comparisons of Brucella spp. and closely related bacteria using base compositional and proteome based methods. BMC Evol Biol 2010; 10:249. [PMID: 20707916 PMCID: PMC2928237 DOI: 10.1186/1471-2148-10-249] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2010] [Accepted: 08/13/2010] [Indexed: 11/30/2022] Open
Abstract
Background Classification of bacteria within the genus Brucella has been difficult due in part to considerable genomic homogeneity between the different species and biovars, in spite of clear differences in phenotypes. Therefore, many different methods have been used to assess Brucella taxonomy. In the current work, we examine 32 sequenced genomes from genus Brucella representing the six classical species, as well as more recently described species, using bioinformatical methods. Comparisons were made at the level of genomic DNA using oligonucleotide based methods (Markov chain based genomic signatures, genomic codon and amino acid frequencies based comparisons) and proteomes (all-against-all BLAST protein comparisons and pan-genomic analyses). Results We found that the oligonucleotide based methods gave different results compared to that of the proteome based methods. Differences were also found between the oligonucleotide based methods used. Whilst the Markov chain based genomic signatures grouped the different species in genus Brucella according to host preference, the codon and amino acid frequencies based methods reflected small differences between the Brucella species. Only minor differences could be detected between all genera included in this study using the codon and amino acid frequencies based methods. Proteome comparisons were found to be in strong accordance with current Brucella taxonomy indicating a remarkable association between gene gain or loss on one hand and mutations in marker genes on the other. The proteome based methods found greater similarity between Brucella species and Ochrobactrum species than between species within genus Agrobacterium compared to each other. In other words, proteome comparisons of species within genus Agrobacterium were found to be more diverse than proteome comparisons between species in genus Brucella and genus Ochrobactrum. Pan-genomic analyses indicated that uptake of DNA from outside genus Brucella appears to be limited. Conclusions While both the proteome based methods and the Markov chain based genomic signatures were able to reflect environmental diversity between the different species and strains of genus Brucella, the genomic codon and amino acid frequencies based comparisons were not found adequate for such comparisons. The proteome comparison based phylogenies of the species in genus Brucella showed a surprising consistency with current Brucella taxonomy.
Collapse
Affiliation(s)
- Jon Bohlin
- Norwegian School of Veterinary Science, Department of Food Safety and Infection Biology, Epicenter, Ullevålsveien 72, PO Box 8146 Dep, NO-0033 Oslo, Norway.
| | | | | | | | | | | | | |
Collapse
|
45
|
Dagan T, Roettger M, Bryant D, Martin W. Genome networks root the tree of life between prokaryotic domains. Genome Biol Evol 2010; 2:379-92. [PMID: 20624742 PMCID: PMC2997548 DOI: 10.1093/gbe/evq025] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Eukaryotes arose from prokaryotes, hence the root in the tree of life resides among the prokaryotic domains. The position of the root is still debated, although pinpointing it would aid our understanding of the early evolution of life. Because prokaryote evolution was long viewed as a tree-like process of lineage bifurcations, efforts to identify the most ancient microbial lineage split have traditionally focused on positioning a root on a phylogenetic tree constructed from one or several genes. Such studies have delivered widely conflicting results on the position of the root, this being mainly due to methodological problems inherent to deep gene phylogeny and the workings of lateral gene transfer among prokaryotes over evolutionary time. Here, we report the position of the root determined with whole genome data using network-based procedures that take into account both gene presence or absence and the level of sequence similarity among all individual gene families that are shared across genomes. On the basis of 562,321 protein-coding gene families distributed across 191 genomes, we find that the deepest divide in the prokaryotic world is interdomain, that is, separating the archaebacteria from the eubacteria. This result resonates with some older views but conflicts with the results of most studies over the last decade that have addressed the issue. In particular, several studies have suggested that the molecular distinctness of archaebacteria is not evidence for their antiquity relative to eubacteria but instead stems from some kind of inherently elevated rate of archaebacterial sequence change. Here, we specifically test for such a rate elevation across all prokaryotic lineages through the analysis of all possible quartets among eight genes duplicated in all prokaryotes, hence the last common ancestor thereof. The results show that neither the archaebacteria as a group nor the eubacteria as a group harbor evidence for elevated evolutionary rates in the sampled genes, either in the recent evolutionary past or in their common ancestor. The interdomain prokaryotic position of the root is thus not attributable to lineage-specific rate variation.
Collapse
Affiliation(s)
- Tal Dagan
- Institute of Botany III, Heinrich-Heine University of Düsseldorf, Düsseldorf, Germany.
| | | | | | | |
Collapse
|
46
|
Biased gene transfer mimics patterns created through shared ancestry. Proc Natl Acad Sci U S A 2010; 107:10679-84. [PMID: 20495090 DOI: 10.1073/pnas.1001418107] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In phylogenetic reconstruction, two types of bacterial tyrosyl-tRNA synthetases (TyrRS) form distinct clades with many bacterial phyla represented in both clades. Very few taxa possess both forms, and maximum likelihood analysis of the distribution of TyrRS types suggests horizontal gene transfer (HGT), rather than an ancient duplication followed by differential gene loss, as the contributor to the evolutionary history of TyrRS in bacteria. However, for each TyrRS type, phylogenetic reconstruction yields phylogenies similar to the ribosomal phylogeny, revealing that frequent gene transfer has not destroyed the expected phylogeny; rather, the expected phylogenetic signal was reinforced or even created by HGT. We show that biased HGT can mimic patterns created through shared ancestry by in silico simulation. Furthermore, in cases where genomic synteny is sufficient to allow comparisons of relative gene positions, both tyrRS types occupy equivalent positions in closely related genomes, rejecting the loss hypothesis. Although the two types of bacterial TyrRS are only distantly related and only rarely coexist in a single genome, they have many features in common with alleles that are swapped between related lineages. We propose to label these functionally similar homologs as homeoalleles. We conclude that the observed phylogenetic pattern reflects both vertical inheritance and biased HGT and that the signal caused by common organismal descent is difficult to distinguish from the signal due to biased gene transfer.
Collapse
|
47
|
Kuo CH, Ochman H. Inferring clocks when lacking rocks: the variable rates of molecular evolution in bacteria. Biol Direct 2009; 4:35. [PMID: 19788732 PMCID: PMC2760517 DOI: 10.1186/1745-6150-4-35] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 09/29/2009] [Indexed: 11/10/2022] Open
Abstract
Background Because bacteria do not have a robust fossil record, attempts to infer the timing of events in their evolutionary history requires comparisons of molecular sequences. This use of molecular clocks is based on the assumptions that substitution rates for homologous genes or sites are fairly constant through time and across taxa. Violation of these conditions can lead to erroneous inferences and result in estimates that are off by orders of magnitude. In this study, we examine the consistency of substitution rates among a set of conserved genes in diverse bacterial lineages, and address the questions regarding the validity of molecular dating. Results By examining the evolution of 16S rRNA gene in obligate endosymbionts, which can be calibrated by the fossil record of their hosts, we found that the rates are consistent within a clade but varied widely across different bacterial lineages. Genome-wide estimates of nonsynonymous and synonymous substitutions suggest that these two measures are highly variable in their rates across bacterial taxa. Genetic drift plays a fundamental role in determining the accumulation of substitutions in 16S rRNA genes and at nonsynonymous sites. Moreover, divergence estimates based on a set of universally conserved protein-coding genes also exhibit low correspondence to those based on 16S rRNA genes. Conclusion Our results document a wide range of substitution rates across genes and bacterial taxa. This high level of variation cautions against the assumption of a universal molecular clock for inferring divergence times in bacteria. However, by applying relative-rate tests to homologous genes, it is possible to derive reliable local clocks that can be used to calibrate bacterial evolution. Reviewers This article was reviewed by Adam Eyre-Walker, Simonetta Gribaldo and Tal Pupko (nominated by Dan Graur).
Collapse
Affiliation(s)
- Chih-Horng Kuo
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | |
Collapse
|
48
|
Abstract
This chapter discusses the pros and cons of the existing computational methods for the detection of horizontal (or lateral) gene transfer and highlights the genome-wide studies utilizing these methods. The impact of horizontal gene transfer (HGT) on prokaryote genome evolution is discussed.
Collapse
|
49
|
Bon E, Delaherche A, Bilhère E, De Daruvar A, Lonvaud-Funel A, Le Marrec C. Oenococcus oeni genome plasticity is associated with fitness. Appl Environ Microbiol 2009; 75:2079-90. [PMID: 19218413 PMCID: PMC2663225 DOI: 10.1128/aem.02194-08] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2008] [Accepted: 02/02/2009] [Indexed: 12/19/2022] Open
Abstract
Oenococcus oeni strains are well-known for their considerable phenotypic variations in terms of tolerance to harsh wine conditions and malolactic activity. Genomic subtractive hybridization (SH) between two isolates with differing enological potentials was used to elucidate the genetic bases of this intraspecies diversity and identify novel genes involved in adaptation to wine. SH revealed 182 tester-specific fragments corresponding to 126 open reading frames (ORFs). A large proportion of the chromosome-related ORFs resembled genes involved in carbohydrate transport and metabolism, cell wall/membrane/envelope biogenesis, and replication, recombination, and repair. Six regions of genomic plasticity were identified, and their analysis suggested that both limited recombination and insertion/deletion events contributed to the vast genomic diversity observed in O. oeni. The association of selected sequences with adaptation to wine was further assessed by screening a large collection of strains using PCR. No sequences were found to be specific to highly performing (HP) strains alone. However, there was a statistically significant positive association between HP strains and the presence of eight gene sequences located on regions 2, 4, and 5. Gene expression patterns were significantly modified in HP strains, following exposure to one or more of the common stresses in wines. Regions 2 and 5 showed no traces of mobile elements and had normal GC content. In contrast, region 4 had the typical hallmarks of horizontal transfer, suggesting that the strategy of acquiring genes from other bacteria enhances the fitness of O. oeni strains.
Collapse
|
50
|
Horizontal Gene Transfers in prokaryotes show differential preferences for metabolic and translational genes. BMC Evol Biol 2009; 9:9. [PMID: 19134215 PMCID: PMC2651853 DOI: 10.1186/1471-2148-9-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2008] [Accepted: 01/10/2009] [Indexed: 11/22/2022] Open
Abstract
Background Horizontal gene transfer (HGT) is an important process, which contributes in bacterial pathogenesis and drug resistance. A number of methods have been proposed for detection of horizontal gene transfer. One successful approach to the detection of HGT events is due to Novichkov et al. (J. Bacteriology 186, 6575–85), who rely on comparing phylogenetic distances within a gene family with genomic distances of the source organisms. Building on their approach, we introduce outlier detection in the correlation between those two sets of distances. This approach is designed to detect horizontal transfers of core set of genes present in many bacteria. The principle behind method allows detection of xenologous gene displacements as well as acquisition of novel genes. Results Simulations indicated that our method performs better than Novichkov et al's original approach. The approach very efficiently identified HGT between distantly related bacteria and also a limited number of gene transfers between closely related bacteria. In combination with sequence similarity and likelihood tests, it yields a measure robust enough to derive a set of 171 genes deemed likely to have been horizontally transferred. Further analysis of these 171 established horizontal transfer events gave interesting insights in the direction of transfer. Conclusion The majority of transfers between archaea and bacteria have occurred in the direction from bacteria to archaea rather than the other way round. Genes transferred between the archaea and bacteria are mostly metabolic genes. On the other hand, genes transferred within the bacterial phyla are mainly involved in translation.
Collapse
|