151
|
Yue J, Sun G, Hu X, Huang J. The scale and evolutionary significance of horizontal gene transfer in the choanoflagellate Monosiga brevicollis. BMC Genomics 2013; 14:729. [PMID: 24156600 PMCID: PMC4046809 DOI: 10.1186/1471-2164-14-729] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 10/17/2013] [Indexed: 12/29/2022] Open
Abstract
Background It is generally agreed that horizontal gene transfer (HGT) is common in phagotrophic protists. However, the overall scale of HGT and the cumulative impact of acquired genes on the evolution of these organisms remain largely unknown. Results Choanoflagellates are phagotrophs and the closest living relatives of animals. In this study, we performed phylogenomic analyses to investigate the scale of HGT and the evolutionary importance of horizontally acquired genes in the choanoflagellate Monosiga brevicollis. Our analyses identified 405 genes that are likely derived from algae and prokaryotes, accounting for approximately 4.4% of the Monosiga nuclear genome. Many of the horizontally acquired genes identified in Monosiga were probably acquired from food sources, rather than by endosymbiotic gene transfer (EGT) from obsolete endosymbionts or plastids. Of 193 genes identified in our analyses with functional information, 84 (43.5%) are involved in carbohydrate or amino acid metabolism, and 45 (23.3%) are transporters and/or involved in response to oxidative, osmotic, antibiotic, or heavy metal stresses. Some identified genes may also participate in biosynthesis of important metabolites such as vitamins C and K12, porphyrins and phospholipids. Conclusions Our results suggest that HGT is frequent in Monosiga brevicollis and might have contributed substantially to its adaptation and evolution. This finding also highlights the importance of HGT in the genome and organismal evolution of phagotrophic eukaryotes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-14-729) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | - Jinling Huang
- Department of Biology, East Carolina University, Greenville, NC 27858, USA.
| |
Collapse
|
152
|
Larremore DB, Clauset A, Buckee CO. A network approach to analyzing highly recombinant malaria parasite genes. PLoS Comput Biol 2013; 9:e1003268. [PMID: 24130474 PMCID: PMC3794903 DOI: 10.1371/journal.pcbi.1003268] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2013] [Accepted: 08/23/2013] [Indexed: 11/18/2022] Open
Abstract
The var genes of the human malaria parasite Plasmodium falciparum present a challenge to population geneticists due to their extreme diversity, which is generated by high rates of recombination. These genes encode a primary antigen protein called PfEMP1, which is expressed on the surface of infected red blood cells and elicits protective immune responses. Var gene sequences are characterized by pronounced mosaicism, precluding the use of traditional phylogenetic tools that require bifurcating tree-like evolutionary relationships. We present a new method that identifies highly variable regions (HVRs), and then maps each HVR to a complex network in which each sequence is a node and two nodes are linked if they share an exact match of significant length. Here, networks of var genes that recombine freely are expected to have a uniformly random structure, but constraints on recombination will produce network communities that we identify using a stochastic block model. We validate this method on synthetic data, showing that it correctly recovers populations of constrained recombination, before applying it to the Duffy Binding Like-α (DBLα) domain of var genes. We find nine HVRs whose network communities map in distinctive ways to known DBLα classifications and clinical phenotypes. We show that the recombinational constraints of some HVRs are correlated, while others are independent. These findings suggest that this micromodular structuring facilitates independent evolutionary trajectories of neighboring mosaic regions, allowing the parasite to retain protein function while generating enormous sequence diversity. Our approach therefore offers a rigorous method for analyzing evolutionary constraints in var genes, and is also flexible enough to be easily applied more generally to any highly recombinant sequences. The human malaria parasite kills nearly 1 million people each year globally. Frequent genetic exchange between malaria parasites creates enormous genetic diversity that largely explains the lack of an effective vaccine for the disease. Traditional phylogenetic tools cannot accommodate this type of diversity, however, and rigorous analytical tools capable of making sense of gene sequences that recombine frequently are still lacking. Here, we use network techniques that have been developed by the physics and network science communities to analyze malaria parasite gene sequences, allowing us to automatically identify highly variable mosaic regions in sequence data and to derive the network of recombination events. We apply our method to seven fully-sequenced parasite genomes, and show that our method provides new insights into the complex evolutionary patterns of the parasite. Our results suggest that the structure of these sequences allows the parasite to rapidly diversify to evade immune responses while maintaining antigen structure and function.
Collapse
Affiliation(s)
- Daniel B. Larremore
- Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, United States of America
- Center for Communicable Disease Dynamics, Harvard School of Public Health, Boston, Massachusetts, United States of America
- * E-mail:
| | - Aaron Clauset
- Department of Computer Science, University of Colorado, Boulder, Colorado, United States of America
- BioFrontiers Institute, University of Colorado, Boulder, Colorado, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Caroline O. Buckee
- Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, United States of America
- Center for Communicable Disease Dynamics, Harvard School of Public Health, Boston, Massachusetts, United States of America
| |
Collapse
|
153
|
Fish JA, Chai B, Wang Q, Sun Y, Brown CT, Tiedje JM, Cole JR. FunGene: the functional gene pipeline and repository. Front Microbiol 2013; 4:291. [PMID: 24101916 PMCID: PMC3787254 DOI: 10.3389/fmicb.2013.00291] [Citation(s) in RCA: 332] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2013] [Accepted: 09/10/2013] [Indexed: 11/29/2022] Open
Abstract
Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
Collapse
Affiliation(s)
- Jordan A Fish
- Center for Microbial Ecology, Michigan State University East Lansing, MI, USA ; Department of Computer Science and Engineering, Michigan State University East Lansing, MI, USA
| | | | | | | | | | | | | |
Collapse
|
154
|
Ecological patterns of nifH genes in four terrestrial climatic zones explored with targeted metagenomics using FrameBot, a new informatics tool. mBio 2013; 4:e00592-13. [PMID: 24045641 PMCID: PMC3781835 DOI: 10.1128/mbio.00592-13] [Citation(s) in RCA: 175] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Biological nitrogen fixation is an important component of sustainable soil fertility and a key component of the nitrogen cycle. We used targeted metagenomics to study the nitrogen fixation-capable terrestrial bacterial community by targeting the gene for nitrogenase reductase (nifH). We obtained 1.1 million nifH 454 amplicon sequences from 222 soil samples collected from 4 National Ecological Observatory Network (NEON) sites in Alaska, Hawaii, Utah, and Florida. To accurately detect and correct frameshifts caused by indel sequencing errors, we developed FrameBot, a tool for frameshift correction and nearest-neighbor classification, and compared its accuracy to that of two other rapid frameshift correction tools. We found FrameBot was, in general, more accurate as long as a reference protein sequence with 80% or greater identity to a query was available, as was the case for virtually all nifH reads for the 4 NEON sites. Frameshifts were present in 12.7% of the reads. Those nifH sequences related to the Proteobacteria phylum were most abundant, followed by those for Cyanobacteria in the Alaska and Utah sites. Predominant genera with nifH sequences similar to reads included Azospirillum, Bradyrhizobium, and Rhizobium, the latter two without obvious plant hosts at the sites. Surprisingly, 80% of the sequences had greater than 95% amino acid identity to known nifH gene sequences. These samples were grouped by site and correlated with soil environmental factors, especially drainage, light intensity, mean annual temperature, and mean annual precipitation. FrameBot was tested successfully on three ecofunctional genes but should be applicable to any. High-throughput phylogenetic analysis of microbial communities using rRNA-targeted sequencing is now commonplace; however, such data often allow little inference with respect to either the presence or the diversity of genes involved in most important ecological processes. To study the gene pool for these processes, it is more straightforward to assess the genes directly responsible for the ecological function (ecofunctional genes). However, analyzing these genes involves technical challenges beyond those seen for rRNA. In particular, frameshift errors cause garbled downstream protein translations. Our FrameBot tool described here both corrects frameshift errors in query reads and determines their closest matching protein sequences in a set of reference sequences. We validated this new tool with sequences from defined communities and demonstrated the tool’s utility on nifH gene fragments sequenced from soils in well-characterized and major terrestrial ecosystem types.
Collapse
|
155
|
EGN: a wizard for construction of gene and genome similarity networks. BMC Evol Biol 2013; 13:146. [PMID: 23841456 PMCID: PMC3727994 DOI: 10.1186/1471-2148-13-146] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 07/05/2013] [Indexed: 01/11/2023] Open
Abstract
Background Increasingly, similarity networks are being used for evolutionary analyses of molecular datasets. These networks are very useful, in particular for the analysis of gene sharing, lateral gene transfer and for the detection of distant homologs. Currently, such analyses require some computer programming skills due to the limited availability of user-friendly freely distributed software. Consequently, although appealing, the construction and analyses of these networks remain less familiar to biologists than do phylogenetic approaches. Results In order to ease the use of similarity networks in the community of evolutionary biologists, we introduce a software program, EGN, that runs under Linux or MacOSX. EGN automates the reconstruction of gene and genome networks from nucleic and proteic sequences. EGN also implements statistics describing genetic diversity in these samples, for various user-defined thresholds of similarities. In the interest of studying the complexity of evolutionary processes affecting microbial evolution, we applied EGN to a dataset of 571,044 proteic sequences from the three domains of life and from mobile elements. We observed that, in Borrelia, plasmids play a different role than in most other eubacteria. Rather than being genetic couriers involved in lateral gene transfer, Borrelia’s plasmids and their genes act as private genetic goods, that contribute to the creation of genetic diversity within their parasitic hosts. Conclusion EGN can be used for constructing, analyzing, and mining molecular datasets in evolutionary studies. The program can help increase our knowledge of the processes through which genes from distinct sources and/or from multiple genomes co-evolve in lineages of cellular organisms.
Collapse
|
156
|
Lobkovsky AE, Wolf YI, Koonin EV. Gene frequency distributions reject a neutral model of genome evolution. Genome Biol Evol 2013; 5:233-42. [PMID: 23315380 PMCID: PMC3595032 DOI: 10.1093/gbe/evt002] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Evolution of prokaryotes involves extensive loss and gain of genes, which lead to substantial differences in the gene repertoires even among closely related organisms. Through a wide range of phylogenetic depths, gene frequency distributions in prokaryotic pangenomes bear a characteristic, asymmetrical U-shape, with a core of (nearly) universal genes, a “shell” of moderately common genes, and a “cloud” of rare genes. We employ mathematical modeling to investigate evolutionary processes that might underlie this universal pattern. Gene frequency distributions for almost 400 groups of 10 bacterial or archaeal species each over a broad range of evolutionary distances were fit to steady-state, infinite allele models based on the distribution of gene replacement rates and the phylogenetic tree relating the species in each group. The fits of the theoretical frequency distributions to the empirical ones yield model parameters and estimates of the goodness of fit. Using the Akaike Information Criterion, we show that the neutral model of genome evolution, with the same replacement rate for all genes, can be confidently rejected. Of the three tested models with purifying selection, the one in which the distribution of replacement rates is derived from a stochastic population model with additive per-gene fitness yields the best fits to the data. The selection strength estimated from the fits declines with evolutionary divergence while staying well outside the neutral regime. These findings indicate that, unlike some other universal distributions of genomic variables, for example, the distribution of paralogous gene family membership, the gene frequency distribution is substantially affected by selection.
Collapse
Affiliation(s)
- Alexander E Lobkovsky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | | | | |
Collapse
|
157
|
Networks: expanding evolutionary thinking. Trends Genet 2013; 29:439-41. [PMID: 23764187 DOI: 10.1016/j.tig.2013.05.007] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 05/14/2013] [Indexed: 11/21/2022]
Abstract
Networks allow the investigation of evolutionary relationships that do not fit a tree model. They are becoming a leading tool for describing the evolutionary relationships between organisms, given the comparative complexities among genomes.
Collapse
|
158
|
Trigui H, Dudyk P, Sum J, Shuman HA, Faucher SP. Analysis of the transcriptome of Legionella pneumophila hfq mutant reveals a new mobile genetic element. MICROBIOLOGY-SGM 2013; 159:1649-1660. [PMID: 23728622 DOI: 10.1099/mic.0.067983-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Hfq is a small RNA-binding protein involved in the post-transcriptional regulation of gene expression by affecting the stability of the mRNA and by mediating efficient pairing between small regulatory RNAs and their target mRNAs. In Legionella pneumophila, the aetiological agent of Legionnaires' disease, mutation of hfq results in increased duration of the lag phase and reduced growth in low-iron medium. In an effort to uncover genes potentially regulated by Hfq, the transcriptome of an hfq mutant strain was compared to that of the wild-type. Unexpectedly, many genes located within a 100 kb genomic island, including a section of the previously identified efflux island, were overexpressed in the hfq mutant strain. Since this island contains a putative conjugative system and an integrase, it was postulated that it could be a new integrated mobile genetic element. PCR analysis revealed that this region exists both as an integrated and as an episomal form in the cell population and that it undergoes differential excision in the hfq mutant background, which was further confirmed by trans-complementation of the hfq mutation. This new plasmid-like element was named pLP100. Differential excision did not affect the copy number of pLP100 at the population level. This region contains a copper efflux pump encoded by copA, and increased resistance to copper was observed for the hfq mutant strain that was abrogated in the complemented strain. A strain carrying a mutation of hfq and a deletion of the right side recombination site, attR, showed that overexpression of pLP100 genes and increased copper resistance in the hfq mutant strain were dependent upon excision of pLP100.
Collapse
Affiliation(s)
- Hana Trigui
- Department of Natural Resource Sciences, Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Paulina Dudyk
- Department of Natural Resource Sciences, Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Janet Sum
- Department of Microbiology and Immunology, Columbia University Medical Center, New York, NY, USA
| | - Howard A Shuman
- Department of Microbiology and Immunology, Columbia University Medical Center, New York, NY, USA
| | - Sebastien P Faucher
- Department of Natural Resource Sciences, Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| |
Collapse
|
159
|
Lasek-Nesselquist E, Gogarten JP. The effects of model choice and mitigating bias on the ribosomal tree of life. Mol Phylogenet Evol 2013; 69:17-38. [PMID: 23707703 DOI: 10.1016/j.ympev.2013.05.006] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2013] [Revised: 04/26/2013] [Accepted: 05/08/2013] [Indexed: 01/03/2023]
Abstract
Deep-level relationships within Bacteria, Archaea, and Eukarya as well as the relationships of these three domains to each other require resolution. The ribosomal machinery, universal to all cellular life, represents a protein repertoire resistant to horizontal gene transfer, which provides a largely congruent signal necessary for reconstructing a tree suitable as a backbone for life's reticulate history. Here, we generate a ribosomal tree of life from a robust taxonomic sampling of Bacteria, Archaea, and Eukarya to elucidate deep-level intra-domain and inter-domain relationships. Lack of phylogenetic information and systematic errors caused by inadequate models (that cannot account for substitution rate or compositional heterogeneities) or improper model selection compound conflicting phylogenetic signals from HGT and/or paralogy. Thus, we tested several models of varying sophistication on three different datasets, performed removal of fast-evolving or long-branched Archaea and Eukarya, and employed three different strategies to remove compositional heterogeneity to examine their effects on the topological outcome. Our results support a two-domain topology for the tree of life, where Eukarya emerges from within Archaea as sister to a Korarchaeota/Thaumarchaeota (KT) or Crenarchaeota/KT clade for all models under all or at least one of the strategies employed. Taxonomic manipulation allows single-matrix and certain mixture models to vacillate between two-domain and three-domain phylogenies. We find that models vary in their ability to resolve different areas of the tree of life, which does not necessarily correlate with model complexity. For example, both single-matrix and some mixture models recover monophyletic Crenarchaeota and Euryarchaeota archaeal phyla. In contrast, the most sophisticated model recovers a paraphyletic Euryarchaeota but detects two large clades that comprise the Bacteria, which were recovered separately but never together in the other models. Overall, models recovered consistent topologies despite dataset modifications due to the removal of compositional bias, which reflects either ineffective bias reduction or robust datasets that allow models to overcome reconstruction artifacts. We recommend a comparative approach for evolutionary models to identify model weaknesses as well as consensus relationships.
Collapse
|
160
|
Lang JM, Darling AE, Eisen JA. Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. PLoS One 2013; 8:e62510. [PMID: 23638103 PMCID: PMC3636077 DOI: 10.1371/journal.pone.0062510] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2012] [Accepted: 03/26/2013] [Indexed: 11/29/2022] Open
Abstract
Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a "primary concordance" tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes.
Collapse
Affiliation(s)
- Jenna Morgan Lang
- Department of Medical Microbiology and Immunology and Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
| | - Aaron E. Darling
- Department of Medical Microbiology and Immunology and Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Jonathan A. Eisen
- Department of Medical Microbiology and Immunology and Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
| |
Collapse
|
161
|
Hsia CCW, Schmitz A, Lambertz M, Perry SF, Maina JN. Evolution of air breathing: oxygen homeostasis and the transitions from water to land and sky. Compr Physiol 2013; 3:849-915. [PMID: 23720333 PMCID: PMC3926130 DOI: 10.1002/cphy.c120003] [Citation(s) in RCA: 103] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Life originated in anoxia, but many organisms came to depend upon oxygen for survival, independently evolving diverse respiratory systems for acquiring oxygen from the environment. Ambient oxygen tension (PO2) fluctuated through the ages in correlation with biodiversity and body size, enabling organisms to migrate from water to land and air and sometimes in the opposite direction. Habitat expansion compels the use of different gas exchangers, for example, skin, gills, tracheae, lungs, and their intermediate stages, that may coexist within the same species; coexistence may be temporally disjunct (e.g., larval gills vs. adult lungs) or simultaneous (e.g., skin, gills, and lungs in some salamanders). Disparate systems exhibit similar directions of adaptation: toward larger diffusion interfaces, thinner barriers, finer dynamic regulation, and reduced cost of breathing. Efficient respiratory gas exchange, coupled to downstream convective and diffusive resistances, comprise the "oxygen cascade"-step-down of PO2 that balances supply against toxicity. Here, we review the origin of oxygen homeostasis, a primal selection factor for all respiratory systems, which in turn function as gatekeepers of the cascade. Within an organism's lifespan, the respiratory apparatus adapts in various ways to upregulate oxygen uptake in hypoxia and restrict uptake in hyperoxia. In an evolutionary context, certain species also become adapted to environmental conditions or habitual organismic demands. We, therefore, survey the comparative anatomy and physiology of respiratory systems from invertebrates to vertebrates, water to air breathers, and terrestrial to aerial inhabitants. Through the evolutionary directions and variety of gas exchangers, their shared features and individual compromises may be appreciated.
Collapse
Affiliation(s)
- Connie C W Hsia
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA.
| | | | | | | | | |
Collapse
|
162
|
Sun BF, Xiao JH, He S, Liu L, Murphy RW, Huang DW. Multiple interkingdom horizontal gene transfers in Pyrenophora and closely related species and their contributions to phytopathogenic lifestyles. PLoS One 2013; 8:e60029. [PMID: 23555871 PMCID: PMC3612039 DOI: 10.1371/journal.pone.0060029] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Accepted: 02/20/2013] [Indexed: 12/13/2022] Open
Abstract
Many studies have reported horizontal gene transfer (HGT) events from eukaryotes, especially fungi. However, only a few investigations summarized multiple interkingdom HGTs involving important phytopathogenic species of Pyrenophora and few have investigated the genetic contributions of HGTs to fungi. We investigated HGT events in P. teres and P. tritici-repentis and discovered that both species harbored 14 HGT genes derived from bacteria and plants, including 12 HGT genes that occurred in both species. One gene coding a leucine-rich repeat protein was present in both species of Pyrenophora and it may have been transferred from a host plant. The transfer of genes from a host plant to pathogenic fungi has been reported rarely and we discovered the first evidence for this transfer in phytopathogenic Pyrenophora. Two HGTs in Pyrenophora underwent subsequent duplications. Some HGT genes had homologs in a few other fungi, indicating relatively ancient transfer events. Functional analyses indicated that half of the HGT genes encoded extracellular proteins and these may have facilitated the infection of plants by Pyrenophora via interference with plant defense-response and the degradation of plant cell walls. Some other HGT genes appeared to participate in carbohydrate metabolism. Together, these functions implied that HGTs may have led to highly efficient mechanisms of infection as well as the utilization of host carbohydrates. Evolutionary analyses indicated that HGT genes experienced amelioration, purifying selection, and accelerated evolution. These appeared to constitute adaptations to the background genome of the recipient. The discovery of multiple interkingdom HGTs in Pyrenophora, their significance to infection, and their adaptive evolution, provided valuable insights into the evolutionary significance of interkingdom HGTs from multiple donors.
Collapse
Affiliation(s)
- Bao-Fa Sun
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Jin-Hua Xiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Shunmin He
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Li Liu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Robert W. Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto, Canada
| | - Da-Wei Huang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Plant Protection, Shandong Agricultural University, Tai'an, Shandong, China
- * E-mail:
| |
Collapse
|
163
|
Stecher B, Maier L, Hardt WD. 'Blooming' in the gut: how dysbiosis might contribute to pathogen evolution. Nat Rev Microbiol 2013; 11:277-84. [DOI: 10.1038/nrmicro2989] [Citation(s) in RCA: 236] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
164
|
Bapteste E, Dupré J. Towards a processual microbial ontology. BIOLOGY & PHILOSOPHY 2013; 28:379-404. [PMID: 23487350 PMCID: PMC3591535 DOI: 10.1007/s10539-012-9350-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Accepted: 10/17/2012] [Indexed: 05/26/2023]
Abstract
Standard microbial evolutionary ontology is organized according to a nested hierarchy of entities at various levels of biological organization. It typically detects and defines these entities in relation to the most stable aspects of evolutionary processes, by identifying lineages evolving by a process of vertical inheritance from an ancestral entity. However, recent advances in microbiology indicate that such an ontology has important limitations. The various dynamics detected within microbiological systems reveal that a focus on the most stable entities (or features of entities) over time inevitably underestimates the extent and nature of microbial diversity. These dynamics are not the outcome of the process of vertical descent alone. Other processes, often involving causal interactions between entities from distinct levels of biological organisation, or operating at different time scales, are responsible not only for the destabilisation of pre-existing entities, but also for the emergence and stabilisation of novel entities in the microbial world. In this article we consider microbial entities as more or less stabilised functional wholes, and sketch a network-based ontology that can represent a diverse set of processes including, for example, as well as phylogenetic relations, interactions that stabilise or destabilise the interacting entities, spatial relations, ecological connections, and genetic exchanges. We use this pluralistic framework for evaluating (i) the existing ontological assumptions in evolution (e.g. whether currently recognized entities are adequate for understanding the causes of change and stabilisation in the microbial world), and (ii) for identifying hidden ontological kinds, essentially invisible from within a more limited perspective. We propose to recognize additional classes of entities that provide new insights into the structure of the microbial world, namely "processually equivalent" entities, "processually versatile" entities, and "stabilized" entities.
Collapse
Affiliation(s)
- Eric Bapteste
- />UMR CNRS 7138, Université Pierre et Marie Curie, 75005 Paris, France
| | - John Dupré
- />ESRC Centre for Genomics in Society (Egenis), University of Exeter, Exeter, UK
| |
Collapse
|
165
|
Dalquen DA, Altenhoff AM, Gonnet GH, Dessimoz C. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study. PLoS One 2013; 8:e56925. [PMID: 23451112 PMCID: PMC3581572 DOI: 10.1371/journal.pone.0056925] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2012] [Accepted: 01/16/2013] [Indexed: 11/19/2022] Open
Abstract
The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP) as well as two generic approaches (bidirectional best hit and reciprocal smallest distance). We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer) and technological artefacts (ambiguous sequences) on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall), lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.
Collapse
Affiliation(s)
- Daniel A. Dalquen
- Eldgenössische Technische Hochschule Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Zürich, Switzerland
| | - Adrian M. Altenhoff
- Eldgenössische Technische Hochschule Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Zürich, Switzerland
| | - Gaston H. Gonnet
- Eldgenössische Technische Hochschule Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Zürich, Switzerland
| | - Christophe Dessimoz
- Eldgenössische Technische Hochschule Zurich, Department of Computer Science, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Zürich, Switzerland
- European Bioinformatics Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
166
|
Wiles TJ, Norton JP, Smith SN, Lewis AJ, Mobley HLT, Casjens SR, Mulvey MA. A phyletically rare gene promotes the niche-specific fitness of an E. coli pathogen during bacteremia. PLoS Pathog 2013; 9:e1003175. [PMID: 23459509 PMCID: PMC3573123 DOI: 10.1371/journal.ppat.1003175] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 12/19/2012] [Indexed: 12/17/2022] Open
Abstract
In bacteria, laterally acquired genes are often concentrated within chromosomal regions known as genomic islands. Using a recently developed zebrafish infection model, we set out to identify unique factors encoded within genomic islands that contribute to the fitness and virulence of a reference urosepsis isolate—extraintestinal pathogenic Escherichia coli strain CFT073. By screening a series of deletion mutants, we discovered a previously uncharacterized gene, neaT, that is conditionally required by the pathogen during systemic infections. In vitro assays indicate that neaT can limit bacterial interactions with host phagocytes and alter the aggregative properties of CFT073. The neaT gene is localized within an integrated P2-like bacteriophage in CFT073, but was rarely found within other proteobacterial genomes. Sequence-based analyses revealed that neaT homologues are present, but discordantly conserved, within a phyletically diverse set of bacterial species. In CFT073, neaT appears to be unameliorated, having an exceptionally A+T-rich composition along with a notably altered codon bias. These data suggest that neaT was recently brought into the proteobacterial pan-genome from an extra-phyletic source. Interestingly, even in G+C-poor genomes, as found within the Firmicutes lineage, neaT-like genes are often unameliorated. Sequence-level features of neaT homologues challenge the common supposition that the A+T-rich nature of many recently acquired genes reflects the nucleotide composition of their genomes of origin. In total, these findings highlight the complexity of the evolutionary forces that can affect the acquisition, utilization, and assimilation of rare genes that promote the niche-dependent fitness and virulence of a bacterial pathogen. Bacterial pathogens, even those belonging to the same species, can be incredibly diverse with regard to the genes they carry. However, the design of vaccines and antibiotics typically relies upon identification of general molecular features shared by the targeted organisms. Thus, we have traditionally focused on broadly conserved characteristics of pathogenic bacteria, often ignoring the genes that account for their individuality. In this article we report the discovery of a unique gene, neaT, that promotes the fitness of a pathogenic Escherichia coli isolate in zebrafish and mouse models of systemic blood infections. Surprisingly, neaT is rarely found in other related strains of E. coli and appears to have been recently acquired from distant lineages of bacteria via a process known as ‘lateral gene transfer’ that is used by microbes to swap genetic material. Expression of the neaT gene appears to help pathogens avoid interactions with host immune cells, possibly by altering bacterial surface structures. This work provides an interesting example of how the lateral acquisition of a rare gene can impact the niche-specific virulence properties of a pathogen, shedding light on the mechanisms that drive pathogen evolution and diversity.
Collapse
Affiliation(s)
- Travis J. Wiles
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - J. Paul Norton
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Sara N. Smith
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Adam J. Lewis
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Harry L. T. Mobley
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Sherwood R. Casjens
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Matthew A. Mulvey
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
- * E-mail:
| |
Collapse
|
167
|
Affiliation(s)
- David P. Mindell
- Department of Biochemistry & Biophysics, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
168
|
Sun BF, Xiao JH, He SM, Liu L, Murphy RW, Huang DW. Multiple ancient horizontal gene transfers and duplications in lepidopteran species. INSECT MOLECULAR BIOLOGY 2013; 22:72-87. [PMID: 23211014 DOI: 10.1111/imb.12004] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Eukaryotic horizontal gene transfer (HGT) events are increasingly being discovered yet few reports have summarized multiple occurrences in a wide range of species. We systematically investigated HGT events in the order Lepidoptera by employing a series of filters. Bombyx mori, Danaus plexippus and Heliconius melpomene had 13, 12 and 12 HGTs, respectively, from bacteria and fungi. These HGTs contributed a total of 64 predicted genes: 22 to B. mori, 22 to D. plexippus and 20 to H. melpomene. Several new genes were generated by post-transfer duplications. Post-transfer duplication of a suite of functional HGTs has rarely been reported in higher organisms. The distributional patterns of paralogues for certain genes differed in the three species, indicating potential independent duplication or loss events. All of these HGTs had homologues expressed in some other lepidopterans, indicating ancient transfer events. Most HGTs were involved in the metabolism of sugar and amino acids. These HGTs appeared to have experienced amelioration, purifying selection and accelerated evolution to adapt to the background genome of the recipient. The discovery of ancient, massive HGTs and duplications in lepidopterans and their adaptive evolution provides further insights into the evolutionary significance of the events from donors to multicellular host recipients.
Collapse
Affiliation(s)
- B F Sun
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | | | | | | | | | | |
Collapse
|
169
|
Deryusheva EI, Selivanova OM, Serdyuk IN. Loops and repeats in proteins as footprints of molecular evolution. BIOCHEMISTRY (MOSCOW) 2013; 77:1487-99. [DOI: 10.1134/s000629791213007x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
170
|
Dagan T, Roettger M, Stucken K, Landan G, Koch R, Major P, Gould SB, Goremykin VV, Rippka R, Tandeau de Marsac N, Gugger M, Lockhart PJ, Allen JF, Brune I, Maus I, Pühler A, Martin WF. Genomes of Stigonematalean cyanobacteria (subsection V) and the evolution of oxygenic photosynthesis from prokaryotes to plastids. Genome Biol Evol 2013; 5:31-44. [PMID: 23221676 PMCID: PMC3595030 DOI: 10.1093/gbe/evs117] [Citation(s) in RCA: 150] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/04/2012] [Indexed: 01/12/2023] Open
Abstract
Cyanobacteria forged two major evolutionary transitions with the invention of oxygenic photosynthesis and the bestowal of photosynthetic lifestyle upon eukaryotes through endosymbiosis. Information germane to understanding those transitions is imprinted in cyanobacterial genomes, but deciphering it is complicated by lateral gene transfer (LGT). Here, we report genome sequences for the morphologically most complex true-branching cyanobacteria, and for Scytonema hofmanni PCC 7110, which with 12,356 proteins is the most gene-rich prokaryote currently known. We investigated components of cyanobacterial evolution that have been vertically inherited, horizontally transferred, and donated to eukaryotes at plastid origin. The vertical component indicates a freshwater origin for water-splitting photosynthesis. Networks of the horizontal component reveal that 60% of cyanobacterial gene families have been affected by LGT. Plant nuclear genes acquired from cyanobacteria define a lower bound frequency of 611 multigene families that, in turn, specify diazotrophic cyanobacterial lineages as having a gene collection most similar to that possessed by the plastid ancestor.
Collapse
Affiliation(s)
- Tal Dagan
- Institute of Genomic Microbiology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
171
|
Rogers EE, Stenger DC. A conjugative 38 kB plasmid is present in multiple subspecies of Xylella fastidiosa. PLoS One 2012; 7:e52131. [PMID: 23251694 PMCID: PMC3522642 DOI: 10.1371/journal.pone.0052131] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 11/13/2012] [Indexed: 11/18/2022] Open
Abstract
A ≈ 38kB plasmid (pXF-RIV5) was present in the Riv5 strain of Xylella fastidiosa subsp. multiplex isolated from ornamental plum in southern California. The complete nucleotide sequence of pXF-RIV5 is almost identical to that of pXFAS01 from X. fastidiosa subsp. fastidiosa strain M23; the two plasmids vary at only 6 nucleotide positions. BLAST searches and phylogenetic analyses indicate pXF-RIV5 and pXFAS01 share some similarity to chromosomal and plasmid (pXF51) sequences of X. fastidiosa subsp. pauca strain 9a5c and more distant similarity to plasmids from a wide variety of bacteria. Both pXF-RIV5 and pXFAS01 encode homologues of a complete Type IV secretion system involved in conjugation and DNA transfer among bacteria. Mating pair formation proteins (Trb) from Yersinia pseudotuberculosis IP31758 are the mostly closely related non-X. fastidiosa proteins to most of the Trb proteins encoded by pXF-RIV5 and pXFAS01. Unlike many bacterial conjugative plasmids, pXF-RIV5 and pXFAS01 do not carry homologues of known accessory modules that confer selective advantage on host bacteria. However, both plasmids encode seven hypothetical proteins of unknown function and possess a small transposon-associated region encoding a putative transposase and associated factor. Vegetative replication of pXF-RIV5 and pXFAS01 appears to be under control of RepA protein and both plasmids have an origin of DNA replication (oriV) similar to that of pRP4 and pR751 from Escherichia coli. In contrast, conjugative plasmids commonly encode TrfA and have an oriV similar to those found in IncP-1 incompatibility group plasmids. The presence of nearly identical plasmids in single strains from two distinct subspecies of X. fastidiosa is indicative of recent horizontal transfer, probably subsequent to the introduction of subspecies fastidiosa to the United States in the late 19(th) century.
Collapse
Affiliation(s)
- Elizabeth E Rogers
- United States Department of Agriculture, Agricultural Research Service, Parlier, California, USA.
| | | |
Collapse
|
172
|
Rudi K, Sekelja M. High or low correlation between co-occuring gene clusters and 16S rRNA gene phylogeny. FEMS Microbiol Lett 2012; 339:23-9. [DOI: 10.1111/1574-6968.12042] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2012] [Revised: 11/01/2012] [Accepted: 11/01/2012] [Indexed: 11/26/2022] Open
Affiliation(s)
- Knut Rudi
- Department of Chemistry, Biotechnology and Food Science; Norwegian University for Life Sciences; Ås; Norway
| | | |
Collapse
|
173
|
Acquisition of 1,000 eubacterial genes physiologically transformed a methanogen at the origin of Haloarchaea. Proc Natl Acad Sci U S A 2012. [PMID: 23184964 DOI: 10.1073/pnas.1209119109] [Citation(s) in RCA: 167] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Archaebacterial halophiles (Haloarchaea) are oxygen-respiring heterotrophs that derive from methanogens--strictly anaerobic, hydrogen-dependent autotrophs. Haloarchaeal genomes are known to have acquired, via lateral gene transfer (LGT), several genes from eubacteria, but it is yet unknown how many genes the Haloarchaea acquired in total and, more importantly, whether independent haloarchaeal lineages acquired their genes in parallel, or as a single acquisition at the origin of the group. Here we have studied 10 haloarchaeal and 1,143 reference genomes and have identified 1,089 haloarchaeal gene families that were acquired by a methanogenic recipient from eubacteria. The data suggest that these genes were acquired in the haloarchaeal common ancestor, not in parallel in independent haloarchaeal lineages, nor in the common ancestor of haloarchaeans and methanosarcinales. The 1,089 acquisitions include genes for catabolic carbon metabolism, membrane transporters, menaquinone biosynthesis, and complexes I-IV of the eubacterial respiratory chain that functions in the haloarchaeal membrane consisting of diphytanyl isoprene ether lipids. LGT on a massive scale transformed a strictly anaerobic, chemolithoautotrophic methanogen into the heterotrophic, oxygen-respiring, and bacteriorhodopsin-photosynthetic haloarchaeal common ancestor.
Collapse
|
174
|
Low rates of lateral gene transfer among metabolic genes define the evolving biogeochemical niches of archaea through deep time. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2012; 2012:843539. [PMID: 23226971 PMCID: PMC3512248 DOI: 10.1155/2012/843539] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Revised: 09/02/2012] [Accepted: 10/02/2012] [Indexed: 01/26/2023]
Abstract
Phylogenomic analyses of archaeal genome sequences are providing windows into the group's evolutionary past, even though most archaeal taxa lack a conventional fossil record. Here, phylogenetic analyses were performed using key metabolic genes that define the metabolic niche of microorganisms. Such genes are generally considered to have undergone high rates of lateral gene transfer. Many gene sequences formed clades that were identical, or similar, to the tree constructed using large numbers of genes from the stable core of the genome. Surprisingly, such lateral transfer events were readily identified and quantifiable, occurring only a relatively small number of times in the archaeal domain of life. By placing gene acquisition events into a temporal framework, the rates by which new metabolic genes were acquired can be quantified. The highest lateral transfer rates were among cytochrome oxidase genes that use oxygen as a terminal electron acceptor (with a total of 12–14 lateral transfer events, or 3.4–4.0 events per billion years, across the entire archaeal domain). Genes involved in sulfur or nitrogen metabolism had much lower rates, on the order of one lateral transfer event per billion years. This suggests that lateral transfer rates of key metabolic proteins are rare and not rampant.
Collapse
|
175
|
Evolutionary analyses of non-genealogical bonds produced by introgressive descent. Proc Natl Acad Sci U S A 2012; 109:18266-72. [PMID: 23090996 DOI: 10.1073/pnas.1206541109] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
All evolutionary biologists are familiar with evolutionary units that evolve by vertical descent in a tree-like fashion in single lineages. However, many other kinds of processes contribute to evolutionary diversity. In vertical descent, the genetic material of a particular evolutionary unit is propagated by replication inside its own lineage. In what we call introgressive descent, the genetic material of a particular evolutionary unit propagates into different host structures and is replicated within these host structures. Thus, introgressive descent generates a variety of evolutionary units and leaves recognizable patterns in resemblance networks. We characterize six kinds of evolutionary units, of which five involve mosaic lineages generated by introgressive descent. To facilitate detection of these units in resemblance networks, we introduce terminology based on two notions, P3s (subgraphs of three nodes: A, B, and C) and mosaic P3s, and suggest an apparatus for systematic detection of introgressive descent. Mosaic P3s correspond to a distinct type of evolutionary bond that is orthogonal to the bonds of kinship and genealogy usually examined by evolutionary biologists. We argue that recognition of these evolutionary bonds stimulates radical rethinking of key questions in evolutionary biology (e.g., the relations among evolutionary players in very early phases of evolutionary history, the origin and emergence of novelties, and the production of new lineages). This line of research will expand the study of biological complexity beyond the usual genealogical bonds, revealing additional sources of biodiversity. It provides an important step to a more realistic pluralist treatment of evolutionary complexity.
Collapse
|
176
|
Zhou W, Nakhleh L. Convergent evolution of modularity in metabolic networks through different community structures. BMC Evol Biol 2012; 12:181. [PMID: 22974099 PMCID: PMC3534581 DOI: 10.1186/1471-2148-12-181] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2012] [Accepted: 08/09/2012] [Indexed: 01/01/2023] Open
Abstract
Background It has been reported that the modularity of metabolic networks of bacteria is closely related to the variability of their living habitats. However, given the dependency of the modularity score on the community structure, it remains unknown whether organisms achieve certain modularity via similar or different community structures. Results In this work, we studied the relationship between similarities in modularity scores and similarities in community structures of the metabolic networks of 1021 species. Both similarities are then compared against the genetic distances. We revisited the association between modularity and variability of the microbial living environments and extended the analysis to other aspects of their life style such as temperature and oxygen requirements. We also tested both topological and biological intuition of the community structures identified and investigated the extent of their conservation with respect to the taxomony. Conclusions We find that similar modularities are realized by different community structures. We find that such convergent evolution of modularity is closely associated with the number of (distinct) enzymes in the organism’s metabolome, a consequence of different life styles of the species. We find that the order of modularity is the same as the order of the number of the enzymes under the classification based on the temperature preference but not on the oxygen requirement. Besides, inspection of modularity-based communities reveals that these communities are graph-theoretically meaningful yet not reflective of specific biological functions. From an evolutionary perspective, we find that the community structures are conserved only at the level of kingdoms. Our results call for more investigation into the interplay between evolution and modularity: how evolution shapes modularity, and how modularity affects evolution (mainly in terms of fitness and evolvability). Further, our results call for exploring new measures of modularity and network communities that better correspond to functional categorizations.
Collapse
Affiliation(s)
- Wanding Zhou
- Department of Bioengineering, Rice University, Houston, TX, USA.
| | | |
Collapse
|
177
|
Koonin EV, Wolf YI. Evolution of microbes and viruses: a paradigm shift in evolutionary biology? Front Cell Infect Microbiol 2012; 2:119. [PMID: 22993722 PMCID: PMC3440604 DOI: 10.3389/fcimb.2012.00119] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Accepted: 08/27/2012] [Indexed: 01/21/2023] Open
Abstract
When Charles Darwin formulated the central principles of evolutionary biology in the Origin of Species in 1859 and the architects of the Modern Synthesis integrated these principles with population genetics almost a century later, the principal if not the sole objects of evolutionary biology were multicellular eukaryotes, primarily animals and plants. Before the advent of efficient gene sequencing, all attempts to extend evolutionary studies to bacteria have been futile. Sequencing of the rRNA genes in thousands of microbes allowed the construction of the three- domain “ribosomal Tree of Life” that was widely thought to have resolved the evolutionary relationships between the cellular life forms. However, subsequent massive sequencing of numerous, complete microbial genomes revealed novel evolutionary phenomena, the most fundamental of these being: (1) pervasive horizontal gene transfer (HGT), in large part mediated by viruses and plasmids, that shapes the genomes of archaea and bacteria and call for a radical revision (if not abandonment) of the Tree of Life concept, (2) Lamarckian-type inheritance that appears to be critical for antivirus defense and other forms of adaptation in prokaryotes, and (3) evolution of evolvability, i.e., dedicated mechanisms for evolution such as vehicles for HGT and stress-induced mutagenesis systems. In the non-cellular part of the microbial world, phylogenomics and metagenomics of viruses and related selfish genetic elements revealed enormous genetic and molecular diversity and extremely high abundance of viruses that come across as the dominant biological entities on earth. Furthermore, the perennial arms race between viruses and their hosts is one of the defining factors of evolution. Thus, microbial phylogenomics adds new dimensions to the fundamental picture of evolution even as the principle of descent with modification discovered by Darwin and the laws of population genetics remain at the core of evolutionary biology.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, USA.
| | | |
Collapse
|
178
|
Friedman R, Ely B. Codon usage methods for horizontal gene transfer detection generate an abundance of false positive and false negative results. Curr Microbiol 2012; 65:639-42. [PMID: 23010940 DOI: 10.1007/s00284-012-0205-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 07/07/2012] [Indexed: 11/24/2022]
Abstract
Bacteria acquire new DNA in a process known as horizontal gene transfer (HGT). To investigate the evolutionary impact of this transfer of DNA, various methods have been developed to detect past HGT events. For example, codon usage-based methods detect the presence of transferred genes by identifying atypical patterns of codon usage. However, some inherited genes exhibit atypical codon usage and some transferred genes have codon usage patterns similar to those of the inherited genes. In this study, we used a comparative phylogenetic approach with Methylobacterium and Caulobacter species to demonstrate that even well-designed codon usage methods fail to detect many HGT events and generate a high rate of false positives (60-75 %) and false negatives (23-61 %). Therefore, we recommend caution when employing codon usage methods to identify transferred genes and suggest that the rapidly increasing availability of bacterial genome sequences makes the phylogenetic approach the method of choice.
Collapse
Affiliation(s)
- Robert Friedman
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA.
| | | |
Collapse
|
179
|
Bhandari V, Naushad HS, Gupta RS. Protein based molecular markers provide reliable means to understand prokaryotic phylogeny and support Darwinian mode of evolution. Front Cell Infect Microbiol 2012; 2:98. [PMID: 22919687 PMCID: PMC3417386 DOI: 10.3389/fcimb.2012.00098] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 06/27/2012] [Indexed: 11/20/2022] Open
Abstract
The analyses of genome sequences have led to the proposal that lateral gene transfers (LGTs) among prokaryotes are so widespread that they disguise the interrelationships among these organisms. This has led to questioning of whether the Darwinian model of evolution is applicable to prokaryotic organisms. In this review, we discuss the usefulness of taxon-specific molecular markers such as conserved signature indels (CSIs) and conserved signature proteins (CSPs) for understanding the evolutionary relationships among prokaryotes and to assess the influence of LGTs on prokaryotic evolution. The analyses of genomic sequences have identified large numbers of CSIs and CSPs that are unique properties of different groups of prokaryotes ranging from phylum to genus levels. The species distribution patterns of these molecular signatures strongly support a tree-like vertical inheritance of the genes containing these molecular signatures that is consistent with phylogenetic trees. Recent detailed studies in this regard on the Thermotogae and Archaea, which are reviewed here, have identified large numbers of CSIs and CSPs that are specific for the species from these two taxa and a number of their major clades. The genetic changes responsible for these CSIs (and CSPs) initially likely occurred in the common ancestors of these taxa and then vertically transferred to various descendants. Although some CSIs and CSPs in unrelated groups of prokaryotes were identified, their small numbers and random occurrence has no apparent influence on the consistent tree-like branching pattern emerging from other markers. These results provide evidence that although LGT is an important evolutionary force, it does not mask the tree-like branching pattern of prokaryotes or understanding of their evolutionary relationships. The identified CSIs and CSPs also provide novel and highly specific means for identification of different groups of microbes and for taxonomical and biochemical studies.
Collapse
Affiliation(s)
- Vaibhav Bhandari
- Department of Biochemistry and Biomedical Sciences, McMaster University Hamilton, ON, Canada
| | | | | |
Collapse
|
180
|
The role of reticulate evolution in creating innovation and complexity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:418964. [PMID: 22844638 PMCID: PMC3403396 DOI: 10.1155/2012/418964] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 05/08/2012] [Accepted: 05/10/2012] [Indexed: 12/31/2022]
Abstract
Reticulate evolution encompasses processes that conflict with traditional Tree of Life efforts. These processes, horizontal gene transfer (HGT), gene and whole-genome duplications through allopolyploidization, are some of the main driving forces for generating innovation and complexity. HGT has a profound impact on prokaryotic and eukaryotic evolution. HGTs can lead to the invention of new metabolic pathways and the expansion and enhancement of previously existing pathways. It allows for organismal adaptation into new ecological niches and new host ranges. Although many HGTs appear to be selected for because they provide some benefit to their recipient lineage, other HGTs may be maintained by chance through random genetic drift. Moreover, some HGTs that may initially seem parasitic in nature can cause complexity to arise through pathways of neutral evolution. Another mechanism for generating innovation and complexity, occurring more frequently in eukaryotes than in prokaryotes, is gene and genome duplications, which often occur through allopolyploidizations. We discuss how these different evolutionary processes contribute to generating innovation and complexity.
Collapse
|
181
|
Willson SJ. CSD homomorphisms between phylogenetic networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1128-1138. [PMID: 22487988 DOI: 10.1109/tcbb.2012.52] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Since Darwin, species trees have been used as a simplified description of the relationships which summarize the complicated network N of reality. Recent evidence of hybridization and lateral gene transfer, however, suggest that there are situations where trees are inadequate. Consequently it is important to determine properties that characterize networks closely related to N and possibly more complicated than trees but lacking the full complexity of N. A connected surjective digraph map (CSD) is a map f from one network N to another network M such that every arc is either collapsed to a single vertex or is taken to an arc, such that f is surjective, and such that the inverse image of a vertex is always connected. CSD maps are shown to behave well under composition. It is proved that if there is a CSD map from N to M, then there is a way to lift an undirected version of M into N, often with added resolution. A CSD map from N to M puts strong constraints on N. In general, it may be useful to study classes of networks such that, for any N, there exists a CSD map from N to some standard member of that class.
Collapse
Affiliation(s)
- Stephen J Willson
- Department of Mathematics, Iowa State University, Ames, IA 50011, USA.
| |
Collapse
|
182
|
Georgiades K, Raoult D. How microbiology helps define the rhizome of life. Front Cell Infect Microbiol 2012; 2:60. [PMID: 22919651 PMCID: PMC3417629 DOI: 10.3389/fcimb.2012.00060] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2012] [Accepted: 04/16/2012] [Indexed: 01/24/2023] Open
Abstract
In contrast to the tree of life (TOF) theory, species are mosaics of gene sequences with different origins. Observations of the extensive lateral sequence transfers in all organisms have demonstrated that the genomes of all life forms are collections of genes with different evolutionary histories that cannot be represented by a single TOF. Moreover, genes themselves commonly have several origins due to recombination. The human genome is not free from recombination events, so it is a mosaic like other organisms' genomes. Recent studies have demonstrated evidence for the integration of parasitic DNA into the human genome. Lateral transfer events have been accepted as major contributors of genome evolution in free-living bacteria. Furthermore, the accumulation of genomic sequence data provides evidence for extended genetic exchanges in intracellular bacteria and suggests that such events constitute an agent that promotes and maintains all bacterial species. Archaea and viruses also form chimeras containing primarily bacterial but also eukaryotic sequences. In addition to lateral transfers, orphan genes are indicative of the fact that gene creation is a permanent and unsettled phenomenon. Currently, a rhizome may more adequately represent the multiplicity and de novo creation of a genome. We wanted to confirm that the term “rhizome” in evolutionary biology applies to the entire cellular life history. This view of evolution should resemble a clump of roots representing the multiple origins of the repertoires of the genes of each species.
Collapse
Affiliation(s)
- Kalliopi Georgiades
- Faculté de Médecine La Timone, Unité de Recherche en Maladies Infectieuses Tropical Emergentes (URMITE), CNRS-IRD UMR 6236-198, Université de la Méditerranée Marseille, France
| | | |
Collapse
|
183
|
Abstract
Horizontal gene transfer (HGT), the movement of genetic material from one species to another, is a common phenomenon in prokaryotic evolution. Although the rate of HGT is known to vary among genes, our understanding of the cause of this variation, currently summarized by two rules, is far from complete. The first rule states that informational genes, which are involved in DNA replication, transcription, and translation, have lower transferabilities than operational genes. The second rule asserts that protein interactivity negatively impacts gene transferability. Here, we hypothesize that high expression hampers HGT, because the fitness cost of an HGT to the recipient, arising from the 1) energy expenditure in transcription and translation, 2) cytotoxic protein misfolding, 3) reduction in cellular translational efficiency, 4) detrimental protein misinteraction, and 5) disturbance of the optimal protein concentration or cell physiology, increases with the expression level of the transferred gene. To test this hypothesis, we examined laboratory and natural HGTs to Escherichia coli. We observed lower transferabilities of more highly expressed genes, even after controlling the confounding factors from the two established rules and the genic GC content. Furthermore, expression level predicts gene transferability better than all other factors examined. We also confirmed the significant negative impact of gene expression on the rate of HGTs to 127 of 133 genomes of eubacteria and archaebacteria. Together, these findings establish the gene expression level as a major determinant of horizontal gene transferability. They also suggest that most successful HGTs are initially slightly deleterious, fixed because of their negligibly low costs rather than high benefits to the recipient.
Collapse
Affiliation(s)
- Chungoo Park
- Department of Ecology and Evolutionary Biology, University of Michigan, MI, USA
| | | |
Collapse
|
184
|
Thiergart T, Landan G, Schenk M, Dagan T, Martin WF. An evolutionary network of genes present in the eukaryote common ancestor polls genomes on eukaryotic and mitochondrial origin. Genome Biol Evol 2012; 4:466-85. [PMID: 22355196 PMCID: PMC3342870 DOI: 10.1093/gbe/evs018] [Citation(s) in RCA: 102] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
To test the predictions of competing and mutually exclusive hypotheses for the origin of eukaryotes, we identified from a sample of 27 sequenced eukaryotic and 994 sequenced prokaryotic genomes 571 genes that were present in the eukaryote common ancestor and that have homologues among eubacterial and archaebacterial genomes. Maximum-likelihood trees identified the prokaryotic genomes that most frequently contained genes branching as the sister to the eukaryotic nuclear homologues. Among the archaebacteria, euryarchaeote genomes most frequently harbored the sister to the eukaryotic nuclear gene, whereas among eubacteria, the α-proteobacteria were most frequently represented within the sister group. Only 3 genes out of 571 gave a 3-domain tree. Homologues from α-proteobacterial genomes that branched as the sister to nuclear genes were found more frequently in genomes of facultatively anaerobic members of the rhiozobiales and rhodospirilliales than in obligate intracellular ricketttsial parasites. Following α-proteobacteria, the most frequent eubacterial sister lineages were γ-proteobacteria, δ-proteobacteria, and firmicutes, which were also the prokaryote genomes least frequently found as monophyletic groups in our trees. Although all 22 higher prokaryotic taxa sampled (crenarchaeotes, γ-proteobacteria, spirochaetes, chlamydias, etc.) harbor genes that branch as the sister to homologues present in the eukaryotic common ancestor, that is not evidence of 22 different prokaryotic cells participating at eukaryote origins because prokaryotic “lineages” have laterally acquired genes for more than 1.5 billion years since eukaryote origins. The data underscore the archaebacterial (host) nature of the eukaryotic informational genes and the eubacterial (mitochondrial) nature of eukaryotic energy metabolism. The network linking genes of the eukaryote ancestor to contemporary homologues distributed across prokaryotic genomes elucidates eukaryote gene origins in a dialect cognizant of gene transfer in nature.
Collapse
Affiliation(s)
- Thorsten Thiergart
- Institute of Molecular Evolution, Heinrich-Heine University Düsseldorf, Germany
| | | | | | | | | |
Collapse
|
185
|
van Passel MW. Tracing common origins of Genomic Islands in prokaryotes based on genome signature analyses. Mob Genet Elements 2012; 1:247-249. [PMID: 22312594 DOI: 10.4161/mge1.3.18230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2011] [Revised: 09/23/2011] [Accepted: 09/27/2011] [Indexed: 11/19/2022] Open
Abstract
Horizontal gene transfer constitutes a powerful and innovative force in evolution, but often little is known about the actual origins of transferred genes. Sequence alignments are generally of limited use in tracking the original donor, since still only a small fraction of the total genetic diversity is thought to be uncovered. Alternatively, approaches based on similarities in the genome specific relative oligonucleotide frequencies do not require alignments. Even though the exact origins of horizontally transferred genes may still not be established using these compositional analyses, it does suggest that compositionally very similar regions are likely to have had a common origin. These analyses have shown that up to a third of large acquired gene clusters that reside in the same genome are compositionally very similar, indicative of a shared origin. This brings us closer to uncovering the original donors of horizontally transferred genes, and could help in elucidating possible regulatory interactions between previously unlinked sequences.
Collapse
Affiliation(s)
- Mark Wj van Passel
- Systems and Synthetic Biology; Wageningen University; Wageningen, The Netherlands
| |
Collapse
|
186
|
Zhu B, Zhou Q, Xie G, Zhang G, Zhang X, Wang Y, Sun G, Li B, Jin G. Interkingdom gene transfer may contribute to the evolution of phytopathogenicity in botrytis cinerea. Evol Bioinform Online 2012; 8:105-17. [PMID: 22346340 PMCID: PMC3273930 DOI: 10.4137/ebo.s8486] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The ascomycete Botrytis cinerea is a phytopathogenic fungus infecting and causing significant yield losses in a number of crops. The genome of B. cinerea has been fully sequenced while the importance of horizontal gene transfer (HGT) to extend the host range in plant pathogenic fungi has been recently appreciated. However, recent data confirm that the B. cinerea fungus shares conserved virulence factors with other fungal plant pathogens with narrow host range. Therefore, interkingdom HGT may contribute to the evolution of phytopathogenicity in B. cinerea. In this study, a stringent genome comparison pipeline was used to identify potential genes that have been obtained by B. cinerea but not by other fungi through interkingdom HGT. This search led to the identification of four genes: a UDP-glucosyltransferase (UGT), a lipoprotein and two alpha/beta hydrolase fold proteins. Phylogenetic analysis of the four genes suggests that B. cinerea acquired UGT from plants and the other 3 genes from bacteria. Based on the known gene functions and literature searching, a correlation between gene acquision and the evolution of pathogenicity in B. cinerea can be postulated.
Collapse
Affiliation(s)
- Bo Zhu
- State Key Laboratory of Rice Biology and Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Ministry of Agriculture, Institute of Biotechnology, Zhejiang University, Hangzhou 310029, China
| | | | | | | | | | | | | | | | | |
Collapse
|
187
|
Bapteste E, Bouchard F, Burian RM. Philosophy and evolution: minding the gap between evolutionary patterns and tree-like patterns. Methods Mol Biol 2012; 856:81-110. [PMID: 22399456 DOI: 10.1007/978-1-61779-585-5_4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Ever since Darwin, the familiar genealogical pattern known as the Tree of Life (TOL) has been prominent in evolutionary thinking and has dominated not only systematics, but also the analysis of the units of evolution. However, recent findings indicate that the evolution of DNA, especially in prokaryotes and such DNA vehicles as viruses and plasmids, does not follow a unique tree-like pattern. Because evolutionary patterns track a greater range of processes than those captured in genealogies, genealogical patterns are in fact only a subset of a broader set of evolutionary patterns. This fact suggests that evolutionists who focus exclusively on genealogical patterns are blocked from providing a significant range of genuine evolutionary explanations. Consequently, we highlight challenges to tree-based approaches, and point the way toward more appropriate methods to study evolution (although we do not present them in technical detail). We argue that there is significant benefit in adopting wider range of models, evolutionary representations, and evolutionary explanations, based on an analysis of the full range of evolutionary processes. We introduce an ecosystem orientation into evolutionary thinking that highlights the importance of "type 1 coalitions" (functionally related units with genetic exchanges, aka "friends with genetic benefits"), "type 2 coalitions" (functionally related units without genetic exchanges), "communal interactions," and "emergent evolutionary properties." On this basis, we seek to promote the study of (especially prokaryotic) evolution with dynamic evolutionary networks, which are less constrained than the TOL, and to provide new ways to analyze an expanded range of evolutionary units (genetic modules, recombined genes, plasmids, phages and prokaryotic genomes, pangenomes, microbial communities) and evolutionary processes. Finally, we discuss some of the conceptual and practical questions raised by such network-based representation.
Collapse
|
188
|
Lima-Mendez G. Reticulate classification of mosaic microbial genomes using NeAT website. Methods Mol Biol 2012; 804:81-91. [PMID: 22144149 DOI: 10.1007/978-1-61779-361-5_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The tree of life is the classical representation of the evolutionary relationships between existent species. A tree is appropriate to display the divergence of species through mutation, i.e., by vertical descent. However, lateral gene transfer (LGT) is excluded from such representations. When LGT contribution to genome evolution cannot be neglected (e.g., for prokaryotes and mobile genetic elements), the tree becomes misleading. Networks appear as an intuitive way to represent both vertical and horizontal relationships, while overlapping groups within such graphs are more suitable for their classification. Here, we describe a method to represent both vertical and horizontal relationships. We start with a set of genomes whose coded proteins have been grouped into families based on sequence similarity. Next, all pairs of genomes are compared, counting the number of proteins classified into the same family. From this comparison, we derive a weighted graph where genomes with a significant number of similar proteins are linked. Finally, we apply a two-step clustering of this graph to produce a classification where nodes can be assigned to multiple clusters. The procedure can be performed using the Network Analysis Tools (NeAT) website.
Collapse
Affiliation(s)
- Gipsi Lima-Mendez
- Laboratoire de Bioinformatique des Génomes et des Réseaux, Université Libre de Bruxelles, Bruxelles, Belgium.
| |
Collapse
|
189
|
Tamminen M, Virta M, Fani R, Fondi M. Large-scale analysis of plasmid relationships through gene-sharing networks. Mol Biol Evol 2011; 29:1225-40. [PMID: 22130968 DOI: 10.1093/molbev/msr292] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Plasmids are vessels of genetic exchange in microbial communities. They are known to transfer between different host organisms and acquire diverse genetic elements from chromosomes and/or other plasmids. Therefore, they constitute an important element in microbial evolution by rapidly disseminating various genetic properties among different communities. A paradigmatic example of this is the dissemination of antibiotic resistance (AR) genes that has resulted in the emergence of multiresistant pathogenic bacterial strains. To globally analyze the evolutionary dynamics of plasmids, we built a large graph in which 2,343 plasmids (nodes) are connected according to the proteins shared by each other. The analysis of this gene-sharing network revealed an overall coherence between network clustering and the phylogenetic classes of the corresponding microorganisms, likely resulting from genetic barriers to horizontal gene transfer between distant phylogenetic groups. Habitat was not a crucial factor in clustering as plasmids from organisms inhabiting different environments were often found embedded in the same cluster. Analyses of network metrics revealed a statistically significant correlation between plasmid mobility and their centrality within the network, providing support to the observation that mobile plasmids are particularly important in spreading genes in microbial communities. Finally, our study reveals an extensive (and previously undescribed) sharing of AR genes between Actinobacteria and Gammaproteobacteria, suggesting that the former might represent an important reservoir of AR genes for the latter.
Collapse
Affiliation(s)
- Manu Tamminen
- Department of Food and Environmental Sciences, University of Helsinki, Helsinki, Finland
| | | | | | | |
Collapse
|
190
|
Kämpfer P. Systematics of prokaryotes: the state of the art. Antonie van Leeuwenhoek 2011; 101:3-11. [PMID: 22041978 DOI: 10.1007/s10482-011-9660-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2011] [Accepted: 10/11/2011] [Indexed: 11/30/2022]
Abstract
The term taxonomy is often used synonymously with systematics but it should be regarded more as a specific part of the latter and comprises the orderly arrangements of (defined) units in addition to the nomenclature, i.e. labelling of these units defined by classification, and also identification of these units defined by classification and labeled by nomenclature. Similar to all biological disciplines, taxonomic approaches in microbiology aim at the establishment of a system that mirrors the "order in nature" as closely as possible with the ultimate goal to describe the whole evolutionary order back to the origin of life. With the recognition of molecular markers present in all organisms (here in particular the small subunit rRNAs, ssRNSs), the achievement of this goal has become more and more feasible and the generation of gene and increasing numbers of genome sequences allow nowadays the generation of large amounts of data and often a very detailed insight into the genetic potential of prokaryotes. The possibility to generate whole genome sequences in a very short period of time leads to a strong tendency to base the taxonomic system more and more on sequence data. However, a comprehensive understanding of all the information behind sequence data is lagging far behind their accumulation. Genes and genomes may (or may not) function only in a given "environment", with the cell as basic entity for the display of this potential. Prokaryotic taxonomy still has its focus on the whole organism. In this context, natural selection drives evolution selecting the existing phenotypes and it is the phenotype that "exhibits" this process both in a given cellular and also environmental context. The term polyphasic taxonomy, which was coined almost 40 years ago and aimed at the integration of many levels of information (from molecular to ecological data) thereby allowing a more holistic view, should be revisited in the light of the enormous potential of the novel information associated with large data sets.
Collapse
Affiliation(s)
- Peter Kämpfer
- Institut für Angewandte Mikrobiologie, Justus-Liebig-Universität Giessen, Heinrich-Buff-Ring 26, Giessen,
| |
Collapse
|
191
|
Kämpfer P, Glaeser SP. Prokaryotic taxonomy in the sequencing era - the polyphasic approach revisited. Environ Microbiol 2011; 14:291-317. [DOI: 10.1111/j.1462-2920.2011.02615.x] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
192
|
Williams D, Fournier GP, Lapierre P, Swithers KS, Green AG, Andam CP, Gogarten JP. A rooted net of life. Biol Direct 2011; 6:45. [PMID: 21936906 PMCID: PMC3189188 DOI: 10.1186/1745-6150-6-45] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 09/21/2011] [Indexed: 01/29/2023] Open
Abstract
Abstract Phylogenetic reconstruction using DNA and protein sequences has allowed the reconstruction of evolutionary histories encompassing all life. We present and discuss a means to incorporate much of this rich narrative into a single model that acknowledges the discrete evolutionary units that constitute the organism. Briefly, this Rooted Net of Life genome phylogeny is constructed around an initial, well resolved and rooted tree scaffold inferred from a supermatrix of combined ribosomal genes. Extant sampled ribosomes form the leaves of the tree scaffold. These leaves, but not necessarily the deeper parts of the scaffold, can be considered to represent a genome or pan-genome, and to be associated with members of other gene families within that sequenced (pan)genome. Unrooted phylogenies of gene families containing four or more members are reconstructed and superimposed over the scaffold. Initially, reticulations are formed where incongruities between topologies exist. Given sufficient evidence, edges may then be differentiated as those representing vertical lines of inheritance within lineages and those representing horizontal genetic transfers or endosymbioses between lineages. Reviewers W. Ford Doolittle, Eric Bapteste and Robert Beiko.
Collapse
Affiliation(s)
- David Williams
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | | | | | | | | | | | |
Collapse
|
193
|
van Passel MW. Tracing common origins of Genomic Islands in prokaryotes based on genome signature analyses. Mob Genet Elements 2011. [DOI: 10.4161/mge.1.3.18230] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
194
|
Andam CP, Fournier GP, Gogarten JP. Multilevel populations and the evolution of antibiotic resistance through horizontal gene transfer. FEMS Microbiol Rev 2011; 35:756-67. [DOI: 10.1111/j.1574-6976.2011.00274.x] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
195
|
Skippington E, Ragan MA. Lateral genetic transfer and the construction of genetic exchange communities. FEMS Microbiol Rev 2011; 35:707-35. [DOI: 10.1111/j.1574-6976.2010.00261.x] [Citation(s) in RCA: 123] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
|
196
|
Popa O, Dagan T. Trends and barriers to lateral gene transfer in prokaryotes. Curr Opin Microbiol 2011; 14:615-23. [PMID: 21856213 DOI: 10.1016/j.mib.2011.07.027] [Citation(s) in RCA: 153] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 07/19/2011] [Accepted: 07/25/2011] [Indexed: 11/19/2022]
Abstract
Gene acquisition by lateral gene transfer (LGT) is an important mechanism for natural variation among prokaryotes. Laboratory experiments show that protein-coding genes can be laterally transferred extremely fast among microbial cells, inherited to most of their descendants, and adapt to a new regulatory regime within a short time. Recent advance in the phylogenetic analysis of microbial genomes using networks approach reveals a substantial impact of LGT during microbial genome evolution. Phylogenomic networks of LGT among prokaryotes reconstructed from completely sequenced genomes uncover barriers to LGT in multiple levels. Here we discuss the kinds of barriers to gene acquisition in nature including physical barriers for gene transfer between cells, genomic barriers for the integration of acquired DNA, and functional barriers for the acquisition of new genes.
Collapse
Affiliation(s)
- Ovidiu Popa
- Institute of Molecular Evolution, Heinrich-Heine University of Düsseldorf, Universitätstr. 1 40225, Düsseldorf, Germany
| | | |
Collapse
|
197
|
Creevey CJ, Doerks T, Fitzpatrick DA, Raes J, Bork P. Universally distributed single-copy genes indicate a constant rate of horizontal transfer. PLoS One 2011; 6:e22099. [PMID: 21850220 PMCID: PMC3151239 DOI: 10.1371/journal.pone.0022099] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Accepted: 06/17/2011] [Indexed: 11/19/2022] Open
Abstract
Single copy genes, universally distributed across the three domains of life and encoding mostly ancient parts of the translation machinery, are thought to be only rarely subjected to horizontal gene transfer (HGT). Indeed it has been proposed to have occurred in only a few genes and implies a rare, probably not advantageous event in which an ortholog displaces the original gene and has to function in a foreign context (orthologous gene displacement, OGD). Here, we have utilised an automatic method to identify HGT based on a conservative statistical approach capable of robustly assigning both donors and acceptors. Applied to 40 universally single copy genes we found that as many as 68 HGTs (implying OGDs) have occurred in these genes with a rate of 1.7 per family since the last universal common ancestor (LUCA). We examined a number of factors that have been claimed to be fundamental to HGT in general and tested their validity in the subset of universally distributed single copy genes. We found that differing functional constraints impact rates of OGD and the more evolutionarily distant the donor and acceptor, the less likely an OGD is to occur. Furthermore, species with larger genomes are more likely to be subjected to OGD. Most importantly, regardless of the trends above, the number of OGDs increases linearly with time, indicating a neutral, constant rate. This suggests that levels of HGT above this rate may be indicative of positively selected transfers that may allow niche adaptation or bestow other benefits to the recipient organism.
Collapse
Affiliation(s)
| | - Tobias Doerks
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - David A. Fitzpatrick
- Department of Biology, National University of Ireland Maynooth, Maynooth, Ireland
| | - Jeroen Raes
- VIB Department of Molecular and Cellular Interactions, Vrije Universiteit Brussels, Brussels, Belgium
| | - Peer Bork
- European Molecular Biology Laboratory, Heidelberg, Germany
- * E-mail:
| |
Collapse
|
198
|
Phylogenomic networks. Trends Microbiol 2011; 19:483-91. [PMID: 21820313 DOI: 10.1016/j.tim.2011.07.001] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Revised: 07/04/2011] [Accepted: 07/08/2011] [Indexed: 01/15/2023]
Abstract
Phylogenomics is aimed at studying functional and evolutionary aspects of genome biology using phylogenetic analysis of whole genomes. Current approaches to genome phylogenies are commonly founded in terms of phylogenetic trees. However, several evolutionary processes are non tree-like in nature, including recombination and lateral gene transfer (LGT). Phylogenomic networks are a special type of phylogenetic network reconstructed from fully sequenced genomes. The network model, comprising genomes connected by pairwise evolutionary relations, enables the reconstruction of both vertical and LGT events. Modeling genome evolution in the form of a network enables the use of an extensive toolbox developed for network research. The structural properties of phylogenomic networks open up fundamentally new insights into genome evolution.
Collapse
|
199
|
Beauregard-Racine J, Bicep C, Schliep K, Lopez P, Lapointe FJ, Bapteste E. Of woods and webs: possible alternatives to the tree of life for studying genomic fluidity in E. coli. Biol Direct 2011; 6:39; discussion 39. [PMID: 21774799 PMCID: PMC3160433 DOI: 10.1186/1745-6150-6-39] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 07/20/2011] [Indexed: 12/26/2022] Open
Abstract
Background We introduce several forest-based and network-based methods for exploring microbial evolution, and apply them to the study of thousands of genes from 30 strains of E. coli. This case study illustrates how additional analyses could offer fast heuristic alternatives to standard tree of life (TOL) approaches. Results We use gene networks to identify genes with atypical modes of evolution, and genome networks to characterize the evolution of genetic partnerships between E. coli and mobile genetic elements. We develop a novel polychromatic quartet method to capture patterns of recombination within E. coli, to update the clanistic toolkit, and to search for the impact of lateral gene transfer and of pathogenicity on gene evolution in two large forests of trees bearing E. coli. We unravel high rates of lateral gene transfer involving E. coli (about 40% of the trees under study), and show that both core genes and shell genes of E. coli are affected by non-tree-like evolutionary processes. We show that pathogenic lifestyle impacted the structure of 30% of the gene trees, and that pathogenic strains are more likely to transfer genes with one another than with non-pathogenic strains. In addition, we propose five groups of genes as candidate mobile modules of pathogenicity. We also present strong evidence for recent lateral gene transfer between E. coli and mobile genetic elements. Conclusions Depending on which evolutionary questions biologists want to address (i.e. the identification of modules, genetic partnerships, recombination, lateral gene transfer, or genes with atypical evolutionary modes, etc.), forest-based and network-based methods are preferable to the reconstruction of a single tree, because they provide insights and produce hypotheses about the dynamics of genome evolution, rather than the relative branching order of species and lineages. Such a methodological pluralism - the use of woods and webs - is to be encouraged to analyse the evolutionary processes at play in microbial evolution. This manuscript was reviewed by: Ford Doolittle, Tal Pupko, Richard Burian, James McInerney, Didier Raoult, and Yan Boucher
Collapse
|
200
|
Affiliation(s)
- David Penny
- Institute for Molecular BioSciences, Massey University, Palmerston North, New Zealand
- * E-mail:
| |
Collapse
|