Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jun SR, Sims GE, Wu GA, Kim SH. Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution. Proc Natl Acad Sci U S A 2010;107:133-8. [PMID: 20018669 PMCID: PMC2806744 DOI: 10.1073/pnas.0913033107] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

For:	Jun SR, Sims GE, Wu GA, Kim SH. Whole-proteome phylogeny of prokaryotes by feature frequency profiles: An alignment-free method with optimal feature resolution. Proc Natl Acad Sci U S A 2010;107:133-8. [PMID: 20018669 PMCID: PMC2806744 DOI: 10.1073/pnas.0913033107] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Yu H, Yau SST. The optimal metric for viral genome space. Comput Struct Biotechnol J 2024;23:2083-2096. [PMID: 38803517 PMCID: PMC11128839 DOI: 10.1016/j.csbj.2024.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/22/2024] [Accepted: 05/04/2024] [Indexed: 05/29/2024] Open

Wang T, Yu ZG, Li J. CGRWDL: alignment-free phylogeny reconstruction method for viruses based on chaos game representation weighted by dynamical language model. Front Microbiol 2024;15:1339156. [PMID: 38572227 PMCID: PMC10987876 DOI: 10.3389/fmicb.2024.1339156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 02/23/2024] [Indexed: 04/05/2024] Open

Yu H, Yau SST. Automated recognition of chromosome fusion using an alignment-free natural vector method. Front Genet 2024;15:1364951. [PMID: 38572414 PMCID: PMC10987741 DOI: 10.3389/fgene.2024.1364951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 03/06/2024] [Indexed: 04/05/2024] Open

Rachtman E, Sarmashghi S, Bafna V, Mirarab S. Quantifying the uncertainty of assembly-free genome-wide distance estimates and phylogenetic relationships using subsampling. Cell Syst 2022;13:817-829.e3. [PMID: 36265468 PMCID: PMC9589918 DOI: 10.1016/j.cels.2022.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 03/14/2022] [Accepted: 06/28/2022] [Indexed: 01/26/2023]

Balaban M, Bristy NA, Faisal A, Bayzid MS, Mirarab S. Genome-wide alignment-free phylogenetic distance estimation under a no strand-bias model. BIOINFORMATICS ADVANCES 2022;2:vbac055. [PMID: 35992043 PMCID: PMC9383262 DOI: 10.1093/bioadv/vbac055] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 08/09/2022] [Indexed: 01/27/2023]

Key Words Collapse

MESH Headings Collapse

Grants Collapse

Affiliation(s)
Metin Balaban
Nishat Anjum Bristy
Ahnaf Faisal
Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
Md Shamsuzzoha Bayzid
Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh
Siavash Mirarab
To whom correspondence should be addressed.
Collapse

Wang Z, Wen Z, Jiang M, Xia F, Wang M, Zhuge X, Dai J. Dissemination of virulence and resistance genes among Klebsiella pneumoniae via outer membrane vesicle: An important plasmid transfer mechanism to promote the emergence of carbapenem-resistant hypervirulent Klebsiella pneumoniae. Transbound Emerg Dis 2022;69:e2661-e2676. [PMID: 35679514 DOI: 10.1111/tbed.14615] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Revised: 05/15/2022] [Accepted: 06/07/2022] [Indexed: 12/01/2022]

Abstract

Klebsiella pneumoniae is well-known opportunistic enterobacteria involved in complex clinical infections in humans and animals. The domestic animals might be a source of the multidrug-resistant virulent K. pneumoniae to humans. K. pneumoniae infections in domestic animals are considered as an emergent global concern. The horizontal gene transfer plays essential roles in bacterial genome evolution by spread of virulence and resistance determinants. However, the virulence genes can be transferred horizontally via K. pneumoniae-derived outer membrane vesicles (OMVs) remains to be unreported. In this study, we performed complete genome sequencing of two K. pneumoniae HvK2115 and CRK3022 with hypervirulent or carbapenem-resistant traits. OMVs from K. pneumoniae HvK2115 and CRK3022 were purified and observed. The carriage of virulence or resistance genes in K. pneumoniae OMVs was identified. The influence of OMVs on the horizontal transfer of virulence-related or drug-resistant plasmids among K. pneumoniae strains was evaluated thoroughly. The plasmid transfer to recipient bacteria through OMVs was identified by polymerase chain reaction, pulsed field gel electrophoresis and Southern blot. This study revealed that OMVs could mediate the intraspecific and interspecific horizontal transfer of the virulence plasmid phvK2115. OMVs could simultaneously transfer two resistance plasmids into K. pneumoniae and Escherichia coli recipient strains. OMVs-mediated horizontal transfer of virulence plasmid phvK2115 could significantly enhance the pathogenicity of human carbapenem-resistant K. pneumoniae CRK3022. The CRK3022 acquired the virulence plasmid phvK2115 could become a CR-hvKp strain. It was critically important that OMVs-mediated horizontal transfer of phvK2115 lead to the coexistence of virulence and carbapenem-resistance genes in K. pneumoniae, resulting in the emerging of carbapenem-resistant hypervirulent K. pneumoniae.

Collapse

Liyanapathiranage P, Wagner N, Avram O, Pupko T, Potnis N. Phylogenetic Distribution and Evolution of Type VI Secretion System in the Genus Xanthomonas. Front Microbiol 2022;13:840308. [PMID: 35495725 PMCID: PMC9048695 DOI: 10.3389/fmicb.2022.840308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 02/10/2022] [Indexed: 11/13/2022] Open

Abstract

The type VI secretion system (T6SS) present in many Gram-negative bacteria is a contact-dependent apparatus that can directly deliver secreted effectors or toxins into diverse neighboring cellular targets including both prokaryotic and eukaryotic organisms. Recent reverse genetics studies with T6 core gene loci have indicated the importance of functional T6SS toward overall competitive fitness in various pathogenic Xanthomonas spp. To understand the contribution of T6SS toward ecology and evolution of Xanthomonas spp., we explored the distribution of the three distinguishable T6SS clusters, i3*, i3***, and i4, in approximately 1,740 Xanthomonas genomes, along with their conservation, genetic organization, and their evolutionary patterns in this genus. Screening genomes for core genes of each T6 cluster indicated that 40% of the sequenced strains possess two T6 clusters, with combinations of i3*** and i3* or i3*** and i4. A few strains of Xanthomonas citri, Xanthomonas phaseoli, and Xanthomonas cissicola were the exception, possessing a unique combination of i3* and i4. The findings also indicated clade-specific distribution of T6SS clusters. Phylogenetic analysis demonstrated that T6SS clusters i3* and i3*** were probably acquired by the ancestor of the genus Xanthomonas, followed by gain or loss of individual clusters upon diversification into subsequent clades. T6 i4 cluster has been acquired in recent independent events by group 2 xanthomonads followed by its spread via horizontal dissemination across distinct clades across groups 1 and 2 xanthomonads. We also noted reshuffling of the entire core T6 loci, as well as T6SS spike complex components, hcp and vgrG, among different species. Our findings indicate that gain or loss events of specific T6SS clusters across Xanthomonas phylogeny have not been random.

Collapse

Microbial storage and its implications for soil ecology. THE ISME JOURNAL 2022;16:617-629. [PMID: 34593996 PMCID: PMC8857262 DOI: 10.1038/s41396-021-01110-w] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 08/31/2021] [Accepted: 09/07/2021] [Indexed: 02/08/2023]

Dong R, Pei S, Guan M, Yau SC, Yin C, He RL, Yau SST. Full Chromosomal Relationships Between Populations and the Origin of Humans. Front Genet 2022;12:828805. [PMID: 35186019 PMCID: PMC8847220 DOI: 10.3389/fgene.2021.828805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Accepted: 12/22/2021] [Indexed: 11/23/2022] Open

Giannakara M, Koumandou VL. Evolution of two-component quorum sensing systems. Access Microbiol 2022;4:000303. [PMID: 35252749 PMCID: PMC8895600 DOI: 10.1099/acmi.0.000303] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 11/15/2021] [Indexed: 12/16/2022] Open

Zhong H, Loukides G, Pissis SP. Clustering sequence graphs. DATA KNOWL ENG 2022. [DOI: 10.1016/j.datak.2022.101981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Nakayinga R, Makumi A, Tumuhaise V, Tinzaara W. Xanthomonas bacteriophages: a review of their biology and biocontrol applications in agriculture. BMC Microbiol 2021;21:291. [PMID: 34696726 PMCID: PMC8543423 DOI: 10.1186/s12866-021-02351-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 10/12/2021] [Indexed: 11/10/2022] Open

Kořený L, Oborník M, Horáková E, Waller RF, Lukeš J. The convoluted history of haem biosynthesis. Biol Rev Camb Philos Soc 2021;97:141-162. [PMID: 34472688 DOI: 10.1111/brv.12794] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 08/12/2021] [Accepted: 08/19/2021] [Indexed: 01/14/2023]

Cofactor Specificity of Glucose-6-Phosphate Dehydrogenase Isozymes in Pseudomonas putida Reveals a General Principle Underlying Glycolytic Strategies in Bacteria. mSystems 2021;6:6/2/e00014-21. [PMID: 33727391 PMCID: PMC8546961 DOI: 10.1128/msystems.00014-21] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

Glucose-6-phosphate dehydrogenase (G6PDH) is widely distributed in nature and catalyzes the first committing step in the oxidative branch of the pentose phosphate (PP) pathway, feeding either the reductive PP or the Entner-Doudoroff pathway. Besides its role in central carbon metabolism, this dehydrogenase provides reduced cofactors, thereby affecting redox balance. Although G6PDH is typically considered to display specificity toward NADP⁺, some variants accept NAD⁺ similarly or even preferentially. Furthermore, the number of G6PDH isozymes encoded in bacterial genomes varies from none to more than four orthologues. On this background, we systematically analyzed the interplay of the three G6PDH isoforms of the soil bacterium Pseudomonas putida KT2440 from genomic, genetic, and biochemical perspectives. P. putida represents an ideal model to tackle this endeavor, as its genome harbors gene orthologues for most dehydrogenases in central carbon metabolism. We show that the three G6PDHs of strain KT2440 have different cofactor specificities and that the isoforms encoded by zwfA and zwfB carry most of the activity, acting as metabolic “gatekeepers” for carbon sources that enter at different nodes of the biochemical network. Moreover, we demonstrate how multiplication of G6PDH isoforms is a widespread strategy in bacteria, correlating with the presence of an incomplete Embden-Meyerhof-Parnas pathway. The abundance of G6PDH isoforms in these species goes hand in hand with low NADP⁺ affinity, at least in one isozyme. We propose that gene duplication and relaxation in cofactor specificity is an evolutionary strategy toward balancing the relative production of NADPH and NADH.

IMPORTANCE Protein families have likely arisen during evolution by gene duplication and divergence followed by neofunctionalization. While this phenomenon is well documented for catabolic activities (typical of environmental bacteria that colonize highly polluted niches), the coexistence of multiple isozymes in central carbon catabolism remains relatively unexplored. We have adopted the metabolically versatile soil bacterium Pseudomonas putida KT2440 as a model to interrogate the physiological and evolutionary significance of coexisting glucose-6-phosphate dehydrogenase (G6PDH) isozymes. Our results show that each of the three G6PDHs in this bacterium display distinct biochemical properties, especially at the level of cofactor preference, impacting bacterial physiology in a carbon source-dependent fashion. Furthermore, the presence of multiple G6PDHs differing in NAD⁺ or NADP⁺ specificity in bacterial species strongly correlates with their predominant metabolic lifestyle. Our findings support the notion that multiplication of genes encoding cofactor-dependent dehydrogenases is a general evolutionary strategy toward achieving redox balance according to the growth conditions.

Collapse

McKinnon LM, Miller JB, Whiting MF, Kauwe JSK, Ridge PG. A comprehensive analysis of the phylogenetic signal in ramp sequences in 211 vertebrates. Sci Rep 2021;11:622. [PMID: 33436653 PMCID: PMC7803996 DOI: 10.1038/s41598-020-78803-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/23/2020] [Indexed: 01/24/2023] Open

An SQ, Potnis N, Dow M, Vorhölter FJ, He YQ, Becker A, Teper D, Li Y, Wang N, Bleris L, Tang JL. Mechanistic insights into host adaptation, virulence and epidemiology of the phytopathogen Xanthomonas. FEMS Microbiol Rev 2020;44:1-32. [PMID: 31578554 PMCID: PMC8042644 DOI: 10.1093/femsre/fuz024] [Citation(s) in RCA: 117] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 09/29/2019] [Indexed: 01/15/2023] Open

Affiliation(s)

Shi-Qi An National Biofilms Innovation Centre (NBIC), Biological Sciences, University of Southampton, University Road, Southampton SO17 1BJ, UK
Neha Potnis Department of Entomology and Plant Pathology, Rouse Life Science Building, Auburn University, Auburn AL36849, USA
Max Dow School of Microbiology, Food Science & Technology Building, University College Cork, Cork T12 K8AF, Ireland
Frank-Jörg Vorhölter MVZ Dr. Eberhard & Partner Dortmund, Brauhausstraße 4, Dortmund 44137, Germany
Yong-Qiang He State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, 100 Daxue Road, Nanning 530004, Guangxi, China
Anke Becker Loewe Center for Synthetic Microbiology and Department of Biology, Philipps-Universität Marburg, Hans-Meerwein-Straße 6, Marburg 35032, Germany
Doron Teper Citrus Research and Education Center, Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, 700 Experiment Station Road, Lake Alfred 33850, USA
Yi Li Bioengineering Department, University of Texas at Dallas, 2851 Rutford Ave, Richardson, TX 75080, USA.,Center for Systems Biology, University of Texas at Dallas, 800 W Campbell Road, Richardson, TX 75080, USA
Nian Wang Citrus Research and Education Center, Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, 700 Experiment Station Road, Lake Alfred 33850, USA
Leonidas Bleris Bioengineering Department, University of Texas at Dallas, 2851 Rutford Ave, Richardson, TX 75080, USA.,Center for Systems Biology, University of Texas at Dallas, 800 W Campbell Road, Richardson, TX 75080, USA.,Department of Biological Sciences, University of Texas at Dallas, 800 W Campbell Road, Richardson, TX75080, USA
Ji-Liang Tang State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, 100 Daxue Road, Nanning 530004, Guangxi, China

Collapse

Delibaş E, Arslan A, Şeker A, Diri B. A novel alignment-free DNA sequence similarity analysis approach based on top-k n-gram match-up. J Mol Graph Model 2020;100:107693. [PMID: 32805559 DOI: 10.1016/j.jmgm.2020.107693] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Revised: 06/15/2020] [Accepted: 07/06/2020] [Indexed: 11/17/2022]

Mughal F, Nasir A, Caetano-Anollés G. The origin and evolution of viruses inferred from fold family structure. Arch Virol 2020;165:2177-2191. [PMID: 32748179 PMCID: PMC7398281 DOI: 10.1007/s00705-020-04724-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 05/30/2020] [Indexed: 12/16/2022]

Dong R, Pei S, Yin C, He RL, Yau SST. Analysis of the Hosts and Transmission Paths of SARS-CoV-2 in the COVID-19 Outbreak. Genes (Basel) 2020;11:E637. [PMID: 32526937 PMCID: PMC7349679 DOI: 10.3390/genes11060637] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 12/11/2022] Open

Miller JB, McKinnon LM, Whiting MF, Kauwe JSK, Ridge PG. Codon Pairs are Phylogenetically Conserved: A comprehensive analysis of codon pairing conservation across the Tree of Life. PLoS One 2020;15:e0232260. [PMID: 32401752 PMCID: PMC7219770 DOI: 10.1371/journal.pone.0232260] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 04/10/2020] [Indexed: 11/27/2022] Open

Whole-proteome tree of life suggests a deep burst of organism diversity. Proc Natl Acad Sci U S A 2020;117:3678-3686. [PMID: 32019884 PMCID: PMC7035600 DOI: 10.1073/pnas.1915766117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Abstract

Tree of life (ToL) is a metaphorical tree that captures a simplified narrative of the evolutionary course and kinship among all living organisms of today. We have reconstructed a whole-proteome ToL for over 4,000 different extant species for which complete or near-complete genome sequences are available in public databases. The ToL suggests that 1) all extant organisms of this study can be grouped into 2 “Supergroups,” 6 “Major Groups,” or 35+ “Groups”; 2) the order of emergence of the “founders” of all the groups may be assigned on an evolutionary progression scale; and 3) all of the founders of the groups have emerged in a “deep burst” near the root of the ToL—an explosive birth of life’s diversity.

An organism tree of life (organism ToL) is a conceptual and metaphorical tree to capture a simplified narrative of the evolutionary course and kinship among the extant organisms. Such a tree cannot be experimentally validated but may be reconstructed based on characteristics associated with the organisms. Since the whole-genome sequence of an organism is, at present, the most comprehensive descriptor of the organism, a whole-genome sequence-based ToL can be an empirically derivable surrogate for the organism ToL. However, experimentally determining the whole-genome sequences of many diverse organisms was practically impossible until recently. We have constructed three types of ToLs for diversely sampled organisms using the sequences of whole genome, of whole transcriptome, and of whole proteome. Of the three, whole-proteome sequence-based ToL (whole-proteome ToL), constructed by applying information theory-based feature frequency profile method, an “alignment-free” method, gave the most topologically stable ToL. Here, we describe the main features of a whole-proteome ToL for 4,023 species with known complete or almost complete genome sequences on grouping and kinship among the groups at deep evolutionary levels. The ToL reveals 1) all extant organisms of this study can be grouped into 2 “Supergroups,” 6 “Major Groups,” or 35+ “Groups”; 2) the order of emergence of the “founders” of all of the groups may be assigned on an evolutionary progression scale; 3) all of the founders of the groups have emerged in a “deep burst” at the very beginning period near the root of the ToL—an explosive birth of life’s diversity.

Collapse

De Pierri CR, Voyceik R, Santos de Mattos LGC, Kulik MG, Camargo JO, Repula de Oliveira AM, de Lima Nichio BT, Marchaukoski JN, da Silva Filho AC, Guizelini D, Ortega JM, Pedrosa FO, Raittz RT. SWeeP: representing large biological sequences datasets in compact vectors. Sci Rep 2020;10:91. [PMID: 31919449 PMCID: PMC6952362 DOI: 10.1038/s41598-019-55627-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Accepted: 12/02/2019] [Indexed: 12/25/2022] Open

Affiliation(s)

Camilla Reginatto De Pierri Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Biochemistry and Molecular Biology, Curitiba, Paraná, Brazil
Ricardo Voyceik Federal University of Minas Gerais, Institute of Biological Sciences (ICB), Belo Horizonte, Minas Gerais, Brazil
Letícia Graziela Costa Santos de Mattos Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil
Mariane Gonçalves Kulik Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil
Josué Oliveira Camargo Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Biochemistry and Molecular Biology, Curitiba, Paraná, Brazil
Aryel Marlus Repula de Oliveira Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Genetics, Curitiba, Paraná, Brazil
Bruno Thiago de Lima Nichio Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Biochemistry and Molecular Biology, Curitiba, Paraná, Brazil
Jeroniza Nunes Marchaukoski Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil
Antonio Camilo da Silva Filho Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Pharmaceutical Sciences, Curitiba, Paraná, Brazil
Dieval Guizelini Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil
J Miguel Ortega Federal University of Minas Gerais, Institute of Biological Sciences (ICB), Belo Horizonte, Minas Gerais, Brazil
Fabio O Pedrosa Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil.,Federal University of Paraná, Department of Biochemistry and Molecular Biology, Curitiba, Paraná, Brazil
Roberto Tadeu Raittz Federal University of Paraná - SEPT, Graduate Program in Bioinformatics, Curitiba, Paraná, Brazil. .,Federal University of Minas Gerais, Institute of Biological Sciences (ICB), Belo Horizonte, Minas Gerais, Brazil. .,Federal University of Paraná, Department of Genetics, Curitiba, Paraná, Brazil.

Collapse

Bernard G, Chan CX, Chan YB, Chua XY, Cong Y, Hogan JM, Maetschke SR, Ragan MA. Alignment-free inference of hierarchical and reticulate phylogenomic relationships. Brief Bioinform 2019;20:426-435. [PMID: 28673025 PMCID: PMC6433738 DOI: 10.1093/bib/bbx067] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Revised: 05/04/2017] [Indexed: 11/22/2022] Open

Zielezinski A, Girgis HZ, Bernard G, Leimeister CA, Tang K, Dencker T, Lau AK, Röhling S, Choi JJ, Waterman MS, Comin M, Kim SH, Vinga S, Almeida JS, Chan CX, James BT, Sun F, Morgenstern B, Karlowski WM. Benchmarking of alignment-free sequence comparison methods. Genome Biol 2019;20:144. [PMID: 31345254 PMCID: PMC6659240 DOI: 10.1186/s13059-019-1755-7] [Citation(s) in RCA: 101] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 07/03/2019] [Indexed: 11/22/2022] Open

Affiliation(s)

Andrzej Zielezinski Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, Uniwersytetu Poznańskiego 6, 61-614, Poznan, Poland
Hani Z Girgis Tandy School of Computer Science, The University of Tulsa, 800 South Tucker Drive, Tulsa, OK, 74104, USA
Guillaume Bernard UMR 7205 ISYEB, Sorbonne Université, 75005, Paris, France
Chris-Andre Leimeister Department of Bioinformatics, Institute of Microbiology and Genetics, University of Göttingen, Goldschmidtstr. 1, 37077, Göttingen, Germany
Kujin Tang Department of Biological Sciences, Quantitative and Computational Biology Program, University of Southern California, Los Angeles, CA, 90089, USA
Thomas Dencker Department of Bioinformatics, Institute of Microbiology and Genetics, University of Göttingen, Goldschmidtstr. 1, 37077, Göttingen, Germany
Anna Katharina Lau Department of Bioinformatics, Institute of Microbiology and Genetics, University of Göttingen, Goldschmidtstr. 1, 37077, Göttingen, Germany
Sophie Röhling Department of Bioinformatics, Institute of Microbiology and Genetics, University of Göttingen, Goldschmidtstr. 1, 37077, Göttingen, Germany
Jae Jin Choi Department of Chemistry, University of California, Berkeley, CA, 94720, USA Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
Michael S Waterman Department of Biological Sciences, Quantitative and Computational Biology Program, University of Southern California, Los Angeles, CA, 90089, USA Centre for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai, 200433, China
Matteo Comin Department of Information Engineering, University of Padova, Padova, Italy
Sung-Hou Kim Department of Chemistry, University of California, Berkeley, CA, 94720, USA Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
Susana Vinga INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001, Lisbon, Portugal IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001, Lisbon, Portugal
Jonas S Almeida Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institute (NIH/NCI), Bethesda, USA
Cheong Xin Chan Institute for Molecular Bioscience, and School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
Benjamin T James Tandy School of Computer Science, The University of Tulsa, 800 South Tucker Drive, Tulsa, OK, 74104, USA
Fengzhu Sun Department of Biological Sciences, Quantitative and Computational Biology Program, University of Southern California, Los Angeles, CA, 90089, USA Centre for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai, 200433, China
Burkhard Morgenstern Department of Bioinformatics, Institute of Microbiology and Genetics, University of Göttingen, Goldschmidtstr. 1, 37077, Göttingen, Germany
Wojciech M Karlowski Department of Computational Biology, Faculty of Biology, Adam Mickiewicz University Poznan, Uniwersytetu Poznańskiego 6, 61-614, Poznan, Poland.

Collapse

Lu YY, Tang K, Ren J, Fuhrman JA, Waterman MS, Sun F. CAFE: aCcelerated Alignment-FrEe sequence analysis. Nucleic Acids Res 2019;45:W554-W559. [PMID: 28472388 PMCID: PMC5793812 DOI: 10.1093/nar/gkx351] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 04/20/2017] [Indexed: 12/13/2022] Open

Miller JB, McKinnon LM, Whiting MF, Ridge PG. CAM: an alignment-free method to recover phylogenies using codon aversion motifs. PeerJ 2019;7:e6984. [PMID: 31198636 PMCID: PMC6555396 DOI: 10.7717/peerj.6984] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 04/17/2019] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

Common phylogenomic approaches for recovering phylogenies are often time-consuming and require annotations for orthologous gene relationships that are not always available. In contrast, alignment-free phylogenomic approaches typically use structure and oligomer frequencies to calculate pairwise distances between species. We have developed an approach to quickly calculate distances between species based on codon aversion.

METHODS

Utilizing a novel alignment-free character state, we present CAM, an alignment-free approach to recover phylogenies by comparing differences in codon aversion motifs (i.e., the set of unused codons within each gene) across all genes within a species. Synonymous codon usage is non-random and differs between organisms, between genes, and even within a single gene, and many genes do not use all possible codons. We report a comprehensive analysis of codon aversion within 229,742,339 genes from 23,428 species across all kingdoms of life, and we provide an alignment-free framework for its use in a phylogenetic construct. For each species, we first construct a set of codon aversion motifs spanning all genes within that species. We define the pairwise distance between two species, A and B, as one minus the number of shared codon aversion motifs divided by the total codon aversion motifs of the species, A or B, containing the fewest motifs. This approach allows us to calculate pairwise distances even when substantial differences in the number of genes or a high rate of divergence between species exists. Finally, we use neighbor-joining to recover phylogenies.

RESULTS

Using the Open Tree of Life and NCBI Taxonomy Database as expected phylogenies, our approach compares well, recovering phylogenies that largely match expected trees and are comparable to trees recovered using maximum likelihood and other alignment-free approaches. Our technique is much faster than maximum likelihood and similar in accuracy to other alignment-free approaches. Therefore, we propose that codon aversion be considered a phylogenetically conserved character that may be used in future phylogenomic studies.

AVAILABILITY

CAM, documentation, and test files are freely available on GitHub at https://github.com/ridgelab/cam.

Collapse

Cardona T. Thinking twice about the evolution of photosynthesis. Open Biol 2019;9:180246. [PMID: 30890026 PMCID: PMC6451369 DOI: 10.1098/rsob.180246] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Accepted: 02/25/2019] [Indexed: 02/07/2023] Open

Leimeister CA, Schellhorn J, Dörrer S, Gerth M, Bleidorn C, Morgenstern B. Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences. Gigascience 2019;8:giy148. [PMID: 30535314 PMCID: PMC6436989 DOI: 10.1093/gigascience/giy148] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 09/10/2018] [Accepted: 11/20/2018] [Indexed: 11/20/2022] Open

Pornsukarom S, van Vliet AHM, Thakur S. Whole genome sequencing analysis of multiple Salmonella serovars provides insights into phylogenetic relatedness, antimicrobial resistance, and virulence markers across humans, food animals and agriculture environmental sources. BMC Genomics 2018;19:801. [PMID: 30400810 PMCID: PMC6218967 DOI: 10.1186/s12864-018-5137-4] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 10/02/2018] [Indexed: 11/13/2022] Open

Abstract

Background

Salmonella enterica is a significant foodborne pathogen, which can be transmitted via several distinct routes, and reports on acquisition of antimicrobial resistance (AMR) are increasing. To better understand the association between human Salmonella clinical isolates and the potential environmental/animal reservoirs, whole genome sequencing (WGS) was used to investigate the epidemiology and AMR patterns within Salmonella isolates from two adjacent US states.

Results

WGS data of 200 S. enterica isolates recovered from human (n = 44), swine (n = 32), poultry (n = 22), and farm environment (n = 102) were used for in silico prediction of serovar, distribution of virulence genes, and phylogenetically clustered using core genome single nucleotide polymorphism (SNP) and feature frequency profiling (FFP). Furthermore, AMR was studied both by genotypic prediction using five curated AMR databases, and compared to phenotypic AMR using broth microdilution. Core genome SNP-based and FFP-based phylogenetic trees showed consistent clustering of isolates into the respective serovars, and suggested clustering of isolates based on the source of isolation. The overall correlation of phenotypic and genotypic AMR was 87.61% and 97.13% for sensitivity and specificity, respectively. AMR and virulence genes clustered with the Salmonella serovars, while there were also associations between the presence of virulence genes in both animal/environmental isolates and human clinical samples.

Conclusions

WGS is a helpful tool for Salmonella phylogenetic analysis, AMR and virulence gene predictions. The clinical isolates clustered closely with animal and environmental isolates, suggesting that animals and environment are potential sources for dissemination of AMR and virulence genes between Salmonella serovars.

Electronic supplementary material

The online version of this article (10.1186/s12864-018-5137-4) contains supplementary material, which is available to authorized users.

Collapse

Caetano-Anollés G, Nasir A, Kim KM, Caetano-Anollés D. Rooting Phylogenies and the Tree of Life While Minimizing Ad Hoc and Auxiliary Assumptions. Evol Bioinform Online 2018;14:1176934318805101. [PMID: 30364468 PMCID: PMC6196624 DOI: 10.1177/1176934318805101] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 09/05/2018] [Indexed: 12/25/2022] Open

Wiegand S, Jogler M, Jogler C. On the maverick Planctomycetes. FEMS Microbiol Rev 2018;42:739-760. [DOI: 10.1093/femsre/fuy029] [Citation(s) in RCA: 134] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 07/22/2018] [Indexed: 01/01/2023] Open

Ren J, Bai X, Lu YY, Tang K, Wang Y, Reinert G, Sun F. Alignment-Free Sequence Analysis and Applications. Annu Rev Biomed Data Sci 2018;1:93-114. [PMID: 31828235 PMCID: PMC6905628 DOI: 10.1146/annurev-biodatasci-080917-013431] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Staley JT, Caetano-Anollés G. Archaea-First and the Co-Evolutionary Diversification of Domains of Life. Bioessays 2018;40:e1800036. [PMID: 29944192 DOI: 10.1002/bies.201800036] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Revised: 05/12/2018] [Indexed: 12/13/2022]

Barbieri M. What is code biology? Biosystems 2018;164:1-10. [DOI: 10.1016/j.biosystems.2017.10.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 10/04/2017] [Accepted: 10/05/2017] [Indexed: 01/29/2023]

Jun SR, Wassenaar TM, Wanchai V, Patumcharoenpol P, Nookaew I, Ussery DW. Suggested mechanisms for Zika virus causing microcephaly: what do the genomes tell us? BMC Bioinformatics 2017;18:471. [PMID: 29297281 PMCID: PMC5751795 DOI: 10.1186/s12859-017-1894-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Abstract

Background

Zika virus (ZIKV) is an emerging human pathogen. Since its arrival in the Western hemisphere, from Africa via Asia, it has become a serious threat to pregnant women, causing microcephaly and other neuropathies in developing fetuses. The mechanisms behind these teratogenic effects are unknown, although epidemiological evidence suggests that microcephaly is not associated with the original, African lineage of ZIKV. The sequences of 196 published ZIKV genomes were used to assess whether recently proposed mechanistic explanations for microcephaly are supported by molecular level changes that may have increased its virulence since the virus left Africa. For this we performed phylogenetic, recombination, adaptive evolution and tetramer frequency analyses, and compared protein sequences for the presence of protease cleavage sites, Pfam domains, glycosylation sites, signal peptides, trans-membrane protein domains, and phosphorylation sites.

Results

Recombination events within or between Asian and Brazilian lineages were not observed, and likewise there were no differences in protease cleavage, glycosylation sites, signal peptides or trans-membrane domains between African and Brazilian strains. The frequency of Retinoic Acid Response Element (RARE) sequences was increased in Brazilian strains. Genetic adaptation was also apparent by tetramer signatures that had undergone major changes in the past but has stabilized in the Brazilian lineage despite subsequent geographic spread, suggesting the viral population presently propagates in the same host species in various regions. Evidence for selection pressure was recognized for several amino acid sites in the Brazilian lineage compared to the African lineage, mainly in nonstructural proteins, especially protein NS4B. A number of these positively selected mutations resulted in an increased potential to be phosphorylated in the Brazilian lineage compared to the African linage, which may have increased their potential to interfere with neural fetal development.

Conclusions

ZIKV seems to have adapted to a limited number of hosts, including humans, during which its virulence increased. Its protein NS4B, together with NS4A, has recently been shown to inhibit Akt-mTOR signaling in human fetal neural stem cells, a key pathway for brain development. We hypothesize that positive selection of novel phosphorylation sites in the protein NS4B of the Brazilian lineage could interfere with phosphorylation of Akt and mTOR, impairing Akt-mTOR signaling and this may result in an increased risk for developmental neuropathies.

Electronic supplementary material

The online version of this article (10.1186/s12859-017-1894-3) contains supplementary material, which is available to authorized users.

Collapse

Camiolo S, Porru C, Benítez-Cabello A, Rodríguez-Gómez F, Calero-Delgado B, Porceddu A, Budroni M, Mannazzu I, Jiménez-Díaz R, Arroyo-López FN. Genome overview of eight Candida boidinii strains isolated from human activities and wild environments. Stand Genomic Sci 2017;12:70. [PMID: 29213357 PMCID: PMC5712119 DOI: 10.1186/s40793-017-0281-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 11/21/2017] [Indexed: 11/10/2022] Open

Zielezinski A, Vinga S, Almeida J, Karlowski WM. Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol 2017;18:186. [PMID: 28974235 PMCID: PMC5627421 DOI: 10.1186/s13059-017-1319-7] [Citation(s) in RCA: 244] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Choi J, Kim SH. A genome Tree of Life for the Fungi kingdom. Proc Natl Acad Sci U S A 2017;114:9391-9396. [PMID: 28808018 PMCID: PMC5584464 DOI: 10.1073/pnas.1711939114] [Citation(s) in RCA: 90] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

He L, Li Y, He RL, Yau SST. A novel alignment-free vector method to cluster protein sequences. J Theor Biol 2017;427:41-52. [DOI: 10.1016/j.jtbi.2017.06.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Revised: 05/04/2017] [Accepted: 06/02/2017] [Indexed: 11/29/2022]

Seo H, Cho DH. A new alignment free genome comparison algorithm based on statistically estimated feature frequency profile. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2017;2017:4265-4268. [PMID: 29060839 DOI: 10.1109/embc.2017.8037798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Marin J, Battistuzzi FU, Brown AC, Hedges SB. The Timetree of Prokaryotes: New Insights into Their Evolution and Speciation. Mol Biol Evol 2017;34:437-446. [PMID: 27965376 DOI: 10.1093/molbev/msw245] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Staley JT. Domain Cell Theory supports the independent evolution of the Eukarya, Bacteria and Archaea and the Nuclear Compartment Commonality hypothesis. Open Biol 2017;7:170041. [PMID: 28659382 PMCID: PMC5493775 DOI: 10.1098/rsob.170041] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 05/26/2017] [Indexed: 01/15/2023] Open

Abstract

In 2015, the Royal Society of London held a meeting to discuss the various hypotheses regarding the origin of the Eukarya. Although not all participants supported a hypothesis, the proposals that did fit into two broad categories: one group favoured 'Prokaryotes First' hypotheses and another addressed 'Eukaryotes First' hypotheses. Those who proposed Prokaryotes First hypotheses advocated either a fusion event between a bacterium and an archaeon that produced the first eukaryote or the direct evolution of the Eukarya from the Archaea. The Eukaryotes First proponents posit that the eukaryotes evolved initially and then, by reductive evolution, produced the Bacteria and Archaea. No mention was made of another previously published hypothesis termed the Nuclear Compartment Commonality (NuCom) hypothesis, which proposed the evolution of the Eukarya and Bacteria from nucleated ancestors (Staley 2013 Astrobiol Outreach1, 105 (doi:10.4172/2332-2519.1000105)). Evidence from two studies indicates that the nucleated Planctomycetes-Verrucomicrobia-Chlamydia superphylum members are the most ancient Bacteria known (Brochier & Philippe 2002 Nature417, 244 (doi:10.1038/417244a); Jun et al. 2010 Proc. Natl Acad. Sci. USA107, 133-138 (doi:10.1073/pnas.0913033107)). This review summarizes the evidence for the NuCom hypothesis and discusses how simple the NuCom hypothesis is in explaining eukaryote evolution relative to the other hypotheses. The philosophical importance of simplicity and its relationship to truth in hypotheses such as NuCom and Domain Cell Theory is presented. Domain Cell Theory is also proposed herein, which contends that each of the three cellular lineages of life, the Archaea, Bacteria and Eukarya domains, evolved independently, in support of the NuCom hypothesis. All other proposed hypotheses violate Domain Cell Theory because they posit the evolution of different cellular descendants from ancestral cellular types.

Collapse

Sagulenko E, Nouwens A, Webb RI, Green K, Yee B, Morgan G, Leis A, Lee KC, Butler MK, Chia N, Pham UTP, Lindgreen S, Catchpole R, Poole AM, Fuerst JA. Nuclear Pore-Like Structures in a Compartmentalized Bacterium. PLoS One 2017;12:e0169432. [PMID: 28146565 PMCID: PMC5287468 DOI: 10.1371/journal.pone.0169432] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 12/02/2016] [Indexed: 01/02/2023] Open

Affiliation(s)

Evgeny Sagulenko School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Amanda Nouwens School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Richard I. Webb Centre for Microscopy and Microanalysis, The University of Queensland, Brisbane, Queensland, Australia
Kathryn Green Centre for Microscopy and Microanalysis, The University of Queensland, Brisbane, Queensland, Australia
Benjamin Yee School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Garry Morgan Centre for Microscopy and Microanalysis, The University of Queensland, Brisbane, Queensland, Australia
Andrew Leis CSIRO - Livestock Industries, Australian Animal Health Laboratory, Biosecurity Microscopy Facility (ABMF), Geelong, Victoria, Australia
Kuo-Chang Lee School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Margaret K. Butler School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Nicholas Chia Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
Uyen Thi Phuong Pham School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
Stinus Lindgreen School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
Ryan Catchpole School of Biological Sciences, University of Canterbury, Christchurch, New Zealand Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
Anthony M. Poole School of Biological Sciences, University of Canterbury, Christchurch, New Zealand Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand Allan Wilson Centre, University of Canterbury, Christchurch, New Zealand Bioinformatics Institute, School of Biological Sciences, University of Auckland, Auckland, New Zealand
John A. Fuerst School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia * E-mail:

Collapse

Staley JT, Fuerst JA. Ancient, highly conserved proteins from a LUCA with complex cell biology provide evidence in support of the nuclear compartment commonality (NuCom) hypothesis. Res Microbiol 2017;168:395-412. [PMID: 28111289 DOI: 10.1016/j.resmic.2017.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Revised: 01/08/2017] [Accepted: 01/09/2017] [Indexed: 12/23/2022]

Viral Phylogenomics Using an Alignment-Free Method: A Three-Step Approach to Determine Optimal Length of k-mer. Sci Rep 2017;7:40712. [PMID: 28102365 PMCID: PMC5244389 DOI: 10.1038/srep40712] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 12/08/2016] [Indexed: 11/25/2022] Open

How Did the Eukaryotes Evolve? ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s13752-016-0253-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Chen S, Deng LY, Bowman D, Shiau JJH, Wong TY, Madahian B, Lu HHS. Phylogenetic tree construction using trinucleotide usage profile (TUP). BMC Bioinformatics 2016;17:381. [PMID: 27766939 PMCID: PMC5073869 DOI: 10.1186/s12859-016-1222-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4⁶=4096 to 4¹⁵.

RESULTS

We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4³=64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods.

CONCLUSIONS

Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of overlapping words, which is the average (or mixture) of the frequency distributions of three possible reading frames. Consequently, we show (from the entropy viewpoint) that the FFP procedure could dilute important gene information and therefore provides less accurate classification.

Collapse

Hahn L, Leimeister CA, Ounit R, Lonardi S, Morgenstern B. rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison. PLoS Comput Biol 2016;12:e1005107. [PMID: 27760124 PMCID: PMC5070788 DOI: 10.1371/journal.pcbi.1005107] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2016] [Accepted: 08/11/2016] [Indexed: 12/05/2022] Open

Pinos S, Pontarotti P, Raoult D, Baudoin JP, Pagnier I. Compartmentalization in PVC super-phylum: evolution and impact. Biol Direct 2016;11:38. [PMID: 27507008 PMCID: PMC4977879 DOI: 10.1186/s13062-016-0144-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2016] [Accepted: 08/02/2016] [Indexed: 11/27/2022] Open

Abstract

BACKGROUND

The PVC super-phylum gathers bacteria from seven phyla (Planctomycetes, Verrucomicrobiae, Chlamydiae, Lentisphaera, Poribacteria, OP3, WWE2) presenting different lifestyles, cell plans and environments. Planctomyces and several Verrucomicrobiae exhibit a complex cell plan, with an intracytoplasmic membrane inducing the compartmentalization of the cytoplasm into two regions (pirellulosome and paryphoplasm). The evolution and function of this cell plan is still subject to debate. In this work, we hypothesized that it could play a role in protection of the bacterial DNA, especially against Horizontal Genes Transfers (HGT). Therefore, 64 bacterial genomes belonging to seven different phyla (whose four PVC phyla) were studied. We reconstructed the evolution of the cell plan as precisely as possible, thanks to information obtained by bibliographic study and electronic microscopy. We used a strategy based on comparative phylogenomic in order to determine the part occupied by the horizontal transfers for each studied genomes.

RESULTS

Our results show that the bacteria Simkania negevensis (Chlamydiae) and Coraliomargarita akajimensis (Verrucomicrobiae), whose cell plan were unknown before, are compartmentalized, as we can see on the micrographies. This is one of the first indication of the presence of an intracytoplasmic membrane in a Chlamydiae. The proportion of HGT does not seems to be related to the cell plan of bacteria, suggesting that compartmentalization does not induce a protection of bacterial DNA against HGT. Conversely, lifestyle of bacteria seems to impact the ability of bacteria to exchange genes.

CONCLUSIONS

Our study allows a best reconstruction of the evolution of intracytoplasmic membrane, but this structure seems to have no impact on HGT occurrences.

REVIEWERS

This article was reviewed by Mircea Podar and Olivier Tenaillon.

Collapse

Bernard G, Chan CX, Ragan MA. Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer. Sci Rep 2016;6:28970. [PMID: 27363362 PMCID: PMC4929450 DOI: 10.1038/srep28970] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 06/13/2016] [Indexed: 12/22/2022] Open