1
|
Tiemann JKS, Zschach H, Lindorff-Larsen K, Stein A. Interpreting the molecular mechanisms of disease variants in human transmembrane proteins. Biophys J 2023:S0006-3495(22)03941-8. [PMID: 36600598 DOI: 10.1016/j.bpj.2022.12.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/19/2022] [Accepted: 12/21/2022] [Indexed: 01/06/2023] Open
Abstract
Next-generation sequencing of human genomes reveals millions of missense variants, some of which may lead to loss of protein function and ultimately disease. Here, we investigate missense variants in membrane proteins-key drivers in cell signaling and recognition. We find enrichment of pathogenic variants in the transmembrane region across 19,000 functionally classified variants in human membrane proteins. To accurately predict variant consequences, one fundamentally needs to understand the underlying molecular processes. A key mechanism underlying pathogenicity in missense variants of soluble proteins has been shown to be loss of stability. Membrane proteins, however, are widely understudied. Here, we interpret variant effects on a larger scale by performing structure-based estimations of changes in thermodynamic stability using a membrane-specific energy function and analyses of sequence conservation during evolution of 15 transmembrane proteins. We find evidence for loss of stability being the cause of pathogenicity in more than half of the pathogenic variants, indicating that this is a driving factor also in membrane-protein-associated diseases. Our findings show how computational tools aid in gaining mechanistic insights into variant consequences for membrane proteins. To enable broader analyses of disease-related and population variants, we include variant mappings for the entire human proteome.
Collapse
Affiliation(s)
- Johanna Katarina Sofie Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Henrike Zschach
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Amelie Stein
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
2
|
Valanciute A, Nygaard L, Zschach H, Maglegaard Jepsen M, Lindorff-Larsen K, Stein A. Accurate protein stability predictions from homology models. Comput Struct Biotechnol J 2022; 21:66-73. [PMID: 36514339 PMCID: PMC9729920 DOI: 10.1016/j.csbj.2022.11.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/22/2022] [Accepted: 11/23/2022] [Indexed: 11/27/2022] Open
Abstract
Calculating changes in protein stability (ΔΔG) has been shown to be central for predicting the consequences of single amino acid substitutions in protein engineering as well as interpretation of genomic variants for disease risk. Structure-based calculations are considered most accurate, however the tools used to calculate ΔΔGs have been developed on experimentally resolved structures. Extending those calculations to homology models based on related proteins would greatly extend their applicability as large parts of e.g. the human proteome are not structurally resolved. In this study we aim to investigate the accuracy of ΔΔG values predicted on homology models compared to crystal structures. Specifically, we identified four proteins with a large number of experimentally tested ΔΔGs and templates for homology modeling across a broad range of sequence identities, and selected three methods for ΔΔG calculations to test. We find that ΔΔG-values predicted from homology models compare equally well to experimental ΔΔGs as those predicted on experimentally established crystal structures, as long as the sequence identity of the model template to the target protein is at least 40%. In particular, the Rosetta cartesian_ddg protocol is robust against the small perturbations in the structure which homology modeling introduces. In an independent assessment, we observe a similar trend when using ΔΔGs to categorize variants as low or wild-type-like abundance. Overall, our results show that stability calculations performed on homology models can substitute for those on crystal structures with acceptable accuracy as long as the model is built on a template with sequence identity of at least 40% to the target protein.
Collapse
Affiliation(s)
- Audrone Valanciute
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Nygaard
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Henrike Zschach
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Michael Maglegaard Jepsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark,Corresponding authors.
| | - Amelie Stein
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark,Corresponding authors.
| |
Collapse
|
3
|
Rossi A, Treu L, Toppo S, Zschach H, Campanaro S, Dutilh BE. Evolutionary Study of the Crassphage Virus at Gene Level. Viruses 2020; 12:v12091035. [PMID: 32957679 PMCID: PMC7551546 DOI: 10.3390/v12091035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 09/03/2020] [Accepted: 09/14/2020] [Indexed: 12/15/2022] Open
Abstract
crAss-like viruses are a putative family of bacteriophages recently discovered. The eponym of the clade, crAssphage, is an enteric bacteriophage estimated to be present in at least half of the human population and it constitutes up to 90% of the sequences in some human fecal viral metagenomic datasets. We focused on the evolutionary dynamics of the genes encoded on the crAssphage genome. By investigating the conservation of the genes, a consistent variation in the evolutionary rates across the different functional groups was found. Gene duplications in crAss-like genomes were detected. By exploring the differences among the functional categories of the genes, we confirmed that the genes encoding capsid proteins were the most ubiquitous, despite their overall low sequence conservation. It was possible to identify a core of proteins whose evolutionary trees strongly correlate with each other, suggesting their genetic interaction. This group includes the capsid proteins, which are thus established as extremely suitable for rebuilding the phylogenetic tree of this viral clade. A negative correlation between the ubiquity and the conservation of viral protein sequences was shown. Together, this study provides an in-depth picture of the evolution of different genes in crAss-like viruses.
Collapse
Affiliation(s)
- Alessandro Rossi
- Department of Biology, University of Padova, 35131 Padova, Italy; (A.R.); (S.C.)
| | - Laura Treu
- Department of Biology, University of Padova, 35131 Padova, Italy; (A.R.); (S.C.)
- Correspondence: ; Tel.: +39-049-827-6165
| | - Stefano Toppo
- Department of Molecular Medicine, University of Padova, 35131 Padova, Italy;
| | - Henrike Zschach
- Department of Biology, University of Copenhagen, 1017 Copenhagen, Denmark;
| | - Stefano Campanaro
- Department of Biology, University of Padova, 35131 Padova, Italy; (A.R.); (S.C.)
- CRIBI Biotechnology Center, University of Padua, 35131 Padova, Italy
| | - Bas E. Dutilh
- Institute of Biodynamics and Biocomplexity, University of Utrecht, 3508 Utrecht, The Netherlands;
| |
Collapse
|
4
|
Edwards RA, Vega AA, Norman HM, Ohaeri M, Levi K, Dinsdale EA, Cinek O, Aziz RK, McNair K, Barr JJ, Bibby K, Brouns SJJ, Cazares A, de Jonge PA, Desnues C, Díaz Muñoz SL, Fineran PC, Kurilshikov A, Lavigne R, Mazankova K, McCarthy DT, Nobrega FL, Reyes Muñoz A, Tapia G, Trefault N, Tyakht AV, Vinuesa P, Wagemans J, Zhernakova A, Aarestrup FM, Ahmadov G, Alassaf A, Anton J, Asangba A, Billings EK, Cantu VA, Carlton JM, Cazares D, Cho GS, Condeff T, Cortés P, Cranfield M, Cuevas DA, De la Iglesia R, Decewicz P, Doane MP, Dominy NJ, Dziewit L, Elwasila BM, Eren AM, Franz C, Fu J, Garcia-Aljaro C, Ghedin E, Gulino KM, Haggerty JM, Head SR, Hendriksen RS, Hill C, Hyöty H, Ilina EN, Irwin MT, Jeffries TC, Jofre J, Junge RE, Kelley ST, Khan Mirzaei M, Kowalewski M, Kumaresan D, Leigh SR, Lipson D, Lisitsyna ES, Llagostera M, Maritz JM, Marr LC, McCann A, Molshanski-Mor S, Monteiro S, Moreira-Grez B, Morris M, Mugisha L, Muniesa M, Neve H, Nguyen NP, Nigro OD, Nilsson AS, O'Connell T, Odeh R, Oliver A, Piuri M, Prussin Ii AJ, Qimron U, Quan ZX, Rainetova P, Ramírez-Rojas A, Raya R, Reasor K, Rice GAO, Rossi A, Santos R, Shimashita J, Stachler EN, Stene LC, Strain R, Stumpf R, Torres PJ, Twaddle A, Ugochi Ibekwe M, Villagra N, Wandro S, White B, Whiteley A, Whiteson KL, Wijmenga C, Zambrano MM, Zschach H, Dutilh BE. Global phylogeography and ancient evolution of the widespread human gut virus crAssphage. Nat Microbiol 2019. [PMID: 31285584 DOI: 10.1038/s41564-019-04904-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2023]
Abstract
Microbiomes are vast communities of microorganisms and viruses that populate all natural ecosystems. Viruses have been considered to be the most variable component of microbiomes, as supported by virome surveys and examples of high genomic mosaicism. However, recent evidence suggests that the human gut virome is remarkably stable compared with that of other environments. Here, we investigate the origin, evolution and epidemiology of crAssphage, a widespread human gut virus. Through a global collaboration, we obtained DNA sequences of crAssphage from more than one-third of the world's countries and showed that the phylogeography of crAssphage is locally clustered within countries, cities and individuals. We also found fully colinear crAssphage-like genomes in both Old-World and New-World primates, suggesting that the association of crAssphage with primates may be millions of years old. Finally, by exploiting a large cohort of more than 1,000 individuals, we tested whether crAssphage is associated with bacterial taxonomic groups of the gut microbiome, diverse human health parameters and a wide range of dietary factors. We identified strong correlations with different clades of bacteria that are related to Bacteroidetes and weak associations with several diet categories, but no significant association with health or disease. We conclude that crAssphage is a benign cosmopolitan virus that may have coevolved with the human lineage and is an integral part of the normal human gut virome.
Collapse
Affiliation(s)
- Robert A Edwards
- Department of Biology, San Diego State University, San Diego, CA, USA.
- The Viral Information Institute, San Diego State University, San Diego, CA, USA.
| | - Alejandro A Vega
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Holly M Norman
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Maria Ohaeri
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Kyle Levi
- Department of Computer Science, San Diego State University, San Diego, CA, USA
| | | | - Ondrej Cinek
- Department of Pediatrics, 2nd Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
| | - Ramy K Aziz
- Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Katelyn McNair
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Jeremy J Barr
- School of Biological Sciences, Monash University, Clayton, Victoria, Australia
| | - Kyle Bibby
- Civil and Environmental Engineering and Earth Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Stan J J Brouns
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
| | - Adrian Cazares
- Institute of Infection and Global Health, University of Liverpool, Liverpool, UK
| | - Patrick A de Jonge
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands
| | - Christelle Desnues
- MEPHI, Aix-Marseille Université, IRD, AP-HM, CNRS, IHU Méditerranée Infection, Marseille, France
- Mediterranean Institute of Oceanography, Aix-Marseille Université, Université de Toulon, CNRS, IRD, UM 110, Marseille, France
| | - Samuel L Díaz Muñoz
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
- Department of Microbiology and Molecular Genetics, University of California, Davis, Davis, CA, USA
| | - Peter C Fineran
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Alexander Kurilshikov
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | - Rob Lavigne
- Department of Biosystems, KU Leuven, Leuven, Belgium
| | - Karla Mazankova
- Department of Pediatrics, 2nd Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
| | - David T McCarthy
- EPHM Lab, Civil Engineering Department, Monash University, Clayton, Victoria, Australia
| | - Franklin L Nobrega
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
| | - Alejandro Reyes Muñoz
- Max Planck Tandem Group in Computational Biology, Departamento de Ciencias Biológicas, Universidad de los Andes, Bogotá, Colombia
| | - German Tapia
- Department of Child Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Nicole Trefault
- GEMA Center for Genomics, Ecology & Environment, Universidad Mayor, Huechuraba, Chile
| | - Alexander V Tyakht
- Laboratory of Bioinformatics, Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, Russia
- Department of Informational Technologies, ITMO University, Saint Petersburg, Russia
| | - Pablo Vinuesa
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | | | - Alexandra Zhernakova
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | - Frank M Aarestrup
- National Food Institute, Research Group for Genomic Epidemiology, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Abeer Alassaf
- Department of Pediatrics, School of Medicine, University of Jordan, Amman, Jordan
| | - Josefa Anton
- Department of Physiology, Genetics and Microbiology, University of Alicante, Alicante, Spain
| | - Abigail Asangba
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Emma K Billings
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Vito Adrian Cantu
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Jane M Carlton
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Daniel Cazares
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Gyu-Sung Cho
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Tess Condeff
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Pilar Cortés
- Departament de Genètica i de Microbiologia, Universitat Autònoma De Barcelona, Barcelona, Spain
| | - Mike Cranfield
- Wildlife Health Center, University of California, Davis, Davis, CA, USA
| | - Daniel A Cuevas
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Rodrigo De la Iglesia
- Departamento de Genética Molecular y Microbiología, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Przemyslaw Decewicz
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Warsaw, Poland
| | - Michael P Doane
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Lukasz Dziewit
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Warsaw, Poland
| | - Bashir Mukhtar Elwasila
- Department of Pediatrics and Child Health, Faculty of Medicine, University of Khartoum, Khartoum, Sudan
| | - A Murat Eren
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Charles Franz
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Jingyuan Fu
- Department of Pediatrics, University Medical Center Groningen, Groningen, The Netherlands
| | - Cristina Garcia-Aljaro
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Elodie Ghedin
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Kristen M Gulino
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - John M Haggerty
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Steven R Head
- Next Generation Sequencing and Microarray Core Facility, The Scripps Research Institute, La Jolla, CA, USA
| | - Rene S Hendriksen
- National Food Institute, Research Group for Genomic Epidemiology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Colin Hill
- School of Microbiology, University College Cork, Cork, Ireland
| | - Heikki Hyöty
- Department of Virology, School of Medicine, University of Tampere, Tampere, Finland
| | - Elena N Ilina
- Department of Molecular Biology and Genetics, Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, Russia
| | - Mitchell T Irwin
- Department of Anthropology, Northern Illinois University, DeKalb, IL, USA
| | - Thomas C Jeffries
- School of Science and Health, Western Sydney University, Penrith, New South Wales, Australia
| | - Juan Jofre
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Randall E Junge
- Department of Animal Health, Columbus Zoo and Aquarium, Powell, OH, USA
| | - Scott T Kelley
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Martin Kowalewski
- Department Estacion Biologica Corrientes, Institution Museo Arg. Cs. Naturales-CONICET, Corrientes, Argentina
| | - Deepak Kumaresan
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Steven R Leigh
- Department of Anthropology, University of Colorado, Boulder, CO, USA
| | - David Lipson
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Montserrat Llagostera
- Departament de Genètica i de Microbiologia, Universitat Autònoma De Barcelona, Barcelona, Spain
| | - Julia M Maritz
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Linsey C Marr
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Angela McCann
- APC Microbiome Institute, University College Cork, Cork, Ireland
| | - Shahar Molshanski-Mor
- Clinical Microbiology & Immunology, Sackler school of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Silvia Monteiro
- Laboratorio de Analises, Instituto Superior Tecnico, Universidade Lisboa, Lisboa, Portugal
| | - Benjamin Moreira-Grez
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Megan Morris
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Lawrence Mugisha
- CEHA, Kampala, Uganda
- COVAB, Makerere University, Kampala, Uganda
| | - Maite Muniesa
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Horst Neve
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Nam-Phuong Nguyen
- Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Olivia D Nigro
- College of Natural and Computational Sciences, Hawai'i Pacific University, Kaneohe, HI, USA
| | - Anders S Nilsson
- Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
| | - Taylor O'Connell
- Biological and Medical Informatics Program, San Diego State University, San Diego, CA, USA
| | - Rasha Odeh
- Department of Pediatrics, School of Medicine, University of Jordan, Amman, Jordan
| | - Andrew Oliver
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Mariana Piuri
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Aaron J Prussin Ii
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Udi Qimron
- Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Zhe-Xue Quan
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Fudan University, Shanghai, China
| | - Petra Rainetova
- Centre of Epidemiology and Microbiology, National Institute of Public Health, Prague, Czech Republic
| | | | | | - Kim Reasor
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Alessandro Rossi
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands
- Department of Biology, University of Padova, Padova, Italy
| | - Ricardo Santos
- Laboratorio de Analises, Instituto Superior Tecnico, Universidade Lisboa, Lisboa, Portugal
| | - John Shimashita
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Elyse N Stachler
- Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Lars C Stene
- Department of Child Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Ronan Strain
- APC Microbiome Institute, University College Cork, Cork, Ireland
| | - Rebecca Stumpf
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Pedro J Torres
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Alan Twaddle
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - MaryAnn Ugochi Ibekwe
- Department of Pediatrics, Federal Teaching Hospital Abakaliki, Ebonyi State University, Abakaliki, Nigeria
| | - Nicolás Villagra
- Escuela de Tecnología Médica, Universidad Andres Bello, Santiago, Chile
| | - Stephen Wandro
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Bryan White
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Andy Whiteley
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Katrine L Whiteson
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Cisca Wijmenga
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | | | - Henrike Zschach
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Bas E Dutilh
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands.
- Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, The Netherlands.
| |
Collapse
|
5
|
Edwards RA, Vega AA, Norman HM, Ohaeri M, Levi K, Dinsdale EA, Cinek O, Aziz RK, McNair K, Barr JJ, Bibby K, Brouns SJJ, Cazares A, de Jonge PA, Desnues C, Díaz Muñoz SL, Fineran PC, Kurilshikov A, Lavigne R, Mazankova K, McCarthy DT, Nobrega FL, Reyes Muñoz A, Tapia G, Trefault N, Tyakht AV, Vinuesa P, Wagemans J, Zhernakova A, Aarestrup FM, Ahmadov G, Alassaf A, Anton J, Asangba A, Billings EK, Cantu VA, Carlton JM, Cazares D, Cho GS, Condeff T, Cortés P, Cranfield M, Cuevas DA, De la Iglesia R, Decewicz P, Doane MP, Dominy NJ, Dziewit L, Elwasila BM, Eren AM, Franz C, Fu J, Garcia-Aljaro C, Ghedin E, Gulino KM, Haggerty JM, Head SR, Hendriksen RS, Hill C, Hyöty H, Ilina EN, Irwin MT, Jeffries TC, Jofre J, Junge RE, Kelley ST, Khan Mirzaei M, Kowalewski M, Kumaresan D, Leigh SR, Lipson D, Lisitsyna ES, Llagostera M, Maritz JM, Marr LC, McCann A, Molshanski-Mor S, Monteiro S, Moreira-Grez B, Morris M, Mugisha L, Muniesa M, Neve H, Nguyen NP, Nigro OD, Nilsson AS, O'Connell T, Odeh R, Oliver A, Piuri M, Prussin Ii AJ, Qimron U, Quan ZX, Rainetova P, Ramírez-Rojas A, Raya R, Reasor K, Rice GAO, Rossi A, Santos R, Shimashita J, Stachler EN, Stene LC, Strain R, Stumpf R, Torres PJ, Twaddle A, Ugochi Ibekwe M, Villagra N, Wandro S, White B, Whiteley A, Whiteson KL, Wijmenga C, Zambrano MM, Zschach H, Dutilh BE. Global phylogeography and ancient evolution of the widespread human gut virus crAssphage. Nat Microbiol 2019; 4:1727-1736. [PMID: 31285584 DOI: 10.1101/527796] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Accepted: 05/22/2019] [Indexed: 05/26/2023]
Abstract
Microbiomes are vast communities of microorganisms and viruses that populate all natural ecosystems. Viruses have been considered to be the most variable component of microbiomes, as supported by virome surveys and examples of high genomic mosaicism. However, recent evidence suggests that the human gut virome is remarkably stable compared with that of other environments. Here, we investigate the origin, evolution and epidemiology of crAssphage, a widespread human gut virus. Through a global collaboration, we obtained DNA sequences of crAssphage from more than one-third of the world's countries and showed that the phylogeography of crAssphage is locally clustered within countries, cities and individuals. We also found fully colinear crAssphage-like genomes in both Old-World and New-World primates, suggesting that the association of crAssphage with primates may be millions of years old. Finally, by exploiting a large cohort of more than 1,000 individuals, we tested whether crAssphage is associated with bacterial taxonomic groups of the gut microbiome, diverse human health parameters and a wide range of dietary factors. We identified strong correlations with different clades of bacteria that are related to Bacteroidetes and weak associations with several diet categories, but no significant association with health or disease. We conclude that crAssphage is a benign cosmopolitan virus that may have coevolved with the human lineage and is an integral part of the normal human gut virome.
Collapse
Affiliation(s)
- Robert A Edwards
- Department of Biology, San Diego State University, San Diego, CA, USA.
- The Viral Information Institute, San Diego State University, San Diego, CA, USA.
| | - Alejandro A Vega
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Holly M Norman
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Maria Ohaeri
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Kyle Levi
- Department of Computer Science, San Diego State University, San Diego, CA, USA
| | | | - Ondrej Cinek
- Department of Pediatrics, 2nd Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
| | - Ramy K Aziz
- Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Katelyn McNair
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Jeremy J Barr
- School of Biological Sciences, Monash University, Clayton, Victoria, Australia
| | - Kyle Bibby
- Civil and Environmental Engineering and Earth Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Stan J J Brouns
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
| | - Adrian Cazares
- Institute of Infection and Global Health, University of Liverpool, Liverpool, UK
| | - Patrick A de Jonge
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands
| | - Christelle Desnues
- MEPHI, Aix-Marseille Université, IRD, AP-HM, CNRS, IHU Méditerranée Infection, Marseille, France
- Mediterranean Institute of Oceanography, Aix-Marseille Université, Université de Toulon, CNRS, IRD, UM 110, Marseille, France
| | - Samuel L Díaz Muñoz
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
- Department of Microbiology and Molecular Genetics, University of California, Davis, Davis, CA, USA
| | - Peter C Fineran
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Alexander Kurilshikov
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | - Rob Lavigne
- Department of Biosystems, KU Leuven, Leuven, Belgium
| | - Karla Mazankova
- Department of Pediatrics, 2nd Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
| | - David T McCarthy
- EPHM Lab, Civil Engineering Department, Monash University, Clayton, Victoria, Australia
| | - Franklin L Nobrega
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, Delft, The Netherlands
| | - Alejandro Reyes Muñoz
- Max Planck Tandem Group in Computational Biology, Departamento de Ciencias Biológicas, Universidad de los Andes, Bogotá, Colombia
| | - German Tapia
- Department of Child Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Nicole Trefault
- GEMA Center for Genomics, Ecology & Environment, Universidad Mayor, Huechuraba, Chile
| | - Alexander V Tyakht
- Laboratory of Bioinformatics, Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, Russia
- Department of Informational Technologies, ITMO University, Saint Petersburg, Russia
| | - Pablo Vinuesa
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | | | - Alexandra Zhernakova
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | - Frank M Aarestrup
- National Food Institute, Research Group for Genomic Epidemiology, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Abeer Alassaf
- Department of Pediatrics, School of Medicine, University of Jordan, Amman, Jordan
| | - Josefa Anton
- Department of Physiology, Genetics and Microbiology, University of Alicante, Alicante, Spain
| | - Abigail Asangba
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Emma K Billings
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Vito Adrian Cantu
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Jane M Carlton
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Daniel Cazares
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Gyu-Sung Cho
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Tess Condeff
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Pilar Cortés
- Departament de Genètica i de Microbiologia, Universitat Autònoma De Barcelona, Barcelona, Spain
| | - Mike Cranfield
- Wildlife Health Center, University of California, Davis, Davis, CA, USA
| | - Daniel A Cuevas
- Computational Sciences Research Center, San Diego State University, San Diego, CA, USA
| | - Rodrigo De la Iglesia
- Departamento de Genética Molecular y Microbiología, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Przemyslaw Decewicz
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Warsaw, Poland
| | - Michael P Doane
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Lukasz Dziewit
- Department of Bacterial Genetics, Institute of Microbiology, Faculty of Biology, University of Warsaw, Warsaw, Poland
| | - Bashir Mukhtar Elwasila
- Department of Pediatrics and Child Health, Faculty of Medicine, University of Khartoum, Khartoum, Sudan
| | - A Murat Eren
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Charles Franz
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Jingyuan Fu
- Department of Pediatrics, University Medical Center Groningen, Groningen, The Netherlands
| | - Cristina Garcia-Aljaro
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Elodie Ghedin
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Kristen M Gulino
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - John M Haggerty
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Steven R Head
- Next Generation Sequencing and Microarray Core Facility, The Scripps Research Institute, La Jolla, CA, USA
| | - Rene S Hendriksen
- National Food Institute, Research Group for Genomic Epidemiology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Colin Hill
- School of Microbiology, University College Cork, Cork, Ireland
| | - Heikki Hyöty
- Department of Virology, School of Medicine, University of Tampere, Tampere, Finland
| | - Elena N Ilina
- Department of Molecular Biology and Genetics, Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, Russia
| | - Mitchell T Irwin
- Department of Anthropology, Northern Illinois University, DeKalb, IL, USA
| | - Thomas C Jeffries
- School of Science and Health, Western Sydney University, Penrith, New South Wales, Australia
| | - Juan Jofre
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Randall E Junge
- Department of Animal Health, Columbus Zoo and Aquarium, Powell, OH, USA
| | - Scott T Kelley
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Martin Kowalewski
- Department Estacion Biologica Corrientes, Institution Museo Arg. Cs. Naturales-CONICET, Corrientes, Argentina
| | - Deepak Kumaresan
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Steven R Leigh
- Department of Anthropology, University of Colorado, Boulder, CO, USA
| | - David Lipson
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Montserrat Llagostera
- Departament de Genètica i de Microbiologia, Universitat Autònoma De Barcelona, Barcelona, Spain
| | - Julia M Maritz
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - Linsey C Marr
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Angela McCann
- APC Microbiome Institute, University College Cork, Cork, Ireland
| | - Shahar Molshanski-Mor
- Clinical Microbiology & Immunology, Sackler school of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Silvia Monteiro
- Laboratorio de Analises, Instituto Superior Tecnico, Universidade Lisboa, Lisboa, Portugal
| | - Benjamin Moreira-Grez
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Megan Morris
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Lawrence Mugisha
- CEHA, Kampala, Uganda
- COVAB, Makerere University, Kampala, Uganda
| | - Maite Muniesa
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, Spain
| | - Horst Neve
- Department of Microbiology and Biotechnology, Max Rubner-Institut, Federal Research Institute of Nutrition and Food, Kiel, Germany
| | - Nam-Phuong Nguyen
- Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Olivia D Nigro
- College of Natural and Computational Sciences, Hawai'i Pacific University, Kaneohe, HI, USA
| | - Anders S Nilsson
- Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
| | - Taylor O'Connell
- Biological and Medical Informatics Program, San Diego State University, San Diego, CA, USA
| | - Rasha Odeh
- Department of Pediatrics, School of Medicine, University of Jordan, Amman, Jordan
| | - Andrew Oliver
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Mariana Piuri
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Aaron J Prussin Ii
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Udi Qimron
- Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Zhe-Xue Quan
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Fudan University, Shanghai, China
| | - Petra Rainetova
- Centre of Epidemiology and Microbiology, National Institute of Public Health, Prague, Czech Republic
| | | | | | - Kim Reasor
- Department of Biology, San Diego State University, San Diego, CA, USA
| | | | - Alessandro Rossi
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands
- Department of Biology, University of Padova, Padova, Italy
| | - Ricardo Santos
- Laboratorio de Analises, Instituto Superior Tecnico, Universidade Lisboa, Lisboa, Portugal
| | - John Shimashita
- Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, VA, USA
| | - Elyse N Stachler
- Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, USA
| | - Lars C Stene
- Department of Child Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Ronan Strain
- APC Microbiome Institute, University College Cork, Cork, Ireland
| | - Rebecca Stumpf
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Pedro J Torres
- Department of Biology, San Diego State University, San Diego, CA, USA
| | - Alan Twaddle
- Center for Genomics and Systems Biology & Department of Biology, New York University, New York, NY, USA
| | - MaryAnn Ugochi Ibekwe
- Department of Pediatrics, Federal Teaching Hospital Abakaliki, Ebonyi State University, Abakaliki, Nigeria
| | - Nicolás Villagra
- Escuela de Tecnología Médica, Universidad Andres Bello, Santiago, Chile
| | - Stephen Wandro
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Bryan White
- Carl R. Woese Institute of Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Andy Whiteley
- UWA School of Agriculture and Environment, University of Western Australia, Perth, Western Australia, Australia
| | - Katrine L Whiteson
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA, USA
| | - Cisca Wijmenga
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | | | - Henrike Zschach
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Bas E Dutilh
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Utrecht, The Netherlands.
- Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, Nijmegen, The Netherlands.
| |
Collapse
|
6
|
Villarroel J, Kleinheinz KA, Jurtz VI, Zschach H, Lund O, Nielsen M, Larsen MV. HostPhinder: A Phage Host Prediction Tool. Viruses 2016; 8:E116. [PMID: 27153081 PMCID: PMC4885074 DOI: 10.3390/v8050116] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Revised: 04/14/2016] [Accepted: 04/19/2016] [Indexed: 01/11/2023] Open
Abstract
The current dramatic increase of antibiotic resistant bacteria has revitalised the interest in bacteriophages as alternative antibacterial treatment. Meanwhile, the development of bioinformatics methods for analysing genomic data places high-throughput approaches for phage characterization within reach. Here, we present HostPhinder, a tool aimed at predicting the bacterial host of phages by examining the phage genome sequence. Using a reference database of 2196 phages with known hosts, HostPhinder predicts the host species of a query phage as the host of the most genomically similar reference phages. As a measure of genomic similarity the number of co-occurring k-mers (DNA sequences of length k) is used. Using an independent evaluation set, HostPhinder was able to correctly predict host genus and species for 81% and 74% of the phages respectively, giving predictions for more phages than BLAST and significantly outperforming BLAST on phages for which both had predictions. HostPhinder predictions on phage draft genomes from the INTESTI phage cocktail corresponded well with the advertised targets of the cocktail. Our study indicates that for most phages genomic similarity correlates well with related bacterial hosts. HostPhinder is available as an interactive web service [1] and as a stand alone download from the Docker registry [2].
Collapse
Affiliation(s)
- Julia Villarroel
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| | - Kortine Annina Kleinheinz
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| | - Vanessa Isabell Jurtz
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| | - Henrike Zschach
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| | - Ole Lund
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| | - Morten Nielsen
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
- Instituto de Investigaciones Biotecnológicas, Universidad de San Martín, CP(1650) San Martín, Prov. de Buenos Aires, Argentina.
| | - Mette Voldby Larsen
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.
| |
Collapse
|