1
|
Magee AF, Holbrook AJ, Pekar JE, Caviedes-Solis IW, Matsen Iv FA, Baele G, Wertheim JO, Ji X, Lemey P, Suchard MA. Random-effects substitution models for phylogenetics via scalable gradient approximations. Syst Biol 2024:syae019. [PMID: 38712512 DOI: 10.1093/sysbio/syae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Indexed: 05/08/2024] Open
Abstract
Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitution models often require many more parameters than their usual counterparts, inference can be both statistically and computationally challenging. Thus, we also propose an efficient approach to compute an approximation to the gradient of the data likelihood with respect to all unknown substitution model parameters. We demonstrate that this approximate gradient enables scaling of sampling-based inference, namely Bayesian inference via Hamiltonian Monte Carlo, under random-effects substitution models across large trees and state-spaces. Applied to a dataset of 583 SARS-CoV-2 sequences, an HKY model with random-effects shows strong signals of nonreversibility in the substitution process, and posterior predictive model checks clearly show that it is a more adequate model than a reversible model. When analyzing the pattern of phylogeographic spread of 1441 influenza A virus (H3N2) sequences between 14 regions, a random-effects phylogeographic substitution model infers that air travel volume adequately predicts almost all dispersal rates. A random-effects state-dependent substitution model reveals no evidence for an effect of arboreality on the swimming mode in the tree frog subfamily Hylinae. Simulations reveal that random-effects substitution models can accommodate both negligible and radical departures from the underlying base substitution model. We show that our gradient-based inference approach is over an order of magnitude more time efficient than conventional approaches.
Collapse
Affiliation(s)
- Andrew F Magee
- Department of Biostatistics, Jonathan and Karin Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA, USA
| | - Andrew J Holbrook
- Department of Biostatistics, Jonathan and Karin Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA, USA
| | - Jonathan E Pekar
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA, USA
- Department of Biomedical Informatics, University of California San Diega, La Jolla, CA, USA
| | | | - Fredrick A Matsen Iv
- Howard Hughes Medical Institute, Seattle, Washington, USA
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
- Department of Statistics, University of Washington, Seattle, Washington, USA
| | - Guy Baele
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Joel O Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Xiang Ji
- Department of Mathematics, Tulane University, New Orleans, LA, USA
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Marc A Suchard
- Department of Biostatistics, Jonathan and Karin Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA, USA
- Department of Biomathematics, David Geffen School of Medicine at UCLA, University of California Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine at UCLA, Universtiy of California Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
2
|
Magee AF, Holbrook AJ, Pekar JE, Caviedes-Solis IW, Iv FAM, Baele G, Wertheim JO, Ji X, Lemey P, Suchard MA. Random-effects substitution models for phylogenetics via scalable gradient approximations. ArXiv 2023:arXiv:2303.13642v2. [PMID: 36994154 PMCID: PMC10055484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
Abstract
Phylogenetic and discrete-trait evolutionary inference depend heavily on an appropriate characterization of the underlying character substitution process. In this paper, we present random-effects substitution models that extend common continuous-time Markov chain models into a richer class of processes capable of capturing a wider variety of substitution dynamics. As these random-effects substitution models often require many more parameters than their usual counterparts, inference can be both statistically and computationally challenging. Thus, we also propose an efficient approach to compute an approximation to the gradient of the data likelihood with respect to all unknown substitution model parameters. We demonstrate that this approximate gradient enables scaling of sampling-based inference, namely Bayesian inference via Hamiltonian Monte Carlo, under random-effects substitution models across large trees and state-spaces. Applied to a dataset of 583 SARS-CoV-2 sequences, an HKY model with random-effects shows strong signals of nonreversibility in the substitution process, and posterior predictive model checks clearly show that it is a more adequate model than a reversible model. When analyzing the pattern of phylogeographic spread of 1441 influenza A virus (H3N2) sequences between 14 regions, a random-effects phylogeographic substitution model infers that air travel volume adequately predicts almost all dispersal rates. A random-effects state-dependent substitution model reveals no evidence for an effect of arboreality on the swimming mode in the tree frog subfamily Hylinae. Simulations reveal that random-effects substitution models can accommodate both negligible and radical departures from the underlying base substitution model. We show that our gradient-based inference approach is over an order of magnitude more time efficient than conventional approaches.
Collapse
|
3
|
Crits-Christoph A, Levy JI, Pekar JE, Goldstein SA, Singh R, Hensel Z, Gangavarapu K, Rogers MB, Moshiri N, Garry RF, Holmes EC, Koopmans MPG, Lemey P, Popescu S, Rambaut A, Robertson DL, Suchard MA, Wertheim JO, Rasmussen AL, Andersen KG, Worobey M, Débarre F. Genetic tracing of market wildlife and viruses at the epicenter of the COVID-19 pandemic. bioRxiv 2023:2023.09.13.557637. [PMID: 37745602 PMCID: PMC10515900 DOI: 10.1101/2023.09.13.557637] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Zoonotic spillovers of viruses have occurred through the animal trade worldwide. The start of the COVID-19 pandemic was traced epidemiologically to the Huanan Wholesale Seafood Market, the site with the most reported wildlife vendors in the city of Wuhan, China. Here, we analyze publicly available qPCR and sequencing data from environmental samples collected in the Huanan market in early 2020. We demonstrate that the SARS-CoV-2 genetic diversity linked to this market is consistent with market emergence, and find increased SARS-CoV-2 positivity near and within a particular wildlife stall. We identify wildlife DNA in all SARS-CoV-2 positive samples from this stall. This includes species such as civets, bamboo rats, porcupines, hedgehogs, and one species, raccoon dogs, known to be capable of SARS-CoV-2 transmission. We also detect other animal viruses that infect raccoon dogs, civets, and bamboo rats. Combining metagenomic and phylogenetic approaches, we recover genotypes of market animals and compare them to those from other markets. This analysis provides the genetic basis for a short list of potential intermediate hosts of SARS-CoV-2 to prioritize for retrospective serological testing and viral sampling.
Collapse
Affiliation(s)
| | - Joshua I. Levy
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Jonathan E. Pekar
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA, USA
| | - Stephen A. Goldstein
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Reema Singh
- Vaccine and Infectious Disease Organization, University of Saskatchewan, Saskatoon, SK, Canada
| | - Zach Hensel
- ITQB NOVA, Universidade NOVA de Lisboa, Lisbon, Av. da Republica, 2780-157, Oeiras, Portugal
| | - Karthik Gangavarapu
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA 90024, USA
| | - Matthew B. Rogers
- Vaccine and Infectious Disease Organization, University of Saskatchewan, Saskatoon, SK, Canada
| | - Niema Moshiri
- Department of Computer Science & Engineering, University of California San Diego, La Jolla, CA, USA
| | - Robert F. Garry
- Tulane University, School of Medicine, Department of Microbiology and Immunology, New Orleans, LA 70112, USA; Zalgen Labs, Frederick, MD 21703, USA; Global Virus Network (GVN), Baltimore, MD 21201, USA
| | - Edward C. Holmes
- Sydney Institute for Infectious Diseases, School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
| | - Marion P. G. Koopmans
- Department of Viroscience, and Pandemic and Disaster Preparedness Centre., Erasmus Medical Centre, Rotterdam, The Netherlands
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven, Belgium
| | - Saskia Popescu
- University of Maryland, School of Medicine, Department of Epidemiology & Public Health, Baltimore, MD 21201, USA
| | - Andrew Rambaut
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, UK
| | - David L. Robertson
- MRC-University of Glasgow Center for Virus Research, Glasgow, G61 1QH, UK
| | - Marc A. Suchard
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA 90024, USA
| | - Joel O. Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Angela L. Rasmussen
- Vaccine and Infectious Disease Organization, University of Saskatchewan, Saskatoon, SK, Canada
| | - Kristian G. Andersen
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Florence Débarre
- Institut d’Écologie et des Sciences de l’Environnement (IEES-Paris, UMR 7618), CNRS, Sorbonne Université, UPEC, IRD, INRAE, Paris, France
| |
Collapse
|
4
|
Pekar JE, Lytras S, Ghafari M, Magee AF, Parker E, Havens JL, Katzourakis A, Vasylyeva TI, Suchard MA, Hughes AC, Hughes J, Robertson DL, Dellicour S, Worobey M, Wertheim JO, Lemey P. The recency and geographical origins of the bat viruses ancestral to SARS-CoV and SARS-CoV-2. bioRxiv 2023:2023.07.12.548617. [PMID: 37502985 PMCID: PMC10369958 DOI: 10.1101/2023.07.12.548617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The emergence of SARS-CoV in 2002 and SARS-CoV-2 in 2019 has led to increased sampling of related sarbecoviruses circulating primarily in horseshoe bats. These viruses undergo frequent recombination and exhibit spatial structuring across Asia. Employing recombination-aware phylogenetic inference on bat sarbecoviruses, we find that the closest-inferred bat virus ancestors of SARS-CoV and SARS-CoV-2 existed just ~1-3 years prior to their emergence in humans. Phylogeographic analyses examining the movement of related sarbecoviruses demonstrate that they traveled at similar rates to their horseshoe bat hosts and have been circulating for thousands of years in Asia. The closest-inferred bat virus ancestor of SARS-CoV likely circulated in western China, and that of SARS-CoV-2 likely circulated in a region comprising southwest China and northern Laos, both a substantial distance from where they emerged. This distance and recency indicate that the direct ancestors of SARS-CoV and SARS-CoV-2 could not have reached their respective sites of emergence via the bat reservoir alone. Our recombination-aware dating and phylogeographic analyses reveal a more accurate inference of evolutionary history than performing only whole-genome or single gene analyses. These results can guide future sampling efforts and demonstrate that viral genomic fragments extremely closely related to SARS-CoV and SARS-CoV-2 were circulating in horseshoe bats, confirming their importance as the reservoir species for SARS viruses.
Collapse
Affiliation(s)
- Jonathan E Pekar
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, USA
- These authors contributed equally
| | - Spyros Lytras
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
- These authors contributed equally
| | - Mahan Ghafari
- Department of Biology, University of Oxford, Oxford, UK
| | - Andrew F Magee
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Edyth Parker
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Jennifer L Havens
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
| | | | - Tetyana I Vasylyeva
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Marc A Suchard
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Alice C Hughes
- School of Biological Sciences, University of Hong Kong, Hong Kong
- China Biodiversity Green Development Foundation, Beijing, China
| | - Joseph Hughes
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - David L Robertson
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
- These authors jointly supervised the work
| | - Simon Dellicour
- Spatial Epidemiology Lab (SpELL), Université Libre de Bruxelles, CP160/12, 50 av. FD Roosevelt, 1050, Bruxelles, Belgium
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium
- These authors jointly supervised the work
| | - Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- These authors jointly supervised the work
| | - Joel O Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
- These authors jointly supervised the work
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium
- These authors jointly supervised the work
| |
Collapse
|
5
|
Worobey M, Levy JI, Serrano LM, Crits-Christoph A, Pekar JE, Goldstein SA, Rasmussen AL, Kraemer MUG, Newman C, Koopmans MPG, Suchard MA, Wertheim JO, Lemey P, Robertson DL, Garry RF, Holmes EC, Rambaut A, Andersen KG. The Huanan Seafood Wholesale Market in Wuhan was the early epicenter of the COVID-19 pandemic. Science 2022; 377:951-959. [PMID: 35881010 PMCID: PMC9348750 DOI: 10.1126/science.abp8715] [Citation(s) in RCA: 125] [Impact Index Per Article: 62.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 07/18/2022] [Indexed: 12/25/2022]
Abstract
Understanding how severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in 2019 is critical to preventing future zoonotic outbreaks before they become the next pandemic. The Huanan Seafood Wholesale Market in Wuhan, China, was identified as a likely source of cases in early reports, but later this conclusion became controversial. We show here that the earliest known COVID-19 cases from December 2019, including those without reported direct links, were geographically centered on this market. We report that live SARS-CoV-2-susceptible mammals were sold at the market in late 2019 and that within the market, SARS-CoV-2-positive environmental samples were spatially associated with vendors selling live mammals. Although there is insufficient evidence to define upstream events, and exact circumstances remain obscure, our analyses indicate that the emergence of SARS-CoV-2 occurred through the live wildlife trade in China and show that the Huanan market was the epicenter of the COVID-19 pandemic.
Collapse
Affiliation(s)
- Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Joshua I. Levy
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Lorena Malpica Serrano
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Alexander Crits-Christoph
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Jonathan E. Pekar
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, USA
| | - Stephen A. Goldstein
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Angela L. Rasmussen
- Vaccine and Infectious Disease Organization, University of Saskatchewan, Saskatoon SK S7N 5E3, Canada
- Center for Global Health Science and Security, Georgetown University, Washington, DC 20057, USA
| | | | - Chris Newman
- Wildlife Conservation Research Unit, Department of Zoology, The Recanati-Kaplan Centre, University of Oxford, Oxford OX13 5QL, UK
| | - Marion P. G. Koopmans
- Pandemic and Disaster Preparedness Centre, Erasmus University Medical Center, 3015 CE Rotterdam, Netherlands
- Department of Viroscience, Erasmus University Medical Center, 3015 CE Rotterdam, Netherlands
| | - Marc A. Suchard
- Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Joel O. Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, KU Leuven, 3000 Leuven, Belgium
- Global Virus Network (GVN), Baltimore, MD 21201, USA
| | - David L. Robertson
- MRC-University of Glasgow Center for Virus Research, Glasgow G61 1QH, UK
| | - Robert F. Garry
- Global Virus Network (GVN), Baltimore, MD 21201, USA
- Tulane University, School of Medicine, Department of Microbiology and Immunology, New Orleans, LA 70112, USA
- Zalgen Labs, Frederick, MD 21703, USA
| | - Edward C. Holmes
- Sydney Institute for Infectious Diseases, School of Life and Environmental Sciences and School of Medical Sciences, The University of Sydney, Sydney, New South Wales 2006, Australia
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Kristian G. Andersen
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
- Scripps Research Translational Institute, La Jolla, CA 92037, USA
| |
Collapse
|
6
|
Pekar JE, Magee A, Parker E, Moshiri N, Izhikevich K, Havens JL, Gangavarapu K, Malpica Serrano LM, Crits-Christoph A, Matteson NL, Zeller M, Levy JI, Wang JC, Hughes S, Lee J, Park H, Park MS, Ching KZY, Lin RTP, Mat Isa MN, Noor YM, Vasylyeva TI, Garry RF, Holmes EC, Rambaut A, Suchard MA, Andersen KG, Worobey M, Wertheim JO. The molecular epidemiology of multiple zoonotic origins of SARS-CoV-2. Science 2022; 377:960-966. [PMID: 35881005 PMCID: PMC9348752 DOI: 10.1126/science.abp8337] [Citation(s) in RCA: 73] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 07/18/2022] [Indexed: 01/08/2023]
Abstract
Understanding the circumstances that lead to pandemics is important for their prevention. We analyzed the genomic diversity of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) early in the coronavirus disease 2019 (COVID-19) pandemic. We show that SARS-CoV-2 genomic diversity before February 2020 likely comprised only two distinct viral lineages, denoted "A" and "B." Phylodynamic rooting methods, coupled with epidemic simulations, reveal that these lineages were the result of at least two separate cross-species transmission events into humans. The first zoonotic transmission likely involved lineage B viruses around 18 November 2019 (23 October to 8 December), and the separate introduction of lineage A likely occurred within weeks of this event. These findings indicate that it is unlikely that SARS-CoV-2 circulated widely in humans before November 2019 and define the narrow window between when SARS-CoV-2 first jumped into humans and when the first cases of COVID-19 were reported. As with other coronaviruses, SARS-CoV-2 emergence likely resulted from multiple zoonotic events.
Collapse
Affiliation(s)
- Jonathan E. Pekar
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, USA
| | - Andrew Magee
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Edyth Parker
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Niema Moshiri
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Katherine Izhikevich
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
- Department of Mathematics, University of California San Diego, La Jolla, CA 92093, USA
| | - Jennifer L. Havens
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Karthik Gangavarapu
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | | | - Alexander Crits-Christoph
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205, USA
| | - Nathaniel L. Matteson
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Mark Zeller
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Joshua I. Levy
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Jade C. Wang
- New York City Public Health Laboratory, New York City Department of Health and Mental Hygiene, New York, NY 11101, USA
| | - Scott Hughes
- New York City Public Health Laboratory, New York City Department of Health and Mental Hygiene, New York, NY 11101, USA
| | - Jungmin Lee
- Department of Microbiology, Institute for Viral Diseases, Biosafety Center, College of Medicine, Korea University, Seoul, South Korea
| | - Heedo Park
- Department of Microbiology, Institute for Viral Diseases, Biosafety Center, College of Medicine, Korea University, Seoul, South Korea
- BK21 Graduate Program, Department of Biomedical Sciences, Korea University College of Medicine, Seoul, 02841, Republic of Korea
| | - Man-Seong Park
- Department of Microbiology, Institute for Viral Diseases, Biosafety Center, College of Medicine, Korea University, Seoul, South Korea
- BK21 Graduate Program, Department of Biomedical Sciences, Korea University College of Medicine, Seoul, 02841, Republic of Korea
| | | | - Raymond Tzer Pin Lin
- National Public Health Laboratory, National Centre for Infectious Diseases, Singapore
| | - Mohd Noor Mat Isa
- Malaysia Genome and Vaccine Institute, Jalan Bangi, 43000 Kajang, Selangor, Malaysia
| | - Yusuf Muhammad Noor
- Malaysia Genome and Vaccine Institute, Jalan Bangi, 43000 Kajang, Selangor, Malaysia
| | - Tetyana I. Vasylyeva
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Robert F. Garry
- Tulane University, School of Medicine, Department of Microbiology and Immunology, New Orleans, LA 70112, USA
- Zalgen Labs, LCC, Frederick, MD 21703 USA
- Global Virus Network (GVN), Baltimore, MD 21201, USA
| | - Edward C. Holmes
- Sydney Institute for Infectious Diseases, School of Life and Environmental Sciences and School of Medical Sciences, The University of Sydney, Sydney, NSW 2006, Australia
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, King's Buildings, Edinburgh, EH9 3FL, UK
| | - Marc A. Suchard
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Biomathematics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Kristian G. Andersen
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
- Scripps Research Translational Institute, La Jolla, CA 92037, USA
| | - Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Joel O. Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
7
|
Pekar JE, Phaneuf P, Szubin R, Palsson B, Feist A, Monk JM. Gapless, Unambiguous Genome Sequence for Escherichia coli C, a Workhorse of Industrial Biology. Microbiol Resour Announc 2018; 7:e00890-18. [PMID: 30533692 PMCID: PMC6256561 DOI: 10.1128/mra.00890-18] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Accepted: 09/06/2018] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli C is a commonly used strain in the bioprocessing industry, but despite its utility, the publicly available sequence of the E. coli C genome has gaps and 4,180 ambiguous base calls. Here, we present an updated, high-quality, unambiguous genome sequence with no assembly gaps.
Collapse
Affiliation(s)
- Jonathan E. Pekar
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| | - Patrick Phaneuf
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| | - Richard Szubin
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| | - Bernhard Palsson
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| | - Adam Feist
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| | - Jonathan M. Monk
- Bioengineering Department, University of California San Diego, La Jolla, California, USA
| |
Collapse
|