1
|
Kalantar KL, Carvalho T, de Bourcy CFA, Dimitrov B, Dingle G, Egger R, Han J, Holmes OB, Juan YF, King R, Kislyuk A, Lin MF, Mariano M, Morse T, Reynoso LV, Cruz DR, Sheu J, Tang J, Wang J, Zhang MA, Zhong E, Ahyong V, Lay S, Chea S, Bohl JA, Manning JE, Tato CM, DeRisi JL. IDseq-An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring. Gigascience 2020; 9:giaa111. [PMID: 33057676 PMCID: PMC7566497 DOI: 10.1093/gigascience/giaa111] [Citation(s) in RCA: 152] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 08/28/2020] [Accepted: 09/22/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources. For many research laboratories, this presents an obstacle, especially in resource-limited environments. FINDINGS We present IDseq, an open source cloud-based metagenomics pipeline and service for global pathogen detection and monitoring (https://idseq.net). The IDseq Portal accepts raw mNGS data, performs host and quality filtration steps, then executes an assembly-based alignment pipeline, which results in the assignment of reads and contigs to taxonomic categories. The taxonomic relative abundances are reported and visualized in an easy-to-use web application to facilitate data interpretation and hypothesis generation. Furthermore, IDseq supports environmental background model generation and automatic internal spike-in control recognition, providing statistics that are critical for data interpretation. IDseq was designed with the specific intent of detecting novel pathogens. Here, we benchmark novel virus detection capability using both synthetically evolved viral sequences and real-world samples, including IDseq analysis of a nasopharyngeal swab sample acquired and processed locally in Cambodia from a tourist from Wuhan, China, infected with the recently emergent SARS-CoV-2. CONCLUSION The IDseq Portal reduces the barrier to entry for mNGS data analysis and enables bench scientists, clinicians, and bioinformaticians to gain insight from mNGS datasets for both known and novel pathogens.
Collapse
Affiliation(s)
- Katrina L Kalantar
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Tiago Carvalho
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | | | - Boris Dimitrov
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Greg Dingle
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Rebecca Egger
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Julie Han
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Olivia B Holmes
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Yun-Fang Juan
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Ryan King
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Andrey Kislyuk
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Michael F Lin
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Maria Mariano
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Todd Morse
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Lucia V Reynoso
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - David Rissato Cruz
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Jonathan Sheu
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Jennifer Tang
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - James Wang
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Mark A Zhang
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Emily Zhong
- Chan Zuckerberg Initiative, Science, PO Box 8040 Redwood City, CA 94063, USA
| | - Vida Ahyong
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| | - Sreyngim Lay
- Malaria and Vector Research Laboratory, National Institute of Allergy and Infectious Diseases, Phnom Penh, Cambodia
| | - Sophana Chea
- Malaria and Vector Research Laboratory, National Institute of Allergy and Infectious Diseases, Phnom Penh, Cambodia
| | - Jennifer A Bohl
- Malaria and Vector Research Laboratory, National Institute of Allergy and Infectious Diseases, Phnom Penh, Cambodia
| | - Jessica E Manning
- Malaria and Vector Research Laboratory, National Institute of Allergy and Infectious Diseases, Phnom Penh, Cambodia
| | - Cristina M Tato
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| | - Joseph L DeRisi
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| |
Collapse
|
2
|
Petitjean M, Badel A, Veitia RA, Vanet A. Synthetic lethals in HIV: ways to avoid drug resistance : Running title: Preventing HIV resistance. Biol Direct 2015; 10:17. [PMID: 25888435 PMCID: PMC4399722 DOI: 10.1186/s13062-015-0044-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2014] [Accepted: 02/23/2015] [Indexed: 12/19/2022] Open
Abstract
Background RNA viruses rapidly accumulate genetic variation, which can give rise to synthetic lethal (SL) and deleterious (SD) mutations. Synthetic lethal mutations (non-lethal when alone but lethal when combined in one genome) have been studied to develop cancer therapies. This principle can also be used against fast-evolving RNA-viruses. Indeed, targeting protein sites involved in SD + SL interactions with a drug would render any mutation of such sites, lethal. Results Here, we set up a strategy to detect intragenic pairs of SL and SD at the surface of the protein to predict less escapable drug target sites. For this, we detected SD + SL, studying HIV protease (PR) and reverse transcriptase (RT) sequence alignments from two groups of VIH+ individuals: treated with drugs (T) or not (NT). Using a series of statistical approaches, we were able to propose bona fide SD + SL couples. When focusing on spatially close co-variant SD + SL couples at the surface of the protein, we found 5 SD + SL groups (2 in the protease and 3 in the reverse transcriptase), which could be good candidates to form pockets to accommodate potential drugs. Conclusions Thus, designing drugs targeting these specific SD + SL groups would not allow the virus to mutate any residue involved in such groups without losing an essential function. Moreover, we also show that the selection pressure induced by the treatment leads to the appearance of new mutations, which change the mutational landscape of the protein. This drives the existence of differential SD + SL couples between the drug-treated and non-treated groups. Thus, new anti-viral drugs should be designed differently to target such groups. Reviewers This article was reviewed by Neil Greenspan Csaba Pal and István Simon. Electronic supplementary material The online version of this article (doi:10.1186/s13062-015-0044-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Michel Petitjean
- Univ Paris Diderot, Sorbonne Paris Cité, F-75013, Paris, France. .,MTI, INSERM UMR-S 973, F-75013, Paris, France.
| | - Anne Badel
- Univ Paris Diderot, Sorbonne Paris Cité, F-75013, Paris, France. .,MTI, INSERM UMR-S 973, F-75013, Paris, France.
| | - Reiner A Veitia
- Univ Paris Diderot, Sorbonne Paris Cité, F-75013, Paris, France. .,CNRS, UMR7592, Institut Jacques Monod, F-75013, Paris, France.
| | - Anne Vanet
- Univ Paris Diderot, Sorbonne Paris Cité, F-75013, Paris, France. .,CNRS, UMR7592, Institut Jacques Monod, F-75013, Paris, France. .,Atelier de Bio Informatique, F-75005, Paris, France.
| |
Collapse
|