1
|
Liu J, Sun X, Zuo Y, Hu Q, He X. Plant species shape the bacterial communities on the phyllosphere in a hyper-arid desert. Microbiol Res 2023; 269:127314. [PMID: 36724560 DOI: 10.1016/j.micres.2023.127314] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 01/14/2023] [Accepted: 01/23/2023] [Indexed: 01/28/2023]
Abstract
Microorganisms are an important component of global biodiversity. However, they are vulnerable to hyper-arid climates in desert regions. Xerophytes are desert vegetation with unique biodiversity. However, little is known about the identities and communities of phyllosphere epiphytic microorganisms inhabiting the xerophyte leaf surface in the hot and dry environment. The diversity and community composition of phyllosphere epiphytes on different desert plants in Gansu, China, was investigated using the next-generation sequencing technique, revealing the diversity and community composition of the phyllosphere epiphytic bacteria associated with desert xerophytes. In addition, the ecological functions of the bacterial communities were investigated by combining the sequence classification information and prokaryotic taxonomic function annotation (FAPROTAX). This study determined the phyllosphere bacterial community composition, microbial interactions, and their functions. Despite harsh environments in the arid desert, we found that there are still diverse epiphytic bacteria on the leaves of desert plants. The bacterial communities mainly included Actinobacteria (52.79%), Firmicutes (31.62%), and Proteobacteria (12.20%). Further comparisons revealed different microbial communities, including Firmicutes at the phylum and Paenibacillaceae at the family level, in the phyllosphere among different plants, suggesting that the host plants had strong filter effects on bacteria. Co-occurrence network analysis revealed positive relationships were dominant among different bacterial taxa. The abundance of Actinobacteria and Proteobacteria was positively correlated, demonstrating their mutual relationship. On the other hand, the abundance of Firmicutes was negatively correlated, which suggested that they inhibit the growth of other bacterial taxa. FAPROTAX prediction revealed that chemoheterotrophy (accounting for 39.02% of the community) and aerobic chemoheterotrophy (37.01%) were the main functions of the leaf epiphytic bacteria on desert plants. This study improves our understanding of the community composition and ecological functions of plant-associated microbial communities inhabiting scattered niches in the desert ecosystem. In addition, the study provides insight into the biodiversity assessment in the desert region.
Collapse
Affiliation(s)
- Jiaqiang Liu
- School of Life Sciences, Hebei University, Baoding 071002, China.
| | - Xiang Sun
- School of Life Sciences, Hebei University, Baoding 071002, China.
| | - Yiling Zuo
- School of Life Sciences, Hebei University, Baoding 071002, China.
| | - Qiannan Hu
- School of Life Sciences, Hebei University, Baoding 071002, China.
| | - Xueli He
- School of Life Sciences, Hebei University, Baoding 071002, China.
| |
Collapse
|
2
|
High Prevalence and Diversity of Caliciviruses in a Community Setting Determined by a Metagenomic Approach. Microbiol Spectr 2022; 10:e0185321. [PMID: 35196791 PMCID: PMC8865552 DOI: 10.1128/spectrum.01853-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We recently carried out a metagenomic study to determine the fecal virome of infants during their first year of life in a semirural community in Mexico. A total of 97 stool samples from nine children were collected starting 2 weeks after birth and monthly thereafter until 12 months of age. In this work, we describe the prevalence and incidence of caliciviruses in this birth cohort. We found that 54 (56%) and 24 (25%) of the samples were positive for norovirus and sapovirus sequence reads detected by next-generation sequencing, respectively. Potential infections were arbitrarily considered when at least 20% of the complete virus genome was determined. Considering only these samples, there were 3 cases per child/year for norovirus and 0.33 cases per child/year for sapovirus. All nine children had sequence reads related to norovirus in at least 2 and up to 10 samples, and 8 children excreted sapovirus sequence reads in 1 and up to 5 samples during the study. The virus in 35 samples could be genotyped. The results showed a high diversity of both norovirus (GI.3[P13], GI.5, GII.4, GII.4[P16], GII.7[P7], and GII.17[P17]) and sapovirus (GI.1, GI.7, and GII.4) in the community. Of interest, despite the frequent detection of caliciviruses in the stools, all children remained asymptomatic during the study. Our results clearly show that metagenomic studies in stools may reveal a detailed picture of the prevalence and diversity of gastrointestinal viruses in the human gut during the first year of life. IMPORTANCE Human caliciviruses are important etiological agents of acute gastroenteritis in children under 5 years of age. Several studies have characterized their association with childhood diarrhea and their presence in nondiarrheal stool samples. In this work, we used a next-generation sequencing approach to determine, in a longitudinal study, the fecal virome of infants during their first year of life. Using this method, we found that caliciviruses can be detected significantly more frequently than previously reported, providing a more detailed picture of the prevalence and genetic diversity of these viruses in the human gut during early life.
Collapse
|
3
|
Miller RR, Uyaguari-Diaz M, McCabe MN, Montoya V, Gardy JL, Parker S, Steiner T, Hsiao W, Nesbitt MJ, Tang P, Patrick DM. Metagenomic Investigation of Plasma in Individuals with ME/CFS Highlights the Importance of Technical Controls to Elucidate Contamination and Batch Effects. PLoS One 2016; 11:e0165691. [PMID: 27806082 PMCID: PMC5091812 DOI: 10.1371/journal.pone.0165691] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 10/17/2016] [Indexed: 12/24/2022] Open
Abstract
Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS) is a debilitating disease causing indefinite fatigue. ME/CFS has long been hypothesised to have an infectious cause; however, no specific infectious agent has been identified. We used metagenomics to analyse the RNA from plasma samples from 25 individuals with ME/CFS and compare their microbial content to technical controls as well as three control groups: individuals with alternatively diagnosed chronic Lyme syndrome (N = 13), systemic lupus erythematosus (N = 11), and healthy controls (N = 25). We found that the majority of sequencing reads were removed during host subtraction, thus there was very low microbial RNA content in the plasma. The effects of sample batching and contamination during sample processing proved to outweigh the effects of study group on microbial RNA content, as the few differences in bacterial or viral RNA abundance we did observe between study groups were most likely caused by contamination and batch effects. Our results highlight the importance of including negative controls in all metagenomic analyses, since there was considerable overlap between bacterial content identified in study samples and control samples. For example, Proteobacteria, Firmicutes, Actinobacteria, and Bacteriodes were found in both study samples and plasma-free negative controls. Many of the taxonomic groups we saw in our plasma-free negative control samples have previously been associated with diseases, including ME/CFS, demonstrating how incorrect conclusions may arise if controls are not used and batch effects not accounted for.
Collapse
Affiliation(s)
- Ruth R. Miller
- School of Population and Public Health, University of British Columbia, Vancouver, British Columbia, Canada
| | - Miguel Uyaguari-Diaz
- British Columbia Centre for Disease Control, Vancouver, British Columbia, Canada
| | - Mark N. McCabe
- British Columbia Centre for Disease Control, Vancouver, British Columbia, Canada
| | - Vincent Montoya
- British Columbia Centre for Disease Control, Vancouver, British Columbia, Canada
| | - Jennifer L. Gardy
- School of Population and Public Health, University of British Columbia, Vancouver, British Columbia, Canada
- British Columbia Centre for Disease Control, Vancouver, British Columbia, Canada
| | - Shoshana Parker
- Centre for Health Evaluation and Outcome Sciences, Vancouver, British Columbia, Canada
| | - Theodore Steiner
- Department of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - William Hsiao
- British Columbia Public Health Microbiology and Reference Laboratory, Vancouver, British Columbia, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | | | - Patrick Tang
- Department of Pathology, Sidra Medical and Research Center, Doha, Qatar
| | - David M. Patrick
- School of Population and Public Health, University of British Columbia, Vancouver, British Columbia, Canada
- British Columbia Centre for Disease Control, Vancouver, British Columbia, Canada
- * E-mail:
| | | |
Collapse
|
4
|
A Pan-HIV Strategy for Complete Genome Sequencing. J Clin Microbiol 2015; 54:868-82. [PMID: 26699702 DOI: 10.1128/jcm.02479-15] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 12/16/2015] [Indexed: 01/23/2023] Open
Abstract
Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e.,switchingmechanismat 5' end ofRNAtranscript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance.
Collapse
|
5
|
Luk KC, Berg MG, Naccache SN, Kabre B, Federman S, Mbanya D, Kaptué L, Chiu CY, Brennan CA, Hackett J. Utility of Metagenomic Next-Generation Sequencing for Characterization of HIV and Human Pegivirus Diversity. PLoS One 2015; 10:e0141723. [PMID: 26599538 PMCID: PMC4658132 DOI: 10.1371/journal.pone.0141723] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 10/12/2015] [Indexed: 02/06/2023] Open
Abstract
Given the dynamic changes in HIV-1 complexity and diversity, next-generation sequencing (NGS) has the potential to revolutionize strategies for effective HIV global surveillance. In this study, we explore the utility of metagenomic NGS to characterize divergent strains of HIV-1 and to simultaneously screen for other co-infecting viruses. Thirty-five HIV-1-infected Cameroonian blood donor specimens with viral loads of >4.4 log10 copies/ml were selected to include a diverse representation of group M strains. Random-primed NGS libraries, prepared from plasma specimens, resulted in greater than 90% genome coverage for 88% of specimens. Correct subtype designations based on NGS were concordant with sub-region PCR data in 31 of 35 (89%) cases. Complete genomes were assembled for 25 strains, including circulating recombinant forms with relatively limited data available (7 CRF11_cpx, 2 CRF13_cpx, 1 CRF18_cpx, and 1 CRF37_cpx), as well as 9 unique recombinant forms. HPgV (formerly designated GBV-C) co-infection was detected in 9 of 35 (25%) specimens, of which eight specimens yielded complete genomes. The recovered HPgV genomes formed a diverse cluster with genotype 1 sequences previously reported from Ghana, Uganda, and Japan. The extensive genome coverage obtained by NGS improved accuracy and confidence in phylogenetic classification of the HIV-1 strains present in the study population relative to conventional sub-region PCR. In addition, these data demonstrate the potential for metagenomic analysis to be used for routine characterization of HIV-1 and identification of other viral co-infections.
Collapse
Affiliation(s)
- Ka-Cheung Luk
- Abbott Diagnostics, Infectious Disease Research, Abbott Park, Illinois, United States of America
| | - Michael G Berg
- Abbott Diagnostics, Infectious Disease Research, Abbott Park, Illinois, United States of America
| | - Samia N Naccache
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America.,UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California, United States of America
| | - Beniwende Kabre
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America.,UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California, United States of America
| | - Scot Federman
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America.,UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California, United States of America
| | | | | | - Charles Y Chiu
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America.,UCSF-Abbott Viral Diagnostics and Discovery Center, San Francisco, California, United States of America.,Department of Medicine, Division of Infectious Diseases, University of California San Francisco, San Francisco, California, United States of America
| | - Catherine A Brennan
- Abbott Diagnostics, Infectious Disease Research, Abbott Park, Illinois, United States of America
| | - John Hackett
- Abbott Diagnostics, Infectious Disease Research, Abbott Park, Illinois, United States of America
| |
Collapse
|
6
|
Lundin S, Jemt A, Terje-Hegge F, Foam N, Pettersson E, Käller M, Wirta V, Lexow P, Lundeberg J. Endonuclease specificity and sequence dependence of type IIS restriction enzymes. PLoS One 2015; 10:e0117059. [PMID: 25629514 PMCID: PMC4309577 DOI: 10.1371/journal.pone.0117059] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 12/17/2014] [Indexed: 11/23/2022] Open
Abstract
Restriction enzymes that recognize specific sequences but cleave unknown sequence outside the recognition site are extensively utilized tools in molecular biology. Despite this, systematic functional categorization of cleavage performance has largely been lacking. We established a simple and automatable model system to assay cleavage distance variation (termed slippage) and the sequence dependence thereof. We coupled this to massively parallel sequencing in order to provide sensitive and accurate measurement. With this system 14 enzymes were assayed (AcuI, BbvI, BpmI, BpuEI, BseRI, BsgI, Eco57I, Eco57MI, EcoP15I, FauI, FokI, GsuI, MmeI and SmuI). We report significant variation of slippage ranging from 1–54%, variations in sequence context dependence, as well as variation between isoschizomers. We believe this largely overlooked property of enzymes with shifted cleavage would benefit from further large scale classification and engineering efforts seeking to improve performance. The gained insights of in-vitro performance may also aid the in-vivo understanding of these enzymes.
Collapse
Affiliation(s)
- Sverker Lundin
- Science for Life Laboratory, KTH, Gene Technology, Solna, 171 65, Sweden
| | - Anders Jemt
- Science for Life Laboratory, KTH, Gene Technology, Solna, 171 65, Sweden
| | | | | | | | | | | | | | - Joakim Lundeberg
- Science for Life Laboratory, KTH, Gene Technology, Solna, 171 65, Sweden
- * E-mail:
| |
Collapse
|
7
|
Taboada B, Espinoza MA, Isa P, Aponte FE, Arias-Ortiz MA, Monge-Martínez J, Rodríguez-Vázquez R, Díaz-Hernández F, Zárate-Vidal F, Wong-Chew RM, Firo-Reyes V, del Río-Almendárez CN, Gaitán-Meza J, Villaseñor-Sierra A, Martínez-Aguilar G, Salas-Mier MDC, Noyola DE, Pérez-Gónzalez LF, López S, Santos-Preciado JI, Arias CF. Is there still room for novel viral pathogens in pediatric respiratory tract infections? PLoS One 2014; 9:e113570. [PMID: 25412469 PMCID: PMC4239085 DOI: 10.1371/journal.pone.0113570] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2014] [Accepted: 10/24/2014] [Indexed: 11/30/2022] Open
Abstract
Viruses are the most frequent cause of respiratory disease in children. However, despite the advanced diagnostic methods currently in use, in 20 to 50% of respiratory samples a specific pathogen cannot be detected. In this work, we used a metagenomic approach and deep sequencing to examine respiratory samples from children with lower and upper respiratory tract infections that had been previously found negative for 6 bacteria and 15 respiratory viruses by PCR. Nasal washings from 25 children (out of 250) hospitalized with a diagnosis of pneumonia and nasopharyngeal swabs from 46 outpatient children (out of 526) were studied. DNA reads for at least one virus commonly associated to respiratory infections was found in 20 of 25 hospitalized patients, while reads for pathogenic respiratory bacteria were detected in the remaining 5 children. For outpatients, all the samples were pooled into 25 DNA libraries for sequencing. In this case, in 22 of the 25 sequenced libraries at least one respiratory virus was identified, while in all other, but one, pathogenic bacteria were detected. In both patient groups reads for respiratory syncytial virus, coronavirus-OC43, and rhinovirus were identified. In addition, viruses less frequently associated to respiratory infections were also found. Saffold virus was detected in outpatient but not in hospitalized children. Anellovirus, rotavirus, and astrovirus, as well as several animal and plant viruses were detected in both groups. No novel viruses were identified. Adding up the deep sequencing results to the PCR data, 79.2% of 250 hospitalized and 76.6% of 526 ambulatory patients were positive for viruses, and all other children, but one, had pathogenic respiratory bacteria identified. These results suggest that at least in the type of populations studied and with the sampling methods used the odds of finding novel, clinically relevant viruses, in pediatric respiratory infections are low.
Collapse
Affiliation(s)
- Blanca Taboada
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Marco A. Espinoza
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Pavel Isa
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Fernando E. Aponte
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | | | | | | | | | | | - Rosa María Wong-Chew
- Facultad de Medicina, Universidad Nacional Autónoma de México, México D.F., Mexico
| | | | | | - Jesús Gaitán-Meza
- Nuevo Hospital Civil de Guadalajara "Dr. Juan I. Menchaca", Guadalajara, Jalisco, Mexico
| | | | | | | | - Daniel E. Noyola
- Universidad Autónoma de San Luis Potosí, San Luis Potosí, Mexico
| | | | - Susana López
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | | | - Carlos F. Arias
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
- * E-mail:
| |
Collapse
|
8
|
Optimization of virus detection in cells using massively parallel sequencing. Biologicals 2013; 42:34-41. [PMID: 24309095 DOI: 10.1016/j.biologicals.2013.11.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Revised: 10/28/2013] [Accepted: 11/08/2013] [Indexed: 11/22/2022] Open
Abstract
Massively parallel sequencing (MPS)-based virus detection has potential regulatory applications. We studied the ability of one of these approaches, based on degenerate oligonucleotide primer (DOP)-polymerase chain reaction (PCR), to detect viral sequences in cell lines known to express viral genes or particles. DOP-PCR was highly sensitive for the detection of small quantities of isolated viral sequences. Detected viral sequences included nodavirus, bracovirus, and endogenous retroviruses in High Five cells, porcine circovirus type 1 and porcine endogenous retrovirus in PK15 cells, human T-cell leukemia virus 1 in MJ cells, human papillomavirus 18 in HeLa cells, human herpesvirus 8 in BCBL-1 cells, and Epstein-Barr Virus in Raji cells. Illumina sequencing (for which primers were most efficiently added using PCR) provided greater sensitivity for virus detection than Roche 454 sequencing. Analyzing nucleic acids extracted both directly from samples and from capsid-enriched preparations provided useful information. Although there are limitations of these methods, these results indicate significant promise for the combination of nonspecific PCR and MPS in identifying contaminants in clinical and biological samples, including cell lines and reagents used to produce vaccines and therapeutic products.
Collapse
|
9
|
Ruan J, Jiang L, Chong Z, Gong Q, Li H, Li C, Tao Y, Zheng C, Zhai W, Turissini D, Cannon CH, Lu X, Wu CI. Pseudo-Sanger sequencing: massively parallel production of long and near error-free reads using NGS technology. BMC Genomics 2013; 14:711. [PMID: 24134808 PMCID: PMC4046676 DOI: 10.1186/1471-2164-14-711] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Accepted: 10/07/2013] [Indexed: 01/15/2023] Open
Abstract
Background Usually, next generation sequencing (NGS) technology has the property of ultra-high throughput but the read length is remarkably short compared to conventional Sanger sequencing. Paired-end NGS could computationally extend the read length but with a lot of practical inconvenience because of the inherent gaps. Now that Illumina paired-end sequencing has the ability of read both ends from 600 bp or even 800 bp DNA fragments, how to fill in the gaps between paired ends to produce accurate long reads is intriguing but challenging. Results We have developed a new technology, referred to as pseudo-Sanger (PS) sequencing. It tries to fill in the gaps between paired ends and could generate near error-free sequences equivalent to the conventional Sanger reads in length but with the high throughput of the Next Generation Sequencing. The major novelty of PS method lies on that the gap filling is based on local assembly of paired-end reads which have overlaps with at either end. Thus, we are able to fill in the gaps in repetitive genomic region correctly. The PS sequencing starts with short reads from NGS platforms, using a series of paired-end libraries of stepwise decreasing insert sizes. A computational method is introduced to transform these special paired-end reads into long and near error-free PS sequences, which correspond in length to those with the largest insert sizes. The PS construction has 3 advantages over untransformed reads: gap filling, error correction and heterozygote tolerance. Among the many applications of the PS construction is de novo genome assembly, which we tested in this study. Assembly of PS reads from a non-isogenic strain of Drosophila melanogaster yields an N50 contig of 190 kb, a 5 fold improvement over the existing de novo assembly methods and a 3 fold advantage over the assembly of long reads from 454 sequencing. Conclusions Our method generated near error-free long reads from NGS paired-end sequencing. We demonstrated that de novo assembly could benefit a lot from these Sanger-like reads. Besides, the characteristic of the long reads could be applied to such applications as structural variations detection and metagenomics. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-14-711) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Xuemei Lu
- Laboratory of Disease Genomics and Individualized Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, People's Republic of China.
| | | |
Collapse
|
10
|
Le Roch KG, Chung DWD, Ponts N. Genomics and integrated systems biology in Plasmodium falciparum: a path to malaria control and eradication. Parasite Immunol 2012; 34:50-60. [PMID: 21995286 DOI: 10.1111/j.1365-3024.2011.01340.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The first draft of the human malaria parasite's genome was released in 2002. Since then, the malaria scientific community has witnessed a steady embrace of new and powerful functional genomic studies. Over the years, these approaches have slowly revolutionized malaria research and enabled the comprehensive, unbiased investigation of various aspects of the parasite's biology. These genome-wide analyses delivered a refined annotation of the parasite's genome, delivered a better knowledge of its RNA, proteins and metabolite derivatives, and fostered the discovery of new vaccine and drug targets. Despite the positive impacts of these genomic studies, most research and investment still focus on protein targets, drugs and vaccine candidates that were known before the publication of the parasite genome sequence. However, recent access to next-generation sequencing technologies, along with an increased number of genome-wide applications, is expanding the impact of the parasite genome on biomedical research, contributing to a paradigm shift in research activities that may possibly lead to new optimized diagnosis and treatments. This review provides an update of Plasmodium falciparum genome sequences and an overview of the rapid development of genomics and system biology applications that have an immense potential of creating powerful tools for a successful malaria eradication campaign.
Collapse
Affiliation(s)
- K G Le Roch
- Department of Cell Biology and Neuroscience, University of California Riverside, Institute for Integrative Genome Biology, and Center for Disease Vector Research, Riverside, CA 92521, USA.
| | | | | |
Collapse
|
11
|
Translational research in infectious disease: current paradigms and challenges ahead. Transl Res 2012; 159:430-53. [PMID: 22633095 PMCID: PMC3361696 DOI: 10.1016/j.trsl.2011.12.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2011] [Revised: 12/23/2011] [Accepted: 12/24/2012] [Indexed: 12/25/2022]
Abstract
In recent years, the biomedical community has witnessed a rapid scientific and technologic evolution after the development and refinement of high-throughput methodologies. Concurrently and consequentially, the scientific perspective has changed from the reductionist approach of meticulously analyzing the fine details of a single component of biology to the "holistic" approach of broadmindedly examining the globally interacting elements of biological systems. The emergence of this new way of thinking has brought about a scientific revolution in which genomics, proteomics, metabolomics, and other "omics" have become the predominant tools by which large amounts of data are amassed, analyzed, and applied to complex questions of biology that were previously unsolvable. This enormous transformation of basic science research and the ensuing plethora of promising data, especially in the realm of human health and disease, have unfortunately not been followed by a parallel increase in the clinical application of this information. On the contrary, the number of new potential drugs in development has been decreasing steadily, suggesting the existence of roadblocks that prevent the translation of promising research into medically relevant therapeutic or diagnostic application. In this article, we will review, in a noninclusive fashion, several recent scientific advancements in the field of translational research, with a specific focus on how they relate to infectious disease. We will also present a current picture of the limitations and challenges that exist for translational research, as well as ways that have been proposed by the National Institutes of Health to improve the state of this field.
Collapse
Key Words
- 2-de, 2-dimensional electrophoresis
- 2-d dige, 2-dimensional differential in-gel electrophoresis
- cf, cystic fibrosis
- ctsa, clinical and translational science awards program
- ebv, epstein-barr virus
- fda, u.s. food and drug administration
- gwas, genome-wide association studies
- hcv, hepatitis c virus
- hmp, human microbiome project
- hplc, high-pressure liquid chromatography
- lc, liquid chromatography
- lsb, laboratory of systems biology
- mab, monoclonal antibody
- mrm/srm, multiple reaction monitoring/selective reaction monitoring
- ms, mass spectrometry
- ms/ms, tandem mass spectrometry
- ncats, national center for advancing translational sciences
- ncrr, national center of research resources
- niaid, national institute of allergy and infectious disease
- nih, national institutes of health
- nme, new molecular entity
- nmr, nuclear magnetic resonance
- pbmc, peripheral blood mononuclear cell
- pcr, polymerase chain reaction
- prr, pathogen recognition receptor
- qqq, triple quadrupole mass spectrometry
- sars-cov, coronavirus associated with severe acute respiratory syndrome
- snp, single nucleotide polymorphism
- tb, tuberculosis
- uti, urinary tract infection
- yfv, yellow fever virus
Collapse
|
12
|
Anderson CM, Chen SY, Dimon MT, Oke A, DeRisi JL, Fung JC. ReCombine: a suite of programs for detection and analysis of meiotic recombination in whole-genome datasets. PLoS One 2011; 6:e25509. [PMID: 22046241 PMCID: PMC3201961 DOI: 10.1371/journal.pone.0025509] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2011] [Accepted: 08/25/2011] [Indexed: 11/18/2022] Open
Abstract
In meiosis, the exchange of DNA between chromosomes by homologous recombination is a critical step that ensures proper chromosome segregation and increases genetic diversity. Products of recombination include reciprocal exchanges, known as crossovers, and non-reciprocal gene conversions or non-crossovers. The mechanisms underlying meiotic recombination remain elusive, largely because of the difficulty of analyzing large numbers of recombination events by traditional genetic methods. These traditional methods are increasingly being superseded by high-throughput techniques capable of surveying meiotic recombination on a genome-wide basis. Next-generation sequencing or microarray hybridization is used to genotype thousands of polymorphic markers in the progeny of hybrid yeast strains. New computational tools are needed to perform this genotyping and to find and analyze recombination events. We have developed a suite of programs, ReCombine, for using short sequence reads from next-generation sequencing experiments to genotype yeast meiotic progeny. Upon genotyping, the program CrossOver, a component of ReCombine, then detects recombination products and classifies them into categories based on the features found at each location and their distribution among the various chromatids. CrossOver is also capable of analyzing segregation data from microarray experiments or other sources. This package of programs is designed to allow even researchers without computational expertise to use high-throughput, whole-genome methods to study the molecular mechanisms of meiotic recombination.
Collapse
Affiliation(s)
- Carol M. Anderson
- Department of Obstetrics, Gynecology, and Reproductive Sciences and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Stacy Y. Chen
- Department of Obstetrics, Gynecology, and Reproductive Sciences and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Michelle T. Dimon
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Biological and Medical Informatics Program, University of California San Francisco, San Francisco, California, United States of America
| | - Ashwini Oke
- Department of Obstetrics, Gynecology, and Reproductive Sciences and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California, United States of America
| | - Joseph L. DeRisi
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Howard Hughes Medical Institute, Bethesda, Maryland, United States of America
| | - Jennifer C. Fung
- Department of Obstetrics, Gynecology, and Reproductive Sciences and Center for Reproductive Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail:
| |
Collapse
|
13
|
Chen EC, Yagi S, Kelly KR, Mendoza SP, Maninger N, Rosenthal A, Spinner A, Bales KL, Schnurr DP, Lerche NW, Chiu CY. Cross-species transmission of a novel adenovirus associated with a fulminant pneumonia outbreak in a new world monkey colony. PLoS Pathog 2011; 7:e1002155. [PMID: 21779173 PMCID: PMC3136464 DOI: 10.1371/journal.ppat.1002155] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2011] [Accepted: 05/23/2011] [Indexed: 12/21/2022] Open
Abstract
Adenoviruses are DNA viruses that naturally infect many vertebrates, including humans and monkeys, and cause a wide range of clinical illnesses in humans. Infection from individual strains has conventionally been thought to be species-specific. Here we applied the Virochip, a pan-viral microarray, to identify a novel adenovirus (TMAdV, titi monkey adenovirus) as the cause of a deadly outbreak in a closed colony of New World monkeys (titi monkeys; Callicebus cupreus) at the California National Primate Research Center (CNPRC). Among 65 titi monkeys housed in a building, 23 (34%) developed upper respiratory symptoms that progressed to fulminant pneumonia and hepatitis, and 19 of 23 monkeys, or 83% of those infected, died or were humanely euthanized. Whole-genome sequencing of TMAdV revealed that this adenovirus is a new species and highly divergent, sharing <57% pairwise nucleotide identity with other adenoviruses. Cultivation of TMAdV was successful in a human A549 lung adenocarcinoma cell line, but not in primary or established monkey kidney cells. At the onset of the outbreak, the researcher in closest contact with the monkeys developed an acute respiratory illness, with symptoms persisting for 4 weeks, and had a convalescent serum sample seropositive for TMAdV. A clinically ill family member, despite having no contact with the CNPRC, also tested positive, and screening of a set of 81 random adult blood donors from the Western United States detected TMAdV-specific neutralizing antibodies in 2 individuals (2/81, or 2.5%). These findings raise the possibility of zoonotic infection by TMAdV and human-to-human transmission of the virus in the population. Given the unusually high case fatality rate from the outbreak (83%), it is unlikely that titi monkeys are the native host species for TMAdV, and the natural reservoir of the virus is still unknown. The discovery of TMAdV, a novel adenovirus with the capacity to infect both monkeys and humans, suggests that adenoviruses should be monitored closely as potential causes of cross-species outbreaks. Infection from adenoviruses, viruses that cause a variety of illnesses in humans, monkeys, and other animals, has conventionally been thought to be species-specific. We used the Virochip, a microarray designed to detect all viruses, to identify a new species of adenovirus (TMAdV, or titi monkey adenovirus) that caused a deadly outbreak in a colony of New World titi monkeys at the California National Primate Research Center (CNPRC), and also infected a human researcher. One-third of the monkeys developed pneumonia and liver inflammation, and 19 of 23 monkeys died or were humanely euthanized. The unusually high death rate (83%) makes titi monkeys unlikely to be natural hosts for TMAdV, and the genomic sequence of TMAdV revealed that it is very different from any other known adenovirus. The researcher developed an acute respiratory illness at the onset of the outbreak, and was found to be infected by TMAdV by subsequent antibody testing. A clinically ill family member with no prior contact with the CNPRC also tested positive. Further investigation is needed to identify whether TMAdV originated from humans, monkeys, or another animal. The discovery of TMAdV suggests that adenoviruses should be monitored closely as potential causes of cross-species outbreaks.
Collapse
Affiliation(s)
- Eunice C. Chen
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America
- UCSF-Abbott Viral Diagnostics and Discovery Center, University of California San Francisco, San Francisco, California, United States of America
| | - Shigeo Yagi
- Viral and Rickettsial Disease Laboratory, California Department of Public Health, Richmond, California, United States of America
| | - Kristi R. Kelly
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Sally P. Mendoza
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Nicole Maninger
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Ann Rosenthal
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Abigail Spinner
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Karen L. Bales
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
- Department of Psychology, University of California Davis, Davis, California, United States of America
| | - David P. Schnurr
- Viral and Rickettsial Disease Laboratory, California Department of Public Health, Richmond, California, United States of America
| | - Nicholas W. Lerche
- California National Primate Research Center, University of California Davis, Davis, California, United States of America
| | - Charles Y. Chiu
- Department of Laboratory Medicine, University of California San Francisco, San Francisco, California, United States of America
- UCSF-Abbott Viral Diagnostics and Discovery Center, University of California San Francisco, San Francisco, California, United States of America
- Department of Medicine, Division of Infectious Diseases, University of California San Francisco, San Francisco, California, United States of America
- * E-mail:
| |
Collapse
|
14
|
Vignali M, Armour CD, Chen J, Morrison R, Castle JC, Biery MC, Bouzek H, Moon W, Babak T, Fried M, Raymond CK, Duffy PE. NSR-seq transcriptional profiling enables identification of a gene signature of Plasmodium falciparum parasites infecting children. J Clin Invest 2011; 121:1119-29. [PMID: 21317536 DOI: 10.1172/jci43457] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Accepted: 12/15/2010] [Indexed: 11/17/2022] Open
Abstract
Malaria caused by Plasmodium falciparum results in approximately 1 million annual deaths worldwide, with young children and pregnant mothers at highest risk. Disease severity might be related to parasite virulence factors, but expression profiling studies of parasites to test this hypothesis have been hindered by extensive sequence variation in putative virulence genes and a preponderance of host RNA in clinical samples. We report here the application of RNA sequencing to clinical isolates of P. falciparum, using not-so-random (NSR) primers to successfully exclude human ribosomal RNA and globin transcripts and enrich for parasite transcripts. Using NSR-seq, we confirmed earlier microarray studies showing upregulation of a distinct subset of genes in parasites infecting pregnant women, including that encoding the well-established pregnancy malaria vaccine candidate var2csa. We also describe a subset of parasite transcripts that distinguished parasites infecting children from those infecting pregnant women and confirmed this observation using quantitative real-time PCR and mass spectrometry proteomic analyses. Based on their putative functional properties, we propose that these proteins could have a role in childhood malaria pathogenesis. Our study provides proof of principle that NSR-seq represents an approach that can be used to study clinical isolates of parasites causing severe malaria syndromes as well other blood-borne pathogens and blood-related diseases.
Collapse
Affiliation(s)
- Marissa Vignali
- Seattle Biomedical Research Institute, Seattle, Washington, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Sorber K, Dimon MT, DeRisi JL. RNA-Seq analysis of splicing in Plasmodium falciparum uncovers new splice junctions, alternative splicing and splicing of antisense transcripts. Nucleic Acids Res 2011; 39:3820-35. [PMID: 21245033 PMCID: PMC3089446 DOI: 10.1093/nar/gkq1223] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Over 50% of genes in Plasmodium falciparum, the deadliest human malaria parasite, contain predicted introns, yet experimental characterization of splicing in this organism remains incomplete. We present here a transcriptome-wide characterization of intraerythrocytic splicing events, as captured by RNA-Seq data from four timepoints of a single highly synchronous culture. Gene model-independent analysis of these data in conjunction with publically available RNA-Seq data with HMMSplicer, an in-house developed splice site detection algorithm, revealed a total of 977 new 5' GU-AG 3' and 5 new 5' GC-AG 3' junctions absent from gene models and ESTs (11% increase to the current annotation). In addition, 310 alternative splicing events were detected in 254 (4.5%) genes, most of which truncate open reading frames. Splicing events antisense to gene models were also detected, revealing complex transcriptional arrangements within the parasite's transcriptome. Interestingly, antisense introns overlap sense introns more than would be expected by chance, perhaps indicating a functional relationship between overlapping transcripts or an inherent organizational property of the transcriptome. Independent experimental validation confirmed over 30 new antisense and alternative junctions. Thus, this largest assemblage of new and alternative splicing events to date in Plasmodium falciparum provides a more precise, dynamic view of the parasite's transcriptome.
Collapse
Affiliation(s)
- Katherine Sorber
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
| | | | | |
Collapse
|
16
|
Dimon MT, Sorber K, DeRisi JL. HMMSplicer: a tool for efficient and sensitive discovery of known and novel splice junctions in RNA-Seq data. PLoS One 2010; 5:e13875. [PMID: 21079731 PMCID: PMC2975632 DOI: 10.1371/journal.pone.0013875] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2010] [Accepted: 09/16/2010] [Indexed: 02/01/2023] Open
Abstract
Background High-throughput sequencing of an organism's transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on pre-built gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6% of 3′ splice sites and 1.4% of 5′ splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available at http://derisilab.ucsf.edu/software/hmmsplicer.
Collapse
Affiliation(s)
- Michelle T. Dimon
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Biological and Medical Informatics Program, University of California San Francisco, San Francisco, California, United States of America
| | - Katherine Sorber
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Joseph L. DeRisi
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- Howard Hughes Medical Institute, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
17
|
A metagenomic analysis of pandemic influenza A (2009 H1N1) infection in patients from North America. PLoS One 2010; 5:e13381. [PMID: 20976137 PMCID: PMC2956640 DOI: 10.1371/journal.pone.0013381] [Citation(s) in RCA: 134] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2010] [Accepted: 09/21/2010] [Indexed: 12/13/2022] Open
Abstract
Although metagenomics has been previously employed for pathogen discovery, its cost and complexity have prevented its use as a practical front-line diagnostic for unknown infectious diseases. Here we demonstrate the utility of two metagenomics-based strategies, a pan-viral microarray (Virochip) and deep sequencing, for the identification and characterization of 2009 pandemic H1N1 influenza A virus. Using nasopharyngeal swabs collected during the earliest stages of the pandemic in Mexico, Canada, and the United States (n = 17), the Virochip was able to detect a novel virus most closely related to swine influenza viruses without a priori information. Deep sequencing yielded reads corresponding to 2009 H1N1 influenza in each sample (percentage of aligned sequences corresponding to 2009 H1N1 ranging from 0.0011% to 10.9%), with up to 97% coverage of the influenza genome in one sample. Detection of 2009 H1N1 by deep sequencing was possible even at titers near the limits of detection for specific RT-PCR, and the percentage of sequence reads was linearly correlated with virus titer. Deep sequencing also provided insights into the upper respiratory microbiota and host gene expression in response to 2009 H1N1 infection. An unbiased analysis combining sequence data from all 17 outbreak samples revealed that 90% of the 2009 H1N1 genome could be assembled de novo without the use of any reference sequence, including assembly of several near full-length genomic segments. These results indicate that a streamlined metagenomics detection strategy can potentially replace the multiple conventional diagnostic tests required to investigate an outbreak of a novel pathogen, and provide a blueprint for comprehensive diagnosis of unexplained acute illnesses or outbreaks in clinical and public health settings.
Collapse
|
18
|
Chan SH, Stoddard BL, Xu SY. Natural and engineered nicking endonucleases--from cleavage mechanism to engineering of strand-specificity. Nucleic Acids Res 2010; 39:1-18. [PMID: 20805246 PMCID: PMC3017599 DOI: 10.1093/nar/gkq742] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Restriction endonucleases (REases) are highly specific DNA scissors that have facilitated the development of modern molecular biology. Intensive studies of double strand (ds) cleavage activity of Type IIP REases, which recognize 4–8 bp palindromic sequences, have revealed a variety of mechanisms of molecular recognition and catalysis. Less well-studied are REases which cleave only one of the strands of dsDNA, creating a nick instead of a ds break. Naturally occurring nicking endonucleases (NEases) range from frequent cutters such as Nt.CviPII (^CCD; ^ denotes the cleavage site) to rare-cutting homing endonucleases (HEases) such as I-HmuI. In addition to these bona fida NEases, individual subunits of some heterodimeric Type IIS REases have recently been shown to be natural NEases. The discovery and characterization of more REases that recognize asymmetric sequences, particularly Types IIS and IIA REases, has revealed recognition and cleavage mechanisms drastically different from the canonical Type IIP mechanisms, and has allowed researchers to engineer highly strand-specific NEases. Monomeric LAGLIDADG HEases use two separate catalytic sites for cleavage. Exploitation of this characteristic has also resulted in useful nicking HEases. This review aims at providing an overview of the cleavage mechanisms of Types IIS and IIA REases and LAGLIDADG HEases, the engineering of their nicking variants, and the applications of NEases and nicking HEases.
Collapse
|
19
|
Paszkiewicz K, Studholme DJ. De novo assembly of short sequence reads. Brief Bioinform 2010; 11:457-72. [DOI: 10.1093/bib/bbq020] [Citation(s) in RCA: 134] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
20
|
Rodrigue S, Materna AC, Timberlake SC, Blackburn MC, Malmstrom RR, Alm EJ, Chisholm SW. Unlocking short read sequencing for metagenomics. PLoS One 2010; 5:e11840. [PMID: 20676378 PMCID: PMC2911387 DOI: 10.1371/journal.pone.0011840] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 07/06/2010] [Indexed: 11/20/2022] Open
Abstract
Background Different high-throughput nucleic acid sequencing platforms are currently available but a trade-off currently exists between the cost and number of reads that can be generated versus the read length that can be achieved. Methodology/Principal Findings We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read. Conclusions/Significance This strategy is broadly applicable to sequencing applications that benefit from low-cost high-throughput sequencing, but require longer read lengths. We demonstrate that our approach enables metagenomic analyses using the Illumina Genome Analyzer, with low error rates, and at a fraction of the cost of pyrosequencing.
Collapse
Affiliation(s)
- Sébastien Rodrigue
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
| | - Arne C. Materna
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
| | - Sonia C. Timberlake
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
| | - Matthew C. Blackburn
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
| | - Rex R. Malmstrom
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
| | - Eric J. Alm
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
- * E-mail: (EJA); (SWC)
| | - Sallie W. Chisholm
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachussetts, United States of America
- * E-mail: (EJA); (SWC)
| |
Collapse
|
21
|
Tang P, Chiu C. Metagenomics for the discovery of novel human viruses. Future Microbiol 2010; 5:177-89. [PMID: 20143943 DOI: 10.2217/fmb.09.120] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Modern laboratory techniques for the detection of novel human viruses are greatly needed as physicians and epidemiologists increasingly deal with infectious diseases caused by new or previously unrecognized pathogens. There are many clinical syndromes in which viruses are suspected to play a role, but for which traditional microbiology techniques routinely fail in uncovering the etiologic agent. In addition, new viruses continue to challenge the human population owing to the encroachment of human settlements into animal and livestock habitats, globalization, climate change, growing numbers of immunocompromised people and bioterrorism. Metagenomics-based tools, such as microarrays and high-throughput sequencing are ideal for responding to these challenges. Pan-viral microarrays, containing representative sequences from all known viruses, have been used to detect novel and distantly-related variants of known viruses. Sequencing-based methods have also been successfully employed to detect novel viruses and have the potential to detect the full spectrum of viruses, including those present in low numbers.
Collapse
Affiliation(s)
- Patrick Tang
- British Columbia Centre for Disease Control, Department of Pathology & Laboratory Medicine, University of British Columbia, 655 West 12th Avenue, Vancouver, BC, V5Z 4R4, Canada.
| | | |
Collapse
|
22
|
Cultivation and serological characterization of a human Theiler's-like cardiovirus associated with diarrheal disease. J Virol 2010; 84:4407-14. [PMID: 20164225 DOI: 10.1128/jvi.02536-09] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Cardioviruses (e.g., Theiler's murine encephalomyelitis virus [TMEV]) are members of the Picornaviridae family that cause myocarditis and encephalitis in rodents. Recently, several studies have identified human cardioviruses, including Saffold virus (SAFV) and a related virus named human TMEV-like cardiovirus (HTCV). At least eight cardiovirus genotypes are now recognized, with SAFV and most strains of HTCV belonging to genotypes 1 and 2, respectively; genotype 2 strains are the most common in the population. Although a genotype 3 cardiovirus has recently been cultured (SAFV-3), the genotype 1 and 2 cardioviruses have been difficult to propagate in vitro, hindering efforts to understand their seroprevalence and pathogenicity. Here we present the isolation and characterization of a genotype 2 human cardiovirus (HTCV-UC6). Notably, successful cultivation of HTCV-UC6 from stool required the addition of cytokine-blocking antibodies to interrupt downstream antiviral pathways. Unlike SAFV-3, HTCV-UC6 exhibited slow replication kinetics and demonstrated only a moderate cytopathic effect. Serologic assays revealed that 91% of U.S. adults carry antibodies to the genotype 2 cardioviruses, of which 80% generate neutralizing antibodies, in agreement with previous data showing that cardiovirus infection is widespread in humans. We also demonstrate an acute cardiovirus seroconversion event in a child with diarrhea and vomiting, thus reporting for the first time evidence linking cardiovirus infection to diarrheal disease in humans.
Collapse
|
23
|
Otto TD, Wilinski D, Assefa S, Keane TM, Sarry LR, Böhme U, Lemieux J, Barrell B, Pain A, Berriman M, Newbold C, Llinás M. New insights into the blood-stage transcriptome of Plasmodium falciparum using RNA-Seq. Mol Microbiol 2010; 76:12-24. [PMID: 20141604 PMCID: PMC2859250 DOI: 10.1111/j.1365-2958.2009.07026.x] [Citation(s) in RCA: 289] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Recent advances in high-throughput sequencing present a new opportunity to deeply probe an organism's transcriptome. In this study, we used Illumina-based massively parallel sequencing to gain new insight into the transcriptome (RNA-Seq) of the human malaria parasite, Plasmodium falciparum. Using data collected at seven time points during the intraerythrocytic developmental cycle, we (i) detect novel gene transcripts; (ii) correct hundreds of gene models; (iii) propose alternative splicing events; and (iv) predict 5' and 3' untranslated regions. Approximately 70% of the unique sequencing reads map to previously annotated protein-coding genes. The RNA-Seq results greatly improve existing annotation of the P. falciparum genome with over 10% of gene models modified. Our data confirm 75% of predicted splice sites and identify 202 new splice sites, including 84 previously uncharacterized alternative splicing events. We also discovered 107 novel transcripts and expression of 38 pseudogenes, with many demonstrating differential expression across the developmental time series. Our RNA-Seq results correlate well with DNA microarray analysis performed in parallel on the same samples, and provide improved resolution over the microarray-based method. These data reveal new features of the P. falciparum transcriptional landscape and significantly advance our understanding of the parasite's red blood cell-stage transcriptome.
Collapse
Affiliation(s)
- Thomas D Otto
- Parasite Genomics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|