51
|
A general method to eliminate laboratory induced recombinants during massive, parallel sequencing of cDNA library. Virol J 2015; 12:55. [PMID: 25879746 PMCID: PMC4403950 DOI: 10.1186/s12985-015-0280-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 03/16/2015] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Massive, parallel sequencing is a potent tool for dissecting the regulation of biological processes by revealing the dynamics of the cellular RNA profile under different conditions. Similarly, massive, parallel sequencing can be used to reveal the complexity of viral quasispecies that are often found in the RNA virus infected host. However, the production of cDNA libraries for next-generation sequencing (NGS) necessitates the reverse transcription of RNA into cDNA and the amplification of the cDNA template using PCR, which may introduce artefact in the form of phantom nucleic acids species that can bias the composition and interpretation of original RNA profiles. METHOD Using HIV as a model we have characterised the major sources of error during the conversion of viral RNA to cDNA, namely excess RNA template and the RNaseH activity of the polymerase enzyme, reverse transcriptase. In addition we have analysed the effect of PCR cycle on detection of recombinants and assessed the contribution of transfection of highly similar plasmid DNA to the formation of recombinant species during the production of our control viruses. RESULTS We have identified RNA template concentrations, RNaseH activity of reverse transcriptase, and PCR conditions as key parameters that must be carefully optimised to minimise chimeric artefacts. CONCLUSIONS Using our optimised RT-PCR conditions, in combination with our modified PCR amplification procedure, we have developed a reliable technique for accurate determination of RNA species using NGS technology.
Collapse
|
52
|
Schnell IB, Bohmann K, Gilbert MTP. Tag jumps illuminated--reducing sequence-to-sample misidentifications in metabarcoding studies. Mol Ecol Resour 2015; 15:1289-303. [PMID: 25740652 DOI: 10.1111/1755-0998.12402] [Citation(s) in RCA: 255] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2014] [Revised: 03/02/2015] [Accepted: 03/02/2015] [Indexed: 11/29/2022]
Abstract
Metabarcoding of environmental samples on second-generation sequencing platforms has rapidly become a valuable tool for ecological studies. A fundamental assumption of this approach is the reliance on being able to track tagged amplicons back to the samples from which they originated. In this study, we address the problem of sequences in metabarcoding sequencing outputs with false combinations of used tags (tag jumps). Unless these sequences can be identified and excluded from downstream analyses, tag jumps creating sequences with false, but already used tag combinations, can cause incorrect assignment of sequences to samples and artificially inflate diversity. In this study, we document and investigate tag jumping in metabarcoding studies on Illumina sequencing platforms by amplifying mixed-template extracts obtained from bat droppings and leech gut contents with tagged generic arthropod and mammal primers, respectively. We found that an average of 2.6% and 2.1% of sequences had tag combinations, which could be explained by tag jumping in the leech and bat diet study, respectively. We suggest that tag jumping can happen during blunt-ending of pools of tagged amplicons during library build and as a consequence of chimera formation during bulk amplification of tagged amplicons during library index PCR. We argue that tag jumping and contamination between libraries represents a considerable challenge for Illumina-based metabarcoding studies, and suggest measures to avoid false assignment of tag jumping-derived sequences to samples.
Collapse
Affiliation(s)
- Ida Baerholm Schnell
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, 1350, Copenhagen K, Denmark.,Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000, Frederiksberg, Denmark
| | - Kristine Bohmann
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, 1350, Copenhagen K, Denmark.,School of Biological Sciences, University of Bristol, Bristol, BS8 1UG, UK
| | - M Thomas P Gilbert
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, 1350, Copenhagen K, Denmark.,Trace and Environmental DNA Laboratory, Department of Environment and Agriculture, Curtin University, Perth, Western Australia, 6102, Australia
| |
Collapse
|
53
|
Lührig K, Canbäck B, Paul CJ, Johansson T, Persson KM, Rådström P. Bacterial community analysis of drinking water biofilms in southern Sweden. Microbes Environ 2015; 30:99-107. [PMID: 25739379 PMCID: PMC4356470 DOI: 10.1264/jsme2.me14123] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Next-generation sequencing of the V1-V2 and V3 variable regions of the 16S rRNA gene generated a total of 674,116 reads that described six distinct bacterial biofilm communities from both water meters and pipes. A high degree of reproducibility was demonstrated for the experimental and analytical work-flow by analyzing the communities present in parallel water meters, the rare occurrence of biological replicates within a working drinking water distribution system. The communities observed in water meters from households that did not complain about their drinking water were defined by sequences representing Proteobacteria (82-87%), with 22-40% of all sequences being classified as Sphingomonadaceae. However, a water meter biofilm community from a household with consumer reports of red water and flowing water containing elevated levels of iron and manganese had fewer sequences representing Proteobacteria (44%); only 0.6% of all sequences were classified as Sphingomonadaceae; and, in contrast to the other water meter communities, markedly more sequences represented Nitrospira and Pedomicrobium. The biofilm communities in pipes were distinct from those in water meters, and contained sequences that were identified as Mycobacterium, Nocardia, Desulfovibrio, and Sulfuricurvum. The approach employed in the present study resolved the bacterial diversity present in these biofilm communities as well as the differences that occurred in biofilms within a single distribution system, and suggests that next-generation sequencing of 16S rRNA amplicons can show changes in bacterial biofilm communities associated with different water qualities.
Collapse
Affiliation(s)
- Katharina Lührig
- Applied Microbiology, Department of Chemistry, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden; Sydvatten AB, Hyllie Stationstorg 21, SE-215 32 Malmö, Sweden
| | | | | | | | | | | |
Collapse
|
54
|
CATCh, an ensemble classifier for chimera detection in 16S rRNA sequencing studies. Appl Environ Microbiol 2014; 81:1573-84. [PMID: 25527546 DOI: 10.1128/aem.02896-14] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
In ecological studies, microbial diversity is nowadays mostly assessed via the detection of phylogenetic marker genes, such as 16S rRNA. However, PCR amplification of these marker genes produces a significant amount of artificial sequences, often referred to as chimeras. Different algorithms have been developed to remove these chimeras, but efforts to combine different methodologies are limited. Therefore, two machine learning classifiers (reference-based and de novo CATCh) were developed by integrating the output of existing chimera detection tools into a new, more powerful method. When comparing our classifiers with existing tools in either the reference-based or de novo mode, a higher performance of our ensemble method was observed on a wide range of sequencing data, including simulated, 454 pyrosequencing, and Illumina MiSeq data sets. Since our algorithm combines the advantages of different individual chimera detection tools, our approach produces more robust results when challenged with chimeric sequences having a low parent divergence, short length of the chimeric range, and various numbers of parents. Additionally, it could be shown that integrating CATCh in the preprocessing pipeline has a beneficial effect on the quality of the clustering in operational taxonomic units.
Collapse
|
55
|
Rawson JMO, Mansky LM. Retroviral vectors for analysis of viral mutagenesis and recombination. Viruses 2014; 6:3612-42. [PMID: 25254386 PMCID: PMC4189041 DOI: 10.3390/v6093612] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2014] [Revised: 09/15/2014] [Accepted: 09/17/2014] [Indexed: 12/29/2022] Open
Abstract
Retrovirus population diversity within infected hosts is commonly high due in part to elevated rates of replication, mutation, and recombination. This high genetic diversity often complicates the development of effective diagnostics, vaccines, and antiviral drugs. This review highlights the diverse vectors and approaches that have been used to examine mutation and recombination in retroviruses. Retroviral vectors for these purposes can broadly be divided into two categories: those that utilize reporter genes as mutation or recombination targets and those that utilize viral genes as targets of mutation or recombination. Reporter gene vectors greatly facilitate the detection, quantification, and characterization of mutants and/or recombinants, but may not fully recapitulate the patterns of mutagenesis or recombination observed in native viral gene sequences. In contrast, the detection of mutations or recombination events directly in viral genes is more biologically relevant but also typically more challenging and inefficient. We will highlight the advantages and disadvantages of the various vectors and approaches used as well as propose ways in which they could be improved.
Collapse
Affiliation(s)
- Jonathan M O Rawson
- Institute for Molecular Virology, University of Minnesota, Moos Tower 18-242, 515 Delaware St SE, Minneapolis, MN 55455, USA.
| | - Louis M Mansky
- Institute for Molecular Virology, University of Minnesota, Moos Tower 18-242, 515 Delaware St SE, Minneapolis, MN 55455, USA.
| |
Collapse
|
56
|
Fifteen to twenty percent of HIV substitution mutations are associated with recombination. J Virol 2014; 88:3837-49. [PMID: 24453357 DOI: 10.1128/jvi.03136-13] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
HIV undergoes high rates of mutation and recombination during reverse transcription, but it is not known whether these events occur independently or are linked mechanistically. Here we used a system of silent marker mutations in HIV and a single round of infection in primary T lymphocytes combined with a high-throughput sequencing and mathematical modeling approach to directly estimate the viral recombination and mutation rates. From >7 million nucleotides (nt) of sequences from HIV infection, we observed 4,801 recombination events and 859 substitution mutations (≈1.51 and 0.12 events per 1,000 nt, respectively). We used experimental controls to account for PCR-induced and transfection-induced recombination and sequencing error. We found that the single-cycle virus-induced mutation rate is 4.6 × 10(-5) mutations per nt after correction. By sorting of our data into recombined and nonrecombined sequences, we found a significantly higher mutation rate in recombined regions (P = 0.003 by Fisher's exact test). We used a permutation approach to eliminate a number of potential confounding factors and confirm that mutation occurs around the site of recombination and is not simply colocated in the genome. By comparing mutation rates in recombined and nonrecombined regions, we found that recombination-associated mutations account for 15 to 20% of all mutations occurring during reverse transcription.
Collapse
|
57
|
Abstract
UNLABELLED HIV-1 infection is characterized by the rapid generation of genetic diversity that facilitates viral escape from immune selection and antiretroviral therapy. Despite recombination's crucial role in viral diversity and evolution, little is known about the genomic factors that influence recombination between highly similar genomes. In this study, we use a minimally modified full-length HIV-1 genome and high-throughput sequence analysis to study recombination in gag and pol in T cells. We find that recombination is favored at a number of recombination hot spots, where recombination occurs six times more frequently than at corresponding cold spots. Interestingly, these hot spots occur near important features of the HIV-1 genome but do not occur at sites immediately around protease inhibitor or reverse transcriptase inhibitor drug resistance mutations. We show that the recombination hot and cold spots are consistent across five blood donors and are independent of coreceptor-mediated entry. Finally, we check common experimental confounders and find that these are not driving the location of recombination hot spots. This is the first study to identify the location of recombination hot spots between two similar viral genomes with great statistical power and under conditions that closely reflect natural recombination events among HIV-1 quasispecies. IMPORTANCE The ability of HIV-1 to evade the immune system and antiretroviral therapy depends on genetic diversity within the viral quasispecies. Retroviral recombination is an important mechanism that helps to generate and maintain this genetic diversity, but little is known about how recombination rates vary within the HIV-1 genome. We measured recombination rates in gag and pol and identified recombination hot and cold spots, demonstrating that recombination is not random but depends on the underlying gene sequence. The strength and location of these recombination hot and cold spots can be used to improve models of viral dynamics and evolution, which will be useful for the design of robust antiretroviral therapies.
Collapse
|
58
|
Di Giallonardo F, Zagordi O, Duport Y, Leemann C, Joos B, Künzli-Gontarczyk M, Bruggmann R, Beerenwinkel N, Günthard HF, Metzner KJ. Next-generation sequencing of HIV-1 RNA genomes: determination of error rates and minimizing artificial recombination. PLoS One 2013; 8:e74249. [PMID: 24058534 PMCID: PMC3776835 DOI: 10.1371/journal.pone.0074249] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Accepted: 07/31/2013] [Indexed: 12/28/2022] Open
Abstract
Next-generation sequencing (NGS) is a valuable tool for the detection and quantification of HIV-1 variants in vivo. However, these technologies require detailed characterization and control of artificially induced errors to be applicable for accurate haplotype reconstruction. To investigate the occurrence of substitutions, insertions, and deletions at the individual steps of RT-PCR and NGS, 454 pyrosequencing was performed on amplified and non-amplified HIV-1 genomes. Artificial recombination was explored by mixing five different HIV-1 clonal strains (5-virus-mix) and applying different RT-PCR conditions followed by 454 pyrosequencing. Error rates ranged from 0.04-0.66% and were similar in amplified and non-amplified samples. Discrepancies were observed between forward and reverse reads, indicating that most errors were introduced during the pyrosequencing step. Using the 5-virus-mix, non-optimized, standard RT-PCR conditions introduced artificial recombinants in a fraction of at least 30% of the reads that subsequently led to an underestimation of true haplotype frequencies. We minimized the fraction of recombinants down to 0.9-2.6% by optimized, artifact-reducing RT-PCR conditions. This approach enabled correct haplotype reconstruction and frequency estimations consistent with reference data obtained by single genome amplification. RT-PCR conditions are crucial for correct frequency estimation and analysis of haplotypes in heterogeneous virus populations. We developed an RT-PCR procedure to generate NGS data useful for reliable haplotype reconstruction and quantification.
Collapse
Affiliation(s)
- Francesca Di Giallonardo
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Osvaldo Zagordi
- Department of Biosystems Sciences and Engineering, ETH Zurich, Basel, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Yannick Duport
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Christine Leemann
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Beda Joos
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | | | - Rémy Bruggmann
- Functional Genomics Center Zurich (FGCZ), University of Zurich, ETH Zurich, Zurich, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Sciences and Engineering, ETH Zurich, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Huldrych F. Günthard
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Karin J. Metzner
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, Switzerland
- * E-mail:
| |
Collapse
|
59
|
Bourret V, Croville G, Mariette J, Klopp C, Bouchez O, Tiley L, Guérin JL. Whole-genome, deep pyrosequencing analysis of a duck influenza A virus evolution in swine cells. INFECTION GENETICS AND EVOLUTION 2013; 18:31-41. [PMID: 23660486 DOI: 10.1016/j.meegid.2013.04.034] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 03/14/2013] [Accepted: 04/29/2013] [Indexed: 01/21/2023]
Abstract
We studied the sub-population level evolution of a duck influenza A virus isolate during passage in swine tracheal cells. The complete genomes of the A/mallard/Netherlands/10-Nmkt/1999 strain and its swine cell-passaged descendent were analysed by 454 pyrosequencing with coverage depth ranging from several hundred to several thousand reads at any point. This allowed characterization of defined minority sub-populations of gene segments 2, 3, 4, 5, 7, and 8 present in the original isolate. These minority sub-populations ranged between 9.5% (for segment 2) and 46% (for segment 4) of their respective gene segments in the parental stock. They were likely contributed by one or more viruses circulating within the same area, at the same period and in the same or a sympatric host species. The minority sub-populations of segments 3, 4, and 5 became extinct upon viral passage in swine cells, whereas the minority sub-populations of segments 2, 7 and 8 completely replaced their majority counterparts. The swine cell-passaged virus was therefore a three-segment reassortant and also harboured point mutations in segments 3 and 4. The passaged virus was more homogenous than the parental stock, with only 17 minority single nucleotide polymorphisms present above 5% frequency across the whole genome. Though limited here to one sample, this deep sequencing approach highlights the evolutionary versatility of influenza viruses whereby they exploit their genetic diversity, predilection for mixed infection and reassortment to adapt to a new host environmental niche.
Collapse
Affiliation(s)
- Vincent Bourret
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK.
| | | | | | | | | | | | | |
Collapse
|
60
|
Lewis ZT, Bokulich NA, Kalanetra KM, Ruiz-Moyano S, Underwood MA, Mills DA. Use of bifidobacterial specific terminal restriction fragment length polymorphisms to complement next generation sequence profiling of infant gut communities. Anaerobe 2013; 19:62-9. [PMID: 23261904 PMCID: PMC4764513 DOI: 10.1016/j.anaerobe.2012.12.005] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 12/05/2012] [Indexed: 02/06/2023]
Abstract
Bifidobacteria are intestinal anaerobes often associated with gut health. Specific bifidobacterial species are particularly common in the gastrointestinal tract of breast-fed infants. Current short read next-generation sequencing approaches to profile fecal microbial ecologies do not discriminate bifidobacteria to the species level. Here we describe a low-cost terminal restriction fragment length polymorphism (TRFLP) procedure to distinguish between the common infant-associated bifidobacterial species. An empirical database of TRF sizes was created from both common reference strains and well-identified isolates from infant feces. Species-specific quantitative PCR validated bifidobacterial-specific TRFLP profiles from infant feces. These results indicate that bifidobacterial-specific TRFLP is a useful method to monitor intestinal bifidobacterial populations from infant fecal samples. When used alongside next generation sequencing methods that detect broader population levels at lower resolution, this high-throughput, low-cost tool can help clarify the role of bifidobacteria in health and disease.
Collapse
Affiliation(s)
- Zachery T Lewis
- Department of Viticulture and Enology, University of California, Davis, CA 95616, USA
| | | | | | | | | | | |
Collapse
|
61
|
Grueber CE, Wallis GP, King TM, Jamieson IG. Variation at innate immunity Toll-like receptor genes in a bottlenecked population of a New Zealand robin. PLoS One 2012; 7:e45011. [PMID: 23024782 PMCID: PMC3443209 DOI: 10.1371/journal.pone.0045011] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Accepted: 08/16/2012] [Indexed: 11/22/2022] Open
Abstract
Toll-like receptors (TLRs) are an ancient family of genes encoding transmembrane proteins that bind pathogen-specific molecules and initiate both innate and adaptive aspects of the immune response. Our goal was to determine whether these genes show sufficient genetic diversity in a bottlenecked population to be a useful addition or alternative to the more commonly employed major histocompatibility complex (MHC) genotyping in a conservation genetics context. We amplified all known avian TLR genes in a severely bottlenecked population of New Zealand's Stewart Island robin (Petroica australis rakiura), for which reduced microsatellite diversity was previously observed. We genotyped 17-24 birds from a reintroduced island population (including the 12 founders) for nine genes, seven of which were polymorphic. We observed a total of 24 single-nucleotide polymorphisms overall, 15 of which were non-synonymous, representing up to five amino-acid variants at a locus. One locus (TLR1LB) showed evidence of past directional selection. Results also confirmed a passerine duplication of TLR7. The levels of TLR diversity that we observe are sufficient to justify their further use in addressing conservation genetic questions, even in bottlenecked populations.
Collapse
|
62
|
Application of targeted metagenomics to explore abundance and diversity of CO₂-fixing bacterial community using cbbL gene from the rhizosphere of Arachis hypogaea. Gene 2012; 506:18-24. [PMID: 22766402 DOI: 10.1016/j.gene.2012.06.083] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Accepted: 06/25/2012] [Indexed: 11/23/2022]
Abstract
Sequestration of CO(2) by autotrophic bacteria is a key process of biogeochemical carbon cycling in soil ecosystem. Rhizosphere is a rich niche of microbial activity and diversity, influenced by change in atmospheric CO(2). Structural changes in rhizosphere composition influence microbial communities and the nutrient cycling. In the present study, the bacterial diversity and population dynamics were established using cbbL and 16S rRNA gene targeted metagenomics approach from the rhizosphere of Arachis hypogaea. A total of 108 cbbL clones were obtained from the rhizospheric soil which revealed predominance of cbbL sequences affiliated to Rhizobium leguminosarum, Bradyrhizobium sp., Sinorhizobium meliloti, Ochrobactrum anthropi and a variety of uncultured cbbL harboring bacteria. The 16S rRNA gene clone library exhibited the dominance of Firmicutes (34.4%), Proteobacteria (18.3%), Actinobacteria (17.2%) and Bacteroidetes (16.1%). About 43% nucleotide sequences of 16S rRNA gene clone library were novel genera which showed <95% homology with published sequences. Gene copy number of cbbL and 16S rRNA genes, determined by quantitative real-time PCR (qRT PCR), was 9.38 ± 0.75 × 10(7) and 5.43 ± 0.79 × 10(8) (per g dry soil), respectively. The results exhibited bacterial community structure with high bacterial diversity and abundance of CO(2)-fixing bacteria, which can be explored further for their role in carbon cycling, sustainable agriculture and environment management.
Collapse
|
63
|
Smyth RP, Davenport MP, Mak J. The origin of genetic diversity in HIV-1. Virus Res 2012; 169:415-29. [PMID: 22728444 DOI: 10.1016/j.virusres.2012.06.015] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2012] [Revised: 06/10/2012] [Accepted: 06/12/2012] [Indexed: 10/28/2022]
Abstract
One of the hallmarks of HIV infection is the rapid development of a genetically complex population (quasispecies) from an initially limited number of infectious particles. Genetic diversity remains one of the major obstacles to eradication of HIV. The viral quasispecies can respond rapidly to selective pressures, such as that imposed by the immune system and antiretroviral therapy, and frustrates vaccine design efforts. Two unique features of retroviral replication are responsible for the unprecedented variation generated during infection. First, mutations are frequently introduced into the viral genome by the error prone viral reverse transcriptase and through the actions of host cellular factors, such as the APOBEC family of nucleic acid editing enzymes. Second, the HIV reverse transcriptase can utilize both copies of the co-packaged viral genome in a process termed retroviral recombination. When the co-packaged viral genomes are genetically different, retroviral recombination can lead to the shuffling of mutations between viral genomes in the quasispecies. This review outlines the stages of the retroviral life cycle where genetic variation is introduced, focusing on the principal mechanisms of mutation and recombination. Understanding the mechanistic origin of genetic diversity is essential to combating HIV.
Collapse
Affiliation(s)
- Redmond P Smyth
- Centre for Virology, Burnet Institute, 85 Commercial Road, Melbourne, Victoria 3004, Australia
| | | | | |
Collapse
|
64
|
Henn MR, Boutwell CL, Charlebois P, Lennon NJ, Power KA, Macalalad AR, Berlin AM, Malboeuf CM, Ryan EM, Gnerre S, Zody MC, Erlich RL, Green LM, Berical A, Wang Y, Casali M, Streeck H, Bloom AK, Dudek T, Tully D, Newman R, Axten KL, Gladden AD, Battis L, Kemper M, Zeng Q, Shea TP, Gujja S, Zedlack C, Gasser O, Brander C, Hess C, Günthard HF, Brumme ZL, Brumme CJ, Bazner S, Rychert J, Tinsley JP, Mayer KH, Rosenberg E, Pereyra F, Levin JZ, Young SK, Jessen H, Altfeld M, Birren BW, Walker BD, Allen TM. Whole genome deep sequencing of HIV-1 reveals the impact of early minor variants upon immune recognition during acute infection. PLoS Pathog 2012; 8:e1002529. [PMID: 22412369 PMCID: PMC3297584 DOI: 10.1371/journal.ppat.1002529] [Citation(s) in RCA: 287] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2011] [Accepted: 12/27/2011] [Indexed: 12/20/2022] Open
Abstract
Deep sequencing technologies have the potential to transform the study of highly variable viral pathogens by providing a rapid and cost-effective approach to sensitively characterize rapidly evolving viral quasispecies. Here, we report on a high-throughput whole HIV-1 genome deep sequencing platform that combines 454 pyrosequencing with novel assembly and variant detection algorithms. In one subject we combined these genetic data with detailed immunological analyses to comprehensively evaluate viral evolution and immune escape during the acute phase of HIV-1 infection. The majority of early, low frequency mutations represented viral adaptation to host CD8+ T cell responses, evidence of strong immune selection pressure occurring during the early decline from peak viremia. CD8+ T cell responses capable of recognizing these low frequency escape variants coincided with the selection and evolution of more effective secondary HLA-anchor escape mutations. Frequent, and in some cases rapid, reversion of transmitted mutations was also observed across the viral genome. When located within restricted CD8 epitopes these low frequency reverting mutations were sufficient to prime de novo responses to these epitopes, again illustrating the capacity of the immune response to recognize and respond to low frequency variants. More importantly, rapid viral escape from the most immunodominant CD8+ T cell responses coincided with plateauing of the initial viral load decline in this subject, suggestive of a potential link between maintenance of effective, dominant CD8 responses and the degree of early viremia reduction. We conclude that the early control of HIV-1 replication by immunodominant CD8+ T cell responses may be substantially influenced by rapid, low frequency viral adaptations not detected by conventional sequencing approaches, which warrants further investigation. These data support the critical need for vaccine-induced CD8+ T cell responses to target more highly constrained regions of the virus in order to ensure the maintenance of immunodominant CD8 responses and the sustained decline of early viremia. The ability of HIV-1 and other highly variable pathogens to rapidly mutate to escape vaccine-induced immune responses represents a major hurdle to the development of effective vaccines to these highly persistent pathogens. Application of next-generation or deep sequencing technologies to the study of host pathogens could significantly improve our understanding of the mechanisms by which these pathogens subvert host immunity, and aid in the development of novel vaccines and therapeutics. Here, we developed a 454 deep sequencing approach to enable the sensitive detection of low-frequency viral variants across the entire HIV-1 genome. When applied to the acute phase of HIV-1 infection we observed that the majority of early, low frequency mutations represented viral adaptations to host cellular immune responses, evidence of strong host immunity developing during the early decline of peak viral load. Rapid viral escape from the most dominant immune responses however correlated with loss of this initial viral control, suggestive of the importance of mounting immune responses against more conserved regions of the virus. These data provide a greater understanding of the early evolutionary events subverting the ability of host immune responses to control early HIV-1 replication, yielding important insight into the design of more effective vaccine strategies.
Collapse
Affiliation(s)
- Matthew R. Henn
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Christian L. Boutwell
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Patrick Charlebois
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Niall J. Lennon
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Karen A. Power
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | | | - Aaron M. Berlin
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Christine M. Malboeuf
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Elizabeth M. Ryan
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sante Gnerre
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Michael C. Zody
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Rachel L. Erlich
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Lisa M. Green
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Andrew Berical
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Yaoyu Wang
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Monica Casali
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Hendrik Streeck
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Allyson K. Bloom
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Tim Dudek
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Damien Tully
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Ruchi Newman
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Karen L. Axten
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Adrianne D. Gladden
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Laura Battis
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Michael Kemper
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Qiandong Zeng
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Terrance P. Shea
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sharvari Gujja
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | | | - Olivier Gasser
- Immunobiology Lab, Department of Biomedicine, University Hospital Basel, Basel, Switzerland
| | - Christian Brander
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
- Irsicaixa AIDS Research Institute-HIVACAT, Hospital University Germans Trias I Pujol, Badalona, Spain
| | - Christoph Hess
- Immunobiology Lab, Department of Biomedicine, University Hospital Basel, Basel, Switzerland
| | - Huldrych F. Günthard
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Switzerland
| | - Zabrina L. Brumme
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Chanson J. Brumme
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Suzane Bazner
- Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jenna Rychert
- Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jake P. Tinsley
- The Fenway Institute, Fenway Health, Boston, Massachusetts, United States of America
| | - Ken H. Mayer
- The Fenway Institute, Fenway Health, Boston, Massachusetts, United States of America
| | - Eric Rosenberg
- Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Florencia Pereyra
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Joshua Z. Levin
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Sarah K. Young
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | | | - Marcus Altfeld
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
| | - Bruce W. Birren
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Bruce D. Walker
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| | - Todd M. Allen
- Ragon Institute of MGH, MIT and Harvard, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
65
|
Fonseca VG, Nichols B, Lallias D, Quince C, Carvalho GR, Power DM, Creer S. Sample richness and genetic diversity as drivers of chimera formation in nSSU metagenetic analyses. Nucleic Acids Res 2012; 40:e66. [PMID: 22278883 PMCID: PMC3351157 DOI: 10.1093/nar/gks002] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Eukaryotic diversity in environmental samples is often assessed via PCR-based amplification of nSSU genes. However, estimates of diversity derived from pyrosequencing environmental data sets are often inflated, mainly because of the formation of chimeric sequences during PCR amplification. Chimeras are hybrid products composed of distinct parental sequences that can lead to the misinterpretation of diversity estimates. We have analyzed the effect of sample richness, evenness and phylogenetic diversity on the formation of chimeras using a nSSU data set derived from 454 Roche pyrosequencing of replicated, large control pools of closely and distantly related nematode mock communities, of known intragenomic identity and richness. To further investigate how chimeric molecules are formed, the nSSU gene secondary structure was analyzed in several individuals. For the first time in eukaryotes, chimera formation proved to be higher in both richer and more genetically diverse samples, thus providing a novel perspective of chimera formation in pyrosequenced environmental data sets. Findings contribute to a better understanding of the nature and mechanisms involved in chimera formation during PCR amplification of environmentally derived DNA. Moreover, given the similarities between biodiversity analyses using amplicon sequencing and those used to assess genomic variation, our findings have potential broad application for identifying genetic variation in homologous loci or multigene families in general.
Collapse
Affiliation(s)
- V G Fonseca
- Molecular Ecology and Fisheries Genetics Laboratory, School of Biological Sciences, Environment Centre Wales, Bangor University, Deiniol Road, Gwynedd LL57 2UW, UK
| | | | | | | | | | | | | |
Collapse
|
66
|
Sequencing our way towards understanding global eukaryotic biodiversity. Trends Ecol Evol 2012; 27:233-43. [PMID: 22244672 DOI: 10.1016/j.tree.2011.11.010] [Citation(s) in RCA: 236] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2011] [Revised: 11/23/2011] [Accepted: 11/24/2011] [Indexed: 02/01/2023]
Abstract
Microscopic eukaryotes are abundant, diverse and fill critical ecological roles across every ecosystem on Earth, yet there is a well-recognized gap in understanding of their global biodiversity. Fundamental advances in DNA sequencing and bioinformatics now allow accurate en masse biodiversity assessments of microscopic eukaryotes from environmental samples. Despite a promising outlook, the field of eukaryotic marker gene surveys faces significant challenges: how to generate data that are most useful to the community, especially in the face of evolving sequencing technologies and bioinformatics pipelines, and how to incorporate an expanding number of target genes.
Collapse
|
67
|
Gupta AK, Harish, Rai MK, Phulwaria M, Shekhawat NS. Isolation of genomic DNA suitable for community analysis from mature trees adapted to arid environment. Gene 2011; 487:156-9. [PMID: 21827837 DOI: 10.1016/j.gene.2011.06.029] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2011] [Revised: 06/20/2011] [Accepted: 06/24/2011] [Indexed: 10/18/2022]
Abstract
Isolation of intact and pure genomic DNA (gDNA) is essential for many molecular biology applications. It is difficult to isolate pure DNA from mature trees of hot and dry desert regions because of the accumulation of high level of polysaccharides, phenolic compounds, tannins etc. We hereby report the standardized protocol for the isolation and purification of gDNA from seven ecologically and medically important tree species of Combretaceae viz. Anogeissus (Anogeissus sericea var. nummularia, Anogeissus pendula, and Anogeissus latifolia) and Terminalia (Terminalia arjuna, Terminalia bellirica, Terminalia catappa and Terminalia chebula). This method involves (i) washing the sample twice with Triton buffer (2%) then (ii) isolation of gDNA by modified-CTAB (cetyl trimethyl ammonium bromide) method employing a high concentration (4%) of PVP (Polyvinylpyrrolidone) and 50mM ascorbic acid, and (iii) purification of this CTAB-isolated gDNA by spin-column. gDNA isolated by modified CTAB or spin-column alone were not found suitable for PCR amplification. The Triton washing step is also critical. The quality of DNA was determined by the A(260)/A(280) absorbance ratio. gDNA was also observed for its intactness by running on 0.8% agarose gel. The suitability of extracted DNA for PCR was tested by amplification with RAPD primers, which was successful. Further, rbcLa (barcoding gene) was amplified and sequenced to check the quality of extracted gDNA for its downstream applications.
Collapse
Affiliation(s)
- Amit Kumar Gupta
- Biotechnology Unit, Department of Botany, Jai Narain Vyas University, Jodhpur, India
| | | | | | | | | |
Collapse
|