1
|
Abstract
One of the challenges of genetic data analysis is to combine information from sources that are distributed around the world and accessible through a wide array of different methods and interfaces. The HIV database and its footsteps, the hepatitis C virus (HCV) and hemorrhagic fever virus (HFV) databases, have made it their mission to make different data types easily available to their users. This involves a large amount of behind-the-scenes processing, including quality control and analysis of the sequences and their annotation. Gene and protein sequences are distilled from the sequences that are stored in GenBank; to this end, both submitter annotation and script-generated sequences are used. Alignments of both nucleotide and amino acid sequences are generated, manually curated, distilled into an alignment model, and regenerated in an iterative cycle that results in ever better new alignments. Annotation of epidemiological and clinical information is parsed, checked, and added to the database. User interfaces are updated, and new interfaces are added based upon user requests. Vital for its success, the database staff are heavy users of the system, which enables them to fix bugs and find opportunities for improvement. In this chapter we describe some of the infrastructure that keeps these heavily used analysis platforms alive and vital after nearly 25 years of use. The database/analysis platforms described in this chapter can be accessed at http://hiv.lanl.gov http://hcv.lanl.gov http://hfv.lanl.gov.
Collapse
Affiliation(s)
- Carla Kuiken
- Los Alamos National Laboratory, Theoretical Biology and Biophysics (MS K710), Los Alamos, NM, USA.
| | | | | | | | | | | |
Collapse
|
2
|
Lee HY, Giorgi EE, Keele BF, Gaschen B, Athreya GS, Salazar-Gonzalez JF, Pham KT, Goepfert PA, Kilby JM, Saag MS, Delwart EL, Busch MP, Hahn BH, Shaw GM, Korber BT, Bhattacharya T, Perelson AS. Corrigendum to “Modeling sequence evolution in acute HIV-1 infection” [J. Theor. Biol. 261 (2009) 341–360]. J Theor Biol 2012. [DOI: 10.1016/j.jtbi.2011.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
3
|
Gnanakaran S, Bhattacharya T, Daniels M, Keele BF, Hraber PT, Lapedes AS, Shen T, Gaschen B, Krishnamoorthy M, Li H, Decker JM, Salazar-Gonzalez JF, Wang S, Jiang C, Gao F, Swanstrom R, Anderson JA, Ping LH, Cohen MS, Markowitz M, Goepfert PA, Saag MS, Eron JJ, Hicks CB, Blattner WA, Tomaras GD, Asmal M, Letvin NL, Gilbert PB, DeCamp AC, Magaret CA, Schief WR, Ban YEA, Zhang M, Soderberg KA, Sodroski JG, Haynes BF, Shaw GM, Hahn BH, Korber B. Recurrent signature patterns in HIV-1 B clade envelope glycoproteins associated with either early or chronic infections. PLoS Pathog 2011; 7:e1002209. [PMID: 21980282 PMCID: PMC3182927 DOI: 10.1371/journal.ppat.1002209] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2010] [Accepted: 06/26/2011] [Indexed: 12/15/2022] Open
Abstract
Here we have identified HIV-1 B clade Envelope (Env) amino acid signatures from early in infection that may be favored at transmission, as well as patterns of recurrent mutation in chronic infection that may reflect common pathways of immune evasion. To accomplish this, we compared thousands of sequences derived by single genome amplification from several hundred individuals that were sampled either early in infection or were chronically infected. Samples were divided at the outset into hypothesis-forming and validation sets, and we used phylogenetically corrected statistical strategies to identify signatures, systematically scanning all of Env. Signatures included single amino acids, glycosylation motifs, and multi-site patterns based on functional or structural groupings of amino acids. We identified signatures near the CCR5 co-receptor-binding region, near the CD4 binding site, and in the signal peptide and cytoplasmic domain, which may influence Env expression and processing. Two signatures patterns associated with transmission were particularly interesting. The first was the most statistically robust signature, located in position 12 in the signal peptide. The second was the loss of an N-linked glycosylation site at positions 413-415; the presence of this site has been recently found to be associated with escape from potent and broad neutralizing antibodies, consistent with enabling a common pathway for immune escape during chronic infection. Its recurrent loss in early infection suggests it may impact fitness at the time of transmission or during early viral expansion. The signature patterns we identified implicate Env expression levels in selection at viral transmission or in early expansion, and suggest that immune evasion patterns that recur in many individuals during chronic infection when antibodies are present can be selected against when the infection is being established prior to the adaptive immune response.
Collapse
Affiliation(s)
- S. Gnanakaran
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Tanmoy Bhattacharya
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Marcus Daniels
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Brandon F. Keele
- SAIC-Frederick, National Cancer Institute, Frederick, Maryland, United States of America
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Peter T. Hraber
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Alan S. Lapedes
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Tongye Shen
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Center for Molecular Biophysics and Department of Biochemistry, Cellular & Molecular Biology, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Brian Gaschen
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Mohan Krishnamoorthy
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Hui Li
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Julie M. Decker
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Jesus F. Salazar-Gonzalez
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Shuyi Wang
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Chunlai Jiang
- National Engineering Laboratory of AIDS Vaccine School of Life Science, Jilin University, Changchun, China
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - Feng Gao
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - Ronald Swanstrom
- Department of Biochemistry and Biophysics and the Division of Infectious Diseases Center for AIDS Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jeffrey A. Anderson
- Department of Biochemistry and Biophysics and the Division of Infectious Diseases Center for AIDS Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Li-Hua Ping
- Department of Biochemistry and Biophysics and the Division of Infectious Diseases Center for AIDS Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Myron S. Cohen
- Department of Biochemistry and Biophysics and the Division of Infectious Diseases Center for AIDS Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Martin Markowitz
- Aaron Diamond AIDS Research Center, an affiliate of the Rockefeller University, New York, New York, United States of America
| | - Paul A. Goepfert
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Michael S. Saag
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Joseph J. Eron
- Department of Biochemistry and Biophysics and the Division of Infectious Diseases Center for AIDS Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Charles B. Hicks
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - William A. Blattner
- Institute of Human Virology, University of Maryland, School of Medicine, Baltimore, Maryland, United States of America
| | - Georgia D. Tomaras
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - Mohammed Asmal
- Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
| | - Norman L. Letvin
- Beth Israel Deaconess Medical Center, Boston, Massachusetts, United States of America
- Division of Viral Pathogenesis, Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Peter B. Gilbert
- Vaccine Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United State of America
| | - Allan C. DeCamp
- Vaccine Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United State of America
| | - Craig A. Magaret
- Vaccine Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United State of America
| | - William R. Schief
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Yih-En Andrew Ban
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- Arzeda Corporation, Seattle, Washington, United States of America
| | - Ming Zhang
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, Georgia, United States of America
| | - Kelly A. Soderberg
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - Joseph G. Sodroski
- Dana-Farber Cancer Institute, Department of Cancer Immunology and AIDS, Boston, Massachusetts, United States of America
| | - Barton F. Haynes
- Duke University Medical Center, the Departments of Medicine and Surgery, and the Duke Human Vaccine Institute, Duke University, Durham, North Carolina, United States of America
| | - George M. Shaw
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Beatrice H. Hahn
- Departments of Medicine and Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Bette Korber
- Theoretical Biology, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
- * E-mail:
| |
Collapse
|
4
|
Lee HY, Giorgi EE, Keele BF, Gaschen B, Athreya GS, Salazar-Gonzalez JF, Pham KT, Goepfert PA, Kilby JM, Saag MS, Delwart EL, Busch MP, Hahn BH, Shaw GM, Korber BT, Bhattacharya T, Perelson AS. Modeling sequence evolution in acute HIV-1 infection. J Theor Biol 2009; 261:341-60. [PMID: 19660475 DOI: 10.1016/j.jtbi.2009.07.038] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2009] [Revised: 07/20/2009] [Accepted: 07/29/2009] [Indexed: 11/26/2022]
Abstract
We describe a mathematical model and Monte Carlo (MC) simulation of viral evolution during acute infection. We consider both synchronous and asynchronous processes of viral infection of new target cells. The model enables an assessment of the expected sequence diversity in new HIV-1 infections originating from a single transmitted viral strain, estimation of the most recent common ancestor (MRCA) of the transmitted viral lineage, and estimation of the time to coalesce back to the MRCA. We also calculate the probability of the MRCA being the transmitted virus or an evolved variant. Excluding insertions and deletions, we assume HIV-1 evolves by base substitution without selection pressure during the earliest phase of HIV-1 infection prior to the immune response. Unlike phylogenetic methods that follow a lineage backwards to coalescence, we compare the observed data to a model of the diversification of a viral population forward in time. To illustrate the application of these methods, we provide detailed comparisons of the model and simulations results to 306 envelope sequences obtained from eight newly infected subjects at a single time point. The data from 68 patients were in good agreement with model predictions, and hence compatible with a single-strain infection evolving under no selection pressure. The diversity of the samples from the other two patients was too great to be explained by the model, suggesting multiple HIV-1-strains were transmitted. The model can also be applied to longitudinal patient data to estimate within-host viral evolutionary parameters.
Collapse
Affiliation(s)
- Ha Youn Lee
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Tsibris AMN, Korber B, Arnaout R, Russ C, Lo CC, Leitner T, Gaschen B, Theiler J, Paredes R, Su Z, Hughes MD, Gulick RM, Greaves W, Coakley E, Flexner C, Nusbaum C, Kuritzkes DR. Quantitative deep sequencing reveals dynamic HIV-1 escape and large population shifts during CCR5 antagonist therapy in vivo. PLoS One 2009; 4:e5683. [PMID: 19479085 PMCID: PMC2682648 DOI: 10.1371/journal.pone.0005683] [Citation(s) in RCA: 197] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 04/24/2009] [Indexed: 11/24/2022] Open
Abstract
High-throughput sequencing platforms provide an approach for detecting rare HIV-1 variants and documenting more fully quasispecies diversity. We applied this technology to the V3 loop-coding region of env in samples collected from 4 chronically HIV-infected subjects in whom CCR5 antagonist (vicriviroc [VVC]) therapy failed. Between 25,000–140,000 amplified sequences were obtained per sample. Profound baseline V3 loop sequence heterogeneity existed; predicted CXCR4-using populations were identified in a largely CCR5-using population. The V3 loop forms associated with subsequent virologic failure, either through CXCR4 use or the emergence of high-level VVC resistance, were present as minor variants at 0.8–2.8% of baseline samples. Extreme, rapid shifts in population frequencies toward these forms occurred, and deep sequencing provided a detailed view of the rapid evolutionary impact of VVC selection. Greater V3 diversity was observed post-selection. This previously unreported degree of V3 loop sequence diversity has implications for viral pathogenesis, vaccine design, and the optimal use of HIV-1 CCR5 antagonists.
Collapse
Affiliation(s)
- Athe M. N. Tsibris
- Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Bette Korber
- Los Alamos National Laboratories, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Ramy Arnaout
- Brigham and Women's Hospital, Boston, Massachusetts, United States of America
- Program for Evolutionary Dynamics, Harvard University, Cambridge, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Carsten Russ
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Chien-Chi Lo
- Los Alamos National Laboratories, Los Alamos, New Mexico, United States of America
| | - Thomas Leitner
- Los Alamos National Laboratories, Los Alamos, New Mexico, United States of America
| | - Brian Gaschen
- Los Alamos National Laboratories, Los Alamos, New Mexico, United States of America
| | - James Theiler
- Los Alamos National Laboratories, Los Alamos, New Mexico, United States of America
| | - Roger Paredes
- Harvard Medical School, Boston, Massachusetts, United States of America
- Brigham and Women's Hospital, Boston, Massachusetts, United States of America
- Fundacions irsiCaixa i Lluita contra la SIDA, Hospital Universitari Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona, Catalonia, Spain
| | - Zhaohui Su
- Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Michael D. Hughes
- Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Roy M. Gulick
- Weill Medical College, Cornell University, New York, New York, United States of America
| | - Wayne Greaves
- Schering-Plough Research Institute, Kennilworth, New Jersey, United States of America
| | - Eoin Coakley
- Monogram Biosciences, South San Francisco, California, United States of America
| | - Charles Flexner
- Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Chad Nusbaum
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Daniel R. Kuritzkes
- Harvard Medical School, Boston, Massachusetts, United States of America
- Brigham and Women's Hospital, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
6
|
Wood N, Bhattacharya T, Keele BF, Giorgi E, Liu M, Gaschen B, Daniels M, Ferrari G, Haynes BF, McMichael A, Shaw GM, Hahn BH, Korber B, Seoighe C. HIV evolution in early infection: selection pressures, patterns of insertion and deletion, and the impact of APOBEC. PLoS Pathog 2009; 5:e1000414. [PMID: 19424423 PMCID: PMC2671846 DOI: 10.1371/journal.ppat.1000414] [Citation(s) in RCA: 143] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2008] [Accepted: 04/06/2009] [Indexed: 11/24/2022] Open
Abstract
The pattern of viral diversification in newly infected individuals provides information about the host environment and immune responses typically experienced by the newly transmitted virus. For example, sites that tend to evolve rapidly across multiple early-infection patients could be involved in enabling escape from common early immune responses, could represent adaptation for rapid growth in a newly infected host, or could represent reversion from less fit forms of the virus that were selected for immune escape in previous hosts. Here we investigated the diversification of HIV-1 env coding sequences in 81 very early B subtype infections previously shown to have resulted from transmission or expansion of single viruses (n = 78) or two closely related viruses (n = 3). In these cases, the sequence of the infecting virus can be estimated accurately, enabling inference of both the direction of substitutions as well as distinction between insertion and deletion events. By integrating information across multiple acutely infected hosts, we find evidence of adaptive evolution of HIV-1 env and identify a subset of codon sites that diversified more rapidly than can be explained by a model of neutral evolution. Of 24 such rapidly diversifying sites, 14 were either i) clustered and embedded in CTL epitopes that were verified experimentally or predicted based on the individual's HLA or ii) in a nucleotide context indicative of APOBEC-mediated G-to-A substitutions, despite having excluded heavily hypermutated sequences prior to the analysis. In several cases, a rapidly evolving site was embedded both in an APOBEC motif and in a CTL epitope, suggesting that APOBEC may facilitate early immune escape. Ten rapidly diversifying sites could not be explained by CTL escape or APOBEC hypermutation, including the most frequently mutated site, in the fusion peptide of gp41. We also examined the distribution, extent, and sequence context of insertions and deletions, and we provide evidence that the length variation seen in hypervariable loop regions of the envelope glycoprotein is a consequence of selection and not of mutational hotspots. Our results provide a detailed view of the process of diversification of HIV-1 following transmission, highlighting the role of CTL escape and hypermutation in shaping viral evolution during the establishment of new infections. HIV is a rapidly evolving virus, displaying enormous genetic diversity between and even within infected individuals, with implications for vaccine design and drug treatment. Yet, recent research has shown that most new infections result from transmission of a single virus resulting in a homogeneous viral population in early infection. The process of diversification from the transmitted virus provides information about the selection pressures experienced by the virus during the establishment of a new infection. In this paper, we studied early diversification of the envelope gene in a cohort of 81 subjects acutely infected with HIV-1 subtype B and found evidence of adaptive evolution, with a proportion of sites that tended to diversify more rapidly than expected under a model of neutral evolution. Several of these rapidly diversifying sites facilitate escape from early cytotoxic immune responses. Interestingly, hypermutation of the virus, brought about by host proteins as a strategy to restrict infection, appeared to be associated with early immune escape. In addition to single base substitutions, insertions and deletions are an important aspect of HIV evolution. We show that insertion and deletion mutations occur evenly across the gene, but are preferentially fixed in the variable loop regions.
Collapse
Affiliation(s)
- Natasha Wood
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Observatory, Cape Town, South Africa
- Centre for High-Performance Computing, CSIR Campus, Rosebank, Cape Town, South Africa
| | - Tanmoy Bhattacharya
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Brandon F. Keele
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Elena Giorgi
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, Massachusetts, United States of America
| | - Michael Liu
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom
| | - Brian Gaschen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Marcus Daniels
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
| | - Guido Ferrari
- Duke University, Durham, North Carolina, United States of America
| | - Barton F. Haynes
- Duke University, Durham, North Carolina, United States of America
| | - Andrew McMichael
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom
| | - George M. Shaw
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Beatrice H. Hahn
- University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Bette Korber
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
- Santa Fe Institute, Santa Fe, New Mexico, United States of America
| | - Cathal Seoighe
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Observatory, Cape Town, South Africa
- Centre for High-Performance Computing, CSIR Campus, Rosebank, Cape Town, South Africa
- School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Ireland
- * E-mail:
| |
Collapse
|
7
|
Bhattacharya T, Daniels M, Heckerman D, Foley B, Frahm N, Kadie C, Carlson J, Yusim K, McMahon B, Gaschen B, Mallal S, Mullins JI, Nickle DC, Herbeck J, Rousseau C, Learn GH, Miura T, Brander C, Walker B, Korber B. Founder effects in the assessment of HIV polymorphisms and HLA allele associations. Science 2007; 315:1583-6. [PMID: 17363674 DOI: 10.1126/science.1131528] [Citation(s) in RCA: 210] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Escape from T cell-mediated immune responses affects the ongoing evolution of rapidly evolving viruses such as HIV. By applying statistical approaches that account for phylogenetic relationships among viral sequences, we show that viral lineage effects rather than immune escape often explain apparent human leukocyte antigen (HLA)-mediated immune-escape mutations defined by older analysis methods. Phylogenetically informed methods identified immune-susceptible locations with greatly improved accuracy, and the associations we identified with these methods were experimentally validated. This approach has practical implications for understanding the impact of host immunity on pathogen evolution and for defining relevant variants for inclusion in vaccine antigens.
Collapse
|
8
|
Zhang M, Wilbe K, Wolfe ND, Gaschen B, Carr JK, Leitner T. HIV type 1 CRF13_cpx revisited: identification of a new sequence from Cameroon and signal for subsubtype J2. AIDS Res Hum Retroviruses 2005; 21:955-60. [PMID: 16386113 DOI: 10.1089/aid.2005.21.955] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A nearly full-length genome sequence of an HIV-1 isolate originating from Cameroon, 02CM.3226MN, was found to cluster together with previously reported CRF13 sequences 96CM-4164 and 96CM-1849. Similarity plotting, bootscanning, breakpoint analysis, and phylogenetic trees confirmed similar genomic structures with almost identical breakpoint positions among these three isolates. Thus, CRF13 now fulfills the HIV-1 nomenclature requirements. A X2 analysis across all three genomes simultaneously was applied to more accurately determine breakpoints and address the uncertainty in such estimates. Some fragments were found to be difficult to classify, as indicated by a low branching index (BI), due to limited knowledge about parental and reference subtype sequences. One fragment with low BI association to reference subtype J sequences (BI = 0.27, cut-off for subtype classification >0.55) was found to be closer to J fragments of CRF11 similar to the way that A1-A2 and F1-F2 subsubtypes associate. This suggests that subtype J may need to be reclassified into subsubtypes J1 and J2. The CRF13 genome consists of fragments from subtypes A1, G, and both J1 and J2 as well as CRF01 and one region that was left unclassified.
Collapse
Affiliation(s)
- Ming Zhang
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | | | | | | | | | | |
Collapse
|
9
|
Fang G, Kuiken C, Weiser B, Rowland-Jones S, Plummer F, Chen CH, Kaul R, Anzala AO, Bwayo J, Kimani J, Philpott SM, Kitchen C, Sinsheimer JS, Gaschen B, Lang D, Shi B, Kemal KS, Rostron T, Brunner C, Beddows S, Sattenau Q, Paxinos E, Oyugi J, Burger H. Long-term survivors in Nairobi: complete HIV-1 RNA sequences and immunogenetic associations. J Infect Dis 2004; 190:697-701. [PMID: 15272396 DOI: 10.1086/421504] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2003] [Accepted: 01/07/2004] [Indexed: 11/03/2022] Open
Abstract
To investigate African long-term survivors (LTSs) infected with non-subtype B human immunodeficiency virus type 1 (HIV-1), we obtained full-length HIV-1 RNA sequences and immunogenetic profiles from 6 untreated women enrolled in the Pumwani Sex Worker Cohort in Nairobi, Kenya. There were no discernible sequence changes likely to cause attenuation. CCR2-V64I, an immunogenetic polymorphism linked to LTSs, was detected in 4 women, all of whom carried the HLA B58 allele. Further investigation of 99 HIV-1-infected Nairobi women found an association between CCR2-V64I and HLA B58 (P=.0048). Studying the interaction among immunogenetics, immune responses, and viral sequences from all HIV-1 subtypes may increase our understanding of slow HIV-1 disease progression.
Collapse
Affiliation(s)
- Guowei Fang
- Wadsworth Center, NY State Department of Health, Albany, New York 12208, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Zhang M, Gaschen B, Blay W, Foley B, Haigwood N, Kuiken C, Korber B. Tracking global patterns of N-linked glycosylation site variation in highly variable viral glycoproteins: HIV, SIV, and HCV envelopes and influenza hemagglutinin. Glycobiology 2004; 14:1229-46. [PMID: 15175256 DOI: 10.1093/glycob/cwh106] [Citation(s) in RCA: 376] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Human and simian immunodeficiency viruses (HIV and SIV), influenza virus, and hepatitis C virus (HCV) have heavily glycosylated, highly variable surface proteins. Here we explore N-linked glycosylation site (sequon) variation at the population level in these viruses, using a new Web-based program developed to facilitate the sequon tracking and to define patterns (www.hiv.lanl.gov). This tool allowed rapid visualization of the two distinctive patterns of sequon variation found in HIV-1, HIV-2, and SIV CPZ. The first pattern (fixed) describes readily aligned sites that are either simply present or absent. These sites tend to be occupied by high-mannose glycans. The second pattern (shifting) refers to sites embedded in regions of extreme local length variation and is characterized by shifts in terms of the relative position and local density of sequons; these sites tend to be populated by complex carbohydrates. HIV, with its extreme variation in number and precise location of sequons, does not have a net increase in the number of sites over time at the population level. Primate lentiviral lineages have host species-dependent levels of sequon shifting, with HIV-1 in humans the most extreme. HCV E1 and E2 proteins, despite evolving extremely rapidly through point mutation, show limited sequon variation, although two shifting sites were identified. Human influenza A hemagglutinin H3 HA1 is accumulating sequons over time, but this trend is not evident in any other avian or human influenza A serotypes.
Collapse
Affiliation(s)
- Ming Zhang
- Theoretical Biology Group, Los Alamos National Laboratory, Los Alamos, NM 87544, USA
| | | | | | | | | | | | | |
Collapse
|
11
|
Yusim K, Kesmir C, Gaschen B, Addo MM, Altfeld M, Brunak S, Chigaev A, Detours V, Korber BT. Clustering patterns of cytotoxic T-lymphocyte epitopes in human immunodeficiency virus type 1 (HIV-1) proteins reveal imprints of immune evasion on HIV-1 global variation. J Virol 2002; 76:8757-68. [PMID: 12163596 PMCID: PMC136996 DOI: 10.1128/jvi.76.17.8757-8768.2002] [Citation(s) in RCA: 211] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The human cytotoxic T-lymphocyte (CTL) response to human immunodeficiency virus type 1 (HIV-1) has been intensely studied, and hundreds of CTL epitopes have been experimentally defined, published, and compiled in the HIV Molecular Immunology Database. Maps of CTL epitopes on HIV-1 protein sequences reveal that defined epitopes tend to cluster. Here we integrate the global sequence and immunology databases to systematically explore the relationship between HIV-1 amino acid sequences and CTL epitope distributions. CTL responses to five HIV-1 proteins, Gag p17, Gag p24, reverse transcriptase (RT), Env, and Nef, have been particularly well characterized in the literature to date. Through comparing CTL epitope distributions in these five proteins to global protein sequence alignments, we identified distinct characteristics of HIV amino acid sequences that correlate with CTL epitope localization. First, experimentally defined HIV CTL epitopes are concentrated in relatively conserved regions. Second, the highly variable regions that lack epitopes bear cumulative evidence of past immune escape that may make them relatively refractive to CTLs: a paucity of predicted proteasome processing sites and an enrichment for amino acids that do not serve as C-terminal anchor residues. Finally, CTL epitopes are more highly concentrated in alpha-helical regions of proteins. Based on amino acid sequence characteristics, in a blinded fashion, we predicted regions in HIV regulatory and accessory proteins that would be likely to contain CTL epitopes; these predictions were then validated by comparison to new sets of experimentally defined epitopes in HIV-1 Rev, Tat, Vif, and Vpr.
Collapse
Affiliation(s)
- Karina Yusim
- Los Alamos National Laboratory, Los Alamos, New Mexico 87545. Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Gaschen B, Taylor J, Yusim K, Foley B, Gao F, Lang D, Novitsky V, Haynes B, Hahn BH, Bhattacharya T, Korber B. Diversity considerations in HIV-1 vaccine selection. Science 2002; 296:2354-60. [PMID: 12089434 DOI: 10.1126/science.1070441] [Citation(s) in RCA: 600] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Globally, human immunodeficiency virus-type 1 (HIV-1) is extraordinarily variable, and this diversity poses a major obstacle to AIDS vaccine development. Currently, candidate vaccines are derived from isolates, with the hope that they will be sufficiently cross-reactive to protect against circulating viruses. This may be overly optimistic, however, given that HIV-1 envelope proteins can differ in more than 30% of their amino acids. To contend with the diversity, country-specific vaccines are being considered, but evolutionary relationships may be more useful than regional considerations. Consensus or ancestor sequences could be used in vaccine design to minimize the genetic differences between vaccine strains and contemporary isolates, effectively reducing the extent of diversity by half.
Collapse
Affiliation(s)
- Brian Gaschen
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Abstract
MOTIVATION The amount of HIV-1 sequence data generated (presently around 42000 sequences, of which more than 22000 are from the V3 region of the viral envelope) presents a challenge for anyone working on the analysis of these data. A major problem is obtaining the region of interest from the stored sequences, which often contain but are not limited to that region. In addition, multiple alignment programs generally cannot deal with the large numbers of sequences that are available for many HIV-1 regions. We set out to provide our users with a tool that will retrieve and create an initial alignment of the HIV sequences that are available for a given genomic region. RESULTS The MPAlign (Multiple Pairwise Alignment) web interface is a collection of Perl scripts that retrieves sequences from the Los Alamos HIV sequence database based on a number of search parameters. All sequences were pairwise-aligned to a model sequence using the Hidden Markov Model-based program HMMER. The HMMER model is general enough to accommodate virtually all HIV-1 sequences stored in the database. To create a multiple sequence alignment, gaps were inserted into the sequences during retrieval, so that they are aligned to one another. Retrieving and aligning the almost 560 gp120 sequences (approximately>1500 nt) stored in the database is at least 1500 times faster than a similar Clustal alignment.
Collapse
Affiliation(s)
- B Gaschen
- HIV Database and Analysis Group (T10), Los Alamos National Laboratory, Mail stop K710, Los Alamos, NM 87545, USA.
| | | | | | | |
Collapse
|
14
|
Abstract
Evolutionary modelling studies indicate less than a century has passed since the most recent common ancestor of the HIV-1 pandemic strains and, in that time frame, an extraordinarily diverse viral population has developed. HIV-1 employs a multitude of schemes to generate variants: accumulation of base substitutions, insertions and deletions, addition and loss of glycosylation sites in the envelope protein, and recombination. A comparison between HIV and influenza virus illustrates the extraordinary scale of HIV variation, and underscores the importance of exploring innovative HIV vaccine strategies. Deeper understanding of the implications of variation for both antibody and T-cell responses may help in the effort to rationally design vaccines that stimulate broad cross-reactivity. The impact of HIV-1 variation on host immune response is reviewed in this context.
Collapse
Affiliation(s)
- B Korber
- Division of Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, New Mexico 98545, USA
| | | | | | | | | | | |
Collapse
|
15
|
Robertson B, Myers G, Howard C, Brettin T, Bukh J, Gaschen B, Gojobori T, Maertens G, Mizokami M, Nainan O, Netesov S, Nishioka K, Shin i T, Simmonds P, Smith D, Stuyver L, Weiner A. Classification, nomenclature, and database development for hepatitis C virus (HCV) and related viruses: proposals for standardization. International Committee on Virus Taxonomy. Arch Virol 1999; 143:2493-503. [PMID: 9930205 DOI: 10.1007/s007050050479] [Citation(s) in RCA: 323] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|