1
|
Icer Baykal PB, Lara J, Khudyakov Y, Zelikovsky A, Skums P. Quantitative differences between intra-host HCV populations from persons with recently established and persistent infections. Virus Evol 2020; 7:veaa103. [PMID: 33505710 PMCID: PMC7816669 DOI: 10.1093/ve/veaa103] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Detection of incident hepatitis C virus (HCV) infections is crucial for identification of outbreaks and development of public health interventions. However, there is no single diagnostic assay for distinguishing recent and persistent HCV infections. HCV exists in each infected host as a heterogeneous population of genomic variants, whose evolutionary dynamics remain incompletely understood. Genetic analysis of such viral populations can be applied to the detection of incident HCV infections and used to understand intra-host viral evolution. We studied intra-host HCV populations sampled using next-generation sequencing from 98 recently and 256 persistently infected individuals. Genetic structure of the populations was evaluated using 245,878 viral sequences from these individuals and a set of selected features measuring their diversity, topological structure, complexity, strength of selection, epistasis, evolutionary dynamics, and physico-chemical properties. Distributions of the viral population features differ significantly between recent and persistent infections. A general increase in viral genetic diversity from recent to persistent infections is frequently accompanied by decline in genomic complexity and increase in structuredness of the HCV population, likely reflecting a high level of intra-host adaptation at later stages of infection. Using these findings, we developed a machine learning classifier for the infection staging, which yielded a detection accuracy of 95.22 per cent, thus providing a higher accuracy than other genomic-based models. The detection of a strong association between several HCV genetic factors and stages of infection suggests that intra-host HCV population develops in a complex but regular and predictable manner in the course of infection. The proposed models may serve as a foundation of cyber-molecular assays for staging infection, which could potentially complement and/or substitute standard laboratory assays.
Collapse
Affiliation(s)
- Pelin B Icer Baykal
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| | - James Lara
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, 1600 Clifton Rd., Atlanta, GA 30329, USA
| | - Yury Khudyakov
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, 1600 Clifton Rd., Atlanta, GA 30329, USA
| | - Alex Zelikovsky
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| | - Pavel Skums
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| |
Collapse
|
2
|
Lara J, Teka MA, Sims S, Xia GL, Ramachandran S, Khudyakov Y. HCV adaptation to HIV coinfection. INFECTION GENETICS AND EVOLUTION 2018; 65:216-225. [PMID: 30075255 DOI: 10.1016/j.meegid.2018.07.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Revised: 07/25/2018] [Accepted: 07/30/2018] [Indexed: 02/07/2023]
Abstract
Human immunodeficiency virus (HIV) infection is rising as a leading cause of morbidity and mortality among hepatitis C virus (HCV)-infected patients. Both viruses interact in co-infected hosts, which may affect their intra-host evolution, potentially leading to differing genetic composition of viral populations in co-infected (CIP) and mono-infected (MIP) patients. Here, we investigate genetic differences between intra-host variants of the HCV hypervariable region 1 (HVR1) sampled from CIP and MIP. Nucleotide (nt) sequences of intra-host HCV HVR1 variants (N = 28,622) obtained from CIP (N = 112) and MIP (n = 176) were represented using 148 physical-chemical (PhyChem) indexes of DNA nt dimers. Significant (p < .0001) differences in the means and frequency distributions of 7 PhyChem properties were found between HVR1 variants from both groups. Linear projection analysis of 29 PhyChem features extracted from such PhyChem properties showed that the CIP and MIP HVR1 variants have a distinct distribution in the modeled 2D-space, with only ~1.3% of PhyChem profiles (N = 6782), shared by all HVR1 variants, being found in both groups. Probabilistic neural network (PNN) and naïve Bayesian (NB) classifiers trained on the PhyChem features accurately classified HVR1 variants by the group in cross-validation experiments (AUROC ≥ 0.96). Similarly, both models showed a high accuracy (AUROC ≥ 0.95) when evaluated on a test dataset of HVR1 sequences obtained from 10 patients, data from whom were not used for model building. Both models performed at the expected lower accuracy on randomly labeled datasets in cross-validation experiments (AUROC = 0.50). The random-label trained PNN showed a similar drop in accuracy on the test dataset (AUROC = 0.48), indicating that the detected associations were unlikely due to random correlations. Marked differences in genetic composition of HCV HVR1 variants sampled from CIP and MIP suggest differing intra-host HCV evolution in the presence of HIV infection. PhyChem features identified here may be used for detection of HIV infection from intra-host HCV variants alone in co-infected patients, thus facilitating monitoring for HIV introduction to high-risk populations with high HCV prevalence.
Collapse
Affiliation(s)
- James Lara
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States.
| | - Mahder A Teka
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States
| | - Seth Sims
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States
| | - Guo-Liang Xia
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States
| | - Sumathi Ramachandran
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States
| | - Yury Khudyakov
- Centers for Disease Control, 1600 Clifton Road, Atlanta, GA 30333, United States
| |
Collapse
|
3
|
Lara J, Teka M, Khudyakov Y. Identification of recent cases of hepatitis C virus infection using physical-chemical properties of hypervariable region 1 and a radial basis function neural network classifier. BMC Genomics 2017; 18:880. [PMID: 29244000 PMCID: PMC5731502 DOI: 10.1186/s12864-017-4269-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Identification of acute or recent hepatitis C virus (HCV) infections is important for detecting outbreaks and devising timely public health interventions for interruption of transmission. Epidemiological investigations and chemistry-based laboratory tests are 2 main approaches that are available for identification of acute HCV infection. However, owing to complexity, both approaches are not efficient. Here, we describe a new sequence alignment-free method to discriminate between recent (R) and chronic (C) HCV infection using next-generation sequencing (NGS) data derived from the HCV hypervariable region 1 (HVR1). Results Using dinucleotide auto correlation (DAC), we identified physical-chemical (PhyChem) features of HVR1 variants. Significant (p < 9.58 × 10−4) differences in the means and frequency distributions of PhyChem features were found between HVR1 variants sampled from patients with recent vs chronic (R/C) infection. Moreover, the R-associated variants were found to occupy distinct and discrete PhyChem spaces. A radial basis function neural network classifier trained on the PhyChem features of intra-host HVR1 variants accurately classified R/C-HVR1 variants (classification accuracy (CA) = 94.85%; area under the ROC curve, AUROC = 0.979), in 10-fold cross-validation). The classifier was accurate in assigning individual HVR1 variants to R/C-classes in the testing set (CA = 84.15%; AUROC = 0.912) and in detection of infection duration (R/C-class) in patients (CA = 88.45%). Statistical tests and evaluation of the classifier on randomly-labeled datasets indicate that classifiers’ CA is robust (p < 0.001) and unlikely due to random correlations (CA = 59.04% and AUROC = 0.50). Conclusions The PhyChem features of intra-host HVR1 variants are strongly associated with the duration of HCV infection. Application of the PhyChem biomarkers to models for detection of the R/C-state of HCV infection in patients offers a new opportunity for detection of outbreaks and for molecular surveillance. The method will be available at https://webappx.cdc.gov/GHOST/ to the authenticated users of Global Hepatitis Outbreak and Surveillance Technology (GHOST) for further testing and validation.
Collapse
Affiliation(s)
- James Lara
- Division of Viral Hepatitis, National Center for HIV, Hepatitis, TB and STD Prevention, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA.
| | - Mahder Teka
- Division of Viral Hepatitis, National Center for HIV, Hepatitis, TB and STD Prevention, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA
| | - Yury Khudyakov
- Division of Viral Hepatitis, National Center for HIV, Hepatitis, TB and STD Prevention, Centers for Disease Control and Prevention, Atlanta, GA, 30333, USA
| |
Collapse
|
4
|
Genetic Variability of Hepatitis C Virus (HCV) 5' Untranslated Region in HIV/HCV Coinfected Patients Treated with Pegylated Interferon and Ribavirin. PLoS One 2015; 10:e0125604. [PMID: 25932941 PMCID: PMC4416933 DOI: 10.1371/journal.pone.0125604] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 03/24/2015] [Indexed: 01/03/2023] Open
Abstract
Association between hepatitis C virus (HCV) quasispecies and treatment outcome among patients with chronic hepatitis C has been the subject of many studies. However, these studies focused mainly on viral variable regions (E1 and E2) and usually did not include human immunodeficiency virus (HIV)-positive patients. The aim of the present study was to analyze heterogeneity of the 5' untranslated region (5'UTR) in HCV/HIV coinfected patients treated with interferon and ribavirin. The HCV 5'UTR was amplified from serum and peripheral blood mononuclear cells (PBMC) samples in 37 HCV/HIV coinfected patients treated for chronic hepatitis C. Samples were collected right before treatment, and at 2, 4, 6, 8, 12, 20, 24, 36, 44, 48, 60, and 72 weeks. Heterogeneity of the 5'UTR was analyzed by single strand conformational polymorphism (SSCP), cloning and sequencing. Sustained virological response (SVR) was achieved in 46% of analyzed HCV/HIV co-infected patients. Stable SSCP band pattern was observed in 22 patients (62.9%) and SVR rate among these patients was 23%. Decline in the number of bands and/or shift in band positions were found in 6 patients (17.1%), 5 (83%) of whom achieved SVR (p=0.009). A novel viral genotype was identified in all but one of these patients. In 5 of these 6 patients a new genotype was dominant. 5'UTR heterogeneity may correlate with interferon and ribavirin treatment outcome. In the analyzed group of HCV/HIV coinfected patients, viral quasispecies stability during treatment favored viral persistence, whereas decrease in the number of variants and/or emergence of new variants was associated with SVR. Among injection drug users (IDU) patients, a new genotype may become dominant during treatment, probably due to the presence of mixed infections with various strains, which have different susceptibility to treatment.
Collapse
|
5
|
Lara J, Purdy MA, Khudyakov YE. Genetic host specificity of hepatitis E virus. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2014; 24:127-39. [PMID: 24667049 PMCID: PMC5745802 DOI: 10.1016/j.meegid.2014.03.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Revised: 02/24/2014] [Accepted: 03/16/2014] [Indexed: 01/06/2023]
Abstract
Hepatitis E virus (HEV) causes epidemic and sporadic cases of hepatitis worldwide. HEV genotypes 3 (HEV3) and 4 (HEV4) infect humans and animals, with swine being the primary reservoir. The relevance of HEV genetic diversity to host adaptation is poorly understood. We employed a Bayesian network (BN) analysis of HEV3 and HEV4 to detect epistatic connectivity among protein sites and its association with the host specificity in each genotype. The data imply coevolution among ∼70% of polymorphic sites from all HEV proteins and association of numerous coevolving sites with adaptation to swine or humans. BN models for individual proteins and domains of the nonstructural polyprotein detected the host origin of HEV strains with accuracy of 74-93% and 63-87%, respectively. These findings, taken together with lack of phylogenetic association to host, suggest that the HEV host specificity is a heritable and convergent phenotypic trait achievable through variety of genetic pathways (abundance), and explain a broad host range for HEV3 and HEV4.
Collapse
Affiliation(s)
- James Lara
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA.
| | - Michael A Purdy
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Yury E Khudyakov
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| |
Collapse
|
6
|
Ganova-Raeva LM, Khudyakov YE. Application of mass spectrometry to molecular diagnostics of viral infections. Expert Rev Mol Diagn 2013; 13:377-88. [PMID: 23638820 DOI: 10.1586/erm.13.24] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Mass spectrometry (MS) has found numerous applications in life sciences. It has high accuracy, sensitivity and wide dynamic range in addition to medium- to high-throughput capabilities. These features make MS a superior platform for analysis of various biomolecules including proteins, lipids, nucleic acids and carbohydrates. Until recently, MS was applied for protein detection and characterization. During the last decade, however, MS has successfully been used for molecular diagnostics of microbial and viral infections with the most notable applications being identification of pathogens, genomic sequencing, mutation detection, DNA methylation analysis, tracking of transmissions, and characterization of genetic heterogeneity. These new developments vastly expand the MS application from experimental research to public health and clinical fields. Matching of molecular techniques with specific requirements of the major MS platforms has produced powerful technologies for molecular diagnostics, which will further benefit from coupling with computational tools for extracting clinical information from MS-derived data.
Collapse
Affiliation(s)
- Lilia M Ganova-Raeva
- Centers for Disease Control and Prevention, Division of Viral Hepatitis, 1600 Clifton Rd. NE, MS A-33, Atlanta, GA 30329, USA.
| | | |
Collapse
|