1
|
Lucaci AG, Zehr JD, Enard D, Thornton JW, Kosakovsky Pond SL. Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses. Mol Biol Evol 2023; 40:msad150. [PMID: 37395787 PMCID: PMC10336034 DOI: 10.1093/molbev/msad150] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/15/2023] [Accepted: 06/26/2023] [Indexed: 07/04/2023] Open
Abstract
Inference and interpretation of evolutionary processes, in particular of the types and targets of natural selection affecting coding sequences, are critically influenced by the assumptions built into statistical models and tests. If certain aspects of the substitution process (even when they are not of direct interest) are presumed absent or are modeled with too crude of a simplification, estimates of key model parameters can become biased, often systematically, and lead to poor statistical performance. Previous work established that failing to accommodate multinucleotide (or multihit, MH) substitutions strongly biases dN/dS-based inference towards false-positive inferences of diversifying episodic selection, as does failing to model variation in the rate of synonymous substitution (SRV) among sites. Here, we develop an integrated analytical framework and software tools to simultaneously incorporate these sources of evolutionary complexity into selection analyses. We found that both MH and SRV are ubiquitous in empirical alignments, and incorporating them has a strong effect on whether or not positive selection is detected (1.4-fold reduction) and on the distributions of inferred evolutionary rates. With simulation studies, we show that this effect is not attributable to reduced statistical power caused by using a more complex model. After a detailed examination of 21 benchmark alignments and a new high-resolution analysis showing which parts of the alignment provide support for positive selection, we show that MH substitutions occurring along shorter branches in the tree explain a significant fraction of discrepant results in selection detection. Our results add to the growing body of literature which examines decades-old modeling assumptions (including MH) and finds them to be problematic for comparative genomic data analysis. Because multinucleotide substitutions have a significant impact on natural selection detection even at the level of an entire gene, we recommend that selection analyses of this type consider their inclusion as a matter of routine. To facilitate this procedure, we developed, implemented, and benchmarked a simple and well-performing model testing selection detection framework able to screen an alignment for positive selection with two biologically important confounding processes: site-to-site synonymous rate variation, and multinucleotide instantaneous substitutions.
Collapse
Affiliation(s)
- Alexander G Lucaci
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
| | - Jordan D Zehr
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona
| | - Joseph W Thornton
- Department of Human Genetics, University of Chicago, Chicago, Illinois
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois
| | | |
Collapse
|
2
|
Nagata S, Kiyohara R, Toh H. Constraint of Base Pairing on HDV Genome Evolution. Viruses 2021; 13:v13122350. [PMID: 34960619 PMCID: PMC8708965 DOI: 10.3390/v13122350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 10/11/2021] [Accepted: 11/02/2021] [Indexed: 11/17/2022] Open
Abstract
The hepatitis delta virus is a single-stranded circular RNA virus, which is characterized by high self-complementarity. About 70% of the genome sequences can form base-pairs with internal nucleotides. There are many studies on the evolution of the hepatitis delta virus. However, the secondary structure has not been taken into account in these studies. In this study, we developed a method to examine the effect of base pairing as a constraint on the nucleotide substitutions during the evolution of the hepatitis delta virus. The method revealed that the base pairing can reduce the evolutionary rate in the non-coding region of the virus. In addition, it is suggested that the non-coding nucleotides without base pairing may be under some constraint, and that the intensity of the constraint is weaker than that by the base pairing but stronger than that on the synonymous site.
Collapse
|
3
|
Chang WS, Pettersson JHO, Le Lay C, Shi M, Lo N, Wille M, Eden JS, Holmes EC. Novel hepatitis D-like agents in vertebrates and invertebrates. Virus Evol 2019; 5:vez021. [PMID: 31321078 PMCID: PMC6628682 DOI: 10.1093/ve/vez021] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Hepatitis delta virus (HDV) is the smallest known RNA virus, encoding a single protein. Until recently, HDV had only been identified in humans, where it is strongly associated with co-infection with hepatitis B virus (HBV). However, the recent discovery of HDV-like viruses in metagenomic samples from birds and snakes suggests that this virus has a far longer evolutionary history. Herein, using additional meta-transcriptomic data, we show that highly divergent HDV-like viruses are also present in fish, amphibians, and invertebrates, with PCR and Sanger sequencing confirming the presence of the invertebrate HDV-like viruses. Notably, the novel viruses identified here share genomic features characteristic of HDV, such as a circular genome of only approximately 1.7 kb in length, and self-complementary, unbranched rod-like structures. Coiled-coil domains, leucine zippers, conserved residues with essential biological functions, and isoelectronic points similar to those in the human hepatitis delta virus antigens (HDAgs) were also identified in the putative non-human viruses. Importantly, none of these novel HDV-like viruses were associated with hepadnavirus infection, supporting the idea that the HDV–HBV association may be specific to humans. Collectively, these data not only broaden our understanding of the diversity and host range of HDV, but also shed light on its origin and evolutionary history.
Collapse
Affiliation(s)
- Wei-Shan Chang
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| | - John H-O Pettersson
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia.,Department of Medical Biochemistry and Microbiology, Zoonosis Science Center, Uppsala University, Uppsala, Sweden
| | - Callum Le Lay
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| | - Mang Shi
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| | - Nathan Lo
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| | - Michelle Wille
- The Peter Doherty Institute for Infection and Immunity, WHO Collaborating Centre for Reference and Research on Influenza, Melbourne, VIC, Australia
| | - John-Sebastian Eden
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia.,Westmead Institute for Medical Research, Centre for Virus Research, Westmead, NSW, Australia
| | - Edward C Holmes
- School of Life and Environmental Sciences and Sydney Medical School, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, The University of Sydney, Sydney, NSW, Australia
| |
Collapse
|
4
|
Abstract
In this chapter, we give a not-so-long and self-contained introduction to computational molecular evolution. In particular, we present the emergence of the use of likelihood-based methods, review the standard DNA substitution models, and introduce how model choice operates. We also present recent developments in inferring absolute divergence times and rates on a phylogeny, before showing how state-of-the-art models take inspiration from diffusion theory to link population genetics, which traditionally focuses at a taxonomic level below that of the species, and molecular evolution. Although this is not a cookbook chapter, we try and point to popular programs and implementations along the way.
Collapse
|
5
|
Le Gal F, Brichler S, Drugan T, Alloui C, Roulot D, Pawlotsky JM, Dény P, Gordien E. Genetic diversity and worldwide distribution of the deltavirus genus: A study of 2,152 clinical strains. Hepatology 2017; 66:1826-1841. [PMID: 28992360 DOI: 10.1002/hep.29574] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 09/29/2017] [Indexed: 12/12/2022]
Abstract
UNLABELLED Hepatitis delta virus (HDV) is responsible for the most severe form of acute and chronic viral hepatitis. We previously proposed that the Deltavirus genus is composed of eight major clades. However, few sequences were available to confirm this classification. Moreover, little is known about the structural and functional consequences of HDV variability. One practical consequence is the failure of most quantification assays to properly detect or quantify plasmatic HDV RNA. Between 2001 and 2014, 2,152 HDV strains were prospectively collected and genotyped in our reference laboratory by means of nucleotide sequencing and extensive phylogenetic analyses of a 400-nucleotide region of the genome (R0) from nucleotides 889 to 1289 encompassing the 3' end of the delta protein-coding gene. In addition, the full-length genome sequence was generated for 116 strains selected from the different clusters, allowing for in-depth characterization of the HDV genotypes and subgenotypes. This study confirms that the HDV genus is composed of eight genotypes (HDV-1 to HDV-8) defined by an intergenotype similarity >85% or >80%, according to the partial or full-length genome sequence, respectively. Furthermore, genotypes can be segregated into two to four subgenotypes, characterized by an intersubgenotype similarity >90% (>84% for HDV-1) over the whole genome sequence. Systematic analysis of genome and protein sequences revealed highly conserved functional nucleotide and amino acid motifs and positions across all (sub)genotypes, indicating strong conservatory constraints on the structure and function of the genome and the protein. CONCLUSION This study provides insight into the genetic diversity of HDV and a clear view of its geographical localization and allows speculation as to the worldwide spread of the virus, very likely from an initial African origin. (Hepatology 2017;66:1826-1841).
Collapse
Affiliation(s)
- Frédéric Le Gal
- Laboratoire de Microbiologie Clinique, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France.,Centre national de référence des virus des hépatites B, C et Delta, Laboratoire de Virologie, Bobigny, France
| | - Ségolène Brichler
- Laboratoire de Microbiologie Clinique, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France.,Centre national de référence des virus des hépatites B, C et Delta, Laboratoire de Virologie, Bobigny, France.,Unité INSERM U955, Equipe 18, Créteil, France
| | - Tudor Drugan
- Department of Medical Informatics and Biostatistics, Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca, Romania
| | - Chakib Alloui
- Laboratoire de Microbiologie Clinique, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France.,Centre national de référence des virus des hépatites B, C et Delta, Laboratoire de Virologie, Bobigny, France
| | - Dominique Roulot
- Centre national de référence des virus des hépatites B, C et Delta, Laboratoire de Virologie, Bobigny, France.,Unité d'Hépatologie, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France
| | - Jean-Michel Pawlotsky
- Unité INSERM U955, Equipe 18, Créteil, France.,Centre national de référence des virus des hépatites B, C et Delta, Département de Virologie, Hôpital Henri Mondor, Université Paris-Est, Créteil, France
| | - Paul Dény
- Laboratoire de Microbiologie Clinique, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France.,Centre de Recherches en Cancérologie de Lyon, INSERM U1052, UMR CNRS 5286, Team Hepatocarcinogenesis and Viral Infection, Lyon, France
| | - Emmanuel Gordien
- Laboratoire de Microbiologie Clinique, Hôpitaux Universitaires de Paris Seine Saint-Denis, Site Avicenne, Université Sorbonne Paris Cité, Bobigny, France.,Centre national de référence des virus des hépatites B, C et Delta, Laboratoire de Virologie, Bobigny, France.,Unité INSERM U955, Equipe 18, Créteil, France
| |
Collapse
|
6
|
Shirvani-Dastgerdi E, Amini-Bavil-Olyaee S, Alavian SM, Trautwein C, Tacke F. Comprehensive analysis of mutations in the hepatitis delta virus genome based on full-length sequencing in a nationwide cohort study and evolutionary pattern during disease progression. Clin Microbiol Infect 2014; 21:510.e11-23. [PMID: 25656625 DOI: 10.1016/j.cmi.2014.12.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/21/2014] [Accepted: 12/18/2014] [Indexed: 02/06/2023]
Abstract
Delta hepatitis, caused by co-infection or super-infection of hepatitis D virus (HDV) in hepatitis B virus (HBV) -infected patients, is the most severe form of chronic hepatitis, often progressing to liver cirrhosis and liver failure. Although 15 million individuals are affected worldwide, molecular data on the HDV genome and its proteins, small and large delta antigen (S-/L-HDAg), are limited. We therefore conducted a nationwide study in HBV-HDV-infected patients from Iran and successfully amplified 38 HDV full genomes and 44 L-HDAg sequences from 34 individuals. Phylogenetic analyses of full-length HDV and L-HDAg isolates revealed that all strains clustered with genotype 1 and showed high genotypic distances to HDV genotypes 2 to 8, with a maximal distance to genotype 3. Longitudinal analyses in individual patients indicated a reverse evolutionary trend, especially in L-HDAg amino acid composition, over time. Besides multiple sequence variations in the hypervariable region of HDV, nucleotide substitutions preferentially occurred in the stabilizing P4 domain of the HDV ribozyme. A high rate of single amino acid changes was detected in structural parts of L-HDAg, whereas its post-translational modification sites were highly conserved. Interestingly, several non-synonymous mutations were positively selected that affected immunogenic epitopes of L-HDAg towards CD8 T-cell- and B-cell-driven immune responses. Hence, our comprehensive molecular analysis comprising a nationwide cohort revealed phylogenetic relationships and provided insight into viral evolution within individual hosts. Moreover, preferential areas of frequent mutations in the HDV ribozyme and antigen protein were determined in this study.
Collapse
Affiliation(s)
| | - S Amini-Bavil-Olyaee
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, Harlyne J. Norris Cancer Research Tower, Los Angeles, CA, USA
| | - S Moayed Alavian
- Baqiyatallah Research Centre for Gastroenterology and Liver Diseases, Baqiyatallah University of Medical Sciences, Tehran, Iran.
| | - C Trautwein
- Department of Medicine III, RWTH-University Hospital Aachen, Aachen, Germany
| | - F Tacke
- Department of Medicine III, RWTH-University Hospital Aachen, Aachen, Germany
| |
Collapse
|
7
|
Taylor JM. Host RNA circles and the origin of hepatitis delta virus. World J Gastroenterol 2014; 20:2971-2978. [PMID: 24659888 PMCID: PMC3961984 DOI: 10.3748/wjg.v20.i11.2971] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 12/20/2013] [Accepted: 02/20/2014] [Indexed: 02/06/2023] Open
Abstract
Recent reports show that many cellular RNAs are processed to form circular species that are relatively abundant and resistant to host nucleases. In some cases, such circles actually bind host microRNAs. Such depletion of available microRNAs appears to have biological roles; for instance, in homeostasis and disease. These findings regarding host RNA circles support a speculative reappraisal of the origin and mode of replication of hepatitis delta virus, hepatitis delta virus (HDV), an agent with a small circular RNA genome; specifically, it is proposed that in hepatocytes infected with hepatitis B virus (HBV), some viral RNA species are processed to circular forms, which by a series of chance events lead to an RNA that can be both replicated by host enzymes and assembled, using HBV envelope proteins, to form particles some of which are infectious. Such a model also may provide some new insights into the potential pathogenic potential of HDV infections. In return, new insights into HDV might provide information leading to a better understanding of the roles of the host RNA circles.
Collapse
|
8
|
Gjini E, Haydon DT, David Barry J, Cobbold CA. Revisiting the diffusion approximation to estimate evolutionary rates of gene family diversification. J Theor Biol 2014; 341:111-22. [PMID: 24120993 DOI: 10.1016/j.jtbi.2013.10.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/21/2013] [Accepted: 10/02/2013] [Indexed: 11/18/2022]
Abstract
Genetic diversity in multigene families is shaped by multiple processes, including gene conversion and point mutation. Because multi-gene families are involved in crucial traits of organisms, quantifying the rates of their genetic diversification is important. With increasing availability of genomic data, there is a growing need for quantitative approaches that integrate the molecular evolution of gene families with their higher-scale function. In this study, we integrate a stochastic simulation framework with population genetics theory, namely the diffusion approximation, to investigate the dynamics of genetic diversification in a gene family. Duplicated genes can diverge and encode new functions as a result of point mutation, and become more similar through gene conversion. To model the evolution of pairwise identity in a multigene family, we first consider all conversion and mutation events in a discrete manner, keeping track of their details and times of occurrence; second we consider only the infinitesimal effect of these processes on pairwise identity accounting for random sampling of genes and positions. The purely stochastic approach is closer to biological reality and is based on many explicit parameters, such as conversion tract length and family size, but is more challenging analytically. The population genetics approach is an approximation accounting implicitly for point mutation and gene conversion, only in terms of per-site average probabilities. Comparison of these two approaches across a range of parameter combinations reveals that they are not entirely equivalent, but that for certain relevant regimes they do match. As an application of this modelling framework, we consider the distribution of nucleotide identity among VSG genes of African trypanosomes, representing the most prominent example of a multi-gene family mediating parasite antigenic variation and within-host immune evasion.
Collapse
Affiliation(s)
- Erida Gjini
- Instituto Gulbenkian de Ciência Oeiras, Portugal.
| | - Daniel T Haydon
- Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom; The Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, United Kingdom; Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow, United Kingdom
| | - J David Barry
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow, United Kingdom
| | - Christina A Cobbold
- School of Mathematics and Statistics, College of Science and Engineering, University of Glasgow, Glasgow, United Kingdom; The Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, United Kingdom
| |
Collapse
|
9
|
Ghamari S, Alavian SM, Rizzetto M, Olivero A, Smedile A, Khedive A, Alavian SE, Zolfaghari MR, Jazayeri SM. Prevalence of hepatitis delta virus (HDV) infection in chronic hepatitis B patients with unusual clinical pictures. HEPATITIS MONTHLY 2013; 13:e6731. [PMID: 24098308 PMCID: PMC3787685 DOI: 10.5812/hepatmon.6731] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2012] [Revised: 06/29/2013] [Accepted: 07/15/2013] [Indexed: 12/11/2022]
Abstract
BACKGROUND Probably 5% of the HBV carriers have HDV super infection. The risk of fulminant hepatitis, cirrhosis and hepatocellular carcinoma is higher in superinfection than the settings when HBV is alone. OBJECTIVES The aim of this study was to evaluate the prevalence of HDV in Iranian HBV isolates and to compare their clinical and virological pictures as well as their HDV genetic variations with other worldwide isolates. PATIENTS AND METHODS 81 carriers with positive results for HBsAg with upper limit ranges of ALT and low or undetectable levels of HBV viral load who did not respond to HBV therapy were selected. After RT amplification of HDV Delta antigen, direct sequencing and phylogenetic study were performed to explore the genotype(s) and nucleotide/amino acid variations. RESULTS 12 (14.8%) patients had positive results for both HDV RNA and anti-HDV. The mean ALT level was higher in HDV positive patients (75.9 U/ML) than HBV-mono-infected individuals; however, the mean HBV viral load was lower in coinfected patients than HBV-mono-infected patients. Phylogenetically, genotype I was the only detected genotype, and the most closely related isolates were of Turkish, Italian and Mongolian origin. Within the delta Ag, there were 326 nucleotide mutations, of which 111 and 215 were silent and missense, respectively. The total number of amino acid substitution was 148; most were located in known functional/epitopic domains. There was no correlation between the numbers of amino acid mutations, with clinical, virological status of the patients. CONCLUSIONS HDV should be suspected in HBV carriers with unusual clinical and virological pictures. Relatedness of Iranian HDV isolates to Italian and Turkish sequences proposed a common Caucasian origin for the distribution of HDV genotype I in this ethnic group.
Collapse
Affiliation(s)
- Shiva Ghamari
- Hepatitis B Molecular Laboratory, Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, IR Iran
| | - Seyed Moayed Alavian
- Baqiyatallah University of Medical Sciences, Baqiyatallah Research Centre for Gastroenterology and Liver Disease, Tehran, IR Iran
- Middle East Liver Diseases Center, Tehran, IR Iran
| | - Mario Rizzetto
- Department of Gastroenterology and Hepatology, San Giovanni Battista University Hospital (Molinette), Turin, Italy
| | - Antonella Olivero
- Department of Gastroenterology and Hepatology, San Giovanni Battista University Hospital (Molinette), Turin, Italy
| | - Antonina Smedile
- Department of Gastroenterology and Hepatology, San Giovanni Battista University Hospital (Molinette), Turin, Italy
| | - Abulfazl Khedive
- Hepatitis B Molecular Laboratory, Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, IR Iran
| | - Seyed Ehsan Alavian
- Baqiyatallah University of Medical Sciences, Baqiyatallah Research Centre for Gastroenterology and Liver Disease, Tehran, IR Iran
- Middle East Liver Diseases Center, Tehran, IR Iran
| | | | - Seyed Mohammad Jazayeri
- Hepatitis B Molecular Laboratory, Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, IR Iran
- Corresponding author: Seyed Mohammad Jazayeri, Hepatitis B Molecular Laboratory, Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, IR Iran. Tel/Fax: +98-2188992660, E-mail:
| |
Collapse
|
10
|
Chen X, Zhang Q, He C, Zhang L, Li J, Zhang W, Cao W, Lv YG, Liu Z, Zhang JX, Shao ZJ. Recombination and natural selection in hepatitis E virus genotypes. J Med Virol 2012; 84:1396-407. [PMID: 22825818 DOI: 10.1002/jmv.23237] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
To gain new insights into the evolutionary processes that created the genetic diversity of the hepatitis E virus (HEV), the Recombination Detection Program (RDP) and SimPlot program were employed to detect recombination events in the genome, then the fixed-effects likelihood (FEL) method was used to detect natural selection effects on viral proteins. Recombination analysis provided strong evidence for both intergenotype and intragenotype recombination events in the sequences analyzed. Recombination events were found to be distributed non-randomly, with the highest frequency in the X domain and the helicase. Strain DQ450072 was identified as intergenotype-recombinant. Natural selection analysis revealed that codons under both negative selection and positive selection were distributed non-randomly. ORF1 and ORF2 have experienced strong purifying selection across genotypes. Furthermore, potentially important sites were also found under positive selection in the N-terminal end of ORF2 and the C-terminal end of ORF3. No significant difference was found among the selective pressures on different genotypes.
Collapse
Affiliation(s)
- Xiaoming Chen
- Department of Epidemiology, School of Public Health, Fourth Military Medical University, Xi'an, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Abstract
In this chapter, we give a brief yet self-contained introduction to computational molecular evolution. In particular, we present the emergence of the use of likelihood-based methods, review the standard DNA substitution models, and introduce how model choice operates. We also present recent developments in inferring absolute dates and rates on a phylogeny and show how state-of-the-art models take inspiration from diffusion theory to link population genetics, which traditionally focuses at a taxonomic level under that of species, and molecular evolution.
Collapse
|
12
|
Evolution and diversity of the human hepatitis d virus genome. Adv Bioinformatics 2010:323654. [PMID: 20204073 PMCID: PMC2829689 DOI: 10.1155/2010/323654] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2009] [Accepted: 12/11/2009] [Indexed: 12/17/2022] Open
Abstract
Human hepatitis delta virus (HDV) is the smallest RNA virus in genome. HDV genome is divided into a viroid-like sequence and a protein-coding sequence which could have originated from different resources and the HDV genome was eventually constituted through RNA recombination. The genome subsequently diversified through accumulation of mutations selected by interactions between the mutated RNA and proteins with host factors to successfully form the infectious virions. Therefore, we propose that the conservation of HDV nucleotide sequence is highly related with its functionality. Genome analysis of known HDV isolates shows that the C-terminal coding sequences of large delta antigen (LDAg) are the highest diversity than other regions of protein-coding sequences but they still retain biological functionality to interact with the heavy chain of clathrin can be selected and maintained. Since viruses interact with many host factors, including escaping the host immune response, how to design a program to predict RNA genome evolution is a great challenging work.
Collapse
|
13
|
Mazumder R, Hu ZZ, Vinayaka CR, Sagripanti JL, Frost SDW, Kosakovsky Pond SL, Wu CH. Computational analysis and identification of amino acid sites in dengue E proteins relevant to development of diagnostics and vaccines. Virus Genes 2007; 35:175-86. [PMID: 17508277 DOI: 10.1007/s11262-007-0103-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2007] [Accepted: 04/11/2007] [Indexed: 10/23/2022]
Abstract
We have identified 72 completely conserved amino acid residues in the E protein of major groups of the Flavivirus genus by computational analyses. In the dengue species we have identified 12 highly conserved sequence regions, 186 negatively selected sites, and many dengue serotype-specific negatively selected sites. The flavivirus-conserved sites included residues involved in forming six disulfide bonds crucial for the structural integrity of the protein, the fusion motif involved in viral infectivity, and the interface residues of the oligomers. The structural analysis of the E protein showed 19 surface-exposed non-conserved residues, 128 dimer or trimer interface residues, and regions, which undergo major conformational change during trimerization. Eleven consensus T(h)-cell epitopes common to all four dengue serotypes were predicted. Most of these corresponded to dengue-conserved regions or negatively selected sites. Of special interest are six singular sites (N(37), Q(211), D(215), P(217), H(244), K(246)) in dengue E protein that are conserved, are part of the predicted consensus T(h)-cell epitopes and are exposed in the dimer or trimer. We propose these sites and corresponding epitopic regions as potential candidates for prioritization by experimental biologists for development of diagnostics and vaccines that may be difficult to circumvent by natural or man-made alteration of dengue virus.
Collapse
Affiliation(s)
- Raja Mazumder
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC 20007, USA
| | | | | | | | | | | | | |
Collapse
|
14
|
Mes THM, van Putten JPM. Positively selected codons in immune-exposed loops of the vaccine candidate OMP-P1 of Haemophilus influenzae. J Mol Evol 2007; 64:411-22. [PMID: 17479342 PMCID: PMC1915622 DOI: 10.1007/s00239-006-0021-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2006] [Accepted: 01/11/2007] [Indexed: 11/28/2022]
Abstract
The high levels of variation in surface epitopes can be considered as an evolutionary hallmark of immune selection. New computational tools enable analysis of this variation by identifying codons that exhibit high rates of amino acid changes relative to the synonymous substitution rate. In the outer membrane protein P1 of Haemophilus influenzae, a vaccine candidate for nontypeable strains, we identified four codons with this attribute in domains that did not correspond to known or assumed B- and T-cell epitopes of OMP-P1. These codons flank hypervariable domains and do not appear to be false positives as judged from parsimony and maximum likelihood analyses. Some closely spaced positively selected codons have been previously considered part of a transmembrane domain, which would render this region unsuited for inclusion in a vaccine. Secondary structure analysis, three-dimensional structural database searches, and homology modeling using FadL of E. coli as a structural homologue, however, revealed that all positively selected codons are located in or near extracellular looping domains. The spacing and level of diversity of these positively selected and exposed codons in OMP-P1 suggest that vaccine targets based on these and conserved flanking residues may provide broad coverage in H. influenzae.
Collapse
Affiliation(s)
- Ted H M Mes
- Netherlands Institute of Ecology NIOO-KNAW, Centre for Estuarine and Marine Ecology, AC, Yerseke, The Netherlands.
| | | |
Collapse
|
15
|
Wang SY, Wu JC, Chiang TY, Huang YH, Su CW, Sheen IJ. Positive selection of hepatitis delta antigen in chronic hepatitis D patients. J Virol 2007; 81:4438-44. [PMID: 17301143 PMCID: PMC1900184 DOI: 10.1128/jvi.02847-06] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Liver disease may become ameliorated in some patients with chronic hepatitis D virus (HDV) infection. We present here a study based on longitudinal sampling to investigate the viral dynamics in chronic HDV infection. We examined the HDV variants from different time points, especially those before and after the elevation of serum aminotransferase levels. The datasets from each patient were tested for positive selection by using maximum-likelihood methods with heterogeneous selective pressures along the nucleotide sequence. An average of 4.9%, ranging from 3.1 to 6.8%, of the entire delta antigen sites was regulated by a diversifying selection. Most of the positively selected sites were associated with immunogenic domains. Likelihood ratio tests revealed a significant fitness of positive selection over neutrality of the hepatitis delta antigen gene in all patients. We further adapted a neural network method to predict potential cytotoxic T ligand epitopes. Among the HLA-A*0201 cytotoxic T ligand epitopes, three consistent epitopes across all three genotypes were identified: amino acids (aa) 43 to 51, 50 to 58, and 114 to 122. Three patients (60%) had sites evolving under positive selection in the epitope from aa 43 to 51, and four patients (80%) had sites evolving under positive selection in the epitope from aa 114 to 122. The discovery of immunogenic epitopes, especially cytotoxic-T-lymphocyte ligands, associated with chronic HDV infection may be crucial for further development of novel treatments or designs in vaccine for HDV superinfection.
Collapse
Affiliation(s)
- Shen-Yung Wang
- Department of Medical Research and Education, Taipei Veterans General Hospital, 201 Shih-Pai Road, Sec. 2, Taipei 11217, Taiwan
| | | | | | | | | | | |
Collapse
|
16
|
Mes THM, Doeleman M. Positive selection on transposase genes of insertion sequences in the Crocosphaera watsonii genome. J Bacteriol 2006; 188:7176-85. [PMID: 17015656 PMCID: PMC1636226 DOI: 10.1128/jb.01021-06] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Insertion sequences (ISs) are mobile elements that are commonly found in bacterial genomes. Here, the structural and functional diversity of these mobile elements in the genome of the cyanobacterium Crocosphaera watsonii WH8501 is analyzed. The number, distribution, and diversity of nucleotide and amino acid stretches with similarity to the transposase gene of this IS family suggested that this genome harbors many functional as well as truncated IS fragments. The selection pressure acting on full-length transposase open reading frames of these ISs suggested (i) the occurrence of positive selection and (ii) the presence of one or more positively selected codons. These results were obtained using three data sets of transposase genes from the same IS family that were collected based on the level of amino acid similarity, the presence of an inverted repeat, and the number of sequences in the data sets. Neither recombination nor ribosomal frameshifting, which may interfere with the selection analyses, appeared to be important forces in the transposase gene family. Some positively selected codons were located in a conserved domain, suggesting that these residues are functionally important. The finding that this type of selection acts on IS-carried genes is intriguing, because although ISs have been associated with the adaptation of the bacterial host to new environments, this has typically been attributed to transposition or transformation, thus involving different genomic locations. Intragenic adaptation of IS-carried genes identified here may constitute a novel mechanism associated with bacterial diversification and adaptation.
Collapse
MESH Headings
- Adaptation, Biological
- Amino Acid Sequence
- Base Sequence
- Cluster Analysis
- Codon/genetics
- Conserved Sequence
- Cyanobacteria/enzymology
- Cyanobacteria/genetics
- DNA Transposable Elements/genetics
- DNA, Bacterial/chemistry
- DNA, Bacterial/genetics
- Frameshifting, Ribosomal
- Genetic Variation
- Genome, Bacterial
- Molecular Sequence Data
- Phylogeny
- Recombination, Genetic
- Selection, Genetic
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Transposases/genetics
Collapse
Affiliation(s)
- Ted H M Mes
- Netherlands Institute of Ecology (NIOO-KNAW), Centre for Estuarine and Marine Ecology, POB 140, 4400 AC Yerseke, The Netherlands.
| | | |
Collapse
|
17
|
Balakirev ES, Anisimova M, Ayala FJ. Positive and negative selection in the beta-esterase gene cluster of the Drosophila melanogaster subgroup. J Mol Evol 2006; 62:496-510. [PMID: 16547641 DOI: 10.1007/s00239-005-0140-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2005] [Accepted: 12/20/2005] [Indexed: 11/25/2022]
Abstract
We examine the pattern of molecular evolution of the beta-esterase gene cluster, including the Est-6 and psiEst-6 genes, in eight species of the Drosophila melanogaster subgroup. Using maximum likelihood estimates of nonsynonymous/synonymous rate ratios, we show that the majority of Est-6 sites evolves under strong (48% of sites) or moderate (50% of sites) negative selection and a minority of sites (1.5%) is under significant positive selection. Est-6 sites likely to be under positive selection are associated with increased intraspecific variability. One positively selected site is responsible for the EST-6 F/S allozyme polymorphism; the same site is responsible for the EST-6 functional divergence between species of the melanogaster subgroup. For psiEst-6 83.7% sites evolve under negative selection, 16% sites evolve neutrally, and 0.3% sites are under positive selection. The positively selected sites of psiEst-6 are located at the beginning and at the end of the gene, where there is reduced divergence between D. melanogaster and D. simulans; these regions of psiEst-6 could be involved in regulation or some other function. Branch-site-specific analysis shows that the evolution of the melanogaster subgroup underwent episodic positive selection. Collating the present data with previous results for the beta-esterase genes, we propose that positive and negative selection are involved in a complex relationship that may be typical of the divergence of duplicate genes as one or both duplicates evolve a new function.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, 321 Steinhaus Hall, Irvine, CA 92697-2525, USA.
| | | | | |
Collapse
|
18
|
Abstract
We develop a new model for studying the molecular evolution of protein-coding DNA sequences. In contrast to existing models, we incorporate the potential for site-to-site heterogeneity of both synonymous and nonsynonymous substitution rates. We demonstrate that within-gene heterogeneity of synonymous substitution rates appears to be common. Using the new family of models, we investigate the utility of a variety of new statistical inference procedures, and we pay particular attention to issues surrounding the detection of sites undergoing positive selection. We discuss how failure to model synonymous rate variation in the model can lead to misidentification of sites as positively selected.
Collapse
|
19
|
Fry AJ, Wernegreen JJ. The roles of positive and negative selection in the molecular evolution of insect endosymbionts. Gene 2005; 355:1-10. [PMID: 16039807 DOI: 10.1016/j.gene.2005.05.021] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2005] [Revised: 03/29/2005] [Accepted: 05/17/2005] [Indexed: 11/19/2022]
Abstract
The evolutionary rate acceleration observed in most endosymbiotic bacteria may be explained by higher mutation rates, changes in selective pressure, and increased fixation of deleterious mutations by genetic drift. Here, we explore the forces influencing molecular evolution in Blochmannia, an obligate endosymbiont of Camponotus and related ant genera. Our goals were to compare rates of sequence evolution in Blochmannia with related bacteria, to explore variation in the strength and efficacy of negative (purifying) selection, and to evaluate the effect of positive selection. For six Blochmannia pairs, plus Buchnera and related enterobacteria, estimates of sequence divergence at four genes confirm faster rates of synonymous evolution in the ant mutualist. This conclusion is based on higher dS between Blochmannia lineages despite their more recent divergence. Likewise, generally higher dN in Blochmannia indicates faster rates of nonsynonymous substitution in this group. One exception is the groEL gene, for which lower dN and dN/dS compared to Buchnera indicate exceptionally strong negative selection in Blochmannia. In addition, we explored evidence for positive selection in Blochmannia using both site-and lineage-based maximum likelihood models. These approaches confirmed heterogeneity of dN/dS among codon sites and revealed significant variation in dN/dS across Blochmannia lineages for three genes. Lineage variation affected genes independently, with no evidence of parallel changes in dN/dS across genes along a given branch. Our data also reveal instances of dN/dS greater than one; however, we do not interpret these large dN/dS ratios as evidence for positive selection. In sum, while drift may contribute to an overall rate acceleration at nonsynonymous sites in Blochmannia, variable selective pressures best explain the apparent gene-specific changes in dN/dS across lineages of this ant mutualist. In the course of this study, we reanalyzed variation at Buchnera groEL and found no evidence of positive selection that was previously reported.
Collapse
Affiliation(s)
- Adam J Fry
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA
| | | |
Collapse
|