1
|
Nelson CW, Mirabello L. Human papillomavirus genomics: Understanding carcinogenicity. Tumour Virus Res 2023; 15:200258. [PMID: 36812987 PMCID: PMC10063409 DOI: 10.1016/j.tvr.2023.200258] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 02/01/2023] [Accepted: 02/17/2023] [Indexed: 02/22/2023] Open
Abstract
Human papillomavirus (HPV) causes virtually all cervical cancers and many cancers at other anatomical sites in both men and women. However, only 12 of 448 known HPV types are currently classified as carcinogens, and even the most carcinogenic type - HPV16 - only rarely leads to cancer. HPV is therefore necessary but insufficient for cervical cancer, with other contributing factors including host and viral genetics. Over the last decade, HPV whole genome sequencing has established that even fine-scale within-type HPV variation influences precancer/cancer risks, and that these risks vary by histology and host race/ethnicity. In this review, we place these findings in the context of the HPV life cycle and evolution at various levels of viral diversity: between-type, within-type, and within-host. We also discuss key concepts necessary for interpreting HPV genomic data, including features of the viral genome; events leading to carcinogenesis; the role of APOBEC3 in HPV infection and evolution; and methodologies that use deep (high-coverage) sequencing to characterize within-host variation, as opposed to relying on a single representative (consensus) sequence. Given the continued high burden of HPV-associated cancers, understanding HPV carcinogenicity remains important for better understanding, preventing, and treating cancers attributable to infection.
Collapse
Affiliation(s)
- Chase W Nelson
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, 20850, USA; Institute for Comparative Genomics, American Museum of Natural History, New York, NY, 10024, USA.
| | - Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, 20850, USA.
| |
Collapse
|
2
|
Pley C, Lourenço J, McNaughton AL, Matthews PC. Spacer Domain in Hepatitis B Virus Polymerase: Plugging a Hole or Performing a Role? J Virol 2022; 96:e0005122. [PMID: 35412348 PMCID: PMC9093120 DOI: 10.1128/jvi.00051-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/14/2022] [Indexed: 11/25/2022] Open
Abstract
Hepatitis B virus (HBV) polymerase is divided into terminal protein, spacer, reverse transcriptase, and RNase domains. Spacer has previously been considered dispensable, merely acting as a tether between other domains or providing plasticity to accommodate deletions and mutations. We explore evidence for the role of spacer sequence, structure, and function in HBV evolution and lineage, consider its associations with escape from drugs, vaccines, and immune responses, and review its potential impacts on disease outcomes.
Collapse
Affiliation(s)
- Caitlin Pley
- School of Clinical Medicine, University of Cambridge, Cambridge, United Kingdom
- Guy’s and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - José Lourenço
- Department of Zoology, University of Oxford, Oxford, United Kingdom
- Biosystems and Integrative Sciences Institute, University of Lisbon, Lisbon, Portugal
| | - Anna L. McNaughton
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, United Kingdom
- Nuffield Department of Medicine, University of Oxford Medawar Building, Oxford, United Kingdom
| | - Philippa C. Matthews
- Nuffield Department of Medicine, University of Oxford Medawar Building, Oxford, United Kingdom
- The Francis Crick Institute, London, United Kingdom
- Division of Infection and Immunity, University College London, London, United Kingdom
| |
Collapse
|
3
|
Genetic and phylogenetic characterization of polycistronic dsRNA segment-10 of bluetongue virus isolates from India between 1985 and 2011. Virus Genes 2021; 57:369-379. [PMID: 34120252 DOI: 10.1007/s11262-021-01855-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 06/08/2021] [Indexed: 01/07/2023]
Abstract
The smallest polycistronic dsRNA segment-10 (S10) of bluetongue virus (BTV) encodes NS3/3A and putative NS5. The S10 sequence data of 46 Indian BTV field isolates obtained between 1985 and 2011 were determined and compared with the cognate sequences of global BTV strains. The largest ORF on S10 encodes NS3 (229 aa) and an amino-terminal truncated form of the protein (NS3A) and a putative NS5 (50-59 aa) due to alternate translation initiation site. The overall mean distance of the global NS3 was 0.1106 and 0.0269 at nt and deduced aa sequence, respectively. The global BTV strains formed four major clusters. The major cluster of Indian BTV strains was closely related to the viruses reported from Australia and China. A minor sub-cluster of Indian BTV strains were closely related to the USA strains and a few of the Indian strains were similar to the South African reference and vaccine strains. The global trait association of phylogenetic structure indicates the evolution of the global BTV S10 was not homogenous but rather represents a moderate level of geographical divergence. There was no evidence of an association between the virus and the host species, suggesting a random spread of the viruses. Conflicting selection pressure on the alternate coding sequences of the S10 was evident where NS3/3A might have evolved through strong purifying (negative) selection and NS5 through a positive selection. The presence of multiple positively selected codons on the putative NS5 may be advantageous for adaptation of the virus though their precise role is unknown.
Collapse
|
4
|
Pavesi A. Origin, Evolution and Stability of Overlapping Genes in Viruses: A Systematic Review. Genes (Basel) 2021; 12:genes12060809. [PMID: 34073395 PMCID: PMC8227390 DOI: 10.3390/genes12060809] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 05/22/2021] [Accepted: 05/24/2021] [Indexed: 12/11/2022] Open
Abstract
During their long evolutionary history viruses generated many proteins de novo by a mechanism called “overprinting”. Overprinting is a process in which critical nucleotide substitutions in a pre-existing gene can induce the expression of a novel protein by translation of an alternative open reading frame (ORF). Overlapping genes represent an intriguing example of adaptive conflict, because they simultaneously encode two proteins whose freedom to change is constrained by each other. However, overlapping genes are also a source of genetic novelties, as the constraints under which alternative ORFs evolve can give rise to proteins with unusual sequence properties, most importantly the potential for novel functions. Starting with the discovery of overlapping genes in phages infecting Escherichia coli, this review covers a range of studies dealing with detection of overlapping genes in small eukaryotic viruses (genomic length below 30 kb) and recognition of their critical role in the evolution of pathogenicity. Origin of overlapping genes, what factors favor their birth and retention, and how they manage their inherent adaptive conflict are extensively reviewed. Special attention is paid to the assembly of overlapping genes into ad hoc databases, suitable for future studies, and to the development of statistical methods for exploring viral genome sequences in search of undiscovered overlaps.
Collapse
Affiliation(s)
- Angelo Pavesi
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parco Area delle Scienze 23/A, I-43124 Parma, Italy
| |
Collapse
|
5
|
Increased hepatitis B virus quasispecies diversity is correlated with liver fibrosis progression. INFECTION GENETICS AND EVOLUTION 2021; 93:104938. [PMID: 34029727 DOI: 10.1016/j.meegid.2021.104938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 05/08/2021] [Accepted: 05/12/2021] [Indexed: 12/30/2022]
Abstract
Host immune response and viral factors are involved in disease progression in patients with chronic hepatitis B virus (HBV) infection. However, the relationship between HBV quasispecies and liver fibrosis progression remains unclear. In this study, 447 patients with chronic HBV infection, including 239 with chronic hepatitis B (CHB), 104 with liver cirrhosis (LC) and 104 with hepatocellular carcinoma (HCC) were enrolled. The 239 CHB patients were divided into groups F1, F2, and F3 according to liver fibrosis score. Four fragments of the HBV genome were determined and analyzed using next-generation sequencing. Specific mutations, such as A1762T, G1764A and G1896A, in the BCP/PC region were more common in patients with advanced liver disease and formed the majority of the viral quasispecies pool in patients with LC and HCC. The viral complexity and diversity increased as the fibrosis progressed, especially in patients with CHB who were comparable in age but at different stages of fibrosis. Patients with early-stage fibrosis experienced higher purifying selection pressure in the four sequenced regions, whereas different protein-coding region experienced different negative selection with disease progression. HBV quasispecies diversity may increase fibrosis progression in CHB patients with aging under immune selection.
Collapse
|
6
|
Nelson CW, Ardern Z, Wei X. OLGenie: Estimating Natural Selection to Predict Functional Overlapping Genes. Mol Biol Evol 2021; 37:2440-2449. [PMID: 32243542 PMCID: PMC7531306 DOI: 10.1093/molbev/msaa087] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Purifying (negative) natural selection is a hallmark of functional biological sequences, and can be detected in protein-coding genes using the ratio of nonsynonymous to synonymous substitutions per site (dN/dS). However, when two genes overlap the same nucleotide sites in different frames, synonymous changes in one gene may be nonsynonymous in the other, perturbing dN/dS. Thus, scalable methods are needed to estimate functional constraint specifically for overlapping genes (OLGs). We propose OLGenie, which implements a modification of the Wei–Zhang method. Assessment with simulations and controls from viral genomes (58 OLGs and 176 non-OLGs) demonstrates low false-positive rates and good discriminatory ability in differentiating true OLGs from non-OLGs. We also apply OLGenie to the unresolved case of HIV-1’s putative antisense protein gene, showing significant purifying selection. OLGenie can be used to study known OLGs and to predict new OLGs in genome annotation. Software and example data are freely available at https://github.com/chasewnelson/OLGenie (last accessed April 10, 2020).
Collapse
Affiliation(s)
- Chase W Nelson
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY.,Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Zachary Ardern
- Microbial Ecology, ZIEL-Institute for Food & Health, Technische Universität München, Freising, Germany
| | - Xinzhu Wei
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI.,Department of Integrative Biology and Statistics, University of California, Berkeley, CA
| |
Collapse
|
7
|
Willemsen A, Félez-Sánchez M, Bravo IG. Genome Plasticity in Papillomaviruses and De Novo Emergence of E5 Oncogenes. Genome Biol Evol 2019; 11:1602-1617. [PMID: 31076746 PMCID: PMC6557308 DOI: 10.1093/gbe/evz095] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/29/2019] [Indexed: 02/06/2023] Open
Abstract
The clinical presentations of papillomavirus (PV) infections come in many different flavors. While most PVs are part of a healthy skin microbiota and are not associated to physical lesions, other PVs cause benign lesions, and only a handful of PVs are associated to malignant transformations linked to the specific activities of the E5, E6, and E7 oncogenes. The functions and origin of E5 remain to be elucidated. These E5 open reading frames (ORFs) are present in the genomes of a few polyphyletic PV lineages, located between the early and the late viral gene cassettes. We have computationally assessed whether these E5 ORFs have a common origin and whether they display the properties of a genuine gene. Our results suggest that during the evolution of Papillomaviridae, at least four events lead to the presence of a long noncoding DNA stretch between the E2 and the L2 genes. In three of these events, the novel regions evolved coding capacity, becoming the extant E5 ORFs. We then focused on the evolution of the E5 genes in AlphaPVs infecting primates. The sharp match between the type of E5 protein encoded in AlphaPVs and the infection phenotype (cutaneous warts, genital warts, or anogenital cancers) supports the role of E5 in the differential oncogenic potential of these PVs. In our analyses, the best-supported scenario is that the five types of extant E5 proteins within the AlphaPV genomes may not have a common ancestor. However, the chemical similarities between E5s regarding amino acid composition prevent us from confidently rejecting the model of a common origin. Our evolutionary interpretation is that an originally noncoding region entered the genome of the ancestral AlphaPVs. This genetic novelty allowed to explore novel transcription potential, triggering an adaptive radiation that yielded three main viral lineages encoding for different E5 proteins, displaying distinct infection phenotypes. Overall, our results provide an evolutionary scenario for the de novo emergence of viral genes and illustrate the impact of such genotypic novelty in the phenotypic diversity of the viral infections.
Collapse
Affiliation(s)
- Anouk Willemsen
- Laboratory MIVEGEC (UMR CNRS IRD Uni Montpellier), Centre National de la Recherche Scientique (CNRS), Montpellier, France
| | - Marta Félez-Sánchez
- Infections and Cancer Laboratory, Catalan Institute of Oncology (ICO), Barcelona, Spain
| | - Ignacio G Bravo
- Laboratory MIVEGEC (UMR CNRS IRD Uni Montpellier), Centre National de la Recherche Scientique (CNRS), Montpellier, France
| |
Collapse
|
8
|
Phylogenetic analysis and signature of recombination hotspots in sugarcane mosaic virus infecting sugarcane in India. PHYTOPARASITICA 2019. [DOI: 10.1007/s12600-019-00726-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
9
|
Pavesi A, Vianelli A, Chirico N, Bao Y, Blinkova O, Belshaw R, Firth A, Karlin D. Overlapping genes and the proteins they encode differ significantly in their sequence composition from non-overlapping genes. PLoS One 2018; 13:e0202513. [PMID: 30339683 PMCID: PMC6195259 DOI: 10.1371/journal.pone.0202513] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 08/03/2018] [Indexed: 11/19/2022] Open
Abstract
Overlapping genes represent a fascinating evolutionary puzzle, since they encode two functionally unrelated proteins from the same DNA sequence. They originate by a mechanism of overprinting, in which point mutations in an existing frame allow the expression (the "birth") of a completely new protein from a second frame. In viruses, in which overlapping genes are abundant, these new proteins often play a critical role in infection, yet they are frequently overlooked during genome annotation. This results in erroneous interpretation of mutational studies and in a significant waste of resources. Therefore, overlapping genes need to be correctly detected, especially since they are now thought to be abundant also in eukaryotes. Developing better detection methods and conducting systematic evolutionary studies require a large, reliable benchmark dataset of known cases. We thus assembled a high-quality dataset of 80 viral overlapping genes whose expression is experimentally proven. Many of them were not present in databases. We found that overall, overlapping genes differ significantly from non-overlapping genes in their nucleotide and amino acid composition. In particular, the proteins they encode are enriched in high-degeneracy amino acids and depleted in low-degeneracy ones, which may alleviate the evolutionary constraints acting on overlapping genes. Principal component analysis revealed that the vast majority of overlapping genes follow a similar composition bias, despite their heterogeneity in length and function. Six proven mammalian overlapping genes also followed this bias. We propose that this apparently near-universal composition bias may either favour the birth of overlapping genes, or/and result from selection pressure acting on them.
Collapse
Affiliation(s)
- Angelo Pavesi
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy
| | - Alberto Vianelli
- Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
| | - Nicola Chirico
- Department of Theoretical and Applied Sciences, University of Insubria, Varese, Italy
| | - Yiming Bao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Olga Blinkova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States of America
| | - Robert Belshaw
- School of Biomedical & Healthcare Sciences, Plymouth University Peninsula Schools of Medicine and Dentistry (PUPSMD), Plymouth, United Kingdom
| | - Andrew Firth
- Department of Pathology, Division of Virology, University of Cambridge, Cambridge, United Kingdom
| | - David Karlin
- Department of Zoology, University of Oxford, Oxford, United Kingdom
- Division of Structural Biology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
10
|
van der Meijden E, Feltkamp M. The Human Polyomavirus Middle and Alternative T-Antigens; Thoughts on Roles and Relevance to Cancer. Front Microbiol 2018; 9:398. [PMID: 29568287 PMCID: PMC5852106 DOI: 10.3389/fmicb.2018.00398] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 02/21/2018] [Indexed: 01/08/2023] Open
Abstract
Approximately 15–20% of human cancer is related to infection, which renders them potentially preventable by antimicrobial or antiviral therapy. Human polyomaviruses (PyVs) are relevant in this regard, as illustrated by the involvement of Merkel cell polyomavirus (MCPyV) in the development of Merkel cell carcinoma. The polyomavirus Small and Large tumor antigen (ST and LT) have been extensively studied with respect to their role in oncogenesis. Recently it was shown that a number of human PyVs, including MCPyV and the trichodysplasia spinulosa polyomavirus (TSPyV), express additional T-antigens called Middle T (MT) and alternative T (ALT). ALT is encoded by ORF5, also known as the alternative T open reading frame (ALTO), which also encodes the second exon of MT, and overlaps out-of-frame with the second exon of LT. Previously, MT was considered unique for oncogenic rodent polyomaviruses, and ALT was still unknown. In this mini-review, we want to point out there are important reasons to explore the involvement of MT and ALT in human cellular transformation. First, just like their rodent equivalents, MT and ALT probably disrupt cellular pathways that control signaling and proliferation. Second, expression of the MT and ALT-encoding ORF5/ALTO characterizes a monophyletic polyomavirus clade that includes human and animal PyVs with known oncogenic potential. And third, ORF5/ALTO is subject to strong positive selection aimed specifically at a short linear motif within MT and ALT that overlaps completely with the RB-binding motif in LT. The latter suggests tight interplay between these T-antigens with possible consequences for cell transformation.
Collapse
Affiliation(s)
- Els van der Meijden
- Department of Medical Microbiology, Leiden University Medical Center, Leiden, Netherlands
| | - Mariet Feltkamp
- Department of Medical Microbiology, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
11
|
Mirabello L, Yeager M, Yu K, Clifford GM, Xiao Y, Zhu B, Cullen M, Boland JF, Wentzensen N, Nelson CW, Raine-Bennett T, Chen Z, Bass S, Song L, Yang Q, Steinberg M, Burdett L, Dean M, Roberson D, Mitchell J, Lorey T, Franceschi S, Castle PE, Walker J, Zuna R, Kreimer AR, Beachler DC, Hildesheim A, Gonzalez P, Porras C, Burk RD, Schiffman M. HPV16 E7 Genetic Conservation Is Critical to Carcinogenesis. Cell 2017; 170:1164-1174.e6. [PMID: 28886384 PMCID: PMC5674785 DOI: 10.1016/j.cell.2017.08.001] [Citation(s) in RCA: 183] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Revised: 06/05/2017] [Accepted: 08/01/2017] [Indexed: 12/29/2022]
Abstract
Although most cervical human papillomavirus type 16 (HPV16) infections become undetectable within 1-2 years, persistent HPV16 causes half of all cervical cancers. We used a novel HPV whole-genome sequencing technique to evaluate an exceptionally large collection of 5,570 HPV16-infected case-control samples to determine whether viral genetic variation influences risk of cervical precancer and cancer. We observed thousands of unique HPV16 genomes; very few women shared the identical HPV16 sequence, which should stimulate a careful re-evaluation of the clinical implications of HPV mutation rates, transmission, clearance, and persistence. In case-control analyses, HPV16 in the controls had significantly more amino acid changing variants throughout the genome. Strikingly, E7 was devoid of variants in precancers/cancers compared to higher levels in the controls; we confirmed this in cancers from around the world. Strict conservation of the 98 amino acids of E7, which disrupts Rb function, is critical for HPV16 carcinogenesis, presenting a highly specific target for etiologic and therapeutic research.
Collapse
Affiliation(s)
- Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA.
| | - Meredith Yeager
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Kai Yu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Gary M Clifford
- Infections and Cancer Epidemiology Group, International Agency for Research on Cancer 150, Cours Albert Thomas, 69372 Lyon Cedex 08, France
| | - Yanzi Xiao
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Bin Zhu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Michael Cullen
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Joseph F Boland
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Nicolas Wentzensen
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Chase W Nelson
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York City, NY, USA
| | - Tina Raine-Bennett
- Women's Health Research Institute, Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Zigui Chen
- Department of Microbiology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Sara Bass
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Lei Song
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Qi Yang
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Mia Steinberg
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Laurie Burdett
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Michael Dean
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - David Roberson
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Jason Mitchell
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick, MD, USA
| | - Thomas Lorey
- Regional Laboratory, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Silvia Franceschi
- Infections and Cancer Epidemiology Group, International Agency for Research on Cancer 150, Cours Albert Thomas, 69372 Lyon Cedex 08, France
| | - Philip E Castle
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Joan Walker
- University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Rosemary Zuna
- University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Aimée R Kreimer
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Daniel C Beachler
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA; HealthCore Inc., Safety and Epidemiology, Wilmington, DE, USA
| | - Allan Hildesheim
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Paula Gonzalez
- Agencia Costarricense de Investigaciones Biomédicas (ACIB), former Proyecto Epidemiológico Guanacaste, Fundación INCIENSA, Guanacaste, Costa Rica
| | - Carolina Porras
- Agencia Costarricense de Investigaciones Biomédicas (ACIB), former Proyecto Epidemiológico Guanacaste, Fundación INCIENSA, Guanacaste, Costa Rica
| | - Robert D Burk
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA; Departments of Pediatrics, Microbiology and Immunology, and Obstetrics & Gynecology and Women's Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Mark Schiffman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| |
Collapse
|
12
|
Li S, Wang Z, Li Y, Ding G. Adaptive evolution of proteins in hepatitis B virus during divergence of genotypes. Sci Rep 2017; 7:1990. [PMID: 28512348 PMCID: PMC5434055 DOI: 10.1038/s41598-017-02012-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 04/03/2017] [Indexed: 12/12/2022] Open
Abstract
Hepatitis B virus (HBV) is classified into several genotypes, correlated with different geographic distributions, clinical outcomes and susceptible human populations. It is crucial to investigate the evolutionary significance behind the diversification of HBV genotypes, because it improves our understanding of their pathological differences and pathogen-host interactions. Here, we performed comprehensive analysis of HBV genome sequences collected from public database. With a stringent criteria, we generated a dataset of 2992 HBV genomes from eight major genotypes. In particular, we applied a specified classification of non-synonymous and synonymous variants in overlapping regions, to distinguish joint and independent gene evolutions. We confirmed the presence of selective constraints over non-synonymous variants in consideration of overlapping regions. We then performed the McDonald-Kreitman test and revealed adaptive evolutions of non-synonymous variants during genotypic differentiation. Remarkably, we identified strong positive selection that drove the differentiation of PreS1 domain, which is an essential regulator involved in viral transmission. Our study presents novel evidences for the adaptive evolution of HBV genotypes, which suggests that these viruses evolve directionally for maintenance or improvement of successful infections.
Collapse
Affiliation(s)
- Shengdi Li
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, 200031, Shanghai, China
- University of Chinese Academy of Sciences, 19 Yuquan Road, Beijing, 100049, China
| | - Zhen Wang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, 200031, Shanghai, China
| | - Yixue Li
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, 200031, Shanghai, China.
- Shanghai Center for Bioinformation Technology, Shanghai Industrial Technology Institute, 1278 Keyuan Road, 201203, Shanghai, China.
| | - Guohui Ding
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, 200031, Shanghai, China.
- Shanghai Center for Bioinformation Technology, Shanghai Industrial Technology Institute, 1278 Keyuan Road, 201203, Shanghai, China.
| |
Collapse
|
13
|
Mengual-Chuliá B, Bedhomme S, Lafforgue G, Elena SF, Bravo IG. Assessing parallel gene histories in viral genomes. BMC Evol Biol 2016; 16:32. [PMID: 26847371 PMCID: PMC4743424 DOI: 10.1186/s12862-016-0605-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 01/29/2016] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND The increasing abundance of sequence data has exacerbated a long known problem: gene trees and species trees for the same terminal taxa are often incongruent. Indeed, genes within a genome have not all followed the same evolutionary path due to events such as incomplete lineage sorting, horizontal gene transfer, gene duplication and deletion, or recombination. Considering conflicts between gene trees as an obstacle, numerous methods have been developed to deal with these incongruences and to reconstruct consensus evolutionary histories of species despite the heterogeneity in the history of their genes. However, inconsistencies can also be seen as a source of information about the specific evolutionary processes that have shaped genomes. RESULTS The goal of the approach here proposed is to exploit this conflicting information: we have compiled eleven variables describing phylogenetic relationships and evolutionary pressures and submitted them to dimensionality reduction techniques to identify genes with similar evolutionary histories. To illustrate the applicability of the method, we have chosen two viral datasets, namely papillomaviruses and Turnip mosaic virus (TuMV) isolates, largely dissimilar in genome, evolutionary distance and biology. Our method pinpoints viral genes with common evolutionary patterns. In the case of papillomaviruses, gene clusters match well our knowledge on viral biology and life cycle, illustrating the potential of our approach. For the less known TuMV, our results trigger new hypotheses about viral evolution and gene interaction. CONCLUSIONS The approach here presented allows turning phylogenetic inconsistencies into evolutionary information, detecting gene assemblies with similar histories, and could be a powerful tool for comparative pathogenomics.
Collapse
Affiliation(s)
- Beatriz Mengual-Chuliá
- Infections and Cancer Laboratory, Catalan Institute of Oncology (ICO), Barcelona, Spain.,Bellvitge Institute of Biomedical Research (IDIBELL), Barcelona, Spain
| | - Stéphanie Bedhomme
- Infections and Cancer Laboratory, Catalan Institute of Oncology (ICO), Barcelona, Spain.,Bellvitge Institute of Biomedical Research (IDIBELL), Barcelona, Spain.,Centre d'Ecologie Fonctionnelle et Evolutive, UMR CNRS 5175, Montpellier, France
| | - Guillaume Lafforgue
- Centre d'Ecologie Fonctionnelle et Evolutive, UMR CNRS 5175, Montpellier, France.,Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universidad Politécnica de Valencia, València, Spain
| | - Santiago F Elena
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universidad Politécnica de Valencia, València, Spain.,I2SysBio, Consejo Superior de Investigaciones Científicas-Universitat de València, València, Spain.,The Santa Fe Institute, Santa Fe, NM, USA
| | - Ignacio G Bravo
- Infections and Cancer Laboratory, Catalan Institute of Oncology (ICO), Barcelona, Spain. .,MIVEGEC (UMR CNRS 5290, IRD 224, UM), National Center for Scientific Research (CNRS), Montpellier, France. .,National Center for Scientific Research (CNRS), Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle (MIVEGEC), UMR CNRS 5290, IRD 224, UM, 911 Avenue Agropolis, BP 64501, 34394, Montpellier, Cedex 5, France.
| |
Collapse
|
14
|
Stewart M, Hardy A, Barry G, Pinto RM, Caporale M, Melzi E, Hughes J, Taggart A, Janowicz A, Varela M, Ratinier M, Palmarini M. Characterization of a second open reading frame in genome segment 10 of bluetongue virus. J Gen Virol 2015; 96:3280-3293. [PMID: 26290332 PMCID: PMC4806581 DOI: 10.1099/jgv.0.000267] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Viruses have often evolved overlapping reading frames in order to maximize their coding capacity. Until recently, the segmented dsRNA genome of viruses of the Orbivirus genus was thought to be monocistronic, but the identification of the bluetongue virus (BTV) NS4 protein changed this assumption. A small ORF in segment 10, overlapping the NS3 ORF in the +1 position, is maintained in more than 300 strains of the 27 different BTV serotypes and in more than 200 strains of the phylogenetically related African horse sickness virus (AHSV). In BTV, this ORF (named S10-ORF2 in this study) encodes a putative protein 50–59 residues in length and appears to be under strong positive selection. HA- or GFP-tagged versions of S10-ORF2 expressed from transfected plasmids localized within the nucleoli of transfected cells, unless a putative nucleolar localization signal was mutated. S10-ORF2 inhibited gene expression, but not RNA translation, in transient transfection reporter assays. In both mammalian and insect cells, BTV S10-ORF2 deletion mutants (BTV8ΔS10-ORF2) displayed similar replication kinetics to wt virus. In vivo, S10-ORF2 deletion mutants were pathogenic in mouse models of disease. Although further evidence is required for S10-ORF2 expression during infection, the data presented provide an initial characterization of this ORF.
Collapse
Affiliation(s)
- Meredith Stewart
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Alexandra Hardy
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Gerald Barry
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Rute Maria Pinto
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Marco Caporale
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK.,Istituto Zooprofilattico Sperimentale dell'Abruzzo e Molise 'G. Caporale', Teramo, Italy
| | - Eleonora Melzi
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Joseph Hughes
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Aislynn Taggart
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Anna Janowicz
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Mariana Varela
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Maxime Ratinier
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | | |
Collapse
|
15
|
Félez-Sánchez M, Trösemeier JH, Bedhomme S, González-Bravo MI, Kamp C, Bravo IG. Cancer, Warts, or Asymptomatic Infections: Clinical Presentation Matches Codon Usage Preferences in Human Papillomaviruses. Genome Biol Evol 2015; 7:2117-35. [PMID: 26139833 PMCID: PMC4558848 DOI: 10.1093/gbe/evv129] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Viruses rely completely on the hosts’ machinery for translation of viral transcripts. However, for most viruses infecting humans, codon usage preferences (CUPrefs) do not match those of the host. Human papillomaviruses (HPVs) are a showcase to tackle this paradox: they present a large genotypic diversity and a broad range of phenotypic presentations, from asymptomatic infections to productive lesions and cancer. By applying phylogenetic inference and dimensionality reduction methods, we demonstrate first that genes in HPVs are poorly adapted to the average human CUPrefs, the only exception being capsid genes in viruses causing productive lesions. Phylogenetic relationships between HPVs explained only a small proportion of CUPrefs variation. Instead, the most important explanatory factor for viral CUPrefs was infection phenotype, as orthologous genes in viruses with similar clinical presentation displayed similar CUPrefs. Moreover, viral genes with similar spatiotemporal expression patterns also showed similar CUPrefs. Our results suggest that CUPrefs in HPVs reflect either variations in the mutation bias or differential selection pressures depending on the clinical presentation and expression timing. We propose that poor viral CUPrefs may be central to a trade-off between strong viral gene expression and the potential for eliciting protective immune response.
Collapse
Affiliation(s)
- Marta Félez-Sánchez
- Infections and Cancer Laboratory, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain Virus and Cancer Laboratory. Bellvitge Institute of Biomedical Research (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
| | - Jan-Hendrik Trösemeier
- Molecular Bioinformatics, Institute of Computer Science, Johann Wolfgang Goethe University, Frankfurt am Main, Germany Paul-Ehrlich-Institut, Federal Institute for Vaccines and Biomedicines, Langen, Germany
| | - Stéphanie Bedhomme
- Infections and Cancer Laboratory, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain Virus and Cancer Laboratory. Bellvitge Institute of Biomedical Research (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain Département d'Ecologie Evolutive Centre d'Ecologie Fonctionnelle et Evolutive, CNRS - UMR 5175, Montpellier, France
| | | | - Christel Kamp
- Paul-Ehrlich-Institut, Federal Institute for Vaccines and Biomedicines, Langen, Germany
| | - Ignacio G Bravo
- Infections and Cancer Laboratory, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain Virus and Cancer Laboratory. Bellvitge Institute of Biomedical Research (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
| |
Collapse
|
16
|
Lauber C, Kazem S, Kravchenko AA, Feltkamp MCW, Gorbalenya AE. Interspecific adaptation by binary choice at de novo polyomavirus T antigen site through accelerated codon-constrained Val-Ala toggling within an intrinsically disordered region. Nucleic Acids Res 2015; 43:4800-13. [PMID: 25904630 PMCID: PMC4446436 DOI: 10.1093/nar/gkv378] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Revised: 03/22/2015] [Accepted: 04/09/2015] [Indexed: 11/30/2022] Open
Abstract
It is common knowledge that conserved residues evolve slowly. We challenge generality of this central tenet of molecular biology by describing the fast evolution of a conserved nucleotide position that is located in the overlap of two open reading frames (ORFs) of polyomaviruses. The de novo ORF is expressed through either the ALTO protein or the Middle T antigen (MT/ALTO), while the ancestral ORF encodes the N-terminal domain of helicase-containing Large T (LT) antigen. In the latter domain the conserved Cys codon of the LXCXE pRB-binding motif constrains codon evolution in the overlapping MT/ALTO ORF to a binary choice between Val and Ala codons, termed here as codon-constrained Val-Ala (COCO-VA) toggling. We found the rate of COCO-VA toggling to approach the speciation rate and to be significantly accelerated compared to the baseline rate of chance substitution in a large monophyletic lineage including all viruses encoding MT/ALTO and three others. Importantly, the COCO-VA site is located in a short linear motif (SLiM) of an intrinsically disordered region, a typical characteristic of adaptive responders. These findings provide evidence that the COCO-VA toggling is under positive selection in many polyomaviruses, implying its critical role in interspecific adaptation, which is unprecedented for conserved residues.
Collapse
Affiliation(s)
- Chris Lauber
- Department of Medical Microbiology, Leiden University Medical Center, 2300-RC Leiden, The Netherlands Institute for Medical Informatics and Biometry, Technische Universität Dresden, 01307 Dresden, Germany
| | - Siamaque Kazem
- Department of Medical Microbiology, Leiden University Medical Center, 2300-RC Leiden, The Netherlands
| | - Alexander A Kravchenko
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119899 Moscow, Russia
| | - Mariet C W Feltkamp
- Department of Medical Microbiology, Leiden University Medical Center, 2300-RC Leiden, The Netherlands
| | - Alexander E Gorbalenya
- Department of Medical Microbiology, Leiden University Medical Center, 2300-RC Leiden, The Netherlands Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119899 Moscow, Russia Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 119899 Moscow, Russia
| |
Collapse
|
17
|
Nelson CW, Hughes AL. Within-host nucleotide diversity of virus populations: insights from next-generation sequencing. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2015; 30:1-7. [PMID: 25481279 PMCID: PMC4316684 DOI: 10.1016/j.meegid.2014.11.026] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Revised: 11/26/2014] [Accepted: 11/27/2014] [Indexed: 01/03/2023]
Abstract
Next-generation sequencing (NGS) technology offers new opportunities for understanding the evolution and dynamics of viral populations within individual hosts over the course of infection. We review simple methods for estimating synonymous and nonsynonymous nucleotide diversity in viral genes from NGS data without the need for inferring linkage. We discuss the potential usefulness of these data for addressing questions of both practical and theoretical interest, including fundamental questions regarding the effective population sizes of within-host viral populations and the modes of natural selection acting on them.
Collapse
Affiliation(s)
- Chase W Nelson
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA.
| |
Collapse
|
18
|
Abstract
Overlapping genes are two protein-coding sequences sharing a significant part of the same DNA locus in different reading frames. Although in recent times an increasing number of examples have been found in bacteria the underlying mechanisms of their evolution are unknown. In this work we explore how selective pressure in a protein-coding sequence influences its overlapping genes in alternative reading frames. We model evolution using a time-continuous Markov process and derive the corresponding model for the remaining frames to quantify selection pressure and genetic noise. Our findings lead to the presumption that, once information is embedded in the reverse reading frame −2 (relative to the mother gene in +1) purifying selection in the protein-coding reading frame automatically protects the sequences in both frames. We also found that this coincides with the fact that the genetic noise measured using the conditional entropy is minimal in frame −2 under selection in the coding frame.
Collapse
Affiliation(s)
- Katharina Mir
- Institute of Communications Engineering, Ulm University, Ulm, Germany
- * E-mail:
| | - Steffen Schober
- Institute of Communications Engineering, Ulm University, Ulm, Germany
| |
Collapse
|
19
|
Analysis of Sequence Diversity of Human Metapneumovirus Collected from Young Children with Acute Respiratory Tract Infections in South India. Mol Diagn Ther 2013; 17:247-55. [DOI: 10.1007/s40291-013-0032-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
20
|
Torres C, Fernández MDB, Flichman DM, Campos RH, Mbayed VA. Influence of overlapping genes on the evolution of human hepatitis B virus. Virology 2013; 441:40-8. [PMID: 23541083 DOI: 10.1016/j.virol.2013.02.027] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/05/2013] [Accepted: 02/28/2013] [Indexed: 12/23/2022]
Abstract
The aim of this work was to analyse the influence of overlapping genes on the evolution of hepatitis B virus (HBV). A differential evolutionary behaviour among genetic regions and clinical status was found. Dissimilar levels of conservation of the different protein regions could derive from alternative mechanisms to maintain functionality. We propose that, in overlapping regions, selective constraints on one of the genes could drive the substitution process. This would allow protein conservation in one gene by synonymous substitutions while mechanisms of tolerance to the change operate in the overlapping gene (e.g. usage of amino acids with high-degeneracy codons, differential codon usage and replacement by physicochemically similar amino acids). In addition, differential selection pressure according to the HBeAg status was found in all genes, suggesting that the immune response could be one of the factors that would constrain viral replication by interacting with different HBV proteins during the HBeAg(-) stage.
Collapse
Affiliation(s)
- Carolina Torres
- Cátedra de Virología, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires, Argentina; CONICET, Argentina
| | | | | | | | | |
Collapse
|
21
|
Zarghani SN, Shams-Bakhsh M, Zand N, Sokhandan-Bashir N, Pazhouhandeh M. Genetic analysis of Iranian population of Potato leafroll virus based on ORF0. Virus Genes 2012; 45:567-74. [PMID: 22903753 DOI: 10.1007/s11262-012-0804-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Accepted: 08/06/2012] [Indexed: 11/25/2022]
Abstract
Potato leafroll virus (PLRV) is a destructive virus of potatoes and responsible for high yield losses wherever potatoes are grown. In this study, DNA fragments containing ORF0 from each of nine PLRV isolates was sequenced. Sequence analysis data using 36 isolates from 12 different countries including 14 Iranian isolates showed that the identities of ORF0 at both nucleotide and amino acid levels between the Iranian isolates were 96-100 % and these isolates were more similar to the European PLRV isolates than to the other isolates. Furthermore, phylogenetic and population genetic analysis were carried out on the basis of full-length ORF0 and overlapping and non-overlapping regions of ORF0 and ORF1 (ORF0/1) which revealed that PLRV isolates were not geographically resolved. Also, we identified negative selection with different ratios for each of the mentioned genomic regions suggesting effects of F-box motif and -1 frameshift on ORF0 non-overlapping region and ORF0/1 in the selection pressure, respectively. Five recombination events were detected in the Iranian, Australian, and European isolates suggesting an important role for this phenomenon in influencing genetic diversity within this virus population.
Collapse
|
22
|
Sabath N, Wagner A, Karlin D. Evolution of viral proteins originated de novo by overprinting. Mol Biol Evol 2012; 29:3767-80. [PMID: 22821011 PMCID: PMC3494269 DOI: 10.1093/molbev/mss179] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
New protein-coding genes can originate either through modification of existing genes or de novo. Recently, the importance of de novo origination has been recognized in eukaryotes, although eukaryotic genes originated de novo are relatively rare and difficult to identify. In contrast, viruses contain many de novo genes, namely those in which an existing gene has been “overprinted” by a new open reading frame, a process that generates a new protein-coding gene overlapping the ancestral gene. We analyzed the evolution of 12 experimentally validated viral genes that originated de novo and estimated their relative ages. We found that young de novo genes have a different codon usage from the rest of the genome. They evolve rapidly and are under positive or weak purifying selection. Thus, young de novo genes might have strain-specific functions, or no function, and would be difficult to detect using current genome annotation methods that rely on the sequence signature of purifying selection. In contrast to young de novo genes, older de novo genes have a codon usage that is similar to the rest of the genome. They evolve slowly and are under stronger purifying selection. Some of the oldest de novo genes evolve under stronger selection pressure than the ancestral gene they overlap, suggesting an evolutionary tug of war between the ancestral and the de novo gene.
Collapse
Affiliation(s)
- Niv Sabath
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
| | | | | |
Collapse
|
23
|
Tsakogiannis D, Ruether IGA, Kyriakopoulou Z, Pliaka V, Skordas V, Gartzonika C, Levidiotou-Stefanou S, Markoulatos P. Molecular and phylogenetic analysis of the HPV 16 E4 gene in cervical lesions from women in Greece. Arch Virol 2012; 157:1729-39. [DOI: 10.1007/s00705-012-1356-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2012] [Accepted: 04/23/2012] [Indexed: 01/23/2023]
|
24
|
Robles-Sikisaka R, Rivera R, Nollens HH, St Leger J, Durden WN, Stolen M, Burchell J, Wellehan JFX. Evidence of recombination and positive selection in cetacean papillomaviruses. Virology 2012; 427:189-97. [PMID: 22386054 DOI: 10.1016/j.virol.2012.01.039] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2011] [Revised: 11/21/2011] [Accepted: 01/26/2012] [Indexed: 01/31/2023]
Abstract
Papillomaviruses (PVs) are small DNA viruses that have been associated with increased epithelial proliferation. Over one hundred PV types have been identified in humans; however, only three have been identified in bottlenose dolphins (Tursiops truncatus) to date. Using rolling circle amplification and degenerate PCR, we identified four novel PV genomes of bottlenose dolphins. TtPV4, TtPV5 and TtPV6 were identified in genital lesions while TtPV7 was identified in normal genital mucosa. Bayesian analysis of the full-length L1 genes found that TtPV4 and TtPV7 group within the Upsilonpapillomavirus genus while TtPV5 and TtPV6 group with Omikronpapillomavirus. However, analysis of the E1 gene did not distinguish these genera, implying that these genes may not share a common history, consistent with recombination. Recombination analyses identified several probable events. Signals of positive selection were found mostly in the E1 and E2 genes. Recombination and diversifying selection pressures constitute important driving forces of cetacean PV evolution.
Collapse
Affiliation(s)
- Refugio Robles-Sikisaka
- Hubbs-SeaWorld Research Institute, Center for Marine Veterinary Virology, San Diego, CA 92109, USA.
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Population dynamics and ORF3 gene evolution of porcine circovirus type 2 circulating in Korea. Arch Virol 2012; 157:799-810. [DOI: 10.1007/s00705-012-1234-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2011] [Accepted: 12/14/2011] [Indexed: 11/26/2022]
|
26
|
Recombinational histories of avian infectious bronchitis virus and turkey coronavirus. Arch Virol 2011; 156:1823-9. [PMID: 21744259 PMCID: PMC7086623 DOI: 10.1007/s00705-011-1061-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2011] [Accepted: 06/25/2011] [Indexed: 11/29/2022]
Abstract
Phylogenetic analysis of complete genomes of the avian coronaviruses avian infectious bronchitis (AIBV) and turkey coronavirus (TCoV) supported the hypothesis that numerous recombination events have occurred between these viruses. Although the two groups of viruses differed markedly in the sequence of the spike protein, the gene (S) encoding this protein showed no evidence of positive selection or of an elevated mutation rate. Rather, the data suggested that recombination events have homogenized the portions of the genome other than the S gene between the two groups of viruses, while continuing to maintain the two distinct, anciently diverged versions of the S gene. The latter hypothesis was supported by a phylogeny of S proteins from representative coronaviruses, in which S proteins of AIBV and TCoV fell in the same clade.
Collapse
|
27
|
Pagán I, Holmes EC. Long-term evolution of the Luteoviridae: time scale and mode of virus speciation. J Virol 2010; 84:6177-87. [PMID: 20375155 PMCID: PMC2876656 DOI: 10.1128/jvi.02160-09] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Accepted: 03/31/2010] [Indexed: 12/20/2022] Open
Abstract
Despite their importance as agents of emerging disease, the time scale and evolutionary processes that shape the appearance of new viral species are largely unknown. To address these issues, we analyzed intra- and interspecific evolutionary processes in the Luteoviridae family of plant RNA viruses. Using the coat protein gene of 12 members of the family, we determined their phylogenetic relationships, rates of nucleotide substitution, times to common ancestry, and patterns of speciation. An associated multigene analysis enabled us to infer the nature of selection pressures and the genomic distribution of recombination events. Although rates of evolutionary change and selection pressures varied among genes and species and were lower in some overlapping gene regions, all fell within the range of those seen in animal RNA viruses. Recombination breakpoints were commonly observed at gene boundaries but less so within genes. Our molecular clock analysis suggested that the origin of the currently circulating Luteoviridae species occurred within the last 4 millennia, with intraspecific genetic diversity arising within the last few hundred years. Speciation within the Luteoviridae may therefore be associated with the expansion of agricultural systems. Finally, our phylogenetic analysis suggested that viral speciation events tended to occur within the same plant host species and country of origin, as expected if speciation is largely sympatric, rather than allopatric, in nature.
Collapse
Affiliation(s)
- Israel Pagán
- Center for Infectious Disease Dynamics, Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA.
| | | |
Collapse
|
28
|
Cladel NM, Bertotto A, Christensen ND. Human alpha and beta papillomaviruses use different synonymous codon profiles. Virus Genes 2010; 40:329-40. [PMID: 20157772 PMCID: PMC3752370 DOI: 10.1007/s11262-010-0451-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2009] [Accepted: 01/22/2010] [Indexed: 01/11/2023]
Abstract
Human papillomaviruses use rare codons relative to their hosts. It has been theorized that this is a mechanism to allow the virus to escape immune surveillance. In the present study, we examined the codings of four major genes of 21 human alpha (mucosatropic) viruses and 16 human beta (cutaneous-tropic) viruses. We compared the codon usage of different genes from a given papillomavirus and also the same genes from different papillomaviruses. Our data showed that codon usage was not always uniform between two genes of a given papillomavirus or between the same genes of papillomaviruses from different genera. We speculate as to why this might be and conclude that codon usage in the papillomaviruses may not only play a role in facilitating escape from immune surveillance but may also underlie some of the unanswered questions in the papillomavirus field.
Collapse
Affiliation(s)
- Nancy M Cladel
- Jake Gittlen Cancer Research Foundation, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA.
| | | | | |
Collapse
|
29
|
Sabath N, Landan G, Graur D. A method for the simultaneous estimation of selection intensities in overlapping genes. PLoS One 2008; 3:e3996. [PMID: 19098983 PMCID: PMC2601044 DOI: 10.1371/journal.pone.0003996] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2008] [Accepted: 11/21/2008] [Indexed: 11/18/2022] Open
Abstract
Inferring the intensity of positive selection in protein-coding genes is important since it is used to shed light on the process of adaptation. Recently, it has been reported that overlapping genes, which are ubiquitous in all domains of life, seem to exhibit inordinate degrees of positive selection. Here, we present a new method for the simultaneous estimation of selection intensities in overlapping genes. We show that the appearance of positive selection is caused by assuming that selection operates independently on each gene in an overlapping pair, thereby ignoring the unique evolutionary constraints on overlapping coding regions. Our method uses an exact evolutionary model, thereby voiding the need for approximation or intensive computation. We test the method by simulating the evolution of overlapping genes of different types as well as under diverse evolutionary scenarios. Our results indicate that the independent estimation approach leads to the false appearance of positive selection even though the gene is in reality subject to negative selection. Finally, we use our method to estimate selection in two influenza A genes for which positive selection was previously inferred. We find no evidence for positive selection in both cases.
Collapse
Affiliation(s)
- Niv Sabath
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America.
| | | | | |
Collapse
|
30
|
Cooke JN, Westover KM. Serotype-specific differences in antigenic regions of foot-and-mouth disease virus (FMDV): A comprehensive statistical analysis. INFECTION GENETICS AND EVOLUTION 2008; 8:855-63. [DOI: 10.1016/j.meegid.2008.08.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Revised: 08/11/2008] [Accepted: 08/15/2008] [Indexed: 10/21/2022]
|
31
|
Puigbò P, Bravo IG, Garcia-Vallve S. CAIcal: a combined set of tools to assess codon usage adaptation. Biol Direct 2008; 3:38. [PMID: 18796141 PMCID: PMC2553769 DOI: 10.1186/1745-6150-3-38] [Citation(s) in RCA: 353] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2008] [Accepted: 09/16/2008] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The Codon Adaptation Index (CAI) was first developed to measure the synonymous codon usage bias for a DNA or RNA sequence. The CAI quantifies the similarity between the synonymous codon usage of a gene and the synonymous codon frequency of a reference set. RESULTS We describe here CAIcal, a web-server available at http://genomes.urv.es/CAIcal that includes a complete set of utilities related with the CAI. The server provides useful important features, such as the calculation and graphical representation of the CAI along either an individual sequence or a protein multiple sequence alignment translated to DNA. The automated calculation of CAI and its expected value is also included as one of the CAIcal tools. The software is also free to be downloaded as a stand alone application for local use. CONCLUSION The CAIcal server provides a complete set of tools to assess codon usage adaptation and to help in genome annotation.
Collapse
Affiliation(s)
- Pere Puigbò
- Department of Biochemistry and Biotechnology, Rovira i Virgili University (URV), Campus Sescelades, c/Marcelli Domingo s/n, 43007 Tarragona, Spain.
| | | | | |
Collapse
|
32
|
de Groot S, Mailund T, Lunter G, Hein J. Investigating selection on viruses: a statistical alignment approach. BMC Bioinformatics 2008; 9:304. [PMID: 18616801 PMCID: PMC2478691 DOI: 10.1186/1471-2105-9-304] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2007] [Accepted: 07/10/2008] [Indexed: 01/14/2023] Open
Abstract
Background Two problems complicate the study of selection in viral genomes: Firstly, the presence of genes in overlapping reading frames implies that selection in one reading frame can bias our estimates of neutral mutation rates in another reading frame. Secondly, the high mutation rates we are likely to encounter complicate the inference of a reliable alignment of genomes. To address these issues, we develop a model that explicitly models selection in overlapping reading frames. We then integrate this model into a statistical alignment framework, enabling us to estimate selection while explicitly dealing with the uncertainty of individual alignments. We show that in this way we obtain un-biased selection parameters for different genomic regions of interest, and can improve in accuracy compared to using a fixed alignment. Results We run a series of simulation studies to gauge how well we do in selection estimation, especially in comparison to the use of a fixed alignment. We show that the standard practice of using a ClustalW alignment can lead to considerable biases and that estimation accuracy increases substantially when explicitly integrating over the uncertainty in inferred alignments. We even manage to compete favourably for general evolutionary distances with an alignment produced by GenAl. We subsequently run our method on HIV2 and Hepatitis B sequences. Conclusion We propose that marginalizing over all alignments, as opposed to using a fixed one, should be considered in any parametric inference from divergent sequence data for which the alignments are not known with certainty. Moreover, we discover in HIV2 that double coding regions appear to be under less stringent selection than single coding ones. Additionally, there appears to be evidence for differential selection, where one overlapping reading frame is under positive and the other under negative selection.
Collapse
Affiliation(s)
- Saskia de Groot
- Department of Statistics, University of Oxford, 1 South Parks Road, OX1 3TG, UK.
| | | | | | | |
Collapse
|
33
|
Hughes AL, Piontkivska H. Nucleotide sequence polymorphism in circoviruses. INFECTION GENETICS AND EVOLUTION 2007; 8:130-8. [PMID: 18093882 DOI: 10.1016/j.meegid.2007.11.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2007] [Revised: 11/02/2007] [Accepted: 11/07/2007] [Indexed: 12/24/2022]
Abstract
Analysis of nucleotide diversity within six species of circovirus showed consistently stronger purifying selection at nonsynonymous sites in the rep gene than on those in the cap gene. In addition, synonymous nucleotide diversity in the rep gene was significantly lower than that in the cap gene, suggesting functional constraint even at synonymous sites in rep, which was associated in all six species with strongly negative AT-skew. Of the six virus species examined, four species showed evidence of ongoing purifying selection at nonsynonymous polymorphic sites in the rep gene, indicating the presence of slightly deterious nonsynonymous variants in these populations. The rep gene of porcine circovirus 2 (PCV2) was unique, however, in showing a strong excess of rare nonsynonymous polymorphisms. The excess of rare nonsynonymous polymorphisms suggests a prolonged population bottleneck in PCV2, allowing slightly deleterious mutations to accumulate, followed by a population expansion during which selection to remove these variants has increased in effectiveness. Such a population history is consistent with the epidemiological evidence of a recent worldwide spread of PCV2.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, United States.
| | | |
Collapse
|
34
|
McCauley S, de Groot S, Mailund T, Hein J. Annotation of selection strengths in viral genomes. ACTA ACUST UNITED AC 2007; 23:2978-86. [PMID: 17921171 DOI: 10.1093/bioinformatics/btm472] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Viral genomes tend to code in overlapping reading frames to maximize informational content. This may result in atypical codon bias and particular evolutionary constraints. Due to the fast mutation rate of viruses, there is additional strong evidence for varying selection between intra- and intergenomic regions. The presence of multiple coding regions complicates the concept of K(a)/K(s) ratio, and thus begs for an alternative approach when investigating selection strengths. Building on the paper by McCauley and Hein, we develop a method for annotating a viral genome coding in overlapping reading frames. We introduce an evolutionary model capable of accounting for varying levels of selection along the genome, and incorporate it into our prior single sequence HMM methodology, extending it now to a phylogenetic HMM. Given an alignment of several homologous viruses to a reference sequence, we may thus achieve an annotation both of coding regions as well as selection strengths, allowing us to investigate different selection patterns and hypotheses. RESULTS We illustrate our method by applying it to a multiple alignment of four HIV2 sequences, as well as of three Hepatitis B sequences. We obtain an annotation of the coding regions, as well as a posterior probability for each site of the strength of selection acting on it. From this we may deduce the average posterior selection acting on the different genes. Whilst we are encouraged to see in HIV2, that the known to be conserved genes gag and pol are indeed annotated as such, we also discover several sites of less stringent negative selection within the env gene. To the best of our knowledge, we are the first to subsequently provide a full selection annotation of the Hepatitis B genome by explicitly modelling the evolution within overlapping reading frames, and not relying on simple K(a)/K(s) ratios.
Collapse
Affiliation(s)
- Stephen McCauley
- Department of Statistics, University of Oxford, 1 South Parks Road, OX1 3TG, UK
| | | | | | | |
Collapse
|
35
|
Hughes AL, Hughes MAK. More effective purifying selection on RNA viruses than in DNA viruses. Gene 2007; 404:117-25. [PMID: 17928171 DOI: 10.1016/j.gene.2007.09.013] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2007] [Revised: 09/11/2007] [Accepted: 09/13/2007] [Indexed: 01/11/2023]
Abstract
Analysis of the pattern of nucleotide diversity in 222 independent viral sequence datasets showed the prevalence of purifying selection. In spite of the higher mutation rate of RNA viruses, our analyses revealed stronger evidence of the action of purifying selection in RNA viruses than in DNA viruses. The ratio of nonsynonymous to synonymous nucleotide diversity was significantly lower in RNA viruses than in DNA viruses, indicating that nonsynonymous mutations have been removed at a greater rate (relative to the mutation rate) in the former than in the latter. Moreover, statistics that measure the occurrence of rare polymorphisms revealed significantly a greater excess of rare nonsynonymous polymorphisms in RNA viruses than in DNA viruses but no difference with respect to synonymous polymorphisms. Since rare nonsynonymous polymorphisms are likely to be undergoing the effects of purifying selection acting to eliminate them, this result implies a stronger signature of ongoing purifying selection in RNA viruses than in DNA viruses. Across datasets from both DNA viruses and RNA viruses, we found a negatively allometric relationship between nonsynonymous and synonymous nucleotide diversity; in other words, nonsynonymous nucleotide diversity increased with synonymous nucleotide diversity at a less than linear rate. These findings are most easily explained by the occurrence of slightly deleterious mutations. The fact that the negative allometry was more pronounced in RNA viruses than in DNA viruses provided additional evidence that purifying selection is more effective in the former than in the latter.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29205, USA.
| | | |
Collapse
|
36
|
Leykauf K, Kabsch K, Gassler N, Gissmann L, Alonso A, Schenkel J. Expression of the HPV11 E2 gene in transgenic mice does not result in alterations of the phenotypic pattern. Transgenic Res 2007; 17:1-8. [PMID: 17701441 DOI: 10.1007/s11248-007-9130-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2007] [Accepted: 07/24/2007] [Indexed: 12/31/2022]
Abstract
The E2 early protein of human papillomaviruses (HPV) has been found associated with the mitotic spindle therefore being implicated in the partition of the replicated viral DNA to daughter cells. In addition, E2 proteins bind to the upstream regulatory region of the virus and to cellular promoters modulating thereby cellular transcription and differentiation. In many cervical cancers, the E2 reading frame is interrupted upon incorporation of the viral genome into the host DNA. This results in the loss of the E2 mediated transcriptional repression and uncontrolled expression of the viral oncogenes. All these results have been obtained in transfected cells but no information is available on the E2 effects in the context of the entire organism. Transgenic mice were generated expressing the E2 protein of HPV11 under the control of the Ubiquitin C promoter. E2 mRNA is present in all mice tissues analysed and the E2 protein expressed in the skin (the target tissue of HPV11) was shown by Western blotting, albeit at a very low level. Analysis of the transgenic mice shows no major histological changes in the skin or all other tissues investigated. These data indicate that in transgenic mice the human papillomavirus type 11 E2 does not grossly modulate cellular proliferation or differentiation events.
Collapse
Affiliation(s)
- Kerstin Leykauf
- German Cancer Research Centre, Heidelberg, F050, Im Neuenheimer Feld 280, Heidelberg, 69120, Germany
| | | | | | | | | | | |
Collapse
|
37
|
Pavesi A. Pattern of nucleotide substitution in the overlapping nonstructural genes of influenza A virus and implication for the genetic diversity of the H5N1 subtype. Gene 2007; 402:28-34. [PMID: 17825505 DOI: 10.1016/j.gene.2007.07.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2007] [Revised: 07/12/2007] [Accepted: 07/12/2007] [Indexed: 11/24/2022]
Abstract
In viruses under strong pressure to minimize genome size, overlapping genes represent a fine strategy to condense a maximum amount of information into short nucleotide sequences. Here, we investigated the evolution of the genes encoding the nonstructural proteins NS1 and NS2 of influenza A virus (IAV), which are one of the best characterized cases of gene overlap. By a detailed analysis of about four hundred sequences grouped into 11 IAV subtypes, we found that the overlapping coding region of the NS1 gene shows a significant increase of the rate of nonsynonymous change, with respect to its nonoverlapping counterpart. The same feature was observed in the overlapping coding region of the NS2 gene. Such a variation pattern, which implies the occurrence of several amino acid substitutions in the protein regions encoded by overlapping frames, is different from the pattern of constrained evolution typical of other viral overlapping-gene systems. Amino acid sequence analysis of the NS1 and NS2 proteins revealed that some nonsynonymous substitutions, located in the region of gene overlap, play a critical role in shaping the genetic diversity of the highly pathogenic subtype H5N1. Since both proteins contribute to disease pathogenesis by affecting many virus and host-cell processes, information provided by this study should be useful to highlight the impact of nonstructural gene variation on the pathogenicity of H5N1 viruses.
Collapse
Affiliation(s)
- Angelo Pavesi
- Department of Genetics, Biology of Microorganisms, Anthropology, Evolution, University of Parma, V. le G. P. Usberti 11/A, I-43100 Parma, Italy.
| |
Collapse
|
38
|
Szklarczyk R, Heringa J, Pond SK, Nekrutenko A. Rapid asymmetric evolution of a dual-coding tumor suppressor INK4a/ARF locus contradicts its function. Proc Natl Acad Sci U S A 2007; 104:12807-12. [PMID: 17652172 PMCID: PMC1937548 DOI: 10.1073/pnas.0703238104] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
INK4a/ARF tumor suppressor locus encodes two protein products, INK4a and ARF, essential for controlling tumorigenesis and mutated in more than half of human cancers. There is no resemblance between the two proteins: their coding regions are assembled by alternative splicing of two mutually exclusive 5' exons into a constitutive one containing overlapping out-of-phase reading frames. We show that the dual-coding arrangement conflicts with the high cost of mutations within INK4a/ARF. Unexpectedly, the locus evolves rapidly and asymmetrically, with ARF accumulating the majority of amino acid replacements. Rapid evolution drives both INK4a and ARF proteins out of sync with other members of the RB and p53 tumor suppressor pathways, both of which are controlled by the locus. Yet, the asymmetric behavior may be an intrinsic property of dual-coding exons: INK4a/ARF closely mimics the evolution of 90 newly identified genes with similar dual-coding structure. Thus, the strong link between mutations in INK4a/ARF and cancer may be a direct consequence of the architecture of the locus.
Collapse
Affiliation(s)
- Radek Szklarczyk
- *Centre for Integrative Bioinformatics, Vrije University, De Boelelaan 1081a, 1081HV, Amsterdam, The Netherlands
| | - Jaap Heringa
- *Centre for Integrative Bioinformatics, Vrije University, De Boelelaan 1081a, 1081HV, Amsterdam, The Netherlands
| | | | - Anton Nekrutenko
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, PA 16803
- To whom correspondence should be addressed at:
505 Wartik Laboratory, Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, PA 16802. E-mail:
| |
Collapse
|
39
|
Bravo IG, Alonso A. Phylogeny and evolution of papillomaviruses based on the E1 and E2 proteins. Virus Genes 2007; 34:249-62. [PMID: 16927128 DOI: 10.1007/s11262-006-0017-4] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2006] [Accepted: 06/09/2006] [Indexed: 12/26/2022]
Abstract
Papillomaviridae are a family of small double-stranded DNA viruses that infect stratified squamous epithelia in vertebrates. Members of this family are causative agents of malignant tumours, such as cervical cancer while others are associated with benign proliferative lesions. So far, Papillomaviruses (PVs) are classified according to the sequence identity in the capsid gene L1. However, evidence has accumulated indicating a discontinuity in the evolutionary history of the L1 and L2 genes of many PVs, giving rise to differences in the phylogenetic reconstructions of the early and of the late genes. Neither the oncogenes E5, E6 and E7 nor the upstream regulatory region are suitable for phylogenetic inference due to the poor conservation along the Papillomaviridae family. We have analysed here the evolutionary relationships of the PVs with respect to the E1 and E2 proteins, and the results provide both phylogeny and biologic behaviour of the viruses. The hierarchical taxonomic relationships can be structured as an alternative classification system in which mucosal high-risk viruses, mucosal low-risk viruses and viruses associated with cutaneous lesions are grouped separately and do not appear intermingled. Some important trends are also observed: first, evolution of the PVs has not been homogeneous, even in viruses that infect the same host, and second mucosal human PVs have evolved faster than their cutaneous counterparts. The evolutionary analysis based on the E1 and E2 proteins will allow us to better understand the generation of the diversity of the PVs and the development of malignancy associated with these viruses.
Collapse
Affiliation(s)
- Ignacio G Bravo
- Deutsches Krebsforschungszentrum (F050), Im Neuenheimer Feld-242, 69120 Heidelberg, Germany.
| | | |
Collapse
|
40
|
Hughes AL. Micro-scale signature of purifying selection in Marburg virus genomes. Gene 2007; 392:266-72. [PMID: 17306473 DOI: 10.1016/j.gene.2006.12.038] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2006] [Revised: 12/15/2006] [Accepted: 12/18/2006] [Indexed: 11/27/2022]
Abstract
In the seven protein-coding genes in the Marburg virus (MARV) genome, the synonymous nucleotide diversity substantially exceeded the nonsynonymous nucleotide diversity, indicating strong purifying selection. Likewise, there was evidence of purifying selection on 5'UTR and 3'UTR, where nucleotide diversity (pi) was significantly less than piS in the coding regions. Nonsynonymous polymorphic sites showed significantly reduced mean gene diversity in comparison to other polymorphic sites, indicating that purifying selection at certain slightly deleterious nonsynonymous polymorphisms is ongoing. Moreover, nonsynonymous polymorphic sites showed significantly reduced gene diversity in comparison to adjacent synonymous sites, even though the vast majority of such adjacent synonymous sites were in the same codon or an adjacent codon. Thus purifying selection, in conjunction with recombination and/or backward mutation, can act to break up linkage relationships at a micro-scale in the MARV genome. The ability of purifying selection to break up linkage between synonymous and nonsynonymous polymorphisms on such a fine scale has not been reported in any other genome.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Coker Life Sciences Bldg., 700 Sumter St., Columbia, SC 29208, USA.
| |
Collapse
|
41
|
de Groot S, Mailund T, Hein J. Comparative annotation of viral genomes with non-conserved gene structure. Bioinformatics 2007; 23:1080-9. [PMID: 17341494 DOI: 10.1093/bioinformatics/btm078] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Detecting genes in viral genomes is a complex task. Due to the biological necessity of them being constrained in length, RNA viruses in particular tend to code in overlapping reading frames. Since one amino acid is encoded by a triplet of nucleic acids, up to three genes may be coded for simultaneously in one direction. Conventional hidden Markov model (HMM)-based gene-finding algorithms may typically find it difficult to identify multiple coding regions, since in general their topologies do not allow for the presence of overlapping or nested genes. Comparative methods have therefore been restricted to likelihood ratio tests on potential regions as to being double or single coding, using the fact that the constrictions forced upon multiple-coding nucleotides will result in atypical sequence evolution. Exploiting these same constraints, we present an HMM based gene-finding program, which allows for coding in unidirectional nested and overlapping reading frames, to annotate two homologous aligned viral genomes. Our method does not insist on conserved gene structure between the two sequences, thus making it applicable for the pairwise comparison of more distantly related sequences. RESULTS We apply our method to 15 pairwise alignments of six different HIV2 genomes. Given sufficient evolutionary distance between the two sequences, we achieve sensitivity of approximately 84-89% and specificity of approximately 97-99.9%. We additionally annotate three pairwise alignments of the more distantly related HIV1 and HIV2, as well as of two different hepatitis viruses, attaining results of approximately 87% sensitivity and approximately 98.5% specificity. We subsequently incorporate prior knowledge by 'knowing' the gene structure of one sequence and annotating the other conditional on it. Boosting accuracy close to perfect we demonstrate that conservation of gene structure on top of nucleotide sequence is a valuable source of information, especially in distantly related genomes. AVAILABILITY The Java code is available from the authors.
Collapse
|
42
|
Hu JM, Fu HC, Lin CH, Su HJ, Yeh HH. Reassortment and concerted evolution in banana bunchy top virus genomes. J Virol 2007; 81:1746-61. [PMID: 17135318 PMCID: PMC1797577 DOI: 10.1128/jvi.01390-06] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2006] [Accepted: 11/16/2006] [Indexed: 12/12/2022] Open
Abstract
The nanovirus Banana bunchy top virus (BBTV) has six standard components in its genome and occasionally contains components encoding additional Rep (replication initiation protein) genes. Phylogenetic network analysis of coding sequences of DNA 1 and 3 confirmed the two major groups of BBTV, a Pacific and an Asian group, but show evidence of web-like phylogenies for some genes. Phylogenetic analysis of 102 major common regions (CR-Ms) from all six components showed a possible concerted evolution within the Pacific group, which is likely due to recombination in this region. The CR-M of additional Rep genes is close to that of DNA 1 and 2. Comparison of tree topologies constructed with DNA 1 and DNA 3 coding sequences of 14 BBTV isolates showed distinct phylogenetic histories based on Kishino-Hasegawa and Shimodaira-Hasegawa tests. The results of principal component analysis of amino acid and codon usages indicate that DNA 1 and 3 have a codon bias different from that of all other genes of nanoviruses, including all currently known additional Rep genes of BBTV, which suggests a possible ancient genome reassortment event between distinctive nanoviruses.
Collapse
Affiliation(s)
- Jer-Ming Hu
- Institute of Ecology and Evolutionary Biology, National Taiwan University, 1, Sec. 4, Roosevelt Road, Taipei 106, Taiwan.
| | | | | | | | | |
Collapse
|
43
|
Zhao X, McGirr KM, Buehring GC. Potential evolutionary influences on overlapping reading frames in the bovine leukemia virus pXBL region. Genomics 2007; 89:502-11. [PMID: 17239558 DOI: 10.1016/j.ygeno.2006.12.007] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2006] [Revised: 11/27/2006] [Accepted: 12/14/2006] [Indexed: 01/25/2023]
Abstract
Bovine leukemia virus contains a pXBL region encoding the 3' parts of four regulatory proteins (Tax, Rex, G4, R3) in overlapping reading frames. Here we report the pXBL polymorphisms of 30 isolates from four countries. Rates of overall and synonymous substitutions were consistently lower, and nucleotide/amino acid composition bias and codon bias higher, in more-overlapped than in less-overlapped regions. Ratios of nonsynonymous/synonymous substitutions were lowest in the tax gene and its subregions. The 5' parts of the four genes showed selection patterns corresponding to their genomic context outside of the pXBL region. Longer G4 variants due to a natural stop codon mutation had additional triple overlap with reduced sequence variability. These data support the concept that a higher level of overlapping in coding regions correlates with greater evolutionary constraint. Tax, the most conserved among the four regulatory proteins, showed purifying selection consistent with its importance in the viral life cycle.
Collapse
Affiliation(s)
- Xiangrong Zhao
- Graduate Program in Endocrinology, University of California at Berkeley, 3060 Valley Life Science Building, Berkeley, CA 94720-3140, USA.
| | | | | |
Collapse
|
44
|
García-Vallvé S, Alonso A, Bravo IG. Papillomaviruses: different genes have different histories. Trends Microbiol 2005; 13:514-21. [PMID: 16181783 DOI: 10.1016/j.tim.2005.09.003] [Citation(s) in RCA: 108] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2005] [Revised: 08/25/2005] [Accepted: 09/13/2005] [Indexed: 11/17/2022]
Abstract
Papillomaviruses (PVs) infect stratified squamous epithelia in vertebrates. Some PVs are associated with different types of cancer and with certain benign lesions. It has been assumed that PVs coevolved with their hosts. However, recently it has been shown that different regions of the genome have different evolutionary histories. The PV genome has a modular nature and appeared after the addition of pre-existent blocks. This order of appearance in the PV genome is evident today in the different evolutionary rates of the different genes, with new genes--E5, E6 and E7--diverging faster than old genes--E1, E2, L2 and L1. Here, we propose an evolutionary framework aiming to integrate genome evolution, PV biology and epidemiology of PV infections.
Collapse
Affiliation(s)
- Santiago García-Vallvé
- Evolutionary Genomics Group, Biochemistry and Biotechnology Department, Rovira i Virgili University (URV), c/ Marcel-li Domingo, s/n. Campus Sescelades, 43007 Tarragona, Spain
| | | | | |
Collapse
|