51
|
Walker PJ, Firth C, Widen SG, Blasdell KR, Guzman H, Wood TG, Paradkar PN, Holmes EC, Tesh RB, Vasilakis N. Evolution of genome size and complexity in the rhabdoviridae. PLoS Pathog 2015; 11:e1004664. [PMID: 25679389 PMCID: PMC4334499 DOI: 10.1371/journal.ppat.1004664] [Citation(s) in RCA: 122] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 01/06/2015] [Indexed: 12/14/2022] Open
Abstract
RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3' to 5' direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae.
Collapse
Affiliation(s)
- Peter J. Walker
- CSIRO Biosecurity, Australian Animal Health Laboratory, Geelong, Victoria, Australia
- * E-mail:
| | - Cadhla Firth
- CSIRO Biosecurity, Australian Animal Health Laboratory, Geelong, Victoria, Australia
| | - Steven G. Widen
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Kim R. Blasdell
- CSIRO Biosecurity, Australian Animal Health Laboratory, Geelong, Victoria, Australia
| | - Hilda Guzman
- Center for Biodefense and Emerging Infectious Diseases and Department of Pathology, Center for Tropical Diseases, and Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Thomas G. Wood
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Prasad N. Paradkar
- CSIRO Biosecurity, Australian Animal Health Laboratory, Geelong, Victoria, Australia
| | - Edward C. Holmes
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Biological Sciences and Sydney Medical School, The University of Sydney, Sydney, New South Wales, Australia
| | - Robert B. Tesh
- Center for Biodefense and Emerging Infectious Diseases and Department of Pathology, Center for Tropical Diseases, and Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Nikos Vasilakis
- Center for Biodefense and Emerging Infectious Diseases and Department of Pathology, Center for Tropical Diseases, and Institute for Human Infections and Immunity, The University of Texas Medical Branch, Galveston, Texas, United States of America
| |
Collapse
|
52
|
Bao Y, Chetvernin V, Tatusova T. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification. Arch Virol 2014; 159:3293-304. [PMID: 25119676 PMCID: PMC4221606 DOI: 10.1007/s00705-014-2197-x] [Citation(s) in RCA: 126] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 07/29/2014] [Indexed: 11/28/2022]
Abstract
The number of viral genome sequences in the public databases is increasing dramatically, and these sequences are playing an important role in virus classification. Pairwise sequence comparison is a sequence-based virus classification method. A program using this method calculates the pairwise identities of virus sequences within a virus family and displays their distribution, and visual analysis helps to determine demarcations at different taxonomic levels such as strain, species, genus and subfamily. Subsequent comparison of new sequences against existing ones allows viruses from which the new sequences were derived to be classified. Although this method cannot be used as the only criterion for virus classification in some cases, it is a quantitative method and has many advantages over conventional virus classification methods. It has been applied to several virus families, and there is an increasing interest in using this method for other virus families/groups. The Pairwise Sequence Comparison (PASC) classification tool was created at the National Center for Biotechnology Information. The tool's database stores pairwise identities for complete genomes/segments of 56 virus families/groups. Data in the system are updated every day to reflect changes in virus taxonomy and additions of new virus sequences to the public database. The web interface of the tool ( http://www.ncbi.nlm.nih.gov/sutils/pasc/ ) makes it easy to navigate and perform analyses. Multiple new viral genome sequences can be tested simultaneously with this system to suggest the taxonomic position of virus isolates in a specific family. PASC eliminates potential discrepancies in the results caused by different algorithms and/or different data used by researchers.
Collapse
Affiliation(s)
- Yiming Bao
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA,
| | | | | |
Collapse
|
53
|
Muhire BM, Varsani A, Martin DP. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS One 2014; 9:e108277. [PMID: 25259891 PMCID: PMC4178126 DOI: 10.1371/journal.pone.0108277] [Citation(s) in RCA: 860] [Impact Index Per Article: 86.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Accepted: 08/26/2014] [Indexed: 01/16/2023] Open
Abstract
The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).
Collapse
Affiliation(s)
- Brejnev Muhizi Muhire
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa
| | - Arvind Varsani
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa
- School of Biological Sciences and Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
- Department of Plant Pathology and Emerging Pathogens Institute, University of Florida, Gainesville, Florida, United States of America
| | - Darren Patrick Martin
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa
| |
Collapse
|
54
|
Rooting the phylogenetic tree of middle East respiratory syndrome coronavirus by characterization of a conspecific virus from an African bat. J Virol 2014; 88:11297-303. [PMID: 25031349 DOI: 10.1128/jvi.01498-14] [Citation(s) in RCA: 290] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
UNLABELLED The emerging Middle East respiratory syndrome coronavirus (MERS-CoV) causes lethal respiratory infections mainly on the Arabian Peninsula. The evolutionary origins of MERS-CoV are unknown. We determined the full genome sequence of a CoV directly from fecal material obtained from a South African Neoromicia capensis bat (NeoCoV). NeoCoV shared essential details of genome architecture with MERS-CoV. Eighty-five percent of the NeoCoV genome was identical to MERS-CoV at the nucleotide level. Based on taxonomic criteria, NeoCoV and MERS-CoV belonged to one viral species. The presence of a genetically divergent S1 subunit within the NeoCoV spike gene indicated that intraspike recombination events may have been involved in the emergence of MERS-CoV. NeoCoV constitutes a sister taxon of MERS-CoV, placing the MERS-CoV root between a recently described virus from African camels and all other viruses. This suggests a higher level of viral diversity in camels than in humans. Together with serologic evidence for widespread MERS-CoV infection in camelids sampled up to 20 years ago in Africa and the Arabian Peninsula, the genetic data indicate that camels act as sources of virus for humans rather than vice versa. The majority of camels on the Arabian Peninsula is imported from the Greater Horn of Africa, where several Neoromicia species occur. The acquisition of MERS-CoV by camels from bats might have taken place in sub-Saharan Africa. Camelids may represent mixing vessels for MERS-CoV and other mammalian CoVs. IMPORTANCE It is unclear how, when, and where the highly pathogenic MERS-CoV emerged. We characterized the full genome of an African bat virus closely related to MERS-CoV and show that human, camel, and bat viruses belong to the same viral species. The bat virus roots the phylogenetic tree of MERS-CoV, providing evidence for an evolution of MERS-CoV in camels that preceded that in humans. The revised tree suggests that humans are infected by camels rather than vice versa. Although MERS-CoV cases occur mainly on the Arabian Peninsula, the data from this study together with serologic and molecular investigations of African camels indicate that the initial host switch from bats may have taken place in Africa. The emergence of MERS-CoV likely involved exchanges of genetic elements between different viral ancestors. These exchanges may have taken place either in bat ancestors or in camels acting as mixing vessels for viruses from different hosts.
Collapse
|
55
|
Naeem A, Hosomi T, Nishimura Y, Alam MM, Oka T, Zaidi SSZ, Shimizu H. Genetic diversity of circulating Saffold viruses in Pakistan and Afghanistan. J Gen Virol 2014; 95:1945-1957. [PMID: 24899154 DOI: 10.1099/vir.0.066498-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Human cardioviruses or Saffold viruses (SAFVs) of the family Picornaviridae are newly emerging viruses whose genetic and phenotypic diversity are poorly understood. We report here the full genome sequence of 11 SAFV genotypes from Pakistan and Afghanistan, along with a re-evaluation of their genetic diversity and recombination. We detected 88 SAFV from stool samples of 943 acute flaccid paralysis cases using reverse transcriptase-PCR targeting the 5' untranslated region (UTR). Further characterization based on complete VP1 analysis revealed 71 SAFVs belonging to 11 genotypes, including three previously unidentified genotypes. SAFV showed high genetic diversity and recombination based on phylogenetic, pairwise distance distributions and recombination mapping analyses performed herein. Phylogenies based on non-structural and UTRs were highly incongruent indicating frequent recombination events among SAFVs. We improved the SAFV genotyping classification criteria by determining new VP1 thresholds based on the principles used for the classification of enteroviruses. For genotype assignment, we propose a threshold of 23 and 10 % divergence for VP1 nucleotide and amino acid sequences, respectively. Other members of the species Theilovirus, such as Thera virus and Theiler's murine encephalomyelitis virus, are difficult to classify in the same species as SAFV, because they are genetically distinct from SAFV, with 41-56 % aa pairwise distances. The new genetic information obtained in this study will improve our understanding of the evolution and classification of SAFV.
Collapse
Affiliation(s)
- Asif Naeem
- Department of Virology II, National Institute of Infectious Diseases, Tokyo, Japan
| | - Takushi Hosomi
- The Meat Inspection Center of Kochi Prefecture, Kochi, Japan
| | - Yorihiro Nishimura
- Department of Virology II, National Institute of Infectious Diseases, Tokyo, Japan
| | | | - Tomoichiro Oka
- Department of Virology II, National Institute of Infectious Diseases, Tokyo, Japan
| | | | - Hiroyuki Shimizu
- Department of Virology II, National Institute of Infectious Diseases, Tokyo, Japan
| |
Collapse
|
56
|
Taxonomy. VIRUSES AND THE LUNG 2014. [PMCID: PMC7123310 DOI: 10.1007/978-3-642-40605-8_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
This chapter addresses the classification and taxonomy of viruses with special attention to viruses that show pneumotropic properties. Information provided in this chapter supplements that provided in other chapters in Parts II–V of this volume that discuss individual viral pathogens.
Collapse
|
57
|
Muhire BM, Varsani A, Martin DP. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS One 2014. [PMID: 25259891 DOI: 10.1371/journal/pone.0108277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2023] Open
Abstract
The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).
Collapse
Affiliation(s)
- Brejnev Muhizi Muhire
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa
| | - Arvind Varsani
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa; School of Biological Sciences and Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand; Department of Plant Pathology and Emerging Pathogens Institute, University of Florida, Gainesville, Florida, United States of America
| | - Darren Patrick Martin
- Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa
| |
Collapse
|
58
|
Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently "orphan" viral proteins. J Virol 2013; 88:10-20. [PMID: 24155369 PMCID: PMC3911697 DOI: 10.1128/jvi.02595-13] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The genome sequences of new viruses often contain many "orphan" or "taxon-specific" proteins apparently lacking homologs. However, because viral proteins evolve very fast, commonly used sequence similarity detection methods such as BLAST may overlook homologs. We analyzed a data set of proteins from RNA viruses characterized as "genus specific" by BLAST. More powerful methods developed recently, such as HHblits or HHpred (available through web-based, user-friendly interfaces), could detect distant homologs of a quarter of these proteins, suggesting that these methods should be used to annotate viral genomes. In-depth manual analyses of a subset of the remaining sequences, guided by contextual information such as taxonomy, gene order, or domain cooccurrence, identified distant homologs of another third. Thus, a combination of powerful automated methods and manual analyses can uncover distant homologs of many proteins thought to be orphans. We expect these methodological results to be also applicable to cellular organisms, since they generally evolve much more slowly than RNA viruses. As an application, we reanalyzed the genome of a bee pathogen, Chronic bee paralysis virus (CBPV). We could identify homologs of most of its proteins thought to be orphans; in each case, identifying homologs provided functional clues. We discovered that CBPV encodes a domain homologous to the Alphavirus methyltransferase-guanylyltransferase; a putative membrane protein, SP24, with homologs in unrelated insect viruses and insect-transmitted plant viruses having different morphologies (cileviruses, higreviruses, blunerviruses, negeviruses); and a putative virion glycoprotein, ORF2, also found in negeviruses. SP24 and ORF2 are probably major structural components of the virions.
Collapse
|
59
|
Phan TG, Vo NP, Simmonds P, Samayoa E, Naccache S, Chiu CY, Delwart E. Rosavirus: the prototype of a proposed new genus of the Picornaviridae family. Virus Genes 2013; 47:556-8. [PMID: 23943414 DOI: 10.1007/s11262-013-0968-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2013] [Accepted: 07/31/2013] [Indexed: 10/26/2022]
Abstract
We describe a 8,724-nucleotide-long picornavirus genome encoding a single 2,470-aa polyprotein obtained from the feces of a wild mouse. Rosavirus is genetically closest to the double ORF Dicipivirus found in canine feces that is currently the only picornavirus with a second internal ribosome entry site (IRES). Of note, a section of rosavirus' 5'UTR showed strong sequence and structural conservation with the type II IRES from the Parechovirus and Hungarovirus genera possibly reflecting exchange of genetic modules between genera. Based on genetic distance criteria rosavirus qualifies as prototype of a new genus of the Picornaviridae family.
Collapse
Affiliation(s)
- Tung Gia Phan
- Blood Systems Research Institute, San Francisco, CA, 94118, USA
| | | | | | | | | | | | | |
Collapse
|
60
|
A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae). Arch Virol 2013; 158:1411-24. [DOI: 10.1007/s00705-012-1601-7] [Citation(s) in RCA: 138] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Accepted: 12/01/2012] [Indexed: 10/27/2022]
|
61
|
Tapparel C, Siegrist F, Petty TJ, Kaiser L. Picornavirus and enterovirus diversity with associated human diseases. INFECTION GENETICS AND EVOLUTION 2012. [PMID: 23201849 DOI: 10.1016/j.meegid.2012.10.016] [Citation(s) in RCA: 313] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Members of the Picornaviridae family are non-enveloped, positive-stranded RNA viruses with a 30nm icosahedral capsid. This virus family exhibits a considerable amount of genetic variability driven both by mutation and recombination. Recently, three previously unknown human picornaviruses, namely the human Saffold cardiovirus, cosavirus and salivirus, have been identified in stools or respiratory samples from subjects presenting symptoms ranging from gastroenteritis to acute flaccid paralysis. However, these viruses were also frequently detected in asymptomatic subjects and their clinical relevance remains to be elucidated. The Enterovirus genus is a prototype example of the Picornaviridae heterogeneity at both genetic and phenotypic levels. This genus is divided into 10 species, seven of which contain human viruses, including three Rhinovirus species. Both human rhino- and enteroviruses are also characterized by high levels of genetic variability, as exemplified by the existence of over 250 different serotypes and the recent discovery of new enterovirus genotypes and the Rhinovirus C species. Despite their common genomic features, rhinoviruses are restricted to the respiratory tract, whereas the vast majority of enteroviruses infect the gastrointestinal tract and can spread to other organs, such as the heart or the central nervous system. Understanding the genetic determinants of such phenotypic diversity is an important challenge and a field for future investigation. Better characterization of these ubiquitous human pathogens may help to develop vaccines or antiviral treatments and to monitor the emergence of new strains.
Collapse
Affiliation(s)
- Caroline Tapparel
- Laboratory of Virology, Division of Infectious Diseases and Division of Laboratory Medicine, University of Geneva Hospitals, 4 Rue Gabrielle-Perret-Gentil, 1211 Geneva 14, Switzerland.
| | | | | | | |
Collapse
|
62
|
Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans. mBio 2012. [PMID: 23170002 DOI: 10.1128/mbio.00473-12.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
UNLABELLED A novel human coronavirus (HCoV-EMC/2012) was isolated from a man with acute pneumonia and renal failure in June 2012. This report describes the complete genome sequence, genome organization, and expression strategy of HCoV-EMC/2012 and its relation with known coronaviruses. The genome contains 30,119 nucleotides and contains at least 10 predicted open reading frames, 9 of which are predicted to be expressed from a nested set of seven subgenomic mRNAs. Phylogenetic analysis of the replicase gene of coronaviruses with completely sequenced genomes showed that HCoV-EMC/2012 is most closely related to Tylonycteris bat coronavirus HKU4 (BtCoV-HKU4) and Pipistrellus bat coronavirus HKU5 (BtCoV-HKU5), which prototype two species in lineage C of the genus Betacoronavirus. In accordance with the guidelines of the International Committee on Taxonomy of Viruses, and in view of the 75% and 77% amino acid sequence identity in 7 conserved replicase domains with BtCoV-HKU4 and BtCoV-HKU5, respectively, we propose that HCoV-EMC/2012 prototypes a novel species in the genus Betacoronavirus. HCoV-EMC/2012 may be most closely related to a coronavirus detected in Pipistrellus pipistrellus in The Netherlands, but because only a short sequence from the most conserved part of the RNA-dependent RNA polymerase-encoding region of the genome was reported for this bat virus, its genetic distance from HCoV-EMC remains uncertain. HCoV-EMC/2012 is the sixth coronavirus known to infect humans and the first human virus within betacoronavirus lineage C. IMPORTANCE Coronaviruses are capable of infecting humans and many animal species. Most infections caused by human coronaviruses are relatively mild. However, the outbreak of severe acute respiratory syndrome (SARS) caused by SARS-CoV in 2002 to 2003 and the fatal infection of a human by HCoV-EMC/2012 in 2012 show that coronaviruses are able to cause severe, sometimes fatal disease in humans. We have determined the complete genome of HCoV-EMC/2012 using an unbiased virus discovery approach involving next-generation sequencing techniques, which enabled subsequent state-of-the-art bioinformatics, phylogenetics, and taxonomic analyses. By establishing its complete genome sequence, HCoV-EMC/2012 was characterized as a new genotype which is closely related to bat coronaviruses that are distant from SARS-CoV. We expect that this information will be vital to rapid advancement of both clinical and vital research on this emerging pathogen.
Collapse
|
63
|
Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans. mBio 2012; 3:mBio.00473-12. [PMID: 23170002 PMCID: PMC3509437 DOI: 10.1128/mbio.00473-12] [Citation(s) in RCA: 653] [Impact Index Per Article: 54.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
A novel human coronavirus (HCoV-EMC/2012) was isolated from a man with acute pneumonia and renal failure in June 2012. This report describes the complete genome sequence, genome organization, and expression strategy of HCoV-EMC/2012 and its relation with known coronaviruses. The genome contains 30,119 nucleotides and contains at least 10 predicted open reading frames, 9 of which are predicted to be expressed from a nested set of seven subgenomic mRNAs. Phylogenetic analysis of the replicase gene of coronaviruses with completely sequenced genomes showed that HCoV-EMC/2012 is most closely related to Tylonycteris bat coronavirus HKU4 (BtCoV-HKU4) and Pipistrellus bat coronavirus HKU5 (BtCoV-HKU5), which prototype two species in lineage C of the genus Betacoronavirus. In accordance with the guidelines of the International Committee on Taxonomy of Viruses, and in view of the 75% and 77% amino acid sequence identity in 7 conserved replicase domains with BtCoV-HKU4 and BtCoV-HKU5, respectively, we propose that HCoV-EMC/2012 prototypes a novel species in the genus Betacoronavirus. HCoV-EMC/2012 may be most closely related to a coronavirus detected in Pipistrellus pipistrellus in The Netherlands, but because only a short sequence from the most conserved part of the RNA-dependent RNA polymerase-encoding region of the genome was reported for this bat virus, its genetic distance from HCoV-EMC remains uncertain. HCoV-EMC/2012 is the sixth coronavirus known to infect humans and the first human virus within betacoronavirus lineage C. Coronaviruses are capable of infecting humans and many animal species. Most infections caused by human coronaviruses are relatively mild. However, the outbreak of severe acute respiratory syndrome (SARS) caused by SARS-CoV in 2002 to 2003 and the fatal infection of a human by HCoV-EMC/2012 in 2012 show that coronaviruses are able to cause severe, sometimes fatal disease in humans. We have determined the complete genome of HCoV-EMC/2012 using an unbiased virus discovery approach involving next-generation sequencing techniques, which enabled subsequent state-of-the-art bioinformatics, phylogenetics, and taxonomic analyses. By establishing its complete genome sequence, HCoV-EMC/2012 was characterized as a new genotype which is closely related to bat coronaviruses that are distant from SARS-CoV. We expect that this information will be vital to rapid advancement of both clinical and vital research on this emerging pathogen.
Collapse
|
64
|
Lauber C, Gorbalenya AE. Genetics-based classification of filoviruses calls for expanded sampling of genomic sequences. Viruses 2012; 4:1425-37. [PMID: 23170166 PMCID: PMC3499813 DOI: 10.3390/v4091425] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2012] [Revised: 08/23/2012] [Accepted: 08/24/2012] [Indexed: 12/14/2022] Open
Abstract
We have recently developed a computational approach for hierarchical, genome-based classification of viruses of a family (DEmARC). In DEmARC, virus clusters are delimited objectively by devising a universal family-wide threshold on intra-cluster genetic divergence of viruses that is specific for each level of the classification. Here, we apply DEmARC to a set of 56 filoviruses with complete genome sequences and compare the resulting classification to the ICTV taxonomy of the family Filoviridae. We find in total six candidate taxon levels two of which correspond to the species and genus ranks of the family. At these two levels, the six filovirus species and two genera officially recognized by ICTV, as well as a seventh tentative species for Lloviu virus and prototyping a third genus, are reproduced. DEmARC lends the highest possible support for these two as well as the four other levels, implying that the actual number of valid taxon levels remains uncertain and the choice of levels for filovirus species and genera is arbitrary. Based on our experience with other virus families, we conclude that the current sampling of filovirus genomic sequences needs to be considerably expanded in order to resolve these uncertainties in the framework of genetics-based classification.
Collapse
Affiliation(s)
- Chris Lauber
- Molecular Virology Laboratory, Department of Medical Microbiology, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands;
| | - Alexander E. Gorbalenya
- Molecular Virology Laboratory, Department of Medical Microbiology, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands;
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 119899 Moscow, Russia
- Author to whom correspondence should be addressed; ; Tel.: +31-71-526-1436; Fax: +31-71-526-6761
| |
Collapse
|
65
|
Abstract
Evolution of RNA viruses occurs through disequilibria of collections of closely related mutant spectra or mutant clouds termed viral quasispecies. Here we review the origin of the quasispecies concept and some biological implications of quasispecies dynamics. Two main aspects are addressed: (i) mutant clouds as reservoirs of phenotypic variants for virus adaptability and (ii) the internal interactions that are established within mutant spectra that render a virus ensemble the unit of selection. The understanding of viruses as quasispecies has led to new antiviral designs, such as lethal mutagenesis, whose aim is to drive viruses toward low fitness values with limited chances of fitness recovery. The impact of quasispecies for three salient human pathogens, human immunodeficiency virus and the hepatitis B and C viruses, is reviewed, with emphasis on antiviral treatment strategies. Finally, extensions of quasispecies to nonviral systems are briefly mentioned to emphasize the broad applicability of quasispecies theory.
Collapse
Affiliation(s)
- Esteban Domingo
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), C/ Nicolás Cabrera, Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain.
| | | | | |
Collapse
|
66
|
Lauber C, Ziebuhr J, Junglen S, Drosten C, Zirkel F, Nga PT, Morita K, Snijder EJ, Gorbalenya AE. Mesoniviridae: a proposed new family in the order Nidovirales formed by a single species of mosquito-borne viruses. Arch Virol 2012; 157:1623-8. [PMID: 22527862 PMCID: PMC3407358 DOI: 10.1007/s00705-012-1295-x] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Accepted: 02/27/2012] [Indexed: 11/26/2022]
Abstract
Recently, two independent surveillance studies in Côte d’Ivoire and Vietnam, respectively, led to the discovery of two mosquito-borne viruses, Cavally virus and Nam Dinh virus, with genome and proteome properties typical for viruses of the order Nidovirales. Using a state-of-the-art approach, we show that the two insect nidoviruses are (i) sufficiently different from other nidoviruses to represent a new virus family, and (ii) related to each other closely enough to be placed in the same virus species. We propose to name this new family Mesoniviridae. Meso is derived from the Greek word “mesos” (in English “in the middle”) and refers to the distinctive genome size of these insect nidoviruses, which is intermediate between that of the families Arteriviridae and Coronaviridae, while ni is an abbreviation for “nido”. A taxonomic proposal to establish the new family Mesoniviridae, genus Alphamesonivirus, and species Alphamesonivirus 1 has been approved for consideration by the Executive Committee of the ICTV.
Collapse
Affiliation(s)
- Chris Lauber
- Molecular Virology Laboratory, Department of Medical Microbiology, Leiden University Medical Center, Albinusdreef 2, E4-P, P.O. Box 9600, 2300 RC Leiden, The Netherlands
| | - John Ziebuhr
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany
| | - Sandra Junglen
- Institute of Virology, University of Bonn Medical Center, Bonn, Germany
| | - Christian Drosten
- Institute of Virology, University of Bonn Medical Center, Bonn, Germany
| | - Florian Zirkel
- Institute of Virology, University of Bonn Medical Center, Bonn, Germany
| | - Phan Thi Nga
- Department of Virology, National Institute of Hygiene and Epidemiology, Hanoi, 100-000 Vietnam
| | - Kouichi Morita
- Department of Virology, Institute of Tropical Medicine, Global COE Program, Nagasaki University, Nagasaki, 852-8523 Japan
| | - Eric J. Snijder
- Molecular Virology Laboratory, Department of Medical Microbiology, Leiden University Medical Center, Albinusdreef 2, E4-P, P.O. Box 9600, 2300 RC Leiden, The Netherlands
| | - Alexander E. Gorbalenya
- Molecular Virology Laboratory, Department of Medical Microbiology, Leiden University Medical Center, Albinusdreef 2, E4-P, P.O. Box 9600, 2300 RC Leiden, The Netherlands
| |
Collapse
|
67
|
Toward genetics-based virus taxonomy: comparative analysis of a genetics-based classification and the taxonomy of picornaviruses. J Virol 2012; 86:3905-15. [PMID: 22278238 DOI: 10.1128/jvi.07174-11] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Virus taxonomy has received little attention from the research community despite its broad relevance. In an accompanying paper (C. Lauber and A. E. Gorbalenya, J. Virol. 86:3890-3904, 2012), we have introduced a quantitative approach to hierarchically classify viruses of a family using pairwise evolutionary distances (PEDs) as a measure of genetic divergence. When applied to the six most conserved proteins of the Picornaviridae, it clustered 1,234 genome sequences in groups at three hierarchical levels (to which we refer as the "GENETIC classification"). In this study, we compare the GENETIC classification with the expert-based picornavirus taxonomy and outline differences in the underlying frameworks regarding the relation of virus groups and genetic diversity that represent, respectively, the structure and content of a classification. To facilitate the analysis, we introduce two novel diagrams. The first connects the genetic diversity of taxa to both the PED distribution and the phylogeny of picornaviruses. The second depicts a classification and the accommodated genetic diversity in a standardized manner. Generally, we found striking agreement between the two classifications on species and genus taxa. A few disagreements concern the species Human rhinovirus A and Human rhinovirus C and the genus Aphthovirus, which were split in the GENETIC classification. Furthermore, we propose a new supergenus level and universal, level-specific PED thresholds, not reached yet by many taxa. Since the species threshold is approached mostly by taxa with large sampling sizes and those infecting multiple hosts, it may represent an upper limit on divergence, beyond which homologous recombination in the six most conserved genes between two picornaviruses might not give viable progeny.
Collapse
|