1
|
Gupta S, Gupta D, Bhatnagar S. Analysis of SARS-CoV-2 genome evolutionary patterns. Microbiol Spectr 2024; 12:e0265423. [PMID: 38197644 PMCID: PMC10846092 DOI: 10.1128/spectrum.02654-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 11/20/2023] [Indexed: 01/11/2024] Open
Abstract
The spread of SARS-CoV-2 virus accompanied by public availability of abundant sequence data provides a window for the determination of viral evolutionary patterns. In this study, SARS-CoV-2 genome sequences were collected from seven countries in the period January 2020-December 2022. The sequences were classified into three phases, namely, pre-vaccination, post-vaccination, and recent period. Comparison was performed between these phases based on parameters like mutation rates, selection pressure (dN/dS ratio), and transition to transversion ratios (Ti/Tv). Similar comparisons were performed among SARS-CoV-2 variants. Statistical significance was tested using Graphpad unpaired t-test. The analysis showed an increase in the percent genomic mutation rates post-vaccination and in recent periods across all countries from the pre-vaccination sequences. Mutation rates were highest in NSP3, S, N, and NSP12b before and increased further after vaccination. NSP4 showed the largest change in mutation rates after vaccination. The dN/dS ratios showed purifying selection that shifted toward neutral selection after vaccination. N, ORF8, ORF3a, and ORF10 were under highest positive selection before vaccination. Shift toward neutral selection was driven by E, NSP3, and ORF7a in the after vaccination set. In recent sequences, the largest dN/dS change was observed in E, NSP1, and NSP13. The Ti/Tv ratios decreased with time. C→U and G→U were the most frequent transitions and transversions. However, U→G was the most frequent transversion in recent period. The Omicron variant had the highest genomic mutation rates, while Delta showed the highest dN/dS ratio. Protein-wise dN/dS ratio was also seen to vary across the different variants.IMPORTANCETo the best of our knowledge, there exists no other large-scale study of the genomic and protein-wise mutation patterns during the time course of evolution in different countries. Analyzing the SARS-CoV-2 evolutionary patterns in view of the varying spatial, temporal, and biological signals is important for diagnostics, therapeutics, and pharmacovigilance of SARS-CoV-2.
Collapse
Affiliation(s)
- Shubhangi Gupta
- Department of Biological Sciences and Engineering, Computational and Structural Biology Laboratory, Netaji Subhas University of Technology, Dwarka, New Delhi, India
| | - Deepanshu Gupta
- Division of Biotechnology, Computational and Structural Biology Laboratory, Netaji Subhas Institute of Technology, Dwarka, New Delhi, India
| | - Sonika Bhatnagar
- Department of Biological Sciences and Engineering, Computational and Structural Biology Laboratory, Netaji Subhas University of Technology, Dwarka, New Delhi, India
- Division of Biotechnology, Computational and Structural Biology Laboratory, Netaji Subhas Institute of Technology, Dwarka, New Delhi, India
| |
Collapse
|
2
|
Gowane GR, Sharma P, Kumar R, Misra SS, Alex R, Vohra V, Chhotaray S, Sharma N, Chopra A, Kandalkar Y, Choudhary A, Magotra A. Population-wide genetic analysis of Ovar-DQA1 and DQA2 loci across sheep breeds in India revealed their evolutionary importance and fitness of sheep in a tropical climate. Anim Biotechnol 2023; 34:4645-4657. [PMID: 36847639 DOI: 10.1080/10495398.2023.2180010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
Abstract
Genetic variability at the major histocompatibility complex (MHC) is important in any species due to significant role played by MHC for antigen presentation. DQA locus has not been studied for its genetic variability across sheep population in India. In the present study, MHC of sheep at DQA1 and DQA2 loci were evaluated across 17 Indian sheep breeds. Results revealed high degree of heterozygosity (10.34% to 100% for DQA1 and 37.39 to 100% for DQA2). 18 DQA1 alleles and 22 DQA2 alleles were isolated in different breeds. Nucleotide content for DQA region revealed richness of AT content (54.85% for DQA1 and 53.89% for DQA2). DQA1 and DQA2 sequences clustered independently. We could see evidence of divergence of DQA as DQA1 and DQA2 across sheep breeds. Wu-Kabat variability index revealed vast genetic variation across DQA1 and DQA2, specifically at peptide binding sites (PBS) that consisted 21 residues for DQA1 and 17 residues for DQA2. Evolutionary analysis revealed the presence of positive and balancing selection for DQA1 locus, however DQA2 was under purifying selection across sheep breeds. Higher heterozygosity and large diversity at both loci especially at PBS indicated the fitness of the sheep population for evading pathogens and adapt to the harsh tropical climate.
Collapse
Affiliation(s)
- G R Gowane
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Priya Sharma
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Rajiv Kumar
- Animal Genetics and Breeding Division, ICAR-Central Sheep and Wool Research Institute, Avikanagar
| | - S S Misra
- Animal Genetics and Breeding Division, ICAR-Central Sheep and Wool Research Institute, Avikanagar
| | - Rani Alex
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - V Vohra
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - S Chhotaray
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Nikita Sharma
- Animal Health Section, ICAR-Central Institute for Research on Goats, Makhdoom, India
| | - Ashish Chopra
- Animal Genetics and Breeding Division, ICAR-Arid Region Campus, Central Sheep and Wool Research Institute, Bikaner, India
| | - Yogesh Kandalkar
- Deccani Sheep Breeding Unit, NWPSI at Mahatma Phule Krishi Vidyapith, Rahuri, India
| | | | - Ankit Magotra
- Animal Genetics and Breeding Division, Chaudhary Charan Singh Haryana Agricultural University, Hisar, India
| |
Collapse
|
3
|
Gowane GR, Sharma P, Kumar R, Misra SS, Alex R, Vohra V, Chhotaray S, Dass G, Chopra A, Kandalkar Y, Vijay V, Choudhary A, Magotra A, Rajendran R. Cross-population genetic analysis revealed genetic variation and selection in the Ovar-DRB1 gene of Indian sheep breeds. Anim Biotechnol 2023; 34:2928-2939. [PMID: 36153754 DOI: 10.1080/10495398.2022.2125404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
In sheep, MHC variability is studied widely to explore disease association. The aim of the current study was to explore the genetic diversity of Ovar-DRB diversity across sheep breeds of India. Here, Ovar-DRB1 locus was studied across 20 sheep breeds. DRB1 was amplified (301 bp) and sequenced using a PCR-sequence-based typing approach. Results revealed a high degree of heterozygosity across breeds (mean: 73.99%). Overall mean distance for DRB1 was highest in Sangamneri (0.18) and lowest in Madgyal sheep (0.10). There was a higher rate of transition, across breeds. Further, 39 alleles were isolated in different breeds, out of which 10 were new. To allow easy access and use of the immune-polymorphic database, an online database management system was launched (http://www.mhcdbms.in/). Nucleotide content across breeds for the DRB1 region revealed the richness of GC content (59.26%). Wu-Kabat index revealed vast genetic variation across peptide binding sites (PBS) of DRB1. Residues 6, 66, 69, 52, and 81, were polymorphic showing utility for antigen presentation. All breeds were under positive selection for DRB1 locus (dN > dS). Study revealed the importance of DRB locus diversity for beta chain specifically at PBS across sheep breeds of the Indian subcontinent and presented evidence of positive selection for DRB owing to its evolutionary significance.
Collapse
Affiliation(s)
- G R Gowane
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Priya Sharma
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Rajiv Kumar
- Animal Genetics & Breeding Division, ICAR-Central Sheep & Wool Research Institute, Avikanagar, India
| | - S S Misra
- Animal Genetics & Breeding Division, ICAR-Central Sheep & Wool Research Institute, Avikanagar, India
| | - Rani Alex
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - V Vohra
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - S Chhotaray
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, India
| | - Gopal Dass
- Animal Genetics & Breeding Division, ICAR-Central Institute for Research on Goats, Makhdoom, India
| | - Ashish Chopra
- Animal Genetics & Breeding Division, ICAR-Arid Region Campus, Central Sheep & Wool Research Institute Bikaner, Avikanagar, India
| | - Yogesh Kandalkar
- Deccani Sheep Breeding Unit, NWPSI at Mahatma Phule Krishi Vidyapith, Rahuri, India
| | - V Vijay
- Sonadi Seep Breeding Unit, NWPSI at Navaniya Maharana Pratap University of Agriculture and Technology, Udaipur, India
| | | | - Ankit Magotra
- Animal Genetics & Breeding Division, Chaudhary Charan Singh Haryana Agricultural University, Hisar, India
| | - R Rajendran
- Veterinary College and Research Institute, Tamil Nadu Veterinary and Animal Sciences University, Theni, India
| |
Collapse
|
4
|
Shishir TA, Jannat T, Naser IB. Genomic surveillance unfolds the SARS-CoV-2 transmission and divergence dynamics in Bangladesh. Front Genet 2022; 13:966939. [PMID: 36226176 PMCID: PMC9548531 DOI: 10.3389/fgene.2022.966939] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 08/23/2022] [Indexed: 11/13/2022] Open
Abstract
The highly pathogenic virus SARS-CoV-2 has shattered the healthcare system of the world causing the COVID-19 pandemic since first detected in Wuhan, China. Therefore, scrutinizing the genome structure and tracing the transmission of the virus has gained enormous interest in designing appropriate intervention strategies to control the pandemic. In this report, we examined 4,622 sequences from Bangladesh and found that they belonged to thirty-five major PANGO lineages, while Delta alone accounted for 39%, and 78% were from just four primary lineages. Our research has also shown Dhaka to be the hub of viral transmission and observed the virus spreading back and forth across the country at different times by building a transmission network. The analysis resulted in 7,659 unique mutations, with an average of 24.61 missense mutations per sequence. Moreover, our analysis of genetic diversity and mutation patterns revealed that eight genes were under negative selection pressure to purify deleterious mutations, while three genes were under positive selection pressure. Together with an ongoing genomic surveillance program, these data will contribute to a better understanding of SARS-CoV-2, as well as its evolution pattern and pandemic characteristics in Bangladesh.
Collapse
Affiliation(s)
- Tushar Ahmed Shishir
- Department of Mathematics and Natural Sciences, BRAC University, Dhaka, Bangladesh
- Rangamati General Hospital, Chattogram, Bangladesh
| | - Taslimun Jannat
- Department of Mathematics and Natural Sciences, BRAC University, Dhaka, Bangladesh
| | - Iftekhar Bin Naser
- Department of Mathematics and Natural Sciences, BRAC University, Dhaka, Bangladesh
- *Correspondence: Iftekhar Bin Naser,
| |
Collapse
|
5
|
Pereira AG, Grizante MB, Kohlsdorf T. What snakes and caecilians have in common? Molecular interaction units and the independent origins of similar morphotypes in Tetrapoda. Proc Biol Sci 2022; 289:20220841. [PMID: 35975445 PMCID: PMC9382212 DOI: 10.1098/rspb.2022.0841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 07/25/2022] [Indexed: 12/14/2022] Open
Abstract
Developmental pathways encompass transcription factors and cis-regulatory elements that interact as transcription factor-regulatory element (TF-RE) units. Independent origins of similar phenotypes likely involve changes in different parts of these units, a hypothesis promisingly tested addressing the evolution of the rib-associated lumbar (RAL) morphotype that characterizes emblematic animals such as snakes and elephants. Previous investigation in these lineages identified a polymorphism in the Homology region 1 [H1] enhancer of the Myogenic factor-5 [Myf5], which interacts with HOX10 proteins to modulate rib development. Here we address the evolution of TF-RE units focusing on independent origins of RAL morphotypes. We compiled an extensive database for H1-Myf5 and HOX10 sequences with two goals: (i) evaluate if the enhancer polymorphism is present in amphibians exhibiting the RAL morphotype and (ii) test a hypothesis of enhanced evolutionary flexibility mediated by TF-RE units, according to which independent origins of the RAL morphotype might involve changes in either component of the interaction unit. We identified the H1-Myf5 polymorphism in lineages that diverged around 340 Ma, including Lissamphibia. Independent origins of the RAL morphotype in Tetrapoda involved sequence variation in either component of the TF-RE unit, confirming that different changes may similarly affect the phenotypic outcome of a given developmental pathway.
Collapse
Affiliation(s)
- Anieli G. Pereira
- Department of Biology, FFCLRP, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| | - Mariana B. Grizante
- Department of Biology, FFCLRP, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
- Dante Pazzanese Institute of Cardiology, Sao Paulo, Brazil
| | - Tiana Kohlsdorf
- Department of Biology, FFCLRP, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
| |
Collapse
|
6
|
Srivastava A, Pandey V, Sahu AK, Yadav D, Al-Sadi AM, Shahid MS, Gaur RK. Evolutionary Dynamics of Begomoviruses and Its Satellites Infecting Papaya in India. Front Microbiol 2022; 13:879413. [PMID: 35685936 PMCID: PMC9171567 DOI: 10.3389/fmicb.2022.879413] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 04/06/2022] [Indexed: 11/13/2022] Open
Abstract
The genus Begomovirus represents a group of multipartite viruses that significantly damage many agricultural crops, including papaya, and influence overall production. Papaya leaf curl disease (PaLCD) caused by the complex begomovirus species has several important implications and substantial losses in papaya production in many developing countries, including India. The increase in the number of begomovirus species poses a continuous threat to the overall production of papaya. Here, we attempted to map the genomic variation, mutation, evolution rate, and recombination to know the disease complexity and successful adaptation of PaLCD in India. For this, we retrieved 44 DNA-A and 26 betasatellite sequences from GenBank reported from India. An uneven distribution of evolutionary divergence has been observed using the maximum-likelihood algorithm across the branch length. Although there were phylogenetic differences, we found high rates of nucleotide substitution mutation in both viral and sub-viral genome datasets. We demonstrated frequent recombination of begomovirus species, with a maximum in intra-species recombinants. Furthermore, our results showed a high degree of genetic variability, demographic selection, and mean substitution rate acting on the population, supporting the emergence of a diverse and purifying selection of viruses and associated betasatellites. Moreover, variation in the genetic composition of all begomovirus datasets revealed a predominance of nucleotide diversity principally driven by mutation, which might further accelerate the advent of new strains and species and their adaption to various hosts with unique pathogenicity. Therefore, the finding of genetic variation and selection emphases on factors that contribute to the universal spread and evolution of Begomovirus and this unanticipated diversity may also provide guidelines toward future evolutionary trend analyses and the development of wide-ranging disease control strategies for begomoviruses associated with PaLCD.
Collapse
Affiliation(s)
- Aarshi Srivastava
- Department of Biotechnology, D.D.U. Gorakhpur University, Gorakhpur, India
| | - Vineeta Pandey
- Department of Biotechnology, D.D.U. Gorakhpur University, Gorakhpur, India
| | - Anurag Kumar Sahu
- International Center for Genetic Engineering and Biotechnology, New Delhi, India
| | - Dinesh Yadav
- Department of Biotechnology, D.D.U. Gorakhpur University, Gorakhpur, India
| | - Abdullah M. Al-Sadi
- Department of Plant Sciences, College of Agricultural and Marine Sciences, Sultan Qaboos University, Al-Khod, Oman
| | - Muhammad Shafiq Shahid
- Department of Plant Sciences, College of Agricultural and Marine Sciences, Sultan Qaboos University, Al-Khod, Oman
- *Correspondence: Muhammad Shafiq Shahid,
| | - R. K. Gaur
- Department of Biotechnology, D.D.U. Gorakhpur University, Gorakhpur, India
- R. K. Gaur,
| |
Collapse
|
7
|
Sonnleitner ST, Prelog M, Sonnleitner S, Hinterbichler E, Halbfurter H, Kopecky DBC, Almanzar G, Koblmüller S, Sturmbauer C, Feist L, Horres R, Posch W, Walder G. Cumulative SARS-CoV-2 mutations and corresponding changes in immunity in an immunocompromised patient indicate viral evolution within the host. Nat Commun 2022; 13:2560. [PMID: 35538074 PMCID: PMC9090742 DOI: 10.1038/s41467-022-30163-4] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 04/19/2022] [Indexed: 01/07/2023] Open
Abstract
Different scenarios explaining the emergence of novel variants of concern (VOC) of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) have been reported, including their evolution in scarcely monitored populations, in animals as alternative hosts, or in immunocompromised individuals. Here we report SARS-CoV-2 immune escape mutations over a period of seven months in an immunocompromised patient with prolonged viral shedding. Signs of infection, viral shedding and mutation events are periodically analyzed using RT-PCR and next-generation sequencing based on naso-pharyngeal swabs, with the results complemented by immunological diagnostics to determine humoral and T cell immune responses. Throughout the infection course, 17 non-synonymous intra-host mutations are noted, with 15 (88.2%) having been previously described as prominent immune escape mutations (S:E484K, S:D950N, S:P681H, S:N501Y, S:del(9), N:S235F and S:H655Y) in VOCs. The high frequency of these non-synonymous mutations is consistent with multiple events of convergent evolution. Thus, our results suggest that specific mutations in the SARS-CoV-2 genome may represent positions with a fitness advantage, and may serve as targets in future vaccine and therapeutics development for COVID-19.
Collapse
Affiliation(s)
- Sissy Therese Sonnleitner
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria.
- Institute of Hygiene and Medical Microbiology, Medical University of Innsbruck, 6020, Innsbruck, Austria.
| | - Martina Prelog
- Pediatric Rheumatology/Special Immunology, Department of Pediatrics, University Hospital Wuerzburg, Josef-Schneider-Str. 2, Wuerzburg, Germany
| | - Stefanie Sonnleitner
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria
| | - Eva Hinterbichler
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria
| | - Hannah Halbfurter
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria
| | - Dominik B C Kopecky
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria
| | - Giovanni Almanzar
- Pediatric Rheumatology/Special Immunology, Department of Pediatrics, University Hospital Wuerzburg, Josef-Schneider-Str. 2, Wuerzburg, Germany
| | - Stephan Koblmüller
- Institute of Biology, University of Graz, Universitätsplatz 2, 8010, Graz, Austria
| | - Christian Sturmbauer
- Institute of Biology, University of Graz, Universitätsplatz 2, 8010, Graz, Austria
| | - Leonard Feist
- GenXPro GmbH, Altenhoeferallee 3, 60438, Frankfurt am Main, Germany
| | - Ralf Horres
- GenXPro GmbH, Altenhoeferallee 3, 60438, Frankfurt am Main, Germany
| | - Wilfried Posch
- Institute of Hygiene and Medical Microbiology, Medical University of Innsbruck, 6020, Innsbruck, Austria
| | - Gernot Walder
- Infektiologie Tirol, Department of Virology, 9931, Unterwalden 30, Außervillgraten, Austria
| |
Collapse
|
8
|
Selection signatures in melanocortin-1 receptor gene of turkeys (Meleagris gallopavo) raised in hot humid tropics. Trop Anim Health Prod 2022; 54:183. [PMID: 35525911 DOI: 10.1007/s11250-022-03185-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Accepted: 04/28/2022] [Indexed: 10/18/2022]
Abstract
Feather colours are used by avian species for defense, adaptation and signaling. Melanocortin-1 receptor (MC1R) gene is one of the genes responsible for feather colour. This study identified selection signatures in MC1R gene of Nigerian indigenous turkeys (NIT) using British United turkeys (BUT) as control breed to investigate the evolutionary processes that have shaped NIT with various feather colours. Complete MC1R gene of 146 NIT (76 males and 70 females) and 32 BUT (18 males and 14 females) were sequenced. Transition/transversion and codon usage biases were predicted using MEGA v6 software. The selective force acting on the gene was predicted using HyPhy software. The FST values were estimated using Arlequin v3.5. The highest transition/transversion bias was predicted for white BUT (1.00) while the lowest was predicted for black NIT (0.50). Negative dN-dS values, indicative of purifying selection, were observed in MC1R gene of all the turkeys. The highest pairwise FST was observed between the MC1R gene of white BUT and black NIT while the least was observed between lavender NIT and white NIT. No recombination event was observed in black NIT and white BUT. The relative synonymous codon usage was the same among different colours for some codons. Presence of purifying selection in MC1R gene of all the turkeys with different feather colours confirms that the gene plays role in many biological processes such as feather colouration, behaviour, pain perception, immunity, growth and adaptation. The results also suggested that the genetic mechanisms generating different feather colours in turkeys are conserved.
Collapse
|
9
|
Sen P, Aziz R, Deka RC, Feil EJ, Ray SK, Satapathy SS. Stem Region of tRNA Genes Favors Transition Substitution Towards Keto Bases in Bacteria. J Mol Evol 2022; 90:114-123. [PMID: 35084523 DOI: 10.1007/s00239-021-10045-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 12/21/2021] [Indexed: 11/25/2022]
Abstract
Transversion and transition mutations have variable effects on the stability of RNA secondary structure considering that the former destabilizes the double helix geometry to a greater extent by introducing purine:purine (R:R) or pyrimidine:pyrimidine (Y:Y) base pairs. Therefore, transversion frequency is likely to be lower than that of transition in the secondary structure regions of RNA genes. Here, we performed an analysis of transition and transversion frequencies in tRNA genes defined well with secondary structure and compared with the intergenic regions in five bacterial species namely Escherichia coli, Klebsiella pneumoniae, Salmonella enterica, Staphylococcus aureus and Streptococcus pneumoniae using a large genome sequence data set. In general, the transversion frequency was observed to be lower than that of transition in both tRNA genes and intergenic regions. The transition to transversion ratio was observed to be greater in tRNA genes than that in the intergenic regions in all the five bacteria that we studied. Interestingly, the intraspecies base substitution analysis in tRNA genes revealed that non-compensatory substitutions were more frequent than compensatory substitutions in the stem region. Further, transition to transversion ratio in the loop region was observed to be significantly lesser than that among the non-compensatory substitutions in the stem region. This indicated that the transversion is more deleterious than transition in the stem regions. In addition, substitutions from amino bases (A/C) to keto bases (G/T) were also observed to be more than the reverse substitutions in the stem region. Substitution from amino bases to keto bases are likely to facilitate the stable G:U pairing unlike the reverse substitution that facilitates the unstable A:C pairing in the stem region of tRNA. This work provides additional support that the secondary structure of tRNA molecule is what drives the different substitutions in its gene sequence.
Collapse
Affiliation(s)
- Piyali Sen
- Department of Computer Science and Engineering, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Ruksana Aziz
- Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Ramesh C Deka
- Chemical Sciences, Tezpur University, Napaam, Tezpur, Assam, 784028, India
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Edward J Feil
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, BA2 7AY, UK.
| | - Suvendra Kumar Ray
- Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
| | - Siddhartha Sankar Satapathy
- Department of Computer Science and Engineering, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
| |
Collapse
|
10
|
Sonnleitner ST, Sonnleitner S, Hinterbichler E, Halbfurter H, Kopecky DB, Koblmüller S, Sturmbauer C, Posch W, Walder G. The mutational dynamics of the SARS-CoV-2 virus in serial passages in vitro. Virol Sin 2022; 37:198-207. [PMID: 35277373 PMCID: PMC8800542 DOI: 10.1016/j.virs.2022.01.029] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 01/21/2022] [Indexed: 11/04/2022] Open
Abstract
Since its outbreak in 2019, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) keeps surprising the medical community by evolving diverse immune escape mutations in a rapid and effective manner. To gain deeper insight into mutation frequency and dynamics, we isolated ten ancestral strains of SARS-CoV-2 and performed consecutive serial incubation in ten replications in a suitable and common cell line and subsequently analysed them using RT-qPCR and whole genome sequencing. Along those lines we hoped to gain fundamental insights into the evolutionary capacity of SARS-CoV-2 in vitro. Our results identified a series of adaptive genetic changes, ranging from unique convergent substitutional mutations and hitherto undescribed insertions. The region coding for spike proved to be a mutational hotspot, evolving a number of mutational changes including the already known substitutions at positions S:484 and S:501. We discussed the evolution of all specific adaptations as well as possible reasons for the seemingly inhomogeneous potential of SARS-CoV-2 in the adaptation to cell culture. The combination of serial passage in vitro with whole genome sequencing uncovers the immense mutational potential of some SARS-CoV-2 strains. The observed genetic changes of SARS-CoV-2 in vitro could not be explained solely by selectively neutral mutations but possibly resulted from the action of directional selection accumulating favourable genetic changes in the evolving variants, along the path of increasing potency of the strain. Competition among a high number of quasi-species in the SARS-CoV-2 in vitro population gene pool may reinforce directional selection and boost the speed of evolutionary change. Ten genetically similar strains evolved very differently in serial passage in vitro. Observed mutations included substitutions at important spike positions. The three strains with the highest replication rates developed two convergent mutations. Via directional selection favourable genetic changes are accumulated. Competition among many quasi-species boosts the speed of evolutionary change.
Collapse
|
11
|
Intraspecific DNA Barcoding and Variation Analysis for Citri Reticulatae Pericarpium of Citrus reticulata "Chachi". EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2021; 2021:2609935. [PMID: 34925527 PMCID: PMC8677393 DOI: 10.1155/2021/2609935] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 11/22/2021] [Accepted: 11/24/2021] [Indexed: 11/18/2022]
Abstract
Citri Reticulatae Pericarpium, the desiccative mature peel of Citrus reticulata Blanco or its cultivated varieties, is a national geographical indicated product that has the concomitant function of both medicine and foodstuff. The primary source of Citri Reticulatae Pericarpium is Citrus reticulata “Chachi,” called “Guang chenpi,” while it differs in variety, propagation, grafting rootstock, and tree age, and the hereditary stability of its biological information between intraspecific plants is worthy of our attention. Homologous analysis result of 4 DNA barcodings in the ribosome or the chloroplast showed that the homology of them (ITS2, rbcl, matK, and psbA-trnH) of 22 samples was 100.00%, 99.97%, 99.99%, and 99.81%, respectively, which indicated that 4 DNA barcodes maintained a high degree of genetic stability in Citrus reticulata “Chachi.” Also, ITS2 was considered to identify Citrus reticulata “Chachi” from other varieties because it presented not only low variability within a certain taxon but also a high level of interspecies variability. Simultaneously, variant site detection of Citrus reticulata “Chachi” was analyzed by comparing with the reference Citrus reticulata genome, and 2652697 SNP sites and 533906 InDel sites were detected from whole-genome resequencing data of 22 samples, providing the data resources and theoretical foundation for the future study about the relevant molecular makers of “Guang chenpi.”
Collapse
|
12
|
Louca S, McLaughlin A, MacPherson A, Joy JB, Pennell MW. Fundamental Identifiability Limits in Molecular Epidemiology. Mol Biol Evol 2021; 38:4010-4024. [PMID: 34009339 PMCID: PMC8382926 DOI: 10.1093/molbev/msab149] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Viral phylogenies provide crucial information on the spread of infectious diseases, and many studies fit mathematical models to phylogenetic data to estimate epidemiological parameters such as the effective reproduction ratio (Re) over time. Such phylodynamic inferences often complement or even substitute for conventional surveillance data, particularly when sampling is poor or delayed. It remains generally unknown, however, how robust phylodynamic epidemiological inferences are, especially when there is uncertainty regarding pathogen prevalence and sampling intensity. Here, we use recently developed mathematical techniques to fully characterize the information that can possibly be extracted from serially collected viral phylogenetic data, in the context of the commonly used birth-death-sampling model. We show that for any candidate epidemiological scenario, there exists a myriad of alternative, markedly different, and yet plausible "congruent" scenarios that cannot be distinguished using phylogenetic data alone, no matter how large the data set. In the absence of strong constraints or rate priors across the entire study period, neither maximum-likelihood fitting nor Bayesian inference can reliably reconstruct the true epidemiological dynamics from phylogenetic data alone; rather, estimators can only converge to the "congruence class" of the true dynamics. We propose concrete and feasible strategies for making more robust epidemiological inferences from viral phylogenetic data.
Collapse
Affiliation(s)
- Stilianos Louca
- Department of Biology, University of Oregon, Eugene, OR, USA
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, USA
| | - Angela McLaughlin
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Bioinformatics, University of British Columbia, Vancouver, BC, Canada
| | - Ailene MacPherson
- Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
- Department of Zoology, University of British Columbia, Vancouver, BC, Canada
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Jeffrey B Joy
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Bioinformatics, University of British Columbia, Vancouver, BC, Canada
- Department of Medicine, University of British Columbia, Vancouver, BC, Canada
| | - Matthew W Pennell
- Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
- Department of Zoology, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
13
|
Relevant SARS-CoV-2 Genome Variation through Six Months of Worldwide Monitoring. BIOMED RESEARCH INTERNATIONAL 2021; 2021:5553173. [PMID: 34258267 PMCID: PMC8241501 DOI: 10.1155/2021/5553173] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 05/31/2021] [Accepted: 06/15/2021] [Indexed: 01/11/2023]
Abstract
Real-time genome monitoring of the SARS-CoV-2 pandemic outbreak is of utmost importance for designing diagnostic tools, guiding antiviral treatment and vaccination strategies. In this study, we present an accurate method for temporal and geographical comparison of mutational events based on GISAID database genome sequencing. Among 42523 SARS-CoV-2 genomes analyzed, we found 23202 variants compared to the reference genome. The Ti/Tv (transition/transversion) ratio was used to filter out possible false-positive errors. Transition mutations generally occurred more frequently than transversions. Our clustering analysis revealed remarkable hotspot mutation patterns for SARS-CoV-2. Mutations were clustered based on how their frequencies changed over time according to each geographical location. We observed some clusters showing a clear variation in mutation frequency and continuously evolving in the world. However, many mutations appeared in specific periods without a clear pattern over time. Various important nonsynonymous mutations were observed, mainly in Oceania and Asia. More than half of these mutations were observed only once. Four hotspot mutations were found in all geographical locations at least once: T265I (NSP2), P314L (NSP12), D614G (S), and Q57H (ORF3a). The current analysis of SARS-CoV-2 genomes provides valuable information on the geographical and temporal mutational evolution of SARS-CoV-2.
Collapse
|
14
|
Frutos R, Serra-Cobo J, Pinault L, Lopez Roig M, Devaux CA. Emergence of Bat-Related Betacoronaviruses: Hazard and Risks. Front Microbiol 2021; 12:591535. [PMID: 33790874 PMCID: PMC8005542 DOI: 10.3389/fmicb.2021.591535] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 02/15/2021] [Indexed: 01/08/2023] Open
Abstract
The current Coronavirus Disease 2019 (COVID-19) pandemic, with more than 111 million reported cases and 2,500,000 deaths worldwide (mortality rate currently estimated at 2.2%), is a stark reminder that coronaviruses (CoV)-induced diseases remain a major threat to humanity. COVID-19 is only the latest case of betacoronavirus (β-CoV) epidemics/pandemics. In the last 20 years, two deadly CoV epidemics, Severe Acute Respiratory Syndrome (SARS; fatality rate 9.6%) and Middle East Respiratory Syndrome (MERS; fatality rate 34.7%), plus the emergence of HCoV-HKU1 which causes the winter common cold (fatality rate 0.5%), were already a source of public health concern. Betacoronaviruses can also be a threat for livestock, as evidenced by the Swine Acute Diarrhea Syndrome (SADS) epizootic in pigs. These repeated outbreaks of β-CoV-induced diseases raise the question of the dynamic of propagation of this group of viruses in wildlife and human ecosystems. SARS-CoV, SARS-CoV-2, and HCoV-HKU1 emerged in Asia, strongly suggesting the existence of a regional hot spot for emergence. However, there might be other regional hot spots, as seen with MERS-CoV, which emerged in the Arabian Peninsula. β-CoVs responsible for human respiratory infections are closely related to bat-borne viruses. Bats are present worldwide and their level of infection with CoVs is very high on all continents. However, there is as yet no evidence of direct bat-to-human coronavirus infection. Transmission of β-CoV to humans is considered to occur accidentally through contact with susceptible intermediate animal species. This zoonotic emergence is a complex process involving not only bats, wildlife and natural ecosystems, but also many anthropogenic and societal aspects. Here, we try to understand why only few hot spots of β-CoV emergence have been identified despite worldwide bats and bat-borne β-CoV distribution. In this work, we analyze and compare the natural and anthropogenic environments associated with the emergence of β-CoV and outline conserved features likely to create favorable conditions for a new epidemic. We suggest monitoring South and East Africa as well as South America as these regions bring together many of the conditions that could make them future hot spots.
Collapse
Affiliation(s)
- Roger Frutos
- Centre de coopération Internationale en Recherche Agronomique pour le Développement, UMR 17, Intertryp, Montpellier, France.,Institut d'Électronique et des Systèmes, UMR 5214, Université de Montpellier-CNRS, Montpellier, France
| | - Jordi Serra-Cobo
- Department of Evolutionary Biology, Ecology and Environmental Sciences, University of Barcelona, Biodiversity Research Institute, Barcelona, Spain
| | - Lucile Pinault
- Aix Marseille University, IRD, APHM, MEPHI, IHU-Méditerranée Infection, Marseille, France
| | - Marc Lopez Roig
- Department of Evolutionary Biology, Ecology and Environmental Sciences, University of Barcelona, Biodiversity Research Institute, Barcelona, Spain
| | - Christian A Devaux
- Aix Marseille University, IRD, APHM, MEPHI, IHU-Méditerranée Infection, Marseille, France.,Centre National de la Recherche Scientifique, Marseille, France
| |
Collapse
|
15
|
Abraham JS, Somasundaram S, Maurya S, Gupta R, Makhija S, Toteja R. Characterization of Euplotes lynni nov. spec., E. indica nov. spec. and description of E. aediculatus and E. woodruffi (Ciliophora, Euplotidae) using an integrative approach. Eur J Protistol 2021; 79:125779. [PMID: 33706203 DOI: 10.1016/j.ejop.2021.125779] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 01/14/2021] [Accepted: 01/23/2021] [Indexed: 11/26/2022]
Abstract
Four species belonging to the genus Euplotes have been investigated, namely: E. lynni nov. spec., E. indica nov. spec., E. aediculatus, and E. woodruffi. All populations are from India and were investigated using morphological and molecular markers. The phylogenetic relationships were inferred from small subunit ribosomal rRNA gene (SSU rRNA), internal transcribed spacer (ITS) region, and mitochondrial cytochrome c oxidase subunit I (COI) gene. Predicted secondary structure models for two new species using the hypervariable region of the SSU rRNA gene and ITS2 region support the distinctness of both species. Morphological characters were subjected to principal component analysis (PCA) and genetic variations were studied in-depth to analyze the relatedness of the two new species with their congeners. An integrative approach combining morphological features, molecular analysis, and ecological characteristics was carried out to understand the phylogenetic position of the reported species within the different clades of the genus Euplotes.
Collapse
Affiliation(s)
- Jeeva Susan Abraham
- Ciliate Biology Laboratory, Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Sripoorna Somasundaram
- Ciliate Biology Laboratory, Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Swati Maurya
- Ciliate Biology Laboratory, Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Renu Gupta
- Maitreyi College, University of Delhi, Bapu Dham, Chanakyapuri, New Delhi 110021, India.
| | - Seema Makhija
- Ciliate Biology Laboratory, Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India
| | - Ravi Toteja
- Ciliate Biology Laboratory, Acharya Narendra Dev College, University of Delhi, Govindpuri, Kalkaji, New Delhi 110019, India.
| |
Collapse
|
16
|
Icer Baykal PB, Lara J, Khudyakov Y, Zelikovsky A, Skums P. Quantitative differences between intra-host HCV populations from persons with recently established and persistent infections. Virus Evol 2020; 7:veaa103. [PMID: 33505710 PMCID: PMC7816669 DOI: 10.1093/ve/veaa103] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Detection of incident hepatitis C virus (HCV) infections is crucial for identification of outbreaks and development of public health interventions. However, there is no single diagnostic assay for distinguishing recent and persistent HCV infections. HCV exists in each infected host as a heterogeneous population of genomic variants, whose evolutionary dynamics remain incompletely understood. Genetic analysis of such viral populations can be applied to the detection of incident HCV infections and used to understand intra-host viral evolution. We studied intra-host HCV populations sampled using next-generation sequencing from 98 recently and 256 persistently infected individuals. Genetic structure of the populations was evaluated using 245,878 viral sequences from these individuals and a set of selected features measuring their diversity, topological structure, complexity, strength of selection, epistasis, evolutionary dynamics, and physico-chemical properties. Distributions of the viral population features differ significantly between recent and persistent infections. A general increase in viral genetic diversity from recent to persistent infections is frequently accompanied by decline in genomic complexity and increase in structuredness of the HCV population, likely reflecting a high level of intra-host adaptation at later stages of infection. Using these findings, we developed a machine learning classifier for the infection staging, which yielded a detection accuracy of 95.22 per cent, thus providing a higher accuracy than other genomic-based models. The detection of a strong association between several HCV genetic factors and stages of infection suggests that intra-host HCV population develops in a complex but regular and predictable manner in the course of infection. The proposed models may serve as a foundation of cyber-molecular assays for staging infection, which could potentially complement and/or substitute standard laboratory assays.
Collapse
Affiliation(s)
- Pelin B Icer Baykal
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| | - James Lara
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, 1600 Clifton Rd., Atlanta, GA 30329, USA
| | - Yury Khudyakov
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, 1600 Clifton Rd., Atlanta, GA 30329, USA
| | - Alex Zelikovsky
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| | - Pavel Skums
- Department of Computer Science, Georgia State University, 25 Park Place, Atlanta, GA 30302, USA
| |
Collapse
|
17
|
Alexandrov OS, Razumova OV, Karlov GI. A Comparative Study of 5S rDNA Non-Transcribed Spacers in Elaeagnaceae Species. PLANTS 2020; 10:plants10010004. [PMID: 33374528 PMCID: PMC7822202 DOI: 10.3390/plants10010004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 12/21/2020] [Accepted: 12/21/2020] [Indexed: 11/16/2022]
Abstract
5S rDNA is organized as a cluster of tandemly repeated monomers that consist of the conservative 120 bp coding part and non-transcribed spacers (NTSs) with different lengths and sequences among different species. The polymorphism in the 5S rDNA NTSs of closely related species is interesting for phylogenetic and evolutional investigations, as well as for the development of molecular markers. In this study, the 5S rDNA NTSs were amplified with universal 5S1/5S2 primers in some species of the Elaeagnaceae Adans. family. The polymerase chain reaction (PCR) products of five Elaeagnus species had similar lengths near 310 bp and were different from Shepherdia canadensis (L.) Nutt. and Sh. argentea (Pusch.) Nutt. samples (260 bp and 215 bp, respectively). The PCR products were cloned and sequenced. An analysis of the sequences revealed that intraspecific levels of NTS identity are high (approximately 95–96%) and similar in the Elaeagnus L. species. In Sh. argentea, this level was slightly lower due to the differences in the poly-T region. Moreover, the intergeneric and intervarietal NTS identity levels were studied and compared. Significant differences between species (except E. multiflora Thunb. and E. umbellata Thunb.) and genera were found. Herein, a range of the NTS features is discussed. This study is another step in the investigation of the molecular evolution of Elaeagnaceae and may be useful for the development of species-specific DNA markers in this family.
Collapse
|
18
|
Heo JI, Yu J, Choi H, Kim KH. The Signatures of Natural Selection and Molecular Evolution in Fusarium graminearum Virus 1. Front Microbiol 2020; 11:600775. [PMID: 33281800 PMCID: PMC7688778 DOI: 10.3389/fmicb.2020.600775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 10/27/2020] [Indexed: 11/21/2022] Open
Abstract
Fusarium graminearum virus 1 (FgV1) is a positive-sense ssRNA virus that confers hypovirulence in its fungal host, Fusarium graminearum. Like most mycoviruses, FgV1 exists in fungal cells, lacks an extracellular life cycle, and is therefore transmitted during sporulation or hyphal anastomosis. To understand FgV1 evolution and/or adaptation, we conducted mutation accumulation (MA) experiments by serial passage of FgV1 alone or with FgV2, 3, or 4 in F. graminearum. We expected that the effects of positive selection would be highly limited because of repeated bottleneck events. To determine whether selection on the virus was positive, negative, or neutral, we assessed both the phenotypic traits of the host fungus and the RNA sequences of FgV1. We inferred that there was positive selection on beneficial mutations in FgV1 based on the ratio of non-synonymous to synonymous substitutions (dN/dS), on the ratio of radical to conservation amino acid replacements (pNR/pNC), and by changes in the predicted protein structures. In support of this inference, we found evidence of positive selection only in the open reading frame 4 (ORF4) protein of DK21/FgV1 (MA line 1); mutations at amino acids 163A and 289H in the ORF4 of MA line 1 affected the entire structure of the protein predicted to be under positive selection. We also found, however, that deleterious mutations were a major driving force in viral evolution during serial passages. Linear relationships between changes in viral fitness and the number of mutations in each MA line demonstrated that some deleterious mutations resulted in fitness decline. Several mutations in MA line 1 were not shared with any of the other four MA lines (PH-1/FgV1, PH-1/FgV1 + 2, PH-1/FgV1 + 3, and PH-1/FgV1 + 4). This suggests that evolutionary pathways of the virus could differ with respect to hosts and also with respect to co-infecting viruses. The data also suggested that the differences among MA lines might also be explained by mutational robustness and other unidentified factors. Additional research is needed to clarify the effects of virus co-infection on the adaptation or evolution of FgV1 to its environments.
Collapse
Affiliation(s)
- Jeong-In Heo
- Department of Agricultural Biotechnology, College of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea
| | - Jisuk Yu
- Plant Genomics and Breeding Institute, Seoul National University, Seoul, South Korea
| | - Hoseong Choi
- Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea
| | - Kook-Hyung Kim
- Department of Agricultural Biotechnology, College of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea.,Plant Genomics and Breeding Institute, Seoul National University, Seoul, South Korea.,Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea
| |
Collapse
|
19
|
Le NH, Nahrung HF, Morgan JAT, Ogbourne S, Lawson SA. Molecular markers reveal diversity in composition of Megastigmus (Hymenoptera: Megastigmidae) from eucalypt galls. Ecol Evol 2020; 10:11565-11578. [PMID: 33144984 PMCID: PMC7593149 DOI: 10.1002/ece3.6791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2020] [Revised: 08/19/2020] [Accepted: 08/20/2020] [Indexed: 11/25/2022] Open
Abstract
Since outbreaks of the invasive blue gum chalcids Leptocybe spp. began, the genus Megastigmus (Hymenoptera: Megastigmidae) has been increasingly studied as containing potential biocontrol agents against these pests. Megastigmus species have been collected and described from Australia, the presumed origin of Leptocybe spp., with M. zvimendeli and M. lawsoni reported as Leptocybe spp. parasitoids established outside of Australia. Parasitic Megastigmus have been reported to occur locally in the Neotropics, Afrotropic, Palearctic, and Indomalaya biogeographic realms, and in many cases described as new to science. However, molecular tools have not been used in studying parasitic Megastigmus, and difficulties in morphological taxonomy have compromised further understanding of eucalypt-associated Megastigmus as well as the Megastigmus-Leptocybe association. In this study, we used molecular markers to study the species composition and phylogeny of Megastigmus collected from eucalypt galls in Australia and from Leptocybe spp. galls from South Africa, Kenya, Israel, China, and Vietnam. We record thirteen discrete species and a species complex associated with eucalypt galls. A summary of morphological characters is provided to assist morphological delimitation of the studied group. A phylogeny based on 28S rDNA identified species groups of importance to Leptocybe spp. biocontrol agents from four clades with nine species. Relationships between Megastigmus from eucalypt galls and their phytophagous congeners were unresolved. Further molecular work is needed to clarify the identity of many species.
Collapse
Affiliation(s)
- Ngoc Hoan Le
- Forest Industries Research CentreUniversity of the Sunshine CoastSippy DownsQLDAustralia
| | - Helen F. Nahrung
- Forest Industries Research CentreUniversity of the Sunshine CoastSippy DownsQLDAustralia
| | - Jess A. T. Morgan
- Department of Agriculture and FisheriesEcoSciences PrecinctBrisbaneQLDAustralia
| | - Steven Ogbourne
- GeneCology Research CentreSchool of Science and EngineeringUniversity of the Sunshine CoastSippy DownsQLDAustralia
| | - Simon A. Lawson
- Forest Industries Research CentreUniversity of the Sunshine CoastSippy DownsQLDAustralia
| |
Collapse
|
20
|
Normandin E, Solomon IH, Zamirpour S, Lemieux J, Freije CA, Mukerji SS, Tomkins-Tinch C, Park D, Sabeti PC, Piantadosi A. Powassan Virus Neuropathology and Genomic Diversity in Patients With Fatal Encephalitis. Open Forum Infect Dis 2020; 7:ofaa392. [PMID: 33094116 PMCID: PMC7566439 DOI: 10.1093/ofid/ofaa392] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 08/27/2020] [Indexed: 01/09/2023] Open
Abstract
Background Powassan virus (POWV) is an emerging cause of severe encephalitis; very little is known about human pathogenicity due to challenges in diagnosis and viral RNA recovery. We present 3 patients with fatal encephalitis due to POWV lineage II (deer tick virus). Methods We obtained 27 unique samples, including from brain biopsy and autopsy, and used metagenomic sequencing, quantitative reverse transcriptase polymerase chain reaction, and a newly developed CRISPR-based diagnostic assay to perform the first detailed characterization of POWV compartmentalization and genomics between and within human subjects. Results In all 3 patients, imaging and histopathology findings were notable for profound cerebellar involvement. All patients were initially diagnosed with POWV by metagenomic sequencing, and 2 of the 3 had negative clinical testing by serology. We detected POWV RNA in 13 clinical samples; levels were highest in the cerebellum, and there was very little involvement of peripheral tissue. We assembled complete POWV genomes from 8 samples, providing unique information about the strains of POWV lineage II (deer tick virus) that infect humans. Conclusions We demonstrate the utility of molecular assays for detecting POWV infection, including in seronegative patients, and nominate viral genomic features that may relate to human infection and neuropathogenicity. The cerebellum was identified as a key target POWV in fatal infection, by radiological and histopathological findings as well as molecular testing.
Collapse
Affiliation(s)
- Erica Normandin
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Isaac H Solomon
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Siavash Zamirpour
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Harvard College, Cambridge, Massachusetts, USA
| | - Jacob Lemieux
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Division of Infectious Diseases, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Catherine A Freije
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,PhD Program in Virology, Division of Medical Sciences, Harvard University, Boston, Massachusetts, USA
| | - Shibani S Mukerji
- Department of Neurology, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Christopher Tomkins-Tinch
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Daniel Park
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Pardis C Sabeti
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA.,Department of Immunology and Infectious Disease, Harvard School of Public Health, Boston, Massachusetts, USA.,Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Anne Piantadosi
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Division of Infectious Diseases, Massachusetts General Hospital, Boston, Massachusetts, USA.,Emory University School of Medicine, Atlanta, Georgia, USA
| |
Collapse
|
21
|
Matyášek R, Kovařík A. Mutation Patterns of Human SARS-CoV-2 and Bat RaTG13 Coronavirus Genomes Are Strongly Biased Towards C>U Transitions, Indicating Rapid Evolution in Their Hosts. Genes (Basel) 2020; 11:E761. [PMID: 32646049 PMCID: PMC7397057 DOI: 10.3390/genes11070761] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 06/22/2020] [Accepted: 06/29/2020] [Indexed: 12/17/2022] Open
Abstract
The pandemic caused by the spread of SARS-CoV-2 has led to considerable interest in its evolutionary origin and genome structure. Here, we analyzed mutation patterns in 34 human SARS-CoV-2 isolates and a closely related RaTG13 isolated from Rhinolophus affinis (a horseshoe bat). We also evaluated the CpG dinucleotide contents in SARS-CoV-2 and other human and animal coronavirus genomes. Out of 1136 single nucleotide variations (~4% divergence) between human SARS-CoV-2 and bat RaTG13, 682 (60%) can be attributed to C>U and U>C substitutions, far exceeding other types of substitutions. An accumulation of C>U mutations was also observed in SARS-CoV2 variants that arose within the human population. Globally, the C>U substitutions increased the frequency of codons for hydrophobic amino acids in SARS-CoV-2 peptides, while U>C substitutions decreased it. In contrast to most other coronaviruses, both SARS-CoV-2 and RaTG13 exhibited CpG depletion in their genomes. The data suggest that C-to-U conversion mediated by C deamination played a significant role in the evolution of the SARS-CoV-2 coronavirus. We hypothesize that the high frequency C>U transitions reflect virus adaptation processes in their hosts, and that SARS-CoV-2 could have been evolving for a relatively long period in humans following the transfer from animals before spreading worldwide.
Collapse
Affiliation(s)
| | - Aleš Kovařík
- Laboratory of Molecular Epigenetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, Královopolská 135, 61265 Brno, Czech Republic;
| |
Collapse
|
22
|
Abstract
Influenza viruses rapidly diversify within individual human infections. Several recent studies have deep-sequenced clinical influenza infections to identify viral variation within hosts, but it remains unclear how within-host mutations fare at the between-host scale. Here, we compare the genetic variation of H3N2 influenza within and between hosts to link viral evolutionary dynamics across scales. Synonymous sites evolve at similar rates at both scales, indicating that global evolution at these putatively neutral sites results from the accumulation of within-host variation. However, nonsynonymous mutations are depleted between hosts compared to within hosts, suggesting that selection purges many of the protein-altering changes that arise within hosts. The exception is at antigenic sites, where selection detectably favors nonsynonymous mutations at the global scale, but not within hosts. These results suggest that selection against deleterious mutations and selection for antigenic change are the main forces that act on within-host variants of influenza virus as they transmit and circulate between hosts.
Collapse
Affiliation(s)
- Katherine S Xue
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Department of Biology, Stanford University, Stanford, CA, USA
| | - Jesse D Bloom
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Howard Hughes Medical Institute, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA
| |
Collapse
|
23
|
Su Q, Chen Y, Li M, Ma J, Wang B, Luo J, He H. Genetic Characterization and Molecular Evolution of Urban Seoul Virus in Southern China. Viruses 2019; 11:v11121137. [PMID: 31835357 PMCID: PMC6950471 DOI: 10.3390/v11121137] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 12/06/2019] [Accepted: 12/07/2019] [Indexed: 01/03/2023] Open
Abstract
Seoul virus (SEOV), which causes hemorrhagic fever with renal syndrome (HFRS) in humans, has spread all over the world, especially in mainland China. Understanding basic mechanisms of SEOV evolution is essential to better combat and prevent viral diseases. Here, we examined SEOV prevalence and evolution in the residential area of four districts in Guangzhou city, China. The carriage of SEOV was observed in 33.33% of the sampled rodents, with 35.96% of the sampled Rattus norvegicus and 13.33% of R. tanezumi. Based on the comprehensive analyses of large (L), medium (M), and small (S) segments, our study first demonstrated that the genetic characterization of urban SEOV was shaped by high nucleotide substitution rates, purifying selection, and recombination. Additionally, we detected mutational saturation in the S segment of SEOV, which may lead to the biases of genetic divergence and substitution rates in our study. Importantly, we have filled the gap of SEOV evolution in the urban area. The genetic variation of SEOV may highlight the risk of HFRS, which merits further investigation.
Collapse
Affiliation(s)
- Qianqian Su
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
- University of Chinese Academy of Sciences, Beijing 100101, China;
| | - Yi Chen
- University of Chinese Academy of Sciences, Beijing 100101, China;
- State Key Laboratory of Integrated Management of Pest Insects and Rodents in Agriculture, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Meng Li
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
| | - Jiajun Ma
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
| | - Bo Wang
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
- University of Chinese Academy of Sciences, Beijing 100101, China;
| | - Jing Luo
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
| | - Hongxuan He
- National Research Center for Wildlife-Borne Diseases, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; (Q.S.); (M.L.); (J.M.); (B.W.); (J.L.)
- Correspondence:
| |
Collapse
|
24
|
The Effect of Sample Bias and Experimental Artefacts on the Statistical Phylogenetic Analysis of Picornaviruses. Viruses 2019; 11:v11111032. [PMID: 31698764 PMCID: PMC6893659 DOI: 10.3390/v11111032] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 11/04/2019] [Accepted: 11/04/2019] [Indexed: 12/05/2022] Open
Abstract
Statistical phylogenetic methods are a powerful tool for inferring the evolutionary history of viruses through time and space. The selection of mathematical models and analysis parameters has a major impact on the outcome, and has been relatively well-described in the literature. The preparation of a sequence dataset is less formalized, but its impact can be even more profound. This article used simulated datasets of enterovirus sequences to evaluate the effect of sample bias on picornavirus phylogenetic studies. Possible approaches to the reduction of large datasets and their potential for introducing additional artefacts were demonstrated. The most consistent results were obtained using “smart sampling”, which reduced sequence subsets from large studies more than those from smaller ones in order to preserve the rare sequences in a dataset. The effect of sequences with technical or annotation errors in the Bayesian framework was also analyzed. Sequences with about 0.5% sequencing errors or incorrect isolation dates altered by just 5 years could be detected by various approaches, but the efficiency of identification depended upon sequence position in a phylogenetic tree. Even a single erroneous sequence could profoundly destabilize the whole analysis by increasing the variance of the inferred evolutionary parameters.
Collapse
|
25
|
Empirical design of a variant quality control pipeline for whole genome sequencing data using replicate discordance. Sci Rep 2019; 9:16156. [PMID: 31695094 PMCID: PMC6834861 DOI: 10.1038/s41598-019-52614-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 10/18/2019] [Indexed: 12/29/2022] Open
Abstract
The success of next-generation sequencing depends on the accuracy of variant calls. Few objective protocols exist for QC following variant calling from whole genome sequencing (WGS) data. After applying QC filtering based on Genome Analysis Tool Kit (GATK) best practices, we used genotype discordance of eight samples that were sequenced twice each to evaluate the proportion of potentially inaccurate variant calls. We designed a QC pipeline involving hard filters to improve replicate genotype concordance, which indicates improved accuracy of genotype calls. Our pipeline analyzes the efficacy of each filtering step. We initially applied this strategy to well-characterized variants from the ClinVar database, and subsequently to the full WGS dataset. The genome-wide biallelic pipeline removed 82.11% of discordant and 14.89% of concordant genotypes, and improved the concordance rate from 98.53% to 99.69%. The variant-level read depth filter most improved the genome-wide biallelic concordance rate. We also adapted this pipeline for triallelic sites, given the increasing proportion of multiallelic sites as sample sizes increase. For triallelic sites containing only SNVs, the concordance rate improved from 97.68% to 99.80%. Our QC pipeline removes many potentially false positive calls that pass in GATK, and may inform future WGS studies prior to variant effect analysis.
Collapse
|
26
|
Lewitus E, Rolland M. A non-parametric analytic framework for within-host viral phylogenies and a test for HIV-1 founder multiplicity. Virus Evol 2019; 5:vez044. [PMID: 31700680 PMCID: PMC6826062 DOI: 10.1093/ve/vez044] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Phylogenetics is a powerful tool for understanding the diversification dynamics of viral pathogens. Here we present an extension of the spectral density profile of the modified graph Laplacian, which facilitates the characterization of within-host molecular evolution of viruses and the direct comparison of diversification dynamics between hosts. This approach is non-parametric and therefore fast and model-free. We used simulations of within-host evolutionary scenarios to evaluate the efficiency of our approach and to demonstrate the significance of interpreting a viral phylogeny by its spectral density profile in terms of diversification dynamics. The key features that are captured by the profile are positive selection on the viral gene (or genome), temporal changes in substitution rates, mutational fitness, and time between sampling. Using sequences from individuals infected with HIV-1, we showed the utility of this approach for characterizing within-host diversification dynamics, for comparing dynamics between hosts, and for charting disease progression in infected individuals sampled over multiple years. We furthermore propose a heuristic test for assessing founder heterogeneity, which allows us to classify infections with single and multiple HIV-1 founder viruses. This non-parametric approach can be a valuable complement to existing parametric approaches.
Collapse
Affiliation(s)
- Eric Lewitus
- U.S. Military HIV Research Program (MHRP), WRAIR, 503 Robert Grant Avenue, Silver Spring, MD, USA.,Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., 6720A Rockledge Dr, Bethesda, MD, USA
| | - Morgane Rolland
- U.S. Military HIV Research Program (MHRP), WRAIR, 503 Robert Grant Avenue, Silver Spring, MD, USA.,Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc., 6720A Rockledge Dr, Bethesda, MD, USA
| |
Collapse
|
27
|
Talla V, Soler L, Kawakami T, Dincă V, Vila R, Friberg M, Wiklund C, Backström N. Dissecting the Effects of Selection and Mutation on Genetic Diversity in Three Wood White (Leptidea) Butterfly Species. Genome Biol Evol 2019; 11:2875-2886. [PMID: 31580421 PMCID: PMC6795238 DOI: 10.1093/gbe/evz212] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2019] [Indexed: 12/12/2022] Open
Abstract
The relative role of natural selection and genetic drift in evolution is a major topic of debate in evolutionary biology. Most knowledge spring from a small group of organisms and originate from before it was possible to generate genome-wide data on genetic variation. Hence, it is necessary to extend to a larger number of taxonomic groups, descriptive and hypothesis-based research aiming at understanding the proximate and ultimate mechanisms underlying both levels of genetic polymorphism and the efficiency of natural selection. In this study, we used data from 60 whole-genome resequenced individuals of three cryptic butterfly species (Leptidea sp.), together with novel gene annotation information and population recombination data. We characterized the overall prevalence of natural selection and investigated the effects of mutation and linked selection on regional variation in nucleotide diversity. Our analyses showed that genome-wide diversity and rate of adaptive substitutions were comparatively low, whereas nonsynonymous to synonymous polymorphism and substitution levels were comparatively high in Leptidea, suggesting small long-term effective population sizes. Still, negative selection on linked sites (background selection) has resulted in reduced nucleotide diversity in regions with relatively high gene density and low recombination rate. We also found a significant effect of mutation rate variation on levels of polymorphism. Finally, there were considerable population differences in levels of genetic diversity and pervasiveness of selection against slightly deleterious alleles, in line with expectations from differences in estimated effective population sizes.
Collapse
Affiliation(s)
- Venkat Talla
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Lucile Soler
- Department of Medical Biochemistry and Microbiology, National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Uppsala, Sweden
| | - Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Vlad Dincă
- Department of Ecology and Genetics, University of Oulu, Finland
| | - Roger Vila
- Institut de Biologia Evolutiva (CSIC-UPF), Barcelona, Spain
| | - Magne Friberg
- Department of Biology, Biodiversity Unit, Lund University, Sweden
| | - Christer Wiklund
- Department of Zoology, Division of Ecology, Stockholm University, Sweden
| | - Niclas Backström
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| |
Collapse
|
28
|
|
29
|
Olabode AS, Avino M, Ng GT, Abu-Sardanah F, Dick DW, Poon AFY. Evidence for a recombinant origin of HIV-1 Group M from genomic variation. Virus Evol 2019; 5:vey039. [PMID: 30687518 PMCID: PMC6342232 DOI: 10.1093/ve/vey039] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Reconstructing the early dynamics of the HIV-1 pandemic can provide crucial insights into the socioeconomic drivers of emerging infectious diseases in human populations, including the roles of urbanization and transportation networks. Current evidence indicates that the global pandemic comprising almost entirely of HIV-1/M originated around the 1920s in central Africa. However, these estimates are based on molecular clock estimates that are assumed to apply uniformly across the virus genome. There is growing evidence that recombination has played a significant role in the early history of the HIV-1 pandemic, such that different regions of the HIV-1 genome have different evolutionary histories. In this study, we have conducted a dated-tip analysis of all near full-length HIV-1/M genome sequences that were published in the GenBank database. We used a sliding window approach similar to the 'bootscanning' method for detecting breakpoints in inter-subtype recombinant sequences. We found evidence of substantial variation in estimated root dates among windows, with an estimated mean time to the most recent common ancestor of 1922. Estimates were significantly autocorrelated, which was more consistent with an early recombination event than with stochastic error variation in phylogenetic reconstruction and dating analyses. A piecewise regression analysis supported the existence of at least one recombination breakpoint in the HIV-1/M genome with interval-specific means around 1929 and 1913, respectively. This analysis demonstrates that a sliding window approach can accommodate early recombination events outside the established nomenclature of HIV-1/M subtypes, although it is difficult to incorporate the earliest available samples due to their limited genome coverage.
Collapse
Affiliation(s)
- Abayomi S Olabode
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, Canada
| | - Mariano Avino
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, Canada
| | - Garway T Ng
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, Canada
| | - Faisal Abu-Sardanah
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, Canada
| | - David W Dick
- Department of Applied Mathematics, Western University, London, Ontario, Canada
| | - Art F Y Poon
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, Canada.,Department of Applied Mathematics, Western University, London, Ontario, Canada.,Department of Microbiology & Immunology, Western University, London, Ontario, Canada
| |
Collapse
|
30
|
Osorio-Guarín JA, Quackenbush CR, Cornejo OE. Ancestry informative alleles captured with reduced representation library sequencing in Theobroma cacao. PLoS One 2018; 13:e0203973. [PMID: 30332408 PMCID: PMC6192562 DOI: 10.1371/journal.pone.0203973] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 10/04/2018] [Indexed: 12/18/2022] Open
Abstract
As the source of chocolate, cacao has become one of the most important crops in the world. The identification of molecular markers to understand the demographic history, genetic diversity and population structure plays a pivotal role in cacao breeding programs. Here, we report the use of a modified genotyping-by-sequencing (GBS) approach for large-scale single nucleotide polymorphism (SNP) discovery and allele ancestry mapping. We identified 12,357 bi-allelic SNPs after filtering, of which, 7,009 variants were ancestry informative. The GBS approach proved to be rapid, cost-effective, and highly informative for ancestry assignment in this species.
Collapse
Affiliation(s)
- Jaime A. Osorio-Guarín
- Centro de Investigación Tibaitatá, Corporación Colombiana de Investigación Agropecuaria–Agrosavia, Mosquera, Cundinamarca, Colombia
| | - Corey R. Quackenbush
- School of Biological Sciences, Washington State University, Pullman, WA, United States of America
| | - Omar E. Cornejo
- School of Biological Sciences, Washington State University, Pullman, WA, United States of America
| |
Collapse
|
31
|
Iwanowicz LR, Iwanowicz DD, Adams CR, Lewis TD, Brandt TM, Sanders LR, Cornman RS. Isolation, characterization and molecular identification of a novel aquareovirus that infects endangered fountain darter Etheostoma fonticola. DISEASES OF AQUATIC ORGANISMS 2018; 130:95-108. [PMID: 30198485 DOI: 10.3354/dao03261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The fountain darter Etheostoma fonticola (FOD) is a federally endangered fish listed under the US Endangered Species Act. Here, we identified and characterized a novel aquareovirus isolated from wild fountain darters inhabiting the San Marcos River. This virus was propagated in Chinook salmon embryo (CHSE)-214, rainbow trout gonad-2 and fathead minnow cells at 15°C. The epithelioma papulosum cyprini cell line was refractory at all temperatures evaluated. High throughput sequencing technologies facilitated the complete genome sequencing of this virus utilizing ribosomal RNA-depleted RNA extracted from infected CHSE-214 cells. Conventional PCR primer sets were developed for the detection and confirmation of this virus to assist diagnostic screening methods. Phylogenetic analysis suggests this virus belongs to the Aquareovirus A genus. This research provides requisite initial data critical to support hatchery and refugia biosecurity measures for this endangered species.
Collapse
Affiliation(s)
- L R Iwanowicz
- US Geological Survey, Leetown Science Center, 11649 Leetown Road, Kearneysville, WV 25430, USA
| | | | | | | | | | | | | |
Collapse
|
32
|
Evolution of Human Respiratory Syncytial Virus (RSV) over Multiple Seasons in New South Wales, Australia. Viruses 2018; 10:v10090476. [PMID: 30200580 PMCID: PMC6164696 DOI: 10.3390/v10090476] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 09/01/2018] [Accepted: 09/05/2018] [Indexed: 01/06/2023] Open
Abstract
There is an ongoing global pandemic of human respiratory syncytial virus (RSV) infection that results in substantial annual morbidity and mortality. In Australia, RSV is a major cause of acute lower respiratory tract infections (ALRI). Nevertheless, little is known about the extent and origins of the genetic diversity of RSV in Australia, nor the factors that shape this diversity. We have conducted a genome-scale analysis of RSV infections in New South Wales (NSW). RSV genomes were successfully sequenced for 144 specimens collected between 2010⁻2016. Of these, 64 belonged to the RSVA and 80 to the RSVB subtype. Phylogenetic analysis revealed a wide diversity of RSV lineages within NSW and that both subtypes evolved rapidly in a strongly clock-like manner, with mean rates of approximately 6⁻8 × 10-4 nucleotide substitutions per site per year. There was only weak evidence for geographic clustering of sequences, indicative of fluid patterns of transmission within the infected population and no evidence of any clustering by patient age such that viruses in the same lineages circulate through the entire host population. Importantly, we show that both subtypes circulated concurrently in NSW with multiple introductions into the Australian population in each year and only limited evidence for multi-year persistence.
Collapse
|
33
|
Lyons DM, Lauring AS. Mutation and Epistasis in Influenza Virus Evolution. Viruses 2018; 10:E407. [PMID: 30081492 PMCID: PMC6115771 DOI: 10.3390/v10080407] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 07/30/2018] [Accepted: 07/30/2018] [Indexed: 12/25/2022] Open
Abstract
Influenza remains a persistent public health challenge, because the rapid evolution of influenza viruses has led to marginal vaccine efficacy, antiviral resistance, and the annual emergence of novel strains. This evolvability is driven, in part, by the virus's capacity to generate diversity through mutation and reassortment. Because many new traits require multiple mutations and mutations are frequently combined by reassortment, epistatic interactions between mutations play an important role in influenza virus evolution. While mutation and epistasis are fundamental to the adaptability of influenza viruses, they also constrain the evolutionary process in important ways. Here, we review recent work on mutational effects and epistasis in influenza viruses.
Collapse
Affiliation(s)
- Daniel M Lyons
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Adam S Lauring
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA.
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
34
|
Lyons DM, Lauring AS. Evidence for the Selective Basis of Transition-to-Transversion Substitution Bias in Two RNA Viruses. Mol Biol Evol 2018; 34:3205-3215. [PMID: 29029187 PMCID: PMC5850290 DOI: 10.1093/molbev/msx251] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The substitution rates of transitions are higher than expected by chance relative to those of transversions. Many have argued that selection disfavors transversions, as nonsynonymous transversions are less likely to conserve biochemical properties of the original amino acid. Only recently has it become feasible to directly test this selective hypothesis by comparing the fitness effects of a large number of transition and transversion mutations. For example, a recent study of six viruses and one beta-lactamase gene did not find evidence supporting the selective hypothesis. Here, we analyze the relative fitness effects of transition and transversion mutations from our recently published genome-wide study of mutational fitness effects in influenza virus. In contrast to prior work, we find that transversions are significantly more detrimental than transitions. Using what we believe to be an improved statistical framework, we also identify a similar trend in two HIV data sets. We further demonstrate a fitness difference in transition and transversion mutations using four deep mutational scanning data sets of influenza virus and HIV, which provided adequate statistical power. We find that three of the most commonly cited radical/conservative amino acid categories are predictive of fitness, supporting their utility in studies of positive selection and codon usage bias. We conclude that selection is a major contributor to the transition:transversion substitution bias in viruses and that this effect is only partially explained by the greater likelihood of transversion mutations to cause radical as opposed to conservative amino acid changes.
Collapse
Affiliation(s)
- Daniel M Lyons
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Adam S Lauring
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI.,Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI.,Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI
| |
Collapse
|
35
|
Sánchez-Campos S, Domínguez-Huerta G, Díaz-Martínez L, Tomás DM, Navas-Castillo J, Moriones E, Grande-Pérez A. Differential Shape of Geminivirus Mutant Spectra Across Cultivated and Wild Hosts With Invariant Viral Consensus Sequences. FRONTIERS IN PLANT SCIENCE 2018; 9:932. [PMID: 30013589 PMCID: PMC6036239 DOI: 10.3389/fpls.2018.00932] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 06/11/2018] [Indexed: 05/12/2023]
Abstract
Geminiviruses (family Geminiviridae) possess single-stranded circular DNA genomes that are replicated by cellular polymerases in plant host cell nuclei. In their hosts, geminivirus populations behave as ensembles of mutant and recombinant genomes, known as viral quasispecies. This favors the emergence of new geminiviruses with altered host range, facilitating new or more severe diseases or overcoming resistance traits. In warm and temperate areas several whitefly-transmitted geminiviruses of the genus Begomovirus cause the tomato yellow leaf curl disease (TYLCD) with significant economic consequences. TYLCD is frequently controlled in commercial tomatoes by using the dominant Ty-1 resistance gene. Over a 45 day period we have studied the diversification of three begomoviruses causing TYLCD: tomato yellow leaf curl virus (TYLCV), tomato yellow leaf curl Sardinia virus (TYLCSV) and tomato yellow leaf curl Malaga virus (TYLCMaV, a natural recombinant between TYLCV and TYLCSV). Viral quasispecies resulting from inoculation of geminivirus infectious clones were examined in plants of susceptible tomato (ty-1/ty-1), heterozygous resistant tomato (Ty-1/ty-1), common bean, and the wild reservoir Solanum nigrum. Differences in virus fitness across hosts were observed while viral consensus sequences remained invariant. However, the complexity and heterogeneity of the quasispecies were high, especially in common bean and the wild host. Interestingly, the presence or absence of the Ty-1 allele in tomato did not lead to differences in begomovirus mutant spectra. However, the fitness decrease of TYLCSV and TYLCV in tomato at 45 dpi might be related to an increase in CP (Coat protein) mutation frequency. In Solanum nigrum the recombinant TYLCMaV, which showed lower fitness than TYLCSV, at 45 dpi actively explored Rep (Replication associated protein) ORF but not the overlapping C4. Our results underline the importance of begomovirus mutant spectra during infections. This is especially relevant in the wild reservoir of the viruses, which has the potential to maintain highly diverse mutant spectra without modifying their consensus sequences.
Collapse
Affiliation(s)
- Sonia Sánchez-Campos
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Estación Experimental “La Mayora,” Algarrobo-Costa, Málaga, Spain
| | - Guillermo Domínguez-Huerta
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Estación Experimental “La Mayora,” Algarrobo-Costa, Málaga, Spain
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Área de Genética, Facultad de Ciencias, Campus de Teatinos, Málaga, Spain
| | - Luis Díaz-Martínez
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Área de Genética, Facultad de Ciencias, Campus de Teatinos, Málaga, Spain
| | - Diego M. Tomás
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Estación Experimental “La Mayora,” Algarrobo-Costa, Málaga, Spain
| | - Jesús Navas-Castillo
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Estación Experimental “La Mayora,” Algarrobo-Costa, Málaga, Spain
| | - Enrique Moriones
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Estación Experimental “La Mayora,” Algarrobo-Costa, Málaga, Spain
| | - Ana Grande-Pérez
- Instituto de Hortofruticultura Subtropical y Mediterránea “La Mayora,” Consejo Superior de Investigaciones Científicas-Universidad de Málaga, Área de Genética, Facultad de Ciencias, Campus de Teatinos, Málaga, Spain
| |
Collapse
|
36
|
Raghwani J, Redd AD, Longosz AF, Wu CH, Serwadda D, Martens C, Kagaayi J, Sewankambo N, Porcella SF, Grabowski MK, Quinn TC, Eller MA, Eller LA, Wabwire-Mangen F, Robb ML, Fraser C, Lythgoe KA. Evolution of HIV-1 within untreated individuals and at the population scale in Uganda. PLoS Pathog 2018; 14:e1007167. [PMID: 30052678 PMCID: PMC6082572 DOI: 10.1371/journal.ppat.1007167] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 08/08/2018] [Accepted: 06/20/2018] [Indexed: 12/15/2022] Open
Abstract
HIV-1 undergoes multiple rounds of error-prone replication between transmission events, resulting in diverse viral populations within and among individuals. In addition, the virus experiences different selective pressures at multiple levels: during the course of infection, at transmission, and among individuals. Disentangling how these evolutionary forces shape the evolution of the virus at the population scale is important for understanding pathogenesis, how drug- and immune-escape variants are likely to spread in populations, and the development of preventive vaccines. To address this, we deep-sequenced two regions of the HIV-1 genome (p24 and gp41) from 34 longitudinally-sampled untreated individuals from Rakai District in Uganda, infected with subtypes A, D, and inter-subtype recombinants. This dataset substantially increases the availability of HIV-1 sequence data that spans multiple years of untreated infection, in particular for different geographical regions and viral subtypes. In line with previous studies, we estimated an approximately five-fold faster rate of evolution at the within-host compared to the population scale for both synonymous and nonsynonymous substitutions, and for all subtypes. We determined the extent to which this mismatch in evolutionary rates can be explained by the evolution of the virus towards population-level consensus, or the transmission of viruses similar to those that establish infection within individuals. Our findings indicate that both processes are likely to be important.
Collapse
Affiliation(s)
- Jayna Raghwani
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Department of Zoology, Peter Medawar Building, University of Oxford, Oxford, United Kingdom
| | - Andrew D. Redd
- Laboratory of Immunoregulation, Division of Intramural Research, NIAID, NIH, Baltimore MD, United States of America
- Department of Medicine, Johns Hopkins Medical Institute, Johns Hopkins University, Baltimore MD, United States of America
| | - Andrew F. Longosz
- Laboratory of Immunoregulation, Division of Intramural Research, NIAID, NIH, Baltimore MD, United States of America
| | - Chieh-Hsi Wu
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - David Serwadda
- Rakai Health Sciences Program, Kalisizo, Uganda
- School of Public Health, Makerere University, Kampala, Uganda
| | - Craig Martens
- Genomics Unit, RTS, RTB, Rocky Mountain Laboratories, Division of Intramural Research, NIAID, NIH, Hamilton MT, United States of America
| | | | - Nelson Sewankambo
- Rakai Health Sciences Program, Kalisizo, Uganda
- School of Medicine, Makerere University, Kampala, Uganda
| | - Stephen F. Porcella
- Genomics Unit, RTS, RTB, Rocky Mountain Laboratories, Division of Intramural Research, NIAID, NIH, Hamilton MT, United States of America
| | - Mary K. Grabowski
- Department of Pathology, Johns Hopkins Medical Institute, Johns Hopkins University, Baltimore, MD, United States of America
| | - Thomas C. Quinn
- Laboratory of Immunoregulation, Division of Intramural Research, NIAID, NIH, Baltimore MD, United States of America
- Department of Medicine, Johns Hopkins Medical Institute, Johns Hopkins University, Baltimore MD, United States of America
| | - Michael A. Eller
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, United States of America
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, United States of America
| | - Leigh Anne Eller
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, United States of America
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, United States of America
| | - Fred Wabwire-Mangen
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, United States of America
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, United States of America
| | - Merlin L. Robb
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, United States of America
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, United States of America
| | - Christophe Fraser
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Katrina A. Lythgoe
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
- Department of Zoology, Peter Medawar Building, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
37
|
Abstract
Pithovirus sibericum is a giant (610 Kpb) double-stranded DNA virus discovered in a purportedly 30,000-year-old permafrost sample. A closely related virus, Pithovirus massiliensis, was recently isolated from a sewer in southern France. An initial comparison of these two virus genomes assumed that P. sibericum was directly ancestral to P. massiliensis and gave a maximum evolutionary rate of 2.60 × 10-5 nucleotide substitutions per site per year (subs/site/year). If correct, this would make pithoviruses among the fastest-evolving DNA viruses, with rates close to those seen in some RNA viruses. To help determine whether this unusually high rate is accurate we utilized the well-known negative association between evolutionary rate and genome size in DNA microbes. This revealed that a more plausible rate estimate for Pithovirus evolution is ∼2.23 × 10-6 subs/site/year, with even lower estimates obtained if evolutionary rates are assumed to be time-dependent. Hence, we estimate that Pithovirus has evolved at least an order of magnitude more slowly than previously suggested. We then used our new rate estimates to infer a time-scale for Pithovirus evolution. Strikingly, this suggests that these viruses could have diverged at least hundreds of thousands of years ago, and hence have evolved over longer time-scales than previously suggested. We propose that the evolutionary rate and time-scale of pithovirus evolution should be reconsidered in the light of these observations and that future estimates of the rate of giant virus evolution should be carefully examined in the context of their biological plausibility.
Collapse
Affiliation(s)
- Sebastián Duchêne
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Parkville, VIC 3020, Australia
| | - Edward C Holmes
- Marie Bashir Institute of Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, University of Sydney, Sydney, NSW 2006, Australia
| |
Collapse
|
38
|
Liu C, Fetterman JL, Liu P, Luo Y, Larson MG, Vasan RS, Zhu J, Levy D. Deep sequencing of the mitochondrial genome reveals common heteroplasmic sites in NADH dehydrogenase genes. Hum Genet 2018; 137:203-213. [PMID: 29423652 DOI: 10.1007/s00439-018-1873-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 01/30/2018] [Indexed: 12/17/2022]
Abstract
Increasing evidence implicates mitochondrial dysfunction in aging and age-related conditions. But little is known about the molecular basis for this connection. A possible cause may be mutations in the mitochondrial DNA (mtDNA), which are often heteroplasmic-the joint presence of different alleles at a single locus in the same individual. However, the involvement of mtDNA heteroplasmy in aging and age-related conditions has not been investigated thoroughly. We deep-sequenced the complete mtDNA genomes of 356 Framingham Heart Study participants (52% women, mean age 43, mean coverage 4570-fold), identified 2880 unique mutations and comprehensively annotated them by MITOMAP and PolyPhen-2. We discovered 11 heteroplasmic "hot" spots [NADH dehydrogenase (ND) subunit 1, 4, 5 and 6 genes, n = 7; cytochrome c oxidase I (COI), n = 2; 16S rRNA, n = 1; D-loop, n = 1] for which the alternative-to-reference allele ratios significantly increased with advancing age (Bonferroni correction p < 0.001). Four of these heteroplasmic mutations in ND and COI genes were predicted to be deleterious nonsynonymous mutations which may have direct impact on ATP production. We confirmed previous findings that healthy individuals carry many low-frequency heteroplasmy mutations with potentially deleterious effects. We hypothesize that the effect of a single deleterious heteroplasmy may be minimal due to a low mutant-to-wildtype allele ratio, whereas the aggregate effects of many deleterious mutations may cause changes in mitochondrial function and contribute to age-related diseases. The identification of age-related mtDNA mutations is an important step to understand the genetic architecture of age-related diseases and may uncover novel therapeutic targets for such diseases.
Collapse
Affiliation(s)
- Chunyu Liu
- Population Sciences Branch, NHLBI/NHI, Bethesda, MD, USA. .,Framingham Heart Study, Framingham, MA, USA. .,Department of Biostatistics, Boston University, Boston, MA, USA.
| | | | - Poching Liu
- DNA Sequencing and Genomics Core, NHLBI/NIH, Bethesda, MD, USA
| | - Yan Luo
- DNA Sequencing and Genomics Core, NHLBI/NIH, Bethesda, MD, USA
| | - Martin G Larson
- Framingham Heart Study, Framingham, MA, USA.,Department of Biostatistics, Boston University, Boston, MA, USA
| | - Ramachandran S Vasan
- Framingham Heart Study, Framingham, MA, USA.,School of Medicine, Boston University, Boston, MA, USA
| | - Jun Zhu
- System Biology Center, NHLBI/NHI, Bethesda, MD, USA
| | - Daniel Levy
- Population Sciences Branch, NHLBI/NHI, Bethesda, MD, USA. .,Framingham Heart Study, Framingham, MA, USA.
| |
Collapse
|
39
|
Saxenhofer M, Weber de Melo V, Ulrich RG, Heckel G. Revised time scales of RNA virus evolution based on spatial information. Proc Biol Sci 2018; 284:rspb.2017.0857. [PMID: 28794221 DOI: 10.1098/rspb.2017.0857] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Accepted: 07/06/2017] [Indexed: 12/18/2022] Open
Abstract
The time scales of pathogen evolution are of major concern in the context of public and veterinary health, epidemiology and evolutionary biology. Dating the emergence of a pathogen often relies on estimates of evolutionary rates derived from nucleotide sequence data. For many viruses, this has yielded estimates of evolutionary origins only a few hundred years in the past. Here we demonstrate through the incorporation of geographical information from virus sampling that evolutionary age estimates of two European hantaviruses are severely underestimated because of pervasive mutational saturation of nucleotide sequences. We detected very strong relationships between spatial distance and genetic divergence for both Puumala and Tula hantavirus-irrespective of whether nucleotide or derived amino acid sequences were analysed. Extrapolations from these relationships dated the emergence of these viruses most conservatively to at least 3700 and 2500 years ago, respectively. Our minimum estimates for the age of these hantaviruses are ten to a hundred times older than results from current non-spatial methods, and in much better accordance with the biogeography of these viruses and their respective hosts. Spatial information can thus provide valuable insights on the deeper time scales of pathogen evolution and improve our understanding of disease emergence.
Collapse
Affiliation(s)
- Moritz Saxenhofer
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland.,Swiss Institute of Bioinformatics, Quartier Sorge-Bâtiment Génopode, Lausanne, Switzerland
| | - Vanessa Weber de Melo
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland.,Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Rainer G Ulrich
- Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, Institute of Novel and Emerging Infectious Diseases, Greifswald-Insel Riems, Germany.,German Center for Infection Research (DZIF), partner site Hamburg-Luebeck-Borstel-Insel Riems, Germany
| | - Gerald Heckel
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland .,Swiss Institute of Bioinformatics, Quartier Sorge-Bâtiment Génopode, Lausanne, Switzerland
| |
Collapse
|
40
|
Metsky HC, Matranga CB, Wohl S, Schaffner SF, Freije CA, Winnicki SM, West K, Qu J, Baniecki ML, Gladden-Young A, Lin AE, Tomkins-Tinch CH, Ye SH, Park DJ, Luo CY, Barnes KG, Shah RR, Chak B, Barbosa-Lima G, Delatorre E, Vieira YR, Paul LM, Tan AL, Barcellona CM, Porcelli MC, Vasquez C, Cannons AC, Cone MR, Hogan KN, Kopp EW, Anzinger JJ, Garcia KF, Parham LA, Ramírez RMG, Montoya MCM, Rojas DP, Brown CM, Hennigan S, Sabina B, Scotland S, Gangavarapu K, Grubaugh ND, Oliveira G, Robles-Sikisaka R, Rambaut A, Gehrke L, Smole S, Halloran ME, Villar L, Mattar S, Lorenzana I, Cerbino-Neto J, Valim C, Degrave W, Bozza PT, Gnirke A, Andersen KG, Isern S, Michael SF, Bozza FA, Souza TML, Bosch I, Yozwiak NL, MacInnis BL, Sabeti PC. Zika virus evolution and spread in the Americas. Nature 2017; 546:411-415. [PMID: 28538734 PMCID: PMC5563848 DOI: 10.1038/nature22402] [Citation(s) in RCA: 264] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 05/02/2017] [Indexed: 12/22/2022]
Abstract
Although the recent Zika virus (ZIKV) epidemic in the Americas and its link to birth defects have attracted a great deal of attention, much remains unknown about ZIKV disease epidemiology and ZIKV evolution, in part owing to a lack of genomic data. Here we address this gap in knowledge by using multiple sequencing approaches to generate 110 ZIKV genomes from clinical and mosquito samples from 10 countries and territories, greatly expanding the observed viral genetic diversity from this outbreak. We analysed the timing and patterns of introductions into distinct geographic regions; our phylogenetic evidence suggests rapid expansion of the outbreak in Brazil and multiple introductions of outbreak strains into Puerto Rico, Honduras, Colombia, other Caribbean islands, and the continental United States. We find that ZIKV circulated undetected in multiple regions for many months before the first locally transmitted cases were confirmed, highlighting the importance of surveillance of viral infections. We identify mutations with possible functional implications for ZIKV biology and pathogenesis, as well as those that might be relevant to the effectiveness of diagnostic tests.
Collapse
Affiliation(s)
- Hayden C Metsky
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | | | - Shirlee Wohl
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Stephen F Schaffner
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, USA
| | - Catherine A Freije
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Sarah M Winnicki
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Kendra West
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - James Qu
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | | | | | - Aaron E Lin
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | | | - Simon H Ye
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Harvard-MIT Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Daniel J Park
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Cynthia Y Luo
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Kayla G Barnes
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, USA
| | - Rickey R Shah
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Harvard University Extension School, Cambridge, Massachusetts, USA
| | - Bridget Chak
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Giselle Barbosa-Lima
- National Institute of Infectious Diseases Evandro Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Edson Delatorre
- Laboratório de AIDS e Imunologia Molecular, Instituto Oswaldo Cruz, FIOCRUZ, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Yasmine R Vieira
- National Institute of Infectious Diseases Evandro Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Lauren M Paul
- Department of Biological Sciences, College of Arts and Sciences, Florida Gulf Coast University, Fort Myers, Florida, USA
| | - Amanda L Tan
- Department of Biological Sciences, College of Arts and Sciences, Florida Gulf Coast University, Fort Myers, Florida, USA
| | - Carolyn M Barcellona
- Department of Biological Sciences, College of Arts and Sciences, Florida Gulf Coast University, Fort Myers, Florida, USA
| | | | | | - Andrew C Cannons
- Bureau of Public Health Laboratories, Division of Disease Control and Health Protection, Florida Department of Health, Tampa, Florida, USA
| | - Marshall R Cone
- Bureau of Public Health Laboratories, Division of Disease Control and Health Protection, Florida Department of Health, Tampa, Florida, USA
| | - Kelly N Hogan
- Bureau of Public Health Laboratories, Division of Disease Control and Health Protection, Florida Department of Health, Tampa, Florida, USA
| | - Edgar W Kopp
- Bureau of Public Health Laboratories, Division of Disease Control and Health Protection, Florida Department of Health, Tampa, Florida, USA
| | - Joshua J Anzinger
- Department of Microbiology, The University of the West Indies, Mona, Kingston, Jamaica
| | - Kimberly F Garcia
- Instituto de Investigacion en Microbiologia, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Leda A Parham
- Instituto de Investigacion en Microbiologia, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Rosa M Gélvez Ramírez
- Grupo de Epidemiología Clínica, Universidad Industrial de Santander, Bucaramanga, Colombia
| | | | - Diana P Rojas
- Department of Epidemiology, College of Public Health and Health Professions, University of Florida, Gainesville, Florida, USA
| | - Catherine M Brown
- Massachusetts Department of Public Health, Jamaica Plain, Massachusetts, USA
| | - Scott Hennigan
- Massachusetts Department of Public Health, Jamaica Plain, Massachusetts, USA
| | - Brandon Sabina
- Massachusetts Department of Public Health, Jamaica Plain, Massachusetts, USA
| | - Sarah Scotland
- Massachusetts Department of Public Health, Jamaica Plain, Massachusetts, USA
| | - Karthik Gangavarapu
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, USA
| | - Nathan D Grubaugh
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, USA
| | - Glenn Oliveira
- Scripps Translational Science Institute, La Jolla, California, USA
| | - Refugio Robles-Sikisaka
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, USA
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
- Fogarty International Center, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Lee Gehrke
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, Massachusetts, USA
| | - Sandra Smole
- Massachusetts Department of Public Health, Jamaica Plain, Massachusetts, USA
| | - M Elizabeth Halloran
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Department of Biostatistics, University of Washington, Seattle, Washington, USA
| | - Luis Villar
- Grupo de Epidemiología Clínica, Universidad Industrial de Santander, Bucaramanga, Colombia
| | - Salim Mattar
- Institute for Tropical Biology Research, Universidad de Córdoba, Montería, Córdoba, Colombia
| | - Ivette Lorenzana
- Instituto de Investigacion en Microbiologia, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Jose Cerbino-Neto
- National Institute of Infectious Diseases Evandro Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Clarissa Valim
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, USA
- Department of Osteopathic Medical Specialties, Michigan State University, East Lansing, Michegan, USA
| | - Wim Degrave
- FIOCRUZ, Instituto Oswaldo Cruz, Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Patricia T Bozza
- Laboratório de Imunofarmacologia, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Andreas Gnirke
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Kristian G Andersen
- Department of Immunology and Microbial Science, The Scripps Research Institute, La Jolla, California, USA
- Scripps Translational Science Institute, La Jolla, California, USA
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, USA
| | - Sharon Isern
- Department of Biological Sciences, College of Arts and Sciences, Florida Gulf Coast University, Fort Myers, Florida, USA
| | - Scott F Michael
- Department of Biological Sciences, College of Arts and Sciences, Florida Gulf Coast University, Fort Myers, Florida, USA
| | - Fernando A Bozza
- National Institute of Infectious Diseases Evandro Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rio de Janeiro, Rio de Janeiro, Brazil
- D'Or Institute for Research and Education, Rio de Janeiro, Brazil
| | - Thiago M L Souza
- National Institute for Science and Technology on Innovation on Neglected Diseases, FIOCRUZ, Rio de Janeiro, Rio de Janeiro, Brazil
- Center for Technological Development in Health, FIOCRUZ, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Irene Bosch
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Nathan L Yozwiak
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Bronwyn L MacInnis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, USA
| | - Pardis C Sabeti
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
- Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, USA
- Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| |
Collapse
|
41
|
Foley BT, Leitner T, Paraskevis D, Peeters M. Primate immunodeficiency virus classification and nomenclature: Review. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2016; 46:150-158. [PMID: 27789390 PMCID: PMC5136504 DOI: 10.1016/j.meegid.2016.10.018] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Revised: 10/19/2016] [Accepted: 10/21/2016] [Indexed: 12/25/2022]
Abstract
The International Committee for the Taxonomy and Nomenclature of Viruses does not rule on virus classifications below the species level. The definition of species for viruses cannot be clearly defined for all types of viruses. The complex and interesting epidemiology of Human Immunodeficiency Viruses demands a detailed and informative nomenclature system, while at the same time it presents challenges such that many of the rules need to be flexibly applied or modified over time. This review outlines the nomenclature system for primate lentiviruses and provides an update on new findings since the last review was written in 2000.
Collapse
Affiliation(s)
- Brian T Foley
- Theoretical Biology and Biophysics Group, T-6 Mail Stop K710, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
| | - Thomas Leitner
- Theoretical Biology and Biophysics Group, T-6 Mail Stop K710, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Dimitrios Paraskevis
- National and Kapodistrian University of Athens, Department of Hygiene, Epidemiology and Medical Statistics, Medical School, Athens, Greece
| | - Martine Peeters
- UMI233-TransVIHMI, Institut de Recherche pour le Développement (IRD), INSERM U1175, University of Montpellier, Montpellier, France; IBC, Computational Biology Institute, 34095 Montpellier, France
| |
Collapse
|
42
|
Duchêne S, Holt KE, Weill FX, Le Hello S, Hawkey J, Edwards DJ, Fourment M, Holmes EC. Genome-scale rates of evolutionary change in bacteria. Microb Genom 2016; 2:e000094. [PMID: 28348834 PMCID: PMC5320706 DOI: 10.1099/mgen.0.000094] [Citation(s) in RCA: 143] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2016] [Accepted: 10/24/2016] [Indexed: 01/26/2023] Open
Abstract
Estimating the rates at which bacterial genomes evolve is critical to understanding major evolutionary and ecological processes such as disease emergence, long-term host–pathogen associations and short-term transmission patterns. The surge in bacterial genomic data sets provides a new opportunity to estimate these rates and reveal the factors that shape bacterial evolutionary dynamics. For many organisms estimates of evolutionary rate display an inverse association with the time-scale over which the data are sampled. However, this relationship remains unexplored in bacteria due to the difficulty in estimating genome-wide evolutionary rates, which are impacted by the extent of temporal structure in the data and the prevalence of recombination. We collected 36 whole genome sequence data sets from 16 species of bacterial pathogens to systematically estimate and compare their evolutionary rates and assess the extent of temporal structure in the absence of recombination. The majority (28/36) of data sets possessed sufficient clock-like structure to robustly estimate evolutionary rates. However, in some species reliable estimates were not possible even with ‘ancient DNA’ data sampled over many centuries, suggesting that they evolve very slowly or that they display extensive rate variation among lineages. The robustly estimated evolutionary rates spanned several orders of magnitude, from approximately 10−5 to 10−8 nucleotide substitutions per site year−1. This variation was negatively associated with sampling time, with this relationship best described by an exponential decay curve. To avoid potential estimation biases, such time-dependency should be considered when inferring evolutionary time-scales in bacteria.
Collapse
Affiliation(s)
- Sebastian Duchêne
- 1Marie Bashir Institute of Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW 2006, Australia.,2Centre for Systems Genomics, The University of Melbourne, Melbourne, VIC 3010, Australia.,3Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Kathryn E Holt
- 2Centre for Systems Genomics, The University of Melbourne, Melbourne, VIC 3010, Australia.,3Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC 3010, Australia
| | | | - Simon Le Hello
- 4Institut Pasteur, Unité des Bactéries Pathogènes Entériques, Paris 75015, France
| | - Jane Hawkey
- 2Centre for Systems Genomics, The University of Melbourne, Melbourne, VIC 3010, Australia.,3Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - David J Edwards
- 2Centre for Systems Genomics, The University of Melbourne, Melbourne, VIC 3010, Australia.,3Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Mathieu Fourment
- 1Marie Bashir Institute of Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW 2006, Australia
| | - Edward C Holmes
- 1Marie Bashir Institute of Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW 2006, Australia
| |
Collapse
|
43
|
Ho SYW, Duchêne S, Molak M, Shapiro B. Time-dependent estimates of molecular evolutionary rates: evidence and causes. Mol Ecol 2016; 24:6007-12. [PMID: 26769402 DOI: 10.1111/mec.13450] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2015] [Revised: 06/30/2015] [Accepted: 07/17/2015] [Indexed: 12/28/2022]
Abstract
We are writing in response to a recent critique by Emerson & Hickerson (2015), who challenge the evidence of a time-dependent bias in molecular rate estimates. This bias takes the form of a negative relationship between inferred evolutionary rates and the ages of the calibrations on which these estimates are based. Here, we present a summary of the evidence obtained from a broad range of taxa that supports a time-dependent bias in rate estimates, with a consideration of the potential causes of these observed trends. We also describe recent progress in improving the reliability of evolutionary rate estimation and respond to the concerns raised by Emerson & Hickerson (2015) about the validity of rates estimated from time-structured sequence data. In doing so, we hope to dispel some misconceptions and to highlight several research directions that will improve our understanding of time-dependent biases in rate estimates.
Collapse
Affiliation(s)
- Simon Y W Ho
- School of Biological Sciences, University of Sydney, Sydney, NSW, Australia
| | - Sebastián Duchêne
- School of Biological Sciences, University of Sydney, Sydney, NSW, Australia
| | - Martyna Molak
- Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, California, USA.,UCSC Genomics Institute, University of California, Santa Cruz, California, USA
| |
Collapse
|
44
|
Rieux A, Balloux F. Inferences from tip-calibrated phylogenies: a review and a practical guide. Mol Ecol 2016; 25:1911-24. [PMID: 26880113 PMCID: PMC4949988 DOI: 10.1111/mec.13586] [Citation(s) in RCA: 87] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Revised: 02/01/2016] [Accepted: 02/04/2016] [Indexed: 12/25/2022]
Abstract
Molecular dating of phylogenetic trees is a growing discipline using sequence data to co‐estimate the timing of evolutionary events and rates of molecular evolution. All molecular‐dating methods require converting genetic divergence between sequences into absolute time. Historically, this could only be achieved by associating externally derived dates obtained from fossil or biogeographical evidence to internal nodes of the tree. In some cases, notably for fast‐evolving genomes such as viruses and some bacteria, the time span over which samples were collected may cover a significant proportion of the time since they last shared a common ancestor. This situation allows phylogenetic trees to be calibrated by associating sampling dates directly to the sequences representing the tips (terminal nodes) of the tree. The increasing availability of genomic data from ancient DNA extends the applicability of such tip‐based calibration to a variety of taxa including humans, extinct megafauna and various microorganisms which typically have a scarce fossil record. The development of statistical models accounting for heterogeneity in different aspects of the evolutionary process while accommodating very large data sets (e.g. whole genomes) has allowed using tip‐dating methods to reach inferences on divergence times, substitution rates, past demography or the age of specific mutations on a variety of spatiotemporal scales. In this review, we summarize the current state of the art of tip dating, discuss some recent applications, highlight common pitfalls and provide a ‘how to’ guide to thoroughly perform such analyses.
Collapse
Affiliation(s)
- Adrien Rieux
- Department of Genetics, Evolution and Environment, UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK
| | - François Balloux
- Department of Genetics, Evolution and Environment, UCL Genetics Institute, University College London, Gower Street, London, WC1E 6BT, UK
| |
Collapse
|
45
|
Geller R, Estada Ú, Peris JB, Andreu I, Bou JV, Garijo R, Cuevas JM, Sabariegos R, Mas A, Sanjuán R. Highly heterogeneous mutation rates in the hepatitis C virus genome. Nat Microbiol 2016; 1:16045. [PMID: 27572964 DOI: 10.1038/nmicrobiol.2016.45] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 03/09/2016] [Indexed: 01/10/2023]
Abstract
Spontaneous mutations are the ultimate source of genetic variation and have a prominent role in evolution. RNA viruses such as hepatitis C virus (HCV) have extremely high mutation rates, but these rates have been inferred from a minute fraction of genome sites, limiting our view of how RNA viruses create diversity. Here, by applying high-fidelity ultradeep sequencing to a modified replicon system, we scored >15,000 spontaneous mutations, encompassing more than 90% of the HCV genome. This revealed >1,000-fold differences in mutability across genome sites, with extreme variations even between adjacent nucleotides. We identify base composition, the presence of high- and low-mutation clusters and transition/transversion biases as the main factors driving this heterogeneity. Furthermore, we find that mutability correlates with the ability of HCV to diversify in patients. These data provide a site-wise baseline for interrogating natural selection, genetic load and evolvability in HCV, as well as for evaluating drug resistance and immune evasion risks.
Collapse
Affiliation(s)
- Ron Geller
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - Úrsula Estada
- Unitat de Genómica, Servei Central de Suport a la Investigació Experimental, Universitat de València, 46100 Burjassot, València, Spain
| | - Joan B Peris
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - Iván Andreu
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - Juan-Vicente Bou
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - Raquel Garijo
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - José M Cuevas
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain
| | - Rosario Sabariegos
- Regional Center for Biomedical Research, Universidad de Castilla-La Mancha, 02006 Albacete, Spain
| | - Antonio Mas
- Regional Center for Biomedical Research, Universidad de Castilla-La Mancha, 02006 Albacete, Spain
| | - Rafael Sanjuán
- Instituto Cavanilles de Biodiversidad y Biología Evolutiva and Institute for Integrative Systems Biology (I2SysBio), Universitat de València, 46980 Paterna, València, Spain.,Departament de Genètica, Universitat de València, 46100 Burjassot, València, Spain
| |
Collapse
|
46
|
Ni S, Stoneking M. Improvement in detection of minor alleles in next generation sequencing by base quality recalibration. BMC Genomics 2016; 17:139. [PMID: 26920804 PMCID: PMC4769523 DOI: 10.1186/s12864-016-2463-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 02/11/2016] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Minor allele detection in very high coverage sequence data (>1000X) has many applications such as detecting mtDNA heteroplasmy, somatic mutations in cancer or tumors, SNP calling in pool sequencing, etc., where reads with low frequency are not necessarily sequence error but may instead convey biological information. However, the suitability of common base quality recalibration tools for such applications has not been investigated in detail. RESULTS We show that the widely used tool GATK BaseRecalibration has several limitations in minor allele detection. First, GATK IndelRealignment fails to work if the sequence coverage is above a certain level since it then becomes computationally infeasible. Second, the accuracy of the base quality largely depends on the database of known SNPs as the control, which limits the ability of de novo minor allele detection. Third, GATK reduces the base quality of sequence errors at the cost of reducing scores for true minor alleles. To overcome these limitations, we present a novel approach called SEGREG, which applies segmented regression to control sequences (e.g. phiX174 DNA) spiked into a sequencing run. Based on simulations SEGREG improves both the accuracy of base quality scores and the detection of minor alleles. We further investigate sequence error and recalibration parameters by applying a Logarithm Likelihood Ratio (LLR) approach to SEGREG recalibrated base quality scores for phiX174 DNA sequenced to very high coverage, and for mtDNA genome sequences previously analyzed for heteroplasmic variants. CONCLUSIONS Our results suggest that SEGREG improves base recalibration without suffering the limitations discussed above, and the LLR approach benefits from SEGREG in identifying more true minor alleles, while avoiding false positives from sequencing error.
Collapse
Affiliation(s)
- Shengyu Ni
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, D04103, Germany.
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, D04103, Germany.
| |
Collapse
|
47
|
Duchêne S, Di Giallonardo F, Holmes EC. Substitution Model Adequacy and Assessing the Reliability of Estimates of Virus Evolutionary Rates and Time Scales. Mol Biol Evol 2015; 33:255-67. [PMID: 26416981 DOI: 10.1093/molbev/msv207] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Determining the time scale of virus evolution is central to understanding their origins and emergence. The phylogenetic methods commonly used for this purpose can be misleading if the substitution model makes incorrect assumptions about the data. Empirical studies consider a pool of models and select that with the highest statistical fit. However, this does not allow the rejection of all models, even if they poorly describe the data. An alternative is to use model adequacy methods that evaluate the ability of a model to predict hypothetical future observations. This can be done by comparing the empirical data with data generated under the model in question. We conducted simulations to evaluate the sensitivity of such methods with nucleotide, amino acid, and codon data. These effectively detected underparameterized models, but failed to detect mutational saturation and some instances of nonstationary base composition, which can lead to biases in estimates of tree topology and length. To test the applicability of these methods with real data, we analyzed nucleotide and amino acid data sets from the genus Flavivirus of RNA viruses. In most cases these models were inadequate, with the exception of a data set of relatively closely related sequences of Dengue virus, for which the GTR+Γ nucleotide and LG+Γ amino acid substitution models were adequate. Our results partly explain the lack of consensus over estimates of the long-term evolutionary time scale of these viruses, and indicate that assessing the adequacy of substitution models should be routinely used to determine whether estimates are reliable.
Collapse
Affiliation(s)
- Sebastián Duchêne
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Biological Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW, Australia
| | - Francesca Di Giallonardo
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Biological Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW, Australia
| | - Edward C Holmes
- Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Biological Sciences and Sydney Medical School, The University of Sydney, Sydney, NSW, Australia
| |
Collapse
|
48
|
Molak M, Ho SYW. Prolonged decay of molecular rate estimates for metazoan mitochondrial DNA. PeerJ 2015; 3:e821. [PMID: 25780773 PMCID: PMC4358697 DOI: 10.7717/peerj.821] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Accepted: 02/16/2015] [Indexed: 01/07/2023] Open
Abstract
Evolutionary timescales can be estimated from genetic data using the molecular clock, often calibrated by fossil or geological evidence. However, estimates of molecular rates in mitochondrial DNA appear to scale negatively with the age of the clock calibration. Although such a pattern has been observed in a limited range of data sets, it has not been studied on a large scale in metazoans. In addition, there is uncertainty over the temporal extent of the time-dependent pattern in rate estimates. Here we present a meta-analysis of 239 rate estimates from metazoans, representing a range of timescales and taxonomic groups. We found evidence of time-dependent rates in both coding and non-coding mitochondrial markers, in every group of animals that we studied. The negative relationship between the estimated rate and time persisted across a much wider range of calibration times than previously suggested. This indicates that, over long time frames, purifying selection gives way to mutational saturation as the main driver of time-dependent biases in rate estimates. The results of our study stress the importance of accounting for time-dependent biases in estimating mitochondrial rates regardless of the timescale over which they are inferred.
Collapse
Affiliation(s)
- Martyna Molak
- School of Biological Sciences, University of Sydney , Sydney , Australia ; Museum and Institute of Zoology, Polish Academy of Sciences , Warsaw , Poland
| | - Simon Y W Ho
- School of Biological Sciences, University of Sydney , Sydney , Australia
| |
Collapse
|