1
|
Richard H, Martinetti D, Lercier D, Fouillat Y, Hadi B, Elkahky M, Ding J, Michel L, Morris CE, Berthier K, Maupas F, Soubeyrand S. Computing Geographical Networks Generated by Air-Mass Movement. GEOHEALTH 2023; 7:e2023GH000885. [PMID: 37859755 PMCID: PMC10584379 DOI: 10.1029/2023gh000885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 09/18/2023] [Accepted: 09/22/2023] [Indexed: 10/21/2023]
Abstract
As air masses move within the troposphere, they transport a multitude of components including gases and particles such as pollen and microorganisms. These movements generate atmospheric highways that connect geographic areas at distant, local, and global scales that particles can ride depending on their aerodynamic properties and their reaction to environmental conditions. In this article we present an approach and an accompanying web application called tropolink for measuring the extent to which distant locations are potentially connected by air-mass movement. This approach is based on the computation of trajectories of air masses with the HYSPLIT atmospheric transport and dispersion model, and on the computation of connection frequencies, called connectivities, in the purpose of building trajectory-based geographical networks. It is illustrated for different spatial and temporal scales with three case studies related to plant epidemiology. The web application that we designed allows the user to easily perform intensive computation and mobilize massive archived gridded meteorological data to build weighted directed networks. The analysis of such networks allowed us for example, to describe the potential of invasion of a migratory pest beyond its actual distribution. Our approach could also be used to compute geographical networks generated by air-mass movement for diverse application domains, for example, to assess long-term risk of spread from persistent or recurrent sources of pollutants, including wildfire smoke.
Collapse
Affiliation(s)
| | | | | | | | - B. Hadi
- Plant Production and Protection Division (NSP)Food and Agriculture Organization of the United Nations (FAO)RomeItaly
| | - M. Elkahky
- Plant Production and Protection Division (NSP)Food and Agriculture Organization of the United Nations (FAO)RomeItaly
| | - J. Ding
- Plant Production and Protection Division (NSP)Food and Agriculture Organization of the United Nations (FAO)RomeItaly
| | - L. Michel
- Plateforme ESVINRAEBioSPAvignonFrance
| | | | | | | | | |
Collapse
|
2
|
Ribaud M, Gabriel E, Hughes J, Soubeyrand S. Identifying potential significant factors impacting zero-inflated proportion data. Stat Med 2023; 42:3467-3486. [PMID: 37290435 DOI: 10.1002/sim.9814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 04/03/2023] [Accepted: 05/19/2023] [Indexed: 06/10/2023]
Abstract
Classical supervised methods like linear regression and decision trees are not completely adapted for identifying impacting factors on a response variable corresponding to zero-inflated proportion data (ZIPD) that are dependent, continuous and bounded. In this article we propose a within-block permutation-based methodology to identify factors (discrete or continuous) that are significantly correlated with ZIPD, we propose a performance indicator quantifying the percentage of correlation explained by the subset of significant factors, and we show how to predict the ranks of the response variables conditionally on the observation of these factors. The methodology is illustrated on simulated data and on two real data sets dealing with epidemiology. In the first data set, ZIPD correspond to probabilities of transmission of Influenza between horses. In the second data set, ZIPD correspond to probabilities that geographic entities (eg, states and countries) have the same COVID-19 mortality dynamics.
Collapse
Affiliation(s)
| | | | - Joseph Hughes
- Centre for Virus Research, MRC-University of Glasgow, Glasgow, UK
| | | |
Collapse
|
3
|
Ke Z, Vikalo H. Graph-Based Reconstruction and Analysis of Disease Transmission Networks Using Viral Genomic Data. J Comput Biol 2023. [PMID: 37347892 DOI: 10.1089/cmb.2022.0373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2023] Open
Abstract
Understanding the patterns of viral disease transmissions helps establish public health policies and aids in controlling and ending a disease outbreak. Classical methods for studying disease transmission dynamics that rely on epidemiological data, such as times of sample collection and duration of exposure intervals, struggle to provide desired insight due to limited informativeness of such data. A more precise characterization of disease transmissions may be acquired from sequencing data that reveal genetic distance between viral genomes in patient samples. Indeed, genetic distance between viral strains present in hosts contains valuable information about transmission history, thus motivating the design of methods that rely on genomic data to reconstruct a directed disease transmission network, detect transmission clusters, and identify significant network nodes (e.g., super-spreaders). In this article, we present a novel end-to-end framework for the analysis of viral transmissions utilizing viral genomic (sequencing) data. The proposed framework groups infected hosts into transmission clusters based on the reconstructed viral strains infecting them; the genetic distance between a pair of hosts is calculated using Earth Mover's Distance, and further used to infer transmission direction between the hosts. To quantify the significance of a host in the transmission network, the importance score is calculated by a graph convolutional autoencoder. The viral transmission network is represented by a directed minimum spanning tree utilizing the Edmond's algorithm modified to incorporate constraints on the importance scores of the hosts. The proposed framework outperforms state-of-the-art techniques for the analysis of viral transmission dynamics in several experiments on semiexperimental as well as experimental data.
Collapse
Affiliation(s)
- Ziqi Ke
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| | - Haris Vikalo
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas, USA
| |
Collapse
|
4
|
Johnson PCD, Hägglund S, Näslund K, Meyer G, Taylor G, Orton RJ, Zohari S, Haydon DT, Valarcher JF. Evaluating the potential of whole-genome sequencing for tracing transmission routes in experimental infections and natural outbreaks of bovine respiratory syncytial virus. Vet Res 2022; 53:107. [PMID: 36510312 PMCID: PMC9746130 DOI: 10.1186/s13567-022-01127-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open
Abstract
Bovine respiratory syncytial virus (BRSV) is a major cause of respiratory disease in cattle. Genomic sequencing can resolve phylogenetic relationships between virus populations, which can be used to infer transmission routes and potentially inform the design of biosecurity measures. Sequencing of short (<2000 nt) segments of the 15 000-nt BRSV genome has revealed geographic and temporal clustering of BRSV populations, but insufficient variation to distinguish viruses collected from herds infected close together in space and time. This study investigated the potential for whole-genome sequencing to reveal sufficient genomic variation for inferring transmission routes between herds. Next-generation sequencing (NGS) data were generated from experimental infections and from natural outbreaks in Jämtland and Uppsala counties in Sweden. Sufficient depth of coverage for analysis of consensus and sub-consensus sequence diversity was obtained from 47 to 20 samples respectively. Few (range: 0-6 polymorphisms across the six experiments) consensus-level polymorphisms were observed along experimental transmissions. A much higher level of diversity (146 polymorphic sites) was found among the consensus sequences from the outbreak samples. The majority (144/146) of polymorphisms were between rather than within counties, suggesting that consensus whole-genome sequences show insufficient spatial resolution for inferring direct transmission routes, but might allow identification of outbreak sources at the regional scale. By contrast, within-sample diversity was generally higher in the experimental than the outbreak samples. Analyses to infer known (experimental) and suspected (outbreak) transmission links from within-sample diversity data were uninformative. In conclusion, analysis of the whole-genome sequence of BRSV from experimental samples discriminated between circulating isolates from distant areas, but insufficient diversity was observed between closely related isolates to aid local transmission route inference.
Collapse
Affiliation(s)
- Paul C D Johnson
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK.
| | - Sara Hägglund
- HPIG. Unit of Ruminant Medicine. Department of Clinical Sciences, Swedish University of Agricultural Sciences (SLU), Uppsala, Sweden
| | - Katarina Näslund
- Department of Microbiology, National Veterinary Institute, SVA, Uppsala, Sweden
| | - Gilles Meyer
- IHAP, Université de Toulouse, INRAE, ENVT, Toulouse, France
| | | | - Richard J Orton
- MRC-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Siamak Zohari
- Department of Microbiology, National Veterinary Institute, SVA, Uppsala, Sweden
| | - Daniel T Haydon
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK
| | - Jean François Valarcher
- HPIG. Unit of Ruminant Medicine. Department of Clinical Sciences, Swedish University of Agricultural Sciences (SLU), Uppsala, Sweden
| |
Collapse
|
5
|
Alamil M, Thébaud G, Berthier K, Soubeyrand S. Characterizing viral within-host diversity in fast and non-equilibrium demo-genetic dynamics. Front Microbiol 2022; 13:983938. [PMID: 36274731 PMCID: PMC9581327 DOI: 10.3389/fmicb.2022.983938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/08/2022] [Indexed: 11/13/2022] Open
Abstract
High-throughput sequencing has opened the route for a deep assessment of within-host genetic diversity that can be used, e.g., to characterize microbial communities and to infer transmission links in infectious disease outbreaks. The performance of such characterizations and inferences cannot be analytically assessed in general and are often grounded on computer-intensive evaluations. Then, being able to simulate within-host genetic diversity across time under various demo-genetic assumptions is paramount to assess the performance of the approaches of interest. In this context, we built an original model that can be simulated to investigate the temporal evolution of genotypes and their frequencies under various demo-genetic assumptions. The model describes the growth and the mutation of genotypes at the nucleotide resolution conditional on an overall within-host viral kinetics, and can be tuned to generate fast non-equilibrium demo-genetic dynamics. We ran simulations of this model and computed classic diversity indices to characterize the temporal variation of within-host genetic diversity (from high-throughput amplicon sequences) of virus populations under three demographic kinetic models of viral infection. Our results highlight how demographic (viral load) and genetic (mutation, selection, or drift) factors drive variations in within-host diversity during the course of an infection. In particular, we observed a non-monotonic relationship between pathogen population size and genetic diversity, and a reduction of the impact of mutation on diversity when a non-specific host immune response is activated. The large variation in the diversity patterns generated in our simulations suggests that the underlying model provides a flexible basis to produce very diverse demo-genetic scenarios and test, for instance, methods for the inference of transmission links during outbreaks.
Collapse
Affiliation(s)
- Maryam Alamil
- INRAE, BioSP, Avignon, France
- Department of Mathematics and Computer Science, Alfaisal University, Riyadh, Saudi Arabia
- *Correspondence: Maryam Alamil ;
| | - Gaël Thébaud
- PHIM Plant Health Institute, INRAE, Univ Montpellier, CIRAD, Institut Agro, IRD, Montpellier, France
| | | | | |
Collapse
|
6
|
Methods Combining Genomic and Epidemiological Data in the Reconstruction of Transmission Trees: A Systematic Review. Pathogens 2022; 11:pathogens11020252. [PMID: 35215195 PMCID: PMC8875843 DOI: 10.3390/pathogens11020252] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/08/2022] [Accepted: 02/11/2022] [Indexed: 11/17/2022] Open
Abstract
In order to better understand transmission dynamics and appropriately target control and preventive measures, studies have aimed to identify who-infected-whom in actual outbreaks. Numerous reconstruction methods exist, each with their own assumptions, types of data, and inference strategy. Thus, selecting a method can be difficult. Following PRISMA guidelines, we systematically reviewed the literature for methods combing epidemiological and genomic data in transmission tree reconstruction. We identified 22 methods from the 41 selected articles. We defined three families according to how genomic data was handled: a non-phylogenetic family, a sequential phylogenetic family, and a simultaneous phylogenetic family. We discussed methods according to the data needed as well as the underlying sequence mutation, within-host evolution, transmission, and case observation. In the non-phylogenetic family consisting of eight methods, pairwise genetic distances were estimated. In the phylogenetic families, transmission trees were inferred from phylogenetic trees either simultaneously (nine methods) or sequentially (five methods). While a majority of methods (17/22) modeled the transmission process, few (8/22) took into account imperfect case detection. Within-host evolution was generally (7/8) modeled as a coalescent process. These practical and theoretical considerations were highlighted in order to help select the appropriate method for an outbreak.
Collapse
|
7
|
Forde TL, Dennis TPW, Aminu OR, Harvey WT, Hassim A, Kiwelu I, Medvecky M, Mshanga D, Van Heerden H, Vogel A, Zadoks RN, Mmbaga BT, Lembo T, Biek R. Population genomics of Bacillus anthracis from an anthrax hyperendemic area reveals transmission processes across spatial scales and unexpected within-host diversity. Microb Genom 2022; 8:000759. [PMID: 35188453 PMCID: PMC8942019 DOI: 10.1099/mgen.0.000759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 12/10/2021] [Indexed: 11/18/2022] Open
Abstract
Genomic sequencing has revolutionized our understanding of bacterial disease epidemiology, but remains underutilized for zoonotic pathogens in remote endemic settings. Anthrax, caused by the spore-forming bacterium Bacillus anthracis, remains a threat to human and animal health and rural livelihoods in low- and middle-income countries. While the global genomic diversity of B. anthracis has been well-characterized, there is limited information on how its populations are genetically structured at the scale at which transmission occurs, critical for understanding the pathogen's evolution and transmission dynamics. Using a uniquely rich dataset, we quantified genome-wide SNPs among 73 B. anthracis isolates derived from 33 livestock carcasses sampled over 1 year throughout the Ngorongoro Conservation Area, Tanzania, a region hyperendemic for anthrax. Genome-wide SNPs distinguished 22 unique B. anthracis genotypes (i.e. SNP profiles) within the study area. However, phylogeographical structure was lacking, as identical SNP profiles were found throughout the study area, likely the result of the long and variable periods of spore dormancy and long-distance livestock movements. Significantly, divergent genotypes were obtained from spatio-temporally linked cases and even individual carcasses. The high number of SNPs distinguishing isolates from the same host is unlikely to have arisen during infection, as supported by our simulation models. This points to an unexpectedly wide transmission bottleneck for B. anthracis, with an inoculum comprising multiple variants being the norm. Our work highlights that inferring transmission patterns of B. anthracis from genomic data will require analytical approaches that account for extended and variable environmental persistence, as well as co-infection.
Collapse
Affiliation(s)
- Taya L. Forde
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Tristan P. W. Dennis
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - O. Rhoda Aminu
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - William T. Harvey
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Ayesha Hassim
- Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, South Africa
| | - Ireen Kiwelu
- Kilimanjaro Clinical Research Institute, Kilimanjaro Christian Medical Centre, Moshi, Tanzania
| | - Matej Medvecky
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | | | - Henriette Van Heerden
- Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, South Africa
| | - Adeline Vogel
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Ruth N. Zadoks
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
- Present address: Sydney School of Veterinary Science, University of Sydney, Sydney, Australia
| | - Blandina T. Mmbaga
- Kilimanjaro Clinical Research Institute, Kilimanjaro Christian Medical Centre, Moshi, Tanzania
- Kilimanjaro Christian Medical University College, Moshi, Tanzania
| | - Tiziana Lembo
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| | - Roman Biek
- Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, UK
| |
Collapse
|
8
|
Berry IM, Melendrez MC, Pollett S, Figueroa K, Buddhari D, Klungthong C, Nisalak A, Panciera M, Thaisomboonsuk B, Li T, Vallard TG, Macareo L, Yoon IK, Thomas SJ, Endy T, Jarman RG. Precision Tracing of Household Dengue Spread Using Inter- and Intra-Host Viral Variation Data, Kamphaeng Phet, Thailand. Emerg Infect Dis 2021; 27:1637-1644. [PMID: 34013878 PMCID: PMC8153871 DOI: 10.3201/eid2706.204323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Dengue control approaches are best informed by granular spatial epidemiology of these viruses, yet reconstruction of inter- and intra-household transmissions is limited when analyzing case count, serologic, or genomic consensus sequence data. To determine viral spread on a finer spatial scale, we extended phylogenomic discrete trait analyses to reconstructions of house-to-house transmissions within a prospective cluster study in Kamphaeng Phet, Thailand. For additional resolution and transmission confirmation, we mapped dengue intra-host single nucleotide variants on the taxa of these time-scaled phylogenies. This approach confirmed 19 household transmissions and revealed that dengue disperses an average of 70 m per day between households in these communities. We describe an evolutionary biology framework for the resolution of dengue transmissions that cannot be differentiated based on epidemiologic and consensus genome data alone. This framework can be used as a public health tool to inform control approaches and enable precise tracing of dengue transmissions.
Collapse
|
9
|
Ezanno P, Andraud M, Beaunée G, Hoch T, Krebs S, Rault A, Touzeau S, Vergu E, Widgren S. How mechanistic modelling supports decision making for the control of enzootic infectious diseases. Epidemics 2020; 32:100398. [PMID: 32622313 DOI: 10.1016/j.epidem.2020.100398] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 05/07/2020] [Accepted: 05/29/2020] [Indexed: 12/28/2022] Open
Abstract
Controlling enzootic diseases, which generate a large cumulative burden and are often unregulated, is needed for sustainable farming, competitive agri-food chains, and veterinary public health. We discuss the benefits and challenges of mechanistic epidemiological modelling for livestock enzootics, with particular emphasis on the need for interdisciplinary approaches. We focus on issues arising when modelling pathogen spread at various scales (from farm to the region) to better assess disease control and propose targeted options. We discuss in particular the inclusion of farmers' strategic decision-making, the integration of within-host scale to refine intervention targeting, and the need to ground models on data.
Collapse
Affiliation(s)
- P Ezanno
- INRAE, Oniris, BIOEPAR, Site de la Chantrerie, CS40706, 44307 Nantes, France.
| | - M Andraud
- Unité épidémiologie et bien-être du porc, Anses Laboratoire de Ploufragan-Plouzané, Ploufragan, France.
| | - G Beaunée
- INRAE, Oniris, BIOEPAR, Site de la Chantrerie, CS40706, 44307 Nantes, France.
| | - T Hoch
- INRAE, Oniris, BIOEPAR, Site de la Chantrerie, CS40706, 44307 Nantes, France.
| | - S Krebs
- INRAE, Oniris, BIOEPAR, Site de la Chantrerie, CS40706, 44307 Nantes, France.
| | - A Rault
- INRAE, Oniris, BIOEPAR, Site de la Chantrerie, CS40706, 44307 Nantes, France.
| | - S Touzeau
- INRAE, CNRS, Université Côte d'Azur, ISA, France; Inria, INRAE, CNRS, Université Paris Sorbonne, Université Côte d'Azur, BIOCORE, France.
| | - E Vergu
- INRAE, Université Paris-Saclay, MaIAGE, 78350 Jouy-en-Josas, France.
| | - S Widgren
- Department of Disease Control and Epidemiology, National Veterinary Institute, 751 89 Uppsala, Sweden.
| |
Collapse
|
10
|
Thompson RN, Brooks-Pollock E. Detection, forecasting and control of infectious disease epidemics: modelling outbreaks in humans, animals and plants. Philos Trans R Soc Lond B Biol Sci 2020; 374:20190038. [PMID: 31056051 DOI: 10.1098/rstb.2019.0038] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The 1918 influenza pandemic is one of the most devastating infectious disease epidemics on record, having caused approximately 50 million deaths worldwide. Control measures, including prohibiting non-essential gatherings as well as closing cinemas and music halls, were applied with varying success and limited knowledge of transmission dynamics. One hundred years later, following developments in the field of mathematical epidemiology, models are increasingly used to guide decision-making and devise appropriate interventions that mitigate the impacts of epidemics. Epidemiological models have been used as decision-making tools during outbreaks in human, animal and plant populations. However, as the subject has developed, human, animal and plant disease modelling have diverged. Approaches have been developed independently for pathogens of each host type, often despite similarities between the models used in these complementary fields. With the increased importance of a One Health approach that unifies human, animal and plant health, we argue that more inter-disciplinary collaboration would enhance each of the related disciplines. This pair of theme issues presents research articles written by human, animal and plant disease modellers. In this introductory article, we compare the questions pertinent to, and approaches used by, epidemiological modellers of human, animal and plant pathogens, and summarize the articles in these theme issues. We encourage future collaboration that transcends disciplinary boundaries and links the closely related areas of human, animal and plant disease epidemic modelling. This article is part of the theme issue 'Modelling infectious disease outbreaks in humans, animals and plants: approaches and important themes'. This issue is linked with the subsequent theme issue 'Modelling infectious disease outbreaks in humans, animals and plants: epidemic forecasting and control'.
Collapse
Affiliation(s)
- Robin N Thompson
- 1 Mathematical Institute, University of Oxford , Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford OX2 6GG , UK.,2 Department of Zoology, University of Oxford , Peter Medawar Building, South Parks Road, Oxford OX1 3SY , UK.,3 Christ Church, University of Oxford , St Aldates, Oxford OX1 1DP , UK
| | - Ellen Brooks-Pollock
- 4 Bristol Veterinary School, University of Bristol , Langford BS40 5DU , UK.,5 National Institute for Health Research, Health Protection Research Unit in Evaluation of Interventions, Bristol Medical School , Bristol BS8 2BN , UK
| |
Collapse
|
11
|
Pérez-Losada M, Arenas M, Galán JC, Bracho MA, Hillung J, García-González N, González-Candelas F. High-throughput sequencing (HTS) for the analysis of viral populations. INFECTION GENETICS AND EVOLUTION 2020; 80:104208. [PMID: 32001386 DOI: 10.1016/j.meegid.2020.104208] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 01/21/2020] [Accepted: 01/24/2020] [Indexed: 12/12/2022]
Abstract
The development of High-Throughput Sequencing (HTS) technologies is having a major impact on the genomic analysis of viral populations. Current HTS platforms can capture nucleic acid variation across millions of genes for both selected amplicons and full viral genomes. HTS has already facilitated the discovery of new viruses, hinted new taxonomic classifications and provided a deeper and broader understanding of their diversity, population and genetic structure. Hence, HTS has already replaced standard Sanger sequencing in basic and applied research fields, but the next step is its implementation as a routine technology for the analysis of viruses in clinical settings. The most likely application of this implementation will be the analysis of viral genomics, because the huge population sizes, high mutation rates and very fast replacement of viral populations have demonstrated the limited information obtained with Sanger technology. In this review, we describe new technologies and provide guidelines for the high-throughput sequencing and genetic and evolutionary analyses of viral populations and metaviromes, including software applications. With the development of new HTS technologies, new and refurbished molecular and bioinformatic tools are also constantly being developed to process and integrate HTS data. These allow assembling viral genomes and inferring viral population diversity and dynamics. Finally, we also present several applications of these approaches to the analysis of viral clinical samples including transmission clusters and outbreak characterization.
Collapse
Affiliation(s)
- Marcos Pérez-Losada
- Computational Biology Institute, Milken Institute School of Public Health, George Washington University, Washington, DC, USA; CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão 4485-661, Portugal
| | - Miguel Arenas
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain; Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain.
| | - Juan Carlos Galán
- Microbiology Service, Hospital Ramón y Cajal, Madrid, Spain; CIBER in Epidemiology and Public Health, Spain.
| | - Mª Alma Bracho
- CIBER in Epidemiology and Public Health, Spain; Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain.
| | - Julia Hillung
- Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| | - Neris García-González
- Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| | - Fernando González-Candelas
- CIBER in Epidemiology and Public Health, Spain; Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| |
Collapse
|