Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Skums P, Zelikovsky A, Singh R, Gussler W, Dimitrova Z, Knyazev S, Mandric I, Ramachandran S, Campo D, Jha D, Bunimovich L, Costenbader E, Sexton C, O'Connor S, Xia GL, Khudyakov Y. QUENTIN: reconstruction of disease transmissions from viral quasispecies genomic data. Bioinformatics 2018;34:163-170. [PMID: 29304222 DOI: 10.1093/bioinformatics/btx402] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 06/15/2017] [Indexed: 01/08/2023] Open

For:	Skums P, Zelikovsky A, Singh R, Gussler W, Dimitrova Z, Knyazev S, Mandric I, Ramachandran S, Campo D, Jha D, Bunimovich L, Costenbader E, Sexton C, O'Connor S, Xia GL, Khudyakov Y. QUENTIN: reconstruction of disease transmissions from viral quasispecies genomic data. Bioinformatics 2018;34:163-170. [PMID: 29304222 DOI: 10.1093/bioinformatics/btx402] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 06/15/2017] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Deb S, Basu J, Choudhary M. An overview of next generation sequencing strategies and genomics tools used for tuberculosis research. J Appl Microbiol 2024;135:lxae174. [PMID: 39003248 DOI: 10.1093/jambio/lxae174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/07/2024] [Accepted: 07/10/2024] [Indexed: 07/15/2024]

Delgado S, Somovilla P, Ferrer-Orta C, Martínez-González B, Vázquez-Monteagudo S, Muñoz-Flores J, Soria ME, García-Crespo C, de Ávila AI, Durán-Pastor A, Gadea I, López-Galíndez C, Moran F, Lorenzo-Redondo R, Verdaguer N, Perales C, Domingo E. Incipient functional SARS-CoV-2 diversification identified through neural network haplotype maps. Proc Natl Acad Sci U S A 2024;121:e2317851121. [PMID: 38416684 DOI: 10.1073/pnas.2317851121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/08/2024] [Indexed: 03/01/2024] Open

Affiliation(s)

Soledad Delgado Departamento de Sistemas Informáticos, Escuela Técnica Superior de Ingeniería de Sistemas Informáticos, Universidad Politécnica de Madrid, Madrid 28031, Spain
Pilar Somovilla Microbes in Health and Welfare Program, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain Departamento de Biología Molecular, Universidad Autónoma de Madrid, Madrid 28049, Spain
Cristina Ferrer-Orta Structural and Molecular Biology Department, Institut de Biología Molecular de Barcelona, Consejo Superior de Investigaciones Científicas, Barcelona 08028, Spain
Brenda Martínez-González Department of Molecular and Cell Biology, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain Department of Clinical Microbiology, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz University Hospital, Universidad Autónoma de Madrid, Madrid 28040, Spain
Sergi Vázquez-Monteagudo Structural and Molecular Biology Department, Institut de Biología Molecular de Barcelona, Consejo Superior de Investigaciones Científicas, Barcelona 08028, Spain
Javier Muñoz-Flores Global Management Solutions S.L., Torre Picasso, Madrid 28020, Spain
María Eugenia Soria Microbes in Health and Welfare Program, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain Department of Clinical Microbiology, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz University Hospital, Universidad Autónoma de Madrid, Madrid 28040, Spain
Carlos García-Crespo Microbes in Health and Welfare Program, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain
Ana Isabel de Ávila Microbes in Health and Welfare Program, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain
Antoni Durán-Pastor Department of Molecular and Cell Biology, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain
Ignacio Gadea Department of Clinical Microbiology, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz University Hospital, Universidad Autónoma de Madrid, Madrid 28040, Spain
Cecilio López-Galíndez Unidad de Virología Molecular, Laboratorio de Referencia e Investigación en retrovirus, Centro Nacional de Microbiología, Instituto de salud Carlos III, Majadahonda 28222, Spain
Federico Moran Departamento de Bioquímica y Biología Molecular, Universidad Complutense de Madrid, Madrid 28040, Spain
Ramon Lorenzo-Redondo Department of Medicine, Division of Infectious Diseases, Northwestern University Feinberg School of Medicine, Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL 60611
Nuria Verdaguer Structural and Molecular Biology Department, Institut de Biología Molecular de Barcelona, Consejo Superior de Investigaciones Científicas, Barcelona 08028, Spain
Celia Perales Department of Molecular and Cell Biology, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain Department of Clinical Microbiology, Instituto de Investigación Sanitaria-Fundación Jiménez Díaz University Hospital, Universidad Autónoma de Madrid, Madrid 28040, Spain
Esteban Domingo Microbes in Health and Welfare Program, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Consejo Superior de Investigaciones Científicas, Madrid 28049, Spain

Collapse

Senghore M, Read H, Oza P, Johnson S, Passarelli-Araujo H, Taylor BP, Ashley S, Grey A, Callendrello A, Lee R, Goddard MR, Lumley T, Hanage WP, Wiles S. Inferring bacterial transmission dynamics using deep sequencing genomic surveillance data. Nat Commun 2023;14:6397. [PMID: 37907520 PMCID: PMC10618251 DOI: 10.1038/s41467-023-42211-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 09/27/2023] [Indexed: 11/02/2023] Open

Affiliation(s)

Madikay Senghore Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA.
Hannah Read Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
Priyali Oza Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
Sarah Johnson Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
Hemanoel Passarelli-Araujo Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA Department of Biochemistry and Immunology, Federal University of Minas Gerais, Minas Gerais, Brazil
Bradford P Taylor Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
Stephen Ashley Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
Alex Grey Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand
Alanna Callendrello Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
Robyn Lee Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA University of Toronto Dalla Lana School of Public Health, Toronto, ON, Canada
Matthew R Goddard School of Biological Sciences, University of Auckland, Auckland, New Zealand School of Life and Environmental Sciences, University of Lincoln, Lincoln, UK
Thomas Lumley Department of Statistics, University of Auckland, Auckland, New Zealand
William P Hanage Center for Communicable Disease Dynamics, Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA, USA
Siouxsie Wiles Bioluminescent Superbugs Lab, Department of Molecular Medicine and Pathology, University of Auckland, Auckland, New Zealand. Te Pūnaha Matatini, Centre of Research Excellence in Complex Systems, Auckland, New Zealand.

Collapse

Juyal A, Hosseini R, Novikov D, Grinshpon M, Zelikovsky A. Reconstruction of Viral Variants via Monte Carlo Clustering. J Comput Biol 2023;30:1009-1018. [PMID: 37695837 PMCID: PMC10518690 DOI: 10.1089/cmb.2023.0154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2023] Open

Ke Z, Vikalo H. Graph-Based Reconstruction and Analysis of Disease Transmission Networks Using Viral Genomic Data. J Comput Biol 2023. [PMID: 37347892 DOI: 10.1089/cmb.2022.0373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/24/2023] Open

Abstract

Understanding the patterns of viral disease transmissions helps establish public health policies and aids in controlling and ending a disease outbreak. Classical methods for studying disease transmission dynamics that rely on epidemiological data, such as times of sample collection and duration of exposure intervals, struggle to provide desired insight due to limited informativeness of such data. A more precise characterization of disease transmissions may be acquired from sequencing data that reveal genetic distance between viral genomes in patient samples. Indeed, genetic distance between viral strains present in hosts contains valuable information about transmission history, thus motivating the design of methods that rely on genomic data to reconstruct a directed disease transmission network, detect transmission clusters, and identify significant network nodes (e.g., super-spreaders). In this article, we present a novel end-to-end framework for the analysis of viral transmissions utilizing viral genomic (sequencing) data. The proposed framework groups infected hosts into transmission clusters based on the reconstructed viral strains infecting them; the genetic distance between a pair of hosts is calculated using Earth Mover's Distance, and further used to infer transmission direction between the hosts. To quantify the significance of a host in the transmission network, the importance score is calculated by a graph convolutional autoencoder. The viral transmission network is represented by a directed minimum spanning tree utilizing the Edmond's algorithm modified to incorporate constraints on the importance scores of the hosts. The proposed framework outperforms state-of-the-art techniques for the analysis of viral transmission dynamics in several experiments on semiexperimental as well as experimental data.

Collapse

Johnson PCD, Hägglund S, Näslund K, Meyer G, Taylor G, Orton RJ, Zohari S, Haydon DT, Valarcher JF. Evaluating the potential of whole-genome sequencing for tracing transmission routes in experimental infections and natural outbreaks of bovine respiratory syncytial virus. Vet Res 2022;53:107. [PMID: 36510312 PMCID: PMC9746130 DOI: 10.1186/s13567-022-01127-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open

Abstract

Bovine respiratory syncytial virus (BRSV) is a major cause of respiratory disease in cattle. Genomic sequencing can resolve phylogenetic relationships between virus populations, which can be used to infer transmission routes and potentially inform the design of biosecurity measures. Sequencing of short (<2000 nt) segments of the 15 000-nt BRSV genome has revealed geographic and temporal clustering of BRSV populations, but insufficient variation to distinguish viruses collected from herds infected close together in space and time. This study investigated the potential for whole-genome sequencing to reveal sufficient genomic variation for inferring transmission routes between herds. Next-generation sequencing (NGS) data were generated from experimental infections and from natural outbreaks in Jämtland and Uppsala counties in Sweden. Sufficient depth of coverage for analysis of consensus and sub-consensus sequence diversity was obtained from 47 to 20 samples respectively. Few (range: 0-6 polymorphisms across the six experiments) consensus-level polymorphisms were observed along experimental transmissions. A much higher level of diversity (146 polymorphic sites) was found among the consensus sequences from the outbreak samples. The majority (144/146) of polymorphisms were between rather than within counties, suggesting that consensus whole-genome sequences show insufficient spatial resolution for inferring direct transmission routes, but might allow identification of outbreak sources at the regional scale. By contrast, within-sample diversity was generally higher in the experimental than the outbreak samples. Analyses to infer known (experimental) and suspected (outbreak) transmission links from within-sample diversity data were uninformative. In conclusion, analysis of the whole-genome sequence of BRSV from experimental samples discriminated between circulating isolates from distant areas, but insufficient diversity was observed between closely related isolates to aid local transmission route inference.

Collapse

Quasispecies Fitness Partition to Characterize the Molecular Status of a Viral Population. Negative Effect of Early Ribavirin Discontinuation in a Chronically Infected HEV Patient. Int J Mol Sci 2022;23:ijms232314654. [PMID: 36498981 PMCID: PMC9739305 DOI: 10.3390/ijms232314654] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 11/11/2022] [Accepted: 11/17/2022] [Indexed: 11/25/2022] Open

Chao E, Chato C, Vender R, Olabode AS, Ferreira RC, Poon AFY. Molecular source attribution. PLoS Comput Biol 2022;18:e1010649. [PMID: 36395093 PMCID: PMC9671344 DOI: 10.1371/journal.pcbi.1010649] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Skums P, Mohebbi F, Tsyvina V, Baykal PI, Nemira A, Ramachandran S, Khudyakov Y. SOPHIE: Viral outbreak investigation and transmission history reconstruction in a joint phylogenetic and network theory framework. Cell Syst 2022;13:844-856.e4. [PMID: 36265470 PMCID: PMC9590096 DOI: 10.1016/j.cels.2022.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 07/05/2022] [Accepted: 07/19/2022] [Indexed: 01/26/2023]

Lundgren E, Romero-Severson E, Albert J, Leitner T. Combining biomarker and virus phylogenetic models improves HIV-1 epidemiological source identification. PLoS Comput Biol 2022;18:e1009741. [PMID: 36026480 PMCID: PMC9455879 DOI: 10.1371/journal.pcbi.1009741] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 09/08/2022] [Accepted: 08/02/2022] [Indexed: 01/07/2023] Open

Abstract

To identify and stop active HIV transmission chains new epidemiological techniques are needed. Here, we describe the development of a multi-biomarker augmentation to phylogenetic inference of the underlying transmission history in a local population. HIV biomarkers are measurable biological quantities that have some relationship to the amount of time someone has been infected with HIV. To train our model, we used five biomarkers based on real data from serological assays, HIV sequence data, and target cell counts in longitudinally followed, untreated patients with known infection times. The biomarkers were modeled with a mixed effects framework to allow for patient specific variation and general trends, and fit to patient data using Markov Chain Monte Carlo (MCMC) methods. Subsequently, the density of the unobserved infection time conditional on observed biomarkers were obtained by integrating out the random effects from the model fit. This probabilistic information about infection times was incorporated into the likelihood function for the transmission history and phylogenetic tree reconstruction, informed by the HIV sequence data. To critically test our methodology, we developed a coalescent-based simulation framework that generates phylogenies and biomarkers given a specific or general transmission history. Testing on many epidemiological scenarios showed that biomarker augmented phylogenetics can reach 90% accuracy under idealized situations. Under realistic within-host HIV-1 evolution, involving substantial within-host diversification and frequent transmission of multiple lineages, the average accuracy was at about 50% in transmission clusters involving 5-50 hosts. Realistic biomarker data added on average 16 percentage points over using the phylogeny alone. Using more biomarkers improved the performance. Shorter temporal spacing between transmission events and increased transmission heterogeneity reduced reconstruction accuracy, but larger clusters were not harder to get right. More sequence data per infected host also improved accuracy. We show that the method is robust to incomplete sampling and that adding biomarkers improves reconstructions of real HIV-1 transmission histories. The technology presented here could allow for better prevention programs by providing data for locally informed and tailored strategies.

Collapse

Xi X, Spencer SEF, Hall M, Grabowski MK, Kagaayi J, Ratmann O. Inferring the sources of HIV infection in Africa from deep‐sequence data with semi‐parametric Bayesian Poisson flow models. J R Stat Soc Ser C Appl Stat 2022. [DOI: 10.1111/rssc.12544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Guang A, Howison M, Ledingham L, D’Antuono M, Chan PA, Lawrence C, Dunn CW, Kantor R. Incorporating Within-Host Diversity in Phylogenetic Analyses for Detecting Clusters of New HIV Diagnoses. Front Microbiol 2022;12:803190. [PMID: 35250908 PMCID: PMC8891961 DOI: 10.3389/fmicb.2021.803190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/22/2021] [Indexed: 11/29/2022] Open

Xu R, Aranday-Cortes E, Leitch ECM, Hughes J, Singer JB, Sreenu V, Tong L, da Silva Filipe A, Bamford CGG, Rong X, Huang J, Wang M, Fu Y, McLauchlan J. The evolutionary dynamics and epidemiological history of hepatitis C virus genotype 6, including unique strains from the Li community of Hainan Island, China. Virus Evol 2022;8:veac012. [PMID: 35600095 PMCID: PMC9115904 DOI: 10.1093/ve/veac012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 01/17/2022] [Accepted: 02/15/2022] [Indexed: 12/09/2022] Open

Abstract

Hepatitis C virus (HCV) is a highly diverse pathogen that frequently establishes a chronic long-term infection, but the origins and drivers of HCV diversity in the human population remain unclear. Previously unidentified strains of HCV genotype 6 (gt6) were recently discovered in chronically infected individuals of the Li ethnic group living in Baisha County, Hainan Island, China. The Li community, who were early settlers on Hainan Island, has a distinct host genetic background and cultural identity compared to other ethnic groups on the island and mainland China. In this report, we generated 33 whole virus genome sequences to conduct a comprehensive molecular epidemiological analysis of these novel gt6 strains in the context of gt6 isolates present in Southeast Asia. With the exception of one gt6a isolate, the Li gt6 sequences formed three novel clades from two lineages which constituted 3 newly assigned gt6 subtypes and 30 unassigned strains. Using Bayesian inference methods, we dated the most recent common ancestor for all available gt6 whole virus genome sequences to approximately 2767 bce (95 per cent highest posterior density (HPD) intervals, 3670-1397 bce), which is far earlier than previous estimates. The substitution rate was 1.20 × 10-4 substitutions/site/year (s/s/y), and this rate varied across the genome regions, from 1.02 × 10-5 s/s/y in the 5'untranslated region (UTR) region to 3.07 × 10-4 s/s/y in E2. Thus, our study on an isolated ethnic minority group within a small geographical area of Hainan Island has substantially increased the known diversity of HCV gt6, already acknowledged as the most diverse HCV genotype. The extant HCV gt6 sequences from this study were probably transmitted to the Li through at least three independent events dating perhaps from around 4,000 years ago. This analysis describes deeper insight into basic aspects of HCV gt6 molecular evolution including the extensive diversity of gt6 sequences in the isolated Li ethnic group.

Collapse

Affiliation(s)

Ru Xu
Elihu Aranday-Cortes MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
E Carol McWilliam Leitch MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Joseph Hughes MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Joshua B Singer MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Vattipally Sreenu MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Lily Tong MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Ana da Silva Filipe MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Connor G G Bamford MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK
Xia Rong Guangzhou Blood Center, Institute of Clinical Blood Transfusion, Guangzhou Blood Center, 31 LuYuan Road, Guangzhou, Guangdong 510095, P.R. China
Jieting Huang Guangzhou Blood Center, Institute of Clinical Blood Transfusion, Guangzhou Blood Center, 31 LuYuan Road, Guangzhou, Guangdong 510095, P.R. China
Min Wang Guangzhou Blood Center, Institute of Clinical Blood Transfusion, Guangzhou Blood Center, 31 LuYuan Road, Guangzhou, Guangdong 510095, P.R. China
Yongshui Fu Guangzhou Blood Center, Institute of Clinical Blood Transfusion, Guangzhou Blood Center, 31 LuYuan Road, Guangzhou, Guangdong 510095, P.R. China
John McLauchlan MRC-University of Glasgow Centre for Virus Research, Sir Michael Stoker Building, Garscube Campus, 464 Bearsden Road, Glasgow G61 1QH, UK Guangzhou Blood Center, Institute of Clinical Blood Transfusion, Guangzhou Blood Center, 31 LuYuan Road, Guangzhou, Guangdong 510095, P.R. China

Collapse

Gussler JW, Campo DS, Dimitrova Z, Skums P, Khudyakov Y. Primary case inference in viral outbreaks through analysis of intra-host variant population. BMC Bioinformatics 2022;23:62. [PMID: 35135469 PMCID: PMC8822801 DOI: 10.1186/s12859-022-04585-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 01/25/2022] [Indexed: 11/21/2022] Open

Abstract

Background

Investigation of outbreaks to identify the primary case is crucial for the interruption and prevention of transmission of infectious diseases. These individuals may have a higher risk of participating in near future transmission events when compared to the other patients in the outbreak, so directing more transmission prevention resources towards these individuals is a priority. Although the genetic characterization of intra-host viral populations can aid the identification of transmission clusters, it is not trivial to determine the directionality of transmissions during outbreaks, owing to complexity of viral evolution. Here, we present a new computational framework, PYCIVO: primary case inference in viral outbreaks. This framework expands upon our earlier work in development of QUENTIN, which builds a probabilistic disease transmission tree based on simulation of evolution of intra-host hepatitis C virus (HCV) variants between cases involved in direct transmission during an outbreak. PYCIVO improves upon QUENTIN by also adding a custom heterogeneity index and identifying the scenario when the primary case may have not been sampled.

Results

These approaches were validated using a set of 105 sequence samples from 11 distinct HCV transmission clusters identified during outbreak investigations, in which the primary case was epidemiologically verified. Both models can detect the correct primary case in 9 out of 11 transmission clusters (81.8%). However, while QUENTIN issues erroneous predictions on the remaining 2 transmission clusters, PYCIVO issues a null output for these clusters, giving it an effective prediction accuracy of 100%. To further evaluate accuracy of the inference, we created 10 modified transmission clusters in which the primary case had been removed. In this scenario, PYCIVO was able to correctly identify that there was no primary case in 8/10 (80%) of these modified clusters. This model was validated with HCV; however, this approach may be applicable to other microbial pathogens.

Conclusions

PYCIVO improves upon QUENTIN by also implementing a custom heterogeneity index which empowers PYCIVO to make the important ‘No primary case’ prediction. One or more samples, possibly including the primary case, may have not been sampled, and this designation is meant to account for these scenarios.

Collapse

Dhar S, Zhang C, Măndoiu II, Bansal MS. TNet: Transmission Network Inference Using Within-Host Strain Diversity and its Application to Geographical Tracking of COVID-19 Spread. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:230-242. [PMID: 34255632 PMCID: PMC8956368 DOI: 10.1109/tcbb.2021.3096455] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 07/03/2021] [Accepted: 07/08/2021] [Indexed: 06/13/2023]

Mäklin T, Kallonen T, Alanko J, Samuelsen Ø, Hegstad K, Mäkinen V, Corander J, Heinz E, Honkela A. Bacterial genomic epidemiology with mixed samples. Microb Genom 2021;7:000691. [PMID: 34779765 PMCID: PMC8743562 DOI: 10.1099/mgen.0.000691] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 09/13/2021] [Indexed: 11/18/2022] Open

Domingo E, García-Crespo C, Perales C. Historical Perspective on the Discovery of the Quasispecies Concept. Annu Rev Virol 2021;8:51-72. [PMID: 34586874 DOI: 10.1146/annurev-virology-091919-105900] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Orlovich Y, Kukharenko K, Kaibel V, Skums P. Scale-Free Spanning Trees and Their Application in Genomic Epidemiology. J Comput Biol 2021;28:945-960. [PMID: 34491104 PMCID: PMC8670573 DOI: 10.1089/cmb.2020.0500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

Berry IM, Melendrez MC, Pollett S, Figueroa K, Buddhari D, Klungthong C, Nisalak A, Panciera M, Thaisomboonsuk B, Li T, Vallard TG, Macareo L, Yoon IK, Thomas SJ, Endy T, Jarman RG. Precision Tracing of Household Dengue Spread Using Inter- and Intra-Host Viral Variation Data, Kamphaeng Phet, Thailand. Emerg Infect Dis 2021;27:1637-1644. [PMID: 34013878 PMCID: PMC8153871 DOI: 10.3201/eid2706.204323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Knyazev S, Tsyvina V, Shankar A, Melnyk A, Artyomenko A, Malygina T, Porozov YB, Campbell EM, Switzer WM, Skums P, Mangul S, Zelikovsky A. Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction. Nucleic Acids Res 2021;49:e102. [PMID: 34214168 PMCID: PMC8464054 DOI: 10.1093/nar/gkab576] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 05/25/2021] [Accepted: 06/18/2021] [Indexed: 12/21/2022] Open

Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. PLoS Pathog 2021;17:e1009499. [PMID: 33826681 PMCID: PMC8055005 DOI: 10.1371/journal.ppat.1009499] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/19/2021] [Accepted: 03/24/2021] [Indexed: 01/12/2023] Open

Ramazzotti D, Angaroni F, Maspero D, Gambacorti-Passerini C, Antoniotti M, Graudenzi A, Piazza R. VERSO: A comprehensive framework for the inference of robust phylogenies and the quantification of intra-host genomic diversity of viral samples. PATTERNS (NEW YORK, N.Y.) 2021;2:100212. [PMID: 33728416 PMCID: PMC7953447 DOI: 10.1016/j.patter.2021.100212] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 11/30/2020] [Accepted: 01/22/2021] [Indexed: 12/22/2022]

Maljkovic Berry I, Melendrez MC, Bishop-Lilly KA, Rutvisuttinunt W, Pollett S, Talundzic E, Morton L, Jarman RG. Next Generation Sequencing and Bioinformatics Methodologies for Infectious Disease Research and Public Health: Approaches, Applications, and Considerations for Development of Laboratory Capacity. J Infect Dis 2021;221:S292-S307. [PMID: 31612214 DOI: 10.1093/infdis/jiz286] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Abstract

Next generation sequencing (NGS) combined with bioinformatics has successfully been used in a vast array of analyses for infectious disease research of public health relevance. For instance, NGS and bioinformatics approaches have been used to identify outbreak origins, track transmissions, investigate epidemic dynamics, determine etiological agents of a disease, and discover novel human pathogens. However, implementation of high-quality NGS and bioinformatics in research and public health laboratories can be challenging. These challenges mainly include the choice of the sequencing platform and the sequencing approach, the choice of bioinformatics methodologies, access to the appropriate computation and information technology infrastructure, and recruiting and retaining personnel with the specialized skills and experience in this field. In this review, we summarize the most common NGS and bioinformatics workflows in the context of infectious disease genomic surveillance and pathogen discovery, and highlight the main challenges and considerations for setting up an NGS and bioinformatics-focused infectious disease research public health laboratory. We describe the most commonly used sequencing platforms and review their strengths and weaknesses. We review sequencing approaches that have been used for various pathogens and study questions, as well as the most common difficulties associated with these approaches that should be considered when implementing in a public health or research setting. In addition, we provide a review of some common bioinformatics tools and procedures used for pathogen discovery and genome assembly, along with the most common challenges and solutions. Finally, we summarize the bioinformatics of advanced viral, bacterial, and parasite pathogen characterization, including types of study questions that can be answered when utilizing NGS and bioinformatics.

Collapse

Valesano AL, Rumfelt KE, Dimcheff DE, Blair CN, Fitzsimmons WJ, Petrie JG, Martin ET, Lauring AS. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.01.19.427330. [PMID: 33501443 PMCID: PMC7836113 DOI: 10.1101/2021.01.19.427330] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Knyazev S, Hughes L, Skums P, Zelikovsky A. Epidemiological data analysis of viral quasispecies in the next-generation sequencing era. Brief Bioinform 2021;22:96-108. [PMID: 32568371 PMCID: PMC8485218 DOI: 10.1093/bib/bbaa101] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 04/24/2020] [Accepted: 05/04/2020] [Indexed: 01/04/2023] Open

Basodi S, Baykal PI, Zelikovsky A, Skums P, Pan Y. Analysis of heterogeneous genomic samples using image normalization and machine learning. BMC Genomics 2020;21:405. [PMID: 33349236 PMCID: PMC7751093 DOI: 10.1186/s12864-020-6661-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 03/09/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Analysis of heterogeneous populations such as viral quasispecies is one of the most challenging bioinformatics problems. Although machine learning models are becoming to be widely employed for analysis of sequence data from such populations, their straightforward application is impeded by multiple challenges associated with technological limitations and biases, difficulty of selection of relevant features and need to compare genomic datasets of different sizes and structures.

RESULTS

We propose a novel preprocessing approach to transform irregular genomic data into normalized image data. Such representation allows to restate the problems of classification and comparison of heterogeneous populations as image classification problems which can be solved using variety of available machine learning tools. We then apply the proposed approach to two important problems in molecular epidemiology: inference of viral infection stage and detection of viral transmission clusters using next-generation sequencing data. The infection staging method has been applied to HCV HVR1 samples collected from 108 recently and 257 chronically infected individuals. The SVM-based image classification approach achieved more than 95% accuracy for both recently and chronically HCV-infected individuals. Clustering has been performed on the data collected from 33 epidemiologically curated outbreaks, yielding more than 97% accuracy.

CONCLUSIONS

Sequence image normalization method allows for a robust conversion of genomic data into numerical data and overcomes several issues associated with employing machine learning methods to viral populations. Image data also help in the visualization of genomic data. Experimental results demonstrate that the proposed method can be successfully applied to different problems in molecular epidemiology and surveillance of viral diseases. Simple binary classifiers and clustering techniques applied to the image data are equally or more accurate than other models.

Collapse

García-Crespo C, Soria ME, Gallego I, de Ávila AI, Martínez-González B, Vázquez-Sirvent L, Gómez J, Briones C, Gregori J, Quer J, Perales C, Domingo E. Dissimilar Conservation Pattern in Hepatitis C Virus Mutant Spectra, Consensus Sequences, and Data Banks. J Clin Med 2020;9:jcm9113450. [PMID: 33121037 PMCID: PMC7692060 DOI: 10.3390/jcm9113450] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 10/15/2020] [Accepted: 10/20/2020] [Indexed: 02/07/2023] Open

Affiliation(s)

Carlos García-Crespo Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.)
María Eugenia Soria Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.) Department of Clinical Microbiology, IIS-Fundación Jiménez Díaz, UAM. Av. Reyes Católicos 2, 28040 Madrid, Spain
Isabel Gallego Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.) Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.)
Ana Isabel de Ávila Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.)
Brenda Martínez-González Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.) Department of Clinical Microbiology, IIS-Fundación Jiménez Díaz, UAM. Av. Reyes Católicos 2, 28040 Madrid, Spain
Lucía Vázquez-Sirvent Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.)
Jordi Gómez Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Department of Molecular Biology, Instituto de Parasitología y Biomedicina ‘López-Neyra’ (CSIC), Parque Tecnológico Ciencias de la Salud, Armilla, 18016 Granada, Spain
Carlos Briones Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Department of Molecular Evolution, Centro de Astrobiología (CAB, CSIC-INTA), Torrejón de Ardoz, 28850 Madrid, Spain
Josep Gregori Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Liver Unit, Liver Diseases—Viral Hepatitis, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain Roche Diagnostics, S.L., Sant Cugat del Vallés, 08174 Barcelona, Spain
Josep Quer Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Liver Unit, Liver Diseases—Viral Hepatitis, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Universitari, Vall d’Hebron Barcelona Hospital Campus, Passeig Vall d’Hebron 119-129, 08035 Barcelona, Spain
Celia Perales Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.) Department of Clinical Microbiology, IIS-Fundación Jiménez Díaz, UAM. Av. Reyes Católicos 2, 28040 Madrid, Spain Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Correspondence: or (C.P.); (E.D.)
Esteban Domingo Department of Interactions with the environment, Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Consejo Superior de Investigaciones Científicas (CSIC), Campus de Cantoblanco, 28049 Madrid, Spain; (C.G.-C.); (M.E.S.); (I.G.); (A.I.d.Á.); (B.M.-G.); (L.V.-S.) Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain; (J.G.); (C.B.); (J.G.); (J.Q.) Correspondence: or (C.P.); (E.D.)

Collapse

Alamil M, Hughes J, Berthier K, Desbiez C, Thébaud G, Soubeyrand S. Inferring epidemiological links from deep sequencing data: a statistical learning approach for human, animal and plant diseases. Philos Trans R Soc Lond B Biol Sci 2020;374:20180258. [PMID: 31056055 PMCID: PMC6553606 DOI: 10.1098/rstb.2018.0258] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Phylogenetics in HIV transmission: taking within-host diversity into account. Curr Opin HIV AIDS 2020;14:181-187. [PMID: 30920395 DOI: 10.1097/coh.0000000000000536] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Phylogenetic and Demographic Characterization of Directed HIV-1 Transmission Using Deep Sequences from High-Risk and General Population Cohorts/Groups in Uganda. Viruses 2020;12:v12030331. [PMID: 32197553 PMCID: PMC7150763 DOI: 10.3390/v12030331] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 03/13/2020] [Accepted: 03/16/2020] [Indexed: 12/12/2022] Open

de Bernardi Schneider A, Ford CT, Hostager R, Williams J, Cioce M, Çatalyürek ÜV, Wertheim JO, Janies D. StrainHub: a phylogenetic tool to construct pathogen transmission networks. Bioinformatics 2020;36:945-947. [PMID: 31418766 PMCID: PMC8215912 DOI: 10.1093/bioinformatics/btz646] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/06/2019] [Accepted: 08/14/2019] [Indexed: 01/30/2023] Open

Pérez-Losada M, Arenas M, Galán JC, Bracho MA, Hillung J, García-González N, González-Candelas F. High-throughput sequencing (HTS) for the analysis of viral populations. INFECTION GENETICS AND EVOLUTION 2020;80:104208. [PMID: 32001386 DOI: 10.1016/j.meegid.2020.104208] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 01/21/2020] [Accepted: 01/24/2020] [Indexed: 12/12/2022]

Tan MP, Wong LL, Razali SA, Afiqah-Aleng N, Mohd Nor SA, Sung YY, Van de Peer Y, Sorgeloos P, Danish-Daniel M. Applications of Next-Generation Sequencing Technologies and Computational Tools in Molecular Evolution and Aquatic Animals Conservation Studies: A Short Review. Evol Bioinform Online 2019;15:1176934319892284. [PMID: 31839703 PMCID: PMC6896124 DOI: 10.1177/1176934319892284] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 11/12/2019] [Indexed: 12/21/2022] Open

Domingo E, Perales C. Viral quasispecies. PLoS Genet 2019;15:e1008271. [PMID: 31622336 PMCID: PMC6797082 DOI: 10.1371/journal.pgen.1008271] [Citation(s) in RCA: 179] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Hall MD, Colijn C. Transmission Trees on a Known Pathogen Phylogeny: Enumeration and Sampling. Mol Biol Evol 2019;36:1333-1343. [PMID: 30873529 PMCID: PMC6526902 DOI: 10.1093/molbev/msz058] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Ratmann O, Grabowski MK, Hall M, Golubchik T, Wymant C, Abeler-Dörner L, Bonsall D, Hoppe A, Brown AL, de Oliveira T, Gall A, Kellam P, Pillay D, Kagaayi J, Kigozi G, Quinn TC, Wawer MJ, Laeyendecker O, Serwadda D, Gray RH, Fraser C. Inferring HIV-1 transmission networks and sources of epidemic spread in Africa with deep-sequence phylogenetic analysis. Nat Commun 2019;10:1411. [PMID: 30926780 PMCID: PMC6441045 DOI: 10.1038/s41467-019-09139-4] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 02/22/2019] [Indexed: 11/09/2022] Open

Affiliation(s)

Oliver Ratmann Department of Mathematics, Imperial College London, London, SW72AZ, UK. Department of Infectious Disease, Epidemiology School of Public Health, Imperial College London, London, W21PG, UK.
M Kate Grabowski Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, 21205-2196, USA Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda
Matthew Hall Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK
Tanya Golubchik Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK
Chris Wymant Department of Infectious Disease, Epidemiology School of Public Health, Imperial College London, London, W21PG, UK Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK
Lucie Abeler-Dörner Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK
David Bonsall Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK
Anne Hoppe Division of Infection and Immunity, University College London, London, WC1E 6BT, UK
Andrew Leigh Brown School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3FF, UK
Tulio de Oliveira College of Health Sciences, University of KwaZulu-Natal, Durban, 4041, South Africa
Astrid Gall European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
Paul Kellam Department of Medicine, Imperial College London, London, W12 0HS, UK
Deenan Pillay Division of Infection and Immunity, University College London, London, WC1E 6BT, UK Africa Health Research Institute, Private Bag X7, Durban, 4013, South Africa
Joseph Kagaayi Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda
Godfrey Kigozi Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda
Thomas C Quinn Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, 21205-2196, USA Division of Intramural Research, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD, 20892-9806, USA
Maria J Wawer Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
Oliver Laeyendecker Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, 21205-2196, USA Division of Intramural Research, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD, 20892-9806, USA
David Serwadda Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda Makerere University School of Public Health, Kampala, 8HQG+3V, Uganda
Ronald H Gray Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, 21205-2196, USA Rakai Health Sciences Program, Entebbe, P.O.Box 49, Uganda Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
Christophe Fraser Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, Old Road Campus, University of Oxford, Oxford, OX3 7BN, UK

Collapse

Tsyvina V, Campo DS, Sims S, Zelikovsky A, Khudyakov Y, Skums P. Fast estimation of genetic relatedness between members of heterogeneous populations of closely related genomic variants. BMC Bioinformatics 2018;19:360. [PMID: 30343669 PMCID: PMC6196405 DOI: 10.1186/s12859-018-2333-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

De Maio N, Worby CJ, Wilson DJ, Stoesser N. Bayesian reconstruction of transmission within outbreaks using genomic variants. PLoS Comput Biol 2018;14:e1006117. [PMID: 29668677 PMCID: PMC5927459 DOI: 10.1371/journal.pcbi.1006117] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 04/30/2018] [Accepted: 04/03/2018] [Indexed: 01/19/2023] Open

Abstract

Pathogen genome sequencing can reveal details of transmission histories and is a powerful tool in the fight against infectious disease. In particular, within-host pathogen genomic variants identified through heterozygous nucleotide base calls are a potential source of information to identify linked cases and infer direction and time of transmission. However, using such data effectively to model disease transmission presents a number of challenges, including differentiating genuine variants from those observed due to sequencing error, as well as the specification of a realistic model for within-host pathogen population dynamics. Here we propose a new Bayesian approach to transmission inference, BadTrIP (BAyesian epiDemiological TRansmission Inference from Polymorphisms), that explicitly models evolution of pathogen populations in an outbreak, transmission (including transmission bottlenecks), and sequencing error. BadTrIP enables the inference of host-to-host transmission from pathogen sequencing data and epidemiological data. By assuming that genomic variants are unlinked, our method does not require the computationally intensive and unreliable reconstruction of individual haplotypes. Using simulations we show that BadTrIP is robust in most scenarios and can accurately infer transmission events by efficiently combining information from genetic and epidemiological sources; thanks to its realistic model of pathogen evolution and the inclusion of epidemiological data, BadTrIP is also more accurate than existing approaches. BadTrIP is distributed as an open source package (https://bitbucket.org/nicofmay/badtrip) for the phylogenetic software BEAST2. We apply our method to reconstruct transmission history at the early stages of the 2014 Ebola outbreak, showcasing the power of within-host genomic variants to reconstruct transmission events.

We present a new tool to reconstruct transmission events within outbreaks. Our approach makes use of pathogen genetic information, notably genetic variants at low frequency within host that are usually discarded, and combines it with epidemiological information of host exposure to infection. This leads to accurate reconstruction of transmission even in cases where abundant within-host pathogen genetic variation and weak transmission bottlenecks (multiple pathogen units colonising a new host at transmission) would otherwise make inference difficult due to the transmission history differing from the pathogen evolution history inferred from pathogen isolets. Also, the use of within-host pathogen genomic variants increases the resolution of the reconstruction of the transmission tree even in scenarios with limited within-outbreak pathogen genetic diversity: within-host pathogen populations that appear identical at the level of consensus sequences can be discriminated using within-host variants. Our Bayesian approach provides a measure of the confidence in different possible transmission histories, and is published as open source software. We show with simulations and with an analysis of the beginning of the 2014 Ebola outbreak that our approach is applicable in many scenarios, improves our understanding of transmission dynamics, and will contribute to finding and limiting sources and routes of transmission, and therefore preventing the spread of infectious disease.

Collapse