Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Parag KV, Pybus OG. Robust Design for Coalescent Model Inference. Syst Biol 2019;68:730-743. [PMID: 30726979 DOI: 10.1093/sysbio/syz008] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Revised: 01/28/2019] [Accepted: 02/04/2019] [Indexed: 11/08/2023] Open

For:	Parag KV, Pybus OG. Robust Design for Coalescent Model Inference. Syst Biol 2019;68:730-743. [PMID: 30726979 DOI: 10.1093/sysbio/syz008] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Revised: 01/28/2019] [Accepted: 02/04/2019] [Indexed: 11/08/2023] Open

Number

Cited by Other Article(s)

Stammnitz MR, Gori K, Murchison EP. No evidence that a transmissible cancer has shifted from emergence to endemism in Tasmanian devils. ROYAL SOCIETY OPEN SCIENCE 2024;11:231875. [PMID: 38633353 PMCID: PMC11022658 DOI: 10.1098/rsos.231875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 03/01/2024] [Accepted: 03/04/2024] [Indexed: 04/19/2024]

Parag KV, Obolski U. Risk averse reproduction numbers improve resurgence detection. PLoS Comput Biol 2023;19:e1011332. [PMID: 37471464 PMCID: PMC10393178 DOI: 10.1371/journal.pcbi.1011332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Accepted: 07/06/2023] [Indexed: 07/22/2023] Open

Cappello L, Kim J, Palacios JA. adaPop: Bayesian inference of dependent population dynamics in coalescent models. PLoS Comput Biol 2023;19:e1010897. [PMID: 36940209 PMCID: PMC10063170 DOI: 10.1371/journal.pcbi.1010897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 03/30/2023] [Accepted: 01/25/2023] [Indexed: 03/21/2023] Open

Upadhya G, Steinrücken M. Robust inference of population size histories from genomic sequencing data. PLoS Comput Biol 2022;18:e1010419. [PMID: 36112715 PMCID: PMC9518926 DOI: 10.1371/journal.pcbi.1010419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 09/28/2022] [Accepted: 07/21/2022] [Indexed: 02/08/2023] Open

Abstract

Unraveling the complex demographic histories of natural populations is a central problem in population genetics. Understanding past demographic events is of general anthropological interest, but is also an important step in establishing accurate null models when identifying adaptive or disease-associated genetic variation. An important class of tools for inferring past population size changes from genomic sequence data are Coalescent Hidden Markov Models (CHMMs). These models make efficient use of the linkage information in population genomic datasets by using the local genealogies relating sampled individuals as latent states that evolve along the chromosome in an HMM framework. Extending these models to large sample sizes is challenging, since the number of possible latent states increases rapidly. Here, we present our method CHIMP (CHMM History-Inference Maximum-Likelihood Procedure), a novel CHMM method for inferring the size history of a population. It can be applied to large samples (hundreds of haplotypes) and only requires unphased genomes as input. The two implementations of CHIMP that we present here use either the height of the genealogical tree (TMRCA) or the total branch length, respectively, as the latent variable at each position in the genome. The requisite transition and emission probabilities are obtained by numerically solving certain systems of differential equations derived from the ancestral process with recombination. The parameters of the population size history are subsequently inferred using an Expectation-Maximization algorithm. In addition, we implement a composite likelihood scheme to allow the method to scale to large sample sizes. We demonstrate the efficiency and accuracy of our method in a variety of benchmark tests using simulated data and present comparisons to other state-of-the-art methods. Specifically, our implementation using TMRCA as the latent variable shows comparable performance and provides accurate estimates of effective population sizes in intermediate and ancient times. Our method is agnostic to the phasing of the data, which makes it a promising alternative in scenarios where high quality data is not available, and has potential applications for pseudo-haploid data.

Collapse

Parag KV, Donnelly CA, Zarebski AE. Quantifying the information in noisy epidemic curves. NATURE COMPUTATIONAL SCIENCE 2022;2:584-594. [PMID: 38177483 DOI: 10.1038/s43588-022-00313-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 08/08/2022] [Indexed: 01/06/2024]

Bouckaert RR. An Efficient Coalescent Epoch Model for Bayesian Phylogenetic Inference. Syst Biol 2022;71:1549-1560. [PMID: 35212733 PMCID: PMC9773037 DOI: 10.1093/sysbio/syac015] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 01/24/2022] [Accepted: 02/22/2022] [Indexed: 12/25/2022] Open

Cappello L, Palacios JA. Adaptive Preferential Sampling in Phylodynamics With an Application to SARS-CoV-2. J Comput Graph Stat 2021;31:541-552. [PMID: 36035966 PMCID: PMC9409340 DOI: 10.1080/10618600.2021.1987256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Louca S, McLaughlin A, MacPherson A, Joy JB, Pennell MW. Fundamental Identifiability Limits in Molecular Epidemiology. Mol Biol Evol 2021;38:4010-4024. [PMID: 34009339 PMCID: PMC8382926 DOI: 10.1093/molbev/msab149] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Parag KV, Pybus OG, Wu CH. Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? Syst Biol 2021;71:121-138. [PMID: 33989428 PMCID: PMC8677568 DOI: 10.1093/sysbio/syab037] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 05/07/2021] [Accepted: 05/08/2021] [Indexed: 11/13/2022] Open

Parag KV, Donnelly CA. Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models. Syst Biol 2020;69:1163-1179. [PMID: 32333789 PMCID: PMC7584150 DOI: 10.1093/sysbio/syaa035] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 04/14/2020] [Accepted: 04/16/2020] [Indexed: 11/12/2022] Open

Abstract

Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypotheses concerning its future growth or decline. In infectious disease applications, the individuals infected across an epidemic form the target population. The renewal model estimates the effective reproduction number, R, of the epidemic from counts of observed incident cases. The skyline model infers the effective population size, N, underlying a phylogeny of sequences sampled from that epidemic. Practically, R measures ongoing epidemic growth while N informs on historical caseload. While both models solve distinct problems, the reliability of their estimates depends on p-dimensional piecewise-constant functions. If p is misspecified, the model might underfit significant changes or overfit noise and promote a spurious understanding of the epidemic, which might misguide intervention policies or misinform forecasts. Surprisingly, no transparent yet principled approach for optimizing p exists. Usually, p is heuristically set, or obscurely controlled via complex algorithms. We present a computable and interpretable p-selection method based on the minimum description length (MDL) formalism of information theory. Unlike many standard model selection techniques, MDL accounts for the additional statistical complexity induced by how parameters interact. As a result, our method optimizes p so that R and N estimates properly and meaningfully adapt to available data. It also outperforms comparable Akaike and Bayesian information criteria on several classification problems, given minimal knowledge of the parameter space, and exposes statistical similarities among renewal, skyline, and other models in biology. Rigorous and interpretable model selection is necessary if trustworthy and justifiable conclusions are to be drawn from piecewise models. [Coalescent processes; epidemiology; information theory; model selection; phylodynamics; renewal models; skyline plots].

Collapse

Parag KV, du Plessis L, Pybus OG. Jointly Inferring the Dynamics of Population Size and Sampling Intensity from Molecular Sequences. Mol Biol Evol 2020;37:2414-2429. [PMID: 32003829 PMCID: PMC7403618 DOI: 10.1093/molbev/msaa016] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

Huang J, Flouri T, Yang Z. A Simulation Study to Examine the Information Content in Phylogenomic Data Sets under the Multispecies Coalescent Model. Mol Biol Evol 2020;37:3211-3224. [DOI: 10.1093/molbev/msaa166] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Parag KV, Donnelly CA. Using information theory to optimise epidemic models for real-time prediction and estimation. PLoS Comput Biol 2020;16:e1007990. [PMID: 32609732 PMCID: PMC7360089 DOI: 10.1371/journal.pcbi.1007990] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Revised: 07/14/2020] [Accepted: 05/27/2020] [Indexed: 01/31/2023] Open

Abstract

The effective reproduction number, R_t, is a key time-varying prognostic for the growth rate of any infectious disease epidemic. Significant changes in R_t can forewarn about new transmissions within a population or predict the efficacy of interventions. Inferring R_t reliably and in real-time from observed time-series of infected (demographic) data is an important problem in population dynamics. The renewal or branching process model is a popular solution that has been applied to Ebola and Zika virus disease outbreaks, among others, and is currently being used to investigate the ongoing COVID-19 pandemic. This model estimates R_t using a heuristically chosen piecewise function. While this facilitates real-time detection of statistically significant R_t changes, inference is highly sensitive to the function choice. Improperly chosen piecewise models might ignore meaningful changes or over-interpret noise-induced ones, yet produce visually reasonable estimates. No principled piecewise selection scheme exists. We develop a practical yet rigorous scheme using the accumulated prediction error (APE) metric from information theory, which deems the model capable of describing the observed data using the fewest bits as most justified. We derive exact posterior prediction distributions for infected population size and integrate these within an APE framework to obtain an exact and reliable method for identifying the piecewise function best supported by available epidemic data. We find that this choice optimises short-term prediction accuracy and can rapidly detect salient fluctuations in R_t, and hence the infected population growth rate, in real-time over the course of an unfolding epidemic. Moreover, we emphasise the need for formal selection by exposing how common heuristic choices, which seem sensible, can be misleading. Our APE-based method is easily computed and broadly applicable to statistically similar models found in phylogenetics and macroevolution, for example. Our results explore the relationships among estimate precision, forecast reliability and model complexity.

Understanding how the population of infected individuals (which may be humans, animals or plants) fluctuates in size over the course of an epidemic is an important problem in epidemiology and ecology. The effective reproduction number, R, provides an intuitive and useful way of describing these fluctuations by characterising the growth rate of the infected population. An R > 1 signifies a burgeoning epidemic whereas R < 1 indicates a declining one. Public health agencies often use R to inform or corroborate vaccination and quarantine policies. However, popular approaches to inferring R from epidemic data make heuristic choices, which may lead to visually reasonable estimates that are deceptive or unreliable. By adapting mathematical tools from information theory, we develop a general and principled scheme for estimating R in a data-justified way. Our method exposes the pitfalls of heuristic estimates and provides an easily computable correction that also maximises our ability to predict upcoming population fluctuations. Our work is widely applicable to similar inference problems found in evolution and genetics, demonstrably useful for reliably analysing emerging epidemics in real time and highlights how abstract mathematical concepts can inspire novel and practical biological solutions, showcasing the importance of multidisciplinary research.

Collapse

Sellinger TPP, Abu Awad D, Moest M, Tellier A. Inference of past demography, dormancy and self-fertilization rates from whole genome sequence data. PLoS Genet 2020;16:e1008698. [PMID: 32251472 PMCID: PMC7173940 DOI: 10.1371/journal.pgen.1008698] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 04/21/2020] [Accepted: 02/24/2020] [Indexed: 02/04/2023] Open

Wang X, Maher KH, Zhang N, Que P, Zheng C, Liu S, Wang B, Huang Q, Chen D, Yang X, Zhang Z, Székely T, Urrutia AO, Liu Y. Demographic Histories and Genome-Wide Patterns of Divergence in Incipient Species of Shorebirds. Front Genet 2019;10:919. [PMID: 31781152 PMCID: PMC6857203 DOI: 10.3389/fgene.2019.00919] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Accepted: 08/30/2019] [Indexed: 12/30/2022] Open

Affiliation(s)

Xuejing Wang State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Kathryn H. Maher Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
Nan Zhang State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Pinjia Que Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
Chenqing Zheng State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China Department of Bioinformatics, Shenzhen Realomics Biological Technology Ltd, Shenzhen, China
Simin Liu State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
Biao Wang School of Biosciences, University of Melbourne, Parkville, VIC, Australia
Qin Huang State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
De Chen Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
Xu Yang Department of Bioinformatics, Shenzhen Realomics Biological Technology Ltd, Shenzhen, China
Zhengwang Zhang Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
Tamás Székely State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom Ministry of Education Key Laboratory for Biodiversity and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
Araxi O. Urrutia Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
Yang Liu State Key Laboratory of Biocontrol, Department of Ecology, School of Life Sciences, Sun Yat-sen University, Guangzhou, China

Collapse