Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lintusaari J, Gutmann MU, Dutta R, Kaski S, Corander J. Fundamentals and Recent Developments in Approximate Bayesian Computation. Syst Biol 2018;66:e66-e82. [PMID: 28175922 PMCID: PMC5837704 DOI: 10.1093/sysbio/syw077] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 08/09/2016] [Accepted: 08/09/2016] [Indexed: 12/16/2022] Open

For:	Lintusaari J, Gutmann MU, Dutta R, Kaski S, Corander J. Fundamentals and Recent Developments in Approximate Bayesian Computation. Syst Biol 2018;66:e66-e82. [PMID: 28175922 PMCID: PMC5837704 DOI: 10.1093/sysbio/syw077] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 08/09/2016] [Accepted: 08/09/2016] [Indexed: 12/16/2022] Open

Number

Cited by Other Article(s)

Rmus M, Pan TF, Xia L, Collins AGE. Artificial neural networks for model identification and parameter estimation in computational cognitive models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.14.557793. [PMID: 37767088 PMCID: PMC10521012 DOI: 10.1101/2023.09.14.557793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]

Krüger M, Mishra A, Spichtinger P, Pöschl U, Berkemeier T. A numerical compass for experiment design in chemical kinetics and molecular property estimation. J Cheminform 2024;16:34. [PMID: 38520014 PMCID: PMC10960421 DOI: 10.1186/s13321-024-00825-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 03/10/2024] [Indexed: 03/25/2024] Open

Luo M, Zhu J, Jia J, Zhang H, Zhao J. Progress on network modeling and analysis of gut microecology: a review. Appl Environ Microbiol 2024;90:e0009224. [PMID: 38415584 PMCID: PMC11207142 DOI: 10.1128/aem.00092-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024] Open

Valentin S, Kleinegesse S, Bramley NR, Seriès P, Gutmann MU, Lucas CG. Designing optimal behavioral experiments using machine learning. eLife 2024;13:e86224. [PMID: 38261382 PMCID: PMC10805374 DOI: 10.7554/elife.86224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 11/19/2023] [Indexed: 01/24/2024] Open

Abstract

Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avoid these pitfalls and realize the full potential of computational modeling, we require tools to design experiments that provide clear answers about what models explain human behavior and the auxiliary assumptions those models must make. Bayesian optimal experimental design (BOED) formalizes the search for optimal experimental designs by identifying experiments that are expected to yield informative data. In this work, we provide a tutorial on leveraging recent advances in BOED and machine learning to find optimal experiments for any kind of model that we can simulate data from, and show how by-products of this procedure allow for quick and straightforward evaluation of models and their parameters against real experimental data. As a case study, we consider theories of how people balance exploration and exploitation in multi-armed bandit decision-making tasks. We validate the presented approach using simulations and a real-world experiment. As compared to experimental designs commonly used in the literature, we show that our optimal designs more efficiently determine which of a set of models best account for individual human behavior, and more efficiently characterize behavior given a preferred model. At the same time, formalizing a scientific question such that it can be adequately addressed with BOED can be challenging and we discuss several potential caveats and pitfalls that practitioners should be aware of. We provide code to replicate all analyses as well as tutorial notebooks and pointers to adapt the methodology to different experimental settings.

Collapse

Hung KL, Jones MG, Wong ITL, Lange JT, Luebeck J, Scanu E, He BJ, Brückner L, Li R, González RC, Schmargon R, Dörr JR, Belk JA, Bafna V, Werner B, Huang W, Henssen AG, Mischel PS, Chang HY. Coordinated inheritance of extrachromosomal DNA species in human cancer cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.18.549597. [PMID: 37503111 PMCID: PMC10371175 DOI: 10.1101/2023.07.18.549597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Affiliation(s)

King L. Hung Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA
Matthew G. Jones Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA
Ivy Tsz-Lo Wong Sarafan ChEM-H, Stanford University, Stanford, CA, USA Department of Pathology, Stanford University, Stanford, CA, USA
Joshua T. Lange Sarafan ChEM-H, Stanford University, Stanford, CA, USA Department of Pathology, Stanford University, Stanford, CA, USA
Jens Luebeck Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, 92093, USA
Elisa Scanu Department of Mathematics, Queen Mary University of London, London, UK
Britney Jiayu He Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA
Lotte Brückner Max-Delbrück-Centrum für Molekulare Medizin (BIMSB/BIH), Berlin, Germany Experimental and Clinical Research Center (ECRC), Max Delbrück Center for Molecular Medicine and Charité—Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany
Rui Li Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA
Rocío Chamorro González Experimental and Clinical Research Center (ECRC), Max Delbrück Center for Molecular Medicine and Charité—Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
Rachel Schmargon Experimental and Clinical Research Center (ECRC), Max Delbrück Center for Molecular Medicine and Charité—Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
Jan R. Dörr Experimental and Clinical Research Center (ECRC), Max Delbrück Center for Molecular Medicine and Charité—Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany
Julia A. Belk Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA
Vineet Bafna Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, 92093, USA
Benjamin Werner Evolutionary Dynamics Group, Centre for Cancer Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
Weini Huang Department of Mathematics, Queen Mary University of London, London, UK Group of Theoretical Biology, The State Key Laboratory of Biocontrol, School of Life Science, Sun Yat-sen University, Guangzhou, China
Anton G. Henssen Experimental and Clinical Research Center (ECRC), Max Delbrück Center for Molecular Medicine and Charité—Universitätsmedizin Berlin, Lindenberger Weg 80, 13125, Berlin, Germany Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Augustenburger Platz 1, 13353, Berlin, Germany German Cancer Consortium (DKTK), partner site Berlin, and German Cancer Research Center DKFZ, Im Neuenheimer Feld 280, 69120, Heidelberg, Germany Berlin Institute of Health, Anna-Louisa-Karsch-Str. 2, 10178, Berlin, Germany
Paul S. Mischel Sarafan ChEM-H, Stanford University, Stanford, CA, USA Department of Pathology, Stanford University, Stanford, CA, USA
Howard Y. Chang Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA 94305, USA Department of Genetics, Stanford University, Stanford, CA, USA Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA

Collapse

Gilabert A, Rieux A, Robert S, Vitalis R, Zapater M, Abadie C, Carlier J, Ravigné V. Revisiting the historical scenario of a disease dissemination using genetic data and Approximate Bayesian Computation methodology: The case of Pseudocercospora fijiensis invasion in Africa. Ecol Evol 2023;13:e10013. [PMID: 37091563 PMCID: PMC10116021 DOI: 10.1002/ece3.10013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 03/17/2023] [Accepted: 03/29/2023] [Indexed: 04/25/2023] Open

Järvenpää M, Corander J. On predictive inference for intractable models via approximate Bayesian computation. STATISTICS AND COMPUTING 2023;33:42. [PMID: 36785730 PMCID: PMC9911513 DOI: 10.1007/s11222-022-10163-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 10/02/2022] [Indexed: 06/18/2023]

Martin GM, Frazier DT, Robert CP. Approximating Bayes in the 21st Century. Stat Sci 2023. [DOI: 10.1214/22-sts875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]

Dekermanjian JP, Shaddox E, Nandy D, Ghosh D, Kechris K. Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics. BMC Bioinformatics 2022;23:179. [PMID: 35578165 PMCID: PMC9109373 DOI: 10.1186/s12859-022-04659-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 03/23/2022] [Indexed: 11/19/2022] Open

Abstract

When analyzing large datasets from high-throughput technologies, researchers often encounter missing quantitative measurements, which are particularly frequent in metabolomics datasets. Metabolomics, the comprehensive profiling of metabolite abundances, are typically measured using mass spectrometry technologies that often introduce missingness via multiple mechanisms: (1) the metabolite signal may be smaller than the instrument limit of detection; (2) the conditions under which the data are collected and processed may lead to missing values; (3) missing values can be introduced randomly. Missingness resulting from mechanism (1) would be classified as Missing Not At Random (MNAR), that from mechanism (2) would be Missing At Random (MAR), and that from mechanism (3) would be classified as Missing Completely At Random (MCAR). Two common approaches for handling missing data are the following: (1) omit missing data from the analysis; (2) impute the missing values. Both approaches may introduce bias and reduce statistical power in downstream analyses such as testing metabolite associations with clinical variables. Further, standard imputation methods in metabolomics often ignore the mechanisms causing missingness and inaccurately estimate missing values within a data set. We propose a mechanism-aware imputation algorithm that leverages a two-step approach in imputing missing values. First, we use a random forest classifier to classify the missing mechanism for each missing value in the data set. Second, we impute each missing value using imputation algorithms that are specific to the predicted missingness mechanism (i.e., MAR/MCAR or MNAR). Using complete data, we conducted simulations, where we imposed different missingness patterns within the data and tested the performance of combinations of imputation algorithms. Our proposed algorithm provided imputations closer to the original data than those using only one imputation algorithm for all the missing values. Consequently, our two-step approach was able to reduce bias for improved downstream analyses.

Collapse

Cappello L, Kim J, Liu S, Palacios JA. Statistical Challenges in Tracking the Evolution of SARS-CoV-2. Stat Sci 2022;37:162-182. [PMID: 36034090 PMCID: PMC9409356 DOI: 10.1214/22-sts853] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Approximate Bayesian computation using asymptotically normal point estimates. Comput Stat 2022. [DOI: 10.1007/s00180-022-01226-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Raynal L, Chen S, Mira A, Onnela JP. Scalable Approximate Bayesian Computation for Growing Network Models via Extrapolated and Sampled Summaries. BAYESIAN ANALYSIS 2022;17:165-192. [PMID: 36213769 PMCID: PMC9541316 DOI: 10.1214/20-ba1248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Abstract

Approximate Bayesian computation (ABC) is a simulation-based likelihood-free method applicable to both model selection and parameter estimation. ABC parameter estimation requires the ability to forward simulate datasets from a candidate model, but because the sizes of the observed and simulated datasets usually need to match, this can be computationally expensive. Additionally, since ABC inference is based on comparisons of summary statistics computed on the observed and simulated data, using computationally expensive summary statistics can lead to further losses in efficiency. ABC has recently been applied to the family of mechanistic network models, an area that has traditionally lacked tools for inference and model choice. Mechanistic models of network growth repeatedly add nodes to a network until it reaches the size of the observed network, which may be of the order of millions of nodes. With ABC, this process can quickly become computationally prohibitive due to the resource intensive nature of network simulations and evaluation of summary statistics. We propose two methodological developments to enable the use of ABC for inference in models for large growing networks. First, to save time needed for forward simulating model realizations, we propose a procedure to extrapolate (via both least squares and Gaussian processes) summary statistics from small to large networks. Second, to reduce computation time for evaluating summary statistics, we use sample-based rather than census-based summary statistics. We show that the ABC posterior obtained through this approach, which adds two additional layers of approximation to the standard ABC, is similar to a classic ABC posterior. Although we deal with growing network models, both extrapolated summaries and sampled summaries are expected to be relevant in other ABC settings where the data are generated incrementally.

Collapse

Dutta R, Zouaoui Boudjeltia K, Kotsalos C, Rousseau A, Ribeiro de Sousa D, Desmet JM, Van Meerhaeghe A, Mira A, Chopard B. Personalized pathology test for Cardio-vascular disease: Approximate Bayesian computation with discriminative summary statistics learning. PLoS Comput Biol 2022;18:e1009910. [PMID: 35271585 PMCID: PMC8939803 DOI: 10.1371/journal.pcbi.1009910] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 03/22/2022] [Accepted: 02/09/2022] [Indexed: 11/19/2022] Open

Pray IW, Pizzitutti F, Bonnet G, Gonzales-Gustavson E, Wakeland W, Pan WK, Lambert WE, Gonzalez AE, Garcia HH, O’Neal SE. Validation of a spatial agent-based model for Taenia solium transmission ("CystiAgent") against a large prospective trial of control strategies in northern Peru. PLoS Negl Trop Dis 2021;15:e0009885. [PMID: 34705827 PMCID: PMC8575314 DOI: 10.1371/journal.pntd.0009885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 11/08/2021] [Accepted: 10/08/2021] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

The pork tapeworm (Taenia solium) is a parasitic helminth that imposes a major health and economic burden on poor rural populations around the world. As recognized by the World Health Organization, a key barrier for achieving control of T. solium is the lack of an accurate and validated simulation model with which to study transmission and evaluate available control and elimination strategies. CystiAgent is a spatially-explicit agent based model for T. solium that is unique among T. solium models in its ability to represent key spatial and environmental features of transmission and simulate spatially targeted interventions, such as ring strategy.

METHODS/PRINCIPAL FINDINGS

We validated CystiAgent against results from the Ring Strategy Trial (RST)-a large cluster-randomized trial conducted in northern Peru that evaluated six unique interventions for T. solium control in 23 villages. For the validation, each intervention strategy was replicated in CystiAgent, and the simulated prevalences of human taeniasis, porcine cysticercosis, and porcine seroincidence were compared against prevalence estimates from the trial. Results showed that CystiAgent produced declines in transmission in response to each of the six intervention strategies, but overestimated the effect of interventions in the majority of villages; simulated prevalences for human taenasis and porcine cysticercosis at the end of the trial were a median of 0.53 and 5.0 percentages points less than prevalence observed at the end of the trial, respectively.

CONCLUSIONS/SIGNIFICANCE

The validation of CystiAgent represented an important step towards developing an accurate and reliable T. solium transmission model that can be deployed to fill critical gaps in our understanding of T. solium transmission and control. To improve model accuracy, future versions would benefit from improved data on pig immunity and resistance, field effectiveness of anti-helminthic treatment, and factors driving spatial clustering of T. solium infections including dispersion and contact with T. solium eggs in the environment.

Collapse

Zhu B, Pei Y, Li C. An improved approximate Bayesian computation scheme for parameter inference based on a recalibration post-processing method. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1963456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Dutta R, Gomes SN, Kalise D, Pacchiardi L. Using mobility data in the design of optimal lockdown strategies for the COVID-19 pandemic. PLoS Comput Biol 2021;17:e1009236. [PMID: 34383756 PMCID: PMC8360388 DOI: 10.1371/journal.pcbi.1009236] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 07/02/2021] [Indexed: 01/29/2023] Open

Abstract

A mathematical model for the COVID-19 pandemic spread, which integrates age-structured Susceptible-Exposed-Infected-Recovered-Deceased dynamics with real mobile phone data accounting for the population mobility, is presented. The dynamical model adjustment is performed via Approximate Bayesian Computation. Optimal lockdown and exit strategies are determined based on nonlinear model predictive control, constrained to public-health and socio-economic factors. Through an extensive computational validation of the methodology, it is shown that it is possible to compute robust exit strategies with realistic reduced mobility values to inform public policy making, and we exemplify the applicability of the methodology using datasets from England and France.

In many countries, the COVID-19 pandemic has revealed a gap between public policy making and the use of advanced technological tools to inform such a process. In the big data era, decisions concerning the implementation of quarantines and travel restrictions are still being taken based on incomplete public health data, despite the myriad of information our society provides in real time, such as mobility data, commuting network structures, and financial patterns, to name a few. To advance towards an effective data-driven, quantitative policy making, we propose a computational framework where a predictive epidemiological model is fitted by feeding both public health and Google mobility data. The resulting model is then used as a basis for designing mobility reduction strategies which are optimised taking into account both the healthcare system capacity, and the economic impact of an extended lockdown. For the COVID-19 pandemic in England and France, we show that it is possible to design lockdown policies allowing a partial return to workplaces and schools, while maintaining the epidemic under control.

Collapse

West TO, Berthouze L, Farmer SF, Cagnan H, Litvak V. Inference of brain networks with approximate Bayesian computation - assessing face validity with an example application in Parkinsonism. Neuroimage 2021;236:118020. [PMID: 33839264 PMCID: PMC8270890 DOI: 10.1016/j.neuroimage.2021.118020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 03/16/2021] [Accepted: 03/21/2021] [Indexed: 11/21/2022] Open

Hendry JA, Kwiatkowski D, McVean G. Elucidating relationships between P.falciparum prevalence and measures of genetic diversity with a combined genetic-epidemiological model of malaria. PLoS Comput Biol 2021;17:e1009287. [PMID: 34411093 PMCID: PMC8407561 DOI: 10.1371/journal.pcbi.1009287] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 08/31/2021] [Accepted: 07/19/2021] [Indexed: 12/05/2022] Open

Abstract

There is an abundance of malaria genetic data being collected from the field, yet using these data to understand the drivers of regional epidemiology remains a challenge. A key issue is the lack of models that relate parasite genetic diversity to epidemiological parameters. Classical models in population genetics characterize changes in genetic diversity in relation to demographic parameters, but fail to account for the unique features of the malaria life cycle. In contrast, epidemiological models, such as the Ross-Macdonald model, capture malaria transmission dynamics but do not consider genetics. Here, we have developed an integrated model encompassing both parasite evolution and regional epidemiology. We achieve this by combining the Ross-Macdonald model with an intra-host continuous-time Moran model, thus explicitly representing the evolution of individual parasite genomes in a traditional epidemiological framework. Implemented as a stochastic simulation, we use the model to explore relationships between measures of parasite genetic diversity and parasite prevalence, a widely-used metric of transmission intensity. First, we explore how varying parasite prevalence influences genetic diversity at equilibrium. We find that multiple genetic diversity statistics are correlated with prevalence, but the strength of the relationships depends on whether variation in prevalence is driven by host- or vector-related factors. Next, we assess the responsiveness of a variety of statistics to malaria control interventions, finding that those related to mixed infections respond quickly (∼months) whereas other statistics, such as nucleotide diversity, may take decades to respond. These findings provide insights into the opportunities and challenges associated with using genetic data to monitor malaria epidemiology.

Collapse

Auzina IA, Tomczak JM. Approximate Bayesian Computation for Discrete Spaces. ENTROPY 2021;23:e23030312. [PMID: 33800743 PMCID: PMC7998962 DOI: 10.3390/e23030312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 02/27/2021] [Accepted: 03/02/2021] [Indexed: 11/23/2022]

Suzuki Y, Nakamura A, Milosevic M, Nomura K, Tanahashi T, Endo T, Sakoda S, Morasso P, Nomura T. Postural instability via a loss of intermittent control in elderly and patients with Parkinson's disease: A model-based and data-driven approach. CHAOS (WOODBURY, N.Y.) 2020;30:113140. [PMID: 33261318 DOI: 10.1063/5.0022319] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 10/28/2020] [Indexed: 06/12/2023]

Oesterle J, Behrens C, Schröder C, Hermann T, Euler T, Franke K, Smith RG, Zeck G, Berens P. Bayesian inference for biophysical neuron models enables stimulus optimization for retinal neuroprosthetics. eLife 2020;9:e54997. [PMID: 33107821 PMCID: PMC7673784 DOI: 10.7554/elife.54997] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 10/26/2020] [Indexed: 01/02/2023] Open

Memory Alone Does Not Account for the Way Rats Learn a Simple Spatial Alternation Task. J Neurosci 2020;40:7311-7317. [PMID: 32753514 PMCID: PMC7534917 DOI: 10.1523/jneurosci.0972-20.2020] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 07/01/2020] [Accepted: 07/08/2020] [Indexed: 01/21/2023] Open

Pray IW, Wakeland W, Pan W, Lambert WE, Garcia HH, Gonzalez AE, O'Neal SE. Understanding transmission and control of the pork tapeworm with CystiAgent: a spatially explicit agent-based model. Parasit Vectors 2020;13:372. [PMID: 32709250 PMCID: PMC7379812 DOI: 10.1186/s13071-020-04226-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 07/14/2020] [Indexed: 02/04/2023] Open

Abstract

BACKGROUND

The pork tapeworm, Taenia solium, is a serious public health problem in rural low-resource areas of Latin America, Africa and Asia, where the associated conditions of nuerocysticercosis (NCC) and porcine cysticercosis cause substantial health and economic harms. An accurate and validated transmission model for T. solium would serve as an important new tool for control and elimination, as it would allow for comparison of available intervention strategies, and prioritization of the most effective strategies for control and elimination efforts.

METHODS

We developed a spatially-explicit agent-based model (ABM) for T. solium ("CystiAgent") that differs from prior T. solium models by including a spatial framework and behavioral parameters such as pig roaming, open human defecation, and human travel. In this article, we introduce the structure and function of the model, describe the data sources used to parameterize the model, and apply sensitivity analyses (Latin hypercube sampling-partial rank correlation coefficient (LHS-PRCC)) to evaluate model parameters.

RESULTS

LHS-PRCC analysis of CystiAgent found that the parameters with the greatest impact on model uncertainty were the roaming range of pigs, the infectious duration of human taeniasis, use of latrines, and the set of "tuning" parameters defining the probabilities of infection in humans and pigs given exposure to T. solium.

CONCLUSIONS

CystiAgent is a novel ABM that has the ability to model spatial and behavioral features of T. solium transmission not available in other models. There is a small set of impactful model parameters that contribute uncertainty to the model and may impact the accuracy of model projections. Field and laboratory studies to better understand these key components of transmission may help reduce uncertainty, while current applications of CystiAgent may consider calibration of these parameters to improve model performance. These results will ultimately allow for improved interpretation of model validation results, and usage of the model to compare available control and elimination strategies for T. solium.

Collapse

Hazelbag CM, Dushoff J, Dominic EM, Mthombothi ZE, Delva W. Calibration of individual-based models to epidemiological data: A systematic review. PLoS Comput Biol 2020;16:e1007893. [PMID: 32392252 PMCID: PMC7241852 DOI: 10.1371/journal.pcbi.1007893] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 05/21/2020] [Accepted: 04/21/2020] [Indexed: 01/24/2023] Open

Abstract

Individual-based models (IBMs) informing public health policy should be calibrated to data and provide estimates of uncertainty. Two main components of model-calibration methods are the parameter-search strategy and the goodness-of-fit (GOF) measure; many options exist for each of these. This review provides an overview of calibration methods used in IBMs modelling infectious disease spread. We identified articles on PubMed employing simulation-based methods to calibrate IBMs informing public health policy in HIV, tuberculosis, and malaria epidemiology published between 1 January 2013 and 31 December 2018. Articles were included if models stored individual-specific information, and calibration involved comparing model output to population-level targets. We extracted information on parameter-search strategies, GOF measures, and model validation. The PubMed search identified 653 candidate articles, of which 84 met the review criteria. Of the included articles, 40 (48%) combined a quantitative GOF measure with an algorithmic parameter-search strategy–either an optimisation algorithm (14/40) or a sampling algorithm (26/40). These 40 articles varied widely in their choices of parameter-search strategies and GOF measures. For the remaining 44 (52%) articles, the parameter-search strategy could either not be identified (32/44) or was described as an informal, non-reproducible method (12/44). Of these 44 articles, the majority (25/44) were unclear about the GOF measure used; of the rest, only five quantitatively evaluated GOF. Only a minority of the included articles, 14 (17%) provided a rationale for their choice of model-calibration method. Model validation was reported in 31 (37%) articles. Reporting on calibration methods is far from optimal in epidemiological modelling studies of HIV, malaria and TB transmission dynamics. The adoption of better documented, algorithmic calibration methods could improve both reproducibility and the quality of inference in model-based epidemiology. There is a need for research comparing the performance of calibration methods to inform decisions about the parameter-search strategies and GOF measures.

Calibration—that is, “fitting” the model to data—is a crucial part of using mathematical models to better forecast and control the population-level spread of infectious diseases. Evidence that the mathematical model is well-calibrated improves confidence that the model provides a realistic picture of the consequences of health policy decisions. To make informed decisions, Policymakers need information about uncertainty: i.e., what is the range of likely outcomes (rather than just a single prediction). Thus, modellers should also strive to provide accurate measurements of uncertainty, both for their model parameters and for their predictions. This systematic review provides an overview of the methods used to calibrate individual-based models (IBMs) of the spread of HIV, malaria, and tuberculosis. We found that less than half of the reviewed articles used reproducible, non-subjective calibration methods. For the remaining articles, the method could either not be identified or was described as an informal, non-reproducible method. Only one-third of the articles obtained estimates of parameter uncertainty. We conclude that the adoption of better-documented, algorithmic calibration methods could improve both reproducibility and the quality of inference in model-based epidemiology.

Collapse

Chen S, Mira A, Onnela JP. Flexible model selection for mechanistic network models. JOURNAL OF COMPLEX NETWORKS 2020;8:cnz024. [PMID: 32765880 PMCID: PMC7391990 DOI: 10.1093/comnet/cnz024] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Accepted: 06/24/2019] [Indexed: 05/25/2023]

Distance-learning For Approximate Bayesian Computation To Model a Volcanic Eruption. SANKHYA B 2020. [DOI: 10.1007/s13571-019-00208-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Buckwar E, Tamborrino M, Tubikanec I. Spectral density-based and measure-preserving ABC for partially observed diffusion processes. An illustration on Hamiltonian SDEs. STATISTICS AND COMPUTING 2020;30:627-648. [PMID: 32132771 PMCID: PMC7026277 DOI: 10.1007/s11222-019-09909-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2019] [Accepted: 10/17/2019] [Indexed: 05/15/2023]

Abstract

Approximate Bayesian computation (ABC) has become one of the major tools of likelihood-free statistical inference in complex mathematical models. Simultaneously, stochastic differential equations (SDEs) have developed to an established tool for modelling time-dependent, real-world phenomena with underlying random effects. When applying ABC to stochastic models, two major difficulties arise: First, the derivation of effective summary statistics and proper distances is particularly challenging, since simulations from the stochastic process under the same parameter configuration result in different trajectories. Second, exact simulation schemes to generate trajectories from the stochastic model are rarely available, requiring the derivation of suitable numerical methods for the synthetic data generation. To obtain summaries that are less sensitive to the intrinsic stochasticity of the model, we propose to build up the statistical method (e.g. the choice of the summary statistics) on the underlying structural properties of the model. Here, we focus on the existence of an invariant measure and we map the data to their estimated invariant density and invariant spectral density. Then, to ensure that these model properties are kept in the synthetic data generation, we adopt measure-preserving numerical splitting schemes. The derived property-based and measure-preserving ABC method is illustrated on the broad class of partially observed Hamiltonian type SDEs, both with simulated data and with real electroencephalography data. The derived summaries are particularly robust to the model simulation, and this fact, combined with the proposed reliable numerical scheme, yields accurate ABC inference. In contrast, the inference returned using standard numerical methods (Euler-Maruyama discretisation) fails. The proposed ingredients can be incorporated into any type of ABC algorithm and directly applied to all SDEs that are characterised by an invariant distribution and for which a measure-preserving numerical method can be derived.

Collapse

Kokko J, Remes U, Thomas O, Pesonen H, Corander J. PYLFIRE: Python implementation of likelihood-free inference by ratio estimation. Wellcome Open Res 2019;4:197. [PMID: 32133422 PMCID: PMC7041362 DOI: 10.12688/wellcomeopenres.15583.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/13/2019] [Indexed: 11/21/2022] Open

Filipe JA, Kyriazakis I. Bayesian, Likelihood-Free Modelling of Phenotypic Plasticity and Variability in Individuals and Populations. Front Genet 2019;10:727. [PMID: 31616460 PMCID: PMC6764410 DOI: 10.3389/fgene.2019.00727] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 07/11/2019] [Indexed: 12/17/2022] Open

Abstract

There is a paradigm shift from the traditional focus on the "average" individual towards the definition and analysis of trait variation within individual life-history and among individuals in populations. This is a result of increasing availability of individual phenotypic data. The shift allows the use of genetic and environment-driven variations to assess robustness to challenge, gain greater understanding of organismal biological processes, or deliver individual-targeted treatments or genetic selection. These consequences apply, in particular, to variation in ontogenetic growth. We propose an approach to parameterise mathematical models of individual traits (e.g., reaction norms, growth curves) that address two challenges: 1) Estimation of individual traits while making minimal assumptions about data distribution and correlation, addressed via Approximate Bayesian Computation (a form of nonparametric inference). We are motivated by the fact that available information on distribution of biological data is often less precise than assumed by conventional likelihood functions. 2) Scaling-up to population phenotype distributions while facilitating unbiased use of individual data; this is addressed via a probabilistic framework where population distributions build on separately-inferred individual distributions and individual-trait interpretability is preserved. The approach is tested against Bayesian likelihood-based inference, by fitting weight and energy intake growth models to animal data and normal- and skewed-distributed simulated data. i) Individual inferences were accurate and robust to changes in data distribution and sample size; in particular, median-based predictions were more robust than maximum- likelihood-based curves. These results suggest that the approach gives reliable inferences using few observations and monitoring resources. ii) At the population level, each individual contributed via a specific data distribution, and population phenotype estimates were not disproportionally influenced by outlier individuals. Indices measuring population phenotype variation can be derived for study comparisons. The approach offers an alternative for estimating trait variability in biological systems that may be reliable for various applications, for example, in genetics, health, and individualised nutrition, while using fewer assumptions and fewer empirical observations. In livestock breeding, the potentially greater accuracy of trait estimation (without specification of multitrait variance-covariance parameters) could lead to improved selection and to more decisive estimates of trait heritability.

Collapse

Lintusaari J, Blomstedt P, Rose B, Sivula T, Gutmann MU, Kaski S, Corander J. Resolving outbreak dynamics using approximate Bayesian computation for stochastic birth-death models. Wellcome Open Res 2019;4:14. [PMID: 37744419 PMCID: PMC10514576 DOI: 10.12688/wellcomeopenres.15048.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2019] [Indexed: 09/26/2023] Open

Kangasrääsiö A, Jokinen JPP, Oulasvirta A, Howes A, Kaski S. Parameter Inference for Computational Cognitive Models with Approximate Bayesian Computation. Cogn Sci 2019;43:e12738. [PMID: 31204797 PMCID: PMC6593436 DOI: 10.1111/cogs.12738] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 04/09/2019] [Accepted: 04/11/2019] [Indexed: 11/28/2022]

Sard N, Robinson J, Kanefsky J, Herbst S, Scribner K. Coalescent models characterize sources and demographic history of recent round goby colonization of Great Lakes and inland waters. Evol Appl 2019;12:1034-1049. [PMID: 31080513 PMCID: PMC6503821 DOI: 10.1111/eva.12779] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Accepted: 01/15/2019] [Indexed: 12/25/2022] Open

Abstract

The establishment and spread of aquatic invasive species are ecologically and economically harmful and a source of conservation concern internationally. Processes of species invasion have traditionally been inferred from observational data of species presence/absence and relative abundance. However, genetic-based approaches can provide valuable sources of inference. Restriction site-associated DNA sequencing was used to identify and genotype single nucleotide polymorphism (SNP) loci for Round Gobies (Neogobius melanostomus) (N = 440) from 18 sampling locations in the Great Lakes and in three Michigan, USA, drainages (Flint, Au Sable, and Cheboygan River basins). Sampled rivers differed in size, accessibility, and physical characteristics including man-made dispersal barriers. Population levels of genetic diversity and interpopulation variance in SNP allele frequency were used in coalescence-based approximate Bayesian computation (ABC) to statistically compare models representing competing hypotheses regarding source population, postcolonization dispersal, and demographic history in the Great Lakes and inland waters. Results indicate different patterns of colonization across the three drainages. In the Flint River, models indicate a strong population bottleneck (<3% of contemporary effective population size) and a single founding event from Saginaw Bay led to the colonization of inland river segments. In the Au Sable River, analyses could not distinguish potential source populations, but supported models indicated multiple introductions from one source population. In the Cheboygan River, supported models indicated that colonization likely proceeded from east (Lake Huron source) to west among inland locales sampled in the system. Despite the recent occupancy of Great Lakes and inland habitats, large numbers of loci analyzed in an ABC framework enable statistically supported identification of source populations and reconstruction of the direction of inland spread and demographic history following establishment. Information from analyses can direct management actions to limit the spread of invasive species from identified sources and most probable vectors into additional inland aquatic habitats.

Collapse

Järvenpää M, Sater MRA, Lagoudas GK, Blainey PC, Miller LG, McKinnell JA, Huang SS, Grad YH, Marttinen P. A Bayesian model of acquisition and clearance of bacterial colonization incorporating within-host variation. PLoS Comput Biol 2019;15:e1006534. [PMID: 31009452 PMCID: PMC6497309 DOI: 10.1371/journal.pcbi.1006534] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 05/02/2019] [Accepted: 02/22/2019] [Indexed: 11/19/2022] Open

Moens V, Zénon A. Learning and forgetting using reinforced Bayesian change detection. PLoS Comput Biol 2019;15:e1006713. [PMID: 30995214 PMCID: PMC6488101 DOI: 10.1371/journal.pcbi.1006713] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Revised: 04/29/2019] [Accepted: 12/09/2018] [Indexed: 12/17/2022] Open

Lintusaari J, Blomstedt P, Sivula T, Gutmann MU, Kaski S, Corander J. Resolving outbreak dynamics using approximate Bayesian computation for stochastic birth-death models. Wellcome Open Res 2019. [DOI: 10.12688/wellcomeopenres.15048.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Shen P, Lees JA, Bee GCW, Brown SP, Weiser JN. Pneumococcal quorum sensing drives an asymmetric owner-intruder competitive strategy during carriage via the competence regulon. Nat Microbiol 2018;4:198-208. [PMID: 30546100 DOI: 10.1038/s41564-018-0314-4] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 10/30/2018] [Indexed: 11/09/2022]

Järvenpää M, Gutmann MU, Vehtari A, Marttinen P. Gaussian process modelling in approximate Bayesian computation to estimate horizontal gene transfer in bacteria. Ann Appl Stat 2018. [DOI: 10.1214/18-aoas1150] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Dutta R, Brotzakis ZF, Mira A. Bayesian calibration of force-fields from experimental data: TIP4P water. J Chem Phys 2018;149:154110. [DOI: 10.1063/1.5030950] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

Xu Y, Puranen S, Corander J, Kabashima Y. Inverse finite-size scaling for high-dimensional significance analysis. Phys Rev E 2018;97:062112. [PMID: 30011500 DOI: 10.1103/physreve.97.062112] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2017] [Indexed: 11/07/2022]

Dutta R, Mira A, Onnela JP. Bayesian inference of spreading processes on networks. Proc Math Phys Eng Sci 2018;474:20180129. [PMID: 30100809 PMCID: PMC6083242 DOI: 10.1098/rspa.2018.0129] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 06/19/2018] [Indexed: 01/18/2023] Open

Karabatsos G, Leisen F. An approximate likelihood perspective on ABC methods. STATISTICS SURVEYS 2018. [DOI: 10.1214/18-ss120] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Corander J, Fraser C, Gutmann MU, Arnold B, Hanage WP, Bentley SD, Lipsitch M, Croucher NJ. Frequency-dependent selection in vaccine-associated pneumococcal population dynamics. Nat Ecol Evol 2017;1:1950-1960. [PMID: 29038424 PMCID: PMC5708525 DOI: 10.1038/s41559-017-0337-x] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 09/01/2017] [Indexed: 12/21/2022]

Overcast I, Bagley JC, Hickerson MJ. Strategies for improving approximate Bayesian computation tests for synchronous diversification. BMC Evol Biol 2017;17:203. [PMID: 28836959 PMCID: PMC5571621 DOI: 10.1186/s12862-017-1052-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 08/14/2017] [Indexed: 11/22/2022] Open

Abstract

Background

Estimating the variability in isolation times across co-distributed taxon pairs that may have experienced the same allopatric isolating mechanism is a core goal of comparative phylogeography. The use of hierarchical Approximate Bayesian Computation (ABC) and coalescent models to infer temporal dynamics of lineage co-diversification has been a contentious topic in recent years. Key issues that remain unresolved include the choice of an appropriate prior on the number of co-divergence events (Ψ), as well as the optimal strategies for data summarization.

Methods

Through simulation-based cross validation we explore the impact of the strategy for sorting summary statistics and the choice of prior on Ψ on the estimation of co-divergence variability. We also introduce a new setting (β) that can potentially improve estimation of Ψ by enforcing a minimal temporal difference between pulses of co-divergence. We apply this new method to three empirical datasets: one dataset each of co-distributed taxon pairs of Panamanian frogs and freshwater fishes, and a large set of Neotropical butterfly sister-taxon pairs.

Results

We demonstrate that the choice of prior on Ψ has little impact on inference, but that sorting summary statistics yields substantially more reliable estimates of co-divergence variability despite violations of assumptions about exchangeability. We find the implementation of β improves estimation of Ψ, with improvement being most dramatic given larger numbers of taxon pairs. We find equivocal support for synchronous co-divergence for both of the Panamanian groups, but we find considerable support for asynchronous divergence among the Neotropical butterflies.

Conclusions

Our simulation experiments demonstrate that using sorted summary statistics results in improved estimates of the variability in divergence times, whereas the choice of hyperprior on Ψ has negligible effect. Additionally, we demonstrate that estimating the number of pulses of co-divergence across co-distributed taxon-pairs is improved by applying a flexible buffering regime over divergence times. This improves the correlation between Ψ and the true variability in isolation times and allows for more meaningful interpretation of this hyperparameter. This will allow for more accurate identification of the number of temporally distinct pulses of co-divergence that generated the diversification pattern of a given regional assemblage of sister-taxon-pairs.

Electronic supplementary material

The online version of this article (doi:10.1186/s12862-017-1052-6) contains supplementary material, which is available to authorized users.

Collapse

Tietäväinen A, Gutmann MU, Keski-Vakkuri E, Corander J, Hæggström E. Bayesian inference of physiologically meaningful parameters from body sway measurements. Sci Rep 2017. [PMID: 28630413 PMCID: PMC5476665 DOI: 10.1038/s41598-017-02372-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Gutmann MU, Dutta R, Kaski S, Corander J. Likelihood-free inference via classification. STATISTICS AND COMPUTING 2017;28:411-425. [PMID: 31997856 PMCID: PMC6956883 DOI: 10.1007/s11222-017-9738-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Accepted: 02/28/2017] [Indexed: 06/10/2023]