101
|
Neuronal circuits overcome imbalance in excitation and inhibition by adjusting connection numbers. Proc Natl Acad Sci U S A 2021; 118:2018459118. [PMID: 33723048 DOI: 10.1073/pnas.2018459118] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The interplay between excitation and inhibition is crucial for neuronal circuitry in the brain. Inhibitory cell fractions in the neocortex and hippocampus are typically maintained at 15 to 30%, which is assumed to be important for stable dynamics. We have studied systematically the role of precisely controlled excitatory/inhibitory (E/I) cellular ratios on network activity using mice hippocampal cultures. Surprisingly, networks with varying E/I ratios maintain stable bursting dynamics. Interburst intervals remain constant for most ratios, except in the extremes of 0 to 10% and 90 to 100% inhibitory cells. Single-cell recordings and modeling suggest that networks adapt to chronic alterations of E/I compositions by balancing E/I connectivity. Gradual blockade of inhibition substantiates the agreement between the model and experiment and defines its limits. Combining measurements of population and single-cell activity with theoretical modeling, we provide a clearer picture of how E/I balance is preserved and where it fails in living neuronal networks.
Collapse
|
102
|
Gonzalez M, Gutierrez C, Martinez R. Bayesian Inference in Y-Linked Two-Sex Branching Processes with Mutations: ABC Approach. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:525-538. [PMID: 31180867 DOI: 10.1109/tcbb.2019.2921308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
A Y-linked two-sex branching process with mutations and blind choice of males is a suitable model for analyzing the evolution of the number of carriers of a Y-linked allele and its mutations. Such a model considers a two-sex monogamous population in which each female chooses her partner from among the male population without caring about his type (i.e., the allele he carries). In this work, we deal with the problem of estimating the main parameters of these models by developing Bayesian inference in a parametric framework. First, we consider as a sample scheme the observation of the total number of females and males up to some generation as well as the number of males of each genotype in the last generation. Subsequently, we introduce the information on the mutated males in only the last generation, obtaining in this way a second sample scheme. For both samples, we apply the Approximate Bayesian Computation (ABC) method to approximate the posterior distributions of the main parameters of the model. The accuracy of the procedure based on these samples is illustrated and discussed by way of simulated examples.
Collapse
|
103
|
Lofgren ET, Mietchen M, Dicks KV, Moehring R, Anderson D. Estimated Methicillin-Resistant Staphylococcus aureus Decolonization in Intensive Care Units Associated With Single-Application Chlorhexidine Gluconate or Mupirocin. JAMA Netw Open 2021; 4:e210652. [PMID: 33662133 PMCID: PMC7933999 DOI: 10.1001/jamanetworkopen.2021.0652] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
IMPORTANCE Chlorhexidine gluconate (CHG) and mupirocin are widely used to decolonize patients with methicillin-resistant Staphylococcus aureus (MRSA) and reduce risks associated with infection in hospitalized populations. Quantifying the association of an application of CHG alone or in combination with mupirocin with risk of MRSA infection is important for studies evaluating alternative decolonization strategies or schedules and for identifying whether there is room for improved decolonizing agents. OBJECTIVE To estimate the proportion of patients with MRSA decolonized per application of CHG and mupirocin from existing population-level studies. DESIGN, SETTING, AND PARTICIPANTS A stochastic mathematical model of an 18-bed intensive care unit (ICU) in an academic medical center operating over 1 year was used to estimate parameters for the proportion of simulated patients with MRSA decolonized per application of CHG and mupirocin. The model was conducted using approximate bayesian computation with data from an existing meta-analysis of studies conducted from February 2005 through January 2015. Data were analyzed from January 2018 through November 2019. EXPOSURE A universal decolonization protocol for colonized patients in the ICU using CHG or CHG and mupirocin in combination was simulated. MAIN OUTCOMES AND MEASURES The proportion of patients with MRSA decolonized per application of CHG and mupirocin was estimated. RESULTS The estimated proportion of patients with MRSA decolonized per application of CHG was 0.15 (95% credible interval, 0.01-0.42), and the estimated proportion per application of mupirocin in conjunction with CHG was 0.15 (95% credible interval, 0.01-0.54). A lag in colonization detection was associated with decreases in the CHG estimate (0.11; 95% credible interval, 0.01-0.30) and mupirocin estimate (0.10; 95% credible interval, 0.00-0.34), which were sensitive to the value of the modeled contact rate between nurses and patients. A 1% increase in the value of this parameter was associated with a 0.73% increase in the estimated combined outcomes associated with CHG and mupirocin (95% CI: 0.71, 0.75). Gaps longer than 24 hours in the administration of decolonizing agents were associated with a decrease of within-ICU MRSA transmission. Compared with a mean (SD) of 1.23 (0.27) acquisitions per 1000 patient-days in scenarios with no decolonizing bathing, a bathing protocol administering CHG and mupirocin every 120 hours was associated with a mean (SD) acquisition rate of 1.03 (0.24) acquisitions per 1000 patient days, a 16.3% decrease (95% CI, 14.7%-18.0%; P > .001). CONCLUSIONS AND RELEVANCE These findings suggest that there may be room for significant improvement in anti-MRSA disinfectants, including the compounds themselves and their delivery mechanisms. Despite the decolonization estimates found in this study, these agents are associated with robust outcomes after delays in administration, which may help in alleviating concerns over patient comfort and toxic effects.
Collapse
Affiliation(s)
- Eric T. Lofgren
- Paul G. Allen School for Global Animal Health, Washington State University, Pullman, Washington
| | - Matthew Mietchen
- Paul G. Allen School for Global Animal Health, Washington State University, Pullman, Washington
| | - Kristen V. Dicks
- Duke Center for Antimicrobial Stewardship and Infection Prevention, Durham, North Carolina
| | - Rebekah Moehring
- Duke Center for Antimicrobial Stewardship and Infection Prevention, Durham, North Carolina
| | - Deverick Anderson
- Duke Center for Antimicrobial Stewardship and Infection Prevention, Durham, North Carolina
| |
Collapse
|
104
|
Callaway F, Rangel A, Griffiths TL. Fixation patterns in simple choice reflect optimal information sampling. PLoS Comput Biol 2021; 17:e1008863. [PMID: 33770069 PMCID: PMC8026028 DOI: 10.1371/journal.pcbi.1008863] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 04/07/2021] [Accepted: 03/10/2021] [Indexed: 11/24/2022] Open
Abstract
Simple choices (e.g., eating an apple vs. an orange) are made by integrating noisy evidence that is sampled over time and influenced by visual attention; as a result, fluctuations in visual attention can affect choices. But what determines what is fixated and when? To address this question, we model the decision process for simple choice as an information sampling problem, and approximate the optimal sampling policy. We find that it is optimal to sample from options whose value estimates are both high and uncertain. Furthermore, the optimal policy provides a reasonable account of fixations and choices in binary and trinary simple choice, as well as the differences between the two cases. Overall, the results show that the fixation process during simple choice is influenced dynamically by the value estimates computed during the decision process, in a manner consistent with optimal information sampling.
Collapse
Affiliation(s)
- Frederick Callaway
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Antonio Rangel
- Departments of Humanities and Social Sciences and Computation and Neural Systems, California Institute of Technology, Pasadena, California, United States of America
| | - Thomas L. Griffiths
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
105
|
Weerasuriya CK, Harris RC, McQuaid CF, Bozzani F, Ruan Y, Li R, Li T, Rade K, Rao R, Ginsberg AM, Gomez GB, White RG. The epidemiologic impact and cost-effectiveness of new tuberculosis vaccines on multidrug-resistant tuberculosis in India and China. BMC Med 2021; 19:60. [PMID: 33632218 PMCID: PMC7908776 DOI: 10.1186/s12916-021-01932-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 01/29/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Despite recent advances through the development pipeline, how novel tuberculosis (TB) vaccines might affect rifampicin-resistant and multidrug-resistant tuberculosis (RR/MDR-TB) is unknown. We investigated the epidemiologic impact, cost-effectiveness, and budget impact of hypothetical novel prophylactic prevention of disease TB vaccines on RR/MDR-TB in China and India. METHODS We constructed a deterministic, compartmental, age-, drug-resistance- and treatment history-stratified dynamic transmission model of tuberculosis. We introduced novel vaccines from 2027, with post- (PSI) or both pre- and post-infection (P&PI) efficacy, conferring 10 years of protection, with 50% efficacy. We measured vaccine cost-effectiveness over 2027-2050 as USD/DALY averted-against 1-times GDP/capita, and two healthcare opportunity cost-based (HCOC), thresholds. We carried out scenario analyses. RESULTS By 2050, the P&PI vaccine reduced RR/MDR-TB incidence rate by 71% (UI: 69-72) and 72% (UI: 70-74), and the PSI vaccine by 31% (UI: 30-32) and 44% (UI: 42-47) in China and India, respectively. In India, we found both USD 10 P&PI and PSI vaccines cost-effective at the 1-times GDP and upper HCOC thresholds and P&PI vaccines cost-effective at the lower HCOC threshold. In China, both vaccines were cost-effective at the 1-times GDP threshold. P&PI vaccine remained cost-effective at the lower HCOC threshold with 49% probability and PSI vaccines at the upper HCOC threshold with 21% probability. The P&PI vaccine was predicted to avert 0.9 million (UI: 0.8-1.1) and 1.1 million (UI: 0.9-1.4) second-line therapy regimens in China and India between 2027 and 2050, respectively. CONCLUSIONS Novel TB vaccination is likely to substantially reduce the future burden of RR/MDR-TB, while averting the need for second-line therapy. Vaccination may be cost-effective depending on vaccine characteristics and setting.
Collapse
Affiliation(s)
- Chathika K Weerasuriya
- TB Modelling Group, TB Centre and Centre for the Mathematical Modelling of Infectious Diseases, Department of Infectious Disease Epidemiology, Faculty of Epidemiology & Population Health, London School of Hygiene and Tropical Medicine, London, UK.
| | - Rebecca C Harris
- TB Modelling Group, TB Centre and Centre for the Mathematical Modelling of Infectious Diseases, Department of Infectious Disease Epidemiology, Faculty of Epidemiology & Population Health, London School of Hygiene and Tropical Medicine, London, UK.,Currently employed at Sanofi Pasteur, Singapore, Singapore
| | - C Finn McQuaid
- TB Modelling Group, TB Centre and Centre for the Mathematical Modelling of Infectious Diseases, Department of Infectious Disease Epidemiology, Faculty of Epidemiology & Population Health, London School of Hygiene and Tropical Medicine, London, UK
| | - Fiammetta Bozzani
- Department of Global Health and Development, Faculty of Public Health & Policy, London School of Hygiene and Tropical Medicine, London, UK
| | - Yunzhou Ruan
- Chinese Centre for Disease Control and Prevention, Beijing, China
| | - Renzhong Li
- Chinese Centre for Disease Control and Prevention, Beijing, China
| | - Tao Li
- Chinese Centre for Disease Control and Prevention, Beijing, China
| | | | - Raghuram Rao
- National Tuberculosis Elimination Programme, New Delhi, India
| | - Ann M Ginsberg
- International AIDS Vaccine Initiative, New York, USA.,Current Affiliation: Bill and Melinda Gates Foundation, Washington DC, USA
| | - Gabriela B Gomez
- Department of Global Health and Development, Faculty of Public Health & Policy, London School of Hygiene and Tropical Medicine, London, UK.,Currently employed at Sanofi Pasteur, Lyon, France
| | - Richard G White
- TB Modelling Group, TB Centre and Centre for the Mathematical Modelling of Infectious Diseases, Department of Infectious Disease Epidemiology, Faculty of Epidemiology & Population Health, London School of Hygiene and Tropical Medicine, London, UK
| |
Collapse
|
106
|
A mesoscopic simulator to uncover heterogeneity and evolutionary dynamics in tumors. PLoS Comput Biol 2021; 17:e1008266. [PMID: 33566821 PMCID: PMC7901744 DOI: 10.1371/journal.pcbi.1008266] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 02/23/2021] [Accepted: 01/16/2021] [Indexed: 12/12/2022] Open
Abstract
Increasingly complex in silico modeling approaches offer a way to simultaneously access cancerous processes at different spatio-temporal scales. High-level models, such as those based on partial differential equations, are computationally affordable and allow large tumor sizes and long temporal windows to be studied, but miss the discrete nature of many key underlying cellular processes. Individual-based approaches provide a much more detailed description of tumors, but have difficulties when trying to handle full-sized real cancers. Thus, there exists a trade-off between the integration of macroscopic and microscopic information, now widely available, and the ability to attain clinical tumor sizes. In this paper we put forward a stochastic mesoscopic simulation framework that incorporates key cellular processes during tumor progression while keeping computational costs to a minimum. Our framework captures a physical scale that allows both the incorporation of microscopic information, tracking the spatio-temporal emergence of tumor heterogeneity and the underlying evolutionary dynamics, and the reconstruction of clinically sized tumors from high-resolution medical imaging data, with the additional benefit of low computational cost. We illustrate the functionality of our modeling approach for the case of glioblastoma, a paradigm of tumor heterogeneity that remains extremely challenging in the clinical setting.
Collapse
|
107
|
Deng H, Exel KE, Swart A, Bonačić Marinović AA, Dam-Deisz C, van der Giessen JWB, Opsteegh M. Digging into Toxoplasma gondii infections via soil: A quantitative microbial risk assessment approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 755:143232. [PMID: 33160663 DOI: 10.1016/j.scitotenv.2020.143232] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 10/05/2020] [Accepted: 10/16/2020] [Indexed: 06/11/2023]
Abstract
Soil has been identified as an important source of exposure to a variety of chemical and biological contaminants. Toxoplasma gondii is one of those potential biological contaminants associated with serious health effects in pregnant women and immunocompromised patients. Gardening or consumption of homegrown vegetables may present an important route of T. gondii infection via accidental ingestion of soil. In the Netherlands, there is quantitative information on the risk of T. gondii infection via meat products, but not on the risk of infection through soil. The objective of this study was to develop a quantitative microbial risk assessment (QMRA) model for estimating the risk associated with T. gondii exposure via accidental soil ingestion in the Netherlands. In order to obtain the needed information, a magnetic capture method for detection of T. gondii oocysts in soil samples was developed, and T. gondii DNA was detected using qPCR targeting the 529 bp repeat element. The method was shown to provide 95% probability of detection (95% CI: 88-100%) when at least 34 oocysts are present in 25 g of soil. T. gondii DNA was detected in 5 of 148 soil samples with interpretable results (3%, 95% CI: 1.5-7.7%). Results for 18 samples were not interpretable due to PCR inhibition. The estimated amount of oocysts presented in qPCR positive samples was quantified by a linear model, and the amount varied from 8 to 478 in 25 g of soil. The estimated incidence rate of T. gondii infection from the QMRA model via soil varied from 0.3 to 1.8 per 1000 individuals per day. Several data gaps (e.g., soil contamination/ingestion and oocysts viability) have been identified in this study, the structure of the model can be applied to obtain more accurate estimates of the risk of T. gondii infection via soil when data become available.
Collapse
Affiliation(s)
- Huifang Deng
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| | - Kitty E Exel
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands; Faculty of Veterinary Medicine, Utrecht University, Yalelaan 7, 3584 CL Utrecht, the Netherlands.
| | - Arno Swart
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| | - Axel A Bonačić Marinović
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| | - Cecile Dam-Deisz
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| | - Johanna W B van der Giessen
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| | - Marieke Opsteegh
- National Institute for Public Health and the Environment (RIVM), P.O. Box 1, 3720 BA Bilthoven, the Netherlands.
| |
Collapse
|
108
|
Paun LM, Husmeier D. Markov chain Monte Carlo with Gaussian processes for fast parameter estimation and uncertainty quantification in a 1D fluid-dynamics model of the pulmonary circulation. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING 2021; 37:e3421. [PMID: 33249755 PMCID: PMC7901000 DOI: 10.1002/cnm.3421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Revised: 11/07/2020] [Accepted: 11/18/2020] [Indexed: 06/12/2023]
Abstract
The past few decades have witnessed an explosive synergy between physics and the life sciences. In particular, physical modelling in medicine and physiology is a topical research area. The present work focuses on parameter inference and uncertainty quantification in a 1D fluid-dynamics model for quantitative physiology: the pulmonary blood circulation. The practical challenge is the estimation of the patient-specific biophysical model parameters, which cannot be measured directly. In principle this can be achieved based on a comparison between measured and predicted data. However, predicting data requires solving a system of partial differential equations (PDEs), which usually have no closed-form solution, and repeated numerical integrations as part of an adaptive estimation procedure are computationally expensive. In the present article, we demonstrate how fast parameter estimation combined with sound uncertainty quantification can be achieved by a combination of statistical emulation and Markov chain Monte Carlo (MCMC) sampling. We compare a range of state-of-the-art MCMC algorithms and emulation strategies, and assess their performance in terms of their accuracy and computational efficiency. The long-term goal is to develop a method for reliable disease prognostication in real time, and our work is an important step towards an automatic clinical decision support system.
Collapse
Affiliation(s)
- L. Mihaela Paun
- School of Mathematics and StatisticsUniversity of GlasgowGlasgowUK
| | - Dirk Husmeier
- School of Mathematics and StatisticsUniversity of GlasgowGlasgowUK
| |
Collapse
|
109
|
Kuronen M, Myllymäki M, Loavenbruck A, Särkkä A. Point process models for sweat gland activation observed with noise. Stat Med 2021; 40:2055-2072. [PMID: 33517587 DOI: 10.1002/sim.8891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 11/07/2020] [Accepted: 01/06/2021] [Indexed: 11/11/2022]
Abstract
The aim of this article is to construct spatial models for the activation of sweat glands for healthy subjects and subjects suffering from peripheral neuropathy by using videos of sweating recorded from the subjects. The sweat patterns are regarded as realizations of spatial point processes and two point process models for the sweat gland activation and two methods for inference are proposed. Several image analysis steps are needed to extract the point patterns from the videos and some incorrectly identified sweat gland locations may be present in the data. To take into account the errors, we either include an error term in the point process model or use an estimation procedure that is robust with respect to the errors.
Collapse
Affiliation(s)
- Mikko Kuronen
- Natural Resources Institute Finland (LUKE), Helsinki, Finland
| | - Mari Myllymäki
- Natural Resources Institute Finland (LUKE), Helsinki, Finland
| | - Adam Loavenbruck
- Department of Neurology, Kennedy Laboratory, University of Minnesota, Minneapolis, Minnesota, USA
| | - Aila Särkkä
- Department of Mathematical Sciences, Chalmers University of Technology and the University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
110
|
Li G, Hu Y, Jan Zrimec, Luo H, Wang H, Zelezniak A, Ji B, Nielsen J. Bayesian genome scale modelling identifies thermal determinants of yeast metabolism. Nat Commun 2021; 12:190. [PMID: 33420025 PMCID: PMC7794507 DOI: 10.1038/s41467-020-20338-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 11/25/2020] [Indexed: 12/05/2022] Open
Abstract
The molecular basis of how temperature affects cell metabolism has been a long-standing question in biology, where the main obstacles are the lack of high-quality data and methods to associate temperature effects on the function of individual proteins as well as to combine them at a systems level. Here we develop and apply a Bayesian modeling approach to resolve the temperature effects in genome scale metabolic models (GEM). The approach minimizes uncertainties in enzymatic thermal parameters and greatly improves the predictive strength of the GEMs. The resulting temperature constrained yeast GEM uncovers enzymes that limit growth at superoptimal temperatures, and squalene epoxidase (ERG1) is predicted to be the most rate limiting. By replacing this single key enzyme with an ortholog from a thermotolerant yeast strain, we obtain a thermotolerant strain that outgrows the wild type, demonstrating the critical role of sterol metabolism in yeast thermosensitivity. Therefore, apart from identifying thermal determinants of cell metabolism and enabling the design of thermotolerant strains, our Bayesian GEM approach facilitates modelling of complex biological systems in the absence of high-quality data and therefore shows promise for becoming a standard tool for genome scale modeling. While temperature impacts the function of all cellular components, it’s hard to rule out how the temperature dependence of cell phenotypes emerged from the dependence of individual components. Here, the authors develop a Bayesian genome scale modelling approach to identify thermal determinants of yeast metabolism.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Yating Hu
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Jan Zrimec
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Luo
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Wang
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, SE-41258, Gothenburg, Sweden.,Wallenberg Center for Molecular and Translational Medicine, University of Gothenburg, SE-41258, Gothenburg, Sweden
| | - Aleksej Zelezniak
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Science for Life Laboratory, Tomtebodavägen 23a, SE-171 65, Stockholm, Sweden
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden. .,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark. .,BioInnovation Institute, Ole Måløes Vej 3, DK2200, Copenhagen N, Denmark.
| |
Collapse
|
111
|
Warne DJ, Ebert A, Drovandi C, Hu W, Mira A, Mengersen K. Hindsight is 2020 vision: a characterisation of the global response to the COVID-19 pandemic. BMC Public Health 2020; 20:1868. [PMID: 33287789 PMCID: PMC7719727 DOI: 10.1186/s12889-020-09972-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/25/2020] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND The global impact of COVID-19 and the country-specific responses to the pandemic provide an unparalleled opportunity to learn about different patterns of the outbreak and interventions. We model the global pattern of reported COVID-19 cases during the primary response period, with the aim of learning from the past to prepare for the future. METHODS Using Bayesian methods, we analyse the response to the COVID-19 outbreak for 158 countries for the period 22 January to 9 June 2020. This encompasses the period in which many countries imposed a variety of response measures and initial relaxation strategies. Instead of modelling specific intervention types and timings for each country explicitly, we adopt a stochastic epidemiological model including a feedback mechanism on virus transmission to capture complex nonlinear dynamics arising from continuous changes in community behaviour in response to rising case numbers. We analyse the overall effect of interventions and community responses across diverse regions. This approach mitigates explicit consideration of issues such as period of infectivity and public adherence to government restrictions. RESULTS Countries with the largest cumulative case tallies are characterised by a delayed response, whereas countries that avoid substantial community transmission during the period of study responded quickly. Countries that recovered rapidly also have a higher case identification rate and small numbers of undocumented community transmission at the early stages of the outbreak. We also demonstrate that uncertainty in numbers of undocumented infections dramatically impacts the risk of multiple waves. Our approach is also effective at pre-empting potential flare-ups. CONCLUSIONS We demonstrate the utility of modelling to interpret community behaviour in the early epidemic stages. Two lessons learnt that are important for the future are: i) countries that imposed strict containment measures early in the epidemic fared better with respect to numbers of reported cases; and ii) broader testing is required early in the epidemic to understand the magnitude of undocumented infections and recover rapidly. We conclude that clear patterns of containment are essential prior to relaxation of restrictions and show that modelling can provide insights to this end.
Collapse
Affiliation(s)
- David J Warne
- School of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia.
- Centre for Data Science, Queensland University of Technology, Brisbane, Australia.
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers, Brisbane, Australia.
| | - Anthony Ebert
- Institute of Computational Science, Università della Svizzera italiana, Lugano, Switzerland
| | - Christopher Drovandi
- School of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia
- Centre for Data Science, Queensland University of Technology, Brisbane, Australia
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers, Brisbane, Australia
| | - Wenbiao Hu
- School of Public Health and Social Work, Institute of Health and Biomedical Innovation, Queensland University of Technology, Brisbane, Australia
| | - Antonietta Mira
- Institute of Computational Science, Università della Svizzera italiana, Lugano, Switzerland
- Dipartimento di Scienza e Alta Tecnologia, Università dell´Insubria, Varese, Italy
| | - Kerrie Mengersen
- School of Mathematical Sciences, Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia
- Centre for Data Science, Queensland University of Technology, Brisbane, Australia
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers, Brisbane, Australia
| |
Collapse
|
112
|
Browning AP, Warne DJ, Burrage K, Baker RE, Simpson MJ. Identifiability analysis for stochastic differential equation models in systems biology. J R Soc Interface 2020; 17:20200652. [PMID: 33323054 PMCID: PMC7811582 DOI: 10.1098/rsif.2020.0652] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 11/24/2020] [Indexed: 12/26/2022] Open
Abstract
Mathematical models are routinely calibrated to experimental data, with goals ranging from building predictive models to quantifying parameters that cannot be measured. Whether or not reliable parameter estimates are obtainable from the available data can easily be overlooked. Such issues of parameter identifiability have important ramifications for both the predictive power of a model, and the mechanistic insight that can be obtained. Identifiability analysis is well-established for deterministic, ordinary differential equation (ODE) models, but there are no commonly adopted methods for analysing identifiability in stochastic models. We provide an accessible introduction to identifiability analysis and demonstrate how existing ideas for analysis of ODE models can be applied to stochastic differential equation (SDE) models through four practical case studies. To assess structural identifiability, we study ODEs that describe the statistical moments of the stochastic process using open-source software tools. Using practically motivated synthetic data and Markov chain Monte Carlo methods, we assess parameter identifiability in the context of available data. Our analysis shows that SDE models can often extract more information about parameters than deterministic descriptions. All code used to perform the analysis is available on Github.
Collapse
Affiliation(s)
- Alexander P. Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - David J. Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| | - Kevin Burrage
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Queensland University of Technology, Brisbane, Australia
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Ruth E. Baker
- Mathematical Institute, University of Oxford, Oxford, UK
| | - Matthew J. Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
113
|
Loog L. Sometimes hidden but always there: the assumptions underlying genetic inference of demographic histories. Philos Trans R Soc Lond B Biol Sci 2020; 376:20190719. [PMID: 33250022 DOI: 10.1098/rstb.2019.0719] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Demographic processes directly affect patterns of genetic variation within contemporary populations as well as future generations, allowing for demographic inference from patterns of both present-day and past genetic variation. Advances in laboratory procedures, sequencing and genotyping technologies in the past decades have resulted in massive increases in high-quality genome-wide genetic data from present-day populations and allowed retrieval of genetic data from archaeological material, also known as ancient DNA. This has resulted in an explosion of work exploring past changes in population size, structure, continuity and movement. However, as genetic processes are highly stochastic, patterns of genetic variation only indirectly reflect demographic histories. As a result, past demographic processes need to be reconstructed using an inferential approach. This usually involves comparing observed patterns of variation with model expectations from theoretical population genetics. A large number of approaches have been developed based on different population genetic models that each come with assumptions about the data and underlying demography. In this article I review some of the key models and assumptions underlying the most commonly used approaches for past demographic inference and their consequences for our ability to link the inferred demographic processes to the archaeological and climate records. This article is part of the theme issue 'Cross-disciplinary approaches to prehistoric demography'.
Collapse
Affiliation(s)
- Liisa Loog
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| |
Collapse
|
114
|
Eckstein MK, Collins AGE. Computational evidence for hierarchically structured reinforcement learning in humans. Proc Natl Acad Sci U S A 2020; 117:29381-29389. [PMID: 33229518 PMCID: PMC7703642 DOI: 10.1073/pnas.1912330117] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Humans have the fascinating ability to achieve goals in a complex and constantly changing world, still surpassing modern machine-learning algorithms in terms of flexibility and learning speed. It is generally accepted that a crucial factor for this ability is the use of abstract, hierarchical representations, which employ structure in the environment to guide learning and decision making. Nevertheless, how we create and use these hierarchical representations is poorly understood. This study presents evidence that human behavior can be characterized as hierarchical reinforcement learning (RL). We designed an experiment to test specific predictions of hierarchical RL using a series of subtasks in the realm of context-based learning and observed several behavioral markers of hierarchical RL, such as asymmetric switch costs between changes in higher-level versus lower-level features, faster learning in higher-valued compared to lower-valued contexts, and preference for higher-valued compared to lower-valued contexts. We replicated these results across three independent samples. We simulated three models-a classic RL, a hierarchical RL, and a hierarchical Bayesian model-and compared their behavior to human results. While the flat RL model captured some aspects of participants' sensitivity to outcome values, and the hierarchical Bayesian model captured some markers of transfer, only hierarchical RL accounted for all patterns observed in human behavior. This work shows that hierarchical RL, a biologically inspired and computationally simple algorithm, can capture human behavior in complex, hierarchical environments and opens the avenue for future research in this field.
Collapse
Affiliation(s)
- Maria K Eckstein
- Department of Psychology, University of California, Berkeley, CA 94704
| | - Anne G E Collins
- Department of Psychology, University of California, Berkeley, CA 94704
| |
Collapse
|
115
|
White DM, Huang JP, Jara-Muñoz OA, MadriñáN S, Ree RH, Mason-Gamer RJ. The Origins of Coca: Museum Genomics Reveals Multiple Independent Domestications from Progenitor Erythroxylum gracilipes. Syst Biol 2020; 70:1-13. [PMID: 32979264 PMCID: PMC7744036 DOI: 10.1093/sysbio/syaa074] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 09/08/2020] [Accepted: 09/13/2020] [Indexed: 11/21/2022] Open
Abstract
Coca is the natural source of cocaine as well as a sacred and medicinal plant farmed by South American Amerindians and mestizos. The coca crop comprises four closely related varieties classified into two species (Amazonian and Huánuco varieties within Erythroxylum coca Lam., and Colombian and Trujillo varieties within Erythroxylum novogranatense (D. Morris) Hieron.) but our understanding of the domestication and evolutionary history of these taxa is nominal. In this study, we use genomic data from natural history collections to estimate the geographic origins and genetic diversity of this economically and culturally important crop in the context of its wild relatives. Our phylogeographic analyses clearly demonstrate the four varieties of coca comprise two or three exclusive groups nested within the diverse lineages of the widespread, wild species Erythroxylum gracilipes; establishing a new and robust hypothesis of domestication wherein coca originated two or three times from this wild progenitor. The Colombian and Trujillo coca varieties are descended from a single, ancient domestication event in northwestern South America. Huánuco coca was domesticated more recently, possibly in southeastern Peru. Amazonian coca either shares a common domesticated ancestor with Huánuco coca, or it was the product of a third and most recent independent domestication event in the western Amazon basin. This chronology of coca domestication reveals different Holocene peoples in South America were able to independently transform the same natural resource to serve their needs; in this case, a workaday stimulant. [Erythroxylum; Erythroxylaceae; Holocene; Museomics; Neotropics; phylogeography; plant domestication; target-sequence capture.]
Collapse
Affiliation(s)
- Dawson M White
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA.,Grainger Bioinformatics Center, The Field Museum, Chicago, IL 60605, USA
| | - Jen-Pan Huang
- Biodiversity Research Center, Academia Sinica, Taipei 11529, Taiwan
| | | | - Santiago MadriñáN
- Laboratorio de Botánica y Sistemática, Departamento de Ciencias Biológicas, Universidad de los Andes, Bogotá D.C., Colombia.,Jardín Botánico de Cartagena "Guillermo Piñeres", Turbaco, Bolívar, Colombia
| | - Richard H Ree
- Grainger Bioinformatics Center, The Field Museum, Chicago, IL 60605, USA
| | - Roberta J Mason-Gamer
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
| |
Collapse
|
116
|
Nuñez JJ, Suárez-Villota EY, Quercia CA, Olivares AP, Sites JW. Phylogeographic analysis and species distribution modelling of the wood frog Batrachyla leptopus (Batrachylidae) reveal interglacial diversification in south western Patagonia. PeerJ 2020; 8:e9980. [PMID: 33083116 PMCID: PMC7546244 DOI: 10.7717/peerj.9980] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 08/27/2020] [Indexed: 01/07/2023] Open
Abstract
Background The evolutionary history of southern South American organisms has been strongly influenced by Pleistocene climate oscillations. Amphibians are good models to evaluate hypotheses about the influence of these climate cycles on population structure and diversification of the biota, because they are sensitive to environmental changes and have restricted dispersal capabilities. We test hypotheses regarding putative forest refugia and expansion events associated with past climatic changes in the wood frog Batrachyla leptopus distributed along ∼1,000 km of length including glaciated and non-glaciated areas in southwestern Patagonia. Methods Using three mitochondrial regions (D-loop, cyt b, and coI) and two nuclear loci (pomc and crybA1), we conducted multilocus phylogeographic analyses and species distribution modelling to gain insights of the evolutionary history of this species. Intraspecific genealogy was explored with maximum likelihood, Bayesian, and phylogenetic network approaches. Diversification time was assessed using molecular clock models in a Bayesian framework, and demographic scenarios were evaluated using approximate Bayesian computation (ABC) and extended Bayesian skyline plot (EBSP). Species distribution models (SDM) were reconstructed using climatic and geographic data. Results Population structure and genealogical analyses support the existence of four lineages distributed north to south, with moderate to high phylogenetic support (Bootstrap > 70%; BPP > 0.92). The diversification time of B. leptopus’ populations began at ∼0.107 mya. The divergence between A and B lineages would have occurred by the late Pleistocene, approximately 0.068 mya, and divergence between C and D lineages was approximately 0.065 mya. The ABC simulations indicate that lineages coalesced at two different time periods, suggesting the presence of at least two glacial refugia and a postglacial colonization route that may have generated two southern lineages (p = 0.93, type I error: <0.094, type II error: 0.134). EBSP, mismatch distribution and neutrality indexes suggest sudden population expansion at ∼0.02 mya for all lineages. SDM infers fragmented distributions of B. leptopus associated with Pleistocene glaciations. Although the present populations of B. leptopus are found in zones affected by the last glacial maximum (∼0.023 mya), our analyses recover an older history of interglacial diversification (0.107–0.019 mya). In addition, we hypothesize two glacial refugia and three interglacial colonization routes, one of which gave rise to two expanding lineages in the south.
Collapse
Affiliation(s)
- José J Nuñez
- Instituto de Ciencias Marinas y Limnológicas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Región de Los Ríos, Chile
| | - Elkin Y Suárez-Villota
- Instituto de Ciencias Naturales, Facultad de Medicina Veterinaria y Agronomía, Universidad de Las Américas, Concepción, Región del Bio-Bío, Chile
| | - Camila A Quercia
- Instituto de Ciencias Marinas y Limnológicas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Región de Los Ríos, Chile
| | - Angel P Olivares
- Instituto de Ciencias Marinas y Limnológicas, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Región de Los Ríos, Chile
| | - Jack W Sites
- Department of Biology and M.L. Bean Life Science Museum, Brigham Young University, Provo, UT, United States of America.,Current affiliation: Department of Biology, Austin Peay St University, Clarksville, TN, United States of America
| |
Collapse
|
117
|
Xu L, Van Doorn S, Hildenbrandt H, Etienne RS. Inferring the Effect of Species Interactions on Trait Evolution. Syst Biol 2020; 70:463-479. [PMID: 32960972 PMCID: PMC8048392 DOI: 10.1093/sysbio/syaa072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Revised: 09/01/2020] [Accepted: 09/08/2020] [Indexed: 11/18/2022] Open
Abstract
Models of trait evolution form an important part of macroevolutionary biology. The Brownian motion model and Ornstein–Uhlenbeck models have become classic (null) models of character evolution, in which species evolve independently. Recently, models incorporating species interactions have been developed, particularly involving competition where abiotic factors pull species toward an optimal trait value and competitive interactions drive the trait values apart. However, these models assume a fitness function rather than derive it from population dynamics and they do not consider dynamics of the trait variance. Here, we develop a general coherent trait evolution framework where the fitness function is based on a model of population dynamics, and therefore it can, in principle, accommodate any type of species interaction. We illustrate our framework with a model of abundance-dependent competitive interactions against a macroevolutionary background encoded in a phylogenetic tree. We develop an inference tool based on Approximate Bayesian Computation and test it on simulated data (of traits at the tips). We find that inference performs well when the diversity predicted by the parameters equals the number of species in the phylogeny. We then fit the model to empirical data of baleen whale body lengths, using three different summary statistics, and compare it to a model without population dynamics and a model where competition depends on the total metabolic rate of the competitors. We show that the unweighted model performs best for the least informative summary statistic, while the model with competition weighted by the total metabolic rate fits the data slightly better than the other two models for the two more informative summary statistics. Regardless of the summary statistic used, the three models substantially differ in their predictions of the abundance distribution. Therefore, data on abundance distributions will allow us to better distinguish the models from one another, and infer the nature of species interactions. Thus, our framework provides a conceptual approach to reveal species interactions underlying trait evolution and identifies the data needed to do so in practice. [Approximate Bayesian computation; competition; phylogeny; population dynamics; simulations; species interaction; trait evolution.]
Collapse
Affiliation(s)
- Liang Xu
- Faculty of Science and Engineering, Groningen Institute for Evolutionary Life Sciences, University of Groningen, PO Box 11103, Groningen 9700 CC, The Netherlands
| | - Sander Van Doorn
- Faculty of Science and Engineering, Groningen Institute for Evolutionary Life Sciences, University of Groningen, PO Box 11103, Groningen 9700 CC, The Netherlands
| | - Hanno Hildenbrandt
- Faculty of Science and Engineering, Groningen Institute for Evolutionary Life Sciences, University of Groningen, PO Box 11103, Groningen 9700 CC, The Netherlands
| | - Rampal S Etienne
- Faculty of Science and Engineering, Groningen Institute for Evolutionary Life Sciences, University of Groningen, PO Box 11103, Groningen 9700 CC, The Netherlands
| |
Collapse
|
118
|
Joslyn LR, Kirschner DE, Linderman JJ. CaliPro: A Calibration Protocol That Utilizes Parameter Density Estimation to Explore Parameter Space and Calibrate Complex Biological Models. Cell Mol Bioeng 2020; 14:31-47. [PMID: 33643465 DOI: 10.1007/s12195-020-00650-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 09/02/2020] [Indexed: 12/15/2022] Open
Abstract
Introduction Mathematical and computational modeling have a long history of uncovering mechanisms and making predictions for biological systems. However, to create a model that can provide relevant quantitative predictions, models must first be calibrated by recapitulating existing biological datasets from that system. Current calibration approaches may not be appropriate for complex biological models because: 1) many attempt to recapitulate only a single aspect of the experimental data (such as a median trend) or 2) Bayesian techniques require specification of parameter priors and likelihoods to experimental data that cannot always be confidently assigned. A new calibration protocol is needed to calibrate complex models when current approaches fall short. Methods Herein, we develop CaliPro, an iterative, model-agnostic calibration protocol that utilizes parameter density estimation to refine parameter space and calibrate to temporal biological datasets. An important aspect of CaliPro is the user-defined pass set definition, which specifies how the model might successfully recapitulate experimental data. We define the appropriate settings to use CaliPro. Results We illustrate the usefulness of CaliPro through four examples including predator-prey, infectious disease transmission, and immune response models. We show that CaliPro works well for both deterministic, continuous model structures as well as stochastic, discrete models and illustrate that CaliPro can work across diverse calibration goals. Conclusions We present CaliPro, a new method for calibrating complex biological models to a range of experimental outcomes. In addition to expediting calibration, CaliPro may be useful in already calibrated parameter spaces to target and isolate specific model behavior for further analysis.
Collapse
Affiliation(s)
- Louis R Joslyn
- Department of Chemical Engineering, University of Michigan, G045W NCRC B28, 2800 Plymouth Rd, Ann Arbor, MI 48109-2136 USA.,Department of Microbiology and Immunology, University of Michigan Medical School, 1150 W Medical Center Drive, 5641 Medical Science II, Ann Arbor, MI 48109-5620 USA
| | - Denise E Kirschner
- Department of Microbiology and Immunology, University of Michigan Medical School, 1150 W Medical Center Drive, 5641 Medical Science II, Ann Arbor, MI 48109-5620 USA
| | - Jennifer J Linderman
- Department of Chemical Engineering, University of Michigan, G045W NCRC B28, 2800 Plymouth Rd, Ann Arbor, MI 48109-2136 USA
| |
Collapse
|
119
|
Davis JT, Chinazzi M, Perra N, Mu K, Piontti APY, Ajelli M, Dean NE, Gioannini C, Litvinova M, Merler S, Rossi L, Sun K, Xiong X, Halloran ME, Longini IM, Viboud C, Vespignani A. Estimating the establishment of local transmission and the cryptic phase of the COVID-19 pandemic in the USA. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.07.06.20140285. [PMID: 32676609 PMCID: PMC7359534 DOI: 10.1101/2020.07.06.20140285] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
We use a global metapopulation transmission model to study the establishment of sustained and undetected community transmission of the COVID-19 pandemic in the United States. The model is calibrated on international case importations from mainland China and takes into account travel restrictions to and from international destinations. We estimate widespread community transmission of SARS-CoV-2 in February, 2020. Modeling results indicate international travel as the key driver of the introduction of SARS-CoV-2 in the West and East Coast metropolitan areas that could have been seeded as early as late-December, 2019. For most of the continental states the largest contribution of imported infections arrived through domestic travel flows.
Collapse
Affiliation(s)
- Jessica T. Davis
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
| | - Matteo Chinazzi
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
| | - Nicola Perra
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
- Networks and Urban Systems Centre, University of Greenwich, London, UK
| | - Kunpeng Mu
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
| | - Ana Pastore y Piontti
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
| | - Marco Ajelli
- Bruno Kessler Foundation, Trento Italy
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health, Bloomington, IN, USA
| | - Natalie E. Dean
- Department of Biostatistics, College of Public Health and Health Professions, University of Florida, Gainesville, USA
| | | | | | | | | | | | - Xinyue Xiong
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
| | - M. Elizabeth Halloran
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Biostatistics, University of Washington, Seattle, WA. USA
| | - Ira M. Longini
- Department of Biostatistics, College of Public Health and Health Professions, University of Florida, Gainesville, USA
| | | | - Alessandro Vespignani
- laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA USA
- ISI Foundation, Turin, Italy
| |
Collapse
|
120
|
Liu D, Clemente L, Poirier C, Ding X, Chinazzi M, Davis J, Vespignani A, Santillana M. Real-Time Forecasting of the COVID-19 Outbreak in Chinese Provinces: Machine Learning Approach Using Novel Digital Data and Estimates From Mechanistic Models. J Med Internet Res 2020; 22:e20285. [PMID: 32730217 PMCID: PMC7459435 DOI: 10.2196/20285] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 07/24/2020] [Accepted: 07/24/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The inherent difficulty of identifying and monitoring emerging outbreaks caused by novel pathogens can lead to their rapid spread; and if left unchecked, they may become major public health threats to the planet. The ongoing coronavirus disease (COVID-19) outbreak, which has infected over 2,300,000 individuals and caused over 150,000 deaths, is an example of one of these catastrophic events. OBJECTIVE We present a timely and novel methodology that combines disease estimates from mechanistic models and digital traces, via interpretable machine learning methodologies, to reliably forecast COVID-19 activity in Chinese provinces in real time. METHODS Our method uses the following as inputs: (a) official health reports, (b) COVID-19-related internet search activity, (c) news media activity, and (d) daily forecasts of COVID-19 activity from a metapopulation mechanistic model. Our machine learning methodology uses a clustering technique that enables the exploitation of geospatial synchronicities of COVID-19 activity across Chinese provinces and a data augmentation technique to deal with the small number of historical disease observations characteristic of emerging outbreaks. RESULTS Our model is able to produce stable and accurate forecasts 2 days ahead of the current time and outperforms a collection of baseline models in 27 out of 32 Chinese provinces. CONCLUSIONS Our methodology could be easily extended to other geographies currently affected by COVID-19 to aid decision makers with monitoring and possibly prevention.
Collapse
Affiliation(s)
- Dianbo Liu
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| | - Leonardo Clemente
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States
- Tecnologico de Monterrey, Monterrey, Mexico
| | - Canelle Poirier
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| | - Xiyu Ding
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
- Harvard TH Chan School of Public Health, Boston, MA, United States
| | - Matteo Chinazzi
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, United States
| | - Jessica Davis
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, United States
| | - Alessandro Vespignani
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, United States
- ISI Foundation, Turin, Italy
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
- Department of Pediatrics, Harvard Medical School, Boston, MA, United States
- Harvard TH Chan School of Public Health, Boston, MA, United States
| |
Collapse
|
121
|
Harrison JU, Baker RE. An automatic adaptive method to combine summary statistics in approximate Bayesian computation. PLoS One 2020; 15:e0236954. [PMID: 32760106 PMCID: PMC7410215 DOI: 10.1371/journal.pone.0236954] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 07/16/2020] [Indexed: 11/18/2022] Open
Abstract
To infer the parameters of mechanistic models with intractable likelihoods, techniques such as approximate Bayesian computation (ABC) are increasingly being adopted. One of the main disadvantages of ABC in practical situations, however, is that parameter inference must generally rely on summary statistics of the data. This is particularly the case for problems involving high-dimensional data, such as biological imaging experiments. However, some summary statistics contain more information about parameters of interest than others, and it is not always clear how to weight their contributions within the ABC framework. We address this problem by developing an automatic, adaptive algorithm that chooses weights for each summary statistic. Our algorithm aims to maximize the distance between the prior and the approximate posterior by automatically adapting the weights within the ABC distance function. Computationally, we use a nearest neighbour estimator of the distance between distributions. We justify the algorithm theoretically based on properties of the nearest neighbour distance estimator. To demonstrate the effectiveness of our algorithm, we apply it to a variety of test problems, including several stochastic models of biochemical reaction networks, and a spatial model of diffusion, and compare our results with existing algorithms.
Collapse
Affiliation(s)
- Jonathan U. Harrison
- Mathematical Institute, Mathematical Sciences Building, University of Warwick, Coventry, United Kingdom
- * E-mail:
| | - Ruth E. Baker
- Mathematical Institute, Andrew Wiles Building, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
122
|
Lynn MB, Lee KFH, Soares C, Naud R, Béïque JC. A Synthetic Likelihood Solution to the Silent Synapse Estimation Problem. Cell Rep 2020; 32:107916. [PMID: 32697998 DOI: 10.1016/j.celrep.2020.107916] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 05/04/2020] [Accepted: 06/25/2020] [Indexed: 11/19/2022] Open
Abstract
Functional features of synaptic populations are typically inferred from random electrophysiological sampling of small subsets of synapses. Are these samples unbiased? Here, we develop a biophysically constrained statistical framework to address this question and apply it to assess the performance of a widely used method based on a failure-rate analysis to quantify the occurrence of silent (AMPAR-lacking) synapses. We simulate this method in silico and find that it is characterized by strong and systematic biases, poor reliability, and weak statistical power. Key conclusions are validated by whole-cell recordings from hippocampal neurons. To address these shortcomings, we develop a simulator of the experimental protocol and use it to compute a synthetic likelihood. By maximizing the likelihood, we infer silent synapse fraction with no bias, low variance, and superior statistical power over alternatives. Together, this generalizable approach highlights how a simulator of experimental methodologies can substantially improve the estimation of physiological properties.
Collapse
Affiliation(s)
- Michael B Lynn
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Kevin F H Lee
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Cary Soares
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Richard Naud
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada; Centre for Neural Dynamics, University of Ottawa, Ottawa, ON K1H 8M5, Canada; University of Ottawa's Brain and Mind Research Institute, Ottawa, ON K1H 8M5, Canada; Department of Physics, STEM Complex, Room 336, 150 Louis Pasteur Private, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
| | - Jean-Claude Béïque
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada; Canadian Partnership for Stroke Recovery, University of Ottawa, Ottawa, ON K1H 8M5, Canada; Centre for Neural Dynamics, University of Ottawa, Ottawa, ON K1H 8M5, Canada; University of Ottawa's Brain and Mind Research Institute, Ottawa, ON K1H 8M5, Canada.
| |
Collapse
|
123
|
Lasri A, Juric V, Verreault M, Bielle F, Idbaih A, Kel A, Murphy B, Sturrock M. Phenotypic selection through cell death: stochastic modelling of O-6-methylguanine-DNA methyltransferase dynamics. ROYAL SOCIETY OPEN SCIENCE 2020; 7:191243. [PMID: 32874597 PMCID: PMC7428254 DOI: 10.1098/rsos.191243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 06/17/2020] [Indexed: 05/11/2023]
Abstract
Glioblastoma (GBM) is the most aggressive malignant primary brain tumour with a median overall survival of 15 months. To treat GBM, patients currently undergo a surgical resection followed by exposure to radiotherapy and concurrent and adjuvant temozolomide (TMZ) chemotherapy. However, this protocol often leads to treatment failure, with drug resistance being the main reason behind this. To date, many studies highlight the role of O-6-methylguanine-DNA methyltransferase (MGMT) in conferring drug resistance. The mechanism through which MGMT confers resistance is not well studied-particularly in terms of computational models. With only a few reasonable biological assumptions, we were able to show that even a minimal model of MGMT expression could robustly explain TMZ-mediated drug resistance. In particular, we showed that for a wide range of parameter values constrained by novel cell growth and viability assays, a model accounting for only stochastic gene expression of MGMT coupled with cell growth, division, partitioning and death was able to exhibit phenotypic selection of GBM cells expressing MGMT in response to TMZ. Furthermore, we found this selection allowed the cells to pass their acquired phenotypic resistance onto daughter cells in a stable manner (as long as TMZ is provided). This suggests that stochastic gene expression alone is enough to explain the development of chemotherapeutic resistance.
Collapse
Affiliation(s)
- Ayoub Lasri
- Department of Physiology and Medical Physics, Royal College of Surgeons in Ireland, York House, Dublin, Ireland
| | - Viktorija Juric
- Department of Physiology and Medical Physics, Royal College of Surgeons in Ireland, York House, Dublin, Ireland
| | - Maité Verreault
- Inserm U 1127, CNRS UMR 7225, Sorbonne Université, Institut du Cerveau et de la Moelle épinière, ICM, 75013 Paris, France
| | - Franck Bielle
- Sorbonne Université, Inserm, CNRS, UMR S 1127, Institut du Cerveau et de la Moelle épinière, ICM, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière – Charles Foix, Service de Neurologie 2-Mazarin, 75013 Paris, France
| | - Ahmed Idbaih
- Sorbonne Université, Inserm, CNRS, UMR S 1127, Institut du Cerveau et de la Moelle épinière, ICM, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière – Charles Foix, Service de Neurologie 2-Mazarin, 75013 Paris, France
| | - Alexander Kel
- Department of Research and Development, geneXplain GmbH, Wolfenbüttel 38302, Germany
- Laboratory of Pharmacogenomics, Institute of Chemical Biology and Fundamental Medicine, Novosibirsk 630090, Russia
| | - Brona Murphy
- Department of Physiology and Medical Physics, Royal College of Surgeons in Ireland, York House, Dublin, Ireland
| | - Marc Sturrock
- Department of Physiology and Medical Physics, Royal College of Surgeons in Ireland, York House, Dublin, Ireland
| |
Collapse
|
124
|
Houston C, Marchand B, Engelbert L, Cantwell CD. Reducing complexity and unidentifiability when modelling human atrial cells. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2020. [PMID: 32448063 DOI: 10.5061/dryad.p2ngf1vmc] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Mathematical models of a cellular action potential (AP) in cardiac modelling have become increasingly complex, particularly in gating kinetics, which control the opening and closing of individual ion channel currents. As cardiac models advance towards use in personalized medicine to inform clinical decision-making, it is critical to understand the uncertainty hidden in parameter estimates from their calibration to experimental data. This study applies approximate Bayesian computation to re-calibrate the gating kinetics of four ion channels in two existing human atrial cell models to their original datasets, providing a measure of uncertainty and indication of potential issues with selecting a single unique value given the available experimental data. Two approaches are investigated to reduce the uncertainty present: re-calibrating the models to a more complete dataset and using a less complex formulation with fewer parameters to constrain. The re-calibrated models are inserted back into the full cell model to study the overall effect on the AP. The use of more complete datasets does not eliminate uncertainty present in parameter estimates. The less complex model, particularly for the fast sodium current, gave a better fit to experimental data alongside lower parameter uncertainty and improved computational speed. This article is part of the theme issue 'Uncertainty quantification in cardiac and cardiovascular modelling and simulation'.
Collapse
Affiliation(s)
- C Houston
- ElectroCardioMaths Programme, Centre for Cardiac Engineering, Imperial College, London, UK
- Department of Aeronautics, Imperial College, London, UK
| | - B Marchand
- Department of Aeronautics, Imperial College, London, UK
| | - L Engelbert
- Department of Aeronautics, Imperial College, London, UK
| | - C D Cantwell
- ElectroCardioMaths Programme, Centre for Cardiac Engineering, Imperial College, London, UK
- Department of Aeronautics, Imperial College, London, UK
| |
Collapse
|
125
|
Houston C, Marchand B, Engelbert L, Cantwell CD. Reducing complexity and unidentifiability when modelling human atrial cells. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2020; 378:20190339. [PMID: 32448063 PMCID: PMC7287336 DOI: 10.1098/rsta.2019.0339] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Mathematical models of a cellular action potential (AP) in cardiac modelling have become increasingly complex, particularly in gating kinetics, which control the opening and closing of individual ion channel currents. As cardiac models advance towards use in personalized medicine to inform clinical decision-making, it is critical to understand the uncertainty hidden in parameter estimates from their calibration to experimental data. This study applies approximate Bayesian computation to re-calibrate the gating kinetics of four ion channels in two existing human atrial cell models to their original datasets, providing a measure of uncertainty and indication of potential issues with selecting a single unique value given the available experimental data. Two approaches are investigated to reduce the uncertainty present: re-calibrating the models to a more complete dataset and using a less complex formulation with fewer parameters to constrain. The re-calibrated models are inserted back into the full cell model to study the overall effect on the AP. The use of more complete datasets does not eliminate uncertainty present in parameter estimates. The less complex model, particularly for the fast sodium current, gave a better fit to experimental data alongside lower parameter uncertainty and improved computational speed. This article is part of the theme issue 'Uncertainty quantification in cardiac and cardiovascular modelling and simulation'.
Collapse
Affiliation(s)
- C. Houston
- ElectroCardioMaths Programme, Centre for Cardiac Engineering, Imperial College, London, UK
- Department of Aeronautics, Imperial College, London, UK
- e-mail:
| | - B. Marchand
- Department of Aeronautics, Imperial College, London, UK
| | - L. Engelbert
- Department of Aeronautics, Imperial College, London, UK
| | - C. D. Cantwell
- ElectroCardioMaths Programme, Centre for Cardiac Engineering, Imperial College, London, UK
- Department of Aeronautics, Imperial College, London, UK
| |
Collapse
|
126
|
Probabilistic Updating of Structural Models for Damage Assessment Using Approximate Bayesian Computation. SENSORS 2020; 20:s20113197. [PMID: 32512897 PMCID: PMC7308976 DOI: 10.3390/s20113197] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2020] [Revised: 06/01/2020] [Accepted: 06/03/2020] [Indexed: 11/16/2022]
Abstract
A novel probabilistic approach for model updating based on approximate Bayesian computation with subset simulation (ABC-SubSim) is proposed for damage assessment of structures using modal data. The ABC-SubSim is a likelihood-free Bayesian approach in which the explicit expression of likelihood function is avoided and the posterior samples of model parameters are obtained using the technique of subset simulation. The novel contributions of this paper are on three fronts: one is the introduction of some new stopping criteria to find an appropriate tolerance level for the metric used in the ABC-SubSim; the second one is the employment of a hybrid optimization scheme to find finer optimal values for the model parameters; and the last one is the adoption of an iterative approach to determine the optimal weighting factors related to the residuals of modal frequency and mode shape in the metric. The effectiveness of this approach is demonstrated using three illustrative examples.
Collapse
|
127
|
Garre A, Zwietering MH, den Besten HMW. Multilevel modelling as a tool to include variability and uncertainty in quantitative microbiology and risk assessment. Thermal inactivation of Listeria monocytogenes as proof of concept. Food Res Int 2020; 137:109374. [PMID: 33233076 DOI: 10.1016/j.foodres.2020.109374] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 05/27/2020] [Accepted: 05/31/2020] [Indexed: 12/13/2022]
Abstract
Variability is inherent in biology and also substantial for microbial populations. In the context of food safety risk assessment, it refers to differences in the response of different bacterial strains (between-strain variability) and different cells (within-strain variability) to the same condition (e.g. inactivation treatment). However, its quantification based on empirical observations and its incorporation in predictive models is a challenge for both experimental design and (statistical) analysis. In this article we propose the use of multilevel models to quantify (different levels of) variability and uncertainty and include them in the predictions. As proof of concept, we analyse the microbial inactivation of Listeria monocytogenes to thermal treatments including different levels of variability (between-strain and within-strain) and uncertainty. The relationship between the microbial count and time was expressed using a (non-linear) Weibullian model. Moreover, we defined stochastic hypotheses to describe the different types of variation at the level of the kinetic parameters, as well as in the observations (microbial counts). The model parameters (kinetic parameters and variances) are estimated using Bayesian statistics. The multilevel approach was compared against an analogous, single-level model. The multilevel methodology shrinks extreme parameter estimates towards the mean according to uncertainty, thus mitigating overfitting. In addition, this approach enables to easily incorporate different levels of variation (between-strain and/or within-strain variability and/or uncertainty) in the predictions. On the other hand, multilevel (Bayesian) models are more complex to define, implement, analyse and communicate than single-level models. Nevertheless, their ability to incorporate different sources of variability in predictions make them very suitable for Quantitative Microbial Risk Assessment.
Collapse
Affiliation(s)
- Alberto Garre
- Food Microbiology, Wageningen University & Research, P.O. Box 17, 6700 AA Wageningen, the Netherlands
| | - Marcel H Zwietering
- Food Microbiology, Wageningen University & Research, P.O. Box 17, 6700 AA Wageningen, the Netherlands
| | - Heidy M W den Besten
- Food Microbiology, Wageningen University & Research, P.O. Box 17, 6700 AA Wageningen, the Netherlands.
| |
Collapse
|
128
|
Esteller-Cucala P, Maceda I, Børglum AD, Demontis D, Faraone SV, Cormand B, Lao O. Genomic analysis of the natural history of attention-deficit/hyperactivity disorder using Neanderthal and ancient Homo sapiens samples. Sci Rep 2020; 10:8622. [PMID: 32451437 PMCID: PMC7248073 DOI: 10.1038/s41598-020-65322-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 04/24/2020] [Indexed: 11/18/2022] Open
Abstract
Attention-deficit/hyperactivity disorder (ADHD) is an impairing neurodevelopmental condition highly prevalent in current populations. Several hypotheses have been proposed to explain this paradox, mainly in the context of the Paleolithic versus Neolithic cultural shift but especially within the framework of the mismatch theory. This theory elaborates on how a particular trait once favoured in an ancient environment might become maladaptive upon environmental changes. However, given the lack of genomic data available for ADHD, these theories have not been empirically tested. We took advantage of the largest GWAS meta-analysis available for this disorder consisting of over 20,000 individuals diagnosed with ADHD and 35,000 controls, to assess the evolution of ADHD-associated alleles in European populations using archaic, ancient and modern human samples. We also included Approximate Bayesian computation coupled with deep learning analyses and singleton density scores to detect human adaptation. Our analyses indicate that ADHD-associated alleles are enriched in loss of function intolerant genes, supporting the role of selective pressures in this early-onset phenotype. Furthermore, we observed that the frequency of variants associated with ADHD has steadily decreased since Paleolithic times, particularly in Paleolithic European populations compared to samples from the Neolithic Fertile Crescent. We demonstrate this trend cannot be explained by African admixture nor Neanderthal introgression, since introgressed Neanderthal alleles are enriched in ADHD risk variants. All analyses performed support the presence of long-standing selective pressures acting against ADHD-associated alleles until recent times. Overall, our results are compatible with the mismatch theory for ADHD but suggest a much older time frame for the evolution of ADHD-associated alleles compared to previous hypotheses.
Collapse
Affiliation(s)
- Paula Esteller-Cucala
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institut de Biologia Evolutiva (UPF-CSIC), Barcelona, Spain
| | - Iago Maceda
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Anders D Børglum
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Centre for Integrative Sequencing, iSEQ, and Aarhus Genome Centre, Aarhus, Denmark
- Department of Biomedicine - Human Genetics, Aarhus University, Aarhus, Denmark
| | - Ditte Demontis
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Centre for Integrative Sequencing, iSEQ, and Aarhus Genome Centre, Aarhus, Denmark
- Department of Biomedicine - Human Genetics, Aarhus University, Aarhus, Denmark
| | - Stephen V Faraone
- Departments of Psychiatry and of Neuroscience and Physiology, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Bru Cormand
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain.
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III, Madrid, Spain.
- Institut de Biomedicina de la Universitat de Barcelona (IBUB), Barcelona, Spain.
- Institut de Recerca Sant Joan de Déu (IR-SJD), Esplugues de Llobregat, Spain.
| | - Oscar Lao
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), Barcelona, Spain.
| |
Collapse
|
129
|
Kacprzak T, Herbel J, Nicola A, Sgier R, Tarsitano F, Bruderer C, Amara A, Refregier A, Bridle S, Drlica-Wagner A, Gruen D, Hartley W, Hoyle B, Secco L, Zuntz J, Annis J, Avila S, Bertin E, Brooks D, Buckley-Geer E, Carnero Rosell A, Carrasco Kind M, Carretero J, da Costa L, De Vicente J, Desai S, Diehl H, Doel P, García-Bellido J, Gaztanaga E, Gruendl R, Gschwend J, Gutierrez G, Hollowood D, Honscheid K, James D, Jarvis M, Lima M, Maia M, Marshall J, Melchior P, Menanteau F, Miquel R, Paz-Chinchón F, Plazas A, Sanchez E, Scarpine V, Serrano S, Sevilla-Noarbe I, Smith M, Suchyta E, Swanson M, Tarle G, Vikram V, Weller J. Monte Carlo control loops for cosmic shear cosmology with DES Year 1 data. Int J Clin Exp Med 2020. [DOI: 10.1103/physrevd.101.082003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
130
|
Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, Pastore Y Piontti A, Mu K, Rossi L, Sun K, Viboud C, Xiong X, Yu H, Halloran ME, Longini IM, Vespignani A. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science 2020; 368:395-400. [PMID: 32144116 PMCID: PMC7164386 DOI: 10.1126/science.aba9757] [Citation(s) in RCA: 1777] [Impact Index Per Article: 444.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 02/07/2020] [Accepted: 03/05/2020] [Indexed: 12/14/2022]
Abstract
Motivated by the rapid spread of coronavirus disease 2019 (COVID-19) in mainland China, we use a global metapopulation disease transmission model to project the impact of travel limitations on the national and international spread of the epidemic. The model is calibrated on the basis of internationally reported cases and shows that, at the start of the travel ban from Wuhan on 23 January 2020, most Chinese cities had already received many infected travelers. The travel quarantine of Wuhan delayed the overall epidemic progression by only 3 to 5 days in mainland China but had a more marked effect on the international scale, where case importations were reduced by nearly 80% until mid-February. Modeling results also indicate that sustained 90% travel restrictions to and from mainland China only modestly affect the epidemic trajectory unless combined with a 50% or higher reduction of transmission in the community.
Collapse
Affiliation(s)
- Matteo Chinazzi
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA
| | - Jessica T Davis
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA
| | | | | | | | | | - Ana Pastore Y Piontti
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA
| | - Kunpeng Mu
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA
| | | | - Kaiyuan Sun
- Fogarty International Center, NIH, Bethesda, MD, USA
| | - Cécile Viboud
- Fogarty International Center, NIH, Bethesda, MD, USA
| | - Xinyue Xiong
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA
| | - Hongjie Yu
- School of Public Health, Fudan University, Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
| | - M Elizabeth Halloran
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Ira M Longini
- Department of Biostatistics, College of Public Health and Health Professions, University of Florida, Gainesville, FL, USA.
| | - Alessandro Vespignani
- Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA.
- ISI Foundation, Turin, Italy
| |
Collapse
|
131
|
Chen S, Mira A, Onnela JP. Flexible model selection for mechanistic network models. JOURNAL OF COMPLEX NETWORKS 2020; 8:cnz024. [PMID: 32765880 PMCID: PMC7391990 DOI: 10.1093/comnet/cnz024] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Accepted: 06/24/2019] [Indexed: 05/25/2023]
Abstract
Network models are applied across many domains where data can be represented as a network. Two prominent paradigms for modelling networks are statistical models (probabilistic models for the observed network) and mechanistic models (models for network growth and/or evolution). Mechanistic models are better suited for incorporating domain knowledge, to study effects of interventions (such as changes to specific mechanisms) and to forward simulate, but they typically have intractable likelihoods. As such, and in a stark contrast to statistical models, there is a relative dearth of research on model selection for such models despite the otherwise large body of extant work. In this article, we propose a simulator-based procedure for mechanistic network model selection that borrows aspects from Approximate Bayesian Computation along with a means to quantify the uncertainty in the selected model. To select the most suitable network model, we consider and assess the performance of several learning algorithms, most notably the so-called Super Learner, which makes our framework less sensitive to the choice of a particular learning algorithm. Our approach takes advantage of the ease to forward simulate from mechanistic network models to circumvent their intractable likelihoods. The overall process is flexible and widely applicable. Our simulation results demonstrate the approach's ability to accurately discriminate between competing mechanistic models. Finally, we showcase our approach with a protein-protein interaction network model from the literature for yeast (Saccharomyces cerevisiae).
Collapse
Affiliation(s)
- Sixing Chen
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University 655 Huntington Avenue, Building 2, 4th Floor, Boston, MA 02115, USA
| | - Antonietta Mira
- Data Science Lab, Institute of Computational Science, Università della Svizzera italiana Via Buffi 6, 6900 Lugano, Switzerland and Dipartimento di Scienza e Alta Tecnologia, Università degli Studi dell'Insubria Via Valleggio, 11 - 22100 Como, Italy
| | | |
Collapse
|
132
|
Warne DJ, Baker RE, Simpson MJ. A practical guide to pseudo-marginal methods for computational inference in systems biology. J Theor Biol 2020; 496:110255. [PMID: 32223995 DOI: 10.1016/j.jtbi.2020.110255] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 03/11/2020] [Accepted: 03/18/2020] [Indexed: 01/07/2023]
Abstract
For many stochastic models of interest in systems biology, such as those describing biochemical reaction networks, exact quantification of parameter uncertainty through statistical inference is intractable. Likelihood-free computational inference techniques enable parameter inference when the likelihood function for the model is intractable but the generation of many sample paths is feasible through stochastic simulation of the forward problem. The most common likelihood-free method in systems biology is approximate Bayesian computation that accepts parameters that result in low discrepancy between stochastic simulations and measured data. However, it can be difficult to assess how the accuracy of the resulting inferences are affected by the choice of acceptance threshold and discrepancy function. The pseudo-marginal approach is an alternative likelihood-free inference method that utilises a Monte Carlo estimate of the likelihood function. This approach has several advantages, particularly in the context of noisy, partially observed, time-course data typical in biochemical reaction network studies. Specifically, the pseudo-marginal approach facilitates exact inference and uncertainty quantification, and may be efficiently combined with particle filters for low variance, high-accuracy likelihood estimation. In this review, we provide a practical introduction to the pseudo-marginal approach using inference for biochemical reaction networks as a series of case studies. Implementations of key algorithms and examples are provided using the Julia programming language; a high performance, open source programming language for scientific computing (https://github.com/davidwarne/Warne2019_GuideToPseudoMarginal).
Collapse
Affiliation(s)
- David J Warne
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland 4001, Australia.
| | - Ruth E Baker
- Mathematical Institute, University of Oxford, Oxford, OX2 6GG, United Kingdom
| | - Matthew J Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland 4001, Australia
| |
Collapse
|
133
|
Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface 2020; 16:20180943. [PMID: 30958205 DOI: 10.1098/rsif.2018.0943] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Stochasticity is a key characteristic of intracellular processes such as gene regulation and chemical signalling. Therefore, characterizing stochastic effects in biochemical systems is essential to understand the complex dynamics of living things. Mathematical idealizations of biochemically reacting systems must be able to capture stochastic phenomena. While robust theory exists to describe such stochastic models, the computational challenges in exploring these models can be a significant burden in practice since realistic models are analytically intractable. Determining the expected behaviour and variability of a stochastic biochemical reaction network requires many probabilistic simulations of its evolution. Using a biochemical reaction network model to assist in the interpretation of time-course data from a biological experiment is an even greater challenge due to the intractability of the likelihood function for determining observation probabilities. These computational challenges have been subjects of active research for over four decades. In this review, we present an accessible discussion of the major historical developments and state-of-the-art computational techniques relevant to simulation and inference problems for stochastic biochemical reaction network models. Detailed algorithms for particularly important methods are described and complemented with Matlab® implementations. As a result, this review provides a practical and accessible introduction to computational methods for stochastic models within the life sciences community.
Collapse
Affiliation(s)
- David J Warne
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| | - Ruth E Baker
- 2 Mathematical Institute, University of Oxford , Oxford OX2 6GG , UK
| | - Matthew J Simpson
- 1 School of Mathematical Sciences, Queensland University of Technology , Brisbane, Queensland 4001 , Australia
| |
Collapse
|
134
|
Märkle H, Tellier A. Inference of coevolutionary dynamics and parameters from host and parasite polymorphism data of repeated experiments. PLoS Comput Biol 2020; 16:e1007668. [PMID: 32203545 PMCID: PMC7156111 DOI: 10.1371/journal.pcbi.1007668] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 04/14/2020] [Accepted: 01/19/2020] [Indexed: 01/27/2023] Open
Abstract
There is a long-standing interest in understanding host-parasite coevolutionary dynamics and associated fitness effects. Increasing amounts of genomic data for both interacting species offer a promising source to identify candidate loci and to infer the main parameters of the past coevolutionary history. However, so far no method exists to perform the latter. By coupling a gene-for-gene model with coalescent simulations, we first show that three types of biological costs, namely, resistance, infectivity and infection, define the allele frequencies at the internal equilibrium point of the coevolution model. These in return determine the strength of selective signatures at the coevolving host and parasite loci. We apply an Approximate Bayesian Computation (ABC) approach on simulated datasets to infer these costs by jointly integrating host and parasite polymorphism data at the coevolving loci. To control for the effect of genetic drift on coevolutionary dynamics, we assume that 10 or 30 repetitions are available from controlled experiments or several natural populations. We study two scenarios: 1) the cost of infection and population sizes (host and parasite) are unknown while costs of infectivity and resistance are known, and 2) all three costs are unknown while populations sizes are known. Using the ABC model choice procedure, we show that for both scenarios, we can distinguish with high accuracy pairs of coevolving host and parasite loci from pairs of neutrally evolving loci, though the statistical power decreases with higher cost of infection. The accuracy of parameter inference is high under both scenarios especially when using both host and parasite data because parasite polymorphism data do inform on costs applying to the host and vice-versa. As the false positive rate to detect pairs of genes under coevolution is small, we suggest that our method complements recently developed methods to identify host and parasite candidate loci for functional studies.
Collapse
Affiliation(s)
- Hanna Märkle
- Section of Population Genetics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Aurélien Tellier
- Section of Population Genetics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| |
Collapse
|
135
|
Vihola M, Franks J. On the use of approximate Bayesian computation Markov chain Monte Carlo with inflated tolerance and post-correction. Biometrika 2020. [DOI: 10.1093/biomet/asz078] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SummaryApproximate Bayesian computation enables inference for complicated probabilistic models with intractable likelihoods using model simulations. The Markov chain Monte Carlo implementation of approximate Bayesian computation is often sensitive to the tolerance parameter: low tolerance leads to poor mixing and large tolerance entails excess bias. We propose an approach that involves using a relatively large tolerance for the Markov chain Monte Carlo sampler to ensure sufficient mixing and post-processing the output, leading to estimators for a range of finer tolerances. We introduce an approximate confidence interval for the related post-corrected estimators and propose an adaptive approximate Bayesian computation Markov chain Monte Carlo algorithm, which finds a balanced tolerance level automatically based on acceptance rate optimization. Our experiments show that post-processing-based estimators can perform better than direct Markov chain Monte Carlo targeting a fine tolerance, that our confidence intervals are reliable, and that our adaptive algorithm leads to reliable inference with little user specification.
Collapse
Affiliation(s)
- Matti Vihola
- Department of Mathematics and Statistics, University of Jyväskylä, P.O. Box 35, FI-40014 University of Jyväskylä, Finland
| | - Jordan Franks
- Department of Mathematics and Statistics, University of Jyväskylä, P.O. Box 35, FI-40014 University of Jyväskylä, Finland
| |
Collapse
|
136
|
Jay F, Boitard S, Austerlitz F. An ABC Method for Whole-Genome Sequence Data: Inferring Paleolithic and Neolithic Human Expansions. Mol Biol Evol 2020; 36:1565-1579. [PMID: 30785202 DOI: 10.1093/molbev/msz038] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Species generally undergo a complex demographic history consisting, in particular, of multiple changes in population size. Genome-wide sequencing data are potentially highly informative for reconstructing this demographic history. A crucial point is to extract the relevant information from these very large data sets. Here, we design an approach for inferring past demographic events from a moderate number of fully sequenced genomes. Our new approach uses Approximate Bayesian Computation, a simulation-based statistical framework that allows 1) identifying the best demographic scenario among several competing scenarios and 2) estimating the best-fitting parameters under the chosen scenario. Approximate Bayesian Computation relies on the computation of summary statistics. Using a cross-validation approach, we show that statistics such as the lengths of haplotypes shared between individuals, or the decay of linkage disequilibrium with distance, can be combined with classical statistics (e.g., heterozygosity and Tajima's D) to accurately infer complex demographic scenarios including bottlenecks and expansion periods. We also demonstrate the importance of simultaneously estimating the genotyping error rate. Applying our method on genome-wide human-sequence databases, we finally show that a model consisting in a bottleneck followed by a Paleolithic and a Neolithic expansion is the most relevant for Eurasian populations.
Collapse
Affiliation(s)
- Flora Jay
- Laboratoire EcoAnthropologie et Ethnobiologie, CNRS/MNHN/Université Paris Diderot, Paris, France.,Laboratoire de Recherche en Informatique, CNRS/Université Paris-Sud/Université Paris-Saclay, Orsay, France
| | - Simon Boitard
- GenPhySE, Université de Toulouse, INRA, INPT, INP-ENVT, Castanet Tolosan, France
| | - Frédéric Austerlitz
- Laboratoire EcoAnthropologie et Ethnobiologie, CNRS/MNHN/Université Paris Diderot, Paris, France
| |
Collapse
|
137
|
Rybiński M, Möller S, Sunnåker M, Lormeau C, Stelling J. TopoFilter: a MATLAB package for mechanistic model identification in systems biology. BMC Bioinformatics 2020; 21:34. [PMID: 31996136 PMCID: PMC6990465 DOI: 10.1186/s12859-020-3343-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 01/08/2020] [Indexed: 12/27/2022] Open
Abstract
Background To develop mechanistic dynamic models in systems biology, one often needs to identify all (or minimal) representations of the biological processes that are consistent with experimental data, out of a potentially large set of hypothetical mechanisms. However, a simple enumeration of all alternatives becomes quickly intractable when the number of model parameters grows. Selecting appropriate dynamic models out of a large ensemble of models, taking the uncertainty in our biological knowledge and in the experimental data into account, is therefore a key current problem in systems biology. Results The TopoFilter package addresses this problem in a heuristic and automated fashion by implementing the previously described topological filtering method for Bayesian model selection. It includes a core heuristic for searching the space of submodels of a parametrized model, coupled with a sampling-based exploration of the parameter space. Recent developments of the method allow to balance exhaustiveness and speed of the model space search, to efficiently re-sample parameters, to parallelize the search, and to use custom scoring functions. We use a theoretical example to motivate these features and then demonstrate TopoFilter’s applicability for a yeast signaling network with more than 250’000 possible model structures. Conclusions TopoFilter is a flexible software framework that makes Bayesian model selection and reduction efficient and scalable to network models of a complexity that represents contemporary problems in, for example, cell signaling. TopoFilter is open-source, available under the GPL-3.0 license at https://gitlab.com/csb.ethz/TopoFilter. It includes installation instructions, a quickstart guide, a description of all package options, and multiple examples.
Collapse
Affiliation(s)
- Mikołaj Rybiński
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,ID Scientific IT Services, ETH Zurich, Zurich, 8092, Switzerland
| | - Simon Möller
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Mikael Sunnåker
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland
| | - Claude Lormeau
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.,Life Science Zurich Ph.D. program "Systems Biology", Zurich, 8092, Switzerland
| | - Jörg Stelling
- Department of Biosystems Science and Engineering and SIB Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, Basel, 4058, Switzerland.
| |
Collapse
|
138
|
Buetti-Dinh A, Herold M, Christel S, El Hajjami M, Delogu F, Ilie O, Bellenberg S, Wilmes P, Poetsch A, Sand W, Vera M, Pivkin IV, Friedman R, Dopson M. Reverse engineering directed gene regulatory networks from transcriptomics and proteomics data of biomining bacterial communities with approximate Bayesian computation and steady-state signalling simulations. BMC Bioinformatics 2020; 21:23. [PMID: 31964336 PMCID: PMC6975020 DOI: 10.1186/s12859-019-3337-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 12/30/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Network inference is an important aim of systems biology. It enables the transformation of OMICs datasets into biological knowledge. It consists of reverse engineering gene regulatory networks from OMICs data, such as RNAseq or mass spectrometry-based proteomics data, through computational methods. This approach allows to identify signalling pathways involved in specific biological functions. The ability to infer causality in gene regulatory networks, in addition to correlation, is crucial for several modelling approaches and allows targeted control in biotechnology applications. METHODS We performed simulations according to the approximate Bayesian computation method, where the core model consisted of a steady-state simulation algorithm used to study gene regulatory networks in systems for which a limited level of details is available. The simulations outcome was compared to experimentally measured transcriptomics and proteomics data through approximate Bayesian computation. RESULTS The structure of small gene regulatory networks responsible for the regulation of biological functions involved in biomining were inferred from multi OMICs data of mixed bacterial cultures. Several causal inter- and intraspecies interactions were inferred between genes coding for proteins involved in the biomining process, such as heavy metal transport, DNA damage, replication and repair, and membrane biogenesis. The method also provided indications for the role of several uncharacterized proteins by the inferred connection in their network context. CONCLUSIONS The combination of fast algorithms with high-performance computing allowed the simulation of a multitude of gene regulatory networks and their comparison to experimentally measured OMICs data through approximate Bayesian computation, enabling the probabilistic inference of causality in gene regulatory networks of a multispecies bacterial system involved in biomining without need of single-cell or multiple perturbation experiments. This information can be used to influence biological functions and control specific processes in biotechnology applications.
Collapse
Affiliation(s)
- Antoine Buetti-Dinh
- Institute of Computational Science, Faculty of Informatics, Università della Svizzera Italiana, Via Giuseppe Buffi 13, Lugano, CH-6900 Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge – Batiment Genopode, Lausanne, CH-1015 Switzerland
- Department of Chemistry and Biomedical Sciences, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
- Linnæus University Centre for Biomaterials Chemistry, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
- Centre for Ecology and Evolution in Microbial Model Systems, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
| | - Malte Herold
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, Luxembourg
| | - Stephan Christel
- Centre for Ecology and Evolution in Microbial Model Systems, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
| | | | - Francesco Delogu
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Oslo, Norway
| | - Olga Ilie
- Institute of Computational Science, Faculty of Informatics, Università della Svizzera Italiana, Via Giuseppe Buffi 13, Lugano, CH-6900 Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge – Batiment Genopode, Lausanne, CH-1015 Switzerland
| | - Sören Bellenberg
- Centre for Ecology and Evolution in Microbial Model Systems, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, Luxembourg
| | - Ansgar Poetsch
- Plant Biochemistry, Ruhr University Bochum, Bochum, Germany
- Center for Marine and Molecular Biotechnology, QNLM, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, Qingdao, China
| | - Wolfgang Sand
- Faculty of Chemistry, Essen, Germany
- College of Environmental Science and Engineering, Donghua University, Shanghai, People’s Republic of China
- Mining Academy and Technical University Freiberg, Freiberg, Germany
| | - Mario Vera
- Institute for Biological and Medical Engineering. Schools of Engineering, Medicine & Biological Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
- Department of Hydraulic & Environmental Engineering, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Igor V. Pivkin
- Institute of Computational Science, Faculty of Informatics, Università della Svizzera Italiana, Via Giuseppe Buffi 13, Lugano, CH-6900 Switzerland
- Swiss Institute of Bioinformatics, Quartier Sorge – Batiment Genopode, Lausanne, CH-1015 Switzerland
| | - Ran Friedman
- Department of Chemistry and Biomedical Sciences, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
- Linnæus University Centre for Biomaterials Chemistry, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
| | - Mark Dopson
- Centre for Ecology and Evolution in Microbial Model Systems, Linnæus University, Hus Vita, Kalmar, SE-391 82 Sweden
| |
Collapse
|
139
|
Krzhizhanovskaya VV, Závodszky G, Lees MH, Dongarra JJ, Sloot PMA, Brissos S, Teixeira J. Supermodeling: The Next Level of Abstraction in the Use of Data Assimilation. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7304721 DOI: 10.1007/978-3-030-50433-5_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Data assimilation (DA) is a key procedure that synchronizes a computer model with real observations. However, in the case of overparametrized complex systems modeling, the task of parameter-estimation through data assimilation can expand exponentially. It leads to unacceptable computational overhead, substantial inaccuracies in parameter matching, and wrong predictions. Here we define a Supermodel as a kind of ensembling scheme, which consists of a few sub-models representing various instances of the baseline model. The sub-models differ in parameter sets and are synchronized through couplings between the most sensitive dynamical variables. We demonstrate that after a short pretraining of the fully parametrized small sub-model ensemble, and then training a few latent parameters of the low-parameterized Supermodel, we can outperform in efficiency and accuracy the baseline model matched to data by a classical DA procedure.
Collapse
|
140
|
Scanlon LA, Lobb A, Tehrani JJ, Kendal JR. Unknotting the interactive effects of learning processes on cultural evolutionary dynamics. EVOLUTIONARY HUMAN SCIENCES 2019; 1:e17. [PMID: 37588399 PMCID: PMC10427294 DOI: 10.1017/ehs.2019.17] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Forms of non-random copying error provide sources of inherited variation yet their effects on cultural evolutionary dynamics are poorly understood. Focusing on variation in granny and reef knot forms, we present a mathematical model that specifies how these variant frequencies are affected by non-linear interactions between copying fidelity, mirroring, handedness and repetition biases. Experiments on adult humans allowed these effects to be estimated using approximate Bayesian computation and the model is iterated to explain the prevalence of granny over reef knots in the wild. Our study system also serves to show conditions under which copying fidelity drives heterogeneity in cultural variants at equilibrium, and that interaction between unbiased forms of copying error can skew cultural variation.
Collapse
Affiliation(s)
- Lauren A. Scanlon
- Department Of Mathematics, Durham University, Durham, UK
- Department Of Anthropology And Durham Cultural Evolution Research Centre, Durham University, Durham, UK
| | - Andrew Lobb
- Department Of Mathematics, Durham University, Durham, UK
| | - Jamshid J. Tehrani
- Department Of Anthropology And Durham Cultural Evolution Research Centre, Durham University, Durham, UK
| | - Jeremy R. Kendal
- Department Of Anthropology And Durham Cultural Evolution Research Centre, Durham University, Durham, UK
- Durham Research Methods Centre
| |
Collapse
|
141
|
Han JH, Weston JF, Heuer C, Gates MC. Estimation of the within-herd transmission rates of bovine viral diarrhoea virus in extensively grazed beef cattle herds. Vet Res 2019; 50:103. [PMID: 31783904 PMCID: PMC6884759 DOI: 10.1186/s13567-019-0723-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2019] [Accepted: 11/06/2019] [Indexed: 11/12/2022] Open
Abstract
Many research groups have developed mathematical models to simulate the dynamics of BVDV infections in cattle herds. However, most models use estimates for within-herd BVDV transmission rates that are either based on expert opinion or adapted from other dairy herd simulation models presented in the literature. There is currently little information on the transmission rates for BVDV in extensively grazed beef herds partly due to the logistical challenges in obtaining longitudinal data of individual animal’s seroconversion, and it may not be appropriate to apply the same transmission rates from intensive dairy herds given the significant differences in herd demographics and management. To address this knowledge gap, we measured BVDV antibody levels in 15 replacement heifers in each of 75 New Zealand beef breeding farms after their first calving and again at pregnancy scanning or weaning to check for seroconversion. Among these, data from 9 farms were used to infer the within-herd BVDV transmission rate with an approximate Bayesian computation method. The most probable within-herd BVDV transmission rate was estimated as 0.11 per persistently infected (PI) animal per day with a 95% highest posterior density interval between 0.03 and 0.34. This suggests that BVDV transmission in extensively grazed beef herds is generally slower than in dairy herds where the transmission rate has been estimated at 0.50 per PI animal per day and therefore may not be sufficient to ensure that all susceptible breeding females gain adequate immunity to the virus before the risk period of early pregnancy for generating new PI calves.
Collapse
Affiliation(s)
- Jun-Hee Han
- EpiCentre, School of Veterinary Science, Massey University, Private Bag 11-222, Palmerston North, New Zealand.
| | - Jenny F Weston
- School of Veterinary Science, Massey University, Private Bag 11-222, Palmerston North, New Zealand
| | - Cord Heuer
- EpiCentre, School of Veterinary Science, Massey University, Private Bag 11-222, Palmerston North, New Zealand
| | - M Carolyn Gates
- EpiCentre, School of Veterinary Science, Massey University, Private Bag 11-222, Palmerston North, New Zealand
| |
Collapse
|
142
|
Wilson RC, Collins AG. Ten simple rules for the computational modeling of behavioral data. eLife 2019; 8:49547. [PMID: 31769410 PMCID: PMC6879303 DOI: 10.7554/elife.49547] [Citation(s) in RCA: 245] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 10/09/2019] [Indexed: 02/06/2023] Open
Abstract
Computational modeling of behavior has revolutionized psychology and neuroscience. By fitting models to experimental data we can probe the algorithms underlying behavior, find neural correlates of computational variables and better understand the effects of drugs, illness and interventions. But with great power comes great responsibility. Here, we offer ten simple rules to ensure that computational modeling is used with care and yields meaningful insights. In particular, we present a beginner-friendly, pragmatic and details-oriented introduction on how to relate models to data. What, exactly, can a model tell us about the mind? To answer this, we apply our rules to the simplest modeling techniques most accessible to beginning modelers and illustrate them with examples and code available online. However, most rules apply to more advanced techniques. Our hope is that by following our guidelines, researchers will avoid many pitfalls and unleash the power of computational modeling on their own data.
Collapse
Affiliation(s)
- Robert C Wilson
- Department of Psychology, University of Arizona, Tucson, United States.,Cognitive Science Program, University of Arizona, Tucson, United States
| | - Anne Ge Collins
- Department of Psychology, University of California, Berkeley, Berkeley, United States.,Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
143
|
Hazra I, Pandey MD, Jyrkama MI. Estimation of Flow-Accelerated Corrosion Rate in Nuclear Piping System. JOURNAL OF NUCLEAR ENGINEERING AND RADIATION SCIENCE 2019. [DOI: 10.1115/1.4044407] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Abstract
Flow-accelerated corrosion (FAC) is a life-limiting factor for the piping network of the primary heat transport system (PHTS) in CANDU® reactors. The pipe wall thinning caused by FAC is monitored by carrying out periodic in-service inspections (ISI) to ensure the fitness-for-service of the piping system. Accurate prediction of the lifetime of various components in the PHTS piping network requires estimation of FAC thinning rate. The traditional Bayesian inference techniques commonly employed for parameter estimation are computationally costly. This paper presents an inexpensive and intuitive simulation-based Bayesian approach to FAC rate estimation, called approximate Bayesian computation using Markov chain Monte Carlo (ABC-MCMC). ABC-MCMC is a likelihood-free Bayesian computation scheme that generates samples directly from an approximate posterior distribution by simulating data sets from a forward model. The efficiency of ABC-MCMC is demonstrated by presenting a comparison with a likelihood-based Bayesian computation scheme, Metropolis-Hastings (MH) algorithm, using a practical data-based example. Furthermore, an innovative step has been proposed for reducing the Markov chain burn-in time in the proposed scheme. To indicate the need of a Bayesian approach in quantifying the uncertainties related to the FAC model parameters, results from the linear regression method, a common industrial approach, are also presented in this study. The numerical results show a notable reduction in computational time, suggesting that ABC-MCMC is an efficient alternative to the traditional Bayesian inference methods, specifically for handling noisy degradation data.
Collapse
Affiliation(s)
- Indranil Hazra
- Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON N2 L 3G1, Canada
| | - Mahesh D. Pandey
- Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON N2 L 3G1, Canada
| | - Mikko I. Jyrkama
- Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON N2 L 3G1, Canada
| |
Collapse
|
144
|
Götte H. Handbook of Approximate Bayesian Computation. Edited by Scott A.Sisson, YananFan, Mark A.Beaumont (2019). London, UK: Chapman & Hall/CRC Press. 662 pages, ISBN: 978‐1‐4398‐8150‐7. Biom J 2019. [DOI: 10.1002/bimj.201900141] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
145
|
Eriksson O, Jauhiainen A, Maad Sasane S, Kramer A, Nair AG, Sartorius C, Hellgren Kotaleski J. Uncertainty quantification, propagation and characterization by Bayesian analysis combined with global sensitivity analysis applied to dynamical intracellular pathway models. Bioinformatics 2019; 35:284-292. [PMID: 30010712 PMCID: PMC6330009 DOI: 10.1093/bioinformatics/bty607] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2017] [Accepted: 07/10/2018] [Indexed: 11/14/2022] Open
Abstract
Motivation Dynamical models describing intracellular phenomena are increasing in size and complexity as more information is obtained from experiments. These models are often over-parameterized with respect to the quantitative data used for parameter estimation, resulting in uncertainty in the individual parameter estimates as well as in the predictions made from the model. Here we combine Bayesian analysis with global sensitivity analysis (GSA) in order to give better informed predictions; to point out weaker parts of the model that are important targets for further experiments, as well as to give guidance on parameters that are essential in distinguishing different qualitative output behaviours. Results We used approximate Bayesian computation (ABC) to estimate the model parameters from experimental data, as well as to quantify the uncertainty in this estimation (inverse uncertainty quantification), resulting in a posterior distribution for the parameters. This parameter uncertainty was next propagated to a corresponding uncertainty in the predictions (forward uncertainty propagation), and a GSA was performed on the predictions using the posterior distribution as the possible values for the parameters. This methodology was applied on a relatively large model relevant for synaptic plasticity, using experimental data from several sources. We could hereby point out those parameters that by themselves have the largest contribution to the uncertainty of the prediction as well as identify parameters important to separate between qualitatively different predictions. This approach is useful both for experimental design as well as model building. Availability and implementation Source code is freely available at https://github.com/alexjau/uqsa. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.,Science for Life Laboratory, Department of Numerical Analysis and Computer Science, Stockholm University, Stockholm, Sweden.,Swedish e-Science Research Centre (SeRC), KTH Royal Institute of Technology, Stockholm, Sweden
| | - Alexandra Jauhiainen
- Biometrics, Early Clinical Development, IMED Biotech Unit, AstraZeneca, Gothenburg, Sweden
| | | | - Andrei Kramer
- Science for Life Laboratory, Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Anu G Nair
- Science for Life Laboratory, Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | | | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.,Science for Life Laboratory, Department of Numerical Analysis and Computer Science, Stockholm University, Stockholm, Sweden.,Swedish e-Science Research Centre (SeRC), KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
146
|
Götte H, Kirchner M, Kieser M. Adjustment for exploratory cut‐off selection in randomized clinical trials with survival endpoint. Biom J 2019; 62:627-642. [DOI: 10.1002/bimj.201800302] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 09/04/2019] [Accepted: 09/06/2019] [Indexed: 11/08/2022]
Affiliation(s)
| | - Marietta Kirchner
- Institute of Medical Biometry and Informatics University of Heidelberg Heidelberg Germany
| | - Meinhard Kieser
- Institute of Medical Biometry and Informatics University of Heidelberg Heidelberg Germany
| |
Collapse
|
147
|
Minter A, Retkute R. Approximate Bayesian Computation for infectious disease modelling. Epidemics 2019; 29:100368. [PMID: 31563466 DOI: 10.1016/j.epidem.2019.100368] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Revised: 08/20/2019] [Accepted: 08/30/2019] [Indexed: 12/23/2022] Open
Abstract
Approximate Bayesian Computation (ABC) techniques are a suite of model fitting methods which can be implemented without a using likelihood function. In order to use ABC in a time-efficient manner users must make several design decisions including how to code the ABC algorithm and the type of ABC algorithm to use. Furthermore, ABC relies on a number of user defined choices which can greatly effect the accuracy of estimation. Having a clear understanding of these factors in reducing computation time and improving accuracy allows users to make more informed decisions when planning analyses. In this paper, we present an introduction to ABC with a focus of application to infectious disease models. We present a tutorial on coding practice for ABC in R and three case studies to illustrate the application of ABC to infectious disease models.
Collapse
Affiliation(s)
- Amanda Minter
- Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene and Tropical Medicine, London, UK.
| | - Renata Retkute
- The Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research, University of Warwick, UK
| |
Collapse
|
148
|
Filipe JA, Kyriazakis I. Bayesian, Likelihood-Free Modelling of Phenotypic Plasticity and Variability in Individuals and Populations. Front Genet 2019; 10:727. [PMID: 31616460 PMCID: PMC6764410 DOI: 10.3389/fgene.2019.00727] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 07/11/2019] [Indexed: 12/17/2022] Open
Abstract
There is a paradigm shift from the traditional focus on the "average" individual towards the definition and analysis of trait variation within individual life-history and among individuals in populations. This is a result of increasing availability of individual phenotypic data. The shift allows the use of genetic and environment-driven variations to assess robustness to challenge, gain greater understanding of organismal biological processes, or deliver individual-targeted treatments or genetic selection. These consequences apply, in particular, to variation in ontogenetic growth. We propose an approach to parameterise mathematical models of individual traits (e.g., reaction norms, growth curves) that address two challenges: 1) Estimation of individual traits while making minimal assumptions about data distribution and correlation, addressed via Approximate Bayesian Computation (a form of nonparametric inference). We are motivated by the fact that available information on distribution of biological data is often less precise than assumed by conventional likelihood functions. 2) Scaling-up to population phenotype distributions while facilitating unbiased use of individual data; this is addressed via a probabilistic framework where population distributions build on separately-inferred individual distributions and individual-trait interpretability is preserved. The approach is tested against Bayesian likelihood-based inference, by fitting weight and energy intake growth models to animal data and normal- and skewed-distributed simulated data. i) Individual inferences were accurate and robust to changes in data distribution and sample size; in particular, median-based predictions were more robust than maximum- likelihood-based curves. These results suggest that the approach gives reliable inferences using few observations and monitoring resources. ii) At the population level, each individual contributed via a specific data distribution, and population phenotype estimates were not disproportionally influenced by outlier individuals. Indices measuring population phenotype variation can be derived for study comparisons. The approach offers an alternative for estimating trait variability in biological systems that may be reliable for various applications, for example, in genetics, health, and individualised nutrition, while using fewer assumptions and fewer empirical observations. In livestock breeding, the potentially greater accuracy of trait estimation (without specification of multitrait variance-covariance parameters) could lead to improved selection and to more decisive estimates of trait heritability.
Collapse
Affiliation(s)
- Joao A.N. Filipe
- Agriculture, School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | | |
Collapse
|
149
|
Miskovic L, Béal J, Moret M, Hatzimanikatis V. Uncertainty reduction in biochemical kinetic models: Enforcing desired model properties. PLoS Comput Biol 2019; 15:e1007242. [PMID: 31430276 PMCID: PMC6716680 DOI: 10.1371/journal.pcbi.1007242] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 08/30/2019] [Accepted: 07/03/2019] [Indexed: 11/18/2022] Open
Abstract
A persistent obstacle for constructing kinetic models of metabolism is uncertainty in the kinetic properties of enzymes. Currently, available methods for building kinetic models can cope indirectly with uncertainties by integrating data from different biological levels and origins into models. In this study, we use the recently proposed computational approach iSCHRUNK (in Silico Approach to Characterization and Reduction of Uncertainty in the Kinetic Models), which combines Monte Carlo parameter sampling methods and machine learning techniques, in the context of Bayesian inference. Monte Carlo parameter sampling methods allow us to exploit synergies between different data sources and generate a population of kinetic models that are consistent with the available data and physicochemical laws. The machine learning allows us to data-mine the a priori generated kinetic parameters together with the integrated datasets and derive posterior distributions of kinetic parameters consistent with the observed physiology. In this work, we used iSCHRUNK to address a design question: can we identify which are the kinetic parameters and what are their values that give rise to a desired metabolic behavior? Such information is important for a wide variety of studies ranging from biotechnology to medicine. To illustrate the proposed methodology, we performed Metabolic Control Analysis, computed the flux control coefficients of the xylose uptake (XTR), and identified parameters that ensure a rate improvement of XTR in a glucose-xylose co-utilizing S. cerevisiae strain. Our results indicate that only three kinetic parameters need to be accurately characterized to describe the studied physiology, and ultimately to design and control the desired responses of the metabolism. This framework paves the way for a new generation of methods that will systematically integrate the wealth of available omics data and efficiently extract the information necessary for metabolic engineering and synthetic biology decisions. Kinetic models are the most promising tool for understanding the complex dynamic behavior of living cells. The primary goal of kinetic models is to capture the properties of the metabolic networks as a whole, and thus we need large-scale models for dependable in silico analyses of metabolism. However, uncertainty in kinetic parameters impedes the development of kinetic models, and uncertainty levels increase with the model size. Tools that will address the issues with parameter uncertainty and that will be able to reduce the uncertainty propagation through the system are therefore needed. In this work, we applied a method called iSCHRUNK that combines parameter sampling and machine learning techniques to characterize the uncertainties and uncover intricate relationships between the parameters of kinetic models and the responses of the metabolic network. The proposed method allowed us to identify a small number of parameters that determine the responses in the network regardless of the values of other parameters. As a consequence, in future studies of metabolism, it will be sufficient to explore a reduced kinetic space, and more comprehensive analyses of large-scale and genome-scale metabolic networks will be computationally tractable.
Collapse
Affiliation(s)
- Ljubisa Miskovic
- Laboratory of Computational Systems Biology (LCSB), EPFL, CH, Lausanne, Switzerland
| | - Jonas Béal
- Master's Program in Life Sciences and Technology, EPFL, CH, Lausanne, Switzerland
| | - Michael Moret
- Master's Program in Life Sciences and Technology, EPFL, CH, Lausanne, Switzerland
| | | |
Collapse
|
150
|
Bekara MEA, Bareille N. Quantification by simulation of the effect of herd management practices and cow fertility on the reproductive and economic performance of Holstein dairy herds. J Dairy Sci 2019; 102:9435-9457. [PMID: 31421872 DOI: 10.3168/jds.2018-15484] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 06/19/2019] [Indexed: 11/19/2022]
Abstract
The performance of dairy herds is affected mainly by factors related to cows' characteristics and herd management practices. However, these factors are interrelated, and as such, the estimation of their individual effect on the performance of dairy herds remains difficult. The aim of this study was to estimate the weight of these factors as well the interactions between them on the reproductive and economic performance of dairy farms. A stochastic dynamic model was used to simulate most physiological and management processes occurring on a dairy farm. A herd of 60 Holstein cows, with a milk yield of 8,000 L/cow-year, representative of French Holstein dairy herds, was simulated. A total of 216 scenarios were run by combining 2 levels of postpartum cyclicity resumption (average: 45 d, high: 75 d), 3 levels of 21-d conception rate of the herd (i.e., proportion of cows pregnant 21 d after insemination; low: 25%, average: 45%, high: 70%), 3 levels of probability of pregnancy loss until 120 d (low: 3%, average: 15%, high: 43%), 3 levels of sensitivity of estrus detection by the farmer (low: 20%, average: 50%, high: 90%), 2 alternative managerial goals (constant number of cows or constant volume of milk sold), and 2 types of management for the sale and purchase of animals (closed or open herd). The effect of each factor was estimated by sensitivity analysis. The parameter that had the greatest effect on reproductive performance was the sensitivity of estrus detection: a 10-percentage-point increase between the low and average levels and between the average and high levels reduced the calving interval by 16 and 5.7 d, respectively. However, the factor that had the greatest effect on economic performance was the 21-d conception rate: a 10-percentage-point increase between the low and average levels and between the average and high levels increased the gross margin by €62.2 and €22.3/cow-year, respectively. The pregnancy loss until 120 d had an effect on economic performance: an increase of 1 percentage point of this parameter decreased the gross margin by €2/cow-year. The other factors studied, and their interactions, did not have a major effect (low value of sensitivity indices). Closed herds or farms with a constant number of cows had economic losses of €58/cow-year compared with open herds or to farms with constant volume of milk sold. Altogether, our data suggest that, in a typical French dairy farm, farmers' efforts on estrus detection will be more profitable when associated with improvement of the conception rate of the cows.
Collapse
Affiliation(s)
- M E A Bekara
- BIOEPAR, INRA, Oniris, La Chantrerie, 44307 Nantes, France; Laboratory of Molecular Biology, Genomics and Bioinformatics, Department of Biology, Faculty of Nature and Life Sciences, University Hassiba Benbouali of Chlef, 02000 Chlef, Algeria.
| | - N Bareille
- BIOEPAR, INRA, Oniris, La Chantrerie, 44307 Nantes, France
| |
Collapse
|