1
|
Goyal R, Carnegie N, Slipher S, Turk P, Little SJ, De Gruttola V. Estimating contact network properties by integrating multiple data sources associated with infectious diseases. Stat Med 2023; 42:3593-3615. [PMID: 37392149 PMCID: PMC10825904 DOI: 10.1002/sim.9816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 05/09/2023] [Accepted: 05/19/2023] [Indexed: 07/03/2023]
Abstract
To effectively mitigate the spread of communicable diseases, it is necessary to understand the interactions that enable disease transmission among individuals in a population; we refer to the set of these interactions as a contact network. The structure of the contact network can have profound effects on both the spread of infectious diseases and the effectiveness of control programs. Therefore, understanding the contact network permits more efficient use of resources. Measuring the structure of the network, however, is a challenging problem. We present a Bayesian approach to integrate multiple data sources associated with the transmission of infectious diseases to more precisely and accurately estimate important properties of the contact network. An important aspect of the approach is the use of the congruence class models for networks. We conduct simulation studies modeling pathogens resembling SARS-CoV-2 and HIV to assess the method; subsequently, we apply our approach to HIV data from the University of California San Diego Primary Infection Resource Consortium. Based on simulation studies, we demonstrate that the integration of epidemiological and viral genetic data with risk behavior survey data can lead to large decreases in mean squared error (MSE) in contact network estimates compared to estimates based strictly on risk behavior information. This decrease in MSE is present even in settings where the risk behavior surveys contain measurement error. Through these simulations, we also highlight certain settings where the approach does not improve MSE.
Collapse
Affiliation(s)
- Ravi Goyal
- Division of Infectious Diseases and Global Public, University of California San Diego, San Diego, California, USA
| | | | - Sally Slipher
- Department of Mathematical Sciences, Montana State University, Bozeman, Montana, USA
| | - Philip Turk
- Department of Data Science, University of Mississippi Medical Center, Jackson, Mississippi, USA
| | - Susan J Little
- Division of Infectious Diseases and Global Public, University of California San Diego, La Jolla, California, USA
| | - Victor De Gruttola
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
2
|
Pujante-Otalora L, Canovas-Segura B, Campos M, Juarez JM. The use of networks in spatial and temporal computational models for outbreak spread in epidemiology: A systematic review. J Biomed Inform 2023; 143:104422. [PMID: 37315830 DOI: 10.1016/j.jbi.2023.104422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 06/05/2023] [Accepted: 06/09/2023] [Indexed: 06/16/2023]
Abstract
OBJECTIVES To examine recent literature in order to present a comprehensive overview of the current trends as regards the computational models used to represent the propagation of an infectious outbreak in a population, paying particular attention to those that represent network-based transmission. METHODS a systematic review was conducted following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Papers published in English between 2010 and September 2021 were sought in the ACM Digital Library, IEEE Xplore, PubMed and Scopus databases. RESULTS Upon considering their titles and abstracts, 832 papers were obtained, of which 192 were selected for a full content-body check. Of these, 112 studies were eventually deemed suitable for quantitative and qualitative analysis. Emphasis was placed on the spatial and temporal scales studied, the use of networks or graphs, and the granularity of the data used to evaluate the models. The models principally used to represent the spreading of outbreaks have been stochastic (55.36%), while the type of networks most frequently used are relationship networks (32.14%). The most common spatial dimension used is a region (19.64%) and the most used unit of time is a day (28.57%). Synthetic data as opposed to an external source were used in 51.79% of the papers. With regard to the granularity of the data sources, aggregated data such as censuses or transportation surveys are the most common. CONCLUSION We identified a growing interest in the use of networks to represent disease transmission. We detected that research is focused on only certain combinations of the computational model, type of network (in both the expressive and the structural sense) and spatial scale, while the search for other interesting combinations has been left for the future.
Collapse
Affiliation(s)
- Lorena Pujante-Otalora
- AIKE Research Group (INTICO), University of Murcia, Campus Espinardo, Murcia 30100, Spain.
| | | | - Manuel Campos
- AIKE Research Group (INTICO), University of Murcia, Campus Espinardo, Murcia 30100, Spain; Murcian Bio-Health Institute (IMIB-Arrixaca), El Palmar, Murcia 30120, Spain.
| | - Jose M Juarez
- AIKE Research Group (INTICO), University of Murcia, Campus Espinardo, Murcia 30100, Spain.
| |
Collapse
|
3
|
Silk MJ, Carrignon S, Bentley RA, Fefferman NH. Improving pandemic mitigation policies across communities through coupled dynamics of risk perception and infection. Proc Biol Sci 2021; 288:20210834. [PMID: 34284634 PMCID: PMC8292781 DOI: 10.1098/rspb.2021.0834] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 06/23/2021] [Indexed: 02/07/2023] Open
Abstract
Capturing the coupled dynamics between individual behavioural decisions that affect disease transmission and the epidemiology of outbreaks is critical to pandemic mitigation strategy. We develop a multiplex network approach to model how adherence to health-protective behaviours that impact COVID-19 spread are shaped by perceived risks and resulting community norms. We focus on three synergistic dynamics governing individual behavioural choices: (i) social construction of concern, (ii) awareness of disease incidence, and (iii) reassurance by lack of disease. We show why policies enacted early or broadly can cause communities to become reassured and therefore unwilling to maintain or adopt actions. Public health policies for which success relies on collective action should therefore exploit the behaviourally receptive phase; the period between the generation of sufficient concern to foster adoption of novel actions and the relaxation of adherence driven by reassurance fostered by avoidance of negative outcomes over time.
Collapse
Affiliation(s)
- M. J. Silk
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, UK
| | - S. Carrignon
- Center for the Dynamics of Social Complexity, University of Tennessee, Knoxville, TN, USA
- Department of Anthropology, University of Tennessee, Knoxville, TN, USA
- School of Information Sciences, University of Tennessee, Knoxville, TN, USA
| | - R. A. Bentley
- Department of Anthropology, University of Tennessee, Knoxville, TN, USA
| | - N. H. Fefferman
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN, USA
- Department of Mathematics, University of Tennessee, Knoxville, TN, USA
| |
Collapse
|
4
|
Brethouwer JT, van de Rijt A, Lindelauf R, Fokkink R. "Stay nearby or get checked": A Covid-19 control strategy. Infect Dis Model 2020; 6:36-45. [PMID: 33225114 PMCID: PMC7669247 DOI: 10.1016/j.idm.2020.10.013] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 10/13/2020] [Accepted: 10/31/2020] [Indexed: 11/25/2022] Open
Abstract
This paper repurposes the classic insight from network theory that long-distance connections drive disease propagation into a strategy for controlling a second wave of Covid-19. We simulate a scenario in which a lockdown is first imposed on a population and then partly lifted while long-range transmission is kept at a minimum. Simulated spreading patterns resemble contemporary distributions of Covid- 19 across EU member states, German and Italian regions, and through New York City, providing some model validation. Results suggest that our proposed strategy may significantly reduce peak infection. We also find that post-lockdown flare-ups remain local longer, aiding geographical containment. These results suggest a tailored policy in which individuals who frequently travel to places where they interact with many people are offered greater protection, tracked more closely, and are regularly tested. This policy can be communicated to the general public as a simple and reasonable principle: Stay nearby or get checked.
Collapse
Affiliation(s)
| | - Arnout van de Rijt
- European University Institute, Political and Social Sciences, Italy
- Utrecht University, Sociology, Netherlands
| | - Roy Lindelauf
- Netherlands Defence Academy, Faculty of Military Science, Intelligence and Security, Netherlands
| | - Robbert Fokkink
- TU Delft, Delft Institute of Applied Mathematics, Netherlands
| |
Collapse
|
5
|
Zhou Z, Wang B. Identification of male infertility-related long non-coding RNAs and their functions based on a competing endogenous RNA network. J Int Med Res 2020; 48:300060520961277. [PMID: 33054493 PMCID: PMC7580164 DOI: 10.1177/0300060520961277] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
OBJECTIVE To identify male infertility-related long non-coding (lnc)RNAs and an lncRNA-related competing endogenous (ce)RNA network. METHODS Expression data including 13 normospermic and eight teratozoospermic samples from postmortem donors were downloaded from the GEO database (GSE6872). The limma R package was used to discriminate dysregulated lncRNA and micro (m)RNA profiles. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses of differentially expressed (DE) mRNAs were performed using the clusterProfiler R package. The ceRNA network of dysregulated genes was visualized by Cytoscape. RESULTS A total of 101 DE lncRNAs and 1722 mRNAs were identified as male infertility-specific RNAs with thresholds of |log2FoldChange| >2.0 and adjusted P-value <0.05. GO and KEGG pathways were analyzed for DE mRNAs. Gene set enrichment analysis revealed that DE genes were enriched in embryonic skeletal system development and cytokine-cytokine receptor interactions. A ceRNA network was constructed with 26 key lncRNAs, 33 microRNAs, and 133 mRNAs. DE lncRNAs in male sterility were mainly associated with transferring phosphorus-containing groups and complexes of histone methyltransferases, methyltransferases, PcG proteins, and serine/threonine protein kinases. CONCLUSION This provides a novel perspective to study lncRNA-related ceRNA networks in male infertility and assist in identifying new potential biomarkers for diagnostic purposes.
Collapse
Affiliation(s)
- Zuo Zhou
- Department of Obstetrics, Maternal and Child Health Hospital of Zibo City, Shandong Province, China
| | - Bing Wang
- Center of Reproductive Medicine, Maternal and Child Health Hospital of Zibo City, Shandong Province, China
| |
Collapse
|
6
|
Barido-Sottani J, Vaughan TG, Stadler T. Detection of HIV transmission clusters from phylogenetic trees using a multi-state birth-death model. J R Soc Interface 2019; 15:rsif.2018.0512. [PMID: 30185544 PMCID: PMC6170769 DOI: 10.1098/rsif.2018.0512] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Accepted: 08/13/2018] [Indexed: 12/03/2022] Open
Abstract
HIV patients form clusters in HIV transmission networks. Accurate identification of these transmission clusters is essential to effectively target public health interventions. One reason for clustering is that the underlying contact network contains many local communities. We present a new maximum-likelihood method for identifying transmission clusters caused by community structure, based on phylogenetic trees. The method employs a multi-state birth–death (MSBD) model which detects changes in transmission rate, which are interpreted as the introduction of the epidemic into a new susceptible community, i.e. the formation of a new cluster. We show that the MSBD method is able to reliably infer the clusters and the transmission parameters from a pathogen phylogeny based on our simulations. In contrast to existing cutpoint-based methods for cluster identification, our method does not require that clusters be monophyletic nor is it dependent on the selection of a difficult-to-interpret cutpoint parameter. We present an application of our method to data from the Swiss HIV Cohort Study. The method is available as an easy-to-use R package.
Collapse
Affiliation(s)
- Joëlle Barido-Sottani
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland .,Swiss Institute of Bioinformatics (SIB), Switzerland
| | - Timothy G Vaughan
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.,Swiss Institute of Bioinformatics (SIB), Switzerland
| | - Tanja Stadler
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.,Swiss Institute of Bioinformatics (SIB), Switzerland
| |
Collapse
|
7
|
Hidano A, Gates MC. Assessing biases in phylodynamic inferences in the presence of super-spreaders. Vet Res 2019; 50:74. [PMID: 31558163 PMCID: PMC6764146 DOI: 10.1186/s13567-019-0692-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Accepted: 08/28/2019] [Indexed: 12/03/2022] Open
Abstract
Phylodynamic analyses using pathogen genetic data have become popular for making epidemiological inferences. However, many methods assume that the underlying host population follows homogenous mixing patterns. Nevertheless, in real disease outbreaks, a small number of individuals infect a disproportionately large number of others (super-spreaders). Our objective was to quantify the degree of bias in estimating the epidemic starting date in the presence of super-spreaders using different sample selection strategies. We simulated 100 epidemics of a hypothetical pathogen (fast evolving foot and mouth disease virus-like) over a real livestock movement network allowing the genetic mutations in pathogen sequence. Genetic sequences were sampled serially over the epidemic, which were then used to estimate the epidemic starting date using Extended Bayesian Coalescent Skyline plot (EBSP) and Birth–death skyline plot (BDSKY) models. Our results showed that the degree of bias varies over different epidemic situations, with substantial overestimations on the epidemic duration occurring in some occasions. While the accuracy and precision of BDSKY were deteriorated when a super-spreader generated a larger proportion of secondary cases, those of EBSP were deteriorated when epidemics were shorter. The accuracies of the inference were similar irrespective of whether the analysis used all sampled sequences or only a subset of them, although the former required substantially longer computational times. When phylodynamic analyses need to be performed under a time constraint to inform policy makers, we suggest multiple phylodynamics models to be used simultaneously for a subset of data to ascertain the robustness of inferences.
Collapse
Affiliation(s)
- Arata Hidano
- EpiCentre, School of Veterinary Science, Massey University, Palmerston North, New Zealand.
| | - M Carolyn Gates
- EpiCentre, School of Veterinary Science, Massey University, Palmerston North, New Zealand
| |
Collapse
|
8
|
Gilbertson MLJ, Fountain-Jones NM, Craft ME. Incorporating genomic methods into contact networks to reveal new insights into animal behavior and infectious disease dynamics. BEHAVIOUR 2019; 155:759-791. [PMID: 31680698 DOI: 10.1163/1568539x-00003471] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Utilization of contact networks has provided opportunities for assessing the dynamic interplay between pathogen transmission and host behavior. Genomic techniques have, in their own right, provided new insight into complex questions in disease ecology, and the increasing accessibility of genomic approaches means more researchers may seek out these tools. The integration of network and genomic approaches provides opportunities to examine the interaction between behavior and pathogen transmission in new ways and with greater resolution. While a number of studies have begun to incorporate both contact network and genomic approaches, a great deal of work has yet to be done to better integrate these techniques. In this review, we give a broad overview of how network and genomic approaches have each been used to address questions regarding the interaction of social behavior and infectious disease, and then discuss current work and future horizons for the merging of these techniques.
Collapse
Affiliation(s)
- Marie L J Gilbertson
- Department of Veterinary Population Medicine, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Nicholas M Fountain-Jones
- Department of Veterinary Population Medicine, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Meggan E Craft
- Department of Veterinary Population Medicine, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
9
|
Metzig C, Ratmann O, Bezemer D, Colijn C. Phylogenies from dynamic networks. PLoS Comput Biol 2019; 15:e1006761. [PMID: 30807578 PMCID: PMC6420041 DOI: 10.1371/journal.pcbi.1006761] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 03/15/2019] [Accepted: 01/07/2019] [Indexed: 12/12/2022] Open
Abstract
The relationship between the underlying contact network over which a pathogen spreads and the pathogen phylogenetic trees that are obtained presents an opportunity to use sequence data to learn about contact networks that are difficult to study empirically. However, this relationship is not explicitly known and is usually studied in simulations, often with the simplifying assumption that the contact network is static in time, though human contact networks are dynamic. We simulate pathogen phylogenetic trees on dynamic Erdős-Renyi random networks and on two dynamic networks with skewed degree distribution, of which one is additionally clustered. We use tree shape features to explore how adding dynamics changes the relationships between the overall network structure and phylogenies. Our tree features include the number of small substructures (cherries, pitchforks) in the trees, measures of tree imbalance (Sackin index, Colless index), features derived from network science (diameter, closeness), as well as features using the internal branch lengths from the tip to the root. Using principal component analysis we find that the network dynamics influence the shapes of phylogenies, as does the network type. We also compare dynamic and time-integrated static networks. We find, in particular, that static network models like the widely used Barabasi-Albert model can be poor approximations for dynamic networks. We explore the effects of mis-specifying the network on the performance of classifiers trained identify the transmission rate (using supervised learning methods). We find that both mis-specification of the underlying network and its parameters (mean degree, turnover rate) have a strong adverse effect on the ability to estimate the transmission parameter. We illustrate these results by classifying HIV trees with a classifier that we trained on simulated trees from different networks, infection rates and turnover rates. Our results point to the importance of correctly estimating and modelling contact networks with dynamics when using phylodynamic tools to estimate epidemiological parameters.
Collapse
Affiliation(s)
- Cornelia Metzig
- Dept of Electronic Engineering and Computer Science, Queen Mary University of London, London, United Kingdom
| | - Oliver Ratmann
- Dept of Mathematics, Imperial College London, London, United Kingdom
| | | | - Caroline Colijn
- Dept of Mathematics, Simon Fraser University, Burnaby, Canada
| |
Collapse
|