1
|
Caruso G, Alaimo Di Loro P, Mingione M, Tardella L, Pace DS, Jona Lasinio G. Finite mixtures in capture-recapture surveys for modeling residency patterns in marine wildlife populations. Biom J 2024; 66:e2200350. [PMID: 38285406 DOI: 10.1002/bimj.202200350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 09/13/2023] [Accepted: 09/25/2023] [Indexed: 01/30/2024]
Abstract
This work aims to show how prior knowledge about the structure of a heterogeneous animal population can be leveraged to improve the abundance estimation from capture-recapture survey data. We combine the Open Jolly-Seber model with finite mixtures and propose a parsimonious specification tailored to the residency patterns of the common bottlenose dolphin. We employ a Bayesian framework for our inference, discussing the appropriate choice of priors to mitigate label-switching and nonidentifiability issues, commonly associated with finite mixture models. We conduct a series of simulation experiments to illustrate the competitive advantage of our proposal over less specific alternatives. The proposed approach is applied to data collected on the common bottlenose dolphin population inhabiting the Tiber River estuary (Mediterranean Sea). Our results provide novel insights into this population's size and structure, shedding light on some of the ecological processes governing its dynamics.
Collapse
Affiliation(s)
- Gianmarco Caruso
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
- Department of Statistical Sciences, Sapienza University of Rome, Rome, Italy
| | | | - Marco Mingione
- Department of Political Sciences, University of Roma Tre, Rome, Italy
| | - Luca Tardella
- Department of Statistical Sciences, Sapienza University of Rome, Rome, Italy
| | - Daniela Silvia Pace
- Department of of Environmental Biology, Sapienza University of Rome, Rome, Italy
- Institute for the Study of Anthropogenic Impacts and Sustainability in the Marine Environment, CNR, Trapani, Italy
| | | |
Collapse
|
2
|
Gosky RM, Sanqui J. A Simulation Study on Increasing Capture Periods in Bayesian Closed Population Capture-Recapture Models with Heterogeneity. JOURNAL OF MODERN APPLIED STATISTICAL METHODS 2020. [DOI: 10.22237/jmasm/1556668920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Capture-Recapture models are useful in estimating unknown population sizes. A common modeling challenge for closed population models involves modeling unequal animal catchability in each capture period, referred to as animal heterogeneity. Inference about population size N is dependent on the assumed distribution of animal capture probabilities in the population, and that different models can fit a data set equally well but provide contradictory inferences about N. Three common Bayesian Capture-Recapture heterogeneity models are studied with simulated data to study the prevalence of contradictory inferences is in different population sizes with relatively low capture probabilities, specifically at different numbers of capture periods in the study.
Collapse
|
3
|
Worthington H, McCrea RS, King R, Griffiths RA. Estimation of Population Size When Capture Probability Depends on Individual States. JOURNAL OF AGRICULTURAL, BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2018. [DOI: 10.1007/s13253-018-00347-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
4
|
Alunni Fegatelli D, Tardella L. Flexible behavioral capture-recapture modeling. Biometrics 2015; 72:125-35. [DOI: 10.1111/biom.12417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2014] [Revised: 07/01/2015] [Accepted: 08/01/2015] [Indexed: 11/30/2022]
Affiliation(s)
| | - Luca Tardella
- Dipartimento di Scienze Statistiche -Sapienza Università di Roma (Italy)
| |
Collapse
|
5
|
Akanda MAS, Alpizar-Jara R. Estimation of capture probabilities using generalized estimating equations and mixed effects approaches. Ecol Evol 2014; 4:1158-65. [PMID: 24772290 PMCID: PMC3997329 DOI: 10.1002/ece3.1000] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 01/23/2014] [Indexed: 11/21/2022] Open
Abstract
Modeling individual heterogeneity in capture probabilities has been one of the most challenging tasks in capture–recapture studies. Heterogeneity in capture probabilities can be modeled as a function of individual covariates, but correlation structure among capture occasions should be taking into account. A proposed generalized estimating equations (GEE) and generalized linear mixed modeling (GLMM) approaches can be used to estimate capture probabilities and population size for capture–recapture closed population models. An example is used for an illustrative application and for comparison with currently used methodology. A simulation study is also conducted to show the performance of the estimation procedures. Our simulation results show that the proposed quasi-likelihood based on GEE approach provides lower SE than partial likelihood based on either generalized linear models (GLM) or GLMM approaches for estimating population size in a closed capture–recapture experiment. Estimator performance is good if a large proportion of individuals are captured. For cases where only a small proportion of individuals are captured, the estimates become unstable, but the GEE approach outperforms the other methods.
Collapse
Affiliation(s)
- Md Abdus Salam Akanda
- Department of Mathematics, Research Center in Mathematics and Applications, University of Évora 7000-671, Évora, Portugal ; Department of Statistics, Biostatistics & Informatics, University of Dhaka Dhaka, 1000, Bangladesh
| | - Russell Alpizar-Jara
- Department of Mathematics, Research Center in Mathematics and Applications, University of Évora 7000-671, Évora, Portugal
| |
Collapse
|
6
|
|
7
|
Zhang H, Ghosh K, Ghosh P. Sampling designs via a multivariate hypergeometric-Dirichlet process model for a multi-species assemblage with unknown heterogeneity. Comput Stat Data Anal 2012. [DOI: 10.1016/j.csda.2012.02.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
8
|
Zhang S. A GEE approach for estimating size of hard-to-reach population by using capture–recapture data. STATISTICS-ABINGDON 2012. [DOI: 10.1080/02331888.2010.500735] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
9
|
Gosky RM, Ghosh SK. A Comparative Study of Bayes Estimators of Closed Population Size from Capture-Recapture Data. JOURNAL OF STATISTICAL THEORY AND PRACTICE 2011. [DOI: 10.1080/15598608.2011.10412027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
10
|
Heisey DM, Osnas EE, Cross PC, Joly DO, Langenberg JA, Miller MW. Linking process to pattern: estimating spatiotemporal dynamics of a wildlife epidemic from cross-sectional data. ECOL MONOGR 2010. [DOI: 10.1890/09-0052.1] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
11
|
|
12
|
Grosbois V, Gimenez O, Gaillard JM, Pradel R, Barbraud C, Clobert J, Møller AP, Weimerskirch H. Assessing the impact of climate variation on survival in vertebrate populations. Biol Rev Camb Philos Soc 2008; 83:357-99. [PMID: 18715402 DOI: 10.1111/j.1469-185x.2008.00047.x] [Citation(s) in RCA: 310] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The impact of the ongoing rapid climate change on natural systems is a major issue for human societies. An important challenge for ecologists is to identify the climatic factors that drive temporal variation in demographic parameters, and, ultimately, the dynamics of natural populations. The analysis of long-term monitoring data at the individual scale is often the only available approach to estimate reliably demographic parameters of vertebrate populations. We review statistical procedures used in these analyses to study links between climatic factors and survival variation in vertebrate populations. We evaluated the efficiency of various statistical procedures from an analysis of survival in a population of white stork, Ciconia ciconia, a simulation study and a critical review of 78 papers published in the ecological literature. We identified six potential methodological problems: (i) the use of statistical models that are not well-suited to the analysis of long-term monitoring data collected at the individual scale; (ii) low ratios of number of statistical units to number of candidate climatic covariates; (iii) collinearity among candidate climatic covariates; (iv) the use of statistics, to assess statistical support for climatic covariates effects, that deal poorly with unexplained variation in survival; (v) spurious detection of effects due to the co-occurrence of trends in survival and the climatic covariate time series; and (vi) assessment of the magnitude of climatic effects on survival using measures that cannot be compared across case studies. The critical review of the ecological literature revealed that five of these six methodological problems were often poorly tackled. As a consequence we concluded that many of these studies generated hypotheses but only few provided solid evidence for impacts of climatic factors on survival or reliable measures of the magnitude of such impacts. We provide practical advice to solve efficiently most of the methodological problems identified. The only frequent issue that still lacks a straightforward solution was the low ratio of the number of statistical units to the number of candidate climatic covariates. In the perspective of increasing this ratio and therefore of producing more robust analyses of the links between climate and demography, we suggest leads to improve the procedures for designing field protocols and selecting a set of candidate climatic covariates. Finally, we present recent statistical methods with potential interest for assessing the impact of climatic factors on demographic parameters.
Collapse
Affiliation(s)
- V Grosbois
- Centre d'Ecologie Fonctionnelle et Evolutive, UMR 5175, 1919 Route de Mende, F-34293 Montpellier Cedex 5, France.
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Buenconsejo J, Fish D, Childs JE, Holford TR. A Bayesian hierarchical model for the estimation of two incomplete surveillance data sets. Stat Med 2008; 27:3269-85. [PMID: 18314934 DOI: 10.1002/sim.3190] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A model-based approach to analyze two incomplete disease surveillance datasets is described. Such data typically consist of case counts, each originating from a specific geographical area. A Bayesian hierarchical model is proposed for estimating the total number of cases with disease while simultaneously adjusting for spatial variation. This approach explicitly accounts for model uncertainty and can make use of covariates.The method is applied to two surveillance datasets maintained by the Centers for Disease Control and Prevention on Rocky Mountain spotted fever (RMSF). An inference is drawn using Markov Chain Monte Carlo simulation techniques in a fully Bayesian framework. The central feature of the model is the ability to calculate and estimate the total number of cases and disease incidence for geographical regions where RMSF is endemic.The information generated by this model could significantly reduce the public health impact of RMSF and other vector-borne zoonoses, as well as other infectious or chronic diseases, by improving knowledge of the spatial distribution of disease risk of public health officials and medical practitioners. More accurate information on populations at high risk would focus attention and resources on specific areas, thereby reducing the morbidity and mortality caused by some of the preventable and treatable diseases.
Collapse
Affiliation(s)
- Joan Buenconsejo
- Center for Drugs, Evaluation and Research, US Food and Drug Administration, 10903 New Hampshire Avenue, Bldg. 22, Rm. 3241, Silver Spring, MD 20993-0002, USA.
| | | | | | | |
Collapse
|
14
|
King R, Brooks SP. On the Bayesian estimation of a closed population size in the presence of heterogeneity and model uncertainty. Biometrics 2007; 64:816-824. [PMID: 18047534 DOI: 10.1111/j.1541-0420.2007.00938.x] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We consider the estimation of the size of a closed population, often of interest for wild animal populations, using a capture-recapture study. The estimate of the total population size can be very sensitive to the choice of model used to fit to the data. We consider a Bayesian approach, in which we consider all eight plausible models initially described by Otis et al. (1978, Wildlife Monographs 62, 1-135) within a single framework, including models containing an individual heterogeneity component. We show how we are able to obtain a model-averaged estimate of the total population, incorporating both parameter and model uncertainty. To illustrate the methodology we initially perform a simulation study and analyze two datasets where the population size is known, before considering a real example relating to a population of dolphins off northeast Scotland.
Collapse
Affiliation(s)
- R King
- School of Mathematics and Statistics, University of St. Andrews, North Haugh, St. Andrews, Fife KY16 9SS, U.K
| | - S P Brooks
- The Statistical Laboratory, University of Cambridge, Cambridge, CB3 0WB, U.K
| |
Collapse
|
15
|
Ohlssen DI, Sharples LD, Spiegelhalter DJ. Flexible random-effects models using Bayesian semi-parametric models: applications to institutional comparisons. Stat Med 2007; 26:2088-112. [PMID: 16906554 DOI: 10.1002/sim.2666] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Random effects models are used in many applications in medical statistics, including meta-analysis, cluster randomized trials and comparisons of health care providers. This paper provides a tutorial on the practical implementation of a flexible random effects model based on methodology developed in Bayesian non-parametrics literature, and implemented in freely available software. The approach is applied to the problem of hospital comparisons using routine performance data, and among other benefits provides a diagnostic to detect clusters of providers with unusual results, thus avoiding problems caused by masking in traditional parametric approaches. By providing code for Winbugs we hope that the model can be used by applied statisticians working in a wide variety of applications.
Collapse
Affiliation(s)
- D I Ohlssen
- MRC Biostatistics Unit, Institute of Public Health, Robinson Way, Cambridge CB2 2SR, UK.
| | | | | |
Collapse
|