1
|
Byrnes JEK, Dee LE. Causal Inference With Observational Data and Unobserved Confounding Variables. Ecol Lett 2025; 28:e70023. [PMID: 39836442 PMCID: PMC11750058 DOI: 10.1111/ele.70023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 10/15/2024] [Accepted: 10/16/2024] [Indexed: 01/22/2025]
Abstract
Experiments have long been the gold standard for causal inference in Ecology. As Ecology tackles progressively larger problems, however, we are moving beyond the scales at which randomised controlled experiments are feasible. To answer causal questions at scale, we need to also use observational data -something Ecologists tend to view with great scepticism. The major challenge using observational data for causal inference is confounding variables: variables affecting both a causal variable and response of interest. Unmeasured confounders-known or unknown-lead to statistical bias, creating spurious correlations and masking true causal relationships. To combat this omitted variable bias, other disciplines have developed rigorous approaches for causal inference from observational data that flexibly control for broad suites of confounding variables. We show how ecologists can harness some of these methods-causal diagrams to identify confounders coupled with nested sampling and statistical designs-to reduce risks of omitted variable bias. Using an example of estimating warming effects on snails, we show how current methods in Ecology (e.g., mixed models) produce incorrect inferences due to omitted variable bias and how alternative methods can eliminate it, improving causal inferences with weaker assumptions. Our goal is to expand tools for causal inference using observational and imperfect experimental data in Ecology.
Collapse
Affiliation(s)
| | - Laura E. Dee
- Department of Ecology and Evolutionary BiologyUniversity of Colorado BoulderBoulderColoradoUSA
| |
Collapse
|
2
|
Harwell LC, McMillion CA, Lamper AM, Summers JK. Development of a generalized pseudo-probabilistic approach for characterizing ecological conditions in estuaries using secondary data. ENVIRONMENTAL MONITORING AND ASSESSMENT 2024; 196:753. [PMID: 39030312 PMCID: PMC11271375 DOI: 10.1007/s10661-024-12877-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 06/28/2024] [Indexed: 07/21/2024]
Abstract
Under the best circumstances, achieving or sustaining optimum ecological conditions in estuaries is challenging. Persistent information gaps in estuarine data make it difficult to differentiate natural variability from potential regime shifts. Long-term monitoring is critical for tracking ecological change over time. In the United States (US), many resource management programs are working at maximum capacity to address existing state and federal water quality mandates (e.g., pollutant load limits, climate impact mitigation, and fisheries management) and have little room to expand routine sampling efforts to conduct periodic ecological baseline assessments, especially at state and local scales. Alternative design, monitoring, and assessment approaches are needed to help offset the burden of addressing additional data needs to increase understanding about estuarine system resilience when existing monitoring data are sparse or spatially limited. Research presented here offers a pseudo-probabilistic approach that allows for the use of found or secondary data, such as data on hand and other acquired data, to generate statistically robust characterizations of ecological conditions in estuaries. Our approach uses a generalized pseudo-probabilistic framework to synthesize data from different contributors to inform probabilistic-like baseline assessments. The methodology relies on simple geospatial techniques and existing tools (R package functions) developed for the US Environmental Protection Agency to support ecological monitoring and assessment programs like the National Coastal Condition Assessment. Using secondary estuarine water quality data collected in the Northwest Florida (US) estuaries, demonstrations suggest that the pseudo-probabilistic approach produces estuarine condition assessment results with reasonable statistical confidence, improved spatial representativeness, and value-added information. While the pseudo-probabilistic framework is not a substitute for fully evolved monitoring, it offers a scalable alternative to bridge the gap between limitations in resource management capability and optimal monitoring strategies to track ecological baselines in estuaries over time.
Collapse
Affiliation(s)
- Linda C Harwell
- US Environmental Protection Agency, 1 Sabine Island Drive, Gulf Breeze, FL, 32561, USA.
| | - Courtney A McMillion
- Oak Ridge Associated Universities, 1 Sabine Island Drive, Gulf Breeze, FL, 32561, USA
- Santa Rosa County Florida, Development Services Department, 6051 Old Bagdad Highway, Suite 202, Milton, FL, 32583, USA
| | - Andrea M Lamper
- Oak Ridge Associated Universities, 1 Sabine Island Drive, Gulf Breeze, FL, 32561, USA
- CDM Smith, 670 N Commercial St, Unit 208, Manchester, NH, 03101, USA
| | - J Kevin Summers
- US Environmental Protection Agency, 1 Sabine Island Drive, Gulf Breeze, FL, 32561, USA
| |
Collapse
|
3
|
Robertson B, Price C, Reale M. Well-spread samples with dynamic sample sizes. Biometrics 2024; 80:ujae026. [PMID: 38591365 DOI: 10.1093/biomtc/ujae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 03/04/2024] [Accepted: 03/25/2024] [Indexed: 04/10/2024]
Abstract
A spatial sampling design determines where sample locations are placed in a study area so that population parameters can be estimated with relatively high precision. If the response variable has spatial trends, spatially balanced or well-spread designs give precise results for commonly used estimators. This article proposes a new method that draws well-spread samples over arbitrary auxiliary spaces and can be used for master sampling applications. All we require is a measure of the distance between population units. Numerical results show that the method generates well-spread samples and compares favorably with existing designs. We provide an example application using several auxiliary variables to estimate total aboveground biomass over a large study area in Eastern Amazonia, Brazil. Multipurpose surveys are also considered, where the totals of aboveground biomass, primary production, and clay content (3 responses) are estimated from a single well-spread sample over the auxiliary space.
Collapse
Affiliation(s)
- Blair Robertson
- School of Mathematics and Statistics, University of Canterbury, Private Bag 4800, Christchurch, NZ
| | - Chris Price
- School of Mathematics and Statistics, University of Canterbury, Private Bag 4800, Christchurch, NZ
| | - Marco Reale
- School of Mathematics and Statistics, University of Canterbury, Private Bag 4800, Christchurch, NZ
| |
Collapse
|
4
|
Hook SE, Foster S, Althaus F, Bearham D, Angel BM, Revill AT, Simpson SL, Strzelecki J, Cresswell T, Hayes KR. The distribution of metal and petroleum-derived contaminants within sediments around oil and gas infrastructure in the Gippsland Basin, Australia. MARINE POLLUTION BULLETIN 2023; 193:115196. [PMID: 37421917 DOI: 10.1016/j.marpolbul.2023.115196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 06/14/2023] [Accepted: 06/15/2023] [Indexed: 07/10/2023]
Abstract
As oil and gas infrastructure comes to the end of its working life, a decommissioning decision must be made: should the infrastructure be abandoned in situ, repurposed, partially removed, or fully removed? Environmental contaminants around oil and gas infrastructure could influence these decisions because contaminants in sediments could degrade the value of the infrastructure as habitat, enter the seafood supply if the area is re-opened for commercial and/or recreational fishing, or be made biologically available as sediment is resuspended when the structures are moved. An initial risk hypothesis, however, may postulate that these concerns are only relevant if contaminant concentrations are above screening values that predict the possibility of environmental harm or contaminant bioaccumulation. To determine whether a substantive contaminants-based risk assessment is needed for infrastructure in the Gippsland Basin (South-eastern Australia), we measured the concentration of metals and polycyclic aromatic hydrocarbons (PAHs) in benthic sediments collected around eight platforms earmarked for decommissioning. The measurements were compared to preset screening values and to background contaminant concentrations in reference sites. Lead (Pb), zinc (Zn), PAHs and other contaminants were occasionally measured at concentrations that exceeded reference values, most often within 150 m of the platforms. The exceedance of a few screening values by contaminants at some platforms indicates that these platforms require further analysis to determine the contaminant risks associated with any decommissioning option.
Collapse
Affiliation(s)
| | | | | | | | - Brad M Angel
- CSIRO Environment, Lucas Heights, NSW, Australia
| | | | | | | | - Tom Cresswell
- Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW, Australia
| | | |
Collapse
|
5
|
Affiliation(s)
| | - Brian H. McArdle
- Department of Statistics University of Auckland Auckland New Zealand
| |
Collapse
|
6
|
Howard TG, White K, Goren J. Thirteen Years of Rare Plant Population Changes in the Adirondack Alpine. Northeast Nat (Steuben) 2021. [DOI: 10.1656/045.028.s1103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Affiliation(s)
- Timothy G. Howard
- New York Natural Heritage Program, College of Environmental Science and Forestry, State University of New York, 625 Broadway, Albany, NY 12233-4757
| | - Kayla White
- Adirondack High Peaks Summit Stewardship Program, Adirondack Mountain Club, PO Box 867, Lake Placid, NY 12946
| | - Julia Goren
- Adirondack High Peaks Summit Stewardship Program, Adirondack Mountain Club, PO Box 867, Lake Placid, NY 12946
| |
Collapse
|
7
|
Benedetti R, Dickson MM, Espa G, Pantalone F, Piersimoni F. A simulated annealing-based algorithm for selecting balanced samples. Comput Stat 2021. [DOI: 10.1007/s00180-021-01113-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
AbstractBalanced sampling is a random method for sample selection, the use of which is preferable when auxiliary information is available for all units of a population. However, implementing balanced sampling can be a challenging task, and this is due in part to the computational efforts required and the necessity to respect balancing constraints and inclusion probabilities. In the present paper, a new algorithm for selecting balanced samples is proposed. This method is inspired by simulated annealing algorithms, as a balanced sample selection can be interpreted as an optimization problem. A set of simulation experiments and an example using real data shows the efficiency and the accuracy of the proposed algorithm.
Collapse
|
8
|
Altieri L, Cocchi D. Spatial Sampling for Non‐compact Patterns. Int Stat Rev 2021. [DOI: 10.1111/insr.12445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Linda Altieri
- Department of Statistical Sciences University of Bologna via Belle Arti 41 Bologna 40126 Italy
| | - Daniela Cocchi
- Department of Statistical Sciences University of Bologna via Belle Arti 41 Bologna 40126 Italy
| |
Collapse
|
9
|
|
10
|
Kermorvant C, D’Amico F, L’Ambert G, Dossou-Gbete S. Setting up an efficient survey of Aedes albopictus in an unfamiliar urban area. Urban Ecosyst 2020. [DOI: 10.1007/s11252-020-01041-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|