1
|
Hansul S, Fettweis A, Smolders E, Schamphelaere KD. Extrapolating Metal (Cu, Ni, Zn) Toxicity from Individuals to Populations Across Daphnia Species Using Mechanistic Models: The Roles of Uncertainty Propagation and Combined Physiological Modes of Action. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2024; 43:338-358. [PMID: 37921584 DOI: 10.1002/etc.5782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/31/2023] [Accepted: 10/31/2023] [Indexed: 11/04/2023]
Abstract
Mechanistic effect modeling is a promising tool to improve the ecological realism of environmental risk assessment. An open question for the mechanistic modeling of metal toxicity is whether the same physiological mode of action (PMoA) could be assumed for closely related species. The implications of various modeling choices, such as the use of parameter point estimates and assumption of simplistic toxicodynamic models, are largely unexplored. We conducted life-table experiments with Daphnia longispina, Daphnia magna, and Daphnia pulex exposed to the single metals Cu, Ni, and Zn, and calibrated toxicokinetic-toxicodynamic (TKTD) models based on dynamic energy budget theory. We developed TKTD models with single and combined PMoAs to compare their goodness-of-fit and predicted population-level sensitivity. We identified the PMoA reproduction efficiency as most probable in all species for Ni and Zn, but not for Cu, and found that combined-PMoA models predicted higher population-level sensitivity than single-PMoA models, which was related to the predicted individual-level sensitivity, rather than to mechanistic differences between models. Using point estimates of parameters, instead of sampling from the probability distributions of parameters, could also lead to differences in the predicted population-level sensitivity. According to model predictions, apical chronic endpoints (cumulative reproduction, survival) are conservative for single-metal population effects across metals and species. We conclude that the assumption of an identical PMoA for different species of Daphnia could be justified for Ni and Zn, but not for Cu. Single-PMoA models are more appropriate than combined-PMoA models from a model selection perspective, but propagation of the associated uncertainty should be considered. More accurate predictions of effects at low concentrations may nevertheless motivate the use of combined-PMoA models. Environ Toxicol Chem 2024;43:338-358. © 2023 The Authors. Environmental Toxicology and Chemistry published by Wiley Periodicals LLC on behalf of SETAC.
Collapse
Affiliation(s)
- Simon Hansul
- Laboratory of Environmental Toxicology and Aquatic Ecology, Ghent University, Ghent, Belgium
| | | | - Erik Smolders
- Soil and Water Management, KU Leuven, Leuven, Belgium
| | - Karel De Schamphelaere
- Laboratory of Environmental Toxicology and Aquatic Ecology, Ghent University, Ghent, Belgium
| |
Collapse
|
2
|
Lee F, Gallo MV, Schell LM, Jennings J, Lawrence DA, On The Environment ATF. Exposure of Akwesasne Mohawk women to polychlorinated biphenyls and hexachlorobenzene is associated with increased serum levels of thyroid peroxidase autoantibodies. JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH. PART A 2023; 86:597-613. [PMID: 37335069 DOI: 10.1080/15287394.2023.2226685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2023]
Abstract
Persistent organic pollutants (POPs) including polychlorinated biphenyls (PCBs), hexachlorobenzene (HCB), and dichlorodiphenyltrichloroethane (p,p'-DDT) were reported to influence immunological activity. As endocrine-disrupting chemicals (EDC), these pollutants may disrupt normal thyroid function and act as catalysts for development of autoimmune thyroid disease by directly and indirectly affecting levels of thyroid peroxidase antibodies (TPOAbs). Native American communities are disproportionately exposed to harmful toxicants and are at an increased risk of developing an autoimmune disease. The aim of this study was to determine the association between POPs and TPOAbs in serum obtained from Native American women. This assessment was used to measure whether increased risk of autoimmune thyroid disease occurred as a result of exposure to POPs. Data were collected from 183 Akwesasne Mohawk women, 21-38 years of age, between 2009 and 2013. Multivariate analyses were conducted to determine the association between toxicant exposure and levels of TPOAbs. In multiple logistic regression analyses, exposure to PCB congener 33 was related to elevated risk of individuals possessing above normal levels of TPOAbs. Further, HCB was associated with more than 2-fold higher risk of possessing above normal levels of TPOAbs compared to women with normal levels of TPOAbs. p,p'-DDE was not associated with TPOAb levels within this study. Exposure to PCB congener 33 and HCB was correlated with above normal levels of TPOAbs, a marker of autoimmune thyroid disease. Additional investigations are needed to establish the causes and factors surrounding autoimmune thyroid disease which are multiple and complex.
Collapse
Affiliation(s)
- Florence Lee
- Department of Anthropology, University at Albany, Albany, NY, USA
| | - Mia V Gallo
- Department of Anthropology, University at Albany, Albany, NY, USA
- Center for the Elimination of Minority Health Disparities, University at Albany, Albany, NY, USA
| | - Lawrence M Schell
- Department of Anthropology, University at Albany, Albany, NY, USA
- Center for the Elimination of Minority Health Disparities, University at Albany, Albany, NY, USA
- Department of Epidemiology and Biostatistics, University at Albany, Albany, NY, USA
| | - Julia Jennings
- Department of Anthropology, University at Albany, Albany, NY, USA
| | - David A Lawrence
- Wadsworth Center/New York State Department of Health, Albany, NY, USA
- Biomedical Sciences and Environmental Health Sciences, University at Albany, Albany, NY, USA
| | | |
Collapse
|
3
|
Fu J, Koslovsky MD, Neophytou AM, Vannucci M. A Bayesian joint model for compositional mediation effect selection in microbiome data. Stat Med 2023. [PMID: 37173609 DOI: 10.1002/sim.9764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 04/17/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]
Abstract
Analyzing multivariate count data generated by high-throughput sequencing technology in microbiome research studies is challenging due to the high-dimensional and compositional structure of the data and overdispersion. In practice, researchers are often interested in investigating how the microbiome may mediate the relation between an assigned treatment and an observed phenotypic response. Existing approaches designed for compositional mediation analysis are unable to simultaneously determine the presence of direct effects, relative indirect effects, and overall indirect effects, while quantifying their uncertainty. We propose a formulation of a Bayesian joint model for compositional data that allows for the identification, estimation, and uncertainty quantification of various causal estimands in high-dimensional mediation analysis. We conduct simulation studies and compare our method's mediation effects selection performance with existing methods. Finally, we apply our method to a benchmark data set investigating the sub-therapeutic antibiotic treatment effect on body weight in early-life mice.
Collapse
Affiliation(s)
- Jingyan Fu
- Department of Statistics, Rice University, Houston, Texas, USA
| | - Matthew D Koslovsky
- Department of Statistics, Colorado State University, Fort Collins, Colorado, USA
| | - Andreas M Neophytou
- Department of Environmental & Radiological Health Sciences, Colorado State University, Fort Collins, Colorado, USA
| | - Marina Vannucci
- Department of Statistics, Rice University, Houston, Texas, USA
| |
Collapse
|
4
|
Hossain KB, Lin Y, Chen K, Zhang M, Liu M, Zhao W, Ke H, Liu F, Wang C, Cai M. Policy impact on microplastic reduction in China: Observation and prediction using statistical model in an intensive mariculture bay. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 858:160075. [PMID: 36372178 DOI: 10.1016/j.scitotenv.2022.160075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 11/01/2022] [Accepted: 11/04/2022] [Indexed: 06/16/2023]
Abstract
Plastic pollution in the environment has spurred debate among scientists, policymakers, and the general public over how industrialization and consumerism are wreaking havoc on our ecosystem, but some policies might assist to ameliorate the problem in the near future. In this study, the decision tree classifier and Bayesian Structural Time Series (BSTS) model was used to anticipate the possible sources of microplastics and their near future state in 26 surface sediment and a sediment core, respectively in Sansha Bay, which has been criticized for its intensive mariculture applications. An inventory of microplastics in the sediment core was estimated, and it was discovered that during the previous six decades, an average of 181.95 tons of microplastics were deposited, with an average deposition (by a layer of sediment) of 179.44 tons/cm. According to the DT classifier, mariculture was the primary source of microplastics, whereas urban and industrial areas were the primary sources of POPs. The Bayesian Structural Time Series (BSTS) model revealed a microplastic downward slope, indicating that regional and national strategies implemented might successfully reduce microplastic pollution regionally.
Collapse
Affiliation(s)
- Kazi Belayet Hossain
- Coastal and Ocean Management Institute, Xiamen University, Xiamen 361102, China; College of Environment and Ecology, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China
| | - Yan Lin
- College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China; College of Environmental Science and Engineering, Xiamen University of Technology, Xiamen 361102, China
| | - Kai Chen
- College of Environment and Ecology, Xiamen University, Xiamen 361102, China; College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China
| | - Mingyu Zhang
- College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China
| | - Mengyang Liu
- College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Pollution, City University of Hong Kong, 999077, Hong Kong, China
| | - Wenlu Zhao
- School of Environmental Science and Engineering, Zhejiang Gongshang University, Hangzhou 310018, China
| | - Hongwei Ke
- College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China
| | - Fengjiao Liu
- College of Chemistry, Chemical Engineering & Environment, Minnan Normal University, Zhangzhou, Fujian Province, China
| | - Chunhui Wang
- College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China
| | - Minggang Cai
- Coastal and Ocean Management Institute, Xiamen University, Xiamen 361102, China; Key Laboratory of Marine Chemistry and Application (Xiamen University), Fujian Province University, China; College of Environment and Ecology, Xiamen University, Xiamen 361102, China; College of Ocean and Earth Science, Xiamen University, Xiamen 361102, China; Fujian Provincial Key Laboratory for Coastal Ecology and Environmental Studies, Xiamen University, Xiamen 361102, China; State Key Laboratory of Marine Environmental Science, Xiamen University, Xiamen 361102, China.
| |
Collapse
|
5
|
Ding E, Wang Y, Liu J, Tang S, Shi X. A review on the application of the exposome paradigm to unveil the environmental determinants of age-related diseases. Hum Genomics 2022; 16:54. [DOI: 10.1186/s40246-022-00428-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 10/29/2022] [Indexed: 11/11/2022] Open
Abstract
AbstractAge-related diseases account for almost half of all diseases among adults worldwide, and their incidence is substantially affected by the exposome, which is the sum of all exogenous and endogenous environmental exposures and the human body’s response to these exposures throughout the entire lifespan. Herein, we perform a comprehensive review of the epidemiological literature to determine the key elements of the exposome that affect the development of age-related diseases and the roles of aging hallmarks in this process. We find that most exposure assessments in previous aging studies have used a reductionist approach, whereby the effect of only a single environmental factor or a specific class of environmental factors on the development of age-related diseases has been examined. As such, there is a lack of a holistic and unbiased understanding of the effect of multiple environmental factors on the development of age-related diseases. To address this, we propose several research strategies based on an exposomic framework that could advance our understanding—in particular, from a mechanistic perspective—of how environmental factors affect the development of age-related diseases. We discuss the statistical methods and other methods that have been used in exposome-wide association studies, with a particular focus on multiomics technologies. We also address future challenges and opportunities in the realm of multidisciplinary approaches and genome–exposome epidemiology. Furthermore, we provide perspectives on precise public health services for vulnerable populations, public communications, the integration of risk exposure information, and the bench-to-bedside translation of research on age-related diseases.
Collapse
|
6
|
Kim H, Lee JT. On inferences about lag effects using lag models in air pollution time-series studies. ENVIRONMENTAL RESEARCH 2019; 171:134-144. [PMID: 30660919 DOI: 10.1016/j.envres.2018.12.032] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 12/05/2018] [Accepted: 12/14/2018] [Indexed: 06/09/2023]
Abstract
The choice of lag length is a matter of uncertainty in air pollution time-series studies. Lag models and model selections are widely used for inferences about lag effects, but there is lack of discussion on the integration of the two. We aimed to provide theoretical discussion on the performance of lag models, and the impact of model selections on inferences about lag effects. Bias and model selections based upon information criteria, statistical significance, effect size, and model averaging were discussed in the context of lag analysis. A simulation with eight of PM2.5-mortality relation scenarios was also conducted in order to explore the performances of lag models and to compare the model selections. The application of lag models with an insufficient lag interval taken into account (i.e. insufficient lag models) provides biased estimates. We provided features of the model selections and showed their pitfalls in lag analysis of air pollution time-series studies. We also discussed limitations of meta-analysis which fails to consider the application of different lag models in individual studies. To foster exploration on air pollution-lag-response relations with relevant tools, we encourage researchers to compare different lag models in terms of effect estimates and variance estimates, and to report their favored models and competing models together based upon scientific knowledge supporting lag-response relations.
Collapse
Affiliation(s)
- Honghyok Kim
- BK21PLUS Program in 'Embodiment: Health -Society Interaction', Department of Public Health Science, Graduate School, Korea University, Seoul, Republic of Korea
| | - Jong-Tae Lee
- BK21PLUS Program in 'Embodiment: Health -Society Interaction', Department of Public Health Science, Graduate School, Korea University, Seoul, Republic of Korea; Department of Environmental Health, Korea University, Seoul, Republic of Korea; School of Health Policy and Management, College of Health Science, Korea University, Seoul, Republic of Korea.
| |
Collapse
|
7
|
Tran H, Kim J, Kim D, Choi M, Choi M. Impact of air pollution on cause-specific mortality in Korea: Results from Bayesian Model Averaging and Principle Component Regression approaches. THE SCIENCE OF THE TOTAL ENVIRONMENT 2018; 636:1020-1031. [PMID: 29729505 DOI: 10.1016/j.scitotenv.2018.04.273] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 04/17/2018] [Accepted: 04/20/2018] [Indexed: 06/08/2023]
Abstract
Health effects related to air pollution are a major global concern. Related studies based on reliable exposure assessment methods would potentially enable policy makers to propose appropriate environmental management policies. In this study, integrated Bayesian Model Averaging (BMA) and Principle Component Regression (PCR) were adopted to assess the severity of air pollution impacts on mortality related to circulatory, respiratory and skin diseases in 25 districts of Seoul, South Korea for the years 2005-2015. These methods were consistent in determining the best regression models and most important pollutants related to mortality in those highly susceptible to poor air quality. Specifically, the results demonstrated that pneumonia was highly associated with air pollution, with a large determination coefficient (BMA: 0.46, PCR: 0.51) and high model's posterior probability (0.47). The most reliable prediction model for pneumonia was indicated by the lowest Bayesian Information Criterion. Among the pollutants, particulate matter with an aerodynamic diameter of 10 μm or less (PM10) was associated with serious health risks on evaluation, with the highest posterior inclusion probabilities (range, 80.20 to 100.00%) and significantly positive correlation coefficients (range, 0.14 to 0.34, p < 0.05). In addition, excessive PM10 concentration (approximately 2.54 times the threshold) and a continuous increase in mortality due to respiratory diseases (approximately 1.50-fold in 10 years) were also exhibited. Overall, the results of this study suggest that currently, socio-environmental policies and international collaboration to mitigate health effects of air pollution is necessary in Seoul, Korea. Moreover, consideration of uncertainty of the regression model, which was verified in this research, will facilitate further application of this approach and enable optimal prediction of interactions between human and environmental factors.
Collapse
Affiliation(s)
- Hien Tran
- Graduate School of Water Resources, Sungkyunkwan University, Suwon 440-746, Republic of Korea
| | - Jeongyeong Kim
- Graduate School of Water Resources, Sungkyunkwan University, Suwon 440-746, Republic of Korea
| | - Daeun Kim
- Graduate School of Water Resources, Sungkyunkwan University, Suwon 440-746, Republic of Korea
| | - Minyoung Choi
- Department of Medical Business Administration, Kyunghee University, Republic of Korea
| | - Minha Choi
- Graduate School of Water Resources, Sungkyunkwan University, Suwon 440-746, Republic of Korea.
| |
Collapse
|
8
|
Davalos AD, Luben TJ, Herring AH, Sacks JD. Current approaches used in epidemiologic studies to examine short-term multipollutant air pollution exposures. Ann Epidemiol 2017; 27:145-153.e1. [PMID: 28040377 PMCID: PMC5313327 DOI: 10.1016/j.annepidem.2016.11.016] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Revised: 11/08/2016] [Accepted: 11/27/2016] [Indexed: 11/17/2022]
Abstract
PURPOSE Air pollution epidemiology traditionally focuses on the relationship between individual air pollutants and health outcomes (e.g., mortality). To account for potential copollutant confounding, individual pollutant associations are often estimated by adjusting or controlling for other pollutants in the mixture. Recently, the need to characterize the relationship between health outcomes and the larger multipollutant mixture has been emphasized in an attempt to better protect public health and inform more sustainable air quality management decisions. METHODS New and innovative statistical methods to examine multipollutant exposures were identified through a broad literature search, with a specific focus on those statistical approaches currently used in epidemiologic studies of short-term exposures to criteria air pollutants (i.e., particulate matter, carbon monoxide, sulfur dioxide, nitrogen dioxide, and ozone). RESULTS Five broad classes of statistical approaches were identified for examining associations between short-term multipollutant exposures and health outcomes, specifically additive main effects, effect measure modification, unsupervised dimension reduction, supervised dimension reduction, and nonparametric methods. These approaches are characterized including advantages and limitations in different epidemiologic scenarios. DISCUSSION By highlighting the characteristics of various studies in which multipollutant statistical methods have been used, this review provides epidemiologists and biostatisticians with a resource to aid in the selection of the most optimal statistical method to use when examining multipollutant exposures.
Collapse
Affiliation(s)
- Angel D Davalos
- Department of Biostatistics, University of North Carolina, Chapel Hill
| | - Thomas J Luben
- National Center for Environmental Assessment, Office of Research and Development, US Environmental Protection Agency, Research Triangle Park, NC
| | - Amy H Herring
- Department of Biostatistics, University of North Carolina, Chapel Hill
| | - Jason D Sacks
- National Center for Environmental Assessment, Office of Research and Development, US Environmental Protection Agency, Research Triangle Park, NC.
| |
Collapse
|
9
|
Banner KM, Higgs MD. Considerations for assessing model averaging of regression coefficients. ECOLOGICAL APPLICATIONS : A PUBLICATION OF THE ECOLOGICAL SOCIETY OF AMERICA 2017; 27:78-93. [PMID: 27874997 DOI: 10.1002/eap.1419] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 04/21/2016] [Accepted: 05/31/2016] [Indexed: 05/11/2023]
Abstract
Model choice is usually an inevitable source of uncertainty in model-based statistical analyses. While the focus of model choice was traditionally on methods for choosing a single model, methods to formally account for multiple models within a single analysis are now accessible to many researchers. The specific technique of model averaging was developed to improve predictive ability by combining predictions from a set of models. However, it is now often used to average regression coefficients across multiple models with the ultimate goal of capturing a variable's overall effect. This use of model averaging implicitly assumes the same parameter exists across models so that averaging is sensible. While this assumption may initially seem tenable, regression coefficients associated with particular explanatory variables may not hold equivalent interpretations across all of the models in which they appear, making explanatory inference about covariates challenging. Accessibility to easily implementable software, concerns about being criticized for ignoring model uncertainty, and the chance to avoid having to justify choice of a final model have all led to the increasing popularity of model averaging in practice. We see a gap between the theoretical development of model averaging and its current use in practice, potentially leaving well-intentioned researchers with unclear inferences or difficulties justifying reasons for using (or not using) model averaging. We attempt to narrow this gap by revisiting some relevant foundations of regression modeling, suggesting more explicit notation and graphical tools, and discussing how individual model results are combined to obtain a model averaged result. Our goal is to help researchers make informed decisions about model averaging and to encourage question-focused modeling over method-focused modeling.
Collapse
Affiliation(s)
- Katharine M Banner
- Department of Mathematical Sciences, Montana State University, Wilson Hall 2-214, P.O. Box 172400, Bozeman, Montana, 59717, USA
| | - Megan D Higgs
- Department of Mathematical Sciences, Montana State University, Wilson Hall 2-214, P.O. Box 172400, Bozeman, Montana, 59717, USA
| |
Collapse
|
10
|
Toti G, Vilalta R, Lindner P, Lefer B, Macias C, Price D. Analysis of correlation between pediatric asthma exacerbation and exposure to pollutant mixtures with association rule mining. Artif Intell Med 2016; 74:44-52. [PMID: 27964802 DOI: 10.1016/j.artmed.2016.11.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Revised: 11/22/2016] [Accepted: 11/23/2016] [Indexed: 12/23/2022]
Abstract
OBJECTIVES Traditional studies on effects of outdoor pollution on asthma have been criticized for questionable statistical validity and inefficacy in exploring the effects of multiple air pollutants, alone and in combination. Association rule mining (ARM), a method easily interpretable and suitable for the analysis of the effects of multiple exposures, could be of use, but the traditional interest metrics of support and confidence need to be substituted with metrics that focus on risk variations caused by different exposures. METHODS We present an ARM-based methodology that produces rules associated with relevant odds ratios and limits the number of final rules even at very low support levels (0.5%), thanks to post-pruning criteria that limit rule redundancy and control for statistical significance. The methodology has been applied to a case-crossover study to explore the effects of multiple air pollutants on risk of asthma in pediatric subjects. RESULTS We identified 27 rules with interesting odds ratio among more than 10,000 having the required support. The only rule including only one chemical is exposure to ozone on the previous day of the reported asthma attack (OR=1.14). 26 combinatory rules highlight the limitations of air quality policies based on single pollutant thresholds and suggest that exposure to mixtures of chemicals is more harmful, with odds ratio as high as 1.54 (associated with the combination day0 SO2, day0 NO, day0 NO2, day1 PM). CONCLUSIONS The proposed method can be used to analyze risk variations caused by single and multiple exposures. The method is reliable and requires fewer assumptions on the data than parametric approaches. Rules including more than one pollutant highlight interactions that deserve further investigation, while helping to limit the search field.
Collapse
Affiliation(s)
- Giulia Toti
- Department of Computer Science, University of Houston, Philip Guthrie Hoffman Hall, 3551 Cullen Blvd., Room 501, Houston, TX 77204-3010, USA.
| | - Ricardo Vilalta
- Department of Computer Science, University of Houston, Philip Guthrie Hoffman Hall, 3551 Cullen Blvd., Room 501, Houston, TX 77204-3010, USA
| | - Peggy Lindner
- Honors College, University of Houston, M.D Anderson Library, 4333 University Dr, Room 212, Houston, TX 77204-2001, USA
| | - Barry Lefer
- Department of Earth and Atmospheric Sciences, University of Houston, Science & Research Building 1, 3507 Cullen Blvd, Room 312, Houston, TX 77204-5007, USA; Now at: Earth Sciences Division, NASA Headquarters, 300 E St SW, Washington, DC 20546, USA
| | - Charles Macias
- Department of Pediatrics, Baylor College of Medicine/Texas Children's Hospital, One Baylor Plaza, Houston, TX 77030, USA
| | - Daniel Price
- Honors College, University of Houston, M.D Anderson Library, 4333 University Dr, Room 212, Houston, TX 77204-2001, USA
| |
Collapse
|
11
|
Boehm Vock LF, Reich BJ, Fuentes M, Dominici F. Spatial variable selection methods for investigating acute health effects of fine particulate matter components. Biometrics 2014; 71:167-177. [PMID: 25303336 DOI: 10.1111/biom.12254] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Revised: 09/01/2014] [Accepted: 09/01/2014] [Indexed: 11/27/2022]
Abstract
Multi-site time series studies have reported evidence of an association between short term exposure to particulate matter (PM) and adverse health effects, but the effect size varies across the United States. Variability in the effect may partially be due to differing community level exposure and health characteristics, but also due to the chemical composition of PM which is known to vary greatly by location and time. The objective of this article is to identify particularly harmful components of this chemical mixture. Because of the large number of highly-correlated components, we must incorporate some regularization into a statistical model. We assume that, at each spatial location, the regression coefficients come from a mixture model with the flavor of stochastic search variable selection, but utilize a copula to share information about variable inclusion and effect magnitude across locations. The model differs from current spatial variable selection techniques by accommodating both local and global variable selection. The model is used to study the association between fine PM (PM <2.5μm) components, measured at 115 counties nationally over the period 2000-2008, and cardiovascular emergency room admissions among Medicare patients.
Collapse
Affiliation(s)
| | - Brian J Reich
- North Carolina State University, Raleigh, North Carolina 27695, U.S.A
| | | | | |
Collapse
|
12
|
Gass K, Klein M, Chang HH, Flanders WD, Strickland MJ. Classification and regression trees for epidemiologic research: an air pollution example. Environ Health 2014; 13:17. [PMID: 24625053 PMCID: PMC3977944 DOI: 10.1186/1476-069x-13-17] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 03/07/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND Identifying and characterizing how mixtures of exposures are associated with health endpoints is challenging. We demonstrate how classification and regression trees can be used to generate hypotheses regarding joint effects from exposure mixtures. METHODS We illustrate the approach by investigating the joint effects of CO, NO2, O3, and PM2.5 on emergency department visits for pediatric asthma in Atlanta, Georgia. Pollutant concentrations were categorized as quartiles. Days when all pollutants were in the lowest quartile were held out as the referent group (n = 131) and the remaining 3,879 days were used to estimate the regression tree. Pollutants were parameterized as dichotomous variables representing each ordinal split of the quartiles (e.g. comparing CO quartile 1 vs. CO quartiles 2-4) and considered one at a time in a Poisson case-crossover model with control for confounding. The pollutant-split resulting in the smallest P-value was selected as the first split and the dataset was partitioned accordingly. This process repeated for each subset of the data until the P-values for the remaining splits were not below a given alpha, resulting in the formation of a "terminal node". We used the case-crossover model to estimate the adjusted risk ratio for each terminal node compared to the referent group, as well as the likelihood ratio test for the inclusion of the terminal nodes in the final model. RESULTS The largest risk ratio corresponded to days when PM2.5 was in the highest quartile and NO2 was in the lowest two quartiles (RR: 1.10, 95% CI: 1.05, 1.16). A simultaneous Wald test for the inclusion of all terminal nodes in the model was significant, with a chi-square statistic of 34.3 (p = 0.001, with 13 degrees of freedom). CONCLUSIONS Regression trees can be used to hypothesize about joint effects of exposure mixtures and may be particularly useful in the field of air pollution epidemiology for gaining a better understanding of complex multipollutant exposures.
Collapse
Affiliation(s)
- Katherine Gass
- Department of Epidemiology, Rollins School of Public Health, Emory University, 1518 Clifton Rd, Atlanta, GA 30322, USA
| | - Mitch Klein
- Department of Environmental Health, Rollins School of Public Health, Emory University, 1518 Clifton Rd, Atlanta, GA 30322, USA
| | - Howard H Chang
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, 1518 Clifton Rd, Atlanta, GA 30322, USA
| | - W Dana Flanders
- Department of Epidemiology, Rollins School of Public Health, Emory University, 1518 Clifton Rd, Atlanta, GA 30322, USA
| | - Matthew J Strickland
- Department of Environmental Health, Rollins School of Public Health, Emory University, 1518 Clifton Rd, Atlanta, GA 30322, USA
| |
Collapse
|
13
|
Sun Z, Tao Y, Li S, Ferguson KK, Meeker JD, Park SK, Batterman SA, Mukherjee B. Statistical strategies for constructing health risk models with multiple pollutants and their interactions: possible choices and comparisons. Environ Health 2013; 12:85. [PMID: 24093917 PMCID: PMC3857674 DOI: 10.1186/1476-069x-12-85] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 10/02/2013] [Indexed: 05/19/2023]
Abstract
BACKGROUND As public awareness of consequences of environmental exposures has grown, estimating the adverse health effects due to simultaneous exposure to multiple pollutants is an important topic to explore. The challenges of evaluating the health impacts of environmental factors in a multipollutant model include, but are not limited to: identification of the most critical components of the pollutant mixture, examination of potential interaction effects, and attribution of health effects to individual pollutants in the presence of multicollinearity. METHODS In this paper, we reviewed five methods available in the statistical literature that are potentially helpful for constructing multipollutant models. We conducted a simulation study and presented two data examples to assess the performance of these methods on feature selection, effect estimation and interaction identification using both cross-sectional and time-series designs. We also proposed and evaluated a two-step strategy employing an initial screening by a tree-based method followed by further dimension reduction/variable selection by the aforementioned five approaches at the second step. RESULTS Among the five methods, least absolute shrinkage and selection operator regression performs well in general for identifying important exposures, but will yield biased estimates and slightly larger model dimension given many correlated candidate exposures and modest sample size. Bayesian model averaging, and supervised principal component analysis are also useful in variable selection when there is a moderately strong exposure-response association. Substantial improvements on reducing model dimension and identifying important variables have been observed for all the five statistical methods using the two-step modeling strategy when the number of candidate variables is large. CONCLUSIONS There is no uniform dominance of one method across all simulation scenarios and all criteria. The performances differ according to the nature of the response variable, the sample size, the number of pollutants involved, and the strength of exposure-response association/interaction. However, the two-step modeling strategy proposed here is potentially applicable under a multipollutant framework with many covariates by taking advantage of both the screening feature of an initial tree-based method and dimension reduction/variable selection property of the subsequent method. The choice of the method should also depend on the goal of the study: risk prediction, effect estimation or screening for important predictors and their interactions.
Collapse
Affiliation(s)
- Zhichao Sun
- Department of Biostatistics, University of Michigan School of Public Health,
Ann Arbor, MI USA
| | - Yebin Tao
- Department of Biostatistics, University of Michigan School of Public Health,
Ann Arbor, MI USA
| | - Shi Li
- Department of Biostatistics, University of Michigan School of Public Health,
Ann Arbor, MI USA
| | - Kelly K Ferguson
- Department of Environmental Health Sciences, University of Michigan School of
Public Health, Ann Arbor, MI USA
| | - John D Meeker
- Department of Environmental Health Sciences, University of Michigan School of
Public Health, Ann Arbor, MI USA
| | - Sung Kyun Park
- Department of Environmental Health Sciences, University of Michigan School of
Public Health, Ann Arbor, MI USA
| | - Stuart A Batterman
- Department of Environmental Health Sciences, University of Michigan School of
Public Health, Ann Arbor, MI USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan School of Public Health,
Ann Arbor, MI USA
| |
Collapse
|
14
|
Billionnet C, Sherrill D, Annesi-Maesano I. Estimating the health effects of exposure to multi-pollutant mixture. Ann Epidemiol 2012; 22:126-41. [PMID: 22226033 DOI: 10.1016/j.annepidem.2011.11.004] [Citation(s) in RCA: 199] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2011] [Revised: 11/10/2011] [Accepted: 11/15/2011] [Indexed: 01/08/2023]
Abstract
PURPOSE Air pollution constitutes a major public health concern because of its ubiquity and of its potential health impact. Because individuals are exposed to many air pollutants at once that are highly correlated with each other, there is a need to consider the multi-pollutant exposure phenomenon. The characteristics of multiple pollutants that make statistical analysis of health-related effects of air pollution complex include the high correlation between pollutants prevents the use of standard statistical methods, the potential existence of interaction between pollutants, the common measurement errors, the importance of the number of pollutants to consider, and the potential nonlinear relationship between exposure and health. METHODS We made a review of statistical methods either used in the literature to study the effect of multiple pollutants or identified as potentially applicable to this problem. We reported the results of investigations that applied such methods. RESULTS Eighteen publications have investigated the multi-pollutant effects, 5 on indoor pollution, 10 on outdoor pollution, and 3 on statistical methodology with application on outdoor pollution. Some other publications have only addressed statistical methodology. CONCLUSIONS The use of Hierarchical Bayesian approach, dimension reduction methods, clustering, recursive partitioning, and logic regression are some potential methods described. Methods that provide figures for risk assessments should be put forward in public health decisions.
Collapse
|
15
|
Cox LA. Reassessing the human health benefits from cleaner air. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2012; 32:816-829. [PMID: 22050234 DOI: 10.1111/j.1539-6924.2011.01698.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Recent proposals to further reduce permitted levels of air pollution emissions are supported by high projected values of resulting public health benefits. For example, the Environmental Protection Agency recently estimated that the 1990 Clean Air Act Amendment (CAAA) will produce human health benefits in 2020, from reduced mortality rates, valued at nearly $2 trillion per year, compared to compliance costs of $65 billion ($0.065 trillion). However, while compliance costs can be measured, health benefits are unproved: they depend on a series of uncertain assumptions. Among these are that additional life expectancy gained by a beneficiary (with median age of about 80 years) should be valued at about $80,000 per month; that there is a 100% probability that a positive, linear, no-threshold, causal relation exists between PM(2.5) concentration and mortality risk; and that progress in medicine and disease prevention will not greatly diminish this relationship. We present an alternative uncertainty analysis that assigns a positive probability of error to each assumption. This discrete uncertainty analysis suggests (with probability >90% under plausible alternative assumptions) that the costs of CAAA exceed its benefits. Thus, instead of suggesting to policymakers that CAAA benefits are almost certainly far larger than its costs, we believe that accuracy requires acknowledging that the costs purchase a relatively uncertain, possibly much smaller, benefit. The difference between these contrasting conclusions is driven by different approaches to uncertainty analysis, that is, excluding or including discrete uncertainties about the main assumptions required for nonzero health benefits to exist at all.
Collapse
|
16
|
Bobb JF, Dominici F, Peng RD. A Bayesian model averaging approach for estimating the relative risk of mortality associated with heat waves in 105 U.S. cities. Biometrics 2011; 67:1605-16. [PMID: 21447046 PMCID: PMC3128186 DOI: 10.1111/j.1541-0420.2011.01583.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Estimating the risks heat waves pose to human health is a critical part of assessing the future impact of climate change. In this article, we propose a flexible class of time series models to estimate the relative risk of mortality associated with heat waves and conduct Bayesian model averaging (BMA) to account for the multiplicity of potential models. Applying these methods to data from 105 U.S. cities for the period 1987-2005, we identify those cities having a high posterior probability of increased mortality risk during heat waves, examine the heterogeneity of the posterior distributions of mortality risk across cities, assess sensitivity of the results to the selection of prior distributions, and compare our BMA results to a model selection approach. Our results show that no single model best predicts risk across the majority of cities, and that for some cities heat-wave risk estimation is sensitive to model choice. Although model averaging leads to posterior distributions with increased variance as compared to statistical inference conditional on a model obtained through model selection, we find that the posterior mean of heat wave mortality risk is robust to accounting for model uncertainty over a broad class of models.
Collapse
Affiliation(s)
- Jennifer F Bobb
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205, USA.
| | | | | |
Collapse
|
17
|
Young J. Re: Duc, H., Jalaludin, B. & Morgan, G. (2009). Associations between air pollution and hospital visits for cardiovascular diseases in the elderly in Sydney using Bayesian statistical methods. AUST NZ J STAT 2011. [DOI: 10.1111/j.1467-842x.2011.00620.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
Louis TA. Discussion of "conundrums with uncertainty factors". RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2010; 30:346-353. [PMID: 20487392 DOI: 10.1111/j.1539-6924.2010.01365.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
|
19
|
Roberts S, Martin MA. Bootstrap-after-bootstrap model averaging for reducing model uncertainty in model selection for air pollution mortality studies. ENVIRONMENTAL HEALTH PERSPECTIVES 2010; 118:131-6. [PMID: 20056588 PMCID: PMC2831957 DOI: 10.1289/ehp.0901007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 09/17/2009] [Indexed: 05/08/2023]
Abstract
BACKGROUND Concerns have been raised about findings of associations between particulate matter (PM) air pollution and mortality that have been based on a single "best" model arising from a model selection procedure, because such a strategy may ignore model uncertainty inherently involved in searching through a set of candidate models to find the best model. Model averaging has been proposed as a method of allowing for model uncertainty in this context. OBJECTIVES To propose an extension (double BOOT) to a previously described bootstrap model-averaging procedure (BOOT) for use in time series studies of the association between PM and mortality. We compared double BOOT and BOOT with Bayesian model averaging (BMA) and a standard method of model selection [standard Akaike's information criterion (AIC)]. METHOD Actual time series data from the United States are used to conduct a simulation study to compare and contrast the performance of double BOOT, BOOT, BMA, and standard AIC. RESULTS Double BOOT produced estimates of the effect of PM on mortality that have had smaller root mean squared error than did those produced by BOOT, BMA, and standard AIC. This performance boost resulted from estimates produced by double BOOT having smaller variance than those produced by BOOT and BMA. CONCLUSIONS Double BOOT is a viable alternative to BOOT and BMA for producing estimates of the mortality effect of PM.
Collapse
Affiliation(s)
- Steven Roberts
- School of Finance and Applied Statistics, College of Business and Economics, Australian National University, Australian Capital Territory, Australia.
| | | |
Collapse
|
20
|
Haucke F, Brückner U. First approaches to the monetary impact of environmental health disturbances in Germany. Health Policy 2010; 94:34-44. [DOI: 10.1016/j.healthpol.2009.07.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2009] [Revised: 07/27/2009] [Accepted: 07/27/2009] [Indexed: 10/20/2022]
|
21
|
Crainiceanu CM, Dominici F, Parmigiani G. Adjustment uncertainty in effect estimation. Biometrika 2008. [DOI: 10.1093/biomet/asn015] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
22
|
Abstract
In air pollution epidemiology, improvements in statistical analysis tools can help improve signal-to-noise ratios, and untangle large correlations between exposures and confounders. For this reason, we welcome a novel model-selection approach that helps to identify the time-windows of exposure to pollutants that produces adverse health effects. However, there are concerns about approaches that select a model based on a given data set, and then estimate health effects in the same data. This can create problems when (1) the sample size is small in relation to the magnitude of the health effects; and (2) candidate predictors are highly correlated and likely to have similar effects. Bayesian Model Averaging has been advocated as a way to estimate health effects that accounts for model uncertainty. However, implementations where posterior model probabilities are approximated using BIC, as well as other default choices, may not reflect the ability of each model to provide an estimate of the health effect that is properly adjusted for confounding. Air pollution studies need to focus on estimating health effects while accounting for the uncertainty in the adjustment for confounding factors. This is true especially when model choice and estimation are performed on the same data. The development of appropriate statistical tools remains an open area of investigation.
Collapse
|
23
|
Bateson TF, Coull BA, Hubbell B, Ito K, Jerrett M, Lumley T, Thomas D, Vedal S, Ross M. Panel discussion review: session three--issues involved in interpretation of epidemiologic analyses--statistical modeling. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2007; 17 Suppl 2:S90-6. [PMID: 18079770 DOI: 10.1038/sj.jes.7500631] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 09/12/2007] [Indexed: 05/20/2023]
Abstract
The Clean Air Act mandates that the US Environmental Protection Agency (EPA) develop National Ambient Air Quality Standards for criteria air pollutants and conduct periodic reviews of the standards based on new scientific evidence. In recent reviews, evidence from epidemiologic studies has played a key role. Epidemiologic studies often provide evidence for effects of several air pollutants. Determining whether there are independent effects of the separate pollutants is a challenge. Among the many issues confronting the interpretation of epidemiologic studies of multi-pollutant exposures and health effects are those specifically related to statistical modeling. The EPA convened a workshop on 13 and 14 December 2006 in Chapel Hill, North Carolina, USA, to discuss these and other issues; Session Three of the workshop was devoted specifically to statistical modeling. Prominent statistical modeling issues in epidemiologic studies of air pollution include (1) measurement error across the co-pollutants; (2) correlation and multi-collinearity among the co-pollutants; (3) the timing of the concentration-response function; (4) confounding; and (5) spatial analyses.
Collapse
Affiliation(s)
- Thomas F Bateson
- National Center for Environmental Assessment, US Environmental Protection Agency, Washington, District of Columbia 20460, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Thomas DC. Viewpoint: using gene-environment interactions to dissect the effects of complex mixtures. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2007; 17 Suppl 2:S71-S74. [PMID: 18079767 DOI: 10.1038/sj.jes.7500630] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Accepted: 09/12/2007] [Indexed: 05/25/2023]
Abstract
Teasing out the health effects of constituents of complex mixtures poses formidable statistical challenges owing to the problem of multicollinearity. While statistical devices such as regression on principal components, model selection, and model averaging offer some approaches to this problem, incorporation of external information is likely to be more helpful. I explore a general hierarchical modeling framework that would allow such information as sources, genetic interactions, and toxicology to be included in the higher levels of the model.
Collapse
Affiliation(s)
- Duncan C Thomas
- Department of Preventive Medicine, University of Southern California, 1540 Alcazar Street, CHP-220, Los Angeles, California 90089-9011, USA.
| |
Collapse
|