1
|
Feng Y, Wei Y, Coull BA, Schwartz JD. Measurement error correction for ambient PM 2.5 exposure using stratified regression calibration: Effects on all-cause mortality. Environ Res 2023; 216:114792. [PMID: 36375508 PMCID: PMC9729458 DOI: 10.1016/j.envres.2022.114792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 11/01/2022] [Accepted: 11/10/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND Previous studies on the impact of measurement error for PM2.5 were mostly simulation studies, did not control for other pollutants, or used a single regression calibration model to correct for measurement error. However, the relationship between actual and error-prone PM2.5 concentration may vary by time and region. We aim to correct the measurement error of PM2.5 predictions using stratified regression calibration and investigate how the measurement error biases the association between PM2.5 and mortality in the Medicare Cohort. METHODS The "gold-standard" measurements of PM2.5 were defined as daily monitoring data. We regressed daily monitoring PM2.5 on modeled PM2.5 using the simple linear regression by strata of season, elevation, census division and time period. Calibrated PM2.5 was calculated with stratum-specific calibration parameters β0 (intercept) and β1 (slope) for each strata and aggregated to annual level. Associations between calibrated and error-prone annual PM2.5 and all-cause mortality among Medicare beneficiaries were estimated with Quasi-Poisson regression models. RESULTS Across 208 strata, the median of β0 and β1 were 0.62 (25% 0.0.20, 75% 1.06) and 0.93 (25% 0.87, 75% 0.99). From calibrated and error-prone PM2.5 data, we estimated that each 10 μg/m3 increase in PM2.5 was respectively associated with 4.9% (95%CI 4.6-5.2) and 4.6% (95%CI 4.4-4.9) increases in the mortality rate among Medicare beneficiaries, conditional on confounders. CONCLUSIONS Regression calibration parameters of PM2.5 varied by time and region. Using error-prone measures of PM2.5 underestimated the association between PM2.5 and all-cause mortality. Modern exposure models produce relatively small bias.
Collapse
Affiliation(s)
- Yijing Feng
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| | - Yaguang Wei
- Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Brent A Coull
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Joel D Schwartz
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
2
|
Nab L, Groenwold RHH. Sensitivity analysis for random measurement error using regression calibration and simulation-extrapolation. Glob Epidemiol 2021; 3:100067. [PMID: 37635717 PMCID: PMC10446124 DOI: 10.1016/j.gloepi.2021.100067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 11/17/2021] [Accepted: 11/18/2021] [Indexed: 12/27/2022] Open
Abstract
Objective Sensitivity analysis for random measurement error can be applied in the absence of validation data by means of regression calibration and simulation-extrapolation. These have not been compared for this purpose. Study design and setting A simulation study was conducted comparing the performance of regression calibration and simulation-extrapolation for linear and logistic regression. The performance of the two methods was evaluated in terms of bias, mean squared error (MSE) and confidence interval coverage, for various values of reliability of the error-prone measurement (0.05-0.91), sample size (125-4000), number of replicates (2-10), and R-squared (0.03-0.75). It was assumed that no validation data were available about the error-free measures, while correct information about the measurement error variance was available. Results Regression calibration was unbiased while simulation-extrapolation was biased: median bias was 0.8% (interquartile range (IQR): -0.6;1.7%), and -19.0% (IQR: -46.4;-12.4%), respectively. A small gain in efficiency was observed for simulation-extrapolation (median MSE: 0.005, IQR: 0.004;0.006) versus regression calibration (median MSE: 0.006, IQR: 0.005;0.009). Confidence interval coverage was at the nominal level of 95% for regression calibration, and smaller than 95% for simulation-extrapolation (median coverage: 85%, IQR: 73;93%). The application of regression calibration and simulation-extrapolation for a sensitivity analysis was illustrated using an example of blood pressure and kidney function. Conclusion Our results support the use of regression calibration over simulation-extrapolation for sensitivity analysis for random measurement error.
Collapse
Affiliation(s)
- Linda Nab
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands
| | - Rolf H H Groenwold
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| |
Collapse
|
3
|
Nab L, van Smeden M, Keogh RH, Groenwold RHH. Mecor: An R package for measurement error correction in linear regression models with a continuous outcome. Comput Methods Programs Biomed 2021; 208:106238. [PMID: 34311414 DOI: 10.1016/j.cmpb.2021.106238] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/06/2021] [Indexed: 06/13/2023]
Abstract
Measurement error in a covariate or the outcome of regression models is common, but is often ignored, even though measurement error can lead to substantial bias in the estimated covariate-outcome association. While several texts on measurement error correction methods are available, these methods remain seldomly applied. To improve the use of measurement error correction methodology, we developed mecor, an R package that implements measurement error correction methods for regression models with a continuous outcome. Measurement error correction requires information about the measurement error model and its parameters. This information can be obtained from four types of studies, used to estimate the parameters of the measurement error model: an internal validation study, a replicates study, a calibration study and an external validation study. In the package mecor, regression calibration methods and a maximum likelihood method are implemented to correct for measurement error in a continuous covariate in regression analyses. Additionally, methods of moments methods are implemented to correct for measurement error in the continuous outcome in regression analyses. Variance estimation of the corrected estimators is provided in closed form and using the bootstrap.
Collapse
Affiliation(s)
- Linda Nab
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands.
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands
| | - Ruth H Keogh
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Rolf H H Groenwold
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands; Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
4
|
Chen S, Lin X. Analysis in case-control sequencing association studies with different sequencing depths. Biostatistics 2021; 21:577-593. [PMID: 30590456 PMCID: PMC7308042 DOI: 10.1093/biostatistics/kxy073] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Revised: 10/17/2018] [Accepted: 10/21/2018] [Indexed: 01/09/2023] Open
Abstract
With the advent of next-generation sequencing, investigators have access to higher quality sequencing data. However, to sequence all samples in a study using next generation sequencing can still be prohibitively expensive. One potential remedy could be to combine next generation sequencing data from cases with publicly available sequencing data for controls, but there could be a systematic difference in quality of sequenced data, such as sequencing depths, between sequenced study cases and publicly available controls. We propose a regression calibration (RC)-based method and a maximum-likelihood method for conducting an association study with such a combined sample by accounting for differential sequencing errors between cases and controls. The methods allow for adjusting for covariates, such as population stratification as confounders. Both methods control type I error and have comparable power to analysis conducted using the true genotype with sufficiently high but different sequencing depths. We show that the RC method allows for analysis using naive variance estimate (closely approximates true variance in practice) and standard software under certain circumstances. We evaluate the performance of the proposed methods using simulation studies and apply our methods to a combined data set of exome sequenced acute lung injury cases and healthy controls from the 1000 Genomes project.
Collapse
Affiliation(s)
- Sixing Chen
- Department of Biostatistics, Harvard TH Chan School of Public Health, 655 Huntington Avenue, Building 2, 4th Floor, Boston, MA 02115, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard TH Chan School of Public Health, 655 Huntington Avenue, Building 2, 4th Floor, Boston, MA 02115, USA.,Department of Statistics, Harvard University, One Oxford Street, Suite 400, Cambridge, MA 02138-2901, USA
| |
Collapse
|
5
|
Bassett JK, Swain CTV, Hodge AM, Mahmood S, Csizmadi I, Owen N, Dunstan DW, Lynch BM. Calibration of the Active Australia questionnaire and application to a logistic regression model. J Sci Med Sport 2020; 24:474-480. [PMID: 33281094 DOI: 10.1016/j.jsams.2020.11.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 10/09/2020] [Accepted: 11/10/2020] [Indexed: 11/24/2022]
Abstract
OBJECTIVES To estimate the extent of measurement error in the Active Australia questionnaire, and to examine the impact of measurement error on the association of moderate-vigorous physical activity (MVPA) with obesity. DESIGN Accelerometer Validation Study, cross-sectional; data from the third wave of a prospective cohort (Australian Diabetes, Obesity, and Lifestyle (AusDiab) Study)). METHODS Self-reported physical activity data were obtained from 4005 participants of the third wave of the AusDiab study via the Active Australia questionnaire. Accelerometer-derived physical activity data were obtained from a subsample of 670 participants. Validity coefficients and attenuation factors were estimated from a measurement error model. A regression calibration method was applied to a logistic regression model examining the association between self-reported MVPA and obesity to adjust observed odds ratios (OR) for measurement error. RESULTS The validity coefficient was 0.35 (0.28, 0.43) and the attenuation factor was 0.16 (0.13, 0.20) in models adjusted for age and sex. The uncorrected OR for obesity for 210min/week of MVPA (50th percentile) relative to 80min/week (25th percentile) was 0.87 (0.85, 0.90). The attenuation factor was used to adjust this OR for measurement error, giving a corrected OR of 0.43 (0.32, 0.55). CONCLUSIONS Substantial measurement error (relative to accelerometry) was evident in the Active Australia questionnaire, leading to attenuation of the association of MVPA with obesity. A regression-calibration method can be used to adjust risk estimates for associations between self-reported MVPA and health-related outcomes for measurement error specific to self-report. These corrected risk estimates reflect associations that would be expected if MVPA were measured by accelerometry.
Collapse
Affiliation(s)
- Julie K Bassett
- Cancer Epidemiology Division, Cancer Council Victoria, Australia.
| | | | - Allison M Hodge
- Cancer Epidemiology Division, Cancer Council Victoria, Australia; Melbourne School of Population and Global Health, The University of Melbourne, Australia
| | - Shahid Mahmood
- Melbourne School of Population and Global Health, The University of Melbourne, Australia
| | - Ilona Csizmadi
- Department of Surgery, Cedars-Sinai Medical Center, USA; Department of Community Health Sciences, University of Calgary, Canada
| | - Neville Owen
- Behavioural Epidemiology Laboratory, Baker Heart and Diabetes Institute, Australia; Centre for Urban Transitions, Swinburne University of Technology, Australia; The University of Queensland, School of Public Health, Australia; Department of Medicine, Monash University, Australia
| | - David W Dunstan
- The University of Queensland, School of Public Health, Australia; Department of Medicine, Monash University, Australia; Physical Activity Laboratory, Baker Heart and Diabetes Institute, Australia; Institute of Physical Activity and Nutrition Research, School of Exercise and Nutrition Sciences, Deakin University, Australia; Mary MacKillop Institute of Health Research, Australian Catholic University, Australia; School of Sport Science, Exercise and Health, The University of Western Australia, Australia
| | - Brigid M Lynch
- Cancer Epidemiology Division, Cancer Council Victoria, Australia; Melbourne School of Population and Global Health, The University of Melbourne, Australia; Physical Activity Laboratory, Baker Heart and Diabetes Institute, Australia
| |
Collapse
|
6
|
van Smeden M, Penning de Vries BBL, Nab L, Groenwold RHH. Approaches to addressing missing values, measurement error, and confounding in epidemiologic studies. J Clin Epidemiol 2021; 131:89-100. [PMID: 33176189 DOI: 10.1016/j.jclinepi.2020.11.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 10/24/2020] [Accepted: 11/04/2020] [Indexed: 01/13/2023]
Abstract
OBJECTIVES Epidemiologic studies often suffer from incomplete data, measurement error (or misclassification), and confounding. Each of these can cause bias and imprecision in estimates of exposure-outcome relations. We describe and compare statistical approaches that aim to control all three sources of bias simultaneously. STUDY DESIGN AND SETTING We illustrate four statistical approaches that address all three sources of bias, namely, multiple imputation for missing data and measurement error, multiple imputation combined with regression calibration, full information maximum likelihood within a structural equation modeling framework, and a Bayesian model. In a simulation study, we assess the performance of the four approaches compared with more commonly used approaches that do not account for measurement error, missing values, or confounding. RESULTS The results demonstrate that the four approaches consistently outperform the alternative approaches on all performance metrics (bias, mean squared error, and confidence interval coverage). Even in simulated data of 100 subjects, these approaches perform well. CONCLUSION There can be a large benefit of addressing measurement error, missing values, and confounding to improve the estimation of exposure-outcome relations, even when the available sample size is relatively small.
Collapse
|
7
|
Follmann DA, Dodd L. Immune correlates analysis using vaccinees from test negative designs. Biostatistics 2020; 23:507-521. [PMID: 32968765 DOI: 10.1093/biostatistics/kxaa037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 08/10/2020] [Accepted: 08/15/2020] [Indexed: 11/14/2022] Open
Abstract
Determining the effect of vaccine-induced immune response on disease risk is an important goal of vaccinology. Typically, immune correlates analyses are conducted prospectively with immune response measured shortly after vaccination and subsequent disease status regressed on immune response. In outbreaks and rare disease settings, collecting samples from all vaccinees is not feasible. The test negative design is a retrospective design used to measure vaccine efficacy where symptomatic individuals who present at a clinic are assessed for relevant disease (cases) or some other disease (controls) and vaccination status ascertained. This article proposes that test negative vaccinees have immune response to vaccine assessed both for relevant (e.g., Ebola) and irrelevant (e.g., vector) proteins. If the latter immune response is unaffected by active (Ebola) infection, and is correlated with the relevant immune response, it can serve as a proxy for the immune response of interest proximal to infection. We show that logistic regression using imputed immune response as the covariate and case disease as outcome can estimate the prospective immune response slope and detail the assumptions needed for unbiased inference. The method is evaluated by simulation under various scenarios including constant and decaying immune response. A simulated dataset motivated by ring vaccination for an ongoing Ebola outbreak is analyzed.
Collapse
Affiliation(s)
- Dean A Follmann
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Bethesda MD
| | - Lori Dodd
- Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, Bethesda MD
| |
Collapse
|
8
|
Strand M, Rabinovitch N. Health effects of concurrent ambient and tobacco smoke-derived particle exposures at low concentrations in children with asthma. J Expo Sci Environ Epidemiol 2020; 30:785-794. [PMID: 32071391 DOI: 10.1038/s41370-020-0201-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 11/01/2019] [Accepted: 12/23/2019] [Indexed: 06/10/2023]
Abstract
Exposure to particulate matter less than 2.5 microns from either ambient pollution (AMB-PM2.5) or secondhand smoke (SHS-PM2.5) have been associated with asthma worsening, but there is little information on effects and relative potency with concurrent exposures. We studied health effects of concurrent exposures to AMB-PM2.5 and SHS-PM2.5 over a 6-year period in schoolchildren with asthma. Regression calibration with instrumental variables (RCIV) was utilized to estimate effects of personal exposure to low-level SHS and AMB-PM2.5 on daily albuterol usage and urinary leukotriene E4 (uLTE4; a biomarker of asthma-related inflammation) using urine cotinine and concentrations from fixed and personal pollution monitors. Each IQR increase in SHS-PM2.5 exposure was associated with a 6.7% increase (95% CI: 1.0-12.8%) in uLTE4 on the same day and 9.4% increase (95% CI: -2.6 to 22.7%) in albuterol use the next day, when children were co-exposed to mean levels of AMB-PM2.5. The dose-response relationship between health outcomes and one pollutant was higher at lower levels of the other pollutant. For example, at lower levels of predicted SHS-PM2.5 exposure, increases in health outcomes per IQR increase in AMB-PM2.5 ranged between 2 and 5%, but were negligible at higher SHS-PM2.5 levels. Comparing at equivalent co-exposure levels, SHS-PM2.5 was 1.6 times more potent than AMB-PM2.5 for uLTE4 (95% CI: 1.1-2.3); estimates for albuterol usage were similar but less significant. Effects at mean co-exposure levels were closer [SHS to AMB-PM2.5 potency ratio = 1.2 (95% CI: 0.9-1.5) for uLTE4 and 1.2 (95% CI: 0.7-1.9) for albuterol usage]. In summary, concurrent exposure to relatively low levels of SHS and AMB-PM2.5 were associated with health outcomes in asthmatic schoolchildren. Dose responses varied with changes in the relative amounts of each pollutant; SHS-PM2.5 was observed to be more potent than AMB-PM2.5 when co-exposure levels were equivalent.
Collapse
Affiliation(s)
- Matthew Strand
- Division of Biostatistics, National Jewish Health, Denver, CO, USA.
| | | |
Collapse
|
9
|
Agier L, Slama R, Basagaña X. Relying on repeated biospecimens to reduce the effects of classical-type exposure measurement error in studies linking the exposome to health. Environ Res 2020; 186:109492. [PMID: 32330767 DOI: 10.1016/j.envres.2020.109492] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Revised: 03/11/2020] [Accepted: 04/04/2020] [Indexed: 05/27/2023]
Abstract
The exposome calls for assessing numerous exposures, typically using biomarkers with varying amounts of measurement error, which can be assumed to be of classical type. We evaluated the impact of classical-type measurement error on the performance of exposome-health studies, and the efficiency of two measurement error correction methods relying on the collection of repeated biospecimens: within-subject biospecimens pooling and regression calibration. In a simulation study, we generated 237 exposures from a realistic correlation matrix, with various amounts of classical-type measurement error, and a continuous health outcome linearly influenced by exposures. Measurement error decreased the sensitivity to identify exposures influencing health from a value of 75% down to 46%, increased false discovery proportion from 26% to 49% and increased attenuation bias in the slope of true predictors from 45% to 66%. Assuming that repeated biospecimens were available, within-subject pooling and regression calibration improved sensitivity (which increased to 63%), false discovery proportion (down to 37%) and bias (down to 49%) compared to an error-prone study with a single biospecimen per subject. Performances were poorer for the exposures with the largest amount of measurement error, and increased with the number of available biospecimens. Relying on repeated biospecimens only for the exposures with the largest amount of measurement error provided similar performance improvement. Exposome studies relying on spot exposure biospecimens suffer from decreased performances if some biomarkers suffer from measurement error due to their temporal variability; performances can be improved by collecting repeated biospecimens per subject, in particular for non persistent chemicals.
Collapse
Affiliation(s)
- Lydiane Agier
- Team of Environmental Epidemiology Applied to Reproduction and Respiratory Health, Inserm, CNRS, University Grenoble Alpes, Institute for Advanced Biosciences (IAB), U1209 Joint Research Center, Grenoble, France.
| | - Rémy Slama
- Team of Environmental Epidemiology Applied to Reproduction and Respiratory Health, Inserm, CNRS, University Grenoble Alpes, Institute for Advanced Biosciences (IAB), U1209 Joint Research Center, Grenoble, France.
| | - Xavier Basagaña
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), Spain
| |
Collapse
|
10
|
Looman M, Boshuizen HC, Feskens EJ, Geelen A. Using enhanced regression calibration to combine dietary intake estimates from 24 h recall and FFQ reduces bias in diet-disease associations. Public Health Nutr 2019; 22:2738-46. [PMID: 31262375 DOI: 10.1017/S1368980019001563] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
OBJECTIVE To illustrate the impact of combining 24 h recall (24hR) and FFQ estimates using regression calibration (RC) and enhanced regression calibration (ERC) on diet-disease associations. SETTING Wageningen area, the Netherlands, 2011-2013. DESIGN Five approaches for obtaining self-reported dietary intake estimates of protein and K were compared: (i) uncorrected FFQ intakes (FFQ); (ii) uncorrected average of two 24hR ( $\overline {\rm R}$ ); (iii) average of FFQ and $\overline {\rm R}$ ( ${\overline {\rm F}}\,\overline {\rm R}}$ ); (iv) RC from regression of 24hR v. FFQ; and (v) ERC by adding individual random effects to the RC approach. Empirical attenuation factors (AF) were derived by regression of urinary biomarker measurements v. the resulting intake estimates. PARTICIPANTS Data of 236 individuals collected within the National Dietary Assessment Reference Database. RESULTS Both FFQ and 24hR dietary intake estimates were measured with substantial error. Using statistical techniques to correct for measurement error (i.e. RC and ERC) reduced bias in diet-disease associations as indicated by their AF approaching 1 (RC 1·14, ERC 0·95 for protein; RC 1·28, ERC 1·34 for K). The larger sd and narrower 95% CI of AF obtained with ERC compared with RC indicated that using ERC has more power than using RC. However, the difference in AF between RC and ERC was not statistically significant, indicating no significantly better de-attenuation by using ERC compared with RC. AF larger than 1, observed for the ERC for K, indicated possible overcorrection. CONCLUSIONS Our study highlights the potential of combining FFQ and 24hR data. Using RC and ERC resulted in less biased associations for protein and K.
Collapse
|
11
|
Wu Y, Hoffman FO, Apostoaei AI, Kwon D, Thomas BA, Glass R, Zablotska LB. Methods to account for uncertainties in exposure assessment in studies of environmental exposures. Environ Health 2019; 18:31. [PMID: 30961632 PMCID: PMC6454753 DOI: 10.1186/s12940-019-0468-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 03/20/2019] [Indexed: 06/09/2023]
Abstract
BACKGROUND Accurate exposure estimation in environmental epidemiological studies is crucial for health risk assessment. Failure to account for uncertainties in exposure estimation could lead to biased results in exposure-response analyses. Assessment of the effects of uncertainties in exposure estimation on risk estimates received a lot of attention in radiation epidemiology and in several studies of diet and air pollution. The objective of this narrative review is to examine the commonly used statistical approaches to account for exposure estimation errors in risk analyses and to suggest how each could be applied in environmental epidemiological studies. MAIN TEXT We review two main error types in estimating exposures in epidemiological studies: shared and unshared errors and their subtypes. We describe the four main statistical approaches to adjust for exposure estimation uncertainties (regression calibration, simulation-extrapolation, Monte Carlo maximum likelihood and Bayesian model averaging) along with examples to give readers better understanding of their advantages and limitations. We also explain the advantages of using a 2-dimensional Monte-Carlo (2DMC) simulation method to quantify the effect of uncertainties in exposure estimates using full-likelihood methods. For exposures that are estimated independently between subjects and are more likely to introduce unshared errors, regression calibration and SIMEX methods are able to adequately account for exposure uncertainties in risk analyses. When an uncalibrated measuring device is used or estimation parameters with uncertain mean values are applied to a group of people, shared errors could potentially be large. In this case, Monte Carlo maximum likelihood and Bayesian model averaging methods based on estimates of exposure from the 2DMC simulations would work well. The majority of reviewed studies show relatively moderate changes (within 100%) in risk estimates after accounting for uncertainties in exposure estimates, except for the two studies which doubled/tripled naïve estimates. CONCLUSIONS In this paper, we demonstrate various statistical methods to account for uncertain exposure estimates in risk analyses. The differences in the results of various adjustment methods could be due to various error structures in datasets and whether or not a proper statistical method was applied. Epidemiological studies of environmental exposures should include exposure-response analyses accounting for uncertainties in exposure estimates.
Collapse
Affiliation(s)
- You Wu
- Department of Epidemiology and Biostatistics, University of California, San Francisco, 550 16th Street, 2nd floor, Box 0560, San Francisco, CA 94143 USA
- Center for Design and Analysis, Amgen, Inc., 1 Amgen Center Dr., Thousand Oaks, CA 91320 USA
| | - F. Owen Hoffman
- Oak Ridge Center for Risk Analysis, Inc., 102 Donner Drive, Oak Ridge, TN USA
| | - A. Iulian Apostoaei
- Oak Ridge Center for Risk Analysis, Inc., 102 Donner Drive, Oak Ridge, TN USA
| | - Deukwoo Kwon
- Sylvester Comprehensive Cancer Center, University of Miami, 1475 NW 12th Avenue, Miami, FL USA
| | - Brian A. Thomas
- Oak Ridge Center for Risk Analysis, Inc., 102 Donner Drive, Oak Ridge, TN USA
| | - Racquel Glass
- Department of Epidemiology and Biostatistics, University of California, San Francisco, 550 16th Street, 2nd floor, Box 0560, San Francisco, CA 94143 USA
| | - Lydia B. Zablotska
- Department of Epidemiology and Biostatistics, University of California, San Francisco, 550 16th Street, 2nd floor, Box 0560, San Francisco, CA 94143 USA
| |
Collapse
|
12
|
Lester F, Arbuckle TE, Peng Y, McIsaac MA. Impact of exposure to phenols during early pregnancy on birth weight in two Canadian cohort studies subject to measurement errors. Environ Int 2018; 120:231-237. [PMID: 30103122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Received: 02/17/2018] [Revised: 08/02/2018] [Accepted: 08/02/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND It is of interest to know whether early pregnancy exposure to phenols such as bisphenol-A (BPA) or triclosan (TCS) negatively impacts birth weight outcomes. Exposure to these chemicals is widespread in the Canadian population but obtaining accurate measurements of average exposure is difficult because these chemicals are rapidly excreted from the body, causing body levels to fluctuate both within and between days, as observed in a recent Canadian study (P4). This measurement error can attenuate the estimated effects of exposures. METHODS Data from two Canadian cohort studies, the Plastics and Personal-care Products use in Pregnancy (P4) Study and the Maternal-Infant Research on Environmental Chemicals (MIREC) Study, such that all participants with complete BPA or TCS exposure and outcome data were used (MIREC n = 1822, P4 n = 68). We used regression calibration to correct for the attenuating effects of exposure measurement error when modeling the effect of first trimester BPA or TCS exposure on four birth weight outcomes: birth weight (BW), low birth weight (LBW), small for gestational age (SGA) and large for gestational age (LGA). Specific gravity, time of day, and time since last urine void were also controlled in the analysis. RESULTS TCS exposure has a marginally significant association with SGA only with odds ratio 0.87 and 95% confidence interval (0.74, 1.00). It also has a marginally significant association with LGA in male offspring with odds ratio 1.11 and 95% confidence interval (1.00, 1.25). The effects of BPA on the four birth outcomes were insignificant. CONCLUSIONS Increased TCS exposure during pregnancy is marginally associated with decreased odds of having SGA offspring. It is possibly associated with decreased BW in males and decreased odds of LBW, though these associations were not present in measurement error corrected models. TCS is possibly associated with increased odds in male offspring of being LGA, though this relationship was not present in models not corrected for measurement error. The study finds no significant effects of BPA on birth weight outcomes, which may be due to more severe measurement error in a single observation of BPA.
Collapse
Affiliation(s)
- Fiona Lester
- Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
| | - Tye E Arbuckle
- Health Canada, Population Studies Division, Environmental Health Science and Research Bureau, Health Canada, 101 Tunney's Pasture Drive, Tunney's Pasture, Ottawa, Ontario, Canada
| | - Yingwei Peng
- Department of Public Health Sciences, Queen's University, Kingston, Ontario, Canada; Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada.
| | - Michael A McIsaac
- School of Mathematical and Computational Sciences, University of Prince Edward Island, Charlottetown, PEI, Canada
| |
Collapse
|
13
|
Padoan A, Basso D, Zambon CF, Prayer-Galetti T, Arrigoni G, Bozzato D, Moz S, Zattoni F, Bellocco R, Plebani M. MALDI-TOF peptidomic analysis of serum and post-prostatic massage urine specimens to identify prostate cancer biomarkers. Clin Proteomics 2018; 15:23. [PMID: 30065622 PMCID: PMC6060548 DOI: 10.1186/s12014-018-9199-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 07/16/2018] [Indexed: 12/25/2022] Open
Abstract
Background Lower urinary tract symptoms (LUTS) and prostate specific antigen-based parameters seem to have only a limited utility for the differential diagnosis of prostate cancer (PCa). MALDI-TOF/MS peptidomic profiling could be a useful diagnostic tool for biomarker discovery, although reproducibility issues have limited its applicability until now. The current study aimed to evaluate a new MALDI-TOF/MS candidate biomarker. Methods Within- and between-subject variability of MALDI-TOF/MS-based peptidomic urine and serum analyses were evaluated in 20 and 15 healthy donors, respectively. Normalizations and approaches for accounting below limit of detection (LOD) values were utilized to enhance reproducibility, while Monte Carlo experiments were performed to verify whether measurement error can be dealt with LOD data. Post-prostatic massage urine and serum samples from 148 LUTS patients were analysed using MALDI-TOF/MS. Regression-calibration and simulation and extrapolation methods were used to derive the unbiased association between peptidomic features and PCa. Results Although the median normalized peptidomic variability was 24.9%, the within- and between-subject variability showed that median normalization, LOD adjustment, and log2 data transformation were the best combination in terms of reliability; in measurement error conditions, intraclass correlation coefficient was a reliable estimate when the LOD/2 was substituted for below LOD values. In the patients studied, 43 peptides were shared by the urine and serum, and several features were found to be associated with PCa. Only few serum features, however, show statistical significance after the multiple testing procedures were completed. Two serum fragmentation patterns corresponded to the complement C4-A. Conclusions MALDI-TOF/MS serum peptidome profiling was more efficacious with respect to post-prostatic massage urine analysis in discriminating PCa. Electronic supplementary material The online version of this article (10.1186/s12014-018-9199-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andrea Padoan
- 1Department of Medicine-DIMED, University of Padova, Via Giustiniani 2, 35128 Padua, Italy
| | - Daniela Basso
- 1Department of Medicine-DIMED, University of Padova, Via Giustiniani 2, 35128 Padua, Italy
| | | | - Tommaso Prayer-Galetti
- 3Department of Surgical, Oncological and Gastroenterological Sciences, University of Padova, Padua, Italy
| | - Giorgio Arrigoni
- 2Department of Biomedical Sciences, University of Padova, Padua, Italy.,4Proteomic Center, University of Padova, Padua, Italy
| | - Dania Bozzato
- 1Department of Medicine-DIMED, University of Padova, Via Giustiniani 2, 35128 Padua, Italy
| | - Stefania Moz
- 1Department of Medicine-DIMED, University of Padova, Via Giustiniani 2, 35128 Padua, Italy
| | - Filiberto Zattoni
- 3Department of Surgical, Oncological and Gastroenterological Sciences, University of Padova, Padua, Italy
| | - Rino Bellocco
- 5Department of Statistics and Quantitative Methods, University of Milano-Bicocca, Milan, Italy.,6Department of Medical Epidemiology and Biostatistics (MEB), Karolinska Institute, Stockholm, Sweden
| | - Mario Plebani
- 1Department of Medicine-DIMED, University of Padova, Via Giustiniani 2, 35128 Padua, Italy
| |
Collapse
|
14
|
Abstract
PURPOSE OF REVIEW Outdoor air pollution exposures used in epidemiological studies are commonly predicted from spatiotemporal models incorporating limited measurements, temporal factors, geographic information system variables, and/or satellite data. Measurement error in these exposure estimates leads to imprecise estimation of health effects and their standard errors. We reviewed methods for measurement error correction that have been applied in epidemiological studies that use model-derived air pollution data. RECENT FINDINGS We identified seven cohort studies and one panel study that have employed measurement error correction methods. These methods included regression calibration, risk set regression calibration, regression calibration with instrumental variables, the simulation extrapolation approach (SIMEX), and methods under the non-parametric or parameter bootstrap. Corrections resulted in small increases in the absolute magnitude of the health effect estimate and its standard error under most scenarios. Limited application of measurement error correction methods in air pollution studies may be attributed to the absence of exposure validation data and the methodological complexity of the proposed methods. Future epidemiological studies should consider in their design phase the requirements for the measurement error correction method to be later applied, while methodological advances are needed under the multi-pollutants setting.
Collapse
Affiliation(s)
- Evangelia Samoli
- Department of Hygiene, Epidemiology and Medical Statistics, Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias Str, 115 27, Athens, Greece.
| | - Barbara K Butland
- Population Health Research Institute and MRC-PHE Centre for Environment and Health, St George's, University of London, London, UK
| |
Collapse
|
15
|
Agogo GO. A zero-augmented generalized gamma regression calibration to adjust for covariate measurement error: A case of an episodically consumed dietary intake. Biom J 2016; 59:94-109. [PMID: 27704599 DOI: 10.1002/bimj.201600043] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2016] [Revised: 05/27/2016] [Accepted: 07/21/2016] [Indexed: 11/11/2022]
Abstract
Measurement error in exposure variables is a serious impediment in epidemiological studies that relate exposures to health outcomes. In nutritional studies, interest could be in the association between long-term dietary intake and disease occurrence. Long-term intake is usually assessed with food frequency questionnaire (FFQ), which is prone to recall bias. Measurement error in FFQ-reported intakes leads to bias in parameter estimate that quantifies the association. To adjust for bias in the association, a calibration study is required to obtain unbiased intake measurements using a short-term instrument such as 24-hour recall (24HR). The 24HR intakes are used as response in regression calibration to adjust for bias in the association. For foods not consumed daily, 24HR-reported intakes are usually characterized by excess zeroes, right skewness, and heteroscedasticity posing serious challenge in regression calibration modeling. We proposed a zero-augmented calibration model to adjust for measurement error in reported intake, while handling excess zeroes, skewness, and heteroscedasticity simultaneously without transforming 24HR intake values. We compared the proposed calibration method with the standard method and with methods that ignore measurement error by estimating long-term intake with 24HR and FFQ-reported intakes. The comparison was done in real and simulated datasets. With the 24HR, the mean increase in mercury level per ounce fish intake was about 0.4; with the FFQ intake, the increase was about 1.2. With both calibration methods, the mean increase was about 2.0. Similar trend was observed in the simulation study. In conclusion, the proposed calibration method performs at least as good as the standard method.
Collapse
Affiliation(s)
- George O Agogo
- Department of Internal Medicine, Yale University, 300 George St, Suite 775, New Haven, CT, 06511, USA
| |
Collapse
|
16
|
Huque MH, Bondell HD, Carroll RJ, Ryan LM. Spatial regression with covariate measurement error: A semiparametric approach. Biometrics 2016; 72:678-86. [PMID: 26788930 PMCID: PMC4956600 DOI: 10.1111/biom.12474] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Revised: 11/01/2015] [Accepted: 11/01/2015] [Indexed: 11/26/2022]
Abstract
Spatial data have become increasingly common in epidemiology and public health research thanks to advances in GIS (Geographic Information Systems) technology. In health research, for example, it is common for epidemiologists to incorporate geographically indexed data into their studies. In practice, however, the spatially defined covariates are often measured with error. Naive estimators of regression coefficients are attenuated if measurement error is ignored. Moreover, the classical measurement error theory is inapplicable in the context of spatial modeling because of the presence of spatial correlation among the observations. We propose a semiparametric regression approach to obtain bias-corrected estimates of regression parameters and derive their large sample properties. We evaluate the performance of the proposed method through simulation studies and illustrate using data on Ischemic Heart Disease (IHD). Both simulation and practical application demonstrate that the proposed method can be effective in practice.
Collapse
Affiliation(s)
- Md Hamidul Huque
- School of Mathematical and Physical Sciences, University of Technology Sydney, Australia, 15 Broadway, Ultimo, NSW, 2007, Australia.
| | - Howard D Bondell
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, Campus Box 8203, Raleigh, NC 27695-8203, USA
| | - Raymond J Carroll
- Department of Statistics, 447 Blocker Building, Texas A&M University College Station, TX 77843-3143, USA
| | - Louise M Ryan
- School of Mathematical and Physical Sciences, University of Technology Sydney, Australia, 15 Broadway, Ultimo, NSW, 2007, Australia
| |
Collapse
|
17
|
Yan Y, Yi GY. Analysis of error-prone survival data under additive hazards models: measurement error effects and adjustments. Lifetime Data Anal 2016; 22:321-342. [PMID: 26328545 DOI: 10.1007/s10985-015-9340-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2014] [Accepted: 08/17/2015] [Indexed: 06/05/2023]
Abstract
Covariate measurement error occurs commonly in survival analysis. Under the proportional hazards model, measurement error effects have been well studied, and various inference methods have been developed to correct for error effects under such a model. In contrast, error-contaminated survival data under the additive hazards model have received relatively less attention. In this paper, we investigate this problem by exploring measurement error effects on parameter estimation and the change of the hazard function. New insights of measurement error effects are revealed, as opposed to well-documented results for the Cox proportional hazards model. We propose a class of bias correction estimators that embraces certain existing estimators as special cases. In addition, we exploit the regression calibration method to reduce measurement error effects. Theoretical results for the developed methods are established, and numerical assessments are conducted to illustrate the finite sample performance of our methods.
Collapse
Affiliation(s)
- Ying Yan
- Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada
| | - Grace Y Yi
- Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada.
| |
Collapse
|
18
|
Agogo GO, van der Voet H, Van't Veer P, van Eeuwijk FA, Boshuizen HC. Evaluation of a two-part regression calibration to adjust for dietary exposure measurement error in the Cox proportional hazards model: A simulation study. Biom J 2016; 58:766-82. [PMID: 27003183 DOI: 10.1002/bimj.201500009] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Revised: 11/09/2015] [Accepted: 11/14/2015] [Indexed: 11/09/2022]
Abstract
Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation.
Collapse
Affiliation(s)
- George O Agogo
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands.,National Institute for Public Health and the Environment, Postbus 1, 3720 BA Bilthoven, The Netherlands
| | - Hilko van der Voet
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands
| | - Pieter Van't Veer
- Division of Human Nutrition, Wageningen University, Postbus 8129, 6700 EV, Wageningen, The Netherlands
| | - Fred A van Eeuwijk
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands
| | - Hendriek C Boshuizen
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands.,National Institute for Public Health and the Environment, Postbus 1, 3720 BA Bilthoven, The Netherlands.,Division of Human Nutrition, Wageningen University, Postbus 8129, 6700 EV, Wageningen, The Netherlands
| |
Collapse
|
19
|
Lee SM, Hwang WH, de Dieu Tapsoba J. Estimation in closed capture-recapture models when covariates are missing at random. Biometrics 2016; 72:1294-1304. [PMID: 26909877 DOI: 10.1111/biom.12498] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 01/01/2016] [Accepted: 01/01/2016] [Indexed: 11/29/2022]
Abstract
Individual covariates are commonly used in capture-recapture models as they can provide important information for population size estimation. However, in practice, one or more covariates may be missing at random for some individuals, which can lead to unreliable inference if records with missing data are treated as missing completely at random. We show that, in general, such a naive complete-case analysis in closed capture-recapture models with some covariates missing at random underestimates the population size. We develop methods for estimating regression parameters and population size using regression calibration, inverse probability weighting, and multiple imputation without any distributional assumptions about the covariates. We show that the inverse probability weighting and multiple imputation approaches are asymptotically equivalent. We present a simulation study to investigate the effects of missing covariates and to evaluate the performance of the proposed methods. We also illustrate an analysis using data on the bird species yellow-bellied prinia collected in Hong Kong.
Collapse
Affiliation(s)
- Shen-Ming Lee
- Department of Statistics, Feng Chia University, Taichung City, Taiwan
| | - Wen-Han Hwang
- Institute of Statistics, National Chung Hsing University, Taichung City, Taiwan
| | - Jean de Dieu Tapsoba
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| |
Collapse
|
20
|
Masiuk SV, Shklyar SV, Kukush AG, Carroll RJ, Kovgan LN, Likhtarov IA. Estimation of radiation risk in presence of classical additive and Berkson multiplicative errors in exposure doses. Biostatistics 2016; 17:422-36. [PMID: 26795191 DOI: 10.1093/biostatistics/kxv052] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 12/02/2015] [Indexed: 11/14/2022] Open
Abstract
In this paper, the influence of measurement errors in exposure doses in a regression model with binary response is studied. Recently, it has been recognized that uncertainty in exposure dose is characterized by errors of two types: classical additive errors and Berkson multiplicative errors. The combination of classical additive and Berkson multiplicative errors has not been considered in the literature previously. In a simulation study based on data from radio-epidemiological research of thyroid cancer in Ukraine caused by the Chornobyl accident, it is shown that ignoring measurement errors in doses leads to overestimation of background prevalence and underestimation of excess relative risk. In the work, several methods to reduce these biases are proposed. They are new regression calibration, an additive version of efficient SIMEX, and novel corrected score methods.
Collapse
Affiliation(s)
- S V Masiuk
- State Institution "National Research Center for Radiation Medicine of the National Academy of Medical Sciences of Ukraine", Melnykova str., 53, Kyiv, 04050, Ukraine; Ukrainian Radiation Protection Institute, Melnykova str., 53, Kyiv, 04050, Ukraine
| | - S V Shklyar
- Taras Shevchenko National University of Kyiv, Volodymyrska Str. 64, Kyiv 01601, Ukraine
| | - A G Kukush
- Taras Shevchenko National University of Kyiv, Volodymyrska Str. 64, Kyiv 01601, Ukraine
| | - R J Carroll
- Texas A&M University, College Station, TX 77843, USA
| | - L N Kovgan
- State Institution "National Research Center for Radiation Medicine of the National Academy of Medical Sciences of Ukraine", Melnykova str., 53, Kyiv, 04050, Ukraine; Ukrainian Radiation Protection Institute, Melnykova str., 53, Kyiv, 04050, Ukraine
| | - I A Likhtarov
- State Institution "National Research Center for Radiation Medicine of the National Academy of Medical Sciences of Ukraine", Melnykova str., 53, Kyiv, 04050, Ukraine; Ukrainian Radiation Protection Institute, Melnykova str., 53, Kyiv, 04050, Ukraine
| |
Collapse
|
21
|
Kipnis V, Freedman LS, Carroll RJ, Midthune D. A bivariate measurement error model for semicontinuous and continuous variables: Application to nutritional epidemiology. Biometrics 2015; 72:106-15. [PMID: 26332011 DOI: 10.1111/biom.12377] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Revised: 06/01/2015] [Accepted: 06/01/2015] [Indexed: 11/27/2022]
Abstract
Semicontinuous data in the form of a mixture of a large portion of zero values and continuously distributed positive values frequently arise in many areas of biostatistics. This article is motivated by the analysis of relationships between disease outcomes and intakes of episodically consumed dietary components. An important aspect of studies in nutritional epidemiology is that true diet is unobservable and commonly evaluated by food frequency questionnaires with substantial measurement error. Following the regression calibration approach for measurement error correction, unknown individual intakes in the risk model are replaced by their conditional expectations given mismeasured intakes and other model covariates. Those regression calibration predictors are estimated using short-term unbiased reference measurements in a calibration substudy. Since dietary intakes are often "energy-adjusted," e.g., by using ratios of the intake of interest to total energy intake, the correct estimation of the regression calibration predictor for each energy-adjusted episodically consumed dietary component requires modeling short-term reference measurements of the component (a semicontinuous variable), and energy (a continuous variable) simultaneously in a bivariate model. In this article, we develop such a bivariate model, together with its application to regression calibration. We illustrate the new methodology using data from the NIH-AARP Diet and Health Study (Schatzkin et al., 2001, American Journal of Epidemiology 154, 1119-1125), and also evaluate its performance in a simulation study.
Collapse
Affiliation(s)
- Victor Kipnis
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland
| | - Laurence S Freedman
- Information Management Services, Inc., Rockville, Maryland and Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer, Israel
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, College Station, Texas
| | - Douglas Midthune
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland
| |
Collapse
|
22
|
Abstract
Covariate measurement imprecision or errors arise frequently in many areas. It is well known that ignoring such errors can substantially degrade the quality of inference or even yield erroneous results. Although in practice both covariates subject to measurement error and covariates subject to misclassification can occur, research attention in the literature has mainly focused on addressing either one of these problems separately. To fill this gap, we develop estimation and inference methods that accommodate both characteristics simultaneously. Specifically, we consider measurement error and misclassification in generalized linear models under the scenario that an external validation study is available, and systematically develop a number of effective functional and structural methods. Our methods can be applied to different situations to meet various objectives.
Collapse
Affiliation(s)
- Grace Y Yi
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1
| | - Yanyuan Ma
- Department of Statistics, Texas A&M University, TAMU 3143, College Station, TX 77843-3143,
| | - Donna Spiegelman
- Departments of Epidemiology and Biostatistics, Harvard School of Public Health, 677 Huntington Ave, Boston, MA 02115,
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, TAMU 3143, College Station, TX 77843-3143, and School of Mathematical Sciences, University of Technology, Sydney, Broadway NSW 2007,
| |
Collapse
|
23
|
Siddig AA, Ellison AM, Jackson S. Calibrating abundance indices with population size estimators of red back salamanders (Plethodon cinereus) in a New England forest. PeerJ 2015; 3:e952. [PMID: 26020008 PMCID: PMC4435476 DOI: 10.7717/peerj.952] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2015] [Accepted: 04/21/2015] [Indexed: 11/20/2022] Open
Abstract
Herpetologists and conservation biologists frequently use convenient and cost-effective, but less accurate, abundance indices (e.g., number of individuals collected under artificial cover boards or during natural objects surveys) in lieu of more accurate, but costly and destructive, population size estimators to detect and monitor size, state, and trends of amphibian populations. Although there are advantages and disadvantages to each approach, reliable use of abundance indices requires that they be calibrated with accurate population estimators. Such calibrations, however, are rare. The red back salamander, Plethodon cinereus, is an ecologically useful indicator species of forest dynamics, and accurate calibration of indices of salamander abundance could increase the reliability of abundance indices used in monitoring programs. We calibrated abundance indices derived from surveys of P. cinereus under artificial cover boards or natural objects with a more accurate estimator of their population size in a New England forest. Average densities/m2 and capture probabilities of P. cinereus under natural objects or cover boards in independent, replicate sites at the Harvard Forest (Petersham, Massachusetts, USA) were similar in stands dominated by Tsuga canadensis (eastern hemlock) and deciduous hardwood species (predominantly Quercus rubra [red oak] and Acer rubrum [red maple]). The abundance index based on salamanders surveyed under natural objects was significantly associated with density estimates of P. cinereus derived from depletion (removal) surveys, but underestimated true density by 50%. In contrast, the abundance index based on cover-board surveys overestimated true density by a factor of 8 and the association between the cover-board index and the density estimates was not statistically significant. We conclude that when calibrated and used appropriately, some abundance indices may provide cost-effective and reliable measures of P. cinereus abundance that could be used in conservation assessments and long-term monitoring at Harvard Forest and other northeastern USA forests.
Collapse
Affiliation(s)
- Ahmed A Siddig
- Department of Environmental Conservation, University of Massachusetts Amherst , Amherst, MA , USA ; Harvard University, Harvard Forest , Petersham, MA , USA
| | | | - Scott Jackson
- Department of Environmental Conservation, University of Massachusetts Amherst , Amherst, MA , USA
| |
Collapse
|
24
|
Zhao S, Prentice RL. Covariate measurement error correction methods in mediation analysis with failure time data. Biometrics 2014; 70:835-44. [PMID: 25139469 PMCID: PMC4276494 DOI: 10.1111/biom.12205] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Revised: 04/01/2014] [Accepted: 05/01/2014] [Indexed: 11/29/2022]
Abstract
Mediation analysis is important for understanding the mechanisms whereby one variable causes changes in another. Measurement error could obscure the ability of the potential mediator to explain such changes. This article focuses on developing correction methods for measurement error in the mediator with failure time outcomes. We consider a broad definition of measurement error, including technical error, and error associated with temporal variation. The underlying model with the "true" mediator is assumed to be of the Cox proportional hazards model form. The induced hazard ratio for the observed mediator no longer has a simple form independent of the baseline hazard function, due to the conditioning event. We propose a mean-variance regression calibration approach and a follow-up time regression calibration approach, to approximate the partial likelihood for the induced hazard function. Both methods demonstrate value in assessing mediation effects in simulation studies. These methods are generalized to multiple biomarkers and to both case-cohort and nested case-control sampling designs. We apply these correction methods to the Women's Health Initiative hormone therapy trials to understand the mediation effect of several serum sex hormone measures on the relationship between postmenopausal hormone therapy and breast cancer risk.
Collapse
Affiliation(s)
- Shanshan Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| | - Ross L. Prentice
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| |
Collapse
|
25
|
Thomas L, Stefanski LA, Davidian M. Moment Adjusted Imputation for Multivariate Measurement Error Data with Applications to Logistic Regression. Comput Stat Data Anal 2013; 67:15-24. [PMID: 24072947 DOI: 10.1016/j.csda.2013.04.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
In clinical studies, covariates are often measured with error due to biological fluctuations, device error and other sources. Summary statistics and regression models that are based on mismeasured data will differ from the corresponding analysis based on the "true" covariate. Statistical analysis can be adjusted for measurement error, however various methods exhibit a tradeo between convenience and performance. Moment Adjusted Imputation (MAI) is method for measurement error in a scalar latent variable that is easy to implement and performs well in a variety of settings. In practice, multiple covariates may be similarly influenced by biological fluctuastions, inducing correlated multivariate measurement error. The extension of MAI to the setting of multivariate latent variables involves unique challenges. Alternative strategies are described, including a computationally feasible option that is shown to perform well.
Collapse
Affiliation(s)
- Laine Thomas
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina 27705, U.S.A
| | | | | |
Collapse
|