1
|
Cheng C, Spiegelman D, Li F. Mediation analysis in the presence of continuous exposure measurement error. Stat Med 2023; 42:1669-1686. [PMID: 36869626 PMCID: PMC11320713 DOI: 10.1002/sim.9693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 01/06/2023] [Accepted: 02/16/2023] [Indexed: 03/05/2023]
Abstract
The difference method is used in mediation analysis to quantify the extent to which a mediator explains the mechanisms underlying the pathway between an exposure and an outcome. In many health science studies, the exposures are almost never measured without error, which can result in biased effect estimates. This article investigates methods for mediation analysis when a continuous exposure is mismeasured. Under a linear exposure measurement error model, we prove that the bias of indirect effect and mediation proportion can go in either direction but the mediation proportion is usually be less biased when the associations between the exposure and its error-prone counterpart are similar with and without adjustment for the mediator. We further propose methods to adjust for exposure measurement error with continuous and binary outcomes. The proposed approaches require a main study/validation study design where in the validation study, data are available for characterizing the relationship between the true exposure and its error-prone counterpart. The proposed approaches are then applied to the Health Professional Follow-up Study, 1986-2016, to investigate the impact of body mass index (BMI) as a mediator for mediating the effect of physical activity on the risk of cardiovascular diseases. Our results reveal that physical activity is significantly associated with a lower risk of cardiovascular disease incidence, and approximately half of the total effect of physical activity is mediated by BMI after accounting for exposure measurement error. Extensive simulation studies are conducted to demonstrate the validity and efficiency of the proposed approaches in finite samples.
Collapse
Affiliation(s)
- Chao Cheng
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| | - Donna Spiegelman
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
- Center for Methods in Implementation and Prevention Science, Yale School of Public Health, New Haven, Connecticut, USA
| |
Collapse
|
2
|
Yan Y, Ren M. Consistent inverse probability of treatment weighted estimation of the average treatment effect with mismeasured time-dependent confounders. Stat Med 2023; 42:517-535. [PMID: 36513267 DOI: 10.1002/sim.9629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 11/17/2022] [Accepted: 12/02/2022] [Indexed: 12/15/2022]
Abstract
In longitudinal studies, the inverse probability of treatment weighted (IPTW) method is commonly employed to estimate the effect of time-dependent treatments on an outcome of interest. However, it has been documented that when the confounders are subject to measurement error, the naive IPTW method which simply ignores measurement error leads to biased treatment effect estimation. In the existing literature, there is a lack of measurement error correction methods that fully remove measurement error effect and produce consistent treatment effect estimation. In this article, we develop a novel consistent IPTW estimation procedure for longitudinal studies. The key step of the proposed method is to use the observed data to construct a corrected function that is unbiased of the unknown IPTW function. Simulation studies reveal that the proposed method outperforms the existing consistent and approximate measurement error correction methods for IPTW estimation of the average treatment effect. Finally, we apply the proposed method to analyze a real dataset.
Collapse
Affiliation(s)
- Ying Yan
- School of Mathematics, Sun Yat-sen University, Guangzhou, China
| | - Mingchen Ren
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|
3
|
Chen LP, Yi GY. Analysis of noisy survival data with graphical proportional hazards measurement error models. Biometrics 2020; 77:956-969. [PMID: 32687216 DOI: 10.1111/biom.13331] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Revised: 04/15/2020] [Accepted: 07/09/2020] [Indexed: 01/07/2023]
Abstract
In survival data analysis, the Cox proportional hazards (PH) model is perhaps the most widely used model to feature the dependence of survival times on covariates. While many inference methods have been developed under such a model or its variants, those models are not adequate for handling data with complex structured covariates. High-dimensional survival data often entail several features: (1) many covariates are inactive in explaining the survival information, (2) active covariates are associated in a network structure, and (3) some covariates are error-contaminated. To hand such kinds of survival data, we propose graphical PH measurement error models and develop inferential procedures for the parameters of interest. Our proposed models significantly enlarge the scope of the usual Cox PH model and have great flexibility in characterizing survival data. Theoretical results are established to justify the proposed methods. Numerical studies are conducted to assess the performance of the proposed methods.
Collapse
Affiliation(s)
- Li-Pang Chen
- Department of Statistical and Actuarial Sciences, University of Western Ontario, London, Ontario, Canada
| | - Grace Y Yi
- Department of Statistical and Actuarial Sciences, University of Western Ontario, London, Ontario, Canada.,Department of Computer Science, University of Western Ontario, London, Ontario, Canada
| |
Collapse
|
4
|
Semiparametric methods for left-truncated and right-censored survival data with covariate measurement error. ANN I STAT MATH 2020. [DOI: 10.1007/s10463-020-00755-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
5
|
Prentice RL, Aragaki AK, Van Horn L, Thomson CA, Beresford SAA, Robinson J, Snetselaar L, Anderson GL, Manson JE, Allison MA, Rossouw JE, Howard BV. Low-fat dietary pattern and cardiovascular disease: results from the Women's Health Initiative randomized controlled trial. Am J Clin Nutr 2017; 106:35-43. [PMID: 28515068 PMCID: PMC5486201 DOI: 10.3945/ajcn.117.153270] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 04/14/2017] [Indexed: 11/14/2022] Open
Abstract
Background: The influence of a low-fat dietary pattern on the cardiovascular health of postmenopausal women continues to be of public health interest.Objective: This report evaluates low-fat dietary pattern influences on cardiovascular disease (CVD) incidence and mortality during the intervention and postintervention phases of the Women's Health Initiative Dietary Modification Trial.Design: Participants comprised 48,835 postmenopausal women aged 50-79 y; 40% were randomly assigned to a low-fat dietary pattern intervention (target of 20% of energy from fat), and 60% were randomly assigned to a usual diet comparison group. The 8.3-y intervention period ended in March 2005, after which >80% of surviving participants consented to additional active follow-up through September 2010; all participants were followed for mortality through 2013. Breast and colorectal cancer were the primary trial outcomes, and coronary heart disease (CHD) and overall CVD were additional designated outcomes.Results: Incidence rates for CHD and total CVD did not differ between the intervention and comparison groups in either the intervention or postintervention period. However, CHD HRs comparing these groups varied strongly with baseline CVD and hypertension status. Participants without prior CVD had an intervention period CHD HR of 0.70 (95% CI: 0.56, 0.87) or 1.04 (95% CI: 0.90, 1.19) if they were normotensive or hypertensive, respectively (P-interaction = 0.003). The CHD benefit among healthy normotensive women was partially offset by an increase in ischemic stroke risk. Corresponding HRs in the postintervention period were close to null. Participants with CVD at baseline (3.4%) had CHD HRs of 1.47 (95% CI: 1.12, 1.93) and 1.61 (95% CI: 1.02, 2.55) in the intervention and postintervention periods, respectively. However, various lines of evidence suggest that results in women with CVD or hypertension at baseline are confounded by postrandomization use of cholesterol-lowering medications.Conclusions: CVD risk in postmenopausal women appears to be sensitive to a change to a low-fat dietary pattern and, among healthy women, includes both CHD benefit and stroke risk. This trial was registered at clinicaltrials.gov as NCT00000611.
Collapse
Affiliation(s)
- Ross L Prentice
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA;
| | - Aaron K Aragaki
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Linda Van Horn
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Cynthia A Thomson
- Health Promotion Sciences, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ
| | - Shirley AA Beresford
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA
| | | | | | - Garnet L Anderson
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - JoAnn E Manson
- Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA
| | - Matthew A Allison
- Department of Family Medicine and Public Health, University of California, San Diego, La Jolla, CA
| | | | - Barbara V Howard
- MedStar Health Research Institute and Georgetown/Howard Universities Center for Clinical and Translational Research, Washington, DC
| |
Collapse
|
6
|
Liao X, Zhou X, Wang M, Hart JE, Laden F, Spiegelman D. Survival analysis with functions of mismeasured covariate histories: the case of chronic air pollution exposure in relation to mortality in the nurses' health study. J R Stat Soc Ser C Appl Stat 2017; 67:307-327. [PMID: 29430064 DOI: 10.1111/rssc.12229] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Environmental epidemiologists are often interested in estimating the effect of functions of time-varying exposure histories, such as the 12-month moving average, in relation to chronic disease incidence or mortality. The individual exposure measurements that comprise such an exposure history are usually mis-measured, at least moderately, and, often, more substantially. To obtain unbiased estimates of Cox model hazard ratios for these complex mis-measured exposure functions, an extended risk set regression calibration method for Cox models is developed and applied to a study of long-term exposure to the fine particulate matter (PM2.5) component of air pollution in relation to all-cause mortality in the Nurses' Health Study. Simulation studies under several realistic assumptions about the measurement error model and about the correlation structure of the repeated exposure measurements were conducted to assess the finite sample properties of this new method, and found that the method has good performance in terms of finite sample bias reduction and nominal confidence interval coverage.
Collapse
Affiliation(s)
- Xiaomei Liao
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Xin Zhou
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Molin Wang
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA
| | - Jaime E Hart
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, 02215, USA.,Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA
| | - Francine Laden
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, 02215, USA.,Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA
| | - Donna Spiegelman
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, 02115, USA.,Department of Nutrition, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA.,Department of Global Health and Population, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA
| |
Collapse
|
7
|
Zhang H, Zheng Y, Zhang Z, Gao T, Joyce B, Yoon G, Zhang W, Schwartz J, Just A, Colicino E, Vokonas P, Zhao L, Lv J, Baccarelli A, Hou L, Liu L. Estimating and testing high-dimensional mediation effects in epigenetic studies. Bioinformatics 2016; 32:3150-3154. [PMID: 27357171 DOI: 10.1093/bioinformatics/btw351] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/24/2016] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION High-dimensional DNA methylation markers may mediate pathways linking environmental exposures with health outcomes. However, there is a lack of analytical methods to identify significant mediators for high-dimensional mediation analysis. RESULTS Based on sure independent screening and minimax concave penalty techniques, we use a joint significance test for mediation effect. We demonstrate its practical performance using Monte Carlo simulation studies and apply this method to investigate the extent to which DNA methylation markers mediate the causal pathway from smoking to reduced lung function in the Normative Aging Study. We identify 2 CpGs with significant mediation effects. AVAILABILITY AND IMPLEMENTATION R package, source code, and simulation study are available at https://github.com/YinanZheng/HIMA CONTACT: lei.liu@northwestern.edu.
Collapse
Affiliation(s)
- Haixiang Zhang
- Center for Applied Mathematics, Tianjin University, Tianjin 300072, China
| | | | | | - Tao Gao
- Department of Preventive Medicine
| | | | - Grace Yoon
- Department of Statistics, Northwestern University, Chicago, IL 60611, USA
| | | | - Joel Schwartz
- Department of Environmental Health, Harvard University, Boston, MA 02115, USA
| | - Allan Just
- Department of Preventive Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Elena Colicino
- Department of Environmental Health, Harvard University, Boston, MA 02115, USA
| | - Pantel Vokonas
- Veterans Affairs Boston Healthcare System and Boston University School of Medicine, VA Normative Aging Study, Boston, MA 02118, USA
| | | | - Jinchi Lv
- Data Sciences and Operations Department, University of Southern California, Los Angeles, CA 90089, USA
| | - Andrea Baccarelli
- Department of Environmental Health, Harvard University, Boston, MA 02115, USA
| | | | - Lei Liu
- Department of Preventive Medicine
| |
Collapse
|
8
|
Agogo GO, van der Voet H, Van't Veer P, van Eeuwijk FA, Boshuizen HC. Evaluation of a two-part regression calibration to adjust for dietary exposure measurement error in the Cox proportional hazards model: A simulation study. Biom J 2016; 58:766-82. [PMID: 27003183 DOI: 10.1002/bimj.201500009] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Revised: 11/09/2015] [Accepted: 11/14/2015] [Indexed: 11/09/2022]
Abstract
Dietary questionnaires are prone to measurement error, which bias the perceived association between dietary intake and risk of disease. Short-term measurements are required to adjust for the bias in the association. For foods that are not consumed daily, the short-term measurements are often characterized by excess zeroes. Via a simulation study, the performance of a two-part calibration model that was developed for a single-replicate study design was assessed by mimicking leafy vegetable intake reports from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) study. In part I of the fitted two-part calibration model, a logistic distribution was assumed; in part II, a gamma distribution was assumed. The model was assessed with respect to the magnitude of the correlation between the consumption probability and the consumed amount (hereafter, cross-part correlation), the number and form of covariates in the calibration model, the percentage of zero response values, and the magnitude of the measurement error in the dietary intake. From the simulation study results, transforming the dietary variable in the regression calibration to an appropriate scale was found to be the most important factor for the model performance. Reducing the number of covariates in the model could be beneficial, but was not critical in large-sample studies. The performance was remarkably robust when fitting a one-part rather than a two-part model. The model performance was minimally affected by the cross-part correlation.
Collapse
Affiliation(s)
- George O Agogo
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands.,National Institute for Public Health and the Environment, Postbus 1, 3720 BA Bilthoven, The Netherlands
| | - Hilko van der Voet
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands
| | - Pieter Van't Veer
- Division of Human Nutrition, Wageningen University, Postbus 8129, 6700 EV, Wageningen, The Netherlands
| | - Fred A van Eeuwijk
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands
| | - Hendriek C Boshuizen
- Biometris, Wageningen University and Research Centre, Postbus 16, 6700 AA, Wageningen, The Netherlands.,National Institute for Public Health and the Environment, Postbus 1, 3720 BA Bilthoven, The Netherlands.,Division of Human Nutrition, Wageningen University, Postbus 8129, 6700 EV, Wageningen, The Netherlands
| |
Collapse
|
9
|
Wang H, Zhang Y, Feng C, Tu XM. On the Misuse of Taylor Expansion. Biometrics 2016; 71:1195. [PMID: 26769151 DOI: 10.1111/biom.12425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Hongyue Wang
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 601 Elmwood Ave., Box 630, Rochester, New York 14642, U.S.A
| | - Yun Zhang
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 601 Elmwood Ave., Box 630, Rochester, New York 14642, U.S.A
| | - Changyong Feng
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 601 Elmwood Ave., Box 630, Rochester, New York 14642, U.S.A
| | - Xin M Tu
- Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 601 Elmwood Ave., Box 630, Rochester, New York 14642, U.S.A
| |
Collapse
|
10
|
Zhao S, Prentice RL. Rejoinder to "On the Misuse of Taylor Expansion". Biometrics 2016; 71:1195. [PMID: 26769150 DOI: 10.1111/biom.12426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Shanshan Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.S
| | - Ross L Prentice
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.S
| |
Collapse
|
11
|
Kipnis V, Freedman LS, Carroll RJ, Midthune D. A bivariate measurement error model for semicontinuous and continuous variables: Application to nutritional epidemiology. Biometrics 2015; 72:106-15. [PMID: 26332011 DOI: 10.1111/biom.12377] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Revised: 06/01/2015] [Accepted: 06/01/2015] [Indexed: 11/27/2022]
Abstract
Semicontinuous data in the form of a mixture of a large portion of zero values and continuously distributed positive values frequently arise in many areas of biostatistics. This article is motivated by the analysis of relationships between disease outcomes and intakes of episodically consumed dietary components. An important aspect of studies in nutritional epidemiology is that true diet is unobservable and commonly evaluated by food frequency questionnaires with substantial measurement error. Following the regression calibration approach for measurement error correction, unknown individual intakes in the risk model are replaced by their conditional expectations given mismeasured intakes and other model covariates. Those regression calibration predictors are estimated using short-term unbiased reference measurements in a calibration substudy. Since dietary intakes are often "energy-adjusted," e.g., by using ratios of the intake of interest to total energy intake, the correct estimation of the regression calibration predictor for each energy-adjusted episodically consumed dietary component requires modeling short-term reference measurements of the component (a semicontinuous variable), and energy (a continuous variable) simultaneously in a bivariate model. In this article, we develop such a bivariate model, together with its application to regression calibration. We illustrate the new methodology using data from the NIH-AARP Diet and Health Study (Schatzkin et al., 2001, American Journal of Epidemiology 154, 1119-1125), and also evaluate its performance in a simulation study.
Collapse
Affiliation(s)
- Victor Kipnis
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland
| | - Laurence S Freedman
- Information Management Services, Inc., Rockville, Maryland and Gertner Institute for Epidemiology and Health Policy Research, Sheba Medical Center, Tel Hashomer, Israel
| | - Raymond J Carroll
- Department of Statistics, Texas A&M University, College Station, Texas
| | - Douglas Midthune
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland
| |
Collapse
|