1
|
Roy S, Daniels MJ, Roy J. A Bayesian nonparametric approach for multiple mediators with applications in mental health studies. Biostatistics 2024; 25:919-932. [PMID: 38332624 PMCID: PMC11247183 DOI: 10.1093/biostatistics/kxad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 12/14/2023] [Accepted: 12/15/2023] [Indexed: 02/10/2024] Open
Abstract
Mediation analysis with contemporaneously observed multiple mediators is a significant area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect or estimate the joint mediation effect just as the sum of individual mediator effects, ignoring the interaction among the mediators. In this article, we propose a novel Bayesian nonparametric method that overcomes the two aforementioned drawbacks. We model the joint distribution of the observed data (outcome, mediators, treatment, and confounders) flexibly using an enriched Dirichlet process mixture with three levels. We use standardization (g-computation) to compute all possible mediation effects, including pairwise and all other possible interaction among the mediators. We thoroughly explore our method via simulations and apply our method to a mental health data from Wisconsin Longitudinal Study, where we estimate how the effect of births from unintended pregnancies on later life mental depression (CES-D) among the mothers is mediated through lack of self-acceptance and autonomy, employment instability, lack of social participation, and increased family stress. Our method identified significant individual mediators, along with some significant pairwise effects.
Collapse
Affiliation(s)
- Samrat Roy
- Operations and Decision Sciences, Indian Institute of Management Ahmedabad, Gujarat, India
| | | | - Jason Roy
- Department of Biostatistics and Epidemiology, Rutgers University, New Brunswick, USA
| |
Collapse
|
2
|
Chen X, Harhay MO, Tong G, Li F. A BAYESIAN MACHINE LEARNING APPROACH FOR ESTIMATING HETEROGENEOUS SURVIVOR CAUSAL EFFECTS: APPLICATIONS TO A CRITICAL CARE TRIAL. Ann Appl Stat 2024; 18:350-374. [PMID: 38455841 PMCID: PMC10919396 DOI: 10.1214/23-aoas1792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
Assessing heterogeneity in the effects of treatments has become increasingly popular in the field of causal inference and carries important implications for clinical decision-making. While extensive literature exists for studying treatment effect heterogeneity when outcomes are fully observed, there has been limited development in tools for estimating heterogeneous causal effects when patient-centered outcomes are truncated by a terminal event, such as death. Due to mortality occurring during study follow-up, the outcomes of interest are unobservable, undefined, or not fully observed for many participants in which case principal stratification is an appealing framework to draw valid causal conclusions. Motivated by the Acute Respiratory Distress Syndrome Network (ARDSNetwork) ARDS respiratory management (ARMA) trial, we developed a flexible Bayesian machine learning approach to estimate the average causal effect and heterogeneous causal effects among the always-survivors stratum when clinical outcomes are subject to truncation. We adopted Bayesian additive regression trees (BART) to flexibly specify separate mean models for the potential outcomes and latent stratum membership. In the analysis of the ARMA trial, we found that the low tidal volume treatment had an overall benefit for participants sustaining acute lung injuries on the outcome of time to returning home but substantial heterogeneity in treatment effects among the always-survivors, driven most strongly by biologic sex and the alveolar-arterial oxygen gradient at baseline (a physiologic measure of lung function and degree of hypoxemia). These findings illustrate how the proposed methodology could guide the prognostic enrichment of future trials in the field.
Collapse
Affiliation(s)
- Xinyuan Chen
- Department of Mathematics and Statistics, Mississippi State University
| | - Michael O. Harhay
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania
| | - Guangyu Tong
- Department of Biostatistics, Yale School of Public Health
| | - Fan Li
- Department of Biostatistics, Yale School of Public Health
| |
Collapse
|
3
|
Zeng S, Lange EC, Archie EA, Campos FA, Alberts SC, Li F. A Causal Mediation Model for Longitudinal Mediators and Survival Outcomes with an Application to Animal Behavior. JOURNAL OF AGRICULTURAL, BIOLOGICAL, AND ENVIRONMENTAL STATISTICS 2023; 28:197-218. [PMID: 37415781 PMCID: PMC10321498 DOI: 10.1007/s13253-022-00490-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 01/26/2022] [Accepted: 01/28/2022] [Indexed: 07/08/2023]
Abstract
In animal behavior studies, a common goal is to investigate the causal pathways between an exposure and outcome, and a mediator that lies in between. Causal mediation analysis provides a principled approach for such studies. Although many applications involve longitudinal data, the existing causal mediation models are not directly applicable to settings where the mediators are measured on irregular time grids. In this paper, we propose a causal mediation model that accommodates longitudinal mediators on arbitrary time grids and survival outcomes simultaneously. We take a functional data analysis perspective and view longitudinal mediators as realizations of underlying smooth stochastic processes. We define causal estimands of direct and indirect effects accordingly and provide corresponding identification assumptions. We employ a functional principal component analysis approach to estimate the mediator process and propose a Cox hazard model for the survival outcome that flexibly adjusts the mediator process. We then derive a g-computation formula to express the causal estimands using the model coefficients. The proposed method is applied to a longitudinal data set from the Amboseli Baboon Research Project to investigate the causal relationships between early adversity, adult physiological stress responses, and survival among wild female baboons. We find that adversity experienced in early life has a significant direct effect on females' life expectancy and survival probability, but find little evidence that these effects were mediated by markers of the stress response in adulthood. We further developed a sensitivity analysis method to assess the impact of potential violation to the key assumption of sequential ignorability. Supplementary materials accompanying this paper appear on-line.
Collapse
Affiliation(s)
| | | | - Elizabeth A Archie
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| | - Fernando A Campos
- Department of Antropology, University of Texas at San Antonio, San Antonio, TX, USA
| | - Susan C Alberts
- Department of Biology, Duke University, Durham, NC, USA.; Department of Evolutionary Anthropology, Duke University, Durham, NC, USA
| | - Fan Li
- Department of Statistical Science, Duke University, 214 Old Chemistry Building, Durham, NC 27708, USA
| |
Collapse
|
4
|
Fu J, Koslovsky MD, Neophytou AM, Vannucci M. A Bayesian joint model for compositional mediation effect selection in microbiome data. Stat Med 2023. [PMID: 37173609 DOI: 10.1002/sim.9764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 04/17/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]
Abstract
Analyzing multivariate count data generated by high-throughput sequencing technology in microbiome research studies is challenging due to the high-dimensional and compositional structure of the data and overdispersion. In practice, researchers are often interested in investigating how the microbiome may mediate the relation between an assigned treatment and an observed phenotypic response. Existing approaches designed for compositional mediation analysis are unable to simultaneously determine the presence of direct effects, relative indirect effects, and overall indirect effects, while quantifying their uncertainty. We propose a formulation of a Bayesian joint model for compositional data that allows for the identification, estimation, and uncertainty quantification of various causal estimands in high-dimensional mediation analysis. We conduct simulation studies and compare our method's mediation effects selection performance with existing methods. Finally, we apply our method to a benchmark data set investigating the sub-therapeutic antibiotic treatment effect on body weight in early-life mice.
Collapse
Affiliation(s)
- Jingyan Fu
- Department of Statistics, Rice University, Houston, Texas, USA
| | - Matthew D Koslovsky
- Department of Statistics, Colorado State University, Fort Collins, Colorado, USA
| | - Andreas M Neophytou
- Department of Environmental & Radiological Health Sciences, Colorado State University, Fort Collins, Colorado, USA
| | - Marina Vannucci
- Department of Statistics, Rice University, Houston, Texas, USA
| |
Collapse
|
5
|
Zhao Y, Chen T, Cai J, Lichenstein S, Potenza MN, Yip SW. Bayesian network mediation analysis with application to the brain functional connectome. Stat Med 2022; 41:3991-4005. [PMID: 35795965 PMCID: PMC10131252 DOI: 10.1002/sim.9488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 04/12/2022] [Accepted: 05/18/2022] [Indexed: 11/10/2022]
Abstract
The brain functional connectome, the collection of interconnected neural circuits along functional networks, facilitates a cutting-edge understanding of brain functioning, and has a potential to play a mediating role within the effect pathway between an exposure and an outcome. While existing mediation analytic approaches are capable of providing insight into complex processes, they mainly focus on a univariate mediator or mediator vector, without considering network-variate mediators. To fill the methodological gap and accomplish this exciting and urgent application, in the article, we propose an integrative mediation analysis under a Bayesian paradigm with networks entailing the mediation effect. To parameterize the network measurements, we introduce individually specified stochastic block models with unknown block allocation, and naturally bridge effect elements through the latent network mediators induced by the connectivity weights across network modules. To enable the identification of truly active mediating components, we simultaneously impose a feature selection across network mediators. We show the superiority of our model in estimating different effect components and selecting active mediating network structures. As a practical illustration of this approach's application to network neuroscience, we characterize the relationship between a therapeutic intervention and opioid abstinence as mediated by brain functional sub-networks.
Collapse
Affiliation(s)
- Yize Zhao
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
- Yale Center for Analytical Sciences, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Tianqi Chen
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Jiachen Cai
- Department of Biostatistics, Yale University School of Public Health, New Haven, Connecticut, USA
| | - Sarah Lichenstein
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Marc N Potenza
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, USA
- Child Study Center, Yale University School of Medicine, New Haven, Connecticut, USA
- Department of Neuroscience, Yale University School of Medicine, New Haven, Connecticut, USA
- Connecticut Mental Health Center, New Haven, Connecticut, USA
- Connecticut Council on Problem Gambling, Wethersfield, Connecticut, USA
- Wu Tsai Institute, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Sarah W Yip
- Department of Psychiatry, Yale University School of Medicine, New Haven, Connecticut, USA
- Child Study Center, Yale University School of Medicine, New Haven, Connecticut, USA
| |
Collapse
|
6
|
Lipkovich I, Ratitch B, Qu Y, Zhang X, Shan M, Mallinckrodt C. Using principal stratification in analysis of clinical trials. Stat Med 2022; 41:3837-3877. [PMID: 35851717 DOI: 10.1002/sim.9439] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 03/06/2022] [Accepted: 05/03/2022] [Indexed: 11/08/2022]
Abstract
The ICH E9(R1) addendum (2019) proposed principal stratification (PS) as one of five strategies for dealing with intercurrent events. Therefore, understanding the strengths, limitations, and assumptions of PS is important for the broad community of clinical trialists. Many approaches have been developed under the general framework of PS in different areas of research, including experimental and observational studies. These diverse applications have utilized a diverse set of tools and assumptions. Thus, need exists to present these approaches in a unifying manner. The goal of this tutorial is threefold. First, we provide a coherent and unifying description of PS. Second, we emphasize that estimation of effects within PS relies on strong assumptions and we thoroughly examine the consequences of these assumptions to understand in which situations certain assumptions are reasonable. Finally, we provide an overview of a variety of key methods for PS analysis and use a real clinical trial example to illustrate them. Examples of code for implementation of some of these approaches are given in Supplemental Materials.
Collapse
Affiliation(s)
| | | | - Yongming Qu
- Eli Lilly and Company, Indianapolis, Indiana, USA
| | - Xiang Zhang
- CSL Behring, King of Prussia, Pennsylvania, USA
| | | | | |
Collapse
|
7
|
Kim C. Bayesian additive regression trees in spatial data analysis with sparse observations. J STAT COMPUT SIM 2022. [DOI: 10.1080/00949655.2022.2102633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Chanmin Kim
- Department of Statistics, SungKyunKwan University, Seoul, South Korea
| |
Collapse
|
8
|
Zigler CM. Invited Commentary: The Promise and Pitfalls of Causal Inference With Multivariate Environmental Exposures. Am J Epidemiol 2021; 190:2658-2661. [PMID: 34079988 DOI: 10.1093/aje/kwab142] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 03/01/2021] [Accepted: 03/29/2021] [Indexed: 01/15/2023] Open
Abstract
The accompanying article by Keil et al. (Am J Epidemiol. 2021;190(12):2647-2657) deploys Bayesian g-computation to investigate the causal effect of 6 airborne metal exposures linked to power-plant emissions on birth weight. In so doing, it articulates the potential value of framing the analysis of environmental mixtures as an explicit contrast between exposure distributions that might arise in response to a well-defined intervention-here, the decommissioning of coal plants. Framing the mixture analysis as that of an approximate "target trial" is an important approach that deserves incorporation into the already rich literature on the analysis of environmental mixtures. However, its deployment in the power plant example highlights challenges that can arise when the target trial is at odds with the exposure distribution observed in the data, a discordance that seems particularly difficult in studies of environmental mixtures. Bayesian methodology such as model averaging and informative priors can help, but they are ultimately limited for overcoming this salient challenge.
Collapse
|
9
|
Song Y, Zhou X, Kang J, Aung MT, Zhang M, Zhao W, Needham BL, Kardia SLR, Liu Y, Meeker JD, Smith JA, Mukherjee B. Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects. J R Stat Soc Ser C Appl Stat 2021; 70:1391-1412. [PMID: 34887595 PMCID: PMC8653861 DOI: 10.1111/rssc.12518] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Causal mediation analysis aims to characterize an exposure's effect on an outcome and quantify the indirect effect that acts through a given mediator or a group of mediators of interest. With the increasing availability of measurements on a large number of potential mediators, like the epigenome or the microbiome, new statistical methods are needed to simultaneously accommodate high-dimensional mediators while directly target penalization of the natural indirect effect (NIE) for active mediator identification. Here, we develop two novel prior models for identification of active mediators in high-dimensional mediation analysis through penalizing NIEs in a Bayesian paradigm. Both methods specify a joint prior distribution on the exposure-mediator effect and mediator-outcome effect with either (a) a four-component Gaussian mixture prior or (b) a product threshold Gaussian prior. By jointly modeling the two parameters that contribute to the NIE, the proposed methods enable penalization on their product in a targeted way. Resultant inference can take into account the four-component composite structure underlying the NIE. We show through simulations that the proposed methods improve both selection and estimation accuracy compared to other competing methods. We applied our methods for an in-depth analysis of two ongoing epidemiologic studies: the Multi-Ethnic Study of Atherosclerosis (MESA) and the LIFECODES birth cohort. The identified active mediators in both studies reveal important biological pathways for understanding disease mechanisms.
Collapse
Affiliation(s)
- Yanyi Song
- University of Michigan, Ann Arbor, MI, USA
| | - Xiang Zhou
- University of Michigan, Ann Arbor, MI, USA
| | - Jian Kang
- University of Michigan, Ann Arbor, MI, USA
| | - Max T Aung
- University of Michigan, Ann Arbor, MI, USA
| | - Min Zhang
- University of Michigan, Ann Arbor, MI, USA
| | - Wei Zhao
- University of Michigan, Ann Arbor, MI, USA
| | | | | | | | | | | | | |
Collapse
|
10
|
Zeng S, Rosenbaum S, Alberts SC, Archie EA, Li F. Causal mediation analysis for sparse and irregular longitudinal data. Ann Appl Stat 2021. [DOI: 10.1214/20-aoas1427] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Shuxi Zeng
- Department of Statistical Science, Duke University
| | | | - Susan C. Alberts
- Departments of Biology and Evolutionary Anthropology, Duke University
| | | | - Fan Li
- Department of Statistical Science, Duke University
| |
Collapse
|
11
|
Kim C, Lin X, Nelson KP. Measuring rater bias in diagnostic tests with ordinal ratings. Stat Med 2021; 40:4014-4033. [PMID: 33969509 DOI: 10.1002/sim.9011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 04/12/2021] [Accepted: 04/12/2021] [Indexed: 11/11/2022]
Abstract
Diagnostic tests are frequently reliant upon the interpretation of images by skilled raters. In many clinical settings, however, the variability observed between experts' ratings plays a detrimental role in the degree of confidence in these interpretations, leading to uncertainty in the diagnostic process. For example, in breast cancer testing, radiologists interpret mammographic images, while breast biopsy results are examined by pathologists. Each of these procedures involves elements of subjectivity. We propose here a flexible two-stage Bayesian latent variable model to investigate how the skills of individual raters impact the diagnostic accuracy of image-related testing in large-scale medical testing studies. A strength of the proposed model is that the true disease status of a patient within a reasonable time frame may or may not be known. In these studies, many raters each contribute classifications on a large sample of patients using a defined ordinal grading scale, leading to a complex correlation structure between ratings. Our modeling approach considers the different sources of variability contributed by experts and patients while accounting for correlations present between ratings and patients, in contrast to currently available methods. We propose a novel measure of a rater's ability (magnifier) that, in contrast to conventional measures of sensitivity and specificity, is robust to the underlying prevalence of disease in the population, providing an alternative measure of diagnostic accuracy across patient populations. Extensive simulation studies demonstrate lower bias in estimation of parameters and measures of accuracy, and illustrate outperformance of the proposed model when compared with existing models. Receiver operator characteristic curves are derived to assess the diagnostic accuracy of individual experts and their overall performance. Our proposed modeling approach is applied to a large breast imaging study for known disease status and a uterine cancer dataset for unknown disease status.
Collapse
Affiliation(s)
- Chanmin Kim
- Department of Statistics, SungKyunKwan University, Jongno-gu, South Korea
| | - Xiaoyan Lin
- Department of Statistics, University of South Carolina, Columbia, South Carolina, USA
| | - Kerrie P Nelson
- Department of Biostatistics, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
12
|
Abstract
Statistical methods to evaluate the effectiveness of interventions are increasingly challenged by the inherent interconnectedness of units. Specifically, a recent flurry of methods research has addressed the problem of interference between observations, which arises when one observational unit's outcome depends not only on its treatment but also the treatment assigned to other units. We introduce the setting of bipartite causal inference with interference, which arises when 1) treatments are defined on observational units that are distinct from those at which outcomes are measured and 2) there is interference between units in the sense that outcomes for some units depend on the treatments assigned to many other units. The focus of this work is to formulate definitions and several possible causal estimands for this setting, highlighting similarities and differences with more commonly considered settings of causal inference with interference. Towards an empirical illustration, an inverse probability of treatment weighted estimator is adapted from existing literature to estimate a subset of simplified, but interesting, estimands. The estimators are deployed to evaluate how interventions to reduce air pollution from 473 power plants in the U.S. causally affect cardiovascular hospitalization among Medicare beneficiaries residing at 18,807 zip code locations.
Collapse
Affiliation(s)
- Corwin M Zigler
- Department of Statistical Science, Duke University, 206 Old Chem Bldg, Durham, NC 27708
| | - Georgia Papadogeorgou
- Department of Statistical Science, Duke University, 206 Old Chem Bldg, Durham, NC 27708
| |
Collapse
|
13
|
Jérolon A, Baglietto L, Birmelé E, Alarcon F, Perduca V. Causal mediation analysis in presence of multiple mediators uncausally related. Int J Biostat 2020; 17:191-221. [PMID: 32990647 DOI: 10.1515/ijb-2019-0088] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 08/06/2020] [Indexed: 11/15/2022]
Abstract
Mediation analysis aims at disentangling the effects of a treatment on an outcome through alternative causal mechanisms and has become a popular practice in biomedical and social science applications. The causal framework based on counterfactuals is currently the standard approach to mediation, with important methodological advances introduced in the literature in the last decade, especially for simple mediation, that is with one mediator at the time. Among a variety of alternative approaches, Imai et al. showed theoretical results and developed an R package to deal with simple mediation as well as with multiple mediation involving multiple mediators conditionally independent given the treatment and baseline covariates. This approach does not allow to consider the often encountered situation in which an unobserved common cause induces a spurious correlation between the mediators. In this context, which we refer to as mediation with uncausally related mediators, we show that, under appropriate hypothesis, the natural direct and joint indirect effects are non-parametrically identifiable. Moreover, we adopt the quasi-Bayesian algorithm developed by Imai et al. and propose a procedure based on the simulation of counterfactual distributions to estimate not only the direct and joint indirect effects but also the indirect effects through individual mediators. We study the properties of the proposed estimators through simulations. As an illustration, we apply our method on a real data set from a large cohort to assess the effect of hormone replacement treatment on breast cancer risk through three mediators, namely dense mammographic area, nondense area and body mass index.
Collapse
Affiliation(s)
- Allan Jérolon
- Laboratoire MAP5 (UMR CNRS 8145), Université de Paris, Paris, Île-de-France, France
| | - Laura Baglietto
- Department of Clinical and Experimental Medicine, Università di Pisa, Pisa, Italy
| | - Etienne Birmelé
- Laboratoire MAP5 (UMR CNRS 8145), Université de Paris, Paris, Île-de-France, France
| | - Flora Alarcon
- Laboratoire MAP5 (UMR CNRS 8145), Université de Paris, Paris, Île-de-France, France
| | - Vittorio Perduca
- Laboratoire MAP5 (UMR CNRS 8145), Université de Paris, Paris, Île-de-France, France
| |
Collapse
|
14
|
Song Y, Zhou X, Zhang M, Zhao W, Liu Y, Kardia SLR, Diez Roux AV, Needham BL, Smith JA, Mukherjee B. Bayesian shrinkage estimation of high dimensional causal mediation effects in omics studies. Biometrics 2020; 76:700-710. [PMID: 31733066 PMCID: PMC7228845 DOI: 10.1111/biom.13189] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 10/30/2019] [Accepted: 11/04/2019] [Indexed: 11/29/2022]
Abstract
Causal mediation analysis aims to examine the role of a mediator or a group of mediators that lie in the pathway between an exposure and an outcome. Recent biomedical studies often involve a large number of potential mediators based on high-throughput technologies. Most of the current analytic methods focus on settings with one or a moderate number of potential mediators. With the expanding growth of -omics data, joint analysis of molecular-level genomics data with epidemiological data through mediation analysis is becoming more common. However, such joint analysis requires methods that can simultaneously accommodate high-dimensional mediators and that are currently lacking. To address this problem, we develop a Bayesian inference method using continuous shrinkage priors to extend previous causal mediation analysis techniques to a high-dimensional setting. Simulations demonstrate that our method improves the power of global mediation analysis compared to simpler alternatives and has decent performance to identify true nonnull contributions to the mediation effects of the pathway. The Bayesian method also helps us to understand the structure of the composite null cases for inactive mediators in the pathway. We applied our method to Multi-Ethnic Study of Atherosclerosis and identified DNA methylation regions that may actively mediate the effect of socioeconomic status on cardiometabolic outcomes.
Collapse
Affiliation(s)
- Yanyi Song
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, U.S.A
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, U.S.A
| | - Min Zhang
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, U.S.A
| | - Wei Zhao
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, U.S.A
| | - Yongmei Liu
- Department of Epidemiology and Prevention, Wake Forest School of Medicine, Winston-Salem, NC, U.S.A
| | | | - Ana V. Diez Roux
- Department of Epidemiology and Biostatistics, Drexel University, Philadelphia, PA, U.S.A
| | - Belinda L. Needham
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, U.S.A
| | - Jennifer A. Smith
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, U.S.A
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, U.S.A
| |
Collapse
|