1
|
Luo C, Daniels MJ. Variable Selection Using Bayesian Additive Regression Trees. Stat Sci 2024; 39:286-304. [PMID: 39281973 PMCID: PMC11395240 DOI: 10.1214/23-sts900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
Variable selection is an important statistical problem. This problem becomes more challenging when the candidate predictors are of mixed type (e.g. continuous and binary) and impact the response variable in nonlinear and/or non-additive ways. In this paper, we review existing variable selection approaches for the Bayesian additive regression trees (BART) model, a nonparametric regression model, which is flexible enough to capture the interactions between predictors and nonlinear relationships with the response. An emphasis of this review is on the ability to identify relevant predictors. We also propose two variable importance measures which can be used in a permutation-based variable selection approach, and a backward variable selection procedure for BART. We introduce these variations as a way of illustrating limitations and opportunities for improving current approaches and assess these via simulations.
Collapse
Affiliation(s)
- Chuji Luo
- Google LLC, Mountain View, California 94043,USA
| | - Michael J Daniels
- Department of Statistics, University of Florida, Gainesville, Florida 32611, USA
| |
Collapse
|
2
|
Xing H, Yau C. Bayesian inference for identifying tumour-specific cancer dependencies through integration of ex-vivo drug response assays and drug-protein profiling. BMC Bioinformatics 2024; 25:104. [PMID: 38459430 PMCID: PMC10921766 DOI: 10.1186/s12859-024-05682-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 01/29/2024] [Indexed: 03/10/2024] Open
Abstract
The identification of tumor-specific molecular dependencies is essential for the development of effective cancer therapies. Genetic and chemical perturbations are powerful tools for discovering these dependencies. Even though chemical perturbations can be applied to primary cancer samples at large scale, the interpretation of experiment outcomes is often complicated by the fact that one chemical compound can affect multiple proteins. To overcome this challenge, Batzilla et al. (PLoS Comput Biol 18(8): e1010438, 2022) proposed DepInfeR, a regularized multi-response regression model designed to identify and estimate specific molecular dependencies of individual cancers from their ex-vivo drug sensitivity profiles. Inspired by their work, we propose a Bayesian extension to DepInfeR. Our proposed approach offers several advantages over DepInfeR, including e.g. the ability to handle missing values in both protein-drug affinity and drug sensitivity profiles without the need for data pre-processing steps such as imputation. Moreover, our approach uses Gaussian Processes to capture more complex molecular dependency structures, and provides probabilistic statements about whether a protein in the protein-drug affinity profiles is informative to the drug sensitivity profiles. Simulation studies demonstrate that our proposed approach achieves better prediction accuracy, and is able to discover unreported dependency structures.
Collapse
Affiliation(s)
- Hanwen Xing
- Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK
| | - Christopher Yau
- Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK.
- Health Data Research UK, London, UK.
| |
Collapse
|
3
|
Di Loro PA, Mingione M, Lipsitt J, Batteate CM, Jerrett M, Banerjee S. BAYESIAN HIERARCHICAL MODELING AND ANALYSIS FOR ACTIGRAPH DATA FROM WEARABLE DEVICES. Ann Appl Stat 2023; 17:2865-2886. [PMID: 38283128 PMCID: PMC10815935 DOI: 10.1214/23-aoas1742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2024]
Abstract
The majority of Americans fail to achieve recommended levels of physical activity, which leads to numerous preventable health problems such as diabetes, hypertension, and heart diseases. This has generated substantial interest in monitoring human activity to gear interventions toward environmental features that may relate to higher physical activity. Wearable devices, such as wrist-worn sensors that monitor gross motor activity (actigraph units) continuously record the activity levels of a subject, producing massive amounts of high-resolution measurements. Analyzing actigraph data needs to account for spatial and temporal information on trajectories or paths traversed by subjects wearing such devices. Inferential objectives include estimating a subject's physical activity levels along a given trajectory; identifying trajectories that are more likely to produce higher levels of physical activity for a given subject; and predicting expected levels of physical activity in any proposed new trajectory for a given set of health attributes. Here, we devise a Bayesian hierarchical modeling framework for spatial-temporal actigraphy data to deliver fully model-based inference on trajectories while accounting for subject-level health attributes and spatial-temporal dependencies. We undertake a comprehensive analysis of an original dataset from the Physical Activity through Sustainable Transport Approaches in Los Angeles (PASTA-LA) study to ascertain spatial zones and trajectories exhibiting significantly higher levels of physical activity while accounting for various sources of heterogeneity.
Collapse
Affiliation(s)
| | | | - Jonah Lipsitt
- Department of Environmental Health Sciences, University of California, Los Angeles
| | - Christina M. Batteate
- Center of Occupational and Environmental Health, University of California, Los Angeles
| | - Michael Jerrett
- Department of Environmental Health Sciences, University of California, Los Angeles
| | - Sudipto Banerjee
- Department of Biostatistics, University of California, Los Angeles
| |
Collapse
|
4
|
Kim C, Tec M, Zigler C. Bayesian nonparametric adjustment of confounding. Biometrics 2023; 79:3252-3265. [PMID: 36718599 DOI: 10.1111/biom.13833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 01/19/2023] [Indexed: 02/01/2023]
Abstract
Analysis of observational studies increasingly confronts the challenge of determining which of a possibly high-dimensional set of available covariates are required to satisfy the assumption of ignorable treatment assignment for estimation of causal effects. We propose a Bayesian nonparametric approach that simultaneously (1) prioritizes inclusion of adjustment variables in accordance with existing principles of confounder selection; (2) estimates causal effects in a manner that permits complex relationships among confounders, exposures, and outcomes; and (3) provides causal estimates that account for uncertainty in the nature of confounding. The proposal relies on specification of multiple Bayesian additive regression trees models, linked together with a common prior distribution that accrues posterior selection probability to covariates on the basis of association with both the exposure and the outcome of interest. A set of extensive simulation studies demonstrates that the proposed method performs well relative to similarly-motivated methodologies in a variety of scenarios. We deploy the method to investigate the causal effect of emissions from coal-fired power plants on ambient air pollution concentrations, where the prospect of confounding due to local and regional meteorological factors introduces uncertainty around the confounding role of a high-dimensional set of measured variables. Ultimately, we show that the proposed method produces more efficient and more consistent results across adjacent years than alternative methods, lending strength to the evidence of the causal relationship between SO2 emissions and ambient particulate pollution.
Collapse
Affiliation(s)
- Chanmin Kim
- Department of Statistics, SungKyunKwan University, Seoul, South Korea
| | - Mauricio Tec
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Corwin Zigler
- Department of Statistics and Data Science, The University of Texas, Austin, Texas, USA
| |
Collapse
|
5
|
Mork D, Kioumourtzoglou MA, Weisskopf M, Coull BA, Wilson A. Heterogeneous Distributed Lag Models to Estimate Personalized Effects of Maternal Exposures to Air Pollution. J Am Stat Assoc 2023; 119:14-26. [PMID: 38835505 PMCID: PMC11147136 DOI: 10.1080/01621459.2023.2258595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 09/07/2023] [Indexed: 06/06/2024]
Abstract
Children's health studies support an association between maternal environmental exposures and children's birth outcomes. A common goal is to identify critical windows of susceptibility-periods during gestation with increased association between maternal exposures and a future outcome. The timing of the critical windows and magnitude of the associations are likely heterogeneous across different levels of individual, family, and neighborhood characteristics. Using an administrative Colorado birth cohort we estimate the individualized relationship between weekly exposures to fine particulate matter (PM 2.5) during gestation and birth weight. To achieve this goal, we propose a statistical learning method combining distributed lag models and Bayesian additive regression trees to estimate critical windows at the individual level and identify characteristics that induce heterogeneity from a high-dimensional set of potential modifying factors. We find evidence of heterogeneity in the PM 2.5 -birth weight relationship, with some mother-child dyads showing a 3 times larger decrease in birth weight for an IQR increase in exposure (5.9 to 8.5 PM 2.5 μg/m3) compared to the population average. Specifically, we find increased vulnerabilitity for non-Hispanic mothers who are either younger, have higher body mass index or lower educational attainment. Our case study is the first precision health study of critical windows.
Collapse
Affiliation(s)
- Daniel Mork
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | | | - Marc Weisskopf
- Department of Environmental Health, Harvard T.H. Chan School of Public Health
| | - Brent A Coull
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | - Ander Wilson
- Department of Statistics, Colorado State University
| |
Collapse
|
6
|
Um S, Linero AR, Sinha D, Bandyopadhyay D. Bayesian additive regression trees for multivariate skewed responses. Stat Med 2023; 42:246-263. [PMID: 36433639 PMCID: PMC9851978 DOI: 10.1002/sim.9613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 10/06/2022] [Accepted: 11/10/2022] [Indexed: 11/27/2022]
Abstract
This paper introduces a nonparametric regression approach for univariate and multivariate skewed responses using Bayesian additive regression trees (BART). Existing BART methods use ensembles of decision trees to model a mean function, and have become popular recently due to their high prediction accuracy and ease of use. The usual assumption of a univariate Gaussian error distribution, however, is restrictive in many biomedical applications. Motivated by an oral health study, we provide a useful extension of BART, the skewBART model, to address this problem. We then extend skewBART to allow for multivariate responses, with information shared across the decision trees associated with different responses within the same subject. The methodology accommodates within-subject association, and allows varying skewness parameters for the varying multivariate responses. We illustrate the benefits of our multivariate skewBART proposal over existing alternatives via simulation studies and application to the oral health dataset with bivariate highly skewed responses. Our methodology is implementable via the R package skewBART, available on GitHub.
Collapse
Affiliation(s)
- Seungha Um
- Department of Statistics, Florida State University, FL, USA
| | - Antonio R. Linero
- Department of Statistics and Data Sciences, University of Texas at Austin, TX, USA
| | | | | |
Collapse
|
7
|
Linero AR, Du J. Gibbs Priors for Bayesian Nonparametric Variable Selection with Weak Learners. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2142594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Antonio R. Linero
- Department of Statistics and Data Sciences, University of Texas at Austin
| | - Junliang Du
- Department of Statistics, Florida State University
| |
Collapse
|
8
|
Kim C. Bayesian additive regression trees in spatial data analysis with sparse observations. J STAT COMPUT SIM 2022. [DOI: 10.1080/00949655.2022.2102633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Chanmin Kim
- Department of Statistics, SungKyunKwan University, Seoul, South Korea
| |
Collapse
|
9
|
Roy A. Multivariate Gaussian RBF‐net for smooth function estimation and variable selection. Stat Anal Data Min 2021. [DOI: 10.1002/sam.11540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Arkaprava Roy
- Department of Biostatistics University of Florida Gainesville Florida USA
| |
Collapse
|
10
|
Pinto-Ledezma JN, Cavender-Bares J. Predicting species distributions and community composition using satellite remote sensing predictors. Sci Rep 2021; 11:16448. [PMID: 34385574 PMCID: PMC8361206 DOI: 10.1038/s41598-021-96047-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 08/04/2021] [Indexed: 02/07/2023] Open
Abstract
Biodiversity is rapidly changing due to changes in the climate and human related activities; thus, the accurate predictions of species composition and diversity are critical to developing conservation actions and management strategies. In this paper, using satellite remote sensing products as covariates, we constructed stacked species distribution models (S-SDMs) under a Bayesian framework to build next-generation biodiversity models. Model performance of these models was assessed using oak assemblages distributed across the continental United States obtained from the National Ecological Observatory Network (NEON). This study represents an attempt to evaluate the integrated predictions of biodiversity models-including assemblage diversity and composition-obtained by stacking next-generation SDMs. We found that applying constraints to assemblage predictions, such as using the probability ranking rule, does not improve biodiversity prediction models. Furthermore, we found that independent of the stacking procedure (bS-SDM versus pS-SDM versus cS-SDM), these kinds of next-generation biodiversity models do not accurately recover the observed species composition at the plot level or ecological-community scales (NEON plots are 400 m2). However, these models do return reasonable predictions at macroecological scales, i.e., moderately to highly correct assignments of species identities at the scale of NEON sites (mean area ~ 27 km2). Our results provide insights for advancing the accuracy of prediction of assemblage diversity and composition at different spatial scales globally. An important task for future studies is to evaluate the reliability of combining S-SDMs with direct detection of species using image spectroscopy to build a new generation of biodiversity models that accurately predict and monitor ecological assemblages through time and space.
Collapse
Affiliation(s)
- Jesús N Pinto-Ledezma
- Department of Ecology, Evolution and Behavior, University of Minnesota, 1479 Gortner Ave, Saint Paul, MN, 55108, USA.
| | - Jeannine Cavender-Bares
- Department of Ecology, Evolution and Behavior, University of Minnesota, 1479 Gortner Ave, Saint Paul, MN, 55108, USA
| |
Collapse
|
11
|
Murray JS. Log-Linear Bayesian Additive Regression Trees for Multinomial Logistic and Count Regression Models. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2020.1813587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Jared S. Murray
- Department of Information, Risk, & Operations Management and Department of Statistics & Data Science, University of Texas at Austin, Austin, TX
| |
Collapse
|
12
|
Application of Bayesian Additive Regression Trees for Estimating Daily Concentrations of PM 2.5 Components. ATMOSPHERE 2020; 11. [PMID: 34322279 PMCID: PMC8315111 DOI: 10.3390/atmos11111233] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Bayesian additive regression tree (BART) is a recent statistical method that combines ensemble learning and nonparametric regression. BART is constructed under a probabilistic framework that also allows for model-based prediction uncertainty quantification. We evaluated the application of BART in predicting daily concentrations of four fine particulate matter (PM2.5) components (elemental carbon, organic carbon, nitrate, and sulfate) in California during the period 2005 to 2014. We demonstrate in this paper how BART can be tuned to optimize prediction performance and how to evaluate variable importance. Our BART models included, as predictors, a large suite of land-use variables, meteorological conditions, satellite-derived aerosol optical depth parameters, and simulations from a chemical transport model. In cross-validation experiments, BART demonstrated good out-of-sample prediction performance at monitoring locations (R2 from 0.62 to 0.73). More importantly, prediction intervals associated with concentration estimates from BART showed good coverage probability at locations with and without monitoring data. In our case study, major PM2.5 components could be estimated with good accuracy, especially when collocated PM2.5 total mass observations were available. In conclusion, BART is an attractive approach for modeling ambient air pollution levels, especially for its ability to provide uncertainty in estimates that may be useful for subsequent health impact and health effect analyses.
Collapse
|
13
|
Krueger R, Bansal P, Buddhavarapu P. A new spatial count data model with Bayesian additive regression trees for accident hot spot identification. ACCIDENT; ANALYSIS AND PREVENTION 2020; 144:105623. [PMID: 32562928 DOI: 10.1016/j.aap.2020.105623] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 05/24/2020] [Accepted: 05/28/2020] [Indexed: 06/11/2023]
Abstract
The identification of accident hot spots is a central task of road safety management. Bayesian count data models have emerged as the workhorse method for producing probabilistic rankings of hazardous sites in road networks. Typically, these methods assume simple linear link function specifications, which, however, limit the predictive power of a model. Furthermore, extensive specification searches are precluded by complex model structures arising from the need to account for unobserved heterogeneity and spatial correlations. Modern machine learning (ML) methods offer ways to automate the specification of the link function. However, these methods do not capture estimation uncertainty, and it is also difficult to incorporate spatial correlations. In light of these gaps in the literature, this paper proposes a new spatial negative binomial model which uses Bayesian additive regression trees to endogenously select the specification of the link function. Posterior inference in the proposed model is made feasible with the help of the Pólya-Gamma data augmentation technique. We test the performance of this new model on a crash count data set from a metropolitan highway network. The empirical results show that the proposed model performs at least as well as a baseline spatial count data model with random parameters in terms of goodness of fit and site ranking ability.
Collapse
Affiliation(s)
- Rico Krueger
- Transport and Mobility Laboratory, Ecole Polytechnique Fédérale de Lausanne, Switzerland.
| | - Prateek Bansal
- Department of Civil and Environmental Engineering, Imperial College London, UK.
| | - Prasad Buddhavarapu
- Department of Civil Architectural and Environmental Engineering, The University of Texas at Austin, United States.
| |
Collapse
|
14
|
Starling JE, Murray JS, Carvalho CM, Bukowski RK, Scott JG. BART with targeted smoothing: An analysis of patient-specific stillbirth risk. Ann Appl Stat 2020. [DOI: 10.1214/19-aoas1268] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
15
|
Demetriou EA, Park SH, Ho N, Pepper KL, Song YJC, Naismith SL, Thomas EE, Hickie IB, Guastella AJ. Machine Learning for Differential Diagnosis Between Clinical Conditions With Social Difficulty: Autism Spectrum Disorder, Early Psychosis, and Social Anxiety Disorder. Front Psychiatry 2020; 11:545. [PMID: 32636768 PMCID: PMC7319094 DOI: 10.3389/fpsyt.2020.00545] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 05/27/2020] [Indexed: 12/14/2022] Open
Abstract
Differential diagnosis in adult cohorts with social difficulty is confounded by comorbid mental health conditions, common etiologies, and shared phenotypes. Identifying shared and discriminating profiles can facilitate intervention and remediation strategies. The objective of the study was to identify salient features of a composite test battery of cognitive and mood measures using a machine learning paradigm in clinical cohorts with social interaction difficulties. We recruited clinical participants who met standardized diagnostic criteria for autism spectrum disorder (ASD: n = 62), early psychosis (EP: n = 48), or social anxiety disorder (SAD: N = 83) and compared them with a neurotypical comparison group (TYP: N = 43). Using five machine-learning algorithms and repeated cross-validation, we trained and tested classification models using measures of cognitive and executive function, lower- and higher-order social cognition and mood severity. Performance metrics were the area under the curve (AUC) and Brier Scores. Sixteen features successfully differentiated between the groups. The control versus social impairment cohorts (ASD, EP, SAD) were differentiated by social cognition, visuospatial memory and mood measures. Importantly, a distinct profile cluster drawn from social cognition, visual learning, executive function and mood, distinguished the neurodevelopmental cohort (EP and ASD) from the SAD group. The mean AUC range was between 0.891 and 0.916 for social impairment versus control cohorts and, 0.729 to 0.781 for SAD vs neurodevelopmental cohorts. This is the first study that compares an extensive battery of neuropsychological and self-report measures using a machine learning protocol in clinical and neurodevelopmental cohorts characterized by social impairment. Findings are relevant for diagnostic, intervention and remediation strategies for these groups.
Collapse
Affiliation(s)
- Eleni A Demetriou
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Shin H Park
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Nicholas Ho
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Karen L Pepper
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Yun J C Song
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | | | - Emma E Thomas
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Ian B Hickie
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia.,Youth Mental Health Unit, Brain and Mind Centre, Central Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| | - Adam J Guastella
- Autism Clinic for Translational Research, Child Neurodevelopment and Mental Health Team, Brain and Mind Centre, Children's Hospital Westmead l Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia.,Youth Mental Health Unit, Brain and Mind Centre, Central Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Autstralia
| |
Collapse
|
16
|
|
17
|
Lang S, Adebayo SB, Fahrmeir L, Steiner WJ. Bayesian Geoadditive Seemingly Unrelated Regression. Comput Stat 2019. [DOI: 10.1007/s001800300144] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Zeldow B, Lo Re V, Roy J. A SEMIPARAMETRIC MODELING APPROACH USING BAYESIAN ADDITIVE REGRESSION TREES WITH AN APPLICATION TO EVALUATE HETEROGENEOUS TREATMENT EFFECTS. Ann Appl Stat 2019; 13:1989-2010. [PMID: 33072236 DOI: 10.1214/19-aoas1266] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Bayesian Additive Regression Trees (BART) is a flexible machine learning algorithm capable of capturing nonlinearities between an outcome and covariates and interactions among covariates. We extend BART to a semiparametric regression framework in which the conditional expectation of an outcome is a function of treatment, its effect modifiers, and confounders. The confounders are allowed to have unspecified functional form, while treatment and effect modifiers that are directly related to the research question are given a linear form. The result is a Bayesian semiparametric linear regression model where the posterior distribution of the parameters of the linear part can be interpreted as in parametric Bayesian regression. This is useful in situations where a subset of the variables are of substantive interest and the others are nuisance variables that we would like to control for. An example of this occurs in causal modeling with the structural mean model (SMM). Under certain causal assumptions, our method can be used as a Bayesian SMM. Our methods are demonstrated with simulation studies and an application to dataset involving adults with HIV/Hepatitis C coinfection who newly initiate antiretroviral therapy. The methods are available in an R package called semibart.
Collapse
Affiliation(s)
- Bret Zeldow
- Department of Health Care Policy, Harvard Medical School, 180 Longwood Ave, Boston, Massachusetts 02115, USA
| | - Vincent Lo Re
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Jason Roy
- Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Piscataway, New Jersey 08854, USA
| |
Collapse
|
19
|
Bellot A, van der Schaar M. A Hierarchical Bayesian Model for Personalized Survival Predictions. IEEE J Biomed Health Inform 2019; 23:72-80. [DOI: 10.1109/jbhi.2018.2832599] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
20
|
Rauch S, Bradman A, Coker E, Chevrier J, An S, Bornman R, Eskenazi B. Determinants of Exposure to Pyrethroid Insecticides in the VHEMBE Cohort, South Africa. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2018; 52:12108-12121. [PMID: 30991471 DOI: 10.1021/acs.est.8b02767] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Exposure to pyrethroid insecticides has been linked to adverse health effects, and can originate from several sources, including indoor residual spraying (IRS) for malaria control, home pest control, food contamination, and occupational exposure. We aimed to explore the determinants of urinary pyrethroid metabolite concentrations in a rural population with high pesticide use. The Venda Health Examination of Mothers, Babies and their Environment (VHEMBE) is a birth cohort of 752 mother-child pairs in Limpopo, South Africa. We measured pyrethroid metabolites in maternal urine and collected information on several factors possibly related to pesticide exposure, including IRS, home pesticide use, and maternal factors (e.g., dietary habits and body composition). We performed statistical analysis using both conventional bivariate regressions and Bayesian variable selection methods. Urinary pyrethroid metabolites are consistently associated with pesticide factors around homes, including pesticide application in yards and food stocks, and IRS in the home during pregnancy, while more distant factors such as village spraying are not. High fat intake is associated with higher metabolite concentrations, and women from homes drawing water from wells or springs had marginally higher levels. Home pesticide use is the most consistent correlate of pyrethroid metabolite concentrations, but IRS, dietary habits, and household water source may also be important exposure determinants.
Collapse
Affiliation(s)
- Stephen Rauch
- Center for Environmental Research and Children's Health (CERCH), School of Public Health , University of California at Berkeley , Berkeley , California 94720 , United States
| | - Asa Bradman
- Center for Environmental Research and Children's Health (CERCH), School of Public Health , University of California at Berkeley , Berkeley , California 94720 , United States
| | - Eric Coker
- Center for Environmental Research and Children's Health (CERCH), School of Public Health , University of California at Berkeley , Berkeley , California 94720 , United States
| | - Jonathan Chevrier
- Department of Epidemiology, Biostatistics and Occupational Health , McGill University , Montréal , Quebec H3A 1A2 , Canada
| | - Sookee An
- Center for Environmental Research and Children's Health (CERCH), School of Public Health , University of California at Berkeley , Berkeley , California 94720 , United States
| | - Riana Bornman
- Department of Urology , University of Pretoria , Pretoria 0028 , South Africa
- University of Pretoria Institute for Sustainable Malaria Control and School of Health Systems and Public Health , University of Pretoria , Pretoria 0028 , South Africa
| | - Brenda Eskenazi
- Center for Environmental Research and Children's Health (CERCH), School of Public Health , University of California at Berkeley , Berkeley , California 94720 , United States
| |
Collapse
|
21
|
Extracting a Common Signal in Tree Ring Widths with a Semi-parametric Bayesian Hierarchical Model. JOURNAL OF AGRICULTURAL, BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2018. [DOI: 10.1007/s13253-018-0330-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
22
|
|
23
|
|
24
|
Zhao Y, Zheng W, Zhuo DY, Lu Y, Ma X, Liu H, Zeng Z, Laird G. Bayesian additive decision trees of biomarker by treatment interactions for predictive biomarker detection and subgroup identification. J Biopharm Stat 2017; 28:534-549. [PMID: 29020511 DOI: 10.1080/10543406.2017.1372770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Personalized medicine, or tailored therapy, has been an active and important topic in recent medical research. Many methods have been proposed in the literature for predictive biomarker detection and subgroup identification. In this article, we propose a novel decision tree-based approach applicable in randomized clinical trials. We model the prognostic effects of the biomarkers using additive regression trees and the biomarker-by-treatment effect using a single regression tree. Bayesian approach is utilized to periodically revise the split variables and the split rules of the decision trees, which provides a better overall fitting. Gibbs sampler is implemented in the MCMC procedure, which updates the prognostic trees and the interaction tree separately. We use the posterior distribution of the interaction tree to construct the predictive scores of the biomarkers and to identify the subgroup where the treatment is superior to the control. Numerical simulations show that our proposed method performs well under various settings comparing to existing methods. We also demonstrate an application of our method in a real clinical trial.
Collapse
Affiliation(s)
| | | | - Daisy Y Zhuo
- b Operations Research Center , Massachusetts Institute of Technology , Cambridge , MA , USA
| | | | | | - Hengchang Liu
- c Department of Computer Science , University of Science and Technology of China , Suzhou , China
| | - Zhen Zeng
- d Department of Biostatistics , University of Pittsburgh , Pittsburgh , PA , USA
| | | |
Collapse
|
25
|
He J, Wanik DW, Hartman BM, Anagnostou EN, Astitha M, Frediani MEB. Nonparametric Tree-Based Predictive Modeling of Storm Outages on an Electric Distribution Network. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2017; 37:441-458. [PMID: 28418593 DOI: 10.1111/risa.12652] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
This article compares two nonparametric tree-based models, quantile regression forests (QRF) and Bayesian additive regression trees (BART), for predicting storm outages on an electric distribution network in Connecticut, USA. We evaluated point estimates and prediction intervals of outage predictions for both models using high-resolution weather, infrastructure, and land use data for 89 storm events (including hurricanes, blizzards, and thunderstorms). We found that spatially BART predicted more accurate point estimates than QRF. However, QRF produced better prediction intervals for high spatial resolutions (2-km grid cells and towns), while BART predictions aggregated to coarser resolutions (divisions and service territory) more effectively. We also found that the predictive accuracy was dependent on the season (e.g., tree-leaf condition, storm characteristics), and that the predictions were most accurate for winter storms. Given the merits of each individual model, we suggest that BART and QRF be implemented together to show the complete picture of a storm's potential impact on the electric distribution network, which would allow for a utility to make better decisions about allocating prestorm resources.
Collapse
Affiliation(s)
- Jichao He
- Department of Mathematics, University of Connecticut, Storrs, CT, USA
| | - David W Wanik
- Department of Civil and Environmental Engineering, University of Connecticut, Storrs, CT, USA
| | - Brian M Hartman
- Department of Statistics, Brigham Young University, Provo, UT, USA
| | - Emmanouil N Anagnostou
- Department of Civil and Environmental Engineering, University of Connecticut, Storrs, CT, USA
| | - Marina Astitha
- Department of Civil and Environmental Engineering, University of Connecticut, Storrs, CT, USA
| | - Maria E B Frediani
- Department of Civil and Environmental Engineering, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
26
|
Abstract
Varying-coefficient models provide a flexible framework for semi- and nonparametric generalized regression analysis. We present a fully Bayesian B-spline basis function approach with adaptive knot selection. For each of the unknown regression functions or varying coefficients, the number and location of knots and the B-spline coefficients are estimated simultaneously using reversible jump Markov chain Monte Carlo sampling. The overall procedure can therefore be viewed as a kind of Bayesian model averaging. Although Gaussian responses are covered by the general framework, the method is particularly useful for fundamentally non-Gaussian responses, where less alternatives are available. We illustrate the approach with a thorough application to two data sets analysed previously in the literature: the kyphosis data set with a binary response and survival data from the Veteran’s Administration lung cancer trial.
Collapse
|
27
|
Waldmann P. Genome-wide prediction using Bayesian additive regression trees. Genet Sel Evol 2016; 48:42. [PMID: 27286957 PMCID: PMC4901500 DOI: 10.1186/s12711-016-0219-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 05/26/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The goal of genome-wide prediction (GWP) is to predict phenotypes based on marker genotypes, often obtained through single nucleotide polymorphism (SNP) chips. The major problem with GWP is high-dimensional data from many thousands of SNPs scored on several thousands of individuals. A large number of methods have been developed for GWP, which are mostly parametric methods that assume statistical linearity and only additive genetic effects. The Bayesian additive regression trees (BART) method was recently proposed and is based on the sum of nonparametric regression trees with the priors being used to regularize the parameters. Each regression tree is based on a recursive binary partitioning of the predictor space that approximates an unknown function, which will automatically model nonlinearities within SNPs (dominance) and interactions between SNPs (epistasis). In this study, we introduced BART and compared its predictive performance with that of the LASSO, Bayesian LASSO (BLASSO), genomic best linear unbiased prediction (GBLUP), reproducing kernel Hilbert space (RKHS) regression and random forest (RF) methods. RESULTS Tests on the QTLMAS2010 simulated data, which are mainly based on additive genetic effects, show that cross-validated optimization of BART provides a smaller prediction error than the RF, BLASSO, GBLUP and RKHS methods, and is almost as accurate as the LASSO method. If dominance and epistasis effects are added to the QTLMAS2010 data, the accuracy of BART relative to the other methods was increased. We also showed that BART can produce importance measures on the SNPs through variable inclusion proportions. In evaluations using real data on pigs, the prediction error was smaller with BART than with the other methods. CONCLUSIONS BART was shown to be an accurate method for GWP, in which the regression trees guarantee a very sparse representation of additive and complex non-additive genetic effects. Moreover, the Markov chain Monte Carlo algorithm with Bayesian back-fitting provides a computationally efficient procedure that is suitable for high-dimensional genomic data.
Collapse
Affiliation(s)
- Patrik Waldmann
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences (SLU), Box 7023, 750 07, Uppsala, Sweden.
| |
Collapse
|
28
|
|
29
|
Swihart BJ, Goldsmith J, Crainiceanu CM. Restricted Likelihood Ratio Tests for Functional Effects in the Functional Linear Model. Technometrics 2014. [DOI: 10.1080/00401706.2013.863163] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
30
|
Nonparametric Vector Autoregressions: Specification, Estimation, and Inference. ACTA ACUST UNITED AC 2014. [DOI: 10.1108/s0731-9053(2013)0000031009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
31
|
Santos E, Barrios E. Nonparametric Decomposition of Time Series Data with Inputs. COMMUN STAT-SIMUL C 2012. [DOI: 10.1080/03610918.2011.615970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
32
|
|
33
|
Ryu D, Li E, Mallick BK. Bayesian nonparametric regression analysis of data with random effects covariates from longitudinal measurements. Biometrics 2010; 67:454-66. [PMID: 20880012 DOI: 10.1111/j.1541-0420.2010.01489.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
We consider nonparametric regression analysis in a generalized linear model (GLM) framework for data with covariates that are the subject-specific random effects of longitudinal measurements. The usual assumption that the effects of the longitudinal covariate processes are linear in the GLM may be unrealistic and if this happens it can cast doubt on the inference of observed covariate effects. Allowing the regression functions to be unknown, we propose to apply Bayesian nonparametric methods including cubic smoothing splines or P-splines for the possible nonlinearity and use an additive model in this complex setting. To improve computational efficiency, we propose the use of data-augmentation schemes. The approach allows flexible covariance structures for the random effects and within-subject measurement errors of the longitudinal processes. The posterior model space is explored through a Markov chain Monte Carlo (MCMC) sampler. The proposed methods are illustrated and compared to other approaches, the "naive" approach and the regression calibration, via simulations and by an application that investigates the relationship between obesity in adulthood and childhood growth curves.
Collapse
Affiliation(s)
- Duchwan Ryu
- Department of Biostatistics, Medical College of Georgia, Augusta, Georgia 30912-4900, USA.
| | | | | |
Collapse
|
34
|
Ting JA, D'Souza A, Vijayakumar S, Schaal S. Efficient Learning and Feature Selection in High-Dimensional Regression. Neural Comput 2010; 22:831-86. [DOI: 10.1162/neco.2009.02-08-702] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
We present a novel algorithm for efficient learning and feature selection in high-dimensional regression problems. We arrive at this model through a modification of the standard regression model, enabling us to derive a probabilistic version of the well-known statistical regression technique of backfitting. Using the expectation-maximization algorithm, along with variational approximation methods to overcome intractability, we extend our algorithm to include automatic relevance detection of the input features. This variational Bayesian least squares (VBLS) approach retains its simplicity as a linear model, but offers a novel statistically robust black-box approach to generalized linear regression with high-dimensional inputs. It can be easily extended to nonlinear regression and classification problems. In particular, we derive the framework of sparse Bayesian learning, the relevance vector machine, with VBLS at its core, offering significant computational and robustness advantages for this class of methods. The iterative nature of VBLS makes it most suitable for real-time incremental learning, which is crucial especially in the application domain of robotics, brain-machine interfaces, and neural prosthetics, where real-time learning of models for control is needed. We evaluate our algorithm on synthetic and neurophysiological data sets, as well as on standard regression and classification benchmark data sets, comparing it with other competitive statistical approaches and demonstrating its suitability as a drop-in replacement for other generalized linear regression techniques.
Collapse
Affiliation(s)
| | | | | | - Stefan Schaal
- University of Southern California, Los Angeles, CA 90089, U.S.A., and ATR Computational Neuroscience Laboratories, Kyoto, Japan
| |
Collapse
|
35
|
|
36
|
Linton OB, Jacho-Chávez DT. On internally corrected and symmetrized kernel estimators for nonparametric regression. TEST-SPAIN 2009. [DOI: 10.1007/s11749-009-0145-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
37
|
Investigating Determinants of Multiple Sclerosis in Longitunal Studies: A Bayesian Approach. JOURNAL OF PROBABILITY AND STATISTICS 2009. [DOI: 10.1155/2009/198320] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Modelling data from Multiple Sclerosis longitudinal studies is a challenging topic since the phenotype of interest is typically ordinal; time intervals between two consecutive measurements are nonconstant and they can vary among individuals. Due to these unobservable sources of heterogeneity statistical models for analysis of Multiple Sclerosis severity evolve as a difficult feature. A few proposals have been provided in the biostatistical literature (Heijtan (1991); Albert, (1994)) to address the issue of investigating Multiple Sclerosis course. In this paper Bayesian P-Splines (Brezger and Lang, (2006); Fahrmeir and Lang (2001)) are indicated as an appropriate tool since they account for nonlinear smooth effects of covariates on the change in Multiple Sclerosis disability. By means of Bayesian P-Spline model we investigate both the randomness affecting Multiple Sclerosis data as well as the ordinal nature of the response variable.
Collapse
|
38
|
Berhane K, Molitor NT. A Bayesian approach to functional-based multilevel modeling of longitudinal data: applications to environmental epidemiology. Biostatistics 2008; 9:686-99. [PMID: 18349036 PMCID: PMC2733176 DOI: 10.1093/biostatistics/kxm059] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2007] [Revised: 11/07/2007] [Accepted: 12/17/2007] [Indexed: 11/13/2022] Open
Abstract
Flexible multilevel models are proposed to allow for cluster-specific smooth estimation of growth curves in a mixed-effects modeling format that includes subject-specific random effects on the growth parameters. Attention is then focused on models that examine between-cluster comparisons of the effects of an ecologic covariate of interest (e.g. air pollution) on nonlinear functionals of growth curves (e.g. maximum rate of growth). A Gibbs sampling approach is used to get posterior mean estimates of nonlinear functionals along with their uncertainty estimates. A second-stage ecologic random-effects model is used to examine the association between a covariate of interest (e.g. air pollution) and the nonlinear functionals. A unified estimation procedure is presented along with its computational and theoretical details. The models are motivated by, and illustrated with, lung function and air pollution data from the Southern California Children's Health Study.
Collapse
Affiliation(s)
- Kiros Berhane
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90033-9987, USA.
| | | |
Collapse
|
39
|
Gold D, Mallick B, Coombes K. Real-Time Gene Expression: Statistical Challenges in Design and Inference. J Comput Biol 2008; 15:611-23. [DOI: 10.1089/cmb.2007.0220] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Affiliation(s)
- David Gold
- Department of Mathematics and Statistics, Boston University, Boston, Massachusetts
| | - Bani Mallick
- Department of Statistics, Texas A&M University, College Station, Texas
| | - Kevin Coombes
- Department of Bioinformatics, The University of Texas M.D. Anderson Cancer Center, Houston, Texas
| |
Collapse
|
40
|
Reithinger F, Jank W, Tutz G, Shmueli G. Modelling price paths in on-line auctions: smoothing sparse and unevenly sampled curves by using semiparametric mixed models. J R Stat Soc Ser C Appl Stat 2008. [DOI: 10.1111/j.1467-9876.2007.00605.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
41
|
|
42
|
Sun D, Speckman PL. Bayesian hierarchical linear mixed models for additive smoothing splines. ANN I STAT MATH 2007. [DOI: 10.1007/s10463-007-0127-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
43
|
Wood SN. ON CONFIDENCE INTERVALS FOR GENERALIZED ADDITIVE MODELS BASED ON PENALIZED REGRESSION SPLINES. AUST NZ J STAT 2006. [DOI: 10.1111/j.1467-842x.2006.00450.x] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
44
|
Rigby RA, Stasinopoulos DM. Generalized additive models for location, scale and shape (with discussion). J R Stat Soc Ser C Appl Stat 2005. [DOI: 10.1111/j.1467-9876.2005.00510.x] [Citation(s) in RCA: 1166] [Impact Index Per Article: 61.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
45
|
Berhane K, Gauderman WJ, Stram DO, Thomas DC. Statistical Issues in Studies of the Long-Term Effects of Air Pollution: The Southern California Children’s Health Study. Stat Sci 2004. [DOI: 10.1214/088342304000000413] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
46
|
|
47
|
KNORR-HELD LEONHARD, RUE HAVARD. On Block Updating in Markov Random Field Models for Disease Mapping. Scand Stat Theory Appl 2002. [DOI: 10.1111/1467-9469.00308] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
48
|
|
49
|
|