1
|
Lovblom LE, Briollais L, Perkins BA, Tomlinson G. Modeling multiple correlated end-organ disease trajectories: A tutorial for multistate and joint models with applications in diabetes complications. Stat Med 2024; 43:1048-1082. [PMID: 38118464 DOI: 10.1002/sim.9984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 11/13/2023] [Accepted: 11/22/2023] [Indexed: 12/22/2023]
Abstract
State-of-the-art biostatistics methods allow for the simultaneous modeling of several correlated non-fatal disease processes over time, but there is no clear guidance on the optimal analysis in most settings. An example occurs in diabetes, where it is not known with certainty how microvascular complications of the eyes, kidneys, and nerves co-develop over time. In this article, we propose and contrast two general model frameworks for studying complications (sequential state and parallel trajectory frameworks) and review multivariate methods for their analysis, focusing on multistate and joint modeling. We illustrate these methods in a tutorial format using the long-term follow-up from the Diabetes Control and Complications Trial and Epidemiology of Diabetes Interventions and Complications study public data repository. A formal comparison of prediction error and discrimination is included. Multistate models are particularly advantageous for determining the order and timing of complications, but require discretization of the longitudinal outcomes and possibly a very complex state space process. Intermittent observation of the states must be accounted for, and discretization is a probable disadvantage in this setting. In contrast, joint models can account for variations of continuous biomarkers over time and are particularly designed for modeling complex association structures between the complications and for performing dynamic predictions of an outcome of interest to inform clinical decisions (eg, a late-stage complication). We found that both models have helpful features that can better-inform our understanding of the complex trajectories that complications may take and can therefore help with decision making for patients presenting with diabetes complications.
Collapse
Affiliation(s)
- Leif Erik Lovblom
- Biostatistics Department, University Health Network, Toronto, Ontario, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
| | - Laurent Briollais
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
- Biostatistics Division, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | - Bruce A Perkins
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
- Leadership Sinai Centre for Diabetes, Sinai Health, Toronto, Ontario, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - George Tomlinson
- Biostatistics Division, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
- Department of Medicine at UHN/Sinai Health, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
2
|
Mews S, Surmann B, Hasemann L, Elkenkamp S. Markov-modulated marked Poisson processes for modeling disease dynamics based on medical claims data. Stat Med 2023; 42:3804-3815. [PMID: 37308135 DOI: 10.1002/sim.9832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 05/26/2023] [Accepted: 06/01/2023] [Indexed: 06/14/2023]
Abstract
We explore Markov-modulated marked Poisson processes (MMMPPs) as a natural framework for modeling patients' disease dynamics over time based on medical claims data. In claims data, observations do not only occur at random points in time but are also informative, that is, driven by unobserved disease levels, as poor health conditions usually lead to more frequent interactions with the health care system. Therefore, we model the observation process as a Markov-modulated Poisson process, where the rate of health care interactions is governed by a continuous-time Markov chain. Its states serve as proxies for the patients' latent disease levels and further determine the distribution of additional data collected at each observation time, the so-called marks. Overall, MMMPPs jointly model observations and their informative time points by comprising two state-dependent processes: the observation process (corresponding to the event times) and the mark process (corresponding to event-specific information), which both depend on the underlying states. The approach is illustrated using claims data from patients diagnosed with chronic obstructive pulmonary disease by modeling their drug use and the interval lengths between consecutive physician consultations. The results indicate that MMMPPs are able to detect distinct patterns of health care utilization related to disease processes and reveal interindividual differences in the state-switching dynamics.
Collapse
Affiliation(s)
- Sina Mews
- Department of Business Administration and Economics, Bielefeld University, Bielefeld, Germany
| | - Bastian Surmann
- Department for Health Economics and Health Care Management, Bielefeld University, Bielefeld, Germany
| | - Lena Hasemann
- Department for Health Economics and Health Care Management, Bielefeld University, Bielefeld, Germany
| | - Svenja Elkenkamp
- Department for Health Economics and Health Care Management, Bielefeld University, Bielefeld, Germany
| |
Collapse
|
3
|
Skourlis N, Crowther MJ, Andersson TML, Lu D, Lambe M, Lambert PC. Exploring different research questions via complex multi-state models when using registry-based repeated prescriptions of antidepressants in women with breast cancer and a matched population comparison group. BMC Med Res Methodol 2023; 23:87. [PMID: 37038100 PMCID: PMC10084660 DOI: 10.1186/s12874-023-01905-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 03/29/2023] [Indexed: 04/12/2023] Open
Abstract
BACKGROUND Multi-state models are used to study several clinically meaningful research questions. Depending on the research question of interest and the information contained in the data, different multi-state structures and modelling choices can be applied. We aim to explore different research questions using a series of multi-state models of increasing complexity when studying repeated prescriptions data, while also evaluating different modelling choices. METHODS We develop a series of research questions regarding the probability of being under antidepressant medication across time using multi-state models, among Swedish women diagnosed with breast cancer (n = 18,313) and an age-matched population comparison group of cancer-free women (n = 92,454) using a register-based database (Breast Cancer Data Base Sweden 2.0). Research questions were formulated ranging from simple to more composite ones. Depending on the research question, multi-state models were built with structures ranging from simpler ones, like single-event survival analysis and competing risks, up to complex bidirectional and recurrent multi-state structures that take into account the recurring start and stop of medication. We also investigate modelling choices, such as choosing a time-scale for the transition rates and borrowing information across transitions. RESULTS Each structure has its own utility and answers a specific research question. However, the more complex structures (bidirectional, recurrent) enable accounting for the intermittent nature of prescribed medication data. These structures deliver estimates of the probability of being under medication and total time spent under medication over the follow-up period. Sensitivity analyses over different definitions of the medication cycle and different choices of timescale when modelling the transition intensity rates show that the estimates of total probabilities of being in a medication cycle over follow-up derived from the complex structures are quite stable. CONCLUSIONS Each research question requires the definition of an appropriate multi-state structure, with more composite ones requiring such an increase in the complexity of the multi-state structure. When a research question is related with an outcome of interest that repeatedly changes over time, such as the medication status based on prescribed medication, the use of novel multi-state models of adequate complexity coupled with sensible modelling choices can successfully address composite, more realistic research questions.
Collapse
Affiliation(s)
- Nikolaos Skourlis
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.
| | | | - Therese M-L Andersson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Donghao Lu
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Mats Lambe
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Regional Cancer Centre Central Sweden, Uppsala, Sweden
| | - Paul C Lambert
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, Leicester, UK
| |
Collapse
|
4
|
Cheung LC, Albert PS, Das S, Cook RJ. Multistate models for the natural history of cancer progression. Br J Cancer 2022; 127:1279-1288. [PMID: 35821296 DOI: 10.1038/s41416-022-01904-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/21/2022] [Accepted: 06/28/2022] [Indexed: 11/09/2022] Open
Abstract
BACKGROUND Multistate models can be effectively used to characterise the natural history of cancer. Inference from such models has previously been useful for setting screening policies. METHODS We introduce the basic elements of multistate models and the challenges of applying these models to cancer data. Through simulation studies, we examine (1) the impact of assuming time-homogeneous Markov transition intensities when the intensities depend on the time since entry to the current state (i.e., the process is time-inhomogenous semi-Markov) and (2) the effect on precancer risk estimation when observation times depend on an unmodelled intermediate disease state. RESULTS In the settings we examined, we found that misspecifying a time-inhomogenous semi-Markov process as a time-homogeneous Markov process resulted in biased estimates of the mean sojourn times. When screen-detection of the intermediate disease leads to more frequent future screening assessments, there was minimal bias induced compared to when screen-detection of the intermediate disease leads to less frequent screening. CONCLUSIONS Multistate models are useful for estimating parameters governing the process dynamics in cancer such as transition rates, sojourn time distributions, and absolute and relative risks. As with most statistical models, to avoid incorrect inference, care should be given to use the appropriate specifications and assumptions.
Collapse
Affiliation(s)
- Li C Cheung
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA.
| | - Paul S Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Shrutikona Das
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | - Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| |
Collapse
|
5
|
Skourlis N, Crowther MJ, Andersson TML, Lambert PC. Development of a dynamic interactive web tool to enhance understanding of multi-state model analyses: MSMplus. BMC Med Res Methodol 2021; 21:262. [PMID: 34837946 PMCID: PMC8627614 DOI: 10.1186/s12874-021-01420-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 09/30/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Multi-state models are used in complex disease pathways to describe a process where an individual moves from one state to the next, taking into account competing states during each transition. In a multi-state setting, there are various measures to be estimated that are of great epidemiological importance. However, increased complexity of the multi-state setting and predictions over time for individuals with different covariate patterns may lead to increased difficulty in communicating the estimated measures. The need for easy and meaningful communication of the analysis results motivated the development of a web tool to address these issues. RESULTS MSMplus is a publicly available web tool, developed via the Shiny R package, with the aim of enhancing the understanding of multi-state model analyses results. The results from any multi-state model analysis are uploaded to the application in a pre-specified format. Through a variety of user-tailored interactive graphs, the application contributes to an improvement in communication, reporting and interpretation of multi-state analysis results as well as comparison between different approaches. The predicted measures that can be supported by MSMplus include, among others, the transition probabilities, the transition intensity rates, the length of stay in each state, the probability of ever visiting a state and user defined measures. Representation of differences, ratios and confidence intervals of the aforementioned measures are also supported. MSMplus is a useful tool that enhances communication and understanding of multi-state model analyses results. CONCLUSIONS Further use and development of web tools should be encouraged in the future as a means to communicate scientific research.
Collapse
Affiliation(s)
- Nikolaos Skourlis
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Nobels Väg 12A, Stockholm, Sweden
| | - Michael J. Crowther
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Nobels Väg 12A, Stockholm, Sweden
| | - Therese M-L. Andersson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Nobels Väg 12A, Stockholm, Sweden
| | - Paul C. Lambert
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Nobels Väg 12A, Stockholm, Sweden
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, University Road, Leicester, UK
| |
Collapse
|
6
|
Kim SW, Schumacher M, Augustin NH. A shared frailty model for multivariate longitudinal data on adverse event of radiation therapy. Biom J 2021; 63:1493-1506. [PMID: 33949712 DOI: 10.1002/bimj.202000237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 02/01/2021] [Accepted: 04/06/2021] [Indexed: 11/07/2022]
Abstract
Oral mucositis is an inflammatory adverse event when treating head and neck cancer patients with radiation therapy (RT). The severity of its occurrence is believed to mainly depend on its site and the distribution of a cumulative radiation dose in the mouth area. The motivating study investigating differences in radiosensitivities (mucositis progression) at distinct sites where the severity of mucositis is assessed regularly at eight distinct sites on an ordinal scale results in multivariate longitudinal data and thus poses certain challenges. To deal with the multivariate longitudinal data in this particular setting, we take a time-to-event approach focusing on the first occurrence of severe mucositis at the distinct sites using the fact that the site-specific cumulative radiation dose thought to be the main driver of oral mucositis develops over time. Thereby, we may address multivariate longitudinal processes in a simpler and more compact fashion. In this article, to find out differences in mucositis progression at eight distinct sites we propose a shared frailty model for multivariate parallel processes within individuals. The shared frailty model directly incorporating 'process indicators' as covariates turns out to adequately explain the differences in the parallel processes (here, mucositis progressions at distinct sites) while taking individual effects into account. The parallel result with the one from the previous analysis based on the same data but conducted with an alternative statistical methodology shows adequacy of the proposed approach.
Collapse
Affiliation(s)
- Sung Won Kim
- Institute of Medical Biometry and Statistics, Medical Center-University of Freiburg, Freiburg, Germany
| | - Martin Schumacher
- Institute of Medical Biometry and Statistics, Medical Center-University of Freiburg, Freiburg, Germany
| | - Nicole H Augustin
- School of Mathematics, University of Edinburgh, James Clark Maxwell Building, The King's Buildings, Edinburgh, UK
| |
Collapse
|
7
|
Hill M, Lambert PC, Crowther MJ. Relaxing the assumption of constant transition rates in a multi-state model in hospital epidemiology. BMC Med Res Methodol 2021; 21:16. [PMID: 33430778 PMCID: PMC7798316 DOI: 10.1186/s12874-020-01192-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Accepted: 12/11/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Multi-state models are being increasingly used to capture complex disease pathways. The convenient formula of the exponential multi-state model can facilitate a quick and accessible understanding of the data. However, assuming time constant transition rates is not always plausible. On the other hand, obtaining predictions from a fitted model with time-dependent transitions can be challenging. One proposed solution is to utilise a general simulation algorithm to calculate predictions from a fitted multi-state model. METHODS Predictions obtained from an exponential multi-state model were compared to those obtained from two different parametric models and to non-parametric Aalen-Johansen estimates. The first comparative approach fitted a multi-state model with transition-specific distributions, chosen separately based on the Akaike Information Criterion. The second approach was a Royston-Parmar multi-state model with 4 degrees of freedom, which was chosen as a reference model flexible enough to capture complex hazard shapes. All quantities were obtained analytically for the exponential and Aalen-Johansen approaches. The transition rates for the two comparative approaches were also obtained analytically, while all other quantities were obtained from the fitted models via a general simulation algorithm. Metrics investigated were: transition probabilities, attributable mortality (AM), population attributable fraction (PAF) and expected length of stay. This work was performed on previously analysed hospital acquired infection (HAI) data. By definition, a HAI takes three days to develop and therefore selected metrics were also predicted from time 3 (delayed entry). RESULTS Despite clear deviations from the constant transition rates assumption, the empirical estimates of the transition probabilities were approximated reasonably well by the exponential model. However, functions of the transition probabilities, e.g. AM and PAF, were not well approximated and the comparative models offered considerable improvements for these metrics. They also provided consistent predictions with the empirical estimates in the case of delayed entry time, unlike the exponential model. CONCLUSION We conclude that methods and software are readily available for obtaining predictions from multi-state models that do not assume constant transition rates. The multistate package in Stata facilitates a range of predictions with confidence intervals, which can provide a more comprehensive understanding of the data. User-friendly code is provided.
Collapse
Affiliation(s)
- Micki Hill
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, George Davies Centre, University Road, Leicester, LE1 7RH, UK.
| | - Paul C Lambert
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, George Davies Centre, University Road, Leicester, LE1 7RH, UK.,Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Box 281, Stockholm, 17177, Sweden
| | - Michael J Crowther
- Biostatistics Research Group, Department of Health Sciences, University of Leicester, George Davies Centre, University Road, Leicester, LE1 7RH, UK.,Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Box 281, Stockholm, 17177, Sweden
| |
Collapse
|
8
|
Jiang S, Cook RJ, Zeng L. Mitigating bias from intermittent measurement of time-dependent covariates in failure time analysis. Stat Med 2020; 39:1833-1845. [PMID: 32126155 DOI: 10.1002/sim.8517] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 01/07/2020] [Accepted: 02/09/2020] [Indexed: 11/05/2022]
Abstract
Cox regression models are routinely fitted to examine the association between time-dependent markers and a failure time when analyzing data from clinical registries. Typically, the marker values are measured periodically at clinic visits with the recorded value carried forward until the next assessment. We examine the asymptotic behavior of estimators from Cox regression models under this observation and data handling scheme when the true relationship is based on a Cox model using the current value of the marker. Specifically, we explore the impact of the marker process dynamics, the clinic visit intensity, and the marginal failure rate on the limiting value of the estimator of the marker effect from the Cox model. We also illustrate how a joint multistate model that accommodates intermittent observation of the time-varying marker can be formulated. Simulation studies demonstrate that the finite sample performance of the naive estimator aligns with the asymptotic results and shows good performance of the estimators from the joint model. We apply both methods to data from a study of bone markers and their effect on the development of skeletal complications in metastatic cancer.
Collapse
Affiliation(s)
- Shu Jiang
- Division of Public Health Sciences, Department of Surgery, Washington University School of Medicine in St. Louis, St. Louis, Missouri
| | - Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario
| | - Leilei Zeng
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario
| |
Collapse
|
9
|
Zeng L, Cook RJ, Lee J. Multistate analysis from cross-sectional and auxiliary samples. Stat Med 2020; 39:387-408. [PMID: 31820469 DOI: 10.1002/sim.8411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2019] [Revised: 09/12/2019] [Accepted: 09/30/2019] [Indexed: 11/10/2022]
Abstract
Epidemiological studies routinely involve cross-sectional sampling of a population comprised of individuals progressing through life history processes. We consider features of a cross-sectional sample in terms of the intensity functions of a progressive multistate disease process under stationarity assumptions. The limiting values of estimators for regression coefficients in naive logistic regression models are studied, and simulations confirm the key asymptotic results that are relevant in finite samples. We also consider the need for and the use of data from auxiliary samples, which enable one to fit the full multistate life history process. We conclude with an application to data from a national cross-sectional sample assessing marker effects on psoriatic arthritis among individuals with psoriasis.
Collapse
Affiliation(s)
- Leilei Zeng
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| | - Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| | - Jooyoung Lee
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| |
Collapse
|
10
|
Cook RJ, Lawless JF. Independence conditions and the analysis of life history studies with intermittent observation. Biostatistics 2019; 22:455-481. [PMID: 31711113 DOI: 10.1093/biostatistics/kxz047] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 10/08/2019] [Accepted: 10/09/2019] [Indexed: 11/12/2022] Open
Abstract
Multistate models provide a powerful framework for the analysis of life history processes when the goal is to characterize transition intensities, transition probabilities, state occupancy probabilities, and covariate effects thereon. Data on such processes are often only available at random visit times occurring over a finite period. We formulate a joint multistate model for the life history process, the recurrent visit process, and a random loss to follow-up time at which the visit process terminates. This joint model is helpful when discussing the independence conditions necessary to justify the use of standard likelihoods involving the life history model alone and provides a basis for analyses that accommodate dependence. We consider settings with disease-driven visits and routinely scheduled visits and develop likelihoods that accommodate partial information on the types of visits. Simulation studies suggest that suitably constructed joint models can yield consistent estimates of parameters of interest even under dependent visit processes, providing the models are correctly specified; identifiability and estimability issues are also discussed. An application is given to a cohort of individuals attending a rheumatology clinic where interest lies in progression of joint damage.
Collapse
Affiliation(s)
- Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| | - Jerald F Lawless
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| |
Collapse
|
11
|
Nabizadeh R, Yousefian F, Moghadam VK, Hadei M. Characteristics of cohort studies of long-term exposure to PM 2.5: a systematic review. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019; 26:30755-30771. [PMID: 31494855 DOI: 10.1007/s11356-019-06382-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 08/29/2019] [Indexed: 06/10/2023]
Abstract
This study systematically reviewed all the cohort studies investigating the relationship between long-term exposure to PM2.5 and any health outcome until February 2018. We searched ISI Web of Knowledge, Pubmed, and Scopus databases for peer-reviewed journal research articles published in English. We only extracted the results of the single-pollutant main analysis of each study, excluding the effect modifications and sensitivity analyses. Out of the initial 9523 articles, 203 articles were ultimately included for analysis. Based on the different characteristics of studies such as study design, outcome, exposure assessment method, and statistical model, we calculated the number and relative frequency of analyses with statistically significant and insignificant results. Most of the studies were prospective (84.8%), assessed both genders (66.5%), and focused on a specific age range (86.8%). Most of the articles (78.1%) had used modeling techniques for exposure assessment of cohorts' participants. Among the total of 317 health outcomes, the most investigated outcomes include mortality due to cardiovascular disease (6.19%), all causes (5.48%), lung cancer (4.00%), ischemic heart disease (3.50%), and non-accidental causes (3.50%). The percentage of analyses with statistically significant results were higher among studies that used prospective design, mortality as the outcome, fixed stations as exposure assessment method, hazard ratio as risk measure, and no covariate adjustment. We can somehow conclude that the choice of right characteristics for cohort studies can make a difference in their results.
Collapse
Affiliation(s)
- Ramin Nabizadeh
- Department of Environmental Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
- Center for Air Pollution Research (CAPR), Institute for Environmental Research (IER), Tehran University of Medical Sciences, Tehran, Iran
| | - Fatemeh Yousefian
- Department of Environmental Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Vahid Kazemi Moghadam
- Department of Environmental Health Engineering, Neyshabur University of Medical Sciences, Neyshabur, Iran
| | - Mostafa Hadei
- Department of Environmental Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran.
- Students' Scientific Research Center (SSRC), Tehran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
12
|
Jiang S, Cook RJ. Score tests based on a finite mixture model of Markov processes under intermittent observation. Stat Med 2019; 38:3013-3025. [PMID: 30972787 DOI: 10.1002/sim.8155] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Revised: 01/07/2019] [Accepted: 03/08/2019] [Indexed: 11/09/2022]
Abstract
A mixture model is described, which accommodates different Markov processes governing disease progression in a finite set of latent classes. We give special attention to the setting in which individuals are examined intermittently and transition times are consequently interval censored. A score test is developed to identify genetic markers associated with class membership. Simulation studies are conducted to validate the algorithm, assess the finite sample properties of the estimators, and assess the frequency properties of the score tests. A permutation test is recommended for settings when there is concern that the asymptotic approximation to the score test is poor. An application involving progression in joint damage in psoriatic arthritis (PsA) provides illustration and identifies human leukocyte antigen markers associated with unilateral and bilateral sacroiliac damage in individuals with PsA.
Collapse
Affiliation(s)
- Shu Jiang
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada
| | - Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Canada
| |
Collapse
|
13
|
Aralis H, Brookmeyer R. A stochastic estimation procedure for intermittently-observed semi-Markov multistate models with back transitions. Stat Methods Med Res 2017; 28:770-787. [PMID: 29117850 DOI: 10.1177/0962280217736342] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Multistate models provide an important method for analyzing a wide range of life history processes including disease progression and patient recovery following medical intervention. Panel data consisting of the states occupied by an individual at a series of discrete time points are often used to estimate transition intensities of the underlying continuous-time process. When transition intensities depend on the time elapsed in the current state and back transitions between states are possible, this intermittent observation process presents difficulties in estimation due to intractability of the likelihood function. In this manuscript, we present an iterative stochastic expectation-maximization algorithm that relies on a simulation-based approximation to the likelihood function and implement this algorithm using rejection sampling. In a simulation study, we demonstrate the feasibility and performance of the proposed procedure. We then demonstrate application of the algorithm to a study of dementia, the Nun Study, consisting of intermittently-observed elderly subjects in one of four possible states corresponding to intact cognition, impaired cognition, dementia, and death. We show that the proposed stochastic expectation-maximization algorithm substantially reduces bias in model parameter estimates compared to an alternative approach used in the literature, minimal path estimation. We conclude that in estimating intermittently observed semi-Markov models, the proposed approach is a computationally feasible and accurate estimation procedure that leads to substantial improvements in back transition estimates.
Collapse
Affiliation(s)
- Hilary Aralis
- UCLA Department of Biostatistics, Fielding School of Public Health, Los Angeles, CA, USA
| | - Ron Brookmeyer
- UCLA Department of Biostatistics, Fielding School of Public Health, Los Angeles, CA, USA
| |
Collapse
|
14
|
Nazeri Rad N, Lawless JF. Estimation of state occupancy probabilities in multistate models with dependent intermittent observation, with application to HIV viral rebounds. Stat Med 2016; 36:1256-1271. [PMID: 27896823 DOI: 10.1002/sim.7189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2016] [Revised: 11/01/2016] [Accepted: 11/02/2016] [Indexed: 11/06/2022]
Abstract
In follow-up studies on chronic disease cohorts, individuals are often observed at irregular visit times that may be related to their previous disease history and other factors. This can produce bias in standard methods of estimation. Working in the context of multistate models, we consider a method of nonparametric estimation for state occupancy probabilities that adjusts for dependent follow-up through the use of inverse-intensity-of-visit weighted estimating functions and smoothing. The methodology is applied to the estimation of viral rebound probabilities in the Canadian Observational Cohort on HIV-positive persons. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- N Nazeri Rad
- Prosserman Centre for Health Research, Lunenfeld-Tanenbaum Research Institute, 60 Murray Street, Toronto, M5T 3L9, ON, Canada
| | - J F Lawless
- Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo, N2L 3G1, ON, Canada
| |
Collapse
|
15
|
Abstract
flexsurv is an R package for fully-parametric modeling of survival data. Any parametric time-to-event distribution may be fitted if the user supplies a probability density or hazard function, and ideally also their cumulative versions. Standard survival distributions are built in, including the three and four-parameter generalized gamma and F distributions. Any parameter of any distribution can be modeled as a linear or log-linear function of covariates. The package also includes the spline model of Royston and Parmar (2002), in which both baseline survival and covariate effects can be arbitrarily flexible parametric functions of time. The main model-fitting function, flexsurvreg, uses the familiar syntax of survreg from the standard survival package (Therneau 2016). Censoring or left-truncation are specified in 'Surv' objects. The models are fitted by maximizing the full log-likelihood, and estimates and confidence intervals for any function of the model parameters can be printed or plotted. flexsurv also provides functions for fitting and predicting from fully-parametric multi-state models, and connects with the mstate package (de Wreede, Fiocco, and Putter 2011). This article explains the methods and design principles of the package, giving several worked examples of its use.
Collapse
Affiliation(s)
- Christopher H Jackson
- Christopher Jackson, MRC Biostatistics Unit, Cambridge Institute of Public Health, Robinson Way, Cambridge, CB2 0SR, United Kingdom
| |
Collapse
|
16
|
Shen H, Cook RJ. Analysis of interval-censored recurrent event processes subject to resolution. Biom J 2015; 57:725-42. [PMID: 26148993 DOI: 10.1002/bimj.201400162] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2014] [Revised: 11/30/2014] [Accepted: 01/31/2015] [Indexed: 11/06/2022]
Abstract
Interval-censored recurrent event data arise when the event of interest is not readily observed but the cumulative event count can be recorded at periodic assessment times. In some settings, chronic disease processes may resolve, and individuals will cease to be at risk of events at the time of disease resolution. We develop an expectation-maximization algorithm for fitting a dynamic mover-stayer model to interval-censored recurrent event data under a Markov model with a piecewise-constant baseline rate function given a latent process. The model is motivated by settings in which the event times and the resolution time of the disease process are unobserved. The likelihood and algorithm are shown to yield estimators with small empirical bias in simulation studies. Data are analyzed on the cumulative number of damaged joints in patients with psoriatic arthritis where individuals experience disease remission.
Collapse
Affiliation(s)
- Hua Shen
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada
| | - Richard J Cook
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada
| |
Collapse
|
17
|
Lawless JF, Nazeri Rad N. Estimation and assessment of markov multistate models with intermittent observations on individuals. LIFETIME DATA ANALYSIS 2015; 21:160-179. [PMID: 25332076 DOI: 10.1007/s10985-014-9310-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Accepted: 10/06/2014] [Indexed: 06/04/2023]
Abstract
Multistate models provide important methods of analysis for many life history processes, and this is an area where John Klein made numerous contributions. When individuals in a study group are observed continuously so that all transitions between states, and their times, are known, estimation and model checking is fairly straightforward. However, individuals in many studies are observed intermittently, and only the states occupied at the observation times are known. We review methods of estimation and assessment for Markov models in this situation. Numerical studies that show the effects of inter-observation times are provided, and new methods for assessing fit are given. An illustration involving viral load dynamics for HIV-positive persons is presented.
Collapse
Affiliation(s)
- J F Lawless
- Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada,
| | | |
Collapse
|