1
|
Fung A, Loutet M, Roth DE, Wong E, Gill PJ, Morris SK, Beyene J. Clinical prediction models in children that use repeated measurements with time-varying covariates: a scoping review. Acad Pediatr 2024; 24:728-740. [PMID: 38561061 DOI: 10.1016/j.acap.2024.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 02/29/2024] [Accepted: 03/27/2024] [Indexed: 04/04/2024]
Abstract
BACKGROUND Emerging evidence suggests that clinical prediction models that use repeated (time-varying) measurements within each patient may have higher predictive accuracy than models that use patient information from a single measurement. OBJECTIVE To determine the breadth of the published literature reporting the development of clinical prediction models in children that use time-varying predictors. DATA SOURCES MEDLINE, EMBASE and Cochrane databases. ELIGIBILITY CRITERIA We included studies reporting the development of a multivariable clinical prediction model in children, with or without validation, to predict a repeatedly measured binary or time-to-event outcome and utilizing at least one repeatedly measured predictor. SYNTHESIS METHODS We categorized included studies by the method used to model time-varying predictors. RESULTS Of 99 clinical prediction model studies that had a repeated measurements data structure, only 27 (27%) used methods that incorporated the repeated measurements as time-varying predictors in a single model. Among these 27 time-varying prediction model studies, we grouped model types into nine categories: time-dependent Cox regression, generalized estimating equations, random effects model, landmark model, joint model, neural network, K-nearest neighbor, support vector machine and tree-based algorithms. Where there was comparison of time-varying models to single measurement models, using time-varying predictors improved predictive accuracy. CONCLUSIONS Various methods have been used to develop time-varying prediction models in children, but there is a paucity of pediatric time-varying models in the literature. Incorporating time-varying covariates in pediatric prediction models may improve predictive accuracy. Future research in pediatric prediction model development should further investigate whether incorporation of time-varying covariates improves predictive accuracy.
Collapse
Affiliation(s)
- Alastair Fung
- Division of Paediatric Medicine (A Fung, DE Roth, and PJ Gill), Hospital for Sick Children, Toronto, Ontario, Canada; Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Centre for Global Child Health (A Fung, M Loutet, DE Roth, and SK Morris), Hospital for Sick Children, Toronto, Ontario, Canada.
| | - Miranda Loutet
- Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Centre for Global Child Health (A Fung, M Loutet, DE Roth, and SK Morris), Hospital for Sick Children, Toronto, Ontario, Canada
| | - Daniel E Roth
- Division of Paediatric Medicine (A Fung, DE Roth, and PJ Gill), Hospital for Sick Children, Toronto, Ontario, Canada; Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Centre for Global Child Health (A Fung, M Loutet, DE Roth, and SK Morris), Hospital for Sick Children, Toronto, Ontario, Canada; Temerty Faculty of Medicine (DE Roth, E Wong, PJ Gill, and SK Morris), University of Toronto, Toronto, Ontario, Canada; Child Health Evaluative Sciences (DE Roth, PJ Gill, and SK Morris), Hospital for Sick Children Research Institute, Toronto, Ontario, Canada
| | - Elliott Wong
- Temerty Faculty of Medicine (DE Roth, E Wong, PJ Gill, and SK Morris), University of Toronto, Toronto, Ontario, Canada
| | - Peter J Gill
- Division of Paediatric Medicine (A Fung, DE Roth, and PJ Gill), Hospital for Sick Children, Toronto, Ontario, Canada; Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Temerty Faculty of Medicine (DE Roth, E Wong, PJ Gill, and SK Morris), University of Toronto, Toronto, Ontario, Canada; Child Health Evaluative Sciences (DE Roth, PJ Gill, and SK Morris), Hospital for Sick Children Research Institute, Toronto, Ontario, Canada
| | - Shaun K Morris
- Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Centre for Global Child Health (A Fung, M Loutet, DE Roth, and SK Morris), Hospital for Sick Children, Toronto, Ontario, Canada; Temerty Faculty of Medicine (DE Roth, E Wong, PJ Gill, and SK Morris), University of Toronto, Toronto, Ontario, Canada; Child Health Evaluative Sciences (DE Roth, PJ Gill, and SK Morris), Hospital for Sick Children Research Institute, Toronto, Ontario, Canada; Division of Infectious Diseases (SK Morris), Hospital for Sick Children, Toronto, Ontario, Canada
| | - Joseph Beyene
- Dalla Lana School of Public Health (A Fung, M Loutet, DE Roth, PJ Gill, SK Morris, and J Beyene), University of Toronto, Toronto, Ontario, Canada; Department of Health Research Methods, Evidence and Impact (J Beyene), Faculty of Health Sciences, McMaster University, Hamilton, Ontario, Canada
| |
Collapse
|
2
|
Amiri S, Li YC, Buchwald D, Pandey G. Machine learning-driven identification of air toxic combinations associated with asthma symptoms among elementary school children in Spokane, Washington, USA. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 921:171102. [PMID: 38387571 PMCID: PMC10939716 DOI: 10.1016/j.scitotenv.2024.171102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/16/2024] [Accepted: 02/18/2024] [Indexed: 02/24/2024]
Abstract
Air toxics are atmospheric pollutants with hazardous effects on health and the environment. Although methodological constraints have limited the number of air toxics assessed for associations with health and disease, advances in machine learning (ML) enable the assessment of a much larger set of environmental exposures. We used ML methods to conduct a retrospective study to identify combinations of 109 air toxics associated with asthma symptoms among 269 elementary school students in Spokane, Washington. Data on the frequency of asthma symptoms for these children were obtained from Spokane Public Schools. Their exposure to air toxics was estimated by using the Environmental Protection Agency's Air Toxics Screening Assessment and National Air Toxics Assessment. We defined three exposure periods: the most recent year (2019), the last three years (2017-2019), and the last five years (2014-2019). We analyzed the data using the ML-based Data-driven ExposurE Profile (DEEP) extraction method. DEEP identified 25 air toxic combinations associated with asthma symptoms in at least one exposure period. Three combinations (1,1,1-trichloroethane, 2-nitropropane, and 2,4,6-trichlorophenol) were significantly associated with asthma symptoms in all three exposure periods. Four air toxics (1,1,1-trichloroethane, 1,1,2,2-tetrachloroethane, BIS (2-ethylhexyl) phthalate (DEHP), and 2,4-dinitrophenol) were associated only in combination with other toxics, and would not have been identified by traditional statistical methods. The application of DEEP also identified a vulnerable subpopulation of children who were exposed to 13 of the 25 significant combinations in at least one exposure period. On average, these children experienced the largest number of asthma symptoms in our sample. By providing evidence on air toxic combinations associated with childhood asthma, our findings may contribute to the regulation of these toxics to improve children's respiratory health.
Collapse
Affiliation(s)
- Solmaz Amiri
- Institute for Research and Education to Advance Community Health (IREACH), Elson S. Floyd College of Medicine, Washington State University, Seattle, WA, USA.
| | - Yan-Chak Li
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Dedra Buchwald
- Institute for Research and Education to Advance Community Health (IREACH), Elson S. Floyd College of Medicine, Washington State University, Seattle, WA, USA
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
3
|
Rahman MM, Liu FF, Eckel SP, Sankaranarayanan I, Shafiei-Jahani P, Howard E, Baronikian L, Sattler F, Lurmann FW, Allayee H, Akbari O, McConnell R. Near-roadway air pollution, immune cells and adipokines among obese young adults. Environ Health 2022; 21:36. [PMID: 35305663 PMCID: PMC8933931 DOI: 10.1186/s12940-022-00842-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 02/22/2022] [Indexed: 06/14/2023]
Abstract
BACKGROUND Air pollution has been associated with metabolic disease and obesity. Adipokines are potential mediators of these effects, but studies of air pollution-adipokine relationships are inconclusive. Macrophage and T cells in adipose tissue (AT) and blood modulate inflammation; however, the role of immune cells in air pollution-induced dysregulation of adipokines has not been studied. We examined the association between air pollution exposure and circulating and AT adipokine concentrations, and whether these relationships were modified by macrophage and T cell numbers in the blood and AT. METHODS Fasting blood and abdominal subcutaneous AT biopsies were collected from 30 overweight/obese 18-26 year-old volunteers. Flow cytometry was used to quantify T effector (Teff, inflammatory) and regulatory (Treg, anti-inflammatory) lymphocytes and M1 [inflammatory] and M2 [anti-inflammatory]) macrophage cell number. Serum and AT leptin and adiponectin were measured using enzyme-linked immunosorbent assay (ELISA). Exposure to near-roadway air pollution (NRAP) from freeway and non-freeway vehicular sources and to regional particulate matter, nitrogen dioxide and ozone were estimated for the year prior to biopsy, based on participants' residential addresses. Linear regression models were used to examine the association between air pollution exposures and adipokines and to evaluate effect modification by immune cell counts. RESULTS An interquartile increase in non-freeway NRAP exposure during 1 year prior to biopsy was associated with higher leptin levels in both serum [31.7% (95% CI: 10.4, 52.9%)] and AT [19.4% (2.2, 36.6%)]. Non-freeway NRAP exposure effect estimates were greater among participants with greater than median Teff/Treg ratio and M1/M2 ratio in blood, and with greater M1 counts in AT. No adipokine associations with regional air pollutants were found. DISCUSSION Our results suggest that NRAP may increase serum leptin levels in obese young adults, and this association may be promoted in a pro-inflammatory immune cell environment in blood and AT.
Collapse
Affiliation(s)
- Md Mostafijur Rahman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 N. Soto Street Building: SSB, Los Angeles, CA, 90032, USA
| | - Fei Fei Liu
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 N. Soto Street Building: SSB, Los Angeles, CA, 90032, USA
| | - Sandrah P Eckel
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 N. Soto Street Building: SSB, Los Angeles, CA, 90032, USA
| | - Ishwarya Sankaranarayanan
- Department of Molecular and Cellular Immunology, University of Southern California, Los Angeles, California, USA
| | - Pedram Shafiei-Jahani
- Department of Molecular and Cellular Immunology, University of Southern California, Los Angeles, California, USA
| | - Emily Howard
- Department of Molecular and Cellular Immunology, University of Southern California, Los Angeles, California, USA
| | - Lilit Baronikian
- Department of Medicine, Keck School of Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | - Fred Sattler
- Department of Medicine, Keck School of Medicine, University of Southern California, Keck School of Medicine, Los Angeles, CA, USA
| | | | - Hooman Allayee
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 N. Soto Street Building: SSB, Los Angeles, CA, 90032, USA
| | - Omid Akbari
- Department of Molecular and Cellular Immunology, University of Southern California, Los Angeles, California, USA
| | - Rob McConnell
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 N. Soto Street Building: SSB, Los Angeles, CA, 90032, USA.
| |
Collapse
|
4
|
Li YC, Hsu HHL, Chun Y, Chiu PH, Arditi Z, Claudio L, Pandey G, Bunyavanich S. Machine learning-driven identification of early-life air toxic combinations associated with childhood asthma outcomes. J Clin Invest 2021; 131:152088. [PMID: 34609967 DOI: 10.1172/jci152088] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 09/23/2021] [Indexed: 01/19/2023] Open
Abstract
Air pollution is a well-known contributor to asthma. Air toxics are hazardous air pollutants that cause or may cause serious health effects. Although individual air toxics have been associated with asthma, only a limited number of studies have specifically examined combinations of air toxics associated with the disease. We geocoded air toxic levels from the US National Air Toxics Assessment (NATA) to residential locations for participants of our AiRway in Asthma (ARIA) study. We then applied Data-driven ExposurE Profile extraction (DEEP), a machine learning-based method, to discover combinations of early-life air toxics associated with current use of daily asthma controller medication, lifetime emergency department visit for asthma, and lifetime overnight hospitalization for asthma. We discovered 20 multi-air toxic combinations and 18 single air toxics associated with at least 1 outcome. The multi-air toxic combinations included those containing acrylic acid, ethylidene dichloride, and hydroquinone, and they were significantly associated with asthma outcomes. Several air toxic members of the combinations would not have been identified by single air toxic analyses, supporting the use of machine learning-based methods designed to detect combinatorial effects. Our findings provide knowledge about air toxic combinations associated with childhood asthma.
Collapse
Affiliation(s)
| | - Hsiao-Hsien Leon Hsu
- Department of Environmental Medicine and Public Health.,Institute for Exposomic Research, and
| | | | | | - Zoe Arditi
- Department of Genetics and Genomic Sciences.,Division of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Luz Claudio
- Department of Environmental Medicine and Public Health.,Institute for Exposomic Research, and
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences.,Institute for Exposomic Research, and
| | - Supinda Bunyavanich
- Department of Genetics and Genomic Sciences.,Division of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| |
Collapse
|
5
|
Zhang AS, Veeramani A, Quinn MS, Alsoof D, Kuris EO, Daniels AH. Machine Learning Prediction of Length of Stay in Adult Spinal Deformity Patients Undergoing Posterior Spine Fusion Surgery. J Clin Med 2021; 10:jcm10184074. [PMID: 34575182 PMCID: PMC8471961 DOI: 10.3390/jcm10184074] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 09/06/2021] [Accepted: 09/07/2021] [Indexed: 11/16/2022] Open
Abstract
(1) Background: Length of stay (LOS) is a commonly reported metric used to assess surgical success, patient outcomes, and economic impact. The focus of this study is to use a variety of machine learning algorithms to reliably predict whether a patient undergoing posterior spinal fusion surgery treatment for Adult Spine Deformity (ASD) will experience a prolonged LOS. (2) Methods: Patients undergoing treatment for ASD with posterior spinal fusion surgery were selected from the American College of Surgeon's NSQIP dataset. Prolonged LOS was defined as a LOS greater than or equal to 9 days. Data was analyzed with the Logistic Regression, Decision Tree, Random Forest, XGBoost, and Gradient Boosting functions in Python with the Sci-Kit learn package. Prediction accuracy and area under the curve (AUC) were calculated. (3) Results: 1281 posterior patients were analyzed. The five algorithms had prediction accuracies between 68% and 83% for posterior cases (AUC: 0.566-0.821). Multivariable regression indicated that increased Work Relative Value Units (RVU), elevated American Society of Anesthesiologists (ASA) class, and longer operating times were linked to longer LOS. (4) Conclusions: Machine learning algorithms can predict if patients will experience an increased LOS following ASD surgery. Therefore, medical resources can be more appropriately allocated towards patients who are at risk of prolonged LOS.
Collapse
Affiliation(s)
- Andrew S Zhang
- Department of Orthopaedic Surgery, Warren Alpert Medical School of Brown University, Rhode Island Hospital, Providence, RI 02912, USA; (A.S.Z.); (M.S.Q.); (D.A.); (E.O.K.)
| | - Ashwin Veeramani
- Division of Applied Mathematics, Brown University, Providence, RI 02912, USA;
| | - Matthew S. Quinn
- Department of Orthopaedic Surgery, Warren Alpert Medical School of Brown University, Rhode Island Hospital, Providence, RI 02912, USA; (A.S.Z.); (M.S.Q.); (D.A.); (E.O.K.)
| | - Daniel Alsoof
- Department of Orthopaedic Surgery, Warren Alpert Medical School of Brown University, Rhode Island Hospital, Providence, RI 02912, USA; (A.S.Z.); (M.S.Q.); (D.A.); (E.O.K.)
| | - Eren O. Kuris
- Department of Orthopaedic Surgery, Warren Alpert Medical School of Brown University, Rhode Island Hospital, Providence, RI 02912, USA; (A.S.Z.); (M.S.Q.); (D.A.); (E.O.K.)
| | - Alan H. Daniels
- Department of Orthopaedic Surgery, Warren Alpert Medical School of Brown University, Rhode Island Hospital, Providence, RI 02912, USA; (A.S.Z.); (M.S.Q.); (D.A.); (E.O.K.)
- Correspondence:
| |
Collapse
|
6
|
Arnold KF, Davies V, de Kamps M, Tennant PWG, Mbotwa J, Gilthorpe MS. Reflection on modern methods: generalized linear models for prognosis and intervention-theory, practice and implications for machine learning. Int J Epidemiol 2021; 49:2074-2082. [PMID: 32380551 PMCID: PMC7825942 DOI: 10.1093/ije/dyaa049] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 03/09/2020] [Indexed: 01/28/2023] Open
Abstract
Prediction and causal explanation are fundamentally distinct tasks of data analysis. In health applications, this difference can be understood in terms of the difference between prognosis (prediction) and prevention/treatment (causal explanation). Nevertheless, these two concepts are often conflated in practice. We use the framework of generalized linear models (GLMs) to illustrate that predictive and causal queries require distinct processes for their application and subsequent interpretation of results. In particular, we identify five primary ways in which GLMs for prediction differ from GLMs for causal inference: (i) the covariates that should be considered for inclusion in (and possibly exclusion from) the model; (ii) how a suitable set of covariates to include in the model is determined; (iii) which covariates are ultimately selected and what functional form (i.e. parameterization) they take; (iv) how the model is evaluated; and (v) how the model is interpreted. We outline some of the potential consequences of failing to acknowledge and respect these differences, and additionally consider the implications for machine learning (ML) methods. We then conclude with three recommendations that we hope will help ensure that both prediction and causal modelling are used appropriately and to greatest effect in health research.
Collapse
Affiliation(s)
- Kellyn F Arnold
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.,School of Medicine, University of Leeds, Leeds, UK
| | - Vinny Davies
- School of Computing Science, University of Glasgow, Glasgow, UK
| | - Marc de Kamps
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.,School of Computing, University of Leeds, Leeds, UK
| | - Peter W G Tennant
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.,School of Medicine, University of Leeds, Leeds, UK.,The Alan Turing Institute, London, UK
| | - John Mbotwa
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.,School of Medicine, University of Leeds, Leeds, UK
| | - Mark S Gilthorpe
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK.,School of Medicine, University of Leeds, Leeds, UK.,The Alan Turing Institute, London, UK
| |
Collapse
|
7
|
Oskar S, Stingone JA. Machine Learning Within Studies of Early-Life Environmental Exposures and Child Health: Review of the Current Literature and Discussion of Next Steps. Curr Environ Health Rep 2021; 7:170-184. [PMID: 32578067 DOI: 10.1007/s40572-020-00282-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
PURPOSE OF REVIEW The goal of this article is to review the use of machine learning (ML) within studies of environmental exposures and children's health, identify common themes across studies, and provide recommendations to advance their use in research and practice. RECENT FINDINGS We identified 42 articles reporting upon the use of ML within studies of environmental exposures and children's health between 2017 and 2019. The common themes among the articles were analysis of mixture data, exposure prediction, disease prediction and forecasting, analysis of complex data, and causal inference. With the increasing complexity of environmental health data, we anticipate greater use of ML to address the challenges that cannot be handled by traditional analytics. In order for these methods to beneficially impact public health, the ML techniques we use need to be appropriate for our study questions, rigorously evaluated and reported in a way that can be critically assessed by the scientific community.
Collapse
Affiliation(s)
- Sabine Oskar
- Department of Epidemiology, Columbia University Mailman School of Public Health, 722 West 168th St, Room 1608, New York, NY, 10032, USA
| | - Jeanette A Stingone
- Department of Epidemiology, Columbia University Mailman School of Public Health, 722 West 168th St, Room 1608, New York, NY, 10032, USA.
| |
Collapse
|