1
|
Wongyikul P, Tantraworasin A, Suwannasom P, Srisuwan T, Wannasopha Y, Phinyo P. Prediction model for recommending coronary artery calcium score screening (CAC-prob) in cardiology outpatient units: A development study. PLoS One 2024; 19:e0308890. [PMID: 39348344 PMCID: PMC11441643 DOI: 10.1371/journal.pone.0308890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 07/31/2024] [Indexed: 10/02/2024] Open
Abstract
Despite the well-established significance of the CAC score as a cardiovascular risk marker, the timing of using CAC score in routine clinical practice remains unclear. We aim to develop a prediction model for patients visiting outpatient cardiology units, which can recommend whether CAC score screening is necessary. A prediction model using retrospective cross-sectional design was conducted. Patients who underwent CAC score screening were included. Eight candidate predictors were preselected, including age, gender, DM or primary hypertension, angina chest pain, LDL-C (≥130 mg/dl), presence of low HDL-C, triglyceride (≥150 mg/dl), and eGFR. The outcome of interest was the level of CAC score (CAC score 0, CAC score 1-99, CAC score ≥100). The model was developed using ordinal logistic regression, and model performance was evaluated in terms of discriminative ability and calibration. A total of 360 patients were recruited for analysis, comprising 136 with CAC score 0, 133 with CAC score 1-99, and 111 with CAC score ≥100. The final predictors identified were age, male gender, presence of hypertension or DM, and low HDL-C. The model demonstrated excellent discriminative ability (Ordinal C-statistics of 0.81) with visually good agreement on calibration plots. The implementation of this model (CAC-prob) has the potential to enhance precision in recommending CAC screening. However, external validation is necessary to assess its robustness in new patient cohorts.
Collapse
Affiliation(s)
- Pakpoom Wongyikul
- Center for Clinical Epidemiology and Clinical Statistics, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Apichat Tantraworasin
- General Thoracic Unit, Department of Surgery, Faculty of Medicine, Chiang Mai University Hospital, Chiang Mai, Thailand
| | - Pannipa Suwannasom
- Division of Cardiology, Department of Internal Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Tanop Srisuwan
- Department of Radiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Yutthaphan Wannasopha
- Department of Radiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Phichayut Phinyo
- Center for Clinical Epidemiology and Clinical Statistics, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
- Department of Family Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
- Center of Multidisciplinary Technology for Advanced Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| |
Collapse
|
2
|
Baker WL, Sharma M, Cohen A, Ouwens M, Christoph MJ, Koch B, Moore TE, Frady G, Coleman CI. Using 30-day modified rankin scale score to predict 90-day score in patients with intracranial hemorrhage: Derivation and validation of prediction model. PLoS One 2024; 19:e0303757. [PMID: 38771834 PMCID: PMC11108121 DOI: 10.1371/journal.pone.0303757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 04/30/2024] [Indexed: 05/23/2024] Open
Abstract
Whether 30-day modified Rankin Scale (mRS) scores can predict 90-day scores is unclear. This study derived and validated a model to predict ordinal 90-day mRS score in an intracerebral hemorrhage (ICH) population using 30-day mRS values and routinely available baseline variables. Adults enrolled in the Antihypertensive Treatment of Acute Cerebral Hemorrhage-2 (ATACH-2) trial between May 2011 and September 2015 with acute ICH, who were alive at 30 days and had mRS scores reported at both 30 and 90 days were included in this post-hoc analysis. A proportional odds regression model for predicting ordinal 90-day mRS scores was developed and internally validated using bootstrapping. Variables in the model included: mRS score at 30 days, age (years), hematoma volume (cm3), hematoma location (deep [basal ganglia, thalamus], lobar, or infratentorial), presence of intraventricular hemorrhage (IVH), baseline Glasgow Coma Scale (GCS) score, and National Institutes of Health Stroke Scale (NIHSS) score at randomization. We assessed model fit, calibration, discrimination, and agreement (ordinal, dichotomized functional independence), and EuroQol-5D ([EQ-5D] utility weighted) between predicted and observed 90-day mRS. A total of 898/1000 participants were included. Following bootstrap internal validation, our model (calibration slope = 0.967) had an optimism-corrected c-index of 0.884 (95% CI = 0.873-0.896) and R2 = 0.712 for 90-day mRS score. The weighted ĸ for agreement between observed and predicted ordinal 90-day mRS score was 0.811 (95% CI = 0.787-0.834). Agreement between observed and predicted functional independence (mRS score of 0-2) at 90 days was 74.3% (95% CI = 69.9-78.7%). The mean ± SD absolute difference between predicted and observed EQ-5D-weighted mRS score was negligible (0.005 ± 0.145). This tool allows practitioners and researchers to utilize clinically available information along with the mRS score 30 days after ICH to reliably predict the mRS score at 90 days.
Collapse
Affiliation(s)
- William L. Baker
- University of Connecticut School of Pharmacy, Storrs, CT, United States of America
- Evidence-Based Practice Center, Hartford Hospital, Hartford, CT, United States of America
| | - Mukul Sharma
- Division of Neurology, Population Health Research Institute, McMaster University, Hamilton, Ontario, Canada
| | - Alexander Cohen
- Guy’s and St. Thomas’ Hospitals, King’s College London, London, United Kingdom
| | - Mario Ouwens
- Medical and Payer Evidence, BioPharmaceuticals Medical, AstraZeneca, Cambridge, United Kingdom
| | - Mary J. Christoph
- AstraZeneca Pharmaceuticals, Wilmington, DE, United States of America
| | - Bruce Koch
- AstraZeneca Pharmaceuticals, Wilmington, DE, United States of America
| | - Timothy E. Moore
- Statistical Consulting Services, Center for Open Research Resources & Equipment, University of Connecticut, Storrs, CT, United States of America
| | - Garrett Frady
- Department of Statistics, University of Connecticut, Storrs, CT, United States of America
| | - Craig I. Coleman
- University of Connecticut School of Pharmacy, Storrs, CT, United States of America
- Evidence-Based Practice Center, Hartford Hospital, Hartford, CT, United States of America
| |
Collapse
|
3
|
Nylén MC, Abzhandadze T, Persson HC, Sunnerhagen KS. Prediction of long-term functional outcome following different rehabilitation pathways after stroke unit discharge. J Rehabil Med 2024; 56:jrm19458. [PMID: 38770699 PMCID: PMC11135335 DOI: 10.2340/jrm.v56.19458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 04/10/2024] [Indexed: 05/22/2024] Open
Abstract
OBJECTIVE To investigate whether referral for different types of rehabilitation on discharge from Swedish stroke units can predict functional outcomes at 1 and 5 years after a stroke. DESIGN A longitudinal and registry-based study. SUBJECTS/PATIENTS A total of 5,118 participants with index stroke in 2011 were followed-up at 1 and 5 years after the stroke. METHODS Ordinal logistic regression models were developed to predict the category of functional outcome: independent, dependent, or dead. The primary predictors were planned rehabilitation in a home setting, inpatient rehabilitation, and outpatient rehabilitation, with no planned rehabilitation as the reference category. RESULTS Planned outpatient rehabilitation predicted independence (compared with death) at 1 year. Planned rehabilitation in the home setting predicted independence (compared with death) at 1 and 5 years. Compared with other planned pathways, participants planned for inpatient rehabilitation had more severe conditions, and planned inpatient rehabilitation did not predict independence. CONCLUSION Planning for outpatient or home-based rehabilitation appeared to lead more effectively to participants achieving independence over the course of 1-5 years. This may have been due to the less severe nature of these participants' conditions, compared with those requiring inpatient rehabilitation.
Collapse
Affiliation(s)
- Malin C Nylén
- Institute of Neuroscience and Physiology, Rehabilitation Medicine, University of Gothenburg, Gothenburg, Sweden.
| | - Tamar Abzhandadze
- Institute of Neuroscience and Physiology, Rehabilitation Medicine, University of Gothenburg, Gothenburg, Sweden; Department of Occupational Therapy and Physiotherapy, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Hanna C Persson
- Institute of Neuroscience and Physiology, Rehabilitation Medicine, University of Gothenburg, Gothenburg, Sweden; Department of Occupational Therapy and Physiotherapy, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Katharina S Sunnerhagen
- Institute of Neuroscience and Physiology, Rehabilitation Medicine, University of Gothenburg, Gothenburg, Sweden; Neurocare, Sahlgrenska University Hospital, Gothenburg, Sweden
| |
Collapse
|
4
|
Tritt A, Yue JK, Ferguson AR, Torres Espin A, Nelson LD, Yuh EL, Markowitz AJ, Manley GT, Bouchard KE. Data-driven distillation and precision prognosis in traumatic brain injury with interpretable machine learning. Sci Rep 2023; 13:21200. [PMID: 38040784 PMCID: PMC10692236 DOI: 10.1038/s41598-023-48054-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 11/21/2023] [Indexed: 12/03/2023] Open
Abstract
Traumatic brain injury (TBI) affects how the brain functions in the short and long term. Resulting patient outcomes across physical, cognitive, and psychological domains are complex and often difficult to predict. Major challenges to developing personalized treatment for TBI include distilling large quantities of complex data and increasing the precision with which patient outcome prediction (prognoses) can be rendered. We developed and applied interpretable machine learning methods to TBI patient data. We show that complex data describing TBI patients' intake characteristics and outcome phenotypes can be distilled to smaller sets of clinically interpretable latent factors. We demonstrate that 19 clusters of TBI outcomes can be predicted from intake data, a ~ 6× improvement in precision over clinical standards. Finally, we show that 36% of the outcome variance across patients can be predicted. These results demonstrate the importance of interpretable machine learning applied to deeply characterized patients for data-driven distillation and precision prognosis.
Collapse
Affiliation(s)
- Andrew Tritt
- Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - John K Yue
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Adam R Ferguson
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
- San Francisco Veterans Affairs Healthcare System, San Francisco, CA, USA
| | - Abel Torres Espin
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Lindsay D Nelson
- Departments of Neurosurgery and Neurology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Esther L Yuh
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Amy J Markowitz
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Geoffrey T Manley
- Brain and Spinal Injury Center, Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, CA, USA
- Department of Neurosurgery, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
- Weill Neurohub, University of California San Francisco, San Francisco, CA, USA
- Weill Neurohub, University of California Berkeley, Berkeley, CA, USA
| | - Kristofer E Bouchard
- Weill Neurohub, University of California Berkeley, Berkeley, CA, USA.
- Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Helen Wills Neuroscience Institute and Redwood Center for Theoretical Neuroscience, University of California Berkeley, Berkeley, CA, USA.
| |
Collapse
|
5
|
Levine AC, Gainey M, Qu K, Nasrin S, Sharif MBE, Noor SS, Barry MA, Garbern SC, Schmid CH, Rosen RK, Nelson EJ, Alam NH. A comparison of the NIRUDAK models and WHO algorithm for dehydration assessment in older children and adults with acute diarrhoea: a prospective, observational study. Lancet Glob Health 2023; 11:e1725-e1733. [PMID: 37776870 PMCID: PMC10593153 DOI: 10.1016/s2214-109x(23)00403-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 08/04/2023] [Accepted: 08/09/2023] [Indexed: 10/02/2023]
Abstract
BACKGROUND Despite the importance of accurate and rapid assessment of hydration status in patients with acute diarrhoea, no validated tools exist to help clinicians assess dehydration severity in older children and adults. The aim of this study is to validate a clinical decision support tool (CDST) and a simplified score for dehydration severity in older children and adults with acute diarrhoea (both developed during the NIRUDAK study) and compare their accuracy and reliability with current WHO guidelines. METHODS A random sample of patients aged 5 years or older presenting with diarrhoea to the icddr,b Dhaka Hospital in Bangladesh between Jan 30 and Dec 13, 2022 were included in this prospective cohort study. Patients with fewer than three loose stools per day, more than 7 days of symptoms, previous enrolment in the study, or a diagnosis other than acute gastroenteritis were excluded. Patients were weighed on arrival and assessed separately by two nurses using both our novel clinical tools and WHO guidelines. Patients were weighed every 4 h to determine their percent weight change with rehydration, our criterion standard for dehydration. Accuracy for the diagnosis of dehydration category (none, some, or severe) was assessed using the ordinal c-index (ORC). Reliability was assessed by comparing the prediction of severe dehydration from each nurse's independent assessment using the intraclass correlation coefficient (ICC). FINDINGS 1580 patients were included in our primary analysis, of whom 921 (58·3%) were female and 659 (41·7%) male. The ORC was 0·74 (95% CI 0·71-0·77) for the CDST, 0·75 (0·71-0·78) for the simplified score, and 0·64 (0·61-0·67) for the WHO guidelines. The ICC was 0·98 (95% CI 0·97-0·98) for the CDST, 0·94 (0·93-0·95) for the simplified score, and 0·56 (0·52-0·60) for the WHO guidelines. INTERPRETATION Use of our CDST or simplified score by clinicians could reduce undertreatment and overtreatment of older children and adults with acute diarrhoea, potentially reducing morbidity and mortality for this common disease. FUNDING US National Institutes of Health. TRANSLATION For the Bangla translation of the abstract see Supplementary Materials section.
Collapse
Affiliation(s)
- Adam C Levine
- Department of Emergency Medicine, Warren Alpert Medical School, Brown University, Providence, RI, USA.
| | - Monique Gainey
- Department of Emergency Medicine, Rhode Island Hospital, Providence, RI, USA
| | - Kexin Qu
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| | - Sabiha Nasrin
- Nutrition and Clinical Services Division, International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka, Bangladesh
| | - Mohsena Bint-E Sharif
- Nutrition and Clinical Services Division, International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka, Bangladesh
| | - Syada S Noor
- Nutrition and Clinical Services Division, International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka, Bangladesh
| | - Meagan A Barry
- Department of Emergency Medicine, Warren Alpert Medical School, Brown University, Providence, RI, USA
| | - Stephanie C Garbern
- Department of Emergency Medicine, Warren Alpert Medical School, Brown University, Providence, RI, USA
| | - Christopher H Schmid
- Department of Biostatistics, School of Public Health, Brown University, Providence, RI, USA
| | - Rochelle K Rosen
- Department of Behavioral and Social Sciences, School of Public Health, Brown University, Providence, RI, USA
| | - Eric J Nelson
- Departments of Pediatrics and Environmental and Global Health, Emerging Pathogens Institute, University of Florida, Gainesville, FL, USA
| | - Nur H Alam
- Nutrition and Clinical Services Division, International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), Dhaka, Bangladesh
| |
Collapse
|
6
|
Bhattacharyay S, Caruso PF, Åkerlund C, Wilson L, Stevens RD, Menon DK, Steyerberg EW, Nelson DW, Ercole A. Mining the contribution of intensive care clinical course to outcome after traumatic brain injury. NPJ Digit Med 2023; 6:154. [PMID: 37604980 PMCID: PMC10442346 DOI: 10.1038/s41746-023-00895-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 08/01/2023] [Indexed: 08/23/2023] Open
Abstract
Existing methods to characterise the evolving condition of traumatic brain injury (TBI) patients in the intensive care unit (ICU) do not capture the context necessary for individualising treatment. Here, we integrate all heterogenous data stored in medical records (1166 pre-ICU and ICU variables) to model the individualised contribution of clinical course to 6-month functional outcome on the Glasgow Outcome Scale -Extended (GOSE). On a prospective cohort (n = 1550, 65 centres) of TBI patients, we train recurrent neural network models to map a token-embedded time series representation of all variables (including missing values) to an ordinal GOSE prognosis every 2 h. The full range of variables explains up to 52% (95% CI: 50-54%) of the ordinal variance in functional outcome. Up to 91% (95% CI: 90-91%) of this explanation is derived from pre-ICU and admission information (i.e., static variables). Information collected in the ICU (i.e., dynamic variables) increases explanation (by up to 5% [95% CI: 4-6%]), though not enough to counter poorer overall performance in longer-stay (>5.75 days) patients. Highest-contributing variables include physician-based prognoses, CT features, and markers of neurological function. Whilst static information currently accounts for the majority of functional outcome explanation after TBI, data-driven analysis highlights investigative avenues to improve the dynamic characterisation of longer-stay patients. Moreover, our modelling strategy proves useful for converting large patient records into interpretable time series with missing data integration and minimal processing.
Collapse
Affiliation(s)
- Shubhayu Bhattacharyay
- Division of Anaesthesia, University of Cambridge, Cambridge, UK.
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK.
- Laboratory of Computational Intensive Care Medicine, Johns Hopkins University, Baltimore, MD, USA.
| | - Pier Francesco Caruso
- Division of Anaesthesia, University of Cambridge, Cambridge, UK
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, Milan, 20072, Italy
| | - Cecilia Åkerlund
- Department of Physiology and Pharmacology, Section for Perioperative Medicine and Intensive Care, Karolinska Institutet, Stockholm, Sweden
| | - Lindsay Wilson
- Division of Psychology, University of Stirling, Stirling, UK
| | - Robert D Stevens
- Laboratory of Computational Intensive Care Medicine, Johns Hopkins University, Baltimore, MD, USA
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - David K Menon
- Division of Anaesthesia, University of Cambridge, Cambridge, UK
| | - Ewout W Steyerberg
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - David W Nelson
- Department of Physiology and Pharmacology, Section for Perioperative Medicine and Intensive Care, Karolinska Institutet, Stockholm, Sweden
| | - Ari Ercole
- Division of Anaesthesia, University of Cambridge, Cambridge, UK
- Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, UK
| |
Collapse
|
7
|
Biziaev T, Aktary ML, Wang Q, Chekouo T, Bhatti P, Shack L, Robson PJ, Kopciuk KA. Development and External Validation of Partial Proportional Odds Risk Prediction Models for Cancer Stage at Diagnosis among Males and Females in Canada. Cancers (Basel) 2023; 15:3545. [PMID: 37509208 PMCID: PMC10377619 DOI: 10.3390/cancers15143545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/04/2023] [Accepted: 07/05/2023] [Indexed: 07/30/2023] Open
Abstract
Risk prediction models for cancer stage at diagnosis may identify individuals at higher risk of late-stage cancer diagnoses. Partial proportional odds risk prediction models for cancer stage at diagnosis for males and females were developed using data from Alberta's Tomorrow Project (ATP). Prediction models were validated on the British Columbia Generations Project (BCGP) cohort using discrimination and calibration measures. Among ATP males, older age at diagnosis was associated with an earlier stage at diagnosis, while full- or part-time employment, prostate-specific antigen testing, and former/current smoking were associated with a later stage at diagnosis. Among ATP females, mammogram and sigmoidoscopy or colonoscopy were associated with an earlier stage at diagnosis, while older age at diagnosis, number of pregnancies, and hysterectomy were associated with a later stage at diagnosis. On external validation, discrimination results were poor for both males and females while calibration results indicated that the models did not over- or under-fit to derivation data or over- or under-predict risk. Multiple factors associated with cancer stage at diagnosis were identified among ATP participants. While the prediction model calibration was acceptable, discrimination was poor when applied to BCGP data. Updating our models with additional predictors may help improve predictive performance.
Collapse
Affiliation(s)
- Timofei Biziaev
- Department of Mathematics and Statistics, University of Calgary, Calgary, AB T2N 4N2, Canada
| | - Michelle L Aktary
- Faculty of Kinesiology, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Qinggang Wang
- Cancer Epidemiology and Prevention Research, Cancer Care Alberta, Alberta Health Services, Calgary, AB T2S 3C3, Canada
| | - Thierry Chekouo
- Department of Mathematics and Statistics, University of Calgary, Calgary, AB T2N 4N2, Canada
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455, USA
| | - Parveen Bhatti
- Cancer Control Research, BC Cancer, Vancouver, BC V5Z 1L3, Canada
- School of Population and Public Health, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Lorraine Shack
- Cancer Surveillance and Reporting, Alberta Health Services, Calgary, AB T2S 3C3, Canada
| | - Paula J Robson
- Department of Agricultural, Food and Nutritional Science and School of Public Health, University of Alberta, Edmonton, AB T6G 2P5, Canada
- Cancer Care Alberta and Cancer Strategic Clinical Network, Alberta Health Services, Edmonton, AB T5J 3H1, Canada
| | - Karen A Kopciuk
- Department of Mathematics and Statistics, University of Calgary, Calgary, AB T2N 4N2, Canada
- Cancer Epidemiology and Prevention Research, Cancer Care Alberta, Alberta Health Services, Calgary, AB T2S 3C3, Canada
- Departments of Oncology, Community Health Sciences, University of Calgary, Calgary, AB T2N 4N2, Canada
| |
Collapse
|
8
|
Saffari SE, Ning Y, Xie F, Chakraborty B, Volovici V, Vaughan R, Ong MEH, Liu N. AutoScore-Ordinal: an interpretable machine learning framework for generating scoring models for ordinal outcomes. BMC Med Res Methodol 2022; 22:286. [PMID: 36333672 PMCID: PMC9636613 DOI: 10.1186/s12874-022-01770-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2022] Open
Abstract
Background Risk prediction models are useful tools in clinical decision-making which help with risk stratification and resource allocations and may lead to a better health care for patients. AutoScore is a machine learning–based automatic clinical score generator for binary outcomes. This study aims to expand the AutoScore framework to provide a tool for interpretable risk prediction for ordinal outcomes. Methods The AutoScore-Ordinal framework is generated using the same 6 modules of the original AutoScore algorithm including variable ranking, variable transformation, score derivation (from proportional odds models), model selection, score fine-tuning, and model evaluation. To illustrate the AutoScore-Ordinal performance, the method was conducted on electronic health records data from the emergency department at Singapore General Hospital over 2008 to 2017. The model was trained on 70% of the data, validated on 10% and tested on the remaining 20%. Results This study included 445,989 inpatient cases, where the distribution of the ordinal outcome was 80.7% alive without 30-day readmission, 12.5% alive with 30-day readmission, and 6.8% died inpatient or by day 30 post discharge. Two point-based risk prediction models were developed using two sets of 8 predictor variables identified by the flexible variable selection procedure. The two models indicated reasonably good performance measured by mean area under the receiver operating characteristic curve (0.758 and 0.793) and generalized c-index (0.737 and 0.760), which were comparable to alternative models. Conclusion AutoScore-Ordinal provides an automated and easy-to-use framework for development and validation of risk prediction models for ordinal outcomes, which can systematically identify potential predictors from high-dimensional data.
Collapse
|
9
|
Bhattacharyay S, Milosevic I, Wilson L, Menon DK, Stevens RD, Steyerberg EW, Nelson DW, Ercole A. The leap to ordinal: Detailed functional prognosis after traumatic brain injury with a flexible modelling approach. PLoS One 2022; 17:e0270973. [PMID: 35788768 PMCID: PMC9255749 DOI: 10.1371/journal.pone.0270973] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Accepted: 06/21/2022] [Indexed: 11/30/2022] Open
Abstract
When a patient is admitted to the intensive care unit (ICU) after a traumatic brain injury (TBI), an early prognosis is essential for baseline risk adjustment and shared decision making. TBI outcomes are commonly categorised by the Glasgow Outcome Scale–Extended (GOSE) into eight, ordered levels of functional recovery at 6 months after injury. Existing ICU prognostic models predict binary outcomes at a certain threshold of GOSE (e.g., prediction of survival [GOSE > 1]). We aimed to develop ordinal prediction models that concurrently predict probabilities of each GOSE score. From a prospective cohort (n = 1,550, 65 centres) in the ICU stratum of the Collaborative European NeuroTrauma Effectiveness Research in TBI (CENTER-TBI) patient dataset, we extracted all clinical information within 24 hours of ICU admission (1,151 predictors) and 6-month GOSE scores. We analysed the effect of two design elements on ordinal model performance: (1) the baseline predictor set, ranging from a concise set of ten validated predictors to a token-embedded representation of all possible predictors, and (2) the modelling strategy, from ordinal logistic regression to multinomial deep learning. With repeated k-fold cross-validation, we found that expanding the baseline predictor set significantly improved ordinal prediction performance while increasing analytical complexity did not. Half of these gains could be achieved with the addition of eight high-impact predictors to the concise set. At best, ordinal models achieved 0.76 (95% CI: 0.74–0.77) ordinal discrimination ability (ordinal c-index) and 57% (95% CI: 54%– 60%) explanation of ordinal variation in 6-month GOSE (Somers’ Dxy). Model performance and the effect of expanding the predictor set decreased at higher GOSE thresholds, indicating the difficulty of predicting better functional outcomes shortly after ICU admission. Our results motivate the search for informative predictors that improve confidence in prognosis of higher GOSE and the development of ordinal dynamic prediction models.
Collapse
Affiliation(s)
- Shubhayu Bhattacharyay
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
- Laboratory of Computational Intensive Care Medicine, Johns Hopkins University, Baltimore, MD, United States of America
- * E-mail:
| | - Ioan Milosevic
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
| | - Lindsay Wilson
- Division of Psychology, University of Stirling, Stirling, United Kingdom
| | - David K. Menon
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
| | - Robert D. Stevens
- Laboratory of Computational Intensive Care Medicine, Johns Hopkins University, Baltimore, MD, United States of America
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University, Baltimore, MD, United States of America
| | - Ewout W. Steyerberg
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - David W. Nelson
- Department of Physiology and Pharmacology, Section for Perioperative Medicine and Intensive Care, Karolinska Institutet, Stockholm, Sweden
| | - Ari Ercole
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
- Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, United Kingdom
| | | |
Collapse
|
10
|
Classical Regression and Predictive Modeling. World Neurosurg 2022; 161:251-264. [PMID: 35505542 DOI: 10.1016/j.wneu.2022.02.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 02/05/2022] [Accepted: 02/07/2022] [Indexed: 11/21/2022]
Abstract
BACKGROUND With the advent of personalized and stratified medicine, there has been much discussion about predictive modeling and the role of classical regression in modern medical research. We describe and distinguish the goals in these 2 frameworks for analysis. METHODS The assumptions underlying and utility of classical regression are reviewed for continuous and binary outcomes. The tenets of predictive modeling are then discussed and contrasted. Principles are illustrated by simulation and through application of methods to a neurosurgical study. RESULTS Classical regression can be used for insights into causal mechanisms if careful thought is given to the role of variables of interest and potential confounders. In predictive modeling, interest lies more in accuracy of predictions and so alternative metrics are used to judge adequacy of models and methods; methods which average predictions over several contending models can improve predictive performance but these do not admit a single risk score. CONCLUSIONS Both classical regression and predictive modeling have important roles in modern medical research. Understanding the distinction between the 2 frameworks for analysis is important to place them in their appropriate context and interpreting findings from published studies appropriately.
Collapse
|
11
|
van Doorn T, Reuvers SH, Roobol MJ, Remmers S, Verbeek JF, Scheepe JR, Wolterbeek JH, van der Schoot DK, Nieboer D, ‘t Hoen LA, Blok BF. Development of a prediction model in female pure or predominant urge urinary incontinence: a retrospective cohort study. Ther Adv Urol 2022; 14:17562872221090319. [PMID: 35464652 PMCID: PMC9024161 DOI: 10.1177/17562872221090319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 03/10/2022] [Indexed: 12/02/2022] Open
Abstract
Background: Urinary incontinence is a prevalent form of pelvic floor dysfunction, with a non-negligible impact on a patient’s quality of life. There are several treatment options, varying from conservative to invasive. The aim of this study is to predict treatment outcomes of pure or predominant urge urinary incontinence (UUI) in women to support shared decision-making and manage patient expectations. Methods: Data on patient characteristics, disease history, and investigations of 512 consecutive women treated for UUI in three hospitals in the Netherlands were retrospectively collected. The predicted outcome was the short-term subjective continence outcome, defined as patient-reported continence 3 months after treatment categorized as cure (no urinary leakage), improvement (any degree of improvement of urinary leakage), and failure (no improvement or worsening of urinary leakage). Multivariable ordinal regression with backward stepwise selection was performed to analyze association between outcome and patient’s characteristics. Interactions between patient characteristics and treatment were added to estimate individual treatment benefit. Discriminative ability was assessed with the ordinal c-statistic. Results: Conservative treatment was applied in 12% of the patients, pharmacological in 62%, and invasive in 26%. Subjective continence outcome was cure, improvement, and failure in 20%, 49%, and 31%, respectively. Number of incontinence episodes per day, voiding frequency during the day, subjective quantity of UI, coexistence of stress urinary incontinence (SUI), night incontinence, and bladder capacity and the interactions between these variables were included in the model. After internal validation, the ordinal c-statistic was 0.699. Conclusions: Six variables were of value to predict pure or predominant UUI treatment outcome in women. Further development into a comprehensive set of models for the use in various pelvic floor disorders and treatments is recommended to optimize individualized care. This model requires external validation before implementation in clinical practice.
Collapse
Affiliation(s)
- Tess van Doorn
- Department of Urology, Erasmus MC, Wytemaweg 80, Room Na 1524, 3015 CN Rotterdam, The Netherlands
| | - Sarah H.M. Reuvers
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Monique J. Roobol
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Sebastiaan Remmers
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Jan F.M. Verbeek
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Jeroen R. Scheepe
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Josien H. Wolterbeek
- Department of Urology, Franciscus Gasthuis & Vlietland, Rotterdam, The Netherlands
| | | | - Daan Nieboer
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The NetherlandsDepartment of Public Health, Erasmus MC, Rotterdam, the Netherlands
| | - Lisette A. ‘t Hoen
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| | - Bertil F.M. Blok
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, The Netherlands
| |
Collapse
|
12
|
Zou G, Smith E, Jairath V. A nonparametric approach to confidence intervals for concordance index and difference between correlated indices. J Biopharm Stat 2022; 32:740-767. [PMID: 35216545 DOI: 10.1080/10543406.2022.2030747] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Concordance refers to the probability that subjects with high values on one variable also have high values on another variable. This index has wide application in practice, as a measure of effect size in group-comparison studies, an index of accuracy in diagnostic studies, and a discrimination index for prediction models. Herein, we provide a unified framework for statistical inference involving concordance indices for standard variables of binary, ordinal, and continuous types. In particular, we develop confidence interval procedures for a single concordance index and differences between two correlated indices. Simulation results show that procedures based on logit-transformation for a single index and Fisher's z-transformation for a difference between indices perform very well in terms of coverage and tail errors even when the sample size is as small as 30, unless the concordance is high and the standard is a binary variable for which at least 50 subjects are needed. We illustrate the procedures for a variety of standard variables with previously published data. Illustrative SAS code is provided.
Collapse
Affiliation(s)
- Guangyong Zou
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada.,Robarts Research Institute, Western University, London, Ontario, Canada
| | - Emma Smith
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
| | - Vipul Jairath
- Department of Epidemiology and Biostatistics, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada.,Department of Medicine, Schulich School of Medicine & Dentistry, Western University, London, Ontario, Canada
| |
Collapse
|
13
|
Edlinger M, van Smeden M, Alber HF, Wanitschek M, Van Calster B. Risk prediction models for discrete ordinal outcomes: Calibration and the impact of the proportional odds assumption. Stat Med 2021; 41:1334-1360. [PMID: 34897756 PMCID: PMC9299669 DOI: 10.1002/sim.9281] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 10/08/2021] [Accepted: 11/22/2021] [Indexed: 12/28/2022]
Abstract
Calibration is a vital aspect of the performance of risk prediction models, but research in the context of ordinal outcomes is scarce. This study compared calibration measures for risk models predicting a discrete ordinal outcome, and investigated the impact of the proportional odds assumption on calibration and overfitting. We studied the multinomial, cumulative, adjacent category, continuation ratio, and stereotype logit/logistic models. To assess calibration, we investigated calibration intercepts and slopes, calibration plots, and the estimated calibration index. Using large sample simulations, we studied the performance of models for risk estimation under various conditions, assuming that the true model has either a multinomial logistic form or a cumulative logit proportional odds form. Small sample simulations were used to compare the tendency for overfitting between models. As a case study, we developed models to diagnose the degree of coronary artery disease (five categories) in symptomatic patients. When the true model was multinomial logistic, proportional odds models often yielded poor risk estimates, with calibration slopes deviating considerably from unity even on large model development datasets. The stereotype logistic model improved the calibration slope, but still provided biased risk estimates for individual patients. When the true model had a cumulative logit proportional odds form, multinomial logistic regression provided biased risk estimates, although these biases were modest. Nonproportional odds models require more parameters to be estimated from the data, and hence suffered more from overfitting. Despite larger sample size requirements, we generally recommend multinomial logistic regression for risk prediction modeling of discrete ordinal outcomes.
Collapse
Affiliation(s)
- Michael Edlinger
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium.,Department of Medical Statistics, Informatics, and Health Economics, Medical University Innsbruck, Innsbruck, Austria
| | - Maarten van Smeden
- Julius Centre for Health Science and Primary Care, University Medical Centre Utrecht, Utrecht, The Netherlands.,Department of Clinical Epidemiology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Hannes F Alber
- Department of Internal Medicine and Cardiology, Klinikum Klagenfurt am Wörthersee, Klagenfurt, Austria.,Karl Landsteiner Institute for Interdisciplinary Science, Rehabilitation Centre, Münster, Austria
| | - Maria Wanitschek
- Department of Internal Medicine III-Cardiology and Angiology, Tirol Kliniken, Innsbruck, Austria
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium.,EPI-Centre, KU Leuven, Leuven, Belgium.,Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands
| |
Collapse
|
14
|
Sainani KL. Multinomial and ordinal logistic regression. PM R 2021; 13:1050-1055. [PMID: 33905601 DOI: 10.1002/pmrj.12622] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Accepted: 04/15/2021] [Indexed: 11/05/2022]
Affiliation(s)
- Kristin L Sainani
- Department of Epidemiology and Population Health, Stanford University, Stanford, California, USA
| |
Collapse
|
15
|
Singh V, Dwivedi SN, Deo SVS. Ordinal logistic regression model describing factors associated with extent of nodal involvement in oral cancer patients and its prospective validation. BMC Med Res Methodol 2020; 20:95. [PMID: 32336269 PMCID: PMC7183690 DOI: 10.1186/s12874-020-00985-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 04/20/2020] [Indexed: 01/09/2023] Open
Abstract
Background Oral cancer is the most common cancer among Indian men, and has strong tendency of metastatic spread to neck lymph node which strongly influences prognosis especially 5 year survival-rate and also guides the related managements more effectively. Therefore, a reliable and accurate means of preoperative evaluation of extent of nodal involvement becomes crucial. However, earlier researchers have preferred to address mainly its dichotomous form (involved/not-involved) instead of ordinal form while dealing with epidemiology of nodal involvement. As a matter of fact, consideration of ordinal form appropriately may increase not only the efficiency of the developed model but also accuracy in the results and related implications. Hence, to develop a model describing factors associated with ordinal form of nodal involvement was major focus of this study. Methods The data for model building were taken from the Department of Surgical Oncology, Dr.BRA-IRCH, AIIMS, New Delhi, India. All the OSCC patients (duly operated including neck dissection) and confirmed histopathologically from 1995 to 2013 were included. Further, another data of 204 patients collected prospectively from 2014 to 2015 was considered for the validation of the developed model. To assess the factors associated with extent of nodal involvement, as a first attempt in the field of OSCC, stepwise multivariable regression procedure was used and results are presented as odds-ratio and corresponding 95% confidence interval (CI). For appropriate accounting of ordinal form, the ordinal models were assessed and compared. Also, performance of the developed model was validated on a prospectively collected another data. Results Under multivariable proportional odds model, pain at the time of presentation, sub mucous fibrosis, palpable neck node, oral site and degree of differentiation were found to be significantly associated factors with extent of nodal involvement. In addition, tumor size also emerged to be significant under partial-proportional odds model. Conclusions The analytical results under the present study reveal that in case of ordinal form of the outcome, appropriate ordinal regression may be a preferred choice. Present data suggest that, pain, sub mucous fibrosis, palpable neck node, oral site, degree of differentiation and tumor size are the most probable associated factors with extent of nodal involvement.
Collapse
Affiliation(s)
- Vishwajeet Singh
- Department of Biostatistics, All India Institute of Medical Sciences, New Delhi, 110029, India
| | - Sada Nand Dwivedi
- Department of Biostatistics, All India Institute of Medical Sciences, New Delhi, 110029, India.
| | - S V S Deo
- Department of Surgical Oncology, Dr BRA-IRCH, All India Institute of Medical Sciences, New Delhi, India
| |
Collapse
|
16
|
Duan Y, Xie X, Li Q, Mercaldo N, Samir AE, Kuang M, Lin M. Differentiation of regenerative nodule, dysplastic nodule, and small hepatocellular carcinoma in cirrhotic patients: a contrast-enhanced ultrasound-based multivariable model analysis. Eur Radiol 2020; 30:4741-4751. [PMID: 32307563 DOI: 10.1007/s00330-020-06834-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 03/03/2020] [Accepted: 03/25/2020] [Indexed: 12/13/2022]
Abstract
OBJECTIVE To develop a contrast-enhanced ultrasound (CEUS)-based model for differentiating cirrhotic liver lesions and for active surveillance of hepatocellular carcinoma (HCC). METHODS Patients with focal liver lesions (FLLs) with biopsy/resection-proven pathology and pre-procedure CEUS were enrolled from our institution between January 2011 and November 2014. Univariable and multivariable regression models were constructed using qualitative CEUS features and/or contrast arrival time ratio (CATR). The optimism-adjusted Harrell's generalized concordance index (CH) was used to quantify the discriminatory ability of each CEUS feature and model. RESULTS A total of 149 patients (113 men and 36 women) with 162 FLLs were enrolled with mean age 53.4 ± 12.7 years. A 0.1-unit reduction in CATR was associated with a 68% increase in the odds of having a higher nodule ranking (RN < DN < small HCC) (OR, 0.32; 95% CI, 0.20-0.50, p < .001). Arterial phase hypoenhancement and isoenhancement were inversely associated with a higher nodule ranking compared to hyperenhancement. Late-phase isoenhancement was associated with lower odds of a higher nodule ranking. The CEUS + CATR model (CH 0.92, 0.89-0.95) provided greater discriminatory ability when compared to the CATR model (ΔCH 0.09, 0.04-0.13, p < .001) and the CEUS model (ΔCH 0.03, 0.01-0.05, p = .02). CONCLUSIONS Our results provide preliminary evidence that multivariable regression model constructed using both qualitative CEUS features and CATR provides the greatest discriminatory ability to differentiate RN, DN, and small HCC in patients with cirrhosis, and might allow for active surveillance of the progression of cirrhotic liver lesions. KEY POINTS • Proportional odds logistic regression models based on qualitative CEUS features and/or CATR can be used for differentiating cirrhotic liver lesions and for active surveillance of HCC. • The reduction of CATR (RN < DN < small HCC) was strongly associated with high-risk cirrhotic liver nodules. • Inclusion of CATR in the CEUS prediction model significantly improved its performance for cirrhotic liver lesions risk-stratification.
Collapse
Affiliation(s)
- Yu Duan
- Department of Medical Ultrasonics, The First Affiliated Hospital, Sun Yat-sen University, 58 Zhongshan Road 2, Guangzhou, 510080, China
- Department of Radiology, Massachusetts General Hospital, Harvard Medical School, 55 Fruit Street, Boston, MA, 02114, USA
| | - Xiaoyan Xie
- Department of Medical Ultrasonics, The First Affiliated Hospital, Sun Yat-sen University, 58 Zhongshan Road 2, Guangzhou, 510080, China
| | - Qian Li
- Department of Radiology, Massachusetts General Hospital, Harvard Medical School, 55 Fruit Street, Boston, MA, 02114, USA
| | - Nathaniel Mercaldo
- Institute for Technology Assessment, Massachusetts General Hospital, Harvard Medical School, 101 Merrimac Street, Suite 1010, Boston, MA, 02114, USA
| | - Anthony E Samir
- Department of Radiology, Massachusetts General Hospital, Harvard Medical School, 55 Fruit Street, Boston, MA, 02114, USA
| | - Ming Kuang
- Department of Medical Ultrasonics, The First Affiliated Hospital, Sun Yat-sen University, 58 Zhongshan Road 2, Guangzhou, 510080, China
| | - Manxia Lin
- Department of Medical Ultrasonics, The First Affiliated Hospital, Sun Yat-sen University, 58 Zhongshan Road 2, Guangzhou, 510080, China.
| |
Collapse
|
17
|
Cicuendez M, Castaño-León A, Ramos A, Hilario A, Gómez PA, Lagares A. The added prognostic value of magnetic resonance imaging in traumatic brain injury: The importance of traumatic axonal injury when performing ordinal logistic regression. J Neuroradiol 2019; 46:299-306. [DOI: 10.1016/j.neurad.2018.08.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Revised: 07/30/2018] [Accepted: 08/15/2018] [Indexed: 12/01/2022]
|
18
|
Mijderwijk HJ, Steyerberg EW, Steiger HJ, Fischer I, Kamp MA. Fundamentals of Clinical Prediction Modeling for the Neurosurgeon. Neurosurgery 2019; 85:302-311. [DOI: 10.1093/neuros/nyz282] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Accepted: 05/26/2019] [Indexed: 01/18/2023] Open
Abstract
AbstractClinical prediction models in neurosurgery are increasingly reported. These models aim to provide an evidence-based approach to the estimation of the probability of a neurosurgical outcome by combining 2 or more prognostic variables. Model development and model reporting are often suboptimal. A basic understanding of the methodology of clinical prediction modeling is needed when interpreting these models. We address basic statistical background, 7 modeling steps, and requirements of these models such that they may fulfill their potential for major impact for our daily clinical practice and for future scientific work.
Collapse
Affiliation(s)
- Hendrik-Jan Mijderwijk
- Department of Neurosurgery, Heinrich-Heine University Medical Center, Düsseldorf, Germany
| | - Ewout W Steyerberg
- Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
| | - Hans-Jakob Steiger
- Department of Neurosurgery, Heinrich-Heine University Medical Center, Düsseldorf, Germany
| | - Igor Fischer
- Division of Informatics and Data Science, Department of Neurosurgery, Heinrich-Heine University, Düsseldorf, Germany
| | - Marcel A Kamp
- Department of Neurosurgery, Heinrich-Heine University Medical Center, Düsseldorf, Germany
| |
Collapse
|
19
|
van de Maat J, Nieboer D, Thompson M, Lakhanpaul M, Moll H, Oostenbrink R. Can clinical prediction models assess antibiotic need in childhood pneumonia? A validation study in paediatric emergency care. PLoS One 2019; 14:e0217570. [PMID: 31194750 PMCID: PMC6563975 DOI: 10.1371/journal.pone.0217570] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 05/14/2019] [Indexed: 11/18/2022] Open
Abstract
Objectives Pneumonia is the most common bacterial infection in children at the emergency department (ED). Clinical prediction models for childhood pneumonia have been developed (using chest x-ray as their reference standard), but without implementation in clinical practice. Given current insights in the diagnostic limitations of chest x-ray, this study aims to validate these prediction models for a clinical diagnosis of pneumonia, and to explore their potential to guide decisions on antibiotic treatment at the ED. Methods We systematically identified clinical prediction models for childhood pneumonia and assessed their quality. We evaluated the validity of these models in two populations, using a clinical reference standard (1. definite/probable bacterial, 2. bacterial syndrome, 3. unknown bacterial/viral, 4. viral syndrome, 5. definite/probable viral), measuring performance by the ordinal c-statistic (ORC). Validation populations included prospectively collected data of children aged 1 month to 5 years attending the ED of Rotterdam (2012–2013) or Coventry (2005–2006) with fever and cough or dyspnoea. Results We identified eight prediction models and could evaluate the validity of seven, with original good performance. In the Dutch population 22/248 (9%) had a bacterial infection, in Coventry 53/301 (17%), antibiotic prescription was 21% and 35% respectively. Three models predicted a higher risk in children with bacterial infections than in those with viral disease (ORC ≥0.55) and could identify children at low risk of bacterial infection. Conclusions Three clinical prediction models for childhood pneumonia could discriminate fairly well between a clinical reference standard of bacterial versus viral infection. However, they all require the measurement of biomarkers, raising questions on the exact target population when implementing these models in clinical practice. Moreover, choosing optimal thresholds to guide antibiotic prescription is challenging and requires careful consideration of potential harms and benefits.
Collapse
Affiliation(s)
- Josephine van de Maat
- Department of General Paediatrics, Erasmus MC–Sophia Children’s Hospital, Rotterdam, The Netherlands
| | - Daan Nieboer
- Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
| | - Matthew Thompson
- University of Washington, Department of Family Medicine, Seattle, United States of America
| | - Monica Lakhanpaul
- Population, Policy, Practice Program, UCL Great Ormond Street Institute of Child Health, London, United Kingdom
| | - Henriette Moll
- Department of General Paediatrics, Erasmus MC–Sophia Children’s Hospital, Rotterdam, The Netherlands
| | - Rianne Oostenbrink
- Department of General Paediatrics, Erasmus MC–Sophia Children’s Hospital, Rotterdam, The Netherlands
- * E-mail:
| |
Collapse
|
20
|
An ordinal prediction model of the diagnosis of non-obstructive coronary artery and multi-vessel disease in the CARDIIGAN cohort. Int J Cardiol 2018; 267:8-12. [DOI: 10.1016/j.ijcard.2018.05.092] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 05/22/2018] [Accepted: 05/23/2018] [Indexed: 01/09/2023]
|
21
|
Murphy K, Murphy BT, Boyce S, Flynn L, Gilgunn S, O'Rourke CJ, Rooney C, Stöckmann H, Walsh AL, Finn S, O'Kennedy RJ, O'Leary J, Pennington SR, Perry AS, Rudd PM, Saldova R, Sheils O, Shields DC, Watson RW. Integrating biomarkers across omic platforms: an approach to improve stratification of patients with indolent and aggressive prostate cancer. Mol Oncol 2018; 12:1513-1525. [PMID: 29927052 PMCID: PMC6120220 DOI: 10.1002/1878-0261.12348] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 05/28/2018] [Accepted: 06/13/2018] [Indexed: 12/22/2022] Open
Abstract
Classifying indolent prostate cancer represents a significant clinical challenge. We investigated whether integrating data from different omic platforms could identify a biomarker panel with improved performance compared to individual platforms alone. DNA methylation, transcripts, protein and glycosylation biomarkers were assessed in a single cohort of patients treated by radical prostatectomy. Novel multiblock statistical data integration approaches were used to deal with missing data and modelled via stepwise multinomial logistic regression, or LASSO. After applying leave‐one‐out cross‐validation to each model, the probabilistic predictions of disease type for each individual panel were aggregated to improve prediction accuracy using all available information for a given patient. Through assessment of three performance parameters of area under the curve (AUC) values, calibration and decision curve analysis, the study identified an integrated biomarker panel which predicts disease type with a high level of accuracy, with Multi AUC value of 0.91 (0.89, 0.94) and Ordinal C‐Index (ORC) value of 0.94 (0.91, 0.96), which was significantly improved compared to the values for the clinical panel alone of 0.67 (0.62, 0.72) Multi AUC and 0.72 (0.67, 0.78) ORC. Biomarker integration across different omic platforms significantly improves prediction accuracy. We provide a novel multiplatform approach for the analysis, determination and performance assessment of novel panels which can be applied to other diseases. With further refinement and validation, this panel could form a tool to help inform appropriate treatment strategies impacting on patient outcome in early stage prostate cancer.
Collapse
Affiliation(s)
- Keefe Murphy
- UCD School of Mathematics and Statistics, University College Dublin, Ireland
| | - Brendan T Murphy
- UCD School of Mathematics and Statistics, University College Dublin, Ireland
| | - Susie Boyce
- UCD School of Mathematics and Statistics, University College Dublin, Ireland.,UCD School of Medicine, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Ireland
| | - Louise Flynn
- Department of Histopathology, Central Pathology Laboratory, Trinity College, St James Hospital, University of Dublin, Ireland
| | - Sarah Gilgunn
- School of Biotechnology, Dublin City University, Ireland.,Biomedical Diagnostics Institute, National Centre for Sensor Research, Dublin City University, Ireland
| | - Colm J O'Rourke
- Prostate Molecular Oncology, Institute of Molecular Medicine, Trinity College Dublin, St James Hospital, Dublin, Ireland
| | - Cathy Rooney
- UCD School of Medicine, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Ireland
| | - Henning Stöckmann
- NIBRT GlycoScience Group, National Institute for Bioprocessing Research and Training, Dublin, Ireland
| | - Anna L Walsh
- Prostate Molecular Oncology, Institute of Molecular Medicine, Trinity College Dublin, St James Hospital, Dublin, Ireland
| | - Stephen Finn
- Department of Histopathology, Central Pathology Laboratory, Trinity College, St James Hospital, University of Dublin, Ireland
| | - Richard J O'Kennedy
- School of Biotechnology, Dublin City University, Ireland.,Biomedical Diagnostics Institute, National Centre for Sensor Research, Dublin City University, Ireland
| | - John O'Leary
- Department of Histopathology, Central Pathology Laboratory, Trinity College, St James Hospital, University of Dublin, Ireland
| | - Stephen R Pennington
- UCD School of Medicine, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Ireland
| | - Antoinette S Perry
- Prostate Molecular Oncology, Institute of Molecular Medicine, Trinity College Dublin, St James Hospital, Dublin, Ireland.,Cancer Biology and Therapeutics Lab, UCD School of Biomolecular and Biomedical Science, University College Dublin, Ireland
| | - Pauline M Rudd
- NIBRT GlycoScience Group, National Institute for Bioprocessing Research and Training, Dublin, Ireland
| | - Radka Saldova
- NIBRT GlycoScience Group, National Institute for Bioprocessing Research and Training, Dublin, Ireland
| | - Orla Sheils
- Department of Histopathology, Central Pathology Laboratory, Trinity College, St James Hospital, University of Dublin, Ireland
| | - Denis C Shields
- UCD School of Medicine, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Ireland
| | - R William Watson
- UCD School of Medicine, Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Ireland
| |
Collapse
|
22
|
Epstein E, Fischerova D, Valentin L, Testa AC, Franchi D, Sladkevicius P, Frühauf F, Lindqvist PG, Mascilini F, Fruscio R, Haak LA, Opolskiene G, Pascual MA, Alcazar JL, Chiappa V, Guerriero S, Carlson JW, Van Holsbeke C, Leone FPG, De Moor B, Bourne T, van Calster B, Installe A, Timmerman D, Verbakel JY, Van den Bosch T. Ultrasound characteristics of endometrial cancer as defined by International Endometrial Tumor Analysis (IETA) consensus nomenclature: prospective multicenter study. ULTRASOUND IN OBSTETRICS & GYNECOLOGY : THE OFFICIAL JOURNAL OF THE INTERNATIONAL SOCIETY OF ULTRASOUND IN OBSTETRICS AND GYNECOLOGY 2018; 51:818-828. [PMID: 28944985 DOI: 10.1002/uog.18909] [Citation(s) in RCA: 57] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 08/25/2017] [Accepted: 09/01/2017] [Indexed: 06/07/2023]
Abstract
OBJECTIVE To describe the sonographic features of endometrial cancer in relation to tumor stage, grade and histological type, using the International Endometrial Tumor Analysis (IETA) terminology. METHODS This was a prospective multicenter study of 1714 women with biopsy-confirmed endometrial cancer undergoing standardized transvaginal grayscale and Doppler ultrasound examination according to the IETA study protocol, by experienced ultrasound examiners using high-end ultrasound equipment. Clinical and sonographic data were entered into a web-based database. We assessed how strongly sonographic characteristics, according to IETA, were associated with outcome at hysterectomy, i.e. tumor stage, grade and histological type, using univariable logistic regression and the c-statistic. RESULTS In total, 1538 women were included in the final analysis. Median age was 65 (range, 27-98) years, median body mass index was 28.4 (range 16-67) kg/m2 , 1377 (89.5%) women were postmenopausal and 1296 (84.3%) reported abnormal vaginal bleeding. Grayscale and color Doppler features varied according to grade and stage of tumor. High-risk tumors, compared with low-risk tumors, were less likely to have regular endometrial-myometrial junction (difference of -23%; 95% CI, -27 to -18%), were larger (mean endometrial thickness; difference of +9%; 95% CI, +8 to +11%), and were more likely to have non-uniform echogenicity (difference of +7%; 95% CI, +1 to +13%), a multiple, multifocal vessel pattern (difference of +21%; 95% CI, +16 to +26%) and a moderate or high color score (difference of +22%; 95% CI, +18 to +27%). CONCLUSION Grayscale and color Doppler sonographic features are associated with grade and stage of tumor, and differ between high- and low-risk endometrial cancer. Copyright © 2017 ISUOG. Published by John Wiley & Sons Ltd.
Collapse
Affiliation(s)
- E Epstein
- Department of Clinical Science and Education, Karolinska Institutet, and Department of Obstetrics and Gynecology, Södersjukhuset, Stockholm, Sweden
| | - D Fischerova
- Department of Obstetrics and Gynecology, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - L Valentin
- Department of Obstetrics and Gynecology, Skåne University Hospital, Malmö, Lund University, Sweden
| | - A C Testa
- Department of Gynecological Oncology, Catholic University of the Sacred Heart, Rome, Italy
| | - D Franchi
- Department of Gynecological Oncology, European Institute of Oncology, Milan, Italy
| | - P Sladkevicius
- Department of Obstetrics and Gynecology, Skåne University Hospital, Malmö, Lund University, Sweden
| | - F Frühauf
- Department of Obstetrics and Gynecology, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - P G Lindqvist
- Department of Obstetrics and Gynecology, Karolinska University Hospital Huddinge, Stockholm, Sweden
| | - F Mascilini
- Department of Gynecological Oncology, Catholic University of the Sacred Heart, Rome, Italy
| | - R Fruscio
- Clinic of Obstetrics and Gynecology, University of Milan Bicocca, San Gerardo Hospital, Monza, Italy
| | - L A Haak
- Institute for the Care of Mother and Child, Prague and Third Faculty of Medicine, Charles University, Prague, Czech Republic
| | - G Opolskiene
- Center of Obstetrics and Gynecology, Vilnius University Hospital, Santariskiu Clinic, Vilnius, Lithuania
| | - M A Pascual
- Department of Obstetrics, Gynecology, and Reproduction, Hospital Universitario Dexeus, Barcelona, Spain
| | - J L Alcazar
- Department of Obstetrics and Gynecology, Clinica Universidad de Navarra, Pamplona, Spain
| | - V Chiappa
- Department of Obstetrics and Gynecology, National Cancer Institute, Milan, Italy
| | - S Guerriero
- Department of Obstetrics and Gynecology, University of Cagliari, Policlinico Universitario Duilio Casula, Monserrato, Cagliari, Italy
| | - J W Carlson
- Department of Pathology, Karolinska University Hospital, Stockholm, Sweden
| | - C Van Holsbeke
- Department of Obstetrics and Gynecology, Ziekenhuis Oost-Limburg, Genk, Belgium
| | - F P G Leone
- Department of Obstetrics and Gynecology, Clinical Sciences Institute, L. Sacco, Milan, Italy
| | - B De Moor
- Department of Electrical Engineering, ESAT-SCD, STADIUS Center for Dynamical Systems, Signal Processing and Data Analysis, KU Leuven, and imec, Leuven, Belgium
| | - T Bourne
- Department of Obstetrics and Gynaecology, Queen Charlotte's and Chelsea Hospital, Imperial College London, London, UK
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
| | - B van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
| | - A Installe
- Department of Electrical Engineering, ESAT-SCD, STADIUS Center for Dynamical Systems, Signal Processing and Data Analysis, KU Leuven, and imec, Leuven, Belgium
| | - D Timmerman
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Obstetrics and Gynecology, University Hospital Leuven, Leuven, Belgium
| | - J Y Verbakel
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK
| | - T Van den Bosch
- Department of Obstetrics and Gynecology, University Hospital Leuven, Leuven, Belgium
| |
Collapse
|
23
|
de Ridder IR, Dijkland SA, Scheele M, den Hertog HM, Dirks M, Westendorp WF, Nederkoorn PJ, van de Beek D, Ribbers GM, Steyerberg EW, Lingsma HF, Dippel DW. Development and validation of the Dutch Stroke Score for predicting disability and functional outcome after ischemic stroke: A tool to support efficient discharge planning. Eur Stroke J 2018; 3:165-173. [PMID: 29900414 PMCID: PMC5992735 DOI: 10.1177/2396987318754591] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 12/23/2017] [Indexed: 12/23/2022] Open
Abstract
Introduction We aimed to develop and validate a prognostic score for disability at
discharge and functional outcome at three months in patients with acute
ischemic stroke based on clinical information available on admission. Patients and methods The Dutch Stroke Score (DSS) was developed in 1227 patients with ischemic
stroke included in the Paracetamol (Acetaminophen) In Stroke study.
Predictors for Barthel Index (BI) at discharge (‘DSS-discharge’) and
modified Rankin Scale (mRS) at three months (‘DSS-3 months’) were identified
in multivariable ordinal regression. The models were internally validated
with bootstrapping techniques. The DSS-3 months was externally validated in
the PRomoting ACute Thrombolysis in Ischemic StrokE study (1589 patients)
and the Preventive Antibiotics in Stroke Study (2107 patients). Model
performance was assessed in terms of discrimination, expressed by the area
under the receiver operating characteristic curve (AUC), and
calibration. Results At model development, the strongest predictors of Barthel Index at discharge
were age per decade over 60 (odds ratio = 1.55, 95% confidence interval (CI)
1.41–1.68), National Institutes of Health Stroke Scale (odds ratio = 1.24
per point, 95% CI 1.22–1.26) and diabetes (odds ratio = 1.62, 95% CI
1.32–1.91). The internally validated AUC was 0.76 (95% CI 0.75–0.79). The
DSS-3 months, additionally consisting of previous stroke and atrial
fibrillation, performed similarly at internal (AUC 0.75, 95% CI 0.74–0.77)
and external validation (AUC 0.74 in PRomoting ACute Thrombolysis in
Ischemic StrokE (95% CI 0.72–0.76) and 0.69 in Preventive Antibiotics in
Stroke Study (95% CI 0.69–0.72)). Observed outcome was slightly better than
predicted. Discussion: The DSS had satisfactory performance in predicting
BI at discharge and mRS at three months in ischemic stroke patients. Conclusion If further validated, the DSS may contribute to efficient stroke unit
discharge planning alongside patients' contextual factors and therapeutic
needs.
Collapse
Affiliation(s)
- Inger R de Ridder
- 1Department of Neurology, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Simone A Dijkland
- 2Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Maaike Scheele
- 1Department of Neurology, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | | | - Maaike Dirks
- 1Department of Neurology, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands.,4Department of Neurology, University Medical Center Utrecht, Utrecht, Netherlands
| | - Willeke F Westendorp
- 5Department of Neurology, Academic Medical Center, University of Amsterdam, Amsterdam Neuroscience, Amsterdam, Netherlands
| | - Paul J Nederkoorn
- 5Department of Neurology, Academic Medical Center, University of Amsterdam, Amsterdam Neuroscience, Amsterdam, Netherlands
| | - Diederik van de Beek
- 5Department of Neurology, Academic Medical Center, University of Amsterdam, Amsterdam Neuroscience, Amsterdam, Netherlands
| | - Gerard M Ribbers
- 6Department of Rehabilitation Medicine, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Ewout W Steyerberg
- 2Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands.,Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, Netherlands
| | - Hester F Lingsma
- 2Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Diederik Wj Dippel
- 1Department of Neurology, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| |
Collapse
|
24
|
Meisner A, Parikh CR, Kerr KF. Using ordinal outcomes to construct and select biomarker combinations for single-level prediction. Diagn Progn Res 2018; 2:8. [PMID: 31093558 PMCID: PMC6460803 DOI: 10.1186/s41512-018-0028-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Accepted: 04/16/2018] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Biomarker studies may involve an ordinal outcome, such as no, mild, or severe disease. There is often interest in predicting one particular level of the outcome due to its clinical significance. METHODS A simple approach to constructing biomarker combinations in this context involves dichotomizing the outcome and using a binary logistic regression model. We assessed whether more sophisticated methods offer advantages over this simple approach. It is often necessary to select among several candidate biomarker combinations. One strategy involves selecting a combination based on its ability to predict the outcome level of interest. We propose an algorithm that leverages the ordinal outcome to inform combination selection. We apply this algorithm to data from a study of acute kidney injury after cardiac surgery, where kidney injury may be absent, mild, or severe. RESULTS Using more sophisticated modeling approaches to construct combinations provided gains over the simple binary logistic regression approach in specific settings. In the examples considered, the proposed algorithm for combination selection tended to reduce the impact of bias due to selection and to provide combinations with improved performance. CONCLUSIONS Methods that utilize the ordinal nature of the outcome in the construction and/or selection of biomarker combinations have the potential to yield better combinations.
Collapse
Affiliation(s)
- Allison Meisner
- 0000 0001 2171 9311grid.21107.35Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - Chirag R. Parikh
- 0000000419368710grid.47100.32Program of Applied Translational Research, Department of Medicine, Yale School of Medicine, New Haven, CT USA
- Department of Internal Medicine, Veterans Affairs Medical Center, West Haven, CT USA
| | - Kathleen F. Kerr
- 0000000122986657grid.34477.33Department of Biostatistics, University of Washington, Seattle, WA USA
| |
Collapse
|
25
|
Zachariasse JM, Nieboer D, Oostenbrink R, Moll HA, Steyerberg EW. Multiple performance measures are needed to evaluate triage systems in the emergency department. J Clin Epidemiol 2017; 94:27-34. [PMID: 29154810 DOI: 10.1016/j.jclinepi.2017.11.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 09/15/2017] [Accepted: 11/08/2017] [Indexed: 10/18/2022]
Abstract
OBJECTIVES Emergency department triage systems can be considered prediction rules with an ordinal outcome, where different directions of misclassification have different clinical consequences. We evaluated strategies to compare the performance of triage systems and aimed to propose a set of performance measures that should be used in future studies. STUDY DESIGN AND SETTING We identified performance measures based on literature review and expert knowledge. Their properties are illustrated in a case study evaluating two triage modifications in a cohort of 14,485 pediatric emergency department visits. Strengths and weaknesses of the performance measures were systematically appraised. RESULTS Commonly reported performance measures are measures of statistical association (34/60 studies) and diagnostic accuracy (17/60 studies). The case study illustrates that none of the performance measures fulfills all criteria for triage evaluation. Decision curves are the performance measures with the most attractive features but require dichotomization. In addition, paired diagnostic accuracy measures can be recommended for dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R2 for ordinal analyses. Other performance measures provide limited additional information. CONCLUSION When comparing modifications of triage systems, decision curves and diagnostic accuracy measures should be used in a dichotomized analysis, and the triage-weighted kappa and Nagelkerke's R2 in an ordinal approach.
Collapse
Affiliation(s)
- Joany M Zachariasse
- Department of General Paediatrics, Erasmus MC-Sophia Children's Hospital, University Medical Center Rotterdam, P.O. Box 2040, 3000 CB, Rotterdam, The Netherlands
| | - Daan Nieboer
- Department of Public Health, Erasmus MC, University Medical Center Rotterdam, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Rianne Oostenbrink
- Department of General Paediatrics, Erasmus MC-Sophia Children's Hospital, University Medical Center Rotterdam, P.O. Box 2040, 3000 CB, Rotterdam, The Netherlands
| | - Henriëtte A Moll
- Department of General Paediatrics, Erasmus MC-Sophia Children's Hospital, University Medical Center Rotterdam, P.O. Box 2040, 3000 CB, Rotterdam, The Netherlands
| | - Ewout W Steyerberg
- Department of Public Health, Erasmus MC, University Medical Center Rotterdam, P.O. Box 2040, 3000 CA, Rotterdam, The Netherlands.
| |
Collapse
|
26
|
Melesse DY, Lix LM, Nugent Z, Targownik LE, Singh H, Blanchard JF, Bernstein CN. Estimates of Disease Course in Inflammatory Bowel Disease Using Administrative Data: A Population-level Study. J Crohns Colitis 2017; 11:562-570. [PMID: 28453762 DOI: 10.1093/ecco-jcc/jjw201] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Accepted: 11/02/2016] [Indexed: 02/08/2023]
Abstract
BACKGROUND AND AIMS We sought develop a predictive model of disease course in inflammatory bowel disease [IBD] using health care utilization measures from administrative health data, and to apply this model to estimate disease course at a population level over time. METHODS Study participants were IBD patients who were prospectively followed for a 1-year period between 2009 and 2010 in a Canadian clinic setting to assess their IBD disease course [i.e. remission, mild, moderate, severe]. Clinic data were linked with population-based administrative health data. A multivariable partial proportional odds model tested health care utilization measures that discriminated disease course groups. The model was applied to project the distribution of disease course for the Manitoba IBD population for 1995-2013. RESULTS There were 407 participants (54.3% females, 64.4% Crohn's disease [CD]) with mean age at diagnosis of 29.8 years [SD 14.9]. Forty-one per cent of participants were clinically in remission, while 14.0% had severe IBD. Mild, moderate or severe disease was associated with three or more gastroenterologist visits (odds ratio [OR] = 3.33, 95% confidence interval [CI]: 2.03-5.54) or three or more general practitioner visits [OR = 2.97, 95% CI: 1.44-6.37] with an IBD diagnosis and ≥1 radiology test [OR = 2.22, 95% CI: 1.31-3.80]. The percentages of the Manitoba IBD population in remission rose steadily from 1995 to 2013 [43.6 to 59.9%], while the percentages of individuals with mild, moderate or severe disease declined. CONCLUSION This study demonstrated that health care utilization measures from administrative data can be used to predict disease course in the IBD population.
Collapse
Affiliation(s)
- Dessalegn Y Melesse
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Community Health Sciences, College of Medicine, University of Manitoba, Canada
| | - Lisa M Lix
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Community Health Sciences, College of Medicine, University of Manitoba, Canada
- George and Fay Yee Centre for Healthcare Innovation, University of Manitoba, Canada
| | - Zoann Nugent
- Department of Community Health Sciences, College of Medicine, University of Manitoba, Canada
- CancerCare Manitoba, University of Manitoba, Canada
| | - Laura E Targownik
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Internal Medicine, College of Medicine, University of Manitoba, Canada
| | - Harminder Singh
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Internal Medicine, College of Medicine, University of Manitoba, Canada
| | - James F Blanchard
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Community Health Sciences, College of Medicine, University of Manitoba, Canada
| | - Charles N Bernstein
- University of Manitoba Inflammatory Bowel Disease Clinical and Research Centre, University of Manitoba, Canada
- Department of Internal Medicine, College of Medicine, University of Manitoba, Canada
| |
Collapse
|
27
|
Li J, Feng Q, Fine JP, Pencina MJ, Van Calster B. Nonparametric estimation and inference for polytomous discrimination index. Stat Methods Med Res 2017; 27:3092-3103. [DOI: 10.1177/0962280217692830] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Polytomous discrimination index is a novel and important diagnostic accuracy measure for multi-category classification. After reconstructing its probabilistic definition, we propose a nonparametric approach to the estimation of polytomous discrimination index based on an empirical sample of biomarker values. In this paper, we provide the finite-sample and asymptotic properties of the proposed estimators and such analytic results may facilitate the statistical inference. Simulation studies are performed to examine the performance of the nonparametric estimators. Two real data examples are analysed to illustrate our methodology.
Collapse
Affiliation(s)
- Jialiang Li
- National University of Singapore, Singapore, Singapore
- Duke-NUS Graduate Medical School, Singapore, Singapore
- Singapore Eye Research Institute, Singapore, Singapore
| | - Qunqiang Feng
- National University of Singapore, Singapore, Singapore
- University of Science and Technology of China, Hefei Shi, China
| | | | | | | |
Collapse
|
28
|
Levine AC, Glavis-Bloom J, Modi P, Nasrin S, Atika B, Rege S, Robertson S, Schmid CH, Alam NH. External validation of the DHAKA score and comparison with the current IMCI algorithm for the assessment of dehydration in children with diarrhoea: a prospective cohort study. LANCET GLOBAL HEALTH 2016; 4:e744-51. [PMID: 27567350 DOI: 10.1016/s2214-109x(16)30150-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2016] [Revised: 06/23/2016] [Accepted: 07/01/2016] [Indexed: 02/06/2023]
Abstract
BACKGROUND Dehydration due to diarrhoea is a leading cause of child death worldwide, yet no clinical tools for assessing dehydration have been validated in resource-limited settings. The Dehydration: Assessing Kids Accurately (DHAKA) score was derived for assessing dehydration in children with diarrhoea in a low-income country setting. In this study, we aimed to externally validate the DHAKA score in a new population of children and compare its accuracy and reliability to the current Integrated Management of Childhood Illness (IMCI) algorithm. METHODS DHAKA was a prospective cohort study done in children younger than 60 months presenting to the International Centre for Diarrhoeal Disease Research, Bangladesh, with acute diarrhoea (defined by WHO as three or more loose stools per day for less than 14 days). Local nurses assessed children and classified their dehydration status using both the DHAKA score and the IMCI algorithm. Serial weights were obtained and dehydration status was established by percentage weight change with rehydration. We did regression analyses to validate the DHAKA score and compared the accuracy and reliability of the DHAKA score and IMCI algorithm with receiver operator characteristic (ROC) curves and the weighted κ statistic. This study was registered with ClinicalTrials.gov, number NCT02007733. FINDINGS Between March 22, 2015, and May 15, 2015, 496 patients were included in our primary analyses. On the basis of our criterion standard, 242 (49%) of 496 children had no dehydration, 184 (37%) of 496 had some dehydration, and 70 (14%) of 496 had severe dehydration. In multivariable regression analyses, each 1-point increase in the DHAKA score predicted an increase of 0·6% in the percentage dehydration of the child and increased the odds of both some and severe dehydration by a factor of 1·4. Both the accuracy and reliability of the DHAKA score were significantly greater than those of the IMCI algorithm. INTERPRETATION The DHAKA score is the first clinical tool for assessing dehydration in children with acute diarrhoea to be externally validated in a low-income country. Further validation studies in a diverse range of settings and paediatric populations are warranted. FUNDING National Institutes of Health Fogarty International Center.
Collapse
Affiliation(s)
- Adam C Levine
- Warren Alpert Medical School of Brown University, Providence, RI, USA.
| | | | - Payal Modi
- University of Massachusetts Medical School, Worcester, MA, USA
| | - Sabiha Nasrin
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Bita Atika
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| | - Soham Rege
- Warren Alpert Medical School of Brown University, Providence, RI, USA
| | - Sarah Robertson
- Department of Biostatistics, Brown University School of Public Health, Providence, RI, USA
| | - Christopher H Schmid
- Department of Biostatistics, Brown University School of Public Health, Providence, RI, USA
| | - Nur H Alam
- International Centre for Diarrhoeal Disease Research, Bangladesh, Dhaka, Bangladesh
| |
Collapse
|
29
|
Mijderwijk H, Stolker RJ, Duivenvoorden HJ, Klimek M, Steyerberg EW. Clinical prediction model to identify vulnerable patients in ambulatory surgery: towards optimal medical decision-making. Can J Anaesth 2016; 63:1022-32. [PMID: 27282374 DOI: 10.1007/s12630-016-0673-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 04/22/2016] [Accepted: 05/12/2016] [Indexed: 10/21/2022] Open
Abstract
BACKGROUND Ambulatory surgery patients are at risk of adverse psychological outcomes such as anxiety, aggression, fatigue, and depression. We developed and validated a clinical prediction model to identify patients who were vulnerable to these psychological outcome parameters. METHODS We prospectively assessed 383 mixed ambulatory surgery patients for psychological vulnerability, defined as the presence of anxiety (state/trait), aggression (state/trait), fatigue, and depression seven days after surgery. Three psychological vulnerability categories were considered-i.e., none, one, or multiple poor scores, defined as a score exceeding one standard deviation above the mean for each single outcome according to normative data. The following determinants were assessed preoperatively: sociodemographic (age, sex, level of education, employment status, marital status, having children, religion, nationality), medical (heart rate and body mass index), and psychological variables (self-esteem and self-efficacy), in addition to anxiety, aggression, fatigue, and depression. A prediction model was constructed using ordinal polytomous logistic regression analysis, and bootstrapping was applied for internal validation. The ordinal c-index (ORC) quantified the discriminative ability of the model, in addition to measures for overall model performance (Nagelkerke's R (2) ). RESULTS In this population, 137 (36%) patients were identified as being psychologically vulnerable after surgery for at least one of the psychological outcomes. The most parsimonious and optimal prediction model combined sociodemographic variables (level of education, having children, and nationality) with psychological variables (trait anxiety, state/trait aggression, fatigue, and depression). Model performance was promising: R (2) = 30% and ORC = 0.76 after correction for optimism. CONCLUSION This study identified a substantial group of vulnerable patients in ambulatory surgery. The proposed clinical prediction model could allow healthcare professionals the opportunity to identify vulnerable patients in ambulatory surgery, although additional modification and validation are needed. (ClinicalTrials.gov number, NCT01441843).
Collapse
Affiliation(s)
- Herjan Mijderwijk
- Department of Anesthesiology, Erasmus University Medical Center, Room HS-203, PO Box 2040, 3000 CA, Rotterdam, The Netherlands.
| | - Robert Jan Stolker
- Department of Anesthesiology, Erasmus University Medical Center, Room HS-203, PO Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Hugo J Duivenvoorden
- Department of Anesthesiology, Erasmus University Medical Center, Room HS-203, PO Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Markus Klimek
- Department of Anesthesiology, Erasmus University Medical Center, Room HS-203, PO Box 2040, 3000 CA, Rotterdam, The Netherlands
| | - Ewout W Steyerberg
- Department of Public Health, Erasmus University Medical Center, Rotterdam, The Netherlands
| |
Collapse
|
30
|
Chang CH, Lin LC, Chen IC, Yen CH. Assessing the suitability of sets-based approaches: estimating the discriminative power of risk models for ordinal outcome treatments. Int Health 2015; 9:69-75. [PMID: 26409872 DOI: 10.1093/inthealth/ihv058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Revised: 06/29/2015] [Accepted: 07/14/2015] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND In order to evaluate the discrimination performance of an ordinal model for improved disease screening, a new test was proposed where information was obtained across all samples simultaneously. METHODS The ordinal c-index builds upon the volume under the surface methodology without focusing on the accompanying receiver operating characteristic surfaces. However, it can be simplified to an average of pairwise c-indexes. In this paper, a set-based estimate (information was obtained across all samples simultaneously) was proposed by summing all correctly ordered groups. The asymptotic distribution of this proposed estimate was derived using U-statistics. RESULTS A predictive model was applied using the blood urea nitrogen/creatinine ratio to discriminate stroke in evolution in acute ischemic stroke patients, which could potentially be life-saving in emergency departments. CONCLUSIONS By conducting Monte Carlo simulations, it was concluded that the measure proposed herein is a better choice for clinical use because of the asymmetry of the predicted probabilities of groups.
Collapse
Affiliation(s)
- Chia-Hao Chang
- College of Nursing & the Chronic Diseases and Health Promotion Research Center, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan .,Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan
| | - Leng-Chieh Lin
- Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan.,Department of Emergency Medicine, Chang Gung Memorial Hospital, Chiayi, Taiwan
| | - I-Chuan Chen
- Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan.,Department of Emergency Medicine, Chang Gung Memorial Hospital, Chiayi, Taiwan
| | - Ching-Ho Yen
- Department of Industrial Engineering & Management Information, Huafan University, New Taipei City, Taiwan
| |
Collapse
|
31
|
Hsu MJ, Chen YH. Optimal linear combination of biomarkers for multi-category diagnosis. Stat Med 2015; 35:202-13. [DOI: 10.1002/sim.6622] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2014] [Revised: 06/02/2015] [Accepted: 07/26/2015] [Indexed: 11/11/2022]
Affiliation(s)
- Man-Jen Hsu
- Institute of Statistical Science; Academia Sinica; Taipei 11529 Taiwan
| | - Yi-Hau Chen
- Institute of Statistical Science; Academia Sinica; Taipei 11529 Taiwan
| |
Collapse
|
32
|
Reeuwijk KG, Robroek SJW, Niessen MAJ, Kraaijenhagen RA, Vergouwe Y, Burdorf A. The Prognostic Value of the Work Ability Index for Sickness Absence among Office Workers. PLoS One 2015; 10:e0126969. [PMID: 26017387 PMCID: PMC4446207 DOI: 10.1371/journal.pone.0126969] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Accepted: 04/09/2015] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The work ability index (WAI) is a frequently used tool in occupational health to identify workers at risk for a reduced work performance and for work-related disability. However, information about the prognostic value of the WAI to identify workers at risk for sickness absence is scarce. OBJECTIVES To investigate the prognostic value of the WAI for sickness absence, and whether the discriminative ability differs across demographic subgroups. METHODS At baseline, the WAI (score 7-49) was assessed among 1,331 office workers from a Dutch financial service company. Sickness absence was registered during 12-months follow-up and categorised as 0 days, 0<days<5, 5≤days<15, and ≥15 days in one year. Associations between WAI and sickness absence were estimated by multinomial regression analyses. Discriminative ability of the WAI was assessed by the Area Under the Curve (AUC) and Ordinal c-index (ORC). Test characteristics were determined for dichotomised outcomes. Additional analyses were performed for separate WAI dimensions, and subgroup analyses for demographic groups. RESULTS A lower WAI was associated with sickness absence (≥15 days vs. 0 days: per point lower WAI score OR=1.27; 95%CI 1.21-1.33). The WAI showed reasonable ability to discriminate between categories of sickness absence (ORC=0.65; 95%CI 0.63-0.68). Highest discrimination was found for comparing workers with ≥15 sick days with 0 sick days (AUC=0.77) or with 1-5 sick days (AUC=0.69). At the cut-off for poor work ability (WAI≤27) the sensitivity to identify workers at risk for ≥15 sick days was 7.5%, the specificity 99.6%, and the positive predictive value 82%. The performance was similar across demographic subgroups. CONCLUSIONS The WAI could be used to identify workers at high risk for prolonged sickness absence. However, due to low sensitivity many workers will be missed. Hence, additional factors are required to better identify workers at highest risk.
Collapse
Affiliation(s)
| | - Suzan J. W. Robroek
- Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
- * E-mail:
| | | | | | - Yvonne Vergouwe
- Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
| | - Alex Burdorf
- Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
| |
Collapse
|
33
|
Wu Y, Jiang X, Wang S, Jiang W, Li P, Ohno-Machado L. Grid multi-category response logistic models. BMC Med Inform Decis Mak 2015; 15:10. [PMID: 25886151 PMCID: PMC4342889 DOI: 10.1186/s12911-015-0133-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2014] [Accepted: 01/15/2015] [Indexed: 11/28/2022] Open
Abstract
Background Multi-category response models are very important complements to binary logistic models in medical decision-making. Decomposing model construction by aggregating computation developed at different sites is necessary when data cannot be moved outside institutions due to privacy or other concerns. Such decomposition makes it possible to conduct grid computing to protect the privacy of individual observations. Methods This paper proposes two grid multi-category response models for ordinal and multinomial logistic regressions. Grid computation to test model assumptions is also developed for these two types of models. In addition, we present grid methods for goodness-of-fit assessment and for classification performance evaluation. Results Simulation results show that the grid models produce the same results as those obtained from corresponding centralized models, demonstrating that it is possible to build models using multi-center data without losing accuracy or transmitting observation-level data. Two real data sets are used to evaluate the performance of our proposed grid models. Conclusions The grid fitting method offers a practical solution for resolving privacy and other issues caused by pooling all data in a central site. The proposed method is applicable for various likelihood estimation problems, including other generalized linear models. Electronic supplementary material The online version of this article (doi:10.1186/s12911-015-0133-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuan Wu
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, 27708, USA.
| | - Xiaoqian Jiang
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Shuang Wang
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Wenchao Jiang
- Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, 200240, China
| | - Pinghao Li
- Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, 200240, China
| | - Lucila Ohno-Machado
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| |
Collapse
|
34
|
Thompson DD, Murray GD, Sudlow CLM, Dennis M, Whiteley WN. Comparison of statistical and clinical predictions of functional outcome after ischemic stroke. PLoS One 2014; 9:e110189. [PMID: 25299053 PMCID: PMC4192583 DOI: 10.1371/journal.pone.0110189] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 09/09/2014] [Indexed: 11/18/2022] Open
Abstract
Background To determine whether the predictions of functional outcome after ischemic stroke made at the bedside using a doctor’s clinical experience were more or less accurate than the predictions made by clinical prediction models (CPMs). Methods and Findings A prospective cohort study of nine hundred and thirty one ischemic stroke patients recruited consecutively at the outpatient, inpatient and emergency departments of the Western General Hospital, Edinburgh between 2002 and 2005. Doctors made informal predictions of six month functional outcome on the Oxford Handicap Scale (OHS). Patients were followed up at six months with a validated postal questionnaire. For each patient we calculated the absolute predicted risk of death or dependence (OHS≥3) using five previously described CPMs. The specificity of a doctor’s informal predictions of OHS≥3 at six months was good 0.96 (95% CI: 0.94 to 0.97) and similar to CPMs (range 0.94 to 0.96); however the sensitivity of both informal clinical predictions 0.44 (95% CI: 0.39 to 0.49) and clinical prediction models (range 0.38 to 0.45) was poor. The prediction of the level of disability after stroke was similar for informal clinical predictions (ordinal c-statistic 0.74 with 95% CI 0.72 to 0.76) and CPMs (range 0.69 to 0.75). No patient or clinician characteristic affected the accuracy of informal predictions, though predictions were more accurate in outpatients. Conclusions CPMs are at least as good as informal clinical predictions in discriminating between good and bad functional outcome after ischemic stroke. The place of these models in clinical practice has yet to be determined.
Collapse
Affiliation(s)
- Douglas D. Thompson
- Centre for Population Health Sciences, University of Edinburgh, Edinburgh, United Kingdom
- * E-mail:
| | - Gordon D. Murray
- Centre for Population Health Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Cathie L. M. Sudlow
- Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Martin Dennis
- Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - William N. Whiteley
- Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
35
|
Coolen-Maturi T, Elkhafifi FF, Coolen FP. Three-group ROC analysis: A nonparametric predictive approach. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2014.04.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
36
|
Parikh CR, Thiessen-Philbrook H. Key concepts and limitations of statistical methods for evaluating biomarkers of kidney disease. J Am Soc Nephrol 2014; 25:1621-9. [PMID: 24790177 DOI: 10.1681/asn.2013121300] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Interest in developing and using novel markers of kidney injury is increasing. To maintain scientific rigour in these endeavors, a comprehensive understanding of statistical methodology is required to rigorously assess the incremental value of novel biomarkers in existing clinical risk prediction models. Such knowledge is especially relevant, because no single statistical method is sufficient to evaluate a novel biomarker. In this review, we highlight the strengths and limitations of various traditional and novel statistical methods used in the literature for biomarker studies and use biomarkers of AKI as examples to show limitations of some popular statistical methods.
Collapse
Affiliation(s)
- Chirag R Parikh
- Section of Nephrology, Yale University School of Medicine, Veterans Affairs Connecticut Healthcare System and the Program of Applied Translational Research, New Haven, Connecticut; and
| | | |
Collapse
|
37
|
Van Hoorde K, Vergouwe Y, Timmerman D, Van Huffel S, Steyerberg EW, Van Calster B. Assessing calibration of multinomial risk prediction models. Stat Med 2014; 33:2585-96. [PMID: 24549725 DOI: 10.1002/sim.6114] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2013] [Revised: 12/16/2013] [Accepted: 01/27/2014] [Indexed: 11/07/2022]
Abstract
Calibration, that is, whether observed outcomes agree with predicted risks, is important when evaluating risk prediction models. For dichotomous outcomes, several tools exist to assess different aspects of model calibration, such as calibration-in-the-large, logistic recalibration, and (non-)parametric calibration plots. We aim to extend these tools to prediction models for polytomous outcomes. We focus on models developed using multinomial logistic regression (MLR): outcome Y with k categories is predicted using k - 1 equations comparing each category i (i = 2, … ,k) with reference category 1 using a set of predictors, resulting in k - 1 linear predictors. We propose a multinomial logistic recalibration framework that involves an MLR fit where Y is predicted using the k - 1 linear predictors from the prediction model. A non-parametric alternative may use vector splines for the effects of the linear predictors. The parametric and non-parametric frameworks can be used to generate multinomial calibration plots. Further, the parametric framework can be used for the estimation and statistical testing of calibration intercepts and slopes. Two illustrative case studies are presented, one on the diagnosis of malignancy of ovarian tumors and one on residual mass diagnosis in testicular cancer patients treated with cisplatin-based chemotherapy. The risk prediction models were developed on data from 2037 and 544 patients and externally validated on 1107 and 550 patients, respectively. We conclude that calibration tools can be extended to polytomous outcomes. The polytomous calibration plots are particularly informative through the visual summary of the calibration performance.
Collapse
Affiliation(s)
- Kirsten Van Hoorde
- KU Leuven Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing, and Data Analytics, Leuven, Belgium; KU Leuven iMinds Future Health Department, Leuven, Belgium
| | | | | | | | | | | |
Collapse
|
38
|
PET/CT in the staging of patients with a pelvic mass suspicious for ovarian cancer. Gynecol Oncol 2013; 131:694-700. [DOI: 10.1016/j.ygyno.2013.08.020] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2013] [Revised: 08/16/2013] [Accepted: 08/17/2013] [Indexed: 11/19/2022]
|
39
|
Assessing the discriminative ability of risk models for more than two outcome categories. Eur J Epidemiol 2012; 27:761-70. [PMID: 23054032 DOI: 10.1007/s10654-012-9733-3] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 09/14/2012] [Indexed: 12/21/2022]
Abstract
The discriminative ability of risk models for dichotomous outcomes is often evaluated with the concordance index (c-index). However, many medical prediction problems are polytomous, meaning that more than two outcome categories need to be predicted. Unfortunately such problems are often dichotomized in prediction research. We present a perspective on the evaluation of discriminative ability of polytomous risk models, which may instigate researchers to consider polytomous prediction models more often. First, we suggest a "discrimination plot" as a tool to visualize the model's discriminative ability. Second, we discuss the use of one overall polytomous c-index versus a set of dichotomous measures to summarize the performance of the model. Third, we address several aspects to consider when constructing a polytomous c-index. These involve the assessment of concordance in pairs versus sets of patients, weighting by outcome prevalence, the value related to models with random performance, the reduction to the dichotomous c-index for dichotomous problems, and interpretation. We illustrate these issues on case studies dealing with ovarian cancer (four outcome categories) and testicular cancer (three categories). We recommend the use of a discrimination plot together with an overall c-index such as the Polytomous Discrimination Index. If the overall c-index suggests that the model has relevant discriminative ability, pairwise c-indexes for each pair of outcome categories are informative. For pairwise c-indexes we recommend the 'conditional-risk' method which is consistent with the analytical approach of the multinomial logistic regression used to develop polytomous risk models.
Collapse
|