Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Van Calster B, van Smeden M, De Cock B, Steyerberg EW. Regression shrinkage methods for clinical prediction models do not guarantee improved performance: Simulation study. Stat Methods Med Res 2020;29:3166-3178. [PMID: 32401702 DOI: 10.1177/0962280220921415] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

For:	Van Calster B, van Smeden M, De Cock B, Steyerberg EW. Regression shrinkage methods for clinical prediction models do not guarantee improved performance: Simulation study. Stat Methods Med Res 2020;29:3166-3178. [PMID: 32401702 DOI: 10.1177/0962280220921415] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Number

Cited by Other Article(s)

Ali F, Clark H, Machulda M, Senjem ML, Lowe VJ, Jack CR, Josephs KA, Whitwell J, Botha H. Patterns of brain volume and metabolism predict clinical features in the progressive supranuclear palsy spectrum. Brain Commun 2024;6:fcae233. [PMID: 39056025 PMCID: PMC11272075 DOI: 10.1093/braincomms/fcae233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 03/26/2024] [Accepted: 07/14/2024] [Indexed: 07/28/2024] Open

Abstract

Progressive supranuclear palsy (PSP) is a neurodegenerative tauopathy that presents with highly heterogenous clinical syndromes. We perform cross-sectional data-driven discovery of independent patterns of brain atrophy and hypometabolism across the entire PSP spectrum. We then use these patterns to predict specific clinical features and to assess their relationship to phenotypic heterogeneity. We included 111 patients with PSP (60 with Richardson syndrome and 51 with cortical and subcortical variant subtypes). Ninety-one were used as the training set and 20 as a test set. The presence and severity of granular clinical variables such as postural instability, parkinsonism, apraxia and supranuclear gaze palsy were noted. Domains of akinesia, ocular motor impairment, postural instability and cognitive dysfunction as defined by the Movement Disorders Society criteria for PSP were also recorded. Non-negative matrix factorization was used on cross-sectional MRI and fluorodeoxyglucose-positron emission tomography (FDG-PET) scans. Independent models for each as well as a combined model for MRI and FDG-PET were developed and used to predict the granular clinical variables. Both MRI and FDG-PET were better at predicting presence of a symptom than severity, suggesting identification of disease state may be more robust than disease stage. FDG-PET predicted predominantly cortical abnormalities better than MRI such as ideomotor apraxia, apraxia of speech and frontal dysexecutive syndrome. MRI demonstrated prediction of cortical and more so sub-cortical abnormalities, such as parkinsonism. Distinct neuroanatomical foci were predictive in MRI- and FDG-PET-based models. For example, vertical gaze palsy was predicted by midbrain atrophy on MRI, but frontal eye field hypometabolism on FDG-PET. Findings also differed by scale or instrument used. For example, prediction of ocular motor abnormalities using the PSP Saccadic Impairment Scale was stronger than with the Movement Disorders Society Diagnostic criteria for PSP oculomotor impairment designation. Combination of MRI and FDG-PET demonstrated enhanced detection of parkinsonism and frontal syndrome presence and apraxia, cognitive impairment and bradykinesia severity. Both MRI and FDG-PET patterns were able to predict some measures in the test set; however, prediction of global cognition measured by Montreal Cognitive Assessment was the strongest. MRI predictions generalized more robustly to the test set. PSP leads to neurodegeneration in motor, cognitive and ocular motor networks at cortical and subcortical foci, leading to diverse yet overlapping clinical syndromes. To advance understanding of phenotypic heterogeneity in PSP, it is essential to consider data-driven approaches to clinical neuroimaging analyses.

Collapse

Pavlou M, Ambler G, Qu C, Seaman SR, White IR, Omar RZ. An evaluation of sample size requirements for developing risk prediction models with binary outcomes. BMC Med Res Methodol 2024;24:146. [PMID: 38987715 PMCID: PMC11234534 DOI: 10.1186/s12874-024-02268-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 06/24/2024] [Indexed: 07/12/2024] Open

Abstract

BACKGROUND

Risk prediction models are routinely used to assist in clinical decision making. A small sample size for model development can compromise model performance when the model is applied to new patients. For binary outcomes, the calibration slope (CS) and the mean absolute prediction error (MAPE) are two key measures on which sample size calculations for the development of risk models have been based. CS quantifies the degree of model overfitting while MAPE assesses the accuracy of individual predictions.

METHODS

Recently, two formulae were proposed to calculate the sample size required, given anticipated features of the development data such as the outcome prevalence and c-statistic, to ensure that the expectation of the CS and MAPE (over repeated samples) in models fitted using MLE will meet prespecified target values. In this article, we use a simulation study to evaluate the performance of these formulae.

RESULTS

We found that both formulae work reasonably well when the anticipated model strength is not too high (c-statistic < 0.8), regardless of the outcome prevalence. However, for higher model strengths the CS formula underestimates the sample size substantially. For example, for c-statistic = 0.85 and 0.9, the sample size needed to be increased by at least 50% and 100%, respectively, to meet the target expected CS. On the other hand, the MAPE formula tends to overestimate the sample size for high model strengths. These conclusions were more pronounced for higher prevalence than for lower prevalence. Similar results were drawn when the outcome was time to event with censoring. Given these findings, we propose a simulation-based approach, implemented in the new R package 'samplesizedev', to correctly estimate the sample size even for high model strengths. The software can also calculate the variability in CS and MAPE, thus allowing for assessment of model stability.

CONCLUSIONS

The calibration and MAPE formulae suggest sample sizes that are generally appropriate for use when the model strength is not too high. However, they tend to be biased for higher model strengths, which are not uncommon in clinical risk prediction studies. On those occasions, our proposed adjustments to the sample size calculations will be relevant.

Collapse

Chalkou K, Hamza T, Benkert P, Kuhle J, Zecca C, Simoneau G, Pellegrini F, Manca A, Egger M, Salanti G. Combining randomized and non-randomized data to predict heterogeneous effects of competing treatments. Res Synth Methods 2024;15:641-656. [PMID: 38501273 DOI: 10.1002/jrsm.1717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 01/26/2024] [Accepted: 02/16/2024] [Indexed: 03/20/2024]

Pavlou M, Omar RZ, Ambler G. Penalized Regression Methods With Modified Cross-Validation and Bootstrap Tuning Produce Better Prediction Models. Biom J 2024;66:e202300245. [PMID: 38922968 DOI: 10.1002/bimj.202300245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 04/22/2024] [Accepted: 05/06/2024] [Indexed: 06/28/2024]

Abstract

Risk prediction models fitted using maximum likelihood estimation (MLE) are often overfitted resulting in predictions that are too extreme and a calibration slope (CS) less than 1. Penalized methods, such as Ridge and Lasso, have been suggested as a solution to this problem as they tend to shrink regression coefficients toward zero, resulting in predictions closer to the average. The amount of shrinkage is regulated by a tuning parameter,λ , $\lambda ,$ commonly selected via cross-validation ("standard tuning"). Though penalized methods have been found to improve calibration on average, they often over-shrink and exhibit large variability in the selected λ $\lambda $ and hence the CS. This is a problem, particularly for small sample sizes, but also when using sample sizes recommended to control overfitting. We consider whether these problems are partly due to selecting λ $\lambda $ using cross-validation with "training" datasets of reduced size compared to the original development sample, resulting in an over-estimation of λ $\lambda $ and, hence, excessive shrinkage. We propose a modified cross-validation tuning method ("modified tuning"), which estimates λ $\lambda $ from a pseudo-development dataset obtained via bootstrapping from the original dataset, albeit of larger size, such that the resulting cross-validation training datasets are of the same size as the original dataset. Modified tuning can be easily implemented in standard software and is closely related to bootstrap selection of the tuning parameter ("bootstrap tuning"). We evaluated modified and bootstrap tuning for Ridge and Lasso in simulated and real data using recommended sample sizes, and sizes slightly lower and higher. They substantially improved the selection of λ $\lambda $ , resulting in improved CS compared to the standard tuning method. They also improved predictions compared to MLE.

Collapse

Gutman R, Karavani E, Shimoni Y. Improving Inverse Probability Weighting by Post-calibrating Its Propensity Scores. Epidemiology 2024;35:473-480. [PMID: 38619218 PMCID: PMC11191550 DOI: 10.1097/ede.0000000000001733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 03/18/2024] [Indexed: 04/16/2024]

Fan Q, Wang Y, Cheng J, Pan B, Zang X, Liu R, Deng Y. Single-cell RNA-seq reveals T cell exhaustion and immune response landscape in osteosarcoma. Front Immunol 2024;15:1362970. [PMID: 38629071 PMCID: PMC11018946 DOI: 10.3389/fimmu.2024.1362970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 03/18/2024] [Indexed: 04/19/2024] Open

Dunias ZS, Van Calster B, Timmerman D, Boulesteix AL, van Smeden M. A comparison of hyperparameter tuning procedures for clinical prediction models: A simulation study. Stat Med 2024;43:1119-1134. [PMID: 38189632 DOI: 10.1002/sim.9932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 09/10/2023] [Accepted: 09/21/2023] [Indexed: 01/09/2024]

Abstract

Tuning hyperparameters, such as the regularization parameter in Ridge or Lasso regression, is often aimed at improving the predictive performance of risk prediction models. In this study, various hyperparameter tuning procedures for clinical prediction models were systematically compared and evaluated in low-dimensional data. The focus was on out-of-sample predictive performance (discrimination, calibration, and overall prediction error) of risk prediction models developed using Ridge, Lasso, Elastic Net, or Random Forest. The influence of sample size, number of predictors and events fraction on performance of the hyperparameter tuning procedures was studied using extensive simulations. The results indicate important differences between tuning procedures in calibration performance, while generally showing similar discriminative performance. The one-standard-error rule for tuning applied to cross-validation (1SE CV) often resulted in severe miscalibration. Standard non-repeated and repeated cross-validation (both 5-fold and 10-fold) performed similarly well and outperformed the other tuning procedures. Bootstrap showed a slight tendency to more severe miscalibration than standard cross-validation-based tuning procedures. Differences between tuning procedures were larger for smaller sample sizes, lower events fractions and fewer predictors. These results imply that the choice of tuning procedure can have a profound influence on the predictive performance of prediction models. The results support the application of standard 5-fold or 10-fold cross-validation that minimizes out-of-sample prediction error. Despite an increased computational burden, we found no clear benefit of repeated over non-repeated cross-validation for hyperparameter tuning. We warn against the potentially detrimental effects on model calibration of the popular 1SE CV rule for tuning prediction models in low-dimensional settings.

Collapse

Hoogland J, Debray TPA, Crowther MJ, Riley RD, IntHout J, Reitsma JB, Zwinderman AH. Regularized parametric survival modeling to improve risk prediction models. Biom J 2024;66:e2200319. [PMID: 37775946 DOI: 10.1002/bimj.202200319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 04/30/2023] [Accepted: 09/17/2023] [Indexed: 10/01/2023]

Lohmann A, Groenwold RHH, van Smeden M. Comparison of likelihood penalization and variance decomposition approaches for clinical prediction models: A simulation study. Biom J 2024;66:e2200108. [PMID: 37199142 DOI: 10.1002/bimj.202200108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 09/30/2022] [Accepted: 11/10/2022] [Indexed: 05/19/2023]

Riley RD, Pate A, Dhiman P, Archer L, Martin GP, Collins GS. Clinical prediction models and the multiverse of madness. BMC Med 2023;21:502. [PMID: 38110939 PMCID: PMC10729337 DOI: 10.1186/s12916-023-03212-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 12/05/2023] [Indexed: 12/20/2023] Open

Abstract

BACKGROUND

Each year, thousands of clinical prediction models are developed to make predictions (e.g. estimated risk) to inform individual diagnosis and prognosis in healthcare. However, most are not reliable for use in clinical practice.

MAIN BODY

We discuss how the creation of a prediction model (e.g. using regression or machine learning methods) is dependent on the sample and size of data used to develop it-were a different sample of the same size used from the same overarching population, the developed model could be very different even when the same model development methods are used. In other words, for each model created, there exists a multiverse of other potential models for that sample size and, crucially, an individual's predicted value (e.g. estimated risk) may vary greatly across this multiverse. The more an individual's prediction varies across the multiverse, the greater the instability. We show how small development datasets lead to more different models in the multiverse, often with vastly unstable individual predictions, and explain how this can be exposed by using bootstrapping and presenting instability plots. We recommend healthcare researchers seek to use large model development datasets to reduce instability concerns. This is especially important to ensure reliability across subgroups and improve model fairness in practice.

CONCLUSIONS

Instability is concerning as an individual's predicted value is used to guide their counselling, resource prioritisation, and clinical decision making. If different samples lead to different models with very different predictions for the same individual, then this should cast doubt into using a particular model for that individual. Therefore, visualising, quantifying and reporting the instability in individual-level predictions is essential when proposing a new model.

Collapse

Riley RD, Collins GS. Stability of clinical prediction models developed using statistical or machine learning methods. Biom J 2023;65:e2200302. [PMID: 37466257 PMCID: PMC10952221 DOI: 10.1002/bimj.202200302] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 04/26/2023] [Accepted: 05/02/2023] [Indexed: 07/20/2023]

Buick JE, Austin PC, Cheskes S, Ko DT, Atzema CL. Prediction models in prehospital and emergency medicine research: How to derive and internally validate a clinical prediction model. Acad Emerg Med 2023;30:1150-1160. [PMID: 37266925 DOI: 10.1111/acem.14756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 05/24/2023] [Accepted: 05/29/2023] [Indexed: 06/03/2023]

Schmidt AF, Leinveber P, Panovsky R, Soukup L, Machac P, van de Leur RR, Sammani A, Lekadir K, Ter Riele A, Asselbergs FW, Boonstra MJ. DCM-PROGRESS: predicting end-stage heart failure in non-ischemic dilated cardiomyopathy patients. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.10.23295251. [PMID: 37745419 PMCID: PMC10516079 DOI: 10.1101/2023.09.10.23295251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]

Affiliation(s)

A F Schmidt Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Centres, University of Amsterdam, Amsterdam, the Netherlands Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, United Kingdom UCL British Heart Foundation Research Accelerator, London, United Kingdom Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
P Leinveber International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
R Panovsky Department of Internal Medicine-Cardioangiology, International Clinical Research Center, St. Anne's University Hospital Brno, Czech Republic International Clinical Research Center, Faculty of Medicine, Masaryk University, Brno, Czech Republic
L Soukup International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
P Machac International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
R R van de Leur Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
A Sammani Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
K Lekadir Department de Matemàtiques i Informàtica, Universitat de Barcelona, Barcelona, Spain
A Ter Riele Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
F W Asselbergs Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Centres, University of Amsterdam, Amsterdam, the Netherlands Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands Institute of Health Informatics, Faculty of Population Health, University College London, London, UK
M J Boonstra Department of Cardiology, Amsterdam Cardiovascular Sciences, Amsterdam University Medical Centres, University of Amsterdam, Amsterdam, the Netherlands Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands

Collapse

Rentroia-Pacheco B, Tokez S, Bramer EM, Venables ZC, van de Werken HJ, Bellomo D, van Klaveren D, Mooyaart AL, Hollestein LM, Wakkee M. Personalised decision making to predict absolute metastatic risk in cutaneous squamous cell carcinoma: development and validation of a clinico-pathological model. EClinicalMedicine 2023;63:102150. [PMID: 37662519 PMCID: PMC10468358 DOI: 10.1016/j.eclinm.2023.102150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 07/14/2023] [Accepted: 07/25/2023] [Indexed: 09/05/2023] Open

Abstract

Background

Cutaneous squamous cell carcinoma (cSCC) is a common skin cancer, affecting more than 2 million people worldwide yearly and metastasising in 2-5% of patients. However, current clinical staging systems do not provide estimates of absolute metastatic risk, hence missing the opportunity for more personalised treatment advice. We aimed to develop a clinico-pathological model that predicts the probability of metastasis in patients with cSCC.

Methods

Nationwide cohorts from (1) all patients with a first primary cSCC in The Netherlands in 2007-2008 and (2) all patients with a cSCC in 2013-2015 in England were used to derive nested case-control cohorts. Pathology records of primary cSCCs that originated a loco-regional or distant metastasis were identified, and these cSCCs were matched to primary cSCCs of controls without metastasis (1:1 ratio). The model was developed on the Dutch cohort (n = 390) using a weighted Cox regression model with backward selection and validated on the English cohort (n = 696). Model performance was assessed using weighted versions of the C-index, calibration metrics, and decision curve analysis; and compared to the Brigham and Women's Hospital (BWH) and the American Joint Committee on Cancer (AJCC) staging systems. Members of the multidisciplinary Skin Cancer Outcomes (SCOUT) consortium were surveyed to interpret metastatic risk cutoffs in a clinical context.

Findings

Eight out of eleven clinico-pathological variables were selected. The model showed good discriminative ability, with an optimism-corrected C-index of 0.80 (95% Confidence interval (CI) 0.75-0.85) in the development cohort and a C-index of 0.84 (95% CI 0.81-0.87) in the validation cohort. Model predictions were well-calibrated: the calibration slope was 0.96 (95% CI 0.76-1.16) in the validation cohort. Decision curve analysis showed improved net benefit compared to current staging systems, particularly for thresholds relevant for decisions on follow-up and adjuvant treatment. The model is available as an online web-based calculator (https://emc-dermatology.shinyapps.io/cscc-abs-met-risk/).

Interpretation

This validated model assigns personalised metastatic risk predictions to patients with cSCC, using routinely reported histological and patient-specific risk factors. The model can empower clinicians and healthcare systems in identifying patients with high-risk cSCC and offering personalised care/treatment and follow-up. Use of the model for clinical decision-making in different patient populations must be further investigated.

Funding

PPP Allowance made available by Health-Holland, Top Sector Life Sciences & Health, to stimulate public-private partnerships.

Collapse

Dhiman P, Ma J, Qi C, Bullock G, Sergeant JC, Riley RD, Collins GS. Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review. BMC Med Res Methodol 2023;23:188. [PMID: 37598153 PMCID: PMC10439652 DOI: 10.1186/s12874-023-02008-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 08/04/2023] [Indexed: 08/21/2023] Open

Abstract

BACKGROUND

Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome.

METHODS

We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size.

RESULTS

A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63-82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66-84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84).

CONCLUSIONS

Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.

Collapse

Lewis MW, Webb CA, Kuhn M, Akman E, Jobson SA, Rosso IM. Predicting Fear Extinction in Posttraumatic Stress Disorder. Brain Sci 2023;13:1131. [PMID: 37626488 PMCID: PMC10452660 DOI: 10.3390/brainsci13081131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/21/2023] [Accepted: 07/26/2023] [Indexed: 08/27/2023] Open

Chiorino G, Petracci E, Sehovic E, Gregnanin I, Camussi E, Mello-Grand M, Ostano P, Riggi E, Vergini V, Russo A, Berrino E, Ortale A, Garena F, Venesio T, Gallo F, Favettini E, Frigerio A, Matullo G, Segnan N, Giordano L. Plasma microRNA ratios associated with breast cancer detection in a nested case-control study from a mammography screening cohort. Sci Rep 2023;13:12040. [PMID: 37491482 PMCID: PMC10368693 DOI: 10.1038/s41598-023-38886-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 07/17/2023] [Indexed: 07/27/2023] Open

Abstract

Mammographic breast cancer screening is effective in reducing breast cancer mortality. Nevertheless, several limitations are known. Therefore, developing an alternative or complementary non-invasive tool capable of increasing the accuracy of the screening process is highly desirable. The objective of this study was to identify circulating microRNA (miRs) ratios associated with BC in women attending mammography screening. A nested case-control study was conducted within the ANDROMEDA cohort (women of age 46-67 attending BC screening). Pre-diagnostic plasma samples, information on life-styles and common BC risk factors were collected. Small-RNA sequencing was carried out on plasma samples from 65 cases and 66 controls. miR ratios associated with BC were selected by two-sample Wilcoxon test and lasso logistic regression. Subsequent assessment by RT-qPCR of the miRs contained in the selected miR ratios was carried out as a platform validation. To identify the most promising biomarkers, penalised logistic regression was further applied to candidate miR ratios alone, or in combination with non-molecular factors. Small-RNA sequencing yielded 20 candidate miR ratios associated with BC, which were further assessed by RT-qPCR. In the resulting model, penalised logistic regression selected seven miR ratios (miR-199a-3p_let-7a-5p, miR-26b-5p_miR-142-5p, let-7b-5p_miR-19b-3p, miR-101-3p_miR-19b-3p, miR-93-5p_miR-19b-3p, let-7a-5p_miR-22-3p and miR-21-5p_miR-23a-3p), together with body mass index (BMI), menopausal status (MS), the interaction term BMI * MS, life-style score and breast density. The ROC AUC of the model was 0.79 with a sensitivity and specificity of 71.9% and 76.6%, respectively. We identified biomarkers potentially useful for BC screening measured through a widespread and low-cost technique. This is the first study reporting circulating miRs for BC detection in a screening setting. Validation in a wider sample is warranted.Trial registration: The Andromeda prospective cohort study protocol was retrospectively registered on 27-11-2015 (NCT02618538).

Collapse

Affiliation(s)

Giovanna Chiorino Cancer Genomics Lab, Fondazione Edo ed Elvo Tempia, Via Malta 3, 13900, Biella, Italy
Elisabetta Petracci Unit of Biostatistics and Clinical Trials, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) "Dino Amadori", Meldola, Italy
Emir Sehovic Cancer Genomics Lab, Fondazione Edo ed Elvo Tempia, Via Malta 3, 13900, Biella, Italy. Department of Life Sciences and Systems Biology, University of Turin, Turin, Italy.
Ilaria Gregnanin Cancer Genomics Lab, Fondazione Edo ed Elvo Tempia, Via Malta 3, 13900, Biella, Italy
Elisa Camussi SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Maurizia Mello-Grand Cancer Genomics Lab, Fondazione Edo ed Elvo Tempia, Via Malta 3, 13900, Biella, Italy
Paola Ostano Cancer Genomics Lab, Fondazione Edo ed Elvo Tempia, Via Malta 3, 13900, Biella, Italy
Emilia Riggi SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Viviana Vergini SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Alessia Russo Department of Medical Sciences, University of Turin, Turin, Italy
Enrico Berrino Department of Medical Sciences, University of Turin, Turin, Italy Pathology Unit, Candiolo Cancer Institute, FPO IRCCS, Candiolo, Italy
Andrea Ortale SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Francesca Garena SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Tiziana Venesio Pathology Unit, Candiolo Cancer Institute, FPO IRCCS, Candiolo, Italy
Federica Gallo Epidemiology Unit, Staff Health Direction, Local Health Authority 1 of Cuneo, Cuneo, Italy
Elisabetta Favettini Diagnostic Radiology Unit, Nuovo Ospedale Degli Infermi, Ponderano, Italy
Alfonso Frigerio SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy
Giuseppe Matullo Department of Medical Sciences, University of Turin, Turin, Italy
Nereo Segnan SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy.
Livia Giordano SSD Epidemiologia Screening, CPO-AOU Città della Salute e della Scienza di Torino, Via Camillo Benso Di Cavour 31, 10123, Turin, Italy

Collapse

Blythe R, Parsons R, Barnett AG, McPhail SM, White NM. Vital signs-based deterioration prediction model assumptions can lead to losses in prediction performance. J Clin Epidemiol 2023;159:106-115. [PMID: 37245699 DOI: 10.1016/j.jclinepi.2023.05.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 04/11/2023] [Accepted: 05/22/2023] [Indexed: 05/30/2023]

Venkatasubramaniam A, Mateen BA, Shields BM, Hattersley AT, Jones AG, Vollmer SJ, Dennis JM. Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine. BMC Med Inform Decis Mak 2023;23:110. [PMID: 37328784 PMCID: PMC10276367 DOI: 10.1186/s12911-023-02207-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/01/2023] [Indexed: 06/18/2023] Open

Abstract

OBJECTIVE

Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted individual-level treatment effects from a causal forest machine learning algorithm and a penalized regression model.

METHODS

Cohort study characterizing individual-level glucose-lowering response (6 month reduction in HbA1c) in people with type 2 diabetes initiating SGLT2-inhibitor or DPP4-inhibitor therapy. Model development set comprised 1,428 participants in the CANTATA-D and CANTATA-D2 randomised clinical trials of SGLT2-inhibitors versus DPP4-inhibitors. For external validation, calibration of observed versus predicted differences in HbA1c in patient strata defined by size of predicted HbA1c benefit was evaluated in 18,741 patients in UK primary care (Clinical Practice Research Datalink).

RESULTS

Heterogeneity in treatment effects was detected in clinical trial participants with both approaches (proportion predicted to have a benefit on SGLT2-inhibitor therapy over DPP4-inhibitor therapy: causal forest: 98.6%; penalized regression: 81.7%). In validation, calibration was good with penalized regression but sub-optimal with causal forest. A strata with an HbA1c benefit > 10 mmol/mol with SGLT2-inhibitors (3.7% of patients, observed benefit 11.0 mmol/mol [95%CI 8.0-14.0]) was identified using penalized regression but not causal forest, and a much larger strata with an HbA1c benefit 5-10 mmol with SGLT2-inhibitors was identified with penalized regression (regression: 20.9% of patients, observed benefit 7.8 mmol/mol (95%CI 6.7-8.9); causal forest 11.6%, observed benefit 8.7 mmol/mol (95%CI 7.4-10.1).

CONCLUSIONS

Consistent with recent results for outcome prediction with clinical data, when evaluating treatment effect heterogeneity researchers should not rely on causal forest or other similar machine learning algorithms alone, and must compare outputs with standard regression, which in this evaluation was superior.

Collapse

Pate A, Riley RD, Collins GS, van Smeden M, Van Calster B, Ensor J, Martin GP. Minimum sample size for developing a multivariable prediction model using multinomial logistic regression. Stat Methods Med Res 2023;32:555-571. [PMID: 36660777 PMCID: PMC10012398 DOI: 10.1177/09622802231151220] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Debray TPA, Collins GS, Riley RD, Snell KIE, Van Calster B, Reitsma JB, Moons KGM. Transparent reporting of multivariable prediction models developed or validated using clustered data (TRIPOD-Cluster): explanation and elaboration. BMJ 2023;380:e071058. [PMID: 36750236 PMCID: PMC9903176 DOI: 10.1136/bmj-2022-071058] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/07/2022] [Indexed: 02/09/2023]

Wells J, Wang C, Dolgin K, Kayyali R. SPUR: A Patient-Reported Medication Adherence Model as a Predictor of Admission and Early Readmission in Patients Living with Type 2 Diabetes. Patient Prefer Adherence 2023;17:441-455. [PMID: 36844798 PMCID: PMC9948632 DOI: 10.2147/ppa.s397424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 01/14/2023] [Indexed: 02/20/2023] Open

Abstract

PURPOSE

Poor medication adherence (MA) is linked to an increased likelihood of hospital admission. Early interventions to address MA may reduce this risk and associated health-care costs. This study aimed to evaluate a holistic Patient Reported Outcome Measure (PROM) of MA, known as SPUR, as a predictor of general admission and early readmission in patients living with Type 2 Diabetes.

PATIENTS AND METHODS

An observational study design was used to assess data collected over a 12-month period including 6-month retrospective and 6-month prospective monitoring of the number of admissions and early readmissions (admissions occurring within 30 days of discharge) across the cohort. Patients (n = 200) were recruited from a large South London NHS Trust. Covariates of interest included: age, ethnicity, gender, level of education, income, the number of medicines and medical conditions, and a Covid-19 diagnosis. A Poisson or negative binomial model was employed for count outcomes, with the exponentiated coefficient indicating incident ratios (IR) [95% CI]. For binary outcomes (Coefficient, [95% CI]), a logistic regression model was developed.

RESULTS

Higher SPUR scores (increased adherence) were significantly associated with a lower number of admissions (IR = 0.98, [0.96, 1.00]). The number of medical conditions (IR = 1.07, [1.01, 1.13]), age ≥80 years (IR = 5.18, [1.01, 26.55]), a positive Covid-19 diagnosis during follow-up (IR = 1.83, [1.11, 3.02]) and GCSE education (IR = 2.11, [1.15,3.87]) were factors associated with a greater risk of admission. When modelled as a binary variable, only the SPUR score (-0.051, [-0.094, -0.007]) was significantly predictive of an early readmission, with patients reporting higher SPUR scores being less likely to experience an early readmission.

CONCLUSION

Higher levels of MA, as determined by SPUR, were significantly associated with a lower risk of general admissions and early readmissions among patients living with Type 2 Diabetes.

Collapse

Zhang H, Zhang N, Wu W, Zhou R, Li S, Wang Z, Dai Z, Zhang L, Liu Z, Zhang J, Luo P, Liu Z, Cheng Q. Machine learning-based tumor-infiltrating immune cell-associated lncRNAs for predicting prognosis and immunotherapy response in patients with glioblastoma. Brief Bioinform 2022;23:6711411. [PMID: 36136350 DOI: 10.1093/bib/bbac386] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 07/29/2022] [Accepted: 08/10/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Hao Zhang Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China.,Department of Neurosurgery, The Second Affiliated Hospital, Chongqing Medical University, China
Nan Zhang Department of Neurosurgery, Xiangya Hospital, Central South University, China.,One-third Lab, College of Bioinformatics Science and Technology, Harbin Medical University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Wantao Wu Department of Neurosurgery, Xiangya Hospital, Central South University, China.,Department of Oncology, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Ran Zhou Division of Neuroscience and Experimental Psychology, Faculty of Biology, Medicine and Health, University of Manchester, UK
Shuyu Li Department of Thyroid and Breast Surgery, Tongji Hospital, Tongji Medical College of Huazhong University of Science and Technology, China
Zeyu Wang Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Ziyu Dai Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Liyang Zhang Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Zaoqu Liu Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou, China
Jian Zhang Department of Oncology, Zhujiang Hospital, Southern Medical University, China
Peng Luo Department of Oncology, Zhujiang Hospital, Southern Medical University, China
Zhixiong Liu Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China
Quan Cheng Department of Neurosurgery, Xiangya Hospital, Central South University, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, China

Collapse

Virdee PS, Patnick J, Watkinson P, Holt T, Birks J. Full Blood Count Trends for Colorectal Cancer Detection in Primary Care: Development and Validation of a Dynamic Prediction Model. Cancers (Basel) 2022;14:cancers14194779. [PMID: 36230702 PMCID: PMC9563332 DOI: 10.3390/cancers14194779] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/22/2022] [Accepted: 09/27/2022] [Indexed: 11/24/2022] Open

Abstract

Simple Summary

Colorectal cancer is the fourth most common cancer and second most common cause of cancer-death in the UK. If diagnosed and treated early-stage, when the cancer has not spread, 9 in 10 patients are alive five years later. If diagnosed at a late-stage, when the cancer has spread, this drops to 1 in 10 alive. Early detection can save lives, but more than half of colorectal cancers are diagnosed late-stage in the UK. Growing tumours often cause subtle changes in blood test results that could help with earlier detection. For example, patients diagnosed with colorectal cancer often have an increasingly lowering haemoglobin for a few years before their diagnosis, which is not seen in patients without colorectal cancer. These differences as subtle so may be difficult for doctors in primary care to spot from a series of blood tests. We developed a computer-based tool to do this. This tool checks the changes in a patient’s blood test results over the last five years to see how likely they are to have colorectal cancer. We report this tool here and describe how well it works in identifying colorectal cancer cases using blood tests performed in primary care.

Abstract

Colorectal cancer has low survival rates when late-stage, so earlier detection is important. The full blood count (FBC) is a common blood test performed in primary care. Relevant trends in repeated FBCs are related to colorectal cancer presence. We developed and internally validated dynamic prediction models utilising trends for early detection. We performed a cohort study. Sex-stratified multivariate joint models included age at baseline (most recent FBC) and simultaneous trends over historical haemoglobin, mean corpuscular volume (MCV), and platelet measurements up to baseline FBC for two-year risk of diagnosis. Performance measures included the c-statistic and calibration slope. We analysed 250,716 males and 246,695 females in the development cohort and 312,444 males and 462,900 females in the validation cohort, with 0.4% of males and 0.3% of females diagnosed two years after baseline FBC. Compared to average population trends, patient-level declines in haemoglobin and MCV and rise in platelets up to baseline FBC increased risk of diagnosis in two years. C-statistic: 0.751 (males) and 0.763 (females). Calibration slope: 1.06 (males) and 1.05 (females). Our models perform well, with low miscalibration. Utilising trends could bring forward diagnoses to earlier stages and improve survival rates. External validation is now required.

Collapse

Stolarski AE, Kim J, Rop K, Wee K, Zhang Q, Remick DG. Machine learning and murine models explain failures of clinical sepsis trials. J Trauma Acute Care Surg 2022;93:187-194. [PMID: 35881034 PMCID: PMC9335891 DOI: 10.1097/ta.0000000000003691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Abstract

BACKGROUND

Multiple clinical trials failed to demonstrate the efficacy of hydrocortisone, ascorbic acid, and thiamine (HAT) in sepsis. These trials were dominated by patients with pulmonary sepsis and have not accounted for differences in the inflammatory responses across varying etiologies of injury/illness. Hydrocortisone, ascorbic acid, and thiamine have previously revealed tremendous benefits in animal peritonitis sepsis models (cecal ligation and puncture [CLP]) in contradiction to the various clinical trials. The impact of HAT remains unclear in pulmonary sepsis. Our objective was to investigate the impact of HAT in pneumonia, consistent with the predominate etiology in the discordant clinical trials. We hypothesized that, in a pulmonary sepsis model, HAT would act synergistically to reduce end-organ dysfunction by the altering the inflammatory response, in a unique manner compared with CLP.

METHODS

Using Pseudomonas aeruginosa pneumonia, a pulmonary sepsis model (pneumonia [PNA]) was compared directly to previously investigated intra-abdominal sepsis models. Machine learning applied to early vital signs stratified animals into those predicted to die (pDie) versus predicted to live (pLive). Animals were then randomized to receive antibiotics and fluids (vehicle [VEH]) vs. HAT). Vitals, cytokines, vitamin C, and markers of liver and kidney function were assessed in the blood, bronchoalveolar lavage, and organ homogenates.

RESULTS

PNA was induced in 119 outbred wild-type Institute of Cancer Research mice (predicted mortality approximately 50%) similar to CLP. In PNA, interleukin 1 receptor antagonist in 72-hour bronchoalveolar lavage was lower with HAT (2.36 ng/mL) compared with VEH (4.88 ng/mL; p = 0.04). The remaining inflammatory cytokines and markers of liver/renal function showed no significant difference with HAT in PNA. PNA vitamin C levels were 0.62 mg/dL (pDie HAT), lower than vitamin C levels after CLP (1.195 mg/dL). Unlike CLP, PNA mice did not develop acute kidney injury (blood urea nitrogen: pDie, 33.5 mg/dL vs. pLive, 27.6 mg/dL; p = 0.17). Furthermore, following PNA, HAT did not significantly reduce microscopic renal oxidative stress (mean gray area: pDie, 16.64 vs. pLive, 6.88; p = 0.93). Unlike CLP where HAT demonstrated a survival benefit, HAT had no impact on survival in PNA.

CONCLUSION

Hydrocortisone, ascorbic acid, and thiamine therapy has minimal benefits in pneumonia. The inflammatory response induced by pulmonary sepsis is unique compared with the response during intra-abdominal sepsis. Consequently, different etiologies of sepsis respond differently to HAT therapy.

Collapse

Loohuis AMM, Burger H, Wessels N, Dekker J, Malmberg AG, Berger MY, Blanker MH, van der Worp H. Prediction model study focusing on eHealth in the management of urinary incontinence: the Personalised Advantage Index as a decision-making aid. BMJ Open 2022;12:e051827. [PMID: 35879013 PMCID: PMC9328108 DOI: 10.1136/bmjopen-2021-051827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

OBJECTIVE

To develop a prediction model and illustrate the practical potential of personalisation of treatment decisions between app-based treatment and care as usual for urinary incontinence (UI).

DESIGN

A prediction model study using data from a pragmatic, randomised controlled, non-inferiority trial.

SETTING

Dutch primary care from 2015, with social media included from 2017. Enrolment ended on July 2018.

PARTICIPANTS

Adult women were eligible if they had ≥2 episodes of UI per week, access to mobile apps and wanted treatment. Of the 350 screened women, 262 were eligible and randomised to app-based treatment or care as usual; 195 (74%) attended follow-up.

PREDICTORS

Literature review and expert opinion identified 13 candidate predictors, categorised into two groups: Prognostic factors (independent of treatment type), such as UI severity, postmenopausal state, vaginal births, general physical health status, pelvic floor muscle function and body mass index; and modifiers (dependent on treatment type), such as age, UI type and duration, impact on quality of life, previous physical therapy, recruitment method and educational level.

MAIN OUTCOME MEASURE

Primary outcome was symptom severity after a 4-month follow-up period, measured by the International Consultation on Incontinence Questionnaire the Urinary Incontinence Short Form. Prognostic factors and modifiers were combined into a final prediction model. For each participant, we then predicted treatment outcomes and calculated a Personalised Advantage Index (PAI).

RESULTS

Baseline UI severity (prognostic) and age, educational level and impact on quality of life (modifiers) independently affected treatment effect of eHealth. The mean PAI was 0.99±0.79 points, being of clinical relevance in 21% of individuals. Applying the PAI also significantly improved treatment outcomes at the group level.

CONCLUSIONS

Personalising treatment choice can support treatment decision making between eHealth and care as usual through the practical application of prediction modelling. Concerning eHealth for UI, this could facilitate the choice between app-based treatment and care as usual.

TRIAL REGISTRATION NUMBER

NL4948t.

Collapse

van Royen FS, Moons KGM, Geersing GJ, van Smeden M. Developing, validating, updating and judging the impact of prognostic models for respiratory diseases. Eur Respir J 2022;60:13993003.00250-2022. [PMID: 35728976 DOI: 10.1183/13993003.00250-2022] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 05/27/2022] [Indexed: 11/05/2022]

Hafermann L, Klein N, Rauch G, Kammer M, Heinze G. Using Background Knowledge from Preceding Studies for Building a Random Forest Prediction Model: A Plasmode Simulation Study. ENTROPY 2022;24:e24060847. [PMID: 35741566 PMCID: PMC9222226 DOI: 10.3390/e24060847] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 06/14/2022] [Accepted: 06/15/2022] [Indexed: 12/05/2022]

van den Goorbergh R, van Smeden M, Timmerman D, Van Calster B. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. J Am Med Inform Assoc 2022;29:1525-1534. [PMID: 35686364 PMCID: PMC9382395 DOI: 10.1093/jamia/ocac093] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 05/12/2022] [Accepted: 05/27/2022] [Indexed: 12/23/2022] Open

Defining Sepsis Phenotypes-Two Murine Models of Sepsis and Machine Learning. Shock 2022;57:268-273. [PMID: 35759307 DOI: 10.1097/shk.0000000000001935] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Abstract

INTRODUCTION

The immunobiology defining the clinically apparent differences in response to sepsis remains unclear. We hypothesize that in murine models of sepsis we can identify phenotypes of sepsis using non-invasive physiologic parameters (NIPP) early after infection to distinguish between different inflammatory states.

METHODS

Two murine models of sepsis were used: gram-negative pneumonia (PNA) and cecal ligation and puncture (CLP). All mice were treated with broad spectrum antibiotics and fluid resuscitation. High-risk sepsis responders (pDie) were defined as those predicted to die within 72 h following infection. Low-risk responders (pLive) were expected to survive the initial 72 h of sepsis. Statistical modeling in R was used for statistical analysis and machine learning.

RESULTS

NIPP obtained at 6 and 24 h after infection of 291 mice (85 PNA and 206 CLP) were used to define the sepsis phenotypes. Lasso regression for variable selection with 10-fold cross-validation was used to define the optimal shrinkage parameters. The variables selected to discriminate between phenotypes included 6-h temperature and 24-h pulse distention, heart rate (HR), and temperature. Applying the model to fit test data (n = 55), area under the curve (AUC) for the receiver operating characteristics (ROC) curve was 0.93. Subgroup analysis of 120 CLP mice revealed a HR of <620 bpm at 24 h as a univariate predictor of pDie. (AUC of ROC curve = 0.90). Subgroup analysis of PNA exposed mice (n = 121) did not reveal a single predictive variable highlighting the complex physiological alterations in response to sepsis.

CONCLUSION

In murine models with various etiologies of sepsis, non-invasive vitals assessed just 6 and 24 h after infection can identify different sepsis phenotypes. Stratification by sepsis phenotypes can transform future studies investigating novel therapies for sepsis.

Collapse

Modern Learning from Big Data in Critical Care: Primum Non Nocere. Neurocrit Care 2022;37:174-184. [PMID: 35513752 PMCID: PMC9071245 DOI: 10.1007/s12028-022-01510-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 04/06/2022] [Indexed: 12/13/2022]

Abstract

Large and complex data sets are increasingly available for research in critical care. To analyze these data, researchers use techniques commonly referred to as statistical learning or machine learning (ML). The latter is known for large successes in the field of diagnostics, for example, by identification of radiological anomalies. In other research areas, such as clustering and prediction studies, there is more discussion regarding the benefit and efficiency of ML techniques compared with statistical learning. In this viewpoint, we aim to explain commonly used statistical learning and ML techniques and provide guidance for responsible use in the case of clustering and prediction questions in critical care. Clustering studies have been increasingly popular in critical care research, aiming to inform how patients can be characterized, classified, or treated differently. An important challenge for clustering studies is to ensure and assess generalizability. This limits the application of findings in these studies toward individual patients. In the case of predictive questions, there is much discussion as to what algorithm should be used to most accurately predict outcome. Aspects that determine usefulness of ML, compared with statistical techniques, include the volume of the data, the dimensionality of the preferred model, and the extent of missing data. There are areas in which modern ML methods may be preferred. However, efforts should be made to implement statistical frameworks (e.g., for dealing with missing data or measurement error, both omnipresent in clinical data) in ML methods. To conclude, there are important opportunities but also pitfalls to consider when performing clustering or predictive studies with ML techniques. We advocate careful valuation of new data-driven findings. More interaction is needed between the engineer mindset of experts in ML methods, the insight in bias of epidemiologists, and the probabilistic thinking of statisticians to extract as much information and knowledge from data as possible, while avoiding harm.

Collapse

McNamara ME, Zisser M, Beevers CG, Shumake J. Not just “big” data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions. Behav Res Ther 2022;153:104086. [DOI: 10.1016/j.brat.2022.104086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 03/11/2022] [Accepted: 04/05/2022] [Indexed: 11/24/2022]

Fonseca de Freitas D, Kadra-Scalzo G, Agbedjro D, Francis E, Ridler I, Pritchard M, Shetty H, Segev A, Casetta C, Smart SE, Downs J, Christensen SR, Bak N, Kinon BJ, Stahl D, MacCabe JH, Hayes RD. Using a statistical learning approach to identify sociodemographic and clinical predictors of response to clozapine. J Psychopharmacol 2022;36:498-506. [PMID: 35212240 PMCID: PMC9066692 DOI: 10.1177/02698811221078746] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Affiliation(s)

Daniela Fonseca de Freitas Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Giouliana Kadra-Scalzo Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Deborah Agbedjro Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Emma Francis Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Isobel Ridler Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Megan Pritchard Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Hitesh Shetty Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Aviv Segev Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK Shalvata Mental Health Center, Hod Hasharon, Israel Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
Cecilia Casetta Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK Department of Health Sciences, Università degli Studi di Milano, Milan, Italy
Sophie E Smart Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK MRC Centre for Neuropsychiatric Genetics & Genomics, Cardiff University, Cardiff, UK
Johnny Downs Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Søren Rahn Christensen H. Lundbeck A/S, Copenhagen, Denmark
Nikolaj Bak H. Lundbeck A/S, Copenhagen, Denmark
Bruce J Kinon Cyclerion Therapeutics, Massachusetts, USA
Daniel Stahl Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
James H MacCabe Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Richard D Hayes Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK

Collapse

Oosterhoff JHF, Gravesteijn BY, Karhade AV, Jaarsma RL, Kerkhoffs GMMJ, Ring D, Schwab JH, Steyerberg EW, Doornberg JN. Feasibility of Machine Learning and Logistic Regression Algorithms to Predict Outcome in Orthopaedic Trauma Surgery. J Bone Joint Surg Am 2022;104:544-551. [PMID: 34921550 DOI: 10.2106/jbjs.21.00341] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Gregorich M, Melograna F, Sunqvist M, Michiels S, Van Steen K, Heinze G. Individual-specific networks for prediction modelling – A scoping review of methods. BMC Med Res Methodol 2022;22:62. [PMID: 35249534 PMCID: PMC8898441 DOI: 10.1186/s12874-022-01544-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 02/11/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Recent advances in biotechnology enable the acquisition of high-dimensional data on individuals, posing challenges for prediction models which traditionally use covariates such as clinical patient characteristics. Alternative forms of covariate representations for the features derived from these modern data modalities should be considered that can utilize their intrinsic interconnection. The connectivity information between these features can be represented as an individual-specific network defined by a set of nodes and edges, the strength of which can vary from individual to individual. Global or local graph-theoretical features describing the network may constitute potential prognostic biomarkers instead of or in addition to traditional covariates and may replace the often unsuccessful search for individual biomarkers in a high-dimensional predictor space.

Methods

We conducted a scoping review to identify, collate and critically appraise the state-of-art in the use of individual-specific networks for prediction modelling in medicine and applied health research, published during 2000–2020 in the electronic databases PubMed, Scopus and Embase.

Results

Our scoping review revealed the main application areas namely neurology and pathopsychology, followed by cancer research, cardiology and pathology (N = 148). Network construction was mainly based on Pearson correlation coefficients of repeated measurements, but also alternative approaches (e.g. partial correlation, visibility graphs) were found. For covariates measured only once per individual, network construction was mostly based on quantifying an individual’s contribution to the overall group-level structure. Despite the multitude of identified methodological approaches for individual-specific network inference, the number of studies that were intended to enable the prediction of clinical outcomes for future individuals was quite limited, and most of the models served as proof of concept that network characteristics can in principle be useful for prediction.

Conclusion

The current body of research clearly demonstrates the value of individual-specific network analysis for prediction modelling, but it has not yet been considered as a general tool outside the current areas of application. More methodological research is still needed on well-founded strategies for network inference, especially on adequate network sparsification and outcome-guided graph-theoretical feature extraction and selection, and on how networks can be exploited efficiently for prediction modelling.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-022-01544-6.

Collapse

Yan Y, Yang Z, Semenkovich TR, Kozower BD, Meyers BF, Nava RG, Kreisel D, Puri V. Comparison of standard and penalized logistic regression in risk model development. JTCVS OPEN 2022;9:303-316. [PMID: 36003440 PMCID: PMC9390725 DOI: 10.1016/j.xjon.2022.01.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 01/13/2022] [Indexed: 11/26/2022]

Chen YJ, Wang WF, Jhang KM, Chang MC, Chang CC, Liao YC. Prediction of Institutionalization for Patients With Dementia in Taiwan According to Condition at Entry to Dementia Collaborative Care. J Appl Gerontol 2022;41:1357-1364. [PMID: 35220779 DOI: 10.1177/07334648211073129] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

de Hond AAH, Leeuwenberg AM, Hooft L, Kant IMJ, Nijman SWJ, van Os HJA, Aardoom JJ, Debray TPA, Schuit E, van Smeden M, Reitsma JB, Steyerberg EW, Chavannes NH, Moons KGM. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. NPJ Digit Med 2022;5:2. [PMID: 35013569 PMCID: PMC8748878 DOI: 10.1038/s41746-021-00549-7] [Citation(s) in RCA: 105] [Impact Index Per Article: 52.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/13/2021] [Indexed: 12/23/2022] Open

Affiliation(s)

Anne A H de Hond Department of Information Technology and Digital Innovation, Leiden University Medical Center, Leiden, The Netherlands. Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands. Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.
Artuur M Leeuwenberg Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
Lotty Hooft Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Ilse M J Kant Department of Information Technology and Digital Innovation, Leiden University Medical Center, Leiden, The Netherlands Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
Steven W J Nijman Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Hendrikus J A van Os Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands National eHealth Living Lab, Leiden, The Netherlands
Jiska J Aardoom National eHealth Living Lab, Leiden, The Netherlands Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
Thomas P A Debray Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Ewoud Schuit Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Maarten van Smeden Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Johannes B Reitsma Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Ewout W Steyerberg Clinical AI Implementation and Research Lab, Leiden University Medical Center, Leiden, The Netherlands Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands
Niels H Chavannes National eHealth Living Lab, Leiden, The Netherlands Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
Karel G M Moons Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands

Collapse

Joshi A, Geroldinger A, Jiricka L, Senchaudhuri P, Corcoran C, Heinze G. Solutions to problems of nonexistence of parameter estimates and sparse data bias in Poisson regression. Stat Methods Med Res 2021;31:253-266. [PMID: 34931909 PMCID: PMC8829730 DOI: 10.1177/09622802211065405] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Strömer A, Staerk C, Klein N, Weinhold L, Titze S, Mayr A. Deselection of base-learners for statistical boosting-with an application to distributional regression. Stat Methods Med Res 2021;31:207-224. [PMID: 34882438 DOI: 10.1177/09622802211051088] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Martin GP, Riley RD, Collins GS, Sperrin M. Developing clinical prediction models when adhering to minimum sample size recommendations: The importance of quantifying bootstrap variability in tuning parameters and predictive performance. Stat Methods Med Res 2021;30:2545-2561. [PMID: 34623193 PMCID: PMC8649413 DOI: 10.1177/09622802211046388] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Hoogland J, IntHout J, Belias M, Rovers MM, Riley RD, E. Harrell Jr F, Moons KGM, Debray TPA, Reitsma JB. A tutorial on individualized treatment effect prediction from randomized trials with a binary endpoint. Stat Med 2021;40:5961-5981. [PMID: 34402094 PMCID: PMC9291969 DOI: 10.1002/sim.9154] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 06/08/2021] [Accepted: 07/19/2021] [Indexed: 12/23/2022]

Šinkovec H, Heinze G, Blagus R, Geroldinger A. To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets. BMC Med Res Methodol 2021;21:199. [PMID: 34592945 PMCID: PMC8482588 DOI: 10.1186/s12874-021-01374-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 08/19/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

For finite samples with binary outcomes penalized logistic regression such as ridge logistic regression has the potential of achieving smaller mean squared errors (MSE) of coefficients and predictions than maximum likelihood estimation. There is evidence, however, that ridge logistic regression can result in highly variable calibration slopes in small or sparse data situations.

METHODS

In this paper, we elaborate this issue further by performing a comprehensive simulation study, investigating the performance of ridge logistic regression in terms of coefficients and predictions and comparing it to Firth's correction that has been shown to perform well in low-dimensional settings. In addition to tuned ridge regression where the penalty strength is estimated from the data by minimizing some measure of the out-of-sample prediction error or information criterion, we also considered ridge regression with pre-specified degree of shrinkage. We included 'oracle' models in the simulation study in which the complexity parameter was chosen based on the true event probabilities (prediction oracle) or regression coefficients (explanation oracle) to demonstrate the capability of ridge regression if truth was known.

RESULTS

Performance of ridge regression strongly depends on the choice of complexity parameter. As shown in our simulation and illustrated by a data example, values optimized in small or sparse datasets are negatively correlated with optimal values and suffer from substantial variability which translates into large MSE of coefficients and large variability of calibration slopes. In contrast, in our simulations pre-specifying the degree of shrinkage prior to fitting led to accurate coefficients and predictions even in non-ideal settings such as encountered in the context of rare outcomes or sparse predictors.

CONCLUSIONS

Applying tuned ridge regression in small or sparse datasets is problematic as it results in unstable coefficients and predictions. In contrast, determining the degree of shrinkage according to some meaningful prior assumptions about true effects has the potential to reduce bias and stabilize the estimates.

Collapse

Li Y, Liang M, Mao L, Wang S. Robust estimation and variable selection for the accelerated failure time model. Stat Med 2021;40:4473-4491. [PMID: 34031919 PMCID: PMC8364878 DOI: 10.1002/sim.9042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 04/25/2021] [Accepted: 04/26/2021] [Indexed: 11/10/2022]

Cornelissen LL, Caram‐Deelder C, Fustolo‐Gunnink SF, Groenwold RHH, Stanworth SJ, Zwaginga JJ, van der Bom JG. Expected individual benefit of prophylactic platelet transfusions in hemato-oncology patients based on bleeding risks. Transfusion 2021;61:2578-2587. [PMID: 34263930 PMCID: PMC8518514 DOI: 10.1111/trf.16587] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 06/15/2021] [Accepted: 06/23/2021] [Indexed: 01/01/2023]

Heinze G, van Smeden M, Wynants L, Steyerberg E, van Calster B. Prediction models: stepwise development and simultaneous validation is a step back. J Clin Epidemiol 2021;142:330-331. [PMID: 34348179 DOI: 10.1016/j.jclinepi.2021.07.019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Accepted: 07/28/2021] [Indexed: 12/23/2022]

About Model Validation in Bioprocessing. Processes (Basel) 2021. [DOI: 10.3390/pr9060961] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Srisuwarn P, Srisuma S, Sriapha C, Tongpoo A, Rittilert P, Pradoo A, Tanpudsa Y, Wananukul W. Clinical effects and factors associated with adverse clinical outcomes of hymenopteran stings treated in a Thai Poison Centre: a retrospective cross-sectional study. Clin Toxicol (Phila) 2021;60:168-174. [PMID: 33960850 DOI: 10.1080/15563650.2021.1918705] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Abstract

OBJECTIVE

To describe clinical effects and outcomes of hymenopteran stings and to explore the non-laboratory factors associated with adverse clinical outcomes, a composite outcome including death, respiratory failure requiring intubation, acute kidney injury (AKI) requiring dialysis and hypotension requiring vasopressor use.

METHODS

A retrospective cross-sectional study was performed at the Ramathibodi Poison Center, a poison centre of a tertiary care hospital in Thailand. All cases of hymenopteran sting consultations from January 2015 to June 2019 were consecutively enrolled, and charts were reviewed. Demographics, initial clinical characteristics and outcomes were collected. Factors associated with adverse clinical outcome were explored.

RESULTS

One hundred and fourteen hymenopteran stings cases (wasp 48%, bee 33%, hornet 14% and carpenter bee 8.8%) were included (median age, 36.5 years (interquartile range 9-55); male 63%). The prevalence of adverse clinical outcomes was 12.3% (95%CI 6.88-12.8). At initial presentation, 100% of cases had local skin reactions, 11.4% were clinical anaphylaxis, and 8% had red urine. Adverse clinical outcomes included death (n = 10), respiratory failure requiring intubation (n = 9), AKI requiring dialysis (n = 6) and hypotension requiring vasopressor use (n = 2). None of the patients with carpenter bee or hornet stings developed adverse clinical outcomes. In univariable analysis, urticaria, wheezing, red urine, wasp sting and sting number > 10 were significantly associated with adverse clinical outcomes. In multivariable analysis, red urine (adjusted OR 11.1 (95% CI 1.57-216)), wheezing (adjusted OR 16.7 (95% CI 1.43-402)) and a number of stings > 10 (adjusted OR 21.5 (95% CI2.13-2557)) were significant.

CONCLUSIONS

Adverse clinical outcomes in hymenopteran stings were not uncommon among cases inquiring to a national Thai poison centre. At initial presentation, red urine, wheezing and a number stings >10 were significantly associated with adverse clinical outcomes. Larger epidemiologic studies are required to confirm these associations.

Collapse

Riley RD, Snell KIE, Martin GP, Whittle R, Archer L, Sperrin M, Collins GS. Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small. J Clin Epidemiol 2021;132:88-96. [PMID: 33307188 PMCID: PMC8026952 DOI: 10.1016/j.jclinepi.2020.12.005] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 11/15/2020] [Accepted: 12/02/2020] [Indexed: 12/14/2022]

Christodoulou E, van Smeden M, Edlinger M, Timmerman D, Wanitschek M, Steyerberg EW, Van Calster B. Adaptive sample size determination for the development of clinical prediction models. Diagn Progn Res 2021;5:6. [PMID: 33745449 PMCID: PMC7983402 DOI: 10.1186/s41512-021-00096-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/15/2021] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

We suggest an adaptive sample size calculation method for developing clinical prediction models, in which model performance is monitored sequentially as new data comes in.

METHODS

We illustrate the approach using data for the diagnosis of ovarian cancer (n = 5914, 33% event fraction) and obstructive coronary artery disease (CAD; n = 4888, 44% event fraction). We used logistic regression to develop a prediction model consisting only of a priori selected predictors and assumed linear relations for continuous predictors. We mimicked prospective patient recruitment by developing the model on 100 randomly selected patients, and we used bootstrapping to internally validate the model. We sequentially added 50 random new patients until we reached a sample size of 3000 and re-estimated model performance at each step. We examined the required sample size for satisfying the following stopping rule: obtaining a calibration slope ≥ 0.9 and optimism in the c-statistic (or AUC) < = 0.02 at two consecutive sample sizes. This procedure was repeated 500 times. We also investigated the impact of alternative modeling strategies: modeling nonlinear relations for continuous predictors and correcting for bias on the model estimates (Firth's correction).

RESULTS

Better discrimination was achieved in the ovarian cancer data (c-statistic 0.9 with 7 predictors) than in the CAD data (c-statistic 0.7 with 11 predictors). Adequate calibration and limited optimism in discrimination was achieved after a median of 450 patients (interquartile range 450-500) for the ovarian cancer data (22 events per parameter (EPP), 20-24) and 850 patients (750-900) for the CAD data (33 EPP, 30-35). A stricter criterion, requiring AUC optimism < = 0.01, was met with a median of 500 (23 EPP) and 1500 (59 EPP) patients, respectively. These sample sizes were much higher than the well-known 10 EPP rule of thumb and slightly higher than a recently published fixed sample size calculation method by Riley et al. Higher sample sizes were required when nonlinear relationships were modeled, and lower sample sizes when Firth's correction was used.

CONCLUSIONS

Adaptive sample size determination can be a useful supplement to fixed a priori sample size calculations, because it allows to tailor the sample size to the specific prediction modeling context in a dynamic fashion.

Collapse