Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Richter AN, Khoshgoftaar TM. A review of statistical and machine learning methods for modeling cancer risk using structured clinical data. Artif Intell Med 2018;90:1-14. [DOI: 10.1016/j.artmed.2018.06.002] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Revised: 09/08/2017] [Accepted: 06/13/2018] [Indexed: 02/06/2023]

For:	Richter AN, Khoshgoftaar TM. A review of statistical and machine learning methods for modeling cancer risk using structured clinical data. Artif Intell Med 2018;90:1-14. [DOI: 10.1016/j.artmed.2018.06.002] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Revised: 09/08/2017] [Accepted: 06/13/2018] [Indexed: 02/06/2023]

Number

Cited by Other Article(s)

Chang TG, Park S, Schäffer AA, Jiang P, Ruppin E. Hallmarks of artificial intelligence contributions to precision oncology. NATURE CANCER 2025;6:417-431. [PMID: 40055572 PMCID: PMC11957836 DOI: 10.1038/s43018-025-00917-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Accepted: 01/21/2025] [Indexed: 03/29/2025]

Garwe T, Choi J. An introduction to clinical prediction models using logistic regression in acute care surgery research: Methodologic considerations and common pitfalls. J Trauma Acute Care Surg 2025:01586154-990000000-00923. [PMID: 40012096 DOI: 10.1097/ta.0000000000004584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2025]

Grau-Jurado P, Mostafaei S, Xu H, Mo M, Petek B, Kalar I, Naia L, Kele J, Maioli S, Pereira JB, Eriksdotter M, Chatterjee S, Garcia-Ptacek S. Medications and cognitive decline in Alzheimer's disease: Cohort cluster analysis of 15,428 patients. J Alzheimers Dis 2025;103:931-940. [PMID: 39772858 DOI: 10.1177/13872877241307870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]

Abstract

BACKGROUND

Medications for comorbid conditions may affect cognition in Alzheimer's disease (AD).

OBJECTIVE

To explore the association between common medications and cognition, measured with the Mini-Mental State Examination.

METHODS

Cohort study including persons with AD from the Swedish Registry for Cognitive/Dementia Disorders (SveDem). Medications were included if they were used by ≥5% of patients (26 individual drugs). Each follow-up was analyzed independently by performing 100 Monte-Carlo simulations of two steps each 1) k-means clustering of patients according to Mini-Mental State Examination at follow-up and its decline since previous measure, and 2) Identification of medications presenting statistically significant differences in the proportion of users in the different clusters.

RESULTS

15,428 patients (60.38% women) were studied. Four clusters were identified. Medications associated with the best cognition cluster (relative to the worse) were atorvastatin (point estimate 1.44 95% confidence interval [1.15-1.83] at first follow-up, simvastatin (1.41 [1.11-1.78] at second follow-up), warfarin (1.56 [1.22-2.01] first follow-up), zopiclone (1.35 [1.15-1.58], and metformin (2.08 [1.35-3.33] second follow-up. Oxazepam (0.60 [0.50-0.73] first follow-up), paracetamol (0.83 [0.73-0.95] first follow-up), cyanocobalamin, felodipine and furosemide were associated with the worst cluster. Cholinesterase inhibitors were associated with the best cognition clusters, whereas memantine appeared in the worse cognition clusters, consistent with its indication in moderate to severe dementia.

CONCLUSIONS

We performed unsupervised clustering to classify patients based on their current cognition and cognitive decline from previous testing. Atorvastatin, simvastatin, warfarin, metformin, and zopiclone presented a positive and statistically significant associations with cognition, while oxazepam, cyanocobalamin, felodipine, furosemide and paracetamol, were associated with the worst cluster.

Collapse

Affiliation(s)

Pol Grau-Jurado Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
Shayan Mostafaei Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Departmenet of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Hong Xu Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
Minjia Mo Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
Bojana Petek Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia Clinical Institute of Genomic Medicine, University Medical Centre Ljubljana, Ljubljana, Slovenia
Irena Kalar Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia Department of Neurology, University Medical Centre Ljubljana, Ljubljana, Slovenia
Luana Naia Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
Julianna Kele Team Neurovascular Biology and Health, Clinical Immunology, Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden
Silvia Maioli Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
Joana B Pereira Neuro Division, Department of Clinical Neurosciences, Karolinska Institutet, Stockholm, Sweden
Maria Eriksdotter Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Aging and Inflammation Theme, Karolinska University Hospital, Stockholm, Sweden
Saikat Chatterjee School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
Sara Garcia-Ptacek Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden Aging and Inflammation Theme, Karolinska University Hospital, Stockholm, Sweden

Collapse

Mohammed M, Zainal H, Ong SC, Tangiisuran B, Aziz FA, Sidek NN, Sha'aban A, Ibrahim UI, Muhammad S, Looi I, Aziz ZA. Prognostic Models of Mortality Following First-Ever Acute Ischemic Stroke: A Population-Based Retrospective Cohort Study. Health Sci Rep 2025;8:e70445. [PMID: 39957974 PMCID: PMC11825595 DOI: 10.1002/hsr2.70445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 01/20/2025] [Accepted: 01/27/2025] [Indexed: 02/18/2025] Open

Abstract

Background and Aims

There is a lack of population-based studies focusing on guideline-based prognostic models for stroke. This study aimed to develop and validate a prognostic model that predicts mortality following a first-ever acute ischemic stroke.

Methods

The study included 899 adult patients ( ≥ 18 years) with confirmed diagnosis of first-ever acute ischemic stroke enrolled in the Malaysian National Stroke Registry (NSR) from January 2009 to December 2019. The primary outcome was mortality within 90 days post-stroke (266 events [29.6%]). The prognostic model was developed using logistic regression (75%, n = 674) and internally validated (25%, n = 225). Model performance was assessed using discrimination (area under the curve (AUC]) and calibration (Hosmer-Lemeshow test [HL]).

Results

The final model includes factors associated with increased risk of mortality, such as age (adjusted odds ratio, aOR 1.06 [95% confidence interval, CI 1.03, 1.10; p < 0.001]), National Institutes of Health Stroke Scale (NIHSS) score aOR 1.08 (95% CI 1.08, 1.13; p = 0.004), and diabetes aOR 2.29 (95% CI 1.20, 4.37; p = 0.012). The protective factors were antiplatelet within 48 h. aOR 0.40 (95% CI 0.19, 0.81; p = 0.01), dysphagia screening aOR 0.30 (95% CI 0.15, 0.61; p = 0.001), antiplatelets upon discharge aOR 0.17 (95% CI 0.08, 0.35; p < 0.001), lipid-lowering therapy aOR 0.37 (95% CI 0.17, 0.82; p = 0.01), stroke education aOR 0.02 (95% CI 0.01, 0.05; p < 0.001) and rehabilitation aOR 0.08 (95% CI 0.04, 0.16; p < 0.001). The model demonstrated excellent performance (discrimination [AUC = 0.94] and calibration [HL, X 2 p = 0.63]).

Conclusion

The study developed a validated prognostic model that excellently predicts mortality after a first-ever acute ischemic stroke with potential clinical utility in acute stroke care decision-making. The predictors could be valuable for creating risk calculators and aiding healthcare providers and patients in making well-informed clinical decisions during the stroke care process.

Collapse

Pislar N, Gasljevic G, Matos E, Pilko G, Zgajnar J, Perhavec A. Predicting nodal response to neoadjuvant treatment in breast cancer with core biopsy biomarkers of tumor microenvironment using data mining. Breast Cancer Res Treat 2025;210:87-94. [PMID: 39496911 PMCID: PMC11787214 DOI: 10.1007/s10549-024-07539-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 10/22/2024] [Indexed: 11/06/2024]

Nemlander E, Abedi E, Ljungman P, Hasselström J, Carlsson AC, Rosenblad A. The Stockholm early detection of cancer study (STEADY-CAN): rationale, design, data collection, and baseline characteristics for 2.7 million participants. Eur J Epidemiol 2025;40:123-136. [PMID: 39755982 PMCID: PMC11799118 DOI: 10.1007/s10654-024-01192-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Accepted: 12/09/2024] [Indexed: 01/07/2025]

Abstract

The Stockholm Early Detection of Cancer Study (STEADY-CAN) cohort was established to investigate strategies for early cancer detection in a population-based context within Stockholm County, the capital region of Sweden. Utilising real-world data to explore cancer-related healthcare patterns and outcomes, the cohort links extensive clinical and laboratory data from both inpatient and outpatient care in the region. The dataset includes demographic information, detailed diagnostic codes, laboratory results, prescribed medications, and healthcare utilisation data. Since its inception, STEADY-CAN has collected longitudinal data on 2,732,005 individuals aged ≥ 18 years old living in or having access to health care in Stockholm County during the years 2011-2021. Focusing on cancer, the cohort includes 140,042 (5.1%) individuals with incident cancer and a control group of 2,591,963 (94.9%) cancer-free individuals. The cohort's diverse adult population enables robust analyses of early symptom detection, incidental findings, and the impact of comorbidities on cancer diagnoses. Utilizing the wide range of available laboratory data and clinical variables allow for advanced statistical analyses and adjustments for important confounding factors. The cohort's primary focus is to improve understanding of the early diagnostic phase of cancer, offering a crucial resource for studying cancer detection in clinical practice. Its comprehensive data collection provides unique opportunities for research into comorbidities and cancer outcomes, making the cohort a useful resource for ongoing cancer surveillance and public health strategies. The present study gives a detailed description of the rationale for creating the STEADY-CAN cohort, its design, the data collection procedure, and baseline characteristics of collected data.

Collapse

Wojcik KM, Caswell-Jin JL, Wilson OWA, Schechter C, Kamil D, Kurian AW, Jayasekera J. The population-level effects of omitting chemotherapy guided by a 21-gene expression assay in node-positive breast cancer: a simulation modeling study. BMC Cancer 2024;24:975. [PMID: 39118050 PMCID: PMC11308572 DOI: 10.1186/s12885-024-12719-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 07/26/2024] [Indexed: 08/10/2024] Open

Abstract

BACKGROUND

A recent trial showed that postmenopausal women diagnosed with hormone receptor-positive, human epidermal growth factor receptor-2 (HER2)-negative, lymph node-positive (1-3 nodes) breast cancer with a 21-gene recurrence score of ≤ 25 could safely omit chemotherapy. However, there are limited data on population-level long-term outcomes associated with omitting chemotherapy among diverse women seen in real-world practice.

METHODS

We adapted an established, validated simulation model to generate the joint distributions of population-level characteristics of women diagnosed with early-stage breast cancer in the U.S. Input parameters were derived from cancer registry, meta-analyses, and clinical trial data. The effects of omitting chemotherapy on 10-year distant recurrence-free survival, life-years, and quality adjusted life-years (QALYs) were modeled for premenopausal and postmenopausal women. QALYs were discounted at 3%. Results were evaluated for subgroups stratified by race and ethnicity. Sensitivity analyses included testing results across a range of inputs. The model was validated using the published RxPONDER trial data.

RESULTS

In premenopausal women, the 10-year distant recurrence-free survival rates were 85.3% with chemo-endocrine and 80.1% with endocrine therapy. The estimated life-years and QALYs gained with chemotherapy in premenopausal women were 2.1 and 0.6, respectively. There was no chemotherapy benefit in postmenopausal women. There was no variation in the absolute benefit of chemotherapy across racial or ethnic subgroups. However, there were differences in distant recurrence-free survival rates, life-years, and QALYs across groups. Sensitivity analysis showed similar results. The model closely replicated the RxPONDER trial.

CONCLUSIONS

Modeled population-level outcomes show a small chemotherapy benefit in premenopausal women, but no benefit among postmenopausal women. Simulation modeling provides a useful tool to extend trial data and evaluate population-level outcomes.

Collapse

Izadi Z, Gianfrancesco M, Anastasiou C, Schmajuk G, Yazdany J. Development and validation of a risk scoring system to identify patients with lupus nephritis in electronic health record data. Lupus Sci Med 2024;11:e001170. [PMID: 38769054 PMCID: PMC11110552 DOI: 10.1136/lupus-2024-001170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 04/20/2024] [Indexed: 05/22/2024]

Abstract

OBJECTIVE

Accurate identification of lupus nephritis (LN) cases is essential for patient management, research and public health initiatives. However, LN diagnosis codes in electronic health records (EHRs) are underused, hindering efficient identification. We investigated the current performance of International Classification of Diseases (ICD) codes, 9th and 10th editions (ICD9/10), for identifying prevalent LN, and developed scoring systems to increase identification of LN that are adaptable to settings with and without LN ICD codes.

METHODS

Training and test sets derived from EHR data from a large health system. An external set comprised data from the EHR of a second large health system. Adults with ICD9/10 codes for SLE were included. LN cases were ascertained through manual chart reviews conducted by rheumatologists. Two definitions of LN were used: strict (definite LN) and inclusive (definite, potential or diagnostic uncertainty). Gradient boosting models including structured EHR fields were used for predictor selection. Two logistic regression-based scoring systems were developed ('LN-Code' included LN ICD codes and 'LN-No Code' did not), calibrated and validated using standard performance metrics.

RESULTS

A total of 4152 patients from University of California San Francisco Medical Center and 370 patients from Zuckerberg San Francisco General Hospital and Trauma Center met the eligibility criteria. Mean age was 50 years, 87% were female. LN diagnosis codes demonstrated low sensitivity (43-73%) but high specificity (92-97%). LN-Code achieved an area under the curve (AUC) of 0.93 and a sensitivity of 0.88 for identifying LN using the inclusive definition. LN-No Code reached an AUC of 0.91 and a sensitivity of 0.95 (0.97 for the strict definition). Both scoring systems had good external validity, calibration and performance across racial and ethnic groups.

CONCLUSIONS

This study quantified the underutilisation of LN diagnosis codes in EHRs and introduced two adaptable scoring systems to enhance LN identification. Further validation in diverse healthcare settings is essential to ensure their broader applicability.

Collapse

Tesfie TK, Anlay DZ, Abie B, Chekol YM, Gelaw NB, Tebeje TM, Animut Y. Nomogram to predict risk of neonatal mortality among preterm neonates admitted with sepsis at University of Gondar Comprehensive Specialized Hospital: risk prediction model development and validation. BMC Pregnancy Childbirth 2024;24:139. [PMID: 38360591 PMCID: PMC10868119 DOI: 10.1186/s12884-024-06306-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open

Abstract

BACKGROUND

Mortality in premature neonates is a global public health problem. In developing countries, nearly 50% of preterm births ends with death. Sepsis is one of the major causes of death in preterm neonates. Risk prediction model for mortality in preterm septic neonates helps for directing the decision making process made by clinicians.

OBJECTIVE

We aimed to develop and validate nomogram for the prediction of neonatal mortality. Nomograms are tools which assist the clinical decision making process through early estimation of risks prompting early interventions.

METHODS

A three year retrospective follow up study was conducted at University of Gondar Comprehensive Specialized Hospital and a total of 603 preterm neonates with sepsis were included. Data was collected using KoboCollect and analyzed using STATA version 16 and R version 4.2.1. Lasso regression was used to select the most potent predictors and to minimize the problem of overfitting. Nomogram was developed using multivariable binary logistic regression analysis. Model performance was evaluated using discrimination and calibration. Internal model validation was done using bootstrapping. Net benefit of the nomogram was assessed through decision curve analysis (DCA) to assess the clinical relevance of the model.

RESULT

The nomogram was developed using nine predictors: gestational age, maternal history of premature rupture of membrane, hypoglycemia, respiratory distress syndrome, perinatal asphyxia, necrotizing enterocolitis, total bilirubin, platelet count and kangaroo-mother care. The model had discriminatory power of 96.7% (95% CI: 95.6, 97.9) and P-value of 0.165 in the calibration test before and after internal validation with brier score of 0.07. Based on the net benefit analysis the nomogram was found better than treat all and treat none conditions.

CONCLUSION

The developed nomogram can be used for individualized mortality risk prediction with excellent performance, better net benefit and have been found to be useful in clinical practice with contribution in preterm neonatal mortality reduction by giving better emphasis for those at high risk.

Collapse

Zhang J, Liu Y, Zhang C, Chen Y, Hu Y, Yang X, Liu W, Zhang W, Liu D, Song H. Predicting suicidal behavior in individuals with depression over 50 years of age: Evidence from the UK biobank. Digit Health 2024;10:20552076241287450. [PMID: 39411544 PMCID: PMC11475109 DOI: 10.1177/20552076241287450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 09/10/2024] [Indexed: 10/19/2024] Open

Abstract

Objective

To construct applicable models suitable for predicting the risk of suicidal behavior among individuals with depression, particularly on the progression from no history of suicidal behavior to suicide attempts, as well as from suicidal ideation to suicide attempts.

Methods

Based on a prospective cohort from the UK Biobank, a total of 55,139 individuals aged 50 and above with depression were enrolled in the study, among whom 29,528 exhibited suicidal behavior. Specifically, they were divided into control (25,611), suicidal ideation (24,361), and suicide attempt (5167) groups. Least absolute shrinkage and selection operator (LASSO) regression was used to identify a subset of important features for distinguishing suicidal ideation and suicide attempts. We used the Gradient Boosting Decision Tree (GBDT) algorithm with stratified 10-fold cross-validation and grid-search to construct the prediction models for suicidal ideation or suicide attempts. To address the dataset imbalance in classifying suicide attempts, we used random under-sampling. The SHapley Additive exPlanations (SHAP) were used to estimate the important variables in the GBDT model.

Results

Significant differences in sociodemographic, economic, lifestyle, and psychological factors were observed across the three groups. Each classifier optimally utilized 8-11 features. Overall, the algorithms predicting suicide attempts demonstrated slightly higher performance than those predicting suicidal ideation. The GBDT classifier achieved the highest accuracy, with AUROC scores of 0.914 for suicide attempts and 0.803 for suicidal ideation. Distinctive predictive factors were identified for each group: while depression's inherent characteristics crucially distinguished the suicidal ideation group from controls, some key predictors, including the age of depression onset and childhood trauma events, were identified for suicide attempts.

Conclusions

We established applicable machine learning-based models for predicting suicidal behavior, particularly suicide attempts, in individuals with depression, and clarified the differences in predictors between suicidal ideation and suicide attempts.

Collapse

Affiliation(s)

Jian Zhang Mental Health Center, West China Hospital, Sichuan University, Chengdu, China West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
Yujun Liu West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Chao Zhang West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Yilong Chen West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Yao Hu West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Xiujia Yang University of Illinois at Urbana and Champaign, Urbana, IL, USA
Wentao Liu West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Wei Zhang Mental Health Center, West China Hospital, Sichuan University, Chengdu, China West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China
Di Liu West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China Industrial Engineering, Pittsburgh Institute, Sichuan University, Chengdu, China
Huan Song West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China Med-X Center for Informatics, Sichuan University, Chengdu, China Center of Public Health Sciences, Faculty of Medicine, University of Iceland, Reykjavík, Iceland

Collapse

Chen YL, Kraus SW, Freeman MJ, Freeman AJ. A Machine-Learning Approach to Assess Factors Associated With Hospitalization of Children and Youths in Psychiatric Crisis. Psychiatr Serv 2023;74:943-949. [PMID: 36916060 DOI: 10.1176/appi.ps.20220201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]

Abstract

OBJECTIVE

The authors used a machine-learning approach to model clinician decision making regarding psychiatric hospitalization of children and youths in crisis and to identify factors associated with the decision to hospitalize.

METHODS

Data consisted of 4,786 mobile crisis response team assessments of children and youths, ages 4.0-19.5 years (mean±SD=14.0±2.7 years, 56% female), in Nevada. The sample assessments were split into training and testing data sets. A random-forest machine-learning algorithm was used to identify variables related to the decision to hospitalize a child or youth after the crisis assessment. Results from the training sample were externally validated in the testing sample.

RESULTS

The random-forest model had good performance (area under the curve training sample=0.91, testing sample=0.92). Variables found to be important in the decision to hospitalize a child or youth were acute suicidality, followed by poor judgment or decision making, danger to others, impulsivity, runaway behavior, other risky behaviors, nonsuicidal self-injury, psychotic or depressive symptoms, sleep problems, oppositional behavior, poor functioning at home or with peers, depressive or schizophrenia spectrum disorders, and age.

CONCLUSIONS

In crisis settings, clinicians were found to mostly focus on acute factors that increased risk for danger to self or others (e.g., suicidality, poor judgment), current psychiatric symptoms (e.g., psychotic symptoms), and functioning (e.g., poor home functioning, problems with peer relationships) when deciding whether to hospitalize or stabilize a child or youth. To reduce psychiatric hospitalization, community-based services should target interventions to address these important factors associated with the need for a higher level of care among youths in psychiatric crisis.

Collapse

Chen SL, Chin SC, Chan KC, Ho CY. A Machine Learning Approach to Assess Patients with Deep Neck Infection Progression to Descending Mediastinitis: Preliminary Results. Diagnostics (Basel) 2023;13:2736. [PMID: 37685275 PMCID: PMC10486957 DOI: 10.3390/diagnostics13172736] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/25/2023] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open

Timilsina M, Fey D, Buosi S, Janik A, Costabello L, Carcereny E, Abreu DR, Cobo M, Castro RL, Bernabé R, Minervini P, Torrente M, Provencio M, Nováček V. Synergy between imputed genetic pathway and clinical information for predicting recurrence in early stage non-small cell lung cancer. J Biomed Inform 2023;144:104424. [PMID: 37352900 DOI: 10.1016/j.jbi.2023.104424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 06/06/2023] [Accepted: 06/11/2023] [Indexed: 06/25/2023]

Reyes-Santias F, García-García C, Aibar-Guzmán B, García-Campos A, Cordova-Arevalo O, Mendoza-Pintos M, Cinza-Sanjurjo S, Portela-Romero M, Mazón-Ramos P, Gonzalez-Juanatey JR. Cost Analysis of Magnetic Resonance Imaging and Computed Tomography in Cardiology: A Case Study of a University Hospital Complex in the Euro Region. Healthcare (Basel) 2023;11:2084. [PMID: 37510526 PMCID: PMC10379578 DOI: 10.3390/healthcare11142084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 07/12/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023] Open

Affiliation(s)

Francisco Reyes-Santias Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain Department of Business, University of Vigo, 36310 Vigo, Spain
Carlos García-García Department of Pharmacology, Pharmacy and Pharmaceutical Technology, R+D Pharma Group (GI-1645), Faculty of Pharmacy, Health Research Institute of Santiago de Compostela (IDIS), University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
Beatriz Aibar-Guzmán Departamento de Economía Financiera y Contabilidad, Facultad de Ciencias Económicas y Empresariales, Universidad de Santiago de Compostela, Av. Burgo, s/n, 15782 Santiago Compostela, Spain
Ana García-Campos Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
Octavio Cordova-Arevalo Department of Business, University of Vigo, 36310 Vigo, Spain
Margarita Mendoza-Pintos Medtronic España, S.A., 15753 Santiago de Compostela, Spain
Sergio Cinza-Sanjurjo Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain CS Milladoiro, Área Sanitaria Integrada Santiago de Compostela, 15895 Travesía do Porto, Spain
Manuel Portela-Romero Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain CS Concepción Arenal, Área Sanitaria Integrada Santiago de Compostela, Rúa de Santiago León de Caracas, 12, 15701 Santiago de Compostela, Spain
Pilar Mazón-Ramos Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
Jose Ramon Gonzalez-Juanatey Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain

Collapse

Wang B, Tian P, Sun Q, Zhang H, Han L, Zhu B. A novel, effective machine learning-based RNA editing profile for predicting the prognosis of lower-grade gliomas. Heliyon 2023;9:e18075. [PMID: 37483735 PMCID: PMC10362151 DOI: 10.1016/j.heliyon.2023.e18075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 07/02/2023] [Accepted: 07/05/2023] [Indexed: 07/25/2023] Open

Abstract

Patients with low-grade glioma (LGG) may survive for long time periods, but their tumors often progress to higher-grade lesions. Currently, no cure for LGG is available. A-to-I RNA editing accounts for nearly 90% of all RNA editing events in humans and plays a role in tumorigenesis in various cancers. However, little is known regarding its prognostic role in LGG. On the basis of The Cancer Genome Atlas (TCGA) data, we used LASSO and univariate Cox regression to construct an RNA editing site signature. The results derived from the TCGA dataset were further validated with Gene Expression Omnibus (GEO) and Chinese Glioma Genome Atlas (CGGA) datasets. Five machine learning algorithms (Decision Trees C5.0, XGboost, GBDT, Lightgbm, and Catboost) were used to confirm the prognosis associated with the RNA editing site signature. Finally, we explored immune function, immunotherapy, and potential therapeutic agents in the high- and low-risk groups by using multiple biological prediction websites. A total of 22,739 RNA editing sites were identified, and a signature model consisting of four RNA editing sites (PRKCSH|chr19:11561032, DSEL|chr18:65174489, UGGT1|chr2:128952084, and SOD2|chr6:160101723) was established. Cox regression analysis indicated that the RNA editing signature was an independent prognostic factor, according to the ROC curve (AUC = 0.823), and the nomogram model had good predictive power (C-index = 0.824). In addition, the predictive ability of the RNA editing signature was confirmed with the machine learning model. The sensitivity of PCI-34051 and Elephantin was significantly higher in the high-risk group than the low-risk group, thus potentially providing a marker to predict the effects of lung cancer drug treatment. RNA editing may serve as a novel survival prediction tool, thus offering hope for developing editing-based therapeutic strategies to combat LGG progression. In addition, this tool may help optimize survival risk assessment and individualized care for patients with low-grade gliomas.

Collapse

Liu YS, Thaliffdeen R, Han S, Park C. Use of machine learning to predict bladder cancer survival outcomes: a systematic literature review. Expert Rev Pharmacoecon Outcomes Res 2023;23:761-771. [PMID: 37306511 DOI: 10.1080/14737167.2023.2224963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 06/09/2023] [Indexed: 06/13/2023]

Popović Krneta M, Šobić Šaranović D, Mijatović Teodorović L, Krajčinović N, Avramović N, Bojović Ž, Bukumirić Z, Marković I, Rajšić S, Djorović BB, Artiko V, Karličić M, Tanić M. Prediction of Cervical Lymph Node Metastasis in Clinically Node-Negative T1 and T2 Papillary Thyroid Carcinoma Using Supervised Machine Learning Approach. J Clin Med 2023;12:jcm12113641. [PMID: 37297835 DOI: 10.3390/jcm12113641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 05/19/2023] [Accepted: 05/22/2023] [Indexed: 06/12/2023] Open

Affiliation(s)

Marina Popović Krneta Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
Dragana Šobić Šaranović Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia Center for Nuclear Medicine with PET, University Clinical Center of Serbia, 11 000 Belgrade, Serbia
Ljiljana Mijatović Teodorović Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia Faculty of Medical Sciences, University of Kragujevac, 34 000 Kragujevac, Serbia
Nemanja Krajčinović Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
Nataša Avramović Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
Živko Bojović Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
Zoran Bukumirić Institute of Medical Statistics and Informatics, Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia
Ivan Marković Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia Surgical Oncology Clinic, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
Saša Rajšić Department of Anesthesiology and Intensive Care Medicine, Medical University Innsbruck, 6020 Innsbruck, Austria
Biljana Bazić Djorović Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
Vera Artiko Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia Center for Nuclear Medicine with PET, University Clinical Center of Serbia, 11 000 Belgrade, Serbia
Mihajlo Karličić School of Electrical Engineering, University of Belgrade, 11 000 Belgrade, Serbia
Miljana Tanić Department of Experimental Oncology, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia UCL Cancer Institute, London WC1E 6DD, UK

Collapse

Brehon K, Carriere J, Churchill K, Loyola-Sanchez A, Papathanassoglou E, MacIsaac R, Tavakoli M, Ho C, Manhas KP. Evaluating Efficiency of a Provincial Telerehabilitation Service in Improving Access to Care During the COVID-19 Pandemic. Int J Telerehabil 2023;15:e6523. [PMID: 38046552 PMCID: PMC10687995 DOI: 10.5195/ijt.2023.6523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2023] Open

Manuel Román-Belmonte J, De la Corte-Rodríguez H, Adriana Rodríguez-Damiani B, Carlos Rodríguez-Merchán E. Artificial Intelligence in Musculoskeletal Conditions. ARTIF INTELL 2023. [DOI: 10.5772/intechopen.110696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023]

Sheehy J, Rutledge H, Acharya UR, Loh HW, Gururajan R, Tao X, Zhou X, Li Y, Gurney T, Kondalsamy-Chennakesavan S. Gynecological cancer prognosis using machine learning techniques: A systematic review of last three decades (1990–2022). Artif Intell Med 2023;139:102536. [PMID: 37100507 DOI: 10.1016/j.artmed.2023.102536] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/19/2023] [Accepted: 03/23/2023] [Indexed: 03/30/2023]

Abstract

OBJECTIVE

Many Computer Aided Prognostic (CAP) systems based on machine learning techniques have been proposed in the field of oncology. The objective of this systematic review was to assess and critically appraise the methodologies and approaches used in predicting the prognosis of gynecological cancers using CAPs.

METHODS

Electronic databases were used to systematically search for studies utilizing machine learning methods in gynecological cancers. Study risk of bias (ROB) and applicability were assessed using the PROBAST tool. 139 studies met the inclusion criteria, of which 71 predicted outcomes for ovarian cancer patients, 41 predicted outcomes for cervical cancer patients, 28 predicted outcomes for uterine cancer patients, and 2 predicted outcomes for gynecological malignancies broadly.

RESULTS

Random forest (22.30 %) and support vector machine (21.58 %) classifiers were used most commonly. Use of clinicopathological, genomic and radiomic data as predictors was observed in 48.20 %, 51.08 % and 17.27 % of studies, respectively, with some studies using multiple modalities. 21.58 % of studies were externally validated. Twenty-three individual studies compared ML and non-ML methods. Study quality was highly variable and methodologies, statistical reporting and outcome measures were inconsistent, preventing generalized commentary or meta-analysis of performance outcomes.

CONCLUSION

There is significant variability in model development when prognosticating gynecological malignancies with respect to variable selection, machine learning (ML) methods and endpoint selection. This heterogeneity prevents meta-analysis and conclusions regarding the superiority of ML methods. Furthermore, PROBAST-mediated ROB and applicability analysis demonstrates concern for the translatability of existing models. This review identifies ways that this can be improved upon in future works to develop robust, clinically translatable models within this promising field.

Collapse

Scott-Fordsmand JJ, Amorim MJB. Using Machine Learning to make nanomaterials sustainable. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;859:160303. [PMID: 36410486 DOI: 10.1016/j.scitotenv.2022.160303] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 11/06/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]

Burnett B, Zhou SM, Brophy S, Davies P, Ellis P, Kennedy J, Bandyopadhyay A, Parker M, Lyons RA. Machine Learning in Colorectal Cancer Risk Prediction from Routinely Collected Data: A Review. Diagnostics (Basel) 2023;13:301. [PMID: 36673111 PMCID: PMC9858109 DOI: 10.3390/diagnostics13020301] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/05/2023] [Accepted: 01/07/2023] [Indexed: 01/15/2023] Open

Kotsyfakis S, Iliaki-Giannakoudaki E, Anagnostopoulos A, Papadokostaki E, Giannakoudakis K, Goumenakis M, Kotsyfakis M. The application of machine learning to imaging in hematological oncology: A scoping review. Front Oncol 2022;12:1080988. [PMID: 36605438 PMCID: PMC9808781 DOI: 10.3389/fonc.2022.1080988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open

Abstract

Background

Here, we conducted a scoping review to (i) establish which machine learning (ML) methods have been applied to hematological malignancy imaging; (ii) establish how ML is being applied to hematological cancer radiology; and (iii) identify addressable research gaps.

Methods

The review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis Extension for Scoping Reviews guidelines. The inclusion criteria were (i) pediatric and adult patients with suspected or confirmed hematological malignancy undergoing imaging (population); (ii) any study using ML techniques to derive models using radiological images to apply to the clinical management of these patients (concept); and (iii) original research articles conducted in any setting globally (context). Quality Assessment of Diagnostic Accuracy Studies 2 criteria were used to assess diagnostic and segmentation studies, while the Newcastle-Ottawa scale was used to assess the quality of observational studies.

Results

Of 53 eligible studies, 33 applied diverse ML techniques to diagnose hematological malignancies or to differentiate them from other diseases, especially discriminating gliomas from primary central nervous system lymphomas (n=18); 11 applied ML to segmentation tasks, while 9 applied ML to prognostication or predicting therapeutic responses, especially for diffuse large B-cell lymphoma. All studies reported discrimination statistics, but no study calculated calibration statistics. Every diagnostic/segmentation study had a high risk of bias due to their case-control design; many studies failed to provide adequate details of the reference standard; and only a few studies used independent validation.

Conclusion

To deliver validated ML-based models to radiologists managing hematological malignancies, future studies should (i) adhere to standardized, high-quality reporting guidelines such as the Checklist for Artificial Intelligence in Medical Imaging; (ii) validate models in independent cohorts; (ii) standardize volume segmentation methods for segmentation tasks; (iv) establish comprehensive prospective studies that include different tumor grades, comparisons with radiologists, optimal imaging modalities, sequences, and planes; (v) include side-by-side comparisons of different methods; and (vi) include low- and middle-income countries in multicentric studies to enhance generalizability and reduce inequity.

Collapse

Butner JD, Dogra P, Chung C, Pasqualini R, Arap W, Lowengrub J, Cristini V, Wang Z. Mathematical modeling of cancer immunotherapy for personalized clinical translation. NATURE COMPUTATIONAL SCIENCE 2022;2:785-796. [PMID: 38126024 PMCID: PMC10732566 DOI: 10.1038/s43588-022-00377-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 11/14/2022] [Indexed: 12/23/2023]

Hanis TM, Ruhaiyem NIR, Arifin WN, Haron J, Wan Abdul Rahman WF, Abdullah R, Musa KI. Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records. Diagnostics (Basel) 2022;12:diagnostics12112826. [PMID: 36428886 PMCID: PMC9689364 DOI: 10.3390/diagnostics12112826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 10/13/2022] [Accepted: 10/15/2022] [Indexed: 11/18/2022] Open

Bai Z, Bai Y, Fang C, Chen W. Oxidative stress-related patterns determination for establishment of prognostic models, and characteristics of tumor microenvironment infiltration. Front Surg 2022;9:1013794. [PMID: 36386530 PMCID: PMC9665876 DOI: 10.3389/fsurg.2022.1013794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 10/14/2022] [Indexed: 12/02/2022] Open

Briggs E, de Kamps M, Hamilton W, Johnson O, McInerney CD, Neal RD. Machine Learning for Risk Prediction of Oesophago-Gastric Cancer in Primary Care: Comparison with Existing Risk-Assessment Tools. Cancers (Basel) 2022;14:cancers14205023. [PMID: 36291807 PMCID: PMC9600097 DOI: 10.3390/cancers14205023] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/12/2022] [Accepted: 10/12/2022] [Indexed: 11/18/2022] Open

Abstract

Simple Summary

Oesophago-gastric cancer is one of the commonest cancers worldwide, yet it can be particularly difficult to diagnose given that initial symptoms are often non-specific and routine screening is not available. Cancer risk-assessment tools, which calculate cancer risk based on symptoms and other risk factors present in the primary care record, can aid decisions on referrals for cancer investigations, facilitating earlier diagnosis. Diagnosing common cancers earlier could help improve survival rates. Using UK primary care electronic health record data, we compared five different machine learning techniques for probabilistic classification of cancer patients against a current widely used UK primary care cancer risk-assessment tool. The machine learning algorithms outperformed the current risk-assessment tool, with a higher overall accuracy and an ability to reasonably identify 11–25% more cancer patients. We conclude that machine-learning-based risk-assessment tools could help better identify suitable patients for further investigation and support earlier diagnosis.

Abstract

Oesophago-gastric cancer is difficult to diagnose in the early stages given its typical non-specific initial manifestation. We hypothesise that machine learning can improve upon the diagnostic performance of current primary care risk-assessment tools by using advanced analytical techniques to exploit the wealth of evidence available in the electronic health record. We used a primary care electronic health record dataset derived from the UK General Practice Research Database (7471 cases; 32,877 controls) and developed five probabilistic machine learning classifiers: Support Vector Machine, Random Forest, Logistic Regression, Naïve Bayes, and Extreme Gradient Boosted Decision Trees. Features included basic demographics, symptoms, and lab test results. The Logistic Regression, Support Vector Machine, and Extreme Gradient Boosted Decision Tree models achieved the highest performance in terms of accuracy and AUROC (0.89 accuracy, 0.87 AUROC), outperforming a current UK oesophago-gastric cancer risk-assessment tool (ogRAT). Machine learning also identified more cancer patients than the ogRAT: 11.0% more with little to no effect on false positives, or up to 25.0% more with a slight increase in false positives (for Logistic Regression, results threshold-dependent). Feature contribution estimates and individual prediction explanations indicated clinical relevance. We conclude that machine learning could improve primary care cancer risk-assessment tools, potentially helping clinicians to identify additional cancer cases earlier. This could, in turn, improve survival outcomes.

Collapse

Izadi Z, Gianfrancesco MA, Aguirre A, Strangfeld A, Mateus EF, Hyrich KL, Gossec L, Carmona L, Lawson‐Tovey S, Kearsley‐Fleet L, Schaefer M, Seet AM, Schmajuk G, Jacobsohn L, Katz P, Rush S, Al‐Emadi S, Sparks JA, Hsu TY, Patel NJ, Wise L, Gilbert E, Duarte‐García A, Valenzuela‐Almada MO, Ugarte‐Gil MF, Ribeiro SLE, de Oliveira Marinho A, de Azevedo Valadares LD, Giuseppe DD, Hasseli R, Richter JG, Pfeil A, Schmeiser T, Isnardi CA, Reyes Torres AA, Alle G, Saurit V, Zanetti A, Carrara G, Labreuche J, Barnetche T, Herasse M, Plassart S, Santos MJ, Rodrigues AM, Robinson PC, Machado PM, Sirotich E, Liew JW, Hausmann JS, Sufka P, Grainger R, Bhana S, Costello W, Wallace ZS, Yazdany J. Development of a Prediction Model for COVID-19 Acute Respiratory Distress Syndrome in Patients With Rheumatic Diseases: Results From the Global Rheumatology Alliance Registry. ACR Open Rheumatol 2022;4:872-882. [PMID: 35869686 PMCID: PMC9350083 DOI: 10.1002/acr2.11481] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 05/31/2022] [Indexed: 11/09/2022] Open

Affiliation(s)

Zara Izadi University of CaliforniaSan Francisco
Milena A. Gianfrancesco University of CaliforniaSan Francisco
Alfredo Aguirre University of CaliforniaSan Francisco
Anja Strangfeld Deutsches Rheuma‐Forschungszentrum BerlinBerlinGermany
Elsa F. Mateus Portuguese League Against Rheumatic DiseasesLisbonPortugal
Kimme L. Hyrich The University of Manchester and National Institute for Health Research Manchester Biomedical Research Centre, Manchester University and NHS Foundation TrustManchesterUK
Laure Gossec INSERM, Sorbonne Universite and Hopital Universitaire Pitie Salpetriere, AP‐HPParisFrance
Loreto Carmona Instituto de Salud MusculoesqueléticaMadridSpain
Saskia Lawson‐Tovey The University of Manchester and National Institute for Health Research Manchester Biomedical Research Centre, Manchester University NHS Foundation Trust and Manchester Academic Health Science CentreManchesterUK
Lianne Kearsley‐Fleet The University of Manchester and Manchester Academic Health Science CentreManchesterUK
Martin Schaefer German Rheumatism Research CenterBerlinGermany
Andrea M. Seet University of CaliforniaSan Francisco
Gabriela Schmajuk University of CaliforniaSan Francisco and San Francisco Department of Veterans Affairs Medical Center
Lindsay Jacobsohn University of CaliforniaSan Francisco
Patricia Katz University of CaliforniaSan Francisco
Stephanie Rush University of CaliforniaSan Francisco
Samar Al‐Emadi Hamad Medical CorporationDohaQatar
Jeffrey A. Sparks Brigham and Women's Hospital and Harvard Medical SchoolBostonMassachusetts
Tiffany Y‐T Hsu Brigham and Women's Hospital and Harvard Medical SchoolBostonMassachusetts
Naomi J. Patel Massachusetts General Hospital and Harvard Medical SchoolBoston
Leanna Wise University of Southern CaliforniaLos Angeles
Emily Gilbert Mayo ClinicJacksonvilleFlorida
Alí Duarte‐García Mayo ClinicRochesterMinnesota
Maria O. Valenzuela‐Almada Mayo ClinicRochesterMinnesota
Manuel F. Ugarte‐Gil Universidad Científica del Sur and Hospital Nacional Guillermo Almenara IrigoyenEsSalud, LimaPeru
Sandra Lúcia Euzébio Ribeiro Universidade Federal do AmazonasManausBrazil
Adriana de Oliveira Marinho Fundação Hospitalar do AcreRio BrancoBrazil
Lilian David de Azevedo Valadares Universidade Federal de PernambucoRecifeBrazil
Daniela Di Giuseppe Karolinska InstitutetStockholmSweden
Rebecca Hasseli Justus‐Liebig University Giessen, Campus KerckhoffGiessenGermany
Jutta G. Richter Heinrich‐Heine‐University DüsseldorfDüsseldorfGermany
Alexander Pfeil Jena University Hospital and Friedrich Schiller University JenaJenaGermany
Tim Schmeiser Rheumatology im Veedel (Private Practice)CologneGermany
Carolina A. Isnardi Argentine Society of RheumatologyBuenos AiresArgentina
Alvaro A. Reyes Torres Hospital Italiano de Buenos AiresBuenos AiresArgentina
Gelsomina Alle Hospital Italiano de Buenos AiresBuenos AiresArgentina
Verónica Saurit Hospital Privado Universitario de CórdobaCórdobaArgentina
Anna Zanetti Italian Society for Rheumatology and University of Milano‐BicoccaMilanItaly
Greta Carrara Italian Society for Rheumatology and University of Milano‐BicoccaMilanItaly
Julien Labreuche Centre Hospitalier Universitaire de LilleLilleFrance
Thomas Barnetche FHU ACRONIM, Centre for Autoimmune Systemic Rare Diseases, Bordeaux University HospitalBordeauxFrance
Muriel Herasse Filière des Maladies Autoimmunes et Autoinflammatoires Rares, Hôpital Huriez, Centre Hospitalier Universitaire de LilleLilleFrance
Samira Plassart Filière des Maladies Autoimmunes et Autoinflammatoires Rares, Hôpital Huriez, Centre Hospitalier Universitaire de LilleLilleFrance
Maria José Santos Hospital Garcia de Orta, Almada, Portugal, and Instituto de Medicina Molecular Faculdade Medicina and Rheumatic Diseases Portuguese RegisterLisbonPortugal
Ana Maria Rodrigues Rheumatic Diseases Portuguese Register, Sociedade Portuguesa de Reumatologia, Nova Medical School, and Hospital dos LusiadasLisbonPortugal
Philip C. Robinson The University of Queensland, Brisbane, Queensland, Australia, and Royal Brisbane and Women's Hospital, Metro North Hospital and Health ServiceHerstonQueenslandAustralia
Pedro M. Machado University College London, University College London Hospitals NHS Foundation Trust and Northwick Park Hospital, London North West University Healthcare NHS TrustLondonUK
Emily Sirotich McMaster University, Hamilton, Ontario, Canada, and Canadian Arthritis Patient AllianceTorontoOntarioCanada
Jean W. Liew Boston University School of MedicineBostonMassachusetts
Jonathan S. Hausmann Beth Israel Deaconess Medical Center, Harvard Medical School and Boston Children's HospitalBostonMassachusetts
Paul Sufka HealthPartnersSt. PaulMinnesota
Rebecca Grainger University of OtagoWellingtonWellingtonNew Zealand
Suleman Bhana Pfizer Inc.New YorkNew York
Wendy Costello Irish Children's Arthritis NetworkTipperaryIreland
Zachary S. Wallace Massachusetts General Hospital and Harvard Medical SchoolBoston
Jinoos Yazdany University of CaliforniaSan Francisco
Global Rheumatology Alliance Registry

Collapse

Roman-Belmonte JM, De la Corte-Rodriguez H, Rodriguez-Merchan EC, Vazquez-Sasot A, Rodriguez-Damiani BA, Resino-Luís C, Sanchez-Laguna F. The three horizons model applied to medical science. Postgrad Med 2022;134:776-783. [DOI: 10.1080/00325481.2022.2124086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Lui TKL, Cheung KS, Leung WK. Machine learning models in the prediction of 1-year mortality in patients with advanced hepatocellular cancer on immunotherapy: a proof-of-concept study. Hepatol Int 2022;16:879-891. [PMID: 35779202 DOI: 10.1007/s12072-022-10370-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 05/22/2022] [Indexed: 11/28/2022]

Abstract

INTRODUCTION

Immunotherapy is a new promising treatment for patients with advanced hepatocellular carcinoma (HCC), but is costly and potentially associated with considerable side effects. This study aimed to evaluate the role of machine learning (ML) models in predicting the 1-year cancer-related mortality in advanced HCC patients treated with immunotherapy.

METHOD

395 HCC patients who had received immunotherapy (including nivolumab, pembrolizumab or ipilimumab) between 2014 and 2019 in Hong Kong were included. The whole data sets were randomly divided into training (n = 316) and internal validation (n = 79) set. The data set, including 47 clinical variables, was used to construct six different ML models in predicting the risk of 1-year mortality. The performances of ML models were measured by the area under receiver operating characteristic curve (AUC) and their performances were compared with C-Reactive protein and Alpha Fetoprotein in ImmunoTherapY score (CRAFITY) and albumin-bilirubin (ALBI) score. The ML models were further validated with an external cohort between 2020 and 2021.

RESULTS

The 1-year cancer-related mortality was 51.1%. Of the six ML models, the random forest (RF) has the highest AUC of 0.92 (95% CI 0.87-0.98), which was better than logistic regression (0.82, p = 0.01) as well as the CRAFITY (0.68, p < 0.01) and ALBI score (0.84, p = 0.04). RF had the lowest false positive (2.0%) and false negative rate (5.2%), and performed better than CRAFITY score in the external validation cohort (0.91 vs 0.66, p < 0.01). High baseline AFP, bilirubin and alkaline phosphatase were three common risk factors identified by all ML models.

CONCLUSION

ML models could predict 1-year cancer-related mortality in HCC patients treated with immunotherapy, which may help to select patients who would benefit from this treatment.

Collapse

Exploring the Utility of Anonymized EHR Datasets in Machine Learning Experiments in the Context of the MODELHealth Project. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12125942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]

Greenberg JK, Otun A, Ghogawala Z, Yen PY, Molina CA, Limbrick DD, Foraker RE, Kelly MP, Ray WZ. Translating Data Analytics Into Improved Spine Surgery Outcomes: A Roadmap for Biomedical Informatics Research in 2021. Global Spine J 2022;12:952-963. [PMID: 33973491 PMCID: PMC9344511 DOI: 10.1177/21925682211008424] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Prediction of Trypanosoma evansi infection in dromedaries using artificial neural network (ANN). Vet Parasitol 2022;306:109716. [DOI: 10.1016/j.vetpar.2022.109716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 05/05/2022] [Accepted: 05/06/2022] [Indexed: 11/20/2022]

Kwong JC, Khondker A, Kim JK, Chua M, Keefe DT, Dos Santos J, Skreta M, Erdman L, D'Souza N, Selman AF, Weaver J, Weiss DA, Long C, Tasian G, Teoh CW, Rickard M, Lorenzo AJ. Posterior Urethral Valves Outcomes Prediction (PUVOP): a machine learning tool to predict clinically relevant outcomes in boys with posterior urethral valves. Pediatr Nephrol 2022;37:1067-1074. [PMID: 34686914 DOI: 10.1007/s00467-021-05321-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/11/2021] [Accepted: 09/28/2021] [Indexed: 10/20/2022]

Abstract

BACKGROUND

Early kidney and anatomic features may be predictive of future progression and need for additional procedures in patients with posterior urethral valve (PUV). The objective of this study was to use machine learning (ML) to predict clinically relevant outcomes in these patients.

METHODS

Patients diagnosed with PUV with kidney function measurements at our institution between 2000 and 2020 were included. Pertinent clinical measures were abstracted, including estimated glomerular filtration rate (eGFR) at each visit, initial vesicoureteral reflux grade, and renal dysplasia at presentation. ML models were developed to predict clinically relevant outcomes: progression in CKD stage, initiation of kidney replacement therapy (KRT), and need for clean-intermittent catheterization (CIC). Model performance was assessed by concordance index (c-index) and the model was externally validated.

RESULTS

A total of 103 patients were included with a median follow-up of 5.7 years. Of these patients, 26 (25%) had CKD progression, 18 (17%) required KRT, and 32 (31%) were prescribed CIC. Additionally, 22 patients were included for external validation. The ML model predicted CKD progression (c-index = 0.77; external C-index = 0.78), KRT (c-index = 0.95; external C-index = 0.89) and indicated CIC (c-index = 0.70; external C-index = 0.64), and all performed better than Cox proportional-hazards regression. The models have been packaged into a simple easy-to-use tool, available at https://share.streamlit.io/jcckwong/puvop/main/app.py CONCLUSION: ML-based approaches for predicting clinically relevant outcomes in PUV are feasible. Further validation is warranted, but this implementable model can act as a decision-making aid. A higher resolution version of the Graphical abstract is available as Supplementary information.

Collapse

Affiliation(s)

Jethro Cc Kwong Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada.,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Adree Khondker Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.,Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Jin Kyu Kim Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada.,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Michael Chua Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Daniel T Keefe Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Joana Dos Santos Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Marta Skreta Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
Lauren Erdman Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
Neeta D'Souza Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Antoine Fermin Selman Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
John Weaver Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Dana A Weiss Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Christopher Long Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Gregory Tasian Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
Chia Wei Teoh Division of Nephrology, Hospital for Sick Children, Toronto, ON, Canada.,Department of Paediatrics, University of Toronto, Toronto, ON, Canada
Mandy Rickard Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
Armando J Lorenzo Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada. .,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.

Collapse

Appelbaum L, Kaplan ID, Palchuk MB, Kundrot S, Winer-Jones JP, Rinard M. Development and Experience with Cancer Risk Prediction Models Using Federated Databases and Electronic Health Records. Digit Health 2022. [DOI: 10.36255/exon-publications-digital-health-federated-databases] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Lin KW, Ang TL, Li JW. Role of artificial intelligence in early detection and screening for pancreatic adenocarcinoma. Artif Intell Med Imaging 2022;3:21-32. [DOI: 10.35711/aimi.v3.i2.21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 02/12/2022] [Accepted: 03/17/2022] [Indexed: 02/06/2023] Open

A review on machine learning techniques for the assessment of image grading in breast mammogram. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01546-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Qiu H, Ding S, Liu J, Wang L, Wang X. Applications of Artificial Intelligence in Screening, Diagnosis, Treatment, and Prognosis of Colorectal Cancer. Curr Oncol 2022;29:1773-1795. [PMID: 35323346 PMCID: PMC8947571 DOI: 10.3390/curroncol29030146] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 02/28/2022] [Accepted: 03/03/2022] [Indexed: 12/29/2022] Open

Machine Learning-Based Risk Prediction of Critical Care Unit Admission for Advanced Stage High Grade Serous Ovarian Cancer Patients Undergoing Cytoreductive Surgery: The Leeds-Natal Score. J Clin Med 2021;11:jcm11010087. [PMID: 35011828 PMCID: PMC8745521 DOI: 10.3390/jcm11010087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/20/2021] [Accepted: 12/22/2021] [Indexed: 12/12/2022] Open

Francisco ME, Carvajal TM, Ryo M, Nukazawa K, Amalin DM, Watanabe K. Dengue disease dynamics are modulated by the combined influences of precipitation and landscape: A machine learning approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021;792:148406. [PMID: 34157535 DOI: 10.1016/j.scitotenv.2021.148406] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 05/25/2021] [Accepted: 06/08/2021] [Indexed: 06/13/2023]

Abstract

BACKGROUND

Dengue is an endemic vector-borne disease influenced by environmental factors such as landscape and climate. Previous studies separately assessed the effects of landscape and climate factors on mosquito occurrence and dengue incidence. However, both factors concurrently coexist in time and space and can interact, affecting mosquito development and dengue disease transmission. For example, eggs laid in a suitable environment can hatch after being submerged in rain water. It has been difficult for conventional statistical modeling approaches to demonstrate these combined influences due to mathematical constraints.

OBJECTIVES

To investigate the combined influences of landscape and climate factors on mosquito occurrence and dengue incidence.

METHODS

Entomological, epidemiological, and landscape data from the rainy season (July-December) were obtained from respective government agencies in Metropolitan Manila, Philippines, from 2012 to 2014. Temperature, precipitation and vegetation data were obtained through remote sensing. A random forest algorithm was used to select the landscape and climate variables. Afterward, using the identified key variables, a model-based (MOB) recursive partitioning was implemented to test the combined influences of landscape and climate factors on ovitrap index (vector mosquito occurrence) and dengue incidence.

RESULTS

The MOB recursive partitioning for ovitrap index indicated a high sensitivity of vector mosquito occurrence on environmental conditions generated by a combination of high residential density areas with low precipitation. Moreover, the MOB recursive partitioning indicated high sensitivity of dengue incidence to the effects of precipitation in areas with high proportions of residential density and commercial areas.

CONCLUSIONS

Dengue dynamics are not solely influenced by individual effects of either climate or landscape, but rather by their synergistic or combined effects. The presented findings have the potential to target vector surveillance in areas identified as suitable for mosquito occurrence under specific climatic conditions and may be relevant as part of urban planning strategies to control dengue.

Collapse

MI-MOTE: Multiple imputation-based minority oversampling technique for imbalanced and incomplete data classification. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.06.043] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Machine Learning in the Differentiation of Soft Tissue Neoplasms: Comparison of Fat-Suppressed T2WI and Apparent Diffusion Coefficient (ADC) Features-Based Models. J Digit Imaging 2021;34:1146-1155. [PMID: 34545474 PMCID: PMC8554992 DOI: 10.1007/s10278-021-00513-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 08/18/2021] [Accepted: 08/22/2021] [Indexed: 12/26/2022] Open

Abdullah Alfayez A, Kunz H, Grace Lai A. Predicting the risk of cancer in adults using supervised machine learning: a scoping review. BMJ Open 2021;11:e047755. [PMID: 34521662 PMCID: PMC8442074 DOI: 10.1136/bmjopen-2020-047755] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 09/01/2021] [Indexed: 12/20/2022] Open

Abstract

OBJECTIVES

The purpose of this scoping review is to: (1) identify existing supervised machine learning (ML) approaches on the prediction of cancer in asymptomatic adults; (2) to compare the performance of ML models with each other and (3) to identify potential gaps in research.

DESIGN

Scoping review using the population, concept and context approach.

SEARCH STRATEGY

PubMed search engine was used from inception to 10 November 2020 to identify literature meeting following inclusion criteria: (1) a general adult (≥18 years) population, either sex, asymptomatic (population); (2) any study using ML techniques to derive predictive models for future cancer risk using clinical and/or demographic and/or basic laboratory data (concept) and (3) original research articles conducted in all settings in any region of the world (context).

RESULTS

The search returned 627 unique articles, of which 580 articles were excluded because they did not meet the inclusion criteria, were duplicates or were related to benign neoplasm. Full-text reviews were conducted for 47 articles and a final set of 10 articles were included in this scoping review. These 10 very heterogeneous studies used ML to predict future cancer risk in asymptomatic individuals. All studies reported area under the receiver operating characteristics curve (AUC) values as metrics of model performance, but no study reported measures of model calibration.

CONCLUSIONS

Research gaps that must be addressed in order to deliver validated ML-based models to assist clinical decision-making include: (1) establishing model generalisability through validation in independent cohorts, including those from low-income and middle-income countries; (2) establishing models for all cancer types; (3) thorough comparisons of ML models with best available clinical tools to ensure transparency of their potential clinical utility; (4) reporting of model calibration performance and (5) comparisons of different methods on the same cohort to reveal important information about model generalisability and performance.

Collapse

Oei RW, Lyu Y, Ye L, Kong F, Du C, Zhai R, Xu T, Shen C, He X, Kong L, Hu C, Ying H. Progression-Free Survival Prediction in Patients with Nasopharyngeal Carcinoma after Intensity-Modulated Radiotherapy: Machine Learning vs. Traditional Statistics. J Pers Med 2021;11:jpm11080787. [PMID: 34442430 PMCID: PMC8398698 DOI: 10.3390/jpm11080787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/08/2021] [Accepted: 08/10/2021] [Indexed: 12/24/2022] Open

Affiliation(s)

Ronald Wihal Oei Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Yingchen Lyu Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Lulu Ye Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Fangfang Kong Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Chengrun Du Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Ruiping Zhai Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Tingting Xu Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Chunying Shen Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Xiayun He Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Lin Kong Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Chaosu Hu Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
Hongmei Ying Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.) Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China Correspondence: ; Tel.: +86-21-64175590; Fax: +86-21-6417477

Collapse

Nordin N, Zainol Z, Mohd Noor MH, Lai Fong C. A comparative study of machine learning techniques for suicide attempts predictive model. Health Informatics J 2021;27:1460458221989395. [PMID: 33745355 DOI: 10.1177/1460458221989395] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Chakraborty D, Ivan C, Amero P, Khan M, Rodriguez-Aguayo C, Başağaoğlu H, Lopez-Berestein G. Explainable Artificial Intelligence Reveals Novel Insight into Tumor Microenvironment Conditions Linked with Better Prognosis in Patients with Breast Cancer. Cancers (Basel) 2021;13:3450. [PMID: 34298668 PMCID: PMC8303703 DOI: 10.3390/cancers13143450] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/06/2021] [Accepted: 07/06/2021] [Indexed: 12/29/2022] Open

A Classification Approach for Cancer Survivors from Those Cancer-Free, Based on Health Behaviors: Analysis of the Lifelines Cohort. Cancers (Basel) 2021;13:cancers13102335. [PMID: 34066093 PMCID: PMC8151639 DOI: 10.3390/cancers13102335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 05/06/2021] [Accepted: 05/07/2021] [Indexed: 12/29/2022] Open

Abstract

Simple Summary

Health behaviors affect health status in cancer survivors. We aimed to identify such key health behaviors using nonlinear algorithms and compare their classification performance with logistic regression, for distinguishing cancer survivors from those cancer-free in a population-based cohort. We used health behaviors and socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified as cancer survivors or cancer-free using nonlinear algorithms or logistic regression. Data were collected for 107,624 cancer-free participants and 2760 cancer survivors. Using all variables, algorithms obtained an area under the receiver operator curve (AUC) of 0.75 ± 0.01. Using only health behaviors, the algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 ± 0.01 and 0.60 ± 0.01, respectively. In the case–control analyses, both algorithms produced AUCs of 0.52 ± 0.01. The main distinctive classifier was age. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants.

Abstract

Health behaviors affect health status in cancer survivors. We hypothesized that nonlinear algorithms would identify distinct key health behaviors compared to a linear algorithm and better classify cancer survivors. We aimed to use three nonlinear algorithms to identify such key health behaviors and compare their performances with that of a logistic regression for distinguishing cancer survivors from those without cancer in a population-based cohort study. We used six health behaviors and three socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified into a cancer-survivors group and a cancer-free group using either nonlinear algorithms or logistic regression, and their performances were compared by the area under the curve (AUC). In addition, we performed case–control analyses (matched by age, sex, and education level) to evaluate classification performance only by health behaviors. Data were collected for 107,624 cancer free participants and 2760 cancer survivors. Using all variables resulted an AUC of 0.75 ± 0.01, using only six health behaviors, the logistic regression and nonlinear algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 ± 0.01 and 0.60 ± 0.01, respectively. The main distinctive classifier was age. Though not relevant to classification, the main distinctive health behaviors were body mass index and alcohol consumption. In the case–control analyses, algorithms produced AUCs of 0.52 ± 0.01. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants in this population-based cohort.

Collapse

Gupta S, Gupta MK. A comprehensive data‐level investigation of cancer diagnosis on imbalanced data. Comput Intell 2021. [DOI: 10.1111/coin.12452] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Prediction of Incident Cancers in the Lifelines Population-Based Cohort. Cancers (Basel) 2021;13:cancers13092133. [PMID: 33925159 PMCID: PMC8125183 DOI: 10.3390/cancers13092133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 04/23/2021] [Indexed: 12/23/2022] Open

Abstract

Simple Summary

The accurate prediction of incident cancers could be relevant to understanding and reducing cancer incidence. The aim of this study was to develop machine learning (ML) models that could predict an incident diagnosis of cancer. Data were available for 116,188 cancer-free participants and 4232 incident cancer cases. The main outcome was an incident cancer (excluding skin cancer) during follow-up assessment in a population-based cohort. The performance of three ML algorithms was evaluated using supervised binary classification to identify incident cancers among participants. An overall area under the receiver operator curve (AUC) < 0.75 was obtained; the highest AUC was for prostate cancer AUC > 0.80. Linear and non-linear ML algorithms including socioeconomic, lifestyle, and clinical variables produced a moderate predictive performance of incident cancers in the Lifelines cohort.

Abstract

Cancer incidence is rising, and accurate prediction of incident cancers could be relevant to understanding and reducing cancer incidence. The aim of this study was to develop machine learning (ML) models that could predict an incident diagnosis of cancer. Participants without any history of cancer within the Lifelines population-based cohort were followed for a median of 7 years. Data were available for 116,188 cancer-free participants and 4232 incident cancer cases. At baseline, socioeconomic, lifestyle, and clinical variables were assessed. The main outcome was an incident cancer during follow-up (excluding skin cancer), based on linkage with the national pathology registry. The performance of three ML algorithms was evaluated using supervised binary classification to identify incident cancers among participants. Elastic net regularization and Gini index were used for variables selection. An overall area under the receiver operator curve (AUC) <0.75 was obtained, the highest AUC value was for prostate cancer (random forest AUC = 0.82 (95% CI 0.77–0.87), logistic regression AUC = 0.81 (95% CI 0.76–0.86), and support vector machines AUC = 0.83 (95% CI 0.78–0.88), respectively); age was the most important predictor in these models. Linear and non-linear ML algorithms including socioeconomic, lifestyle, and clinical variables produced a moderate predictive performance of incident cancers in the Lifelines cohort.

Collapse

Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, Peng X. Predicting breast cancer 5-year survival using machine learning: A systematic review. PLoS One 2021;16:e0250370. [PMID: 33861809 PMCID: PMC8051758 DOI: 10.1371/journal.pone.0250370] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/06/2021] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Accurately predicting the survival rate of breast cancer patients is a major issue for cancer researchers. Machine learning (ML) has attracted much attention with the hope that it could provide accurate results, but its modeling methods and prediction performance remain controversial. The aim of this systematic review is to identify and critically appraise current studies regarding the application of ML in predicting the 5-year survival rate of breast cancer.

METHODS

In accordance with the PRISMA guidelines, two researchers independently searched the PubMed (including MEDLINE), Embase, and Web of Science Core databases from inception to November 30, 2020. The search terms included breast neoplasms, survival, machine learning, and specific algorithm names. The included studies related to the use of ML to build a breast cancer survival prediction model and model performance that can be measured with the value of said verification results. The excluded studies in which the modeling process were not explained clearly and had incomplete information. The extracted information included literature information, database information, data preparation and modeling process information, model construction and performance evaluation information, and candidate predictor information.

RESULTS

Thirty-one studies that met the inclusion criteria were included, most of which were published after 2013. The most frequently used ML methods were decision trees (19 studies, 61.3%), artificial neural networks (18 studies, 58.1%), support vector machines (16 studies, 51.6%), and ensemble learning (10 studies, 32.3%). The median sample size was 37256 (range 200 to 659820) patients, and the median predictor was 16 (range 3 to 625). The accuracy of 29 studies ranged from 0.510 to 0.971. The sensitivity of 25 studies ranged from 0.037 to 1. The specificity of 24 studies ranged from 0.008 to 0.993. The AUC of 20 studies ranged from 0.500 to 0.972. The precision of 6 studies ranged from 0.549 to 1. All of the models were internally validated, and only one was externally validated.

CONCLUSIONS

Overall, compared with traditional statistical methods, the performance of ML models does not necessarily show any improvement, and this area of research still faces limitations related to a lack of data preprocessing steps, the excessive differences of sample feature selection, and issues related to validation. Further optimization of the performance of the proposed model is also needed in the future, which requires more standardization and subsequent validation.

Collapse