1
|
Chang TG, Park S, Schäffer AA, Jiang P, Ruppin E. Hallmarks of artificial intelligence contributions to precision oncology. NATURE CANCER 2025; 6:417-431. [PMID: 40055572 PMCID: PMC11957836 DOI: 10.1038/s43018-025-00917-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Accepted: 01/21/2025] [Indexed: 03/29/2025]
Abstract
The integration of artificial intelligence (AI) into oncology promises to revolutionize cancer care. In this Review, we discuss ten AI hallmarks in precision oncology, organized into three groups: (1) cancer prevention and diagnosis, encompassing cancer screening, detection and profiling; (2) optimizing current treatments, including patient outcome prediction, treatment planning and monitoring, clinical trial design and matching, and developing response biomarkers; and (3) advancing new treatments by identifying treatment combinations, discovering cancer vulnerabilities and designing drugs. We also survey AI applications in interventional clinical trials and address key challenges to broader clinical adoption of AI: data quality and quantity, model accuracy, clinical relevance and patient benefit, proposing actionable solutions for each.
Collapse
Affiliation(s)
- Tian-Gen Chang
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Seongyong Park
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alejandro A Schäffer
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Peng Jiang
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
2
|
Garwe T, Choi J. An introduction to clinical prediction models using logistic regression in acute care surgery research: Methodologic considerations and common pitfalls. J Trauma Acute Care Surg 2025:01586154-990000000-00923. [PMID: 40012096 DOI: 10.1097/ta.0000000000004584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2025]
Abstract
ABSTRACT Clinical prediction models can enhance timely clinical decision-making when appropriately developed and integrated within clinical workflows. A risk prediction model is typically a regression equation that uses patient risk factor data to estimate the probability of the presence of disease (diagnostic) or its future occurrence (prognostic). Risk prediction models are widely studied in the surgical literature and commonly developed using logistic regression. For a risk prediction model to be useful, it must balance statistical performance and clinical usefulness. This article provides a brief overview of the various methodologic issues to consider when developing or validating a risk prediction model and common pitfalls.
Collapse
Affiliation(s)
- Tabitha Garwe
- From the Department of Surgery (T.G.) and Department of Biostatistics and Epidemiology (T.G.), University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma; and Department of Surgery (J.C.), Stanford University, Stanford, California
| | | |
Collapse
|
3
|
Grau-Jurado P, Mostafaei S, Xu H, Mo M, Petek B, Kalar I, Naia L, Kele J, Maioli S, Pereira JB, Eriksdotter M, Chatterjee S, Garcia-Ptacek S. Medications and cognitive decline in Alzheimer's disease: Cohort cluster analysis of 15,428 patients. J Alzheimers Dis 2025; 103:931-940. [PMID: 39772858 DOI: 10.1177/13872877241307870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]
Abstract
BACKGROUND Medications for comorbid conditions may affect cognition in Alzheimer's disease (AD). OBJECTIVE To explore the association between common medications and cognition, measured with the Mini-Mental State Examination. METHODS Cohort study including persons with AD from the Swedish Registry for Cognitive/Dementia Disorders (SveDem). Medications were included if they were used by ≥5% of patients (26 individual drugs). Each follow-up was analyzed independently by performing 100 Monte-Carlo simulations of two steps each 1) k-means clustering of patients according to Mini-Mental State Examination at follow-up and its decline since previous measure, and 2) Identification of medications presenting statistically significant differences in the proportion of users in the different clusters. RESULTS 15,428 patients (60.38% women) were studied. Four clusters were identified. Medications associated with the best cognition cluster (relative to the worse) were atorvastatin (point estimate 1.44 95% confidence interval [1.15-1.83] at first follow-up, simvastatin (1.41 [1.11-1.78] at second follow-up), warfarin (1.56 [1.22-2.01] first follow-up), zopiclone (1.35 [1.15-1.58], and metformin (2.08 [1.35-3.33] second follow-up. Oxazepam (0.60 [0.50-0.73] first follow-up), paracetamol (0.83 [0.73-0.95] first follow-up), cyanocobalamin, felodipine and furosemide were associated with the worst cluster. Cholinesterase inhibitors were associated with the best cognition clusters, whereas memantine appeared in the worse cognition clusters, consistent with its indication in moderate to severe dementia. CONCLUSIONS We performed unsupervised clustering to classify patients based on their current cognition and cognitive decline from previous testing. Atorvastatin, simvastatin, warfarin, metformin, and zopiclone presented a positive and statistically significant associations with cognition, while oxazepam, cyanocobalamin, felodipine, furosemide and paracetamol, were associated with the worst cluster.
Collapse
Affiliation(s)
- Pol Grau-Jurado
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Shayan Mostafaei
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Departmenet of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Hong Xu
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Minjia Mo
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Bojana Petek
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
- Clinical Institute of Genomic Medicine, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Irena Kalar
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
- Department of Neurology, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Luana Naia
- Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Julianna Kele
- Team Neurovascular Biology and Health, Clinical Immunology, Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Silvia Maioli
- Division of Neurogeriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
| | - Joana B Pereira
- Neuro Division, Department of Clinical Neurosciences, Karolinska Institutet, Stockholm, Sweden
| | - Maria Eriksdotter
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Aging and Inflammation Theme, Karolinska University Hospital, Stockholm, Sweden
| | - Saikat Chatterjee
- School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Sara Garcia-Ptacek
- Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society (NVS), Karolinska Institutet, Stockholm, Sweden
- Aging and Inflammation Theme, Karolinska University Hospital, Stockholm, Sweden
| |
Collapse
|
4
|
Mohammed M, Zainal H, Ong SC, Tangiisuran B, Aziz FA, Sidek NN, Sha'aban A, Ibrahim UI, Muhammad S, Looi I, Aziz ZA. Prognostic Models of Mortality Following First-Ever Acute Ischemic Stroke: A Population-Based Retrospective Cohort Study. Health Sci Rep 2025; 8:e70445. [PMID: 39957974 PMCID: PMC11825595 DOI: 10.1002/hsr2.70445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 01/20/2025] [Accepted: 01/27/2025] [Indexed: 02/18/2025] Open
Abstract
Background and Aims There is a lack of population-based studies focusing on guideline-based prognostic models for stroke. This study aimed to develop and validate a prognostic model that predicts mortality following a first-ever acute ischemic stroke. Methods The study included 899 adult patients ( ≥ 18 years) with confirmed diagnosis of first-ever acute ischemic stroke enrolled in the Malaysian National Stroke Registry (NSR) from January 2009 to December 2019. The primary outcome was mortality within 90 days post-stroke (266 events [29.6%]). The prognostic model was developed using logistic regression (75%, n = 674) and internally validated (25%, n = 225). Model performance was assessed using discrimination (area under the curve (AUC]) and calibration (Hosmer-Lemeshow test [HL]). Results The final model includes factors associated with increased risk of mortality, such as age (adjusted odds ratio, aOR 1.06 [95% confidence interval, CI 1.03, 1.10; p < 0.001]), National Institutes of Health Stroke Scale (NIHSS) score aOR 1.08 (95% CI 1.08, 1.13; p = 0.004), and diabetes aOR 2.29 (95% CI 1.20, 4.37; p = 0.012). The protective factors were antiplatelet within 48 h. aOR 0.40 (95% CI 0.19, 0.81; p = 0.01), dysphagia screening aOR 0.30 (95% CI 0.15, 0.61; p = 0.001), antiplatelets upon discharge aOR 0.17 (95% CI 0.08, 0.35; p < 0.001), lipid-lowering therapy aOR 0.37 (95% CI 0.17, 0.82; p = 0.01), stroke education aOR 0.02 (95% CI 0.01, 0.05; p < 0.001) and rehabilitation aOR 0.08 (95% CI 0.04, 0.16; p < 0.001). The model demonstrated excellent performance (discrimination [AUC = 0.94] and calibration [HL, X 2 p = 0.63]). Conclusion The study developed a validated prognostic model that excellently predicts mortality after a first-ever acute ischemic stroke with potential clinical utility in acute stroke care decision-making. The predictors could be valuable for creating risk calculators and aiding healthcare providers and patients in making well-informed clinical decisions during the stroke care process.
Collapse
Affiliation(s)
| | - Hadzliana Zainal
- School of Pharmaceutical SciencesUniversiti Sains MalaysiaPulau PinangMalaysia
| | - Siew Chin Ong
- School of Pharmaceutical SciencesUniversiti Sains MalaysiaPulau PinangMalaysia
| | | | | | - Norsima N. Sidek
- Clinical Research CenterHospital Sultanah Nur ZahirahTerengganuMalaysia
| | | | - Umar Idris Ibrahim
- Faculty of PharmacyUniversiti Sultan Zainal AbidinKuala TerengganuMalaysia
| | | | - Irene Looi
- Clinical Research CenterHospital Seberang JayaPulau PinangMalaysia
| | - Zariah A. Aziz
- Clinical Research CenterHospital Sultanah Nur ZahirahTerengganuMalaysia
| |
Collapse
|
5
|
Pislar N, Gasljevic G, Matos E, Pilko G, Zgajnar J, Perhavec A. Predicting nodal response to neoadjuvant treatment in breast cancer with core biopsy biomarkers of tumor microenvironment using data mining. Breast Cancer Res Treat 2025; 210:87-94. [PMID: 39496911 PMCID: PMC11787214 DOI: 10.1007/s10549-024-07539-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 10/22/2024] [Indexed: 11/06/2024]
Abstract
PURPOSE To generate a model for predicting nodal response to neoadjuvant systemic treatment (NAST) in biopsy-proven node-positive breast cancer patients (cN+) that incorporates tumor microenvironment (TME) characteristics and could be used for planning the axillary surgical staging procedure. METHODS Clinical and pathologic features were retrospectively collected for 437 patients. Core biopsy (CB) samples were reviewed for stromal content and tumor-infiltrating lymphocytes (TIL). Orange Datamining Toolbox was used for model generation and assessment. RESULTS 151/437 (34.6%) patients achieved nodal pCR (ypN0). The following 5 variables were included in the prediction model: ER, Her-2, grade, stroma content and TILs. After stratified tenfold cross-validation, the logistic regression algorithm achieved and area under the ROC curve (AUC) of 0.86 and F1 score of 0.72. Nomogram was used for visualization. CONCLUSIONS We developed a clinical tool to predict nodal pCR for cN+ patients after NAST that includes biomarkers of TME and achieves an AUC of 0.86 after tenfold cross-validation.
Collapse
Affiliation(s)
- Nina Pislar
- Department of Surgical Oncology, Institute of Oncology Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Gorana Gasljevic
- Department of Pathology, Institute of Oncology Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, University of Maribor, Maribor, Slovenia
| | - Erika Matos
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
- Department of Medical Oncology, Institute of Oncology Ljubljana, Ljubljana, Slovenia
| | - Gasper Pilko
- Department of Surgical Oncology, Institute of Oncology Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Janez Zgajnar
- Department of Surgical Oncology, Institute of Oncology Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Andraz Perhavec
- Department of Surgical Oncology, Institute of Oncology Ljubljana, Ljubljana, Slovenia.
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.
| |
Collapse
|
6
|
Nemlander E, Abedi E, Ljungman P, Hasselström J, Carlsson AC, Rosenblad A. The Stockholm early detection of cancer study (STEADY-CAN): rationale, design, data collection, and baseline characteristics for 2.7 million participants. Eur J Epidemiol 2025; 40:123-136. [PMID: 39755982 PMCID: PMC11799118 DOI: 10.1007/s10654-024-01192-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Accepted: 12/09/2024] [Indexed: 01/07/2025]
Abstract
The Stockholm Early Detection of Cancer Study (STEADY-CAN) cohort was established to investigate strategies for early cancer detection in a population-based context within Stockholm County, the capital region of Sweden. Utilising real-world data to explore cancer-related healthcare patterns and outcomes, the cohort links extensive clinical and laboratory data from both inpatient and outpatient care in the region. The dataset includes demographic information, detailed diagnostic codes, laboratory results, prescribed medications, and healthcare utilisation data. Since its inception, STEADY-CAN has collected longitudinal data on 2,732,005 individuals aged ≥ 18 years old living in or having access to health care in Stockholm County during the years 2011-2021. Focusing on cancer, the cohort includes 140,042 (5.1%) individuals with incident cancer and a control group of 2,591,963 (94.9%) cancer-free individuals. The cohort's diverse adult population enables robust analyses of early symptom detection, incidental findings, and the impact of comorbidities on cancer diagnoses. Utilizing the wide range of available laboratory data and clinical variables allow for advanced statistical analyses and adjustments for important confounding factors. The cohort's primary focus is to improve understanding of the early diagnostic phase of cancer, offering a crucial resource for studying cancer detection in clinical practice. Its comprehensive data collection provides unique opportunities for research into comorbidities and cancer outcomes, making the cohort a useful resource for ongoing cancer surveillance and public health strategies. The present study gives a detailed description of the rationale for creating the STEADY-CAN cohort, its design, the data collection procedure, and baseline characteristics of collected data.
Collapse
Affiliation(s)
- Elinor Nemlander
- Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Stockholm, Sweden.
- Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden.
- Regional Cancer Centre Stockholm-Gotland, Region Stockholm, Stockholm, Sweden.
| | - Eliya Abedi
- Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Stockholm, Sweden
- Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden
- Regional Cancer Centre Stockholm-Gotland, Region Stockholm, Stockholm, Sweden
| | - Per Ljungman
- Department of Cellular Therapy and Allogeneic Stem Cell Transplantation, Karolinska Comprehensive Cancer Centre, Karolinska University Hospital, and Division of Haematology, Department of Medicine Huddinge,, Karolinska Institutet, Stockholm, Sweden
| | - Jan Hasselström
- Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Stockholm, Sweden
- Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden
| | - Axel C Carlsson
- Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Stockholm, Sweden
- Academic Primary Health Care Centre, Region Stockholm, Stockholm, Sweden
| | - Andreas Rosenblad
- Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Stockholm, Sweden
- Regional Cancer Centre Stockholm-Gotland, Region Stockholm, Stockholm, Sweden
- Department of Statistics, Uppsala University, Uppsala, Sweden
- Department of Medical Sciences, Division of Clinical Diabetology and Metabolism, Uppsala University, Uppsala, Sweden
| |
Collapse
|
7
|
Wojcik KM, Caswell-Jin JL, Wilson OWA, Schechter C, Kamil D, Kurian AW, Jayasekera J. The population-level effects of omitting chemotherapy guided by a 21-gene expression assay in node-positive breast cancer: a simulation modeling study. BMC Cancer 2024; 24:975. [PMID: 39118050 PMCID: PMC11308572 DOI: 10.1186/s12885-024-12719-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 07/26/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND A recent trial showed that postmenopausal women diagnosed with hormone receptor-positive, human epidermal growth factor receptor-2 (HER2)-negative, lymph node-positive (1-3 nodes) breast cancer with a 21-gene recurrence score of ≤ 25 could safely omit chemotherapy. However, there are limited data on population-level long-term outcomes associated with omitting chemotherapy among diverse women seen in real-world practice. METHODS We adapted an established, validated simulation model to generate the joint distributions of population-level characteristics of women diagnosed with early-stage breast cancer in the U.S. Input parameters were derived from cancer registry, meta-analyses, and clinical trial data. The effects of omitting chemotherapy on 10-year distant recurrence-free survival, life-years, and quality adjusted life-years (QALYs) were modeled for premenopausal and postmenopausal women. QALYs were discounted at 3%. Results were evaluated for subgroups stratified by race and ethnicity. Sensitivity analyses included testing results across a range of inputs. The model was validated using the published RxPONDER trial data. RESULTS In premenopausal women, the 10-year distant recurrence-free survival rates were 85.3% with chemo-endocrine and 80.1% with endocrine therapy. The estimated life-years and QALYs gained with chemotherapy in premenopausal women were 2.1 and 0.6, respectively. There was no chemotherapy benefit in postmenopausal women. There was no variation in the absolute benefit of chemotherapy across racial or ethnic subgroups. However, there were differences in distant recurrence-free survival rates, life-years, and QALYs across groups. Sensitivity analysis showed similar results. The model closely replicated the RxPONDER trial. CONCLUSIONS Modeled population-level outcomes show a small chemotherapy benefit in premenopausal women, but no benefit among postmenopausal women. Simulation modeling provides a useful tool to extend trial data and evaluate population-level outcomes.
Collapse
Affiliation(s)
- Kaitlyn M Wojcik
- Health Equity and Decision Sciences Laboratory, National Institute on Minority Health and Health Disparities, Division of Intramural Research, National Institutes of Health, Bethesda, MD, USA
| | | | - Oliver W A Wilson
- Health Equity and Decision Sciences Laboratory, National Institute on Minority Health and Health Disparities, Division of Intramural Research, National Institutes of Health, Bethesda, MD, USA
| | - Clyde Schechter
- Department of Family & Social Medicine at Albert Einstein College of Medicine, Bronx, NY, USA
| | - Dalya Kamil
- Health Equity and Decision Sciences Laboratory, National Institute on Minority Health and Health Disparities, Division of Intramural Research, National Institutes of Health, Bethesda, MD, USA
| | - Allison W Kurian
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
| | - Jinani Jayasekera
- Health Equity and Decision Sciences Laboratory, National Institute on Minority Health and Health Disparities, Division of Intramural Research, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
8
|
Izadi Z, Gianfrancesco M, Anastasiou C, Schmajuk G, Yazdany J. Development and validation of a risk scoring system to identify patients with lupus nephritis in electronic health record data. Lupus Sci Med 2024; 11:e001170. [PMID: 38769054 PMCID: PMC11110552 DOI: 10.1136/lupus-2024-001170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 04/20/2024] [Indexed: 05/22/2024]
Abstract
OBJECTIVE Accurate identification of lupus nephritis (LN) cases is essential for patient management, research and public health initiatives. However, LN diagnosis codes in electronic health records (EHRs) are underused, hindering efficient identification. We investigated the current performance of International Classification of Diseases (ICD) codes, 9th and 10th editions (ICD9/10), for identifying prevalent LN, and developed scoring systems to increase identification of LN that are adaptable to settings with and without LN ICD codes. METHODS Training and test sets derived from EHR data from a large health system. An external set comprised data from the EHR of a second large health system. Adults with ICD9/10 codes for SLE were included. LN cases were ascertained through manual chart reviews conducted by rheumatologists. Two definitions of LN were used: strict (definite LN) and inclusive (definite, potential or diagnostic uncertainty). Gradient boosting models including structured EHR fields were used for predictor selection. Two logistic regression-based scoring systems were developed ('LN-Code' included LN ICD codes and 'LN-No Code' did not), calibrated and validated using standard performance metrics. RESULTS A total of 4152 patients from University of California San Francisco Medical Center and 370 patients from Zuckerberg San Francisco General Hospital and Trauma Center met the eligibility criteria. Mean age was 50 years, 87% were female. LN diagnosis codes demonstrated low sensitivity (43-73%) but high specificity (92-97%). LN-Code achieved an area under the curve (AUC) of 0.93 and a sensitivity of 0.88 for identifying LN using the inclusive definition. LN-No Code reached an AUC of 0.91 and a sensitivity of 0.95 (0.97 for the strict definition). Both scoring systems had good external validity, calibration and performance across racial and ethnic groups. CONCLUSIONS This study quantified the underutilisation of LN diagnosis codes in EHRs and introduced two adaptable scoring systems to enhance LN identification. Further validation in diverse healthcare settings is essential to ensure their broader applicability.
Collapse
Affiliation(s)
- Zara Izadi
- University of California San Francisco, San Francisco, California, USA
| | | | | | - Gabriela Schmajuk
- University of California San Francisco, San Francisco, California, USA
- San Francisco VA Medical Center, San Francisco, California, USA
| | - Jinoos Yazdany
- University of California San Francisco, San Francisco, California, USA
| |
Collapse
|
9
|
Tesfie TK, Anlay DZ, Abie B, Chekol YM, Gelaw NB, Tebeje TM, Animut Y. Nomogram to predict risk of neonatal mortality among preterm neonates admitted with sepsis at University of Gondar Comprehensive Specialized Hospital: risk prediction model development and validation. BMC Pregnancy Childbirth 2024; 24:139. [PMID: 38360591 PMCID: PMC10868119 DOI: 10.1186/s12884-024-06306-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open
Abstract
BACKGROUND Mortality in premature neonates is a global public health problem. In developing countries, nearly 50% of preterm births ends with death. Sepsis is one of the major causes of death in preterm neonates. Risk prediction model for mortality in preterm septic neonates helps for directing the decision making process made by clinicians. OBJECTIVE We aimed to develop and validate nomogram for the prediction of neonatal mortality. Nomograms are tools which assist the clinical decision making process through early estimation of risks prompting early interventions. METHODS A three year retrospective follow up study was conducted at University of Gondar Comprehensive Specialized Hospital and a total of 603 preterm neonates with sepsis were included. Data was collected using KoboCollect and analyzed using STATA version 16 and R version 4.2.1. Lasso regression was used to select the most potent predictors and to minimize the problem of overfitting. Nomogram was developed using multivariable binary logistic regression analysis. Model performance was evaluated using discrimination and calibration. Internal model validation was done using bootstrapping. Net benefit of the nomogram was assessed through decision curve analysis (DCA) to assess the clinical relevance of the model. RESULT The nomogram was developed using nine predictors: gestational age, maternal history of premature rupture of membrane, hypoglycemia, respiratory distress syndrome, perinatal asphyxia, necrotizing enterocolitis, total bilirubin, platelet count and kangaroo-mother care. The model had discriminatory power of 96.7% (95% CI: 95.6, 97.9) and P-value of 0.165 in the calibration test before and after internal validation with brier score of 0.07. Based on the net benefit analysis the nomogram was found better than treat all and treat none conditions. CONCLUSION The developed nomogram can be used for individualized mortality risk prediction with excellent performance, better net benefit and have been found to be useful in clinical practice with contribution in preterm neonatal mortality reduction by giving better emphasis for those at high risk.
Collapse
Affiliation(s)
- Tigabu Kidie Tesfie
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia.
| | - Degefaye Zelalem Anlay
- School of Nursing, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Birhanu Abie
- Department of Pediatrics and Child Health, School of Medicine, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Yazachew Moges Chekol
- Department of Health Information Technology, Mizan Aman College of Health Science, Mizan Aman, Ethiopia
| | - Negalgn Byadgie Gelaw
- Department of Public Health, Mizan Aman College of Health Science, Mizan Aman, Ethiopia
| | - Tsion Mulat Tebeje
- School of Public Health, College of Medicine and Health Sciences, Dilla University, Dilla, Ethiopia
| | - Yaregal Animut
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| |
Collapse
|
10
|
Zhang J, Liu Y, Zhang C, Chen Y, Hu Y, Yang X, Liu W, Zhang W, Liu D, Song H. Predicting suicidal behavior in individuals with depression over 50 years of age: Evidence from the UK biobank. Digit Health 2024; 10:20552076241287450. [PMID: 39411544 PMCID: PMC11475109 DOI: 10.1177/20552076241287450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 09/10/2024] [Indexed: 10/19/2024] Open
Abstract
Objective To construct applicable models suitable for predicting the risk of suicidal behavior among individuals with depression, particularly on the progression from no history of suicidal behavior to suicide attempts, as well as from suicidal ideation to suicide attempts. Methods Based on a prospective cohort from the UK Biobank, a total of 55,139 individuals aged 50 and above with depression were enrolled in the study, among whom 29,528 exhibited suicidal behavior. Specifically, they were divided into control (25,611), suicidal ideation (24,361), and suicide attempt (5167) groups. Least absolute shrinkage and selection operator (LASSO) regression was used to identify a subset of important features for distinguishing suicidal ideation and suicide attempts. We used the Gradient Boosting Decision Tree (GBDT) algorithm with stratified 10-fold cross-validation and grid-search to construct the prediction models for suicidal ideation or suicide attempts. To address the dataset imbalance in classifying suicide attempts, we used random under-sampling. The SHapley Additive exPlanations (SHAP) were used to estimate the important variables in the GBDT model. Results Significant differences in sociodemographic, economic, lifestyle, and psychological factors were observed across the three groups. Each classifier optimally utilized 8-11 features. Overall, the algorithms predicting suicide attempts demonstrated slightly higher performance than those predicting suicidal ideation. The GBDT classifier achieved the highest accuracy, with AUROC scores of 0.914 for suicide attempts and 0.803 for suicidal ideation. Distinctive predictive factors were identified for each group: while depression's inherent characteristics crucially distinguished the suicidal ideation group from controls, some key predictors, including the age of depression onset and childhood trauma events, were identified for suicide attempts. Conclusions We established applicable machine learning-based models for predicting suicidal behavior, particularly suicide attempts, in individuals with depression, and clarified the differences in predictors between suicidal ideation and suicide attempts.
Collapse
Affiliation(s)
- Jian Zhang
- Mental Health Center, West China Hospital, Sichuan University, Chengdu,
China
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
| | - Yujun Liu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Chao Zhang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Yilong Chen
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Yao Hu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Xiujia Yang
- University of Illinois at Urbana and Champaign, Urbana, IL, USA
| | - Wentao Liu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Wei Zhang
- Mental Health Center, West China Hospital, Sichuan University, Chengdu,
China
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Di Liu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
- Industrial Engineering, Pittsburgh Institute, Sichuan University, Chengdu, China
| | - Huan Song
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
- Med-X Center for Informatics, Sichuan University, Chengdu, China
- Center of Public Health Sciences, Faculty of Medicine, University of Iceland, Reykjavík, Iceland
| |
Collapse
|
11
|
Chen YL, Kraus SW, Freeman MJ, Freeman AJ. A Machine-Learning Approach to Assess Factors Associated With Hospitalization of Children and Youths in Psychiatric Crisis. Psychiatr Serv 2023; 74:943-949. [PMID: 36916060 DOI: 10.1176/appi.ps.20220201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
OBJECTIVE The authors used a machine-learning approach to model clinician decision making regarding psychiatric hospitalization of children and youths in crisis and to identify factors associated with the decision to hospitalize. METHODS Data consisted of 4,786 mobile crisis response team assessments of children and youths, ages 4.0-19.5 years (mean±SD=14.0±2.7 years, 56% female), in Nevada. The sample assessments were split into training and testing data sets. A random-forest machine-learning algorithm was used to identify variables related to the decision to hospitalize a child or youth after the crisis assessment. Results from the training sample were externally validated in the testing sample. RESULTS The random-forest model had good performance (area under the curve training sample=0.91, testing sample=0.92). Variables found to be important in the decision to hospitalize a child or youth were acute suicidality, followed by poor judgment or decision making, danger to others, impulsivity, runaway behavior, other risky behaviors, nonsuicidal self-injury, psychotic or depressive symptoms, sleep problems, oppositional behavior, poor functioning at home or with peers, depressive or schizophrenia spectrum disorders, and age. CONCLUSIONS In crisis settings, clinicians were found to mostly focus on acute factors that increased risk for danger to self or others (e.g., suicidality, poor judgment), current psychiatric symptoms (e.g., psychotic symptoms), and functioning (e.g., poor home functioning, problems with peer relationships) when deciding whether to hospitalize or stabilize a child or youth. To reduce psychiatric hospitalization, community-based services should target interventions to address these important factors associated with the need for a higher level of care among youths in psychiatric crisis.
Collapse
Affiliation(s)
- Yen-Ling Chen
- Department of Psychology, University of Nevada, Las Vegas, Las Vegas (Chen, Kraus); Boys and Girls Clubs of Southern Nevada, Las Vegas (M. J. Freeman); Inspiring Children Foundation, Las Vegas (A. J. Freeman)
| | - Shane W Kraus
- Department of Psychology, University of Nevada, Las Vegas, Las Vegas (Chen, Kraus); Boys and Girls Clubs of Southern Nevada, Las Vegas (M. J. Freeman); Inspiring Children Foundation, Las Vegas (A. J. Freeman)
| | - Megan J Freeman
- Department of Psychology, University of Nevada, Las Vegas, Las Vegas (Chen, Kraus); Boys and Girls Clubs of Southern Nevada, Las Vegas (M. J. Freeman); Inspiring Children Foundation, Las Vegas (A. J. Freeman)
| | - Andrew J Freeman
- Department of Psychology, University of Nevada, Las Vegas, Las Vegas (Chen, Kraus); Boys and Girls Clubs of Southern Nevada, Las Vegas (M. J. Freeman); Inspiring Children Foundation, Las Vegas (A. J. Freeman)
| |
Collapse
|
12
|
Chen SL, Chin SC, Chan KC, Ho CY. A Machine Learning Approach to Assess Patients with Deep Neck Infection Progression to Descending Mediastinitis: Preliminary Results. Diagnostics (Basel) 2023; 13:2736. [PMID: 37685275 PMCID: PMC10486957 DOI: 10.3390/diagnostics13172736] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/25/2023] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open
Abstract
BACKGROUND Deep neck infection (DNI) is a serious infectious disease, and descending mediastinitis is a fatal infection of the mediastinum. However, no study has applied artificial intelligence to assess progression to descending mediastinitis in DNI patients. Thus, we developed a model to assess the possible progression of DNI to descending mediastinitis. METHODS Between August 2017 and December 2022, 380 patients with DNI were enrolled; 75% of patients (n = 285) were assigned to the training group for validation, whereas the remaining 25% (n = 95) were assigned to the test group to determine the accuracy. The patients' clinical and computed tomography (CT) parameters were analyzed via the k-nearest neighbor method. The predicted and actual progression of DNI patients to descending mediastinitis were compared. RESULTS In the training and test groups, there was no statistical significance (all p > 0.05) noted at clinical variables (age, gender, chief complaint period, white blood cells, C-reactive protein, diabetes mellitus, and blood sugar), deep neck space (parapharyngeal, submandibular, retropharyngeal, and multiple spaces involved, ≥3), tracheostomy performance, imaging parameters (maximum diameter of abscess and nearest distance from abscess to level of sternum notch), or progression to mediastinitis. The model had a predictive accuracy of 82.11% (78/95 patients), with sensitivity and specificity of 41.67% and 87.95%, respectively. CONCLUSIONS Our model can assess the progression of DNI to descending mediastinitis depending on clinical and imaging parameters. It can be used to identify DNI patients who will benefit from prompt treatment.
Collapse
Affiliation(s)
- Shih-Lung Chen
- Department of Otorhinolaryngology & Head and Neck Surgery, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Shy-Chyi Chin
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
- Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
| | - Kai-Chieh Chan
- Department of Otorhinolaryngology & Head and Neck Surgery, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Chia-Ying Ho
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
- Division of Chinese Internal Medicine, Center for Traditional Chinese Medicine, Chang Gung Memorial Hospital, Taoyuan 333, Taiwan
| |
Collapse
|
13
|
Timilsina M, Fey D, Buosi S, Janik A, Costabello L, Carcereny E, Abreu DR, Cobo M, Castro RL, Bernabé R, Minervini P, Torrente M, Provencio M, Nováček V. Synergy between imputed genetic pathway and clinical information for predicting recurrence in early stage non-small cell lung cancer. J Biomed Inform 2023; 144:104424. [PMID: 37352900 DOI: 10.1016/j.jbi.2023.104424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 06/06/2023] [Accepted: 06/11/2023] [Indexed: 06/25/2023]
Abstract
OBJECTIVE Lung cancer exhibits unpredictable recurrence in low-stage tumors and variable responses to different therapeutic interventions. Predicting relapse in early-stage lung cancer can facilitate precision medicine and improve patient survivability. While existing machine learning models rely on clinical data, incorporating genomic information could enhance their efficiency. This study aims to impute and integrate specific types of genomic data with clinical data to improve the accuracy of machine learning models for predicting relapse in early-stage, non-small cell lung cancer patients. METHODS The study utilized a publicly available TCGA lung cancer cohort and imputed genetic pathway scores into the Spanish Lung Cancer Group (SLCG) data, specifically in 1348 early-stage patients. Initially, tumor recurrence was predicted without imputed pathway scores. Subsequently, the SLCG data were augmented with pathway scores imputed from TCGA. The integrative approach aimed to enhance relapse risk prediction performance. RESULTS The integrative approach achieved improved relapse risk prediction with the following evaluation metrics: an area under the precision-recall curve (PR-AUC) score of 0.75, an area under the ROC (ROC-AUC) score of 0.80, an F1 score of 0.61, and a Precision of 0.80. The prediction explanation model SHAP (SHapley Additive exPlanations) was employed to explain the machine learning model's predictions. CONCLUSION We conclude that our explainable predictive model is a promising tool for oncologists that addresses an unmet clinical need of post-treatment patient stratification based on the relapse risk while also improving the predictive power by incorporating proxy genomic data not available for specific patients.
Collapse
Affiliation(s)
- Mohan Timilsina
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland.
| | - Dirk Fey
- Systems Biology Ireland, University College Dublin, Ireland.
| | - Samuele Buosi
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland.
| | | | | | - Enric Carcereny
- Catalan Institute of Oncology, Hospital Universitari Germans Trias i Pujol, B-ARGO, IGTP, Badalona, Spain.
| | | | - Manuel Cobo
- Medical Oncology Intercenter Unit. Regional and Virgen de la Victoria University Hospitals. IBIMA. Málaga., Spain.
| | | | - Reyes Bernabé
- Hospital Universitario Virgen del Rocio, Sevilla, Spain.
| | | | - Maria Torrente
- Medical Oncology Department, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain.
| | - Mariano Provencio
- Medical Oncology Department, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain.
| | - Vít Nováček
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland; Faculty of Informatics, Masaryk University Brno, Czech Republic; Masaryk Memorial Cancer Institute, Brno, Czech Republic.
| |
Collapse
|
14
|
Reyes-Santias F, García-García C, Aibar-Guzmán B, García-Campos A, Cordova-Arevalo O, Mendoza-Pintos M, Cinza-Sanjurjo S, Portela-Romero M, Mazón-Ramos P, Gonzalez-Juanatey JR. Cost Analysis of Magnetic Resonance Imaging and Computed Tomography in Cardiology: A Case Study of a University Hospital Complex in the Euro Region. Healthcare (Basel) 2023; 11:2084. [PMID: 37510526 PMCID: PMC10379578 DOI: 10.3390/healthcare11142084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 07/12/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023] Open
Abstract
INTRODUCTION In recent years, several hospitals have incorporated MRI equipment managed directly by their cardiology departments. The aim of our work is to determine the total cost per test of both CT and MRI in the setting of a Cardiology Department of a tertiary hospital. MATERIALS AND METHODS The process followed for estimating the costs of CT and MRI tests consists of three phases: (1) Identification of the phases of the testing process; (2) Identification of the resources consumed in carrying out the tests; (3) Quantification and assessment of inputs. RESULTS MRI involves higher personnel (EUR 66.03 vs. EUR 49.17) and equipment (EUR 89.98 vs. EUR 33.73) costs, while CT consumes higher expenditures in consumables (EUR 93.28 vs. EUR 22.95) and overheads (EUR 1.64 vs. EUR 1.55). The total cost of performing each test is higher in MRI (EUR 180.60 vs. EUR 177.73). CONCLUSIONS We can conclude that the unit cost of each CT and MRI performed in that unit are EUR 177.73 and EUR 180.60, respectively, attributable to consumables in the case of CT and to amortization of equipment and staff time in the case of MRI.
Collapse
Affiliation(s)
- Francisco Reyes-Santias
- Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
- Department of Business, University of Vigo, 36310 Vigo, Spain
| | - Carlos García-García
- Department of Pharmacology, Pharmacy and Pharmaceutical Technology, R+D Pharma Group (GI-1645), Faculty of Pharmacy, Health Research Institute of Santiago de Compostela (IDIS), University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Beatriz Aibar-Guzmán
- Departamento de Economía Financiera y Contabilidad, Facultad de Ciencias Económicas y Empresariales, Universidad de Santiago de Compostela, Av. Burgo, s/n, 15782 Santiago Compostela, Spain
| | - Ana García-Campos
- Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
| | | | | | - Sergio Cinza-Sanjurjo
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
- CS Milladoiro, Área Sanitaria Integrada Santiago de Compostela, 15895 Travesía do Porto, Spain
| | - Manuel Portela-Romero
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
- CS Concepción Arenal, Área Sanitaria Integrada Santiago de Compostela, Rúa de Santiago León de Caracas, 12, 15701 Santiago de Compostela, Spain
| | - Pilar Mazón-Ramos
- Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
| | - Jose Ramon Gonzalez-Juanatey
- Servicio de Cardiología, Complejo Hospitalario Universitario de Santiago de Compostela, Choupana s/n, 15706 Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Choupana s/n, 15706 Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red-Enfermedades Cardiovasculares (CIBERCV), Av. Monforte de Lemos, 3-5. Pabellón 11. Planta 0, 28029 Madrid, Spain
| |
Collapse
|
15
|
Wang B, Tian P, Sun Q, Zhang H, Han L, Zhu B. A novel, effective machine learning-based RNA editing profile for predicting the prognosis of lower-grade gliomas. Heliyon 2023; 9:e18075. [PMID: 37483735 PMCID: PMC10362151 DOI: 10.1016/j.heliyon.2023.e18075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 07/02/2023] [Accepted: 07/05/2023] [Indexed: 07/25/2023] Open
Abstract
Patients with low-grade glioma (LGG) may survive for long time periods, but their tumors often progress to higher-grade lesions. Currently, no cure for LGG is available. A-to-I RNA editing accounts for nearly 90% of all RNA editing events in humans and plays a role in tumorigenesis in various cancers. However, little is known regarding its prognostic role in LGG. On the basis of The Cancer Genome Atlas (TCGA) data, we used LASSO and univariate Cox regression to construct an RNA editing site signature. The results derived from the TCGA dataset were further validated with Gene Expression Omnibus (GEO) and Chinese Glioma Genome Atlas (CGGA) datasets. Five machine learning algorithms (Decision Trees C5.0, XGboost, GBDT, Lightgbm, and Catboost) were used to confirm the prognosis associated with the RNA editing site signature. Finally, we explored immune function, immunotherapy, and potential therapeutic agents in the high- and low-risk groups by using multiple biological prediction websites. A total of 22,739 RNA editing sites were identified, and a signature model consisting of four RNA editing sites (PRKCSH|chr19:11561032, DSEL|chr18:65174489, UGGT1|chr2:128952084, and SOD2|chr6:160101723) was established. Cox regression analysis indicated that the RNA editing signature was an independent prognostic factor, according to the ROC curve (AUC = 0.823), and the nomogram model had good predictive power (C-index = 0.824). In addition, the predictive ability of the RNA editing signature was confirmed with the machine learning model. The sensitivity of PCI-34051 and Elephantin was significantly higher in the high-risk group than the low-risk group, thus potentially providing a marker to predict the effects of lung cancer drug treatment. RNA editing may serve as a novel survival prediction tool, thus offering hope for developing editing-based therapeutic strategies to combat LGG progression. In addition, this tool may help optimize survival risk assessment and individualized care for patients with low-grade gliomas.
Collapse
Affiliation(s)
- Boshen Wang
- Jiangsu Provincial Center for Disease Prevention and Control, Nanjing 210000, Jiangsu, China
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, Jiangsu, China
| | - Peijie Tian
- Department of Pathology, Weifang Medical University, China
| | - Qianyu Sun
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, Jiangsu, China
| | - Hengdong Zhang
- Jiangsu Provincial Center for Disease Prevention and Control, Nanjing 210000, Jiangsu, China
| | - Lei Han
- Jiangsu Provincial Center for Disease Prevention and Control, Nanjing 210000, Jiangsu, China
| | - Baoli Zhu
- Jiangsu Provincial Center for Disease Prevention and Control, Nanjing 210000, Jiangsu, China
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, Jiangsu, China
| |
Collapse
|
16
|
Liu YS, Thaliffdeen R, Han S, Park C. Use of machine learning to predict bladder cancer survival outcomes: a systematic literature review. Expert Rev Pharmacoecon Outcomes Res 2023; 23:761-771. [PMID: 37306511 DOI: 10.1080/14737167.2023.2224963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 06/09/2023] [Indexed: 06/13/2023]
Abstract
INTRODUCTION The objective of this systematic review is to summarize the use of machine learning (ML) in predicting overall survival (OS) in patients with bladder cancer. METHODS Search terms for bladder cancer, ML algorithms, and mortality were used to identify studies in PubMed and Web of Science as of February 2022. Notable inclusion/exclusion criteria contained the inclusion of studies that utilized patient-level datasets and exclusion of primary gene expression-related dataset studies. Study quality and bias were assessed using the International Journal of Medical Informatics (IJMEDI) checklist. RESULTS Of the 14 included studies, the most common algorithms were artificial neural networks (n = 8) and logistic regression (n = 4). Nine articles described missing data handling, with five articles removing patients with missing data entirely. With respect to feature selection, the most common sociodemographic variables were age (n = 9), gender (n = 9), and smoking status (n = 3), with clinical variables most commonly including tumor stage (n = 8), grade (n = 7), and lymph node involvement (n = 6). Most studies (n = 10) were of medium IJMEDI quality, with common areas of improvement being the descriptions of data preparation and deployment. CONCLUSIONS ML holds promise for optimizing bladder cancer care through accurate OS predictions, but challenges related to data processing, feature selection, and data source quality must be resolved to develop robust models. While this review is limited by its inability to compare models across studies, this systematic review will inform decision-making by various stakeholders to improve understanding of ML-based OS prediction in bladder cancer and foster interpretability of future models.
Collapse
Affiliation(s)
- Yi-Shao Liu
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Ryan Thaliffdeen
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Sola Han
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Chanhyun Park
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| |
Collapse
|
17
|
Popović Krneta M, Šobić Šaranović D, Mijatović Teodorović L, Krajčinović N, Avramović N, Bojović Ž, Bukumirić Z, Marković I, Rajšić S, Djorović BB, Artiko V, Karličić M, Tanić M. Prediction of Cervical Lymph Node Metastasis in Clinically Node-Negative T1 and T2 Papillary Thyroid Carcinoma Using Supervised Machine Learning Approach. J Clin Med 2023; 12:jcm12113641. [PMID: 37297835 DOI: 10.3390/jcm12113641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 05/19/2023] [Accepted: 05/22/2023] [Indexed: 06/12/2023] Open
Abstract
Papillary thyroid carcinoma (PTC) is generally considered an indolent cancer. However, patients with cervical lymph node metastasis (LNM) have a higher risk of local recurrence. This study evaluated and compared four machine learning (ML)-based classifiers to predict the presence of cervical LNM in clinically node-negative (cN0) T1 and T2 PTC patients. The algorithm was developed using clinicopathological data from 288 patients who underwent total thyroidectomy and prophylactic central neck dissection, with sentinel lymph node biopsy performed to identify lateral LNM. The final ML classifier was selected based on the highest specificity and the lowest degree of overfitting while maintaining a sensitivity of 95%. Among the models evaluated, the k-Nearest Neighbor (k-NN) classifier was found to be the best fit, with an area under the receiver operating characteristic curve of 0.72, and sensitivity, specificity, positive and negative predictive values, F1 and F2 scores of 98%, 27%, 56%, 93%, 72%, and 85%, respectively. A web application based on a sensitivity-optimized kNN classifier was also created to predict the potential of cervical LNM, allowing users to explore and potentially build upon the model. These findings suggest that ML can improve the prediction of LNM in cN0 T1 and T2 PTC patients, thereby aiding in individual treatment planning.
Collapse
Affiliation(s)
- Marina Popović Krneta
- Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
| | - Dragana Šobić Šaranović
- Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia
- Center for Nuclear Medicine with PET, University Clinical Center of Serbia, 11 000 Belgrade, Serbia
| | - Ljiljana Mijatović Teodorović
- Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
- Faculty of Medical Sciences, University of Kragujevac, 34 000 Kragujevac, Serbia
| | - Nemanja Krajčinović
- Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
| | - Nataša Avramović
- Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
| | - Živko Bojović
- Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, 21 000 Novi Sad, Serbia
| | - Zoran Bukumirić
- Institute of Medical Statistics and Informatics, Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia
| | - Ivan Marković
- Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia
- Surgical Oncology Clinic, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
| | - Saša Rajšić
- Department of Anesthesiology and Intensive Care Medicine, Medical University Innsbruck, 6020 Innsbruck, Austria
| | - Biljana Bazić Djorović
- Department of Nuclear Medicine, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
| | - Vera Artiko
- Faculty of Medicine, University of Belgrade, 11 000 Belgrade, Serbia
- Center for Nuclear Medicine with PET, University Clinical Center of Serbia, 11 000 Belgrade, Serbia
| | - Mihajlo Karličić
- School of Electrical Engineering, University of Belgrade, 11 000 Belgrade, Serbia
| | - Miljana Tanić
- Department of Experimental Oncology, Institute for Oncology and Radiology of Serbia, 11 000 Belgrade, Serbia
- UCL Cancer Institute, London WC1E 6DD, UK
| |
Collapse
|
18
|
Brehon K, Carriere J, Churchill K, Loyola-Sanchez A, Papathanassoglou E, MacIsaac R, Tavakoli M, Ho C, Manhas KP. Evaluating Efficiency of a Provincial Telerehabilitation Service in Improving Access to Care During the COVID-19 Pandemic. Int J Telerehabil 2023; 15:e6523. [PMID: 38046552 PMCID: PMC10687995 DOI: 10.5195/ijt.2023.6523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2023] Open
Abstract
Scope Early in the COVID-19 pandemic, community rehabilitation stakeholders from a provincial health system designed a novel telerehabilitation service. The service provided wayfinding and self-management advice to individuals with musculoskeletal concerns, neurological conditions, or post-COVID-19 recovery needs. This study evaluated the efficiency of the service in improving access to care. Methodology We used multiple methods including secondary data analyses of call metrics, narrative analyses of clinical notes using artificial intelligence (AI) and machine learning (ML), and qualitative interviews. Conclusions Interviews revealed that the telerehabilitation service had the potential to positively impact access to rehabilitation during the COVID-19 pandemic, for individuals living rurally, and for individuals on wait lists. Call metric analyses revealed that efficiency may be enhanced if call handling time was reduced. AI/ML analyses found that pain was the most frequently-mentioned keyword in clinical notes, suggesting an area for additional telerehabilitation resources to ensure efficiency.
Collapse
Affiliation(s)
- Katelyn Brehon
- Department of Physical Therapy, University of Alberta, Edmonton, Alberta, Canada
| | - Jay Carriere
- Department of Electrical and Software Engineering, University of Calgary, Calgary, Alberta, Canada
| | - Katie Churchill
- Allied Health Professional Practice and Education, Alberta Health Services, Alberta, Canada
- Department of Occupational Therapy, University of Alberta, Edmonton, Alberta, Canada
| | | | - Elizabeth Papathanassoglou
- Neurosciences, Rehabilitation, and Vision Strategic Clinical Network, Alberta Health Services, Alberta, Canada
- Faculty of Nursing, University of Alberta, Edmonton, Alberta, Canada
| | - Rob MacIsaac
- Spinal Cord Injury Alberta, Edmonton, Alberta, Canada
| | - Mahdi Tavakoli
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alberta, Canada
| | - Chester Ho
- Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
- Neurosciences, Rehabilitation, and Vision Strategic Clinical Network, Alberta Health Services, Alberta, Canada
| | - Kiran Pohar Manhas
- Neurosciences, Rehabilitation, and Vision Strategic Clinical Network, Alberta Health Services, Alberta, Canada
- Faculty of Nursing, University of Alberta, Edmonton, Alberta, Canada
- Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|
19
|
Manuel Román-Belmonte J, De la Corte-Rodríguez H, Adriana Rodríguez-Damiani B, Carlos Rodríguez-Merchán E. Artificial Intelligence in Musculoskeletal Conditions. ARTIF INTELL 2023. [DOI: 10.5772/intechopen.110696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023]
Abstract
Artificial intelligence (AI) refers to computer capabilities that resemble human intelligence. AI implies the ability to learn and perform tasks that have not been specifically programmed. Moreover, it is an iterative process involving the ability of computerized systems to capture information, transform it into knowledge, and process it to produce adaptive changes in the environment. A large labeled database is needed to train the AI system and generate a robust algorithm. Otherwise, the algorithm cannot be applied in a generalized way. AI can facilitate the interpretation and acquisition of radiological images. In addition, it can facilitate the detection of trauma injuries and assist in orthopedic and rehabilitative processes. The applications of AI in musculoskeletal conditions are promising and are likely to have a significant impact on the future management of these patients.
Collapse
|
20
|
Sheehy J, Rutledge H, Acharya UR, Loh HW, Gururajan R, Tao X, Zhou X, Li Y, Gurney T, Kondalsamy-Chennakesavan S. Gynecological cancer prognosis using machine learning techniques: A systematic review of last three decades (1990–2022). Artif Intell Med 2023; 139:102536. [PMID: 37100507 DOI: 10.1016/j.artmed.2023.102536] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/19/2023] [Accepted: 03/23/2023] [Indexed: 03/30/2023]
Abstract
OBJECTIVE Many Computer Aided Prognostic (CAP) systems based on machine learning techniques have been proposed in the field of oncology. The objective of this systematic review was to assess and critically appraise the methodologies and approaches used in predicting the prognosis of gynecological cancers using CAPs. METHODS Electronic databases were used to systematically search for studies utilizing machine learning methods in gynecological cancers. Study risk of bias (ROB) and applicability were assessed using the PROBAST tool. 139 studies met the inclusion criteria, of which 71 predicted outcomes for ovarian cancer patients, 41 predicted outcomes for cervical cancer patients, 28 predicted outcomes for uterine cancer patients, and 2 predicted outcomes for gynecological malignancies broadly. RESULTS Random forest (22.30 %) and support vector machine (21.58 %) classifiers were used most commonly. Use of clinicopathological, genomic and radiomic data as predictors was observed in 48.20 %, 51.08 % and 17.27 % of studies, respectively, with some studies using multiple modalities. 21.58 % of studies were externally validated. Twenty-three individual studies compared ML and non-ML methods. Study quality was highly variable and methodologies, statistical reporting and outcome measures were inconsistent, preventing generalized commentary or meta-analysis of performance outcomes. CONCLUSION There is significant variability in model development when prognosticating gynecological malignancies with respect to variable selection, machine learning (ML) methods and endpoint selection. This heterogeneity prevents meta-analysis and conclusions regarding the superiority of ML methods. Furthermore, PROBAST-mediated ROB and applicability analysis demonstrates concern for the translatability of existing models. This review identifies ways that this can be improved upon in future works to develop robust, clinically translatable models within this promising field.
Collapse
|
21
|
Scott-Fordsmand JJ, Amorim MJB. Using Machine Learning to make nanomaterials sustainable. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 859:160303. [PMID: 36410486 DOI: 10.1016/j.scitotenv.2022.160303] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 11/06/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]
Abstract
Sustainable development is a key challenge for contemporary human societies; failure to achieve sustainability could threaten human survival. In this review article, we illustrate how Machine Learning (ML) could support more sustainable development, covering the basics of data gathering through each step of the Environmental Risk Assessment (ERA). The literature provides several examples showing how ML can be employed in most steps of a typical ERA.A key observation is that there are currently no clear guidance for using such autonomous technologies in ERAs or which standards/checks are required. Steering thus seems to be the most important task for supporting the use of ML in the ERA of nano- and smart-materials. Resources should be devoted to developing a strategy for implementing ML in ERA with a strong emphasis on data foundations, methodologies, and the related sensitivities/uncertainties. We should recognise historical errors and biases (e.g., in data) to avoid embedding them during ML programming.
Collapse
Affiliation(s)
| | - Mónica J B Amorim
- Department of Biology & CESAM, University of Aveiro, 3810-193 Aveiro, Portugal.
| |
Collapse
|
22
|
Burnett B, Zhou SM, Brophy S, Davies P, Ellis P, Kennedy J, Bandyopadhyay A, Parker M, Lyons RA. Machine Learning in Colorectal Cancer Risk Prediction from Routinely Collected Data: A Review. Diagnostics (Basel) 2023; 13:301. [PMID: 36673111 PMCID: PMC9858109 DOI: 10.3390/diagnostics13020301] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/05/2023] [Accepted: 01/07/2023] [Indexed: 01/15/2023] Open
Abstract
The inclusion of machine-learning-derived models in systematic reviews of risk prediction models for colorectal cancer is rare. Whilst such reviews have highlighted methodological issues and limited performance of the models included, it is unclear why machine-learning-derived models are absent and whether such models suffer similar methodological problems. This scoping review aims to identify machine-learning models, assess their methodology, and compare their performance with that found in previous reviews. A literature search of four databases was performed for colorectal cancer prediction and prognosis model publications that included at least one machine-learning model. A total of 14 publications were identified for inclusion in the scoping review. Data was extracted using an adapted CHARM checklist against which the models were benchmarked. The review found similar methodological problems with machine-learning models to that observed in systematic reviews for non-machine-learning models, although model performance was better. The inclusion of machine-learning models in systematic reviews is required, as they offer improved performance despite similar methodological omissions; however, to achieve this the methodological issues that affect many prediction models need to be addressed.
Collapse
Affiliation(s)
- Bruce Burnett
- Swansea University Medical School, Swansea SA2 8PP, UK
| | - Shang-Ming Zhou
- Faculty of Health, University of Plymouth, Plymouth PL4 8AA, UK
| | - Sinead Brophy
- Swansea University Medical School, Swansea SA2 8PP, UK
| | | | | | | | | | | | | |
Collapse
|
23
|
Kotsyfakis S, Iliaki-Giannakoudaki E, Anagnostopoulos A, Papadokostaki E, Giannakoudakis K, Goumenakis M, Kotsyfakis M. The application of machine learning to imaging in hematological oncology: A scoping review. Front Oncol 2022; 12:1080988. [PMID: 36605438 PMCID: PMC9808781 DOI: 10.3389/fonc.2022.1080988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
Background Here, we conducted a scoping review to (i) establish which machine learning (ML) methods have been applied to hematological malignancy imaging; (ii) establish how ML is being applied to hematological cancer radiology; and (iii) identify addressable research gaps. Methods The review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis Extension for Scoping Reviews guidelines. The inclusion criteria were (i) pediatric and adult patients with suspected or confirmed hematological malignancy undergoing imaging (population); (ii) any study using ML techniques to derive models using radiological images to apply to the clinical management of these patients (concept); and (iii) original research articles conducted in any setting globally (context). Quality Assessment of Diagnostic Accuracy Studies 2 criteria were used to assess diagnostic and segmentation studies, while the Newcastle-Ottawa scale was used to assess the quality of observational studies. Results Of 53 eligible studies, 33 applied diverse ML techniques to diagnose hematological malignancies or to differentiate them from other diseases, especially discriminating gliomas from primary central nervous system lymphomas (n=18); 11 applied ML to segmentation tasks, while 9 applied ML to prognostication or predicting therapeutic responses, especially for diffuse large B-cell lymphoma. All studies reported discrimination statistics, but no study calculated calibration statistics. Every diagnostic/segmentation study had a high risk of bias due to their case-control design; many studies failed to provide adequate details of the reference standard; and only a few studies used independent validation. Conclusion To deliver validated ML-based models to radiologists managing hematological malignancies, future studies should (i) adhere to standardized, high-quality reporting guidelines such as the Checklist for Artificial Intelligence in Medical Imaging; (ii) validate models in independent cohorts; (ii) standardize volume segmentation methods for segmentation tasks; (iv) establish comprehensive prospective studies that include different tumor grades, comparisons with radiologists, optimal imaging modalities, sequences, and planes; (v) include side-by-side comparisons of different methods; and (vi) include low- and middle-income countries in multicentric studies to enhance generalizability and reduce inequity.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Michail Kotsyfakis
- Biology Center of the Czech Academy of Sciences, Budweis (Ceske Budejovice), Czechia
| |
Collapse
|
24
|
Butner JD, Dogra P, Chung C, Pasqualini R, Arap W, Lowengrub J, Cristini V, Wang Z. Mathematical modeling of cancer immunotherapy for personalized clinical translation. NATURE COMPUTATIONAL SCIENCE 2022; 2:785-796. [PMID: 38126024 PMCID: PMC10732566 DOI: 10.1038/s43588-022-00377-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 11/14/2022] [Indexed: 12/23/2023]
Abstract
Encouraging advances are being made in cancer immunotherapy modeling, especially in the key areas of developing personalized treatment strategies based on individual patient parameters, predicting treatment outcomes and optimizing immunotherapy synergy when used in combination with other treatment approaches. Here we present a focused review of the most recent mathematical modeling work on cancer immunotherapy with a focus on clinical translatability. It can be seen that this field is transitioning from pure basic science to applications that can make impactful differences in patients' lives. We discuss how researchers are integrating experimental and clinical data to fully inform models so that they can be applied for clinical predictions, and present the challenges that remain to be overcome if widespread clinical adaptation is to be realized. Lastly, we discuss the most promising future applications and areas that are expected to be the focus of extensive upcoming modeling studies.
Collapse
Affiliation(s)
- Joseph D. Butner
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
| | - Prashant Dogra
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
| | - Caroline Chung
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Renata Pasqualini
- Rutgers Cancer Institute of New Jersey, Newark, NJ, USA
- Department of Radiation Oncology, Division of Cancer Biology, Rutgers New Jersey Medical School, Newark, NJ, USA
| | - Wadih Arap
- Rutgers Cancer Institute of New Jersey, Newark, NJ, USA
- Department of Medicine, Division of Hematology/Oncology, Rutgers New Jersey Medical School, Newark, NJ, USA
| | - John Lowengrub
- Department of Mathematics, University of California at Irvine, Irvine, CA, USA
| | - Vittorio Cristini
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
- Neal Cancer Center, Houston Methodist Research Institute, Houston, TX, USA
- Department of Imaging Physics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Physiology, Biophysics, and Systems Biology Program, Graduate School of Medical Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Zhihui Wang
- Mathematics in Medicine Program, Houston Methodist Research Institute, Houston, TX, USA
- Neal Cancer Center, Houston Methodist Research Institute, Houston, TX, USA
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|
25
|
Hanis TM, Ruhaiyem NIR, Arifin WN, Haron J, Wan Abdul Rahman WF, Abdullah R, Musa KI. Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records. Diagnostics (Basel) 2022; 12:diagnostics12112826. [PMID: 36428886 PMCID: PMC9689364 DOI: 10.3390/diagnostics12112826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 10/13/2022] [Accepted: 10/15/2022] [Indexed: 11/18/2022] Open
Abstract
This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer.
Collapse
Affiliation(s)
- Tengku Muhammad Hanis
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| | | | - Wan Nor Arifin
- Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Juhara Haron
- Department of Radiology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Wan Faiziah Wan Abdul Rahman
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Department of Pathology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Rosni Abdullah
- School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Penang, Malaysia
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| |
Collapse
|
26
|
Bai Z, Bai Y, Fang C, Chen W. Oxidative stress-related patterns determination for establishment of prognostic models, and characteristics of tumor microenvironment infiltration. Front Surg 2022; 9:1013794. [PMID: 36386530 PMCID: PMC9665876 DOI: 10.3389/fsurg.2022.1013794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 10/14/2022] [Indexed: 12/02/2022] Open
Abstract
Oxidative stress-mediated excessive accumulation of ROS in the body destroys cell homeostasis and participates in various diseases. However, the relationship between oxidative stress-related genes (ORGs) and tumor microenvironment (TME) in gastric cancer remains poorly understood. For improving the treatment strategy of GC, it is necessary to explore the relationship among them. We describe the changes of ORGs in 732 gastric cancer samples from two data sets. The two different molecular subtypes revealed that the changes of ORGs were associated with clinical features, prognosis, and TME. Subsequently, the OE_score was related to RFS, as confirmed by the correlation between OE_score and TME, TMB, MSI, immunotherapy, stem cell analysis, chemotherapeutic drugs, etc. OE_score can be used as an independent predictive marker for the treatment and prognosis of gastric cancer. Further, a Norman diagram was established to improve clinical practicability. Our research showed a potential role of ORGs in clinical features, prognosis, and tumor microenvironment of gastric cancer. Our research findings broaden the understanding of gastric cancer ORGs as a potential target for individualized treatment of gastric cancer and a new direction to evaluate the prognosis.
Collapse
Affiliation(s)
- Zihao Bai
- Graduate Department, Shanxi Medical University, Taiyuan, China
| | - Yihua Bai
- Graduate Department, Shanxi Medical University, Taiyuan, China
| | - Changzhong Fang
- Graduate Department, Shanxi Medical University, Taiyuan, China
| | - Wenliang Chen
- Department of General Surgery, The 2nd Affiliated Hospital of Shanxi Medical University, Taiyuan, China,Correspondence: Wenliang Chen
| |
Collapse
|
27
|
Briggs E, de Kamps M, Hamilton W, Johnson O, McInerney CD, Neal RD. Machine Learning for Risk Prediction of Oesophago-Gastric Cancer in Primary Care: Comparison with Existing Risk-Assessment Tools. Cancers (Basel) 2022; 14:cancers14205023. [PMID: 36291807 PMCID: PMC9600097 DOI: 10.3390/cancers14205023] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/12/2022] [Accepted: 10/12/2022] [Indexed: 11/18/2022] Open
Abstract
Simple Summary Oesophago-gastric cancer is one of the commonest cancers worldwide, yet it can be particularly difficult to diagnose given that initial symptoms are often non-specific and routine screening is not available. Cancer risk-assessment tools, which calculate cancer risk based on symptoms and other risk factors present in the primary care record, can aid decisions on referrals for cancer investigations, facilitating earlier diagnosis. Diagnosing common cancers earlier could help improve survival rates. Using UK primary care electronic health record data, we compared five different machine learning techniques for probabilistic classification of cancer patients against a current widely used UK primary care cancer risk-assessment tool. The machine learning algorithms outperformed the current risk-assessment tool, with a higher overall accuracy and an ability to reasonably identify 11–25% more cancer patients. We conclude that machine-learning-based risk-assessment tools could help better identify suitable patients for further investigation and support earlier diagnosis. Abstract Oesophago-gastric cancer is difficult to diagnose in the early stages given its typical non-specific initial manifestation. We hypothesise that machine learning can improve upon the diagnostic performance of current primary care risk-assessment tools by using advanced analytical techniques to exploit the wealth of evidence available in the electronic health record. We used a primary care electronic health record dataset derived from the UK General Practice Research Database (7471 cases; 32,877 controls) and developed five probabilistic machine learning classifiers: Support Vector Machine, Random Forest, Logistic Regression, Naïve Bayes, and Extreme Gradient Boosted Decision Trees. Features included basic demographics, symptoms, and lab test results. The Logistic Regression, Support Vector Machine, and Extreme Gradient Boosted Decision Tree models achieved the highest performance in terms of accuracy and AUROC (0.89 accuracy, 0.87 AUROC), outperforming a current UK oesophago-gastric cancer risk-assessment tool (ogRAT). Machine learning also identified more cancer patients than the ogRAT: 11.0% more with little to no effect on false positives, or up to 25.0% more with a slight increase in false positives (for Logistic Regression, results threshold-dependent). Feature contribution estimates and individual prediction explanations indicated clinical relevance. We conclude that machine learning could improve primary care cancer risk-assessment tools, potentially helping clinicians to identify additional cancer cases earlier. This could, in turn, improve survival outcomes.
Collapse
Affiliation(s)
- Emma Briggs
- School of Computing, University of Leeds, Leeds LS2 9JT, UK
- Correspondence:
| | - Marc de Kamps
- School of Computing, University of Leeds, Leeds LS2 9JT, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds LS2 9NL, UK
- The Alan Turing Institute, London NW1 2DB, UK
| | - Willie Hamilton
- Department of Health and Community Sciences, University of Exeter, Exeter EX1 2LU, UK
| | - Owen Johnson
- School of Computing, University of Leeds, Leeds LS2 9JT, UK
- Leeds Institute for Data Analytics, University of Leeds, Leeds LS2 9NL, UK
| | - Ciarán D. McInerney
- Academic Unit of Primary Medical Care, University of Sheffield, Sheffield S10 2TN, UK
| | - Richard D. Neal
- Department of Health and Community Sciences, University of Exeter, Exeter EX1 2LU, UK
| |
Collapse
|
28
|
Izadi Z, Gianfrancesco MA, Aguirre A, Strangfeld A, Mateus EF, Hyrich KL, Gossec L, Carmona L, Lawson‐Tovey S, Kearsley‐Fleet L, Schaefer M, Seet AM, Schmajuk G, Jacobsohn L, Katz P, Rush S, Al‐Emadi S, Sparks JA, Hsu TY, Patel NJ, Wise L, Gilbert E, Duarte‐García A, Valenzuela‐Almada MO, Ugarte‐Gil MF, Ribeiro SLE, de Oliveira Marinho A, de Azevedo Valadares LD, Giuseppe DD, Hasseli R, Richter JG, Pfeil A, Schmeiser T, Isnardi CA, Reyes Torres AA, Alle G, Saurit V, Zanetti A, Carrara G, Labreuche J, Barnetche T, Herasse M, Plassart S, Santos MJ, Rodrigues AM, Robinson PC, Machado PM, Sirotich E, Liew JW, Hausmann JS, Sufka P, Grainger R, Bhana S, Costello W, Wallace ZS, Yazdany J. Development of a Prediction Model for COVID-19 Acute Respiratory Distress Syndrome in Patients With Rheumatic Diseases: Results From the Global Rheumatology Alliance Registry. ACR Open Rheumatol 2022; 4:872-882. [PMID: 35869686 PMCID: PMC9350083 DOI: 10.1002/acr2.11481] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 05/31/2022] [Indexed: 11/09/2022] Open
Abstract
OBJECTIVE Some patients with rheumatic diseases might be at higher risk for coronavirus disease 2019 (COVID-19) acute respiratory distress syndrome (ARDS). We aimed to develop a prediction model for COVID-19 ARDS in this population and to create a simple risk score calculator for use in clinical settings. METHODS Data were derived from the COVID-19 Global Rheumatology Alliance Registry from March 24, 2020, to May 12, 2021. Seven machine learning classifiers were trained on ARDS outcomes using 83 variables obtained at COVID-19 diagnosis. Predictive performance was assessed in a US test set and was validated in patients from four countries with independent registries using area under the curve (AUC), accuracy, sensitivity, and specificity. A simple risk score calculator was developed using a regression model incorporating the most influential predictors from the best performing classifier. RESULTS The study included 8633 patients from 74 countries, of whom 523 (6%) had ARDS. Gradient boosting had the highest mean AUC (0.78; 95% confidence interval [CI]: 0.67-0.88) and was considered the top performing classifier. Ten predictors were identified as key risk factors and were included in a regression model. The regression model that predicted ARDS with 71% (95% CI: 61%-83%) sensitivity in the test set, and with sensitivities ranging from 61% to 80% in countries with independent registries, was used to develop the risk score calculator. CONCLUSION We were able to predict ARDS with good sensitivity using information readily available at COVID-19 diagnosis. The proposed risk score calculator has the potential to guide risk stratification for treatments, such as monoclonal antibodies, that have potential to reduce COVID-19 disease progression.
Collapse
Affiliation(s)
| | | | | | | | - Elsa F. Mateus
- Portuguese League Against Rheumatic DiseasesLisbonPortugal
| | - Kimme L. Hyrich
- The University of Manchester and National Institute for Health Research Manchester Biomedical Research Centre, Manchester University and NHS Foundation TrustManchesterUK
| | - Laure Gossec
- INSERM, Sorbonne Universite and Hopital Universitaire Pitie Salpetriere, AP‐HPParisFrance
| | | | - Saskia Lawson‐Tovey
- The University of Manchester and National Institute for Health Research Manchester Biomedical Research Centre, Manchester University NHS Foundation Trust and Manchester Academic Health Science CentreManchesterUK
| | - Lianne Kearsley‐Fleet
- The University of Manchester and Manchester Academic Health Science CentreManchesterUK
| | | | | | - Gabriela Schmajuk
- University of CaliforniaSan Francisco and San Francisco Department of Veterans Affairs Medical Center
| | | | | | | | | | - Jeffrey A. Sparks
- Brigham and Women's Hospital and Harvard Medical SchoolBostonMassachusetts
| | - Tiffany Y‐T Hsu
- Brigham and Women's Hospital and Harvard Medical SchoolBostonMassachusetts
| | - Naomi J. Patel
- Massachusetts General Hospital and Harvard Medical SchoolBoston
| | - Leanna Wise
- University of Southern CaliforniaLos Angeles
| | | | | | | | - Manuel F. Ugarte‐Gil
- Universidad Científica del Sur and Hospital Nacional Guillermo Almenara IrigoyenEsSalud, LimaPeru
| | | | | | | | | | - Rebecca Hasseli
- Justus‐Liebig University Giessen, Campus KerckhoffGiessenGermany
| | | | - Alexander Pfeil
- Jena University Hospital and Friedrich Schiller University JenaJenaGermany
| | - Tim Schmeiser
- Rheumatology im Veedel (Private Practice)CologneGermany
| | | | | | | | | | - Anna Zanetti
- Italian Society for Rheumatology and University of Milano‐BicoccaMilanItaly
| | - Greta Carrara
- Italian Society for Rheumatology and University of Milano‐BicoccaMilanItaly
| | | | - Thomas Barnetche
- FHU ACRONIM, Centre for Autoimmune Systemic Rare Diseases, Bordeaux University HospitalBordeauxFrance
| | - Muriel Herasse
- Filière des Maladies Autoimmunes et Autoinflammatoires Rares, Hôpital Huriez, Centre Hospitalier Universitaire de LilleLilleFrance
| | - Samira Plassart
- Filière des Maladies Autoimmunes et Autoinflammatoires Rares, Hôpital Huriez, Centre Hospitalier Universitaire de LilleLilleFrance
| | - Maria José Santos
- Hospital Garcia de Orta, Almada, Portugal, and Instituto de Medicina Molecular Faculdade Medicina and Rheumatic Diseases Portuguese RegisterLisbonPortugal
| | - Ana Maria Rodrigues
- Rheumatic Diseases Portuguese Register, Sociedade Portuguesa de Reumatologia, Nova Medical School, and Hospital dos LusiadasLisbonPortugal
| | - Philip C. Robinson
- The University of Queensland, Brisbane, Queensland, Australia, and Royal Brisbane and Women's Hospital, Metro North Hospital and Health ServiceHerstonQueenslandAustralia
| | - Pedro M. Machado
- University College London, University College London Hospitals NHS Foundation Trust and Northwick Park Hospital, London North West University Healthcare NHS TrustLondonUK
| | - Emily Sirotich
- McMaster University, Hamilton, Ontario, Canada, and Canadian Arthritis Patient AllianceTorontoOntarioCanada
| | - Jean W. Liew
- Boston University School of MedicineBostonMassachusetts
| | - Jonathan S. Hausmann
- Beth Israel Deaconess Medical Center, Harvard Medical School and Boston Children's HospitalBostonMassachusetts
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Roman-Belmonte JM, De la Corte-Rodriguez H, Rodriguez-Merchan EC, Vazquez-Sasot A, Rodriguez-Damiani BA, Resino-Luís C, Sanchez-Laguna F. The three horizons model applied to medical science. Postgrad Med 2022; 134:776-783. [DOI: 10.1080/00325481.2022.2124086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Affiliation(s)
- Juan M. Roman-Belmonte
- Department of Physical Medicine and Rehabilitation, Cruz Roja San José y Santa Adela University Hospital, Madrid, Spain
| | | | - E. Carlos Rodriguez-Merchan
- Department of Orthopedic Surgery, La Paz University Hospital, Madrid, Spain
- Osteoarticular Surgery Research, Hospital La Paz Institute for Health Research – IdiPAZ (La Paz University Hospital – Autonomous University of Madrid), Madrid, Spain
| | - Aranzazu Vazquez-Sasot
- Department of Physical Medicine and Rehabilitation, Cruz Roja San José y Santa Adela University Hospital, Madrid, Spain
| | - Beatriz A. Rodriguez-Damiani
- Department of Physical Medicine and Rehabilitation, Cruz Roja San José y Santa Adela University Hospital, Madrid, Spain
| | - Cristina Resino-Luís
- Department of Physical Medicine and Rehabilitation, Cruz Roja San José y Santa Adela University Hospital, Madrid, Spain
| | | |
Collapse
|
30
|
Lui TKL, Cheung KS, Leung WK. Machine learning models in the prediction of 1-year mortality in patients with advanced hepatocellular cancer on immunotherapy: a proof-of-concept study. Hepatol Int 2022; 16:879-891. [PMID: 35779202 DOI: 10.1007/s12072-022-10370-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 05/22/2022] [Indexed: 11/28/2022]
Abstract
INTRODUCTION Immunotherapy is a new promising treatment for patients with advanced hepatocellular carcinoma (HCC), but is costly and potentially associated with considerable side effects. This study aimed to evaluate the role of machine learning (ML) models in predicting the 1-year cancer-related mortality in advanced HCC patients treated with immunotherapy. METHOD 395 HCC patients who had received immunotherapy (including nivolumab, pembrolizumab or ipilimumab) between 2014 and 2019 in Hong Kong were included. The whole data sets were randomly divided into training (n = 316) and internal validation (n = 79) set. The data set, including 47 clinical variables, was used to construct six different ML models in predicting the risk of 1-year mortality. The performances of ML models were measured by the area under receiver operating characteristic curve (AUC) and their performances were compared with C-Reactive protein and Alpha Fetoprotein in ImmunoTherapY score (CRAFITY) and albumin-bilirubin (ALBI) score. The ML models were further validated with an external cohort between 2020 and 2021. RESULTS The 1-year cancer-related mortality was 51.1%. Of the six ML models, the random forest (RF) has the highest AUC of 0.92 (95% CI 0.87-0.98), which was better than logistic regression (0.82, p = 0.01) as well as the CRAFITY (0.68, p < 0.01) and ALBI score (0.84, p = 0.04). RF had the lowest false positive (2.0%) and false negative rate (5.2%), and performed better than CRAFITY score in the external validation cohort (0.91 vs 0.66, p < 0.01). High baseline AFP, bilirubin and alkaline phosphatase were three common risk factors identified by all ML models. CONCLUSION ML models could predict 1-year cancer-related mortality in HCC patients treated with immunotherapy, which may help to select patients who would benefit from this treatment.
Collapse
Affiliation(s)
- Thomas Ka Luen Lui
- Department of Medicine, University of Hong Kong, 4/F, Professorial Block, Queen Mary Hospital, 102 Pokfulam Road, Hong Kong, China
| | - Ka Shing Cheung
- Department of Medicine, University of Hong Kong, 4/F, Professorial Block, Queen Mary Hospital, 102 Pokfulam Road, Hong Kong, China
| | - Wai Keung Leung
- Department of Medicine, University of Hong Kong, 4/F, Professorial Block, Queen Mary Hospital, 102 Pokfulam Road, Hong Kong, China.
| |
Collapse
|
31
|
Exploring the Utility of Anonymized EHR Datasets in Machine Learning Experiments in the Context of the MODELHealth Project. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12125942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
The object of this paper was the application of machine learning to a clinical dataset that was anonymized using the Mondrian algorithm. (1) Background: The preservation of patient privacy is a necessity rising from the increasing digitization of health data; however, the effect of data anonymization on the performance of machine learning models remains to be explored. (2) Methods: The original EHR derived dataset was subjected to anonymization by applying the Mondrian algorithm for various k values and quasi identifier (QI) set attributes. The logistic regression, decision trees, k-nearest neighbors, Gaussian naive Bayes and support vector machine models were applied to the different dataset versions. (3) Results: The classifiers demonstrated different degrees of resilience to the anonymization, with the decision tree and the KNN models showing remarkably stable performance, as opposed to the Gaussian naïve Bayes model. The choice of the QI set attributes and the generalized information loss value played a more important role than the size of the QI set or the k value. (4) Conclusions: Data anonymization can reduce the performance of certain machine learning models, although the appropriate selection of classifier and parameter values can mitigate this effect.
Collapse
|
32
|
Greenberg JK, Otun A, Ghogawala Z, Yen PY, Molina CA, Limbrick DD, Foraker RE, Kelly MP, Ray WZ. Translating Data Analytics Into Improved Spine Surgery Outcomes: A Roadmap for Biomedical Informatics Research in 2021. Global Spine J 2022; 12:952-963. [PMID: 33973491 PMCID: PMC9344511 DOI: 10.1177/21925682211008424] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
STUDY DESIGN Narrative review. OBJECTIVES There is growing interest in the use of biomedical informatics and data analytics tools in spine surgery. Yet despite the rapid growth in research on these topics, few analytic tools have been implemented in routine spine practice. The purpose of this review is to provide a health information technology (HIT) roadmap to help translate data assets and analytics tools into measurable advances in spine surgical care. METHODS We conducted a narrative review of PubMed and Google Scholar to identify publications discussing data assets, analytical approaches, and implementation strategies relevant to spine surgery practice. RESULTS A variety of data assets are available for spine research, ranging from commonly used datasets, such as administrative billing data, to emerging resources, such as mobile health and biobanks. Both regression and machine learning techniques are valuable for analyzing these assets, and researchers should recognize the particular strengths and weaknesses of each approach. Few studies have focused on the implementation of HIT, and a variety of methods exist to help translate analytic tools into clinically useful interventions. Finally, a number of HIT-related challenges must be recognized and addressed, including stakeholder acceptance, regulatory oversight, and ethical considerations. CONCLUSIONS Biomedical informatics has the potential to support the development of new HIT that can improve spine surgery quality and outcomes. By understanding the development life-cycle that includes identifying an appropriate data asset, selecting an analytic approach, and leveraging an effective implementation strategy, spine researchers can translate this potential into measurable advances in patient care.
Collapse
Affiliation(s)
- Jacob K. Greenberg
- Department of Neurological Surgery, Washington University School of Medicine,
St. Louis, MO, USA,Jacob K. Greenberg, Department of
Neurosurgery, Washington University School of Medicine, 660S. Euclid Ave., Box
8057, St. Louis, MO 63 110, USA.
| | - Ayodamola Otun
- Department of Neurological Surgery, Washington University School of Medicine,
St. Louis, MO, USA
| | - Zoher Ghogawala
- Department of Neurosurgery, Lahey Hospital and Medical Center, Burlington, MA, USA
| | - Po-Yin Yen
- Institute for Informatics, Washington University School of Medicine,
St. Louis, MO, USA
| | - Camilo A. Molina
- Department of Neurological Surgery, Washington University School of Medicine,
St. Louis, MO, USA
| | - David D. Limbrick
- Department of Neurological Surgery, Washington University School of Medicine,
St. Louis, MO, USA
| | - Randi E Foraker
- Institute for Informatics, Washington University School of Medicine,
St. Louis, MO, USA
| | - Michael P. Kelly
- Department of Orthopaedic Surgery, Washington University School of Medicine,
St. Louis, MO, USA
| | - Wilson Z. Ray
- Department of Neurological Surgery, Washington University School of Medicine,
St. Louis, MO, USA
| |
Collapse
|
33
|
Prediction of Trypanosoma evansi infection in dromedaries using artificial neural network (ANN). Vet Parasitol 2022; 306:109716. [DOI: 10.1016/j.vetpar.2022.109716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 05/05/2022] [Accepted: 05/06/2022] [Indexed: 11/20/2022]
|
34
|
Kwong JC, Khondker A, Kim JK, Chua M, Keefe DT, Dos Santos J, Skreta M, Erdman L, D'Souza N, Selman AF, Weaver J, Weiss DA, Long C, Tasian G, Teoh CW, Rickard M, Lorenzo AJ. Posterior Urethral Valves Outcomes Prediction (PUVOP): a machine learning tool to predict clinically relevant outcomes in boys with posterior urethral valves. Pediatr Nephrol 2022; 37:1067-1074. [PMID: 34686914 DOI: 10.1007/s00467-021-05321-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/11/2021] [Accepted: 09/28/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Early kidney and anatomic features may be predictive of future progression and need for additional procedures in patients with posterior urethral valve (PUV). The objective of this study was to use machine learning (ML) to predict clinically relevant outcomes in these patients. METHODS Patients diagnosed with PUV with kidney function measurements at our institution between 2000 and 2020 were included. Pertinent clinical measures were abstracted, including estimated glomerular filtration rate (eGFR) at each visit, initial vesicoureteral reflux grade, and renal dysplasia at presentation. ML models were developed to predict clinically relevant outcomes: progression in CKD stage, initiation of kidney replacement therapy (KRT), and need for clean-intermittent catheterization (CIC). Model performance was assessed by concordance index (c-index) and the model was externally validated. RESULTS A total of 103 patients were included with a median follow-up of 5.7 years. Of these patients, 26 (25%) had CKD progression, 18 (17%) required KRT, and 32 (31%) were prescribed CIC. Additionally, 22 patients were included for external validation. The ML model predicted CKD progression (c-index = 0.77; external C-index = 0.78), KRT (c-index = 0.95; external C-index = 0.89) and indicated CIC (c-index = 0.70; external C-index = 0.64), and all performed better than Cox proportional-hazards regression. The models have been packaged into a simple easy-to-use tool, available at https://share.streamlit.io/jcckwong/puvop/main/app.py CONCLUSION: ML-based approaches for predicting clinically relevant outcomes in PUV are feasible. Further validation is warranted, but this implementable model can act as a decision-making aid. A higher resolution version of the Graphical abstract is available as Supplementary information.
Collapse
Affiliation(s)
- Jethro Cc Kwong
- Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada.,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Adree Khondker
- Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.,Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Jin Kyu Kim
- Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada.,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Michael Chua
- Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Daniel T Keefe
- Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Joana Dos Santos
- Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Marta Skreta
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Lauren Erdman
- Centre for Computational Medicine, The Hospital for Sick Children, Toronto, ON, Canada
| | - Neeta D'Souza
- Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - John Weaver
- Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Dana A Weiss
- Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Christopher Long
- Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Gregory Tasian
- Division of Urology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Chia Wei Teoh
- Division of Nephrology, Hospital for Sick Children, Toronto, ON, Canada.,Department of Paediatrics, University of Toronto, Toronto, ON, Canada
| | - Mandy Rickard
- Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada
| | - Armando J Lorenzo
- Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada. .,Division of Urology, Department of Surgery, Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.
| |
Collapse
|
35
|
Appelbaum L, Kaplan ID, Palchuk MB, Kundrot S, Winer-Jones JP, Rinard M. Development and Experience with Cancer Risk Prediction Models Using Federated Databases and Electronic Health Records. Digit Health 2022. [DOI: 10.36255/exon-publications-digital-health-federated-databases] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
36
|
Lin KW, Ang TL, Li JW. Role of artificial intelligence in early detection and screening for pancreatic adenocarcinoma. Artif Intell Med Imaging 2022; 3:21-32. [DOI: 10.35711/aimi.v3.i2.21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 02/12/2022] [Accepted: 03/17/2022] [Indexed: 02/06/2023] Open
Abstract
Pancreatic adenocarcinoma remains to be one of the deadliest malignancies in the world despite treatment advancement over the past few decades. Its low survival rates and poor prognosis can be attributed to ambiguity in recommendations for screening and late symptom onset, contributing to its late presentation. In the recent years, artificial intelligence (AI) as emerged as a field to aid in the process of clinical decision making. Considerable efforts have been made in the realm of AI to screen for and predict future development of pancreatic ductal adenocarcinoma. This review discusses the use of AI in early detection and screening for pancreatic adenocarcinoma, and factors which may limit its use in a clinical setting.
Collapse
Affiliation(s)
- Kenneth Weicong Lin
- Department of Gastroenterology and Hepatology, Changi General Hospital, Singapore 529889, Singapore
| | - Tiing Leong Ang
- Department of Gastroenterology and Hepatology, Changi General Hospital, Singapore 529889, Singapore
| | - James Weiquan Li
- Department of Gastroenterology and Hepatology, Changi General Hospital, Singapore 529889, Singapore
| |
Collapse
|
37
|
A review on machine learning techniques for the assessment of image grading in breast mammogram. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01546-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
38
|
Qiu H, Ding S, Liu J, Wang L, Wang X. Applications of Artificial Intelligence in Screening, Diagnosis, Treatment, and Prognosis of Colorectal Cancer. Curr Oncol 2022; 29:1773-1795. [PMID: 35323346 PMCID: PMC8947571 DOI: 10.3390/curroncol29030146] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 02/28/2022] [Accepted: 03/03/2022] [Indexed: 12/29/2022] Open
Abstract
Colorectal cancer (CRC) is one of the most common cancers worldwide. Accurate early detection and diagnosis, comprehensive assessment of treatment response, and precise prediction of prognosis are essential to improve the patients’ survival rate. In recent years, due to the explosion of clinical and omics data, and groundbreaking research in machine learning, artificial intelligence (AI) has shown a great application potential in clinical field of CRC, providing new auxiliary approaches for clinicians to identify high-risk patients, select precise and personalized treatment plans, as well as to predict prognoses. This review comprehensively analyzes and summarizes the research progress and clinical application value of AI technologies in CRC screening, diagnosis, treatment, and prognosis, demonstrating the current status of the AI in the main clinical stages. The limitations, challenges, and future perspectives in the clinical implementation of AI are also discussed.
Collapse
Affiliation(s)
- Hang Qiu
- Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 611731, China;
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
- Correspondence: (H.Q.); (X.W.)
| | - Shuhan Ding
- School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14853, USA;
| | - Jianbo Liu
- West China School of Medicine, Sichuan University, Chengdu 610041, China;
- Department of Gastrointestinal Surgery, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Liya Wang
- Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 611731, China;
| | - Xiaodong Wang
- West China School of Medicine, Sichuan University, Chengdu 610041, China;
- Department of Gastrointestinal Surgery, West China Hospital, Sichuan University, Chengdu 610041, China
- Correspondence: (H.Q.); (X.W.)
| |
Collapse
|
39
|
Machine Learning-Based Risk Prediction of Critical Care Unit Admission for Advanced Stage High Grade Serous Ovarian Cancer Patients Undergoing Cytoreductive Surgery: The Leeds-Natal Score. J Clin Med 2021; 11:jcm11010087. [PMID: 35011828 PMCID: PMC8745521 DOI: 10.3390/jcm11010087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/20/2021] [Accepted: 12/22/2021] [Indexed: 12/12/2022] Open
Abstract
Achieving complete surgical cytoreduction in advanced stage high grade serous ovarian cancer (HGSOC) patients warrants an availability of Critical Care Unit (CCU) beds. Machine Learning (ML) could be helpful in monitoring CCU admissions to improve standards of care. We aimed to improve the accuracy of predicting CCU admission in HGSOC patients by ML algorithms and developed an ML-based predictive score. A cohort of 291 advanced stage HGSOC patients with fully curated data was selected. Several linear and non-linear distances, and quadratic discriminant ML methods, were employed to derive prediction information for CCU admission. When all the variables were included in the model, the prediction accuracies were higher for linear discriminant (0.90) and quadratic discriminant (0.93) methods compared with conventional logistic regression (0.84). Feature selection identified pre-treatment albumin, surgical complexity score, estimated blood loss, operative time, and bowel resection with stoma as the most significant prediction features. The real-time prediction accuracy of the Graphical User Interface CCU calculator reached 95%. Limited, potentially modifiable, mostly intra-operative factors contributing to CCU admission were identified and suggest areas for targeted interventions. The accurate quantification of CCU admission patterns is critical information when counseling patients about peri-operative risks related to their cytoreductive surgery.
Collapse
|
40
|
Francisco ME, Carvajal TM, Ryo M, Nukazawa K, Amalin DM, Watanabe K. Dengue disease dynamics are modulated by the combined influences of precipitation and landscape: A machine learning approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 792:148406. [PMID: 34157535 DOI: 10.1016/j.scitotenv.2021.148406] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 05/25/2021] [Accepted: 06/08/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND Dengue is an endemic vector-borne disease influenced by environmental factors such as landscape and climate. Previous studies separately assessed the effects of landscape and climate factors on mosquito occurrence and dengue incidence. However, both factors concurrently coexist in time and space and can interact, affecting mosquito development and dengue disease transmission. For example, eggs laid in a suitable environment can hatch after being submerged in rain water. It has been difficult for conventional statistical modeling approaches to demonstrate these combined influences due to mathematical constraints. OBJECTIVES To investigate the combined influences of landscape and climate factors on mosquito occurrence and dengue incidence. METHODS Entomological, epidemiological, and landscape data from the rainy season (July-December) were obtained from respective government agencies in Metropolitan Manila, Philippines, from 2012 to 2014. Temperature, precipitation and vegetation data were obtained through remote sensing. A random forest algorithm was used to select the landscape and climate variables. Afterward, using the identified key variables, a model-based (MOB) recursive partitioning was implemented to test the combined influences of landscape and climate factors on ovitrap index (vector mosquito occurrence) and dengue incidence. RESULTS The MOB recursive partitioning for ovitrap index indicated a high sensitivity of vector mosquito occurrence on environmental conditions generated by a combination of high residential density areas with low precipitation. Moreover, the MOB recursive partitioning indicated high sensitivity of dengue incidence to the effects of precipitation in areas with high proportions of residential density and commercial areas. CONCLUSIONS Dengue dynamics are not solely influenced by individual effects of either climate or landscape, but rather by their synergistic or combined effects. The presented findings have the potential to target vector surveillance in areas identified as suitable for mosquito occurrence under specific climatic conditions and may be relevant as part of urban planning strategies to control dengue.
Collapse
Affiliation(s)
- Micanaldo Ernesto Francisco
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama 790-8577, Japan; Graduate School of Science and Engineering, Ehime University, Matsuyama 790-8577, Japan
| | - Thaddeus M Carvajal
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama 790-8577, Japan; Graduate School of Science and Engineering, Ehime University, Matsuyama 790-8577, Japan; Biology Department, De La Salle University, Taft Ave, Manila 1004, Philippines; Biological Control Research Unit, Center for Natural Science and Environmental Research, De La Salle University, Taft Ave, Manila, Philippines
| | - Masahiro Ryo
- Leibniz Centre for Agricultural Landscape Research (ZALF), Eberswalder Str. 84, 15374 Müncheberg, Germany; Environment and Natural Sciences, Brandenburg University of Technology Cottbus-Senftenberg, 03046 Cottbus, Germany
| | - Kei Nukazawa
- Department of Civil and Environmental Engineering, University of Miyazaki, Miyazaki 889-2192, Japan
| | - Divina M Amalin
- Biology Department, De La Salle University, Taft Ave, Manila 1004, Philippines; Biological Control Research Unit, Center for Natural Science and Environmental Research, De La Salle University, Taft Ave, Manila, Philippines
| | - Kozo Watanabe
- Center for Marine Environmental Studies (CMES), Ehime University, Matsuyama 790-8577, Japan; Graduate School of Science and Engineering, Ehime University, Matsuyama 790-8577, Japan; Biology Department, De La Salle University, Taft Ave, Manila 1004, Philippines; Biological Control Research Unit, Center for Natural Science and Environmental Research, De La Salle University, Taft Ave, Manila, Philippines.
| |
Collapse
|
41
|
MI-MOTE: Multiple imputation-based minority oversampling technique for imbalanced and incomplete data classification. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.06.043] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
42
|
Machine Learning in the Differentiation of Soft Tissue Neoplasms: Comparison of Fat-Suppressed T2WI and Apparent Diffusion Coefficient (ADC) Features-Based Models. J Digit Imaging 2021; 34:1146-1155. [PMID: 34545474 PMCID: PMC8554992 DOI: 10.1007/s10278-021-00513-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 08/18/2021] [Accepted: 08/22/2021] [Indexed: 12/26/2022] Open
Abstract
Machine learning has been widely used in the characterization of tumors recently. This article aims to explore the feasibility of the whole tumor fat-suppressed (FS) T2WI and ADC features-based least absolute shrinkage and selection operator (LASSO)-logistic predictive models in the differentiation of soft tissue neoplasms (STN). The clinical and MR findings of 160 cases with 161 histologically proven STN were reviewed, retrospectively, 75 with diffusion-weighted imaging (DWI with b values of 50, 400, and 800 s/mm2). They were divided into benign and malignant groups and further divided into training (70%) and validation (30%) cohorts. The MR FS T2WI and ADC features-based LASSO-logistic models were built and compared. The AUC of the FS T2WI features-based LASSO-logistic regression model for benign and malignant prediction was 0.65 and 0.75 for the training and validation cohorts. The model's sensitivity, specificity, and accuracy of the validation cohort were 55%, 96%, and 76.6%. While the AUC of the ADC features-based model was 0.932 and 0.955 for the training and validation cohorts. The model's sensitivity, specificity, and accuracy were 83.3%, 100%, and 91.7%. The performances of these models were also validated by decision curve analysis (DCA). The AUC of the whole tumor ADC features-based LASSO-logistic regression predictive model was larger than that of FS T2WI features (p = 0.017). The whole tumor fat-suppressed T2WI and ADC features-based LASSO-logistic predictive models both can serve as useful tools in the differentiation of STN. ADC features-based LASSO-logistic regression predictive model did better than that of FS T2WI features.
Collapse
|
43
|
Abdullah Alfayez A, Kunz H, Grace Lai A. Predicting the risk of cancer in adults using supervised machine learning: a scoping review. BMJ Open 2021; 11:e047755. [PMID: 34521662 PMCID: PMC8442074 DOI: 10.1136/bmjopen-2020-047755] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 09/01/2021] [Indexed: 12/20/2022] Open
Abstract
OBJECTIVES The purpose of this scoping review is to: (1) identify existing supervised machine learning (ML) approaches on the prediction of cancer in asymptomatic adults; (2) to compare the performance of ML models with each other and (3) to identify potential gaps in research. DESIGN Scoping review using the population, concept and context approach. SEARCH STRATEGY PubMed search engine was used from inception to 10 November 2020 to identify literature meeting following inclusion criteria: (1) a general adult (≥18 years) population, either sex, asymptomatic (population); (2) any study using ML techniques to derive predictive models for future cancer risk using clinical and/or demographic and/or basic laboratory data (concept) and (3) original research articles conducted in all settings in any region of the world (context). RESULTS The search returned 627 unique articles, of which 580 articles were excluded because they did not meet the inclusion criteria, were duplicates or were related to benign neoplasm. Full-text reviews were conducted for 47 articles and a final set of 10 articles were included in this scoping review. These 10 very heterogeneous studies used ML to predict future cancer risk in asymptomatic individuals. All studies reported area under the receiver operating characteristics curve (AUC) values as metrics of model performance, but no study reported measures of model calibration. CONCLUSIONS Research gaps that must be addressed in order to deliver validated ML-based models to assist clinical decision-making include: (1) establishing model generalisability through validation in independent cohorts, including those from low-income and middle-income countries; (2) establishing models for all cancer types; (3) thorough comparisons of ML models with best available clinical tools to ensure transparency of their potential clinical utility; (4) reporting of model calibration performance and (5) comparisons of different methods on the same cohort to reveal important information about model generalisability and performance.
Collapse
Affiliation(s)
- Asma Abdullah Alfayez
- Institute of Health Informatics, University College London, London, UK
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
| | - Holger Kunz
- Institute of Health Informatics, University College London, London, UK
| | - Alvina Grace Lai
- Institute of Health Informatics, University College London, London, UK
| |
Collapse
|
44
|
Oei RW, Lyu Y, Ye L, Kong F, Du C, Zhai R, Xu T, Shen C, He X, Kong L, Hu C, Ying H. Progression-Free Survival Prediction in Patients with Nasopharyngeal Carcinoma after Intensity-Modulated Radiotherapy: Machine Learning vs. Traditional Statistics. J Pers Med 2021; 11:jpm11080787. [PMID: 34442430 PMCID: PMC8398698 DOI: 10.3390/jpm11080787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/08/2021] [Accepted: 08/10/2021] [Indexed: 12/24/2022] Open
Abstract
Background: The Cox proportional hazards (CPH) model is the most commonly used statistical method for nasopharyngeal carcinoma (NPC) prognostication. Recently, machine learning (ML) models are increasingly adopted for this purpose. However, only a few studies have compared the performances between CPH and ML models. This study aimed at comparing CPH with two state-of-the-art ML algorithms, namely, conditional survival forest (CSF) and DeepSurv for disease progression prediction in NPC. Methods: From January 2010 to March 2013, 412 eligible NPC patients were reviewed. The entire dataset was split into training cohort and testing cohort in a ratio of 90%:10%. Ten features from patient-related, disease-related, and treatment-related data were used to train the models for progression-free survival (PFS) prediction. The model performance was compared using the concordance index (c-index), Brier score, and log-rank test based on the risk stratification results. Results: DeepSurv (c-index = 0.68, Brier score = 0.13, log-rank test p = 0.02) achieved the best performance compared to CSF (c-index = 0.63, Brier score = 0.14, log-rank test p = 0.38) and CPH (c-index = 0.57, Brier score = 0.15, log-rank test p = 0.81). Conclusions: Both CSF and DeepSurv outperformed CPH in our relatively small dataset. ML-based survival prediction may guide physicians in choosing the most suitable treatment strategy for NPC patients.
Collapse
Affiliation(s)
- Ronald Wihal Oei
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Yingchen Lyu
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Lulu Ye
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Fangfang Kong
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Chengrun Du
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Ruiping Zhai
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Tingting Xu
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Chunying Shen
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Xiayun He
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Lin Kong
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Chaosu Hu
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Hongmei Ying
- Department of Radiation Oncology, Fudan University Shanghai Cancer Center, Shanghai 200032, China; (R.W.O.); (Y.L.); (L.Y.); (F.K.); (C.D.); (R.Z.); (T.X.); (C.S.); (X.H.); (L.K.); (C.H.)
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
- Correspondence: ; Tel.: +86-21-64175590; Fax: +86-21-6417477
| |
Collapse
|
45
|
Nordin N, Zainol Z, Mohd Noor MH, Lai Fong C. A comparative study of machine learning techniques for suicide attempts predictive model. Health Informatics J 2021; 27:1460458221989395. [PMID: 33745355 DOI: 10.1177/1460458221989395] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Current suicide risk assessments for predicting suicide attempts are time consuming, of low predictive value and have inadequate reliability. This paper aims to develop a predictive model for suicide attempts among patients with depression using machine learning algorithms as well as presents a comparative study on single predictive models with ensemble predictive models for differentiating depressed patients with suicide attempts from non-suicide attempters. We applied and trained eight different machine learning algorithms using a dataset that consists of 75 patients diagnosed with a depressive disorder. A recursive feature elimination was used to reduce the features via three-fold cross validation. An ensemble predictive models outperformed the single predictive models. Voting and bagging revealed the highest accuracy of 92% compared to other machine learning algorithms. Our findings indicate that history of suicide attempt, religion, race, suicide ideation and severity of clinical depression are useful factors for prediction of suicide attempts.
Collapse
Affiliation(s)
| | | | | | - Chan Lai Fong
- National University of Malaysia Medical Centre, Malaysia
| |
Collapse
|
46
|
Chakraborty D, Ivan C, Amero P, Khan M, Rodriguez-Aguayo C, Başağaoğlu H, Lopez-Berestein G. Explainable Artificial Intelligence Reveals Novel Insight into Tumor Microenvironment Conditions Linked with Better Prognosis in Patients with Breast Cancer. Cancers (Basel) 2021; 13:3450. [PMID: 34298668 PMCID: PMC8303703 DOI: 10.3390/cancers13143450] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/06/2021] [Accepted: 07/06/2021] [Indexed: 12/29/2022] Open
Abstract
We investigated the data-driven relationship between immune cell composition in the tumor microenvironment (TME) and the ≥5-year survival rates of breast cancer patients using explainable artificial intelligence (XAI) models. We acquired TCGA breast invasive carcinoma data from the cbioPortal and retrieved immune cell composition estimates from bulk RNA sequencing data from TIMER2.0 based on EPIC, CIBERSORT, TIMER, and xCell computational methods. Novel insights derived from our XAI model showed that B cells, CD8+ T cells, M0 macrophages, and NK T cells are the most critical TME features for enhanced prognosis of breast cancer patients. Our XAI model also revealed the inflection points of these critical TME features, above or below which ≥5-year survival rates improve. Subsequently, we ascertained the conditional probabilities of ≥5-year survival under specific conditions inferred from the inflection points. In particular, the XAI models revealed that the B cell fraction (relative to all cells in a sample) exceeding 0.025, M0 macrophage fraction (relative to the total immune cell content) below 0.05, and NK T cell and CD8+ T cell fractions (based on cancer type-specific arbitrary units) above 0.075 and 0.25, respectively, in the TME could enhance the ≥5-year survival in breast cancer patients. The findings could lead to accurate clinical predictions and enhanced immunotherapies, and to the design of innovative strategies to reprogram the breast TME.
Collapse
Affiliation(s)
- Debaditya Chakraborty
- Department of Construction Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Cristina Ivan
- Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (C.I.); (P.A.); (C.R.-A.); (G.L.-B.)
- Center for RNA Interference and Non-Coding RNA, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Paola Amero
- Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (C.I.); (P.A.); (C.R.-A.); (G.L.-B.)
| | - Maliha Khan
- Department of Lymphoma and Myeloma, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | - Cristian Rodriguez-Aguayo
- Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (C.I.); (P.A.); (C.R.-A.); (G.L.-B.)
- Center for RNA Interference and Non-Coding RNA, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | | | - Gabriel Lopez-Berestein
- Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA; (C.I.); (P.A.); (C.R.-A.); (G.L.-B.)
- Center for RNA Interference and Non-Coding RNA, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
47
|
A Classification Approach for Cancer Survivors from Those Cancer-Free, Based on Health Behaviors: Analysis of the Lifelines Cohort. Cancers (Basel) 2021; 13:cancers13102335. [PMID: 34066093 PMCID: PMC8151639 DOI: 10.3390/cancers13102335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 05/06/2021] [Accepted: 05/07/2021] [Indexed: 12/29/2022] Open
Abstract
Simple Summary Health behaviors affect health status in cancer survivors. We aimed to identify such key health behaviors using nonlinear algorithms and compare their classification performance with logistic regression, for distinguishing cancer survivors from those cancer-free in a population-based cohort. We used health behaviors and socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified as cancer survivors or cancer-free using nonlinear algorithms or logistic regression. Data were collected for 107,624 cancer-free participants and 2760 cancer survivors. Using all variables, algorithms obtained an area under the receiver operator curve (AUC) of 0.75 ± 0.01. Using only health behaviors, the algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 ± 0.01 and 0.60 ± 0.01, respectively. In the case–control analyses, both algorithms produced AUCs of 0.52 ± 0.01. The main distinctive classifier was age. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants. Abstract Health behaviors affect health status in cancer survivors. We hypothesized that nonlinear algorithms would identify distinct key health behaviors compared to a linear algorithm and better classify cancer survivors. We aimed to use three nonlinear algorithms to identify such key health behaviors and compare their performances with that of a logistic regression for distinguishing cancer survivors from those without cancer in a population-based cohort study. We used six health behaviors and three socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified into a cancer-survivors group and a cancer-free group using either nonlinear algorithms or logistic regression, and their performances were compared by the area under the curve (AUC). In addition, we performed case–control analyses (matched by age, sex, and education level) to evaluate classification performance only by health behaviors. Data were collected for 107,624 cancer free participants and 2760 cancer survivors. Using all variables resulted an AUC of 0.75 ± 0.01, using only six health behaviors, the logistic regression and nonlinear algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 ± 0.01 and 0.60 ± 0.01, respectively. The main distinctive classifier was age. Though not relevant to classification, the main distinctive health behaviors were body mass index and alcohol consumption. In the case–control analyses, algorithms produced AUCs of 0.52 ± 0.01. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants in this population-based cohort.
Collapse
|
48
|
Gupta S, Gupta MK. A comprehensive data‐level investigation of cancer diagnosis on imbalanced data. Comput Intell 2021. [DOI: 10.1111/coin.12452] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Surbhi Gupta
- School of Computer Science and Engineering Department Shri Mata Vaishno Devi University Katra Jammu and Kashmir India
| | - Manoj Kumar Gupta
- School of Computer Science and Engineering Department Shri Mata Vaishno Devi University Katra Jammu and Kashmir India
| |
Collapse
|
49
|
Prediction of Incident Cancers in the Lifelines Population-Based Cohort. Cancers (Basel) 2021; 13:cancers13092133. [PMID: 33925159 PMCID: PMC8125183 DOI: 10.3390/cancers13092133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 04/23/2021] [Indexed: 12/23/2022] Open
Abstract
Simple Summary The accurate prediction of incident cancers could be relevant to understanding and reducing cancer incidence. The aim of this study was to develop machine learning (ML) models that could predict an incident diagnosis of cancer. Data were available for 116,188 cancer-free participants and 4232 incident cancer cases. The main outcome was an incident cancer (excluding skin cancer) during follow-up assessment in a population-based cohort. The performance of three ML algorithms was evaluated using supervised binary classification to identify incident cancers among participants. An overall area under the receiver operator curve (AUC) < 0.75 was obtained; the highest AUC was for prostate cancer AUC > 0.80. Linear and non-linear ML algorithms including socioeconomic, lifestyle, and clinical variables produced a moderate predictive performance of incident cancers in the Lifelines cohort. Abstract Cancer incidence is rising, and accurate prediction of incident cancers could be relevant to understanding and reducing cancer incidence. The aim of this study was to develop machine learning (ML) models that could predict an incident diagnosis of cancer. Participants without any history of cancer within the Lifelines population-based cohort were followed for a median of 7 years. Data were available for 116,188 cancer-free participants and 4232 incident cancer cases. At baseline, socioeconomic, lifestyle, and clinical variables were assessed. The main outcome was an incident cancer during follow-up (excluding skin cancer), based on linkage with the national pathology registry. The performance of three ML algorithms was evaluated using supervised binary classification to identify incident cancers among participants. Elastic net regularization and Gini index were used for variables selection. An overall area under the receiver operator curve (AUC) <0.75 was obtained, the highest AUC value was for prostate cancer (random forest AUC = 0.82 (95% CI 0.77–0.87), logistic regression AUC = 0.81 (95% CI 0.76–0.86), and support vector machines AUC = 0.83 (95% CI 0.78–0.88), respectively); age was the most important predictor in these models. Linear and non-linear ML algorithms including socioeconomic, lifestyle, and clinical variables produced a moderate predictive performance of incident cancers in the Lifelines cohort.
Collapse
|
50
|
Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, Peng X. Predicting breast cancer 5-year survival using machine learning: A systematic review. PLoS One 2021; 16:e0250370. [PMID: 33861809 PMCID: PMC8051758 DOI: 10.1371/journal.pone.0250370] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/06/2021] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Accurately predicting the survival rate of breast cancer patients is a major issue for cancer researchers. Machine learning (ML) has attracted much attention with the hope that it could provide accurate results, but its modeling methods and prediction performance remain controversial. The aim of this systematic review is to identify and critically appraise current studies regarding the application of ML in predicting the 5-year survival rate of breast cancer. METHODS In accordance with the PRISMA guidelines, two researchers independently searched the PubMed (including MEDLINE), Embase, and Web of Science Core databases from inception to November 30, 2020. The search terms included breast neoplasms, survival, machine learning, and specific algorithm names. The included studies related to the use of ML to build a breast cancer survival prediction model and model performance that can be measured with the value of said verification results. The excluded studies in which the modeling process were not explained clearly and had incomplete information. The extracted information included literature information, database information, data preparation and modeling process information, model construction and performance evaluation information, and candidate predictor information. RESULTS Thirty-one studies that met the inclusion criteria were included, most of which were published after 2013. The most frequently used ML methods were decision trees (19 studies, 61.3%), artificial neural networks (18 studies, 58.1%), support vector machines (16 studies, 51.6%), and ensemble learning (10 studies, 32.3%). The median sample size was 37256 (range 200 to 659820) patients, and the median predictor was 16 (range 3 to 625). The accuracy of 29 studies ranged from 0.510 to 0.971. The sensitivity of 25 studies ranged from 0.037 to 1. The specificity of 24 studies ranged from 0.008 to 0.993. The AUC of 20 studies ranged from 0.500 to 0.972. The precision of 6 studies ranged from 0.549 to 1. All of the models were internally validated, and only one was externally validated. CONCLUSIONS Overall, compared with traditional statistical methods, the performance of ML models does not necessarily show any improvement, and this area of research still faces limitations related to a lack of data preprocessing steps, the excessive differences of sample feature selection, and issues related to validation. Further optimization of the performance of the proposed model is also needed in the future, which requires more standardization and subsequent validation.
Collapse
Affiliation(s)
- Jiaxin Li
- School of Nursing, Jilin University, Jilin, China
| | - Zijun Zhou
- Breast Surgery, Jilin Province Tumor Hospital, Jilin, China
| | - Jianyu Dong
- School of Nursing, Jilin University, Jilin, China
| | - Ying Fu
- School of Nursing, Jilin University, Jilin, China
| | - Yuan Li
- School of Nursing, Jilin University, Jilin, China
| | - Ze Luan
- School of Nursing, Jilin University, Jilin, China
| | - Xin Peng
- School of Nursing, Jilin University, Jilin, China
- * E-mail:
| |
Collapse
|