1
|
Zhang C, Peng J, Wang L, Wang Y, Chen W, Sun MW, Jiang H. A deep learning-powered diagnostic model for acute pancreatitis. BMC Med Imaging 2024; 24:154. [PMID: 38902660 PMCID: PMC11188273 DOI: 10.1186/s12880-024-01339-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 06/17/2024] [Indexed: 06/22/2024] Open
Abstract
BACKGROUND Acute pancreatitis is one of the most common diseases requiring emergency surgery. Rapid and accurate recognition of acute pancreatitis can help improve clinical outcomes. This study aimed to develop a deep learning-powered diagnostic model for acute pancreatitis. MATERIALS AND METHODS In this investigation, we enrolled a cohort of 190 patients with acute pancreatitis who were admitted to Sichuan Provincial People's Hospital between January 2020 and December 2021. Abdominal computed tomography (CT) scans were obtained from both patients with acute pancreatitis and healthy individuals. Our model was constructed using two modules: (1) the acute pancreatitis classifier module; (2) the pancreatitis lesion segmentation module. Each model's performance was assessed based on precision, recall rate, F1-score, Area Under the Curve (AUC), loss rate, frequency-weighted accuracy (fwavacc), and Mean Intersection over Union (MIOU). RESULTS Upon admission, significant variations were observed between patients with mild and severe acute pancreatitis in inflammatory indexes, liver, and kidney function indicators, as well as coagulation parameters. The acute pancreatitis classifier module exhibited commendable diagnostic efficacy, showing an impressive AUC of 0.993 (95%CI: 0.978-0.999) in the test set (comprising healthy examination patients vs. those with acute pancreatitis, P < 0.001) and an AUC of 0.850 (95%CI: 0.790-0.898) in the external validation set (healthy examination patients vs. patients with acute pancreatitis, P < 0.001). Furthermore, the acute pancreatitis lesion segmentation module demonstrated exceptional performance in the validation set. For pancreas segmentation, peripancreatic inflammatory exudation, peripancreatic effusion, and peripancreatic abscess necrosis, the MIOU values were 86.02 (84.52, 87.20), 61.81 (56.25, 64.83), 57.73 (49.90, 68.23), and 66.36 (55.08, 72.12), respectively. These findings underscore the robustness and reliability of the developed models in accurately characterizing and assessing acute pancreatitis. CONCLUSION The diagnostic model for acute pancreatitis, driven by deep learning, exhibits excellent efficacy in accurately evaluating the severity of the condition. TRIAL REGISTRATION This is a retrospective study.
Collapse
Affiliation(s)
- Chi Zhang
- Department of Intensive Care Medicine, The Third People's Hospital of Chengdu, Affiliated Hospital of Southwest Jiaotong University, Chengdu, China
| | - Jin Peng
- Institute for Emergency and Disaster Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- Department of Histology and Neuroscience, Sichuan University, Chengdu, China
| | - Lu Wang
- Institute for Emergency and Disaster Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- Department of Emergency Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
- Sichuan Provincial Clinical Research Center for Emergency and Critical Care, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Yu Wang
- Sichuan Provincial Clinical Research Center for Emergency and Critical Care, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- Department of Clinical Nutrition, Peking Union Medical College Hospital, Beijing, China
| | - Wei Chen
- Department of Clinical Nutrition, Peking Union Medical College Hospital, Beijing, China
| | - Ming-Wei Sun
- Institute for Emergency and Disaster Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Hua Jiang
- Institute for Emergency and Disaster Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.
- Department of Emergency Medicine, School of Medicine, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, China.
- Sichuan Provincial Clinical Research Center for Emergency and Critical Care, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
2
|
Chen T, Kabir MF. Explainable machine learning approach for cancer prediction through binarilization of RNA sequencing data. PLoS One 2024; 19:e0302947. [PMID: 38728288 PMCID: PMC11086842 DOI: 10.1371/journal.pone.0302947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 04/15/2024] [Indexed: 05/12/2024] Open
Abstract
In recent years, researchers have proven the effectiveness and speediness of machine learning-based cancer diagnosis models. However, it is difficult to explain the results generated by machine learning models, especially ones that utilized complex high-dimensional data like RNA sequencing data. In this study, we propose the binarilization technique as a novel way to treat RNA sequencing data and used it to construct explainable cancer prediction models. We tested our proposed data processing technique on five different models, namely neural network, random forest, xgboost, support vector machine, and decision tree, using four cancer datasets collected from the National Cancer Institute Genomic Data Commons. Since our datasets are imbalanced, we evaluated the performance of all models using metrics designed for imbalance performance like geometric mean, Matthews correlation coefficient, F-Measure, and area under the receiver operating characteristic curve. Our approach showed comparative performance while relying on less features. Additionally, we demonstrated that data binarilization offers higher explainability by revealing how each feature affects the prediction. These results demonstrate the potential of data binarilization technique in improving the performance and explainability of RNA sequencing based cancer prediction models.
Collapse
Affiliation(s)
- Tianjie Chen
- Department of Computer Science, Pennsylvania State University Harrisburg, Middletown, Pennsylvania, United States of America
| | - Md Faisal Kabir
- Department of Computer Science, Pennsylvania State University Harrisburg, Middletown, Pennsylvania, United States of America
| |
Collapse
|
3
|
Nicolis O, De Los Angeles D, Taramasco C. A contemporary review of breast cancer risk factors and the role of artificial intelligence. Front Oncol 2024; 14:1356014. [PMID: 38699635 PMCID: PMC11063273 DOI: 10.3389/fonc.2024.1356014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 03/25/2024] [Indexed: 05/05/2024] Open
Abstract
Background Breast cancer continues to be a significant global health issue, necessitating advancements in prevention and early detection strategies. This review aims to assess and synthesize research conducted from 2020 to the present, focusing on breast cancer risk factors, including genetic, lifestyle, and environmental aspects, as well as the innovative role of artificial intelligence (AI) in prediction and diagnostics. Methods A comprehensive literature search, covering studies from 2020 to the present, was conducted to evaluate the diversity of breast cancer risk factors and the latest advances in Artificial Intelligence (AI) in this field. The review prioritized high-quality peer-reviewed research articles and meta-analyses. Results Our analysis reveals a complex interplay of genetic, lifestyle, and environmental risk factors for breast cancer, with significant variability across different populations. Furthermore, AI has emerged as a promising tool in enhancing the accuracy of breast cancer risk prediction and the personalization of prevention strategies. Conclusion The review highlights the necessity for personalized breast cancer prevention and detection approaches that account for individual risk factor profiles. It underscores the potential of AI to revolutionize these strategies, offering clear recommendations for future research directions and clinical practice improvements.
Collapse
Affiliation(s)
- Orietta Nicolis
- Engineering Faculty, Universidad Andres Bello, Viña del Mar, Chile
- Centro para la Prevención y Control del Cáncer (CECAN), Santiago, Chile
| | - Denisse De Los Angeles
- Engineering Faculty, Universidad Andres Bello, Viña del Mar, Chile
- Centro para la Prevención y Control del Cáncer (CECAN), Santiago, Chile
| | - Carla Taramasco
- Engineering Faculty, Universidad Andres Bello, Viña del Mar, Chile
- Centro para la Prevención y Control del Cáncer (CECAN), Santiago, Chile
| |
Collapse
|
4
|
Yang F, Xu Z, Wang H, Sun L, Zhai M, Zhang J. A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis. PLoS One 2024; 19:e0290332. [PMID: 38466662 PMCID: PMC10927139 DOI: 10.1371/journal.pone.0290332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Accepted: 08/04/2023] [Indexed: 03/13/2024] Open
Abstract
BACKGROUND Cancer diagnosis based on machine learning has become a popular application direction. Support vector machine (SVM), as a classical machine learning algorithm, has been widely used in cancer diagnosis because of its advantages in high-dimensional and small sample data. However, due to the high-dimensional feature space and high feature redundancy of gene expression data, SVM faces the problem of poor classification effect when dealing with such data. METHODS Based on this, this paper proposes a hybrid feature selection algorithm combining information gain and grouping particle swarm optimization (IG-GPSO). The algorithm firstly calculates the information gain values of the features and ranks them in descending order according to the value. Then, ranked features are grouped according to the information index, so that the features in the group are close, and the features outside the group are sparse. Finally, grouped features are searched using grouping PSO and evaluated according to in-group and out-group. RESULTS Experimental results show that the average accuracy (ACC) of the SVM on the feature subset selected by the IG-GPSO is 98.50%, which is significantly better than the traditional feature selection algorithm. Compared with KNN, the classification effect of the feature subset selected by the IG-GPSO is still optimal. In addition, the results of multiple comparison tests show that the feature selection effect of the IG-GPSO is significantly better than that of traditional feature selection algorithms. CONCLUSION The feature subset selected by IG-GPSO not only has the best classification effect, but also has the least feature scale (FS). More importantly, the IG-GPSO significantly improves the ACC of SVM in cancer diagnostic.
Collapse
Affiliation(s)
- Fangyuan Yang
- Department of Gynecologic Oncology, The First Affiliated Hospital of Henan Polytechnic University, Jiaozuo, Henan, China
| | - Zhaozhao Xu
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan, China
| | - Hong Wang
- Department of Gynecologic Oncology, The First Affiliated Hospital of Henan Polytechnic University, Jiaozuo, Henan, China
| | - Lisha Sun
- Department of Gynecologic Oncology, The First Affiliated Hospital of Henan Polytechnic University, Jiaozuo, Henan, China
| | - Mengjiao Zhai
- Department of Gynecologic Oncology, The First Affiliated Hospital of Henan Polytechnic University, Jiaozuo, Henan, China
| | - Juan Zhang
- Department of Gynecologic Oncology, The First Affiliated Hospital of Henan Polytechnic University, Jiaozuo, Henan, China
| |
Collapse
|
5
|
Hao J, Tiles-Sar N, Habtewold TD, Liemburg EJ, Bruggeman R, van der Meer L, Alizadeh BZ. Shaping tomorrow's support: baseline clinical characteristics predict later social functioning and quality of life in schizophrenia spectrum disorder. Soc Psychiatry Psychiatr Epidemiol 2024:10.1007/s00127-024-02630-4. [PMID: 38456932 DOI: 10.1007/s00127-024-02630-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 01/28/2024] [Indexed: 03/09/2024]
Abstract
PURPOSE We aimed to explore the multidimensional nature of social inclusion (mSI) among patients diagnosed with schizophrenia spectrum disorder (SSD), and to identify the predictors of 3-year mSI and the mSI prediction using traditional and data-driven approaches. METHODS We used the baseline and 3-year follow-up data of 1119 patients from the Genetic Risk and Outcome in Psychosis (GROUP) cohort in the Netherlands. The outcome mSI was defined as clusters derived from combined analyses of thirteen subscales from the Social Functioning Scale and the brief version of World Health Organization Quality of Life questionnaires through K-means clustering. Prediction models were built through multinomial logistic regression (ModelMLR) and random forest (ModelRF), internally validated via bootstrapping and compared by accuracy and the discriminability of mSI subgroups. RESULTS We identified five mSI subgroups: "very low (social functioning)/very low (quality of life)" (8.58%), "low/low" (12.87%), "high/low" (49.24%), "medium/high" (18.05%), and "high/high" (11.26%). The mSI was robustly predicted by a genetic predisposition for SSD, premorbid adjustment, positive, negative, and depressive symptoms, number of met needs, and baseline satisfaction with the environment and social life. The ModelRF (61.61% [54.90%, 68.01%]; P =0.013) was cautiously considered outperform the ModelMLR (59.16% [55.75%, 62.58%]; P =0.994). CONCLUSION We introduced and distinguished meaningful subgroups of mSI, which were modestly predictable from baseline clinical characteristics. A possibility for early prediction of mSI at the clinical stage may unlock the potential for faster and more impactful social support that is specifically tailored to the unique characteristics of the mSI subgroup to which a given patient belongs.
Collapse
Affiliation(s)
- Jiasi Hao
- Department of Epidemiology, University Medical Centre Groningen, University of Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands.
| | - Natalia Tiles-Sar
- Department of Epidemiology, University Medical Centre Groningen, University of Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands
- Department of Psychiatry, University Medical Centre Groningen, University Centre for Psychiatry, Rob Giel Research Centre, University of Groningen, Groningen, The Netherlands
| | - Tesfa Dejenie Habtewold
- Department of Epidemiology, University Medical Centre Groningen, University of Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands
| | - Edith J Liemburg
- Department of Psychiatry, University Medical Centre Groningen, University Centre for Psychiatry, Rob Giel Research Centre, University of Groningen, Groningen, The Netherlands
| | - Richard Bruggeman
- Department of Psychiatry, University Medical Centre Groningen, University Centre for Psychiatry, Rob Giel Research Centre, University of Groningen, Groningen, The Netherlands
- Department of Clinical and Developmental Neuropsychology, Faculty of Behavioural and Social Sciences, University of Groningen, Groningen, The Netherlands
- Department of Rehabilitation, Lentis Psychiatric Institute, Zuidlaren, The Netherlands
| | - Lisette van der Meer
- Department of Clinical and Developmental Neuropsychology, Faculty of Behavioural and Social Sciences, University of Groningen, Groningen, The Netherlands
- Department of Rehabilitation, Lentis Psychiatric Institute, Zuidlaren, The Netherlands
| | - Behrooz Z Alizadeh
- Department of Epidemiology, University Medical Centre Groningen, University of Groningen, Hanzeplein 1, 9713 GZ, Groningen, The Netherlands.
- Department of Psychiatry, University Medical Centre Groningen, University Centre for Psychiatry, Rob Giel Research Centre, University of Groningen, Groningen, The Netherlands.
| |
Collapse
|
6
|
Kolasa K, Admassu B, Hołownia-Voloskova M, Kędzior KJ, Poirrier JE, Perni S. Systematic reviews of machine learning in healthcare: a literature review. Expert Rev Pharmacoecon Outcomes Res 2024; 24:63-115. [PMID: 37955147 DOI: 10.1080/14737167.2023.2279107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/31/2023] [Indexed: 11/14/2023]
Abstract
INTRODUCTION The increasing availability of data and computing power has made machine learning (ML) a viable approach to faster, more efficient healthcare delivery. METHODS A systematic literature review (SLR) of published SLRs evaluating ML applications in healthcare settings published between1 January 2010 and 27 March 2023 was conducted. RESULTS In total 220 SLRs covering 10,462 ML algorithms were reviewed. The main application of AI in medicine related to the clinical prediction and disease prognosis in oncology and neurology with the use of imaging data. Accuracy, specificity, and sensitivity were provided in 56%, 28%, and 25% SLRs respectively. Internal and external validation was reported in 53% and less than 1% of the cases respectively. The most common modeling approach was neural networks (2,454 ML algorithms), followed by support vector machine and random forest/decision trees (1,578 and 1,522 ML algorithms, respectively). EXPERT OPINION The review indicated considerable reporting gaps in terms of the ML's performance, both internal and external validation. Greater accessibility to healthcare data for developers can ensure the faster adoption of ML algorithms into clinical practice.
Collapse
Affiliation(s)
- Katarzyna Kolasa
- Division of Health Economics and Healthcare Management, Kozminski University, Warsaw, Poland
| | - Bisrat Admassu
- Division of Health Economics and Healthcare Management, Kozminski University, Warsaw, Poland
| | | | | | | | | |
Collapse
|
7
|
Shadbahr T, Roberts M, Stanczuk J, Gilbey J, Teare P, Dittmer S, Thorpe M, Torné RV, Sala E, Lió P, Patel M, Preller J, Rudd JHF, Mirtti T, Rannikko AS, Aston JAD, Tang J, Schönlieb CB. The impact of imputation quality on machine learning classifiers for datasets with missing values. COMMUNICATIONS MEDICINE 2023; 3:139. [PMID: 37803172 PMCID: PMC10558448 DOI: 10.1038/s43856-023-00356-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 09/13/2023] [Indexed: 10/08/2023] Open
Abstract
BACKGROUND Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods, followed by classification of the now complete samples. The focus of the machine learning researcher is to optimise the classifier's performance. METHODS We utilise three simulated and three real-world clinical datasets with different feature types and missingness patterns. Initially, we evaluate how the downstream classifier performance depends on the choice of classifier and imputation methods. We employ ANOVA to quantitatively evaluate how the choice of missingness rate, imputation method, and classifier method influences the performance. Additionally, we compare commonly used methods for assessing imputation quality and introduce a class of discrepancy scores based on the sliced Wasserstein distance. We also assess the stability of the imputations and the interpretability of model built on the imputed data. RESULTS The performance of the classifier is most affected by the percentage of missingness in the test data, with a considerable performance decline observed as the test missingness rate increases. We also show that the commonly used measures for assessing imputation quality tend to lead to imputed data which poorly matches the underlying data distribution, whereas our new class of discrepancy scores performs much better on this measure. Furthermore, we show that the interpretability of classifier models trained using poorly imputed data is compromised. CONCLUSIONS It is imperative to consider the quality of the imputation when performing downstream classification as the effects on the classifier can be considerable.
Collapse
Affiliation(s)
- Tolou Shadbahr
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Michael Roberts
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK.
- Data Science & Artificial Intelligence, AstraZeneca, Cambridge, UK.
| | - Jan Stanczuk
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Julian Gilbey
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Philip Teare
- Data Science & Artificial Intelligence, AstraZeneca, Cambridge, UK
| | - Sören Dittmer
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- ZeTeM, University of Bremen, Bremen, Germany
| | - Matthew Thorpe
- Department of Mathematics, University of Manchester, Manchester, UK
| | - Ramon Viñas Torné
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Evis Sala
- Department of Radiology, University of Cambridge, Cambridge, UK
| | - Pietro Lió
- Department of Mathematics, University of Manchester, Manchester, UK
| | - Mishal Patel
- Data Science & Artificial Intelligence, AstraZeneca, Cambridge, UK
- Clinical Pharmacology & Safety Sciences, AstraZeneca, Cambridge, UK
| | - Jacobus Preller
- Addenbrooke's Hospital, Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - James H F Rudd
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Tuomas Mirtti
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- Department of Pathology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- iCAN-Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Antti Sakari Rannikko
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- iCAN-Digital Precision Cancer Medicine Flagship, Helsinki, Finland
- Department of Urology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - John A D Aston
- Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Carola-Bibiane Schönlieb
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|
8
|
Smith DL, Held P. Moving toward precision PTSD treatment: predicting veterans' intensive PTSD treatment response using continuously updating machine learning models. Psychol Med 2023; 53:5500-5509. [PMID: 36259132 PMCID: PMC10482723 DOI: 10.1017/s0033291722002689] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 07/28/2022] [Accepted: 08/02/2022] [Indexed: 02/01/2023]
Abstract
BACKGROUND Considerable heterogeneity exists in treatment response to first-line posttraumatic stress disorder (PTSD) treatments, such as Cognitive Processing Therapy (CPT). Relatively little is known about the timing of when during a course of care the treatment response becomes apparent. Novel machine learning methods, especially continuously updating prediction models, have the potential to address these gaps in our understanding of response and optimize PTSD treatment. METHODS Using data from a 3-week (n = 362) CPT-based intensive PTSD treatment program (ITP), we explored three methods for generating continuously updating prediction models to predict endpoint PTSD severity. These included Mixed Effects Bayesian Additive Regression Trees (MixedBART), Mixed Effects Random Forest (MERF) machine learning models, and Linear Mixed Effects models (LMM). Models used baseline and self-reported PTSD symptom severity data collected every other day during treatment. We then validated our findings by examining model performances in a separate, equally established, 2-week CPT-based ITP (n = 108). RESULTS Results across approaches were very similar and indicated modest prediction accuracy at baseline (R2 ~ 0.18), with increasing accuracy of predictions of final PTSD severity across program timepoints (e.g. mid-program R2 ~ 0.62). Similar findings were obtained when the models were applied to the 2-week ITP. Neither the MERF nor the MixedBART machine learning approach outperformed LMM prediction, though benefits of each may differ based on the application. CONCLUSIONS Utilizing continuously updating models in PTSD treatments may be beneficial for clinicians in determining whether an individual is responding, and when this determination can be made.
Collapse
Affiliation(s)
- Dale L. Smith
- Department of Psychiatry and Behavioral Sciences, Rush University Medical Center, 325 S. Paulina St., Suite 200, Chicago, IL 60612, USA
- Behavioral Sciences, Olivet Nazarene University, 1 University Ave., Bourbonnais, Illinois 60914, USA
| | - Philip Held
- Department of Psychiatry and Behavioral Sciences, Rush University Medical Center, 325 S. Paulina St., Suite 200, Chicago, IL 60612, USA
| |
Collapse
|
9
|
Feng Y, McGuire N, Walton A, Fox S, Papa A, Lakhani SR, McCart Reed AE. Predicting breast cancer-specific survival in metaplastic breast cancer patients using machine learning algorithms. J Pathol Inform 2023; 14:100329. [PMID: 37664452 PMCID: PMC10470383 DOI: 10.1016/j.jpi.2023.100329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 08/03/2023] [Accepted: 08/04/2023] [Indexed: 09/05/2023] Open
Abstract
Metaplastic breast cancer (MpBC) is a rare and aggressive subtype of breast cancer, with data emerging on prognostic factors and survival prediction. This study aimed to develop machine learning models to predict breast cancer-specific survival (BCSS) in MpBC patients, utilizing a dataset of 160 patients with clinical, pathological, and biological variables. An in-depth variable selection process was carried out using gain ratio and correlation-based methods, resulting in 10 variables for model estimation. Five models (decision tree with bagging; logistic regression; multilayer perceptron; naïve Bayes; and, random forest algorithms) were evaluated using 10-fold cross-validation. Despite the constraints posed by the absence of therapeutic information, the random forest model exhibited the highest performance in predicting BCSS, with an ROC area of 0.808. This study emphasizes the potential of machine learning algorithms in predicting prognosis for complex and heterogeneous cancer subtypes using clinical datasets, and their potential to contribute to patient management. Further research that incorporates additional variables, such as treatment response, and more advanced machine learning techniques will likely enhance the predictive power of MpBC prognostic models.
Collapse
Affiliation(s)
- Yufan Feng
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane 4029, Australia
| | - Natasha McGuire
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane 4029, Australia
| | - Alexandra Walton
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane 4029, Australia
- Pathology Queensland, The Royal Brisbane and Women’s Hospital, Brisbane 4029, Australia
| | | | - Stephen Fox
- Peter MacCallum Cancer Centre and University of Melbourne, Melbourne 3000, Australia
| | - Antonella Papa
- Monash Biomedicine Discovery Institute, Monash University, Melbourne 3800, Australia
| | - Sunil R. Lakhani
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane 4029, Australia
- Pathology Queensland, The Royal Brisbane and Women’s Hospital, Brisbane 4029, Australia
| | - Amy E. McCart Reed
- UQ Centre for Clinical Research, Faculty of Medicine, The University of Queensland, Brisbane 4029, Australia
| |
Collapse
|
10
|
Lin W, Wang J, Ge J, Zhou R, Hu Y, Xiao L, Peng Q, Zheng Z. The activity of cuproptosis pathway calculated by AUCell algorithm was employed to construct cuproptosis landscape in lung adenocarcinoma. Discov Oncol 2023; 14:135. [PMID: 37481739 PMCID: PMC10363522 DOI: 10.1007/s12672-023-00755-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 07/14/2023] [Indexed: 07/25/2023] Open
Abstract
Cuproptosis is a recently described copper-dependent cell death pathway. Consequently, there are still few studies on lung adenocarcinoma (LUAD)-related cuproptosis, and we aimed to deepen in this matter. In this study, data from 503 patients with lung cancer from the TCGA-LUAD cohort data collection and 11 LUAD single-cells from GSE131907 as well as from 10 genes associated with cuproptosis were analyzed. The AUCell R package was used to determine the copper-dependent cell death pathway activity for each cell subpopulation, calculate the CellChat score, and display cell communication for each cell subpopulation. The PROGENy score was calculated to show the scores of tumor-related pathways in different cell populations. GO and KEGG analyses were used to calculate pathway activity. Univariate COX and random forest analyses were used to screen prognosis-associated genes and construct models. The ssGSEA and xCell algorithms were used to calculate the immunocyte infiltration score. Based on data from the GDSC database, the drug sensitivity score was calculated using oncoPredict. Finally, in vitro experiments were performed to determine the role of TLE1, the most important gene in the prognostic model. The 11 LUAD single-cell samples were classified into 8 different cell populations, from which epithelial cells showed the highest copper-dependent cell death pathway activity. Epithelial cell subsets were significantly positively correlated with MAKP, hypoxia, and other pathways. In addition, cell subgroup communication showed highly active collagen and APP pathways. Using the Findmark algorithm, differentially expressed genes (DEGs) between epithelial and other cell types were identified. Combined with the bulk data in the TCGA-LUAD database, DEGs were enriched in pathways such as EGFR tyrosine kinase inhibitor resistance, Hippo signaling pathway, and tight junction. Subsequently, we selected 4 genes (out of 112) with prognostic significance, ANKRD29, RHOV, TLE1, and NPAS2, and used them to construct a prognostic model. The high- and low-risk groups, distinguished by the median risk score, showed significantly different prognoses. Finally, we chose TLE1 as a biomarker based on the relative importance score in the prognostic model. In vitro experiments showed that TLE1 promotes tumor proliferation and migration and inhibits apoptosis.
Collapse
Affiliation(s)
- Weixian Lin
- Department of Respiratory and Critical Care Medicine, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, China
| | - Jiaren Wang
- The First Clinical Medical School, Southern Medical University, Guangdong, Guangzhou, China
| | - Jing Ge
- Department of Pediatrics, Nanfang Hospital, Southern Medical University, Guangdong, Guangzhou, China
| | - Rui Zhou
- Department of Oncology, Nanfang Hospital, Southern Medical University, Guangdong, Guangzhou, China
- Guangdong Province Key Laboratory of Molecular Tumor Pathology, Guangzhou, Guangdong, China
| | - Yahui Hu
- Department of Huiqiao Medical Centre, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, China
| | - Lushan Xiao
- Department of Infectious Diseases, Nanfang Hospital, Southern Medical University, Guangdong, Guangzhou, China
| | - Quanzhou Peng
- Department of Pathology, Shenzhen People's Hospital, The Second Clinical Medical College of Jinan University, The First Affiliated Hospital of Southern University of Science and Technology, Shenzhen, China.
| | - Zemao Zheng
- Department of Respiratory and Critical Care Medicine, Nanfang Hospital, Southern Medical University, Guangzhou, Guangdong, China.
| |
Collapse
|
11
|
Hildebrand RD, Chang DT, Ewongwoo AN, Ramchandran KJ, Gensheimer MF. Study of Patient and Physician Attitudes Toward Automated Prognostic Models for Patients With Metastatic Cancer. JCO Clin Cancer Inform 2023; 7:e2300023. [PMID: 37478393 DOI: 10.1200/cci.23.00023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 04/27/2023] [Accepted: 05/25/2023] [Indexed: 07/23/2023] Open
Abstract
PURPOSE For patients with cancer and their doctors, prognosis is important for choosing treatments and supportive care. Oncologists' life expectancy estimates are often inaccurate, and many patients are not aware of their general prognosis. Machine learning (ML) survival models could be useful in the clinic, but there are potential concerns involving accuracy, provider training, and patient involvement. We conducted a qualitative study to learn about patient and oncologist views on potentially using a ML model for patient care. METHODS Patients with metastatic cancer (n = 15) and their family members (n = 5), radiation oncologists (n = 5), and medical oncologists (n = 5) were recruited from a single academic health system. Participants were shown an anonymized report from a validated ML survival model for another patient, which included a predicted survival curve and a list of variables influencing predicted survival. Semistructured interviews were conducted using a script. RESULTS Every physician and patient who completed their interview said that they would want the option for the model to be used in their practice or care. Physicians stated that they would use an AI prognosis model for patient triage and increasing patient understanding, but had concerns about accuracy and explainability. Patients generally said that they would trust model results completely if presented by their physician but wanted to know if the model was being used in their care. Some reacted negatively to being shown a median survival prediction. CONCLUSION Patients and physicians were supportive of use of the model in the clinic, but had various concerns, which should be addressed as predictive models are increasingly deployed in practice.
Collapse
|
12
|
Liu YS, Thaliffdeen R, Han S, Park C. Use of machine learning to predict bladder cancer survival outcomes: a systematic literature review. Expert Rev Pharmacoecon Outcomes Res 2023; 23:761-771. [PMID: 37306511 DOI: 10.1080/14737167.2023.2224963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 06/09/2023] [Indexed: 06/13/2023]
Abstract
INTRODUCTION The objective of this systematic review is to summarize the use of machine learning (ML) in predicting overall survival (OS) in patients with bladder cancer. METHODS Search terms for bladder cancer, ML algorithms, and mortality were used to identify studies in PubMed and Web of Science as of February 2022. Notable inclusion/exclusion criteria contained the inclusion of studies that utilized patient-level datasets and exclusion of primary gene expression-related dataset studies. Study quality and bias were assessed using the International Journal of Medical Informatics (IJMEDI) checklist. RESULTS Of the 14 included studies, the most common algorithms were artificial neural networks (n = 8) and logistic regression (n = 4). Nine articles described missing data handling, with five articles removing patients with missing data entirely. With respect to feature selection, the most common sociodemographic variables were age (n = 9), gender (n = 9), and smoking status (n = 3), with clinical variables most commonly including tumor stage (n = 8), grade (n = 7), and lymph node involvement (n = 6). Most studies (n = 10) were of medium IJMEDI quality, with common areas of improvement being the descriptions of data preparation and deployment. CONCLUSIONS ML holds promise for optimizing bladder cancer care through accurate OS predictions, but challenges related to data processing, feature selection, and data source quality must be resolved to develop robust models. While this review is limited by its inability to compare models across studies, this systematic review will inform decision-making by various stakeholders to improve understanding of ML-based OS prediction in bladder cancer and foster interpretability of future models.
Collapse
Affiliation(s)
- Yi-Shao Liu
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Ryan Thaliffdeen
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Sola Han
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| | - Chanhyun Park
- College of Pharmacy, The University of Texas at Austin, 2409 University Ave, Austin, TX, USA
| |
Collapse
|
13
|
Pangarsa EA, Rizky D, Tandarto K, Setiawan B, Santosa D, Hadiyanto JN, Kyana S, Suharti C. The effect of multidisciplinary team on survival rates of women with breast cancer: a systematic review and meta-analysis. Ann Med Surg (Lond) 2023; 85:2940-2948. [PMID: 37363480 PMCID: PMC10289736 DOI: 10.1097/ms9.0000000000000914] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 05/14/2023] [Indexed: 06/28/2023] Open
Abstract
Breast cancer is quite frequent all around the world. This disease was responsible for an estimated 2.1 million malignancies in 2022, making it the seventh-highest cause of cancer deaths globally. A multidisciplinary team (MDT) care policy was developed in the United Kingdom (UK) in 1995 to enhance the quality of care for cancer patients. The purpose of this systematic review and meta-analysis study is to assess the effects of MDT on breast cancer survival rates. Methods This study was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020. Systematic search was conducted in several international databases including Google Scholar, PubMed, EBSCOhost, and Proquest from 2012 to 2022. The authors used RevMan 5.4 to do the meta-analysis of the pooled hazard ratio. Newcastle-Ottawa Scale to measure the risk of bias. Newcastle-Ottawa Scale evaluated participant selection, comparability, and reporting of results using eight subscale items. Egger's test funnel plot was used to assess the potential publication bias for this study. Results A total of 1187 studies were identified from research database. The authors found a total of six studies from six different countries (China, the UK, Taiwan, Australia, Africa, and France) included for this study. Based on the meta-analysis of the pooled hazard ratio of the included studies, the authors found that the overall effect size of the study was 0.80 (CI 95%: 0.73-0.88). Conclusions Breast cancer patients who participated in well-organized MDT discussions had a greater survival rate than those who did not.
Collapse
Affiliation(s)
- Eko A. Pangarsa
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| | - Daniel Rizky
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| | - Kevin Tandarto
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| | - Budi Setiawan
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| | - Damai Santosa
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| | - Jessica N. Hadiyanto
- Department of Internal Medicine, Dr. Kariadi General Hospital/Faculty of Medicine, Diponegoro University, Indonesia
| | - Salma Kyana
- Department of Internal Medicine, Dr. Kariadi General Hospital/Faculty of Medicine, Diponegoro University, Indonesia
| | - Catharina Suharti
- Hematology Medical Oncology Division, Department of Internal Medicine, Faculty of Medicine, Diponegoro University/Dr. Kariadi General Hospital Semarang, Indonesia
| |
Collapse
|
14
|
Zhao X, Jiang C. The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model. BMC Med Inform Decis Mak 2023; 23:74. [PMID: 37085843 PMCID: PMC10120176 DOI: 10.1186/s12911-023-02166-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 04/04/2023] [Indexed: 04/23/2023] Open
Abstract
OBJECTIVES This research was designed to compare the ability of different machine learning (ML) models and nomogram to predict distant metastasis in male breast cancer (MBC) patients and to interpret the optimal ML model by SHapley Additive exPlanations (SHAP) framework. METHODS Four powerful ML models were developed using data from male breast cancer (MBC) patients in the SEER database between 2010 and 2015 and MBC patients from our hospital between 2010 and 2020. The area under curve (AUC) and Brier score were used to assess the capacity of different models. The Delong test was applied to compare the performance of the models. Univariable and multivariable analysis were conducted using logistic regression. RESULTS Of 2351 patients were analyzed; 168 (7.1%) had distant metastasis (M1); 117 (5.0%) had bone metastasis, and 71 (3.0%) had lung metastasis. The median age at diagnosis is 68.0 years old. Most patients did not receive radiotherapy (1723, 73.3%) or chemotherapy (1447, 61.5%). The XGB model was the best ML model for predicting M1 in MBC patients. It showed the largest AUC value in the tenfold cross validation (AUC:0.884; SD:0.02), training (AUC:0.907; 95% CI: 0.899-0.917), testing (AUC:0.827; 95% CI: 0.802-0.857) and external validation (AUC:0.754; 95% CI: 0.739-0.771) sets. It also showed powerful ability in the prediction of bone metastasis (AUC: 0.880, 95% CI: 0.856-0.903 in the training set; AUC: 0.823, 95% CI:0.790-0.848 in the test set; AUC: 0.747, 95% CI: 0.727-0.764 in the external validation set) and lung metastasis (AUC: 0.906, 95% CI: 0.877-0.928 in training set; AUC: 0.859, 95% CI: 0.816-0.891 in the test set; AUC: 0.756, 95% CI: 0.732-0.777 in the external validation set). The AUC value of the XGB model was larger than that of nomogram in the training (0.907 vs 0.802) and external validation (0.754 vs 0.706) sets. CONCLUSIONS The XGB model is a better predictor of distant metastasis among MBC patients than other ML models and nomogram; furthermore, the XGB model is a powerful model for predicting bone and lung metastasis. Combining with SHAP values, it could help doctors intuitively understand the impact of each variable on outcome.
Collapse
Affiliation(s)
- Xuhai Zhao
- Department of Breast Surgery, Harbin Medical University Cancer Hospital, Harbin, China
| | - Cong Jiang
- Department of Breast Surgery, Harbin Medical University Cancer Hospital, Harbin, China.
| |
Collapse
|
15
|
Rahimi M, Akbari A, Asadi F, Emami H. Cervical cancer survival prediction by machine learning algorithms: a systematic review. BMC Cancer 2023; 23:341. [PMID: 37055741 PMCID: PMC10103471 DOI: 10.1186/s12885-023-10808-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 04/05/2023] [Indexed: 04/15/2023] Open
Abstract
BACKGROUND Cervical cancer is a common malignant tumor of the female reproductive system and is considered a leading cause of mortality in women worldwide. The analysis of time to event, which is crucial for any clinical research, can be well done with the method of survival prediction. This study aims to systematically investigate the use of machine learning to predict survival in patients with cervical cancer. METHOD An electronic search of the PubMed, Scopus, and Web of Science databases was performed on October 1, 2022. All articles extracted from the databases were collected in an Excel file and duplicate articles were removed. The articles were screened twice based on the title and the abstract and checked again with the inclusion and exclusion criteria. The main inclusion criterion was machine learning algorithms for predicting cervical cancer survival. The information extracted from the articles included authors, publication year, dataset details, survival type, evaluation criteria, machine learning models, and the algorithm execution method. RESULTS A total of 13 articles were included in this study, most of which were published from 2018 onwards. The most common machine learning models were random forest (6 articles, 46%), logistic regression (4 articles, 30%), support vector machines (3 articles, 23%), ensemble and hybrid learning (3 articles, 23%), and Deep Learning (3 articles, 23%). The number of sample datasets in the study varied between 85 and 14946 patients, and the models were internally validated except for two articles. The area under the curve (AUC) range for overall survival (0.40 to 0.99), disease-free survival (0.56 to 0.88), and progression-free survival (0.67 to 0.81), respectively from (lowest to highest) received. Finally, 15 variables with an effective role in predicting cervical cancer survival were identified. CONCLUSION Combining heterogeneous multidimensional data with machine learning techniques can play a very influential role in predicting cervical cancer survival. Despite the benefits of machine learning, the problem of interpretability, explainability, and imbalanced datasets is still one of the biggest challenges. Providing machine learning algorithms for survival prediction as a standard requires further studies.
Collapse
Affiliation(s)
- Milad Rahimi
- Department of Health Information Technology and Management, Medical Informatics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Atieh Akbari
- Obstetrics and Gynecology, Cancer Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Farkhondeh Asadi
- Department of Health Information Technology and Management, Health Information Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Hassan Emami
- Department of Health Information Technology and Management, Information Technology, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
16
|
Guleria HV, Luqmani AM, Kothari HD, Phukan P, Patil S, Pareek P, Kotecha K, Abraham A, Gabralla LA. Enhancing the Breast Histopathology Image Analysis for Cancer Detection Using Variational Autoencoder. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:ijerph20054244. [PMID: 36901255 PMCID: PMC10002012 DOI: 10.3390/ijerph20054244] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Revised: 02/08/2023] [Accepted: 02/09/2023] [Indexed: 06/12/2023]
Abstract
A breast tissue biopsy is performed to identify the nature of a tumour, as it can be either cancerous or benign. The first implementations involved the use of machine learning algorithms. Random Forest and Support Vector Machine (SVM) were used to classify the input histopathological images into whether they were cancerous or non-cancerous. The implementations continued to provide promising results, and then Artificial Neural Networks (ANNs) were applied for this purpose. We propose an approach for reconstructing the images using a Variational Autoencoder (VAE) and the Denoising Variational Autoencoder (DVAE) and then use a Convolutional Neural Network (CNN) model. Afterwards, we predicted whether the input image was cancerous or non-cancerous. Our implementation provides predictions with 73% accuracy, which is greater than the results produced by our custom-built CNN on our dataset. The proposed architecture will prove to be a new field of research and a new area to be explored in the field of computer vision using CNN and Generative Modelling since it incorporates reconstructions of the original input images and provides predictions on them thereafter.
Collapse
Affiliation(s)
- Harsh Vardhan Guleria
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Ali Mazhar Luqmani
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Harsh Devendra Kothari
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Priyanshu Phukan
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Shruti Patil
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Preksha Pareek
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Ketan Kotecha
- Symbiosis Institute of Technology, Symbiosis International University, Pune 412115, India
| | - Ajith Abraham
- Faculty of Computing and Data Sciences, FLAME University, Lavale, Pune 412115, India
| | - Lubna Abdelkareim Gabralla
- Department of Computer Science and Information Technology, College of Applied, Nourah Bint Abdulrahman University, Riyadh 11671, Saudi Arabia
| |
Collapse
|
17
|
Kokabi M, Sui J, Gandotra N, Pournadali Khamseh A, Scharfe C, Javanmard M. Nucleic Acid Quantification by Multi-Frequency Impedance Cytometry and Machine Learning. BIOSENSORS 2023; 13:bios13030316. [PMID: 36979528 PMCID: PMC10046493 DOI: 10.3390/bios13030316] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/15/2023] [Accepted: 02/20/2023] [Indexed: 06/10/2023]
Abstract
Determining nucleic acid concentrations in a sample is an important step prior to proceeding with downstream analysis in molecular diagnostics. Given the need for testing DNA amounts and its purity in many samples, including in samples with very small input DNA, there is utility of novel machine learning approaches for accurate and high-throughput DNA quantification. Here, we demonstrated the ability of a neural network to predict DNA amounts coupled to paramagnetic beads. To this end, a custom-made microfluidic chip is applied to detect DNA molecules bound to beads by measuring the impedance peak response (IPR) at multiple frequencies. We leveraged electrical measurements including the frequency and imaginary and real parts of the peak intensity within a microfluidic channel as the input of deep learning models to predict DNA concentration. Specifically, 10 different deep learning architectures are examined. The results of the proposed regression model indicate that an R_Squared of 97% with a slope of 0.68 is achievable. Consequently, machine learning models can be a suitable, fast, and accurate method to measure nucleic acid concentration in a sample. The results presented in this study demonstrate the ability of the proposed neural network to use the information embedded in raw impedance data to predict the amount of DNA concentration.
Collapse
Affiliation(s)
- Mahtab Kokabi
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA
| | - Jianye Sui
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA
| | - Neeru Gandotra
- Department of Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA
| | | | - Curt Scharfe
- Department of Genetics, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520, USA
| | - Mehdi Javanmard
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
18
|
Massafra R, Fanizzi A, Amoroso N, Bove S, Comes MC, Pomarico D, Didonna V, Diotaiuti S, Galati L, Giotta F, La Forgia D, Latorre A, Lombardi A, Nardone A, Pastena MI, Ressa CM, Rinaldi L, Tamborra P, Zito A, Paradiso AV, Bellotti R, Lorusso V. Analyzing breast cancer invasive disease event classification through explainable artificial intelligence. Front Med (Lausanne) 2023; 10:1116354. [PMID: 36817766 PMCID: PMC9932275 DOI: 10.3389/fmed.2023.1116354] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 01/13/2023] [Indexed: 02/05/2023] Open
Abstract
Introduction Recently, accurate machine learning and deep learning approaches have been dedicated to the investigation of breast cancer invasive disease events (IDEs), such as recurrence, contralateral and second cancers. However, such approaches are poorly interpretable. Methods Thus, we designed an Explainable Artificial Intelligence (XAI) framework to investigate IDEs within a cohort of 486 breast cancer patients enrolled at IRCCS Istituto Tumori "Giovanni Paolo II" in Bari, Italy. Using Shapley values, we determined the IDE driving features according to two periods, often adopted in clinical practice, of 5 and 10 years from the first tumor diagnosis. Results Age, tumor diameter, surgery type, and multiplicity are predominant within the 5-year frame, while therapy-related features, including hormone, chemotherapy schemes and lymphovascular invasion, dominate the 10-year IDE prediction. Estrogen Receptor (ER), proliferation marker Ki67 and metastatic lymph nodes affect both frames. Discussion Thus, our framework aims at shortening the distance between AI and clinical practice.
Collapse
Affiliation(s)
| | | | - Nicola Amoroso
- INFN, Sezione di Bari, Bari, Italy
- Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Samantha Bove
- IRCCS Istituto Tumori “Giovanni Paolo II”, Bari, Italy
| | | | - Domenico Pomarico
- INFN, Sezione di Bari, Bari, Italy
- Dipartimento di Fisica, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | | | | | - Luisa Galati
- International Agency for Research on Cancer, Lyon, France
| | | | | | | | - Angela Lombardi
- Dipartimento di Ingegneria Elettrica e dell'Informazione, Politecnico di Bari, Bari, Italy
| | | | | | | | - Lucia Rinaldi
- IRCCS Istituto Tumori “Giovanni Paolo II”, Bari, Italy
| | | | - Alfredo Zito
- IRCCS Istituto Tumori “Giovanni Paolo II”, Bari, Italy
| | | | - Roberto Bellotti
- INFN, Sezione di Bari, Bari, Italy
- Dipartimento di Fisica, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Vito Lorusso
- IRCCS Istituto Tumori “Giovanni Paolo II”, Bari, Italy
| |
Collapse
|
19
|
Nunez JJ, Leung B, Ho C, Bates AT, Ng RT. Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing. JAMA Netw Open 2023; 6:e230813. [PMID: 36848085 PMCID: PMC9972192 DOI: 10.1001/jamanetworkopen.2023.0813] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Open
Abstract
IMPORTANCE Predicting short- and long-term survival of patients with cancer may improve their care. Prior predictive models either use data with limited availability or predict the outcome of only 1 type of cancer. OBJECTIVE To investigate whether natural language processing can predict survival of patients with general cancer from a patient's initial oncologist consultation document. DESIGN, SETTING, AND PARTICIPANTS This retrospective prognostic study used data from 47 625 of 59 800 patients who started cancer care at any of the 6 BC Cancer sites located in the province of British Columbia between April 1, 2011, and December 31, 2016. Mortality data were updated until April 6, 2022, and data were analyzed from update until September 30, 2022. All patients with a medical or radiation oncologist consultation document generated within 180 days of diagnosis were included; patients seen for multiple cancers were excluded. EXPOSURES Initial oncologist consultation documents were analyzed using traditional and neural language models. MAIN OUTCOMES AND MEASURES The primary outcome was the performance of the predictive models, including balanced accuracy and receiver operating characteristics area under the curve (AUC). The secondary outcome was investigating what words the models used. RESULTS Of the 47 625 patients in the sample, 25 428 (53.4%) were female and 22 197 (46.6%) were male, with a mean (SD) age of 64.9 (13.7) years. A total of 41 447 patients (87.0%) survived 6 months, 31 143 (65.4%) survived 36 months, and 27 880 (58.5%) survived 60 months, calculated from their initial oncologist consultation. The best models achieved a balanced accuracy of 0.856 (AUC, 0.928) for predicting 6-month survival, 0.842 (AUC, 0.918) for 36-month survival, and 0.837 (AUC, 0.918) for 60-month survival, on a holdout test set. Differences in what words were important for predicting 6- vs 60-month survival were found. CONCLUSIONS AND RELEVANCE These findings suggest that models performed comparably with or better than previous models predicting cancer survival and that they may be able to predict survival using readily available data without focusing on 1 cancer type.
Collapse
Affiliation(s)
- John-Jose Nunez
- BC Cancer, Vancouver, British Columbia, Canada
- Department of Computer Science, University of British Columbia, Vancouver, British Columbia, Canada
- Department of Psychiatry, University of British Columbia, Vancouver, British Columbia, Canada
| | | | - Cheryl Ho
- BC Cancer, Vancouver, British Columbia, Canada
| | - Alan T. Bates
- BC Cancer, Vancouver, British Columbia, Canada
- Department of Psychiatry, University of British Columbia, Vancouver, British Columbia, Canada
| | - Raymond T. Ng
- Department of Computer Science, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
20
|
Dammu H, Ren T, Duong TQ. Deep learning prediction of pathological complete response, residual cancer burden, and progression-free survival in breast cancer patients. PLoS One 2023; 18:e0280148. [PMID: 36607982 PMCID: PMC9821469 DOI: 10.1371/journal.pone.0280148] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 12/20/2022] [Indexed: 01/07/2023] Open
Abstract
The goal of this study was to employ novel deep-learning convolutional-neural-network (CNN) to predict pathological complete response (PCR), residual cancer burden (RCB), and progression-free survival (PFS) in breast cancer patients treated with neoadjuvant chemotherapy using longitudinal multiparametric MRI, demographics, and molecular subtypes as inputs. In the I-SPY-1 TRIAL, 155 patients with stage 2 or 3 breast cancer with breast tumors underwent neoadjuvant chemotherapy met the inclusion/exclusion criteria. The inputs were dynamic-contrast-enhanced (DCE) MRI, and T2- weighted MRI as three-dimensional whole-images without the tumor segmentation, as well as molecular subtypes and demographics. The outcomes were PCR, RCB, and PFS. Three ("Integrated", "Stack" and "Concatenation") CNN were evaluated using receiver-operating characteristics and mean absolute errors. The Integrated approach outperformed the "Stack" or "Concatenation" CNN. Inclusion of both MRI and non-MRI data outperformed either alone. The combined pre- and post-neoadjuvant chemotherapy data outperformed either alone. Using the best model and data combination, PCR prediction yielded an accuracy of 0.81±0.03 and AUC of 0.83±0.03; RCB prediction yielded an accuracy of 0.80±0.02 and Cohen's κ of 0.73±0.03; PFS prediction yielded a mean absolute error of 24.6±0.7 months (survival ranged from 6.6 to 127.5 months). Deep learning using longitudinal multiparametric MRI, demographics, and molecular subtypes accurately predicts PCR, RCB, and PFS in breast cancer patients. This approach may prove useful for treatment selection, planning, execution, and mid-treatment adjustment.
Collapse
Affiliation(s)
- Hongyi Dammu
- Department of Radiology, Montefiore Medical Center and Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Thomas Ren
- Department of Radiology, Montefiore Medical Center and Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Tim Q. Duong
- Department of Radiology, Montefiore Medical Center and Albert Einstein College of Medicine, Bronx, New York, United States of America
- * E-mail:
| |
Collapse
|
21
|
Sorayaie Azar A, Babaei Rikan S, Naemi A, Bagherzadeh Mohasefi J, Pirnejad H, Bagherzadeh Mohasefi M, Wiil UK. Application of machine learning techniques for predicting survival in ovarian cancer. BMC Med Inform Decis Mak 2022; 22:345. [PMID: 36585641 PMCID: PMC9801354 DOI: 10.1186/s12911-022-02087-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 12/15/2022] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Ovarian cancer is the fifth leading cause of mortality among women in the United States. Ovarian cancer is also known as forgotten cancer or silent disease. The survival of ovarian cancer patients depends on several factors, including the treatment process and the prognosis. METHODS The ovarian cancer patients' dataset is compiled from the Surveillance, Epidemiology, and End Results (SEER) database. With the help of a clinician, the dataset is curated, and the most relevant features are selected. Pearson's second coefficient of skewness test is used to evaluate the skewness of the dataset. Pearson correlation coefficient is also used to investigate the associations between features. Statistical test is utilized to evaluate the significance of the features. Six Machine Learning (ML) models, including K-Nearest Neighbors , Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), Adaptive Boosting (AdaBoost), and Extreme Gradient Boosting (XGBoost), are implemented for survival prediction in both classification and regression approaches. An interpretable method, Shapley Additive Explanations (SHAP), is applied to clarify the decision-making process and determine the importance of each feature in prediction. Additionally, DTs of the RF model are displayed to show how the model predicts the survival intervals. RESULTS Our results show that RF (Accuracy = 88.72%, AUC = 82.38%) and XGBoost (Root Mean Squad Error (RMSE)) = 20.61%, R2 = 0.4667) have the best performance for classification and regression approaches, respectively. Furthermore, using the SHAP method along with extracted DTs of the RF model, the most important features in the dataset are identified. Histologic type ICD-O-3, chemotherapy recode, year of diagnosis, age at diagnosis, tumor stage, and grade are the most important determinant factors in survival prediction. CONCLUSION To the best of our knowledge, our study is the first study that develops various ML models to predict ovarian cancer patients' survival on the SEER database in both classification and regression approaches. These ML algorithms also achieve more accurate results and outperform statistical methods. Furthermore, our study is the first study to use the SHAP method to increase confidence and transparency of the proposed models' prediction for clinicians. Moreover, our developed models, as an automated auxiliary tool, can help clinicians to have a better understanding of the estimated survival as well as important features that affect survival.
Collapse
Affiliation(s)
- Amir Sorayaie Azar
- grid.412763.50000 0004 0442 8645Department of Computer Engineering, Urmia University, Urmia, Iran
| | - Samin Babaei Rikan
- grid.412763.50000 0004 0442 8645Department of Computer Engineering, Urmia University, Urmia, Iran
| | - Amin Naemi
- grid.10825.3e0000 0001 0728 0170Center for Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| | | | - Habibollah Pirnejad
- grid.412763.50000 0004 0442 8645Patient Safety Research Center, Clinical Research Institute, Urmia University of Medical Sciences, Urmia, Iran ,grid.6906.90000000092621349Erasmus School of Health Policy and Management (ESHPM), Erasmus University Rotterdam, Rotterdam, The Netherlands
| | | | - Uffe Kock Wiil
- grid.10825.3e0000 0001 0728 0170Center for Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
22
|
Nik Ab Kadir MN, Yaacob NM, Yusof SN, Ab Hadi IS, Musa KI, Mohd Isa SA, Bahtiar B, Adam F, Yahya MM, Hairon SM. Development of Predictive Models for Survival among Women with Breast Cancer in Malaysia. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:15335. [PMID: 36430052 PMCID: PMC9690612 DOI: 10.3390/ijerph192215335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 11/17/2022] [Accepted: 11/18/2022] [Indexed: 06/16/2023]
Abstract
Prediction of survival probabilities based on models developed by other countries has shown inconsistent findings among Malaysian patients. This study aimed to develop predictive models for survival among women with breast cancer in Malaysia. A retrospective cohort study was conducted involving patients who were diagnosed between 2012 and 2016 in seven breast cancer centres, where their survival status was followed until 31 December 2021. A total of 13 predictors were selected to model five-year survival probabilities by applying Cox proportional hazards (PH), artificial neural networks (ANN), and decision tree (DT) classification analysis. The random-split dataset strategy was used to develop and measure the models' performance. Among 1006 patients, the majority were Malay, with ductal carcinoma, hormone-sensitive, HER2-negative, at T2-, N1-stage, without metastasis, received surgery and chemotherapy. The estimated five-year survival rate was 60.5% (95% CI: 57.6, 63.6). For Cox PH, the c-index was 0.82 for model derivation and 0.81 for validation. The model was well-calibrated. The Cox PH model outperformed the DT and ANN models in most performance indices, with the Cox PH model having the highest accuracy of 0.841. The accuracies of the DT and ANN models were 0.811 and 0.821, respectively. The Cox PH model is more useful for survival prediction in this study's setting.
Collapse
Affiliation(s)
- Mohd Nasrullah Nik Ab Kadir
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Najib Majdi Yaacob
- Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Siti Norbayah Yusof
- Malaysian National Cancer Registry Department, National Cancer Institute, Ministry of Health Malaysia, Putrajaya 62250, Federal Territory of Putrajaya, Malaysia
| | - Imi Sairi Ab Hadi
- Breast and Endocrine Surgery Unit, Department of Surgery, Hospital Raja Perempuan Zainab II, Ministry of Health Malaysia, Kota Bharu 15586, Kelantan, Malaysia
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Seoparjoo Azmel Mohd Isa
- Department of Pathology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Balqis Bahtiar
- Malaysian National Cancer Registry Department, National Cancer Institute, Ministry of Health Malaysia, Putrajaya 62250, Federal Territory of Putrajaya, Malaysia
| | - Farzaana Adam
- Public Health Division, Penang State Health Department, Ministry of Health Malaysia, Georgetown 10590, Penang, Malaysia
| | - Maya Mazuwin Yahya
- Department of Surgery, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Suhaily Mohd Hairon
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| |
Collapse
|
23
|
Hanis TM, Ruhaiyem NIR, Arifin WN, Haron J, Wan Abdul Rahman WF, Abdullah R, Musa KI. Over-the-Counter Breast Cancer Classification Using Machine Learning and Patient Registration Records. Diagnostics (Basel) 2022; 12:diagnostics12112826. [PMID: 36428886 PMCID: PMC9689364 DOI: 10.3390/diagnostics12112826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 10/13/2022] [Accepted: 10/15/2022] [Indexed: 11/18/2022] Open
Abstract
This study aims to determine the feasibility of machine learning (ML) and patient registration record to be utilised to develop an over-the-counter (OTC) screening model for breast cancer risk estimation. Data were retrospectively collected from women who came to the Hospital Universiti Sains Malaysia, Malaysia for breast-related problems. Eight ML models were used: k-nearest neighbour (kNN), elastic-net logistic regression, multivariate adaptive regression splines, artificial neural network, partial least square, random forest, support vector machine (SVM), and extreme gradient boosting. Features utilised for the development of the screening models were limited to information in the patient registration form. The final model was evaluated in terms of performance across a mammographic density. Additionally, the feature importance of the final model was assessed using the model agnostic approach. kNN had the highest Youden J index, precision, and PR-AUC, while SVM had the highest F2 score. The kNN model was selected as the final model. The model had a balanced performance in terms of sensitivity, specificity, and PR-AUC across the mammographic density groups. The most important feature was the age at examination. In conclusion, this study showed that ML and patient registration information are feasible to be used as the OTC screening model for breast cancer.
Collapse
Affiliation(s)
- Tengku Muhammad Hanis
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| | | | - Wan Nor Arifin
- Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Juhara Haron
- Department of Radiology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Wan Faiziah Wan Abdul Rahman
- Breast Cancer Awareness and Research Unit, Hospital Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Department of Pathology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Rosni Abdullah
- School of Computer Sciences, Universiti Sains Malaysia, Gelugor 11800, Penang, Malaysia
| | - Kamarul Imran Musa
- Department of Community Medicine, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
- Correspondence: (T.M.H.); (K.I.M.)
| |
Collapse
|
24
|
Ozcan I, Aydin H, Cetinkaya A. Comparison of Classification Success Rates of Different Machine Learning Algorithms in the Diagnosis of Breast Cancer. Asian Pac J Cancer Prev 2022; 23:3287-3297. [PMID: 36308351 PMCID: PMC9924317 DOI: 10.31557/apjcp.2022.23.10.3287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Indexed: 02/18/2023] Open
Abstract
OBJECTIVE To identify which Machine Learning (ML) algorithms are the most successful in predicting and diagnosing breast cancer according to accuracy rates. METHODS The "College of Wisconsin Breast Cancer Dataset", which consists of 569 data and 30 features, was classified using Support Vector Machine (SVM), Naive Bayes (NB), Random Forest (RF), Decision Tree (DT), K-Nearest Neighbor (KNN), Logistic Regression (LR), Multilayer Perceptron (MLP), Linear Discriminant Analysis (LDA), XgBoost (XGB), Ada-Boost (ABC) and Gradient Boosting (GBC) ML algorithms. Before the classification process, the dataset was preprocessed. Sensitivity, accuracy, and definiteness metrics were used to measure the success of the methods. RESULT Compared to other ML algorithms used in the study, the GBC ML algorithm was found to be the most successful method in the classification of tumors with an accuracy of 99.12%. The XGB ML algorithm was found to be the lowest method with an accuracy rate of 88.10%. In addition, it was determined that the general accuracy rates of the 11 ML algorithms used in the study varied between 88-95%. CONCLUSION When the results obtained from the ML classifiers used in the study are evaluated, the efficiency of the GBC algorithm in the classification of tumors is obvious. It can be said that the success rates obtained from 11 different ML algorithms used in the study are valuable in terms of being used to predict different cancer types.
Collapse
Affiliation(s)
- Irem Ozcan
- Department of Computer Engineering, Faculty of Engineering and Architecture, Istanbul Gelisim University, Istanbul, Turkey.
| | - Hakan Aydin
- Department of Computer Engineering, Faculty of Engineering, Istanbul Topkapı University, Istanbul, Turkey.
| | - Ali Cetinkaya
- Department of Electronics Technology, Istanbul Gelisim Vocational School, Istanbul Gelisim University, Istanbul, Turkey. ,For Correspondence:
| |
Collapse
|
25
|
Gupta S, Gupta MK, Shabaz M, Sharma A. Deep learning techniques for cancer classification using microarray gene expression data. Front Physiol 2022; 13:952709. [PMID: 36246115 PMCID: PMC9563992 DOI: 10.3389/fphys.2022.952709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 09/01/2022] [Indexed: 11/28/2022] Open
Abstract
Cancer is one of the top causes of death globally. Recently, microarray gene expression data has been used to aid in cancer’s effective and early detection. The use of DNA microarray technology to uncover information from the expression levels of thousands of genes has enormous promise. The DNA microarray technique can determine the levels of thousands of genes simultaneously in a single experiment. The analysis of gene expression is critical in many disciplines of biological study to obtain the necessary information. This study analyses all the research studies focused on optimizing gene selection for cancer detection using artificial intelligence. One of the most challenging issues is figuring out how to extract meaningful information from massive databases. Deep Learning architectures have performed efficiently in numerous sectors and are used to diagnose many other chronic diseases and to assist physicians in making medical decisions. In this study, we have evaluated the results of different optimizers on a RNA sequence dataset. The Deep learning algorithm proposed in the study classifies five different forms of cancer, including kidney renal clear cell carcinoma (KIRC), Breast Invasive Carcinoma (BRCA), lung adenocarcinoma (LUAD), Prostate Adenocarcinoma (PRAD) and Colon Adenocarcinoma (COAD). The performance of different optimizers like Stochastic gradient descent (SGD), Root Mean Squared Propagation (RMSProp), Adaptive Gradient Optimizer (AdaGrad), and Adaptive Momentum (AdaM). The experimental results gathered on the dataset affirm that AdaGrad and Adam. Also, the performance analysis has been done using different learning rates and decay rates. This study discusses current advancements in deep learning-based gene expression data analysis using optimized feature selection methods.
Collapse
Affiliation(s)
- Surbhi Gupta
- Department of Computer Science and Engineering Department, SMVDU, Jammu, India
- Model Institute of Engineering and Technology, Jammu, India
| | - Manoj K. Gupta
- Department of Computer Science and Engineering Department, SMVDU, Jammu, India
| | - Mohammad Shabaz
- Model Institute of Engineering and Technology, Jammu, India
- *Correspondence: Mohammad Shabaz,
| | - Ashutosh Sharma
- School of Computer Science, University of Petroleum and Energy Studies, Dehradun, India
| |
Collapse
|
26
|
Syed AH, Khan T. Evolution of research trends in artificial intelligence for breast cancer diagnosis and prognosis over the past two decades: A bibliometric analysis. Front Oncol 2022; 12:854927. [PMID: 36267967 PMCID: PMC9578338 DOI: 10.3389/fonc.2022.854927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 08/30/2022] [Indexed: 01/27/2023] Open
Abstract
Objective In recent years, among the available tools, the concurrent application of Artificial Intelligence (AI) has improved the diagnostic performance of breast cancer screening. In this context, the present study intends to provide a comprehensive overview of the evolution of AI for breast cancer diagnosis and prognosis research using bibliometric analysis. Methodology Therefore, in the present study, relevant peer-reviewed research articles published from 2000 to 2021 were downloaded from the Scopus and Web of Science (WOS) databases and later quantitatively analyzed and visualized using Bibliometrix (R package). Finally, open challenges areas were identified for future research work. Results The present study revealed that the number of literature studies published in AI for breast cancer detection and survival prediction has increased from 12 to 546 between the years 2000 to 2021. The United States of America (USA), the Republic of China, and India are the most productive publication-wise in this field. Furthermore, the USA leads in terms of the total citations; however, hungry and Holland take the lead positions in average citations per year. Wang J is the most productive author, and Zhan J is the most relevant author in this field. Stanford University in the USA is the most relevant affiliation by the number of published articles. The top 10 most relevant sources are Q1 journals with PLOS ONE and computer in Biology and Medicine are the leading journals in this field. The most trending topics related to our study, transfer learning and deep learning, were identified. Conclusion The present findings provide insight and research directions for policymakers and academic researchers for future collaboration and research in AI for breast cancer patients.
Collapse
Affiliation(s)
- Asif Hassan Syed
- Department of Computer Science, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi Arabia,*Correspondence: Asif Hassan Syed,
| | - Tabrej Khan
- Department of Information Systems, Faculty of Computing and Information Technology Rabigh (FCITR), King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
27
|
Sandarenu P, Millar EKA, Song Y, Browne L, Beretov J, Lynch J, Graham PH, Jonnagaddala J, Hawkins N, Huang J, Meijering E. Survival prediction in triple negative breast cancer using multiple instance learning of histopathological images. Sci Rep 2022; 12:14527. [PMID: 36008541 PMCID: PMC9411153 DOI: 10.1038/s41598-022-18647-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 08/17/2022] [Indexed: 11/09/2022] Open
Abstract
Computational pathology is a rapidly expanding area for research due to the current global transformation of histopathology through the adoption of digital workflows. Survival prediction of breast cancer patients is an important task that currently depends on histopathology assessment of cancer morphological features, immunohistochemical biomarker expression and patient clinical findings. To facilitate the manual process of survival risk prediction, we developed a computational pathology framework for survival prediction using digitally scanned haematoxylin and eosin-stained tissue microarray images of clinically aggressive triple negative breast cancer. Our results show that the model can produce an average concordance index of 0.616. Our model predictions are analysed for independent prognostic significance in univariate analysis (hazard ratio = 3.12, 95% confidence interval [1.69,5.75], p < 0.005) and multivariate analysis using clinicopathological data (hazard ratio = 2.68, 95% confidence interval [1.44,4.99], p < 0.005). Through qualitative analysis of heatmaps generated from our model, an expert pathologist is able to associate tissue features highlighted in the attention heatmaps of high-risk predictions with morphological features associated with more aggressive behaviour such as low levels of tumour infiltrating lymphocytes, stroma rich tissues and high-grade invasive carcinoma, providing explainability of our method for triple negative breast cancer.
Collapse
Affiliation(s)
- Piumi Sandarenu
- School of Computer Science and Engineering, UNSW Sydney, Kensington, NSW, 2052, Australia
| | - Ewan K A Millar
- Department of Anatomical Pathology, NSW Health Pathology, St. George Hospital, Kogarah, NSW, 2217, Australia.,St. George and Sutherland Clinical School, UNSW Sydney, Kensington, NSW, 2052, Australia.,Faculty of Medicine and Health Sciences, Sydney Western University, Campbelltown, NSW, 2560, Australia.,University of Technology Sydney, Ultimo, NSW, 2007, Australia
| | - Yang Song
- School of Computer Science and Engineering, UNSW Sydney, Kensington, NSW, 2052, Australia
| | - Lois Browne
- Cancer Care Centre, St. George Hospital, Kogarah, NSW, 2217, Australia
| | - Julia Beretov
- Department of Anatomical Pathology, NSW Health Pathology, St. George Hospital, Kogarah, NSW, 2217, Australia.,St. George and Sutherland Clinical School, UNSW Sydney, Kensington, NSW, 2052, Australia.,Cancer Care Centre, St. George Hospital, Kogarah, NSW, 2217, Australia
| | - Jodi Lynch
- St. George and Sutherland Clinical School, UNSW Sydney, Kensington, NSW, 2052, Australia.,Cancer Care Centre, St. George Hospital, Kogarah, NSW, 2217, Australia
| | - Peter H Graham
- St. George and Sutherland Clinical School, UNSW Sydney, Kensington, NSW, 2052, Australia.,Cancer Care Centre, St. George Hospital, Kogarah, NSW, 2217, Australia
| | | | - Nicholas Hawkins
- School of Medical Sciences, UNSW Sydney, Kensington, NSW, 2052, Australia
| | - Junzhou Huang
- University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Erik Meijering
- School of Computer Science and Engineering, UNSW Sydney, Kensington, NSW, 2052, Australia.
| |
Collapse
|
28
|
Hacking SM, Yakirevich E, Wang Y. From Immunohistochemistry to New Digital Ecosystems: A State-of-the-Art Biomarker Review for Precision Breast Cancer Medicine. Cancers (Basel) 2022; 14:cancers14143469. [PMID: 35884530 PMCID: PMC9315712 DOI: 10.3390/cancers14143469] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/13/2022] [Accepted: 07/15/2022] [Indexed: 02/04/2023] Open
Abstract
Simple Summary In this state-of-the-art breast biomarker review, we have tried to imagine and illustrate future, emerging digital breast cancer ecosystems which allow for greater incorporation of traditional immunohistochemical and molecular biomarkers, WSI, and radiomic features. Abstract Breast cancers represent complex ecosystem-like networks of malignant cells and their associated microenvironment. Estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) are biomarkers ubiquitous to clinical practice in evaluating prognosis and predicting response to therapy. Recent feats in breast cancer have led to a new digital era, and advanced clinical trials have resulted in a growing number of personalized therapies with corresponding biomarkers. In this state-of-the-art review, we included the latest 10-year updated recommendations for ER, PR, and HER2, along with the most salient information on tumor-infiltrating lymphocytes (TILs), Ki-67, PD-L1, and several prognostic/predictive biomarkers at genomic, transcriptomic, and proteomic levels recently developed for selection and optimization of breast cancer treatment. Looking forward, the multi-omic landscape of the tumor ecosystem could be integrated with computational findings from whole slide images and radiomics in predictive machine learning (ML) models. These are new digital ecosystems on the road to precision breast cancer medicine.
Collapse
Affiliation(s)
| | | | - Yihong Wang
- Correspondence: ; Tel.: +1-401-444-9897; Fax: +1-401-444-4377
| |
Collapse
|
29
|
Diagnostic Accuracy of Machine Learning Models on Mammography in Breast Cancer Classification: A Meta-Analysis. Diagnostics (Basel) 2022; 12:diagnostics12071643. [PMID: 35885548 PMCID: PMC9320089 DOI: 10.3390/diagnostics12071643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 06/29/2022] [Accepted: 06/29/2022] [Indexed: 11/16/2022] Open
Abstract
In this meta-analysis, we aimed to estimate the diagnostic accuracy of machine learning models on digital mammograms and tomosynthesis in breast cancer classification and to assess the factors affecting its diagnostic accuracy. We searched for related studies in Web of Science, Scopus, PubMed, Google Scholar and Embase. The studies were screened in two stages to exclude the unrelated studies and duplicates. Finally, 36 studies containing 68 machine learning models were included in this meta-analysis. The area under the curve (AUC), hierarchical summary receiver operating characteristics (HSROC) curve, pooled sensitivity and pooled specificity were estimated using a bivariate Reitsma model. Overall AUC, pooled sensitivity and pooled specificity were 0.90 (95% CI: 0.85–0.90), 0.83 (95% CI: 0.78–0.87) and 0.84 (95% CI: 0.81–0.87), respectively. Additionally, the three significant covariates identified in this study were country (p = 0.003), source (p = 0.002) and classifier (p = 0.016). The type of data covariate was not statistically significant (p = 0.121). Additionally, Deeks’ linear regression test indicated that there exists a publication bias in the included studies (p = 0.002). Thus, the results should be interpreted with caution.
Collapse
|
30
|
Painuli D, Bhardwaj S, Köse U. Recent advancement in cancer diagnosis using machine learning and deep learning techniques: A comprehensive review. Comput Biol Med 2022; 146:105580. [PMID: 35551012 DOI: 10.1016/j.compbiomed.2022.105580] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Revised: 04/14/2022] [Accepted: 04/30/2022] [Indexed: 02/07/2023]
Abstract
Being a second most cause of mortality worldwide, cancer has been identified as a perilous disease for human beings, where advance stage diagnosis may not help much in safeguarding patients from mortality. Thus, efforts to provide a sustainable architecture with proven cancer prevention estimate and provision for early diagnosis of cancer is the need of hours. Advent of machine learning methods enriched cancer diagnosis area with its overwhelmed efficiency & low error-rate then humans. A significant revolution has been witnessed in the development of machine learning & deep learning assisted system for segmentation & classification of various cancers during past decade. This research paper includes a review of various types of cancer detection via different data modalities using machine learning & deep learning-based methods along with different feature extraction techniques and benchmark datasets utilized in the recent six years studies. The focus of this study is to review, analyse, classify, and address the recent development in cancer detection and diagnosis of six types of cancers i.e., breast, lung, liver, skin, brain and pancreatic cancer, using machine learning & deep learning techniques. Various state-of-the-art technique are clustered into same group and results are examined through key performance indicators like accuracy, area under the curve, precision, sensitivity, dice score on benchmark datasets and concluded with future research work challenges.
Collapse
Affiliation(s)
- Deepak Painuli
- Department of Computer Science and Engineering, Gurukula Kangri Vishwavidyalaya, Haridwar, India.
| | - Suyash Bhardwaj
- Department of Computer Science and Engineering, Gurukula Kangri Vishwavidyalaya, Haridwar, India
| | - Utku Köse
- Department of Computer Engineering, Suleyman Demirel University, Isparta, Turkey
| |
Collapse
|
31
|
Dhiman P, Ma J, Andaur Navarro CL, Speich B, Bullock G, Damen JAA, Hooft L, Kirtley S, Riley RD, Van Calster B, Moons KGM, Collins GS. Methodological conduct of prognostic prediction models developed using machine learning in oncology: a systematic review. BMC Med Res Methodol 2022; 22:101. [PMID: 35395724 PMCID: PMC8991704 DOI: 10.1186/s12874-022-01577-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 03/18/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Describe and evaluate the methodological conduct of prognostic prediction models developed using machine learning methods in oncology. METHODS We conducted a systematic review in MEDLINE and Embase between 01/01/2019 and 05/09/2019, for studies developing a prognostic prediction model using machine learning methods in oncology. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement, Prediction model Risk Of Bias ASsessment Tool (PROBAST) and CHecklist for critical Appraisal and data extraction for systematic Reviews of prediction Modelling Studies (CHARMS) to assess the methodological conduct of included publications. Results were summarised by modelling type: regression-, non-regression-based and ensemble machine learning models. RESULTS Sixty-two publications met inclusion criteria developing 152 models across all publications. Forty-two models were regression-based, 71 were non-regression-based and 39 were ensemble models. A median of 647 individuals (IQR: 203 to 4059) and 195 events (IQR: 38 to 1269) were used for model development, and 553 individuals (IQR: 69 to 3069) and 50 events (IQR: 17.5 to 326.5) for model validation. A higher number of events per predictor was used for developing regression-based models (median: 8, IQR: 7.1 to 23.5), compared to alternative machine learning (median: 3.4, IQR: 1.1 to 19.1) and ensemble models (median: 1.7, IQR: 1.1 to 6). Sample size was rarely justified (n = 5/62; 8%). Some or all continuous predictors were categorised before modelling in 24 studies (39%). 46% (n = 24/62) of models reporting predictor selection before modelling used univariable analyses, and common method across all modelling types. Ten out of 24 models for time-to-event outcomes accounted for censoring (42%). A split sample approach was the most popular method for internal validation (n = 25/62, 40%). Calibration was reported in 11 studies. Less than half of models were reported or made available. CONCLUSIONS The methodological conduct of machine learning based clinical prediction models is poor. Guidance is urgently needed, with increased awareness and education of minimum prediction modelling standards. Particular focus is needed on sample size estimation, development and validation analysis methods, and ensuring the model is available for independent validation, to improve quality of machine learning based clinical prediction models.
Collapse
Affiliation(s)
- Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
- NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
| | - Jie Ma
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| | - Constanza L Andaur Navarro
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Benjamin Speich
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
- Basel Institute for Clinical Epidemiology and Biostatistics, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland
| | - Garrett Bullock
- Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Johanna A A Damen
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Lotty Hooft
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Shona Kirtley
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| | - Richard D Riley
- Centre for Prognosis Research, School of Medicine, Keele University, Staffordshire, ST5 5BG, UK
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
- EPI-centre, KU Leuven, Leuven, Belgium
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
- NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| |
Collapse
|
32
|
Okagbue HI, Oguntunde PE, Adamu PI, Adejumo AO. Unique clusters of patterns of breast cancer survivorship. HEALTH AND TECHNOLOGY 2022. [DOI: 10.1007/s12553-021-00637-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
33
|
Performance evaluation of machine learning for breast cancer diagnosis: A case study. INFORMATICS IN MEDICINE UNLOCKED 2022. [DOI: 10.1016/j.imu.2022.101009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
34
|
Nwanosike EM, Conway BR, Merchant HA, Hasan SS. Potential applications and performance of machine learning techniques and algorithms in clinical practice: A systematic review. Int J Med Inform 2021; 159:104679. [PMID: 34990939 DOI: 10.1016/j.ijmedinf.2021.104679] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Revised: 12/08/2021] [Accepted: 12/27/2021] [Indexed: 12/11/2022]
Abstract
PURPOSE The advent of clinically adapted machine learning algorithms can solve numerous problems ranging from disease diagnosis and prognosis to therapy recommendations. This systematic review examines the performance of machine learning (ML) algorithms and evaluates the progress made to date towards their implementation in clinical practice. METHODS Systematic searching of databases (PubMed, MEDLINE, Scopus, Google Scholar, Cochrane Library and WHO Covid-19 database) to identify original articles published between January 2011 and October 2021. Studies reporting ML techniques in clinical practice involving humans and ML algorithms with a performance metric were considered. RESULTS Of 873 unique articles identified, 36 studies were eligible for inclusion. The XGBoost (extreme gradient boosting) algorithm showed the highest potential for clinical applications (n = 7 studies); this was followed jointly by random forest algorithm, logistic regression, and the support vector machine, respectively (n = 5 studies). Prediction of outcomes (n = 33), in particular Inflammatory diseases (n = 7) received the most attention followed by cancer and neuropsychiatric disorders (n = 5 for each) and Covid-19 (n = 4). Thirty-three out of the thirty-six included studies passed more than 50% of the selected quality assessment criteria in the TRIPOD checklist. In contrast, none of the studies could achieve an ideal overall bias rating of 'low' based on the PROBAST checklist. In contrast, only three studies showed evidence of the deployment of ML algorithm(s) in clinical practice. CONCLUSIONS ML is potentially a reliable tool for clinical decision support. Although advocated widely in clinical practice, work is still in progress to validate clinically adapted ML algorithms. Improving quality standards, transparency, and interpretability of ML models will further lower the barriers to acceptability.
Collapse
Affiliation(s)
- Ezekwesiri Michael Nwanosike
- Department of Pharmacy, School of Applied Sciences, University of Huddersfield, Queensgate Huddersfield HD1 3DH, West Yorkshire, United Kingdom
| | - Barbara R Conway
- Department of Pharmacy, School of Applied Sciences, University of Huddersfield, Queensgate Huddersfield HD1 3DH, West Yorkshire, United Kingdom
| | - Hamid A Merchant
- Department of Pharmacy, School of Applied Sciences, University of Huddersfield, Queensgate Huddersfield HD1 3DH, West Yorkshire, United Kingdom
| | - Syed Shahzad Hasan
- Department of Pharmacy, School of Applied Sciences, University of Huddersfield, Queensgate Huddersfield HD1 3DH, West Yorkshire, United Kingdom; School of Biomedical Sciences & Pharmacy, University of Newcastle, Callaghan, Australia.
| |
Collapse
|
35
|
Qurashi AA, Alanazi RK, Alhazmi YM, Almohammadi AS, Alsharif WM, Alshamrani KM. Saudi Radiology Personnel's Perceptions of Artificial Intelligence Implementation: A Cross-Sectional Study. J Multidiscip Healthc 2021; 14:3225-3231. [PMID: 34848967 PMCID: PMC8627310 DOI: 10.2147/jmdh.s340786] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 10/29/2021] [Indexed: 12/15/2022] Open
Abstract
PURPOSE Artificial intelligence (AI) in radiology has been a subject of heated debate. The external perception is that algorithms and machines cannot offer better diagnosis than radiologists. Reluctance to implement AI maybe due to the opacity in how AI applications work and the challenging and lengthy validation process. In this study, Saudi radiology personnel's familiarity with AI applications and its usefulness in clinical practice were investigated. METHODS A cross-sectional study was conducted in Saudi Arabia among radiology personnel from March to April 2021. Radiology personnel nationwide were surveyed electronically using Google form. The questionnaire included 12-questions related to AI usefulness in clinical practice and participants' knowledge about AI and their acceptance level to learn and implement this technology into clinical practice. Participants' trust level was also measured; Kruskal-Wallis test was used to examine differences between groups. RESULTS A total of 224 respondents from various radiology-related occupations participated in the survey. The lowest trust level in AI applications was shown by radiologists (p = 0.033). Eighty-two percent of participants (n = 184) had never used AI in their departments. Most respondents (n = 160, 71.4%) reported lack of formal education regarding AI-based applications. Most participants (n = 214, 95.5%) showed strong interest in AI education and are willing to incorporate it into the clinical practice of radiology. Almost half of radiography students (22/46, 47.8%) believe that their job might be at risk due to AI application (p = 0.038). CONCLUSION Radiology personnel's knowledge of AI has a significant impact on their willingness to learn, use and adapt this technology in clinical practice. Participants demonstrated a positive attitude towards AI, showed a reasonable understanding and are highly motivated to learn and incorporate it into clinical practice. Some participants felt that their jobs were threatened by AI adaptation, but this belief might change with good training and education programmes.
Collapse
Affiliation(s)
- Abdulaziz A Qurashi
- Diagnostic Radiology Technology Department, College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
| | - Rashed K Alanazi
- Diagnostic Radiology Technology Department, College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
| | - Yasser M Alhazmi
- Diagnostic Radiology Technology Department, College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
| | - Ahmed S Almohammadi
- Diagnostic Radiology Technology Department, College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
| | - Walaa M Alsharif
- Diagnostic Radiology Technology Department, College of Applied Medical Sciences, Taibah University, Madinah, Saudi Arabia
| | - Khalid M Alshamrani
- College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Jeddah, Saudi Arabia
- King Abdullah International Medical Research Center, Jeddah, Saudi Arabia
| |
Collapse
|
36
|
Machine learning predictive models for acute pancreatitis: A systematic review. Int J Med Inform 2021; 157:104641. [PMID: 34785488 DOI: 10.1016/j.ijmedinf.2021.104641] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 11/03/2021] [Accepted: 11/05/2021] [Indexed: 12/14/2022]
Abstract
INTRODUCTION Acute pancreatitis (AP) is a common clinical pancreatic disease. Patients with different severity levels have different clinical outcomes. With the advantages of algorithms, machine learning (ML) has gradually emerged in the field of disease prediction, assisting doctors in decision-making. METHODS A systematic review was conducted using the PubMed, Web of Science, Scopus, and Embase databases, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Publication time was limited from inception to 29 May 2021. Studies that have used ML to establish predictive tools for AP were eligible for inclusion. Quality assessment of the included studies was conducted in accordance with the IJMEDI checklist. RESULTS In this systematic review, 24 of 2,913 articles, with a total of 8,327 patients and 47 models, were included. The studies could be divided into five categories: 10 studies (42%) reported severity prediction; 10 studies (42%), complication prediction; 3 studies (13%), mortality prediction; 2 studies (8%), recurrence prediction; and 2 studies (8%), surgery timing prediction. ML showed great accuracy in several prediction tasks. However, most of the included studies were retrospective in nature, conducted at a single centre, based on database data, and lacked external validation. According to the IJMEDI checklist and our scoring criteria, two studies were considered to be of high quality. Most studies had an obvious bias in the quality of data preparation, validation, and deployment dimensions. CONCLUSION In the prediction tasks for AP, ML has shown great potential in assisting decision-making. However, the existing studies still have some deficiencies in the process of model construction. Future studies need to optimize the deficiencies and further evaluate the comparability of the ML systems and model performance, so as to consequently develop high-quality ML-based models that can be used in clinical practice.
Collapse
|
37
|
Foo LL, Ng WY, Lim GYS, Tan TE, Ang M, Ting DSW. Artificial intelligence in myopia: current and future trends. Curr Opin Ophthalmol 2021; 32:413-424. [PMID: 34310401 DOI: 10.1097/icu.0000000000000791] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
PURPOSE OF REVIEW Myopia is one of the leading causes of visual impairment, with a projected increase in prevalence globally. One potential approach to address myopia and its complications is early detection and treatment. However, current healthcare systems may not be able to cope with the growing burden. Digital technological solutions such as artificial intelligence (AI) have emerged as a potential adjunct for myopia management. RECENT FINDINGS There are currently four significant domains of AI in myopia, including machine learning (ML), deep learning (DL), genetics and natural language processing (NLP). ML has been demonstrated to be a useful adjunctive for myopia prediction and biometry for cataract surgery in highly myopic individuals. DL techniques, particularly convoluted neural networks, have been applied to various image-related diagnostic and predictive solutions. Applications of AI in genomics and NLP appear to be at a nascent stage. SUMMARY Current AI research is mainly focused on disease classification and prediction in myopia. Through greater collaborative research, we envision AI will play an increasingly critical role in big data analysis by aggregating a greater variety of parameters including genomics and environmental factors. This may enable the development of generalizable adjunctive DL systems that could help realize predictive and individualized precision medicine for myopic patients.
Collapse
Affiliation(s)
- Li Lian Foo
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | - Wei Yan Ng
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | | | - Tien-En Tan
- Singapore National Eye Centre, Singapore Eye Research Institute
| | - Marcus Ang
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| | - Daniel Shu Wei Ting
- Singapore National Eye Centre, Singapore Eye Research Institute
- Duke-NUS Medical School, National University of Singapore, Singapore, Singapore
| |
Collapse
|
38
|
Ding D, Lang T, Zou D, Tan J, Chen J, Zhou L, Wang D, Li R, Li Y, Liu J, Ma C, Zhou Q. Machine learning-based prediction of survival prognosis in cervical cancer. BMC Bioinformatics 2021; 22:331. [PMID: 34134623 PMCID: PMC8207793 DOI: 10.1186/s12859-021-04261-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 06/11/2021] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Accurately forecasting the prognosis could improve cervical cancer management, however, the currently used clinical features are difficult to provide enough information. The aim of this study is to improve forecasting capability by developing a miRNAs-based machine learning survival prediction model. RESULTS The expression characteristics of miRNAs were chosen as features for model development. The cervical cancer miRNA expression data was obtained from The Cancer Genome Atlas database. Preprocessing, including unquantified data removal, missing value imputation, samples normalization, log transformation, and feature scaling, was performed. In total, 42 survival-related miRNAs were identified by Cox Proportional-Hazards analysis. The patients were optimally clustered into four groups with three different 5-years survival outcome (≥ 90%, ≈ 65%, ≤ 40%) by K-means clustering algorithm base on top 10 survival-related miRNAs. According to the K-means clustering result, a prediction model with high performance was established. The pathways analysis indicated that the miRNAs used play roles involved in the regulation of cancer stem cells. CONCLUSION A miRNAs-based machine learning cervical cancer survival prediction model was developed that robustly stratifies cervical cancer patients into high survival rate (5-years survival rate ≥ 90%), moderate survival rate (5-years survival rate ≈ 65%), and low survival rate (5-years survival rate ≤ 40%).
Collapse
Affiliation(s)
- Dongyan Ding
- Key Laboratory of Biorheological Science and Technology (Chongqing University), Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, People's Republic of China
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Tingyuan Lang
- Key Laboratory of Biorheological Science and Technology (Chongqing University), Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, People's Republic of China.
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China.
- Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, School of Medicine, Chongqing University Cancer Hospital, Chongqing University, Chongqing, 400030, People's Republic of China.
| | - Dongling Zou
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Jiawei Tan
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, 130012, People's Republic of China
| | - Jia Chen
- School of Mathematics and Statistics, Changchun University of Technology, Changchun, 130012, People's Republic of China
| | - Lei Zhou
- Singapore Eye Research Institute, The academia, 20 College Road, Discovery Tower Level 6, Singapore, 169856, Singapore
- Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Duke-NUS Medical School, Ophthalmology and Visual Sciences Academic Clinical Research Program, National University of Singapore, Singapore, Singapore
| | - Dong Wang
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Rong Li
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Yunzhe Li
- Key Laboratory of Biorheological Science and Technology (Chongqing University), Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, People's Republic of China
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Jingshu Liu
- Key Laboratory of Biorheological Science and Technology (Chongqing University), Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, People's Republic of China
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China
| | - Cui Ma
- Department of Pediatric Hematology, First Hospital of Jilin University, Changchun, 130023, Jilin, People's Republic of China
| | - Qi Zhou
- Key Laboratory of Biorheological Science and Technology (Chongqing University), Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, People's Republic of China.
- Department of Gynecologic Oncology, School of Medicine, Chongqing University Cancer Hospital, , Chongqing University, Chongqing, 400030, People's Republic of China.
- Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, School of Medicine, Chongqing University Cancer Hospital, Chongqing University, Chongqing, 400030, People's Republic of China.
| |
Collapse
|