1
|
Zabaleta J, Aguinagalde B, Lopez I, Fernandez-Monge A, Lizarbe JA, Mainer M, Ferrer-Bonsoms JA, de Assas M. Utility of Artificial Intelligence for Decision Making in Thoracic Multidisciplinary Tumor Boards. J Clin Med 2025; 14:399. [PMID: 39860405 PMCID: PMC11765867 DOI: 10.3390/jcm14020399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2024] [Revised: 12/28/2024] [Accepted: 01/07/2025] [Indexed: 01/27/2025] Open
Abstract
Background/Objectives: The aim of this study was to analyze whether the implementation of artificial intelligence (AI), specifically the Natural Language Processing (NLP) branch developed by OpenAI, could help a thoracic multidisciplinary tumor board (MTB) make decisions if provided with all of the patient data presented to the committee and supported by accepted clinical practice guidelines. Methods: This is a retrospective comparative study. The inclusion criteria were defined as all patients who presented at the thoracic MTB with a suspicious or first diagnosis of non-small-cell lung cancer between January 2023 and June 2023. Intervention: GPT 3.5 turbo chat was used, providing the clinical case summary presented in committee proceedings and the latest SEPAR lung cancer treatment guidelines. The application was asked to issue one of the following recommendations: follow-up, surgery, chemotherapy, radiotherapy, or chemoradiotherapy. Statistical analysis: A concordance analysis was performed by measuring the Kappa coefficient to evaluate the consistency between the results of the AI and the committee's decision. Results: Fifty-two patients were included in the study. The AI had an overall concordance of 76%, with a Kappa index of 0.59 and a consistency and replicability of 92.3% for the patients in whom it recommended surgery (after repeating the cases four times). Conclusions: AI is an interesting tool which could help in decision making in MTBs.
Collapse
Affiliation(s)
- Jon Zabaleta
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | - Borja Aguinagalde
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | - Iker Lopez
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | - Arantza Fernandez-Monge
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | - Jon A. Lizarbe
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | - Maria Mainer
- Department of Thoracic Surgery, Basque Health Service, Donostialdea Integrated Health Organisation, 20014 San Sebastian, Spain; (B.A.)
| | | | - Mateo de Assas
- Tecnun, School of Engineering, University of Navarra, 20018 San Sebastian, Spain
| |
Collapse
|
2
|
Chowa SS, Bhuiyan MRI, Tahosin MS, Karim A, Montaha S, Hassan MM, Shah MA, Azam S. An automated privacy-preserving self-supervised classification of COVID-19 from lung CT scan images minimizing the requirements of large data annotation. Sci Rep 2025; 15:226. [PMID: 39747232 PMCID: PMC11696874 DOI: 10.1038/s41598-024-83972-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Accepted: 12/18/2024] [Indexed: 01/04/2025] Open
Abstract
This study presents a novel privacy-preserving self-supervised (SSL) framework for COVID-19 classification from lung CT scans, utilizing federated learning (FL) enhanced with Paillier homomorphic encryption (PHE) to prevent third-party attacks during training. The FL-SSL based framework employs two publicly available lung CT scan datasets which are considered as labeled and an unlabeled dataset. The unlabeled dataset is split into three subsets which are assumed to be collected from three hospitals. Training is done using the Bootstrap Your Own Latent (BYOL) contrastive learning SSL framework with a VGG19 encoder followed by attention CNN blocks (VGG19 + attention CNN). The input datasets are processed by selecting the largest lung portion of each lung CT scan using an automated selection approach and a 64 × 64 input size is utilized to reduce computational complexity. Healthcare privacy issues are addressed by collaborative training across decentralized datasets and secure aggregation with PHE, underscoring the effectiveness of this approach. Three subsets of the dataset are used to train the local BYOL model, which together optimizes the central encoder. The labeled dataset is employed to train the central encoder (updated VGG19 + attention CNN), resulting in an accuracy of 97.19%, a precision of 97.43%, and a recall of 98.18%. The reliability of the framework's performance is demonstrated through statistical analysis and five-fold cross-validation. The efficacy of the proposed framework is further showcased by showing its performance on three distinct modality datasets: skin cancer, breast cancer, and chest X-rays. In conclusion, this study offers a promising solution for accurate diagnosis of chest X-rays, preserving privacy and overcoming the challenges of dataset scarcity and computational complexity.
Collapse
Affiliation(s)
- Sadia Sultana Chowa
- Faculty of Science and Technology, Charles Darwin University, Casuarina, NT, 0909, Australia
- Health Informatics Research Laboratory (HIRL), Department of Computer Science and Engineering, Daffodil International University, Dhaka, 1341, Bangladesh
| | - Md Rahad Islam Bhuiyan
- Faculty of Science and Technology, Charles Darwin University, Casuarina, NT, 0909, Australia
- Health Informatics Research Laboratory (HIRL), Department of Computer Science and Engineering, Daffodil International University, Dhaka, 1341, Bangladesh
| | - Mst Sazia Tahosin
- Health Informatics Research Laboratory (HIRL), Department of Computer Science and Engineering, Daffodil International University, Dhaka, 1341, Bangladesh
| | - Asif Karim
- Faculty of Science and Technology, Charles Darwin University, Casuarina, NT, 0909, Australia
| | - Sidratul Montaha
- Department of Computer Science, University of Calgary, Calgary, Canada
| | - Md Mehedi Hassan
- Computer Science and Engineering Discipline, Khulna University, Khulna, 9208, Bangladesh
| | - Mohd Asif Shah
- Department of Economics, Bakhtar University, Kart-e-Char, Kabul, 1001, Afghanistan.
- Centre for Research Impact & Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, 140401, India.
- Chitkara Centre for Research and Development, Chitkara University, Baddi, Himachal Pradesh, 174103, India.
- Division of Research and Development, Lovely Professional University, Phagwara, Punjab, 144001, India.
| | - Sami Azam
- Faculty of Science and Technology, Charles Darwin University, Casuarina, NT, 0909, Australia
| |
Collapse
|
3
|
Kumar KN, Mohan CK, Cenkeramaddi LR, Awasthi N. Minimal data poisoning attack in federated learning for medical image classification: An attacker perspective. Artif Intell Med 2025; 159:103024. [PMID: 39591879 DOI: 10.1016/j.artmed.2024.103024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 05/21/2024] [Accepted: 11/15/2024] [Indexed: 11/28/2024]
Abstract
The privacy-sensitive nature of medical image data is often bounded by strict data sharing regulations that necessitate the need for novel modeling and analysis techniques. Federated learning (FL) enables multiple medical institutions to collectively train a deep neural network without sharing sensitive patient information. In addition, FL uses its collaborative approach to address challenges related to the scarcity and non-uniform distribution of heterogeneous medical domain data. Nevertheless, the data-opaque nature and distributed setup make FL susceptible to data poisoning attacks. There are diverse FL data poisoning attacks for classification models on natural image data in the literature. But their primary focus is on the impact of the attack and they do not consider the attack budget and attack visibility. The attack budget is essential for adversaries to optimize resource utilization in real-world scenarios, which determines the number of manipulations or perturbations they can apply. Simultaneously, attack visibility is crucial to ensure covert execution, allowing attackers to achieve their objectives without triggering detection mechanisms. Generally, an attacker's aim is to create maximum attack impact with minimal resources and low visibility. So, considering these three entities can effectively comprehend the adversary's perspective in designing an attack for real-world scenarios. Further, data poisoning attacks on medical images are challenging compared to natural images due to the subjective nature of medical data. Hence, we develop an attack with a low budget, low visibility, and high impact for medical image classification in FL. We propose a federated learning attention guided minimal attack (FL-AGMA), that uses class attention maps to identify specific medical image regions for perturbation. We introduce image distortion degree (IDD) as a metric to assess the attack budget. Also, we develop a feedback mechanism to regulate the attack coefficient for low attack visibility. Later, we optimize the attack budget by adaptively changing the IDD based on attack visibility. We extensively evaluate three large-scale datasets, namely, Covid-chestxray, Camelyon17, and HAM10000, covering three different data modalities. We observe that our FL-AGMA method has resulted in 44.49% less test accuracy with only 24% of IDD attack budget and lower attack visibility compared to the other attacks.
Collapse
Affiliation(s)
- K Naveen Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology Hyderabad (IITH), Hyderabad, 502284, India.
| | - C Krishna Mohan
- Department of Computer Science and Engineering, Indian Institute of Technology Hyderabad (IITH), Hyderabad, 502284, India.
| | | | - Navchetan Awasthi
- Department of Faculty of Science, Mathematics and Computer Science, Informatics Institute, University of Amsterdam, 1090 GH Amsterdam, The Netherlands; Department of Biomedical Engineering and Physics, Amsterdam UMC, Amsterdam, The Netherlands.
| |
Collapse
|
4
|
Rai HM, Yoo J, Razaque A. A depth analysis of recent innovations in non-invasive techniques using artificial intelligence approach for cancer prediction. Med Biol Eng Comput 2024; 62:3555-3580. [PMID: 39012415 DOI: 10.1007/s11517-024-03158-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 06/22/2024] [Indexed: 07/17/2024]
Abstract
The fight against cancer, a relentless global health crisis, emphasizes the urgency for efficient and automated early detection methods. To address this critical need, this review assesses recent advances in non-invasive cancer prediction techniques, comparing conventional machine learning (CML) and deep neural networks (DNNs). Focusing on these seven major cancers, we analyze 310 publications spanning the years 2018 to 2024, focusing on detection accuracy as the key metric to identify the most effective predictive models, highlighting critical gaps in current methodologies, and suggesting directions for future research. We further delved into factors like datasets, features, and modalities to gain a comprehensive understanding of each approach's performance. Separate review tables for each cancer type and approach facilitated comparisons between top performers (accuracy exceeding 99%) and low performers (65.83 to 85.8%). Our exploration of public databases and commonly used classifiers revealed that optimal combinations of features, datasets, and models can achieve up to 100% accuracy for both CML and DNN. However, significant variations in accuracy (up to 35%) were observed, particularly when optimization was lacking. Notably, colorectal cancer exhibited the lowest accuracy (DNN 69%, CML 65.83%). A five-point comparative analysis (best/worst models, performance gap, average accuracy, and research trends) revealed that while DNN research is gaining momentum, CML approaches remain competitive, even outperforming DNN in some cases. This study presents an in-depth comparative analysis of CML and DNN techniques for cancer detection. This knowledge can inform future research directions and contribute to the development of increasingly accurate and reliable cancer detection tools.
Collapse
Affiliation(s)
- Hari Mohan Rai
- School of Computing, Gachon University, 1342 Seongnam-daero, Sujeong-Gu, Seongnam-Si, 13120, Gyeonggi-Do, Republic of Korea.
| | - Joon Yoo
- School of Computing, Gachon University, 1342 Seongnam-daero, Sujeong-Gu, Seongnam-Si, 13120, Gyeonggi-Do, Republic of Korea
| | - Abdul Razaque
- Department of Cyber Security, Information Processing and Storage, Satbayev University, Almaty, Kazakhstan
| |
Collapse
|
5
|
Düsing C, Cimiano P, Rehberg S, Scherer C, Kaup O, Köster C, Hellmich S, Herrmann D, Meier KL, Claßen S, Borgstedt R. Integrating federated learning for improved counterfactual explanations in clinical decision support systems for sepsis therapy. Artif Intell Med 2024; 157:102982. [PMID: 39277983 DOI: 10.1016/j.artmed.2024.102982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 08/29/2024] [Accepted: 09/01/2024] [Indexed: 09/17/2024]
Abstract
In recent years, we have witnessed both artificial intelligence obtaining remarkable results in clinical decision support systems (CDSSs) and explainable artificial intelligence (XAI) improving the interpretability of these models. In turn, this fosters the adoption by medical personnel and improves trustworthiness of CDSSs. Among others, counterfactual explanations prove to be one such XAI technique particularly suitable for the healthcare domain due to its ease of interpretation, even for less technically proficient staff. However, the generation of high-quality counterfactuals relies on generative models for guidance. Unfortunately, training such models requires a huge amount of data that is beyond the means of ordinary hospitals. In this paper, we therefore propose to use federated learning to allow multiple hospitals to jointly train such generative models while maintaining full data privacy. We demonstrate the superiority of our approach compared to locally generated counterfactuals. Moreover, we prove that generative models for counterfactual generation that are trained using federated learning in a suitable environment perform only marginally worse compared to centrally trained ones while offering the benefit of data privacy preservation. Finally, we integrate our method into a prototypical CDSS for treatment recommendation for sepsis patients, thus providing a proof of concept for real-world application as well as insights and sanity checks from clinical application.
Collapse
Affiliation(s)
- Christoph Düsing
- Center for Cognitive Interaction Technology, Bielefeld University, Inspiration 1, Bielefeld, 33619, Germany.
| | - Philipp Cimiano
- Center for Cognitive Interaction Technology, Bielefeld University, Inspiration 1, Bielefeld, 33619, Germany.
| | - Sebastian Rehberg
- Department of Anaesthesiology, Intensive Care, Emergency Medicine, Transfusion Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld-Bethel, Protestant Hospital of the Bethel Foundation, Burgsteig 13, Bielefeld, 33617, Germany.
| | - Christiane Scherer
- Institute of Laboratory Medicine, Microbiology and Hygiene, University Hospital OWL, Campus Bielefeld-Bethel, Protestant Hospital of the Bethel Foundation, Burgsteig 13, Bielefeld, 33617, Germany.
| | - Olaf Kaup
- Institute of Laboratory Medicine, Microbiology and Transfusion Medicine, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Christiane Köster
- University Clinic for Cardiology and Internal Intensive Care Medicine, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Stefan Hellmich
- Department of Anesthesiology, Surgical Intensive Care Medicine, Emergency Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Daniel Herrmann
- Department of Anesthesiology, Surgical Intensive Care Medicine, Emergency Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Kirsten Laura Meier
- Department of Anesthesiology, Surgical Intensive Care Medicine, Emergency Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Simon Claßen
- Department of Anesthesiology, Surgical Intensive Care Medicine, Emergency Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld Hospital, Teutoburger Straße 50, Bielefeld, 33604, Germany.
| | - Rainer Borgstedt
- Department of Anaesthesiology, Intensive Care, Emergency Medicine, Transfusion Medicine and Pain Therapy, University Hospital OWL, Campus Bielefeld-Bethel, Protestant Hospital of the Bethel Foundation, Burgsteig 13, Bielefeld, 33617, Germany.
| |
Collapse
|
6
|
Eliwa EHI, Mohamed El Koshiry A, Abd El-Hafeez T, Omar A. Secure and Transparent Lung and Colon Cancer Classification Using Blockchain and Microsoft Azure. Adv Respir Med 2024; 92:395-420. [PMID: 39452059 PMCID: PMC11505339 DOI: 10.3390/arm92050037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 10/07/2024] [Accepted: 10/15/2024] [Indexed: 10/26/2024]
Abstract
BACKGROUND The global healthcare system faces challenges in diagnosing and managing lung and colon cancers, which are significant health burdens. Traditional diagnostic methods are inefficient and prone to errors, while data privacy and security concerns persist. OBJECTIVE This study aims to develop a secure and transparent framework for remote consultation and classification of lung and colon cancer, leveraging blockchain technology and Microsoft Azure cloud services. Dataset and Features: The framework utilizes the LC25000 dataset, containing 25,000 histopathological images, for training and evaluating advanced machine learning models. Key features include secure data upload, anonymization, encryption, and controlled access via blockchain and Azure services. METHODS The proposed framework integrates Microsoft Azure's cloud services with a permissioned blockchain network. Patients upload CT scans through a mobile app, which are then preprocessed, anonymized, and stored securely in Azure Blob Storage. Blockchain smart contracts manage data access, ensuring only authorized specialists can retrieve and analyze the scans. Azure Machine Learning is used to train and deploy state-of-the-art machine learning models for cancer classification. Evaluation Metrics: The framework's performance is evaluated using metrics such as accuracy, precision, recall, and F1-score, demonstrating the effectiveness of the integrated approach in enhancing diagnostic accuracy and data security. RESULTS The proposed framework achieves an impressive accuracy of 100% for lung and colon cancer classification using DenseNet, ResNet50, and MobileNet models with different split ratios (70-30, 80-20, 90-10). The F1-score and k-fold cross-validation accuracy (5-fold and 10-fold) also demonstrate exceptional performance, with values exceeding 99.9%. Real-time notifications and secure remote consultations enhance the efficiency and transparency of the diagnostic process, contributing to better patient outcomes and streamlined cancer care management.
Collapse
Affiliation(s)
- Entesar Hamed I. Eliwa
- Department of Mathematics and Statistics, College of Science, King Faisal University, P.O. Box 400, Al-Ahsa 31982, Saudi Arabia
- Department of Computer Science, Faculty of Science, Minia University, El Minia 61519, Egypt
| | - Amr Mohamed El Koshiry
- Department of Curricula and Teaching Methods, College of Education, King Faisal University, P.O. Box 400, Al-Ahsa 31982, Saudi Arabia;
- Faculty of Specific Education, Minia University, El-Minia 61519, Egypt
| | - Tarek Abd El-Hafeez
- Department of Computer Science, Faculty of Science, Minia University, El Minia 61519, Egypt
- Computer Science Unit, Deraya University, El-Minia 61765, Egypt
| | - Ahmed Omar
- Department of Computer Science, Faculty of Science, Minia University, El Minia 61519, Egypt
| |
Collapse
|
7
|
J M, K J. Enhancing Lung Nodule Classification: A Novel CViEBi-CBGWO Approach with Integrated Image Preprocessing. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:2108-2125. [PMID: 38526706 PMCID: PMC11522259 DOI: 10.1007/s10278-024-01074-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 02/16/2024] [Accepted: 03/01/2024] [Indexed: 03/27/2024]
Abstract
Cancer detection and accurate classification pose significant challenges for medical professionals, as it is described as a lethal illness. Diagnosing the malignant lung nodules in its initial stage significantly enhances the recovery and survival rates. Therefore, a novel model named convolutional vision Elman bidirectional-based crossover boosted grey wolf optimization (CViEBi-CBGWO) has been proposed to enhance classification accuracy. CT images selected for further preprocessing are obtained from the LUNA16 dataset and LIDC-IDRI dataset. The data undergoes preprocessing phases involving normalization, data augmentation, and filtering to improve the generalization ability as well as image quality. The local features within the preprocessed images are extracted by implementing the convolutional neural network (CNN). For extracting the global features within the preprocessed images, the vision transformer (ViT) model consists of five encoder blocks. The attained local and global features are combined to generate the feature map. The Elman bidirectional long short-term memory (EBiLSTM) model is applied to categorize the generated feature map as benign and malignant. The crossover operation is integrated with the grey wolf optimization (GWO) algorithm, and the combined form of CBGWO fine-tunes the parameters of the CViEBi model, eliminating the problem of local optima. Experimental validation is conducted using various evaluation measures to assess effectiveness. Comparative analysis demonstrates a superior classification accuracy of 98.72% in the proposed method compared to existing methods.
Collapse
Affiliation(s)
- Manikandan J
- Department of Information Technology, St. Joseph's College of Engineering, Chennai, India.
| | - Jayashree K
- Department of Artificial Intelligence and Data Science, Panimalar Engineering College, Chennai, India
| |
Collapse
|
8
|
Liu Y, Huang J, Chen JC, Chen W, Pan Y, Qiu J. Predicting treatment response in multicenter non-small cell lung cancer patients based on federated learning. BMC Cancer 2024; 24:688. [PMID: 38840081 PMCID: PMC11155008 DOI: 10.1186/s12885-024-12456-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Accepted: 05/30/2024] [Indexed: 06/07/2024] Open
Abstract
BACKGROUND Multicenter non-small cell lung cancer (NSCLC) patient data is information-rich. However, its direct integration becomes exceptionally challenging due to constraints involving different healthcare organizations and regulations. Traditional centralized machine learning methods require centralizing these sensitive medical data for training, posing risks of patient privacy leakage and data security issues. In this context, federated learning (FL) has attracted much attention as a distributed machine learning framework. It effectively addresses this contradiction by preserving data locally, conducting local model training, and aggregating model parameters. This approach enables the utilization of multicenter data with maximum benefit while ensuring privacy safeguards. Based on pre-radiotherapy planning target volume images of NSCLC patients, a multicenter treatment response prediction model is designed by FL for predicting the probability of remission of NSCLC patients. This approach ensures medical data privacy, high prediction accuracy and computing efficiency, offering valuable insights for clinical decision-making. METHODS We retrospectively collected CT images from 245 NSCLC patients undergoing chemotherapy and radiotherapy (CRT) in four Chinese hospitals. In a simulation environment, we compared the performance of the centralized deep learning (DL) model with that of the FL model using data from two sites. Additionally, due to the unavailability of data from one hospital, we established a real-world FL model using data from three sites. Assessments were conducted using measures such as accuracy, receiver operating characteristic curve, and confusion matrices. RESULTS The model's prediction performance obtained using FL methods outperforms that of traditional centralized learning methods. In the comparative experiment, the DL model achieves an AUC of 0.718/0.695, while the FL model demonstrates an AUC of 0.725/0.689, with real-world FL model achieving an AUC of 0.698/0.672. CONCLUSIONS We demonstrate that the performance of a FL predictive model, developed by combining convolutional neural networks (CNNs) with data from multiple medical centers, is comparable to that of a traditional DL model obtained through centralized training. It can efficiently predict CRT treatment response in NSCLC patients while preserving privacy.
Collapse
Affiliation(s)
- Yuan Liu
- School of Radiology, Shandong First Medical University and Shandong Academy of Medical Sciences, Taian, China
| | - Jinzao Huang
- Department of Radiology, Cathay General Hospital, Taipei, China
- Department of Biomedical Imaging and Radiological Sciences, National Yang-Ming Chiao- Tung University, Taipei, China
| | - Jyh-Cheng Chen
- Department of Biomedical Imaging and Radiological Sciences, National Yang-Ming Chiao- Tung University, Taipei, China
- Department of Biomedical Imaging and Radiological Science, China Medical University, Taichung, China
| | - Wei Chen
- School of Radiology, Shandong First Medical University and Shandong Academy of Medical Sciences, Taian, China
| | - Yuteng Pan
- School of Radiology, Shandong First Medical University and Shandong Academy of Medical Sciences, Taian, China
| | - Jianfeng Qiu
- School of Radiology, Second Affiliated Hospital of Shandong First Medical University and Shandong Academy of Medical Sciences, Taian, China.
| |
Collapse
|
9
|
Zhou J, Wang X, Li Y, Yang Y, Shi J. Federated-learning-based prognosis assessment model for acute pulmonary thromboembolism. BMC Med Inform Decis Mak 2024; 24:141. [PMID: 38802861 PMCID: PMC11131248 DOI: 10.1186/s12911-024-02543-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/17/2024] [Indexed: 05/29/2024] Open
Abstract
BACKGROUND Acute pulmonary thromboembolism (PTE) is a common cardiovascular disease and recognizing low prognosis risk patients with PTE accurately is significant for clinical treatment. This study evaluated the value of federated learning (FL) technology in PTE prognosis risk assessment while ensuring the security of clinical data. METHODS A retrospective dataset consisted of PTE patients from 12 hospitals were collected, and 19 physical indicators of patients were included to train the FL-based prognosis assessment model to predict the 30-day death event. Firstly, multiple machine learning methods based on FL were compared to choose the superior model. And then performance of models trained on the independent (IID) and non-independent identical distributed(Non-IID) datasets was calculated and they were tested further on Real-world data. Besides, the optimal model was compared with pulmonary embolism severity index (PESI), simplified PESI (sPESI), Peking Union Medical College Hospital (PUMCH). RESULTS The area under the receiver operating characteristic curve (AUC) of logistic regression(0.842) outperformed convolutional neural network (0.819) and multi layer perceptron (0.784). Under IID, AUC of model trained using FL(Fed) on the training, validation and test sets was 0.852 ± 0.002, 0.867 ± 0.012 and 0.829 ± 0.004. Under Real-world, AUC of Fed was 0.855 ± 0.005, 0.882 ± 0.003 and 0.835 ± 0.005. Under IID and Real-world, AUC of Fed surpassed centralization model(NonFed) (0.847 ± 0.001, 0.841 ± 0.001 and 0.811 ± 0.001). Under Non-IID, although AUC of Fed (0.846 ± 0.047) outperformed NonFed (0.841 ± 0.001) on validation set, it (0.821 ± 0.016 and 0.799 ± 0.031) slightly lagged behind NonFed (0.847 ± 0.001 and 0.811 ± 0.001) on the training and test sets. In practice, AUC of Fed (0.853, 0.884 and 0.842) outshone PESI (0.812, 0.789 and 0.791), sPESI (0.817, 0.770 and 0.786) and PUMCH(0.848, 0.814 and 0.832) on the training, validation and test sets. Additionally, Fed (0.842) exhibited higher AUC values across test sets compared to those trained directly on the clients (0.758, 0.801, 0.783, 0.741, 0.788). CONCLUSIONS In this study, the FL based machine learning model demonstrated commendable efficacy on PTE prognostic risk prediction, rendering it well-suited for deployment in hospitals.
Collapse
Affiliation(s)
- Jun Zhou
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Xin Wang
- Department of Ultrasound, Peking Union Medical College Hospital, Beijing, China
| | - Yiyao Li
- Department of Pulmonary and Critical Care Medicine, Peking Union Medical College Hospital, Beijing, China
| | - Yuqing Yang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China.
| | - Juhong Shi
- Department of Pulmonary and Critical Care Medicine, Peking Union Medical College Hospital, Beijing, China.
| |
Collapse
|
10
|
Zhou H, Yin L, Su R, Zhang Y, Yuan Y, Xie P, Li X. STCGRU: A hybrid model based on CNN and BiGRU for mild cognitive impairment diagnosis. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 248:108123. [PMID: 38471292 DOI: 10.1016/j.cmpb.2024.108123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/28/2023] [Accepted: 03/07/2024] [Indexed: 03/14/2024]
Abstract
BACKGROUND AND OBJECTIVE Early diagnosis of mild cognitive impairment (MCI) is one of the essential measures to prevent its further development into Alzheimer's disease (AD). In this paper, we propose a hybrid deep learning model for early diagnosis of MCI, called spatio-temporal convolutional gated recurrent unit network (STCGRU). METHODS The STCGRU comprises three bespoke convolutional neural network (CNN) modules and a bi-directional gated recurrent unit (BiGRU) module, which can effectively extract the spatial and temporal features of EEG and obtain excellent diagnostic results. We use a publicly available EEG dataset that has not undergone pre-processing to verify the robustness and accuracy of the model. Ablation experiments on STCGRU are conducted to showcase the individual performance improvement of each module. RESULTS Compared with other state-of-the-art approaches using the same publicly available EEG dataset, the results show that STCGRU is more suitable for early diagnosis of MCI. After 10-fold cross-validation, the average classification accuracy of the hybrid model reached 99.95 %, while the average kappa value reached 0.9989. CONCLUSIONS The experimental results show that the hybrid model proposed in this paper can directly extract compelling spatio-temporal features from the raw EEG data for classification. The STCGRU allows for accurate diagnosis of patients with MCI and has a high practical value.
Collapse
Affiliation(s)
- Hao Zhou
- Key Laboratory of Measurement Technology and Instrumentation of Hebei Province, Institute of Electric Engineering, Yanshan University, Qinhuangdao, PR China
| | - Liyong Yin
- The First Hospital of Qinhuangdao, Qinhuangdao, PR China
| | - Rui Su
- Hebei Medical University, Shijiazhuang, PR China
| | - Ying Zhang
- Key Laboratory of Measurement Technology and Instrumentation of Hebei Province, Institute of Electric Engineering, Yanshan University, Qinhuangdao, PR China
| | - Yi Yuan
- Key Laboratory of Measurement Technology and Instrumentation of Hebei Province, Institute of Electric Engineering, Yanshan University, Qinhuangdao, PR China
| | - Ping Xie
- Key Laboratory of Measurement Technology and Instrumentation of Hebei Province, Institute of Electric Engineering, Yanshan University, Qinhuangdao, PR China
| | - Xin Li
- Key Laboratory of Measurement Technology and Instrumentation of Hebei Province, Institute of Electric Engineering, Yanshan University, Qinhuangdao, PR China.
| |
Collapse
|
11
|
Shyamala Bharathi P, Shalini C. Advanced hybrid attention-based deep learning network with heuristic algorithm for adaptive CT and PET image fusion in lung cancer detection. Med Eng Phys 2024; 126:104138. [PMID: 38621836 DOI: 10.1016/j.medengphy.2024.104138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 02/17/2024] [Accepted: 03/02/2024] [Indexed: 04/17/2024]
Abstract
Lung cancer is one of the most deadly diseases in the world. Lung cancer detection can save the patient's life. Despite being the best imaging tool in the medical sector, clinicians find it challenging to interpret and detect cancer from Computed Tomography (CT) scan data. One of the most effective ways for the diagnosis of certain malignancies like lung tumours is Positron Emission Tomography (PET) imaging. So many diagnosis models have been implemented nowadays to diagnose various diseases. Early lung cancer identification is very important for predicting the severity level of lung cancer in cancer patients. To explore the effective model, an image fusion-based detection model is proposed for lung cancer detection using an improved heuristic algorithm of the deep learning model. Firstly, the PET and CT images are gathered from the internet. Further, these two collected images are fused for further process by using the Adaptive Dilated Convolution Neural Network (AD-CNN), in which the hyperparameters are tuned by the Modified Initial Velocity-based Capuchin Search Algorithm (MIV-CapSA). Subsequently, the abnormal regions are segmented by influencing the TransUnet3+. Finally, the segmented images are fed into the Hybrid Attention-based Deep Networks (HADN) model, encompassed with Mobilenet and Shufflenet. Therefore, the effectiveness of the novel detection model is analyzed using various metrics compared with traditional approaches. At last, the outcome evinces that it aids in early basic detection to treat the patients effectively.
Collapse
Affiliation(s)
- P Shyamala Bharathi
- Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, India.
| | - C Shalini
- Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, India
| |
Collapse
|
12
|
Li X, Chen K, Yang J, Wang C, Yang T, Luo C, Li N, Liu Z. TLDA: A transfer learning based dual-augmentation strategy for traditional Chinese Medicine syndrome differentiation in rare disease. Comput Biol Med 2024; 169:107808. [PMID: 38101119 DOI: 10.1016/j.compbiomed.2023.107808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Revised: 11/30/2023] [Accepted: 12/01/2023] [Indexed: 12/17/2023]
Abstract
The Traditional Chinese Medicine (TCM) has demonstrated its significant medical value over the decades, particularly during the COVID-19 pandemic. TCM-AI interdisciplinary models have been proposed to model TCM knowledge, diagnosis, and treatment experiments in clinical practice. Among them, numerous models have been developed to simulate the syndrome differentiation process of human TCM doctors for automatic syndrome diagnosis. However, these models are designed for normal scenarios and trained using a supervised learning paradigm which needs tens of thousands of training samples. They fail to effectively differentiate syndromes in rare disease scenarios where the available TCM electronic medical records (EMRs) are very limited for each unique syndrome. To address the challenge of rare diseases, this study proposes a simple yet effective method called Transfer Learning based Dual-Augmentation (TLDA). TLDA aims to augment the limited EMRs at both the sample-level and feature-level, enriching the pathological and medical information during training. Extended experiments involving 11 comparison models, including the state-of-the-art model, demonstrate the effectiveness of TLDA. TLDA outperforms all comparison models by a significant margin. Furthermore, TLDA can also be extended to other medical tasks when the EMRs for diagnosis are limited in samples.
Collapse
Affiliation(s)
- Xiaochen Li
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China
| | - Kui Chen
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China
| | - Jiaxi Yang
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China
| | - Cheng Wang
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China
| | - Tao Yang
- TCM Department, Beijing Shijitan Hospital, Capital Medical University, Beijing, 100038, China
| | - Changyong Luo
- Infectious Fever Center, Dongfang Hospital of Beijing University of Chinese Medicine, Beijing, 100078, China
| | - Nan Li
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China
| | - Zhi Liu
- Interdisciplinary Research Centers, Zhejiang Lab, Hangzhou, 311100, China.
| |
Collapse
|
13
|
Chauhan M, Singh RP, Sonali, Zia G, Shekhar S, Yadav B, Garg V, Dutt R. An Overview of Current Progress and Challenges in Brain Cancer Therapy Using Advanced Nanoparticles. RECENT PATENTS ON NANOTECHNOLOGY 2024; 18:295-304. [PMID: 37904557 DOI: 10.2174/1872210517666230815105031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 06/22/2023] [Accepted: 07/18/2023] [Indexed: 11/01/2023]
Abstract
Brain tumors pose significant challenges in terms of complete cure and early-stage prognosis. The complexity of brain tumors, including their location, infiltrative nature, and intricate tumor microenvironment (TME), contributes to the difficulties in achieving a complete cure. The primary objective of brain cancer therapy is to effectively treat brain tumors and improve the patient's quality of life. Nanoparticles (NPs) have emerged as promising tools in this regard. They can be designed to deliver therapeutic drugs to the brain tumor site while also incorporating imaging agents. The NPs with the 10-200 nm range can cross the blood-brain barrier (BBB) and blood-brain tumor barrier (BBTB) and facilitate drug bioavailability. NPs can be designed by several methods to improve the pharmaceutical and pharmacological aspects of encapsulated therapeutic agents. NPs can be developed in various dosage forms to suit different administration routes in brain cancer therapy. The unique properties and versatility of NPs make them essential tools in the fight against brain tumors, offering new opportunities to improve patient outcomes and care. Having the ability to target brain tumors directly, overcome the BBB, and minimize systemic side effects makes NPs valuable tools in improving patient outcomes and care. The review highlights the challenges associated with brain tumor treatment and emphasizes the importance of early detection and diagnosis. The use of NPs for drug delivery and imaging in brain tumors is a promising approach to improving patient outcomes and quality of life. The versatility and unique properties of NPs make them valuable tools in the fight against brain tumors, and innovative NP-related patents have the potential to revolutionize healthcare.
Collapse
Affiliation(s)
- Mahima Chauhan
- Department of Pharmacy, School of Medical & Allied Sciences, GD Goenka University, Gurugram, 122103, India
| | - Rahul Pratap Singh
- Department of Pharmacy, School of Medical & Allied Sciences, GD Goenka University, Gurugram, 122103, India
| | - Sonali
- Guru Teg Bahadur Hospital, GTB Enclave, Dilshad Garden, New Delhi, Delhi, 110095, India
| | - Ghazala Zia
- Department of Pharmaceutical Sciences, Maharshi Dayanand University, Rohtak, 124001, India
| | - Saurabh Shekhar
- Department of Pharmacy, School of Medical & Allied Sciences, GD Goenka University, Gurugram, 122103, India
| | - Bhavna Yadav
- Department of Pharmacy, School of Medical & Allied Sciences, GD Goenka University, Gurugram, 122103, India
| | - Vandana Garg
- Guru Teg Bahadur Hospital, GTB Enclave, Dilshad Garden, New Delhi, Delhi, 110095, India
| | - Rohit Dutt
- Department of Pharmacy, School of Medical & Allied Sciences, GD Goenka University, Gurugram, 122103, India
| |
Collapse
|
14
|
Dargahi H, Kooshkebaghi M, Mireshghollah M. Learner satisfaction with synchronous and asynchronous virtual learning systems during the COVID-19 pandemic in Tehran university of medical sciences: a comparative analysis. BMC MEDICAL EDUCATION 2023; 23:886. [PMID: 37990188 PMCID: PMC10661977 DOI: 10.1186/s12909-023-04872-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Accepted: 11/14/2023] [Indexed: 11/23/2023]
Abstract
BACKGROUND The need for electronic learning and its systems, especially during specific circumstances and crises, is crucial and fundamental for users in universities. However, what is even more important is the awareness and familiarity of learners with different systems and their appropriate use in e-learning. Therefore, the present study was conducted to determine the satisfaction of learners with synchronous and asynchronous electronic learning systems during the COVID-19 period at Tehran University of Medical Sciences. METHODS The present study was a descriptive-analytical study conducted cross-sectionally from the first semester of 2019-2020 academic year until the end of the second semester of 2021-2022 academic year, coinciding with the COVID-19 pandemic. The sample size was determined to be 370 students and 650 staff members using the Krejcie and Morgan table. The face validity and reliability of the research tool, which was a researcher-made questionnaire, was confirmed. Considering a response rate of 75%, 280 completed questionnaires were received from students, and 500 completed questionnaires were collected from employees. For data analysis, absolute and relative frequencies, as well as independent t-test, analysis of variance (ANOVA), and Post Hoc tests in the SPSS software were utilized. RESULTS During the COVID-19 pandemic, both students and staff members at Tehran University of Medical Sciences showed a relatively decreasing level of satisfaction with electronic learning. There was a significant difference in satisfaction between these two groups of learners regarding electronic learning (P = 0/031). Learners were relatively more satisfied with the offline system called "Navid" compared to online learning systems. Among the online systems, the highest level of satisfaction was observed with the Skype platform. CONCLUSION Although learners expressed relative satisfaction with electronic learning during the COVID-19 period, it is necessary to strengthen infrastructure and provide support services, technical assistance, and continuous updates for electronic learning platforms. This can contribute to more effective and efficient utilization of electronic learning, especially during particular circumstances and crises, or in hybrid models combining online and face to face education and training.
Collapse
Affiliation(s)
- Hossein Dargahi
- Health Management, Policy Making and Economic Department, School of Public Health, Health Information Management Research Center, Tehran University of Medical Sciences, Tehran, Iran.
| | - Mahdi Kooshkebaghi
- Health Services Management, Yas Hospital, Tehran University of Medical Sciences, Tehran, Iran
| | - Masoumeh Mireshghollah
- Educational Management, School of Nursing and Midwifery, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
15
|
Rai HM, Yoo J. A comprehensive analysis of recent advancements in cancer detection using machine learning and deep learning models for improved diagnostics. J Cancer Res Clin Oncol 2023; 149:14365-14408. [PMID: 37540254 DOI: 10.1007/s00432-023-05216-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 07/26/2023] [Indexed: 08/05/2023]
Abstract
PURPOSE There are millions of people who lose their life due to several types of fatal diseases. Cancer is one of the most fatal diseases which may be due to obesity, alcohol consumption, infections, ultraviolet radiation, smoking, and unhealthy lifestyles. Cancer is abnormal and uncontrolled tissue growth inside the body which may be spread to other body parts other than where it has originated. Hence it is very much required to diagnose the cancer at an early stage to provide correct and timely treatment. Also, manual diagnosis and diagnostic error may cause of the death of many patients hence much research are going on for the automatic and accurate detection of cancer at early stage. METHODS In this paper, we have done the comparative analysis of the diagnosis and recent advancement for the detection of various cancer types using traditional machine learning (ML) and deep learning (DL) models. In this study, we have included four types of cancers, brain, lung, skin, and breast and their detection using ML and DL techniques. In extensive review we have included a total of 130 pieces of literature among which 56 are of ML-based and 74 are from DL-based cancer detection techniques. Only the peer reviewed research papers published in the recent 5-year span (2018-2023) have been included for the analysis based on the parameters, year of publication, feature utilized, best model, dataset/images utilized, and best accuracy. We have reviewed ML and DL-based techniques for cancer detection separately and included accuracy as the performance evaluation metrics to maintain the homogeneity while verifying the classifier efficiency. RESULTS Among all the reviewed literatures, DL techniques achieved the highest accuracy of 100%, while ML techniques achieved 99.89%. The lowest accuracy achieved using DL and ML approaches were 70% and 75.48%, respectively. The difference in accuracy between the highest and lowest performing models is about 28.8% for skin cancer detection. In addition, the key findings, and challenges for each type of cancer detection using ML and DL techniques have been presented. The comparative analysis between the best performing and worst performing models, along with overall key findings and challenges, has been provided for future research purposes. Although the analysis is based on accuracy as the performance metric and various parameters, the results demonstrate a significant scope for improvement in classification efficiency. CONCLUSION The paper concludes that both ML and DL techniques hold promise in the early detection of various cancer types. However, the study identifies specific challenges that need to be addressed for the widespread implementation of these techniques in clinical settings. The presented results offer valuable guidance for future research in cancer detection, emphasizing the need for continued advancements in ML and DL-based approaches to improve diagnostic accuracy and ultimately save more lives.
Collapse
Affiliation(s)
- Hari Mohan Rai
- School of Computing, Gachon University, 1342 Seongnam-daero, Sujeong-gu, Seongnam-si, 13120, Gyeonggi-do, Republic of Korea.
| | - Joon Yoo
- School of Computing, Gachon University, 1342 Seongnam-daero, Sujeong-gu, Seongnam-si, 13120, Gyeonggi-do, Republic of Korea
| |
Collapse
|
16
|
Aydin SG, Bilge HŞ. FPGA Implementation of Image Registration Using Accelerated CNN. SENSORS (BASEL, SWITZERLAND) 2023; 23:6590. [PMID: 37514883 PMCID: PMC10386551 DOI: 10.3390/s23146590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 07/17/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023]
Abstract
BACKGROUND Accurate and fast image registration (IR) is critical during surgical interventions where the ultrasound (US) modality is used for image-guided intervention. Convolutional neural network (CNN)-based IR methods have resulted in applications that respond faster than traditional iterative IR methods. However, general-purpose processors are unable to operate at the maximum speed possible for real-time CNN algorithms. Due to its reconfigurable structure and low power consumption, the field programmable gate array (FPGA) has gained prominence for accelerating the inference phase of CNN applications. METHODS This study proposes an FPGA-based ultrasound IR CNN (FUIR-CNN) to regress three rigid registration parameters from image pairs. To speed up the estimation process, the proposed design makes use of fixed-point data and parallel operations carried out by unrolling and pipelining techniques. Experiments were performed on three US datasets in real time using the xc7z020, and the xcku5p was also used during implementation. RESULTS The FUIR-CNN produced results for the inference phase 139 times faster than the software-based network while retaining a negligible drop in regression performance of under 200 MHz clock frequency. CONCLUSIONS Comprehensive experimental results demonstrate that the proposed end-to-end FPGA-based accelerated CNN achieves a negligible loss, a high speed for registration parameters, less power when compared to the CPU, and the potential for real-time medical imaging.
Collapse
Affiliation(s)
- Seda Guzel Aydin
- Department of Electrical and Electronics Engineering, Bingol University, Bingol 12000, Turkey
| | - Hasan Şakir Bilge
- Biomedical Calibration and Research Center (BIYOKAM), Gazi University, Ankara 06560, Turkey
| |
Collapse
|
17
|
Calle JLP, Punta-Sánchez I, González-de-Peredo AV, Ruiz-Rodríguez A, Ferreiro-González M, Palma M. Rapid and Automated Method for Detecting and Quantifying Adulterations in High-Quality Honey Using Vis-NIRs in Combination with Machine Learning. Foods 2023; 12:2491. [PMID: 37444229 DOI: 10.3390/foods12132491] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023] Open
Abstract
Honey is one of the most adulterated foods, usually through the addition of sweeteners or low-cost honeys. This study presents a method based on visible near infrared spectroscopy (Vis-NIRs), in combination with machine learning (ML) algorithms, for the correct identification and quantification of adulterants in honey. Honey samples from two botanical origins (orange blossom and sunflower) were evaluated and adulterated with low-cost honey in different percentages (5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, and 50%). The results of the exploratory analysis showed a tendency to group the samples according to botanical origin, as well as the presence of adulteration. A supervised analysis was performed to detect the presence of adulterations. The best performance with 100% accuracy was achieved by support vector machines (SVM) and random forests (RF). A regression study was also carried out to quantify the percentage of adulteration. The best result was obtained by support vector regression (SVR) with a coefficient of determination (R2) of 0.991 and a root mean squared error (RMSE) of 1.894. These results demonstrate the potential of combining ML with spectroscopic data as a method for the automated quality control of honey.
Collapse
Affiliation(s)
- José Luis P Calle
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| | - Irene Punta-Sánchez
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| | - Ana Velasco González-de-Peredo
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| | - Ana Ruiz-Rodríguez
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| | - Marta Ferreiro-González
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| | - Miguel Palma
- Department of Analytical Chemistry, Faculty of Sciences, University of Cadiz, Agrifood Campus of International Excellence (ceiA3), IVAGRO, 11510 Puerto Real, Spain
| |
Collapse
|