1
|
Hansun S, Argha A, Bakhshayeshi I, Wicaksana A, Alinejad-Rokny H, Fox GJ, Liaw ST, Celler BG, Marks GB. Diagnostic Performance of Artificial Intelligence-Based Methods for Tuberculosis Detection: Systematic Review. J Med Internet Res 2025; 27:e69068. [PMID: 40053773 PMCID: PMC11928776 DOI: 10.2196/69068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Revised: 01/10/2025] [Accepted: 02/07/2025] [Indexed: 03/09/2025] Open
Abstract
BACKGROUND Tuberculosis (TB) remains a significant health concern, contributing to the highest mortality among infectious diseases worldwide. However, none of the various TB diagnostic tools introduced is deemed sufficient on its own for the diagnostic pathway, so various artificial intelligence (AI)-based methods have been developed to address this issue. OBJECTIVE We aimed to provide a comprehensive evaluation of AI-based algorithms for TB detection across various data modalities. METHODS Following PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analysis) 2020 guidelines, we conducted a systematic review to synthesize current knowledge on this topic. Our search across 3 major databases (Scopus, PubMed, Association for Computing Machinery [ACM] Digital Library) yielded 1146 records, of which we included 152 (13.3%) studies in our analysis. QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies version 2) was performed for the risk-of-bias assessment of all included studies. RESULTS Radiographic biomarkers (n=129, 84.9%) and deep learning (DL; n=122, 80.3%) approaches were predominantly used, with convolutional neural networks (CNNs) using Visual Geometry Group (VGG)-16 (n=37, 24.3%), ResNet-50 (n=33, 21.7%), and DenseNet-121 (n=19, 12.5%) architectures being the most common DL approach. The majority of studies focused on model development (n=143, 94.1%) and used a single modality approach (n=141, 92.8%). AI methods demonstrated good performance in all studies: mean accuracy=91.93% (SD 8.10%, 95% CI 90.52%-93.33%; median 93.59%, IQR 88.33%-98.32%), mean area under the curve (AUC)=93.48% (SD 7.51%, 95% CI 91.90%-95.06%; median 95.28%, IQR 91%-99%), mean sensitivity=92.77% (SD 7.48%, 95% CI 91.38%-94.15%; median 94.05% IQR 89%-98.87%), and mean specificity=92.39% (SD 9.4%, 95% CI 90.30%-94.49%; median 95.38%, IQR 89.42%-99.19%). AI performance across different biomarker types showed mean accuracies of 92.45% (SD 7.83%), 89.03% (SD 8.49%), and 84.21% (SD 0%); mean AUCs of 94.47% (SD 7.32%), 88.45% (SD 8.33%), and 88.61% (SD 5.9%); mean sensitivities of 93.8% (SD 6.27%), 88.41% (SD 10.24%), and 93% (SD 0%); and mean specificities of 94.2% (SD 6.63%), 85.89% (SD 14.66%), and 95% (SD 0%) for radiographic, molecular/biochemical, and physiological types, respectively. AI performance across various reference standards showed mean accuracies of 91.44% (SD 7.3%), 93.16% (SD 6.44%), and 88.98% (SD 9.77%); mean AUCs of 90.95% (SD 7.58%), 94.89% (SD 5.18%), and 92.61% (SD 6.01%); mean sensitivities of 91.76% (SD 7.02%), 93.73% (SD 6.67%), and 91.34% (SD 7.71%); and mean specificities of 86.56% (SD 12.8%), 93.69% (SD 8.45%), and 92.7% (SD 6.54%) for bacteriological, human reader, and combined reference standards, respectively. The transfer learning (TL) approach showed increasing popularity (n=89, 58.6%). Notably, only 1 (0.7%) study conducted domain-shift analysis for TB detection. CONCLUSIONS Findings from this review underscore the considerable promise of AI-based methods in the realm of TB detection. Future research endeavors should prioritize conducting domain-shift analyses to better simulate real-world scenarios in TB detection. TRIAL REGISTRATION PROSPERO CRD42023453611; https://www.crd.york.ac.uk/PROSPERO/view/CRD42023453611.
Collapse
Affiliation(s)
- Seng Hansun
- School of Clinical Medicine, South West Sydney, UNSW Medicine & Health, UNSW Sydney, Sydney, Australia
- Woolcock Vietnam Research Group, Woolcock Institute of Medical Research, Sydney, Australia
| | - Ahmadreza Argha
- Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, Australia
- Tyree Institute of Health Engineering, UNSW Sydney, Sydney, Australia
- Ageing Future Institute, UNSW Sydney, Sydney, Australia
| | - Ivan Bakhshayeshi
- Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, Australia
- BioMedical Machine Learning Lab, Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, Australia
| | - Arya Wicaksana
- Informatics Department, Universitas Multimedia Nusantara, Tangerang, Indonesia
| | - Hamid Alinejad-Rokny
- Tyree Institute of Health Engineering, UNSW Sydney, Sydney, Australia
- Ageing Future Institute, UNSW Sydney, Sydney, Australia
- BioMedical Machine Learning Lab, Graduate School of Biomedical Engineering, UNSW Sydney, Sydney, Australia
| | - Greg J Fox
- NHMRC Clinical Trials Centre, Faculty of Medicine and Health, University of Sydney, Sydney, Australia
| | - Siaw-Teng Liaw
- School of Population Health and School of Clinical Medicine, UNSW Sydney, Sydney, Australia
| | - Branko G Celler
- Biomedical Systems Research Laboratory, School of Electrical Engineering and Telecommunications, UNSW Sydney, Sydney, Australia
| | - Guy B Marks
- School of Clinical Medicine, South West Sydney, UNSW Medicine & Health, UNSW Sydney, Sydney, Australia
- Woolcock Vietnam Research Group, Woolcock Institute of Medical Research, Sydney, Australia
- Burnet Institute, Melbourne, Australia
| |
Collapse
|
2
|
Hage Chehade A, Abdallah N, Marion JM, Hatt M, Oueidat M, Chauvet P. Advancing chest X-ray diagnostics: A novel CycleGAN-based preprocessing approach for enhanced lung disease classification in ChestX-Ray14. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 259:108518. [PMID: 39615193 DOI: 10.1016/j.cmpb.2024.108518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 10/28/2024] [Accepted: 11/15/2024] [Indexed: 12/11/2024]
Abstract
BACKGROUND AND OBJECTIVE Chest radiography is a medical imaging technique widely used to diagnose thoracic diseases. However, X-ray images may contain artifacts such as irrelevant objects, medical devices, wires and electrodes that can introduce unnecessary noise, making difficult the distinction of relevant anatomical structures, and hindering accurate diagnoses. We aim in this study to address the issue of these artifacts in order to improve lung diseases classification results. METHODS In this paper we present a novel preprocessing approach which begins by detecting images that contain artifacts and then we reduce the artifacts' noise effect by generating sharper images using a CycleGAN model. The DenseNet-121 model, used for the classification, incorporates channel and spatial attention mechanisms to specifically focus on relevant parts of the image. Additional information contained in the dataset, namely clinical characteristics, were also integrated into the model. RESULTS We evaluated the performance of the classification model before and after applying our proposed artifact preprocessing approach. These results clearly demonstrate that our preprocessing approach significantly improves the model's AUC by 5.91% for pneumonia and 6.44% for consolidation classification, outperforming previous studies for the 14 diseases in the ChestX-Ray14 dataset. CONCLUSION This research highlights the importance of considering the presence of artifacts when diagnosing lung diseases from radiographic images. By eliminating unwanted noise, our approach enables models to focus on relevant diagnostic features, thereby improving their performance. The results demonstrated that our approach is promising, highlighting its potential for broader applications in lung disease classification.
Collapse
Affiliation(s)
| | | | | | - Mathieu Hatt
- LaTIM, INSERM UMR 1101, University of Brest, Brest, France
| | | | | |
Collapse
|
3
|
Mei L, Lian C, Han S, Jin S, He J, Dong L, Wang H, Shen H, Lei C, Xiong B. High-Accuracy and Lightweight Image Classification Network for Optimizing Lymphoblastic Leukemia Diagnosisy. Microsc Res Tech 2025; 88:489-500. [PMID: 39429031 DOI: 10.1002/jemt.24704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Revised: 08/09/2024] [Accepted: 09/16/2024] [Indexed: 10/22/2024]
Abstract
Leukemia is a hematological malignancy that significantly impacts the human immune system. Early detection helps to effectively manage and treat cancer. Although deep learning techniques hold promise for early detection of blood disorders, their effectiveness is often limited by the physical constraints of available datasets and deployed devices. For this investigation, we collect an excellent-quality dataset of 17,826 morphological bone marrow cell images from 85 patients with lymphoproliferative neoplasms. We employ a progressive shrinking approach, which integrates a comprehensive pruning technique across multiple dimensions, including width, depth, resolution, and kernel size, to train our lightweight model. The proposed model achieves rapid identification of acute lymphoblastic leukemia, chronic lymphocytic leukemia, and other bone marrow cell types with an accuracy of 92.51% and a throughput of 111 slides per second, while comprising only 6.4 million parameters. This model significantly contributes to leukemia diagnosis, particularly in the rapid and accurate identification of lymphatic system diseases, and provides potential opportunities to enhance the efficiency and accuracy of medical experts in the diagnosis and treatment of lymphocytic leukemia.
Collapse
Affiliation(s)
- Liye Mei
- School of Computer Science, Hubei University of Technology, Wuhan, China
- The Institute of Technological Sciences, Wuhan University, Wuhan, China
| | - Chentao Lian
- School of Computer Science, Hubei University of Technology, Wuhan, China
| | - Suyang Han
- The Second Clinical School of Wuhan University, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Shuangtong Jin
- School of Computer Science, Hubei University of Technology, Wuhan, China
| | - Jing He
- The Department of Hematology, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Lan Dong
- Department of Gynecology, Renmin Hospital of Wuhan University, Wuhan, China
| | - Hongzhu Wang
- Faculty of Engineering, University of New South Wales, Kensington, New South Wales, Australia
| | - Hui Shen
- The Department of Hematology, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Cheng Lei
- The Institute of Technological Sciences, Wuhan University, Wuhan, China
- Suzhou Institute of Wuhan University, Suzhou, China
- Shenzhen Institute of Wuhan University, Shenzhen, China
| | - Bei Xiong
- The Department of Hematology, Zhongnan Hospital of Wuhan University, Wuhan, China
| |
Collapse
|
4
|
Visu P, Sathiya V, Ajitha P, Surendran R. Enhanced swin transformer based tuberculosis classification with segmentation using chest X-ray. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2025; 33:167-186. [PMID: 39973770 DOI: 10.1177/08953996241300018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
BACKGROUND: Tuberculosis disease is the disease that causes significant morbidity and mortality worldwide. Thus, early detection of the disease is crucial for proper treatment and controlling the spread of Tuberculosis disease. Chest X-ray imaging is one of the most widely used diagnostic tools for detecting the Tuberculosis, which is time-consuming, and prone to errors. Nowadays, deep learning model provides the automated classification of medical images with promising outcome. OBJECTIVE: Thus, this research introduced a deep learning based segmentation and classification model. Initially, the Adaptive Gaussian Filtering based pre-processing and data augmentation is performed to remove artefacts and biased outcome. Then, Attention UNet (A_UNet) based segmentation is proposed for segmenting the required region of Chest X-ray. METHODS: Using the segmented outcome, Enhanced Swin Transformer (EnSTrans) model based Tuberculosis classification model is designed with Residual Pyramid Network based Multi-layer perceptron (MLP) layer for enhancing the classification accuracy. RESULTS: Enhanced Lotus Effect Optimization (EnLeO) Algorithm is employed for the loss function optimization of the EnSTrans model. CONCLUSIONS: The proposed methods acquired the Accuracy, Recall, Precision, F-score, and Specificity of 99.0576%, 98.9459%, 99.145%, 98.96%, and 99.152% respectively.
Collapse
Affiliation(s)
- P Visu
- Department of Artificial Intelligence and Data Science, Velammal Engineering College, Chennai, India
| | - V Sathiya
- Department of Computer Science and Engineering, Panimalar Engineering College, Chennai, India
| | - P Ajitha
- Department of Computer Science and Engineering, School of Computing, Sathyabama Institue of Science and Technology, Chennai, India
| | - R Surendran
- Department of Computer Science and Engineering, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, India
| |
Collapse
|
5
|
Chibuike O, Yang X. Convolutional Neural Network-Vision Transformer Architecture with Gated Control Mechanism and Multi-Scale Fusion for Enhanced Pulmonary Disease Classification. Diagnostics (Basel) 2024; 14:2790. [PMID: 39767151 PMCID: PMC11727035 DOI: 10.3390/diagnostics14242790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 12/09/2024] [Accepted: 12/11/2024] [Indexed: 01/16/2025] Open
Abstract
BACKGROUND/OBJECTIVES Vision Transformers (ViTs) and convolutional neural networks (CNNs) have demonstrated remarkable performances in image classification, especially in the domain of medical imaging analysis. However, ViTs struggle to capture high-frequency components of images, which are critical in identifying fine-grained patterns, while CNNs have difficulties in capturing long-range dependencies due to their local receptive fields, which makes it difficult to fully capture the spatial relationship across lung regions. METHODS In this paper, we proposed a hybrid architecture that integrates ViTs and CNNs within a modular component block(s) to leverage both local feature extraction and global context capture. In each component block, the CNN is used to extract the local features, which are then passed through the ViT to capture the global dependencies. We implemented a gated attention mechanism that combines the channel-, spatial-, and element-wise attention to selectively emphasize the important features, thereby enhancing overall feature representation. Furthermore, we incorporated a multi-scale fusion module (MSFM) in the proposed framework to fuse the features at different scales for more comprehensive feature representation. RESULTS Our proposed model achieved an accuracy of 99.50% in the classification of four pulmonary conditions. CONCLUSIONS Through extensive experiments and ablation studies, we demonstrated the effectiveness of our approach in improving the medical image classification performance, while achieving good calibration results. This hybrid approach offers a promising framework for reliable and accurate disease diagnosis in medical imaging.
Collapse
Affiliation(s)
- Okpala Chibuike
- Department of Human Ecology & Technology, Handong Global University, Pohang 37554, Republic of Korea;
| | - Xiaopeng Yang
- Department of Human Ecology & Technology, Handong Global University, Pohang 37554, Republic of Korea;
- School of Global Entrepreneurship and Information Communication Technology, Handong Global University, Pohang 37554, Republic of Korea
| |
Collapse
|
6
|
Arjun KP, Kumar KS, Dhanaraj RK, Ravi V, Kumar TG. Optimizing time prediction and error classification in early melanoma detection using a hybrid RCNN-LSTM model. Microsc Res Tech 2024; 87:1789-1809. [PMID: 38515433 DOI: 10.1002/jemt.24559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Revised: 01/13/2024] [Accepted: 03/14/2024] [Indexed: 03/23/2024]
Abstract
Skin cancer is a terrifying disorder that affects all individuals. Due to the significant increase in the rate of melanoma skin cancer, early detection of skin cancer is now more critical than ever before. Malignant melanoma is one of the most serious forms of skin cancer, and it is caused by abnormal melanocyte cell growth. In recent years, skin cancer predictive categorization has become more accurate and predictive due to multiple deep learning algorithms. Malignant melanoma is diagnosed using the Recurrent Convolution Neural Network-Long Short-Term Memory (RCNN-LSTM), which is one of the deep learning classification approaches. Using the International Skin Image Collection and the RCNN-LSTM, the data are categorized and analyzed to gain a better understanding of skin cancer. The method begins with data preprocessing, which prepares the dataset for classification. Additionally, the RCNN is employed to extract the features that are vital to the prediction process. The LSTM is accountable for the final step, classification. There are further factors to examine, such as the precision of 94.60%, the sensitivity of 95.67%, and the F1-score of 95.13%. Other benefits of the suggested study include shorter prediction durations of 95.314, 122.530, and 131.205 s and lower model loss of 0.25%, 0.19%, and 0.15% for input sizes 10, 15, and 20, respectively. Three datasets had a reduced categorization error of 5.11% and an accuracy of 95.42%. In comparison to previous approaches, the work discussed here produces superior outcomes. RESEARCH HIGHLIGHTS: Recurrent convolutional neural network (RCNN) deep learning approach for optimizing time prediction and error classification in early melanoma detection. It extracts a high number of specific features from the skin disease image, making the classification process easier and more accurate. To reduce classification errors in accurately detecting melanoma, context dependency is considered in this work. By accounting for context dependency, the deprivation state is avoided, preventing performance degradation in the model. To minimize melanoma detection model loss, a skin disease image augmentation or regularization process is performed in this work. This strategy improves the accuracy of the model when applied to fresh, previously unobserved data.
Collapse
Affiliation(s)
- K P Arjun
- Department of Computer Science and Engineering, GITAM University, Bangalore, India
| | - K Sampath Kumar
- Department of Computer Science and Engineering, AMET University, Chennai, India
| | - Rajesh Kumar Dhanaraj
- Symbiosis Institute of Computer Studies and Research (SICSR), Symbiosis International (Deemed University), Pune, India
| | - Vinayakumar Ravi
- Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, Saudi Arabia
| | - T Ganesh Kumar
- School of Computing Science and Engineering, Galgotias University, Greater Noida, India
| |
Collapse
|
7
|
Siddiqi R, Javaid S. Deep Learning for Pneumonia Detection in Chest X-ray Images: A Comprehensive Survey. J Imaging 2024; 10:176. [PMID: 39194965 DOI: 10.3390/jimaging10080176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 07/15/2024] [Accepted: 07/19/2024] [Indexed: 08/29/2024] Open
Abstract
This paper addresses the significant problem of identifying the relevant background and contextual literature related to deep learning (DL) as an evolving technology in order to provide a comprehensive analysis of the application of DL to the specific problem of pneumonia detection via chest X-ray (CXR) imaging, which is the most common and cost-effective imaging technique available worldwide for pneumonia diagnosis. This paper in particular addresses the key period associated with COVID-19, 2020-2023, to explain, analyze, and systematically evaluate the limitations of approaches and determine their relative levels of effectiveness. The context in which DL is applied as both an aid to and an automated substitute for existing expert radiography professionals, who often have limited availability, is elaborated in detail. The rationale for the undertaken research is provided, along with a justification of the resources adopted and their relevance. This explanatory text and the subsequent analyses are intended to provide sufficient detail of the problem being addressed, existing solutions, and the limitations of these, ranging in detail from the specific to the more general. Indeed, our analysis and evaluation agree with the generally held view that the use of transformers, specifically, vision transformers (ViTs), is the most promising technique for obtaining further effective results in the area of pneumonia detection using CXR images. However, ViTs require extensive further research to address several limitations, specifically the following: biased CXR datasets, data and code availability, the ease with which a model can be explained, systematic methods of accurate model comparison, the notion of class imbalance in CXR datasets, and the possibility of adversarial attacks, the latter of which remains an area of fundamental research.
Collapse
Affiliation(s)
- Raheel Siddiqi
- Computer Science Department, Karachi Campus, Bahria University, Karachi 73500, Pakistan
| | - Sameena Javaid
- Computer Science Department, Karachi Campus, Bahria University, Karachi 73500, Pakistan
| |
Collapse
|
8
|
Shayegan MJ. A brief review and scientometric analysis on ensemble learning methods for handling COVID-19. Heliyon 2024; 10:e26694. [PMID: 38420425 PMCID: PMC10901105 DOI: 10.1016/j.heliyon.2024.e26694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 02/07/2024] [Accepted: 02/19/2024] [Indexed: 03/02/2024] Open
Abstract
Numerous efforts and research have been conducted worldwide to combat the coronavirus disease 2019 (COVID-19) pandemic. In this regard, some researchers have focused on deep and machine-learning approaches to discover more about this disease. There have been many articles on using ensemble learning methods for COVID-19 detection. Still, there seems to be no scientometric analysis or a brief review of these researches. Hence, a combined method of scientometric analysis and brief review was used to study the published articles that employed an ensemble learning approach to detect COVID-19. This research used both methods to overcome their limitations, leading to enhanced and reliable outcomes. The related articles were retrieved from the Scopus database. Then a two-step procedure was employed. A concise review of the collected articles was conducted. Then they underwent scientometric and bibliometric analyses. The findings revealed that convolutional neural network (CNN) is the mostly employed algorithm, while support vector machine (SVM), random forest, Resnet, DenseNet, and visual geometry group (VGG) were also frequently used. Additionally, China has had a significant presence in the numerous top-ranking categories of this field of research. Both study phases yielded valuable results and rankings.
Collapse
|
9
|
Saju B, Tressa N, Dhanaraj RK, Tharewal S, Mathew JC, Pelusi D. Effective multi-class lungdisease classification using the hybridfeature engineering mechanism. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:20245-20273. [PMID: 38052644 DOI: 10.3934/mbe.2023896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
The utilization of computational models in the field of medical image classification is an ongoing and unstoppable trend, driven by the pursuit of aiding medical professionals in achieving swift and precise diagnoses. Post COVID-19, many researchers are studying better classification and diagnosis of lung diseases particularly, as it was reported that one of the very few diseases greatly affecting human beings was related to lungs. This research study, as presented in the paper, introduces an advanced computer-assisted model that is specifically tailored for the classification of 13 lung diseases using deep learning techniques, with a focus on analyzing chest radiograph images. The work flows from data collection, image quality enhancement, feature extraction to a comparative classification performance analysis. For data collection, an open-source data set consisting of 112,000 chest X-Ray images was used. Since, the quality of the pictures was significant for the work, enhanced image quality is achieved through preprocessing techniques such as Otsu-based binary conversion, contrast limited adaptive histogram equalization-driven noise reduction, and Canny edge detection. Feature extraction incorporates connected regions, histogram of oriented gradients, gray-level co-occurrence matrix and Haar wavelet transformation, complemented by feature selection via regularized neighbourhood component analysis. The paper proposes an optimized hybrid model, improved Aquila optimization convolutional neural networks (CNN), which is a combination of optimized CNN and DENSENET121 with applied batch equalization, which provides novelty for the model compared with other similar works. The comparative evaluation of classification performance among CNN, DENSENET121 and the proposed hybrid model is also done to find the results. The findings highlight the proposed hybrid model's supremacy, boasting 97.00% accuracy, 94.00% precision, 96.00% sensitivity, 96.00% specificity and 95.00% F1-score. In the future, potential avenues encompass exploring explainable machine learning for discerning model decisions and optimizing performance through strategic model restructuring.
Collapse
Affiliation(s)
- Binju Saju
- Department of Master of Computer Applications, New Horizon College of Engineering, Bengaluru, India
| | - Neethu Tressa
- Department of Master of Computer Applications, New Horizon College of Engineering, Bengaluru, India
| | - Rajesh Kumar Dhanaraj
- Symbiosis Institute of Computer Studies and Research (SICSR), Symbiosis International University, Pune, India
| | - Sumegh Tharewal
- Symbiosis Institute of Computer Studies and Research (SICSR), Symbiosis International University, Pune, India
| | | | - Danilo Pelusi
- Department of Communication Sciences, University of Teramo, Teramo, Italy
| |
Collapse
|
10
|
Bakasa W, Viriri S. Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting. Front Artif Intell 2023; 6:1232640. [PMID: 37876961 PMCID: PMC10591225 DOI: 10.3389/frai.2023.1232640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 09/04/2023] [Indexed: 10/26/2023] Open
Abstract
Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference.
Collapse
Affiliation(s)
| | - Serestina Viriri
- School of Mathematics Statistics & Computer Science, College of Agriculture, Engineering and Science, University of KwaZulu-Natal, Durban, South Africa
| |
Collapse
|
11
|
Al-Azzwi ZHN, Nazarov A. Brain Tumor Classification based on Improved Stacked Ensemble Deep Learning Methods. Asian Pac J Cancer Prev 2023; 24:2141-2148. [PMID: 37378946 PMCID: PMC10505861 DOI: 10.31557/apjcp.2023.24.6.2141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 06/21/2023] [Indexed: 06/29/2023] Open
Abstract
OBJECTIVE Brain Tumor diagnostic prediction is essential for assisting radiologists and other healthcare professionals in identifying and classifying brain tumors. For the diagnosis and treatment of cancer diseases, prediction and classification accuracy are crucial. The aim of this study was to improve ensemble deep learning models for classifing brain tumor and increase the performance of structure models by combining different model of deep learning to develop a model with more accurate predictions than the individual models. METHODS Convolutional neural networks (CNNs), which are made up of a single algorithm called CNN model, are the foundation of most current methods for classifying cancer illness images. The model CNN is combined with other models to create other methods of classification called ensemble method. However, compared to a single machine learning algorithm, ensemble machine learning models are more accurate. This study used stacked ensemble deep learning technology. The data set used in this study was obtained from Kaggle and included two categories: abnormal & normal brains. The data set was trained with three models: VGG19, Inception v3, and Resnet 10. RESULT The 96.6% accuracy for binary classification (0,1) have been achieved by stacked ensemble deep learning model with Loss binary cross entropy, and Adam optimizer take into consideration with stacking models. CONCLUSION The stacked ensemble deep learning model can be improved over a single framework.
Collapse
Affiliation(s)
- Zobeda Hatif Naji Al-Azzwi
- School of Radio Engineering and Computer Technology, Moscow Institute of Physics and Technology, Moscow, Russian Federation.
| | - A.N Nazarov
- Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, Russian Federation.
| |
Collapse
|
12
|
A deep learning-based framework for automatic detection of drug resistance in tuberculosis patients. EGYPTIAN INFORMATICS JOURNAL 2023. [DOI: 10.1016/j.eij.2023.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|