1
|
Rai HM, Trivedi A, Shukla S. ECG signal processing for abnormalities detection using multi-resolution wavelet transform and Artificial Neural Network classifier. MEASUREMENT 2013; 46:3238-3246. [DOI: 10.1016/j.measurement.2013.05.021] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023] [Imported: 08/08/2023]
|
|
12 |
57 |
2
|
Rai HM, Chatterjee K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. APPL INTELL 2022; 52:5366-5384. [DOI: 10.1007/s10489-021-02696-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 07/16/2021] [Indexed: 10/20/2022] [Imported: 08/08/2023]
|
|
3 |
30 |
3
|
Rai HM, Chatterjee K. A unique feature extraction using MRDWT for automatic classification of abnormal heartbeat from ECG big data with Multilayered Probabilistic Neural Network classifier. Appl Soft Comput 2018; 72:596-608. [DOI: 10.1016/j.asoc.2018.04.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] [Imported: 08/08/2023]
|
|
7 |
18 |
4
|
Rai HM, Chatterjee K. Hybrid adaptive algorithm based on wavelet transform and independent component analysis for denoising of MRI images. MEASUREMENT 2019; 144:72-82. [DOI: 10.1016/j.measurement.2019.05.028] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023] [Imported: 08/08/2023]
|
|
6 |
13 |
5
|
Rai HM, Chatterjee K. A Novel Adaptive Feature Extraction for Detection of Cardiac Arrhythmias Using Hybrid Technique MRDWT & MPNN Classifier from ECG Big Data. BIG DATA RESEARCH 2018; 12:13-22. [DOI: 10.1016/j.bdr.2018.02.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023] [Imported: 08/08/2023]
|
|
7 |
11 |
6
|
Rai HM, Trivedi A, Chatterjee K, Shukla S. R-Peak Detection using Daubechies Wavelet and ECG Signal Classification using Radial Basis Function Neural Network. JOURNAL OF THE INSTITUTION OF ENGINEERS (INDIA): SERIES B 2014; 95:63-71. [DOI: 10.1007/s40031-014-0073-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023] [Imported: 08/08/2023]
|
|
11 |
11 |
7
|
Rai HM, Chatterjee K, Dashkevich S. Automatic and accurate abnormality detection from brain MR images using a novel hybrid UnetResNext-50 deep CNN model. Biomed Signal Process Control 2021; 66:102477. [DOI: 10.1016/j.bspc.2021.102477] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] [Imported: 08/08/2023]
|
|
4 |
11 |
8
|
Rai HM, Chatterjee K, Dashkevych S. The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models. Comput Biol Med 2022; 150:106142. [PMID: 36182760 DOI: 10.1016/j.compbiomed.2022.106142] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 09/04/2022] [Accepted: 09/18/2022] [Indexed: 11/03/2022] [Imported: 08/08/2023]
Abstract
Cardiovascular disease (CVD) is the most fatal disease in the world, so its accurate and automated detection in the early stages will certainly support the medical expert in timely diagnosis and treatment, which can save many lives. Many types of research have been carried out in this regard, but due to the problem of data imbalance in the medical and health care sector, it may not provide the desired results in all aspects. To overcome this problem, a sequential ensemble technique has been proposed that detects 6 types of cardiac arrhythmias on large ECG imbalanced datasets, and the data imbalanced issue of the ECG dataset has been addressed by using a hybrid data resampling technique called "Synthetically Minority Oversampling Technique and Tomek Link (SMOTE + Tomek)". The sequential ensemble technique employs two distinct deep learning models: Convolutional Neural Network (CNN) and a hybrid model, CNN with Long Short-Term Memory Network (CNN-LSTM). The two standard datasets "MIT-BIH arrhythmias database" (MITDB) and "PTB diagnostic database" (PTBDB) are combined and extracted 23, 998 ECG beats for the model validation. In this work, the three models CNN, CNN-LSTM, and ensemble approach were tested on four kinds of ECG datasets: the original data (imbalanced), the data sampled using a random oversampled technique, data sampled using SMOTE, and the dataset resampled using SMOTE + Tomek algorithm. The overall highest accuracy was obtained of 99.02% on the SMOTE + Tomek sampled dataset by ensemble technique and the minority class accuracy result (Recall) is improved by 20% as compared to the imbalanced data.
Collapse
|
|
3 |
9 |
9
|
Rai HM, Chatterjee K. Detection of brain abnormality by a novel Lu-Net deep neural CNN model from MR images. MACHINE LEARNING WITH APPLICATIONS 2020; 2:100004. [DOI: 10.1016/j.mlwa.2020.100004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] [Imported: 08/09/2023] Open
|
|
5 |
6 |
10
|
Rai HM, Chatterjee K. 2D MRI image analysis and brain tumor detection using deep learning CNN model LeU-Net. MULTIMEDIA TOOLS AND APPLICATIONS 2021; 80:36111-36141. [DOI: 10.1007/s11042-021-11504-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 07/03/2021] [Accepted: 08/19/2021] [Indexed: 08/08/2023] [Imported: 08/08/2023]
|
|
4 |
6 |
11
|
Rai HM. Cancer detection and segmentation using machine learning and deep learning techniques: a review. MULTIMEDIA TOOLS AND APPLICATIONS 2023; 83:27001-27035. [DOI: 10.1007/s11042-023-16520-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 05/12/2023] [Accepted: 08/13/2023] [Indexed: 09/16/2023] [Imported: 09/16/2023]
|
|
2 |
5 |
12
|
Rai HM, Chatterjee K, Mukherjee C. Hybrid CNN-LSTM model for automatic prediction of cardiac arrhythmias from ECG big data. 2020 IEEE 7TH UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (UPCON) 2020:1-6. [DOI: 10.1109/upcon50219.2020.9376450] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
5 |
4 |
13
|
Rai HM, Yoo J, Dashkevych S. GAN-SkipNet: A Solution for Data Imbalance in Cardiac Arrhythmia Detection Using Electrocardiogram Signals from a Benchmark Dataset. MATHEMATICS 2024; 12:2693. [DOI: 10.3390/math12172693] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2025] [Imported: 01/11/2025]
Abstract
Electrocardiography (ECG) plays a pivotal role in monitoring cardiac health, yet the manual analysis of ECG signals is challenging due to the complex task of identifying and categorizing various waveforms and morphologies within the data. Additionally, ECG datasets often suffer from a significant class imbalance issue, which can lead to inaccuracies in detecting minority class samples. To address these challenges and enhance the effectiveness and efficiency of cardiac arrhythmia detection from imbalanced ECG datasets, this study proposes a novel approach. This research leverages the MIT-BIH arrhythmia dataset, encompassing a total of 109,446 ECG beats distributed across five classes following the Association for the Advancement of Medical Instrumentation (AAMI) standard. Given the dataset’s inherent class imbalance, a 1D generative adversarial network (GAN) model is introduced, incorporating the Bi-LSTM model to synthetically generate the two minority signal classes, which represent a mere 0.73% fusion (F) and 2.54% supraventricular (S) of the data. The generated signals are rigorously evaluated for similarity to real ECG data using three key metrics: mean squared error (MSE), structural similarity index (SSIM), and Pearson correlation coefficient (r). In addition to addressing data imbalance, the work presents three deep learning models tailored for ECG classification: SkipCNN (a convolutional neural network with skip connections), SkipCNN+LSTM, and SkipCNN+LSTM+Attention mechanisms. To further enhance efficiency and accuracy, the test dataset is rigorously assessed using an ensemble model, which consistently outperforms the individual models. The performance evaluation employs standard metrics such as precision, recall, and F1-score, along with their average, macro average, and weighted average counterparts. Notably, the SkipCNN+LSTM model emerges as the most promising, achieving remarkable precision, recall, and F1-scores of 99.3%, which were further elevated to an impressive 99.60% through ensemble techniques. Consequently, with this innovative combination of data balancing techniques, the GAN-SkipNet model not only resolves the challenges posed by imbalanced data but also provides a robust and reliable solution for cardiac arrhythmia detection. This model stands poised for clinical applications, offering the potential to be deployed in hospitals for real-time cardiac arrhythmia detection, thereby benefiting patients and healthcare practitioners alike.
Collapse
|
|
1 |
2 |
14
|
Rai HM, Chatterjee K, Dubey A, Srivastava P. Myocardial Infarction Detection Using Deep Learning and Ensemble Technique from ECG Signals. LECTURE NOTES IN NETWORKS AND SYSTEMS 2021:717-730. [DOI: 10.1007/978-981-16-0733-2_51] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
4 |
2 |
15
|
Ahmed H, Shukla S, Rai HM. Static Handwritten Signature Recognition Using Discrete Random Transform and Combined Projection Based Technique. 2014 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION TECHNOLOGIES 2014:37-41. [DOI: 10.1109/acct.2014.76] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
11 |
1 |
16
|
Rai HM, Trivedi A. ECG signal classification using wavelet transform and Back Propagation Neural Network. 2012 5TH INTERNATIONAL CONFERENCE ON COMPUTERS AND DEVICES FOR COMMUNICATION (CODEC) 2012:1-4. [DOI: 10.1109/codec.2012.6509183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
13 |
1 |
17
|
Rai HM, Dashkevych S, Yoo J. Next-Generation Diagnostics: The Impact of Synthetic Data Generation on the Detection of Breast Cancer from Ultrasound Imaging. MATHEMATICS 2024; 12:2808. [DOI: 10.3390/math12182808] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2025] [Imported: 01/11/2025]
Abstract
Breast cancer is one of the most lethal and widespread diseases affecting women worldwide. As a result, it is necessary to diagnose breast cancer accurately and efficiently utilizing the most cost-effective and widely used methods. In this research, we demonstrated that synthetically created high-quality ultrasound data outperformed conventional augmentation strategies for efficiently diagnosing breast cancer using deep learning. We trained a deep-learning model using the EfficientNet-B7 architecture and a large dataset of 3186 ultrasound images acquired from multiple publicly available sources, as well as 10,000 synthetically generated images using generative adversarial networks (StyleGAN3). The model was trained using five-fold cross-validation techniques and validated using four metrics: accuracy, recall, precision, and the F1 score measure. The results showed that integrating synthetically produced data into the training set increased the classification accuracy from 88.72% to 92.01% based on the F1 score, demonstrating the power of generative models to expand and improve the quality of training datasets in medical-imaging applications. This demonstrated that training the model using a larger set of data comprising synthetic images significantly improved its performance by more than 3% over the genuine dataset with common augmentation. Various data augmentation procedures were also investigated to improve the training set’s diversity and representativeness. This research emphasizes the relevance of using modern artificial intelligence and machine-learning technologies in medical imaging by providing an effective strategy for categorizing ultrasound images, which may lead to increased diagnostic accuracy and optimal treatment options. The proposed techniques are highly promising and have strong potential for future clinical application in the diagnosis of breast cancer.
Collapse
|
|
1 |
1 |
18
|
Gupta S, Sharmila, Rai HM. IoT-Based Automatic Irrigation System Using Robotic Vehicle. ALGORITHMS FOR INTELLIGENT SYSTEMS 2021:669-677. [DOI: 10.1007/978-981-15-4936-6_73] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
4 |
1 |
19
|
Patrick U, Rao SK, Jagan BOL, Rai HM, Agarwal S, Pak W. Radar-Based Target Tracking Using Deep Learning Approaches with Unscented Kalman Filter. APPLIED SCIENCES 2024; 14:8332. [DOI: 10.3390/app14188332] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2025] [Imported: 01/11/2025]
Abstract
Machine learning, a rapidly growing field, has attracted numerous researchers for its ability to automatically learn from and make predictions based on data. This manuscript presents an innovative approach to estimating the covariance matrix of noise in radar measurements for target tracking, resulting from collaborative efforts. Traditionally, researchers have assumed that the covariance matrix of noise in sonar measurements is present in the vast majority of literature related to target tracking. On the other hand, this research aims to estimate it by employing deep learning algorithms with noisy measurements in range, bearing, and elevation from radar sensors. This collaborative approach, involving multiple disciplines, provides a more precise and accurate covariance matrix estimate. Additionally, the unscented Kalman filter was combined with the gated recurrent unit, multilayer perceptron, convolutional neural network, and long short-term memory to accomplish the task of 3D target tracking in an airborne environment. The quantification of the results was achieved through the use of Monte Carlo simulations, which demonstrated that the convolutional neural network performed better than any other approach. The system was simulated using a Python program, and the proposed method offers higher accuracy and faster convergence time than conventional target tracking methods. This is a demonstration of the potential that collaboration can have in research.
Collapse
|
|
1 |
1 |
20
|
Rai HM, Chatterjee K, Nayyar A. Automatic Segmentation and Classification of Brain Tumor from MR Images Using DWT-RBFNN. STUDIES IN BIG DATA 2021:215-243. [DOI: 10.1007/978-3-030-75657-4_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] [Imported: 08/09/2023]
|
|
4 |
|
21
|
Moqurrab SA, Rai HM, Yoo J. HRIDM: Hybrid Residual/Inception-Based Deeper Model for Arrhythmia Detection from Large Sets of 12-Lead ECG Recordings. ALGORITHMS 2024; 17:364. [DOI: 10.3390/a17080364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024] [Imported: 09/11/2024]
Abstract
Heart diseases such as cardiovascular and myocardial infarction are the foremost reasons of death in the world. The timely, accurate, and effective prediction of heart diseases is crucial for saving lives. Electrocardiography (ECG) is a primary non-invasive method to identify cardiac abnormalities. However, manual interpretation of ECG recordings for heart disease diagnosis is a time-consuming and inaccurate process. For the accurate and efficient detection of heart diseases from the 12-lead ECG dataset, we have proposed a hybrid residual/inception-based deeper model (HRIDM). In this study, we have utilized ECG datasets from various sources, which are multi-institutional large ECG datasets. The proposed model is trained on 12-lead ECG data from over 10,000 patients. We have compared the proposed model with several state-of-the-art (SOTA) models, such as LeNet-5, AlexNet, VGG-16, ResNet-50, Inception, and LSTM, on the same training and test datasets. To show the effectiveness of the computational efficiency of the proposed model, we have only trained over 20 epochs without GPU support and we achieved an accuracy of 50.87% on the test dataset for 27 categories of heart abnormalities. We found that our proposed model outperformed the previous studies which participated in the official PhysioNet/CinC Challenge 2020 and achieved fourth place as compared with the 41 official ranking teams. The result of this study indicates that the proposed model is an implying new method for predicting heart diseases using 12-lead ECGs.
Collapse
|
|
1 |
|
22
|
Rai HM, Shukla KK, Tightiz L, Padmanaban S. Enhancing data security and privacy in energy applications: Integrating IoT and blockchain technologies. Heliyon 2024; 10:e38917. [PMID: 39430499 PMCID: PMC11490785 DOI: 10.1016/j.heliyon.2024.e38917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 09/30/2024] [Accepted: 10/02/2024] [Indexed: 10/22/2024] [Imported: 01/11/2025] Open
Abstract
The integration of blockchain technology with the IoToffers numerous opportunities to enhance the privacy, security, and integrity. This study comprehensively analyze the challenges, scope, and potential solutions associated with integrating blockchain technology and the IoT, with a specific emphasis on nuclear energy applications. We discuss the roles and various aspects of blockchain and the IoT, highlighting their multiple dimensions and applications. Our study develops a secure data management framework that incorporates encryption, integrity verification, an integrated communication network, and a robust data flow architecture. We explore the several aspects of data security, privacy, and integrity, along with the potential solutions in the integration of blockchain and IoT. The study also investigates the secure transaction process, with a specific focus on cryptographic, mathematical, and algorithmic perspectives. We demonstrated the use of blockchain technology in the nuclear energy sector using flow charts, comprehensively addressing the associated security and privacy concerns. While emphasizing the applicability of our methodology to the nuclear sector, we also acknowledge limitations such as requirements for practical validation, challenges with resource-constrained IoT environments, increasing cyberthreats, and limited real-time data availability. The future scope of our study focuses on standardization, scalable blockchain, post-quantum cryptography, privacy, regulations, real-world testbeds, and deep learning for nuclear sector security. Our findings highlight that the integration of blockchain and IoT can significantly enhance the security and privacy of nuclear energy applications, although practical validation and optimization are necessary.
Collapse
|
Review |
1 |
|
23
|
Rai HM, Pal A, Shukla KK, Moqurrab SA, Amanzholova S. Transformative impacts of AI and the IoT on healthcare delivery. ENGINEERING REVIEW 2024; 44:116-137. [DOI: 10.30765/er.2523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2025] [Imported: 02/02/2025]
Abstract
The healthcare field is undergoing an enormous transformation mainly a result of the integration of artificial intelligence (AI) and the Internet of Things (IoT). This study investigates AI's varied responsibilities, which include patient authorization, scheduling appointments, billing, revenue management, improving patient experience, monitoring bed availability, symptoms triage, and enabling online consultations. AI makes significant improvements to personalized medical care by assisting with the development of individualized treatment plans depending on individual genetic profiles, resulting in improved patient care. Additionally, IoT connects a diverse set of devices, promoting continuous data sharing, efficient operation, and enhanced patient care. However, the extensive use of AI and IoT in healthcare creates serious ethical and privacy concerns, especially over confidentiality and Integrity. The study highlights the importance of a balanced approach that optimizes the advantages of technological advances while protecting patient rights. Furthermore, it explores global trends in healthcare IoT, emphasizing how these breakthroughs are creating substantial changes in the industry. As AI and IoT grow, they could spark additional transformations in healthcare, bringing both potential and challenges. This study explores at the methodology underlying these technologies, the ethical issues they raise, and the potential they have for improving healthcare accessibility and patient outcomes.
Collapse
|
|
1 |
|
24
|
Rai HM, Yoo J. Analysis of Colorectal and Gastric Cancer Classification: A Mathematical Insight Utilizing Traditional Machine Learning Classifiers. MATHEMATICS 2023; 11:4937. [DOI: 10.3390/math11244937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024] [Imported: 09/11/2024]
Abstract
Cancer remains a formidable global health challenge, claiming millions of lives annually. Timely and accurate cancer diagnosis is imperative. While numerous reviews have explored cancer classification using machine learning and deep learning techniques, scant literature focuses on traditional ML methods. In this manuscript, we undertake a comprehensive review of colorectal and gastric cancer detection specifically employing traditional ML classifiers. This review emphasizes the mathematical underpinnings of cancer detection, encompassing preprocessing techniques, feature extraction, machine learning classifiers, and performance assessment metrics. We provide mathematical formulations for these key components. Our analysis is limited to peer-reviewed articles published between 2017 and 2023, exclusively considering medical imaging datasets. Benchmark and publicly available imaging datasets for colorectal and gastric cancers are presented. This review synthesizes findings from 20 articles on colorectal cancer and 16 on gastric cancer, culminating in a total of 36 research articles. A significant focus is placed on mathematical formulations for commonly used preprocessing techniques, features, ML classifiers, and assessment metrics. Crucially, we introduce our optimized methodology for the detection of both colorectal and gastric cancers. Our performance metrics analysis reveals remarkable results: 100% accuracy in both cancer types, but with the lowest sensitivity recorded at 43.1% for gastric cancer.
Collapse
|
|
2 |
|
25
|
Pal A, Rai HM, Frej MBH, Razaque A. Advanced Segmentation of Gastrointestinal (GI) Cancer Disease Using a Novel U-MaskNet Model. Life (Basel) 2024; 14:1488. [PMID: 39598286 PMCID: PMC11595444 DOI: 10.3390/life14111488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Revised: 11/05/2024] [Accepted: 11/12/2024] [Indexed: 11/29/2024] [Imported: 01/11/2025] Open
Abstract
The purpose of this research is to contribute to the development of approaches for the classification and segmentation of various gastrointestinal (GI) cancer diseases, such as dyed lifted polyps, dyed resection margins, esophagitis, normal cecum, normal pylorus, normal Z line, polyps, and ulcerative colitis. This research is relevant and essential because of the current challenges related to the absence of efficient diagnostic tools for early diagnostics of GI cancers, which are fundamental for improving the diagnosis of these common diseases. To address the above challenges, we propose a new hybrid segmentation model, U-MaskNet, which is a combination of U-Net and Mask R-CNN models. Here, U-Net is utilized for pixel-wise classification and Mask R-CNN for instance segmentation, together forming a solution for classifying and segmenting GI cancer. The Kvasir dataset, which includes 8000 endoscopic images of various GI cancers, is utilized to validate the proposed methodology. The experimental results clearly demonstrated that the novel proposed model provided superior segmentation compared to other well-known models, such as DeepLabv3+, FCN, and DeepMask, as well as improved classification performance compared to state-of-the-art (SOTA) models, including LeNet-5, AlexNet, VGG-16, ResNet-50, and the Inception Network. The quantitative analysis revealed that our proposed model outperformed the other models, achieving a precision of 98.85%, recall of 98.49%, and F1 score of 98.68%. Additionally, the novel model achieved a Dice coefficient of 94.35% and IoU of 89.31%. Consequently, the developed model increased the accuracy and reliability in detecting and segmenting GI cancer, and it was proven that the proposed model can potentially be used for improving the diagnostic process and, consequently, patient care in the clinical environment. This work highlights the benefits of integrating the U-Net and Mask R-CNN models, opening the way for further research in medical image segmentation.
Collapse
|
research-article |
1 |
|