1
|
Ahmad I, Singh VP, Gore MM. Detection of Diabetic Retinopathy Using Discrete Wavelet-Based Center-Symmetric Local Binary Pattern and Statistical Features. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024:10.1007/s10278-024-01243-2. [PMID: 39237836 DOI: 10.1007/s10278-024-01243-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Revised: 07/19/2024] [Accepted: 08/19/2024] [Indexed: 09/07/2024]
Abstract
Computer-aided diagnosis (CAD) system assists ophthalmologists in early diabetic retinopathy (DR) detection by automating the analysis of retinal images, enabling timely intervention and treatment. This paper introduces a novel CAD system based on the global and multi-resolution analysis of retinal images. As a first step, we enhance the quality of the retinal images by applying a sequence of preprocessing techniques, which include the median filter, contrast limited adaptive histogram equalization (CLAHE), and the unsharp filter. These preprocessing steps effectively eliminate noise and enhance the contrast in the retinal images. Further, these images are represented at multi-scales using discrete wavelet transform (DWT), and center symmetric local binary pattern (CSLBP) features are extracted from each scale. The extracted CSLBP features from decomposed images capture the fine and coarse details of the retinal fundus images. Also, statistical features are extracted to capture the global characteristics and provide a comprehensive representation of retinal fundus images. The detection performances of these features are evaluated on a benchmark dataset using two machine learning models, i.e., SVM and k-NN, and found that the performance of the proposed work is considerably more encouraging than other existing methods. Furthermore, the results demonstrate that when wavelet-based CSLBP features are combined with statistical features, they yield notably improved detection performance compared to using these features individually.
Collapse
Affiliation(s)
- Imtiyaz Ahmad
- Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology Allahabad, Prayagraj, 211004, UP, India.
| | - Vibhav Prakash Singh
- Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology Allahabad, Prayagraj, 211004, UP, India
| | - Manoj Madhava Gore
- Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology Allahabad, Prayagraj, 211004, UP, India
| |
Collapse
|
2
|
Vasireddi HK, K SD, Reddy GNVR. An enumerative pre-processing approach for retinopathy severity grading using an interpretable classifier: a comparative study. Graefes Arch Clin Exp Ophthalmol 2024; 262:2247-2267. [PMID: 38400856 DOI: 10.1007/s00417-024-06396-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 01/23/2024] [Accepted: 01/30/2024] [Indexed: 02/26/2024] Open
Abstract
BACKGROUND Diabetic retinopathy (DR) is a serious eye complication that results in permanent vision damage. As the number of patients suffering from DR increases, so does the delay in treatment for DR diagnosis. To bridge this gap, an efficient DR screening system that assists clinicians is required. Although many artificial intelligence (AI) screening systems have been deployed in recent years, accuracy remains a metric that can be improved. METHODS An enumerative pre-processing approach is implemented in the deep learning model to attain better accuracies for DR severity grading. The proposed approach is compared with various pre-trained models, and the necessary performance metrics were tabulated. This paper also presents the comparative analysis of various optimization algorithms that are utilized in the deep network model, and the results were outlined. RESULTS The experimental results are carried out on the MESSIDOR dataset to assess the performance. The experimental results show that an enumerative pipeline combination K1-K2-K3-DFNN-LOA shows better results when compared with other combinations. When compared with various optimization algorithms and pre-trained models, the proposed model has better performance with maximum accuracy, precision, recall, F1 score, and macro-averaged metric of 97.60%, 94.60%, 98.40%, 94.60%, and 0.97, respectively. CONCLUSION This study focussed on developing and implementing a DR screening system on color fundus photographs. This artificial intelligence-based system offers the possibility to enhance the efficacy and approachability of DR diagnosis.
Collapse
Affiliation(s)
- Hemanth Kumar Vasireddi
- Computer Science and Engineering, National Institute of Technology, Silchar, 788010, Assam, India
- Computer Science Engineering, Raghu Engineering College, Visakhapatnam, 531162, Andhra Pradesh, India
| | - Suganya Devi K
- Computer Science and Engineering, National Institute of Technology, Silchar, 788010, Assam, India.
| | - G N V Raja Reddy
- Computer Science and Engineering, National Institute of Technology, Silchar, 788010, Assam, India
- Computer Science Engineering, GITAM University, Visakhapatnam, 530045, Andhra Pradesh, India
| |
Collapse
|
3
|
Fu Y, Wei Y, Chen S, Chen C, Zhou R, Li H, Qiu M, Xie J, Huang D. UC-stack: a deep learning computer automatic detection system for diabetic retinopathy classification. Phys Med Biol 2024; 69:045021. [PMID: 38271723 DOI: 10.1088/1361-6560/ad22a1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 01/25/2024] [Indexed: 01/27/2024]
Abstract
Object. The existing diagnostic paradigm for diabetic retinopathy (DR) greatly relies on subjective assessments by medical practitioners utilizing optical imaging, introducing susceptibility to individual interpretation. This work presents a novel system for the early detection and grading of DR, providing an automated alternative to the manual examination.Approach. First, we use advanced image preprocessing techniques, specifically contrast-limited adaptive histogram equalization and Gaussian filtering, with the goal of enhancing image quality and module learning capabilities. Second, a deep learning-based automatic detection system is developed. The system consists of a feature segmentation module, a deep learning feature extraction module, and an ensemble classification module. The feature segmentation module accomplishes vascular segmentation, the deep learning feature extraction module realizes the global feature and local feature extraction of retinopathy images, and the ensemble module performs the diagnosis and classification of DR for the extracted features. Lastly, nine performance evaluation metrics are applied to assess the quality of the model's performance.Main results. Extensive experiments are conducted on four retinal image databases (APTOS 2019, Messidor, DDR, and EyePACS). The proposed method demonstrates promising performance in the binary and multi-classification tasks for DR, evaluated through nine indicators, including AUC and quadratic weighted Kappa score. The system shows the best performance in the comparison of three segmentation methods, two convolutional neural network architecture models, four Swin Transformer structures, and the latest literature methods.Significance. In contrast to existing methods, our system demonstrates superior performance across multiple indicators, enabling accurate screening of DR and providing valuable support to clinicians in the diagnostic process. Our automated approach minimizes the reliance on subjective assessments, contributing to more consistent and reliable DR evaluations.
Collapse
Affiliation(s)
- Yong Fu
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Yuekun Wei
- School of Information and Management, Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Siying Chen
- School of Basic Medical Sciences, Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Caihong Chen
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Rong Zhou
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Hongjun Li
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Mochan Qiu
- School of Basic Medical Sciences, Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Jin Xie
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| | - Daizheng Huang
- The Life Sciences Research Institute of Guangxi Medical University, Nanning, 530021, People's Republic of China
| |
Collapse
|
4
|
Wang D, Lian J, Jiao W. Multi-label classification of retinal disease via a novel vision transformer model. Front Neurosci 2024; 17:1290803. [PMID: 38260025 PMCID: PMC10800810 DOI: 10.3389/fnins.2023.1290803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/18/2023] [Indexed: 01/24/2024] Open
Abstract
Introduction The precise identification of retinal disorders is of utmost importance in the prevention of both temporary and permanent visual impairment. Prior research has yielded encouraging results in the classification of retinal images pertaining to a specific retinal condition. In clinical practice, it is not uncommon for a single patient to present with multiple retinal disorders concurrently. Hence, the task of classifying retinal images into multiple labels remains a significant obstacle for existing methodologies, but its successful accomplishment would yield valuable insights into a diverse array of situations simultaneously. Methods This study presents a novel vision transformer architecture called retinal ViT, which incorporates the self-attention mechanism into the field of medical image analysis. To note that this study supposed to prove that the transformer-based models can achieve competitive performance comparing with the CNN-based models, hence the convolutional modules have been eliminated from the proposed model. The suggested model concludes with a multi-label classifier that utilizes a feed-forward network architecture. This classifier consists of two layers and employs a sigmoid activation function. Results and discussion The experimental findings provide evidence of the improved performance exhibited by the suggested model when compared to state-of-the-art approaches such as ResNet, VGG, DenseNet, and MobileNet, on the publicly available dataset ODIR-2019, and the proposed approach has outperformed the state-of-the-art algorithms in terms of Kappa, F1 score, AUC, and AVG.
Collapse
Affiliation(s)
- Dong Wang
- School of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, China
| | - Jian Lian
- School of Intelligence Engineering, Shandong Management University, Jinan, China
| | - Wanzhen Jiao
- Department of Ophthalmology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China
| |
Collapse
|
5
|
Jamali Dogahe S, Garmany A, Sadegh Mousavi S, Khanna CL. Predicting 60-4 visual field tests using 3D facial reconstruction. Br J Ophthalmol 2023; 108:112-116. [PMID: 36428007 PMCID: PMC10209349 DOI: 10.1136/bjo-2022-321651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 11/11/2022] [Indexed: 11/26/2022]
Abstract
BACKGROUND Despite, the potential clinical utility of 60-4 visual fields, they are not frequently used in clinical practice partly, due to the purported impact of facial contour on field defects. The purpose of this study was to design and test an artificial intelligence-driven platform to predict facial structure-dependent visual field defects on 60-4 visual field tests. METHODS Subjects with no ocular pathology were included. Participants were subject to optical coherence tomography, 60-4 Swedish interactive thresholding algorithm visual field tests and photography. The predicted visual field was compared with observed 60-4 visual field results in subjects. Average and point-specific sensitivity, specificity, precision, negative predictive value, accuracy, and F1-scores were primary outcome measures. RESULTS 30 healthy were enrolled. Three-dimensional facial reconstruction using a convolution neural network (CNN) was able to predict facial contour-dependent 60-4 visual field defects in 30 subjects without ocular pathology. Overall model accuracy was 97%±3% and 96%±3% and the F1-score, dependent on precision and sensitivity, was 58%±19% and 55%±15% for the right eye and left eye, respectively. Spatial-dependent model performance was observed with increased sensitivity and precision within the far inferior nasal field reflected by an average F1-score of 76%±20% and 70%±29% for the right eye and left eye, respectively. CONCLUSIONS This pilot study reports the development of a CNN-enhanced platform capable of predicting 60-4 visual field defects in healthy controls based on facial contour. Further study with this platform may enhance understanding of the influence of facial contour on 60-4 visual field testing.
Collapse
Affiliation(s)
| | - Armin Garmany
- Graduate School of Biomedical Sciences, Alix School of Medicine, Medical Scientist Training Program, Mayo Clinic, Rochester, Minnesota, USA
| | | | - Cheryl L Khanna
- Department of Ophthalmology, Mayo Clinic, Rochester, Minnesota, USA
| |
Collapse
|
6
|
Islam MM, Rahman MJ, Rabby MS, Alam MJ, Pollob SMAI, Ahmed NAMF, Tawabunnahar M, Roy DC, Shin J, Maniruzzaman M. Predicting the risk of diabetic retinopathy using explainable machine learning algorithms. Diabetes Metab Syndr 2023; 17:102919. [PMID: 38091881 DOI: 10.1016/j.dsx.2023.102919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 11/24/2023] [Accepted: 11/26/2023] [Indexed: 12/31/2023]
Abstract
BACKGROUND AND OBJECTIVE Diabetic retinopathy (DR) is a global health concern among diabetic patients. The objective of this study was to propose an explainable machine learning (ML)-based system for predicting the risk of DR. MATERIALS AND METHODS This study utilized publicly available cross-sectional data in a Chinese cohort of 6374 respondents. We employed boruta and least absolute shrinkage and selection operator (LASSO) based feature selection methods to identify the common predictors of DR. Using the identified predictors, we trained and optimized four widly applicable models (artificial neural network, support vector machine, random forest, and extreme gradient boosting (XGBoost) to predict patients with DR. Moreover, shapely additive explanation (SHAP) was adopted to show the contribution of each predictor of DR in the prediction. RESULTS Combining Boruta and LASSO method revealed that community, TCTG, HDLC, BUN, FPG, HbAlc, weight, and duration were the most important predictors of DR. The XGBoost-based model outperformed the other models, with an accuracy of 90.01%, precision of 91.80%, recall of 97.91%, F1 score of 94.86%, and AUC of 0.850. Moreover, SHAP method showed that HbA1c, community, FPG, TCTG, duration, and UA1b were the influencing predictors of DR. CONCLUSION The proposed integrating system will be helpful as a tool for selecting significant predictors, which can predict patients who are at high risk of DR at an early stage in China.
Collapse
Affiliation(s)
- Md Merajul Islam
- Department of Statistics, University of Rajshahi, Rajshahi-6205, Bangladesh; Department of Statistics, Jatiya Kabi Kazi Nazrul Islam University, Mymensingh-2224, Bangladesh.
| | - Md Jahanur Rahman
- Department of Statistics, University of Rajshahi, Rajshahi-6205, Bangladesh.
| | - Md Symun Rabby
- Department of Statistics, Jatiya Kabi Kazi Nazrul Islam University, Mymensingh-2224, Bangladesh.
| | - Md Jahangir Alam
- Department of Statistics, University of Rajshahi, Rajshahi-6205, Bangladesh.
| | | | - N A M Faisal Ahmed
- Instutite of Education and Research, University of Rajshahi, Rajshahi-6205, Bangladesh.
| | - Most Tawabunnahar
- Department of Statistics, Jatiya Kabi Kazi Nazrul Islam University, Mymensingh-2224, Bangladesh.
| | - Dulal Chandra Roy
- Department of Statistics, University of Rajshahi, Rajshahi-6205, Bangladesh.
| | - Junpil Shin
- School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu, 965-8580, Fukushima, Japan.
| | - Md Maniruzzaman
- Statistics Discipline, Khulna University, Khulna-9208, Bangladesh.
| |
Collapse
|
7
|
Rafay A, Asghar Z, Manzoor H, Hussain W. EyeCNN: exploring the potential of convolutional neural networks for identification of multiple eye diseases through retinal imagery. Int Ophthalmol 2023; 43:3569-3586. [PMID: 37291412 DOI: 10.1007/s10792-023-02764-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 05/21/2023] [Indexed: 06/10/2023]
Abstract
BACKGROUND The eyes are the most important part of the human body as these are directly connected to the brain and help us perceive the imagery in daily life whereas, eye diseases are mostly ignored and underestimated until it is too late. Diagnosing eye disorders through manual diagnosis by the physician can be very costly and time taking. OBJECTIVE Thus, to tackle this, a novel method namely EyeCNN is proposed for identifying eye diseases through retinal images using EfficientNet B3. METHODS A dataset of retinal imagery of three diseases, i.e. Diabetic Retinopathy, Glaucoma, and Cataract is used to train 12 convolutional networks while EfficientNet B3 was the topperforming model out of all 12 models with a testing accuracy of 94.30%. RESULTS After preprocessing of the dataset and training of models, various experimentations were performed to see where our model stands. The evaluation was performed using some well-defined measures and the final model was deployed on the Streamlit server as a prototype for public usage. The proposed model has the potential to help diagnose eye diseases early, which can facilitate timely treatment. CONCLUSION The use of EyeCNN for classifying eye diseases has the potential to aid ophthalmologists in diagnosing conditions accurately and efficiently. This research may also lead to a deeper understanding of these diseases and it may lead to new treatments. The webserver of EyeCNN can be accessed at ( https://abdulrafay97-eyecnn-app-rd9wgz.streamlit.app/ ).
Collapse
Affiliation(s)
- Abdul Rafay
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Zaeem Asghar
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Hamza Manzoor
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Waqar Hussain
- Department of Artificial Intelligence, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan.
| |
Collapse
|
8
|
Durai DBJ, Jaya T. Automatic severity grade classification of diabetic retinopathy using deformable ladder Bi attention U-net and deep adaptive CNN. Med Biol Eng Comput 2023:10.1007/s11517-023-02860-9. [PMID: 37338737 DOI: 10.1007/s11517-023-02860-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 05/25/2023] [Indexed: 06/21/2023]
Abstract
Long-term exposure to diabetes mellitus leads to the formation of diabetic retinopathy (DR), which can cause vision loss in working-age adults. Early stage diagnosis of DR is highly essential for preventing vision loss and preserving vision in people with diabetes. The motivation behind the severity grade classification of DR is to develop an automated system that can assist ophthalmologists and healthcare professionals in the diagnosis and management of DR. However, existing methods suffer from variability in image quality, similar structures of the normal and lesion regions, high dimensional features, variability in disease manifestations, small datasets, high training loss, model complexity, and overfitting, which leads to high misclassification errors in the severity grading system. Hence, there is a need to develop an automated system using improved deep learning techniques to provide a reliable and consistent grading of DR severity with high classification accuracy using fundus images. To solve these issues, we proposes a Deformable Ladder Bi attention U-shaped encoder-decoder network and Deep Adaptive Convolutional Neural Network (DLBUnet-DACNN) for accurate severity classification of DR. The DLBUnet performs lesion segmentation that can be divided into three parts: the encoder, the central processing module and the decoder. In the encoder part, deformable convolution is used instead of convolution to learn different shapes of the lesion by understanding the offset location. Afterwards, Ladder Atrous Spatial Pyramidal Pooling (LASPP) using variable dilation rates is introduced in the central processing module. LASPP enhance the tiny lesion features and variable dilation rates avoid gridding effects and can learn better global context information. Then the decoder part uses a bi-attention layer contains spatial and channel attention, which can learn contour and edges of the lesion accurately. Finally, the severity of DR is classified using a DACNN by extracting the discriminative features from the segmentation results. Experiments are conducted on the Messidor-2, Kaggle, and Messidor datasets. Our proposed method DLBUnet-DACNN achieves better results in terms of accuracy of 98.2, recall of 0.987, kappa coefficient of 0.993, precision of 0.98, F1-score of 0.981, Matthews Correlation Coefficient (MCC) of 0.93 and Classification Success Index (CSI) of 0.96 when compared to existing methods.
Collapse
Affiliation(s)
- D Binny Jeba Durai
- Department of Electronics and Communication Engineering, Udaya School of Engineering, Vellamodi, India.
| | - T Jaya
- Department of Electronics and Communication Engineering, C.S.I. Institute of Technology, Thovalai, India
| |
Collapse
|