1
|
Selvaraj S, Alsayed AO, Ismail NA, Kavin BP, Onyema EM, Seng GH, Uchechi AQ. Super learner model for classifying leukemia through gene expression monitoring. Discov Oncol 2024; 15:499. [PMID: 39331180 PMCID: PMC11436508 DOI: 10.1007/s12672-024-01337-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 09/11/2024] [Indexed: 09/28/2024] Open
Abstract
Leukemia is a form of cancer that affects the bone marrow and lymphatic system, and it requires complex treatment strategies that vary with each subtype. Due to the subtle morphological differences among these types, monitoring gene expressions is crucial for accurate classification. Manual or pathological testing can be time-consuming and expensive. Therefore, data-driven methods and machine learning algorithms offer an efficient alternative for leukemia classification. This study introduced a novel super learning model that leverages heterogeneous machine learning models to analyze gene expression data and classify leukemia cells. The proposed approach incorporates an entropy-based feature importance technique to identify the gene profiles most significant to the labeling process. The strength of this super learning model lies in its final super learner, Random Forest, which effectively classifies cross-validated data from the candidate learners. Validation on a gene expression monitoring dataset demonstrates that this model outperforms other state-of-the-art models in predictive accuracy. The study contributes to the knowledge regarding the use of advanced machine learning techniques to improve the accuracy and reliability of leukemia classification using gene expression data, addressing the challenges of traditional methods that rely on clinical features and morphological examination.
Collapse
Affiliation(s)
- Sharanya Selvaraj
- Department of Data Science and Business Systems, SRM Institute of Science and Technology, Kattankulathur, Chennai, India, 603203
| | | | - Nor Azman Ismail
- Faculty of Computing, UniversitiTeknologi Malaysia, Johor Bahru, Malaysia
| | - Balasubramanian Prabhu Kavin
- Department of Data Science and Business Systems, SRM Institute of Science and Technology, Kattankulathur, Chennai, India, 603203
| | - Edeh Michael Onyema
- Department of Mathematics and Computer Science, Coal City University, Enugu, Nigeria.
- Adjunct Faculty, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, 602105, India.
| | - Gan Hong Seng
- School of AI and Advanced Computing, XJTLU Entrepreneur College (Taicang), Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, People's Republic of China, 215400
| | | |
Collapse
|
2
|
Asar TO, Ragab M. Leukemia detection and classification using computer-aided diagnosis system with falcon optimization algorithm and deep learning. Sci Rep 2024; 14:21755. [PMID: 39294306 PMCID: PMC11410793 DOI: 10.1038/s41598-024-72900-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/11/2024] [Indexed: 09/20/2024] Open
Abstract
Leukemia is a type of blood tumour that occurs because of abnormal enhancement in WBCs (white blood cells) in the bone marrow of the human body. Blood-forming tissue cancer influences the lymphatic and bone marrow system. The early diagnosis and detection of leukaemia, i.e., the accurate difference of malignant leukocytes with little expense at the beginning of the disease, is a primary challenge in the disease analysis field. Despite the higher occurrence of leukemia, there is a lack of flow cytometry tools, and the procedures accessible at medical diagnostics centres are time-consuming. Distinct researchers have implemented computer-aided diagnostic (CAD) and machine learning (ML) methods for laboratory image analysis, aiming to manage the restrictions of late leukemia analysis. This study proposes a new Falcon optimization algorithm with deep convolutional neural network for Leukemia detection and classification (FOADCNN-LDC) technique. The main objective of the FOADCNN-LDC technique is to classify and recognize leukemia. The FOADCNN-LDC technique utilizes a median filtering (MF) based noise removal process to eradicate the image noise. Besides, the FOADCNN-LDC technique employs the ShuffleNetv2 model for the feature extraction process. Moreover, the detection and classification of the leukemia process are performed by utilizing the convolutional denoising autoencoder (CDAE) model. The FOA is implemented to select the hyperparameter of the CDAE model. The simulation process of the FOADCNN-LDC approach is performed on a benchmark medical dataset. The investigational analysis of the FOADCNN-LDC approach highlighted a superior accuracy value of 99.62% over existing techniques.
Collapse
Affiliation(s)
- Turky Omar Asar
- Department of Biology, College of Science and Arts at Alkamil, University of Jeddah, Jeddah, Saudi Arabia
| | - Mahmoud Ragab
- Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, 21589, Saudi Arabia.
| |
Collapse
|
3
|
Ashayeri H, Sobhi N, Pławiak P, Pedrammehr S, Alizadehsani R, Jafarizadeh A. Transfer Learning in Cancer Genetics, Mutation Detection, Gene Expression Analysis, and Syndrome Recognition. Cancers (Basel) 2024; 16:2138. [PMID: 38893257 PMCID: PMC11171544 DOI: 10.3390/cancers16112138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 05/30/2024] [Accepted: 06/01/2024] [Indexed: 06/21/2024] Open
Abstract
Artificial intelligence (AI), encompassing machine learning (ML) and deep learning (DL), has revolutionized medical research, facilitating advancements in drug discovery and cancer diagnosis. ML identifies patterns in data, while DL employs neural networks for intricate processing. Predictive modeling challenges, such as data labeling, are addressed by transfer learning (TL), leveraging pre-existing models for faster training. TL shows potential in genetic research, improving tasks like gene expression analysis, mutation detection, genetic syndrome recognition, and genotype-phenotype association. This review explores the role of TL in overcoming challenges in mutation detection, genetic syndrome detection, gene expression, or phenotype-genotype association. TL has shown effectiveness in various aspects of genetic research. TL enhances the accuracy and efficiency of mutation detection, aiding in the identification of genetic abnormalities. TL can improve the diagnostic accuracy of syndrome-related genetic patterns. Moreover, TL plays a crucial role in gene expression analysis in order to accurately predict gene expression levels and their interactions. Additionally, TL enhances phenotype-genotype association studies by leveraging pre-trained models. In conclusion, TL enhances AI efficiency by improving mutation prediction, gene expression analysis, and genetic syndrome detection. Future studies should focus on increasing domain similarities, expanding databases, and incorporating clinical data for better predictions.
Collapse
Affiliation(s)
- Hamidreza Ashayeri
- Student Research Committee, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran;
| | - Navid Sobhi
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran; (N.S.); (A.J.)
| | - Paweł Pławiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155 Krakow, Poland
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, 44-100 Gliwice, Poland
| | - Siamak Pedrammehr
- Faculty of Design, Tabriz Islamic Art University, Tabriz 5164736931, Iran;
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, VIC 3216, Australia;
| | - Roohallah Alizadehsani
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, VIC 3216, Australia;
| | - Ali Jafarizadeh
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran; (N.S.); (A.J.)
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz 5165665811, Iran
| |
Collapse
|
4
|
Elrefaie RM, Mohamed MA, Marzouk EA, Ata MM. A robust classification of acute lymphocytic leukemia-based microscopic images with supervised Hilbert-Huang transform. Microsc Res Tech 2024; 87:191-204. [PMID: 37715495 DOI: 10.1002/jemt.24425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 08/19/2023] [Accepted: 09/06/2023] [Indexed: 09/17/2023]
Abstract
Acute lymphocytic leukemia (ALL) is a malignant condition characterized by the development of blast cells in the bone marrow and their quick dissemination into the bloodstream. It primarily affects children and individuals over the age of 60. Manual blood testing, which has been around for a long time, may be slow. The likelihood of recognizing ALL in its early stages was increased by automating the diagnosis. This research developed an improved criterion for classifying ALL microscopic images into two categories: normal images and blast images. First, to save processing time, innovative image preprocessing techniques were employed to gather data for data augmentation, enhancement, and conversion. The K-means clustering technique was also utilized to effectively segment the relevant nuclei from the background. Furthermore, the most salient features were extracted using an empirical mode decomposition (EMD) based on the Hilbert-Huang transform. MATLAB functions such as principal component analysis, gray level co-occurrence matrix, local binary pattern, shape features, discrete cosine transform, discrete Fourier transform, discrete wavelet transform, and independent component analysis have been used and compared with EMD. The Bayesian regularization (BR) method has been implemented in the neural networks (NNs) classifier. Along with NNs, other classifiers such as support vector machine, K-nearest neighbors, random forest, naive Bayes, logistic regression, and decision tree have been used, evaluated, and contrasted with NNs. According to experimental findings, the ALL-IDB2 (Image Database 2) dataset's NNs-based-EMD model classified objects with an accuracy of 98.7%, sensitivity of 99.3%, and specificity of 98.1%. RESEARCH HIGHLIGHTS: Implement a robust method for classifying normal and blast ALL images in the state of the art using the combination of the BR algorithm and the neural networks classifier. Perform robust data processing via data augmentation and conversion from RGB (Red, Green, and Blue) image LAB (Luminosity, A: color space, B: color space) image. Extract the nuclei correctly from the background image using k-means clustering. Extract the most salient features from the segmented images using EMD in the state of the art of HHT.
Collapse
Affiliation(s)
- Reem Magdy Elrefaie
- Department of Electronics and Communications Engineering, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Mohamed A Mohamed
- Department of Electronics and Communications Engineering, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Elsaid A Marzouk
- Department of Electronics and Communications Engineering, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Mohamed Maher Ata
- School of Computational Sciences and Artificial Intelligence (CSAI), Zewail City of Science and Technology, 6th of October City, Giza, Egypt
- Department of Communications and Electronics Engineering, MISR Higher Institute for Engineering and Technology, Mansoura, Egypt
| |
Collapse
|
5
|
Yadav DP, Kumar D, Jalal AS, Kumar A, Singh KU, Shah MA. Morphological diagnosis of hematologic malignancy using feature fusion-based deep convolutional neural network. Sci Rep 2023; 13:16988. [PMID: 37813973 PMCID: PMC10562409 DOI: 10.1038/s41598-023-44210-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 10/05/2023] [Indexed: 10/11/2023] Open
Abstract
Leukemia is a cancer of white blood cells characterized by immature lymphocytes. Due to blood cancer, many people die every year. Hence, the early detection of these blast cells is necessary for avoiding blood cancer. A novel deep convolutional neural network (CNN) 3SNet that has depth-wise convolution blocks to reduce the computation costs has been developed to aid the diagnosis of leukemia cells. The proposed method includes three inputs to the deep CNN model. These inputs are grayscale and their corresponding histogram of gradient (HOG) and local binary pattern (LBP) images. The HOG image finds the local shape, and the LBP image describes the leukaemia cell's texture pattern. The suggested model was trained and tested with images from the AML-Cytomorphology_LMU dataset. The mean average precision (MAP) for the cell with less than 100 images in the dataset was 84%, whereas for cells with more than 100 images in the dataset was 93.83%. In addition, the ROC curve area for these cells is more than 98%. This confirmed proposed model could be an adjunct tool to provide a second opinion to a doctor.
Collapse
Affiliation(s)
- D P Yadav
- Department of Computer Engineering and Applications, G.L.A. University, Mathura, 281406, India
| | - Deepak Kumar
- Department of Computer Science, NIT Meghalaya, Shillong, 793003, India
| | - Anand Singh Jalal
- Department of Computer Engineering and Applications, G.L.A. University, Mathura, 281406, India
| | - Ankit Kumar
- Department of Computer Engineering and Applications, G.L.A. University, Mathura, 281406, India
| | - Kamred Udham Singh
- School of Computing, Graphic Era Hill University, Dehradun, 248002, India
| | - Mohd Asif Shah
- Kebri Dehar University, Kebri Dehar, Ethiopia.
- Woxsen University, Kamkole, Sadasivpet, Hyderabad, Telangana, 502345, India.
- Division of Research and Development, Lovely Professional University, Phagwara, Punjab, 144001, India.
- Research Fellow, INTI International University, Persiaran Perdana BBN Putra, Nilai, Negeri Sembilan, 71800, Malaysia.
| |
Collapse
|
6
|
Walter W, Haferlach C, Nadarajah N, Schmidts I, Kühn C, Kern W, Haferlach T. How artificial intelligence might disrupt diagnostics in hematology in the near future. Oncogene 2021; 40:4271-4280. [PMID: 34103684 PMCID: PMC8225509 DOI: 10.1038/s41388-021-01861-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 05/11/2021] [Accepted: 05/24/2021] [Indexed: 02/07/2023]
Abstract
Artificial intelligence (AI) is about to make itself indispensable in the health care sector. Examples of successful applications or promising approaches range from the application of pattern recognition software to pre-process and analyze digital medical images, to deep learning algorithms for subtype or disease classification, and digital twin technology and in silico clinical trials. Moreover, machine-learning techniques are used to identify patterns and anomalies in electronic health records and to perform ad-hoc evaluations of gathered data from wearable health tracking devices for deep longitudinal phenotyping. In the last years, substantial progress has been made in automated image classification, reaching even superhuman level in some instances. Despite the increasing awareness of the importance of the genetic context, the diagnosis in hematology is still mainly based on the evaluation of the phenotype. Either by the analysis of microscopic images of cells in cytomorphology or by the analysis of cell populations in bidimensional plots obtained by flow cytometry. Here, AI algorithms not only spot details that might escape the human eye, but might also identify entirely new ways of interpreting these images. With the introduction of high-throughput next-generation sequencing in molecular genetics, the amount of available information is increasing exponentially, priming the field for the application of machine learning approaches. The goal of all the approaches is to allow personalized and informed interventions, to enhance treatment success, to improve the timeliness and accuracy of diagnoses, and to minimize technically induced misclassifications. The potential of AI-based applications is virtually endless but where do we stand in hematology and how far can we go?
Collapse
|