1
|
Kadhim YA, Guzel MS, Mishra A. A Novel Hybrid Machine Learning-Based System Using Deep Learning Techniques and Meta-Heuristic Algorithms for Various Medical Datatypes Classification. Diagnostics (Basel) 2024; 14:1469. [PMID: 39061605 PMCID: PMC11275302 DOI: 10.3390/diagnostics14141469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 06/27/2024] [Accepted: 07/02/2024] [Indexed: 07/28/2024] Open
Abstract
Medicine is one of the fields where the advancement of computer science is making significant progress. Some diseases require an immediate diagnosis in order to improve patient outcomes. The usage of computers in medicine improves precision and accelerates data processing and diagnosis. In order to categorize biological images, hybrid machine learning, a combination of various deep learning approaches, was utilized, and a meta-heuristic algorithm was provided in this research. In addition, two different medical datasets were introduced, one covering the magnetic resonance imaging (MRI) of brain tumors and the other dealing with chest X-rays (CXRs) of COVID-19. These datasets were introduced to the combination network that contained deep learning techniques, which were based on a convolutional neural network (CNN) or autoencoder, to extract features and combine them with the next step of the meta-heuristic algorithm in order to select optimal features using the particle swarm optimization (PSO) algorithm. This combination sought to reduce the dimensionality of the datasets while maintaining the original performance of the data. This is considered an innovative method and ensures highly accurate classification results across various medical datasets. Several classifiers were employed to predict the diseases. The COVID-19 dataset found that the highest accuracy was 99.76% using the combination of CNN-PSO-SVM. In comparison, the brain tumor dataset obtained 99.51% accuracy, the highest accuracy derived using the combination method of autoencoder-PSO-KNN.
Collapse
Affiliation(s)
- Yezi Ali Kadhim
- College of Engineering, University of Baghdad, Jadriyah, Baghdad 10071, Iraq;
- Department of Modeling and Design of Engineering Systems (MODES), Atilim University, Ankara 06830, Turkey
- Department of Electrical and Electronics Engineering, Atilim University, Incek, Ankara 06830, Turkey
| | - Mehmet Serdar Guzel
- Department of Computer Engineering, Ankara University, Yenimahalle, Ankara 06100, Turkey;
| | - Alok Mishra
- Faculty of Engineering, Norwegian University of Science and Technology, 7034 Trondheim, Norway
- Department of Software Engineering, Atilim University, Incek, Ankara 06830, Turkey
| |
Collapse
|
2
|
Kumar S, Gola KK, Jee N, Singh BM. Optimized feature fusion-based modified cascaded kernel extreme learning machine for heart disease prediction in E-healthcare. Comput Methods Biomech Biomed Engin 2024; 27:980-993. [PMID: 37272059 DOI: 10.1080/10255842.2023.2218520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 05/19/2023] [Indexed: 06/06/2023]
Abstract
In recent years, medical technological innovators have focused on diverse clinical therapies to find innovative ways to overcome clinical challenges. But still, there emerge certain drawbacks like high computational cost, increased error, less training ability, the requirement of high storage space and degraded accuracy. To conquer these drawbacks, the proposed research article presents an innovative cascaded extreme learning machine for effective heart disease (HD) prediction. Missing data filtering and normalization methods are carried out for data pre-processing. From the pre-processed data, the features are extracted using the Framingham risk factor extraction module, whereas the extracted features are fused to generate a feature vector. The most significant features are selected using Rhino Satin Herd optimization algorithm. Using a linear weight assignment approach, the feature weighting process is undertaken by allocating higher weights to significant features and less weight to unwanted features. Finally, classification is performed through the Cascaded kernel soft plus extreme learning machine with a stacked autoencoder model. The performance is analyzed using PYTHON to evaluate the superiority of the proposed model. The proposed model obtained an overall accuracy of 90%, precision of 94%, recall of 91.3% and F1 measure of 92.6% in the Cleveland-Hungarian dataset, which is comparatively superior to the existing methods. An accuracy of 92.6% is attained for predicting HD in terms of the heart patient dataset. The proposed model attains better performance because of effective accuracy outcome, reduced overfitting issues, fewer error rates, better convergence and training ability.
Collapse
Affiliation(s)
- Sumit Kumar
- COER University, Roorkee, Uttarakhand, 247667, India
| | | | - Narayan Jee
- COER University, Roorkee, Uttarakhand, 247667, India
| | | |
Collapse
|
3
|
Wang J, Xie F, Nie F, Li X. Generalized and Robust Least Squares Regression. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:7006-7020. [PMID: 36264726 DOI: 10.1109/tnnls.2022.3213594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
As a simple yet effective method, least squares regression (LSR) is extensively applied for data regression and classification. Combined with sparse representation, LSR can be extended to feature selection (FS) as well, in which l1 regularization is often applied in embedded FS algorithms. However, because the loss function is in the form of squared error, LSR and its variants are sensitive to noises, which significantly degrades the effectiveness and performance of classification and FS. To cope with the problem, we propose a generalized and robust LSR (GRLSR) for classification and FS, which is made up of arbitrary concave loss function and the l2,p -norm regularization term. Meanwhile, an iterative algorithm is applied to efficiently deal with the nonconvex minimization problem, in which an additional weight to suppress the effect of noises is added to each data point. The weights can be automatically assigned according to the error of the samples. When the error is large, the value of the corresponding weight is small. It is this mechanism that allows GRLSR to reduce the impact of noises and outliers. According to the different formulations of the concave loss function, four specific methods are proposed to clarify the essence of the framework. Comprehensive experiments on corrupted datasets have proven the advantage of the proposed method.
Collapse
|
4
|
Al-Alshaikh HA, P P, Poonia RC, Saudagar AKJ, Yadav M, AlSagri HS, AlSanad AA. Comprehensive evaluation and performance analysis of machine learning in heart disease prediction. Sci Rep 2024; 14:7819. [PMID: 38570582 PMCID: PMC10991287 DOI: 10.1038/s41598-024-58489-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 03/29/2024] [Indexed: 04/05/2024] Open
Abstract
Heart disease is a leading cause of mortality on a global scale. Accurately predicting cardiovascular disease poses a significant challenge within clinical data analysis. The present study introduces a prediction model that utilizes various combinations of information and employs multiple established classification approaches. The proposed technique combines the genetic algorithm (GA) and the recursive feature elimination method (RFEM) to select relevant features, thus enhancing the model's robustness. Techniques like the under sampling clustering oversampling method (USCOM) address the issue of data imbalance, thereby improving the model's predictive capabilities. The classification challenge employs a multilayer deep convolutional neural network (MLDCNN), trained using the adaptive elephant herd optimization method (AEHOM). The proposed machine learning-based heart disease prediction method (ML-HDPM) demonstrates outstanding performance across various crucial evaluation parameters, as indicated by its comprehensive assessment. During the training process, the ML-HDPM model exhibits a high level of performance, achieving an accuracy rate of 95.5% and a precision rate of 94.8%. The system's sensitivity (recall) performs with a high accuracy rate of 96.2%, while the F-score highlights its well-balanced performance, measuring 91.5%. It is worth noting that the specificity of ML-HDPM is recorded at a remarkable 89.7%. The findings underscore the potential of ML-HDPM to transform the prediction of heart disease and aid healthcare practitioners in providing precise diagnoses, exerting a substantial influence on patient care outcomes.
Collapse
Affiliation(s)
- Halah A Al-Alshaikh
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| | - Prabu P
- Department of Computer Science, CHRIST University, Bangalore, 560029, India
| | | | - Abdul Khader Jilani Saudagar
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| | - Manoj Yadav
- Department of Computer Science and Engineering, Guru Jambheshwar University of Science and Technology, Hisar, India
| | - Hatoon S AlSagri
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| | - Abeer A AlSanad
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| |
Collapse
|
5
|
Hasan M, Sahid MA, Uddin MP, Marjan MA, Kadry S, Kim J. Performance discrepancy mitigation in heart disease prediction for multisensory inter-datasets. PeerJ Comput Sci 2024; 10:e1917. [PMID: 38660196 PMCID: PMC11041935 DOI: 10.7717/peerj-cs.1917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 02/12/2024] [Indexed: 04/26/2024]
Abstract
Heart disease is one of the primary causes of morbidity and death worldwide. Millions of people have had heart attacks every year, and only early-stage predictions can help to reduce the number. Researchers are working on designing and developing early-stage prediction systems using different advanced technologies, and machine learning (ML) is one of them. Almost all existing ML-based works consider the same dataset (intra-dataset) for the training and validation of their method. In particular, they do not consider inter-dataset performance checks, where different datasets are used in the training and testing phases. In inter-dataset setup, existing ML models show a poor performance named the inter-dataset discrepancy problem. This work focuses on mitigating the inter-dataset discrepancy problem by considering five available heart disease datasets and their combined form. All potential training and testing mode combinations are systematically executed to assess discrepancies before and after applying the proposed methods. Imbalance data handling using SMOTE-Tomek, feature selection using random forest (RF), and feature extraction using principle component analysis (PCA) with a long preprocessing pipeline are used to mitigate the inter-dataset discrepancy problem. The preprocessing pipeline builds on missing value handling using RF regression, log transformation, outlier removal, normalization, and data balancing that convert the datasets to more ML-centric. Support vector machine, K-nearest neighbors, decision tree, RF, eXtreme Gradient Boosting, Gaussian naive Bayes, logistic regression, and multilayer perceptron are used as classifiers. Experimental results show that feature selection and classification using RF produce better results than other combination strategies in both single- and inter-dataset setups. In certain configurations of individual datasets, RF demonstrates 100% accuracy and 96% accuracy during the feature selection phase in an inter-dataset setup, exhibiting commendable precision, recall, F1 score, specificity, and AUC score. The results indicate that an effective preprocessing technique has the potential to improve the performance of the ML model without necessitating the development of intricate prediction models. Addressing inter-dataset discrepancies introduces a novel research avenue, enabling the amalgamation of identical features from various datasets to construct a comprehensive global dataset within a specific domain.
Collapse
Affiliation(s)
- Mahmudul Hasan
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
- School of Information Technology, Deakin University, Geelong, VIC, Australia
| | - Md Abdus Sahid
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
| | - Md Palash Uddin
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
- School of Information Technology, Deakin University, Geelong, VIC, Australia
| | - Md Abu Marjan
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
| | - Seifedine Kadry
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon
- Department of Applied Data Science, Noroff University College, Kristiansand, Norway
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, Norway
- MEU Research Unit, Middle East University, Amman, Jordan
| | - Jungeun Kim
- Department of Software, Kongju National University, Cheonan, Republic of South Korea
| |
Collapse
|
6
|
Abdullhadi S, Al-Qudah DA, Abu-Salih B. Time-aware forecasting of search volume categories and actual purchase. Heliyon 2024; 10:e25034. [PMID: 38317988 PMCID: PMC10838793 DOI: 10.1016/j.heliyon.2024.e25034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/28/2023] [Accepted: 01/18/2024] [Indexed: 02/07/2024] Open
Abstract
The new e-commerce field has attracted businesses of all sizes, retailers, and individuals. Consequently, there is an ongoing necessity for applications that can offer predictions on trending products and optimal selling time. This research suggests aiding businesses in forecasting demand for various product categories by employing data mining algorithms on multivariate time series data. To ensure the most recent information, real-time data was gathered through APIs to build the first block in this research. While search volume was derived from the Keywords Everywhere tool, Amazon's search volume was derived from the Helium 10 tool and external features about actual purchased data. The harvested raw datasets went through multiple processes to generate the dataset and were validated. The models XGBoost, Linear Regression, Random Forest, long-short-term memory, and K-nearest neighbor were employed to predict the trends, and the performance is demonstrated using evaluation metrics, namely Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R2). Overall, Linear Regression outperformed, especially at a correlation coefficient of 0.9, with R2 = 90.688, MAE = 0.038, MSE = 0.003, and RMSE = 0.057. KNN outperformed on correlation coefficient of 0.7, R2 = 85.129, MAE = 0.045, MSE = 0.005, and RMSE = 0.068. XGBoost produced the best results with a correlation coefficient of 0.9, yielding R2 = 85.89, MAE = 0.042, MSE = 0.004, and RMSE = 0.062. Random Forest, on the other hand, achieves peak metrics with a correlation coefficient of 0.6, R2 = 84.854, MAE = 0.041, MSE = 0.004, and RMSE = 0.066.
Collapse
Affiliation(s)
- Shahed Abdullhadi
- King Abdullah II School of Information Technology, The University of Jordan, Jordan
| | - Dana A. Al-Qudah
- King Abdullah II School of Information Technology, The University of Jordan, Jordan
| | - Bilal Abu-Salih
- King Abdullah II School of Information Technology, The University of Jordan, Jordan
| |
Collapse
|
7
|
Zhang DJ, Gao YL, Zhao JX, Zheng CH, Liu JX. A New Graph Autoencoder-Based Consensus-Guided Model for scRNA-seq Cell Type Detection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:2473-2483. [PMID: 35857730 DOI: 10.1109/tnnls.2022.3190289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Single-cell RNA sequencing (scRNA-seq) technology is famous for providing a microscopic view to help capture cellular heterogeneity. This characteristic has advanced the field of genomics by enabling the delicate differentiation of cell types. However, the properties of single-cell datasets, such as high dropout events, noise, and high dimensionality, are still a research challenge in the single-cell field. To utilize single-cell data more efficiently and to better explore the heterogeneity among cells, a new graph autoencoder (GAE)-based consensus-guided model (scGAC) is proposed in this article. The data are preprocessed into multiple top-level feature datasets. Then, feature learning is performed by using GAEs to generate new feature matrices, followed by similarity learning based on distance fusion methods. The learned similarity matrices are fed back to the GAEs to guide their feature learning process. Finally, the abovementioned steps are iterated continuously to integrate the final consistent similarity matrix and perform other related downstream analyses. The scGAC model can accurately identify critical features and effectively preserve the internal structure of the data. This can further improve the accuracy of cell type identification.
Collapse
|
8
|
Talari P, N B, Kaur G, Alshahrani H, Al Reshan MS, Sulaiman A, Shaikh A. Hybrid feature selection and classification technique for early prediction and severity of diabetes type 2. PLoS One 2024; 19:e0292100. [PMID: 38236900 PMCID: PMC10796060 DOI: 10.1371/journal.pone.0292100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 09/12/2023] [Indexed: 01/22/2024] Open
Abstract
Diabetes prediction is an ongoing study topic in which medical specialists are attempting to forecast the condition with greater precision. Diabetes typically stays lethargic, and on the off chance that patients are determined to have another illness, like harm to the kidney vessels, issues with the retina of the eye, or a heart issue, it can cause metabolic problems and various complexities in the body. Various worldwide learning procedures, including casting a ballot, supporting, and sacking, have been applied in this review. The Engineered Minority Oversampling Procedure (Destroyed), along with the K-overlay cross-approval approach, was utilized to achieve class evening out and approve the discoveries. Pima Indian Diabetes (PID) dataset is accumulated from the UCI Machine Learning (UCI ML) store for this review, and this dataset was picked. A highlighted engineering technique was used to calculate the influence of lifestyle factors. A two-phase classification model has been developed to predict insulin resistance using the Sequential Minimal Optimisation (SMO) and SMOTE approaches together. The SMOTE technique is used to preprocess data in the model's first phase, while SMO classes are used in the second phase. All other categorization techniques were outperformed by bagging decision trees in terms of Misclassification Error rate, Accuracy, Specificity, Precision, Recall, F1 measures, and ROC curve. The model was created using a combined SMOTE and SMO strategy, which achieved 99.07% correction with 0.1 ms of runtime. The suggested system's result is to enhance the classifier's performance in spotting illness early.
Collapse
Affiliation(s)
- Praveen Talari
- Department of Computer Science and Engineering, Vignana Bharathi Institute of Technology, Hyderabad, India
| | - Bharathiraja N
- Chitkara University Institute of Engineering and Technology, Chitkara University Punjab, Rajpura, India
| | - Gaganpreet Kaur
- Chitkara University Institute of Engineering and Technology, Chitkara University Punjab, Rajpura, India
| | - Hani Alshahrani
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| | - Mana Saleh Al Reshan
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
- Scientific and Engineering Research Centre, Najran University, Najran, Saudi Arabia
| | - Adel Sulaiman
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| | - Asadullah Shaikh
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| |
Collapse
|
9
|
Liu Y, Li F, Shang J, Liu J, Wang J, Ge D. scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising. Interdiscip Sci 2023; 15:590-601. [PMID: 37402002 DOI: 10.1007/s12539-023-00574-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 07/05/2023]
Abstract
Recently developed single-cell RNA-seq (scRNA-seq) technology has given researchers the chance to investigate single-cell level of disease development. Clustering is one of the most essential strategies for analyzing scRNA-seq data. Choosing high-quality feature sets can significantly enhance the outcomes of single-cell clustering and classification. But computationally burdensome and highly expressed genes cannot afford a stabilized and predictive feature set for technical reasons. In this study, we introduce scFED, a feature-engineered gene selection framework. scFED identifies prospective feature sets to eliminate the noise fluctuation. And fuse them with existing knowledge from the tissue-specific cellular taxonomy reference database (CellMatch) to avoid the influence of subjective factors. Then present a reconstruction approach for noise reduction and crucial information amplification. We apply scFED on four genuine single-cell datasets and compare it with other techniques. According to the results, scFED improves clustering, decreases dimension of the scRNA-seq data, improves cell type identification when combined with clustering algorithms, and has higher performance than other methods. Therefore, scFED offers certain benefits in scRNA-seq data gene selection.
Collapse
Affiliation(s)
- Yang Liu
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Feng Li
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China.
| | - Junliang Shang
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Jinxing Liu
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Juan Wang
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Daohui Ge
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| |
Collapse
|
10
|
Cornaby C, Weimer ET. Utilizing principal component analysis in the identification of clinically relevant changes in patient HLA single antigen bead solid phase testing patterns. PLoS One 2023; 18:e0288743. [PMID: 37883384 PMCID: PMC10602234 DOI: 10.1371/journal.pone.0288743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 07/04/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND HLA antibody testing is essential for successful solid-organ allocation, patient monitoring post-transplant, and risk assessment for both solid-organ and hematopoietic transplant patients. Luminex solid-phase testing is the most common method for identifying HLA antibody specificities, making it one of the most complex immunoassays as each panel contains over 90 specificities for both HLA class I and HLA class II with most of the analysis being performed manually in the vendor-provided software. Principal component analysis (PCA), used in machine learning, is a feature extraction method often utilized to assess data with many variables. METHODS & FINDINGS In our study, solid organ transplant patients who exhibited HLA donor-specific antibodies (DSAs) were used to characterize the utility of PCA-derived analysis when compared to a control group of post-transplant and pre-transplant patients. ROC analysis was utilized to determine a potential threshold for the PCA-derived analysis that would indicate a significant change in a patient's single antigen bead pattern. To evaluate if the algorithm could identify differences in patterns on HLA class I and HLA class II single antigen bead results using the optimized threshold, HLA antibody test results were analyzed using PCA-derived analysis and compared to the clinical results for each patient sample. The PCA-derived algorithm had a sensitivity of 100% (95% CI, 73.54%-100%), a specificity of 75% (95% CI, 56.30%-92.54%), with a PPV of 65% (95% CI, 52.50%-83.90%) and an NPV of 100%, in identifying new reactivity that differed from the patients historic HLA antibody pattern. Additionally, PCA-derived analysis was utilized to assess the potential over-reactivity of single antigen beads for both HLA class I and HLA class II antibody panels. This assessment of antibody results identified several beads in both the HLA class I and HLA class II antibody panel which exhibit over reactivity from 2018 to the present time. CONCLUSIONS PCA-derived analysis would be ideal to help automatically identify patient samples that have an HLA antibody pattern of reactivity consistent with their history and those which exhibit changes in their antibody patterns which could include donor-specific antibodies, de novo HLA antibodies, and assay interference. A similar method could also be applied to evaluate the over-reactivity of beads in the HLA solid phase assays which would be beneficial for lot comparisons and instructive for transplant centers to better understand which beads are more prone to exhibiting over-reactivity and impact patient care.
Collapse
Affiliation(s)
- Caleb Cornaby
- Histocompatibility & Diagnostic Immunology Laboratory, Children's Hospital of Los Angeles, Los Angeles, California, United States of America
- Department of Pathology and Laboratory Medicine, University of Southern California Keck School of Medicine, Los Angeles, California, United States of America
| | - Eric T Weimer
- Molecular Immunology Laboratory, McLendon Clinical Laboratories, UNC Health, Chapel Hill, North Carolina, United States of America
- Department of Pathology & Laboratory Medicine, the University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
11
|
An intelligent heart disease prediction system using hybrid deep dense Aquila network. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
12
|
Karuppuchamy V, Palanivelrajan S. Efficient IoT-machine learning assisted heart failure prediction using adaptive fuzzy-based LSTM-RNN algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-224298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
Abstract
Chronic diseases like diabetes, Heart Failure (HF), malignancy, and severe respiratory sickness are the leading cause of mortality around the globe. Dissimilar indications or traits are extremely difficult to identify in HF patients. IoT solutions are becoming increasingly commonplace as smart wearable gadgets become more popular. Sudden heart attacks have a short life expectancy, which is terrible. As a result, a patient monitoring of heart patients based on IoT-centered Machine Learning (ML) is presented to help with HF prediction, and treatment is administered as necessary. Verification, Encryption, and Categorization are the three phases that make up this developed model. Initially, the datasets from the IoT sensor gadget are gathered by authenticating with a specific hospital through encryption. The patient’s integrated IoT sensor module then transfers sensing information to the cloud. The Improved Blowfish Encryption (IBE) approach is used to protect the sensor data transfer to the cloud. Then the encrypted data is decrypted, and the classification is performed using the Adaptive Fuzzy-Based Long Short-Term Memory with Recurrent Neural Network (AF-LSTM-RNN) algorithm. The results are classed as malignant or benign. It assesses the patient’s cardiac state and sends an alert text to the doctor for treatment. The AF-LSTM-RNN-based HF prediction outperforms the existing techniques. Accuracy, sensitivity, specificity, precision, F-measure and Matthews Correlation Coefficient (MCC) are compared to existing procedures to ensure the planned research is genuine. Using the Origin tool, these metrics are shown as research findings.
Collapse
Affiliation(s)
- V. Karuppuchamy
- Department of Information Technology, Kongunadu College of Engineering and Technology, Thottiam, Trichy, Tamilnadu, India
| | - S. Palanivelrajan
- Department of Electronics and Communication Engineering, M.Kumarasamy College of Engineering (Autonomous), Thalavapalayam, Karur, Tamilnadu, India
| |
Collapse
|
13
|
Ay Ş, Ekinci E, Garip Z. A comparative analysis of meta-heuristic optimization algorithms for feature selection on ML-based classification of heart-related diseases. THE JOURNAL OF SUPERCOMPUTING 2023; 79:11797-11826. [PMID: 37304052 PMCID: PMC9983547 DOI: 10.1007/s11227-023-05132-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 02/21/2023] [Indexed: 06/13/2023]
Abstract
This study aims to use a machine learning (ML)-based enhanced diagnosis and survival model to predict heart disease and survival in heart failure by combining the cuckoo search (CS), flower pollination algorithm (FPA), whale optimization algorithm (WOA), and Harris hawks optimization (HHO) algorithms, which are meta-heuristic feature selection algorithms. To achieve this, experiments are conducted on the Cleveland heart disease dataset and the heart failure dataset collected from the Faisalabad Institute of Cardiology published at UCI. CS, FPA, WOA, and HHO algorithms for feature selection are applied for different population sizes and are realized based on the best fitness values. For the original dataset of heart disease, the maximum prediction F-score of 88% is obtained using K-nearest neighbour (KNN) when compared to logistic regression (LR), support vector machine (SVM), Gaussian Naive Bayes (GNB), and random forest (RF). With the proposed approach, the heart disease prediction F-score of 99.72% is obtained using KNN for population sizes 60 with FPA by selecting eight features. For the original dataset of heart failure, the maximum prediction F-score of 70% is obtained using LR and RF compared to SVM, GNB, and KNN. With the proposed approach, the heart failure prediction F-score of 97.45% is obtained using KNN for population sizes 10 with HHO by selecting five features. Experimental findings show that the applied meta-heuristic algorithms with ML algorithms significantly improve prediction performances compared to performances obtained from the original datasets. The motivation of this paper is to select the most critical and informative feature subset through meta-heuristic algorithms to improve classification accuracy.
Collapse
Affiliation(s)
- Şevket Ay
- Computer Engineering Department, Faculty of Technology, Sakarya University of Applied Sciences, Sakarya, 54187 Turkey
| | - Ekin Ekinci
- Computer Engineering Department, Faculty of Technology, Sakarya University of Applied Sciences, Sakarya, 54187 Turkey
| | - Zeynep Garip
- Computer Engineering Department, Faculty of Technology, Sakarya University of Applied Sciences, Sakarya, 54187 Turkey
| |
Collapse
|
14
|
Balakrishnan C, Ambeth Kumar VD. IoT-Enabled Classification of Echocardiogram Images for Cardiovascular Disease Risk Prediction with Pre-Trained Recurrent Convolutional Neural Networks. Diagnostics (Basel) 2023; 13:diagnostics13040775. [PMID: 36832263 PMCID: PMC9955174 DOI: 10.3390/diagnostics13040775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/22/2023] Open
Abstract
Cardiovascular diseases currently present a key health concern, contributing to an increase in death rates worldwide. In this phase of increasing mortality rates, healthcare represents a major field of research, and the knowledge acquired from this analysis of health information will assist in the early identification of disease. The retrieval of medical information is becoming increasingly important to make an early diagnosis and provide timely treatment. Medical image segmentation and classification is an emerging field of research in medical image processing. In this research, the data collected from an Internet of Things (IoT)-based device, the health records of patients, and echocardiogram images are considered. The images are pre-processed and segmented, and then further processed using deep learning techniques for classification as well as forecasting the risk of heart disease. Segmentation is attained via fuzzy C-means clustering (FCM) and classification using a pretrained recurrent neural network (PRCNN). Based on the findings, the proposed approach achieves 99.5% accuracy, which is higher than the current state-of-the-art techniques.
Collapse
Affiliation(s)
- Chitra Balakrishnan
- Panimalar Engineering College, Anna University, Chennai 600123, India
- Correspondence: (C.B.); (V.D.A.K.)
| | - V. D. Ambeth Kumar
- Computer Engineering, Mizoram University, Aizawl 796004, India
- Correspondence: (C.B.); (V.D.A.K.)
| |
Collapse
|
15
|
Le TD, Noumeir R, Rambaud J, Sans G, Jouvet P. Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2023; 11:469-478. [PMID: 37817825 PMCID: PMC10561736 DOI: 10.1109/jtehm.2023.3241635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 01/08/2023] [Accepted: 01/30/2023] [Indexed: 10/12/2023]
Abstract
When dealing with clinical text classification on a small dataset, recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored. GOAL Our aim is, therefore, to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively. METHODS This study proposed an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation. The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space. The classification performance of the classifiers was then evaluated in the trained and compressed feature space. RESULTS The proposed approach provided overall performance gains of up to 3% for each test set evaluation. Finally, the classifier achieved 92% accuracy, 91% recall, 91% precision, and 91% f1-score in detecting the patient's condition. Furthermore, the compression working mechanism and the autoencoder prediction process were demonstrated by applying the theoretic information bottleneck framework. Clinical and Translational Impact Statement- An autoencoder learning algorithm effectively tackles the problem of sparsity in the representation feature space from a small clinical narrative dataset. Significantly, it can learn the best representation of the training data because of its lossless compression capacity compared to other approaches. Consequently, its downstream classification ability can be significantly improved, which cannot be done using deep learning models.
Collapse
Affiliation(s)
- Thanh-Dung Le
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Rita Noumeir
- Biomedical Information Processing Laboratory, Ecole de Technologie SuperieureUniversity of QuebecMontrealQCH3C 1K3Canada
| | - Jerome Rambaud
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Guillaume Sans
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| | - Philippe Jouvet
- Research Center at CHU Sainte-JustineUniversity of MontrealMontrealQCH3T 1J4Canada
| |
Collapse
|
16
|
Menshawi A, Hassan MM, Allheeib N, Fortino G. A Hybrid Generic Framework for Heart Problem Diagnosis Based on a Machine Learning Paradigm. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23031392. [PMID: 36772430 PMCID: PMC9921250 DOI: 10.3390/s23031392] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 11/25/2022] [Accepted: 11/27/2022] [Indexed: 05/31/2023]
Abstract
The early, valid prediction of heart problems would minimize life threats and save lives, while lack of prediction and false diagnosis can be fatal. Addressing a single dataset alone to build a machine learning model for the identification of heart problems is not practical because each country and hospital has its own data schema, structure, and quality. On this basis, a generic framework has been built for heart problem diagnosis. This framework is a hybrid framework that employs multiple machine learning and deep learning techniques and votes for the best outcome based on a novel voting technique with the intention to remove bias from the model. The framework contains two consequent layers. The first layer contains simultaneous machine learning models running over a given dataset. The second layer consolidates the outputs of the first layer and classifies them as a second classification layer based on novel voting techniques. Prior to the classification process, the framework selects the top features using a proposed feature selection framework. It starts by filtering the columns using multiple feature selection methods and considers the top common features selected. Results from the proposed framework, with 95.6% accuracy, show its superiority over the single machine learning model, classical stacking technique, and traditional voting technique. The main contribution of this work is to demonstrate how the prediction probabilities of multiple models can be exploited for the purpose of creating another layer for final output; this step neutralizes any model bias. Another experimental contribution is proving the complete pipeline's ability to be retrained and used for other datasets collected using different measurements and with different distributions.
Collapse
Affiliation(s)
- Alaa Menshawi
- Information Systems Department, College of Computer and Information Science, King Saud University, Riyadh 11543, Saudi Arabia
| | - Mohammad Mehedi Hassan
- Information Systems Department, College of Computer and Information Science, King Saud University, Riyadh 11543, Saudi Arabia
| | - Nasser Allheeib
- Information Systems Department, College of Computer and Information Science, King Saud University, Riyadh 11543, Saudi Arabia
| | - Giancarlo Fortino
- Department of Informatics, Modeling, Electronics, and Systems, University of Calabria, 87036 Rende, Italy
| |
Collapse
|
17
|
Abd El-Ghany S, A. Abd El-Aziz A. A Robust Tuned Random Forest Classifier Using Randomized Grid Search to Predict Coronary Artery Diseases. COMPUTERS, MATERIALS & CONTINUA 2023; 75:4633-4648. [DOI: 10.32604/cmc.2023.035779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 12/08/2022] [Indexed: 09/01/2023]
|
18
|
Doan LMT, Angione C, Occhipinti A. Machine Learning Methods for Survival Analysis with Clinical and Transcriptomics Data of Breast Cancer. Methods Mol Biol 2023; 2553:325-393. [PMID: 36227551 DOI: 10.1007/978-1-0716-2617-7_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Breast cancer is one of the most common cancers in women worldwide, which causes an enormous number of deaths annually. However, early diagnosis of breast cancer can improve survival outcomes enabling simpler and more cost-effective treatments. The recent increase in data availability provides unprecedented opportunities to apply data-driven and machine learning methods to identify early-detection prognostic factors capable of predicting the expected survival and potential sensitivity to treatment of patients, with the final aim of enhancing clinical outcomes. This tutorial presents a protocol for applying machine learning models in survival analysis for both clinical and transcriptomic data. We show that integrating clinical and mRNA expression data is essential to explain the multiple biological processes driving cancer progression. Our results reveal that machine-learning-based models such as random survival forests, gradient boosted survival model, and survival support vector machine can outperform the traditional statistical methods, i.e., Cox proportional hazard model. The highest C-index among the machine learning models was recorded when using survival support vector machine, with a value 0.688, whereas the C-index recorded using the Cox model was 0.677. Shapley Additive Explanation (SHAP) values were also applied to identify the feature importance of the models and their impact on the prediction outcomes.
Collapse
Affiliation(s)
- Le Minh Thao Doan
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough, UK
- National Horizons Centre, Teesside University, Darlington, UK
| | - Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK.
- National Horizons Centre, Teesside University, Darlington, UK.
| |
Collapse
|
19
|
Sonawane R, Patil H. A design and implementation of heart disease prediction model using data and ECG signal through hybrid clustering. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING: IMAGING & VISUALIZATION 2022. [DOI: 10.1080/21681163.2022.2156927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Ritesh Sonawane
- Computer Engineering, S.S.V.P.S B.S.Deore College of Engineering, Dhule, Maharashtra, India
| | - Hitendra Patil
- Computer Engineering, S.S.V.P.S B.S.Deore College of Engineering, Dhule, Maharashtra, India
| |
Collapse
|
20
|
Martinez G, Garduno A, Mahmud-Al-Rafat A, Ostadgavahi AT, Avery A, de Avila e Silva S, Cusack R, Cameron C, Cameron M, Martin-Loeches I, Kelvin D. An artificial neural network classification method employing longitudinally monitored immune biomarkers to predict the clinical outcome of critically ill COVID-19 patients. PeerJ 2022; 10:e14487. [PMID: 36530391 PMCID: PMC9753745 DOI: 10.7717/peerj.14487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 11/08/2022] [Indexed: 12/14/2022] Open
Abstract
Background The severe form of COVID-19 can cause a dysregulated host immune syndrome that might lead patients to death. To understand the underlying immune mechanisms that contribute to COVID-19 disease we have examined 28 different biomarkers in two cohorts of COVID-19 patients, aiming to systematically capture, quantify, and algorithmize how immune signals might be associated to the clinical outcome of COVID-19 patients. Methods The longitudinal concentration of 28 biomarkers of 95 COVID-19 patients was measured. We performed a dimensionality reduction analysis to determine meaningful biomarkers for explaining the data variability. The biomarkers were used as input of artificial neural network, random forest, classification and regression trees, k-nearest neighbors and support vector machines. Two different clinical cohorts were used to grant validity to the findings. Results We benchmarked the classification capacity of two COVID-19 clinicals studies with different models and found that artificial neural networks was the best classifier. From it, we could employ different sets of biomarkers to predict the clinical outcome of COVID-19 patients. First, all the biomarkers available yielded a satisfactory classification. Next, we assessed the prediction capacity of each protein separated. With a reduced set of biomarkers, our model presented 94% accuracy, 96.6% precision, 91.6% recall, and 95% of specificity upon the testing data. We used the same model to predict 83% and 87% (recovered and deceased) of unseen data, granting validity to the results obtained. Conclusions In this work, using state-of-the-art computational techniques, we systematically identified an optimal set of biomarkers that are related to a prediction capacity of COVID-19 patients. The screening of such biomarkers might assist in understanding the underlying immune response towards inflammatory diseases.
Collapse
Affiliation(s)
- Gustavo Martinez
- Immunology, Shantou University, Shantou, GD, China,Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Alexis Garduno
- Department of Clinical Medicine, University of Dublin, Trinity College, Dublin, Ireland
| | | | | | - Ann Avery
- Division of Infectious Diseases, MetroHealth Medical Center, Cleveland, OH, United States of America
| | - Scheila de Avila e Silva
- Department of Biotechnology, Universidade de Caxias do Sul, Caxias do Sul, Rio Grande do Sul, Brazil
| | - Rachael Cusack
- Department of Clinical Medicine, University of Dublin, Trinity College, Dublin, Ireland
| | - Cheryl Cameron
- Department of Nutrition, Case Western Reserve University, Cleveland, OH, United States of America
| | - Mark Cameron
- Department of Population & Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, United States of America
| | | | - David Kelvin
- Immunology, Shantou University, Shantou, GD, China,Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
21
|
Taha MA, Alsaidi SAAA, Hussein RA. Machine Learning Techniques for Predicting Heart Diseases. 2022 INTERNATIONAL SYMPOSIUM ON INNOVATIVE INFORMATICS OF BISKRA (ISNIB) 2022. [DOI: 10.1109/isnib57382.2022.10076238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Affiliation(s)
- Mohammed A. Taha
- Ministry of Education, Babylon Education Directorates,Babylon,Iraq
| | | | - Reem Ali Hussein
- Univirsity of Technology,Laser and Optoelectronics Eng. Dep.,Baghdad,Iraq
| |
Collapse
|
22
|
|
23
|
Deng T, Huang Y, Yang G, Wang C. Pointwise mutual information sparsely embedded feature selection. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
24
|
Kadhim YA, Khan MU, Mishra A. Deep Learning-Based Computer-Aided Diagnosis (CAD): Applications for Medical Image Datasets. SENSORS (BASEL, SWITZERLAND) 2022; 22:s22228999. [PMID: 36433595 PMCID: PMC9692938 DOI: 10.3390/s22228999] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 11/17/2022] [Accepted: 11/18/2022] [Indexed: 05/26/2023]
Abstract
Computer-aided diagnosis (CAD) has proved to be an effective and accurate method for diagnostic prediction over the years. This article focuses on the development of an automated CAD system with the intent to perform diagnosis as accurately as possible. Deep learning methods have been able to produce impressive results on medical image datasets. This study employs deep learning methods in conjunction with meta-heuristic algorithms and supervised machine-learning algorithms to perform an accurate diagnosis. Pre-trained convolutional neural networks (CNNs) or auto-encoder are used for feature extraction, whereas feature selection is performed using an ant colony optimization (ACO) algorithm. Ant colony optimization helps to search for the best optimal features while reducing the amount of data. Lastly, diagnosis prediction (classification) is achieved using learnable classifiers. The novel framework for the extraction and selection of features is based on deep learning, auto-encoder, and ACO. The performance of the proposed approach is evaluated using two medical image datasets: chest X-ray (CXR) and magnetic resonance imaging (MRI) for the prediction of the existence of COVID-19 and brain tumors. Accuracy is used as the main measure to compare the performance of the proposed approach with existing state-of-the-art methods. The proposed system achieves an average accuracy of 99.61% and 99.18%, outperforming all other methods in diagnosing the presence of COVID-19 and brain tumors, respectively. Based on the achieved results, it can be claimed that physicians or radiologists can confidently utilize the proposed approach for diagnosing COVID-19 patients and patients with specific brain tumors.
Collapse
Affiliation(s)
- Yezi Ali Kadhim
- Department of Modeling and Design of Engineering Systems (MODES), Atilim University, Ankara 06830, Turkey
- Department of Electrical and Electronics Engineering, Atilim University, Incek, Ankara 06830, Turkey
| | - Muhammad Umer Khan
- Department of Mechatronics Engineering, Atilim University, Incek, Ankara 06830, Turkey
| | - Alok Mishra
- Department of Software Engineering, Atilim University, Incek, Ankara 06830, Turkey
- Informatics and Digitalization Group, Molde University College—Specialized University in Logistics, 6410 Molde, Norway
| |
Collapse
|
25
|
Chakraborty M, Biswas SK. Computer-Aided Heart Disease Diagnosis Using Recursive Rule Extraction Algorithms from Neural Networks. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2022. [DOI: 10.1142/s1469026822500110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Mortality rate due to fatal heart disease (HD) or cardiovascular disease (CVD) has increased drastically over the world in recent decades. HD is a very hazardous problem prevailing among people which is treatable if detected early. But in most of the cases, the disease is not diagnosed until it becomes severe. Hence, it is requisite to develop an effective system which can accurately diagnosis HD and provide a concise description for the underlying causes [risk factors (RFs)] of the disease, so that in future HD can be controlled only by managing the primary RFs. Recently, researchers are using various machine learning algorithms for HD diagnosis, and neural network (NN) is one among them which has attracted tons of people because of its high performance. But the main obstacle with a NN is its black-box nature, i.e., its incapability in explaining the decisions. So, as a solution to this pitfall, the rule extraction algorithms can be very effective as they can extract explainable decision rules from NNs with high prediction accuracies. Many neural-based rule extraction algorithms have been applied successfully in various medical diagnosis problems. This study assesses the performance of rule extraction algorithms for HD diagnosis, particularly those that construct rules recursively from NNs. Because they subdivide a rule’s subspace until the accuracy improves, recursive algorithms are known for delivering interpretable decisions with high accuracy. The recursive rule extraction algorithms’ efficacy in HD diagnosis is demonstrated by the results. Along with the significant data ranges for the primary RFs, a maximum accuracy of 82.59% is attained.
Collapse
Affiliation(s)
- Manomita Chakraborty
- School of Computer Science and Engineering, VIT-AP, Amaravati 522237, Andhra Pradesh, India
| | - Saroj Kumar Biswas
- Department of Computer Science & Engineering, NIT Silchar, Silchar 788010, Assam, India
| |
Collapse
|
26
|
Enhancement of Detection of Diabetic Retinopathy Using Harris Hawks Optimization with Deep Learning Model. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:8512469. [PMID: 35665292 PMCID: PMC9162819 DOI: 10.1155/2022/8512469] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/29/2022] [Indexed: 02/01/2023]
Abstract
In today's world, diabetic retinopathy is a very severe health issue, which is affecting many humans of different age groups. Due to the high levels of blood sugar, the minuscule blood vessels in the retina may get damaged in no time and further may lead to retinal detachment and even sometimes lead to glaucoma blindness. If diabetic retinopathy can be diagnosed at the early stages, then many of the affected people will not be losing their vision and also human lives can be saved. Several machine learning and deep learning methods have been applied on the available data sets of diabetic retinopathy, but they were unable to provide the better results in terms of accuracy in preprocessing and optimizing the classification and feature extraction process. To overcome the issues like feature extraction and optimization in the existing systems, we have considered the Diabetic Retinopathy Debrecen Data Set from the UCI machine learning repository and designed a deep learning model with principal component analysis (PCA) for dimensionality reduction, and to extract the most important features, Harris hawks optimization algorithm is used further to optimize the classification and feature extraction process. The results shown by the deep learning model with respect to specificity, precision, accuracy, and recall are very much satisfactory compared to the existing systems.
Collapse
|
27
|
Deepika D, Balaji N. Effective heart disease prediction with Grey-wolf with Firefly algorithm-differential evolution (GF-DE) for feature selection and weighted ANN classification. Comput Methods Biomech Biomed Engin 2022; 25:1409-1427. [PMID: 35652537 DOI: 10.1080/10255842.2022.2078966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
In recent time, heart disease has become common leading to mortality of many individuals. Hence, early and accurate prediction of this disease is vital to reduce death rate and enhance people's lives. Concurrently, Artificial Intelligence has gained more attention at present as it permits deeper understanding of the healthcare data thereby providing accurate prediction results. This efficient prediction will solve complicated queries regarding heart diseases and hence assists clinical practitioners to adopt smart medical decisions. Hence, this study intends to predict heart disease with high accuracy by proposing an improved feature selection and enhanced classification approach. The paper employs Grey-wolf with Firefly algorithm for effective feature selection and using Differential Evolution Algorithm for tuning the hyper parameters of Artificial Neural Network (ANN). Hence, it is named as Grey Wolf Firefly algorithm with Differential Evolution (GF-DE) for better classification of the selected features. This proposed classification model trains the neural network to obtain optimal weights and tunes huge number of hyper parameters in an efficiently. To prove this, the proposed system is comparatively analysed with existing methods in terms of performance metrics like accuracy, precision, recall and F1 score for Cleveland and Statlog dataset. In addition, statistical analysis is also undertaken to analyse the significance of proposed system. Outcomes revealed the efficiency of proposed method which makes it highly suitable for heart disease prediction in an efficient manner.
Collapse
Affiliation(s)
- D Deepika
- Research Scholar, Anna University, Chennai, India
| | - N Balaji
- Professor, Computer Science and Engineering, Velammal Institute of Technology, Chennai, India
| |
Collapse
|
28
|
Kaur J, Khehra BS. Fuzzy Logic and Hybrid based Approaches for the Risk of Heart Disease Detection: State-of-the-Art Review. JOURNAL OF THE INSTITUTION OF ENGINEERS (INDIA): SERIES B 2022. [PMCID: PMC8328141 DOI: 10.1007/s40031-021-00644-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Artificial Intelligence, Machine Learning, Fuzzy Logic, Neural Network, Genetic Algorithm and their hybrid systems play vital role in the medical sciences to diagnose various diseases efficiently in the patients. The problems related to the heart are widely comon in today’s world. The risk of heart failure develops due to the narrowness and blockage in the coronary arteries of the heart as excess cholesterol deposits in the arteries and blood vessels that results in fatigue, chest pain, dyspnoea, sleeping difficulties and depression. This research aims to explore diverse work done on FL and Hybrid-based techniques to identify the risk of heart disease among the patients. The present study reveals publications along with the strength, operating system, accuracy rate and other specifications used in the identification of heart disease based on FL and Hybrid-based approaches since 2010. This survey contributes motivation for research scholars to generate more innovative ideas and continue their research work in the respective field. Moreover, the future model for direct service of the patients from old age homes to the Intensive Care Unit through ambulance services is also presented in this paper.
Collapse
Affiliation(s)
- Jagmohan Kaur
- IK Gujral Punjab Technical University, Jalandhar, Punjab India
| | | |
Collapse
|
29
|
|
30
|
Chen Z, Shi J, Pommier T, Cottin Y, Salomon M, Decourselle T, Lalande A, Couturier R. Prediction of Myocardial Infarction From Patient Features With Machine Learning. Front Cardiovasc Med 2022; 9:754609. [PMID: 35369326 PMCID: PMC8964399 DOI: 10.3389/fcvm.2022.754609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 02/02/2022] [Indexed: 11/24/2022] Open
Abstract
This study proposes machine learning-based models to automatically evaluate the severity of myocardial infarction (MI) from physiological, clinical, and paraclinical features. Two types of machine learning models are investigated for the MI assessment: the classification models classify the presence of the infarct and the persistent microvascular obstruction (PMO), and the regression models quantify the Percentage of Infarcted Myocardium (PIM) of patients suspected of having an acute MI during their reception in the emergency department. The ground truth labels for these supervised models are derived from the corresponding Delayed Enhancement MRI (DE-MRI) exams and manual annotations of the myocardium and scar tissues. Experiments were conducted on 150 cases and evaluated with cross-validation. Results showed that for the MI (PMO inclusive) and the PMO (infarct exclusive), the best models obtained respectively a mean error of 0.056 and 0.012 for the quantification, and 88.67 and 77.33% for the classification accuracy of the state of the myocardium. The study of the features' importance also revealed that the troponin value had the strongest correlation to the severity of the MI among the 12 selected features. For the proposal's translational perspective, in cardiac emergencies, qualitative and quantitative analysis can be obtained prior to the achievement of MRI by relying only on conventional tests and patient features, thus, providing an objective reference for further treatment by physicians.
Collapse
Affiliation(s)
- Zhihao Chen
- FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté, Belfort, France
| | - Jixi Shi
- FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté, Belfort, France
- IRSEEM, EA 4353, ESIGELEC, Univ. Normandie, Saint-Étienne-du-Rouvray, France
| | - Thibaut Pommier
- Department of Cardiology, University Hospital of Dijon, Dijon, France
| | - Yves Cottin
- Department of Cardiology, University Hospital of Dijon, Dijon, France
| | - Michel Salomon
- FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté, Belfort, France
| | | | - Alain Lalande
- Department of Medical Imaging, University Hospital of Dijon, Dijon, France
- ImViA Laboratory, EA 7535, Univ. Bourgogne Franche-Comté, Dijon, France
| | - Raphaël Couturier
- FEMTO-ST Institute, UMR 6174 CNRS, Univ. Bourgogne Franche-Comté, Belfort, France
| |
Collapse
|
31
|
Machine Learning Technology-Based Heart Disease Detection Models. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:7351061. [PMID: 35265303 PMCID: PMC8898839 DOI: 10.1155/2022/7351061] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 01/22/2022] [Accepted: 02/04/2022] [Indexed: 11/17/2022]
Abstract
At present, a multifaceted clinical disease known as heart failure disease can affect a greater number of people in the world. In the early stages, to evaluate and diagnose the disease of heart failure, cardiac centers and hospitals are heavily based on ECG. The ECG can be considered as a regular tool. Heart disease early detection is a critical concern in healthcare services (HCS). This paper presents the different machine learning technologies based on heart disease detection brief analysis. Firstly, Naïve Bayes with a weighted approach is used for predicting heart disease. The second one, according to the features of frequency domain, time domain, and information theory, is automatic and analyze ischemic heart disease localization/detection. Two classifiers such as support vector machine (SVM) with XGBoost with the best performance are selected for the classification in this method. The third one is the heart failure automatic identification method by using an improved SVM based on the duality optimization scheme also analyzed. Finally, for a clinical decision support system (CDSS), an effective heart disease prediction model (HDPM) is used, which includes density-based spatial clustering of applications with noise (DBSCAN) for outlier detection and elimination, a hybrid synthetic minority over-sampling technique-edited nearest neighbor (SMOTE-ENN) for balancing the training data distribution, and XGBoost for heart disease prediction. Machine learning can be applied in the medical industry for disease diagnosis, detection, and prediction. The major purpose of this paper is to give clinicians a tool to help them diagnose heart problems early on. As a result, it will be easier to treat patients effectively and avoid serious repercussions. This study uses XGBoost to test alternative decision tree classification algorithms in the hopes of improving the accuracy of heart disease diagnosis. In terms of precision, accuracy, f1-measure, and recall as performance parameters above mentioned, four types of machine learning (ML) models are compared.
Collapse
|
32
|
Prasanna SL, Challa NP. Heart Disease Prediction Using Optimal Mayfly Technique with Ensemble Models. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH 2022. [DOI: 10.4018/ijsir.313665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
This paper proposes a methodology consisting of two phases: attributes selection and classification based on the attributes selected. Phase 1 uses the introduced new feature selection algorithm which is the optimal mayfly algorithm (OMA) to solve the feature selection technique problem. Mayfly algorithm has derived features of physiological and anatomical relevance, like ST depression, the highest heart rate, cholesterol, chest pain, and heart vessels. In the second phase, the selected attributes use the ensemble classifiers like random subspace, bagging, and boosting. Optimal mayfly algorithm (OMA) with boosting technique had the highest accuracy. Therefore, true disease, false disease, accuracy, and specificity are measured to evaluate the proposed system's efficiency. It has been discovered that the proposed method, which combines feature selection and ensemble techniques performs well, the performance of the optimal mayfly algorithm along with ensemble classifiers of boosting method with a model accuracy of 97.12% which is the highest accuracy value compared to any single model.
Collapse
|
33
|
Comparative Study on Heart Disease Prediction Using Feature Selection Techniques on Classification Algorithms. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING 2021. [DOI: 10.1155/2021/5581806] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Heart disease is recognized as one of the leading factors of death rate worldwide. Biomedical instruments and various systems in hospitals have massive quantities of clinical data. Therefore, understanding the data related to heart disease is very important to improve prediction accuracy. This article has conducted an experimental evaluation of the performance of models created using classification algorithms and relevant features selected using various feature selection approaches. For results of the exploratory analysis, ten feature selection techniques, i.e., ANOVA, Chi-square, mutual information, ReliefF, forward feature selection, backward feature selection, exhaustive feature selection, recursive feature elimination, Lasso regression, and Ridge regression, and six classification approaches, i.e., decision tree, random forest, support vector machine, K-nearest neighbor, logistic regression, and Gaussian naive Bayes, have been applied to Cleveland heart disease dataset. The feature subset selected by the backward feature selection technique has achieved the highest classification accuracy of 88.52%, precision of 91.30%, sensitivity of 80.76%, and f-measure of 85.71% with the decision tree classifier.
Collapse
|
34
|
Velswamy K, Velswamy R, Swamidason ITJ, Chinnaiyan S. Classification model for heart disease prediction with feature selection through modified bee algorithm. Soft comput 2021. [DOI: 10.1007/s00500-021-06330-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
35
|
Reddy KVV, Elamvazuthi I, Aziz AA, Paramasivam S, Chua HN. Heart Disease Risk Prediction using Machine Learning with Principal Component Analysis. 2020 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS) 2021. [DOI: 10.1109/icias49414.2021.9642676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
36
|
Rasheed J, Jamil A, Hameed AA, Al-Turjman F, Rasheed A. COVID-19 in the Age of Artificial Intelligence: A Comprehensive Review. Interdiscip Sci 2021; 13:153-175. [PMID: 33886097 PMCID: PMC8060789 DOI: 10.1007/s12539-021-00431-w] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 04/03/2021] [Accepted: 04/09/2021] [Indexed: 12/23/2022]
Abstract
The recent COVID-19 pandemic, which broke at the end of the year 2019 in Wuhan, China, has infected more than 98.52 million people by today (January 23, 2021) with over 2.11 million deaths across the globe. To combat the growing pandemic on urgent basis, there is need to design effective solutions using new techniques that could exploit recent technology, such as machine learning, deep learning, big data, artificial intelligence, Internet of Things, for identification and tracking of COVID-19 cases in near real time. These technologies have offered inexpensive and rapid solution for proper screening, analyzing, prediction and tracking of COVID-19 positive cases. In this paper, a detailed review of the role of AI as a decisive tool for prognosis, analyze, and tracking the COVID-19 cases is performed. We searched various databases including Google Scholar, IEEE Library, Scopus and Web of Science using a combination of different keywords consisting of COVID-19 and AI. We have identified various applications, where AI can help healthcare practitioners in the process of identification and monitoring of COVID-19 cases. A compact summary of the corona virus cases are first highlighted, followed by the application of AI. Finally, we conclude the paper by highlighting new research directions and discuss the research challenges. Even though scientists and researchers have gathered and exchanged sufficient knowledge over last couple of months, but this structured review also examined technological perspectives while encompassing the medical aspect to help the healthcare practitioners, policymakers, decision makers, policymakers, AI scientists and virologists to quell this infectious COVID-19 pandemic outbreak.
Collapse
Affiliation(s)
- Jawad Rasheed
- Department of Computer Engineering, Istanbul Aydin University, Istanbul, 34295, Turkey.
| | - Akhtar Jamil
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul, 34303, Turkey
| | - Alaa Ali Hameed
- Department of Computer Engineering, Istanbul Sabahattin Zaim University, Istanbul, 34303, Turkey
| | - Fadi Al-Turjman
- Artificial Intelligence Engineering Department, Research Center for AI and IoT, Near East University, Nicosia, Mersin 10, Turkey
| | - Ahmad Rasheed
- Department of Electrical and Electronics Engineering, Eastern Mediterranean University, Famagusta, Mersin 10, Turkey
| |
Collapse
|
37
|
Patel J, Ladani A, Sambamoorthi N, LeMasters T, Dwibedi N, Sambamoorthi U. Predictors of Co-occurring Cardiovascular and Gastrointestinal Disorders among Elderly with Osteoarthritis. OSTEOARTHRITIS AND CARTILAGE OPEN 2021; 3:100148. [DOI: 10.1016/j.ocarto.2021.100148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 02/22/2021] [Indexed: 01/22/2023] Open
|
38
|
Nandy S, Adhikari M, Balasubramanian V, Menon VG, Li X, Zakarya M. An intelligent heart disease prediction system based on swarm-artificial neural network. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06124-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
39
|
Uddin MN, Halder RK. An ensemble method based multilayer dynamic system to predict cardiovascular disease using machine learning approach. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100584] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
|
40
|
Performance analysis of cost-sensitive learning methods with application to imbalanced medical data. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100690] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
41
|
Bharti R, Khamparia A, Shabaz M, Dhiman G, Pande S, Singh P. Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:8387680. [PMID: 34306056 PMCID: PMC8266441 DOI: 10.1155/2021/8387680] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2021] [Revised: 06/15/2021] [Accepted: 06/21/2021] [Indexed: 02/07/2023]
Abstract
The correct prediction of heart disease can prevent life threats, and incorrect prediction can prove to be fatal at the same time. In this paper different machine learning algorithms and deep learning are applied to compare the results and analysis of the UCI Machine Learning Heart Disease dataset. The dataset consists of 14 main attributes used for performing the analysis. Various promising results are achieved and are validated using accuracy and confusion matrix. The dataset consists of some irrelevant features which are handled using Isolation Forest, and data are also normalized for getting better results. And how this study can be combined with some multimedia technology like mobile devices is also discussed. Using deep learning approach, 94.2% accuracy was obtained.
Collapse
Affiliation(s)
- Rohit Bharti
- 1School of Computer Science and Engineering, Lovely Professional University, Phagwara, India
| | - Aditya Khamparia
- 2Department of Computer Science, Babasaheb Bhimrao Ambedkar University, Lucknow, India
| | | | - Gaurav Dhiman
- 4Department of Computer Science, Government Bikram College of Commerce, Patiala, India
| | - Sagar Pande
- 1School of Computer Science and Engineering, Lovely Professional University, Phagwara, India
| | - Parneet Singh
- 5All India Institute of Medical Science, Rishikesh, India
| |
Collapse
|
42
|
Nkiruka O, Prasad R, Clement O. Prediction of malaria incidence using climate variability and machine learning. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2020.100508] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
43
|
Kumar PR, Ravichandran S, Narayana S. Ensemble classification technique for heart disease prediction with meta-heuristic-enabled training system. BIO-ALGORITHMS AND MED-SYSTEMS 2020. [DOI: 10.1515/bams-2020-0033] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Abstract
Objectives
This research work exclusively aims to develop a novel heart disease prediction framework including three major phases, namely proposed feature extraction, dimensionality reduction, and proposed ensemble-based classification.
Methods
As the novelty, the training of NN is carried out by a new enhanced optimization algorithm referred to as Sea Lion with Canberra Distance (S-CDF) via tuning the optimal weights. The improved S-CDF algorithm is the extended version of the existing “Sea Lion Optimization (SLnO)”. Initially, the statistical and higher-order statistical features are extracted including central tendency, degree of dispersion, and qualitative variation, respectively. However, in this scenario, the “curse of dimensionality” seems to be the greatest issue, such that there is a necessity of dimensionality reduction in the extracted features. Hence, the principal component analysis (PCA)-based feature reduction approach is deployed here. Finally, the dimensional concentrated features are fed as the input to the proposed ensemble technique with “Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN)” with optimized Neural Network (NN) as the final classifier.
Results
An elaborative analyses as well as discussion have been provided by concerning the parameters, like evaluation metrics, year of publication, accuracy, implementation tool, and utilized datasets obtained by various techniques.
Conclusions
From the experiment outcomes, it is proved that the accuracy of the proposed work with the proposed feature set is 5, 42.85, and 10% superior to the performance with other feature sets like central tendency + dispersion feature, central tendency qualitative variation, and dispersion qualitative variation, respectively.
Results
Finally, the comparative evaluation shows that the presented work is appropriate for heart disease prediction as it has high accuracy than the traditional works.
Collapse
Affiliation(s)
- Parvathaneni Rajendra Kumar
- Department of Information Technology, Faculty of Engineering and Technology , Annamalai University Annamalainagar - 608002 , Tamil Nadu , India
| | - Suban Ravichandran
- Department of Information Technology, Faculty of Engineering and Technology , Annamalai University Annamalainagar - 608002 , Tamil Nadu , India
| | | |
Collapse
|
44
|
Lalmuanawma S, Hussain J, Chhakchhuak L. Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: A review. CHAOS, SOLITONS, AND FRACTALS 2020; 139:110059. [PMID: 32834612 PMCID: PMC7315944 DOI: 10.1016/j.chaos.2020.110059] [Citation(s) in RCA: 289] [Impact Index Per Article: 72.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 06/23/2020] [Indexed: 05/17/2023]
Abstract
BACKGROUND AND OBJECTIVE During the recent global urgency, scientists, clinicians, and healthcare experts around the globe keep on searching for a new technology to support in tackling the Covid-19 pandemic. The evidence of Machine Learning (ML) and Artificial Intelligence (AI) application on the previous epidemic encourage researchers by giving a new angle to fight against the novel Coronavirus outbreak. This paper aims to comprehensively review the role of AI and ML as one significant method in the arena of screening, predicting, forecasting, contact tracing, and drug development for SARS-CoV-2 and its related epidemic. METHOD A selective assessment of information on the research article was executed on the databases related to the application of ML and AI technology on Covid-19. Rapid and critical analysis of the three crucial parameters, i.e., abstract, methodology, and the conclusion was done to relate to the model's possibilities for tackling the SARS-CoV-2 epidemic. RESULT This paper addresses on recent studies that apply ML and AI technology towards augmenting the researchers on multiple angles. It also addresses a few errors and challenges while using such algorithms in real-world problems. The paper also discusses suggestions conveying researchers on model design, medical experts, and policymakers in the current situation while tackling the Covid-19 pandemic and ahead. CONCLUSION The ongoing development in AI and ML has significantly improved treatment, medication, screening, prediction, forecasting, contact tracing, and drug/vaccine development process for the Covid-19 pandemic and reduce the human intervention in medical practice. However, most of the models are not deployed enough to show their real-world operation, but they are still up to the mark to tackle the SARS-CoV-2 epidemic.
Collapse
Affiliation(s)
- Samuel Lalmuanawma
- Department of Mathematics & Computer Science, Mizoram University, Tanhril, Aizawl, Mizoram, 796004, India
| | - Jamal Hussain
- Department of Mathematics & Computer Science, Mizoram University, Tanhril, Aizawl, Mizoram, 796004, India
| | | |
Collapse
|