1
|
Almalki J, Alshahrani SM, Khan NA. A comprehensive secure system enabling healthcare 5.0 using federated learning, intrusion detection and blockchain. PeerJ Comput Sci 2024; 10:e1778. [PMID: 38259900 PMCID: PMC10803090 DOI: 10.7717/peerj-cs.1778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 12/05/2023] [Indexed: 01/24/2024]
Abstract
Recently, the use of the Internet of Medical Things (IoMT) has gained popularity across various sections of the health sector. The historical security risks of IoMT devices themselves and the data flowing from them are major concerns. Deploying many devices, sensors, services, and networks that connect the IoMT systems is gaining popularity. This study focuses on identifying the use of blockchain in innovative healthcare units empowered by federated learning. A collective use of blockchain with intrusion detection management (IDM) is beneficial to detect and prevent malicious activity across the storage nodes. Data accumulated at a centralized storage node is analyzed with the help of machine learning algorithms to diagnose disease and allow appropriate medication to be prescribed by a medical healthcare professional. The model proposed in this study focuses on the effective use of such models for healthcare monitoring. The amalgamation of federated learning and the proposed model makes it possible to reach 93.89 percent accuracy for disease analysis and addiction. Further, intrusion detection ensures a success rate of 97.13 percent in this study.
Collapse
Affiliation(s)
- Jameel Almalki
- Department of Computer Science, College of Computer in Al-Lith, Umm Al-Qura University, Makkah, Makkah, Saudi Arabia
| | - Saeed M. Alshahrani
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra, Riyadh, Saudi Arabia
| | - Nayyar Ahmed Khan
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Huang R, Ma S, Dai S, Zheng J. Application of Data Fusion in Traditional Chinese Medicine: A Review. SENSORS (BASEL, SWITZERLAND) 2023; 24:106. [PMID: 38202967 PMCID: PMC10781265 DOI: 10.3390/s24010106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 12/22/2023] [Accepted: 12/22/2023] [Indexed: 01/12/2024]
Abstract
Traditional Chinese medicine is characterized by numerous chemical constituents, complex components, and unpredictable interactions among constituents. Therefore, a single analytical technique is usually unable to obtain comprehensive chemical information. Data fusion is an information processing technology that can improve the accuracy of test results by fusing data from multiple devices, which has a broad application prospect by utilizing chemometrics methods, adopting low-level, mid-level, and high-level data fusion techniques, and establishing final classification or prediction models. This paper summarizes the current status of the application of data fusion strategies based on spectroscopy, mass spectrometry, chromatography, and sensor technologies in traditional Chinese medicine (TCM) in light of the latest research progress of data fusion technology at home and abroad. It also gives an outlook on the development of data fusion technology in TCM analysis to provide references for the research and development of TCM.
Collapse
Affiliation(s)
- Rui Huang
- National Institutes for Food and Drug Control, Beijing 102629, China; (R.H.); (S.M.)
- School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Shuangcheng Ma
- National Institutes for Food and Drug Control, Beijing 102629, China; (R.H.); (S.M.)
| | - Shengyun Dai
- National Institutes for Food and Drug Control, Beijing 102629, China; (R.H.); (S.M.)
| | - Jian Zheng
- National Institutes for Food and Drug Control, Beijing 102629, China; (R.H.); (S.M.)
| |
Collapse
|
3
|
Wu Y, Zhang L, Bhatti UA, Huang M. Interpretable Machine Learning for Personalized Medical Recommendations: A LIME-Based Approach. Diagnostics (Basel) 2023; 13:2681. [PMID: 37627940 PMCID: PMC10453635 DOI: 10.3390/diagnostics13162681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/10/2023] [Accepted: 08/10/2023] [Indexed: 08/27/2023] Open
Abstract
Chronic diseases are increasingly major threats to older persons, seriously affecting their physical health and well-being. Hospitals have accumulated a wealth of health-related data, including patients' test reports, treatment histories, and diagnostic records, to better understand patients' health, safety, and disease progression. Extracting relevant information from this data enables physicians to provide personalized patient-treatment recommendations. While collaborative filtering techniques and classical algorithms such as naive Bayes, logistic regression, and decision trees have had notable success in health-recommendation systems, most current systems primarily inform users of their likely preferences without providing explanations. This paper proposes an approach of deep learning with a local interpretable model-agnostic explanations (LIME)-based interpretable recommendation system to solve this problem. Specifically, we apply the proposed approach to two chronic diseases common in older adults: heart disease and diabetes. After data preprocessing, we use six deep-learning algorithms to form interpretations. In the heart-disease data set, the actual model recommendation of multi-layer perceptron and gradient-boosting algorithm differs from the local model's recommendation of LIME, which can be used as its approximate prediction. From the feature importance of these two algorithms, it can be seen that the CholCheck, GenHith, and HighBP features are the most important for predicting heart disease. In the diabetes data set, the actual model predictions of the multi-layer perceptron and logistic-regression algorithm were little different from the local model's prediction of LIME, which can be used as its approximate recommendation. Moreover, from the feature importance of the two algorithms, it can be seen that the three features of glucose, BMI, and age were the most important for predicting heart disease. Next, LIME is used to determine the importance of each feature that affected the results of the calculated model. Subsequently, we present the contribution coefficients of these features to the final recommendation. By analyzing the impact of different patient characteristics on the recommendations, our proposed system elucidates the underlying reasons behind these recommendations and enhances patient trust. This approach has important implications for medical recommendation systems and encourages informed decision-making in healthcare.
Collapse
Affiliation(s)
| | | | - Uzair Aslam Bhatti
- School of Information and Communication Engineering, Hainan University, Haikou 570100, China; (Y.W.); (L.Z.)
| | - Mengxing Huang
- School of Information and Communication Engineering, Hainan University, Haikou 570100, China; (Y.W.); (L.Z.)
| |
Collapse
|
4
|
Ragab M, Binyamin SS. Intelligent Energy-Aware Thermal Exchange Optimization with Deep Learning Model for IoT-Enabled Smart Healthcare. JOURNAL OF HEALTHCARE ENGINEERING 2023; 2023:3830857. [PMID: 37483302 PMCID: PMC10361830 DOI: 10.1155/2023/3830857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Revised: 04/14/2022] [Accepted: 04/27/2022] [Indexed: 07/25/2023]
Abstract
In recent years, Internet of Things (IoT) and advanced sensor technologies have gained considerable interest in linking different medical devices, patients, and healthcare professionals to improve the quality of medical services in a cost-effective manner. The evolution of the smart healthcare sector has considerably enhanced patient safety, accessibility, and operational competence while minimizing the costs incurred in healthcare services. In this background, the current study develops intelligent energy-aware thermal exchange optimization with deep learning (IEA-TEODL) model for IoT-enabled smart healthcare. The aim of the proposed IEA-TOEDL technique is to group the IoT devices into clusters and make decisions in the smart healthcare sector. The proposed IEA-TEODL technique constructs clusters using the energy-aware chaotic thermal exchange optimization-based clustering (EACTEO-C) scheme. In addition, the disease diagnosis model also intends to classify the collected healthcare data as either presence or absence of the disease. To accomplish this, the proposed IEA-TODL technique involves several subprocesses such as preprocessing, K-medoid clustering-based outlier removal, multihead attention bidirectional long short-term memory (MHA-BLSTM), and weighted salp swarm algorithm (WSSA). The utilization of outlier removal and WSSA-based hyperparameter tuning process assist in achieving enhanced classification outcomes. In order to demonstrate the enhanced outcomes of the IEA-TEODL approach, a wide range of simulations was conducted against benchmark datasets. The simulation results inferred the enhanced outcomes of the IEA-TEODL technique over recent techniques under distinct evaluation metrics.
Collapse
Affiliation(s)
- Mahmoud Ragab
- Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Department, Faculty of Science, Al-Azhar University, Nasr City 11884, Cairo, Egypt
- Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Sami Saeed Binyamin
- Computer and Information Technology Department, The Applied College, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| |
Collapse
|
5
|
Outranking Relations based Multi-criteria Recommender System for Analysis of Health Risk using Multi-objective Feature Selection Approach. DATA KNOWL ENG 2023. [DOI: 10.1016/j.datak.2023.102144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
6
|
Menon SP, Shukla PK, Sethi P, Alasiry A, Marzougui M, Alouane MTH, Khan AA. An Intelligent Diabetic Patient Tracking System Based on Machine Learning for E-Health Applications. SENSORS (BASEL, SWITZERLAND) 2023; 23:3004. [PMID: 36991714 PMCID: PMC10052330 DOI: 10.3390/s23063004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 02/19/2023] [Accepted: 03/07/2023] [Indexed: 06/19/2023]
Abstract
BACKGROUND Continuous surveillance helps people with diabetes live better lives. A wide range of technologies, including the Internet of Things (IoT), modern communications, and artificial intelligence (AI), can assist in lowering the expense of health services. Due to numerous communication systems, it is now possible to provide customized and distant healthcare. MAIN PROBLEM Healthcare data grows daily, making storage and processing challenging. We provide intelligent healthcare structures for smart e-health apps to solve the aforesaid problem. The 5G network must offer advanced healthcare services to meet important requirements like large bandwidth and excellent energy efficacy. METHODOLOGY This research suggested an intelligent system for diabetic patient tracking based on machine learning (ML). The architectural components comprised smartphones, sensors, and smart devices, to gather body dimensions. Then, the preprocessed data is normalized using the normalization procedure. To extract features, we use linear discriminant analysis (LDA). To establish a diagnosis, the intelligent system conducted data classification utilizing the suggested advanced-spatial-vector-based Random Forest (ASV-RF) in conjunction with particle swarm optimization (PSO). RESULTS Compared to other techniques, the simulation's outcomes demonstrate that the suggested approach offers greater accuracy.
Collapse
Affiliation(s)
- Sindhu P. Menon
- School of Computing and Information Technology, Reva University, Bangalore 560064, Karnataka, India
| | - Prashant Kumar Shukla
- Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur 522302, Andhra Pradesh, India
| | - Priyanka Sethi
- Department of Physiotherapy, Faculty of Allied Health Sciences, Manav Rachna International Institute of Research & Studies, Faridabad 121004, Haryana, India
| | - Areej Alasiry
- College of Computer Science, King Khalid University, Abha 61413, Saudi Arabia
| | - Mehrez Marzougui
- College of Computer Science, King Khalid University, Abha 61413, Saudi Arabia
| | | | - Arfat Ahmad Khan
- Department of Computer Science, College of Computing, Khon Kaen University, Khon Kaen 40002, Thailand
| |
Collapse
|
7
|
Naveena S, Bharathi A. Weighted entropy deep features on hybrid RNN with LSTM for glucose level and diabetes prediction. Comput Methods Biomech Biomed Engin 2022; 26:1-25. [PMID: 36448678 DOI: 10.1080/10255842.2022.2149263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 11/15/2022] [Indexed: 12/05/2022]
Abstract
Glucose level regulation with essential advice regarding diabetes must be provided to the patients to maintain their diet for diabetes treatment. Therefore, the academic community has focused on implementing novel glucose prediction techniques for decision support systems. Recent computational techniques for diagnosing diabetes have certain limitations, and also they are not evaluated under various datasets obtained from the different people of various countries. This generates inefficiency in the prediction systems to apply it in real-time applications. This paper plans to suggest a hybrid deep learning model for diabetes prediction and glucose level classification. Two benchmark datasets are used in the data collection process for experimenting. Initially, the deep selected features were extracted by the Convolutional Neural Network (CNN). Further, weighted entropy deep features are extracted, where the tuning of weight is taken place by the Modified Escaping Energy-based Harris Hawks Optimization. These features are processed in the glucose level classification using the modified Fuzzy classifier for classifying the high-level and low-level glucose. Further, glucose prediction is done by the Hybrid Recurrent Neural Network (RNN), and Long Short Term Memory (LSTM) termed R-LSTM with parameter optimization. From the experimental result, In the dataset 2 analyses on SMAPE, the MEE-HHO-R-LSTM is 12.5%, 87.5%, 50%, 12.5%, and 2.5% better than SVM, LSTM, DNN, RNN, and RNN-LSTM, at the learning percentage of 75%. The analytical results enforce that the suggested methods attain enhanced prediction performance concerning the evaluation metrics compared to conventional prediction models.
Collapse
Affiliation(s)
- Somasundaram Naveena
- Assistant Professor Senior Grade, Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, India
| | - Ayyasamy Bharathi
- Professor, Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, India
| |
Collapse
|
8
|
Rehman A, Abbas S, Khan MA, Ghazal TM, Adnan KM, Mosavi A. A secure healthcare 5.0 system based on blockchain technology entangled with federated learning technique. Comput Biol Med 2022; 150:106019. [PMID: 36162198 DOI: 10.1016/j.compbiomed.2022.106019] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 08/04/2022] [Accepted: 08/20/2022] [Indexed: 11/22/2022]
Abstract
In recent years, the global Internet of Medical Things (IoMT) industry has evolved at a tremendous speed. Security and privacy are key concerns on the IoMT, owing to the huge scale and deployment of IoMT networks. Machine learning (ML) and blockchain (BC) technologies have significantly enhanced the capabilities and facilities of healthcare 5.0, spawning a new area known as "Smart Healthcare." By identifying concerns early, a smart healthcare system can help avoid long-term damage. This will enhance the quality of life for patients while reducing their stress and healthcare costs. The IoMT enables a range of functionalities in the field of information technology, one of which is smart and interactive health care. However, combining medical data into a single storage location to train a powerful machine learning model raises concerns about privacy, ownership, and compliance with greater concentration. Federated learning (FL) overcomes the preceding difficulties by utilizing a centralized aggregate server to disseminate a global learning model. Simultaneously, the local participant keeps control of patient information, assuring data confidentiality and security. This article conducts a comprehensive analysis of the findings on blockchain technology entangled with federated learning in healthcare. 5.0. The purpose of this study is to construct a secure health monitoring system in healthcare 5.0 by utilizing a blockchain technology and Intrusion Detection System (IDS) to detect any malicious activity in a healthcare network and enables physicians to monitor patients through medical sensors and take necessary measures periodically by predicting diseases. The proposed system demonstrates that the approach is optimized effectively for healthcare monitoring. In contrast, the proposed healthcare 5.0 system entangled with FL Approach achieves 93.22% accuracy for disease prediction, and the proposed RTS-DELM-based secure healthcare 5.0 system achieves 96.18% accuracy for the estimation of intrusion detection.
Collapse
Affiliation(s)
- Abdur Rehman
- School of Computer Science, National College of Business Administration and Economics, Lahore, 54000, Pakistan.
| | - Sagheer Abbas
- School of Computer Science, National College of Business Administration and Economics, Lahore, 54000, Pakistan.
| | - M A Khan
- Riphah School of Computing and Innovation, Faculty of Computing, Riphah International University, Lahore Campus, Lahore, 54000, Pakistan.
| | - Taher M Ghazal
- School of Information Technology, Skyline University College, University City Sharjah, 1797, Sharjah, United Arab Emirates; Center for Cyber Security, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), 43600, Bangi, Selangor, Malaysia.
| | - Khan Muhammad Adnan
- Department of Software, Gachon University, Seongnam, 13120, Republic of Korea.
| | - Amir Mosavi
- Institute of Information Engineering, Automation and Mathematics, Slovak University of Technology in Bratislava, 81107 Bratislava, Slovakia; John von Neumann Faculty of Informatics, Obuda University, 1034, Budapest, Hungary; Faculty of Civil Engineering, TU-Dresden, 01062, Dresden, Germany.
| |
Collapse
|
9
|
Hybrid stacked ensemble combined with genetic algorithms for diabetes prediction. IRAN JOURNAL OF COMPUTER SCIENCE 2022. [PMCID: PMC8935256 DOI: 10.1007/s42044-022-00100-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Diabetes is currently one of the most common, dangerous, and costly diseases globally caused by increased blood sugar or a decrease in insulin in the body. Diabetes can have detrimental effects on people’s health if diagnosed late. Today, diabetes has become one of the challenges for health and government officials. Prevention is a priority, and taking care of people’s health without compromising their comfort is an essential need. In this study, the ensemble training methodology based on genetic algorithms was used to diagnose and predict the outcomes of diabetes mellitus accurately. This study uses the experimental data, actual data on Indian diabetics on the University of California website. Current developments in ICT, such as the Internet of Things, machine learning, and data mining, allow us to provide health strategies with more intelligent capabilities to accurately predict the outcomes of the disease in daily life and the hospital and prevent the progression of this disease and its many complications. The results show the high performance of the proposed method in diagnosing the disease, which has reached 98.8%, and 99% accuracy in this study.
Collapse
|
10
|
Mishra S, Shaw K, Mishra D, Patil S, Kotecha K, Kumar S, Bajaj S. Improving the Accuracy of Ensemble Machine Learning Classification Models Using a Novel Bit-Fusion Algorithm for Healthcare AI Systems. Front Public Health 2022; 10:858282. [PMID: 35602150 PMCID: PMC9114677 DOI: 10.3389/fpubh.2022.858282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 03/15/2022] [Indexed: 12/11/2022] Open
Abstract
Healthcare AI systems exclusively employ classification models for disease detection. However, with the recent research advances into this arena, it has been observed that single classification models have achieved limited accuracy in some cases. Employing fusion of multiple classifiers outputs into a single classification framework has been instrumental in achieving greater accuracy and performing automated big data analysis. The article proposes a bit fusion ensemble algorithm that minimizes the classification error rate and has been tested on various datasets. Five diversified base classifiers k- nearest neighbor (KNN), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Decision Tree (D.T.), and Naïve Bayesian Classifier (N.B.), are used in the implementation model. Bit fusion algorithm works on the individual input from the classifiers. Decision vectors of the base classifier are weighted transformed into binary bits by comparing with high-reliability threshold parameters. The output of each base classifier is considered as soft class vectors (CV). These vectors are weighted, transformed and compared with a high threshold value of initialized δ = 0.9 for reliability. Binary patterns are extracted, and the model is trained and tested again. The standard fusion approach and proposed bit fusion algorithm have been compared by average error rate. The error rate of the Bit-fusion algorithm has been observed with the values 5.97, 12.6, 4.64, 0, 0, 27.28 for Leukemia, Breast cancer, Lung Cancer, Hepatitis, Lymphoma, Embryonal Tumors, respectively. The model is trained and tested over datasets from UCI, UEA, and UCR repositories as well which also have shown reduction in the error rates.
Collapse
Affiliation(s)
- Sashikala Mishra
- Symbiosis Institute of Technology, Symbiosis International University, Pune, India
| | - Kailash Shaw
- Symbiosis Institute of Technology, Symbiosis International University, Pune, India
| | - Debahuti Mishra
- Department of Computer Science and Engineering, Siksha O Anusandhan Deemed to be University, Bhubaneshwar, India
| | - Shruti Patil
- Symbiosis Centre for Applied Artificial Intelligence (SCAAI), Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| | - Ketan Kotecha
- Symbiosis Centre for Applied Artificial Intelligence (SCAAI), Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| | - Satish Kumar
- Symbiosis Centre for Applied Artificial Intelligence (SCAAI), Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| | - Simi Bajaj
- School of Computer Data and Mathematical Sciences, University of Western Sydney, Sydney, NSW, Australia
| |
Collapse
|
11
|
Rehman AU, Haroon HM, Malik JA, Saeed A, Ali A, Ateeq K. Fuzzy System For Covid-19 Disease Detection. 2022 INTERNATIONAL CONFERENCE ON BUSINESS ANALYTICS FOR TECHNOLOGY AND SECURITY (ICBATS) 2022. [DOI: 10.1109/icbats54253.2022.9759014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Affiliation(s)
- Abd Ur Rehman
- Riphah International university,Department of Computer Science,Lahore,Pakistan
| | | | | | - Afia Saeed
- Agriculture University,Department of Computer Science,Faislabad,Pakistan
| | - Asad Ali
- NCBA&E,School of Computer Science,Lahore,Pakistan
| | | |
Collapse
|
12
|
Fregoso-Aparicio L, Noguez J, Montesinos L, García-García JA. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr 2021; 13:148. [PMID: 34930452 PMCID: PMC8686642 DOI: 10.1186/s13098-021-00767-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 12/07/2021] [Indexed: 12/12/2022] Open
Abstract
Diabetes Mellitus is a severe, chronic disease that occurs when blood glucose levels rise above certain limits. Over the last years, machine and deep learning techniques have been used to predict diabetes and its complications. However, researchers and developers still face two main challenges when building type 2 diabetes predictive models. First, there is considerable heterogeneity in previous studies regarding techniques used, making it challenging to identify the optimal one. Second, there is a lack of transparency about the features used in the models, which reduces their interpretability. This systematic review aimed at providing answers to the above challenges. The review followed the PRISMA methodology primarily, enriched with the one proposed by Keele and Durham Universities. Ninety studies were included, and the type of model, complementary techniques, dataset, and performance parameters reported were extracted. Eighteen different types of models were compared, with tree-based algorithms showing top performances. Deep Neural Networks proved suboptimal, despite their ability to deal with big and dirty data. Balancing data and feature selection techniques proved helpful to increase the model's efficiency. Models trained on tidy datasets achieved almost perfect models.
Collapse
Affiliation(s)
- Luis Fregoso-Aparicio
- School of Engineering and Sciences, Tecnologico de Monterrey, Av Lago de Guadalupe KM 3.5, Margarita Maza de Juarez, 52926 Cd Lopez Mateos, Mexico
| | - Julieta Noguez
- School of Engineering and Sciences, Tecnologico de Monterrey, Ave. Eugenio Garza Sada 2501, 64849 Monterrey, Nuevo Leon Mexico
| | - Luis Montesinos
- School of Engineering and Sciences, Tecnologico de Monterrey, Ave. Eugenio Garza Sada 2501, 64849 Monterrey, Nuevo Leon Mexico
| | - José A. García-García
- Hospital General de Mexico Dr. Eduardo Liceaga, Dr. Balmis 148, Doctores, Cuauhtemoc, 06720 Mexico City, Mexico
| |
Collapse
|