1
|
Saleem MA, Javeed A, Akarathanawat W, Chutinet A, Suwanwela NC, Kaewplung P, Chaitusaney S, Deelertpaiboon S, Srisiri W, Benjapolakul W. An intelligent learning system based on electronic health records for unbiased stroke prediction. Sci Rep 2024; 14:23052. [PMID: 39367027 PMCID: PMC11452373 DOI: 10.1038/s41598-024-73570-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 09/18/2024] [Indexed: 10/06/2024] Open
Abstract
Stroke has a negative impact on people's lives and is one of the leading causes of death and disability worldwide. Early detection of symptoms can significantly help predict stroke and promote a healthy lifestyle. Researchers have developed several methods to predict strokes using machine learning (ML) techniques. However, the proposed systems have suffered from the following two main problems. The first problem is that the machine learning models are biased due to the uneven distribution of classes in the dataset. Recent research has not adequately addressed this problem, and no preventive measures have been taken. Synthetic Minority Oversampling (SMOTE) has been used to remove bias and balance the training of the proposed ML model. The second problem is to solve the problem of lower classification accuracy of machine learning models. We proposed a learning system that combines an autoencoder with a linear discriminant analysis (LDA) model to increase the accuracy of the proposed ML model for stroke prediction. Relevant features are extracted from the feature space using the autoencoder, and the extracted subset is then fed into the LDA model for stroke classification. The hyperparameters of the LDA model are found using a grid search strategy. However, the conventional accuracy metric does not truly reflect the performance of ML models. Therefore, we employed several evaluation metrics to validate the efficiency of the proposed model. Consequently, we evaluated the proposed model's accuracy, sensitivity, specificity, area under the curve (AUC), and receiver operator characteristic (ROC). The experimental results show that the proposed model achieves a sensitivity and specificity of 98.51% and 97.56%, respectively, with an accuracy of 99.24% and a balanced accuracy of 98.00%.
Collapse
Affiliation(s)
- Muhammad Asim Saleem
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Ashir Javeed
- Aging Research Center, Karolinska Institutet, 171 65, Stockholm, Sweden
| | - Wasan Akarathanawat
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Aurauma Chutinet
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Nijasri Charnnarong Suwanwela
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Pasu Kaewplung
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand.
| | - Surachai Chaitusaney
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Sunchai Deelertpaiboon
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Wattanasak Srisiri
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Watit Benjapolakul
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand.
| |
Collapse
|
2
|
Shareef SKK, Chaitanya RK, Chennupalli S, Chokkakula D, Kiran KVD, Pamula U, Vatambeti R. Enhanced botnet detection in IoT networks using zebra optimization and dual-channel GAN classification. Sci Rep 2024; 14:17148. [PMID: 39060369 PMCID: PMC11282287 DOI: 10.1038/s41598-024-67865-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Accepted: 07/16/2024] [Indexed: 07/28/2024] Open
Abstract
The Internet of Things (IoT) permeates various sectors, including healthcare, smart cities, and agriculture, alongside critical infrastructure management. However, its susceptibility to malware due to limited processing power and security protocols poses significant challenges. Traditional antimalware solutions fall short in combating evolving threats. To address this, the research work developed a feature selection-based classification model. At first stage, a preprocessing stage enhances dataset quality through data smoothing and consistency improvement. Feature selection via the Zebra Optimization Algorithm (ZOA) reduces dimensionality, while a classification phase integrates the Graph Attention Network (GAN), specifically the Dual-channel GAN (DGAN). DGAN incorporates Node Attention Networks and Semantic Attention Networks to capture intricate IoT device interactions and detect anomalous behaviors like botnet activity. The model's accuracy is further boosted by leveraging both structural and semantic data with the Sooty Tern Optimization Algorithm (STOA) for hyperparameter tuning. The proposed STOA-DGAN model achieves an impressive 99.87% accuracy in botnet activity classification, showcasing robustness and reliability compared to existing approaches.
Collapse
Affiliation(s)
- S K Khaja Shareef
- Department of Computer Science & Information Technology, Koneru Lakshmaiah Education Foundation, Hyderabad, India
| | | | | | - Devi Chokkakula
- Department of Computer Science and Engineering, Nadimpalli Satyanarayana Raju Institute of Technology, Visakhapatnam, 531173, India
| | - K V D Kiran
- Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, India
| | - Udayaraju Pamula
- Department of Computer Science and Engineering, SRM University-AP, Amaravati, 522502, India
| | - Ramesh Vatambeti
- School of Computer Science and Engineering, VIT-AP University, Vijayawada, 522237, India.
| |
Collapse
|
3
|
Guo G, Liu P, Zheng Y. Early energy performance analysis of smart buildings by consolidated artificial neural network paradigms. Heliyon 2024; 10:e25848. [PMID: 38404842 PMCID: PMC10884448 DOI: 10.1016/j.heliyon.2024.e25848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 02/02/2024] [Accepted: 02/04/2024] [Indexed: 02/27/2024] Open
Abstract
The assessment of energy performance in smart buildings has emerged as a prominent area of research driven by the increasing energy consumption trends worldwide. Analyzing the attributes of buildings using optimized machine learning models has been a highly effective approach for estimating the cooling load (CL) and heating load (HL) of the buildings. In this study, an artificial neural network (ANN) is used as the basic predictor that undergoes optimization using five metaheuristic algorithms, namely coati optimization algorithm (COA), gazelle optimization algorithm (GOA), incomprehensible but intelligible-in-time logics (IbIL), osprey optimization algorithm (OOA), and sooty tern optimization algorithm (STOA) to predict the CL and HL of a residential building. The models are trained and tested via an Energy Efficiency dataset (downloaded from UCI Repository). A score-based ranking system is built upon three accuracy evaluators including mean absolute percentage error (MAPE), root mean square error (RMSE), and percentage-Pearson correlation coefficient (PPCC) to compare the prediction accuracy of the models. Referring to the results, all models demonstrated high accuracy (e.g., PPCCs >89%) for predicting both CL and HL. However, the calculated final scores of the models (43, 20, 39, 38, and 10 in HL prediction and 36, 20, 42, 42, and 10 in CL prediction for the STOA, OOA, IbIL, GOA, and COA, respectively) indicated that the GOA, IbIL, and STOA perform better than COA and OOA. Moreover, a comparison with various algorithms used in earlier literature showed that the GOA, IbIL, and STOA provide a more accurate solution. Therefore, the use of ANN optimized by these three algorithms is recommended for practical early forecast of energy performance in buildings and optimizing the design of energy systems.
Collapse
Affiliation(s)
- Guoqing Guo
- Xi'an Jiaotong-liverpool University, Xi'an, Shannxi, 215123, China
| | - Peng Liu
- Xi'an Jiaotong-liverpool University, Xi'an, Shannxi, 215123, China
| | - Yuchen Zheng
- Chenyu Technology (Wuhan) Co., LTD, Wuhan, Hubei, 430074, China
| |
Collapse
|
4
|
Javeed A, Anderberg P, Ghazi AN, Noor A, Elmståhl S, Berglund JS. Breaking barriers: a statistical and machine learning-based hybrid system for predicting dementia. Front Bioeng Biotechnol 2024; 11:1336255. [PMID: 38260734 PMCID: PMC10801181 DOI: 10.3389/fbioe.2023.1336255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/05/2023] [Indexed: 01/24/2024] Open
Abstract
Introduction: Dementia is a condition (a collection of related signs and symptoms) that causes a continuing deterioration in cognitive function, and millions of people are impacted by dementia every year as the world population continues to rise. Conventional approaches for determining dementia rely primarily on clinical examinations, analyzing medical records, and administering cognitive and neuropsychological testing. However, these methods are time-consuming and costly in terms of treatment. Therefore, this study aims to present a noninvasive method for the early prediction of dementia so that preventive steps should be taken to avoid dementia. Methods: We developed a hybrid diagnostic system based on statistical and machine learning (ML) methods that used patient electronic health records to predict dementia. The dataset used for this study was obtained from the Swedish National Study on Aging and Care (SNAC), with a sample size of 43040 and 75 features. The newly constructed diagnostic extracts a subset of useful features from the dataset through a statistical method (F-score). For the classification, we developed an ensemble voting classifier based on five different ML models: decision tree (DT), naive Bayes (NB), logistic regression (LR), support vector machines (SVM), and random forest (RF). To address the problem of ML model overfitting, we used a cross-validation approach to evaluate the performance of the proposed diagnostic system. Various assessment measures, such as accuracy, sensitivity, specificity, receiver operating characteristic (ROC) curve, and Matthew's correlation coefficient (MCC), were used to thoroughly validate the devised diagnostic system's efficiency. Results: According to the experimental results, the proposed diagnostic method achieved the best accuracy of 98.25%, as well as sensitivity of 97.44%, specificity of 95.744%, and MCC of 0.7535. Discussion: The effectiveness of the proposed diagnostic approach is compared to various cutting-edge feature selection techniques and baseline ML models. From experimental results, it is evident that the proposed diagnostic system outperformed the prior feature selection strategies and baseline ML models regarding accuracy.
Collapse
Affiliation(s)
- Ashir Javeed
- Department of Health, Blekinge Institute of Technology, Karlskrona, Sweden
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, Karlskrona, Sweden
- School of Health Sciences, University of Skövde, Skövde, Sweden
| | - Ahmad Nauman Ghazi
- Department of Software Engineering, Blekinge Institute of Technology, Karlskrona, Sweden
| | - Adeeb Noor
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Sölve Elmståhl
- EpiHealth: Epidemiology for Health, Lund University, SUS Malmö, Malmö, Sweden
| | | |
Collapse
|