1
|
Sembiring I, Wahyuni SN, Sediyono E. LSTM algorithm optimization for COVID-19 prediction model. Heliyon 2024; 10:e26158. [PMID: 38440291 PMCID: PMC10909716 DOI: 10.1016/j.heliyon.2024.e26158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 01/29/2024] [Accepted: 02/08/2024] [Indexed: 03/06/2024] Open
Abstract
The development of predictive models for infectious diseases, specifically COVID-19, is an important step in early control efforts to reduce the mortality rate. However, traditional time series prediction models used to analyze the disease spread trends often encounter challenges related to accuracy, necessitating the need to develop prediction models with enhanced accuracy. Therefore, this research aimed to develop a prediction model based on the Long Short-Term Memory (LSTM) networks to better predict the number of confirmed COVID-19 cases. The proposed optimized LSTM (popLSTM) model was compared with Basic LSTM and improved MinMaxScaler developed earlier using COVID-19 dataset taken from previous research. The dataset was collected from four countries with a high daily increase in confirmed cases, including Hong Kong, South Korea, Italy, and Indonesia. The results showed significantly improved accuracy in the optimized model compared to the previous research methods. The contributions of popLSTM included 1) Incorporating the output results on the output gate to effectively filter more detailed information compared to the previous model, and 2) Reducing the error value by considering the hidden state on the output gate to improve accuracy. popLSTM in this experiment exhibited a significant 4% increase in accuracy.
Collapse
Affiliation(s)
- Irwan Sembiring
- Satya Wacana Christian University, 50711, Salatiga, Indonesia
| | | | - Eko Sediyono
- Satya Wacana Christian University, 50711, Salatiga, Indonesia
| |
Collapse
|
2
|
Martinez GS, Ostadgavahi AT, Al-Rafat AM, Garduno A, Cusack R, Bermejo-Martin JF, Martin-Loeches I, Kelvin D. Model-interpreted outcomes of artificial neural networks classifying immune biomarkers associated with severe infections in ICU. Front Immunol 2023; 14:1137850. [PMID: 36969221 PMCID: PMC10034398 DOI: 10.3389/fimmu.2023.1137850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 02/27/2023] [Indexed: 03/11/2023] Open
Abstract
IntroductionMillions of deaths worldwide are a result of sepsis (viral and bacterial) and septic shock syndromes which originate from microbial infections and cause a dysregulated host immune response. These diseases share both clinical and immunological patterns that involve a plethora of biomarkers that can be quantified and used to explain the severity level of the disease. Therefore, we hypothesize that the severity of sepsis and septic shock in patients is a function of the concentration of biomarkers of patients.MethodsIn our work, we quantified data from 30 biomarkers with direct immune function. We used distinct Feature Selection algorithms to isolate biomarkers to be fed into machine learning algorithms, whose mapping of the decision process would allow us to propose an early diagnostic tool.ResultsWe isolated two biomarkers, i.e., Programmed Death Ligand-1 and Myeloperoxidase, that were flagged by the interpretation of an Artificial Neural Network. The upregulation of both biomarkers was indicated as contributing to increase the severity level in sepsis (viral and bacterial induced) and septic shock patients.DiscussionIn conclusion, we built a function considering biomarker concentrations to explain severity among sepsis, sepsis COVID, and septic shock patients. The rules of this function include biomarkers with known medical, biological, and immunological activity, favoring the development of an early diagnosis system based in knowledge extracted from artificial intelligence.
Collapse
Affiliation(s)
- Gustavo Sganzerla Martinez
- Laboratory of Emerging Infectious Diseases, Department of Immunology and Microbiology, Dalhousie University, Halifax, NS, Canada
- Department of Pediatrics, Izaak Walton Killan (IWK) Health Center, CCfV, Halifax, NS, Canada
- *Correspondence: David Kelvin, ; Gustavo Sganzerla Martinez,
| | - Ali Toloue Ostadgavahi
- Laboratory of Emerging Infectious Diseases, Department of Immunology and Microbiology, Dalhousie University, Halifax, NS, Canada
- Department of Pediatrics, Izaak Walton Killan (IWK) Health Center, CCfV, Halifax, NS, Canada
| | - Abdullah Mahmud Al-Rafat
- Laboratory of Emerging Infectious Diseases, Department of Immunology and Microbiology, Dalhousie University, Halifax, NS, Canada
- Department of Pediatrics, Izaak Walton Killan (IWK) Health Center, CCfV, Halifax, NS, Canada
| | - Alexis Garduno
- Department of Clinical Medicine, Trinity College, University of Dublin, Dublin, Ireland
| | - Rachael Cusack
- Department of Clinical Medicine, Trinity College, University of Dublin, Dublin, Ireland
| | - Jesus Francisco Bermejo-Martin
- Instituto de Investigación Biomédica de Salamanca (IBSAL), Gerencia Regional de Salud de Castilla y León, Paseo de San Vicente, Salamanca, Spain
- Universidad de Salamanca, C. Alfonso X el Sabio, s/n, Salamanca, Spain
- Centro de Investigación Biomédica en Red en Enfermedades Respiratorias (CIBERES), CB22/06/00035, Instituto de Salud Carlos III, Avenida de Monforte de Lemos, Madrid, Spain
| | | | - David Kelvin
- Laboratory of Emerging Infectious Diseases, Department of Immunology and Microbiology, Dalhousie University, Halifax, NS, Canada
- Department of Pediatrics, Izaak Walton Killan (IWK) Health Center, CCfV, Halifax, NS, Canada
- *Correspondence: David Kelvin, ; Gustavo Sganzerla Martinez,
| |
Collapse
|
3
|
Alam Suha S, Islam MN. Exploring the Dominant Features and Data-driven Detection of Polycystic Ovary Syndrome through Modified Stacking Ensemble Machine Learning Technique. Heliyon 2023; 9:e14518. [PMID: 36994397 PMCID: PMC10040521 DOI: 10.1016/j.heliyon.2023.e14518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 01/09/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023] Open
Abstract
Polycystic ovary syndrome (PCOS) is the most frequent endocrinological anomaly in reproductive women that causes persistent hormonal secretion disruption, leading to the formation of numerous cysts within the ovaries and serious health complications. But the real-world clinical detection technique for PCOS is very critical since the accuracy of interpretations being substantially dependent on the physician's expertise. Thus, an artificially intelligent PCOS prediction model might be a feasible additional technique to the error prone and time-consuming diagnostic technique. In this study, a modified ensemble machine learning (ML) classification approach is proposed utilizing state-of-the-art stacking technique for PCOS identification with patients' symptom data; employing five traditional ML models as base learners and then one bagging or boosting ensemble ML model as the meta-learner of the stacked model. Furthermore, three distinct types of feature selection strategies are applied to pick different sets of features with varied numbers and combinations of attributes. To evaluate and explore the dominant features necessary for predicting PCOS, the proposed technique with five variety of models and other ten types of classifiers is trained, tested and assessed utilizing different feature sets. As outcomes, the proposed stacking ensemble technique significantly enhances the accuracy in comparison to the other existing ML based techniques in case of all varieties of feature sets. However, among various models investigated to categorize PCOS and non-PCOS patients, the stacking ensemble model with 'Gradient Boosting' classifier as meta learner outperforms others with 95.7% accuracy while utilizing the top 25 features selected using Principal Component Analysis (PCA) feature selection technique.
Collapse
|
4
|
Majeed MA, Shafri HZM, Zulkafli Z, Wayayok A. A Deep Learning Approach for Dengue Fever Prediction in Malaysia Using LSTM with Spatial Attention. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:4130. [PMID: 36901139 PMCID: PMC10002017 DOI: 10.3390/ijerph20054130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 02/19/2023] [Accepted: 02/21/2023] [Indexed: 06/18/2023]
Abstract
This research aims to predict dengue fever cases in Malaysia using machine learning techniques. A dataset consisting of weekly dengue cases at the state level in Malaysia from 2010 to 2016 was obtained from the Malaysia Open Data website and includes variables such as climate, geography, and demographics. Six different long short-term memory (LSTM) models were developed and compared for dengue prediction in Malaysia: LSTM, stacked LSTM (S-LSTM), LSTM with temporal attention (TA-LSTM), S-LSTM with temporal attention (STA-LSTM), LSTM with spatial attention (SA-LSTM), and S-LSTM with spatial attention (SSA-LSTM). The models were trained and evaluated on a dataset of monthly dengue cases in Malaysia from 2010 to 2016, with the task of predicting the number of dengue cases based on various climate, topographic, demographic, and land-use variables. The SSA-LSTM model, which used both stacked LSTM layers and spatial attention, performed the best, with an average root mean squared error (RMSE) of 3.17 across all lookback periods. When compared to three benchmark models (SVM, DT, ANN), the SSA-LSTM model had a significantly lower average RMSE. The SSA-LSTM model also performed well in different states in Malaysia, with RMSE values ranging from 2.91 to 4.55. When comparing temporal and spatial attention models, the spatial models generally performed better at predicting dengue cases. The SSA-LSTM model was also found to perform well at different prediction horizons, with the lowest RMSE at 4- and 5-month lookback periods. Overall, the results suggest that the SSA-LSTM model is effective at predicting dengue cases in Malaysia.
Collapse
Affiliation(s)
- Mokhalad A. Majeed
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Helmi Zulhaidi Mohd Shafri
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
- Geospatial Information Science Research Centre (GISRC), Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Zed Zulkafli
- Department of Civil Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| | - Aimrun Wayayok
- Department of Biological and Agricultural Engineering, Faculty of Engineering, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia
| |
Collapse
|
5
|
Siamba S, Otieno A, Koech J. Application of ARIMA, and hybrid ARIMA Models in predicting and forecasting tuberculosis incidences among children in Homa Bay and Turkana Counties, Kenya. PLOS DIGITAL HEALTH 2023; 2:e0000084. [PMID: 36812585 PMCID: PMC9931286 DOI: 10.1371/journal.pdig.0000084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 12/15/2022] [Indexed: 02/04/2023]
Abstract
Tuberculosis (TB) infections among children (below 15 years) is a growing concern, particularly in resource-limited settings. However, the TB burden among children is relatively unknown in Kenya where two-thirds of estimated TB cases are undiagnosed annually. Very few studies have used Autoregressive Integrated Moving Average (ARIMA), and hybrid ARIMA models to model infectious diseases globally. We applied ARIMA, and hybrid ARIMA models to predict and forecast TB incidences among children in Homa Bay and Turkana Counties in Kenya. The ARIMA, and hybrid models were used to predict and forecast monthly TB cases reported in the Treatment Information from Basic Unit (TIBU) system by health facilities in Homa Bay and Turkana Counties between 2012 and 2021. The best parsimonious ARIMA model that minimizes errors was selected based on a rolling window cross-validation procedure. The hybrid ARIMA-ANN model produced better predictive and forecast accuracy compared to the Seasonal ARIMA (0,0,1,1,0,1,12) model. Furthermore, using the Diebold-Mariano (DM) test, the predictive accuracy of ARIMA-ANN versus ARIMA (0,0,1,1,0,1,12) model were significantly different, p<0.001, respectively. The forecasts showed a TB incidence of 175 TB cases per 100,000 (161 to 188 TB incidences per 100,000 population) children in Homa Bay and Turkana Counties in 2022. The hybrid (ARIMA-ANN) model produces better predictive and forecast accuracy compared to the single ARIMA model. The findings show evidence that the incidence of TB among children below 15 years in Homa Bay and Turkana Counties is significantly under-reported and is potentially higher than the national average.
Collapse
Affiliation(s)
- Stephen Siamba
- University of Eldoret, School of Science, Department of Mathematics and Computer Science, Eldoret, Kenya
- * E-mail:
| | - Argwings Otieno
- University of Eldoret, School of Science, Department of Mathematics and Computer Science, Eldoret, Kenya
| | - Julius Koech
- University of Eldoret, School of Science, Department of Mathematics and Computer Science, Eldoret, Kenya
| |
Collapse
|
6
|
Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches. One Health 2022; 15:100439. [PMID: 36277100 PMCID: PMC9582566 DOI: 10.1016/j.onehlt.2022.100439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/20/2022] [Accepted: 09/30/2022] [Indexed: 11/21/2022] Open
Abstract
The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment. An exponential increase in use of ML and DL models over the past decade. ML/DL can model highly complex systems with a wide variety of input features. ML/DL disease models should include the full disease ecology in a One-Health context. Important food & agricultural diseases and key disease hotspots are underrepresented. Studies must focus on operational deployment needs to better support biopreparedness.
Collapse
|
7
|
Observing flow of He II with unsupervised machine learning. Sci Rep 2022; 12:20383. [PMID: 36437248 PMCID: PMC9701805 DOI: 10.1038/s41598-022-21906-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 10/05/2022] [Indexed: 11/29/2022] Open
Abstract
Time dependent observations of point-to-point correlations of the velocity vector field (structure functions) are necessary to model and understand fluid flow around complex objects. Using thermal gradients, we observed fluid flow by recording fluorescence of [Formula: see text] excimers produced by neutron capture throughout a ~ cm3 volume. Because the photon emitted by an excited excimer is unlikely to be recorded by the camera, the techniques of particle tracking (PTV) and particle imaging (PIV) velocimetry cannot be applied to extract information from the fluorescence of individual excimers. Therefore, we applied an unsupervised machine learning algorithm to identify light from ensembles of excimers (clusters) and then tracked the centroids of the clusters using a particle displacement determination algorithm developed for PTV.
Collapse
|
8
|
Palanivinayagam A, Kumar VV, Mahesh TR, Singh KK, Singh A. Machine Learning-Based COVID-19 Classification Using E-Adopted CT Scans. INTERNATIONAL JOURNAL OF E-ADOPTION 2022. [DOI: 10.4018/ijea.310001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In recent years, several machine learning models were successfully deployed in various fields. However, a huge quantity of data is required for training good machine learning. Data are distributivity stored across multiple sources and centralizing those data leads to privacy and security issues. To solve this problem, the proposed federated-based method works by exchanging the parameters of three locally trained machine learning models without compromising privacy. Each machine learning model uses the e-adoption of CT scans for improving their training knowledge. The CT scans are electronically transferred between various medical centers. Proper care is taken to prevent identify loss from the e-adopted data. To normalize the parameters, a novel weighting scheme is also exchanged along with the parameters. Thus, the global model is trained with more heterogeneous samples to increase performance. Based on the experiment, the proposed algorithm has obtained 89% of accuracy, which is 32% more than the existing machine learning models.
Collapse
|
9
|
HFMD Cases Prediction Using Transfer One-Step-Ahead Learning. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-10795-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
10
|
Wani SUD, Khan NA, Thakur G, Gautam SP, Ali M, Alam P, Alshehri S, Ghoneim MM, Shakeel F. Utilization of Artificial Intelligence in Disease Prevention: Diagnosis, Treatment, and Implications for the Healthcare Workforce. Healthcare (Basel) 2022; 10:608. [PMID: 35455786 PMCID: PMC9026833 DOI: 10.3390/healthcare10040608] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/15/2022] [Accepted: 03/18/2022] [Indexed: 01/27/2023] Open
Abstract
Artificial intelligence (AI) has been described as one of the extremely effective and promising scientific tools available to mankind. AI and its associated innovations are becoming more popular in industry and culture, and they are starting to show up in healthcare. Numerous facets of healthcare, as well as regulatory procedures within providers, payers, and pharmaceutical companies, may be transformed by these innovations. As a result, the purpose of this review is to identify the potential machine learning applications in the field of infectious diseases and the general healthcare system. The literature on this topic was extracted from various databases, such as Google, Google Scholar, Pubmed, Scopus, and Web of Science. The articles having important information were selected for this review. The most challenging task for AI in such healthcare sectors is to sustain its adoption in daily clinical practice, regardless of whether the programs are scalable enough to be useful. Based on the summarized data, it has been concluded that AI can assist healthcare staff in expanding their knowledge, allowing them to spend more time providing direct patient care and reducing weariness. Overall, we might conclude that the future of "conventional medicine" is closer than we realize, with patients seeing a computer first and subsequently a doctor.
Collapse
Affiliation(s)
- Shahid Ud Din Wani
- Department of Pharmaceutical Sciences, University of Kashmir, Jammu and Kashmir, Srinagar 190006, India;
| | - Nisar Ahmad Khan
- Department of Pharmaceutical Sciences, University of Kashmir, Jammu and Kashmir, Srinagar 190006, India;
| | - Gaurav Thakur
- Department of Pharmaceutics, CT Institute of Pharmaceutical Sciences, CT Group of Institutions, Jalandhar 144020, India; (G.T.); (S.P.G.)
| | - Surya Prakash Gautam
- Department of Pharmaceutics, CT Institute of Pharmaceutical Sciences, CT Group of Institutions, Jalandhar 144020, India; (G.T.); (S.P.G.)
| | - Mohammad Ali
- Department of Pharmacology, School of Pharmaceutical Sciences, University of Science & Technology, Meghalaya 793101, India;
| | - Prawez Alam
- Department of Pharmacognosy, College of Pharmacy, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia;
| | - Sultan Alshehri
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia;
| | - Mohammed M. Ghoneim
- Department of Pharmacy Practice, College of Pharmacy, AlMaarefa University, Ad Diriyah 13713, Saudi Arabia;
| | - Faiyaz Shakeel
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia;
| |
Collapse
|
11
|
Abstract
In the current era, Artificial Intelligence (AI) is becoming increasingly pervasive with applications in several applicative fields effectively changing our daily life. In this scenario, machine learning (ML), a subset of AI techniques, provides machines with the ability to programmatically learn from data to model a system while adapting to new situations as they learn more by data they are ingesting (on-line training). During the last several years, many papers have been published concerning ML applications in the field of solar systems. This paper presents the state of the art ML models applied in solar energy’s forecasting field i.e., for solar irradiance and power production forecasting (both point and interval or probabilistic forecasting), electricity price forecasting and energy demand forecasting. Other applications of ML into the photovoltaic (PV) field taken into account are the modelling of PV modules, PV design parameter extraction, tracking the maximum power point (MPP), PV systems efficiency optimization, PV/Thermal (PV/T) and Concentrating PV (CPV) system design parameters’ optimization and efficiency improvement, anomaly detection and energy management of PV’s storage systems. While many review papers already exist in this regard, they are usually focused only on one specific topic, while in this paper are gathered all the most relevant applications of ML for solar systems in many different fields. The paper gives an overview of the most recent and promising applications of machine learning used in the field of photovoltaic systems.
Collapse
|