1
|
Dostmohammadi M, Pedram MZ, Hoseinzadeh S, Garcia DA. A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 364:121264. [PMID: 38870783 DOI: 10.1016/j.jenvman.2024.121264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 05/21/2024] [Accepted: 05/26/2024] [Indexed: 06/15/2024]
Abstract
The considerable amount of energy utilized by buildings has led to various environmental challenges that adversely impact human existence. Predicting buildings' energy usage is commonly acknowledged as encouraging energy efficiency and enabling well-informed decision-making, ultimately leading to decreased energy consumption. Implementing eco-friendly architectural designs is paramount in mitigating energy consumption, particularly in recently constructed structures. This study utilizes clustering analysis on the original dataset to capture complex consumption patterns over various periods. The analysis yields two distinct subsets that represent low and high consumption patterns and an additional subset that exclusively encompasses weekends, attributed to the specific behavior of occupants. Ensemble models have become increasingly popular due to advancements in machine learning techniques. This research utilizes three discrete algorithms, namely Artificial Neural Network (ANN), K-nearest neighbors (KNN), and Decision Trees (DT). In addition, the application employs three more machine learning algorithms bagging and boosting: Random Forest (RF), Extreme Gradient Boosting (XGB), and Gradient Boosting Trees (GBT). To augment the accuracy of predictions, a stacking ensemble methodology is employed, wherein the forecasts generated by many algorithms are combined. Given the obtained outcomes, a thorough examination is undertaken, encompassing the techniques of stacking, bagging, and boosting, to conduct a comprehensive comparative study. It is pertinent to highlight that the stacking technique consistently exhibits superior performance relative to alternative ensemble methodologies across a spectrum of heterogeneous datasets. Furthermore, using a genetic algorithm enables the optimization of the combination of base learners, resulting in a notable enhancement in prediction accuracy. After implementing this optimization technique, GA-Stacking demonstrated remarkable performance in Mean Absolute Percentage Error (MAPE) scores. The improvement observed was substantial, surpassing 90 percent for all datasets. In addition, in subset-1, subset-2, and subset-3, the achieved R2 scores were 0.983, 0.985, and 0.999, respectively. This represents a substantial advancement in forecasting the energy consumption of residential buildings. Such progress underscores the potential advantages of integrating this framework into the practices of building designers, thereby fostering informed decision-making, design management, and optimization prior to construction.
Collapse
Affiliation(s)
- Mahziyar Dostmohammadi
- Energy System Division, Department of Mechanical Engineering, K.N, Toosi University, Tehran P.O.Box 19395-1999, Iran
| | - Mona Zamani Pedram
- Energy System Division, Department of Mechanical Engineering, K.N, Toosi University, Tehran P.O.Box 19395-1999, Iran.
| | - Siamak Hoseinzadeh
- Department of Planning, Design, Technology of Architecture, Sapienza University of Rome, Via Flaminia 72, 00196, Rome, Italy.
| | - Davide Astiaso Garcia
- Department of Planning, Design, Technology of Architecture, Sapienza University of Rome, Via Flaminia 72, 00196, Rome, Italy
| |
Collapse
|
2
|
Trabassi D, Castiglia SF, Bini F, Marinozzi F, Ajoudani A, Lorenzini M, Chini G, Varrecchia T, Ranavolo A, De Icco R, Casali C, Serrao M. Optimizing Rare Disease Gait Classification through Data Balancing and Generative AI: Insights from Hereditary Cerebellar Ataxia. SENSORS (BASEL, SWITZERLAND) 2024; 24:3613. [PMID: 38894404 PMCID: PMC11175240 DOI: 10.3390/s24113613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 05/28/2024] [Accepted: 05/31/2024] [Indexed: 06/21/2024]
Abstract
The interpretability of gait analysis studies in people with rare diseases, such as those with primary hereditary cerebellar ataxia (pwCA), is frequently limited by the small sample sizes and unbalanced datasets. The purpose of this study was to assess the effectiveness of data balancing and generative artificial intelligence (AI) algorithms in generating synthetic data reflecting the actual gait abnormalities of pwCA. Gait data of 30 pwCA (age: 51.6 ± 12.2 years; 13 females, 17 males) and 100 healthy subjects (age: 57.1 ± 10.4; 60 females, 40 males) were collected at the lumbar level with an inertial measurement unit. Subsampling, oversampling, synthetic minority oversampling, generative adversarial networks, and conditional tabular generative adversarial networks (ctGAN) were applied to generate datasets to be input to a random forest classifier. Consistency and explainability metrics were also calculated to assess the coherence of the generated dataset with known gait abnormalities of pwCA. ctGAN significantly improved the classification performance compared with the original dataset and traditional data augmentation methods. ctGAN are effective methods for balancing tabular datasets from populations with rare diseases, owing to their ability to improve diagnostic models with consistent explainability.
Collapse
Affiliation(s)
- Dante Trabassi
- Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
| | - Stefano Filippo Castiglia
- Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
- Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy;
| | - Fabiano Bini
- Department of Mechanical and Aerospace Engineering, Sapienza University of Rome, 00184 Rome, Italy; (F.B.); (F.M.)
| | - Franco Marinozzi
- Department of Mechanical and Aerospace Engineering, Sapienza University of Rome, 00184 Rome, Italy; (F.B.); (F.M.)
| | - Arash Ajoudani
- Department of Advanced Robotics, Italian Institute of Technology, 16163 Genoa, Italy; (A.A.); (M.L.)
| | - Marta Lorenzini
- Department of Advanced Robotics, Italian Institute of Technology, 16163 Genoa, Italy; (A.A.); (M.L.)
| | - Giorgia Chini
- Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
| | - Tiwana Varrecchia
- Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
| | - Alberto Ranavolo
- Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
| | - Roberto De Icco
- Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy;
- Headache Science & Neurorehabilitation Unit, IRCCS Mondino Foundation, 27100 Pavia, Italy
| | - Carlo Casali
- Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
| | - Mariano Serrao
- Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
- Movement Analysis Laboratory, Policlinico Italia, 00162 Rome, Italy
| |
Collapse
|
3
|
de Kok JWTM, van Bussel BCT, Schnabel R, van Herpt TTW, Driessen RGH, Meijs DAM, Goossens JA, Mertens HJMM, van Kuijk SMJ, Wynants L, van der Horst ICC, van Rosmalen F. Table 0; documenting the steps to go from clinical database to research dataset. J Clin Epidemiol 2024; 170:111342. [PMID: 38574979 DOI: 10.1016/j.jclinepi.2024.111342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/01/2024] [Accepted: 03/26/2024] [Indexed: 04/06/2024]
Abstract
OBJECTIVES Data-driven decision support tools have been increasingly recognized to transform health care. However, such tools are often developed on predefined research datasets without adequate knowledge of the origin of this data and how it was selected. How a dataset is extracted from a clinical database can profoundly impact the validity, interpretability and interoperability of the dataset, and downstream analyses, yet is rarely reported. Therefore, we present a case study illustrating how a definitive patient list was extracted from a clinical source database and how this can be reported. STUDY DESIGN AND SETTING A single-center observational study was performed at an academic hospital in the Netherlands to illustrate the impact of selecting a definitive patient list for research from a clinical source database, and the importance of documenting this process. All admissions from the critical care database admitted between January 1, 2013, and January 1, 2023, were used. RESULTS An interdisciplinary team collaborated to identify and address potential sources of data insufficiency and uncertainty. We demonstrate a stepwise data preparation process, reducing the clinical source database of 54,218 admissions to a definitive patient list of 21,553 admissions. Transparent documentation of the data preparation process improves the quality of the definitive patient list before analysis of the corresponding patient data. This study generated seven important recommendations for preparing observational health-care data for research purposes. CONCLUSION Documenting data preparation is essential for understanding a research dataset originating from a clinical source database before analyzing health-care data. The findings contribute to establishing data standards and offer insights into the complexities of preparing health-care data for scientific investigation. Meticulous data preparation and documentation thereof will improve research validity and advance critical care.
Collapse
Affiliation(s)
- Jip W T M de Kok
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
| | - Bas C T van Bussel
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands; Care and Public Health Research Institute (CAPHRI), Maastricht University, Maastricht, The Netherlands
| | - Ronny Schnabel
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands
| | - Thijs T W van Herpt
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
| | - Rob G H Driessen
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands; Department of Cardiology, Maastricht University Medical Centre+, Maastricht, The Netherlands
| | - Daniek A M Meijs
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
| | - Joep A Goossens
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands
| | | | - Sander M J van Kuijk
- Department of Clinical Epidemiology and Medical Technology Assessment (KEMTA), Maastricht University Medical Centre+, Maastricht, The Netherlands
| | - Laure Wynants
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, The Netherlands; Department of Development and Regeneration, KU Leuven, Leuven, Belgium
| | - Iwan C C van der Horst
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
| | - Frank van Rosmalen
- Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands.
| |
Collapse
|
4
|
Begum N, Rahman MM, Omar Faruk M. Machine learning prediction of nutritional status among pregnant women in Bangladesh: Evidence from Bangladesh demographic and health survey 2017-18. PLoS One 2024; 19:e0304389. [PMID: 38820295 PMCID: PMC11142495 DOI: 10.1371/journal.pone.0304389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 05/12/2024] [Indexed: 06/02/2024] Open
Abstract
AIM Malnutrition in pregnant women significantly affects both mother and child health. This research aims to identify the best machine learning (ML) techniques for predicting the nutritional status of pregnant women in Bangladesh and detect the most essential features based on the best-performed algorithm. METHODS This study used retrospective cross-sectional data from the Bangladeshi Demographic and Health Survey 2017-18. Different feature transformations and machine learning classifiers were applied to find the best transformation and classification model. RESULTS This investigation found that robust scaling outperformed all feature transformation methods. The result shows that the Random Forest algorithm with robust scaling outperforms all other machine learning algorithms with 74.75% accuracy, 57.91% kappa statistics, 73.36% precision, 73.08% recall, and 73.09% f1 score. In addition, the Random Forest algorithm had the highest precision (76.76%) and f1 score (71.71%) for predicting the underweight class, as well as an expected precision of 82.01% and f1 score of 83.78% for the overweight/obese class when compared to other algorithms with a robust scaling method. The respondent's age, wealth index, region, husband's education level, husband's age, and occupation were crucial features for predicting the nutritional status of pregnant women in Bangladesh. CONCLUSION The proposed classifier could help predict the expected outcome and reduce the burden of malnutrition among pregnant women in Bangladesh.
Collapse
Affiliation(s)
- Najma Begum
- Department of Statistics, Noakhali Science and Technology University, Noakhali, Bangladesh
| | | | - Mohammad Omar Faruk
- Department of Statistics, Noakhali Science and Technology University, Noakhali, Bangladesh
| |
Collapse
|
5
|
Ubeira-Gabellini MG, Mori M, Palazzo G, Cicchetti A, Mangili P, Pavarini M, Rancati T, Fodor A, Del Vecchio A, Di Muzio NG, Fiorino C. Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches. Cancers (Basel) 2024; 16:934. [PMID: 38473296 DOI: 10.3390/cancers16050934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/20/2024] [Accepted: 02/20/2024] [Indexed: 03/14/2024] Open
Abstract
PURPOSE Different ML models were compared to predict toxicity in RT on a large cohort (n = 1314). METHODS The endpoint was RTOG G2/G3 acute toxicity, resulting in 204/1314 patients with the event. The dataset, including 25 clinical, anatomical, and dosimetric features, was split into 984 for training and 330 for internal tests. The dataset was standardized; features with a high p-value at univariate LR and with Spearman ρ>0.8 were excluded; synthesized data of the minority were generated to compensate for class imbalance. Twelve ML methods were considered. Model optimization and sequential backward selection were run to choose the best models with a parsimonious feature number. Finally, feature importance was derived for every model. RESULTS The model's performance was compared on a training-test dataset over different metrics: the best performance model was LightGBM. Logistic regression with three variables (LR3) selected via bootstrapping showed performances similar to the best-performing models. The AUC of test data is slightly above 0.65 for the best models (highest value: 0.662 with LightGBM). CONCLUSIONS No model performed the best for all metrics: more complex ML models had better performances; however, models with just three features showed performances comparable to the best models using many (n = 13-19) features.
Collapse
Affiliation(s)
| | - Martina Mori
- Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| | - Gabriele Palazzo
- Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| | - Alessandro Cicchetti
- Data Science Unit, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133 Milan, Italy
| | - Paola Mangili
- Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| | - Maddalena Pavarini
- Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| | - Tiziana Rancati
- Data Science Unit, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133 Milan, Italy
| | - Andrei Fodor
- Radiotherapy, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| | | | - Nadia Gisella Di Muzio
- Radiotherapy, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
- Department of Radiotherapy, Vita-Salute San Raffaele University, 20132 Milan, Italy
| | - Claudio Fiorino
- Medical Physics, IRCCS San Raffaele Scientific Institute, 20132 Milan, Italy
| |
Collapse
|
6
|
A G, M T, N S. Machine learning, a powerful tool for the prediction of BiVO 4 nanoparticles efficiency in photocatalytic degradation of organic dyes. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART A, TOXIC/HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING 2024; 59:15-24. [PMID: 38400531 DOI: 10.1080/10934529.2024.2319510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/08/2024] [Indexed: 02/25/2024]
Abstract
Wastewater pollution caused by organic dyes is a growing concern due to its negative impact on human health and aquatic life. To tackle this issue, the use of advanced wastewater treatment with nano photocatalysts has emerged as a promising solution. However, experimental procedures for identifying the optimal conditions for dye degradation could be time-consuming and expensive. To overcome this, machine learning methods have been employed to predict the degradation of organic dyes in a more efficient manner by recognizing patterns in the process and addressing its feasibility. The objective of this study is to develop a machine learning model to predict the degradation of organic dyes and identify the main variables affecting the photocatalytic degradation capacity and removal of organic dyes from wastewater. Nine machine learning algorithms were tested including multiple linear regression, polynomial regression, decision trees, random forest, adaptive boosting, extreme gradient boosting, k-nearest neighbors, support vector machine, and artificial neural network. The study found that the XGBoosting algorithm outperformed the other models, making it ideal for predicting the photocatalytic degradation capacity of BiVO4. The results suggest that XGBoost is a suitable model for predicting the photocatalytic degradation of wastewater using BiVO4 with different dopants.
Collapse
Affiliation(s)
- Gnanaprakasam A
- Department of Chemical Engineering, Coimbatore Institute of Technology, Coimbatore, Tamil Nadu, India
| | - Thirumarimurugan M
- Department of Chemical Engineering, Coimbatore Institute of Technology, Coimbatore, Tamil Nadu, India
| | - Shanmathi N
- Department of Chemical Engineering, Coimbatore Institute of Technology, Coimbatore, Tamil Nadu, India
| |
Collapse
|
7
|
Li A, Mullin S, Elkin PL. Improving Prediction of Survival for Extremely Premature Infants Born at 23 to 29 Weeks Gestational Age in the Neonatal Intensive Care Unit: Development and Evaluation of Machine Learning Models. JMIR Med Inform 2024; 12:e42271. [PMID: 38354033 PMCID: PMC10902770 DOI: 10.2196/42271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 02/02/2023] [Accepted: 12/28/2023] [Indexed: 03/02/2024] Open
Abstract
BACKGROUND Infants born at extremely preterm gestational ages are typically admitted to the neonatal intensive care unit (NICU) after initial resuscitation. The subsequent hospital course can be highly variable, and despite counseling aided by available risk calculators, there are significant challenges with shared decision-making regarding life support and transition to end-of-life care. Improving predictive models can help providers and families navigate these unique challenges. OBJECTIVE Machine learning methods have previously demonstrated added predictive value for determining intensive care unit outcomes, and their use allows consideration of a greater number of factors that potentially influence newborn outcomes, such as maternal characteristics. Machine learning-based models were analyzed for their ability to predict the survival of extremely preterm neonates at initial admission. METHODS Maternal and newborn information was extracted from the health records of infants born between 23 and 29 weeks of gestation in the Medical Information Mart for Intensive Care III (MIMIC-III) critical care database. Applicable machine learning models predicting survival during the initial NICU admission were developed and compared. The same type of model was also examined using only features that would be available prepartum for the purpose of survival prediction prior to an anticipated preterm birth. Features most correlated with the predicted outcome were determined when possible for each model. RESULTS Of included patients, 37 of 459 (8.1%) expired. The resulting random forest model showed higher predictive performance than the frequently used Score for Neonatal Acute Physiology With Perinatal Extension II (SNAPPE-II) NICU model when considering extremely preterm infants of very low birth weight. Several other machine learning models were found to have good performance but did not show a statistically significant difference from previously available models in this study. Feature importance varied by model, and those of greater importance included gestational age; birth weight; initial oxygenation level; elements of the APGAR (appearance, pulse, grimace, activity, and respiration) score; and amount of blood pressure support. Important prepartum features also included maternal age, steroid administration, and the presence of pregnancy complications. CONCLUSIONS Machine learning methods have the potential to provide robust prediction of survival in the context of extremely preterm births and allow for consideration of additional factors such as maternal clinical and socioeconomic information. Evaluation of larger, more diverse data sets may provide additional clarity on comparative performance.
Collapse
Affiliation(s)
- Angie Li
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States
| | - Sarah Mullin
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States
| | - Peter L Elkin
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, United States
| |
Collapse
|
8
|
Hassan J, Saeed SM, Deka L, Uddin MJ, Das DB. Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges. Pharmaceutics 2024; 16:260. [PMID: 38399314 PMCID: PMC10892549 DOI: 10.3390/pharmaceutics16020260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/29/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024] Open
Abstract
The use of data-driven high-throughput analytical techniques, which has given rise to computational oncology, is undisputed. The widespread use of machine learning (ML) and mathematical modeling (MM)-based techniques is widely acknowledged. These two approaches have fueled the advancement in cancer research and eventually led to the uptake of telemedicine in cancer care. For diagnostic, prognostic, and treatment purposes concerning different types of cancer research, vast databases of varied information with manifold dimensions are required, and indeed, all this information can only be managed by an automated system developed utilizing ML and MM. In addition, MM is being used to probe the relationship between the pharmacokinetics and pharmacodynamics (PK/PD interactions) of anti-cancer substances to improve cancer treatment, and also to refine the quality of existing treatment models by being incorporated at all steps of research and development related to cancer and in routine patient care. This review will serve as a consolidation of the advancement and benefits of ML and MM techniques with a special focus on the area of cancer prognosis and anticancer therapy, leading to the identification of challenges (data quantity, ethical consideration, and data privacy) which are yet to be fully addressed in current studies.
Collapse
Affiliation(s)
- Jasmin Hassan
- Drug Delivery & Therapeutics Lab, Dhaka 1212, Bangladesh; (J.H.); (S.M.S.)
| | | | - Lipika Deka
- Faculty of Computing, Engineering and Media, De Montfort University, Leicester LE1 9BH, UK;
| | - Md Jasim Uddin
- Department of Pharmaceutical Technology, Faculty of Pharmacy, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Diganta B. Das
- Department of Chemical Engineering, Loughborough University, Loughborough LE11 3TU, UK
| |
Collapse
|
9
|
Moaveninejad S, D'Onofrio V, Tecchio F, Ferracuti F, Iarlori S, Monteriù A, Porcaro C. Fractal Dimension as a discriminative feature for high accuracy classification in motor imagery EEG-based brain-computer interface. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 244:107944. [PMID: 38064955 DOI: 10.1016/j.cmpb.2023.107944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/31/2023] [Accepted: 11/24/2023] [Indexed: 01/26/2024]
Abstract
BACKGROUND AND OBJECTIVE The brain-computer interface (BCI) technology acquires human brain electrical signals, which can be effectively and successfully used to control external devices, potentially supporting subjects suffering from motor impairments in the interaction with the environment. To this aim, BCI systems must correctly decode and interpret neurophysiological signals reflecting the intention of the subjects to move. Therefore, the accurate classification of single events in motor tasks represents a fundamental challenge in ensuring efficient communication and control between users and BCIs. Movement-associated changes in electroencephalographic (EEG) sensorimotor rhythms, such as event-related desynchronization (ERD), are well-known features of discriminating motor tasks. Fractal dimension (FD) can be used to evaluate the complexity and self-similarity in brain signals, potentially providing complementary information to frequency-based signal features. METHODS In the present work, we introduce FD as a novel feature for subject-independent event classification, and we test several machine learning (ML) models in behavioural tasks of motor imagery (MI) and motor execution (ME). RESULTS Our results show that FD improves the classification accuracy of ML compared to ERD. Furthermore, unilateral hand movements have higher classification accuracy than bilateral movements in both MI and ME tasks. CONCLUSIONS These results provide further insights into subject-independent event classification in BCI systems and demonstrate the potential of FD as a discriminative feature for EEG signals.
Collapse
Affiliation(s)
| | | | - Franca Tecchio
- Institute of Cognitive Sciences and Technologies (ISCT) - National Research Council (CNR), 00185 Rome, Italy
| | - Francesco Ferracuti
- Department of Information Engineering, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Sabrina Iarlori
- Department of Information Engineering, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Andrea Monteriù
- Department of Information Engineering, Università Politecnica delle Marche, 60131 Ancona, Italy
| | - Camillo Porcaro
- Department of Neuroscience, University of Padova, 35128 Padua, Italy; Padova Neuroscience Center (PNC), University of Padova, 35131 Padua, Italy; Institute of Cognitive Sciences and Technologies (ISCT) - National Research Council (CNR), 00185 Rome, Italy; Centre for Human Brain Health, School of Psychology, University of Birmingham, Birmingham B15 2TT, UK.
| |
Collapse
|
10
|
Olcay B, Ozdemir GD, Ozdemir MA, Ercan UK, Guren O, Karaman O. Prediction of the synergistic effect of antimicrobial peptides and antimicrobial agents via supervised machine learning. BMC Biomed Eng 2024; 6:1. [PMID: 38233957 DOI: 10.1186/s42490-024-00075-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 01/09/2024] [Indexed: 01/19/2024] Open
Abstract
BACKGROUND Infectious diseases not only cause severe health problems but also burden the healthcare system. Therefore, the effective treatment of those diseases is crucial. Both conventional approaches, such as antimicrobial agents, and novel approaches, like antimicrobial peptides (AMPs), are used to treat infections. However, due to the drawbacks of current approaches, new solutions are still being investigated. One recent approach is the use of AMPs and antimicrobial agents in combination, but determining synergism is with a huge variety of AMPs time-consuming and requires multiple experimental studies. Machine learning (ML) algorithms are widely used to predict biological outcomes, particularly in the field of AMPs, but no previous research reported on predicting the synergistic effects of AMPs and antimicrobial agents. RESULTS Several supervised ML models were implemented to accurately predict the synergistic effect of AMPs and antimicrobial agents. The results demonstrated that the hyperparameter-optimized Light Gradient Boosted Machine Classifier (oLGBMC) yielded the best test accuracy of 76.92% for predicting the synergistic effect. Besides, the feature importance analysis reveals that the target microbial species, the minimum inhibitory concentrations (MICs) of the AMP and the antimicrobial agents, and the used antimicrobial agent were the most important features for the prediction of synergistic effect, which aligns with recent experimental studies in the literature. CONCLUSION This study reveals that ML algorithms can predict the synergistic activity of two different antimicrobial agents without the need for complex and time-consuming experimental procedures. The implications support that the ML models may not only reduce the experimental cost but also provide validation of experimental procedures.
Collapse
Affiliation(s)
- Basak Olcay
- Department of Biomedical Engineering, Graduate School of Natural and Applied Sciences, Izmir Katip Celebi University, Izmir, Turkey
| | - Gizem D Ozdemir
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Mehmet A Ozdemir
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey.
| | - Utku K Ercan
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Onan Guren
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Ozan Karaman
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| |
Collapse
|
11
|
Rodríguez Mallma MJ, Vilca-Aguilar M, Zuloaga-Rotta L, Borja-Rosales R, Salas-Ojeda M, Mauricio D. Machine Learning Approach for Analyzing 3-Year Outcomes of Patients with Brain Arteriovenous Malformation (AVM) after Stereotactic Radiosurgery (SRS). Diagnostics (Basel) 2023; 14:22. [PMID: 38201331 PMCID: PMC10871108 DOI: 10.3390/diagnostics14010022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/14/2023] [Accepted: 12/17/2023] [Indexed: 01/12/2024] Open
Abstract
A cerebral arteriovenous malformation (AVM) is a tangle of abnormal blood vessels that irregularly connects arteries and veins. Stereotactic radiosurgery (SRS) has been shown to be an effective treatment for AVM patients, but the factors associated with AVM obliteration remains a matter of debate. In this study, we aimed to develop a model that can predict whether patients with AVM will be cured 36 months after intervention by means of SRS and identify the most important predictors that explain the probability of being cured. A machine learning (ML) approach was applied using decision tree (DT) and logistic regression (LR) techniques on historical data (sociodemographic, clinical, treatment, angioarchitecture, and radiosurgery procedure) of 202 patients with AVM who underwent SRS at the Instituto de Radiocirugía del Perú (IRP) between 2005 and 2018. The LR model obtained the best results for predicting AVM cure with an accuracy of 0.92, sensitivity of 0.93, specificity of 0.89, and an area under the curve (AUC) of 0.98, which shows that ML models are suitable for predicting the prognosis of medical conditions such as AVM and can be a support tool for medical decision-making. In addition, several factors were identified that could explain whether patients with AVM would be cured at 36 months with the highest likelihood: the location of the AVM, the occupation of the patient, and the presence of hemorrhage.
Collapse
Affiliation(s)
| | - Marcos Vilca-Aguilar
- Instituto de Radiocirugía del Perú, Clínica San Pablo, Lima 15023, Peru
- Servicio de Neurocirugía, Hospital María Auxiliadora, Lima 15828, Peru
| | - Luis Zuloaga-Rotta
- Facultad de Ingeniería Industrial y de Sistemas, Universidad Nacional de Ingeniería, Lima 15333, Peru
| | - Rubén Borja-Rosales
- Facultad de Ingeniería Industrial y de Sistemas, Universidad Nacional de Ingeniería, Lima 15333, Peru
| | | | - David Mauricio
- Universidad Nacional Mayor de San Marcos, Lima 15081, Peru
| |
Collapse
|
12
|
Kloonen RMJS, Varisco G, de Kort E, Andriessen P, Niemarkt HJ, van Pul C. Predicting CPAP failure after less invasive surfactant administration (LISA) in preterm infants by machine learning model on vital parameter data: a pilot study. Physiol Meas 2023; 44:115005. [PMID: 37939392 DOI: 10.1088/1361-6579/ad0ab6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/07/2023] [Indexed: 11/10/2023]
Abstract
Objective. Less invasive surfactant administration (LISA) has been introduced to preterm infants with respiratory distress syndrome on continuous positive airway pressure (CPAP) support in order to avoid intubation and mechanical ventilation. However, after this LISA procedure, a significant part of infants fails CPAP treatment (CPAP-F) and requires intubation in the first 72 h of life, which is associated with worse complication free survival chances. The aim of this study was to predict CPAP-F after LISA, based on machine learning (ML) analysis of high resolution vital parameter monitoring data surrounding the LISA procedure.Approach. Patients with a gestational age (GA) <32 weeks receiving LISA were included. Vital parameter data was obtained from a data warehouse. Physiological features (HR, RR, peripheral oxygen saturation (SpO2) and body temperature) were calculated in eight 0.5 h windows throughout a period 1.5 h before to 2.5 h after LISA. First, physiological data was analyzed to investigate differences between the CPAP-F and CPAP-Success (CPAP-S) groups. Next, the performance of two types of ML models (logistic regression: LR, support vector machine: SVM) for the prediction of CPAP-F were evaluated.Main results. Of 51 included patients, 18 (35%) had CPAP-F. Univariate analysis showed lower SpO2, temperature and heart rate variability (HRV) before and after the LISA procedure. The best performing ML model showed an area under the curve of 0.90 and 0.93 for LR and SVM respectively in the 0.5 h window directly after LISA, with GA, HRV, respiration rate and SpO2as most important features. Excluding GA decreased performance in both models.Significance. In this pilot study we were able to predict CPAP-F with a ML model of patient monitor signals, with best performance in the first 0.5 h after LISA. Using ML to predict CPAP-F based on vital signals gains insight in (possibly modifiable) factors that are associated with LISA failure and can help to guide personalized clinical decisions in early respiratory management.
Collapse
Affiliation(s)
- R M J S Kloonen
- Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands
- Maxima Medical Centre Veldhoven, Department of Clinical Physics, Po Box 7777, 5600 MB, The Netherlands
| | - G Varisco
- Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands
| | - E de Kort
- Maxima Medical Centre Veldhoven, Department of Pediatrics, Po Box 7777, 5600 MB, The Netherlands
| | - P Andriessen
- Maxima Medical Centre Veldhoven, Department of Pediatrics, Po Box 7777, 5600 MB, The Netherlands
| | - H J Niemarkt
- Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands
- Maxima Medical Centre Veldhoven, Department of Pediatrics, Po Box 7777, 5600 MB, The Netherlands
| | - C van Pul
- Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands
- Maxima Medical Centre Veldhoven, Department of Clinical Physics, Po Box 7777, 5600 MB, The Netherlands
| |
Collapse
|
13
|
Tran TS, Stitmannaithum B, Van Hong Bui L, Nguyen TT. Data-driven prediction of the shear capacity of ETS-FRP-strengthened beams in the hybrid 2PKT-ML approach. Sci Rep 2023; 13:19871. [PMID: 37963991 PMCID: PMC10646016 DOI: 10.1038/s41598-023-47064-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 11/08/2023] [Indexed: 11/16/2023] Open
Abstract
A new approach that combines analytical two-parameter kinematic theory (2PKT) with machine learning (ML) models for estimating the shear capacity of embedded through-section (ETS)-strengthened reinforced concrete (RC) beams is proposed. The 2PKT was first developed to validate its representativeness and confidence against the available experimental data of ETS-retrofitted RC beams. Given the deficiency of the test data, the developed 2PKT was utilized to generate a large data pool with 2643 samples. The aim was to optimize the ML algorithms, namely, the random forest, extreme gradient boosting (XGBoost), light gradient boosting machine, and artificial neural network (ANN) algorithm. The optimized ANN model exhibited the highest accuracy in predicting the total shear strength of ETS-strengthened beams and ETS shear contribution. In terms of predicting the total shear strength of ETS-strengthened beams, the ANN model achieved R2 values of 0.99, 0.98, and 0.96 for the training, validation, and testing data, respectively. By contrast, the ANN model could predict ETS shear contribution with high accuracy, with R2 values of 0.99, 0.99, and 0.97 for the training, validation, and testing data, respectively. Then, the effects of all design variables on the shear capacity of the ETS-strengthened beams were investigated using the hybrid 2PKT-ML. The obtained trends could well appraise the reasonability of the proposed approach.
Collapse
Affiliation(s)
- Thai Son Tran
- Research and Development Department, IRIS Technology, 301 Daeyang AI Center, 209 Neungdong-ro, Gunja-dong, Gwangjin-gu, Seoul, 05006, South Korea
| | - Boonchai Stitmannaithum
- Center of Excellence in Innovative Construction Materials, Department of Civil Engineering, Faculty of Engineering, Chulalongkorn University, 254 Phayathai Road, Pathumwan, Bangkok, 10330, Thailand
| | - Linh Van Hong Bui
- Laboratory for Computational Civil Engineering, Institute for Computational Science and Artificial Intelligence, Van Lang University, Ho Chi Minh City, Vietnam.
- Faculty of Civil Engineering, School of Technology, Van Lang University, Ho Chi Minh City, Vietnam.
| | - Thanh-Truong Nguyen
- Industrial Maintenance Training Center, Ho Chi Minh City University of Technology (HCMUT), Ward 14, District 10, Ho Chi Minh City, Vietnam
- Vietnam National University Ho Chi Minh City, Linh Trung Ward, Thu Duc City, Ho Chi Minh City, Vietnam
| |
Collapse
|
14
|
Xiao X. DVNE-DRL: dynamic virtual network embedding algorithm based on deep reinforcement learning. Sci Rep 2023; 13:19789. [PMID: 37957350 PMCID: PMC10643368 DOI: 10.1038/s41598-023-47195-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 11/10/2023] [Indexed: 11/15/2023] Open
Abstract
Virtual network embedding (VNE), as the key challenge of network resource management technology, lies in the contradiction between online embedding decision and pursuing long-term average revenue goals. Most of the previous work ignored the dynamics in Virtual Network (VN) modeling, or could not automatically detect the complex and time-varying network state to provide a reasonable network embedding scheme. In view of this, we model a network embedding framework where the topology and resource allocation change dynamically with the number of network users and workload, and then introduce a deep reinforcement learning method to solve the VNE problem. Further, a dynamic virtual network embedding algorithm based on Deep Reinforcement Learning (DRL), named DVNE-DRL, is proposed. In DVNE-DRL, VNE is modeled as a Markov Decision Process (MDP), and then deep learning is introduced to perceive the current network state through historical data and embedded knowledge, while utilizing reinforcement learning decision-making capabilities to implement the network embedding process. In addition, we improve the method of feature extraction and matrix optimization, and consider the characteristics of virtual network and physical network together to alleviate the problem of redundancy and slow convergence. The simulation results show that compared with the existing advanced algorithms, the acceptance rate and average revenue of DVNE-DRL are increased by about 25% and 35%, respectively.
Collapse
Affiliation(s)
- Xiancui Xiao
- School of Information Engineering, Shandong Management University, Ji'nan, 250357, China.
- Key Laboratory of TCM Data Cloud Service in Universities of Shandong, Ji'nan, China.
| |
Collapse
|
15
|
Yagin B, Yagin FH, Colak C, Inceoglu F, Kadry S, Kim J. Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research. Diagnostics (Basel) 2023; 13:3314. [PMID: 37958210 PMCID: PMC10650093 DOI: 10.3390/diagnostics13213314] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 10/17/2023] [Accepted: 10/25/2023] [Indexed: 11/15/2023] Open
Abstract
AIM Method: This research presents a model combining machine learning (ML) techniques and eXplainable artificial intelligence (XAI) to predict breast cancer (BC) metastasis and reveal important genomic biomarkers in metastasis patients. METHOD A total of 98 primary BC samples was analyzed, comprising 34 samples from patients who developed distant metastases within a 5-year follow-up period and 44 samples from patients who remained disease-free for at least 5 years after diagnosis. Genomic data were then subjected to biostatistical analysis, followed by the application of the elastic net feature selection method. This technique identified a restricted number of genomic biomarkers associated with BC metastasis. A light gradient boosting machine (LightGBM), categorical boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Gradient Boosting Trees (GBT), and Ada boosting (AdaBoost) algorithms were utilized for prediction. To assess the models' predictive abilities, the accuracy, F1 score, precision, recall, area under the ROC curve (AUC), and Brier score were calculated as performance evaluation metrics. To promote interpretability and overcome the "black box" problem of ML models, a SHapley Additive exPlanations (SHAP) method was employed. RESULTS The LightGBM model outperformed other models, yielding remarkable accuracy of 96% and an AUC of 99.3%. In addition to biostatistical evaluation, in XAI-based SHAP results, increased expression levels of TSPYL5, ATP5E, CA9, NUP210, SLC37A1, ARIH1, PSMD7, UBQLN1, PRAME, and UBE2T (p ≤ 0.05) were found to be associated with an increased incidence of BC metastasis. Finally, decreased levels of expression of CACTIN, TGFB3, SCUBE2, ARL4D, OR1F1, ALDH4A1, PHF1, and CROCC (p ≤ 0.05) genes were also determined to increase the risk of metastasis in BC. CONCLUSION The findings of this study may prevent disease progression and metastases and potentially improve clinical outcomes by recommending customized treatment approaches for BC patients.
Collapse
Affiliation(s)
- Burak Yagin
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya 44280, Turkey; (B.Y.); (C.C.)
| | - Fatma Hilal Yagin
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya 44280, Turkey; (B.Y.); (C.C.)
| | - Cemil Colak
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya 44280, Turkey; (B.Y.); (C.C.)
| | - Feyza Inceoglu
- Department of Biostatistics, Faculty of Medicine, Malatya Turgut Ozal University, Malatya 44090, Turkey;
| | - Seifedine Kadry
- Department of applied Data science, Noroff University College, 4612 Kristiansand, Norway;
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman 346, United Arab Emirates
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos 36, Lebanon
| | - Jungeun Kim
- Department of Software, Kongju National University, Cheonan 31080, Republic of Korea
| |
Collapse
|
16
|
Hasan MF, Smith R, Vajedian S, Pommerenke R, Majumdar S. Global land subsidence mapping reveals widespread loss of aquifer storage capacity. Nat Commun 2023; 14:6180. [PMID: 37794012 PMCID: PMC10550978 DOI: 10.1038/s41467-023-41933-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/22/2023] [Indexed: 10/06/2023] Open
Abstract
Groundwater overdraft gives rise to multiple adverse impacts including land subsidence and permanent groundwater storage loss. Existing methods are unable to characterize groundwater storage loss at the global scale with sufficient resolution to be relevant for local studies. Here we explore the interrelation between groundwater stress, aquifer depletion, and land subsidence using remote sensing and model-based datasets with a machine learning approach. The developed model predicts global land subsidence magnitude at high spatial resolution (~2 km), provides a first-order estimate of aquifer storage loss due to consolidation of ~17 km3/year globally, and quantifies key drivers of subsidence. Roughly 73% of the mapped subsidence occurs over cropland and urban areas, highlighting the need for sustainable groundwater management practices over these areas. The results of this study aid in assessing the spatial extents of subsidence in known subsiding areas, and in locating unknown groundwater stressed regions.
Collapse
Affiliation(s)
- Md Fahim Hasan
- Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, 80523, USA.
| | - Ryan Smith
- Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, 80523, USA
| | - Sanaz Vajedian
- Department of Geosciences and Geological and Petroleum Engineering, Missouri University of Science and Technology, Rolla, MO, 65409, USA
| | - Rahel Pommerenke
- Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, 80523, USA
| | - Sayantan Majumdar
- Division of Hydrologic Sciences, Desert Research Institute, Reno, NV, 89512, USA
| |
Collapse
|
17
|
Zafar K, Siddiqui HUR, Majid A, Rustam F, Alfarhood S, Safran M, Ashraf I. Enhancing Diagnosis of Anterior and Inferior Myocardial Infarctions Using UWB Radar and AI-Driven Feature Fusion Approach. SENSORS (BASEL, SWITZERLAND) 2023; 23:7756. [PMID: 37765813 PMCID: PMC10537523 DOI: 10.3390/s23187756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 09/02/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023]
Abstract
Despite significant improvement in prognosis, myocardial infarction (MI) remains a major cause of morbidity and mortality around the globe. MI is a life-threatening cardiovascular condition that requires prompt diagnosis and appropriate treatment. The primary objective of this research is to identify instances of anterior and inferior myocardial infarction by utilizing data obtained from Ultra-wideband radar technology in a hospital for patients of anterior and inferior MI. The collected data is preprocessed to extract spectral features. A novel feature engineering approach is designed to fuse temporal features and class prediction probability features derived from the spectral feature dataset. Several well-known machine learning models are implemented and fine-tuned to obtain optimal performance in the detection of anterior and inferior MI. The results demonstrate that integration of the fused feature set with machine learning models results in a notable improvement in both the accuracy and precision of MI detection. Notably, random forest (RF) and k-nearest neighbor showed superb performance with an accuracy of 98.8%. For demonstrating the capacity of models to generalize, K-fold cross-validation is carried out, wherein RF exhibits a mean accuracy of 99.1%. Furthermore, the examination of computational complexity indicates a low computational complexity, thereby indicating computational efficiency.
Collapse
Affiliation(s)
- Kainat Zafar
- Institute of Computer Science, Khwaja Fareed University of Engineering and Information Technology, Abu Dhabi Road, Rahim Yar Khan 64200, Punjab, Pakistan; (K.Z.); (H.U.R.S.)
| | - Hafeez Ur Rehman Siddiqui
- Institute of Computer Science, Khwaja Fareed University of Engineering and Information Technology, Abu Dhabi Road, Rahim Yar Khan 64200, Punjab, Pakistan; (K.Z.); (H.U.R.S.)
| | - Abdul Majid
- Cardiology Department, Sheikh Zayed Medical College & Hospital, Rahim Yar Khan 64200, Punjab, Pakistan;
| | - Furqan Rustam
- School of Computer Science, University College Dublin, D04 V1W8 Dublin, Ireland;
| | - Sultan Alfarhood
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia;
| | - Mejdl Safran
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia;
| | - Imran Ashraf
- Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea
| |
Collapse
|
18
|
Greenberg ZF, Graim KS, He M. Towards artificial intelligence-enabled extracellular vesicle precision drug delivery. Adv Drug Deliv Rev 2023:114974. [PMID: 37356623 DOI: 10.1016/j.addr.2023.114974] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 06/21/2023] [Accepted: 06/22/2023] [Indexed: 06/27/2023]
Abstract
Extracellular Vesicles (EVs), particularly exosomes, recently exploded into nanomedicine as an emerging drug delivery approach due to their superior biocompatibility, circulating stability, and bioavailability in vivo. However, EV heterogeneity makes molecular targeting precision a critical challenge. Deciphering key molecular drivers for controlling EV tissue targeting specificity is in great need. Artificial intelligence (AI) brings powerful prediction ability for guiding the rational design of engineered EVs in precision control for drug delivery. This review focuses on cutting-edge nano-delivery via integrating large-scale EV data with AI to develop AI-directed EV therapies and illuminate the clinical translation potential. We briefly review the current status of EVs in drug delivery, including the current frontier, limitations, and considerations to advance the field. Subsequently, we detail the future of AI in drug delivery and its impact on precision EV delivery. Our review discusses the current universal challenge of standardization and critical considerations when using AI combined with EVs for precision drug delivery. Finally, we will conclude this review with a perspective on future clinical translation led by a combined effort of AI and EV research.
Collapse
Affiliation(s)
- Zachary F Greenberg
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, Florida, 32610, USA
| | - Kiley S Graim
- Department of Computer & Information Science & Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, Florida, 32610, USA
| | - Mei He
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, Florida, 32610, USA.
| |
Collapse
|
19
|
Mehmood A, Lee KT, Kim DH. Energy Prediction and Optimization for Smart Homes with Weather Metric-Weight Coefficients. SENSORS (BASEL, SWITZERLAND) 2023; 23:3640. [PMID: 37050700 PMCID: PMC10099256 DOI: 10.3390/s23073640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/26/2023] [Accepted: 03/27/2023] [Indexed: 06/19/2023]
Abstract
Home appliances are considered to account for a large portion of smart homes' energy consumption. This is due to the abundant use of IoT devices. Various home appliances, such as heaters, dishwashers, and vacuum cleaners, are used every day. It is thought that proper control of these home appliances can reduce significant amounts of energy use. For this purpose, optimization techniques focusing mainly on energy reduction are used. Current optimization techniques somewhat reduce energy use but overlook user convenience, which was the main goal of introducing home appliances. Therefore, there is a need for an optimization method that effectively addresses the trade-off between energy saving and user convenience. Current optimization techniques should include weather metrics other than temperature and humidity to effectively optimize the energy cost of controlling the desired indoor setting of a smart home for the user. This research work involves an optimization technique that addresses the trade-off between energy saving and user convenience, including the use of air pressure, dew point, and wind speed. To test the optimization, a hybrid approach utilizing GWO and PSO was modeled. This work involved enabling proactive energy optimization using appliance energy prediction. An LSTM model was designed to test the appliances' energy predictions. Through predictions and optimized control, smart home appliances could be proactively and effectively controlled. First, we evaluated the RMSE score of the predictive model and found that the proposed model results in low RMSE values. Second, we conducted several simulations and found the proposed optimization results to provide energy cost savings used in appliance control to regulate the desired indoor setting of the smart home. Energy cost reduction goals using the optimization strategies were evaluated for seasonal and monthly patterns of data for result verification. Hence, the proposed work is considered a better candidate solution for proactively optimizing the energy of smart homes.
Collapse
Affiliation(s)
- Asif Mehmood
- Smart Information Technology Engineering Department, Kongju National University, Cheonan 31080, Republic of Korea;
| | - Kyu-Tae Lee
- Smart Information Technology Engineering Department, Kongju National University, Cheonan 31080, Republic of Korea;
| | - Do-Hyeun Kim
- Computer Engineering Department, Jeju National University, Jeju 63243, Republic of Korea
| |
Collapse
|
20
|
Shastry KA, Sattar SA. Logistic random forest boosting technique for Alzheimer’s diagnosis. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY 2023; 15:1719-1731. [PMID: 37056794 PMCID: PMC9983513 DOI: 10.1007/s41870-023-01187-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 02/16/2023] [Indexed: 03/06/2023]
Abstract
Alzheimer's disease (AD) is a common and well-known neurodegenerative condition that causes cognitive impairment. In the field of medicine, it is the "nervous system" disorder that has received the most attention. Despite this extensive research, there is no treatment or strategy to slow or stop its spread. Nevertheless, there are a variety of options (medication and non-medication alternatives) that may aid in the treatment of AD symptoms at their various phases, thereby enhancing the patient's quality of life. As AD advances over time, it is necessary to treat patients at their various stages appropriately. As a result, detecting and classifying AD phases prior to symptom treatment can be beneficial. Approximately twenty years ago, the rate of progress in the field of machine learning (ML) accelerated dramatically. Using ML methods, this study focuses on early AD identification. The "Alzheimer's Disease Neuroimaging Initiative" (ADNI) dataset was subjected to exhaustive testing for AD identification. The purpose was to classify the dataset into three groups: AD, "Cognitive Normal" (CN), and "Late Mild Cognitive Impairment" (LMCI). In this paper, we present the ensemble model Logistic Random Forest Boosting (LRFB), representing the ensemble of “Logistic Regression” (LR), “Random Forest” (RF), and “Gradient Boost” (GB). The proposed LRFB outperformed LR, RF, GB, “k-Nearest Neighbour” (k-NN), “Multi-Layer Perceptron” (MLP), “Support Vector Machine” (SVM), “AdaBoost” (AB), “Naïve Bayes” (NB), “XGBoost” (XGB), “Decision Tree” (DT), and other ensemble ML models with respect to the performance metrics “Accuracy” (Acc), “Recall” (Rec), “Precision” (Prec), and “F1-Score” (FS).
Collapse
|
21
|
Identifying two distinct subphenotypes of patent ductus arteriosus in preterm infants using machine learning. Eur J Pediatr 2023; 182:2173-2179. [PMID: 36853570 DOI: 10.1007/s00431-023-04882-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 02/09/2023] [Accepted: 02/15/2023] [Indexed: 03/01/2023]
Abstract
To use unsupervised machine learning to identify potential subphenotypes of preterm infants with patent ductus arteriosus (PDA). The study was conducted retrospectively at a neonatal intensive care unit in Brazil. Patients with a gestational age < 28 weeks who had undergone at least one echocardiogram within the first two weeks of life and had PDA size > 1.5 or LA/AO ratio > 1.5 were included. Agglomerative hierarchical clustering on principal components was used to divide the data into different clusters based on common characteristics. Two distinct subphenotypes of preterm infants with hemodynamically significant PDA were identified: "inflamed," characterized by high leukocyte, neutrophil, and neutrophil-to-lymphocyte ratio, and "respiratory acidosis," characterized by low pH and high pCO2 levels. Conclusions: This study suggests that there may be two distinct subphenotypes of preterm infants with hemodynamically significant PDA: "inflamed" and "respiratory acidosis." By dividing the population into different subgroups based on common characteristics, it is possible to get a more nuanced understanding of the effectiveness of PDA interventions. What is Known: • Treatment of PDA in preterm infants has been controversial. • Stratification of preterm infants with PDA into subgroups is important in order to determine the best treatment. What is New: • Unsupervised machine learning was used to identify two subphenotypes of preterm infants with hemodynamically significant PDA. • The 'inflamed' cluster was characterized by higher values of leukocyte, neutrophil, and neutrophil-to-lymphocyte ratio. The 'respiratory acidosis' cluster was characterized by lower pH values and higher pCO2 values.
Collapse
|
22
|
The Increase of Theta Power and Decrease of Alpha/Theta Ratio as a Manifestation of Cognitive Impairment in Parkinson's Disease. J Clin Med 2023; 12:jcm12041569. [PMID: 36836103 PMCID: PMC9965386 DOI: 10.3390/jcm12041569] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/06/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open
Abstract
In this study, we aim to assess and examine cognitive functions in Parkinson's Disease patients using EEG recordings, with a central focus on characteristics associated with a cognitive decline. Based on neuropsychological evaluation using Mini-Mental State Examination, Montreal Cognitive Assessment, and Addenbrooke's Cognitive Examination-III, 98 participants were divided into three cognitive groups. All the particpants of the study underwent EEG recordings with spectral analysis. The results revealed an increase in the absolute theta power in patients with Parkinson's disease dementia (PD-D) compared to cognitively normal status (PD-CogN, p=0.00997) and a decrease in global relative beta power in PD-D compared to PD-CogN (p=0.0413). An increase in theta relative power in the left temporal region (p=0.0262), left occipital region (p=0.0109), and right occipital region (p=0.0221) were observed in PD-D compared to PD-N. The global alpha/theta ratio and global power spectral ratio significantly decreased in PD-D compared to PD-N (p = 0.001). In conclusion, the increase in relative theta power and the decrease in relative beta power are characteristic changes in EEG recordings in PD patients with cognitive impairment. Identifying these changes can be a useful biomarker and a complementary tool in the neuropsychological diagnosis of cognitive impairment in Parkinson's Disease.
Collapse
|
23
|
Ahmed B, Haque MA, Iquebal MA, Jaiswal S, Angadi UB, Kumar D, Rai A. DeepAProt: Deep learning based abiotic stress protein sequence classification and identification tool in cereals. FRONTIERS IN PLANT SCIENCE 2023; 13:1008756. [PMID: 36714750 PMCID: PMC9877618 DOI: 10.3389/fpls.2022.1008756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 11/14/2022] [Indexed: 06/18/2023]
Abstract
The impact of climate change has been alarming for the crop growth. The extreme weather conditions can stress the crops and reduce the yield of major crops belonging to Poaceae family too, that sustains 50% of the world's food calorie and 20% of protein intake. Computational approaches, such as artificial intelligence-based techniques have become the forefront of prediction-based data interpretation and plant stress responses. In this study, we proposed a novel activation function, namely, Gaussian Error Linear Unit with Sigmoid (SIELU) which was implemented in the development of a Deep Learning (DL) model along with other hyper parameters for classification of unknown abiotic stress protein sequences from crops of Poaceae family. To develop this models, data pertaining to four different abiotic stress (namely, cold, drought, heat and salinity) responsive proteins of the crops belonging to poaceae family were retrieved from public domain. It was observed that efficiency of the DL models with our proposed novel SIELU activation function outperformed the models as compared to GeLU activation function, SVM and RF with 95.11%, 80.78%, 94.97%, and 81.69% accuracy for cold, drought, heat and salinity, respectively. Also, a web-based tool, named DeepAProt (http://login1.cabgrid.res.in:5500/) was developed using flask API, along with its mobile app. This server/App will provide researchers a convenient tool, which is rapid and economical in identification of proteins for abiotic stress management in crops Poaceae family, in endeavour of higher production for food security and combating hunger, ensuring UN SDG goal 2.0.
Collapse
Affiliation(s)
- Bulbul Ahmed
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Md Ashraful Haque
- Division of Computer Application, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Mir Asif Iquebal
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sarika Jaiswal
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - U. B. Angadi
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Dinesh Kumar
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
- Department of Biotechnology, School of Interdisciplinary and Applied Sciences, Central University of Haryana, Mahendergarh, Haryana, India
| | - Anil Rai
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| |
Collapse
|
24
|
Hasnul MA, Ab. Aziz NA, Abd. Aziz A. Augmenting ECG Data with Multiple Filters for a Better Emotion Recognition System. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2023; 48:1-22. [PMID: 36685996 PMCID: PMC9838506 DOI: 10.1007/s13369-022-07585-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 12/18/2022] [Indexed: 01/13/2023]
Abstract
A physiological-based emotion recognition system (ERS) with a unimodal approach such as an electrocardiogram (ECG) is not as popular compared to a multimodal approach. However, a single modality has the advantage of lower development and computational cost. Therefore, this study focuses on a unimodal ECG-based ERS. The ECG-based ERS has the potential to become the next mass-adopted consumer application due to the wide availability of wearable and mobile ECG devices in the market. Currently, ECG-inclusive affective datasets are limited, and many of the existing datasets have small sample sizes. Hence, ECG-based ERS studies are stunted by the lack of quality data. A novel multi-filtering augmentation technique is proposed here to increase the sample size of the ECG data. This technique augments the ECG signals by cleaning the data in different ways. Three small ECG datasets labelled according to emotion state are used in this study. The benefit of the proposed augmentation techniques is measured using the classification accuracy of five machine learning algorithms; k-nearest neighbours (KNN), support vector machine, decision tree, random forest and multilayer perceptron. The results show that with the proposed technique, there is a significant improvement in performance for all the datasets and classifiers. KNN classifier improved the most with the augmented data and the reported classification accuracies of over 90%.
Collapse
Affiliation(s)
| | - Nor Azlina Ab. Aziz
- Faculty of Engineering and Technology, Multimedia University, Melaka, Malaysia
| | - Azlan Abd. Aziz
- Faculty of Engineering and Technology, Multimedia University, Melaka, Malaysia
| |
Collapse
|
25
|
Kanyongo W, Ezugwu AE. Machine learning approaches to medication adherence amongst NCD patients: A systematic literature review. INFORMATICS IN MEDICINE UNLOCKED 2023. [DOI: 10.1016/j.imu.2023.101210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023] Open
|
26
|
Alam MS, Rashid MM, Roy R, Faizabadi AR, Gupta KD, Ahsan MM. Empirical Study of Autism Spectrum Disorder Diagnosis Using Facial Images by Improved Transfer Learning Approach. Bioengineering (Basel) 2022; 9:710. [PMID: 36421111 PMCID: PMC9687350 DOI: 10.3390/bioengineering9110710] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 11/09/2022] [Accepted: 11/14/2022] [Indexed: 09/29/2023] Open
Abstract
Autism spectrum disorder (ASD) is a neurological illness characterized by deficits in cognition, physical activities, and social skills. There is no specific medication to treat this illness; only early intervention can improve brain functionality. Since there is no medical test to identify ASD, a diagnosis might be challenging. In order to determine a diagnosis, doctors consider the child's behavior and developmental history. The human face can be used as a biomarker as it is one of the potential reflections of the brain and thus can be used as a simple and handy tool for early diagnosis. This study uses several deep convolutional neural network (CNN)-based transfer learning approaches to detect autistic children using the facial image. An empirical study is conducted to select the best optimizer and set of hyperparameters to achieve better prediction accuracy using the CNN model. After training and validating with the optimized setting, the modified Xception model demonstrates the best performance by achieving an accuracy of 95% on the test set, whereas the VGG19, ResNet50V2, MobileNetV2, and EfficientNetB0 achieved 86.5%, 94%, 92%, and 85.8%, accuracy, respectively. Our preliminary computational results demonstrate that our transfer learning approaches outperformed existing methods. Our modified model can be employed to assist doctors and practitioners in validating their initial screening to detect children with ASD disease.
Collapse
Affiliation(s)
- Md Shafiul Alam
- Department of Mechatronics Engineering, International Islamic University Malaysia, Kula Lumpur 43200, Malaysia
| | - Muhammad Mahbubur Rashid
- Department of Mechatronics Engineering, International Islamic University Malaysia, Kula Lumpur 43200, Malaysia
| | - Rupal Roy
- Department of Mechatronics Engineering, International Islamic University Malaysia, Kula Lumpur 43200, Malaysia
| | - Ahmed Rimaz Faizabadi
- Department of Mechatronics Engineering, International Islamic University Malaysia, Kula Lumpur 43200, Malaysia
| | - Kishor Datta Gupta
- Computer and Information Science, Clark Atlanta University, Atlanta, GA 30314, USA
| | - Md Manjurul Ahsan
- School of Industrial and Systems Engineering, University of Oklahoma, Norman, OK 73019, USA
| |
Collapse
|
27
|
Paepae T, Bokoro PN, Kyamakya K. A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques. SENSORS (BASEL, SWITZERLAND) 2022; 22:7338. [PMID: 36236438 PMCID: PMC9572788 DOI: 10.3390/s22197338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 09/20/2022] [Accepted: 09/24/2022] [Indexed: 06/16/2023]
Abstract
Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R2, was 97% in the rural catchment and 82% in an urban catchment.
Collapse
Affiliation(s)
- Thulane Paepae
- Department of Electrical and Electronic Engineering Technology, University of Johannesburg, Doornfontein 2028, South Africa
| | - Pitshou N. Bokoro
- Department of Electrical and Electronic Engineering Technology, University of Johannesburg, Doornfontein 2028, South Africa
| | - Kyandoghere Kyamakya
- Institute for Smart Systems Technologies, Transportation Informatics, Alpen-Adria Universität Klagenfurt, 9020 Klagenfurt, Austria
| |
Collapse
|
28
|
Skin cancer diagnosis based on deep transfer learning and sparrow search algorithm. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07762-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
AbstractSkin cancer affects the lives of millions of people every year, as it is considered the most popular form of cancer. In the USA alone, approximately three and a half million people are diagnosed with skin cancer annually. The survival rate diminishes steeply as the skin cancer progresses. Despite this, it is an expensive and difficult procedure to discover this cancer type in the early stages. In this study, a threshold-based automatic approach for skin cancer detection, classification, and segmentation utilizing a meta-heuristic optimizer named sparrow search algorithm (SpaSA) is proposed. Five U-Net models (i.e., U-Net, U-Net++, Attention U-Net, V-net, and Swin U-Net) with different configurations are utilized to perform the segmentation process. Besides this, the meta-heuristic SpaSA optimizer is used to perform the optimization of the hyperparameters using eight pre-trained CNN models (i.e., VGG16, VGG19, MobileNet, MobileNetV2, MobileNetV3Large, MobileNetV3Small, NASNetMobile, and NASNetLarge). The dataset is gathered from five public sources in which two types of datasets are generated (i.e., 2-classes and 10-classes). For the segmentation, concerning the “skin cancer segmentation and classification” dataset, the best reported scores by U-Net++ with DenseNet201 as a backbone architecture are 0.104, $$94.16\%$$
94.16
%
, $$91.39\%$$
91.39
%
, $$99.03\%$$
99.03
%
, $$96.08\%$$
96.08
%
, $$96.41\%$$
96.41
%
, $$77.19\%$$
77.19
%
, $$75.47\%$$
75.47
%
in terms of loss, accuracy, F1-score, AUC, IoU, dice, hinge, and squared hinge, respectively, while for the “PH2” dataset, the best reported scores by the Attention U-Net with DenseNet201 as backbone architecture are 0.137, $$94.75\%$$
94.75
%
, $$92.65\%$$
92.65
%
, $$92.56\%$$
92.56
%
, $$92.74\%$$
92.74
%
, $$96.20\%$$
96.20
%
, $$86.30\%$$
86.30
%
, $$92.65\%$$
92.65
%
, $$69.28\%$$
69.28
%
, and $$68.04\%$$
68.04
%
in terms of loss, accuracy, F1-score, precision, sensitivity, specificity, IoU, dice, hinge, and squared hinge, respectively. For the “ISIC 2019 and 2020 Melanoma” dataset, the best reported overall accuracy from the applied CNN experiments is $$98.27\%$$
98.27
%
by the MobileNet pre-trained model. Similarly, for the “Melanoma Classification (HAM10K)” dataset, the best reported overall accuracy from the applied CNN experiments is $$98.83\%$$
98.83
%
by the MobileNet pre-trained model. For the “skin diseases image” dataset, the best reported overall accuracy from the applied CNN experiments is $$85.87\%$$
85.87
%
by the MobileNetV2 pre-trained model. After computing the results, the suggested approach is compared with 13 related studies.
Collapse
|
29
|
Adams J, Agyenkwa-Mawuli K, Agyapong O, Wilson MD, Kwofie SK. EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus. Comput Biol Chem 2022; 101:107766. [DOI: 10.1016/j.compbiolchem.2022.107766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 08/10/2022] [Accepted: 08/29/2022] [Indexed: 11/03/2022]
|
30
|
Accurate Numerical Treatment on a Stochastic SIR Epidemic Model with Optimal Control Strategy. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10040082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
In this paper, a numerical study has been undertaken on the susceptible-infected-recovered (SIR) epidemic model that encompasses the mechanisms of the evolution of disease transmission; a prophylactic vaccination strategy in the susceptible populations, depending on the infective individuals. We furnish numerical and graphical simulation combined with explicit series solutions of the proposed model using the New Iterative Method (NIM) and Modified New Iterative Method (MNIM). The analytic-numeric New Iterative Method failed to deliver accurate solution for the large time domain. A new reliable algorithm based on NIM, the coupling of the Laplace transforms, and the New Iterative method is called Modified New Iterative Method (MNIM) which is presented to enhance the validity domain of NIM techniques. The convergence analysis of the MNIM has also been illustrated. The simulation results show that the vaccination strategy can slow down the spread of the epidemic rapidly. Numerical results illustrate the excellent performance of the MNIM and show that the modified method is much more accurate than the NIM.
Collapse
|
31
|
A Two-Step Data Normalization Approach for Improving Classification Accuracy in the Medical Diagnosis Domain. MATHEMATICS 2022. [DOI: 10.3390/math10111942] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Data normalization is a data preprocessing task and one of the first to be performed during intellectual analysis, particularly in the case of tabular data. The importance of its implementation is determined by the need to reduce the sensitivity of the artificial intelligence model to the values of the features in the dataset to increase the studied model’s adequacy. This paper focuses on the problem of effectively preprocessing data to improve the accuracy of intellectual analysis in the case of performing medical diagnostic tasks. We developed a new two-step method for data normalization of numerical medical datasets. It is based on the possibility of considering both the interdependencies between the features of each observation from the dataset and their absolute values to improve the accuracy when performing medical data mining tasks. We describe and substantiate each step of the algorithmic implementation of the method. We also visualize the results of the proposed method. The proposed method was modeled using six different machine learning methods based on decision trees when performing binary and multiclass classification tasks. We used six real-world, freely available medical datasets with different numbers of vectors, attributes, and classes to conduct experiments. A comparison between the effectiveness of the developed method and that of five existing data normalization methods was carried out. It was experimentally established that the developed method increases the accuracy of the Decision Tree and Extra Trees Classifier by 1–5% in the case of performing the binary classification task and the accuracy of the Bagging, Decision Tree, and Extra Trees Classifier by 1–6% in the case of performing the multiclass classification task. Increasing the accuracy of these classifiers only by using the new data normalization method satisfies all the prerequisites for its application in practice when performing various medical data mining tasks.
Collapse
|
32
|
Ahsan MM, Siddique Z. Machine learning-based heart disease diagnosis: A systematic literature review. Artif Intell Med 2022; 128:102289. [DOI: 10.1016/j.artmed.2022.102289] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 03/22/2022] [Indexed: 01/01/2023]
|
33
|
A Modified Iterative Algorithm for Numerical Investigation of HIV Infection Dynamics. ALGORITHMS 2022. [DOI: 10.3390/a15050175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The human immunodeficiency virus (HIV) mainly attacks CD4+ T cells in the host. Chronic HIV infection gradually depletes the CD4+ T cell pool, compromising the host’s immunological reaction to invasive infections and ultimately leading to acquired immunodeficiency syndrome (AIDS). The goal of this study is not to provide a qualitative description of the rich dynamic characteristics of the HIV infection model of CD4+ T cells, but to produce accurate analytical solutions to the model using the modified iterative approach. In this research, a new efficient method using the new iterative method (NIM), the coupling of the standard NIM and Laplace transform, called the modified new iterative method (MNIM), has been introduced to resolve the HIV infection model as a class of system of ordinary differential equations (ODEs). A nonlinear HIV infection dynamics model is adopted as an instance to elucidate the identification process and the solution process of MNIM, only two iterations lead to ideal results. In addition, the model has also been solved using NIM and the fourth order Runge–Kutta (RK4) method. The results indicate that the solutions by MNIM match with those of RK4 method to a minimum of eight decimal places, whereas NIM solutions are not accurate enough. Numerical comparisons between the MNIM, NIM, the classical RK4 and other methods reveal that the modified technique has potential as a tool for the nonlinear systems of ODEs.
Collapse
|
34
|
Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations. SENSORS 2022; 22:s22103659. [PMID: 35632067 PMCID: PMC9145404 DOI: 10.3390/s22103659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 04/28/2022] [Accepted: 05/09/2022] [Indexed: 12/05/2022]
Abstract
The ongoing trend of building larger wind turbines (WT) to reach greater economies of scale is contributing to the reduction in cost of wind energy, as well as the increase in WT drivetrain input loads into uncharted territories. The resulting intensification of the load situation within the WT gearbox motivates the need to monitor WT transmission input loads. However, due to the high costs of direct measurement solutions, more economical solutions, such as virtual sensing of transmission input loads using stationary sensors mounted on the gearbox housing or other drivetrain locations, are of interest. As the number, type, and location of sensors needed for a virtual sensing solutions can vary considerably in cost, in this investigation, we aimed to identify optimal sensor locations for virtually sensing WT 6-degree of freedom (6-DOF) transmission input loads. Random forest (RF) models were designed and applied to a dataset containing simulated operational data of a Vestas V52 WT multibody simulation model undergoing simulated wind fields. The dataset contained the 6-DOF transmission input loads and signals from potential sensor locations covering deformations, misalignments, and rotational speeds at various drivetrain locations. The RF models were used to identify the sensor locations with the highest impact on accuracy of virtual load sensing following a known statistical test in order to prioritize and reduce the number of needed input signals. The performance of the models was assessed before and after reducing the number of input signals required. By allowing for a screening of sensors prior to real-world tests, the results demonstrate the high promise of the proposed method for optimizing the cost of future virtual WT transmission load sensors.
Collapse
|
35
|
A Comparative Analysis on Suicidal Ideation Detection Using NLP, Machine, and Deep Learning. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10030057] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Social networks are essential resources to obtain information about people’s opinions and feelings towards various issues as they share their views with their friends and family. Suicidal ideation detection via online social network analysis has emerged as an essential research topic with significant difficulties in the fields of NLP and psychology in recent years. With the proper exploitation of the information in social media, the complicated early symptoms of suicidal ideations can be discovered and hence, it can save many lives. This study offers a comparative analysis of multiple machine learning and deep learning models to identify suicidal thoughts from the social media platform Twitter. The principal purpose of our research is to achieve better model performance than prior research works to recognize early indications with high accuracy and avoid suicide attempts. We applied text pre-processing and feature extraction approaches such as CountVectorizer and word embedding, and trained several machine learning and deep learning models for such a goal. Experiments were conducted on a dataset of 49,178 instances retrieved from live tweets by 18 suicidal and non-suicidal keywords using Python Tweepy API. Our experimental findings reveal that the RF model can achieve the highest classification score among machine learning algorithms, with an accuracy of 93% and an F1 score of 0.92. However, training the deep learning classifiers with word embedding increases the performance of ML models, where the BiLSTM model reaches an accuracy of 93.6% and a 0.93 F1 score.
Collapse
|
36
|
A Multi-Step Time-Series Clustering-Based Seq2Seq LSTM Learning for a Single Household Electricity Load Forecasting. ENERGIES 2022. [DOI: 10.3390/en15072623] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
The deep learning (DL) approaches in smart grid (SG) describes the possibility of shifting the energy industry into a modern era of reliable and sustainable energy networks. This paper proposes a time-series clustering framework with multi-step time-series sequence to sequence (Seq2Seq) long short-term memory (LSTM) load forecasting strategy for households. Specifically, we investigate a clustering-based Seq2Seq LSTM electricity load forecasting model to undertake an energy load forecasting problem, where information input to the model contains individual appliances and aggregate energy as historical data of households. The original dataset is preprocessed, and forwarded to a multi-step time-series learning model which reduces the training time and guarantees convergence for energy forecasting. Furthermore, simulation results show the accuracy performance of the proposed model by validation and testing cluster data, which shows a promising potential of the proposed predictive model.
Collapse
|
37
|
Ahsan MM, Luna SA, Siddique Z. Machine-Learning-Based Disease Diagnosis: A Comprehensive Review. Healthcare (Basel) 2022; 10:541. [PMID: 35327018 PMCID: PMC8950225 DOI: 10.3390/healthcare10030541] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/08/2022] [Accepted: 03/10/2022] [Indexed: 02/06/2023] Open
Abstract
Globally, there is a substantial unmet need to diagnose various diseases effectively. The complexity of the different disease mechanisms and underlying symptoms of the patient population presents massive challenges in developing the early diagnosis tool and effective treatment. Machine learning (ML), an area of artificial intelligence (AI), enables researchers, physicians, and patients to solve some of these issues. Based on relevant research, this review explains how machine learning (ML) is being used to help in the early identification of numerous diseases. Initially, a bibliometric analysis of the publication is carried out using data from the Scopus and Web of Science (WOS) databases. The bibliometric study of 1216 publications was undertaken to determine the most prolific authors, nations, organizations, and most cited articles. The review then summarizes the most recent trends and approaches in machine-learning-based disease diagnosis (MLBDD), considering the following factors: algorithm, disease types, data type, application, and evaluation metrics. Finally, in this paper, we highlight key results and provides insight into future trends and opportunities in the MLBDD area.
Collapse
Affiliation(s)
- Md Manjurul Ahsan
- School of Industrial and Systems Engineering, University of Oklahoma, Norman, OK 73019, USA
| | - Shahana Akter Luna
- Medicine & Surgery, Dhaka Medical College & Hospital, Dhaka 1000, Bangladesh;
| | - Zahed Siddique
- Department of Aerospace and Mechanical Engineering, University of Oklahoma, Norman, OK 73019, USA;
| |
Collapse
|
38
|
Abstract
Context: Energy utilization is one of the most closely related factors affecting many areas of the smart farm, plant growth, crop production, device automation, and energy supply to the same degree. Recently, 4th industrial revolution technologies such as IoT, artificial intelligence, and big data have been widely used in smart farm environments to efficiently use energy and control smart farms’ conditions. In particular, machine learning technologies with big data analysis are actively used as one of the most potent prediction methods supporting energy use in the smart farm. Purpose: This study proposes a machine learning-based prediction model for peak energy use by analyzing energy-related data collected from various environmental and growth devices in a smart paprika farm of the Jeonnam Agricultural Research and Extension Service in South Korea between 2019 and 2021. Scientific method: To find out the most optimized prediction model, comparative evaluation tests are performed using representative ML algorithms such as artificial neural network, support vector regression, random forest, K-nearest neighbors, extreme gradient boosting and gradient boosting machine, and time series algorithm ARIMA with binary classification for a different number of input features. Validate: This article can provide an effective and viable way for smart farm managers or greenhouse farmers who can better manage the problem of agricultural energy economically and environmentally. Therefore, we hope that the recommended ML method will help improve the smart farm’s energy use or their energy policies in various fields related to agricultural energy. Conclusion: The seven performance metrics including R-squared, root mean squared error, and mean absolute error, are associated with these two algorithms. It is concluded that the RF-based model is more successful than in the pre-others diction accuracy of 92%. Therefore, the proposed model may be contributed to the development of various applications for environment energy usage in a smart farm, such as a notification service for energy usage peak time or an energy usage control for each device.
Collapse
|
39
|
Abstract
Nowadays, many cities have problems with traffic congestion at certain peak hours, which produces more pollution, noise and stress for citizens. Neural networks (NN) and machine-learning (ML) approaches are increasingly used to solve real-world problems, overcoming analytical and statistical methods, due to their ability to deal with dynamic behavior over time and with a large number of parameters in massive data. In this paper, machine-learning (ML) and deep-learning (DL) algorithms are proposed for predicting traffic flow at an intersection, thus laying the groundwork for adaptive traffic control, either by remote control of traffic lights or by applying an algorithm that adjusts the timing according to the predicted flow. Therefore, this work only focuses on traffic flow prediction. Two public datasets are used to train, validate and test the proposed ML and DL models. The first one contains the number of vehicles sampled every five minutes at six intersections for 56 days using different sensors. For this research, four of the six intersections are used to train the ML and DL models. The Multilayer Perceptron Neural Network (MLP-NN) obtained better results (R-Squared and EV score of 0.93) and took less training time, followed closely by Gradient Boosting then Recurrent Neural Networks (RNNs), with good metrics results but the longer training time, and finally Random Forest, Linear Regression and Stochastic Gradient. All ML and DL algorithms scored good performance metrics, indicating that they are feasible for implementation on smart traffic light controllers.
Collapse
|
40
|
Winston L, McCann M, Onofrei G. ‘Exploring socioeconomic status as a global determinant of COVID-19 prevalence, using statistical, exploratory data analytic, and supervised machine learning techniques.’ (Preprint). JMIR Form Res 2021; 6:e35114. [PMID: 36001798 PMCID: PMC9518652 DOI: 10.2196/35114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 04/12/2022] [Accepted: 04/27/2022] [Indexed: 11/21/2022] Open
Abstract
Background The COVID-19 pandemic represents the most unprecedented global challenge in recent times. As the global community attempts to manage the pandemic in the long term, it is pivotal to understand what factors drive prevalence rates and to predict the future trajectory of the virus. Objective This study had 2 objectives. First, it tested the statistical relationship between socioeconomic status and COVID-19 prevalence. Second, it used machine learning techniques to predict cumulative COVID-19 cases in a multicountry sample of 182 countries. Taken together, these objectives will shed light on socioeconomic status as a global risk factor of the COVID-19 pandemic. Methods This research used exploratory data analysis and supervised machine learning methods. Exploratory analysis included variable distribution, variable correlations, and outlier detection. Following this, the following 3 supervised regression techniques were applied: linear regression, random forest, and adaptive boosting (AdaBoost). Results were evaluated using k-fold cross-validation and subsequently compared to analyze algorithmic suitability. The analysis involved 2 models. First, the algorithms were trained to predict 2021 COVID-19 prevalence using only 2020 reported case data. Following this, socioeconomic indicators were added as features and the algorithms were trained again. The Human Development Index (HDI) metrics of life expectancy, mean years of schooling, expected years of schooling, and gross national income were used to approximate socioeconomic status. Results All variables correlated positively with the 2021 COVID-19 prevalence, with R2 values ranging from 0.55 to 0.85. Using socioeconomic indicators, COVID-19 prevalence was predicted with a reasonable degree of accuracy. Using 2020 reported case rates as a lone predictor to predict 2021 prevalence rates, the average predictive accuracy of the algorithms was low (R2=0.543). When socioeconomic indicators were added alongside 2020 prevalence rates as features, the average predictive performance improved considerably (R2=0.721) and all error statistics decreased. Thus, adding socioeconomic indicators alongside 2020 reported case data optimized the prediction of COVID-19 prevalence to a considerable degree. Linear regression was the strongest learner with R2=0.693 on the first model and R2=0.763 on the second model, followed by random forest (0.481 and 0.722) and AdaBoost (0.454 and 0.679). Following this, the second model was retrained using a selection of additional COVID-19 risk factors (population density, median age, and vaccination uptake) instead of the HDI metrics. However, average accuracy dropped to 0.649, which highlights the value of socioeconomic status as a predictor of COVID-19 cases in the chosen sample. Conclusions The results show that socioeconomic status is an important variable to consider in future epidemiological modeling, and highlights the reality of the COVID-19 pandemic as a social phenomenon and a health care phenomenon. This paper also puts forward new considerations about the application of statistical and machine learning techniques to understand and combat the COVID-19 pandemic.
Collapse
Affiliation(s)
- Luke Winston
- Department of Computing, Atlantic Technological University, Letterkenny, Ireland
| | - Michael McCann
- Department of Computing, Atlantic Technological University, Letterkenny, Ireland
| | - George Onofrei
- Department of Business, Atlantic Technological University, Letterkenny, Ireland
| |
Collapse
|
41
|
Bakhshian S, Romanak K. DeepSense: A Physics-Guided Deep Learning Paradigm for Anomaly Detection in Soil Gas Data at Geologic CO 2 Storage Sites. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021; 55:15531-15541. [PMID: 34694136 DOI: 10.1021/acs.est.1c04048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Driven by the collection of enormous amounts of streaming data from sensors, and with the emergence of the internet of things, the need for developing robust detection techniques to identify data anomalies has increased recently. The algorithms for anomaly detection are required to be selected based on the type of data. In this study, we propose a predictive anomaly detection technique, DeepSense, which is applied to soil gas concentration data acquired from sensors being used for environmental characterization at a prospective CO2 storage site in Queensland, Australia. DeepSense takes advantage of deep-learning algorithms as its predictor module and uses a process-based soil gas method as the basis of its anomaly detector module. The proposed predictor framework leverages the power of convolutional neural network algorithms for feature extraction and simultaneously captures the long-term temporal dependency through long short-term memory algorithms. The proposed process-based anomaly detection method is a cost-effective alternative to the conventional concentration-based soil gas methodologies which rely on long-term baseline surveys for defining the threshold level. The results indicate that the proposed framework performs well in diagnosing anomalous data in soil gas concentration data streams. The robustness and efficacy of the DeepSense were verified against data sets acquired from different monitoring stations of the storage site.
Collapse
Affiliation(s)
- Sahar Bakhshian
- Bureau of Economic Geology, Jackson School of Geosciences, The University of Texas at Austin, Austin, Texas 78758-4445, United States
| | - Katherine Romanak
- Bureau of Economic Geology, Jackson School of Geosciences, The University of Texas at Austin, Austin, Texas 78758-4445, United States
| |
Collapse
|
42
|
Choi JM, Seo SY, Kim PJ, Kim YS, Lee SH, Sohn JH, Kim DK, Lee JJ, Kim C. Prediction of Hemorrhagic Transformation after Ischemic Stroke Using Machine Learning. J Pers Med 2021; 11:863. [PMID: 34575640 PMCID: PMC8470833 DOI: 10.3390/jpm11090863] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/25/2021] [Accepted: 08/28/2021] [Indexed: 12/27/2022] Open
Abstract
Hemorrhagic transformation (HT) is one of the leading causes of a poor prognostic marker after acute ischemic stroke (AIS). We compared the performances of the several machine learning (ML) algorithms to predict HT after AIS using only structured data. A total of 2028 patients with AIS, who were admitted within seven days of symptoms onset, were included in this analysis. HT was defined based on the criteria of the European Co-operative Acute Stroke Study-II trial. The whole dataset was randomly divided into a training and a test dataset with a 7:3 ratio. Binary logistic regression, support vector machine, extreme gradient boosting, and artificial neural network (ANN) algorithms were used to assess the performance of predicting the HT occurrence after AIS. Five-fold cross validation and a grid search technique were used to optimize the hyperparameters of each ML model, which had its performance measured by the area under the receiver operating characteristic (AUROC) curve. Among the included AIS patients, the mean age and number of male subjects were 69.6 years and 1183 (58.3%), respectively. HT was observed in 318 subjects (15.7%). There were no significant differences in corresponding variables between the training and test dataset. Among all the ML algorithms, the ANN algorithm showed the best performance in terms of predicting the occurrence of HT in our dataset (0.844). Feature scaling including standardization and normalization, and the resampling strategy showed no additional improvement of the ANN's performance. The ANN-based prediction of HT after AIS showed better performance than the conventional ML algorithms. Deep learning may be used to predict important outcomes for structured data-based prediction.
Collapse
Affiliation(s)
- Jeong-Myeong Choi
- Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
| | - Soo-Young Seo
- Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
| | - Pum-Jun Kim
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
| | - Yu-Seop Kim
- Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
| | - Sang-Hwa Lee
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
- Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
| | - Jong-Hee Sohn
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
- Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
| | - Dong-Kyu Kim
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
- Department of Otorhinolaryngology and Head and Neck Surgery, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
| | - Jae-Jun Lee
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
- Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
| | - Chulho Kim
- Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
- Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
| |
Collapse
|