Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ahsan MM, Mahmud MAP, Saha PK, Gupta KD, Siddique Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021;9:52. [DOI: 10.3390/technologies9030052] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

For:	Ahsan MM, Mahmud MAP, Saha PK, Gupta KD, Siddique Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021;9:52. [DOI: 10.3390/technologies9030052] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Number

Cited by Other Article(s)

Dostmohammadi M, Pedram MZ, Hoseinzadeh S, Garcia DA. A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;364:121264. [PMID: 38870783 DOI: 10.1016/j.jenvman.2024.121264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 05/21/2024] [Accepted: 05/26/2024] [Indexed: 06/15/2024]

Abstract

The considerable amount of energy utilized by buildings has led to various environmental challenges that adversely impact human existence. Predicting buildings' energy usage is commonly acknowledged as encouraging energy efficiency and enabling well-informed decision-making, ultimately leading to decreased energy consumption. Implementing eco-friendly architectural designs is paramount in mitigating energy consumption, particularly in recently constructed structures. This study utilizes clustering analysis on the original dataset to capture complex consumption patterns over various periods. The analysis yields two distinct subsets that represent low and high consumption patterns and an additional subset that exclusively encompasses weekends, attributed to the specific behavior of occupants. Ensemble models have become increasingly popular due to advancements in machine learning techniques. This research utilizes three discrete algorithms, namely Artificial Neural Network (ANN), K-nearest neighbors (KNN), and Decision Trees (DT). In addition, the application employs three more machine learning algorithms bagging and boosting: Random Forest (RF), Extreme Gradient Boosting (XGB), and Gradient Boosting Trees (GBT). To augment the accuracy of predictions, a stacking ensemble methodology is employed, wherein the forecasts generated by many algorithms are combined. Given the obtained outcomes, a thorough examination is undertaken, encompassing the techniques of stacking, bagging, and boosting, to conduct a comprehensive comparative study. It is pertinent to highlight that the stacking technique consistently exhibits superior performance relative to alternative ensemble methodologies across a spectrum of heterogeneous datasets. Furthermore, using a genetic algorithm enables the optimization of the combination of base learners, resulting in a notable enhancement in prediction accuracy. After implementing this optimization technique, GA-Stacking demonstrated remarkable performance in Mean Absolute Percentage Error (MAPE) scores. The improvement observed was substantial, surpassing 90 percent for all datasets. In addition, in subset-1, subset-2, and subset-3, the achieved R2 scores were 0.983, 0.985, and 0.999, respectively. This represents a substantial advancement in forecasting the energy consumption of residential buildings. Such progress underscores the potential advantages of integrating this framework into the practices of building designers, thereby fostering informed decision-making, design management, and optimization prior to construction.

Collapse

Trabassi D, Castiglia SF, Bini F, Marinozzi F, Ajoudani A, Lorenzini M, Chini G, Varrecchia T, Ranavolo A, De Icco R, Casali C, Serrao M. Optimizing Rare Disease Gait Classification through Data Balancing and Generative AI: Insights from Hereditary Cerebellar Ataxia. SENSORS (BASEL, SWITZERLAND) 2024;24:3613. [PMID: 38894404 PMCID: PMC11175240 DOI: 10.3390/s24113613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 05/28/2024] [Accepted: 05/31/2024] [Indexed: 06/21/2024]

Affiliation(s)

Dante Trabassi Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
Stefano Filippo Castiglia Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.) Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy;
Fabiano Bini Department of Mechanical and Aerospace Engineering, Sapienza University of Rome, 00184 Rome, Italy; (F.B.); (F.M.)
Franco Marinozzi Department of Mechanical and Aerospace Engineering, Sapienza University of Rome, 00184 Rome, Italy; (F.B.); (F.M.)
Arash Ajoudani Department of Advanced Robotics, Italian Institute of Technology, 16163 Genoa, Italy; (A.A.); (M.L.)
Marta Lorenzini Department of Advanced Robotics, Italian Institute of Technology, 16163 Genoa, Italy; (A.A.); (M.L.)
Giorgia Chini Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
Tiwana Varrecchia Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
Alberto Ranavolo Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL, Monte Porzio Catone, 00078 Rome, Italy; (G.C.); (T.V.); (A.R.)
Roberto De Icco Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy; Headache Science & Neurorehabilitation Unit, IRCCS Mondino Foundation, 27100 Pavia, Italy
Carlo Casali Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.)
Mariano Serrao Department of Medical and Surgical Sciences and Biotechnologies, “Sapienza” University of Rome, 04100 Latina, Italy; (D.T.); (C.C.); (M.S.) Movement Analysis Laboratory, Policlinico Italia, 00162 Rome, Italy

Collapse

de Kok JWTM, van Bussel BCT, Schnabel R, van Herpt TTW, Driessen RGH, Meijs DAM, Goossens JA, Mertens HJMM, van Kuijk SMJ, Wynants L, van der Horst ICC, van Rosmalen F. Table 0; documenting the steps to go from clinical database to research dataset. J Clin Epidemiol 2024;170:111342. [PMID: 38574979 DOI: 10.1016/j.jclinepi.2024.111342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/01/2024] [Accepted: 03/26/2024] [Indexed: 04/06/2024]

Abstract

OBJECTIVES

Data-driven decision support tools have been increasingly recognized to transform health care. However, such tools are often developed on predefined research datasets without adequate knowledge of the origin of this data and how it was selected. How a dataset is extracted from a clinical database can profoundly impact the validity, interpretability and interoperability of the dataset, and downstream analyses, yet is rarely reported. Therefore, we present a case study illustrating how a definitive patient list was extracted from a clinical source database and how this can be reported.

STUDY DESIGN AND SETTING

A single-center observational study was performed at an academic hospital in the Netherlands to illustrate the impact of selecting a definitive patient list for research from a clinical source database, and the importance of documenting this process. All admissions from the critical care database admitted between January 1, 2013, and January 1, 2023, were used.

RESULTS

An interdisciplinary team collaborated to identify and address potential sources of data insufficiency and uncertainty. We demonstrate a stepwise data preparation process, reducing the clinical source database of 54,218 admissions to a definitive patient list of 21,553 admissions. Transparent documentation of the data preparation process improves the quality of the definitive patient list before analysis of the corresponding patient data. This study generated seven important recommendations for preparing observational health-care data for research purposes.

CONCLUSION

Documenting data preparation is essential for understanding a research dataset originating from a clinical source database before analyzing health-care data. The findings contribute to establishing data standards and offer insights into the complexities of preparing health-care data for scientific investigation. Meticulous data preparation and documentation thereof will improve research validity and advance critical care.

Collapse

Affiliation(s)

Jip W T M de Kok Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
Bas C T van Bussel Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands; Care and Public Health Research Institute (CAPHRI), Maastricht University, Maastricht, The Netherlands
Ronny Schnabel Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands
Thijs T W van Herpt Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
Rob G H Driessen Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands; Department of Cardiology, Maastricht University Medical Centre+, Maastricht, The Netherlands
Daniek A M Meijs Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
Joep A Goossens Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands
Helen J M M Mertens Maastricht University Medical Centre+, Maastricht, The Netherlands
Sander M J van Kuijk Department of Clinical Epidemiology and Medical Technology Assessment (KEMTA), Maastricht University Medical Centre+, Maastricht, The Netherlands
Laure Wynants Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, The Netherlands; Department of Development and Regeneration, KU Leuven, Leuven, Belgium
Iwan C C van der Horst Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands
Frank van Rosmalen Department of Intensive Care Medicine, Maastricht University Medical Centre+, Maastricht, The Netherlands; Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, The Netherlands.

Collapse

Begum N, Rahman MM, Omar Faruk M. Machine learning prediction of nutritional status among pregnant women in Bangladesh: Evidence from Bangladesh demographic and health survey 2017-18. PLoS One 2024;19:e0304389. [PMID: 38820295 PMCID: PMC11142495 DOI: 10.1371/journal.pone.0304389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 05/12/2024] [Indexed: 06/02/2024] Open

Ubeira-Gabellini MG, Mori M, Palazzo G, Cicchetti A, Mangili P, Pavarini M, Rancati T, Fodor A, Del Vecchio A, Di Muzio NG, Fiorino C. Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches. Cancers (Basel) 2024;16:934. [PMID: 38473296 DOI: 10.3390/cancers16050934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/20/2024] [Accepted: 02/20/2024] [Indexed: 03/14/2024] Open

A G, M T, N S. Machine learning, a powerful tool for the prediction of BiVO₄ nanoparticles efficiency in photocatalytic degradation of organic dyes. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART A, TOXIC/HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING 2024;59:15-24. [PMID: 38400531 DOI: 10.1080/10934529.2024.2319510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/08/2024] [Indexed: 02/25/2024]

Li A, Mullin S, Elkin PL. Improving Prediction of Survival for Extremely Premature Infants Born at 23 to 29 Weeks Gestational Age in the Neonatal Intensive Care Unit: Development and Evaluation of Machine Learning Models. JMIR Med Inform 2024;12:e42271. [PMID: 38354033 PMCID: PMC10902770 DOI: 10.2196/42271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 02/02/2023] [Accepted: 12/28/2023] [Indexed: 03/02/2024] Open

Abstract

BACKGROUND

Infants born at extremely preterm gestational ages are typically admitted to the neonatal intensive care unit (NICU) after initial resuscitation. The subsequent hospital course can be highly variable, and despite counseling aided by available risk calculators, there are significant challenges with shared decision-making regarding life support and transition to end-of-life care. Improving predictive models can help providers and families navigate these unique challenges.

OBJECTIVE

Machine learning methods have previously demonstrated added predictive value for determining intensive care unit outcomes, and their use allows consideration of a greater number of factors that potentially influence newborn outcomes, such as maternal characteristics. Machine learning-based models were analyzed for their ability to predict the survival of extremely preterm neonates at initial admission.

METHODS

Maternal and newborn information was extracted from the health records of infants born between 23 and 29 weeks of gestation in the Medical Information Mart for Intensive Care III (MIMIC-III) critical care database. Applicable machine learning models predicting survival during the initial NICU admission were developed and compared. The same type of model was also examined using only features that would be available prepartum for the purpose of survival prediction prior to an anticipated preterm birth. Features most correlated with the predicted outcome were determined when possible for each model.

RESULTS

Of included patients, 37 of 459 (8.1%) expired. The resulting random forest model showed higher predictive performance than the frequently used Score for Neonatal Acute Physiology With Perinatal Extension II (SNAPPE-II) NICU model when considering extremely preterm infants of very low birth weight. Several other machine learning models were found to have good performance but did not show a statistically significant difference from previously available models in this study. Feature importance varied by model, and those of greater importance included gestational age; birth weight; initial oxygenation level; elements of the APGAR (appearance, pulse, grimace, activity, and respiration) score; and amount of blood pressure support. Important prepartum features also included maternal age, steroid administration, and the presence of pregnancy complications.

CONCLUSIONS

Machine learning methods have the potential to provide robust prediction of survival in the context of extremely preterm births and allow for consideration of additional factors such as maternal clinical and socioeconomic information. Evaluation of larger, more diverse data sets may provide additional clarity on comparative performance.

Collapse

Hassan J, Saeed SM, Deka L, Uddin MJ, Das DB. Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges. Pharmaceutics 2024;16:260. [PMID: 38399314 PMCID: PMC10892549 DOI: 10.3390/pharmaceutics16020260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/29/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024] Open

Moaveninejad S, D'Onofrio V, Tecchio F, Ferracuti F, Iarlori S, Monteriù A, Porcaro C. Fractal Dimension as a discriminative feature for high accuracy classification in motor imagery EEG-based brain-computer interface. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;244:107944. [PMID: 38064955 DOI: 10.1016/j.cmpb.2023.107944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/31/2023] [Accepted: 11/24/2023] [Indexed: 01/26/2024]

Olcay B, Ozdemir GD, Ozdemir MA, Ercan UK, Guren O, Karaman O. Prediction of the synergistic effect of antimicrobial peptides and antimicrobial agents via supervised machine learning. BMC Biomed Eng 2024;6:1. [PMID: 38233957 DOI: 10.1186/s42490-024-00075-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 01/09/2024] [Indexed: 01/19/2024] Open

Abstract

BACKGROUND

Infectious diseases not only cause severe health problems but also burden the healthcare system. Therefore, the effective treatment of those diseases is crucial. Both conventional approaches, such as antimicrobial agents, and novel approaches, like antimicrobial peptides (AMPs), are used to treat infections. However, due to the drawbacks of current approaches, new solutions are still being investigated. One recent approach is the use of AMPs and antimicrobial agents in combination, but determining synergism is with a huge variety of AMPs time-consuming and requires multiple experimental studies. Machine learning (ML) algorithms are widely used to predict biological outcomes, particularly in the field of AMPs, but no previous research reported on predicting the synergistic effects of AMPs and antimicrobial agents.

RESULTS

Several supervised ML models were implemented to accurately predict the synergistic effect of AMPs and antimicrobial agents. The results demonstrated that the hyperparameter-optimized Light Gradient Boosted Machine Classifier (oLGBMC) yielded the best test accuracy of 76.92% for predicting the synergistic effect. Besides, the feature importance analysis reveals that the target microbial species, the minimum inhibitory concentrations (MICs) of the AMP and the antimicrobial agents, and the used antimicrobial agent were the most important features for the prediction of synergistic effect, which aligns with recent experimental studies in the literature.

CONCLUSION

This study reveals that ML algorithms can predict the synergistic activity of two different antimicrobial agents without the need for complex and time-consuming experimental procedures. The implications support that the ML models may not only reduce the experimental cost but also provide validation of experimental procedures.

Collapse

Rodríguez Mallma MJ, Vilca-Aguilar M, Zuloaga-Rotta L, Borja-Rosales R, Salas-Ojeda M, Mauricio D. Machine Learning Approach for Analyzing 3-Year Outcomes of Patients with Brain Arteriovenous Malformation (AVM) after Stereotactic Radiosurgery (SRS). Diagnostics (Basel) 2023;14:22. [PMID: 38201331 PMCID: PMC10871108 DOI: 10.3390/diagnostics14010022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/14/2023] [Accepted: 12/17/2023] [Indexed: 01/12/2024] Open

Kloonen RMJS, Varisco G, de Kort E, Andriessen P, Niemarkt HJ, van Pul C. Predicting CPAP failure after less invasive surfactant administration (LISA) in preterm infants by machine learning model on vital parameter data: a pilot study. Physiol Meas 2023;44:115005. [PMID: 37939392 DOI: 10.1088/1361-6579/ad0ab6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/07/2023] [Indexed: 11/10/2023]

Abstract

Objective. Less invasive surfactant administration (LISA) has been introduced to preterm infants with respiratory distress syndrome on continuous positive airway pressure (CPAP) support in order to avoid intubation and mechanical ventilation. However, after this LISA procedure, a significant part of infants fails CPAP treatment (CPAP-F) and requires intubation in the first 72 h of life, which is associated with worse complication free survival chances. The aim of this study was to predict CPAP-F after LISA, based on machine learning (ML) analysis of high resolution vital parameter monitoring data surrounding the LISA procedure.Approach. Patients with a gestational age (GA) <32 weeks receiving LISA were included. Vital parameter data was obtained from a data warehouse. Physiological features (HR, RR, peripheral oxygen saturation (SpO2) and body temperature) were calculated in eight 0.5 h windows throughout a period 1.5 h before to 2.5 h after LISA. First, physiological data was analyzed to investigate differences between the CPAP-F and CPAP-Success (CPAP-S) groups. Next, the performance of two types of ML models (logistic regression: LR, support vector machine: SVM) for the prediction of CPAP-F were evaluated.Main results. Of 51 included patients, 18 (35%) had CPAP-F. Univariate analysis showed lower SpO2, temperature and heart rate variability (HRV) before and after the LISA procedure. The best performing ML model showed an area under the curve of 0.90 and 0.93 for LR and SVM respectively in the 0.5 h window directly after LISA, with GA, HRV, respiration rate and SpO2as most important features. Excluding GA decreased performance in both models.Significance. In this pilot study we were able to predict CPAP-F with a ML model of patient monitor signals, with best performance in the first 0.5 h after LISA. Using ML to predict CPAP-F based on vital signals gains insight in (possibly modifiable) factors that are associated with LISA failure and can help to guide personalized clinical decisions in early respiratory management.

Collapse

Tran TS, Stitmannaithum B, Van Hong Bui L, Nguyen TT. Data-driven prediction of the shear capacity of ETS-FRP-strengthened beams in the hybrid 2PKT-ML approach. Sci Rep 2023;13:19871. [PMID: 37963991 PMCID: PMC10646016 DOI: 10.1038/s41598-023-47064-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 11/08/2023] [Indexed: 11/16/2023] Open

Xiao X. DVNE-DRL: dynamic virtual network embedding algorithm based on deep reinforcement learning. Sci Rep 2023;13:19789. [PMID: 37957350 PMCID: PMC10643368 DOI: 10.1038/s41598-023-47195-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 11/10/2023] [Indexed: 11/15/2023] Open

Yagin B, Yagin FH, Colak C, Inceoglu F, Kadry S, Kim J. Cancer Metastasis Prediction and Genomic Biomarker Identification through Machine Learning and eXplainable Artificial Intelligence in Breast Cancer Research. Diagnostics (Basel) 2023;13:3314. [PMID: 37958210 PMCID: PMC10650093 DOI: 10.3390/diagnostics13213314] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 10/17/2023] [Accepted: 10/25/2023] [Indexed: 11/15/2023] Open

Abstract

AIM

Method: This research presents a model combining machine learning (ML) techniques and eXplainable artificial intelligence (XAI) to predict breast cancer (BC) metastasis and reveal important genomic biomarkers in metastasis patients.

METHOD

A total of 98 primary BC samples was analyzed, comprising 34 samples from patients who developed distant metastases within a 5-year follow-up period and 44 samples from patients who remained disease-free for at least 5 years after diagnosis. Genomic data were then subjected to biostatistical analysis, followed by the application of the elastic net feature selection method. This technique identified a restricted number of genomic biomarkers associated with BC metastasis. A light gradient boosting machine (LightGBM), categorical boosting (CatBoost), Extreme Gradient Boosting (XGBoost), Gradient Boosting Trees (GBT), and Ada boosting (AdaBoost) algorithms were utilized for prediction. To assess the models' predictive abilities, the accuracy, F1 score, precision, recall, area under the ROC curve (AUC), and Brier score were calculated as performance evaluation metrics. To promote interpretability and overcome the "black box" problem of ML models, a SHapley Additive exPlanations (SHAP) method was employed.

RESULTS

The LightGBM model outperformed other models, yielding remarkable accuracy of 96% and an AUC of 99.3%. In addition to biostatistical evaluation, in XAI-based SHAP results, increased expression levels of TSPYL5, ATP5E, CA9, NUP210, SLC37A1, ARIH1, PSMD7, UBQLN1, PRAME, and UBE2T (p ≤ 0.05) were found to be associated with an increased incidence of BC metastasis. Finally, decreased levels of expression of CACTIN, TGFB3, SCUBE2, ARL4D, OR1F1, ALDH4A1, PHF1, and CROCC (p ≤ 0.05) genes were also determined to increase the risk of metastasis in BC.

CONCLUSION

The findings of this study may prevent disease progression and metastases and potentially improve clinical outcomes by recommending customized treatment approaches for BC patients.

Collapse

Hasan MF, Smith R, Vajedian S, Pommerenke R, Majumdar S. Global land subsidence mapping reveals widespread loss of aquifer storage capacity. Nat Commun 2023;14:6180. [PMID: 37794012 PMCID: PMC10550978 DOI: 10.1038/s41467-023-41933-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/22/2023] [Indexed: 10/06/2023] Open

Zafar K, Siddiqui HUR, Majid A, Rustam F, Alfarhood S, Safran M, Ashraf I. Enhancing Diagnosis of Anterior and Inferior Myocardial Infarctions Using UWB Radar and AI-Driven Feature Fusion Approach. SENSORS (BASEL, SWITZERLAND) 2023;23:7756. [PMID: 37765813 PMCID: PMC10537523 DOI: 10.3390/s23187756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 09/02/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023]

Greenberg ZF, Graim KS, He M. Towards artificial intelligence-enabled extracellular vesicle precision drug delivery. Adv Drug Deliv Rev 2023:114974. [PMID: 37356623 DOI: 10.1016/j.addr.2023.114974] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 06/21/2023] [Accepted: 06/22/2023] [Indexed: 06/27/2023]

Mehmood A, Lee KT, Kim DH. Energy Prediction and Optimization for Smart Homes with Weather Metric-Weight Coefficients. SENSORS (BASEL, SWITZERLAND) 2023;23:3640. [PMID: 37050700 PMCID: PMC10099256 DOI: 10.3390/s23073640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/26/2023] [Accepted: 03/27/2023] [Indexed: 06/19/2023]

Abstract

Home appliances are considered to account for a large portion of smart homes' energy consumption. This is due to the abundant use of IoT devices. Various home appliances, such as heaters, dishwashers, and vacuum cleaners, are used every day. It is thought that proper control of these home appliances can reduce significant amounts of energy use. For this purpose, optimization techniques focusing mainly on energy reduction are used. Current optimization techniques somewhat reduce energy use but overlook user convenience, which was the main goal of introducing home appliances. Therefore, there is a need for an optimization method that effectively addresses the trade-off between energy saving and user convenience. Current optimization techniques should include weather metrics other than temperature and humidity to effectively optimize the energy cost of controlling the desired indoor setting of a smart home for the user. This research work involves an optimization technique that addresses the trade-off between energy saving and user convenience, including the use of air pressure, dew point, and wind speed. To test the optimization, a hybrid approach utilizing GWO and PSO was modeled. This work involved enabling proactive energy optimization using appliance energy prediction. An LSTM model was designed to test the appliances' energy predictions. Through predictions and optimized control, smart home appliances could be proactively and effectively controlled. First, we evaluated the RMSE score of the predictive model and found that the proposed model results in low RMSE values. Second, we conducted several simulations and found the proposed optimization results to provide energy cost savings used in appliance control to regulate the desired indoor setting of the smart home. Energy cost reduction goals using the optimization strategies were evaluated for seasonal and monthly patterns of data for result verification. Hence, the proposed work is considered a better candidate solution for proactively optimizing the energy of smart homes.

Collapse

Shastry KA, Sattar SA. Logistic random forest boosting technique for Alzheimer’s diagnosis. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY 2023;15:1719-1731. [PMID: 37056794 PMCID: PMC9983513 DOI: 10.1007/s41870-023-01187-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 02/16/2023] [Indexed: 03/06/2023]

Identifying two distinct subphenotypes of patent ductus arteriosus in preterm infants using machine learning. Eur J Pediatr 2023;182:2173-2179. [PMID: 36853570 DOI: 10.1007/s00431-023-04882-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 02/09/2023] [Accepted: 02/15/2023] [Indexed: 03/01/2023]

The Increase of Theta Power and Decrease of Alpha/Theta Ratio as a Manifestation of Cognitive Impairment in Parkinson's Disease. J Clin Med 2023;12:jcm12041569. [PMID: 36836103 PMCID: PMC9965386 DOI: 10.3390/jcm12041569] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 02/06/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open

Ahmed B, Haque MA, Iquebal MA, Jaiswal S, Angadi UB, Kumar D, Rai A. DeepAProt: Deep learning based abiotic stress protein sequence classification and identification tool in cereals. FRONTIERS IN PLANT SCIENCE 2023;13:1008756. [PMID: 36714750 PMCID: PMC9877618 DOI: 10.3389/fpls.2022.1008756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 11/14/2022] [Indexed: 06/18/2023]

Hasnul MA, Ab. Aziz NA, Abd. Aziz A. Augmenting ECG Data with Multiple Filters for a Better Emotion Recognition System. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2023;48:1-22. [PMID: 36685996 PMCID: PMC9838506 DOI: 10.1007/s13369-022-07585-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 12/18/2022] [Indexed: 01/13/2023]

Kanyongo W, Ezugwu AE. Machine learning approaches to medication adherence amongst NCD patients: A systematic literature review. INFORMATICS IN MEDICINE UNLOCKED 2023. [DOI: 10.1016/j.imu.2023.101210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023] Open

Alam MS, Rashid MM, Roy R, Faizabadi AR, Gupta KD, Ahsan MM. Empirical Study of Autism Spectrum Disorder Diagnosis Using Facial Images by Improved Transfer Learning Approach. Bioengineering (Basel) 2022;9:710. [PMID: 36421111 PMCID: PMC9687350 DOI: 10.3390/bioengineering9110710] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 11/09/2022] [Accepted: 11/14/2022] [Indexed: 09/29/2023] Open

Paepae T, Bokoro PN, Kyamakya K. A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques. SENSORS (BASEL, SWITZERLAND) 2022;22:7338. [PMID: 36236438 PMCID: PMC9572788 DOI: 10.3390/s22197338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 09/20/2022] [Accepted: 09/24/2022] [Indexed: 06/16/2023]

Skin cancer diagnosis based on deep transfer learning and sparrow search algorithm. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07762-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Abstract AbstractSkin cancer affects the lives of millions of people every year, as it is considered the most popular form of cancer. In the USA alone, approximately three and a half million people are diagnosed with skin cancer annually. The survival rate diminishes steeply as the skin cancer progresses. Despite this, it is an expensive and difficult procedure to discover this cancer type in the early stages. In this study, a threshold-based automatic approach for skin cancer detection, classification, and segmentation utilizing a meta-heuristic optimizer named sparrow search algorithm (SpaSA) is proposed. Five U-Net models (i.e., U-Net, U-Net++, Attention U-Net, V-net, and Swin U-Net) with different configurations are utilized to perform the segmentation process. Besides this, the meta-heuristic SpaSA optimizer is used to perform the optimization of the hyperparameters using eight pre-trained CNN models (i.e., VGG16, VGG19, MobileNet, MobileNetV2, MobileNetV3Large, MobileNetV3Small, NASNetMobile, and NASNetLarge). The dataset is gathered from five public sources in which two types of datasets are generated (i.e., 2-classes and 10-classes). For the segmentation, concerning the “skin cancer segmentation and classification” dataset, the best reported scores by U-Net++ with DenseNet201 as a backbone architecture are 0.104,

$$94.16\%$$

94.16 % ,

$$91.39\%$$

91.39 % ,

$$99.03\%$$

99.03 % ,

$$96.08\%$$

96.08 % ,

$$96.41\%$$

96.41 % ,

$$77.19\%$$

77.19 % ,

$$75.47\%$$

75.47 % in terms of loss, accuracy, F1-score, AUC, IoU, dice, hinge, and squared hinge, respectively, while for the “PH2” dataset, the best reported scores by the Attention U-Net with DenseNet201 as backbone architecture are 0.137,

$$94.75\%$$

94.75 % ,

$$92.65\%$$

92.65 % ,

$$92.56\%$$

92.56 % ,

$$92.74\%$$

92.74 % ,

$$96.20\%$$

96.20 % ,

$$86.30\%$$

86.30 % ,

$$92.65\%$$

92.65 % ,

$$69.28\%$$

69.28 % , and

$$68.04\%$$

68.04 % in terms of loss, accuracy, F1-score, precision, sensitivity, specificity, IoU, dice, hinge, and squared hinge, respectively. For the “ISIC 2019 and 2020 Melanoma” dataset, the best reported overall accuracy from the applied CNN experiments is

$$98.27\%$$

98.27 % by the MobileNet pre-trained model. Similarly, for the “Melanoma Classification (HAM10K)” dataset, the best reported overall accuracy from the applied CNN experiments is

$$98.83\%$$

98.83 % by the MobileNet pre-trained model. For the “skin diseases image” dataset, the best reported overall accuracy from the applied CNN experiments is

$$85.87\%$$

85.87 % by the MobileNetV2 pre-trained model. After computing the results, the suggested approach is compared with 13 related studies. Collapse

Adams J, Agyenkwa-Mawuli K, Agyapong O, Wilson MD, Kwofie SK. EBOLApred: A machine learning-based web application for predicting cell entry inhibitors of the Ebola virus. Comput Biol Chem 2022;101:107766. [DOI: 10.1016/j.compbiolchem.2022.107766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 08/10/2022] [Accepted: 08/29/2022] [Indexed: 11/03/2022]

Accurate Numerical Treatment on a Stochastic SIR Epidemic Model with Optimal Control Strategy. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10040082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

A Two-Step Data Normalization Approach for Improving Classification Accuracy in the Medical Diagnosis Domain. MATHEMATICS 2022. [DOI: 10.3390/math10111942] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Abstract Data normalization is a data preprocessing task and one of the first to be performed during intellectual analysis, particularly in the case of tabular data. The importance of its implementation is determined by the need to reduce the sensitivity of the artificial intelligence model to the values of the features in the dataset to increase the studied model’s adequacy. This paper focuses on the problem of effectively preprocessing data to improve the accuracy of intellectual analysis in the case of performing medical diagnostic tasks. We developed a new two-step method for data normalization of numerical medical datasets. It is based on the possibility of considering both the interdependencies between the features of each observation from the dataset and their absolute values to improve the accuracy when performing medical data mining tasks. We describe and substantiate each step of the algorithmic implementation of the method. We also visualize the results of the proposed method. The proposed method was modeled using six different machine learning methods based on decision trees when performing binary and multiclass classification tasks. We used six real-world, freely available medical datasets with different numbers of vectors, attributes, and classes to conduct experiments. A comparison between the effectiveness of the developed method and that of five existing data normalization methods was carried out. It was experimentally established that the developed method increases the accuracy of the Decision Tree and Extra Trees Classifier by 1–5% in the case of performing the binary classification task and the accuracy of the Bagging, Decision Tree, and Extra Trees Classifier by 1–6% in the case of performing the multiclass classification task. Increasing the accuracy of these classifiers only by using the new data normalization method satisfies all the prerequisites for its application in practice when performing various medical data mining tasks. Collapse

Ahsan MM, Siddique Z. Machine learning-based heart disease diagnosis: A systematic literature review. Artif Intell Med 2022;128:102289. [DOI: 10.1016/j.artmed.2022.102289] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 03/22/2022] [Indexed: 01/01/2023]

A Modified Iterative Algorithm for Numerical Investigation of HIV Infection Dynamics. ALGORITHMS 2022. [DOI: 10.3390/a15050175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Sensor Screening Methodology for Virtually Sensing Transmission Input Loads of a Wind Turbine Using Machine Learning Techniques and Drivetrain Simulations. SENSORS 2022;22:s22103659. [PMID: 35632067 PMCID: PMC9145404 DOI: 10.3390/s22103659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 04/28/2022] [Accepted: 05/09/2022] [Indexed: 12/05/2022]

A Comparative Analysis on Suicidal Ideation Detection Using NLP, Machine, and Deep Learning. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10030057] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

A Multi-Step Time-Series Clustering-Based Seq2Seq LSTM Learning for a Single Household Electricity Load Forecasting. ENERGIES 2022. [DOI: 10.3390/en15072623] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]

Ahsan MM, Luna SA, Siddique Z. Machine-Learning-Based Disease Diagnosis: A Comprehensive Review. Healthcare (Basel) 2022;10:541. [PMID: 35327018 PMCID: PMC8950225 DOI: 10.3390/healthcare10030541] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/08/2022] [Accepted: 03/10/2022] [Indexed: 02/06/2023] Open

A Machine Learning Based Model for Energy Usage Peak Prediction in Smart Farms. ELECTRONICS 2022. [DOI: 10.3390/electronics11020218] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Abstract Context: Energy utilization is one of the most closely related factors affecting many areas of the smart farm, plant growth, crop production, device automation, and energy supply to the same degree. Recently, 4th industrial revolution technologies such as IoT, artificial intelligence, and big data have been widely used in smart farm environments to efficiently use energy and control smart farms’ conditions. In particular, machine learning technologies with big data analysis are actively used as one of the most potent prediction methods supporting energy use in the smart farm. Purpose: This study proposes a machine learning-based prediction model for peak energy use by analyzing energy-related data collected from various environmental and growth devices in a smart paprika farm of the Jeonnam Agricultural Research and Extension Service in South Korea between 2019 and 2021. Scientific method: To find out the most optimized prediction model, comparative evaluation tests are performed using representative ML algorithms such as artificial neural network, support vector regression, random forest, K-nearest neighbors, extreme gradient boosting and gradient boosting machine, and time series algorithm ARIMA with binary classification for a different number of input features. Validate: This article can provide an effective and viable way for smart farm managers or greenhouse farmers who can better manage the problem of agricultural energy economically and environmentally. Therefore, we hope that the recommended ML method will help improve the smart farm’s energy use or their energy policies in various fields related to agricultural energy. Conclusion: The seven performance metrics including R-squared, root mean squared error, and mean absolute error, are associated with these two algorithms. It is concluded that the RF-based model is more successful than in the pre-others diction accuracy of 92%. Therefore, the proposed model may be contributed to the development of various applications for environment energy usage in a smart farm, such as a notification service for energy usage peak time or an energy usage control for each device. Collapse

Traffic Flow Prediction for Smart Traffic Lights Using Machine Learning Algorithms. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10010005] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Winston L, McCann M, Onofrei G. ‘Exploring socioeconomic status as a global determinant of COVID-19 prevalence, using statistical, exploratory data analytic, and supervised machine learning techniques.’ (Preprint). JMIR Form Res 2021;6:e35114. [PMID: 36001798 PMCID: PMC9518652 DOI: 10.2196/35114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 04/12/2022] [Accepted: 04/27/2022] [Indexed: 11/21/2022] Open

Abstract

Background

The COVID-19 pandemic represents the most unprecedented global challenge in recent times. As the global community attempts to manage the pandemic in the long term, it is pivotal to understand what factors drive prevalence rates and to predict the future trajectory of the virus.

Objective

This study had 2 objectives. First, it tested the statistical relationship between socioeconomic status and COVID-19 prevalence. Second, it used machine learning techniques to predict cumulative COVID-19 cases in a multicountry sample of 182 countries. Taken together, these objectives will shed light on socioeconomic status as a global risk factor of the COVID-19 pandemic.

Methods

This research used exploratory data analysis and supervised machine learning methods. Exploratory analysis included variable distribution, variable correlations, and outlier detection. Following this, the following 3 supervised regression techniques were applied: linear regression, random forest, and adaptive boosting (AdaBoost). Results were evaluated using k-fold cross-validation and subsequently compared to analyze algorithmic suitability. The analysis involved 2 models. First, the algorithms were trained to predict 2021 COVID-19 prevalence using only 2020 reported case data. Following this, socioeconomic indicators were added as features and the algorithms were trained again. The Human Development Index (HDI) metrics of life expectancy, mean years of schooling, expected years of schooling, and gross national income were used to approximate socioeconomic status.

Results

All variables correlated positively with the 2021 COVID-19 prevalence, with R² values ranging from 0.55 to 0.85. Using socioeconomic indicators, COVID-19 prevalence was predicted with a reasonable degree of accuracy. Using 2020 reported case rates as a lone predictor to predict 2021 prevalence rates, the average predictive accuracy of the algorithms was low (R²=0.543). When socioeconomic indicators were added alongside 2020 prevalence rates as features, the average predictive performance improved considerably (R²=0.721) and all error statistics decreased. Thus, adding socioeconomic indicators alongside 2020 reported case data optimized the prediction of COVID-19 prevalence to a considerable degree. Linear regression was the strongest learner with R²=0.693 on the first model and R²=0.763 on the second model, followed by random forest (0.481 and 0.722) and AdaBoost (0.454 and 0.679). Following this, the second model was retrained using a selection of additional COVID-19 risk factors (population density, median age, and vaccination uptake) instead of the HDI metrics. However, average accuracy dropped to 0.649, which highlights the value of socioeconomic status as a predictor of COVID-19 cases in the chosen sample.

Conclusions

The results show that socioeconomic status is an important variable to consider in future epidemiological modeling, and highlights the reality of the COVID-19 pandemic as a social phenomenon and a health care phenomenon. This paper also puts forward new considerations about the application of statistical and machine learning techniques to understand and combat the COVID-19 pandemic.

Collapse

Bakhshian S, Romanak K. DeepSense: A Physics-Guided Deep Learning Paradigm for Anomaly Detection in Soil Gas Data at Geologic CO₂ Storage Sites. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:15531-15541. [PMID: 34694136 DOI: 10.1021/acs.est.1c04048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Choi JM, Seo SY, Kim PJ, Kim YS, Lee SH, Sohn JH, Kim DK, Lee JJ, Kim C. Prediction of Hemorrhagic Transformation after Ischemic Stroke Using Machine Learning. J Pers Med 2021;11:863. [PMID: 34575640 PMCID: PMC8470833 DOI: 10.3390/jpm11090863] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/25/2021] [Accepted: 08/28/2021] [Indexed: 12/27/2022] Open

Affiliation(s)

Jeong-Myeong Choi Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
Soo-Young Seo Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
Pum-Jun Kim Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.)
Yu-Seop Kim Department of Convergence Software, Hallym University, Chuncheon 24252, Korea; (J.-M.C.); (S.-Y.S.); (Y.-S.K.)
Sang-Hwa Lee Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.) Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
Jong-Hee Sohn Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.) Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
Dong-Kyu Kim Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.) Department of Otorhinolaryngology and Head and Neck Surgery, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
Jae-Jun Lee Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.) Department of Anesthesiology and Pain Medicine, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
Chulho Kim Institute of New Frontier Research Team, Hallym University College of Medicine, Chuncheon 24252, Korea; (P.-J.K.); (S.-H.L.); (J.-H.S.); (D.-K.K.); (J.-J.L.) Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea

Collapse