Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Brereton RG, Lloyd GR. Support vector machines for classification and regression. Analyst 2009;135:230-67. [PMID: 20098757 DOI: 10.1039/b918972f] [Citation(s) in RCA: 245] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Number

Cited by Other Article(s)

Gracida-Osorno C, Molina-Salinas GM, Góngora-Hernández R, Brito-Loeza C, Uc-Cachón AH, Paniagua-Sierra JR. Machine Learning for Predicting Chronic Renal Disease Progression in COVID-19 Patients with Acute Renal Injury: A Feasibility Study. Biomedicines 2024;12:1511. [PMID: 39062084 PMCID: PMC11274434 DOI: 10.3390/biomedicines12071511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 05/21/2024] [Accepted: 05/31/2024] [Indexed: 07/28/2024] Open

Wu W, Fukui S. Using Human Resources Data to Predict Turnover of Community Mental Health Employees: Prediction and Interpretation of Machine Learning Methods. Int J Ment Health Nurs 2024. [PMID: 38961607 DOI: 10.1111/inm.13387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 05/09/2024] [Accepted: 06/20/2024] [Indexed: 07/05/2024]

Shin S, Choi TY, Han DH, Choi B, Cho E, Seog Y, Koo BN. An explainable machine learning model to predict early and late acute kidney injury after major hepatectomy. HPB (Oxford) 2024;26:949-959. [PMID: 38705794 DOI: 10.1016/j.hpb.2024.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 12/13/2023] [Accepted: 04/19/2024] [Indexed: 05/07/2024]

Li M, Yin S, Liu Z, Zhang H. Machine learning enables electrical resistivity modeling of printed lines in aerosol jet 3D printing. Sci Rep 2024;14:14614. [PMID: 38918598 PMCID: PMC11199662 DOI: 10.1038/s41598-024-65693-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 06/24/2024] [Indexed: 06/27/2024] Open

Haruna SI, Ibrahim YE, Hassan IH, Al-shawafi A, Zhu H. Bond Strength Assessment of Normal Strength Concrete-Ultra-High-Performance Fiber Reinforced Concrete Using Repeated Drop-Weight Impact Test: Experimental and Machine Learning Technique. MATERIALS (BASEL, SWITZERLAND) 2024;17:3032. [PMID: 38930404 PMCID: PMC11205906 DOI: 10.3390/ma17123032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 05/29/2024] [Accepted: 06/17/2024] [Indexed: 06/28/2024]

Liu B, Guo B, Zhuo R, Dai F. Estimation of soil organic carbon in LUCAS soil database using Vis-NIR spectroscopy based on hybrid kernel Gaussian process regression. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024;321:124687. [PMID: 38909558 DOI: 10.1016/j.saa.2024.124687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Revised: 06/02/2024] [Accepted: 06/18/2024] [Indexed: 06/25/2024]

Woodhouse AW, Kocaarslan A, Garden JA, Mutlu H. Unlocking the Potential of Polythioesters. Macromol Rapid Commun 2024:e2400260. [PMID: 38824417 DOI: 10.1002/marc.202400260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 05/20/2024] [Indexed: 06/03/2024]

Kushwaha NL, Kudnar NS, Vishwakarma DK, Subeesh A, Jatav MS, Gaddikeri V, Ahmed AA, Abdelaty I. Stacked hybridization to enhance the performance of artificial neural networks (ANN) for prediction of water quality index in the Bagh river basin, India. Heliyon 2024;10:e31085. [PMID: 38784559 PMCID: PMC11112320 DOI: 10.1016/j.heliyon.2024.e31085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 05/03/2024] [Accepted: 05/09/2024] [Indexed: 05/25/2024] Open

Fang Z, Ke H, Ma Y, Zhao S, Zhou R, Ma Z, Liu Z. Design optimization of groundwater circulation well based on numerical simulation and machine learning. Sci Rep 2024;14:11506. [PMID: 38769108 PMCID: PMC11106317 DOI: 10.1038/s41598-024-62545-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Accepted: 05/17/2024] [Indexed: 05/22/2024] Open

Yang D, Zhou Y, Jie Y, Li Q, Shi T. Non-destructive detection of defective maize kernels using hyperspectral imaging and convolutional neural network with attention module. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024;313:124166. [PMID: 38493512 DOI: 10.1016/j.saa.2024.124166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 03/04/2024] [Accepted: 03/14/2024] [Indexed: 03/19/2024]

Bui TBC, Iida D, Kitamura Y, Kokawa M. Utilization of multiple-dilution fluorescence fingerprint facilitates prediction of chemical attributes in spice extracts. Food Chem 2024;438:138028. [PMID: 38091861 DOI: 10.1016/j.foodchem.2023.138028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 11/14/2023] [Indexed: 12/28/2023]

Chappel JR, Kirkwood-Donelson KI, Reif DM, Baker ES. From big data to big insights: statistical and bioinformatic approaches for exploring the lipidome. Anal Bioanal Chem 2024;416:2189-2202. [PMID: 37875675 PMCID: PMC10954412 DOI: 10.1007/s00216-023-04991-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 10/01/2023] [Accepted: 10/05/2023] [Indexed: 10/26/2023]

Grelet C, Larsen T, Crowe MA, Wathes DC, Ferris CP, Ingvartsen KL, Marchitelli C, Becker F, Vanlierde A, Leblois J, Schuler U, Auer FJ, Köck A, Dale L, Sölkner J, Christophe O, Hummel J, Mensching A, Fernández Pierna JA, Soyeurt H, Calmels M, Reding R, Gelé M, Chen Y, Gengler N, Dehareng F. Prediction of key milk biomarkers in dairy cows through milk mid-infrared spectra and international collaborations. J Dairy Sci 2024;107:1669-1684. [PMID: 37863287 DOI: 10.3168/jds.2023-23843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/23/2023] [Indexed: 10/22/2023]

Abstract

At the individual cow level, suboptimum fertility, mastitis, negative energy balance, and ketosis are major issues in dairy farming. These problems are widespread on dairy farms and have an important economic impact. The objectives of this study were (1) to assess the potential of milk mid-infrared (MIR) spectra to predict key biomarkers of energy deficit (citrate, isocitrate, glucose-6 phosphate [glucose-6P], free glucose), ketosis (β-hydroxybutyrate [BHB] and acetone), mastitis (N-acetyl-β-d-glucosaminidase activity [NAGase] and lactate dehydrogenase), and fertility (progesterone); (2) to test alternative methodologies to partial least squares (PLS) regression to better account for the specific asymmetric distribution of the biomarkers; and (3) to create robust models by merging large datasets from 5 international or national projects. Benefiting from this international collaboration, the dataset comprised a total of 9,143 milk samples from 3,758 cows located in 589 herds across 10 countries and represented 7 breeds. The samples were analyzed by reference chemistry for biomarker contents, whereas the MIR analyses were performed on 30 instruments from different models and brands, with spectra harmonized into a common format. Four quantitative methodologies were evaluated to address the strongly skewed distribution of some biomarkers. Partial least squares regression was used as the reference basis, and compared with a random modification of distribution associated with PLS (random-downsampling-PLS), an optimized modification of distribution associated with PLS (KennardStone-downsampling-PLS), and support vector machine (SVM). When the ability of MIR to predict biomarkers was too low for quantification, different qualitative methodologies were tested to discriminate low versus high values of biomarkers. For each biomarker, 20% of the herds were randomly removed within all countries to be used as the validation dataset. The remaining 80% of herds were used as the calibration dataset. In calibration, the 3 alternative methodologies outperform the PLS performances for the majority of biomarkers. However, in the external herd validation, PLS provided the best results for isocitrate, glucose-6P, free glucose, and lactate dehydrogenase (coefficient of determination in external herd validation [R2v] = 0.48, 0.58, 0.28, and 0.24, respectively). For other molecules, PLS-random-downsampling and PLS-KennardStone-downsampling outperformed PLS in the majority of cases, but the best results were provided by SVM for citrate, BHB, acetone, NAGase, and progesterone (R2v = 0.94, 0.58, 0.76, 0.68, and 0.15, respectively). Hence, PLS and SVM based on the entire dataset provided the best results for normal and skewed distributions, respectively. Complementary to the quantitative methods, the qualitative discriminant models enabled the discrimination of high and low values for BHB, acetone, and NAGase with a global accuracy around 90%, and glucose-6P with an accuracy of 83%. In conclusion, MIR spectra of milk can enable quantitative screening of citrate as a biomarker of energy deficit and discrimination of low and high values of BHB, acetone, and NAGase, as biomarkers of ketosis and mastitis. Finally, progesterone could not be predicted with sufficient accuracy from milk MIR spectra to be further considered. Consequently, MIR spectrometry can bring valuable information regarding the occurrence of energy deficit, ketosis, and mastitis in dairy cows, which in turn have major influences on their fertility and survival.

Collapse

Affiliation(s)

C Grelet Walloon Agricultural Research Center (CRA-W), Gembloux, Belgium, 5030
T Larsen Department of Animal and Veterinary Sciences, Aarhus University, Tjele, Denmark, DK-8830
M A Crowe University College Dublin (UCD), Dublin, Ireland, D04 C1P1
D C Wathes Royal Veterinary College (RVC), London, United Kingdom, CM24 1RW
C P Ferris Agri-Food and Biosciences Institute (AFBI), Belfast, Northern Ireland, BT9 5PX
K L Ingvartsen Department of Animal and Veterinary Sciences, Aarhus University, Tjele, Denmark, DK-8830
C Marchitelli Research Center for Animal Production and Aquaculture (CREA), Roma, Italy, 00184
F Becker Leibniz Institute for Farm Animal Biology (FBN), Dummerstorf, Germany, 18196
A Vanlierde Walloon Agricultural Research Center (CRA-W), Gembloux, Belgium, 5030
J Leblois EEIG European Milk Recording (EMR), Ciney, Belgium, 5590
U Schuler Qualitas, Zug, Switzerland, 6300
F J Auer LKV-Austria, Vienna, Austria, A-1200
A Köck ZuchtData, Vienna, Austria, A-1200
L Dale LKV Baden Württemberg, Stuttgart, Germany, D-70190
J Sölkner University of Natural Resources and Life Sciences, Vienna, Austria, A-1180
O Christophe Walloon Agricultural Research Center (CRA-W), Gembloux, Belgium, 5030
J Hummel University of Göttingen, Göttingen, Germany, D-37075
A Mensching University of Göttingen, Göttingen, Germany, D-37075
J A Fernández Pierna Walloon Agricultural Research Center (CRA-W), Gembloux, Belgium, 5030
H Soyeurt University of Liège, Gembloux Agro-Bio Tech (Ulg-GxABT), Gembloux, Belgium, 5030
M Calmels Seenovia, Saint Berthevin, France, 53940
R Reding Convis, Ettelbruck, Luxembourg, 9085
M Gelé Idele, Paris, France, 75012
Y Chen University of Liège, Gembloux Agro-Bio Tech (Ulg-GxABT), Gembloux, Belgium, 5030
N Gengler University of Liège, Gembloux Agro-Bio Tech (Ulg-GxABT), Gembloux, Belgium, 5030
F Dehareng Walloon Agricultural Research Center (CRA-W), Gembloux, Belgium, 5030.

Collapse

AlHarkan K, Sultana N, Al Mulhim N, AlAbdulKader AM, Alsafwani N, Barnawi M, Alasqah K, Bazuhair A, Alhalwah Z, Bokhamseen D, Aljameel SS, Alamri S, Alqurashi Y, Ghamdi KA. Artificial intelligence approaches for early detection of neurocognitive disorders among older adults. Front Comput Neurosci 2024;18:1307305. [PMID: 38444404 PMCID: PMC10913197 DOI: 10.3389/fncom.2024.1307305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 01/29/2024] [Indexed: 03/07/2024] Open

Abstract

Introduction

Dementia is one of the major global health issues among the aging population, characterized clinically by a progressive decline in higher cognitive functions. This paper aims to apply various artificial intelligence (AI) approaches to detect patients with mild cognitive impairment (MCI) or dementia accurately.

Methods

Quantitative research was conducted to address the objective of this study using randomly selected 343 Saudi patients. The Chi-square test was conducted to determine the association of the patient's cognitive function with various features, including demographical and medical history. Two widely used AI algorithms, logistic regression and support vector machine (SVM), were used for detecting cognitive decline. This study also assessed patients' cognitive function based on gender and developed the predicting models for males and females separately.

Results

Fifty four percent of patients have normal cognitive function, 34% have MCI, and 12% have dementia. The prediction accuracies for all the developed models are greater than 71%, indicating good prediction capability. However, the developed SVM models performed the best, with an accuracy of 93.3% for all patients, 94.4% for males only, and 95.5% for females only. The top 10 significant predictors based on the developed SVM model are education, bedtime, taking pills for chronic pain, diabetes, stroke, gender, chronic pains, coronary artery diseases, and wake-up time.

Conclusion

The results of this study emphasize the higher accuracy and reliability of the proposed methods in cognitive decline prediction that health practitioners can use for the early detection of dementia. This research can also stipulate substantial direction and supportive intuitions for scholars to enhance their understanding of crucial research, emerging trends, and new developments in future cognitive decline studies.

Collapse

Affiliation(s)

Khalid AlHarkan Department of Family and Community Medicine, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Nahid Sultana Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Noura Al Mulhim Department of Physiology, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Assim M. AlAbdulKader Department of Family and Community Medicine, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Noor Alsafwani Department of Pathology, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Marwah Barnawi Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Khulud Alasqah Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Anhar Bazuhair Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Zainab Alhalwah Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Dina Bokhamseen Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Sumayh S. Aljameel Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Sultan Alamri Department of Family Medicine, College of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
Yousef Alqurashi Respiratory Care Department, College of Applied Medical Sciences, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
Kholoud Al Ghamdi Department of Physiology, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia

Collapse

Zhang L, Ye L, Wang F, Gao W, Yu J, Zhang L. Prediction of Hydrogen Abstraction Rate Constants at the Allylic Site between Alkenes and OH with Multiple Machine Learning Models. J Phys Chem A 2024;128:761-772. [PMID: 38237153 DOI: 10.1021/acs.jpca.3c06917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2024]

Abstract

Hydrogen abstraction reactions between hydrocarbons and hydroxyl radicals are important propagation steps in radical chain reactions, playing a crucial role in atmospheric and combustion chemistry. This study focuses on predicting the rate constants of the prototype of the reaction class of hydrogen abstractions, i.e., the primary allylic hydrogen abstraction from alkenes by the OH radical, via utilizing machine learning (ML) methods. Specifically, three distinct models, namely, feedforward neural network (FNN), support vector regression (SVR), and Gaussian process regression (GPR), have been employed to construct robust ML models for prediction. We proposed a novel strategy that seamlessly integrates descriptor preprocessing, a pairwise linear correlation analysis, and a model-specific Wrapper method to enhance the effectiveness of the feature selection procedure. The selected feature subset was then evaluated using two cross-validation techniques, i.e., leave-one-group-out (LOGO) and K-fold cross-validations, for each of the three ML models (FNN, SVR, and GPR) to assess their predictive and stability performance. The results demonstrate that the FNN model, trained with seven representative descriptors, achieves superior performance compared to the other two methods. For the FNN model, the average percentage deviation is 39.06% on the test set by performing LOGO cross-validation, while the repeated 10-fold cross-validation achieves a percentage prediction deviation of 19.1%. Two larger alkenes with 10 carbons were selected to test the prediction performance of the trained FNN model on primary allylic hydrogen abstraction. Results show that the kinetic predictions follow well the modified three-parameter Arrhenius equation, indicating the reliable performance of FNN in predicting hydrogen abstraction rate constants, especially for the primary allylic site. Hopefully, this work can shed useful light on the application of ML in generating chemical kinetic parameters of hydrocarbon combustion chemistry.

Collapse

Uddin S, Lu H. Dataset meta-level and statistical features affect machine learning performance. Sci Rep 2024;14:1670. [PMID: 38238551 PMCID: PMC10796674 DOI: 10.1038/s41598-024-51825-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 01/09/2024] [Indexed: 01/22/2024] Open

Abstract

What dataset features affect machine learning (ML) performance has primarily been unknown in the current literature. This study examines the impact of tabular datasets' different meta-level and statistical features on the performance of various ML algorithms. The three meta-level features this study considered are the dataset size, the number of attributes and the ratio between the positive (class 1) and negative (class 0) class instances. It considered four statistical features for each dataset: mean, standard deviation, skewness and kurtosis. After applying the required scaling, this study averaged (uniform and weighted) each dataset's different attributes to quantify its four statistical features. We analysed 200 open-access tabular datasets from the Kaggle (147) and UCI Machine Learning Repository (53) and developed ML classification models (through classification implementation and hyperparameter tuning) for each dataset. Then, this study developed multiple regression models to explore the impact of dataset features on ML performance. We found that kurtosis has a statistically significant negative effect on the accuracy of the three non-tree-based ML algorithms of the Support vector machine (SVM), Logistic regression (LR) and K-nearest neighbour (KNN) for their classical implementation with both uniform and weighted aggregations. This study observed similar findings in most cases for ML implementations through hyperparameter tuning, except for SVM with weighted aggregation. Meta-level and statistical features barely show any statistically significant impact on the accuracy of the two tree-based ML algorithms (Decision tree and Random forest), except for implementation through hyperparameter tuning for the weighted aggregation. When we excluded some datasets based on the imbalanced statistics and a significantly higher contribution of one attribute compared to others to the classification performance, we found a significant effect of the meta-level ratio feature and statistical mean and standard deviation features on SVM, LR and KNN accuracy in many cases. Our findings open a new research direction in understanding how dataset characteristics affect ML performance and will help researchers select appropriate ML algorithms for a possible optimal accuracy outcome.

Collapse

Sun W, Mo Z, Li Y, Xiao J, Jia L, Huang S, Liao C, Du J, He S, Chen L, Zhang W, Yang X. Machine learning-based ensemble prediction model for the gamma passing rate of VMAT-SBRT plan. Phys Med 2024;117:103204. [PMID: 38154373 DOI: 10.1016/j.ejmp.2023.103204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 10/29/2023] [Accepted: 12/21/2023] [Indexed: 12/30/2023] Open

Ghosh SK, Khandoker AH. A machine learning driven monogram for predicting chronic kidney disease stages 3-5. Sci Rep 2023;13:21613. [PMID: 38062134 PMCID: PMC10703939 DOI: 10.1038/s41598-023-48815-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/30/2023] [Indexed: 12/18/2023] Open

Budiarto A, Tsang KCH, Wilson AM, Sheikh A, Shah SA. Machine Learning-Based Asthma Attack Prediction Models From Routinely Collected Electronic Health Records: Systematic Scoping Review. JMIR AI 2023;2:e46717. [PMID: 38875586 PMCID: PMC11041490 DOI: 10.2196/46717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 09/28/2023] [Accepted: 10/09/2023] [Indexed: 06/16/2024]

Abstract

BACKGROUND

An early warning tool to predict attacks could enhance asthma management and reduce the likelihood of serious consequences. Electronic health records (EHRs) providing access to historical data about patients with asthma coupled with machine learning (ML) provide an opportunity to develop such a tool. Several studies have developed ML-based tools to predict asthma attacks.

OBJECTIVE

This study aims to critically evaluate ML-based models derived using EHRs for the prediction of asthma attacks.

METHODS

We systematically searched PubMed and Scopus (the search period was between January 1, 2012, and January 31, 2023) for papers meeting the following inclusion criteria: (1) used EHR data as the main data source, (2) used asthma attack as the outcome, and (3) compared ML-based prediction models' performance. We excluded non-English papers and nonresearch papers, such as commentary and systematic review papers. In addition, we also excluded papers that did not provide any details about the respective ML approach and its result, including protocol papers. The selected studies were then summarized across multiple dimensions including data preprocessing methods, ML algorithms, model validation, model explainability, and model implementation.

RESULTS

Overall, 17 papers were included at the end of the selection process. There was considerable heterogeneity in how asthma attacks were defined. Of the 17 studies, 8 (47%) studies used routinely collected data both from primary care and secondary care practices together. Extreme imbalanced data was a notable issue in most studies (13/17, 76%), but only 38% (5/13) of them explicitly dealt with it in their data preprocessing pipeline. The gradient boosting-based method was the best ML method in 59% (10/17) of the studies. Of the 17 studies, 14 (82%) studies used a model explanation method to identify the most important predictors. None of the studies followed the standard reporting guidelines, and none were prospectively validated.

CONCLUSIONS

Our review indicates that this research field is still underdeveloped, given the limited body of evidence, heterogeneity of methods, lack of external validation, and suboptimally reported models. We highlighted several technical challenges (class imbalance, external validation, model explanation, and adherence to reporting guidelines to aid reproducibility) that need to be addressed to make progress toward clinical adoption.

Collapse

Ali L, Sivaramakrishnan K, Kuttiyathil MS, Chandrasekaran V, Ahmed OH, Al-Harahsheh M, Altarawneh M. Prediction of Thermogravimetric Data in the Thermal Recycling of e-waste Using Machine Learning Techniques: A Data-driven Approach. ACS OMEGA 2023;8:43254-43270. [PMID: 38024703 PMCID: PMC10652257 DOI: 10.1021/acsomega.3c07228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 10/12/2023] [Accepted: 10/17/2023] [Indexed: 12/01/2023]

Abstract

The release of bromine-free hydrocarbons and gases is a major challenge faced in the thermal recycling of e-waste due to the corrosive effects of produced HBr. Metal oxides such as Fe2O3 (hematite) are excellent debrominating agents, and they are copyrolyzed along with tetrabromophenol (TBP), a lesser used brominated flame retardant that is a constituent of printed circuit boards in electronic equipment. The pyrolytic (N2) and oxidative (O2) decomposition of TBP with Fe2O3 has been previously investigated with thermogravimetric analysis (TGA) at four different heating rates of 5, 10, 15, and 20 °C/min, and the mass loss data between room temperature and 800 °C were reported. The objective of our paper is to study the effectiveness of machine learning (ML) techniques to reproduce these TGA data so that the use of the instrument can be eliminated to enhance the potential of online monitoring of copyrolysis in e-waste treatment. This will reduce experimental and human errors as well as improve process time significantly. TGA data are both nonlinear and multidimensional, and hence, nonlinear regression techniques such as random forest (RF) and gradient boosting regression (GBR) showed the highest prediction accuracies of 0.999 and lowest prediction errors among all the ML models employed in this work. The large data sets allowed us to explore three different scenarios of model training and validation, where the number of training samples were varied from 10,000 to 40,000 for both TBP and TBP + hematite samples under N2 (pyrolysis) and O2 (combustion) environments. The novelty of our study is that ML techniques have not been employed for the copyrolysis of these compounds, while the significance is the excellent potential of enhanced online monitoring of e-waste treatment and extension to other characterization techniques such as spectroscopy and chromatography. Lastly, e-waste recycling could greatly benefit from ML applications since it has the potential to reduce total and operational costs and improve overall process time and efficiency, thereby encouraging more treatment plants to adopt these techniques, resulting in reducing the increasing environmental footprint of e-waste.

Collapse

He Q, Zhang H, Li T, Zhang X, Li X, Dong C. NIR Spectral Inversion of Soil Physicochemical Properties in Tea Plantations under Different Particle Size States. SENSORS (BASEL, SWITZERLAND) 2023;23:9107. [PMID: 38005495 PMCID: PMC10675699 DOI: 10.3390/s23229107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 11/05/2023] [Accepted: 11/09/2023] [Indexed: 11/26/2023]

Lin R, Peng B, Li L, He X, Yan H, Tian C, Luo H, Yin G. Application of serum Raman spectroscopy combined with classification model for rapid breast cancer screening. Front Oncol 2023;13:1258436. [PMID: 37965448 PMCID: PMC10640987 DOI: 10.3389/fonc.2023.1258436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/13/2023] [Indexed: 11/16/2023] Open

Sancar N, Tabrizi SS. Machine learning approach for the detection of vitamin D level: a comparative study. BMC Med Inform Decis Mak 2023;23:219. [PMID: 37845674 PMCID: PMC10580577 DOI: 10.1186/s12911-023-02323-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 10/03/2023] [Indexed: 10/18/2023] Open

Abstract

BACKGROUND

After the World Health Organization declared the COVID-19 pandemic, the role of Vitamin D has become even more critical for people worldwide. The most accurate way to define vitamin D level is 25-hydroxy vitamin D(25-OH-D) blood test. However, this blood test is not always feasible. Most data sets used in health science research usually contain highly correlated features, which is referred to as multicollinearity problem. This problem can lead to misleading results and overfitting problems in the ML training process. Therefore, the proposed study aims to determine a clinically acceptable ML model for the detection of the vitamin D status of the North Cyprus adult participants accurately, without the need to determine 25-OH-D level, taking into account the multicollinearity problem.

METHOD

The study was conducted with 481 observations who applied voluntarily to Internal Medicine Department at NEU Hospital. The classification performance of four conventional supervised ML models, namely, Ordinal logistic regression(OLR), Elastic-net ordinal regression(ENOR), Support Vector Machine(SVM), and Random Forest (RF) was compared. The comparative analysis is performed regarding the model's sensitivity to the participant's metabolic syndrome(MtS)'positive status, hyper-parameter tuning, sensitivities to the size of training data, and the classification performance of the models.

RESULTS

Due to the presence of multicollinearity, the findings showed that the performance of the SVM(RBF) is obviously negatively affected when the test is examined. Moreover, it can be obviously detected that RF is more robust than other models when the variations in the size of training data are examined. This experiment's result showed that the selected RF and ENOR showed better performances than the other two models when the size of training samples was reduced. Since the multicollinearity is more severe in the small samples, it can be concluded that RF and ENOR are not affected by the presence of the multicollinearity problem. The comparative analysis revealed that the RF classifier performed better and was more robust than the other proposed models in terms of accuracy (0.94), specificity (0.96), sensitivity or recall (0.94), precision (0.95), F1-score (0.95), and Cohen's kappa (0.90).

CONCLUSION

It is evident that the RF achieved better than the SVM(RBF), ENOR, and OLR. These comparison findings will be applied to develop a Vitamin D level intelligent detection system for being used in routine clinical, biochemical tests, and lifestyle characteristics of individuals to decrease the cost and time of vitamin D level detection.

Collapse

Wang G, Zeng M, Li J, Liu Y, Wei D, Long Z, Chen H, Zang X, Yang J. Neural Representation of Collective Self-esteem in Resting-state Functional Connectivity and its Validation in Task-dependent Modality. Neuroscience 2023;530:66-78. [PMID: 37619767 DOI: 10.1016/j.neuroscience.2023.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 08/01/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023]

Abstract

INTRODUCTION

Collective self-esteem (CSE) is an important personality variable, defined as self-worth derived from membership in social groups. A study explored the neural basis of CSE using a task-based functional magnetic resonance imaging (fMRI) paradigm; however, task-independent neural basis of CSE remains to be explored, and whether the CSE neural basis of resting-state fMRI is consistent with that of task-based fMRI is unclear.

METHODS

We built support vector regression (SVR) models to predict CSE scores using topological metrics measured in the resting-state functional connectivity network (RSFC) as features. Then, to test the reliability of the SVR analysis, the activation pattern of the identified brain regions from SVR analysis was used as features to distinguish collective self-worth from other conditions by multivariate pattern classification in task-based fMRI dataset.

RESULTS

SVR analysis results showed that leverage centrality successfully decoded the individual differences in CSE. The ventromedial prefrontal cortex, anterior cingulate cortex, posterior cingulate gyrus, precuneus, orbitofrontal cortex, posterior insula, postcentral gyrus, inferior parietal lobule, temporoparietal junction, and inferior frontal gyrus, which are involved in self-referential processing, affective processing, and social cognition networks, participated in this prediction. Multivariate pattern classification analysis found that the activation pattern of the identified regions from the SVR analysis successfully distinguished collective self-worth from relational self-worth, personal self-worth and semantic control.

CONCLUSION

Our findings revealed CSE neural basis in the whole-brain RSFC network, and established the concordance between leverage centrality and the activation pattern (evoked during collective self-worth task) of the identified regions in terms of representing CSE.

Collapse

Bian X, Zhao Z, Liu J, Liu P, Shi H, Tan X. Discretized butterfly optimization algorithm for variable selection in the rapid determination of cholesterol by near-infrared spectroscopy. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2023;15:5190-5198. [PMID: 37779476 DOI: 10.1039/d3ay01636f] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]

Abstract

The blood cholesterol level is strongly associated with cardiovascular disease. It is necessary to develop a rapid method to determine the cholesterol concentration of blood. In this study, a discretized butterfly optimization algorithm-partial least squares (BOA-PLS) method combined with near-infrared (NIR) spectroscopy is firstly proposed for rapid determination of the cholesterol concentration in blood. In discretized BOA, the butterfly vector is described by 1 or 0, which represents whether the variable is selected or not, respectively. In the optimization process, four transfer functions, i.e., arctangent, V-shaped, improved arctangent (I-atan) and improved V-shaped (I-V), are introduced and compared for discretization of the butterfly position. The partial least squares (PLS) model is established between the selected NIR variables and cholesterol concentrations. The iteration number, transfer functions and the performance of butterflies are investigated. The proposed method is compared with full-spectrum PLS, multiplicative scatter correction-PLS (MSC-PLS), max-min scaling-PLS (MMS-PLS), MSC-MMS-PLS, uninformative variable elimination-PLS (UVE-PLS), Monte Carlo uninformative variable elimination-PLS (MCUVE-PLS) and randomization test-PLS (RT-PLS). Results show that the I-V function is the best transfer function for discretization. Both preprocessing and variable selection can improve the prediction performance of PLS. Variable selection methods based on BOA are better than those based on statistics. Furthermore, I-V-BOA-PLS has the highest predictive accuracy among the seven variable selection methods. MSC-MMS can further improve the prediction ability of I-V-BOA-PLS. Therefore, BOA-PLS combined with NIR spectroscopy is promising for the rapid determination of cholesterol concentration in blood.

Collapse

Keshtehgar A, Dahmardeh M, Ghanbari A, Khammari I. Prediction models of macro-nutrient content in plant organs of Cucumis melo in response to soil elements using support vector regression. PeerJ 2023;11:e15417. [PMID: 37810792 PMCID: PMC10552743 DOI: 10.7717/peerj.15417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 04/24/2023] [Indexed: 10/10/2023] Open

Abstract

Background

Undoubtedly, the importance of food and food security as one of the present and future challenges is not invisible to anyone. Nowadays, the development of methods for monitoring the nutrient content in crop products is an essential issue for implementing reasonable and logical soil properties management. The modeling technique can evaluate the soil properties of fields and study the subject of crop yield through soil management. This study aims to predict fruit yield and macro-nutrient content in plant organs of Cucumis melo in response to soil elements using support vector regression (SVR).

Methodology

In the spring of 2020, this study was done as a factorial test in a randomized complete block design with three replications. The first factor was the use of fertilizers in six levels: no fertilizer (control), cow manure (30 t ha-1), sheep manure (30 t ha-1), nanobiomic foliar application (2 l ha-1), silicone foliar application (3 l ha-1), and chemical fertilizer from urea, triple superphosphate, and potassium sulfate sources (200, 100, and 150 kg ha-1). In addition, four levels of vermicompost considering as the second factor: no vermicompost (control), 5, 10, and 15 t ha-1. Input data sets such as fruit yield and nitrogen, phosphorus, and potassium levels in the seeds, fruits, leaves, and roots are used to calibrate the probabilistic model of SP using SVR.

Results

According to the results, when the data sets of the nitrogen, phosphorus, and potassium in the fruit uses as input, the accuracy of these models was higher than 80.0% (R2 = 0.807 for predicting fruit nitrogen; R2 = 0.999 for fruit phosphorus; R2 = 0.968 for fruit potassium). Also, the results of the prediction models in response to soil elements showed that the soil nitrogen content ranged from 0.05 to 1.1%, soil phosphorus from 10 to 59 mg kg-1, and soil potassium from 180 to 320 mg kg-1, which offers a suitable macro-nutrient content in the soil. Likewise, the best fruit nitrogen content ranged from 1.27 to 4.33%, fruit phosphorus from 15.74 to 26.19%, fruit potassium from 15.19 to 19.67%, and fruit yield from 2.16 to 5.95 kg per plant obtained under NPK chemical fertilizers and using 15 t ha-1 of vermicompost.

Conclusions

Because the fruit values had the highest contribution in prediction than observed values, thus identified as the best plant organs in response to soil elements. Based on our findings, the importance of fruit phosphorus identifies as a determinant that strongly influenced melon prediction models. More significant values of soil elements do not affect increasing fruit yield and macro-nutrient content in plant organs, and excessive application may not be economical. Therefore, our studies provide an efficient approach with potentially high accuracy to estimate fruit yield and macro-nutrient in the fruits of Cucumis melo in response to soil elements and cause a saving in the amount of fertilizer during the growing season.

Collapse

Chen X, He L, Shi K, Wu Y, Lin S, Fang Y. Interpretable Machine Learning for Fall Prediction Among Older Adults in China. Am J Prev Med 2023;65:579-586. [PMID: 37087076 DOI: 10.1016/j.amepre.2023.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 04/14/2023] [Accepted: 04/14/2023] [Indexed: 04/24/2023]

Chhoa H, Chabriat H, Anato AJ, Bamba M, Zittoun F, Chevret S, Biard L. Improvement of an External Predictive Model Based on New Information Using a Synthetic Data Approach: Application to CADASIL. Neurol Genet 2023;9:e200091. [PMID: 38235365 PMCID: PMC10691224 DOI: 10.1212/nxg.0000000000200091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/07/2023] [Indexed: 01/19/2024]

Abstract

Background and Objectives

Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL) is the most frequent hereditary cerebral small vessel disease. It is caused by mutations of the NOTCH3 gene. The disease evolves progressively over decades leading to stroke, disability, cognitive decline, and functional dependency. The course and clinical severity of CADASIL seem heterogeneous. Predictive models are thus needed to improve prognostic evaluation and inform future clinical trials. A predictive model of the 3-year variation in the Mattis Dementia Rating Scale (MDRS), which reflects the global cognitive performance of patients with CADASIL, was previously proposed. This model made predictions based on demographic, clinical, and MRI data. We aimed to improve this existing predictive model by integrating a new potential factor, the location of the genetic mutation in the different epidermal growth factor (EGFr) domains of the NOTCH3 gene, dichotomized into EGFr domains 1 to 6 or 7 to 34.

Methods

We used a new synthetic data approach to improve the initial predictive model by incorporating additional genetic information. This method combined the predicted outcomes from the previous model and 5 "synthetic" data sets with the observed outcome in a new data set. We then applied a multiple imputation method for missing data on the mutation location.

Results

The new data set included 367 patients who were followed up for 30 to 42 months. In the multivariable model with synthetic data, patients with NOTCH3 mutations in EGFr domains 7 to 34 had an additional average decrease of -1.4 points (standard error 0.67, p = 0.035) in their MDRS score variation over 3 years compared with patients with mutations located in EGFr domains 1 to 6. Cross-validation results highlighted the improved predictive performance of the enhanced model. Moreover, the model estimation was found to be more robust than fitting a model without synthetic data.

Discussion

The use of synthetic data improved the predictive model of MDRS change over 3 years in CADASIL. The predictive performance and estimation robustness of the predictive model were enhanced using this approach, whether genetic information was used. A statistically significant association between the location of the mutation in the NOTCH3 gene and the 3-year MDRS score variation was detected.

Collapse

Affiliation(s)

Henri Chhoa From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Hugues Chabriat From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Adelina Joanita Anato From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Mamadou Bamba From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Florent Zittoun From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Sylvie Chevret From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France
Lucie Biard From the ECSTRRA Team (H. Chhoa, S.C., L.B.), Université Paris-Cité, UMR1153, INSERM; Translational Neurovascular Centre (H. Chabriat), GH Saint-Louis-Lariboisière, Assistance Publique des Hôpitaux de Paris APHP, Université Paris-Cité and DHU NeuroVasc Sorbonne Paris-Cité; UMR 1161 (H. Chabriat), INSERM; and ENSAI (A.J.A., M.B., F.Z.), Ecole d'ingénieur statistique, data science et big data, Bruz, France

Collapse

Rácz A, Vincze A, Volk B, Balogh GT. Extending the limitations in the prediction of PAMPA permeability with machine learning algorithms. Eur J Pharm Sci 2023;188:106514. [PMID: 37402429 DOI: 10.1016/j.ejps.2023.106514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/21/2023] [Accepted: 07/01/2023] [Indexed: 07/06/2023]

Li W, Shao C, Li C, Zhou H, Yu L, Yang J, Wan H, He Y. Metabolomics: A useful tool for ischemic stroke research. J Pharm Anal 2023;13:968-983. [PMID: 37842657 PMCID: PMC10568109 DOI: 10.1016/j.jpha.2023.05.015] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 05/14/2023] [Accepted: 05/29/2023] [Indexed: 10/17/2023] Open

Budiarto A, Sheikh A, Wilson A, Price DB, Shah SA. Handling Class Imbalance in Machine Learning-based Prediction Models: A Case Study in Asthma Management. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023;2023:1-5. [PMID: 38083129 DOI: 10.1109/embc40787.2023.10340751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]

Abstract

A data-driven prediction tool has the potential to provide early warning of an asthma attack and improve asthma management and outcomes. Most previous machine learning (ML)-based studies for asthma attack prediction have reported a severe class imbalance, with major implications for model performance. We aimed to undertake a systematic comparison of several class imbalance handling techniques in the context of risk prediction models for asthma prognosis. We used data from 9,835 asthma patients extracted from the Medical Information Mart for Intensive Care (MIMIC) IV database and deployed five class imbalance handling methods based on synthetic minority oversampling technique (SMOTE) and cost function customisation. We then compared their performances in improving two-class classifier models developed using logistic regression (LR) and extreme gradient boosting (XGBoost) for three different prediction tasks with varying severity of class imbalance (proportion of majority class ranging from 90.86% to 98.98%). The cost function customisation technique substantially outperformed the SMOTE-based methods in all tasks. XGBoost combined with cost function customisation achieved the highest prediction performance for the outcome with the most extreme class imbalance ratio (AUC = 0.72). Our findings suggest that the cost function customisation-based approach to tackle class imbalance provides substantially better performance compared to oversampling in the context of asthma management.Clinical Relevance- This study underscores the challenge of class imbalance in the context of prediction tools to improve asthma management and outcomes and provides a methodological solution that addresses the challenge. Accurate asthma prediction tools can provide early warning and potentially prevent deterioration thereby improving the quality of life of patients with asthma.

Collapse

Pchitskaya E, Vasiliev P, Smirnova D, Chukanov V, Bezprozvanny I. SpineTool is an open-source software for analysis of morphology of dendritic spines. Sci Rep 2023;13:10561. [PMID: 37386071 PMCID: PMC10310755 DOI: 10.1038/s41598-023-37406-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 06/21/2023] [Indexed: 07/01/2023] Open

Ashraf WM, Uddin GM, Tariq R, Ahmed A, Farhan M, Nazeer MA, Hassan RU, Naeem A, Jamil H, Krzywanski J, Sosnowski M, Dua V. Artificial Intelligence Modeling-Based Optimization of an Industrial-Scale Steam Turbine for Moving toward Net-Zero in the Energy Sector. ACS OMEGA 2023;8:21709-21725. [PMID: 37360426 PMCID: PMC10285957 DOI: 10.1021/acsomega.3c01227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 05/16/2023] [Indexed: 06/28/2023]

Abstract

Augmentation of energy efficiency in the power generation systems can aid in decarbonizing the energy sector, which is also recognized by the International Energy Agency (IEA) as a solution to attain net-zero from the energy sector. With this reference, this article presents a framework incorporating artificial intelligence (AI) for improving the isentropic efficiency of a high-pressure (HP) steam turbine installed at a supercritical power plant. The data of the operating parameters taken from a supercritical 660 MW coal-fired power plant is well-distributed in the input and output spaces of the operating parameters. Based on hyperparameter tuning, two advanced AI modeling algorithms, i.e., artificial neural network (ANN) and support vector machine (SVM), are trained and, subsequently, validated. ANN, as turned out to be a better-performing model, is utilized to conduct the Monte Carlo technique-based sensitivity analysis toward the high-pressure (HP) turbine efficiency. Subsequently, the ANN model is deployed for evaluating the impact of individual or combination of operating parameters on the HP turbine efficiency under three real-power generation capacities of the power plant. The parametric study and nonlinear programming-based optimization techniques are applied to optimize the HP turbine efficiency. It is estimated that the HP turbine efficiency can be improved by 1.43, 5.09, and 3.40% as compared to that of the average values of input parameters for half-load, mid-load, and full-load power generation modes, respectively. The annual reduction in CO₂ measuring 58.3, 123.5, and 70.8 kilo ton/year (kt/y) corresponds to half-load, mid-load, and full load, respectively, and noticeable mitigation of SO₂, CH₄, N₂O, and Hg emissions is estimated for the three power generation modes of the power plant. The AI-based modeling and optimization analysis is conducted to enhance the operation excellence of the industrial-scale steam turbine that promotes higher-energy efficiency and contributes to the net-zero target from the energy sector.

Collapse

Bu Y, Jiang X, Tian J, Hu X, Han L, Huang D, Luo H. Rapid nondestructive detecting of sorghum varieties based on hyperspectral imaging and convolutional neural network. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2023;103:3970-3983. [PMID: 36397181 DOI: 10.1002/jsfa.12344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 10/24/2022] [Accepted: 11/18/2022] [Indexed: 05/03/2023]

Abstract

BACKGROUND

The purity of sorghum varieties is an important indicator of the quality of raw materials used in the distillation of liquors. Different varieties of sorghum may be mixed during the acquisition process, which will affect the flavor and quality of liquor. To facilitate the rapid identification of sorghum varieties, this study proposes a sorghum variety identification model using hyperspectral imaging (HSI) technology combined with convolutional neural network (AlexNet).

RESULTS

First, the watershed algorithm, which was modified with the extended-maxim transform, was used to segment the hyperspectral images of a single sorghum grain. The isolated forest algorithm was used to eliminate abnormal spectral data from the complete spectral data. Secondly, the AlexNet model of sorghum variety identification was established based on the two-dimensional gray image data of sorghum grain in group 1. The effects of different preprocessing methods and different convolution kernel sizes on the performance of the AlexNet model were discussed. The eigenvalues of the last layer of the AlexNet model were visualized using the t-distributed random neighborhood embedding method, which is used to evaluate the separability of features extracted by the AlexNet model. The performance differences between the optimal AlexNet model and traditional machine learning models for sorghum variety identification were compared. Finally, the varieties of sorghum grains in groups 2 and 3 were identified based on the optimal AlexNet model, and the average accuracy values of the test set reached 95.62% and 95.91% respectively.

CONCLUSION

Collapse

Guo D, He W, Wei L, Song Y, Qi J, Yao Y, Chen X, Huang J, Lu Y, Zhu X. The Zhu-Lu formula: a machine learning-based intraocular lens power calculation formula for highly myopic eyes. EYE AND VISION (LONDON, ENGLAND) 2023;10:26. [PMID: 37259154 DOI: 10.1186/s40662-023-00342-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 04/12/2023] [Indexed: 06/02/2023]

Affiliation(s)

Dongling Guo Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China NHC Key Laboratory of Myopia, Fudan University, Shanghai, China Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China
Wenwen He Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China NHC Key Laboratory of Myopia, Fudan University, Shanghai, China Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China
Ling Wei Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China NHC Key Laboratory of Myopia, Fudan University, Shanghai, China Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China
Yunxiao Song University of Illinois at Urbana-Champaign, Illinois, USA
Jiao Qi Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China NHC Key Laboratory of Myopia, Fudan University, Shanghai, China Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China
Yunqian Yao Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China NHC Key Laboratory of Myopia, Fudan University, Shanghai, China Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China
Xu Chen Shanghai Aier Eye Hospital, Shanghai, China
Jinhai Huang Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China Eye Hospital of Wenzhou Medical University, Wenzhou, China
Yi Lu Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China. NHC Key Laboratory of Myopia, Fudan University, Shanghai, China. Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China. Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China.
Xiangjia Zhu Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China. NHC Key Laboratory of Myopia, Fudan University, Shanghai, China. Key Laboratory of Myopia, Chinese Academy of Medical Science, Shanghai, China. Shanghai Key Laboratory of Visual Impairment and Restoration, Shanghai, China.

Collapse

Kavaliauskas A, Žydelis R, Castaldi F, Auškalnienė O, Povilaitis V. Predicting Maize Theoretical Methane Yield in Combination with Ground and UAV Remote Data Using Machine Learning. PLANTS (BASEL, SWITZERLAND) 2023;12:plants12091823. [PMID: 37176880 PMCID: PMC10181051 DOI: 10.3390/plants12091823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 04/26/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]

Lu CH, Li BQ, Jing Q, Pei D, Huang XY. A classification and identification model of extra virgin olive oil adulterated with other edible oils based on pigment compositions and support vector machine. Food Chem 2023;420:136161. [PMID: 37080110 DOI: 10.1016/j.foodchem.2023.136161] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 04/04/2023] [Accepted: 04/11/2023] [Indexed: 04/22/2023]

Miao J, Chen Z, Zhang Z, Wang Z, Wang Q, Zhang Z, Pan Y. A web tool for the global identification of pig breeds. Genet Sel Evol 2023;55:18. [PMID: 36944938 PMCID: PMC10029154 DOI: 10.1186/s12711-023-00788-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 02/14/2023] [Indexed: 03/23/2023] Open

Lakhouit A, Shaban M, Alatawi A, Abbas SYH, Asiri E, Al Juhni T, Elsawy M. Machine-learning approaches in geo-environmental engineering: Exploring smart solid waste management. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023;330:117174. [PMID: 36586367 DOI: 10.1016/j.jenvman.2022.117174] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/19/2022] [Accepted: 12/28/2022] [Indexed: 06/17/2023]

Abstract

Over the past few decades, increased attention has been paid to domestic waste (DW) generation. DW comprises a large percentage of municipal solid waste (MSW), and its handling and processing involves serious technical issues while also consuming a major portion of municipal budgets. The accurate estimation, prediction, and characterization of DW is an ongoing challenge for many cities, municipalities, and local governments as they strive to implement sustainable strategies for MSW. The main objective of the present study is to estimate and correctly predict DW quantities using machine-learning (ML) algorithms. Several different ML algorithms are used in the research, including linear regression, regression trees, Gaussian process regression, support vector machine, and autoregressive integrated moving average methods for time series analysis. Two case studies are presented in this paper. In the first, domestic waste data covering the period from 2010 to 2021 were collected from the Saudi and Bahrain authorities, and in the second, the domestic waste-generating behavior of a family of eleven members was followed for one month. The results show that the biodegradable and non-biodegradable wastes generated by the family were in the range of 1.7-7.9 kg and 0.0-2.0 kg, respectively, and promising outcomes were obtained using an appropriate selection of input predictors in conjunction with time series analysis. The trained models are validated and tested using several types of evaluation metrics, including calculated residuals, mean square error, root mean square error, and coefficient determination (R²-Score). The latter values are in the range of 0.67-0.85 for the training and testing datasets for many of the predicted waste quantities. The results obtained from the study show that these algorithms can be used to reduce the environmental, economic, and societal impacts of waste by designing a smart waste management engineering system.

Collapse

Jafari SM, Nikoo MR, Sadegh M, Chen M, Gandomi AH. Non-parametric severity-duration-frequency analysis of drought based on satellite-based product and model fusion techniques. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:42087-42107. [PMID: 36645590 DOI: 10.1007/s11356-023-25235-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Accepted: 01/06/2023] [Indexed: 06/17/2023]

Chen X, Lin S, Zheng Y, He L, Fang Y. Long-term trajectories of depressive symptoms and machine learning techniques for fall prediction in older adults:Evidence from the China Health and Retirement Longitudinal Study (CHARLS). Arch Gerontol Geriatr 2023;111:105012. [PMID: 37030148 DOI: 10.1016/j.archger.2023.105012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/27/2023] [Accepted: 03/29/2023] [Indexed: 04/01/2023]

Olatunji SO, Alsheikh N, Alnajrani L, Alanazy A, Almusairii M, Alshammasi S, Alansari A, Zaghdoud R, Alahmadi A, Basheer Ahmed MI, Ahmed MS, Alhiyafi J. Comprehensible Machine-Learning-Based Models for the Pre-Emptive Diagnosis of Multiple Sclerosis Using Clinical Data: A Retrospective Study in the Eastern Province of Saudi Arabia. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:4261. [PMID: 36901273 PMCID: PMC10002108 DOI: 10.3390/ijerph20054261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 02/22/2023] [Accepted: 02/24/2023] [Indexed: 06/18/2023]

Abstract

Multiple Sclerosis (MS) is characterized by chronic deterioration of the nervous system, mainly the brain and the spinal cord. An individual with MS develops the condition when the immune system begins attacking nerve fibers and the myelin sheathing that covers them, affecting the communication between the brain and the rest of the body and eventually causing permanent damage to the nerve. Patients with MS (pwMS) might experience different symptoms depending on which nerve was damaged and how much damage it has sustained. Currently, there is no cure for MS; however, there are clinical guidelines that help control the disease and its accompanying symptoms. Additionally, no specific laboratory biomarker can precisely identify the presence of MS, leaving specialists with a differential diagnosis that relies on ruling out other possible diseases with similar symptoms. Since the emergence of Machine Learning (ML) in the healthcare industry, it has become an effective tool for uncovering hidden patterns that aid in diagnosing several ailments. Several studies have been conducted to diagnose MS using ML and Deep Learning (DL) models trained using MRI images, achieving promising results. However, complex and expensive diagnostic tools are needed to collect and examine imaging data. Thus, the intention of this study is to implement a cost-effective, clinical data-driven model that is capable of diagnosing pwMS. The dataset was obtained from King Fahad Specialty Hospital (KFSH) in Dammam, Saudi Arabia. Several ML algorithms were compared, namely Support Vector Machine (SVM), Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Adaptive Boosting (AdaBoost), and Extra Trees (ET). The results indicated that the ET model outpaced the rest with an accuracy of 94.74%, recall of 97.26%, and precision of 94.67%.

Collapse

Keshtegar B, Piri J, Asnida Abdullah R, Hasanipanah M, Muayad Sabri Sabri M, Nguyen Le B. Intelligent ground vibration prediction in surface mines using an efficient soft computing method based on field data. Front Public Health 2023;10:1094771. [PMID: 36817184 PMCID: PMC9929182 DOI: 10.3389/fpubh.2022.1094771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 11/29/2022] [Indexed: 02/04/2023] Open

Guo H, Zhou X, Dong Y, Wang Y, Li S. On the use of machine learning methods to improve the estimation of gross primary productivity of maize field with drip irrigation. Ecol Modell 2023. [DOI: 10.1016/j.ecolmodel.2022.110250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Early Prediction in Classification of Cardiovascular Diseases with Machine Learning, Neuro-Fuzzy and Statistical Methods. BIOLOGY 2023;12:biology12010117. [PMID: 36671809 PMCID: PMC9855428 DOI: 10.3390/biology12010117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 01/06/2023] [Accepted: 01/08/2023] [Indexed: 01/15/2023]

Trinklein TJ, Cain CN, Ochoa GS, Schöneich S, Mikaliunaite L, Synovec RE. Recent Advances in GC×GC and Chemometrics to Address Emerging Challenges in Nontargeted Analysis. Anal Chem 2023;95:264-286. [PMID: 36625122 DOI: 10.1021/acs.analchem.2c04235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Li D, Ren X, Su Y. Predicting COVID-19 using lioness optimization algorithm and graph convolution network. Soft comput 2023;27:5437-5501. [PMID: 36686544 PMCID: PMC9838306 DOI: 10.1007/s00500-022-07778-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2022] [Indexed: 01/11/2023]

Li Y, Huang X, Zhao C, Ding P. A novel remaining useful life prediction method based on multi-support vector regression fusion and adaptive weight updating. ISA TRANSACTIONS 2022;131:444-459. [PMID: 35581022 DOI: 10.1016/j.isatra.2022.04.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 04/23/2022] [Accepted: 04/23/2022] [Indexed: 06/15/2023]

Gan N, Sun M, Lu C, Li M, Wang Y, Song Y, Ning JM, Zhang ZZ. High-speed identification system for fresh tea leaves based on phenotypic characteristics utilizing an improved genetic algorithm. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2022;102:6858-6867. [PMID: 35654754 DOI: 10.1002/jsfa.12047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 05/27/2022] [Accepted: 06/06/2022] [Indexed: 06/15/2023]

Artificial intelligence-based analytics for impacts of COVID-19 and online learning on college students’ mental health. PLoS One 2022;17:e0276767. [DOI: 10.1371/journal.pone.0276767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 10/13/2022] [Indexed: 11/19/2022] Open