1
|
Yang L, Qiao C, Kanamori T, Calhoun VD, Stephen JM, Wilson TW, Wang YP. Tensor dictionary-based heterogeneous transfer learning to study emotion-related gender differences in brain. Neural Netw 2025; 183:106974. [PMID: 39657530 DOI: 10.1016/j.neunet.2024.106974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 11/05/2024] [Accepted: 11/26/2024] [Indexed: 12/12/2024]
Abstract
In practice, collecting auxiliary labeled data with same feature space from multiple domains is difficult. Thus, we focus on the heterogeneous transfer learning to address the problem of insufficient sample sizes in neuroimaging. Viewing subjects, time, and features as dimensions, brain activation and dynamic functional connectivity data can be treated as high-order heterogeneous data with heterogeneity arising from distinct feature space. To use the heterogeneous priori knowledge from the low-dimensional brain activation data to improve the classification performance of high-dimensional dynamic functional connectivity data, we propose a tensor dictionary-based heterogeneous transfer learning framework. It combines supervised tensor dictionary learning with heterogeneous transfer learning for enhance high-order heterogeneous knowledge sharing. The former can encode the underlying discriminative features in high-order data into dictionaries, while the latter can transfer heterogeneous knowledge encoded in dictionaries through feature transformation derived from mathematical relationship between domains. The primary focus of this paper is gender classification using fMRI data to identify emotion-related brain gender differences during adolescence. Additionally, experiments on simulated data and EEG data are included to demonstrate the generalizability of the proposed method. Experimental results indicate that incorporating prior knowledge significantly enhances classification performance. Further analysis of brain gender differences suggests that temporal variability in brain activity explains differences in emotion regulation strategies between genders. By adopting the heterogeneous knowledge sharing strategy, the proposed framework can capture the multifaceted characteristics of the brain, improve the generalization of the model, and reduce training costs. Understanding the gender specific neural mechanisms of emotional cognition helps to develop the gender-specific treatments for neurological diseases.
Collapse
Affiliation(s)
- Lan Yang
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, PR China.
| | - Chen Qiao
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, PR China.
| | - Takafumi Kanamori
- Department of Mathematical and Computing Science, Tokyo Institute of Technology, Tokyo 152-8552, Japan; RIKEN AIP, Tokyo 103-0027, Japan.
| | - Vince D Calhoun
- Tri-Institutional Center for Translational Research in Neuroimaging and Data Science(TReNDS), Georgia State University, Georgia Institute of Technology, Atlanta, GA 30030, USA; Emory University, Atlanta, GA, USA.
| | | | - Tony W Wilson
- Institute for Human Neuroscience, Boys Town National Research Hospital, Boys Town, NE 68010, USA.
| | - Yu-Ping Wang
- Department of Biomedical Engineering, Tulane University, New Orleans, LA 70118, USA.
| |
Collapse
|
2
|
Hassan SU, Abdulkadir SJ, Zahid MSM, Al-Selwi SM. Local interpretable model-agnostic explanation approach for medical imaging analysis: A systematic literature review. Comput Biol Med 2025; 185:109569. [PMID: 39705792 DOI: 10.1016/j.compbiomed.2024.109569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 10/30/2024] [Accepted: 12/10/2024] [Indexed: 12/23/2024]
Abstract
BACKGROUND The interpretability and explainability of machine learning (ML) and artificial intelligence systems are critical for generating trust in their outcomes in fields such as medicine and healthcare. Errors generated by these systems, such as inaccurate diagnoses or treatments, can have serious and even life-threatening effects on patients. Explainable Artificial Intelligence (XAI) is emerging as an increasingly significant area of research nowadays, focusing on the black-box aspect of sophisticated and difficult-to-interpret ML algorithms. XAI techniques such as Local Interpretable Model-Agnostic Explanations (LIME) can give explanations for these models, raising confidence in the systems and improving trust in their predictions. Numerous works have been published that respond to medical problems through the use of ML models in conjunction with XAI algorithms to give interpretability and explainability. The primary objective of the study is to evaluate the performance of the newly emerging LIME techniques within healthcare domains that require more attention in the realm of XAI research. METHOD A systematic search was conducted in numerous databases (Scopus, Web of Science, IEEE Xplore, ScienceDirect, MDPI, and PubMed) that identified 1614 peer-reviewed articles published between 2019 and 2023. RESULTS 52 articles were selected for detailed analysis that showed a growing trend in the application of LIME techniques in healthcare, with significant improvements in the interpretability of ML models used for diagnostic and prognostic purposes. CONCLUSION The findings suggest that the integration of XAI techniques, particularly LIME, enhances the transparency and trustworthiness of AI systems in healthcare, thereby potentially improving patient outcomes and fostering greater acceptance of AI-driven solutions among medical professionals.
Collapse
Affiliation(s)
- Shahab Ul Hassan
- Department of Computer and Information Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia; Centre for Intelligent Signal & Imaging Research (CISIR), Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia.
| | - Said Jadid Abdulkadir
- Department of Computer and Information Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia; Center for Research in Data Science (CeRDaS), Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia.
| | - M Soperi Mohd Zahid
- Department of Computer and Information Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia; Centre for Intelligent Signal & Imaging Research (CISIR), Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia.
| | - Safwan Mahmood Al-Selwi
- Department of Computer and Information Sciences, Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia; Center for Research in Data Science (CeRDaS), Universiti Teknologi PETRONAS, Seri Iskandar, 32610, Perak, Malaysia.
| |
Collapse
|
3
|
Li S, Liu J, Xu W, Zhang S, Zhao M, Miao L, Hui M, Wang Y, Hou Y, Cong B, Wang Z. A multi-class support vector machine classification model based on 14 microRNAs for forensic body fluid identification. Forensic Sci Int Genet 2025; 75:103180. [PMID: 39591840 DOI: 10.1016/j.fsigen.2024.103180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 09/30/2024] [Accepted: 11/19/2024] [Indexed: 11/28/2024]
Abstract
MicroRNAs (miRNAs) are promising biomarkers for forensic body fluid identification owing to their small size, stability against degradation, and differential expression patterns. However, the expression of most body fluid-miRNAs is relative (differentially expressed in certain body fluids) rather than absolute (exclusively expressed in a specific body fluid). Moreover, different body fluids contain heterogeneous cell types, complicating their identification. Therefore, appropriate normalization strategies to eliminate non-biological variations and robust models to interpret expression levels accurately are necessary prerequisites for applying miRNAs in body fluid identification. In this study, the expression stability of six candidate reference genes (RGs) across five body fluids was validated using geNorm, NormFinder, BestKeeper and RankAggreg, and the most suitable combination of RGs (hsa-miR-484 and hsa-miR-191-5p) was identified under our experimental conditions. Subsequently, we systematically evaluated the expression patterns of the 28 most promising body fluid-specific miRNA markers using TaqMan RT-qPCR and selected the optimal combination of markers (12 miRNAs) to establish a multi-class support vector machine (MSVM) classification model. An independent test set (60 samples) was used to validate the accuracy of the proposed classification model, while an additional 30 casework samples were used to assess its robustness. The MSVM model accurately predicted the body fluid origin for almost all (59/60) single-source samples. Moreover, this model demonstrated the capability to identify aged forensic samples and to predict the primary components of mixed stains to a certain extent. In summary, this study presented a miRNA-based MSVM classification model for forensic body fluid identification using the qPCR platform. However, extensive validation, especially inter-laboratory collaborative exercises, is necessary before miRNA can be routinely applied in forensic identification practice.
Collapse
Affiliation(s)
- Suyu Li
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Shijiazhuang 050017, China
| | - Wei Xu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; Criminal Investigation Detachment of Huainan Public Security Bureau, Huainan 232000, China
| | - Shuyuan Zhang
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Mengyao Zhao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Lu Miao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; Criminal Investigation Detachment of Huainan Public Security Bureau, Huainan 232000, China
| | - Minxiao Hui
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Yuan Wang
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China; Anhui Hopegenerich Biotechnology, Hefei 230031, China
| | - Yiping Hou
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China
| | - Bin Cong
- College of Forensic Medicine, Hebei Medical University, Hebei Key Laboratory of Forensic Medicine, Shijiazhuang 050017, China.
| | - Zheng Wang
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
4
|
Hwang UJ, Kwon OY, Kim JH, Gwak GT. Machine learning for classifying chronic ankle instability based on ankle strength, range of motion, postural control and anatomical deformities in delivery service workers with a history of lateral ankle sprains. Musculoskelet Sci Pract 2025; 75:103230. [PMID: 39579676 DOI: 10.1016/j.msksp.2024.103230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 10/29/2024] [Accepted: 11/18/2024] [Indexed: 11/25/2024]
Abstract
OBJECTIVE Chronic ankle instability (CAI) frequently develops as a result of lateral ankle sprains (LAS) in delivery service workers (DSWs). Identifying risk factors for CAI is crucial for implementing targeted interventions. This study aimed to develop machine learning (ML) models for classifying CAI in DSWs with a history of LAS (DSWsLAS) and to identify key contributory factors. DESIGN Exploratory, cross-sectional design. SETTING and participants: A total of 121 DSWsLAS were screened for eligibility among 289 DSWs. METHODS A total of 121 DSWsLAS were assessed for demographic characteristics, including ankle strength, range of motion, postural control, and anatomical deformities. Seven ML algorithms were trained and tested for classifying CAI. Principal component analysis (PCA) was used for feature extraction, and feature permutation importance (FPI) and Shapley additive explanations (SHAP) were employed to identify influential features. MAIN OUTCOME MEASURES Model performances were assessed using area under the curve (AUC). To interpret the classifications, we used FPI and SHAP values. RESULTS PCA derived 7 principal components (PCs) accounting for 83.5% of the total variation in the data. The support vector machine (SVM) algorithm achieved the highest classifying performance (AUC = 0.817) among the ML models. FPI and SHAP revealed that PC1, PC2, PC5, and PC7 were the most influential features for classifying CAI in DSWsLAS. CONCLUSIONS The SVM algorithm, utilizing PCA-derived factors related to body mass index and ankle muscle strength demonstrated high classifying performance for diagnosis of CAI in DSWsLAS, emphasizing the importance of considering multiple contributory factors in the prevention and management of this condition.
Collapse
Affiliation(s)
- Ui-Jae Hwang
- Department of Physical Therapy, College of Health Science, Laboratory of KEMA AI Research (KAIR), Yonsei University, 234 Maeji-ri, Heungeop-Myeon, Wonju, Kangwon-Do, 220-710, South Korea.
| | - Oh-Yun Kwon
- Department of Physical Therapy, College of Health Science, Laboratory of Kinetic Ergocise Based on Movement Analysis, Yonsei University, 234 Maeji-ri, Heungeop-Myeon, Wonju, Kangwon-Do, 220-710, South Korea.
| | - Jun-Hee Kim
- Department of Physical Therapy, College of Health Science, Laboratory of KEMA AI Research (KAIR), Yonsei University, 234 Maeji-ri, Heungeop-Myeon, Wonju, Kangwon-Do, 220-710, South Korea.
| | - Gyeong-Tae Gwak
- Department of Physical Therapy, College of Health Science, Laboratory of KEMA AI Research (KAIR), Yonsei University, 234 Maeji-ri, Heungeop-Myeon, Wonju, Kangwon-Do, 220-710, South Korea.
| |
Collapse
|
5
|
Jarmoshti J, Siddique AB, Rane A, Mirhosseini S, Adair SJ, Bauer TW, Caselli F, Swami NS. Neural Network-Enabled Multiparametric Impedance Signal Templating for High throughput Single-Cell Deformability Cytometry Under Viscoelastic Extensional Flows. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2025; 21:e2407212. [PMID: 39439143 DOI: 10.1002/smll.202407212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Revised: 10/08/2024] [Indexed: 10/25/2024]
Abstract
Cellular biophysical metrics exhibit systematic alterations during processes, such as metastasis and immune cell activation, which can be used to identify and separate live cell subpopulations for targeting drug screening. Image-based biophysical cytometry under extensional flows can accurately quantify cell deformability based on cell shape alterations but needs extensive image reconstruction, which limits its inline utilization to activate cell sorting. Impedance cytometry can measure these cell shape alterations based on electric field screening, while its frequency response offers functional information on cell viability and interior structure, which are difficult to discern by imaging. Furthermore, 1-D temporal impedance signal trains exhibit characteristic shapes that can be rapidly templated in near real-time to extract single-cell biophysical metrics to activate sorting. We present a multilayer perceptron neural network signal templating approach that utilizes raw impedance signals from cells under extensional flow, alongside its training with image metrics from corresponding cells to derive net electrical anisotropy metrics that quantify cell deformability over wide anisotropy ranges and with minimal errors from cell size distributions. Deformability and electrical physiology metrics are applied in conjunction on the same cell for multiparametric classification of live pancreatic cancer cells versus cancer associated fibroblasts using the support vector machine model.
Collapse
Affiliation(s)
- Javad Jarmoshti
- Electrical & Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
| | - Abdullah-Bin Siddique
- Electrical & Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
| | - Aditya Rane
- Chemistry, University of Virginia, University of Virginia, Charlottesville, VA, 22904, USA
| | - Shaghayegh Mirhosseini
- Electrical & Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
| | - Sara J Adair
- Surgery, School of Medicine, University of Virginia, Charlottesville, VA, 22903, USA
| | - Todd W Bauer
- Surgery, School of Medicine, University of Virginia, Charlottesville, VA, 22903, USA
| | - Federica Caselli
- Civil Engineering and Computer Science, University of Rome Tor Vergata, Rome, 00133, Italy
| | - Nathan S Swami
- Electrical & Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
- Chemistry, University of Virginia, University of Virginia, Charlottesville, VA, 22904, USA
| |
Collapse
|
6
|
Cejudo A, Beristain A, Almeida A, Rebescher K, Martín C, Macía I. Smart home-assisted anomaly detection system for older adults: a deep learning approach with a comprehensive set of daily activities. Med Biol Eng Comput 2025:10.1007/s11517-025-03308-y. [PMID: 39888470 DOI: 10.1007/s11517-025-03308-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 01/20/2025] [Indexed: 02/01/2025]
Abstract
Smart homes have the potential to enable remote monitoring of the health and well-being of older adults, leading to improved health outcomes and increased independence. However, current approaches only consider a limited set of daily activities and do not combine data from individuals. In this work, we propose the use of deep learning techniques to model behavior at the population level and detect significant deviations (i.e., anomalies) while taking into account the whole set of daily activities (41). We detect and visualize daily routine patterns, train a set of recurrent neural networks for behavior modelling with next-day prediction, and model errors with a normal distribution to identify significant deviations while considering the temporal component. Clustering of daily routines achieves a silhouette score of 0.18 and the best model obtains a mean squared error in next day routine prediction of 4.38%. The mean number of deviated activities for the anomalies in the train and test set are 3.6 and 3.0, respectively, with more than 60% of anomalies involving three or more deviated activities in the test set. The methodology is scalable and can incorporate additional activities into the analysis.
Collapse
Affiliation(s)
- Ander Cejudo
- Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009, Donostia-San Sebastián, Spain.
- Faculty of Engineering, University of Deusto, Avda. Universidades, 24, 48007, Bilbao, Spain.
| | - Andoni Beristain
- Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009, Donostia-San Sebastián, Spain
- e-Health Department, Biodonostia Health Research Institute, Paseo Dr Begiristain s/n, 20014, San Sebastián, Spain
- Computational Intelligence Group, Computer Science Faculty, University of the Basque Country, UPV/EHU, Leioa, Spain
| | - Aitor Almeida
- Faculty of Engineering, University of Deusto, Avda. Universidades, 24, 48007, Bilbao, Spain
| | - Kristin Rebescher
- Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009, Donostia-San Sebastián, Spain
| | - Cristina Martín
- Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009, Donostia-San Sebastián, Spain
- e-Health Department, Biodonostia Health Research Institute, Paseo Dr Begiristain s/n, 20014, San Sebastián, Spain
- Faculty of Engineering, University of Deusto, Avda. Universidades, 24, 48007, Bilbao, Spain
| | - Iván Macía
- Fundación Vicomtech, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009, Donostia-San Sebastián, Spain
- e-Health Department, Biodonostia Health Research Institute, Paseo Dr Begiristain s/n, 20014, San Sebastián, Spain
- Computational Intelligence Group, Computer Science Faculty, University of the Basque Country, UPV/EHU, Leioa, Spain
| |
Collapse
|
7
|
Jin L, Mu Q, Zhang Q, Li K, Wang Y, Jiang Z, Yan Y, He D, Zhu L, Li M, Gao X, Hui Q, Yang J, Wang X. Temperature-Enhanced Purine Metabolism-Based Versatile SERS Platform for Rapid Clinical Pathogens Diagnosis and Drug-Resistant Assessment. Anal Chem 2025. [PMID: 39876763 DOI: 10.1021/acs.analchem.4c04891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2025]
Abstract
Label-free surface-enhanced Raman spectroscopy (SERS) combined with machine learning (ML) techniques presents a promising approach for rapid pathogen identification. Previous studies have demonstrated that purine degradation metabolites are the primary contributors to SERS spectra; however, generating these distinguishable spectra typically requires a long incubation time (>10 h) at room temperature. Moreover, the lack of attention to spectral variations between strains of the same bacterial species has limited the generalizability of ML models in real-world applications. To address these issues, we investigated temperature-induced alterations in bacterial purine metabolism and found that robust SERS spectra could be obtained within just 1 h by heating samples to 60 °C. Our study further revealed that pathogens exhibit multiple fingerprint patterns across strains, rather than a uniform spectral signature. To enhance practicality, we optimized ML models by training them on data sets capturing all relevant SERS fingerprints and validated them on separate bacterial strains. The SoftMax classifier achieved 100% accuracy in identifying both laboratory and clinical specimens within 17 h. Additionally, the platform demonstrated over 91% accuracy in distinguishing drug-resistant strains, such as methicillin-resistant Staphylococcus aureus and carbapenem-resistant Klebsiella pneumoniae, and achieved 99.66% accuracy in differentiating specific strains within a species, such as enterohemorrhagic Escherichia coli. This accelerated, purine metabolism-based SERS platform offers a highly promising alternative for the rapid diagnosis of bacterial infections.
Collapse
Affiliation(s)
- Lei Jin
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325000, China
| | - Qiuqiu Mu
- Third Affiliated Hospital of Wenzhou Medical University, Wenzhou 325200, China
| | - Qing Zhang
- Third Affiliated Hospital of Wenzhou Medical University, Wenzhou 325200, China
| | - Kunxin Li
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Ying Wang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Zelong Jiang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Yang Yan
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325000, China
| | - Deyin He
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325000, China
| | - Liqin Zhu
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Mengyun Li
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Xiangyun Gao
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Qi Hui
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
| | - Jinmei Yang
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou 325027, China
| | - Xiaojie Wang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325000, China
| |
Collapse
|
8
|
Huo J, He F, Lu C, Zhu M, Bu Y, Kang D, Wang R, Feng W, Ma R. Nonlinear Small Sample Data Regression with a New Rational-Quadratic Minkowski Kernel for Tobacco Laser Perforation Process Tar Reduction Estimation. ACS OMEGA 2025; 10:2908-2918. [PMID: 39895722 PMCID: PMC11780416 DOI: 10.1021/acsomega.4c08978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 12/02/2024] [Accepted: 12/10/2024] [Indexed: 02/04/2025]
Abstract
This paper investigates the nonlinear relationship between tobacco harmful content tar reduction and laser perforation parameters. To find a model to demonstrate the relationship between the laser perforation parameters and the cigarette tar reduction level, an online platform based on Python Streamlit was built to collect and publish related data. After the initial analysis of the collected experimental data, the quadratic nonlinear regression model demonstrates a significant fit to the experimental data. However, although the nonlinear regression has much higher accuracy than the linear regression plane, the prediction normalized root mean squared error (NRMSE) is still high, over 10%, which indicates that the regression relationship is more complex than the simple quadratic function expression. On the other hand, the sample dataset used for modeling is very limited, which restricts its exploration and the development of a model comparable to those built with big data. To address this challenge for small sample size data in modeling this complex nonlinear relationship, a novel rational-quadratic Minkowski (RM)-based kernel was designed. This RM-kernel model acquires higher accuracy than other kernels in both SVM and Gaussian process regression. Furthermore, this new kernel also shows less sensitivity to hyperparameter change, the greater ability to capture complex relationships, and more flexibility than the RBF kernel and RQ kernel. Subsequently, the kernel-based RM regression model was successfully implemented for laser perforation parameter selection, yielding consistent results that align with human sensory test data.
Collapse
Affiliation(s)
- Juan Huo
- School
of Electrical and Information Engineering, Zhengzhou University, Zhengzhou 450001, China
| | - Feng He
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Changtong Lu
- China
Tobacco Henan Industrial Co., Ltd., Zhengzhou 450001, China
| | - Meng Zhu
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Yifan Bu
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Di Kang
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Rui Wang
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Wenning Feng
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| | - Rong Ma
- China
Tobacco Hebei Industrial Co., Ltd., Shijiazhuang 050051, China
| |
Collapse
|
9
|
Khullar V, Kaur P, Gargrish S, Mishra AM, Singh P, Diwakar M, Bijalwan A, Gupta I. Minimal sourced and lightweight federated transfer learning models for skin cancer detection. Sci Rep 2025; 15:2605. [PMID: 39837883 PMCID: PMC11750969 DOI: 10.1038/s41598-024-82402-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 12/05/2024] [Indexed: 01/23/2025] Open
Abstract
One of the most fatal diseases that affect people is skin cancer. Because nevus and melanoma lesions are so similar and there is a high likelihood of false negative diagnoses challenges in hospitals. The aim of this paper is to propose and develop a technique to classify type of skin cancer with high accuracy using minimal resources and lightweight federated transfer learning models. Here minimal resource based pre-trained deep learning models including EfficientNetV2S, EfficientNetB3, ResNet50, and NasNetMobile have been used to apply transfer learning on data of shape[Formula: see text]. To compare with applied minimal resource transfer learning, same methodology has been applied using best identified model i.e. EfficientNetV2S for images of shape[Formula: see text]. The identified minimal and lightweight resource based EfficientNetV2S with images of shape [Formula: see text] have been applied for federated learning ecosystem. Both, identically and non-identically distributed datasets of shape [Formula: see text] have been applied and analyzed through federated learning implementations. The results have been analyzed to show the impact of low-pixel images with non-identical distributions over clients using parameters such as accuracy, precision, recall and categorical losses. The classification of skin cancer shows an accuracy of IID 89.83% and Non-IID 90.64%.
Collapse
Affiliation(s)
- Vikas Khullar
- Chitkara University Institute of Engineering Technology, Chitkara University, Rajpura, Punjab, India
| | - Prabhjot Kaur
- Chitkara University Institute of Engineering Technology, Chitkara University, Rajpura, Punjab, India
| | - Shubham Gargrish
- Chitkara University Institute of Engineering Technology, Chitkara University, Rajpura, Punjab, India
| | - Anand Muni Mishra
- Chandigarh Engineering College, Chandigarh Group of Colleges, Jhanjeri, Mohali, India
| | - Prabhishek Singh
- School of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh, India
| | - Manoj Diwakar
- CSE Department, Graphic Era Deemed to be University, Dehradun, Uttrakhand, India
- Graphic Era Hill University, Dehradun, Uttrakhand, India
| | - Anchit Bijalwan
- Faculty of Electrical and Computer Engineering, Arba Minch University, Arba Minch, Ethiopia.
| | - Indrajeet Gupta
- School of Computer Science and AI, SR University, Warangal, Telangana, India
| |
Collapse
|
10
|
Sarikaya MA, Ince G. Improved BCI calibration in multimodal emotion recognition using heterogeneous adversarial transfer learning. PeerJ Comput Sci 2025; 11:e2649. [PMID: 39896041 PMCID: PMC11784743 DOI: 10.7717/peerj-cs.2649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Accepted: 12/17/2024] [Indexed: 02/04/2025]
Abstract
The use of brain-computer interface (BCI) technology to identify emotional states has gained significant interest, especially with the rise of virtual reality (VR) applications. However, the extensive calibration required for precise emotion recognition models presents a significant challenge, particularly for sensitive groups such as children, elderly, and patients. This study presents a novel approach that utilizes heterogeneous adversarial transfer learning (HATL) to synthesize electroencephalography (EEG) data from various other signal modalities, reducing the need for lengthy calibration phases. We benchmark the efficacy of three generative adversarial network (GAN) architectures, such as conditional GAN (CGAN), conditional Wasserstein GAN (CWGAN), and CWGAN with gradient penalty (CWGAN-GP) within this framework. The proposed framework is rigorously tested on two conventional open sourced datasets, SEED-V and DEAP. Additionally, the framework was applied to an immersive three-dimensional (3D) dataset named GraffitiVR, which we collected to capture the emotional and behavioral reactions of individuals experiencing urban graffiti in a VR environment. This expanded application provides insights into emotion recognition frameworks in VR settings, providing a wider range of contexts for assessing our methodology. When the accuracy of emotion recognition classifiers trained with CWGAN-GP-generated EEG data combined with non-EEG sensory data was compared against those trained using a combination of real EEG and non-EEG sensory data, the accuracy ratios were 93% on the SEED-V dataset, 99% on the DEAP dataset, and 97% on the GraffitiVR dataset. Moreover, in the GraffitiVR dataset, using CWGAN-GP-generated EEG data with non-EEG sensory data for emotion recognition models resulted in up to a 30% reduction in calibration time compared to classifiers trained on real EEG data with non-EEG sensory data. These results underscore the robustness and versatility of the proposed approach, significantly enhancing emotion recognition processes across a variety of environmental settings.
Collapse
Affiliation(s)
- Mehmet Ali Sarikaya
- Department of Computer Engineering, Istanbul Technical University, Istanbul, Turkey
| | - Gökhan Ince
- Department of Computer Engineering, Istanbul Technical University, Istanbul, Turkey
| |
Collapse
|
11
|
Al-Obeidat F, Hafez W, Rashid A, Jallo MK, Gador M, Cherrez-Ojeda I, Simancas-Racines D. Artificial intelligence for the detection of acute myeloid leukemia from microscopic blood images; a systematic review and meta-analysis. Front Big Data 2025; 7:1402926. [PMID: 39897067 PMCID: PMC11782132 DOI: 10.3389/fdata.2024.1402926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 12/23/2024] [Indexed: 02/04/2025] Open
Abstract
Background Leukemia is the 11th most prevalent type of cancer worldwide, with acute myeloid leukemia (AML) being the most frequent malignant blood malignancy in adults. Microscopic blood tests are the most common methods for identifying leukemia subtypes. An automated optical image-processing system using artificial intelligence (AI) has recently been applied to facilitate clinical decision-making. Aim To evaluate the performance of all AI-based approaches for the detection and diagnosis of acute myeloid leukemia (AML). Methods Medical databases including PubMed, Web of Science, and Scopus were searched until December 2023. We used the "metafor" and "metagen" libraries in R to analyze the different models used in the studies. Accuracy and sensitivity were the primary outcome measures. Results Ten studies were included in our review and meta-analysis, conducted between 2016 and 2023. Most deep-learning models have been utilized, including convolutional neural networks (CNNs). The common- and random-effects models had accuracies of 1.0000 [0.9999; 1.0001] and 0.9557 [0.9312, and 0.9802], respectively. The common and random effects models had high sensitivity values of 1.0000 and 0.8581, respectively, indicating that the machine learning models in this study can accurately detect true-positive leukemia cases. Studies have shown substantial variations in accuracy and sensitivity, as shown by the Q values and I2 statistics. Conclusion Our systematic review and meta-analysis found an overall high accuracy and sensitivity of AI models in correctly identifying true-positive AML cases. Future research should focus on unifying reporting methods and performance assessment metrics of AI-based diagnostics. Systematic review registration https://www.crd.york.ac.uk/prospero/#recordDetails, CRD42024501980.
Collapse
Affiliation(s)
- Feras Al-Obeidat
- College of Technological Innovation, Zayed University, Abu Dhabi, United Arab Emirates
| | - Wael Hafez
- Internal Medicine Department, Medical Research and Clinical Studies Institute, The National Research Centre, Cairo, Egypt
- NMC Royal Hospital, Abu Dhabi, United Arab Emirates
| | - Asrar Rashid
- NMC Royal Hospital, Abu Dhabi, United Arab Emirates
| | - Mahir Khalil Jallo
- Department of Clinical Sciences, College of Medicine, Gulf Medical University, Ajman, United Arab Emirates
| | - Munier Gador
- NMC Royal Hospital, Abu Dhabi, United Arab Emirates
| | - Ivan Cherrez-Ojeda
- Department of Allergy and Immunology, Universidad Espiritu Santo, Samborondon, Ecuador
- Respiralab Research Group, Guayaquil, Ecuador
| | - Daniel Simancas-Racines
- Centro de Investigación de Salud Pública y Epidemiología Clínica (CISPEC), Universidad UTE, Quito, Ecuador
| |
Collapse
|
12
|
Wang J, Zhao J, Chen X, Yin B, Li X, Xie P. Alzheimer's disease diagnosis using rhythmic power changes and phase differences: a low-density EEG study. Front Aging Neurosci 2025; 16:1485132. [PMID: 39897456 PMCID: PMC11782140 DOI: 10.3389/fnagi.2024.1485132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 12/30/2024] [Indexed: 02/04/2025] Open
Abstract
Objectives The future emergence of disease-modifying treatments for dementia highlights the urgent need to identify reliable and easily accessible tools for diagnosing Alzheimer's disease (AD). Electroencephalography (EEG) is a non-invasive and cost-effective technique commonly used in the study of neurodegenerative disorders. However, the specific alterations in EEG biomarkers associated with AD remain unclear when using a limited number of electrodes. Methods We studied pathological characteristics of AD using low-density EEG data collected from 26 AD and 29 healthy controls (HC) during both eye closed (EC) and eye opened (EO) resting conditions. The analysis including power spectrum, phase lock value (PLV), and weighted lag phase index (wPLI) and power-to-power frequency coupling (theta/beta) analysis were applied to extract features in the delta, theta, alpha, and beta bands. Results During the EC condition, the AD group exhibited decreased alpha power compared to HC. Additionally, both analysis of PLV and wPLI in the theta band indicated that the alterations in the AD brain network predominantly involved in the frontal region with the opposite changes. Moreover, the AD group had increased frequency coupling in the frontal and central regions. Surprisingly, no group difference was found in the EO condition. Notably, decreased theta band functional connectivity within the fronto-central lobe and increased frequency coupling in frontal region were found in AD group from EC to EO. More importantly, the combination of EC and EO quantitative EEG features improved the inter-group classification accuracy when using support vector machine (SVM) in older adults with AD. These findings highlight the complementary nature of EC and EO conditions in assessing and differentiating AD cohorts. Conclusion Our results underscore the potential of utilizing low-density EEG data from resting-state paradigms, combined with machine learning techniques, to improve the identification and classification of AD.
Collapse
Affiliation(s)
- Juan Wang
- Institute of Electrical Engineering, Yanshan University, Qinhuangdao, China
- Key Laboratory of Intelligent Rehabilitation and Neuromodulation of Hebei Province, Qinhuangdao, China
| | - Jiamei Zhao
- Institute of Electrical Engineering, Yanshan University, Qinhuangdao, China
| | - Xiaoling Chen
- Institute of Electrical Engineering, Yanshan University, Qinhuangdao, China
- Key Laboratory of Intelligent Rehabilitation and Neuromodulation of Hebei Province, Qinhuangdao, China
| | - Bowen Yin
- Department of Neurology, The First Hospital of Qinhuangdao, Hebei Medical University, Qinhuangdao, China
| | - Xiaoli Li
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
- Center for Collaboration and Innovation in Brain and Learning Sciences, Beijing Normal University, Beijing, China
| | - Ping Xie
- Institute of Electrical Engineering, Yanshan University, Qinhuangdao, China
- Key Laboratory of Intelligent Rehabilitation and Neuromodulation of Hebei Province, Qinhuangdao, China
| |
Collapse
|
13
|
Olweny G, Ntayi ML, Kyalo E, Kayongo A. Protocol for identifying Mycobacterium tuberculosis infection status through airway microbiome profiling. STAR Protoc 2025; 6:103574. [PMID: 39826114 PMCID: PMC11787526 DOI: 10.1016/j.xpro.2024.103574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2024] [Revised: 08/20/2024] [Accepted: 12/18/2024] [Indexed: 01/22/2025] Open
Abstract
This protocol describes the steps to determine an airway microbiome signature for identifying Mycobacterium tuberculosis infection status. We outline procedures for processing microbiome data, calculating diversity measures, and fitting Dirichlet multinomial mixture models. Additionally, we provide steps for analyzing taxonomic relative and differential abundances, as well as identifying potential biomarkers associated with infection status. For complete details on the use and execution of this protocol, please refer to Kayongo et al.1.
Collapse
Affiliation(s)
- Geoffrey Olweny
- Department of Immunology and Molecular Biology, Makerere University College of Health Sciences, Kampala 256, Uganda; Lung Institute, Makerere University College of Health Sciences, Kampala 256, Uganda.
| | - Moses Levi Ntayi
- Department of Immunology and Molecular Biology, Makerere University College of Health Sciences, Kampala 256, Uganda; Lung Institute, Makerere University College of Health Sciences, Kampala 256, Uganda
| | - Edward Kyalo
- Department of Immunology and Molecular Biology, Makerere University College of Health Sciences, Kampala 256, Uganda; Lung Institute, Makerere University College of Health Sciences, Kampala 256, Uganda
| | - Alex Kayongo
- Department of Immunology and Molecular Biology, Makerere University College of Health Sciences, Kampala 256, Uganda.
| |
Collapse
|
14
|
Mu G, Li J, Liu Z, Dai J, Qu J, Li X. MSBKA: A Multi-Strategy Improved Black-Winged Kite Algorithm for Feature Selection of Natural Disaster Tweets Classification. Biomimetics (Basel) 2025; 10:41. [PMID: 39851757 PMCID: PMC11763058 DOI: 10.3390/biomimetics10010041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 01/03/2025] [Accepted: 01/08/2025] [Indexed: 01/26/2025] Open
Abstract
With the advancement of the Internet, social media platforms have gradually become powerful in spreading crisis-related content. Identifying informative tweets associated with natural disasters is beneficial for the rescue operation. When faced with massive text data, choosing the pivotal features, reducing the calculation expense, and increasing the model classification performance is a significant challenge. Therefore, this study proposes a multi-strategy improved black-winged kite algorithm (MSBKA) for feature selection of natural disaster tweets classification based on the wrapper method's principle. Firstly, BKA is improved by utilizing the enhanced Circle mapping, integrating the hierarchical reverse learning, and introducing the Nelder-Mead method. Then, MSBKA is combined with the excellent classifier SVM (RBF kernel function) to construct a hybrid model. Finally, the MSBKA-SVM model performs feature selection and tweet classification tasks. The empirical analysis of the data from four natural disasters shows that the proposed model has achieved an accuracy of 0.8822. Compared with GA, PSO, SSA, and BKA, the accuracy is increased by 4.34%, 2.13%, 2.94%, and 6.35%, respectively. This research proves that the MSBKA-SVM model can play a supporting role in reducing disaster risk.
Collapse
Affiliation(s)
- Guangyu Mu
- School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun 130117, China; (G.M.); (J.L.)
- Key Laboratory of Financial Technology of Jilin Province, Changchun 130117, China
| | - Jiaxue Li
- School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun 130117, China; (G.M.); (J.L.)
| | - Zhanhui Liu
- Changchun Community Official Staff College of Jilin Province, Changchun 130052, China
| | - Jiaxiu Dai
- School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun 130117, China; (G.M.); (J.L.)
| | - Jiayi Qu
- School of Management Science and Information Engineering, Jilin University of Finance and Economics, Changchun 130117, China; (G.M.); (J.L.)
| | - Xiurong Li
- Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
| |
Collapse
|
15
|
AlEssa GN, Alzahrani SI. A novel non-invasive EEG-SSVEP diagnostic tool for color vision deficiency in individuals with locked-in syndrome. Front Bioeng Biotechnol 2025; 12:1498401. [PMID: 39840131 PMCID: PMC11747784 DOI: 10.3389/fbioe.2024.1498401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 12/20/2024] [Indexed: 01/23/2025] Open
Abstract
Introduction Color vision deficiency (CVD), a common visual impairment, affects individuals' ability to differentiate between various colors due to malfunctioning or absent color photoreceptors in the retina. Currently available diagnostic tests require a behavioral response, rendering them unsuitable for individuals with limited physical and communication abilities, such as those with locked-in syndrome. This study introduces a novel, non-invasive method that employs brain signals, specifically Steady-State Visually Evoked Potentials (SSVEPs), along with Ishihara plates to diagnose CVD. This method aims to provide an alternative diagnostic tool that addresses the limitations of current tests. Methods Electroencephalography (EEG) recordings were obtained from 16 subjects, including 5 with CVD (specifically Deuteranomaly), using channels O1, O2, Pz, and Cz. The subjects were exposed to visual stimuli at frequencies of 15 Hz and 18 Hz to assess the proposed method. The subjects focused on specific visual stimuli in response to questions related to the Ishihara plates. Their responses were analyzed to determine the presence of CVD. Feature extraction was performed using Power Spectral Density (PSD), Canonical Correlation Analysis (CCA), and a combined PSD + CCA, followed by classification to categorize subjects into two classes: normal vision and CVD. Results The results indicate that the proposed method effectively diagnoses CVD in individuals with limited communication abilities. The classification accuracy of SSVEP exceeded 75% across the three classifiers: Decision Tree (DT), K-Nearest Neighbors (KNN), and Support Vector Machine (SVM). The SVM classifier demonstrated higher accuracy compared to the other classifiers, exceeding 90%. Discussion These observations suggest that the SVM classifier, utilizing the combined feature set of PSD + CCA, may be the most effective in this classification task. These findings demonstrate that the proposed method is an accurate and reliable diagnostic tool for CVD, particularly for individuals unable to communicate.
Collapse
Affiliation(s)
| | - Saleh I. Alzahrani
- Biomedical Engineering Department, College of Engineering, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia
| |
Collapse
|
16
|
Wang L, Sheng J, Zhang Q, Song Y, Zhang Q, Wang B, Zhang R. Diagnosis of Alzheimer's disease using FusionNet with improved secretary bird optimization algorithm for optimal MK-SVM based on imaging genetic data. Cereb Cortex 2025:bhae498. [PMID: 39756421 DOI: 10.1093/cercor/bhae498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 11/19/2024] [Accepted: 12/22/2024] [Indexed: 01/07/2025] Open
Abstract
Alzheimer's disease is an irreversible central neurodegenerative disease, and early diagnosis of Alzheimer's disease is beneficial for its prevention and early intervention treatment. In this study, we propose a novel framework, FusionNet-ISBOA-MK-SVM, which integrates a fusion network (FusionNet) and improved secretary bird optimization algorithm to optimize multikernel support vector machine for Alzheimer's disease diagnosis. The model leverages multimodality data, including functional magnetic resonance imaging and genetic information (single-nucleotide polymorphisms). Specifically, FusionNet employs U-shaped hierarchical graph convolutional networks and sparse graph attention networks to select feature effectively. Extensive validation using the Alzheimer's Disease Neuroimaging Initiative dataset demonstrates the model's superior interpretability and classification performance. Compared to other state-of-the-art machine learning methods, FusionNet-ISBOA-MK-SVM achieves classification accuracies of 98.6%, 95.7%, 93.0%, 91.8%, 93.1%, and 95.4% for HC vs. AD, EMCI vs. AD, LMCI vs. AD, EMCI vs. AD, HC vs. EMCI, and HC vs. LMCI, respectively. Moreover, the proposed model identifies affected brain regions and pathogenic genes, offering deeper insights into the mechanisms and progression of Alzheimer's disease. These findings provide valuable scientific evidence to support early diagnosis and preventive strategies for Alzheimer's disease.
Collapse
Affiliation(s)
- Luyun Wang
- School of Computer Science and Technology, Hangzhou Dianzi University, 1158 2nd Street, Hangzhou, Zhejiang 310018, China
- Key Laboratory of Intelligent Image Analysis for Sensory and Cognitive Health, Ministry of Industry and Information Technology of China, 215 6th Street, Hangzhou, Zhejiang 310018, China
- Hangzhou Vocational & Technical College, 68 Xueyuan Street, Hangzhou, Zhejiang 310018, China
| | - Jinhua Sheng
- School of Computer Science and Technology, Hangzhou Dianzi University, 1158 2nd Street, Hangzhou, Zhejiang 310018, China
- Key Laboratory of Intelligent Image Analysis for Sensory and Cognitive Health, Ministry of Industry and Information Technology of China, 215 6th Street, Hangzhou, Zhejiang 310018, China
| | - Qiao Zhang
- Beijing Hospital, 1 Dahua Road, Beijing 100730, China
- National Center of Gerontology, 1 Dahua Road, Beijing 100730, China
- Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, 1 Dahua Road, Beijing 100730, China
| | - Yan Song
- Beijing Hospital, 1 Dahua Road, Beijing 100730, China
- National Center of Gerontology, 1 Dahua Road, Beijing 100730, China
- Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, 1 Dahua Road, Beijing 100730, China
| | - Qian Zhang
- School of Computer Science and Technology, Hangzhou Dianzi University, 1158 2nd Street, Hangzhou, Zhejiang 310018, China
- Key Laboratory of Intelligent Image Analysis for Sensory and Cognitive Health, Ministry of Industry and Information Technology of China, 215 6th Street, Hangzhou, Zhejiang 310018, China
| | - Binbing Wang
- School of Computer Science and Technology, Hangzhou Dianzi University, 1158 2nd Street, Hangzhou, Zhejiang 310018, China
- Key Laboratory of Intelligent Image Analysis for Sensory and Cognitive Health, Ministry of Industry and Information Technology of China, 215 6th Street, Hangzhou, Zhejiang 310018, China
| | - Rong Zhang
- School of Computer Science and Technology, Hangzhou Dianzi University, 1158 2nd Street, Hangzhou, Zhejiang 310018, China
- Key Laboratory of Intelligent Image Analysis for Sensory and Cognitive Health, Ministry of Industry and Information Technology of China, 215 6th Street, Hangzhou, Zhejiang 310018, China
| |
Collapse
|
17
|
Kumari A, Akhtar M, Shah R, Tanveer M. Support matrix machine: A review. Neural Netw 2025; 181:106767. [PMID: 39488110 DOI: 10.1016/j.neunet.2024.106767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 07/31/2024] [Accepted: 09/26/2024] [Indexed: 11/04/2024]
Abstract
Support vector machine (SVM) is one of the most studied paradigms in the realm of machine learning for classification and regression problems. It relies on vectorized input data. However, a significant portion of the real-world data exists in matrix format, which is given as input to SVM by reshaping the matrices into vectors. The process of reshaping disrupts the spatial correlations inherent in the matrix data. Also, converting matrices into vectors results in input data with a high dimensionality, which introduces significant computational complexity. To overcome these issues in classifying matrix input data, support matrix machine (SMM) is proposed. It represents one of the emerging methodologies tailored for handling matrix input data. SMM preserves the structural information of the matrix data by using the spectral elastic net property which is a combination of the nuclear norm and Frobenius norm. This article provides the first in-depth analysis of the development of the SMM model, which can be used as a thorough summary by both novices and experts. We discuss numerous SMM variants, such as robust, sparse, class-imbalance, and multi-class classification models. We also analyze the applications of the SMM and conclude the article by outlining potential future research avenues and possibilities that may motivate researchers to advance the SMM algorithm.
Collapse
Affiliation(s)
- Anuradha Kumari
- Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, Madhya Pradesh, India
| | - Mushir Akhtar
- Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, Madhya Pradesh, India
| | - Rupal Shah
- Department of Electrical Engineering, Indian Institute of Technology Indore, Simrol, Indore, 453552, Madhya Pradesh, India
| | - M Tanveer
- Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, Madhya Pradesh, India.
| |
Collapse
|
18
|
Hassan MM, Xu Y, Sayada J, Zareef M, Shoaib M, Chen X, Li H, Chen Q. Progress of machine learning-based biosensors for the monitoring of food safety: A review. Biosens Bioelectron 2025; 267:116782. [PMID: 39288707 DOI: 10.1016/j.bios.2024.116782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 08/20/2024] [Accepted: 09/12/2024] [Indexed: 09/19/2024]
Abstract
Rapid urbanization and growing food demand caused people to be concerned about food safety. Biosensors have gained considerable attention for assessing food safety due to selectivity, and sensitivity but poor stability inherently limits their application. The emergence of machine learning (ML) has enhanced the efficiency of different sensors for food safety assessment. The ML combined with various noninvasive biosensors has been implemented efficiently to monitor food safety by considering the stability of bio-recognition molecules. This review comprehensively summarizes the application of ML-powered biosensors to investigate food safety. Initially, different detector-based biosensors using biological molecules with their advantages and disadvantages and biosensor-related various ML algorithms for food safety monitoring have been discussed. Next, the application of ML-powered biosensors to detect antibiotics, foodborne microorganisms, mycotoxins, pesticides, heavy metals, anions, and persistent organic pollutants has been highlighted for the last five years. The challenges and prospects have also been deliberated. This review provides a new prospect in developing various biosensors for multi-food contaminants powered by suitable ML algorithms to monitor in-situ food safety.
Collapse
Affiliation(s)
- Md Mehedi Hassan
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Yi Xu
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Jannatul Sayada
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Muhammad Zareef
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Muhammad Shoaib
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Xiaomei Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China
| | - Huanhuan Li
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China
| | - Quansheng Chen
- College of Ocean Food and Biological Engineering, Jimei University, Xiamen, 361021, PR China; School of Food and Biological Engineering, Jiangsu University, Zhenjiang, 212013, PR China.
| |
Collapse
|
19
|
Felizzato G, Sabo M, Petrìk M, Romolo FS. Laser Desorption-Ion Mobility Spectrometry of Explosives for Forensic and Security Applications. Molecules 2025; 30:138. [PMID: 39795197 PMCID: PMC11722068 DOI: 10.3390/molecules30010138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 12/10/2024] [Accepted: 12/24/2024] [Indexed: 01/13/2025] Open
Abstract
BACKGROUND The detection of explosives in crime scene investigations is critical for forensic science. This study explores the application of laser desorption (LD) ion mobility spectrometry (IMS) as a novel method for this purpose utilising a new IMS prototype developed by MaSaTECH. METHODS The LD sampling technique employs a laser diode module to vaporise explosive traces on surfaces, allowing immediate analysis by IMS without sample preparation. Chemometric approaches, including multivariate data analysis, were utilised for data processing and interpretation, including pre-processing of raw IMS plasmagrams and various pattern recognition techniques, such as linear discriminant analysis (LDA) and support vector machines (SVMs). RESULTS The IMS prototype was validated through experiments with pure explosives (TNT, RDX, PETN) and explosive products (SEMTEX 1A, C4) on different materials. The study found that the pre-processing method significantly impacts classification accuracy, with the PCA-LDA model demonstrating the best performance for real-world applications. CONCLUSIONS The LD-IMS prototype, coupled with effective chemometric techniques, presents a promising methodology for the detection of explosives in forensic investigations, enhancing the reliability of field applications.
Collapse
Affiliation(s)
- Giorgio Felizzato
- Department of Law, University of Bergamo, Via Moroni 255, 24127 Bergamo, Italy;
- Department of Drug Science and Technology, University of Turin, Via Giuria 9, 10125 Torino, Italy
| | - Martin Sabo
- MaSa Tech, s.r.o., Sadová 3018/10, 916 01 Stará Turá, Slovakia; (M.S.)
- Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Ilkovičova 2, Bratislava 4, 842 16 Bratislava, Slovakia
| | - Matej Petrìk
- MaSa Tech, s.r.o., Sadová 3018/10, 916 01 Stará Turá, Slovakia; (M.S.)
- Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Ilkovičova 2, Bratislava 4, 842 16 Bratislava, Slovakia
| | | |
Collapse
|
20
|
Bas Dit Nugues M, Lamassoure L, Rosi G, Flouzat-Lachaniette CH, Khonsari RH, Haiat G. An Instrumented Hammer to Detect the Rupture of the Pterygoid Plates. Ann Biomed Eng 2025; 53:59-70. [PMID: 39174762 PMCID: PMC11782435 DOI: 10.1007/s10439-024-03596-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 07/29/2024] [Indexed: 08/24/2024]
Abstract
PURPOSE Craniofacial osteotomies involving pterygomaxillary disjunction are common procedures in maxillofacial surgery. Surgeons still rely on their proprioception to determine when to stop impacting the osteotome, which is important to avoid complications such as dental damage and bleeding. Our group has developed a technique consisting in using an instrumented hammer that can provide information on the mechanical properties of the tissue located around the osteotome tip. The aim of this study is to determine whether a mallet instrumented with a force sensor can be used to predict the crossing of the osteotome through the pterygoid plates. METHODS 31 osteotomies were carried out in 16 lamb skulls. For each impact, the force signal obtained was analysed using a dedicated signal processing technique. A prediction algorithm based on an SVM classifier and a cost matrix was applied to the database. RESULTS We showed that the device could always detect the crossing of the osteotome, sometimes before its occurrence. The prediction accuracy of the device was 94.7%. The method seemed to be sensitive to the thickness of the plate and to crack apparition and propagation. CONCLUSION These results pave the way for the development of a per-operative decision support system in maxillofacial surgery.
Collapse
Affiliation(s)
- Manon Bas Dit Nugues
- CNRS, Univ Paris Est Creteil, Univ Gustave Eiffel, UMR 8208, MSME, 61, Avenue du Général de Gaulle, 94010, Créteil Cedex, France
| | - Leo Lamassoure
- APHP, Hôpital Henri-Mondor, Service de Chirurgie Orthopédique, 94010, Créteil, France
| | - Giuseppe Rosi
- Univ Paris Est Creteil, Univ Gustave Eiffel, CNRS, UMR 8208, MSME, 94010, Créteil, France
| | - Charles Henri Flouzat-Lachaniette
- INSERM U955, IMRB Université Paris-Est, 51 avenue du Maréchal de Lattre de Tassigny, 94000, Créteil, France
- Service de Chirurgie Orthopédique et Traumatologique, Hôpital Henri Mondor AP-HP, Université Paris-Est, CHU Paris 12, 51 avenue du Maréchal de Lattre de Tassigny, 94000, Créteil, France
| | - Roman Hossein Khonsari
- APHP, Hôpital Necker-Enfants Malades, Service de Chirurgie maxillo-faciale et chirurgie plastique, Laboratoire 'Forme et Croissance du Crâne', 75015, Paris, France
| | - Guillaume Haiat
- CNRS, Univ Paris Est Creteil, Univ Gustave Eiffel, UMR 8208, MSME, 61, Avenue du Général de Gaulle, 94010, Créteil Cedex, France.
| |
Collapse
|
21
|
Jalili A, Pan F, Draayer JP, Chen AX, Ren Z. α-decay half-life predictions with support vector machine. Sci Rep 2024; 14:30776. [PMID: 39730474 DOI: 10.1038/s41598-024-80820-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Accepted: 11/21/2024] [Indexed: 12/29/2024] Open
Abstract
In this study, we investigate the application of support vector machines utilizing a radial basis function kernel for predicting nuclear α-decay half-lives. Our approach integrates a comprehensive set of physics-derived features, including characteristics derived from nuclear structure, to systematically evaluate their impact on predictive accuracy. In addition to traditional parameters such as proton and neutron numbers, as well as terms based on the liquid drop model (e.g., volume, surface, Coulomb features), we incorporate decay energies and orbital angular momentum quantum numbers for both parent and daughter nuclei. Our analysis of 2232 nuclear data points demonstrates that the use of the radial basis function kernel yields predictive models with root mean square errors of 0.819 (for set1) and 0.352 (for set2), aligning with results obtained from comparable machine learning methodologies. Furthermore, Shapley additive explanations values highlight the predominant role of parent nuclei in predicting α-decay half-lives within the support vector machines. These results highlight the effectiveness of machine learning in nuclear structure research, opening up new possibilities for predicting the α-decay half-lives of previously unstudied nuclei.
Collapse
Affiliation(s)
- Amir Jalili
- Department of Physics, Zhejiang Sci-Tech University, Hangzhou, 310018, People's Republic of China.
- School of Physics, Nankai University, Tianjin, 300071, People's Republic of China.
| | - Feng Pan
- Department of Physics, Liaoning Normal University, Dalian, 116029, People's Republic of China
- Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA, 70803-4001, USA
| | - Jerry P Draayer
- Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA, 70803-4001, USA
| | - Ai-Xi Chen
- Department of Physics, Zhejiang Sci-Tech University, Hangzhou, 310018, People's Republic of China.
| | - Zhongzhou Ren
- School of Physics Science and Engineering, Tongji University, Shanghai, 200092, People's Republic of China
| |
Collapse
|
22
|
Huang S, Ishii H. A Generalized Multi-Detector Combination Approach for Differential Item Functioning Detection. APPLIED PSYCHOLOGICAL MEASUREMENT 2024:01466216241310602. [PMID: 39713763 PMCID: PMC11660104 DOI: 10.1177/01466216241310602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/17/2024] [Accepted: 09/29/2024] [Indexed: 12/24/2024]
Abstract
Many studies on differential item functioning (DIF) detection rely on single detection methods (SDMs), each of which necessitates specific assumptions that may not always be validated. Using an inappropriate SDM can lead to diminished accuracy in DIF detection. To address this limitation, a novel multi-detector combination (MDC) approach is proposed. Unlike SDMs, MDC effectively evaluates the relevance of different SDMs under various test conditions and integrates them using supervised learning, thereby mitigating the risk associated with selecting a suboptimal SDM for DIF detection. This study aimed to validate the accuracy of the MDC approach by applying five types of SDMs and four distinct supervised learning methods in MDC modeling. Model performance was assessed using the area under the curve (AUC), which provided a comprehensive measure of the ability of the model to distinguish between classes across all threshold levels, with higher AUC values indicating higher accuracy. The MDC methods consistently achieved higher average AUC values compared to SDMs in both matched test sets (where test conditions align with the training set) and unmatched test sets. Furthermore, MDC outperformed all SDMs under each test condition. These findings indicated that MDC is highly accurate and robust across diverse test conditions, establishing it as a viable method for practical DIF detection.
Collapse
|
23
|
Che Y, Zhao M, Gao Y, Zhang Z, Zhang X. Application of machine learning for mass spectrometry-based multi-omics in thyroid diseases. Front Mol Biosci 2024; 11:1483326. [PMID: 39741929 PMCID: PMC11685090 DOI: 10.3389/fmolb.2024.1483326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 12/02/2024] [Indexed: 01/03/2025] Open
Abstract
Thyroid diseases, including functional and neoplastic diseases, bring a huge burden to people's health. Therefore, a timely and accurate diagnosis is necessary. Mass spectrometry (MS) based multi-omics has become an effective strategy to reveal the complex biological mechanisms of thyroid diseases. The exponential growth of biomedical data has promoted the applications of machine learning (ML) techniques to address new challenges in biology and clinical research. In this review, we presented the detailed review of applications of ML for MS-based multi-omics in thyroid disease. It is primarily divided into two sections. In the first section, MS-based multi-omics, primarily proteomics and metabolomics, and their applications in clinical diseases are briefly discussed. In the second section, several commonly used unsupervised learning and supervised algorithms, such as principal component analysis, hierarchical clustering, random forest, and support vector machines are addressed, and the integration of ML techniques with MS-based multi-omics data and its application in thyroid disease diagnosis is explored.
Collapse
Affiliation(s)
- Yanan Che
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
| | - Meng Zhao
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
| | - Yan Gao
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
| | - Zhibin Zhang
- Department of General Surgery, Tianjin First Central Hospital, Tianjin, China
| | - Xiangyang Zhang
- School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
| |
Collapse
|
24
|
Elashmawi WH, Djellal A, Sheta A, Surani S, Aljahdali S. Machine Learning for Enhanced COPD Diagnosis: A Comparative Analysis of Classification Algorithms. Diagnostics (Basel) 2024; 14:2822. [PMID: 39767182 PMCID: PMC11674508 DOI: 10.3390/diagnostics14242822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Revised: 12/08/2024] [Accepted: 12/12/2024] [Indexed: 01/11/2025] Open
Abstract
Background: In the United States, chronic obstructive pulmonary disease (COPD) is a significant cause of mortality. As far as we know, it is a chronic, inflammatory lung condition that cuts off airflow to the lungs. Many symptoms have been reported for such a disease: breathing problems, coughing, wheezing, and mucus production. Patients with COPD might be at risk, since they are more susceptible to heart disease and lung cancer. Methods: This study reviews COPD diagnosis utilizing various machine learning (ML) classifiers, such as Logistic Regression (LR), Gradient Boosting Classifier (GBC), Support Vector Machine (SVM), Gaussian Naïve Bayes (GNB), Random Forest Classifier (RFC), K-Nearest Neighbors Classifier (KNC), Decision Tree (DT), and Artificial Neural Network (ANN). These models were applied to a dataset comprising 1603 patients after being referred for a pulmonary function test. Results: The RFC has achieved superior accuracy, reaching up to 82.06% in training and 70.47% in testing. Furthermore, it achieved a maximum F score in training and testing with an ROC value of 0.0.82. Conclusions: The results obtained with the utilized ML models align with previous work in the field, with accuracies ranging from 67.81% to 82.06% in training and from 66.73% to 71.46% in testing.
Collapse
Affiliation(s)
- Walaa H. Elashmawi
- Department of Computer Science, Suez Canal University, Ismailia 41522, Egypt
- Department of Computer Science, Misr International University, Cairo 11828, Egypt
| | - Adel Djellal
- Department of Electronics, Electrotechnics, and Automation (EEA), National Higher School of Technology and Engineering, Annaba 23000, Algeria;
| | - Alaa Sheta
- Computer Science Department, Southern Connecticut State University, New Haven, CT 06515, USA;
| | - Salim Surani
- Department of Pharmacy & Medicine, Texas A&M University, College Station, TX 75428, USA;
| | - Sultan Aljahdali
- Computer Science Department, Taif University, Taif 21944, Saudi Arabia;
| |
Collapse
|
25
|
de Jong TMH, Stamatelou E, Rosema NAM, Jansen IDC, Brandt BW, Angelakis A, Loos BG, van der Velden U, Danser MM. Effect of Daily Vitamin C Supplementation with or Without Flavonoids on Periodontal, Microbial, and Systemic Conditions Before and After Periodontal Therapy: A Case Series from an RCT. J Clin Med 2024; 13:7571. [PMID: 39768497 PMCID: PMC11678909 DOI: 10.3390/jcm13247571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 11/22/2024] [Accepted: 12/07/2024] [Indexed: 01/11/2025] Open
Abstract
Purpose: To investigate the effect of vitamin C supplementation with or without flavonoids on periodontal conditions, and microbial and systemic variables before and after non-surgical periodontal treatment (NSPT). Materials and Methods: A case series derived from a randomized controlled trial was conducted to explore the effects of daily vitamin C supplementation, with or without flavonoids, on periodontal conditions. The study population was recruited from patients with periodontitis who had been referred to the Department of Periodontology at the Academic Centre for Dentistry Amsterdam (ACTA). The study consisted of a 2-month observation of untreated periodontitis followed by a 3-month period after NSPT. Descriptive statistics, correlation and clustering analyses, and dimensionality reduction methods were used to evaluate the interventions' impact. Results: Due to COVID-19, the study was prematurely terminated and reported findings from 13 patients. Results indicate a correlation between higher plasma vitamin C levels and reduced gingival inflammation, suggesting benefits for untreated periodontal conditions. Clustering analysis showed no differences based on supplementation type, indicating it did not affect outcomes, and microbiological data had limited effects. Principal Component Analysis visualized clusters and illustrated no distinct groups corresponding to supplementation types. Violin plots highlighted variability, with one cluster comprising individuals with more severe periodontal conditions. Conclusions: Higher plasma vitamin C levels were associated with lower gingival inflammation. However, daily vitamin C supplementation, with or without flavonoids, did not show additional benefits on periodontal conditions before or after treatment. Clustering suggests that periodontal severity, rather than supplementation, influenced patient profiles. The study's small sample size limits the generalizability of the findings.
Collapse
Affiliation(s)
- Thijs M. H. de Jong
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Eleni Stamatelou
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Nanning A. M. Rosema
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Ineke D. C. Jansen
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Bernd W. Brandt
- Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Athanasios Angelakis
- Department of Epidemiology and Data Science, Amsterdam University Medical Center, Location AMC, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Digital Health and Methodology, Amsterdam Public Health Research Institute, Location AMC, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands
- Data Science Center, University of Amsterdam, Singel 425, 1012 WP Amsterdam, The Netherlands
| | - Bruno G. Loos
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Ubele van der Velden
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| | - Monique M. Danser
- Department of Periodontology, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and Vrije Universiteit Amsterdam, Gustav Mahlerlaan 3004, 1081 LA Amsterdam, The Netherlands
| |
Collapse
|
26
|
Teklemariam TA, Chou F, Kumaravel P, Van Buskrik J. ATR-FTIR spectroscopy and machine/deep learning models for detecting adulteration in coconut water with sugars, sugar alcohols, and artificial sweeteners. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 322:124771. [PMID: 39032237 DOI: 10.1016/j.saa.2024.124771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 06/12/2024] [Accepted: 07/02/2024] [Indexed: 07/23/2024]
Abstract
Packaged coconut water offers various options, from pure to those with added sugars and other additives. While the purity of coconut water is esteemed for its health benefits, its popularity also exposes it to potential adulteration and misrepresentation. To address this concern, our study combines Fourier transform infrared spectroscopy (FTIR) and machine learning techniques to detect potential adulterants in coconut water through classification models. The dataset comprises infrared spectra from coconut water samples spiked with 15 different types of potential sugar substitutes, including: sugars, artificial sweeteners, and sugar alcohols. The interaction of infrared light with molecular bonds generates unique molecular fingerprints, forming the basis of our analysis. Departing from previous research predominantly reliant on linear-based chemometrics for adulterant detection, our study explored linear, non-linear, and combined feature extraction models. By developing an interactive application utilizing principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE), non-targeted sugar adulterant detection was streamlined through enhanced visualization and pattern recognition. Targeted analysis using ensemble learning random forest (RF) and deep learning 1-dimensional convolutional neural network (1D CNN) achieved higher classification accuracies (95% and 96%, respectively) compared to sparse partial least squares discriminant analysis (sPLS-DA) at 77% and support vector machine (SVM) at 88% on the same dataset. The CNN's demonstrated classification accuracy is complemented by exceptional efficiency through its ability to train and test on raw data.
Collapse
Affiliation(s)
- Thomas A Teklemariam
- Canadian Food Inspection Agency, Greater Toronto Area Laboratory, 2301 Midland Avenue, Toronto, ON M1P 4R7, Canada.
| | - Faith Chou
- Canadian Food Inspection Agency, 1400 Merivale Road, Ottawa, ON K1A 0Y9, Canada
| | - Pavisha Kumaravel
- University of Guelph, Molecular and Cellular Biology, Guelph, ON N1G 2W1, Canada
| | - Jeremy Van Buskrik
- Canadian Food Inspection Agency, Greater Toronto Area Laboratory, 2301 Midland Avenue, Toronto, ON M1P 4R7, Canada
| |
Collapse
|
27
|
Odugbemi AI, Nyirenda C, Christoffels A, Egieyeh SA. Artificial intelligence in antidiabetic drug discovery: The advances in QSAR and the prediction of α-glucosidase inhibitors. Comput Struct Biotechnol J 2024; 23:2964-2977. [PMID: 39148608 PMCID: PMC11326494 DOI: 10.1016/j.csbj.2024.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 07/03/2024] [Accepted: 07/03/2024] [Indexed: 08/17/2024] Open
Abstract
Artificial Intelligence is transforming drug discovery, particularly in the hit identification phase of therapeutic compounds. One tool that has been instrumental in this transformation is Quantitative Structure-Activity Relationship (QSAR) analysis. This computer-aided drug design tool uses machine learning to predict the biological activity of new compounds based on the numerical representation of chemical structures against various biological targets. With diabetes mellitus becoming a significant health challenge in recent times, there is intense research interest in modulating antidiabetic drug targets. α-Glucosidase is an antidiabetic target that has gained attention due to its ability to suppress postprandial hyperglycaemia, a key contributor to diabetic complications. This review explored a detailed approach to developing QSAR models, focusing on strategies for generating input variables (molecular descriptors) and computational approaches ranging from classical machine learning algorithms to modern deep learning algorithms. We also highlighted studies that have used these approaches to develop predictive models for α-glucosidase inhibitors to modulate this critical antidiabetic drug target.
Collapse
Affiliation(s)
- Adeshina I Odugbemi
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, Cape Town 7535, South Africa
- School of Pharmacy, University of the Western Cape, Bellville, Cape Town 7535, South Africa
- National Institute for Theoretical and Computational Sciences (NITheCS), South Africa
| | - Clement Nyirenda
- Department of Computer Science, University of the Western Cape, Cape Town 7535, South Africa
| | - Alan Christoffels
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, Cape Town 7535, South Africa
- Africa Centres for Disease Control and Prevention, African Union, Addis Ababa, Ethiopia
| | - Samuel A Egieyeh
- School of Pharmacy, University of the Western Cape, Bellville, Cape Town 7535, South Africa
- National Institute for Theoretical and Computational Sciences (NITheCS), South Africa
| |
Collapse
|
28
|
Wang Y, Shangguan C, Li S, Zhang W. Negative Emotion Differentiation Promotes Cognitive Reappraisal: Evidence From Electroencephalogram Oscillations and Phase-Amplitude Coupling. Hum Brain Mapp 2024; 45:e70092. [PMID: 39651732 PMCID: PMC11626486 DOI: 10.1002/hbm.70092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 11/06/2024] [Accepted: 11/15/2024] [Indexed: 12/11/2024] Open
Abstract
Cognitive reappraisal, an effective emotion regulation strategy, is influenced by various individual factors. Although previous studies have established a link between negative emotion differentiation (NED) and cognitive reappraisal, the underlying neural mechanisms remain largely unknown. Using electroencephalography, this study investigates the influence and neural basis of NED in cognitive reappraisal by integrating aspects of event-related potentials, neural oscillation rhythms, and cross-frequency coupling. The findings revealed that individuals with high NED demonstrated a significant decrease in parietal late positive potential amplitudes during cognitive reappraisal, suggesting enhanced cognitive reappraisal abilities. Moreover, high NED individuals displayed increased γ synchronization, parietal α-γ coupling, and frontal θ-γ coupling when reappraising negative emotions than those with low emotion differentiation ability. Machine learning analysis of these neural indicators highlighted the superior classification and predictive accuracy of multimodal indicators for NED as opposed to unimodal indicators. Overall, this multimodal evidence provides a comprehensive interpretation of the neurophysiological mechanisms through which NED influences cognitive reappraisal and provides preliminary empirical support for personalized cognitive reappraisal interventions to alleviate emotional problems.
Collapse
Affiliation(s)
- Yali Wang
- School of MarxismZhejiang University of Finance and EconomicsHangzhouChina
| | - Chenyu Shangguan
- College of Education Science and TechnologyNanjing University of Posts and TelecommunicationsNanjingChina
| | - Sijin Li
- School of PsychologyShenzhen UniversityShenzhenChina
| | - Wenhai Zhang
- Mental Health Education CenterYancheng Institute of TechnologyYanchengChina
- The Big Data Centre for Neuroscience and AIHengyang Normal UniversityHengyangChina
| |
Collapse
|
29
|
Herath A, Tiozon RJ, Kretzschmar T, Sreenivasulu N, Mahon P, Butardo V. Machine learning approach for high-throughput phenolic antioxidant screening in black Rice germplasm collection based on surface FTIR. Food Chem 2024; 460:140728. [PMID: 39121772 DOI: 10.1016/j.foodchem.2024.140728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 07/13/2024] [Accepted: 07/30/2024] [Indexed: 08/12/2024]
Abstract
Pigmented rice contains beneficial phenolic antioxidants but analysing them across germplasm collections is laborious and time-consuming. Here we utilised rapid surface Fourier transform infrared (FTIR) spectroscopy and machine learning algorithms (ML) to predict and classify polyphenolic antioxidants. Total phenolics, flavonoids, anthocyanins, and proanthocyanidins were quantified biochemically from 270 diverse global coloured rice collection and attenuated total reflectance (ATR) FTIR spectra were obtained by scanning whole grain surfaces at 800-4000 cm-1. Five ML classification models were optimised using the biochemical and spectral data which performed predictions with 93.5%-100% accuracy. Random Forest and Support Vector Machine models identified key FTIR peaks linked to flavonols, flavones and anthocyanins as important model predictors. This research successfully established direct and non-destructive surface chemistry spectroscopy of the aleurone layer of pigmented rice integrated with ML models as a viable high-throughput platform to accelerate the analysis and profiling of nutritionally valuable coloured rice varieties.
Collapse
Affiliation(s)
- Achini Herath
- Department of Chemistry and Biotechnology, School of Science, Computing, and Engineering Technologies, Swinburne University of Technology, Hawthorn, Victoria, Australia
| | - Rhowell Jr Tiozon
- Consumer-driven Grain Quality and Nutrition Research Unit, International Rice Research Institute, Los Banos, Laguna, Philippines
| | - Tobias Kretzschmar
- Plant Science, Faculty of Science and Engineering, Southern Cross University, Lismore, NSW, Australia
| | - Nese Sreenivasulu
- Consumer-driven Grain Quality and Nutrition Research Unit, International Rice Research Institute, Los Banos, Laguna, Philippines
| | - Peter Mahon
- Department of Chemistry and Biotechnology, School of Science, Computing, and Engineering Technologies, Swinburne University of Technology, Hawthorn, Victoria, Australia
| | - Vito Butardo
- Department of Chemistry and Biotechnology, School of Science, Computing, and Engineering Technologies, Swinburne University of Technology, Hawthorn, Victoria, Australia.
| |
Collapse
|
30
|
Wisuthiphaet N, Zhang H, Liu X, Nitin N. Detection of Escherichia coli Using Bacteriophage T7 and Analysis of Excitation‑Emission Matrix Fluorescence Spectroscopy. J Food Prot 2024; 87:100396. [PMID: 39521134 DOI: 10.1016/j.jfp.2024.100396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 10/11/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024]
Abstract
Conventional detection methods require the isolation and enrichment of bacteria, followed by molecular, biochemical, or culture-based analysis. To address some of the limitations of conventional methods, this study develops a machine learning (ML) approach to analyze the excitation-emission matrix (EEM) fluorescence data generated based on bacteriophage T7 and Escherichia coli interactions for in-situ detection of live bacteria in the presence of fresh produce homogenate. We trained classification models using various ML algorithms based on the 3-D EEM data generated with bacteria and their interactions with a T7 phage. These ML algorithms, including linear Support Vector Classifier (SVC) and Random Forest (RF), demonstrate high accuracy (>0.85) for detecting E. coli at 102 CFU/ml concentration within 6 h. Additionally, these ML models can differentiate among different E. coli concentration levels. For example, the Gaussian Process model achieved an accuracy of 92% in detecting different concentration levels of live E. coli. Application of these ML methods to detect E. coli in spinach homogenate yielded an accuracy of 89% using the linear-SVC model. Furthermore, feature selection techniques were employed to reduce the dimensionality of the data, revealing that only six features were necessary for achieving classification accuracy (>0.85) of spinach homogenate samples containing 102 CFU/ml of E. coli. These findings highlight the potential of this novel bacterial detection methodology, offering rapid, specific, and efficient solutions for applications in food safety and environmental monitoring.
Collapse
Affiliation(s)
- Nicharee Wisuthiphaet
- Department of Biotechnology, Faculty of Applied Science, King Mongkut's University of Technology North Bangkok, Bangkok, Thailand
| | - Huanle Zhang
- School of Computer Science and Technology, Shandong University, Shandong, China
| | - Xin Liu
- Department of Computer Science, University of California, Davis, Davis, California, United States
| | - Nitin Nitin
- Department of Food Science & Technology, University of California, Davis, Davis, California, United States; Department of Biological & Agricultural Engineering, University of California, Davis, Davis, California, United States.
| |
Collapse
|
31
|
Su S, Gao J, Dong J, Guo Q, Ma H, Luan S, Zheng X, Tao H, Zhou L, Dai Y. Prediction of mortality in hemodialysis patients based on autoencoders. Int J Med Inform 2024; 195:105744. [PMID: 39642591 DOI: 10.1016/j.ijmedinf.2024.105744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 10/05/2024] [Accepted: 11/30/2024] [Indexed: 12/09/2024]
Abstract
BACKGROUND Patients with end-stage renal disease (ESRD) undergoing hemodialysis (HD) exhibit a high mortality risk, particularly at the onset of treatment. Conventional risk assessment models, dependent on extensive temporal data accumulation, frequently encounter issues of data incompleteness and lengthy collection periods. OBJECTIVE This study addresses the imbalance in short-term HD data and the issue of missing data features, achieving a robust assessment of mortality risk for HD patients over the subsequent 30 to 450 days. METHODS An autoencoder-based mortality prediction model for HD patients is proposed. Leveraging the manifold structure of the non-missing features and the intrinsic relationship between the features in the high-dimensional data space, the model infers the values of the missing features. Noise and redundant information typically distort the manifold structure, impacting the accuracy of inferences about missing features. Consequently, we generate feature dropping masks to simulate the missing data distribution in the deep learning framework and design an autoencoder, forming an adaptive feature extraction module. The module utilizes readily available short-term data for unsupervised learning, enabling the encoder to reconstruct missing features and derive latent representations. Finally, a classifier based on the latent representations achieves the mortality prediction. RESULTS Over a 30-day observation window, the model demonstrated superior mortality prediction performance compared to other models across all prediction windows. Feature importance analysis showed that creatinine and age are consistently the most critical features across all prediction windows. Glucose (fasting) and platelet count also remain significant, with their correlation with mortality risk increasing over time. Serum albumin, international standard ratio, and phosphate are particularly important for short-term predictions, while conjugated bilirubin and prothrombin time gain prominence in mid- and long-term predictions. CONCLUSION The proposed model proficiently leverages short-term HD data to provide precise mortality risk evaluations in HD patients, with particular efficacy in the short-term. Its application holds considerable value for clinical decision-making and risk management in this patient population.
Collapse
Affiliation(s)
- Shuzhi Su
- Joint Research Center for Occupational Medicine and Health of IHM, Anhui University of Science & Technology, Huainan 232001, PR China; School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, Anhui 232001, PR China; The First Hospital, Anhui University of Science & Technology, Huainan 232001, PR China
| | - Jisheng Gao
- School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, Anhui 232001, PR China
| | - Jingjing Dong
- Department of General Medicine, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou 310022, PR China
| | - Qi Guo
- Department of Nephrology, the Second Affiliated Hospital of Jinan University, Shenzhen People's Hospital, Jinan University, Shenzhen 518020, PR China
| | - Hualin Ma
- Department of Nephrology, the Second Affiliated Hospital of Jinan University, Shenzhen People's Hospital, Jinan University, Shenzhen 518020, PR China
| | - Shaodong Luan
- Departments of Nephrology, Shenzhen Longhua District Central Hospital, Shenzhen 518020, PR China
| | - Xuejia Zheng
- The First Hospital, Anhui University of Science & Technology, Huainan 232001, PR China
| | - Huihui Tao
- School of Medicine, Anhui University of Science & Technology, Huainan 232001, PR China
| | - Lingling Zhou
- School of Medicine, Anhui University of Science & Technology, Huainan 232001, PR China
| | - Yong Dai
- School of Medicine, Anhui University of Science & Technology, Huainan 232001, PR China.
| |
Collapse
|
32
|
Amouei Sheshkal S, Gundersen M, Alexander Riegler M, Aass Utheim Ø, Gunnar Gundersen K, Rootwelt H, Prestø Elgstøen KB, Lewi Hammer H. Classifying Dry Eye Disease Patients from Healthy Controls Using Machine Learning and Metabolomics Data. Diagnostics (Basel) 2024; 14:2696. [PMID: 39682603 DOI: 10.3390/diagnostics14232696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 11/16/2024] [Accepted: 11/21/2024] [Indexed: 12/18/2024] Open
Abstract
Background: Dry eye disease is a common disorder of the ocular surface, leading patients to seek eye care. Clinical signs and symptoms are currently used to diagnose dry eye disease. Metabolomics, a method for analyzing biological systems, has been found helpful in identifying distinct metabolites in patients and in detecting metabolic profiles that may indicate dry eye disease at early stages. In this study, we explored the use of machine learning and metabolomics data to identify cataract patients who suffer from dry eye disease, a topic that, to our knowledge, has not been previously explored. As there is no one-size-fits-all machine learning model for metabolomics data, choosing the most suitable model can significantly affect the quality of predictions and subsequent metabolomics analyses. Methods: To address this challenge, we conducted a comparative analysis of eight machine learning models on two metabolomics data sets from cataract patients with and without dry eye disease. The models were evaluated and optimized using nested k-fold cross-validation. To assess the performance of these models, we selected a set of suitable evaluation metrics tailored to the data set's challenges. Results: The logistic regression model overall performed the best, achieving the highest area under the curve score of 0.8378, balanced accuracy of 0.735, Matthew's correlation coefficient of 0.5147, an F1-score of 0.8513, and a specificity of 0.5667. Additionally, following the logistic regression, the XGBoost and Random Forest models also demonstrated good performance. Conclusions: The results show that the logistic regression model with L2 regularization can outperform more complex models on an imbalanced data set with a small sample size and a high number of features, while also avoiding overfitting and delivering consistent performance across cross-validation folds. Additionally, the results demonstrate that it is possible to identify dry eye in cataract patients from tear film metabolomics data using machine learning models.
Collapse
Affiliation(s)
- Sajad Amouei Sheshkal
- Department of Computer Science, Oslo Metropolitan University, 0166 Oslo, Norway
- Department of Holistic Systems, SimulaMet, 0167 Oslo, Norway
- Ifocus Eye Clinic, 5527 Haugesund, Norway
| | - Morten Gundersen
- Ifocus Eye Clinic, 5527 Haugesund, Norway
- Department of Life Sciences and Health, Oslo Metropolitan University, 0166 Oslo, Norway
| | - Michael Alexander Riegler
- Department of Computer Science, Oslo Metropolitan University, 0166 Oslo, Norway
- Department of Holistic Systems, SimulaMet, 0167 Oslo, Norway
| | - Øygunn Aass Utheim
- Department of Ophthalmology, Oslo University Hospital, 0450 Oslo, Norway
| | | | - Helge Rootwelt
- Department of Medical Biochemistry, Oslo University Hospital, 0450 Oslo, Norway
| | | | - Hugo Lewi Hammer
- Department of Computer Science, Oslo Metropolitan University, 0166 Oslo, Norway
- Department of Holistic Systems, SimulaMet, 0167 Oslo, Norway
| |
Collapse
|
33
|
Yagi M, Sakai A, Yasutomi S, Suzuki K, Kashikura H, Goto K. Assessment of Tail-Cutting in Frozen Albacore ( Thunnus alalunga) Through Ultrasound Inspection and Chemical Analysis. Foods 2024; 13:3860. [PMID: 39682932 DOI: 10.3390/foods13233860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Revised: 11/18/2024] [Accepted: 11/20/2024] [Indexed: 12/18/2024] Open
Abstract
Fat content is the main criterion for evaluating albacore quality. However, no reports exist on the accuracy of the tail-cutting method, a method used to assess the fat content of albacore. Here, we evaluated this method by comparing it with chemical analysis and ultrasound inspection. We measured the actual fat content in albacore using chemical analysis and compared the results with those obtained using the tail-cutting method. Significant discrepancies (99% CI, t-test) were observed in fat content among the tail-cutting samples. Using chemical analysis as the ground truth, the accuracy of tail-cutting from two different companies was 70.0% for company A and 51.9% for company B. An ultrasound inspection revealed that a higher fat content reduced the amplitude of ultrasound signals with statistical significance (99% CI, t-test). Finally, machine learning algorithms were used to enforce the ultrasound inspection. The best combination of ultrasound inspection and a machine learning algorithm achieved an 84.2% accuracy for selecting fat-rich albacore, which is better than tail-cutting (73.6%). Our findings suggested that ultrasound inspection could be a valuable and non-destructive method for estimating the fat content of albacore, achieving better accuracy than the traditional tail-cutting method.
Collapse
Affiliation(s)
- Masafumi Yagi
- School of Marine Science and Technology, Tokai University, 3-20-1 Orido, Shimizu-ku, Shizuoka-shi 424-8610, Shizuoka, Japan
| | - Akira Sakai
- Artificial Intelligence Laboratory, Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi 211-8588, Kanagawa, Japan
| | - Suguru Yasutomi
- Artificial Intelligence Laboratory, Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi 211-8588, Kanagawa, Japan
| | - Kanata Suzuki
- Artificial Intelligence Laboratory, Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi 211-8588, Kanagawa, Japan
| | - Hiroki Kashikura
- Graduate School of Marine Science and Technology, Tokai University, 3-20-1 Orido, Shimizu-ku, Shizuoka-shi 424-8610, Shizuoka, Japan
| | - Keiichi Goto
- School of Marine Science and Technology, Tokai University, 3-20-1 Orido, Shimizu-ku, Shizuoka-shi 424-8610, Shizuoka, Japan
| |
Collapse
|
34
|
Guo H, Wang J, Zhang D, Cui J, Yuan Y, Bao H, Yang M, Guo J, Chen F, Zhou W, Wu G, Guo Y, Wei H, Qiao B, Zhao S. Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2024; 47:1. [PMID: 39607621 PMCID: PMC11604695 DOI: 10.1007/s10653-024-02313-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 11/18/2024] [Indexed: 11/29/2024]
Abstract
Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the "double-carbon" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m2, with test set precisions (R2) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R2 = 0.52, MSE = 0.32) over the SVM model (R2 = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 106t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.
Collapse
Affiliation(s)
- Hengliang Guo
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China
| | - Jinyang Wang
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Dujuan Zhang
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China
| | - Jian Cui
- Henan Institute of Geological Survey, Zhengzhou, 450001, China.
- National Engineering Laboratory Geological Remote Sensing Center for Remote Sensing Satellite Application, Zhengzhou, 450001, China.
| | - Yonghao Yuan
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Haoming Bao
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Mengjiao Yang
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Jiahui Guo
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Feng Chen
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Wenge Zhou
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Gang Wu
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China
| | - Yang Guo
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China
| | - Haitao Wei
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Baojin Qiao
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| | - Shan Zhao
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China
| |
Collapse
|
35
|
Shen C, Lei X, Huang Z. Cold threat and moisture deficit induced individual tree mortality via 25-year monitoring in seminatural mixed forests, northeastern China. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 953:176048. [PMID: 39244065 DOI: 10.1016/j.scitotenv.2024.176048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 08/22/2024] [Accepted: 09/03/2024] [Indexed: 09/09/2024]
Abstract
Accurately predicting tree mortality in mixed forests sets a challenge for conventional models because of large uncertainty, especially under changing climate. Machine learning algorithms had potential for predicting individual tree mortality with higher accuracy via filtering the relevant climatic and environmental factors. In this study, the sensitivity of individual tree mortality to regional climate was validated by modeling in seminatural mixed coniferous forests based on 25-year observations in northeast of China. Three advanced machine learning and deep learning algorithms were employed, including support vector machines, multi-layer perceptron, and random forests. Mortality was predicted by the effects of multiple inherent and environmental factors, including tree size and growth, topography, competition, stand structure and regional climate. All three types of models performed satisfactorily with their values of the areas under receiving operating characteristic curve (AUC) > 0.9. With tree growth, competition and regional climate as input variables, a model based on random forests showed the highest values of the explained variance score (0.862) and AUC (0.914). Since the trees were vulnerable despite their species, mortality could occur after growth limit induced by insufficient or excessive sun radiation during growing seasons, cold threat caused thermal insufficiency in winters, and annual moisture constraints in these mixed coniferous forests. Our findings could enrich basic knowledge on individual tree mortality caused by water and heat inadequacy with the negative impacts of global warming. Successful individual tree mortality modeling via advanced algorithms in mixed forests could assist in adaptive forest ecology modeling in large areas.
Collapse
Affiliation(s)
- Chenchen Shen
- Ecology and Nature Conservation Institute, Chinese Academy of Forestry, Key Laboratory of Forest Ecology and Environment of National Forestry and Grassland Administration, Beijing 100091, China; Hubei Zigui Three Gorges Reservoir National Forest Ecosystem Observation and Research Station, Zigui 443600, China
| | - Xiangdong Lei
- State Key Laboratory of Efficient Production of Forest Resources, Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, China.
| | - Zhilin Huang
- Ecology and Nature Conservation Institute, Chinese Academy of Forestry, Key Laboratory of Forest Ecology and Environment of National Forestry and Grassland Administration, Beijing 100091, China; Hubei Zigui Three Gorges Reservoir National Forest Ecosystem Observation and Research Station, Zigui 443600, China; Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China
| |
Collapse
|
36
|
Luo X, Chi ASY, Lin AH, Ong TJ, Wong L, Rahman CR. Benchmarking recent computational tools for DNA-binding protein identification. Brief Bioinform 2024; 26:bbae634. [PMID: 39657630 PMCID: PMC11630855 DOI: 10.1093/bib/bbae634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Revised: 10/29/2024] [Accepted: 11/20/2024] [Indexed: 12/12/2024] Open
Abstract
Identification of DNA-binding proteins (DBPs) is a crucial task in genome annotation, as it aids in understanding gene regulation, DNA replication, transcriptional control, and various cellular processes. In this paper, we conduct an unbiased benchmarking of 11 state-of-the-art computational tools as well as traditional tools such as ScanProsite, BLAST, and HMMER for identifying DBPs. We highlight the data leakage issue in conventional datasets leading to inflated performance. We introduce new evaluation datasets to support further development. Through a comprehensive evaluation pipeline, we identify potential limitations in models, feature extraction techniques, and training methods, and recommend solutions regarding these issues. We show that combining the predictions of the two best computational tools with BLAST-based prediction significantly enhances DBP identification capability. We provide this consensus method as user-friendly software. The datasets and software are available at https://github.com/Rafeed-bot/DNA_BP_Benchmarking.
Collapse
Affiliation(s)
- Xizi Luo
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Amadeus Song Yi Chi
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Andre Huikai Lin
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Tze Jet Ong
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Limsoon Wong
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | | |
Collapse
|
37
|
Elsayed A, Rixon S, Levison J, Binns A, Goel P. Machine learning models for prediction of nutrient concentrations in surface water in an agricultural watershed. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 372:123305. [PMID: 39561445 DOI: 10.1016/j.jenvman.2024.123305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 09/19/2024] [Accepted: 11/08/2024] [Indexed: 11/21/2024]
Abstract
Prediction and quantification of nutrient concentrations in surface water has gained substantial attention during recent decades because excess nutrients released from agricultural and urban watersheds can significantly deteriorate surface water quality. Machine learning (ML) models are considered an effective tool for better understanding and characterization of nutrient release from agricultural fields to surface water. However, to date, no systematic investigations have examined the implementation of different classification and regression ML models in agricultural settings to predict nutrient concentrations in surface water using a group of input variables including climatological (e.g., precipitation), hydrological (e.g., stream flow) and field characteristics (i.e., land and crop use). In the current study, multiple classification (e.g., decision trees) and regression (e.g., regression trees) ML models were applied on a dataset pertaining to surface water quality in an agricultural watershed in southern Ontario, Canada (i.e., Upper Parkhill watershed). The target variables of these models were the nutrient concentrations in surface water including nitrate, total phosphorus, soluble reactive phosphorus, and total dissolved phosphorus. These target variables were predicted using physical and chemical water parameters of surface water (e.g., temperature and DO), climatological, hydrological, and field conditions as the input variables. The performance of these different models was assessed using various evaluation metrics such as classification accuracy (CA) and coefficient of determination (R2) for classification and regression models, respectively. In general, both the ensemble bagged trees and logistic regression (CA ≥ 0.72), and exponential Gaussian process regression (R2≥ 0.93) models were the optimal classification and regression ML algorithms, respectively, where they resulted in the highest prediction accuracy of the target variables. The insights and outcomes of the current study demonstrates that ML models can be employed to effectively predict and quantify the nutrient concentrations in surface waters to supplement field-collected monitoring data in agricultural watersheds, assisting in maintaining high quality of the available surface water resources.
Collapse
Affiliation(s)
- Ahmed Elsayed
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, Guelph, Ontario, Canada; Irrigation and Hydraulics Department, Faculty of Engineering, Cairo University, Giza, Egypt.
| | - Sarah Rixon
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, Guelph, Ontario, Canada
| | - Jana Levison
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, Guelph, Ontario, Canada
| | - Andrew Binns
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, Guelph, Ontario, Canada
| | - Pradeep Goel
- Ministry of the Environment, Conservation and Parks, Etobicoke, Ontario, Canada
| |
Collapse
|
38
|
Hategan AR, David M, Pirnau A, Cozar B, Cinta-Pinzaru S, Guyon F, Magdas DA. Fusing 1H NMR and Raman experimental data for the improvement of wine recognition models. Food Chem 2024; 458:140245. [PMID: 38954957 DOI: 10.1016/j.foodchem.2024.140245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Revised: 06/12/2024] [Accepted: 06/25/2024] [Indexed: 07/04/2024]
Abstract
The present study proposes the development of new wine recognition models based on Artificial Intelligence (AI) applied to the mid-level data fusion of 1H NMR and Raman data. In this regard, a supervised machine learning method, namely Support Vector Machines (SVMs), was applied for classifying wine samples with respect to the cultivar, vintage, and geographical origin. Because the association between the two data sources generated an input space with a high dimensionality, a feature selection algorithm was employed to identify the most relevant discriminant markers for each wine classification criterion, before SVM modeling. The proposed data processing strategy allowed the classification of the wine sample set with accuracies up to 100% in both cross-validation and on an independent test set and highlighted the efficiency of 1H NMR and Raman data fusion as opposed to the use of a single-source data for differentiating wine concerning the cultivar and vintage.
Collapse
Affiliation(s)
- Ariana Raluca Hategan
- National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania; Faculty of Physics, Babeș-Bolyai University, Kogălniceanu 1, 400084 Cluj-Napoca, Romania.
| | - Maria David
- National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania; Faculty of Physics, Babeș-Bolyai University, Kogălniceanu 1, 400084 Cluj-Napoca, Romania.
| | - Adrian Pirnau
- National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania.
| | - Bogdan Cozar
- National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania.
| | - Simona Cinta-Pinzaru
- Faculty of Physics, Babeș-Bolyai University, Kogălniceanu 1, 400084 Cluj-Napoca, Romania.
| | - Francois Guyon
- Service Commun des Laboratoires, 146 Traverse Charles Susini, 13388 Marseille, France.
| | - Dana Alina Magdas
- National Institute for Research and Development of Isotopic and Molecular Technologies, 67-103 Donat Street, 400293 Cluj-Napoca, Romania; Faculty of Physics, Babeș-Bolyai University, Kogălniceanu 1, 400084 Cluj-Napoca, Romania.
| |
Collapse
|
39
|
Lee KH, Assassi S, Mohan C, Pedroza C. Addressing statistical challenges in the analysis of proteomics data with extremely small sample size: a simulation study. BMC Genomics 2024; 25:1086. [PMID: 39543503 PMCID: PMC11566501 DOI: 10.1186/s12864-024-11018-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 11/08/2024] [Indexed: 11/17/2024] Open
Abstract
BACKGROUND One of the most promising approaches for early and more precise disease prediction and diagnosis is through the inclusion of proteomics data augmented with clinical data. Clinical proteomics data is often characterized by its high dimensionality and extremely limited sample size, posing a significant challenge when employing machine learning techniques for extracting only the most relevant information. Although there is a wide array of statistical techniques and numerous analysis pipelines employed in proteomics data analysis, it is unclear which of these methods produce the most efficient, reproducible, and clinically meaningful results. RESULTS In this study, we compared 9 unique analysis schemes comprised of different machine learning and dimensionality reduction methods for the analysis of simulated proteomics data consisting of 1317 proteins measured in 26 subjects (i.e., 13 controls and 13 cases). In scenarios where the sample size is extremely small (i.e., n < 30), all schemes resulted in an exceptionally high level of performance metrics, indicating potential overfitting. While performance metrics did not exhibit significant differences across schemes, the set of proteins selected to be discriminatory between groups demonstrated a substantial level of heterogeneity. However, despite heterogeneity in the selected proteins, their biological pathways and genetic diseases exhibited similarities. A sensitivity analysis conducted using varying sample sizes indicated that the stability of a set of selected biomarkers improves with larger sample sizes within a scheme. CONCLUSIONS When the aim of the study is to identify a statistical model that best distinguishes between cohort groups using proteomics data and to uncover the biological pathways and disorders common among the selected proteins, the majority of widely used analysis pipelines perform similarly. However, if the main objective is to pinpoint a set of selected proteins that wield significant influence in discriminating cohort groups and utilize them for subsequent investigations, meticulous consideration is necessary when opting for statistical models, due to the possibility of heterogeneity in the sets of selected proteins.
Collapse
Affiliation(s)
- Kyung Hyun Lee
- Institute for Clinical Research and Learning Health Care, Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA.
| | - Shervin Assassi
- Department of Internal Medicine - Rheumatology, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Chandra Mohan
- Department of Biomedical Engineering, University of Houston, Houston, TX, USA
| | - Claudia Pedroza
- Institute for Clinical Research and Learning Health Care, Department of Pediatrics, McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX, USA
| |
Collapse
|
40
|
Pigoli D, Baker K, Budd J, Butler L, Coppock H, Egglestone S, Gilmour SG, Holmes C, Hurley D, Jersakova R, Kiskin I, Koutra V, Mellor J, Nicholson G, Packham J, Patel S, Payne R, Roberts SJ, Schuller BW, Tendero-Cañadas A, Thornley T, Titcomb A. Assessing the Performance of Machine Learning Methods Trained on Public Health Observational Data: A Case Study From COVID-19. Stat Med 2024; 43:4861-4871. [PMID: 39237100 DOI: 10.1002/sim.10211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 06/24/2024] [Accepted: 08/15/2024] [Indexed: 09/07/2024]
Abstract
From early in the coronavirus disease 2019 (COVID-19) pandemic, there was interest in using machine learning methods to predict COVID-19 infection status based on vocal audio signals, for example, cough recordings. However, early studies had limitations in terms of data collection and of how the performances of the proposed predictive models were assessed. This article describes how these limitations have been overcome in a study carried out by the Turing-RSS Health Data Laboratory and the UK Health Security Agency. As part of the study, the UK Health Security Agency collected a dataset of acoustic recordings, SARS-CoV-2 infection status and extensive study participant meta-data. This allowed us to rigorously assess state-of-the-art machine learning techniques to predict SARS-CoV-2 infection status based on vocal audio signals. The lessons learned from this project should inform future studies on statistical evaluation methods to assess the performance of machine learning techniques for public health tasks.
Collapse
Affiliation(s)
- Davide Pigoli
- Department of Mathematics, King's College London, UK
- The Alan Turing Institute, London, UK
| | - Kieran Baker
- Department of Mathematics, King's College London, UK
- The Alan Turing Institute, London, UK
| | - Jobie Budd
- Division of Medicine, University College London, UK
| | | | - Harry Coppock
- The Alan Turing Institute, London, UK
- Group on Language Audio & Music, Imperial College London, UK
| | | | - Steven G Gilmour
- Department of Mathematics, King's College London, UK
- The Alan Turing Institute, London, UK
| | - Chris Holmes
- The Alan Turing Institute, London, UK
- Department of Statistics, University of Oxford, UK
| | | | | | - Ivan Kiskin
- Centre for Vision, Speech and Signal Processing, University of Surrey, UK
| | - Vasiliki Koutra
- Department of Mathematics, King's College London, UK
- The Alan Turing Institute, London, UK
| | | | - George Nicholson
- The Alan Turing Institute, London, UK
- Department of Statistics, University of Oxford, UK
| | | | - Selina Patel
- Division of Medicine, University College London, UK
- UK Health Security Agency, London, UK
| | | | | | - Björn W Schuller
- The Alan Turing Institute, London, UK
- Group on Language Audio & Music, Imperial College London, UK
| | - Ana Tendero-Cañadas
- UK Health Security Agency, London, UK
- Centre for Lifelong Health, University of Brighton, UK
| | - Tracey Thornley
- Pharmacy Practice and Policy Division, University of Nottingham, UK
| | | |
Collapse
|
41
|
Luo JT, Hung YC, Chen GJ, Lin YS. Predicting Early Treatment Effectiveness in Bell's Palsy Using Machine Learning: A Focus on Corticosteroids and Antivirals. Int J Gen Med 2024; 17:5163-5174. [PMID: 39539927 PMCID: PMC11559179 DOI: 10.2147/ijgm.s488418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2024] [Accepted: 11/02/2024] [Indexed: 11/16/2024] Open
Abstract
Purpose Facial nerve paralysis, particularly Bell's palsy, manifests as a rapid onset of unilateral facial weakness or paralysis. Despite most patients recovering within three to six months, a significant proportion experience poor recovery. This study utilized six machine learning models to investigate the effectiveness of early treatment in Bell's palsy. Patients and Methods We applied data from 17 hospitals in Scotland to predict treatment outcomes. Patients were randomized into four groups: Prednisolone (corticosteroids), Acyclovir (antivirals), both, and placebo. Outcomes, defined as full resolution of symptoms, were assessed using the House-Brackmann scale at 3 and 9 months post-treatment. We employed six different machine learning models to predict recovery outcomes and evaluated model performance using AUC, precision, recall, and F1-score. Results Among 493 patients, 72.6% recovered after three months and 89.5% after nine months. Logistic regression demonstrated the highest predictive performance for both 3-month (AUC = 0.751) and 9-month recovery (AUC = 0.720). Additionally, several models achieved Precision levels exceeding 0.9. We further employed the best-performing logistic regression for feature ranking, indicating that the patient's age and prednisolone administration are the most significant predictors of recovery. Conclusion The results highlight the potential of machine learning models in predicting the effectiveness of early treatment. This study conducted a comprehensive comparison of six different machine learning models, with the logistic regression showing the highest predictive performance for both 3-month and 9-month recovery. Additionally, feature ranking using logistic regression supported the importance of Prednisolone in treatment. Notably, our findings revealed the significance of age in prognosis evaluation for the first time. This suggests that future research should further develop age-specific prognostic models, enabling clinicians to tailor individualized treatment strategies more effectively. This previously unrecognized discovery provides a foundation for prognostic analysis in Bell's palsy patients.
Collapse
Affiliation(s)
- Jheng-Ting Luo
- In-Service Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Yung-Chun Hung
- In-Service Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
- School of Health Care Administration, College of Management, Taipei Medical University, Taipei, Taiwan
| | - Gina Jinna Chen
- Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen, People’s Republic of China
| | - Yu-Shiang Lin
- In-Service Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
42
|
Adiprakoso D, Katsimpokis D, Oerlemans S, Ezendam NPM, van Maaren MC, van Til JA, van der Heijden TGW, Mols F, Aben KKH, Vink GR, Koopman M, van de Poll-Franse LV, de Rooij BH. Development of a prediction model for clinically-relevant fatigue: a multi-cancer approach. Qual Life Res 2024:10.1007/s11136-024-03807-9. [PMID: 39516438 DOI: 10.1007/s11136-024-03807-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/03/2024] [Indexed: 11/16/2024]
Abstract
PURPOSE Fatigue is the most prevalent symptom across cancer types. To support clinicians in providing fatigue-related supportive care, this study aims to develop and compare models predicting clinically relevant fatigue (CRF) occurring between two and three years after diagnosis, and to assess the validity of the best-performing model across diverse cancer populations. METHODS Patients with non-metastatic bladder, colorectal, endometrial, ovarian, or prostate cancer who completed a questionnaire within three months after diagnosis and a subsequent questionnaire between two and three years thereafter, were included. Predictor variables included clinical, socio-demographic, and patient-reported variables. The outcome was CRF (EORTC QLQC30 fatigue ≥ 39). Logistic regression using LASSO selection was compared to more advanced Machine Learning (ML) based models, including Extreme gradient boosting (XGBoost), support vector machines (SVM), and artificial neural networks (ANN). Internal-external cross-validation was conducted on the best-performing model. RESULTS 3160 patients were included. The logistic regression model had the highest C-statistic (0.77) and balanced accuracy (0.65), both indicating good discrimination between patients with and without CRF. However, sensitivity was low across all models (0.22-0.37). Following internal-external validation, performance across cancer types was consistent (C-statistics 0.73-0.82). CONCLUSION Although the models' discrimination was good, the low balanced accuracy and poor calibration in the presence of CRF indicates a relatively high likelihood of underdiagnosis of future CRF. Yet, the clinical applicability of the model remains uncertain. The logistic regression performed better than the ML-based models and was robust across cohorts, suggesting an advantage of simpler models to predict CRF.
Collapse
Affiliation(s)
- Dhirendra Adiprakoso
- Department of Health Technology and Services Research (HTSR), Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Dimitris Katsimpokis
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
| | - Simone Oerlemans
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
| | - Nicole P M Ezendam
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- CoRPS-Center of Research On Psychological Disorders and Somatic Diseases, Department of Medical and Clinical Psychology, Tilburg University, Tilburg, The Netherlands
| | - Marissa C van Maaren
- Department of Health Technology and Services Research (HTSR), Technical Medical Centre, University of Twente, Enschede, The Netherlands
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
| | - Janine A van Til
- Department of Health Technology and Services Research (HTSR), Technical Medical Centre, University of Twente, Enschede, The Netherlands
| | - Thijs G W van der Heijden
- Division of Psychosocial Research & Epidemiology, The Netherlands Cancer Institute, Amsterdam, Netherlands
| | - Floortje Mols
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- CoRPS-Center of Research On Psychological Disorders and Somatic Diseases, Department of Medical and Clinical Psychology, Tilburg University, Tilburg, The Netherlands
| | - Katja K H Aben
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- Department of IQ Health, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - Geraldine R Vink
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- Department of Medical Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Miriam Koopman
- Department of Medical Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Lonneke V van de Poll-Franse
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands
- CoRPS-Center of Research On Psychological Disorders and Somatic Diseases, Department of Medical and Clinical Psychology, Tilburg University, Tilburg, The Netherlands
- Division of Psychosocial Research & Epidemiology, The Netherlands Cancer Institute, Amsterdam, Netherlands
| | - Belle H de Rooij
- Department of Research and Development, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, The Netherlands.
- CoRPS-Center of Research On Psychological Disorders and Somatic Diseases, Department of Medical and Clinical Psychology, Tilburg University, Tilburg, The Netherlands.
| |
Collapse
|
43
|
Tangorra FM, Lopez A, Ighina E, Bellagamba F, Moretti VM. Handheld NIR Spectroscopy Combined with a Hybrid LDA-SVM Model for Fast Classification of Retail Milk. Foods 2024; 13:3577. [PMID: 39593993 PMCID: PMC11594020 DOI: 10.3390/foods13223577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 11/06/2024] [Accepted: 11/08/2024] [Indexed: 11/28/2024] Open
Abstract
The EU market offers different types of milk, distinguished by origin, production method, processing technology, fat content, and other characteristics, which are often detailed on product labels. In this context, ensuring the authenticity of milk is crucial for maintaining standards and preventing fraud. Various food authenticity techniques have been employed to achieve this. Among them, near-infrared (NIR) spectroscopy is valued for its non-destructive and rapid analysis capabilities. This study evaluates the effectiveness of a miniaturized NIR device combined with support vector machine (SVM) algorithms and LDA feature selection to discriminate between four commercial milk types: high-quality fresh milk, milk labeled as mountain product, extended shelf-life milk, and TSG hay milk. The results indicate that NIR spectroscopy can effectively classify milk based on the type of milk, relying on different production systems and heat treatments (pasteurization). This capability was greater in distinguishing high-quality mountain and hay milk from the other types, while resulting in less successful class assignment for extended shelf-life milk. This study demonstrated the potential of portable NIR spectroscopy for real-time and cost-effective milk authentication at the retail level.
Collapse
Affiliation(s)
| | - Annalaura Lopez
- Department of Veterinary Medicine and Animal Sciences (DIVAS), Università degli Studi di Milano, Via dell’Università 6, 26900 Lodi, Italy; (F.M.T.); (E.I.); (F.B.); (V.M.M.)
| | | | | | | |
Collapse
|
44
|
Han K, Zuo R, Xu D, Zhao X, Shi J, Xue Z, Xu Y, Wu Z, Wang J. Quantitative expression of LNAPL pollutant concentrations in capillary zone by coupling multiple environmental factors based on random forest algorithm. JOURNAL OF HAZARDOUS MATERIALS 2024; 479:135695. [PMID: 39217922 DOI: 10.1016/j.jhazmat.2024.135695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 08/19/2024] [Accepted: 08/27/2024] [Indexed: 09/04/2024]
Abstract
The capillary zone plays a crucial role in migration and transformation of pollutants. Light nonaqueous liquids (LNAPLs) have become the main organic pollutant in soil and groundwater environments. However, few studies have focused on the concentration distribution characteristics and quantitative expression of LNAPL pollutants within capillary zone. In this study, we conducted a sandbox-migration experiment using diesel oil as a typical LNAPL pollutant, with the capillary zone of silty sand as the research object. The variation characteristics of LNAPL pollutants (total petroleum hydrocarbon) concentration and environmental factors (moisture content, electrical conductivity, pH, and oxidationreduction potential) were essentially consistent at different locations with the same height. These characteristics differed within range of 10.0-50.0 cm and above 60.0 cm from groundwater. A model for quantitative expression of concentrations was constructed by coupling multiple environmental factors of 968 sets-7744 data via random forest algorithm. The goodness of fit (R2) for both training and test sets was greater than 0.90, and the mean absolute percentage error (MAPE) was less than 16.00 %. The absolute values of relative errors in predicting concentrations at characteristic points were less than 15.00 %. The constructed model can accurately and quantitatively express and predict concentrations in capillary zone.
Collapse
Affiliation(s)
- Kexue Han
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Rui Zuo
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China.
| | - Donghui Xu
- China Institute of Geological Environment Monitoring, China Geological Survey, Beijing 100081, China
| | - Xiao Zhao
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Jian Shi
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Zhenkun Xue
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Yunxiang Xu
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Ziyi Wu
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| | - Jinsheng Wang
- College of Water Sciences, Beijing Normal University, Beijing 100875, China; Engineering Research Center of Groundwater Pollution Control and Remediation, Ministry of Education, Beijing 100875, China
| |
Collapse
|
45
|
Salih AM, Galazzo IB, Raisi-Estabragh Z, Petersen SE, Menegaz G, Radeva P. Characterizing the Contribution of Dependent Features in XAI Methods. IEEE J Biomed Health Inform 2024; 28:6466-6473. [PMID: 38696291 DOI: 10.1109/jbhi.2024.3395289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/04/2024]
Abstract
Explainable Artificial Intelligence (XAI) provides tools to help understanding how AI models work and reach a particular decision or outcome. It helps to increase the interpretability of models and makes them more trustworthy and transparent. In this context, many XAI methods have been proposed to make black-box and complex models more digestible from a human perspective. However, one of the main issues that XAI methods have to face especially when dealing with a high number of features is the presence of multicollinearity, which casts shadows on the robustness of the XAI outcomes, such as the ranking of informative features. Most of the current XAI methods either do not consider the collinearity or assume the features are independent which, in general, is not necessarily true. Here, we propose a simple, yet useful, proxy that modifies the outcome of any XAI feature ranking method allowing to account for the dependency among the features, and to reveal their impact on the outcome. The proposed method was applied to SHAP, as an example of XAI method which assume that the features are independent. For this purpose, several models were exploited for a well-known classification task (males versus females) using nine cardiac phenotypes extracted from cardiac magnetic resonance imaging as features. Principal component analysis and biological plausibility were employed to validate the proposed method. Our results showed that the proposed proxy could lead to a more robust list of informative features compared to the original SHAP in presence of collinearity.
Collapse
|
46
|
Kapalaga G, Kivunike FN, Kerfua S, Jjingo D, Biryomumaisho S, Rutaisire J, Ssajjakambwe P, Mugerwa S, Abbey S, Aaron MH, Kiwala Y. Enhancing random forest predictive performance for foot and mouth disease outbreaks in Uganda: a calibrated uncertainty prediction approach for varying distributions. Front Artif Intell 2024; 7:1455331. [PMID: 39554990 PMCID: PMC11564173 DOI: 10.3389/frai.2024.1455331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 08/30/2024] [Indexed: 11/19/2024] Open
Abstract
Foot-and-mouth disease poses a significant threat to both domestic and wild cloven-hoofed animals, leading to severe economic losses and jeopardizing food security. While machine learning models have become essential for predicting foot-and-mouth disease outbreaks, their effectiveness is often compromised by distribution shifts between training and target datasets, especially in non-stationary environments. Despite the critical impact of these shifts, their implications in foot-and-mouth disease outbreak prediction have been largely overlooked. This study introduces the Calibrated Uncertainty Prediction approach, designed to enhance the performance of Random Forest models in predicting foot-and-mouth disease outbreaks across varying distributions. The Calibrated Uncertainty Prediction approach effectively addresses distribution shifts by calibrating uncertain instances for pseudo-label annotation, allowing the active learner to generalize more effectively to the target domain. By utilizing a probabilistic calibration model, Calibrated Uncertainty Prediction pseudo-annotates the most informative instances, refining the active learner iteratively and minimizing the need for human annotation and outperforming existing methods known to mitigate distribution shifts. This reduces costs, saves time, and lessens the dependence on domain experts while achieving outstanding predictive performance. The results demonstrate that Calibrated Uncertainty Prediction significantly enhances predictive performance in non-stationary environments, achieving an accuracy of 98.5%, Area Under the Curve of 0.842, recall of 0.743, precision of 0.855, and an F1 score of 0.791. These findings underscore Calibrated Uncertainty Prediction's ability to overcome the vulnerabilities of existing ML models, offering a robust solution for foot-and-mouth disease outbreak prediction and contributing to the broader field of predictive modeling in infectious disease management.
Collapse
Affiliation(s)
- Geofrey Kapalaga
- Department of Information Technology, College of Computing and Information Sciences, Makerere University, Kampala, Uganda
| | - Florence N Kivunike
- Department of Information Technology, College of Computing and Information Sciences, Makerere University, Kampala, Uganda
| | - Susan Kerfua
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Daudi Jjingo
- African Center of Excellence in Bioinformatics (ACE-B), Makerere University, Kampala, Uganda
- Department of Computer Science, College of Computing and Information sciences, Makerere University, Kampala, Uganda
| | - Savino Biryomumaisho
- College of Veterinary Medicine, Animal Resources and Bio-security, Makerere University, Kampala, Uganda
| | - Justus Rutaisire
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Paul Ssajjakambwe
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Swidiq Mugerwa
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Seguya Abbey
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Mulindwa H Aaron
- Department of Vaccinology, National Livestock Resources Research Institute, Kampala, Uganda
| | - Yusuf Kiwala
- College of Business and Management Science, Makerere University, Kampala, Uganda
| |
Collapse
|
47
|
Martínez-López Y, Phoobane P, Jauriga Y, Castillo-Garit JA, Rodríguez-Gonzalez AY, Martínez-Santiago O, Barigye SJ, Madera J, Rodríguez-Maya NE, Duchowicz P. Exploring blood-brain barrier passage using atomic weighted vector and machine learning. J Mol Model 2024; 30:393. [PMID: 39485560 DOI: 10.1007/s00894-024-06188-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Accepted: 10/21/2024] [Indexed: 11/03/2024]
Abstract
CONTEXT This study investigates the potential of leveraging molecular properties, as determined by MD-LOVIs software and machine learning techniques, to predict the ability of compounds to cross the blood-brain barrier (BBB). Accurate prediction of BBB permeation is critical for the development of central nervous system (CNS) drugs. The study applies various machine learning models, including both classification and regression techniques, to predict BBB passage and molecular activity. Notably, classification models such as GBM-AWV (accuracy = 0.801), GLM-CN (accuracy = 0.808), SVMPoly-means (accuracy = 0.980), SVMPoly-AC (accuracy = 0.980), SVMPoly-MI_TI_SI (accuracy = 0.900), SVMPoly-GI (accuracy = 0.900), RF-means (accuracy = 0.870), and GLM-means (accuracy = 0.818) demonstrate high accuracy in predicting BBB passage. In contrast, regression models like ES-RLM-AG (R2 = 0.902), IB-IBK (R2 = 0.82), IB-Kstar (R2 = 0.834), IB-MLP (R2 = 0.843), and DRF-AWV (R2 = 0.810) exhibit strong performance in predicting molecular activity. The results show that classification models like GBM-AWV, GLM-CN, and SVMPoly variants, as well as regression models like ES-RLM-AG and IB-MLP, achieve high performance, demonstrating the effectiveness of machine learning in predicting BBB permeability. METHODS The computational methods employed in this study include the MD-LOVIs software for generating molecular descriptors and several machine learning algorithms, including gradient boosting machines (GBM), generalized linear models (GLM), support vector machines (SVM) with polynomial kernels, random forests (RF), ensemble regression models, and instance-based learning algorithms. These models were trained and validated using various datasets to predict BBB passage and molecular activity, with the performance metrics reported for each model. Standard computational techniques were employed throughout, ensuring the reliability of the predictions.
Collapse
Affiliation(s)
- Yoan Martínez-López
- Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba.
| | - Paulina Phoobane
- Walter Sisulu University, Mthatha, Eastern Cape, Republic of South Africa
| | - Yanaima Jauriga
- Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba
| | - Juan A Castillo-Garit
- Instituto Universitario de Investigación y Desarrollo Tecnológico (IDT), Universidad Tecnológica Metropolitana, Ignacio Valdivieso 2409, San Joaquín, Santiago de Chile, Chile
| | - Ansel Y Rodríguez-Gonzalez
- Unidad de Transferencia Tecnológica de Tepic, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, Baja California, Mexico
| | - Oscar Martínez-Santiago
- Alfa Vitamins Laboratories, Miami, FL, 33166, USA
- Laboratorio de Bioinformática y Química Computacional, Universidad Católica del Maule, Talca, Chile
| | - Stephen J Barigye
- Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid (UAM), 28049, Madrid, Spain
| | - Julio Madera
- Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba
| | - Noel Enrique Rodríguez-Maya
- División de Estudios de Posgrado E Investigación, Instituto Tecnológico de Zitácuaro, Zitácuaro, Michoacán, Mexico
| | - Pablo Duchowicz
- Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), La Plata, Argentina
| |
Collapse
|
48
|
Oztekin PS, Katar O, Omma T, Erel S, Tokur O, Avci D, Aydogan M, Yildirim O, Avci E, Acharya UR. Comparison of Explainable Artificial Intelligence Model and Radiologist Review Performances to Detect Breast Cancer in 752 Patients. JOURNAL OF ULTRASOUND IN MEDICINE : OFFICIAL JOURNAL OF THE AMERICAN INSTITUTE OF ULTRASOUND IN MEDICINE 2024; 43:2051-2068. [PMID: 39051752 DOI: 10.1002/jum.16535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 07/11/2024] [Accepted: 07/13/2024] [Indexed: 07/27/2024]
Abstract
OBJECTIVES Breast cancer is a type of cancer caused by the uncontrolled growth of cells in the breast tissue. In a few cases, erroneous diagnosis of breast cancer by specialists and unnecessary biopsies can lead to various negative consequences. In some cases, radiologic examinations or clinical findings may raise the suspicion of breast cancer, but subsequent detailed evaluations may not confirm cancer. In addition to causing unnecessary anxiety and stress to patients, such diagnosis can also lead to unnecessary biopsy procedures, which are painful, expensive, and prone to misdiagnosis. Therefore, there is a need for the development of more accurate and reliable methods for breast cancer diagnosis. METHODS In this study, we proposed an artificial intelligence (AI)-based method for automatically classifying breast solid mass lesions as benign vs malignant. In this study, a new breast cancer dataset (Breast-XD) was created with 791 solid mass lesions belonging to 752 different patients aged 18 to 85 years, which were examined by experienced radiologists between 2017 and 2022. RESULTS Six classifiers, support vector machine (SVM), K-nearest neighbor (K-NN), random forest (RF), decision tree (DT), logistic regression (LR), and XGBoost, were trained on the training samples of the Breast-XD dataset. Then, each classifier made predictions on 159 test data that it had not seen before. The highest classification result was obtained using the explainable XGBoost model (X2GAI) with an accuracy of 94.34%. An explainable structure is also implemented to build the reliability of the developed model. CONCLUSIONS The results obtained by radiologists and the X2GAI model were compared according to the diagnosis obtained from the biopsy. It was observed that our developed model performed well in cases where experienced radiologists gave false positive results.
Collapse
Affiliation(s)
- Pelin Seher Oztekin
- Department of Radiology, University of Health Sciences, Ankara Training and Research Hospital, Ankara, Turkey
| | - Oguzhan Katar
- Department of Software Engineering, Firat University, Elazig, Turkey
| | - Tulay Omma
- Department of Endocrinology and Metabolism, University of Health Sciences, Ankara Training and Research Hospital, Ankara, Turkey
| | - Serap Erel
- Department of Surgery, University of Health Sciences, Ankara Training and Research Hospital, Ankara, Turkey
| | - Oguzhan Tokur
- Department of Radiology, University of Health Sciences, Ankara Training and Research Hospital, Ankara, Turkey
| | - Derya Avci
- Department of Computer Technology, Firat University, Elazig, Turkey
| | - Murat Aydogan
- Department of Software Engineering, Firat University, Elazig, Turkey
| | - Ozal Yildirim
- Department of Software Engineering, Firat University, Elazig, Turkey
| | - Engin Avci
- Department of Software Engineering, Firat University, Elazig, Turkey
| | - U Rajendra Acharya
- School of Mathematics, Physics, and Computing, University of Southern Queensland, Springfield, Queensland, Australia
- Centre for Health Research, University of Southern Queensland, Springfield, Queensland, Australia
| |
Collapse
|
49
|
Huang Q, Li G. Knowledge graph based reasoning in medical image analysis: A scoping review. Comput Biol Med 2024; 182:109100. [PMID: 39244959 DOI: 10.1016/j.compbiomed.2024.109100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 08/04/2024] [Accepted: 08/31/2024] [Indexed: 09/10/2024]
Abstract
Automated computer-aided diagnosis (CAD) is becoming more significant in the field of medicine due to advancements in computer hardware performance and the progress of artificial intelligence. The knowledge graph is a structure for visually representing knowledge facts. In the last decade, a large body of work based on knowledge graphs has effectively improved the organization and interpretability of large-scale complex knowledge. Introducing knowledge graph inference into CAD is a research direction with significant potential. In this review, we briefly review the basic principles and application methods of knowledge graphs firstly. Then, we systematically organize and analyze the research and application of knowledge graphs in medical imaging-assisted diagnosis. We also summarize the shortcomings of the current research, such as medical data barriers and deficiencies, low utilization of multimodal information, and weak interpretability. Finally, we propose future research directions with possibilities and potentials to address the shortcomings of current approaches.
Collapse
Affiliation(s)
- Qinghua Huang
- School of Artificial Intelligence, OPtics and ElectroNics (iOPEN), Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an, 710072, Shaanxi, China.
| | - Guanghui Li
- School of Artificial Intelligence, OPtics and ElectroNics (iOPEN), Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an, 710072, Shaanxi, China; School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Road, Chang'an District, Xi'an, 710129, Shaanxi, China.
| |
Collapse
|
50
|
Procopio A, Rania M, Zaffino P, Cortese N, Giofrè F, Arturi F, Segura-Garcia C, Cosentino C. Physiological model-based machine learning for classifying patients with binge-eating disorder (BED) from the Oral Glucose Tolerance Test (OGTT) curve. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 258:108477. [PMID: 39509761 DOI: 10.1016/j.cmpb.2024.108477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 10/04/2024] [Accepted: 10/23/2024] [Indexed: 11/15/2024]
Abstract
BACKGROUND AND OBJECTIVE Binge eating disorder (BED) is the most frequent eating disorder, often confused with obesity, with which it shares several characteristics. Early identification could enable targeted therapeutic interventions. In this study, we propose a hybrid pipeline that, starting from plasma glucose data acquired during the Oral Glucose Tolerance Test (OGTT), allows us to classify the two types of patients through computational modeling and artificial intelligence. METHODS The proposed hybrid pipeline integrates a classical mechanistic model of delayed differential equations (DDE) that describes glucose-insulin dynamics with machine learning (ML) methods. Ad hoc techniques, including structural identifiability analysis, have been employed for refining and evaluating the mathematical model. Additionally, a dedicated pipeline for identifying and optimizing model parameters has been applied to obtain reliable estimates. Robust feature extraction and classifier selection processes were developed to ensure the optimal choice of the best-performing classifier. RESULTS By leveraging parameters estimated from the mechanistic model alongside easily obtainable patient information (such as glucose levels at 30 and 120 min post-OGTT, glycated hemoglobin (Hb1Ac), body mass index (BMI), and waist circumference), our approach facilitates accurate classification of patients, enabling tailored therapeutic interventions. CONCLUSION Initial findings, focusing on correctly categorizing patients with BED based on metabolic data, demonstrate promising outcomes. These results suggest significant potential for refinement, including exploration of alternative mechanistic models and machine learning algorithms, to enhance classification accuracy and therapeutic strategies.
Collapse
Affiliation(s)
- Anna Procopio
- Department of Experimental and Clinical Medicine, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Marianna Rania
- Outpatient Unit for Clinical Research and Treatment of Eating Disorders, University Hospital Mater Domini, Catanzaro, Italy
| | - Paolo Zaffino
- Department of Experimental and Clinical Medicine, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Nicola Cortese
- Department of Experimental and Clinical Medicine, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Federica Giofrè
- Internal Medicine Unit, Department of Medical and Surgical Sciences, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Franco Arturi
- Internal Medicine Unit, Department of Medical and Surgical Sciences, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Cristina Segura-Garcia
- Outpatient Unit for Clinical Research and Treatment of Eating Disorders, University Hospital Mater Domini, Catanzaro, Italy; Department of Medical and Surgical Sciences, Università degli Studi Magna Græcia, Catanzaro, Italy
| | - Carlo Cosentino
- Department of Experimental and Clinical Medicine, Università degli Studi Magna Græcia, Catanzaro, Italy.
| |
Collapse
|