Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Remeseiro B, Bolon-Canedo V. A review of feature selection methods in medical applications. Comput Biol Med 2019;112:103375. [PMID: 31382212 DOI: 10.1016/j.compbiomed.2019.103375] [Citation(s) in RCA: 194] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 07/29/2019] [Accepted: 07/29/2019] [Indexed: 11/22/2022]

For:	Remeseiro B, Bolon-Canedo V. A review of feature selection methods in medical applications. Comput Biol Med 2019;112:103375. [PMID: 31382212 DOI: 10.1016/j.compbiomed.2019.103375] [Citation(s) in RCA: 194] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 07/29/2019] [Accepted: 07/29/2019] [Indexed: 11/22/2022]

Number

Cited by Other Article(s)

Zhou T, Guan Y, Lin X, Zhou X, Mao L, Ma Y, Fan B, Li J, Tu W, Liu S, Fan L. A clinical-radiomics nomogram based on automated segmentation of chest CT to discriminate PRISm and COPD patients. Eur J Radiol Open 2024;13:100580. [PMID: 38989052 PMCID: PMC11233899 DOI: 10.1016/j.ejro.2024.100580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 05/31/2024] [Accepted: 06/11/2024] [Indexed: 07/12/2024] Open

Abstract

Purpose

It is vital to develop noninvasive approaches with high accuracy to discriminate the preserved ratio impaired spirometry (PRISm) group from the chronic obstructive pulmonary disease (COPD) groups. Radiomics has emerged as an image analysis technique. This study aims to develop and confirm the new radiomics-based noninvasive approach to discriminate these two groups.

Methods

Totally 1066 subjects from 4 centers were included in this retrospective research, and classified into training, internal validation or external validation sets. The chest computed tomography (CT) images were segmented by the fully automated deep learning segmentation algorithm (Unet231) for radiomics feature extraction. We established the radiomics signature (Rad-score) using the least absolute shrinkage and selection operator algorithm, then conducted ten-fold cross-validation using the training set. Last, we constructed a radiomics signature by incorporating independent risk factors using the multivariate logistic regression model. Model performance was evaluated by receiver operating characteristic (ROC) curve, calibration curve, and decision curve analyses (DCA).

Results

The Rad-score, including 15 radiomic features in whole-lung region, which was suitable for diffuse lung diseases, was demonstrated to be effective for discriminating between PRISm and COPD. Its diagnostic accuracy was improved through integrating Rad-score with a clinical model, and the area under the ROC (AUC) were 0.82(95 %CI 0.79-0.86), 0.77(95 %CI 0.72-0.83) and 0.841(95 %CI 0.78-0.91) for training, internal validation and external validation sets, respectively. As revealed by analysis, radiomics nomogram showed good fit and superior clinical utility.

Conclusions

The present work constructed the new radiomics-based nomogram and verified its reliability for discriminating between PRISm and COPD.

Collapse

Affiliation(s)

TaoHu Zhou Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China School of Medical Imaging, Shandong Second Medical University, Weifang, Shandong 261053, China
Yu Guan Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China
XiaoQing Lin Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China College of Health Sciences and Engineering, University of Shanghai for Science and Technology, No.516 Jungong Road, Shanghai 200093, China
XiuXiu Zhou Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China
Liang Mao Department of Medical Imaging, Affiliated Hospital of Ji Ning Medical University, Ji Ning 272000, China
YanQing Ma Department of Radiology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital of Hangzhou Medical College, Hangzhou, ZJ, China
Bing Fan Department of Radiology, Jiangxi Provincial People's Hospital, The First Affiliated Hospital of Nanchang Medical College, Nanchang, China
Jie Li Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China College of Health Sciences and Engineering, University of Shanghai for Science and Technology, No.516 Jungong Road, Shanghai 200093, China
WenTing Tu Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China
ShiYuan Liu Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China
Li Fan Department of Radiology, Second Affiliated Hospital of Naval Medical University, No. 415 Fengyang Road, Shanghai 200003, China

Collapse

Kanchanapiboon P, Tunksook P, Tunksook P, Ritthipravat P, Boonpratham S, Satravaha Y, Chaweewannakorn C, Peanchitlertkajorn S. Classification of cervical vertebral maturation stages with machine learning models: leveraging datasets with high inter- and intra-observer agreement. Prog Orthod 2024;25:35. [PMID: 39279025 PMCID: PMC11402886 DOI: 10.1186/s40510-024-00535-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 07/22/2024] [Indexed: 09/18/2024] Open

Meng Q, Chen B, Xu Y, Zhang Q, Ding R, Ma Z, Jin Z, Gao S, Qu F. A machine learning model for early candidemia prediction in the intensive care unit: Clinical application. PLoS One 2024;19:e0309748. [PMID: 39250466 PMCID: PMC11383240 DOI: 10.1371/journal.pone.0309748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 08/17/2024] [Indexed: 09/11/2024] Open

Attallah O. Skin cancer classification leveraging multi-directional compact convolutional neural network ensembles and gabor wavelets. Sci Rep 2024;14:20637. [PMID: 39232043 PMCID: PMC11375051 DOI: 10.1038/s41598-024-69954-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 08/12/2024] [Indexed: 09/06/2024] Open

Abstract

Skin cancer (SC) is an important medical condition that necessitates prompt identification to ensure timely treatment. Although visual evaluation by dermatologists is considered the most reliable method, its efficacy is subjective and laborious. Deep learning-based computer-aided diagnostic (CAD) platforms have become valuable tools for supporting dermatologists. Nevertheless, current CAD tools frequently depend on Convolutional Neural Networks (CNNs) with huge amounts of deep layers and hyperparameters, single CNN model methodologies, large feature space, and exclusively utilise spatial image information, which restricts their effectiveness. This study presents SCaLiNG, an innovative CAD tool specifically developed to address and surpass these constraints. SCaLiNG leverages a collection of three compact CNNs and Gabor Wavelets (GW) to acquire a comprehensive feature vector consisting of spatial-textural-frequency attributes. SCaLiNG gathers a wide range of image details by breaking down these photos into multiple directional sub-bands using GW, and then learning several CNNs using those sub-bands and the original picture. SCaLiNG also combines attributes taken from various CNNs trained with the actual images and subbands derived from GW. This fusion process correspondingly improves diagnostic accuracy due to the thorough representation of attributes. Furthermore, SCaLiNG applies a feature selection approach which further enhances the model's performance by choosing the most distinguishing features. Experimental findings indicate that SCaLiNG maintains a classification accuracy of 0.9170 in categorising SC subcategories, surpassing conventional single-CNN models. The outstanding performance of SCaLiNG underlines its ability to aid dermatologists in swiftly and precisely recognising and classifying SC, thereby enhancing patient outcomes.

Collapse

Zhou T, Guan Y, Lin X, Zhou X, Mao L, Ma Y, Fan B, Li J, Liu S, Fan L. CT-based whole lung radiomics nomogram for identification of PRISm from non-COPD subjects. Respir Res 2024;25:329. [PMID: 39227894 PMCID: PMC11373438 DOI: 10.1186/s12931-024-02964-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 08/28/2024] [Indexed: 09/05/2024] Open

Abstract

BACKGROUND

Preserved Ratio Impaired Spirometry (PRISm) is considered to be a precursor of chronic obstructive pulmonary disease. Radiomics nomogram can effectively identify the PRISm subjects from non-COPD subjects, especially when during large-scale CT lung cancer screening.

METHODS

Totally 1481 participants (864, 370 and 247 in training, internal validation, and external validation cohorts, respectively) were included. Whole lung on thin-section computed tomography (CT) was segmented with a fully automated segmentation algorithm. PyRadiomics was adopted for extracting radiomics features. Clinical features were also obtained. Moreover, Spearman correlation analysis, minimum redundancy maximum relevance (mRMR) feature ranking and least absolute shrinkage and selection operator (LASSO) classifier were adopted to analyze whether radiomics features could be used to build radiomics signatures. A nomogram that incorporated clinical features and radiomics signature was constructed through multivariable logistic regression. Last, calibration, discrimination and clinical usefulness were analyzed using validation cohorts.

RESULTS

The radiomics signature, which included 14 stable features, was related to PRISm of training and validation cohorts (p < 0.001). The radiomics nomogram incorporating independent predicting factors (radiomics signature, age, BMI, and gender) well discriminated PRISm from non-COPD subjects compared with clinical model or radiomics signature alone for training cohort (AUC 0.787 vs. 0.675 vs. 0.778), internal (AUC 0.773 vs. 0.682 vs. 0.767) and external validation cohorts (AUC 0.702 vs. 0.610 vs. 0.699). Decision curve analysis suggested that our constructed radiomics nomogram outperformed clinical model.

CONCLUSIONS

The CT-based whole lung radiomics nomogram could identify PRISm to help decision-making in clinic.

Collapse

Elkahwagy DMAS, Kiriacos CJ, Mansour M. Logistic regression and other statistical tools in diagnostic biomarker studies. Clin Transl Oncol 2024;26:2172-2180. [PMID: 38530558 PMCID: PMC11333519 DOI: 10.1007/s12094-024-03413-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 02/16/2024] [Indexed: 03/28/2024]

Geng Y, Li Y, Deng C. An Improved Binary Walrus Optimizer with Golden Sine Disturbance and Population Regeneration Mechanism to Solve Feature Selection Problems. Biomimetics (Basel) 2024;9:501. [PMID: 39194480 DOI: 10.3390/biomimetics9080501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Revised: 08/13/2024] [Accepted: 08/14/2024] [Indexed: 08/29/2024] Open

Abstract

Feature selection (FS) is a significant dimensionality reduction technique in machine learning and data mining that is adept at managing high-dimensional data efficiently and enhancing model performance. Metaheuristic algorithms have become one of the most promising solutions in FS owing to their powerful search capabilities as well as their performance. In this paper, the novel improved binary walrus optimizer (WO) algorithm utilizing the golden sine strategy, elite opposition-based learning (EOBL), and population regeneration mechanism (BGEPWO) is proposed for FS. First, the population is initialized using an iterative chaotic map with infinite collapses (ICMIC) chaotic map to improve the diversity. Second, a safe signal is obtained by introducing an adaptive operator to enhance the stability of the WO and optimize the trade-off between exploration and exploitation of the algorithm. Third, BGEPWO innovatively designs a population regeneration mechanism to continuously eliminate hopeless individuals and generate new promising ones, which keeps the population moving toward the optimal solution and accelerates the convergence process. Fourth, EOBL is used to guide the escape behavior of the walrus to expand the search range. Finally, the golden sine strategy is utilized for perturbing the population in the late iteration to improve the algorithm's capacity to evade local optima. The BGEPWO algorithm underwent evaluation on 21 datasets of different sizes and was compared with the BWO algorithm and 10 other representative optimization algorithms. The experimental results demonstrate that BGEPWO outperforms these competing algorithms in terms of fitness value, number of selected features, and F1-score in most datasets. The proposed algorithm achieves higher accuracy, better feature reduction ability, and stronger convergence by increasing population diversity, continuously balancing exploration and exploitation processes and effectively escaping local optimal traps.

Collapse

Liu F. Data Science Methods for Real-World Evidence Generation in Real-World Data. Annu Rev Biomed Data Sci 2024;7:201-224. [PMID: 38748863 DOI: 10.1146/annurev-biodatasci-102423-113220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]

Lu Y, Duong T, Miao Z, Thieu T, Lamichhane J, Ahmed A, Delen D. A novel hyperparameter search approach for accuracy and simplicity in disease prediction risk scoring. J Am Med Inform Assoc 2024;31:1763-1773. [PMID: 38899502 PMCID: PMC11258418 DOI: 10.1093/jamia/ocae140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 05/07/2024] [Accepted: 05/28/2024] [Indexed: 06/21/2024] Open

Liao J, Misaki K, Sakamoto J. Impact Exploration of Spatiotemporal Feature Derivation and Selection on Machine Learning-Based Predictive Models for Post-Embolization Cerebral Aneurysm Recanalization. Cardiovasc Eng Technol 2024;15:394-404. [PMID: 38782877 DOI: 10.1007/s13239-024-00721-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 02/04/2024] [Indexed: 05/25/2024]

Ingle M, Sharma M, Verma S, Sharma N, Bhurane A, Rajendra Acharya U. Automated explainable wavelet-based sleep scoring system for a population suspected with insomnia, apnea and periodic leg movement. Med Eng Phys 2024;130:104208. [PMID: 39160031 DOI: 10.1016/j.medengphy.2024.104208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 05/31/2024] [Accepted: 07/01/2024] [Indexed: 08/21/2024]

Abstract

Sleep is an integral and vital component of human life, contributing significantly to overall health and well-being, but a considerable number of people worldwide experience sleep disorders. Sleep disorder diagnosis heavily depends on accurately classifying sleep stages. Traditionally, this classification has been performed manually by trained sleep technologists that visually inspect polysomnography records. However, in order to mitigate the labor-intensive nature of this process, automated approaches have been developed. These automated methods aim to streamline and facilitate sleep stage classification. This study aims to classify sleep stages in a dataset comprising subjects with insomnia, PLM, and sleep apnea. The dataset consists of PSG recordings from the multi-ethnic study of atherosclerosis (MESA) cohort of the national sleep research resource (NSRR), including 2056 subjects. Among these subjects, 130 have insomnia, 39 suffer from PLM, 156 have sleep apnea, and the remaining 1731 are classified as good sleepers. This study proposes an automated computerized technique to classify sleep stages, developing a machine-learning model with explainable artificial intelligence (XAI) capabilities using wavelet-based Hjorth parameters. An optimal biorthogonal wavelet filter bank (BOWFB) has been employed to extract subbands (SBs) from 30 seconds of electroencephalogram (EEG) epochs. Three EEG channels, namely: Fz_Cz, Cz_Oz, and C4_M1, are employed to yield an optimum outcome. The Hjorth parameters extracted from SBs were then fed to different machine learning algorithms. To gain an understanding of the model, in this study, we used SHAP (Shapley Additive explanations) method. For subjects suffering from the aforementioned diseases, the model utilized features derived from all channels and employed an ensembled bagged trees (EnBT) classifier. The highest accuracy of 86.8%, 87.3%, 85.0%, 84.5%, and 83.8% is obtained for the insomniac, PLM, apniac, good sleepers and complete datasets, respectively. Using these techniques and datasets, the study aims to enhance sleep stage classification accuracy and improve understanding of sleep disorders such as insomnia, PLM, and sleep apnea.

Collapse

Li Y, Geng Y, Sheng H. An improved mountain gazelle optimizer based on chaotic map and spiral disturbance for medical feature selection. PLoS One 2024;19:e0307288. [PMID: 39012921 PMCID: PMC11251600 DOI: 10.1371/journal.pone.0307288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 07/03/2024] [Indexed: 07/18/2024] Open

Du Y, Niu J, Xing Y, Li B, Calhoun VD. Neuroimage Analysis Methods and Artificial Intelligence Techniques for Reliable Biomarkers and Accurate Diagnosis of Schizophrenia: Achievements Made by Chinese Scholars Around the Past Decade. Schizophr Bull 2024:sbae110. [PMID: 38982882 DOI: 10.1093/schbul/sbae110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 07/11/2024]

Abstract

BACKGROUND AND HYPOTHESIS

Schizophrenia (SZ) is characterized by significant cognitive and behavioral disruptions. Neuroimaging techniques, particularly magnetic resonance imaging (MRI), have been widely utilized to investigate biomarkers of SZ, distinguish SZ from healthy conditions or other mental disorders, and explore biotypes within SZ or across SZ and other mental disorders, which aim to promote the accurate diagnosis of SZ. In China, research on SZ using MRI has grown considerably in recent years.

STUDY DESIGN

The article reviews advanced neuroimaging and artificial intelligence (AI) methods using single-modal or multimodal MRI to reveal the mechanism of SZ and promote accurate diagnosis of SZ, with a particular emphasis on the achievements made by Chinese scholars around the past decade.

STUDY RESULTS

Our article focuses on the methods for capturing subtle brain functional and structural properties from the high-dimensional MRI data, the multimodal fusion and feature selection methods for obtaining important and sparse neuroimaging features, the supervised statistical analysis and classification for distinguishing disorders, and the unsupervised clustering and semi-supervised learning methods for identifying neuroimage-based biotypes. Crucially, our article highlights the characteristics of each method and underscores the interconnections among various approaches regarding biomarker extraction and neuroimage-based diagnosis, which is beneficial not only for comprehending SZ but also for exploring other mental disorders.

CONCLUSIONS

We offer a valuable review of advanced neuroimage analysis and AI methods primarily focused on SZ research by Chinese scholars, aiming to promote the diagnosis, treatment, and prevention of SZ, as well as other mental disorders, both within China and internationally.

Collapse

Rajab MD, Taketa T, Wharton SB, Wang D. Ranking and filtering of neuropathology features in the machine learning evaluation of dementia studies. Brain Pathol 2024;34:e13247. [PMID: 38374326 PMCID: PMC11189772 DOI: 10.1111/bpa.13247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 01/30/2024] [Indexed: 02/21/2024] Open

Jing X, Wielema M, Monroy-Gonzalez AG, Stams TRG, Mahesh SVK, Oudkerk M, Sijens PE, Dorrius MD, van Ooijen PMA. Automated Breast Density Assessment in MRI Using Deep Learning and Radiomics: Strategies for Reducing Inter-Observer Variability. J Magn Reson Imaging 2024;60:80-91. [PMID: 37846440 DOI: 10.1002/jmri.29058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 09/18/2023] [Accepted: 09/19/2023] [Indexed: 10/18/2023] Open

Abstract

BACKGROUND

Accurate breast density evaluation allows for more precise risk estimation but suffers from high inter-observer variability.

PURPOSE

To evaluate the feasibility of reducing inter-observer variability of breast density assessment through artificial intelligence (AI) assisted interpretation.

STUDY TYPE

Retrospective.

POPULATION

Six hundred and twenty-one patients without breast prosthesis or reconstructions were randomly divided into training (N = 377), validation (N = 98), and independent test (N = 146) datasets.

FIELD STRENGTH/SEQUENCE

1.5 T and 3.0 T; T1-weighted spectral attenuated inversion recovery.

ASSESSMENT

Five radiologists independently assessed each scan in the independent test set to establish the inter-observer variability baseline and to reach a reference standard. Deep learning and three radiomics models were developed for three classification tasks: (i) four Breast Imaging-Reporting and Data System (BI-RADS) breast composition categories (A-D), (ii) dense (categories C, D) vs. non-dense (categories A, B), and (iii) extremely dense (category D) vs. moderately dense (categories A-C). The models were tested against the reference standard on the independent test set. AI-assisted interpretation was performed by majority voting between the models and each radiologist's assessment.

STATISTICAL TESTS

Inter-observer variability was assessed using linear-weighted kappa (κ) statistics. Kappa statistics, accuracy, and area under the receiver operating characteristic curve (AUC) were used to assess models against reference standard.

RESULTS

In the independent test set, five readers showed an overall substantial agreement on tasks (i) and (ii), but moderate agreement for task (iii). The best-performing model showed substantial agreement with reference standard for tasks (i) and (ii), but moderate agreement for task (iii). With the assistance of the AI models, almost perfect inter-observer variability was obtained for tasks (i) (mean κ = 0.86), (ii) (mean κ = 0.94), and (iii) (mean κ = 0.94).

DATA CONCLUSION

Deep learning and radiomics models have the potential to help reduce inter-observer variability of breast density assessment.

LEVEL OF EVIDENCE

3 TECHNICAL EFFICACY: Stage 1.

Collapse

Xu C, Wu J, Zhang F, Freer J, Zhang Z, Cheng Y. A deep image classification model based on prior feature knowledge embedding and application in medical diagnosis. Sci Rep 2024;14:13244. [PMID: 38853158 PMCID: PMC11163012 DOI: 10.1038/s41598-024-63818-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 06/03/2024] [Indexed: 06/11/2024] Open

Abstract

Aiming at the problem of image classification with insignificant morphological structural features, strong target correlation, and low signal-to-noise ratio, combined with prior feature knowledge embedding, a deep learning method based on ResNet and Radial Basis Probabilistic Neural Network (RBPNN) is proposed model. Taking ResNet50 as a visual modeling network, it uses feature pyramid and self-attention mechanism to extract appearance and semantic features of images at multiple scales, and associate and enhance local and global features. Taking into account the diversity of category features, channel cosine similarity attention and dynamic C-means clustering algorithms are used to select representative sample features in different category of sample subsets to implicitly express prior category feature knowledge, and use them as the kernel centers of radial basis probability neurons (RBPN) to realize the embedding of diverse prior feature knowledge. In the RBPNN pattern aggregation layer, the outputs of RBPN are selectively summed according to the category of the kernel center, that is, the subcategory features are combined into category features, and finally the image classification is implemented based on Softmax. The functional module of the proposed method is designed specifically for image characteristics, which can highlight the significance of local and structural features of the image, form a non-convex decision-making area, and reduce the requirements for the completeness of the sample set. Applying the proposed method to medical image classification, experiments were conducted based on the brain tumor MRI image classification public dataset and the actual cardiac ultrasound image dataset, and the accuracy rate reached 85.82% and 83.92% respectively. Compared with the three mainstream image classification models, the performance indicators of this method have been significantly improved.

Collapse

Alsadi B, Musleh S, Al-Absi HRH, Refaee M, Qureshi R, El Hajj N, Alam T. An ensemble-based machine learning model for predicting type 2 diabetes and its effect on bone health. BMC Med Inform Decis Mak 2024;24:144. [PMID: 38811939 PMCID: PMC11134939 DOI: 10.1186/s12911-024-02540-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 05/17/2024] [Indexed: 05/31/2024] Open

Abstract

BACKGROUND

Diabetes is a chronic condition that can result in many long-term physiological, metabolic, and neurological complications. Therefore, early detection of diabetes would help to determine a proper diagnosis and treatment plan.

METHODS

In this study, we employed machine learning (ML) based case-control study on a diabetic cohort size of 1000 participants form Qatar Biobank to predict diabetes using clinical and bone health indicators from Dual Energy X-ray Absorptiometry (DXA) machines. ML models were utilized to distinguish diabetes groups from non-diabetes controls. Recursive feature elimination (RFE) was leveraged to identify a subset of features to improve the performance of model. SHAP based analysis was used for the importance of features and support the explainability of the proposed model.

RESULTS

Ensemble based models XGboost and RF achieved over 84% accuracy for detecting diabetes. After applying RFE, we selected only 20 features which improved the model accuracy to 87.2%. From a clinical standpoint, higher HDL-Cholesterol and Neutrophil levels were observed in the diabetic group, along with lower vitamin B12 and testosterone levels. Lower sodium levels were found in diabetics, potentially stemming from clinical factors including specific medications, hormonal imbalances, unmanaged diabetes. We believe Dapagliflozin prescriptions in Qatar were associated with decreased Gamma Glutamyltransferase and Aspartate Aminotransferase enzyme levels, confirming prior research. We observed that bone area, bone mineral content, and bone mineral density were slightly lower in the Diabetes group across almost all body parts, but the difference against the control group was not statistically significant except in T12, troch and trunk area. No significant negative impact of diabetes progression on bone health was observed over a period of 5-15 yrs in the cohort.

CONCLUSION

This study recommends the inclusion of ML model which combines both DXA and clinical data for the early diagnosis of diabetes.

Collapse

Liu W, Jia L, Xu L, Yang F, Guo Z, Li J, Zhang D, Liu Y, Xiang H, Cheng H, Hou J, Li S, Li H. Prediction of early neurologic deterioration in patients with perforating artery territory infarction using machine learning: a retrospective study. Front Neurol 2024;15:1368902. [PMID: 38841697 PMCID: PMC11150528 DOI: 10.3389/fneur.2024.1368902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 04/24/2024] [Indexed: 06/07/2024] Open

Abstract

Background

Early neurological deterioration (END) is a frequent complication in patients with perforating artery territory infarction (PAI), leading to poorer outcomes. Therefore, we aimed to apply machine learning (ML) algorithms to predict the occurrence of END in PAI and investigate related risk factors.

Methods

This retrospective study analyzed a cohort of PAI patients, excluding those with severe stenosis of the parent artery. We included demographic characteristics, clinical features, laboratory data, and imaging variables. Recursive feature elimination with cross-validation (RFECV) was performed to identify critical features. Seven ML algorithms, namely logistic regression, random forest, adaptive boosting, gradient boosting decision tree, histogram-based gradient boosting, extreme gradient boosting, and category boosting, were developed to predict END in PAI patients using these critical features. We compared the accuracy of these models in predicting outcomes. Additionally, SHapley Additive exPlanations (SHAP) values were introduced to interpret the optimal model and assess the significance of input features.

Results

The study enrolled 1,020 PAI patients with a mean age of 60.46 (range 49.11-71.81) years. Of these, 30.39% were women, and 129 (12.65%) experienced END. RFECV selected 13 critical features, including blood urea nitrogen (BUN), total cholesterol (TC), low-density-lipoprotein cholesterol (LDL-C), apolipoprotein B (apoB), atrial fibrillation, loading dual antiplatelet therapy (DAPT), single antiplatelet therapy (SAPT), argatroban, the basal ganglia, the thalamus, the posterior choroidal arteries, maximal axial infarct diameter (measured at < 15 mm), and stroke subtype. The gradient-boosting decision tree had the highest area under the curve (0.914) among the seven ML algorithms. The SHAP analysis identified apoB as the most significant variable for END.

Conclusion

Our results suggest that ML algorithms, especially the gradient-boosting decision tree, are effective in predicting the occurrence of END in PAI patients.

Collapse

Bahameish M, Stockman T, Requena Carrión J. Strategies for Reliable Stress Recognition: A Machine Learning Approach Using Heart Rate Variability Features. SENSORS (BASEL, SWITZERLAND) 2024;24:3210. [PMID: 38794064 PMCID: PMC11126126 DOI: 10.3390/s24103210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 05/11/2024] [Accepted: 05/14/2024] [Indexed: 05/26/2024]

Burton RJ, Raffray L, Moet LM, Cuff SM, White DA, Baker SE, Moser B, O’Donnell VB, Ghazal P, Morgan MP, Artemiou A, Eberl M. Conventional and unconventional T-cell responses contribute to the prediction of clinical outcome and causative bacterial pathogen in sepsis patients. Clin Exp Immunol 2024;216:293-306. [PMID: 38430552 PMCID: PMC11097916 DOI: 10.1093/cei/uxae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 02/12/2024] [Accepted: 02/28/2024] [Indexed: 03/04/2024] Open

Li Q, Lv H, Chen Y, Shen J, Shi J, Zhou C. Hybrid feature selection in a machine learning predictive model for perioperative myocardial injury in noncoronary cardiac surgery with cardiopulmonary bypass. Perfusion 2024:2676591241253459. [PMID: 38733257 DOI: 10.1177/02676591241253459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2024]

Abstract

BACKGROUND

Perioperative myocardial injury (PMI) is associated with increased mobility and mortality after noncoronary cardiac surgery. However, limited studies have developed a predictive model for PMI. Therefore, we used hybrid feature selection (FS) methods to establish a predictive model for PMI in noncoronary cardiac surgery with cardiopulmonary bypass (CPB).

METHODS

This was a single-center retrospective study conducted at the Fuwai Hospital in China. Patients aged 18-70 years who underwent elective noncoronary surgery with CPB at our institution from December 2018 to April 2021 were enrolled. The primary outcome was PMI, defined as the postoperative cardiac troponin I (cTnI) levels exceeding 220 times of upper reference limit (URL). Statistical analyses were conducted by Python (Python Software Foundation, version 3.9.7 and integrated development environment Jupyter Notebook 1.1.0) and SPSS software version 26.0 (IBM Corp., Armonk, New York, USA).

RESULTS

A total of 1130 patients were eventually eligible for this study. The incidence of PMI was 20.3% (229/1130) in the overall patients, 20.6% (163/791) in the training dataset, and 19.5% (66/339) in the testing dataset. The logistic regression model performed the best AUC of 0.6893 (95 CI%: 0.6371-0.7382) by the traditional selection method, and the random forest model performed the best AUC of 0.6937 (95 CI%: 0.6416-0.7423) by the union of Wrapper and Embedded method, and the CatBoost model performed the best AUC of 0.6828 (95 CI%: 0.6304-0.7320) by the union of Embedded and forward logistic regression technique, and the Naïve Bayes model achieved the best AUC with 0.7254 (95 CI%: 0.6746-0.7723) by forwarding logistic regression method. Moreover, the decision tree, KNeighborsClassifier, and support vector machine models performed the worse AUC in all selection forms. Furthermore, the SHapley Additive exPlanations plot showed that prolonged CPB, aortic clamp time, and preoperative low platelets count were strongly related to the PMI risk.

CONCLUSIONS

In total, four category feature selection methods were utilized, comprising five individual selection techniques and 15 combined methods. Notably, the combination of logistic regression and embedded methods demonstrated outstanding performance in predicting PMI risk. We also concluded that the machine learning model, including random forest, catboost, and Naive Bayes, were suitable candidates for establishing PMI predictive model. Nevertheless, additional investigation and validation are imperative for substantiating these finding.

Collapse

Bulut E, Arslan Yildiz U, Cengiz M, Yilmaz M, Kavakli AS, Arici AG, Ozturk N, Uslu S. Evaluation of the Effect of Morphological Structure on Dilatational Tracheostomy Interference Location and Complications with Ultrasonography and Fiberoptic Bronchoscopy. J Clin Med 2024;13:2788. [PMID: 38792330 PMCID: PMC11122435 DOI: 10.3390/jcm13102788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 04/25/2024] [Accepted: 05/06/2024] [Indexed: 05/26/2024] Open

Li S, Xiang S, Ma Q, Cai W, Liu S, Fang F, Yu H. A decision support system for upper limb rehabilitation robot based on hybrid reasoning with RBR and CBR. Front Bioeng Biotechnol 2024;12:1400912. [PMID: 38720881 PMCID: PMC11076720 DOI: 10.3389/fbioe.2024.1400912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 04/08/2024] [Indexed: 05/12/2024] Open

Seyedtabib M, Najafi-Vosough R, Kamyari N. The predictive power of data: machine learning analysis for Covid-19 mortality based on personal, clinical, preclinical, and laboratory variables in a case-control study. BMC Infect Dis 2024;24:411. [PMID: 38637727 PMCID: PMC11025285 DOI: 10.1186/s12879-024-09298-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 04/05/2024] [Indexed: 04/20/2024] Open

Abstract

BACKGROUND AND PURPOSE

The COVID-19 pandemic has presented unprecedented public health challenges worldwide. Understanding the factors contributing to COVID-19 mortality is critical for effective management and intervention strategies. This study aims to unlock the predictive power of data collected from personal, clinical, preclinical, and laboratory variables through machine learning (ML) analyses.

METHODS

A retrospective study was conducted in 2022 in a large hospital in Abadan, Iran. Data were collected and categorized into demographic, clinical, comorbid, treatment, initial vital signs, symptoms, and laboratory test groups. The collected data were subjected to ML analysis to identify predictive factors associated with COVID-19 mortality. Five algorithms were used to analyze the data set and derive the latent predictive power of the variables by the shapely additive explanation values.

RESULTS

Results highlight key factors associated with COVID-19 mortality, including age, comorbidities (hypertension, diabetes), specific treatments (antibiotics, remdesivir, favipiravir, vitamin zinc), and clinical indicators (heart rate, respiratory rate, temperature). Notably, specific symptoms (productive cough, dyspnea, delirium) and laboratory values (D-dimer, ESR) also play a critical role in predicting outcomes. This study highlights the importance of feature selection and the impact of data quantity and quality on model performance.

CONCLUSION

This study highlights the potential of ML analysis to improve the accuracy of COVID-19 mortality prediction and emphasizes the need for a comprehensive approach that considers multiple feature categories. It highlights the critical role of data quality and quantity in improving model performance and contributes to our understanding of the multifaceted factors that influence COVID-19 outcomes.

Collapse

Nishan A, M. Taslim Uddin Raju S, Hossain MI, Dipto SA, M. Tanvir Uddin S, Sijan A, Chowdhury MAS, Ahmad A, Mahamudul Hasan Khan M. A continuous cuffless blood pressure measurement from optimal PPG characteristic features using machine learning algorithms. Heliyon 2024;10:e27779. [PMID: 38533045 PMCID: PMC10963242 DOI: 10.1016/j.heliyon.2024.e27779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/01/2024] [Accepted: 03/06/2024] [Indexed: 03/28/2024] Open

Abstract

Background and objective

Hypertension is a potentially dangerous health condition that can be detected by measuring blood pressure (BP). Blood pressure monitoring and measurement are essential for preventing and treating cardiovascular diseases. Cuff-based devices, on the other hand, are uncomfortable and prevent continuous BP measurement.

Methods

In this study, a new non-invasive and cuff-less method for estimating Systolic Blood Pressure (SBP), Mean Arterial Pressure (MAP), and Diastolic Blood Pressure (DBP) has been proposed using characteristic features of photoplethysmogram (PPG) signals and nonlinear regression algorithms. PPG signals were collected from 219 participants, which were then subjected to preprocessing and feature extraction steps. Analyzing PPG and its derivative signals, a total of 46 time, frequency, and time-frequency domain features were extracted. In addition, the age and gender of each subject were also included as features. Further, correlation-based feature selection (CFS) and Relief F feature selection (ReliefF) techniques were used to select the relevant features and reduce the possibility of over-fitting the models. Finally, support vector regression (SVR), K-nearest neighbour regression (KNR), decision tree regression (DTR), and random forest regression (RFR) were established to develop the BP estimation model. Regression models were trained and evaluated on all features as well as selected features. The best regression models for SBP, MAP, and DBP estimations were selected separately.

Results

The SVR model, along with the ReliefF-based feature selection algorithm, outperforms other algorithms in estimating the SBP, MAP, and DBP with the mean absolute error of 2.49, 1.62 and 1.43 mmHg, respectively. The proposed method meets the Advancement of Medical Instrumentation standard for BP estimations. Based on the British Hypertension Society standard, the results also fall within Grade A for SBP, MAP, and DBP.

Conclusion

The findings show that the method can be used to estimate blood pressure non-invasively, without using a cuff or calibration, and only by utilizing the PPG signal characteristic features.

Collapse

Tiwari AK, Saini R, Nath A, Singh P, Shah MA. Hybrid similarity relation based mutual information for feature selection in intuitionistic fuzzy rough framework and its applications. Sci Rep 2024;14:5958. [PMID: 38472266 PMCID: PMC10933482 DOI: 10.1038/s41598-024-55902-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Accepted: 02/28/2024] [Indexed: 03/14/2024] Open

Abstract

Fuzzy rough entropy established in the notion of fuzzy rough set theory, which has been effectively and efficiently applied for feature selection to handle the uncertainty in real-valued datasets. Further, Fuzzy rough mutual information has been presented by integrating information entropy with fuzzy rough set to measure the importance of features. However, none of the methods till date can handle noise, uncertainty and vagueness simultaneously due to both judgement and identification, which lead to degrade the overall performances of the learning algorithms with the increment in the number of mixed valued conditional features. In the current study, these issues are tackled by presenting a novel intuitionistic fuzzy (IF) assisted mutual information concept along with IF granular structure. Initially, a hybrid IF similarity relation is introduced. Based on this relation, an IF granular structure is introduced. Then, IF rough conditional and joint entropies are established. Further, mutual information based on these concepts are discussed. Next, mathematical theorems are proved to demonstrate the validity of the given notions. Thereafter, significance of the features subset is computed by using this mutual information, and corresponding feature selection is suggested to delete the irrelevant and redundant features. The current approach effectively handles noise and subsequent uncertainty in both nominal and mixed data (including both nominal and category variables). Moreover, comprehensive experimental performances are evaluated on real-valued benchmark datasets to demonstrate the practical validation and effectiveness of the addressed technique. Finally, an application of the proposed method is exhibited to improve the prediction of phospholipidosis positive molecules. RF(h2o) produces the most effective results till date based on our proposed methodology with sensitivity, accuracy, specificity, MCC, and AUC of 86.7%, 90.1%, 93.0% , 0.808, and 0.922 respectively.

Collapse

Yang F, Xu Z, Wang H, Sun L, Zhai M, Zhang J. A hybrid feature selection algorithm combining information gain and grouping particle swarm optimization for cancer diagnosis. PLoS One 2024;19:e0290332. [PMID: 38466662 PMCID: PMC10927139 DOI: 10.1371/journal.pone.0290332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Accepted: 08/04/2023] [Indexed: 03/13/2024] Open

Deng F, Zhao L, Yu N, Lin Y, Zhang L. Union With Recursive Feature Elimination: A Feature Selection Framework to Improve the Classification Performance of Multicategory Causes of Death in Colorectal Cancer. J Transl Med 2024;104:100320. [PMID: 38158124 DOI: 10.1016/j.labinv.2023.100320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 01/03/2024] Open

Nopour R, Kazemi-Arpanahi H. Developing an intelligent prediction system for successful aging based on artificial neural networks. Int J Prev Med 2024;15:10. [PMID: 38563039 PMCID: PMC10982733 DOI: 10.4103/ijpvm.ijpvm_47_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 10/04/2023] [Indexed: 04/04/2024] Open

Lyu H, Huang H, He J, Zhu S, Hong W, Lai J, Gao T, Shao J, Zhu J, Li Y, Hu S. Task-state skin potential abnormalities can distinguish major depressive disorder and bipolar depression from healthy controls. Transl Psychiatry 2024;14:110. [PMID: 38395985 PMCID: PMC10891315 DOI: 10.1038/s41398-024-02828-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 02/07/2024] [Accepted: 02/13/2024] [Indexed: 02/25/2024] Open

Affiliation(s)

Hailong Lyu Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine; Key Laboratory of Mental Disorder's Management of Zhejiang Province, Hangzhou, 310003, China Brain Research Institute of Zhejiang University, Hangzhou, 310003, China Zhejiang Engineering Center for Mathematical Mental Health, Hangzhou, 310003, China
Huimin Huang The Third Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325200, China Ruian People's Hospital, Wenzhou, 325200, China
Jiadong He College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, 310027, China
Sheng Zhu Department of Psychiatry, The Ruian Fifth People's Hospital, Wenzhou, 325200, China
Wanchu Hong Department of Psychiatry, The Ruian Fifth People's Hospital, Wenzhou, 325200, China
Jianbo Lai Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine; Key Laboratory of Mental Disorder's Management of Zhejiang Province, Hangzhou, 310003, China Brain Research Institute of Zhejiang University, Hangzhou, 310003, China Zhejiang Engineering Center for Mathematical Mental Health, Hangzhou, 310003, China
Tongsheng Gao Ningbo Psychiatric Hospital, Ningbo, 315032, China
Jiamin Shao Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine; Key Laboratory of Mental Disorder's Management of Zhejiang Province, Hangzhou, 310003, China Brain Research Institute of Zhejiang University, Hangzhou, 310003, China Zhejiang Engineering Center for Mathematical Mental Health, Hangzhou, 310003, China
Jianfeng Zhu Department of Psychiatry, The Ruian Fifth People's Hospital, Wenzhou, 325200, China
Yubo Li College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, 310027, China.
Shaohua Hu Department of Psychiatry, The First Affiliated Hospital, Zhejiang University School of Medicine; Key Laboratory of Mental Disorder's Management of Zhejiang Province, Hangzhou, 310003, China. Brain Research Institute of Zhejiang University, Hangzhou, 310003, China. Zhejiang Engineering Center for Mathematical Mental Health, Hangzhou, 310003, China. Ruian People's Hospital, Wenzhou, 325200, China.

Collapse

Zhou TH, Zhou XX, Ni J, Ma YQ, Xu FY, Fan B, Guan Y, Jiang XA, Lin XQ, Li J, Xia Y, Wang X, Wang Y, Huang WJ, Tu WT, Dong P, Li ZB, Liu SY, Fan L. CT whole lung radiomic nomogram: a potential biomarker for lung function evaluation and identification of COPD. Mil Med Res 2024;11:14. [PMID: 38374260 PMCID: PMC10877876 DOI: 10.1186/s40779-024-00516-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 01/22/2024] [Indexed: 02/21/2024] Open

Affiliation(s)

Tao-Hu Zhou Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China School of Medical Imaging, Shandong Second Medical University, Weifang, 261053, Shandong, China
Xiu-Xiu Zhou Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Jiong Ni Department of Radiology, School of Medicine, Tongji Hospital, Tongji University, Shanghai, 200065, China
Yan-Qing Ma Department of Radiology, Zhejiang Province People's Hospital, Affiliated People's Hospital of Hangzhou Medical College, Hangzhou, 310014, China
Fang-Yi Xu Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang, 310018, China
Bing Fan Jiangxi Provincial People's Hospital, the First Affiliated Hospital of Nanchang Medical College, Nanchang, 330006, China
Yu Guan Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Xin-Ang Jiang Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Xiao-Qing Lin Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China College of Health Sciences and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
Jie Li Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China College of Health Sciences and Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
Yi Xia Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Xiang Wang Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Yun Wang Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Wen-Jun Huang Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China Department of Radiology, the Second People's Hospital of Deyang, Deyang, 618000, Sichuan, China
Wen-Ting Tu Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Peng Dong School of Medical Imaging, Shandong Second Medical University, Weifang, 261053, Shandong, China
Zhao-Bin Li Department of Radiation Oncology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, 200233, China
Shi-Yuan Liu Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China
Li Fan Department of Radiology, the Second Affiliated Hospital of Naval Medical University, Shanghai, 200003, China.

Collapse

Reduwan NH, Abdul Aziz AA, Mohd Razi R, Abdullah ERMF, Mazloom Nezhad SM, Gohain M, Ibrahim N. Application of deep learning and feature selection technique on external root resorption identification on CBCT images. BMC Oral Health 2024;24:252. [PMID: 38373931 PMCID: PMC10875886 DOI: 10.1186/s12903-024-03910-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 01/17/2024] [Indexed: 02/21/2024] Open

Abstract

BACKGROUND

Artificial intelligence has been proven to improve the identification of various maxillofacial lesions. The aim of the current study is two-fold: to assess the performance of four deep learning models (DLM) in external root resorption (ERR) identification and to assess the effect of combining feature selection technique (FST) with DLM on their ability in ERR identification.

METHODS

External root resorption was simulated on 88 extracted premolar teeth using tungsten bur in different depths (0.5 mm, 1 mm, and 2 mm). All teeth were scanned using a Cone beam CT (Carestream Dental, Atlanta, GA). Afterward, a training (70%), validation (10%), and test (20%) dataset were established. The performance of four DLMs including Random Forest (RF) + Visual Geometry Group 16 (VGG), RF + EfficienNetB4 (EFNET), Support Vector Machine (SVM) + VGG, and SVM + EFNET) and four hybrid models (DLM + FST: (i) FS + RF + VGG, (ii) FS + RF + EFNET, (iii) FS + SVM + VGG and (iv) FS + SVM + EFNET) was compared. Five performance parameters were assessed: classification accuracy, F1-score, precision, specificity, and error rate. FST algorithms (Boruta and Recursive Feature Selection) were combined with the DLMs to assess their performance.

RESULTS

RF + VGG exhibited the highest performance in identifying ERR, followed by the other tested models. Similarly, FST combined with RF + VGG outperformed other models with classification accuracy, F1-score, precision, and specificity of 81.9%, weighted accuracy of 83%, and area under the curve (AUC) of 96%. Kruskal Wallis test revealed a significant difference (p = 0.008) in the prediction accuracy among the eight DLMs.

CONCLUSION

In general, all DLMs have similar performance on ERR identification. However, the performance can be improved by combining FST with DLMs.

Collapse

Zhang H, Yin J, Zhou C, Qiu J, Wang J, Lv Q, Luo T. Identification of ipsilateral supraclavicular lymph node metastasis in breast cancer based on LASSO regression with a high penalty factor. Front Oncol 2024;14:1349315. [PMID: 38371618 PMCID: PMC10869533 DOI: 10.3389/fonc.2024.1349315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 01/19/2024] [Indexed: 02/20/2024] Open

Jenul A, Stokmo HL, Schrunner S, Hjortland GO, Revheim ME, Tomic O. Novel ensemble feature selection techniques applied to high-grade gastroenteropancreatic neuroendocrine neoplasms for the prediction of survival. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024;244:107934. [PMID: 38016391 DOI: 10.1016/j.cmpb.2023.107934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/05/2023] [Accepted: 11/17/2023] [Indexed: 11/30/2023]

Abstract

BACKGROUND AND OBJECTIVE

Determining the most informative features for predicting the overall survival of patients diagnosed with high-grade gastroenteropancreatic neuroendocrine neoplasms is crucial to improve individual treatment plans for patients, as well as the biological understanding of the disease. The main objective of this study is to evaluate the use of modern ensemble feature selection techniques for this purpose with respect to (a) quantitative performance measures such as predictive performance, (b) clinical interpretability, and (c) the effect of integrating prior expert knowledge.

METHODS

The Repeated Elastic Net Technique for Feature Selection (RENT) and the User-Guided Bayesian Framework for Feature Selection (UBayFS) are recently developed ensemble feature selectors investigated in this work. Both allow the user to identify informative features in datasets with low sample sizes and focus on model interpretability. While RENT is purely data-driven, UBayFS can integrate expert knowledge a priori in the feature selection process. In this work, we compare both feature selectors on a dataset comprising 63 patients and 110 features from multiple sources, including baseline patient characteristics, baseline blood values, tumor histology, imaging, and treatment information.

RESULTS

Our experiments involve data-driven and expert-driven setups, as well as combinations of both. In a five-fold cross-validated experiment without expert knowledge, our results demonstrate that both feature selectors allow accurate predictions: A reduction from 110 to approximately 20 features (around 82%) delivers near-optimal predictive performances with minor variations according to the choice of the feature selector, the predictive model, and the fold. Thereafter, we use findings from clinical literature as a source of expert knowledge. In addition, expert knowledge has a stabilizing effect on the feature set (an increase in stability of approximately 40%), while the impact on predictive performance is limited.

CONCLUSIONS

The features WHO Performance Status, Albumin, Platelets, Ki-67, Tumor Morphology, Total MTV, Total TLG, and SUVmax are the most stable and predictive features in our study. Overall, this study demonstrated the practical value of feature selection in medical applications not only to improve quantitative performance but also to deliver potentially new insights to experts.

Collapse

Aragones DG, Palomino-Segura M, Sicilia J, Crainiciuc G, Ballesteros I, Sánchez-Cabo F, Hidalgo A, Calvo GF. Variable selection for nonlinear dimensionality reduction of biological datasets through bootstrapping of correlation networks. Comput Biol Med 2024;168:107827. [PMID: 38086138 DOI: 10.1016/j.compbiomed.2023.107827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 11/15/2023] [Accepted: 12/04/2023] [Indexed: 01/10/2024]

Ding X, Li Y, Chen S. Maximum margin and global criterion based-recursive feature selection. Neural Netw 2024;169:597-606. [PMID: 37956576 DOI: 10.1016/j.neunet.2023.10.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 06/19/2023] [Accepted: 10/22/2023] [Indexed: 11/15/2023]

Williams TL, Gonen M, Wray R, Do RKG, Simpson AL. Quantitation of Oncologic Image Features for Radiomic Analyses in PET. Methods Mol Biol 2024;2729:409-421. [PMID: 38006509 DOI: 10.1007/978-1-0716-3499-8_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2023]

Arian R, Vard A, Kafieh R, Plonka G, Rabbani H. A new convolutional neural network based on combination of circlets and wavelets for macular OCT classification. Sci Rep 2023;13:22582. [PMID: 38114582 PMCID: PMC10730902 DOI: 10.1038/s41598-023-50164-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Accepted: 12/15/2023] [Indexed: 12/21/2023] Open

Abstract

Artificial intelligence (AI) algorithms, encompassing machine learning and deep learning, can assist ophthalmologists in early detection of various ocular abnormalities through the analysis of retinal optical coherence tomography (OCT) images. Despite considerable progress in these algorithms, several limitations persist in medical imaging fields, where a lack of data is a common issue. Accordingly, specific image processing techniques, such as time-frequency transforms, can be employed in conjunction with AI algorithms to enhance diagnostic accuracy. This research investigates the influence of non-data-adaptive time-frequency transforms, specifically X-lets, on the classification of OCT B-scans. For this purpose, each B-scan was transformed using every considered X-let individually, and all the sub-bands were utilized as the input for a designed 2D Convolutional Neural Network (CNN) to extract optimal features, which were subsequently fed to the classifiers. Evaluating per-class accuracy shows that the use of the 2D Discrete Wavelet Transform (2D-DWT) yields superior outcomes for normal cases, whereas the circlet transform outperforms other X-lets for abnormal cases characterized by circles in their retinal structure (due to the accumulation of fluid). As a result, we propose a novel transform named CircWave by concatenating all sub-bands from the 2D-DWT and the circlet transform. The objective is to enhance the per-class accuracy of both normal and abnormal cases simultaneously. Our findings show that classification results based on the CircWave transform outperform those derived from original images or any individual transform. Furthermore, Grad-CAM class activation visualization for B-scans reconstructed from CircWave sub-bands highlights a greater emphasis on circular formations in abnormal cases and straight lines in normal cases, in contrast to the focus on irrelevant regions in original B-scans. To assess the generalizability of our method, we applied it to another dataset obtained from a different imaging system. We achieved promising accuracies of 94.5% and 90% for the first and second datasets, respectively, which are comparable with results from previous studies. The proposed CNN based on CircWave sub-bands (i.e. CircWaveNet) not only produces superior outcomes but also offers more interpretable results with a heightened focus on features crucial for ophthalmologists.

Collapse

Augustine J, Jereesh AS. Identification of gene-level methylation for disease prediction. Interdiscip Sci 2023;15:678-695. [PMID: 37603212 DOI: 10.1007/s12539-023-00584-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 07/30/2023] [Accepted: 08/01/2023] [Indexed: 08/22/2023]

Chatterjee A, Pahari N, Prinz A, Riegler M. AI and semantic ontology for personalized activity eCoaching in healthy lifestyle recommendations: a meta-heuristic approach. BMC Med Inform Decis Mak 2023;23:278. [PMID: 38041041 PMCID: PMC10693173 DOI: 10.1186/s12911-023-02364-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 11/03/2023] [Indexed: 12/03/2023] Open

Abstract

BACKGROUND

Automated coaches (eCoach) can help people lead a healthy lifestyle (e.g., reduction of sedentary bouts) with continuous health status monitoring and personalized recommendation generation with artificial intelligence (AI). Semantic ontology can play a crucial role in knowledge representation, data integration, and information retrieval.

METHODS

This study proposes a semantic ontology model to annotate the AI predictions, forecasting outcomes, and personal preferences to conceptualize a personalized recommendation generation model with a hybrid approach. This study considers a mixed activity projection method that takes individual activity insights from the univariate time-series prediction and ensemble multi-class classification approaches. We have introduced a way to improve the prediction result with a residual error minimization (REM) technique and make it meaningful in recommendation presentation with a Naïve-based interval prediction approach. We have integrated the activity prediction results in an ontology for semantic interpretation. A SPARQL query protocol and RDF Query Language (SPARQL) have generated personalized recommendations in an understandable format. Moreover, we have evaluated the performance of the time-series prediction and classification models against standard metrics on both imbalanced and balanced public PMData and private MOX2-5 activity datasets. We have used Adaptive Synthetic (ADASYN) to generate synthetic data from the minority classes to avoid bias. The activity datasets were collected from healthy adults (n = 16 for public datasets; n = 15 for private datasets). The standard ensemble algorithms have been used to investigate the possibility of classifying daily physical activity levels into the following activity classes: sedentary (0), low active (1), active (2), highly active (3), and rigorous active (4). The daily step count, low physical activity (LPA), medium physical activity (MPA), and vigorous physical activity (VPA) serve as input for the classification models. Subsequently, we re-verify the classifiers on the private MOX2-5 dataset. The performance of the ontology has been assessed with reasoning and SPARQL query execution time. Additionally, we have verified our ontology for effective recommendation generation.

RESULTS

We have tested several standard AI algorithms and selected the best-performing model with optimized configuration for our use case by empirical testing. We have found that the autoregression model with the REM method outperforms the autoregression model without the REM method for both datasets. Gradient Boost (GB) classifier outperforms other classifiers with a mean accuracy score of 98.00%, and 99.00% for imbalanced PMData and MOX2-5 datasets, respectively, and 98.30%, and 99.80% for balanced PMData and MOX2-5 datasets, respectively. Hermit reasoner performs better than other ontology reasoners under defined settings. Our proposed algorithm shows a direction to combine the AI prediction forecasting results in an ontology to generate personalized activity recommendations in eCoaching.

CONCLUSION

The proposed method combining step-prediction, activity-level classification techniques, and personal preference information with semantic rules is an asset for generating personalized recommendations.

Collapse

Huang MW, Tsai CF, Tsui SC, Lin WC. Combining data discretization and missing value imputation for incomplete medical datasets. PLoS One 2023;18:e0295032. [PMID: 38033140 PMCID: PMC10688879 DOI: 10.1371/journal.pone.0295032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 11/14/2023] [Indexed: 12/02/2023] Open

van Dartel D, Wang Y, Hegeman JH, Vollenbroek-Hutten MMR. Prediction of Physical Activity Patterns in Older Patients Rehabilitating After Hip Fracture Surgery: Exploratory Study. JMIR Rehabil Assist Technol 2023;10:e45307. [PMID: 38032703 PMCID: PMC10727481 DOI: 10.2196/45307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 06/25/2023] [Accepted: 07/27/2023] [Indexed: 12/01/2023] Open

Abstract

BACKGROUND

Building up physical activity is a highly important aspect in an older patient's rehabilitation process after hip fracture surgery. The patterns of physical activity during rehabilitation are associated with the duration of rehabilitation stay. Predicting physical activity patterns early in the rehabilitation phase can provide patients and health care professionals an early indication of the duration of rehabilitation stay as well as insight into the degree of patients' recovery for timely adaptive interventions.

OBJECTIVE

This study aims to explore the early prediction of physical activity patterns in older patients rehabilitating after hip fracture surgery at a skilled nursing home.

METHODS

The physical activity of patients aged ≥70 years with surgically treated hip fracture was continuously monitored using an accelerometer during rehabilitation at a skilled nursing home. Physical activity patterns were described in our previous study, and the 2 most common patterns were used in this study for pattern prediction: the upward linear pattern (n=15) and the S-shape pattern (n=23). Features from the intensity of physical activity were calculated for time windows with different window sizes of the first 5, 6, 7, and 8 days to assess the early rehabilitation moment in which the patterns could be predicted most accurately. Those features were statistical features, amplitude features, and morphological features. Furthermore, the Barthel Index, Fracture Mobility Score, Functional Ambulation Categories, and the Montreal Cognitive Assessment score were used as clinical features. With the correlation-based feature selection method, relevant features were selected that were highly correlated with the physical activity patterns and uncorrelated with other features. Multiple classifiers were used: decision trees, discriminant analysis, logistic regression, support vector machines, nearest neighbors, and ensemble classifiers. The performance of the prediction models was assessed by calculating precision, recall, and F1-score (accuracy measure) for each individual physical activity pattern. Furthermore, the overall performance of the prediction model was calculated by calculating the F1-score for all physical activity patterns together.

RESULTS

The amplitude feature describing the overall intensity of physical activity on the first day of rehabilitation and the morphological features describing the shape of the patterns were selected as relevant features for all time windows. Relevant features extracted from the first 7 days with a cosine k-nearest neighbor model reached the highest overall prediction performance (micro F1-score=1) and a 100% correct classification of the 2 most common physical activity patterns.

CONCLUSIONS

Continuous monitoring of the physical activity of older patients in the first week of hip fracture rehabilitation results in an early physical activity pattern prediction. In the future, continuous physical activity monitoring can offer the possibility to predict the duration of rehabilitation stay, assess the recovery progress during hip fracture rehabilitation, and benefit health care organizations, health care professionals, and patients themselves.

Collapse

Alghushairy O, Ali F, Alghamdi W, Khalid M, Alsini R, Asiry O. Machine learning-based model for accurate identification of druggable proteins using light extreme gradient boosting. J Biomol Struct Dyn 2023:1-12. [PMID: 37850427 DOI: 10.1080/07391102.2023.2269280] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/04/2023] [Indexed: 10/19/2023]

de Lima Gonçalves V, Ribeiro CT, Cavalheiro GL, Zaruz MJF, da Silva DH, Milagre ST, de Oliveira Andrade A, Pereira AA. A hybrid linear discriminant analysis and genetic algorithm to create a linear model of aging when performing motor tasks through inertial sensors positioned on the hand and forearm. Biomed Eng Online 2023;22:98. [PMID: 37845723 PMCID: PMC10580547 DOI: 10.1186/s12938-023-01161-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 10/01/2023] [Indexed: 10/18/2023] Open

Abstract

BACKGROUND

During the aging process, cognitive functions and performance of the muscular and neural system show signs of decline, thus making the elderly more susceptible to disease and death. These alterations, which occur with advanced age, affect functional performance in both the lower and upper members, and consequently human motor functions. Objective measurements are important tools to help understand and characterize the dysfunctions and limitations that occur due to neuromuscular changes related to advancing age. Therefore, the objective of this study is to attest to the difference between groups of young and old individuals through manual movements and whether the combination of features can produce a linear correlation concerning the different age groups.

METHODS

This study counted on 99 participants, these were divided into 8 groups, which were grouped by age. The data collection was performed using inertial sensors (positioned on the back of the hand and on the back of the forearm). Firstly, the participants were divided into groups of young and elderly to verify if the groups could be distinguished through the features alone. Following this, the features were combined using the linear discriminant analysis (LDA), which gave rise to a singular feature called the LDA-value that aided in verifying the correlation between the different age ranges and the LDA-value.

RESULTS

The results demonstrated that 125 features are able to distinguish the difference between the groups of young and elderly individuals. The use of the LDA-value allows for the obtaining of a linear model of the changes that occur with aging in the performance of tasks in line with advancing age, the correlation obtained, using Pearson's coefficient, was 0.86.

CONCLUSION

When we compare only the young and elderly groups, the results indicate that there is a difference in the way tasks are performed between young and elderly individuals. When the 8 groups were analyzed, the linear correlation obtained was strong, with the LDA-value being effective in obtaining a linear correlation of the eight groups, demonstrating that although the features alone do not demonstrate gradual changes as a function of age, their combination established these changes.

Collapse

Affiliation(s)

Veronica de Lima Gonçalves Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Caio Tonus Ribeiro Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Guilherme Lopes Cavalheiro Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Maria José Ferreira Zaruz Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Daniel Hilário da Silva Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Selma Terezinha Milagre Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Adriano de Oliveira Andrade Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil
Adriano Alves Pereira Postgraduate Program in Electrical and Biomedical Engineering, Faculty of Electrical Engineering, Centre for Innovation and Technology Assessment in Health, Federal University of Uberlândia, Uberlândia, Brazil.

Collapse

Luo KH, Wu CH, Yang CC, Chen TH, Tu HP, Yang CH, Chuang HY. Exploring the association of metal mixture in blood to the kidney function and tumor necrosis factor alpha using machine learning methods. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023;265:115528. [PMID: 37783110 DOI: 10.1016/j.ecoenv.2023.115528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 09/09/2023] [Accepted: 09/25/2023] [Indexed: 10/04/2023]

Abstract

This research aimed to approach relationships between metal mixture in blood and kidney function, tumor necrosis factor alpha (TNF-α) by machine learning. Metals levels were measured by Inductively Couple Plasma Mass Spectrometry in blood from 421 participants. We applied K Nearest Neighbor (KNN), Naive Bayes classifier (NB), Support Vector Machines (SVM), random forest (RF), Gradient Boosting Decision Tree (GBDT), Categorical boosting (CatBoost), eXtreme Gradient Boosting (XGBoost), Whale Optimization-based XGBoost (WXGBoost) to identify the effect of plasma metals, TNF-α, and estimated glomerular filtration rate (eGFR by CKD-EPI equation). We conducted not only toxic metals, lead (Pb), arsenic (As), cadmium (Cd) but also included trace essential metals, selenium (Se), copper (Cu), zinc (Zn), cobalt (Co), to predict the interaction of TNF-α, TNF-α/white blood count, and eGFR. The high average TNF-α level group was observed among subjects with higher Pb, As, Cd, Cu, and Zn levels in blood. No associations were shown between the low and high TNF-α level group in blood Se and Co levels. Those with lower eGFR group had high Pb, As, Cd, Co, Cu, and Zn levels. The crucial predictor of TNF-α level in metals was blood Pb, and then Cd, As, Cu, Se, Zn and Co. The machine learning revealed that As was the major role among predictors of eGFR after feature selection. The levels of kidney function and TNF-α were modified by co-exposure metals. We were able to acquire highest accuracy of over 85% in the multi-metals exposure model. The higher Pb and Zn levels had strongest interaction with declined eGFR. In addition, As and Cd had synergistic with prediction model of TNF-α. We explored the potential of machine learning approaches for predicting health outcomes with multi-metal exposure. XGBoost model added SHAP could give an explicit explanation of individualized and precision risk prediction and insight of the interaction of key features in the multi-metal exposure.

Collapse

Affiliation(s)

Kuei-Hau Luo Graduate Institute of Medicine, College of Medicine, Kaohsiung Medicine University, Kaohsiung City 807, Taiwan
Chih-Hsien Wu Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
Chen-Cheng Yang Graduate Institute of Medicine, College of Medicine, Kaohsiung Medicine University, Kaohsiung City 807, Taiwan; Department of Occupational Medicine, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung 812, Taiwan
Tzu-Hua Chen Department of Family Medicine, Kaohsiung Municipal Ta-Tung Hospital, Kaohsiung 801, Taiwan
Hung-Pin Tu Department of Public Health and Environmental Medicine, School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan
Cheng-Hong Yang Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan; Department of Information Management, Tainan University of Technology, Tainan 71002, Taiwan; Drug Development and Value Creation Research Center, Kaohsiung Medical University, Kaohsiung 80708, Taiwan; Ph. D. Program in Biomedical Engineering, Kaohsiung Medical University, Kaohsiung 80708, Taiwan; School of Dentistry, Kaohsiung Medical University, Kaohsiung 80708, Taiwan
Hung-Yi Chuang Graduate Institute of Medicine, College of Medicine, Kaohsiung Medicine University, Kaohsiung City 807, Taiwan; Department of Public Health and Environmental Medicine, School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; Department of Occupational and Environmental Medicine, Kaohsiung Medicine University Hospital, Kaohsiung Medicine University, Kaohsiung City 807, Taiwan; Ph.D. Program in Environmental and Occupational Medicine, and Research Center for Precision Environmental Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan.

Collapse

Pacheco J, Saiz O, Casado S, Ubillos S. A multistart tabu search-based method for feature selection in medical applications. Sci Rep 2023;13:17140. [PMID: 37816874 PMCID: PMC10564765 DOI: 10.1038/s41598-023-44437-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 10/08/2023] [Indexed: 10/12/2023] Open

Abstract

In the design of classification models, irrelevant or noisy features are often generated. In some cases, there may even be negative interactions among features. These weaknesses can degrade the performance of the models. Feature selection is a task that searches for a small subset of relevant features from the original set that generate the most efficient models possible. In addition to improving the efficiency of the models, feature selection confers other advantages, such as greater ease in the generation of the necessary data as well as clearer and more interpretable models. In the case of medical applications, feature selection may help to distinguish which characteristics, habits, and factors have the greatest impact on the onset of diseases. However, feature selection is a complex task due to the large number of possible solutions. In the last few years, methods based on different metaheuristic strategies, mainly evolutionary algorithms, have been proposed. The motivation of this work is to develop a method that outperforms previous methods, with the benefits that this implies especially in the medical field. More precisely, the present study proposes a simple method based on tabu search and multistart techniques. The proposed method was analyzed and compared to other methods by testing their performance on several medical databases. Specifically, eight databases belong to the well-known repository of the University of California in Irvine and one of our own design were used. In these computational tests, the proposed method outperformed other recent methods as gauged by various metrics and classifiers. The analyses were accompanied by statistical tests, the results of which showed that the superiority of our method is significant and therefore strengthened these conclusions. In short, the contribution of this work is the development of a method that, on the one hand, is based on different strategies than those used in recent methods, and on the other hand, improves the performance of these methods.

Collapse

Lee H, Lee Y, Jo M, Nam S, Jo J, Lee C. Enhancing Diagnosis of Rotating Elements in Roll-to-Roll Manufacturing Systems through Feature Selection Approach Considering Overlapping Data Density and Distance Analysis. SENSORS (BASEL, SWITZERLAND) 2023;23:7857. [PMID: 37765913 PMCID: PMC10534779 DOI: 10.3390/s23187857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/01/2023] [Accepted: 09/11/2023] [Indexed: 09/29/2023]

Guo B, Liu H, Niu L. Integration of natural and deep artificial cognitive models in medical images: BERT-based NER and relation extraction for electronic medical records. Front Neurosci 2023;17:1266771. [PMID: 37732304 PMCID: PMC10507183 DOI: 10.3389/fnins.2023.1266771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 08/14/2023] [Indexed: 09/22/2023] Open

Mahmoud AY, Neagu D, Scrimieri D, Abdullatif ARA. Early diagnosis and personalised treatment focusing on synthetic data modelling: Novel visual learning approach in healthcare. Comput Biol Med 2023;164:107295. [PMID: 37557053 DOI: 10.1016/j.compbiomed.2023.107295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 07/26/2023] [Accepted: 07/28/2023] [Indexed: 08/11/2023]

Abstract

The early diagnosis and personalised treatment of diseases are facilitated by machine learning. The quality of data has an impact on diagnosis because medical data are usually sparse, imbalanced, and contain irrelevant attributes, resulting in suboptimal diagnosis. To address the impacts of data challenges, improve resource allocation, and achieve better health outcomes, a novel visual learning approach is proposed. This study contributes to the visual learning approach by determining whether less or more synthetic data are required to improve the quality of a dataset, such as the number of observations and features, according to the intended personalised treatment and early diagnosis. In addition, numerous visualisation experiments are conducted, including using statistical characteristics, cumulative sums, histograms, correlation matrix, root mean square error, and principal component analysis in order to visualise both original and synthetic data to address the data challenges. Real medical datasets for cancer, heart disease, diabetes, cryotherapy and immunotherapy are selected as case studies. As a benchmark and point of classification comparison in terms of such as accuracy, sensitivity, and specificity, several models are implemented such as k-Nearest Neighbours and Random Forest. To simulate algorithm implementation and data, Generative Adversarial Network is used to create and manipulate synthetic data, whilst, Random Forest is implemented to classify the data. An amendable and adaptable system is constructed by combining Generative Adversarial Network and Random Forest models. The system model presents working steps, overview and flowchart. Experiments reveal that the majority of data-enhancement scenarios allow for the application of visual learning in the first stage of data analysis as a novel approach. To achieve meaningful adaptable synergy between appropriate quality data and optimal classification performance while maintaining statistical characteristics, visual learning provides researchers and practitioners with practical human-in-the-loop machine learning visualisation tools. Prior to implementing algorithms, the visual learning approach can be used to actualise early, and personalised diagnosis. For the immunotherapy data, the Random Forest performed best with precision, recall, f-measure, accuracy, sensitivity, and specificity of 81%, 82%, 81%, 88%, 95%, and 60%, as opposed to 91%, 96%, 93%, 93%, 96%, and 73% for synthetic data, respectively. Future studies might examine the optimal strategies to balance the quantity and quality of medical data.

Collapse

Munir N, McMorrow R, Mulrennan K, Whitaker D, McLoone S, Kellomäki M, Talvitie E, Lyyra I, McAfee M. Interpretable Machine Learning Methods for Monitoring Polymer Degradation in Extrusion of Polylactic Acid. Polymers (Basel) 2023;15:3566. [PMID: 37688192 PMCID: PMC10489772 DOI: 10.3390/polym15173566] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/17/2023] [Accepted: 08/24/2023] [Indexed: 09/10/2023] Open

Abstract

This work investigates real-time monitoring of extrusion-induced degradation in different grades of PLA across a range of process conditions and machine set-ups. Data on machine settings together with in-process sensor data, including temperature, pressure, and near-infrared (NIR) spectra, are used as inputs to predict the molecular weight and mechanical properties of the product. Many soft sensor approaches based on complex spectral data are essentially 'black-box' in nature, which can limit industrial acceptability. Hence, the focus here is on identifying an optimal approach to developing interpretable models while achieving high predictive accuracy and robustness across different process settings. The performance of a Recursive Feature Elimination (RFE) approach was compared to more common dimension reduction and regression approaches including Partial Least Squares (PLS), iterative PLS (i-PLS), Principal Component Regression (PCR), ridge regression, Least Absolute Shrinkage and Selection Operator (LASSO), and Random Forest (RF). It is shown that for medical-grade PLA processed under moisture-controlled conditions, accurate prediction of molecular weight is possible over a wide range of process conditions and different machine settings (different nozzle types for downstream fibre spinning) with an RFE-RF algorithm. Similarly, for the prediction of yield stress, RFE-RF achieved excellent predictive performance, outperforming the other approaches in terms of simplicity, interpretability, and accuracy. The features selected by the RFE model provide important insights to the process. It was found that change in molecular weight was not an important factor affecting the mechanical properties of the PLA, which is primarily related to the pressure and temperature at the latter stages of the extrusion process. The temperature at the extruder exit was also the most important predictor of degradation of the polymer molecular weight, highlighting the importance of accurate melt temperature control in the process. RFE not only outperforms more established methods as a soft sensor method, but also has significant advantages in terms of computational efficiency, simplicity, and interpretability. RFE-based soft sensors are promising for better quality control in processing thermally sensitive polymers such as PLA, in particular demonstrating for the first time the ability to monitor molecular weight degradation during processing across various machine settings.

Collapse

Affiliation(s)

Nimra Munir Centre for Mathematical Modelling and Intelligent Systems for Health and Environment (MISHE), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland; Centre for Precision Engineering, Materials and Manufacturing (PEM Centre), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland
Ross McMorrow Department of Mechatronic Engineering, Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland;
Konrad Mulrennan Centre for Mathematical Modelling and Intelligent Systems for Health and Environment (MISHE), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland; Centre for Precision Engineering, Materials and Manufacturing (PEM Centre), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland
Darren Whitaker Perceptive Engineering-An Applied Materials Company, Keckwick Lane, Daresbury WA4 4AB, UK;
Seán McLoone Centre for Intelligent Autonomous Manufacturing Systems, Queen’s University Belfast, Belfast BT7 1NN, UK;
Minna Kellomäki Biomaterials and Tissue Engineering Group, Faculty of Medicine and Health Technology, BioMediTech, Tampere University, 33720 Tampere, Finland; (M.K.); (E.T.); (I.L.)
Elina Talvitie Biomaterials and Tissue Engineering Group, Faculty of Medicine and Health Technology, BioMediTech, Tampere University, 33720 Tampere, Finland; (M.K.); (E.T.); (I.L.)
Inari Lyyra Biomaterials and Tissue Engineering Group, Faculty of Medicine and Health Technology, BioMediTech, Tampere University, 33720 Tampere, Finland; (M.K.); (E.T.); (I.L.)
Marion McAfee Centre for Mathematical Modelling and Intelligent Systems for Health and Environment (MISHE), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland; Centre for Precision Engineering, Materials and Manufacturing (PEM Centre), Atlantic Technological University, ATU Sligo, Ash Lane, F91 YW50 Sligo, Ireland

Collapse