1
|
Pan Z, Wang H, Wan J, Zhang L, Huang J, Shen Y. Efficient federated learning for pediatric pneumonia on chest X-ray classification. Sci Rep 2024; 14:23272. [PMID: 39375440 PMCID: PMC11458595 DOI: 10.1038/s41598-024-74491-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 09/26/2024] [Indexed: 10/09/2024] Open
Abstract
According to the World Health Organization (WHO), pneumonia kills about 2 million children under the age of 5 every year. Traditional machine learning methods can be used to diagnose chest X-rays of pneumonia in children, but there is a privacy and security issue in centralizing the data for training. Federated learning prevents data privacy leakage by sharing only the model and not the data, and it has a wide range of application in the medical field. We use federated learning method for classification, which effectively protects data security. And for the data heterogeneity phenomenon existing in the actual scenario, which will seriously affect the classification effect, we propose a method based on two-end control variables. Specifically, based on the classical federated learning FedAvg algorithm, we modify the loss function on the client side by adding a regular term or a penalty term, and add momentum after the average aggregation on the server side. The federated learning approach prevents the data privacy leakage problem compared to the traditional machine learning approach. In order to solve the problem of low classification accuracy due to data heterogeneity, our proposed method based on two-end control variables achieves an average improvement of 2% and an accuracy of 98% on average, and 99% individually, compared to the previous federated learning algorithms and the latest diffusion model-based method. The classification results and methodology of this study can be utilized by clinicians worldwide to improve the overall detection of pediatric pneumonia.
Collapse
Affiliation(s)
- Zegang Pan
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China
| | - Haijiang Wang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China.
- Zhejiang Key Laboratory of Biomedical Intelligent Computing Technology, Hangzhou, China.
| | - Jian Wan
- Zhejiang Institute of Water Resources and Hydropower, Hangzhou, 310018, Zhejiang, China
- Zhejiang Key Laboratory of Biomedical Intelligent Computing Technology, Hangzhou, China
| | - Lei Zhang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China
- Zhejiang Key Laboratory of Biomedical Intelligent Computing Technology, Hangzhou, China
| | - Jie Huang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China
- Zhejiang Key Laboratory of Biomedical Intelligent Computing Technology, Hangzhou, China
| | - Yangyu Shen
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China
| |
Collapse
|
2
|
Jin Z, Liu YH. Metabolic-related gene signatures for survival prediction and immune cell subtypes associated with prognosis in intrahepatic cholangiocarcinoma. Cancer Genet 2023; 274-275:84-93. [PMID: 37099969 DOI: 10.1016/j.cancergen.2023.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/26/2023] [Accepted: 04/11/2023] [Indexed: 04/28/2023]
Abstract
OBJECTIVES Our study aimed to reveal the metabolic-related gene signatures for survival prediction and immune cell subtypes associated with IHCC prognosis. METHODS Differentially expressed metabolic genes were identified between survival group and dead group which were divided according to survival at discharge. Recursive feature elimination (RFE) and randomForest (RF) algorithms were applied to optimize the combination of feature metabolic genes, which were used to generate SVM classifier. Performance of SVM classifier was evaluated by receiver operating characteristic (ROC) curves. Gene set enrichment analysis (GSEA) was conducted to uncover the activated pathways in high risk group, and differences in immune cell distributions were revealed. RESULTS There were 143 differentially expressed metabolic gens. RFE and RF identified 21 overlapping differentially expressed metabolic genes, and the constructed SVM classifier had excellent accuracy in training and validation dataset. RS survival prediction model was consisted of 10 metabolic genes. RS model had reliable predictive capability in the training and validation dataset. GSEA revealed 15 significant KEGG pathways that were relatively activated in the high risk group. High risk group had obviously lower counts of B cell naive and T cell CD4+ memory resting, while higher counts of B cell plasma and macrophage M2. CONCLUSION Prognostic prediction model of 10 metabolic genes could accurately predict the prognosis of IHCC patients.
Collapse
Affiliation(s)
- Zhe Jin
- Department of Hepatobiliary and Pancreatic Surgery, The First Hospital of Jilin University, No. 1, Xinmin Street, Chaoyang District, Changchun, Jilin 130021, China
| | - Ya-Hui Liu
- Department of Hepatobiliary and Pancreatic Surgery, The First Hospital of Jilin University, No. 1, Xinmin Street, Chaoyang District, Changchun, Jilin 130021, China.
| |
Collapse
|
3
|
Dutta A, Hasan MK, Ahmad M, Awal MA, Islam MA, Masud M, Meshref H. Early Prediction of Diabetes Using an Ensemble of Machine Learning Models. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph191912378. [PMID: 36231678 PMCID: PMC9566114 DOI: 10.3390/ijerph191912378] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/20/2022] [Accepted: 09/24/2022] [Indexed: 05/15/2023]
Abstract
Diabetes is one of the most rapidly spreading diseases in the world, resulting in an array of significant complications, including cardiovascular disease, kidney failure, diabetic retinopathy, and neuropathy, among others, which contribute to an increase in morbidity and mortality rate. If diabetes is diagnosed at an early stage, its severity and underlying risk factors can be significantly reduced. However, there is a shortage of labeled data and the occurrence of outliers or data missingness in clinical datasets that are reliable and effective for diabetes prediction, making it a challenging endeavor. Therefore, we introduce a newly labeled diabetes dataset from a South Asian nation (Bangladesh). In addition, we suggest an automated classification pipeline that includes a weighted ensemble of machine learning (ML) classifiers: Naive Bayes (NB), Random Forest (RF), Decision Tree (DT), XGBoost (XGB), and LightGBM (LGB). Grid search hyperparameter optimization is employed to tune the critical hyperparameters of these ML models. Furthermore, missing value imputation, feature selection, and K-fold cross-validation are included in the framework design. A statistical analysis of variance (ANOVA) test reveals that the performance of diabetes prediction significantly improves when the proposed weighted ensemble (DT + RF + XGB + LGB) is executed with the introduced preprocessing, with the highest accuracy of 0.735 and an area under the ROC curve (AUC) of 0.832. In conjunction with the suggested ensemble model, our statistical imputation and RF-based feature selection techniques produced the best results for early diabetes prediction. Moreover, the presented new dataset will contribute to developing and implementing robust ML models for diabetes prediction utilizing population-level data.
Collapse
Affiliation(s)
- Aishwariya Dutta
- Department of Biomedical Engineering (BME), Khulna University of Engineering & Technology (KUET), Khulna 9203, Bangladesh
- Department of Biomedical Engineering (BME), Military Institute of Science and Technology (MIST), Mirpur Cantonment, Dhaka 1216, Bangladesh
| | - Md. Kamrul Hasan
- Department of Electrical and Electronic Engineering (EEE), Khulna University of Engineering & Technology (KUET), Khulna 9203, Bangladesh
| | - Mohiuddin Ahmad
- Department of Electrical and Electronic Engineering (EEE), Khulna University of Engineering & Technology (KUET), Khulna 9203, Bangladesh
| | - Md. Abdul Awal
- School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4072, Australia
- Electronics and Communication Engineering (ECE) Discipline, Khulna University (KU), Khulna 9208, Bangladesh
- Correspondence:
| | | | - Mehedi Masud
- Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| | - Hossam Meshref
- Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| |
Collapse
|
4
|
Padash S, Mohebbian MR, Adams SJ, Henderson RDE, Babyn P. Pediatric chest radiograph interpretation: how far has artificial intelligence come? A systematic literature review. Pediatr Radiol 2022; 52:1568-1580. [PMID: 35460035 PMCID: PMC9033522 DOI: 10.1007/s00247-022-05368-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 02/28/2022] [Accepted: 03/24/2022] [Indexed: 10/24/2022]
Abstract
Most artificial intelligence (AI) studies have focused primarily on adult imaging, with less attention to the unique aspects of pediatric imaging. The objectives of this study were to (1) identify all publicly available pediatric datasets and determine their potential utility and limitations for pediatric AI studies and (2) systematically review the literature to assess the current state of AI in pediatric chest radiograph interpretation. We searched PubMed, Web of Science and Embase to retrieve all studies from 1990 to 2021 that assessed AI for pediatric chest radiograph interpretation and abstracted the datasets used to train and test AI algorithms, approaches and performance metrics. Of 29 publicly available chest radiograph datasets, 2 datasets included solely pediatric chest radiographs, and 7 datasets included pediatric and adult patients. We identified 55 articles that implemented an AI model to interpret pediatric chest radiographs or pediatric and adult chest radiographs. Classification of chest radiographs as pneumonia was the most common application of AI, evaluated in 65% of the studies. Although many studies report high diagnostic accuracy, most algorithms were not validated on external datasets. Most AI studies for pediatric chest radiograph interpretation have focused on a limited number of diseases, and progress is hindered by a lack of large-scale pediatric chest radiograph datasets.
Collapse
Affiliation(s)
- Sirwa Padash
- Department of Medical Imaging, University of Saskatchewan, 103 Hospital Drive, Saskatoon, Saskatchewan, S7N 0W8, Canada.
- Department of Radiology, Mayo Clinic, Rochester, MN, USA.
| | - Mohammad Reza Mohebbian
- Department of Electrical and Computer Engineering, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Scott J Adams
- Department of Medical Imaging, University of Saskatchewan, 103 Hospital Drive, Saskatoon, Saskatchewan, S7N 0W8, Canada
| | - Robert D E Henderson
- Department of Medical Imaging, University of Saskatchewan, 103 Hospital Drive, Saskatoon, Saskatchewan, S7N 0W8, Canada
| | - Paul Babyn
- Department of Medical Imaging, University of Saskatchewan, 103 Hospital Drive, Saskatoon, Saskatchewan, S7N 0W8, Canada
| |
Collapse
|
5
|
Pneumonia Classification from X-ray Images with Inception-V3 and Convolutional Neural Network. Diagnostics (Basel) 2022; 12:diagnostics12051280. [PMID: 35626436 PMCID: PMC9140837 DOI: 10.3390/diagnostics12051280] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 05/16/2022] [Accepted: 05/16/2022] [Indexed: 02/04/2023] Open
Abstract
Pneumonia is one of the leading causes of death in both infants and elderly people, with approximately 4 million deaths each year. It may be a virus, bacterial, or fungal, depending on the contagious pathogen that damages the lung’s tiny air sacs (alveoli). Patients with underlying disorders such as asthma, a weakened immune system, hospitalized babies, and older persons on ventilators are all at risk, particularly if pneumonia is not detected early. Despite the existing approaches for its diagnosis, low accuracy and efficiency require further research for more accurate systems. This study is a similar endeavor for the detection of pneumonia by the use of X-ray images. The dataset is preprocessed to make it suitable for transfer learning tasks. Different pre-trained convolutional neural network (CNN) variants are utilized, including VGG16, Inception-v3, and ResNet50. Ensembles are made by incorporating CNN with Inception-V3, VGG-16, and ResNet50. Besides the common evaluation metrics, the performance of the pre-trained and ensemble deep learning models is measured with Cohen’s kappa as well as the area under the curve (AUC). Experimental results show that Inception-V3 with CNN attained the highest accuracy and recall score of 99.29% and 99.73%, respectively.
Collapse
|
6
|
Feng Y, Yang X, Qiu D, Zhang H, Wei D, Liu J. PCXRNet: Pneumonia diagnosis from Chest X-Ray Images using Condense attention block and Multiconvolution attention block. IEEE J Biomed Health Inform 2022; 26:1484-1495. [PMID: 35120015 DOI: 10.1109/jbhi.2022.3148317] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Coronavirus disease 2019 (COVID-19) has become a global pandemic. Many recognition approaches based on convolutional neural networks have been proposed for COVID-19 chest X-ray images. However, only a few of them make good use of the potential inter- and intra-relationships of feature maps. Considering the limitation mentioned above, this paper proposes an attention-based convolutional neural network, called PCXRNet, for diagnosis of pneumonia using chest X-ray images. To utilize the information from the channels of the feature maps, we added a novel condense attention module (CDSE) that comprised of two steps: condensation step and squeeze-excitation step. Unlike traditional channel attention modules, CDSE first downsamples the feature map channel by channel to condense the information, followed by the squeeze-excitation step, in which the channel weights are calculated. To make the model pay more attention to informative spatial parts in every feature map, we proposed a multi-convolution spatial attention module (MCSA). It reduces the number of parameters and introduces more nonlinearity. The CDSE and MCSA complement each other in series to tackle the problem of redundancy in feature maps and provide useful information from and between feature maps. We used the ChestXRay2017 dataset to explore the internal structure of PCXRNet, and the proposed network was applied to COVID-19 diagnosis. Additional experiments were conducted on a tuberculosis dataset to verify the effectiveness of PCXRNet. As a result, the network achieves an accuracy of 94.619%, recall of 94.753%, precision of 95.286%, and F1-score of 94.996% on the COVID-19 dataset.
Collapse
|
7
|
Chattopadhyay S, Kundu R, Singh PK, Mirjalili S, Sarkar R. Pneumonia detection from lung X‐ray images using local search aided sine cosine algorithm based deep feature selection method. INT J INTELL SYST 2021. [DOI: 10.1002/int.22703] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
| | - Rohit Kundu
- Department of Electrical Engineering Jadavpur University Kolkata India
| | - Pawan Kumar Singh
- Department of Information Technology Jadavpur University Kolkata India
| | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimization Torrens University Fortitude Valley Queensland Australia
- Yonser Frontier Lab Yonsei University Seoul Korea
| | - Ram Sarkar
- Department of Computer Science and Engineering Jadavpur University Kolkata India
| |
Collapse
|
8
|
Stokes K, Castaldo R, Franzese M, Salvatore M, Fico G, Pokvic LG, Badnjevic A, Pecchia L. A machine learning model for supporting symptom-based referral and diagnosis of bronchitis and pneumonia in limited resource settings. Biocybern Biomed Eng 2021. [DOI: 10.1016/j.bbe.2021.09.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
9
|
Noor ST, Asad ST, Khan MM, Gaba GS, Al-Amri JF, Masud M. Predicting the Risk of Depression Based on ECG Using RNN. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:1299870. [PMID: 34367269 PMCID: PMC8342171 DOI: 10.1155/2021/1299870] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 11/18/2022]
Abstract
This paper presents a model to predict the risk of depression based on electrocardiogram (ECG). This proposed model uses a Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) autoencoder to predict normal, abnormal, and PVC heartbeats. The RNN model is a deep learning-based model to classify normal, abnormal, and PVC heartbeats. We used the model as a classifier. The model uses a heart rates dataset to predict abnormal and PVC heartbeats. As for the dataset, we have used 5000 ECG samples. The model was trained on a training dataset and validation dataset. After that, it was tested on a test dataset. The model is trained on normal heartbeat rates, so the model can predict any heartbeat rates other than normal. Our contribution here is to build a model that can differentiate between "normal," "abnormal," and "risky" heartbeats. Our model predicts "normal" heartbeats with 97.24% accuracy and can predict "PVC" heartbeats with 100% accuracy. Other than the accuracy, we evaluated our model on the training loss graphs. These two types of training loss graphs were evaluated as "normal" versus "risky" and "abnormal" versus "risky." We have seen great results there as well. The best losses for "normal," "abnormal," and "risky" are 5.71, 33.36, and 34.78. However, these results may improve if a larger dataset is used. In studies, it was found that patients suffering from depression may have a different kind of heartbeat than the normal ones. In most cases, it is PVC (Premature Ventricular Contraction) heartbeats. Therefore, the target is to predict abnormal heartbeats and PVC heartbeats.
Collapse
Affiliation(s)
- Sumaiya Tarannum Noor
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka 1229, Bangladesh
| | - Syeda Tasmiah Asad
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka 1229, Bangladesh
| | - Mohammad Monirujjaman Khan
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka 1229, Bangladesh
| | - Gurjot Singh Gaba
- School of Electronics and Electrical Engineering, Lovely Professional University, Phagwara, India
| | - Jehad F. Al-Amri
- Department of Information Technology, College of Computers and Information Technology, Taif University, P. O. Box 11099, Taif 21944, Saudi Arabia
| | - Mehedi Masud
- Department of Computer Science, College of Computers and Information Technology, Taif University, P. O. Box 11099, Taif 21944, Saudi Arabia
| |
Collapse
|
10
|
A Comparative Analysis of Machine Learning Algorithms to Predict Alzheimer's Disease. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:9917919. [PMID: 34336171 PMCID: PMC8289609 DOI: 10.1155/2021/9917919] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 05/29/2021] [Accepted: 06/23/2021] [Indexed: 02/06/2023]
Abstract
Alzheimer's disease has been one of the major concerns recently. Around 45 million people are suffering from this disease. Alzheimer's is a degenerative brain disease with an unspecified cause and pathogenesis which primarily affects older people. The main cause of Alzheimer's disease is Dementia, which progressively damages the brain cells. People lost their thinking ability, reading ability, and many more from this disease. A machine learning system can reduce this problem by predicting the disease. The main aim is to recognize Dementia among various patients. This paper represents the result and analysis regarding detecting Dementia from various machine learning models. The Open Access Series of Imaging Studies (OASIS) dataset has been used for the development of the system. The dataset is small, but it has some significant values. The dataset has been analyzed and applied in several machine learning models. Support vector machine, logistic regression, decision tree, and random forest have been used for prediction. First, the system has been run without fine-tuning and then with fine-tuning. Comparing the results, it is found that the support vector machine provides the best results among the models. It has the best accuracy in detecting Dementia among numerous patients. The system is simple and can easily help people by detecting Dementia among them.
Collapse
|
11
|
Severity Classification of Diabetic Retinopathy Using an Ensemble Learning Algorithm through Analyzing Retinal Images. Symmetry (Basel) 2021. [DOI: 10.3390/sym13040670] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Diabetic Retinopathy (DR) refers to the damages endured by the retina as an effect of diabetes. DR has become a severe health concern worldwide, as the number of diabetes patients is soaring uncountably. Periodic eye examination allows doctors to detect DR in patients at an early stage to initiate proper treatments. Advancements in artificial intelligence and camera technology have allowed us to automate the diagnosis of DR, which can benefit millions of patients indeed. This paper inscribes a novel method for DR diagnosis based on the gray-level intensity and texture features extracted from fundus images using a decision tree-based ensemble learning technique. This study primarily works with the Asia Pacific Tele-Ophthalmology Society 2019 Blindness Detection (APTOS 2019 BD) dataset. We undertook several steps to curate its contents to make them more suitable for machine learning applications. Our approach incorporates several image processing techniques, two feature extraction techniques, and one feature selection technique, which results in a classification accuracy of 94.20% (margin of error: ±0.32%) and an F-measure of 93.51% (margin of error: ±0.5%). Several other parameters regarding the proposed method’s performance have been presented to manifest its robustness and reliability. Details on each employed technique have been included to make the provided results reproducible. This method can be a valuable tool for mass retinal screening to detect DR, thus drastically reducing the rate of vision loss attributed to it.
Collapse
|