1
|
Zhou Y, Zhang Z, Li Q, Mao G, Zhou Z. Construction and validation of machine learning algorithm for predicting depression among home-quarantined individuals during the large-scale COVID-19 outbreak: based on Adaboost model. BMC Psychol 2024; 12:230. [PMID: 38659077 PMCID: PMC11044386 DOI: 10.1186/s40359-024-01696-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 03/29/2024] [Indexed: 04/26/2024] Open
Abstract
OBJECTIVES COVID-19 epidemics often lead to elevated levels of depression. To accurately identify and predict depression levels in home-quarantined individuals during a COVID-19 epidemic, this study constructed a depression prediction model based on multiple machine learning algorithms and validated its effectiveness. METHODS A cross-sectional method was used to examine the depression status of individuals quarantined at home during the epidemic via the network. Characteristics included variables on sociodemographics, COVID-19 and its prevention and control measures, impact on life, work, health and economy after the city was sealed off, and PHQ-9 scale scores. The home-quarantined subjects were randomly divided into training set and validation set according to the ratio of 7:3, and the performance of different machine learning models were compared by 10-fold cross-validation, and the model algorithm with the best performance was selected from 15 models to construct and validate the depression prediction model for home-quarantined subjects. The validity of different models was compared based on accuracy, precision, receiver operating characteristic (ROC) curve, and area under the ROC curve (AUC), and the best model suitable for the data framework of this study was identified. RESULTS The prevalence of depression among home-quarantined individuals during the epidemic was 31.66% (202/638), and the constructed Adaboost depression prediction model had an ACC of 0.7917, an accuracy of 0.7180, and an AUC of 0.7803, which was better than the other 15 models on the combination of various performance measures. In the validation sets, the AUC was greater than 0.83. CONCLUSIONS The Adaboost machine learning algorithm developed in this study can be used to construct a depression prediction model for home-quarantined individuals that has better machine learning performance, as well as high effectiveness, robustness, and generalizability.
Collapse
Affiliation(s)
- Yiwei Zhou
- Business School, University of Shanghai for Science and Technology, 200093, Shanghai, China
- School of Intelligent Emergency Management, University of Shanghai for Science and Technology, 200093, Shanghai, China
- Smart Urban Mobility Institute, University of Shanghai for Science and Technology, 200093, Shanghai, China
| | - Zejie Zhang
- Wenzhou Center for Disease Control and Prevention, 325000, Wenzhou, China
| | - Qin Li
- The Affiliated Kangning Hospital of Wenzhou Medical University Zhejiang Provincial Clinical Research Center for Mental Disorders, 325007, Wenzhou, China
| | - Guangyun Mao
- Department of Preventive Medicine, School of Public Health, Wenzhou Medical University, 325035, Wenzhou, China
| | - Zumu Zhou
- The Affiliated Kangning Hospital of Wenzhou Medical University Zhejiang Provincial Clinical Research Center for Mental Disorders, 325007, Wenzhou, China.
| |
Collapse
|
2
|
Talari P, N B, Kaur G, Alshahrani H, Al Reshan MS, Sulaiman A, Shaikh A. Hybrid feature selection and classification technique for early prediction and severity of diabetes type 2. PLoS One 2024; 19:e0292100. [PMID: 38236900 PMCID: PMC10796060 DOI: 10.1371/journal.pone.0292100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 09/12/2023] [Indexed: 01/22/2024] Open
Abstract
Diabetes prediction is an ongoing study topic in which medical specialists are attempting to forecast the condition with greater precision. Diabetes typically stays lethargic, and on the off chance that patients are determined to have another illness, like harm to the kidney vessels, issues with the retina of the eye, or a heart issue, it can cause metabolic problems and various complexities in the body. Various worldwide learning procedures, including casting a ballot, supporting, and sacking, have been applied in this review. The Engineered Minority Oversampling Procedure (Destroyed), along with the K-overlay cross-approval approach, was utilized to achieve class evening out and approve the discoveries. Pima Indian Diabetes (PID) dataset is accumulated from the UCI Machine Learning (UCI ML) store for this review, and this dataset was picked. A highlighted engineering technique was used to calculate the influence of lifestyle factors. A two-phase classification model has been developed to predict insulin resistance using the Sequential Minimal Optimisation (SMO) and SMOTE approaches together. The SMOTE technique is used to preprocess data in the model's first phase, while SMO classes are used in the second phase. All other categorization techniques were outperformed by bagging decision trees in terms of Misclassification Error rate, Accuracy, Specificity, Precision, Recall, F1 measures, and ROC curve. The model was created using a combined SMOTE and SMO strategy, which achieved 99.07% correction with 0.1 ms of runtime. The suggested system's result is to enhance the classifier's performance in spotting illness early.
Collapse
Affiliation(s)
- Praveen Talari
- Department of Computer Science and Engineering, Vignana Bharathi Institute of Technology, Hyderabad, India
| | - Bharathiraja N
- Chitkara University Institute of Engineering and Technology, Chitkara University Punjab, Rajpura, India
| | - Gaganpreet Kaur
- Chitkara University Institute of Engineering and Technology, Chitkara University Punjab, Rajpura, India
| | - Hani Alshahrani
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| | - Mana Saleh Al Reshan
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
- Scientific and Engineering Research Centre, Najran University, Najran, Saudi Arabia
| | - Adel Sulaiman
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| | - Asadullah Shaikh
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| |
Collapse
|
3
|
Kuang A, Kouznetsova VL, Kesari S, Tsigelny IF. Diagnostics of Thyroid Cancer Using Machine Learning and Metabolomics. Metabolites 2023; 14:11. [PMID: 38248814 PMCID: PMC10818630 DOI: 10.3390/metabo14010011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 12/14/2023] [Accepted: 12/18/2023] [Indexed: 01/23/2024] Open
Abstract
The objective of this research is, with the analysis of existing data of thyroid cancer (TC) metabolites, to develop a machine-learning model that can diagnose TC using metabolite biomarkers. Through data mining, pathway analysis, and machine learning (ML), the model was developed. We identified seven metabolic pathways related to TC: Pyrimidine metabolism, Tyrosine metabolism, Glycine, serine, and threonine metabolism, Pantothenate and CoA biosynthesis, Arginine biosynthesis, Phenylalanine metabolism, and Phenylalanine, tyrosine, and tryptophan biosynthesis. The ML classifications' accuracies were confirmed through 10-fold cross validation, and the most accurate classification was 87.30%. The metabolic pathways identified in relation to TC and the changes within such pathways can contribute to more pattern recognition for diagnostics of TC patients and assistance with TC screening. With independent testing, the model's accuracy for other unique TC metabolites was 92.31%. The results also point to a possibility for the development of using ML methods for TC diagnostics and further applications of ML in general cancer-related metabolite analysis.
Collapse
Affiliation(s)
- Alyssa Kuang
- Haas Business School, University of California at Berkeley, Berkeley, CA 94720, USA;
| | - Valentina L. Kouznetsova
- San Diego Supercomputer Center, University of California at San Diego, La Jolla, CA 92093, USA;
- BiAna, La Jolla, CA 92038, USA
- CureScience Institute, San Diego, CA 92121, USA
| | - Santosh Kesari
- Pacific Neuroscience Institute, Santa Monica, CA 90404, USA;
| | - Igor F. Tsigelny
- San Diego Supercomputer Center, University of California at San Diego, La Jolla, CA 92093, USA;
- BiAna, La Jolla, CA 92038, USA
- CureScience Institute, San Diego, CA 92121, USA
- Department of Neurosciences, University of California at San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
4
|
Abnoosian K, Farnoosh R, Behzadi MH. Prediction of diabetes disease using an ensemble of machine learning multi-classifier models. BMC Bioinformatics 2023; 24:337. [PMID: 37697283 PMCID: PMC10496262 DOI: 10.1186/s12859-023-05465-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 09/04/2023] [Indexed: 09/13/2023] Open
Abstract
BACKGROUND AND OBJECTIVE Diabetes is a life-threatening chronic disease with a growing global prevalence, necessitating early diagnosis and treatment to prevent severe complications. Machine learning has emerged as a promising approach for diabetes diagnosis, but challenges such as limited labeled data, frequent missing values, and dataset imbalance hinder the development of accurate prediction models. Therefore, a novel framework is required to address these challenges and improve performance. METHODS In this study, we propose an innovative pipeline-based multi-classification framework to predict diabetes in three classes: diabetic, non-diabetic, and prediabetes, using the imbalanced Iraqi Patient Dataset of Diabetes. Our framework incorporates various pre-processing techniques, including duplicate sample removal, attribute conversion, missing value imputation, data normalization and standardization, feature selection, and k-fold cross-validation. Furthermore, we implement multiple machine learning models, such as k-NN, SVM, DT, RF, AdaBoost, and GNB, and introduce a weighted ensemble approach based on the Area Under the Receiver Operating Characteristic Curve (AUC) to address dataset imbalance. Performance optimization is achieved through grid search and Bayesian optimization for hyper-parameter tuning. RESULTS Our proposed model outperforms other machine learning models, including k-NN, SVM, DT, RF, AdaBoost, and GNB, in predicting diabetes. The model achieves high average accuracy, precision, recall, F1-score, and AUC values of 0.9887, 0.9861, 0.9792, 0.9851, and 0.999, respectively. CONCLUSION Our pipeline-based multi-classification framework demonstrates promising results in accurately predicting diabetes using an imbalanced dataset of Iraqi diabetic patients. The proposed framework addresses the challenges associated with limited labeled data, missing values, and dataset imbalance, leading to improved prediction performance. This study highlights the potential of machine learning techniques in diabetes diagnosis and management, and the proposed framework can serve as a valuable tool for accurate prediction and improved patient care. Further research can build upon our work to refine and optimize the framework and explore its applicability in diverse datasets and populations.
Collapse
Affiliation(s)
- Karlo Abnoosian
- Department of Statistics, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Rahman Farnoosh
- School of Mathematics, Iran University of Science and Technology, Tehran, Iran.
| | - Mohammad Hassan Behzadi
- Department of Statistics, Science and Research Branch, Islamic Azad University, Tehran, Iran
| |
Collapse
|
5
|
Zhou H, Zhang PY, Zou X, Liu J, Wang WJ. Chronic disease diagnosis model based on convolutional neural network and ensemble learning method. Digit Health 2023; 9:20552076231198643. [PMID: 37667686 PMCID: PMC10475259 DOI: 10.1177/20552076231198643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 08/15/2023] [Indexed: 09/06/2023] Open
Abstract
Introduction Chronic diseases have become one of the main causes of premature death all around the world in recent years. The diagnosis of chronic diseases is time-consuming and costly. Therefore, timely diagnosis and prediction of chronic diseases are very necessary. Methods In this paper, a new method for chronic disease diagnosis is proposed by combining convolutional neural network (CNN) and ensemble learning. This method utilizes random forest (RF) as the base classifier to improve classification performance and diagnostic accuracy, and then combines AdaBoost to successfully replace the Softmax layer of CNN to generate multiple accurate base classifiers while determining their optimal attributes, achieving high-quality classification and prediction of chronic diseases. Results To verify the effectiveness of the proposed method, real-world Electronic Medical Records dataset (C-EMRs) was used for experimental analysis. The results show that compared with other traditional machine learning methods such as CNN, K-Nearest Neighbor, and RF, the proposed method can effectively improve the accuracy of diagnosis and reduce the occurrence of missed diagnosis and misdiagnosis. Conclusions This study will provide effective information for the diagnosis of chronic diseases, assist doctors in making clinical decisions, develop targeted intervention measures, and reduce the probability of misdiagnosis.
Collapse
Affiliation(s)
- Huan Zhou
- School of Business, Hunan University of Technology, Zhuzhou, Hunan, China
| | - Pei-Ying Zhang
- School of Business, Hunan University of Technology, Zhuzhou, Hunan, China
| | - Xiao Zou
- School of Business, Hunan University of Technology, Zhuzhou, Hunan, China
| | - Jia Liu
- School of Business, Hunan University of Technology, Zhuzhou, Hunan, China
| | - Wen-Jie Wang
- School of Business, Hunan University of Technology, Zhuzhou, Hunan, China
| |
Collapse
|
6
|
Zhang X, Gavaldà R, Baixeries J. Interpretable prediction of mortality in liver transplant recipients based on machine learning. Comput Biol Med 2022; 151:106188. [PMID: 36306583 DOI: 10.1016/j.compbiomed.2022.106188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Revised: 09/24/2022] [Accepted: 10/08/2022] [Indexed: 12/27/2022]
Abstract
BACKGROUND Accurate prediction of the mortality of post-liver transplantation is an important but challenging task. It relates to optimizing organ allocation and estimating the risk of possible dysfunction. Existing risk scoring models, such as the Balance of Risk (BAR) score and the Survival Outcomes Following Liver Transplantation (SOFT) score, do not predict the mortality of post-liver transplantation with sufficient accuracy. In this study, we evaluate the performance of machine learning models and establish an explainable machine learning model for predicting mortality in liver transplant recipients. METHOD The optimal feature set for the prediction of the mortality was selected by a wrapper method based on binary particle swarm optimization (BPSO). With the selected optimal feature set, seven machine learning models were applied to predict mortality over different time windows. The best-performing model was used to predict mortality through a comprehensive comparison and evaluation. An interpretable approach based on machine learning and SHapley Additive exPlanations (SHAP) is used to explicitly explain the model's decision and make new discoveries. RESULTS With regard to predictive power, our results demonstrated that the feature set selected by BPSO outperformed both the feature set in the existing risk score model (BAR score, SOFT score) and the feature set processed by principal component analysis (PCA). The best-performing model, extreme gradient boosting (XGBoost), was found to improve the Area Under a Curve (AUC) values for mortality prediction by 6.7%, 11.6%, and 17.4% at 3 months, 3 years, and 10 years, respectively, compared to the SOFT score. The main predictors of mortality and their impact were discussed for different age groups and different follow-up periods. CONCLUSIONS Our analysis demonstrates that XGBoost can be an ideal method to assess the mortality risk in liver transplantation. In combination with the SHAP approach, the proposed framework provides a more intuitive and comprehensive interpretation of the predictive model, thereby allowing the clinician to better understand the decision-making process of the model and the impact of factors associated with mortality risk in liver transplantation.
Collapse
Affiliation(s)
- Xiao Zhang
- Department of Computer Science, Universitat Politècnica de Catalunya, Barcelona, 08034, Spain.
| | | | - Jaume Baixeries
- Department of Computer Science, Universitat Politècnica de Catalunya, Barcelona, 08034, Spain
| |
Collapse
|
7
|
Lee J, Wanyan T, Chen Q, Keenan TDL, Glicksberg BS, Chew EY, Lu Z, Wang F, Peng Y. Predicting Age-related Macular Degeneration Progression with Longitudinal Fundus Images Using Deep Learning. MACHINE LEARNING IN MEDICAL IMAGING. MLMI (WORKSHOP) 2022; 13583:11-20. [PMID: 36656604 PMCID: PMC9842432 DOI: 10.1007/978-3-031-21014-3_2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Accurately predicting a patient's risk of progressing to late age-related macular degeneration (AMD) is difficult but crucial for personalized medicine. While existing risk prediction models for progression to late AMD are useful for triaging patients, none utilizes longitudinal color fundus photographs (CFPs) in a patient's history to estimate the risk of late AMD in a given subsequent time interval. In this work, we seek to evaluate how deep neural networks capture the sequential information in longitudinal CFPs and improve the prediction of 2-year and 5-year risk of progression to late AMD. Specifically, we proposed two deep learning models, CNN-LSTM and CNN-Transformer, which use a Long-Short Term Memory (LSTM) and a Transformer, respectively with convolutional neural networks (CNN), to capture the sequential information in longitudinal CFPs. We evaluated our models in comparison to baselines on the Age-Related Eye Disease Study, one of the largest longitudinal AMD cohorts with CFPs. The proposed models outperformed the baseline models that utilized only single-visit CFPs to predict the risk of late AMD (0.879 vs 0.868 in AUC for 2-year prediction, and 0.879 vs 0.862 for 5-year prediction). Further experiments showed that utilizing longitudinal CFPs over a longer time period was helpful for deep learning models to predict the risk of late AMD. We made the source code available at https://github.com/bionlplab/AMD_prognosis_mlmi2022 to catalyze future works that seek to develop deep learning models for late AMD prediction.
Collapse
Affiliation(s)
- Junghwan Lee
- Columbia University, New York, USA,Weill Cornell Medicine, New York, USA
| | - Tingyi Wanyan
- Indiana University, Bloomington, USA,Ichan School of Medicine at Mount Sinai, New York, USA,Weill Cornell Medicine, New York, USA
| | - Qingyu Chen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
| | | | | | - Emily Y. Chew
- National Eye Institute, National Institutes of Health, Bethesda, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
| | - Fei Wang
- Weill Cornell Medicine, New York, USA
| | | |
Collapse
|
8
|
Wanyan T, Lin M, Klang E, Menon KM, Gulamali FF, Azad A, Zhang Y, Ding Y, Wang Z, Wang F, Glicksberg B, Peng Y. Supervised Pretraining through Contrastive Categorical Positive Samplings to Improve COVID-19 Mortality Prediction. ACM-BCB ... ... : THE ... ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE. ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND BIOMEDICINE 2022; 2022:9. [PMID: 35960866 PMCID: PMC9365529 DOI: 10.1145/3535508.3545541] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Clinical EHR data is naturally heterogeneous, where it contains abundant sub-phenotype. Such diversity creates challenges for outcome prediction using a machine learning model since it leads to high intra-class variance. To address this issue, we propose a supervised pre-training model with a unique embedded k-nearest-neighbor positive sampling strategy. We demonstrate the enhanced performance value of this framework theoretically and show that it yields highly competitive experimental results in predicting patient mortality in real-world COVID-19 EHR data with a total of over 7,000 patients admitted to a large, urban health system. Our method achieves a better AUROC prediction score of 0.872, which outperforms the alternative pre-training models and traditional machine learning methods. Additionally, our method performs much better when the training data size is small (345 training instances).
Collapse
Affiliation(s)
- Tingyi Wanyan
- Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Mingquan Lin
- Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Eyal Klang
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | | | - Ariful Azad
- Intelligent Systems Engineering, Indiana University, Bloomington, Bloomington, IN, USA
| | - Yiye Zhang
- Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Ying Ding
- School of Information, University of Texus Austin, Austin, TX, USA
| | - Zhangyang Wang
- Electrical and Computer Engineering, University of Texus Austin, Austin, TX, USA
| | - Fei Wang
- Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | | | - Yifan Peng
- Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|
9
|
Vision for Improving Pregnancy Health: Innovation and the Future of Pregnancy Research. Reprod Sci 2022; 29:2908-2920. [PMID: 35534766 PMCID: PMC9537127 DOI: 10.1007/s43032-022-00951-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/15/2022] [Indexed: 10/25/2022]
Abstract
Understanding, predicting, and preventing pregnancy disorders have been a major research target. Nonetheless, the lack of progress is illustrated by research results related to preeclampsia and other hypertensive pregnancy disorders. These remain a major cause of maternal and infant mortality worldwide. There is a general consensus that the rate of progress toward understanding pregnancy disorders lags behind progress in other aspects of human health. In this presentation, we advance an explanation for this failure and suggest solutions. We propose that progress has been impeded by narrowly focused research training and limited imagination and innovation, resulting in the failure to think beyond conventional research approaches and analytical strategies. Investigations have been largely limited to hypothesis-generating approaches constrained by attempts to force poorly defined complex disorders into a single "unifying" hypothesis. Future progress could be accelerated by rethinking this approach. We advise taking advantage of innovative approaches that will generate new research strategies for investigating pregnancy abnormalities. Studies should begin before conception, assessing pregnancy longitudinally, before, during, and after pregnancy. Pregnancy disorders should be defined by pathophysiology rather than phenotype, and state of the art agnostic assessment of data should be adopted to generate new ideas. Taking advantage of new approaches mandates emphasizing innovation, inclusion of large datasets, and use of state of the art experimental and analytical techniques. A revolution in understanding pregnancy-associated disorders will depend on networks of scientists who are driven by an intense biological curiosity, a team spirit, and the tools to make new discoveries.
Collapse
|
10
|
A Comprehensive Review of Computation-Based Metal-Binding Prediction Approaches at the Residue Level. BIOMED RESEARCH INTERNATIONAL 2022; 2022:8965712. [PMID: 35402609 PMCID: PMC8989566 DOI: 10.1155/2022/8965712] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 03/04/2022] [Indexed: 12/29/2022]
Abstract
Clear evidence has shown that metal ions strongly connect and delicately tune the dynamic homeostasis in living bodies. They have been proved to be associated with protein structure, stability, regulation, and function. Even small changes in the concentration of metal ions can shift their effects from natural beneficial functions to harmful. This leads to degenerative diseases, malignant tumors, and cancers. Accurate characterizations and predictions of metalloproteins at the residue level promise informative clues to the investigation of intrinsic mechanisms of protein-metal ion interactions. Compared to biophysical or biochemical wet-lab technologies, computational methods provide open web interfaces of high-resolution databases and high-throughput predictors for efficient investigation of metal-binding residues. This review surveys and details 18 public databases of metal-protein binding. We collect a comprehensive set of 44 computation-based methods and classify them into four categories, namely, learning-, docking-, template-, and meta-based methods. We analyze the benchmark datasets, assessment criteria, feature construction, and algorithms. We also compare several methods on two benchmark testing datasets and include a discussion about currently publicly available predictive tools. Finally, we summarize the challenges and underlying limitations of the current studies and propose several prospective directions concerning the future development of the related databases and methods.
Collapse
|
11
|
Reported Adverse Effects and Attitudes among Arab Populations Following COVID-19 Vaccination: A Large-Scale Multinational Study Implementing Machine Learning Tools in Predicting Post-Vaccination Adverse Effects Based on Predisposing Factors. Vaccines (Basel) 2022; 10:vaccines10030366. [PMID: 35334998 PMCID: PMC8955470 DOI: 10.3390/vaccines10030366] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 02/23/2022] [Accepted: 02/24/2022] [Indexed: 02/04/2023] Open
Abstract
Background: The unprecedented global spread of coronavirus disease 2019 (COVID-19) has imposed huge challenges on the healthcare facilities, and impacted every aspect of life. This has led to the development of several vaccines against COVID-19 within one year. This study aimed to assess the attitudes and the side effects among Arab communities after receiving a COVID-19 vaccine and use of machine learning (ML) tools to predict post-vaccination side effects based on predisposing factors. Methods: An online-based multinational survey was carried out via social media platforms from 14 June to 31 August 2021, targeting individuals who received at least one dose of a COVID-19 vaccine from 22 Arab countries. Descriptive statistics, correlation, and chi-square tests were used to analyze the data. Moreover, extensive ML tools were utilized to predict 30 post vaccination adverse effects and their severity based on 15 predisposing factors. The importance of distinct predisposing factors in predicting particular side effects was determined using global feature importance employing gradient boost as AutoML. Results: A total of 10,064 participants from 19 Arab countries were included in this study. Around 56% were female and 59% were aged from 20 to 39 years old. A high rate of vaccine hesitancy (51%) was reported among participants. Almost 88% of the participants were vaccinated with one of three COVID-19 vaccines, including Pfizer-BioNTech (52.8%), AstraZeneca (20.7%), and Sinopharm (14.2%). About 72% of participants experienced post-vaccination side effects. This study reports statistically significant associations (p < 0.01) between various predisposing factors and post-vaccinations side effects. In terms of predicting post-vaccination side effects, gradient boost, random forest, and XGBoost outperformed other ML methods. The most important predisposing factors for predicting certain side effects (i.e., tiredness, fever, headache, injection site pain and swelling, myalgia, and sleepiness and laziness) were revealed to be the number of doses, gender, type of vaccine, age, and hesitancy to receive a COVID-19 vaccine. Conclusions: The reported side effects following COVID-19 vaccination among Arab populations are usually non-life-threatening; flu-like symptoms and injection site pain. Certain predisposing factors have greater weight and importance as input data in predicting post-vaccination side effects. Based on the most significant input data, ML can also be used to predict these side effects; people with certain predicted side effects may require additional medical attention, or possibly hospitalization.
Collapse
|
12
|
Zemmal N, Benzebouchi NE, Azizi N, Schwab D, Belhaouari SB. Unbalanced Learning for Diabetes Diagnosis Based on Enhanced Resampling and Stacking Classifier. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES 2022. [DOI: 10.4018/ijiit.309583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Diabetes is characterized by an abnormally enhanced concentration of glucose in the blood serum. It has a damaging impact on several noble body systems. Today, the concept of unbalanced learning has developed considerably in the domain of medical diagnosis, which greatly reduces the generation of erroneous classification results. The paper takes a hybrid approach to imbalanced learning by proposing an enhanced multimodal meta-learning method called IRESAMPLE+St to distinguish between normal and diabetic patients. This approach relies on the Stacking paradigm by utilizing the complementarity that may exist between classifiers. In the same focus of this study, a modified RESAMPLE-based technique referred to as IRESAMPLE+ and the SMOTE method are integrated as a preliminary resampling step to overcome and resolve the problem of unbalanced data. The suggested IRESAMPLE+St provides a computerized diabetes diagnostic system with impressive results, comparing it to the principal related studies, reflecting the design and engineering successes achieved.
Collapse
Affiliation(s)
- Nawel Zemmal
- Mathematics and Computer Science Department, Mohamed Cherif Messaadia University, Souk-Ahras, Algeria & Labged Laboratory, Badji Mokhtar Annaba University, Annaba, Algeria
| | - Nacer Eddine Benzebouchi
- Labged Laboratory, Computer Science Department, Badji Mokhtar Annaba University, Annaba, Algeria
| | - Nabiha Azizi
- Labged Laboratory, Computer Science Department, Badji Mokhtar Annaba University, Annaba, Algeria
| | | | | |
Collapse
|
13
|
Miller A, Panneerselvam J, Liu L. A review of regression and classification techniques for analysis of common and rare variants and gene-environmental factors. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.08.150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
14
|
Beinecke J, Heider D. Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making. BioData Min 2021; 14:49. [PMID: 34844620 PMCID: PMC8628399 DOI: 10.1186/s13040-021-00283-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 11/10/2021] [Indexed: 02/08/2023] Open
Abstract
Clinical data sets have very special properties and suffer from many caveats in machine learning. They typically show a high-class imbalance, have a small number of samples and a large number of parameters, and have missing values. While feature selection approaches and imputation techniques address the former problems, the class imbalance is typically addressed using augmentation techniques. However, these techniques have been developed for big data analytics, and their suitability for clinical data sets is unclear.This study analyzed different augmentation techniques for use in clinical data sets and subsequent employment of machine learning-based classification. It turns out that Gaussian Noise Up-Sampling (GNUS) is not always but generally, is as good as SMOTE and ADASYN and even outperform those on some datasets. However, it has also been shown that augmentation does not improve classification at all in some cases.
Collapse
Affiliation(s)
- Jacqueline Beinecke
- Department of Mathematics and Computer Science, Philipps-University of Marburg, Hans-Meerwein-Str. 6, 35043, Marburg, Germany
| | - Dominik Heider
- Department of Mathematics and Computer Science, Philipps-University of Marburg, Hans-Meerwein-Str. 6, 35043, Marburg, Germany.
| |
Collapse
|
15
|
Gandouz M, Holzmann H, Heider D. Machine learning with asymmetric abstention for biomedical decision-making. BMC Med Inform Decis Mak 2021; 21:294. [PMID: 34702225 PMCID: PMC8549182 DOI: 10.1186/s12911-021-01655-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 10/13/2021] [Indexed: 02/08/2023] Open
Abstract
Machine learning and artificial intelligence have entered biomedical decision-making for diagnostics, prognostics, or therapy recommendations. However, these methods need to be interpreted with care because of the severe consequences for patients. In contrast to human decision-making, computational models typically make a decision also with low confidence. Machine learning with abstention better reflects human decision-making by introducing a reject option for samples with low confidence. The abstention intervals are typically symmetric intervals around the decision boundary. In the current study, we use asymmetric abstention intervals, which we demonstrate to be better suited for biomedical data that is typically highly imbalanced. We evaluate symmetric and asymmetric abstention on three real-world biomedical datasets and show that both approaches can significantly improve classification performance. However, asymmetric abstention rejects as many or fewer samples compared to symmetric abstention and thus, should be used in imbalanced data.
Collapse
Affiliation(s)
- Mariem Gandouz
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, University of Marburg, 35032, Marburg, Germany
| | - Hajo Holzmann
- Department of Statistics, Faculty of Mathematics and Computer Science, University of Marburg, 35032, Marburg, Germany
| | - Dominik Heider
- Department of Data Science in Biomedicine, Faculty of Mathematics and Computer Science, University of Marburg, 35032, Marburg, Germany.
| |
Collapse
|
16
|
Hatmal MM, Alshaer W, Mahmoud IS, Al-Hatamleh MAI, Al-Ameer HJ, Abuyaman O, Zihlif M, Mohamud R, Darras M, Al Shhab M, Abu-Raideh R, Ismail H, Al-Hamadi A, Abdelhay A. Investigating the association of CD36 gene polymorphisms (rs1761667 and rs1527483) with T2DM and dyslipidemia: Statistical analysis, machine learning based prediction, and meta-analysis. PLoS One 2021; 16:e0257857. [PMID: 34648514 PMCID: PMC8516279 DOI: 10.1371/journal.pone.0257857] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 09/11/2021] [Indexed: 12/15/2022] Open
Abstract
CD36 (cluster of differentiation 36) is a membrane protein involved in lipid metabolism and has been linked to pathological conditions associated with metabolic disorders, such as diabetes and dyslipidemia. A case-control study was conducted and included 177 patients with type-2 diabetes mellitus (T2DM) and 173 control subjects to study the involvement of CD36 gene rs1761667 (G>A) and rs1527483 (C>T) polymorphisms in the pathogenesis of T2DM and dyslipidemia among Jordanian population. Lipid profile, blood sugar, gender and age were measured and recorded. Also, genotyping analysis for both polymorphisms was performed. Following statistical analysis, 10 different neural networks and machine learning (ML) tools were used to predict subjects with diabetes or dyslipidemia. Towards further understanding of the role of CD36 protein and gene in T2DM and dyslipidemia, a protein-protein interaction network and meta-analysis were carried out. For both polymorphisms, the genotypic frequencies were not significantly different between the two groups (p > 0.05). On the other hand, some ML tools like multilayer perceptron gave high prediction accuracy (≥ 0.75) and Cohen's kappa (κ) (≥ 0.5). Interestingly, in K-star tool, the accuracy and Cohen's κ values were enhanced by including the genotyping results as inputs (0.73 and 0.46, respectively, compared to 0.67 and 0.34 without including them). This study confirmed, for the first time, that there is no association between CD36 polymorphisms and T2DM or dyslipidemia among Jordanian population. Prediction of T2DM and dyslipidemia, using these extensive ML tools and based on such input data, is a promising approach for developing diagnostic and prognostic prediction models for a wide spectrum of diseases, especially based on large medical databases.
Collapse
Affiliation(s)
- Ma’mon M. Hatmal
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
- * E-mail:
| | - Walhan Alshaer
- Cell Therapy Centre, The University of Jordan, Amman, Jordan
| | - Ismail S. Mahmoud
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Mohammad A. I. Al-Hatamleh
- Department of Immunology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Hamzeh J. Al-Ameer
- Department of Biology and Biotechnology, American University of Madaba, Madaba, Jordan
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Omar Abuyaman
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Malek Zihlif
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Rohimah Mohamud
- Department of Immunology, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Mais Darras
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Mohammad Al Shhab
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| | - Rand Abu-Raideh
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Hilweh Ismail
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Ali Al-Hamadi
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, The Hashemite University, Zarqa, Jordan
| | - Ali Abdelhay
- Department of Pharmacology, Faculty of Medicine, The University of Jordan, Amman, Jordan
| |
Collapse
|
17
|
Ylenia C, Lauri Chiara D, Giovanni I, Lucia R, Donatella V, Tiziana S, Vincenzo G, Ciro V, Stefania S. A Clinical Decision Support System based on fuzzy rules and classification algorithms for monitoring the physiological parameters of type-2 diabetic patients. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021; 18:2653-2674. [PMID: 33892565 DOI: 10.3934/mbe.2021135] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The use of different types of Clinical Decision Support Systems (CDSS) makes possible the improvement of the quality of the therapeutic and diagnostic efficiency in health field. Those systems, properly implemented, are able to simulate human expert clinician reasoning in order to suggest decisions on treatment of patients. In this paper, we exploit fuzzy inference machines to improve the quality of the day-by-day clinical care of type-2 diabetic patients of Anti-Diabetes Centre (CAD) of the Local Health Authority ASL Naples 1 (Naples, Italy). All the designed functionalities were developed thanks to the experience on the field, through different phases (data collection and adjustment, Fuzzy Inference System development and its validation on real cases) executed by an interdisciplinary research team comprising doctors, clinicians and IT engineers. The proposed approach also allows the remote monitoring of patients' clinical conditions and, hence, can help to reduce hospitalizations.
Collapse
Affiliation(s)
- Colella Ylenia
- Department of Electronic Engineering and Information Technology, Faculty of Engineering, University of Naples Federico II, Naples, Italy
| | - De Lauri Chiara
- Department of Electronic Engineering and Information Technology, Faculty of Engineering, University of Naples Federico II, Naples, Italy
| | - Improta Giovanni
- Department of Public Health of the University Hospital, University of Naples Federico II, Naples, Italy
- Interdepartmental Center for Research in Health Management and Innovation in Health (CIRMIS), University of Naples Federico II, Naples, Italy
| | - Rossano Lucia
- Department of Electronic Engineering and Information Technology, Faculty of Engineering, University of Naples Federico II, Naples, Italy
| | - Vecchione Donatella
- Department of Electronic Engineering and Information Technology, Faculty of Engineering, University of Naples Federico II, Naples, Italy
| | | | | | | | - Santini Stefania
- Department of Electronic Engineering and Information Technology, Faculty of Engineering, University of Naples Federico II, Naples, Italy
| |
Collapse
|
18
|
Zhou Y, Ma XL, Zhang T, Wang J, Zhang T, Tian R. Use of radiomics based on 18F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach. Eur J Nucl Med Mol Imaging 2021; 48:2904-2913. [PMID: 33547553 DOI: 10.1007/s00259-021-05220-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 01/25/2021] [Indexed: 02/06/2023]
Abstract
PURPOSE This study was designed and performed to assess the ability of 18F-fluorodeoxyglucose (FDG) positron emission tomography (PET) and computed tomography (CT) radiomics features combined with machine learning methods to differentiate between primary and metastatic lung lesions and to classify histological subtypes. Moreover, we identified the optimal machine learning method. METHODS A total of 769 patients pathologically diagnosed with primary or metastatic lung cancers were enrolled. We used the LIFEx package to extract radiological features from semiautomatically segmented PET and CT images within the same volume of interest. Patients were randomly distributed in training and validation sets. Through the evaluation of five feature selection methods and nine classification methods, discriminant models were established. The robustness of the procedure was controlled by tenfold cross-validation. The model's performance was evaluated using the area under the receiver operating characteristic curve (AUC). RESULTS Based on the radiomics features extracted from PET and CT images, forty-five discriminative models were established. Combined with appropriate feature selection methods, most classifiers showed excellent discriminative ability with AUCs greater than 0.75. In the differentiation between primary and metastatic lung lesions, the feature selection method gradient boosting decision tree (GBDT) combined with the classifier GBDT achieved the highest classification AUC of 0.983 in the PET dataset. In contrast, the feature selection method eXtreme gradient boosting combined with the classifier random forest (RF) achieved the highest AUC of 0.828 in the CT dataset. In the discrimination between squamous cell carcinoma and adenocarcinoma, the combination of GBDT feature selection method with GBDT classification had the highest AUC of 0.897 in the PET dataset. In contrast, the combination of the GBDT feature selection method with the RF classification had the highest AUC of 0.839 in the CT dataset. Most of the decision tree (DT)-based models were overfitted, suggesting that the classification method was not appropriate for practical application. CONCLUSION 18F-FDG PET/CT radiomics features combined with machine learning methods can distinguish between primary and metastatic lung lesions and identify histological subtypes in lung cancer. GBDT and RF were considered optimal classification methods for the PET and CT datasets, respectively, and GBDT was considered the optimal feature selection method in our analysis.
Collapse
Affiliation(s)
- Yi Zhou
- Department of Nuclear Medicine, West China Hospital, Sichuan University, 37# GuoXueLane, Chengdu, 610041, China
| | - Xue-Lei Ma
- Department of Biotherapy, Cancer Center, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, 37# GuoXueLane, Chengdu, 610041, China
| | - Ting Zhang
- West China School of Medicine, West China Hospital, Sichuan University, 37# GuoXueLane, Chengdu, 610041, China
| | - Jian Wang
- School of Computer Science, Nanjing University of Science and Technology, No. 200, Xiaolinwei Road, Nanjing, 210094, China
| | - Tao Zhang
- West China School of Medicine, West China Hospital, Sichuan University, 37# GuoXueLane, Chengdu, 610041, China
| | - Rong Tian
- Department of Nuclear Medicine, West China Hospital, Sichuan University, 37# GuoXueLane, Chengdu, 610041, China.
| |
Collapse
|
19
|
Comparison of machine learning tools for the prediction of AMD based on genetic, age, and diabetes-related variables in the Chinese population. Regen Ther 2021; 15:180-186. [PMID: 33426217 PMCID: PMC7770346 DOI: 10.1016/j.reth.2020.09.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Revised: 09/01/2020] [Accepted: 09/09/2020] [Indexed: 11/23/2022] Open
Abstract
Introduction Age-related macular degeneration (AMD) is the main cause of visual impairment and the most important cause of blindness in older people. However, there is currently no effective treatment for this disease, so it is necessary to establish a risk model to predict AMD development. Methods This study included a total of 202 subjects, comprising 82 AMD patients and 120 control subjects. Sixty-six single-nucleotide polymorphisms (SNPs) were identified using the MassArray assay. Considering 14 independent clinical variables as well as SNPs, four predictive models were established in the training set and evaluated by the confusion matrix, area under the receiver operating characteristic (ROC) curve (AUROC). The difference distributions of the 14 independent clinical features between the AMD and control groups were tested using the chi-squared test. Age and diabetes were adjusted using logistic regression analysis and the “genomic-control” method was used for multiple testing correction. Results Three SNPs (rs10490924, OR = 1.686, genomic-control corrected p-value (GC) = 0.030; rs2338104, OR = 1.794, GC = 0.025 and rs1864163, OR = 2.125, GC = 0.038) were significant risk factors for AMD development. In the training set, four models obtained AUROC values above 0.72. Conclusions We believe machine learning tools will be useful for the early prediction of AMD and for the development of relevant intervention strategies.
Collapse
|
20
|
Liberda EN, Zuk AM, Martin ID, Tsuji LJS. Fisher's Linear Discriminant Function Analysis and its Potential Utility as a Tool for the Assessment of Health-and-Wellness Programs in Indigenous Communities. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17217894. [PMID: 33126498 PMCID: PMC7663610 DOI: 10.3390/ijerph17217894] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 10/22/2020] [Accepted: 10/25/2020] [Indexed: 11/16/2022]
Abstract
Diabetes mellitus is a growing public health problem affecting persons in both developed and developing nations. The prevalence of type 2 diabetes mellitus (T2DM) is reported to be several times higher among Indigenous populations compared to their non-Indigenous counterparts. Discriminant function analysis (DFA) is a potential tool that can be used to quantitatively evaluate the effectiveness of Indigenous health-and-wellness programs (e.g., on-the-land programs, T2DM interventions), by creating a type of pre-and-post-program scoring system. As the communities of the Eeyou Istchee territory, subarctic Quebec, Canada, have varying degrees of isolation, we derived a DFA tool for point-of-contact evaluations to aid in monitoring and assessment of health-and-wellness programs in rural and remote locations. We developed several DFA models to discriminate between those with and without T2DM status using age, fasting blood glucose, body mass index, waist girth, systolic and diastolic blood pressure, high-density lipoprotein, triglycerides, and total cholesterol in participants from the Eeyou Istchee. The models showed a ~97% specificity (i.e., true positives for non-T2DM) in classification. This study highlights how varying risk factor models can be used to discriminate those without T2DM with high specificity among James Bay Cree communities in Canada.
Collapse
Affiliation(s)
- Eric N. Liberda
- School of Occupational and Public Health, Ryerson University, Toronto, ON M5B 2K3, Canada
- Correspondence: ; Tel.: +1-416-979-5000
| | - Aleksandra M. Zuk
- Department of Physical and Environmental Sciences, University of Toronto, Toronto, ON M1C 1A4, Canada; (A.M.Z.); (I.D.M.); (L.J.S.T.)
- School of Nursing, Faculty of Health Sciences, Queen’s University, Kingston, ON K7L 3N6, Canada
| | - Ian D. Martin
- Department of Physical and Environmental Sciences, University of Toronto, Toronto, ON M1C 1A4, Canada; (A.M.Z.); (I.D.M.); (L.J.S.T.)
| | - Leonard J. S. Tsuji
- Department of Physical and Environmental Sciences, University of Toronto, Toronto, ON M1C 1A4, Canada; (A.M.Z.); (I.D.M.); (L.J.S.T.)
| |
Collapse
|
21
|
Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. LIBRARY HI TECH 2020. [DOI: 10.1108/lht-08-2019-0171] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
PurposeThe primary objective of this study was to recognize critical indicators in predicting blood glucose (BG) through data-driven methods and to compare the prediction performance of four tree-based ensemble models, i.e. bagging with tree regressors (bagging-decision tree [Bagging-DT]), AdaBoost with tree regressors (Adaboost-DT), random forest (RF) and gradient boosting decision tree (GBDT).Design/methodology/approachThis study proposed a majority voting feature selection method by combining lasso regression with the Akaike information criterion (AIC) (LR-AIC), lasso regression with the Bayesian information criterion (BIC) (LR-BIC) and RF to select indicators with excellent predictive performance from initial 38 indicators in 5,642 samples. The selected features were deployed to build the tree-based ensemble models. The 10-fold cross-validation (CV) method was used to evaluate the performance of each ensemble model.FindingsThe results of feature selection indicated that age, corpuscular hemoglobin concentration (CHC), red blood cell volume distribution width (RBCVDW), red blood cell volume and leucocyte count are five most important clinical/physical indicators in BG prediction. Furthermore, this study also found that the GBDT ensemble model combined with the proposed majority voting feature selection method is better than other three models with respect to prediction performance and stability.Practical implicationsThis study proposed a novel BG prediction framework for better predictive analytics in health care.Social implicationsThis study incorporated medical background and machine learning technology to reduce diabetes morbidity and formulate precise medical schemes.Originality/valueThe majority voting feature selection method combined with the GBDT ensemble model provides an effective decision-making tool for predicting BG and detecting diabetes risk in advance.
Collapse
|
22
|
Anand PK, Shin DR, Memon ML. Adaptive Boosting Based Personalized Glucose Monitoring System (PGMS) for Non-Invasive Blood Glucose Prediction with Improved Accuracy. Diagnostics (Basel) 2020; 10:diagnostics10050285. [PMID: 32392841 PMCID: PMC7278000 DOI: 10.3390/diagnostics10050285] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 04/28/2020] [Accepted: 05/04/2020] [Indexed: 12/13/2022] Open
Abstract
In this paper, we present an architecture of a personalized glucose monitoring system (PGMS). PGMS consists of both invasive and non-invasive sensors on a single device. Initially, blood glucose is measured invasively and non-invasively, to train the machine learning models. Then, paired data and corresponding errors are divided scientifically into six different clusters based on blood glucose ranges as per the patient’s diabetic conditions. Each cluster is trained to build the unique error prediction model using an adaptive boosting (AdaBoost) algorithm. Later, these error prediction models undergo personalized calibration based on the patient’s characteristics. Once, the errors in predicted non-invasive values are within the acceptable error range, the device gets personalized for a patient to measure the blood glucose non-invasively. We verify PGMS on two different datasets. Performance analysis shows that the mean absolute relative difference (MARD) is reduced exceptionally to 7.3% and 7.1% for predicted values as compared to 25.4% and 18.4% for measured non-invasive glucose values. The Clarke error grid analysis (CEGA) plot for non-invasive predicted values shows 97% data in Zone A and 3% data in Zone B for dataset 1. Moreover, for dataset 2 results echoed with 98% and 2% in Zones A and B, respectively.
Collapse
Affiliation(s)
- Pradeep Kumar Anand
- College of Information and Communication Engineering, Sungkyunkwan University, Suwon 16419, Korea;
| | - Dong Ryeol Shin
- College of Software, Sungkyunkwan University, Suwon 16419, Korea
- Correspondence: ; Tel.: +82-103-015-7125
| | - Mudasar Latif Memon
- IBA Community College Naushahro Feroze, Sukkur IBA University, Sindh 65200, Pakistan;
| |
Collapse
|
23
|
Zou Y, Wang D, Liu L. Research on Human Movement Target Recognition Algorithm in Complex Traffic Environment. INT J PATTERN RECOGN 2020. [DOI: 10.1142/s0218001420500123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the increase in the total population of the society and the continuous increase in the number of trips, the traffic pressures faced by people are increasing. With the development and advancement of computer technology, the emergence of intelligent transportation provides a better way to solve the problem of effectively alleviating traffic pressure and reducing the incidence of traffic accidents. In recent years, intelligent traffic monitoring system, as one of the important branches in the field of intelligent transportation, has also received more and more attention. Among them, video-based moving target recognition technology involves theoretical knowledge in various fields such as artificial intelligence, image processing, pattern recognition and computer vision. It is an important means to realize “safe city” and “smart city” and a key technology for intelligent monitoring. Therefore, the research on human motion target recognition algorithm in complex traffic environment has important theoretical and practical value. In the field of intelligent traffic monitoring, the moving target detection and recognition effect of video images will have certain influence on the classification and behavior understanding of subsequent moving targets. In this paper, the commonly used moving target detection methods are studied first, and the convergence problem of the traditional Adaboost algorithm is improved. An Adaboost algorithm based on adaptive weight update is proposed, and then the support vector machine (SVM) is used. The algorithm identifies the detected moving target. Finally, through simulation experiments on the acquired video images, the results show that the proposed human motion target recognition algorithm based on adaptive weight update Adaboost and SVM has good feasibility and rationality.
Collapse
Affiliation(s)
- Ying Zou
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, P. R. China
| | - Dahu Wang
- School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo, Henan, P. R. China
| | - Leian Liu
- College of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, P. R. China
| |
Collapse
|
24
|
Construction of cascaded depth model based on boosting feature selection and classification. EVOLUTIONARY INTELLIGENCE 2020. [DOI: 10.1007/s12065-020-00413-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
25
|
Choubey DK, Kumar M, Shukla V, Tripathi S, Dhandhania VK. Comparative Analysis of Classification Methods with PCA and LDA for Diabetes. Curr Diabetes Rev 2020; 16:833-850. [PMID: 31971112 DOI: 10.2174/1573399816666200123124008] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 09/30/2019] [Accepted: 11/11/2019] [Indexed: 12/20/2022]
Abstract
BACKGROUND The modern society is extremely prone to many life-threatening diseases, which can be easily controlled as well as cured if diagnosed at an early stage. The development and implementation of a disease diagnostic system have gained huge popularity over the years. In the current scenario, there are certain factors such as environment, sedentary lifestyle, genetic (hereditary) are the major factors behind the life threatening diseases such as 'diabetes.' Moreover, diabetes has achieved the status of the modern man's leading chronic disease. So one of the prime needs of this generation is to develop a state-of-the-art expert system which can predict diabetes at a very early stage with a minimum of complexity and in an expedited manner. The primary objective of this work is to develop an indigenous and efficient diagnostic technique for detection of diabetes. Method & Discussion: The proposed methodology comprises of two phases: In the first phase The Pima Indian Diabetes Dataset (PIDD) has been collected from the UCI machine learning repository databases and Localized Diabetes Dataset (LDD) has been gathered from Bombay Medical Hall, Upper Bazar Ranchi, Jharkhand, India. In the second phase, the dataset has been processed through two different approaches. The first approach entails classification through Adaboost, Classification via Regression (CVR), Radial Basis Function Network (RBFN), K-Nearest Neighbor (KNN) on Pima Indian Diabetes Dataset and Localized Diabetes Dataset. In the second approach, Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) have been applied as a feature reduction method followed by using the same set of classification methods used in the first approach. Among all of the implemented classification methods, PCA_CVR achieves the maximum performance for both the above mentioned datasets. CONCLUSION In this article, comparative analysis of outcomes obtained by with and without the use of PCA and LDA for the same set of classification method has been done w.r.t performance assessment. Finally, it has been concluded that PCA & LDA both are useful to remove the insignificant features, decreasing the expense and computation time while improving the ROC and accuracy. The used methodology may similarly be applied to other medical diseases.
Collapse
Affiliation(s)
- Dilip Kumar Choubey
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Manish Kumar
- Department of E.C.E & Biomedical Engineering, Mody University of Science and Technology, Sikar, India
| | | | - Sudhakar Tripathi
- Depatment of Information Technology, Rajkiya Engineering College, Ambedkar Nagar, India
| | | |
Collapse
|
26
|
|
27
|
Comparison of Machine Learning Techniques for Prediction of Hospitalization in Heart Failure Patients. J Clin Med 2019; 8:jcm8091298. [PMID: 31450546 PMCID: PMC6780582 DOI: 10.3390/jcm8091298] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/20/2019] [Accepted: 08/22/2019] [Indexed: 12/23/2022] Open
Abstract
The present study aims to compare the performance of eight Machine Learning Techniques (MLTs) in the prediction of hospitalization among patients with heart failure, using data from the Gestione Integrata dello Scompenso Cardiaco (GISC) study. The GISC project is an ongoing study that takes place in the region of Puglia, Southern Italy. Patients with a diagnosis of heart failure are enrolled in a long-term assistance program that includes the adoption of an online platform for data sharing between general practitioners and cardiologists working in hospitals and community health districts. Logistic regression, generalized linear model net (GLMN), classification and regression tree, random forest, adaboost, logitboost, support vector machine, and neural networks were applied to evaluate the feasibility of such techniques in predicting hospitalization of 380 patients enrolled in the GISC study, using data about demographic characteristics, medical history, and clinical characteristics of each patient. The MLTs were compared both without and with missing data imputation. Overall, models trained without missing data imputation showed higher predictive performances. The GLMN showed better performance in predicting hospitalization than the other MLTs, with an average accuracy, positive predictive value and negative predictive value of 81.2%, 87.5%, and 75%, respectively. Present findings suggest that MLTs may represent a promising opportunity to predict hospital admission of heart failure patients by exploiting health care information generated by the contact of such patients with the health care system.
Collapse
|
28
|
Spänig S, Emberger-Klein A, Sowa JP, Canbay A, Menrad K, Heider D. The virtual doctor: An interactive clinical-decision-support system based on deep learning for non-invasive prediction of diabetes. Artif Intell Med 2019; 100:101706. [PMID: 31607340 DOI: 10.1016/j.artmed.2019.101706] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 06/27/2019] [Accepted: 08/18/2019] [Indexed: 11/16/2022]
Abstract
Artificial intelligence (AI) will pave the way to a new era in medicine. However, currently available AI systems do not interact with a patient, e.g., for anamnesis, and thus are only used by the physicians for predictions in diagnosis or prognosis. However, these systems are widely used, e.g., in diabetes or cancer prediction. In the current study, we developed an AI that is able to interact with a patient (virtual doctor) by using a speech recognition and speech synthesis system and thus can autonomously interact with the patient, which is particularly important for, e.g., rural areas, where the availability of primary medical care is strongly limited by low population densities. As a proof-of-concept, the system is able to predict type 2 diabetes mellitus (T2DM) based on non-invasive sensors and deep neural networks. Moreover, the system provides an easy-to-interpret probability estimation for T2DM for a given patient. Besides the development of the AI, we further analyzed the acceptance of young people for AI in healthcare to estimate the impact of such a system in the future.
Collapse
Affiliation(s)
- Sebastian Spänig
- Department of Bioinformatics, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Hans-Meerwein-Str. 6, 35037 Marburg, Germany
| | - Agnes Emberger-Klein
- Chair of Marketing and Management of Biogenic Resources, Weihenstephan-Triesdorf University of Applied Sciences/TUM Campus Straubing for Biotechnology and Sustainability, Petersgasse 18, 94315 Straubing, Germany
| | - Jan-Peter Sowa
- Department of Gastroenterology, Hepatology and Infectious Diseases, Otto-von-Guericke University Magdeburg, Leipziger Str. 44, 39120 Magdeburg, Germany
| | - Ali Canbay
- Department of Gastroenterology, Hepatology and Infectious Diseases, Otto-von-Guericke University Magdeburg, Leipziger Str. 44, 39120 Magdeburg, Germany
| | - Klaus Menrad
- Chair of Marketing and Management of Biogenic Resources, Weihenstephan-Triesdorf University of Applied Sciences/TUM Campus Straubing for Biotechnology and Sustainability, Petersgasse 18, 94315 Straubing, Germany
| | - Dominik Heider
- Department of Bioinformatics, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Hans-Meerwein-Str. 6, 35037 Marburg, Germany.
| |
Collapse
|
29
|
A hybrid Forecast Cost Benefit Classification of diabetes mellitus prevalence based on epidemiological study on Real-life patient's data. Sci Rep 2019; 9:10103. [PMID: 31300715 PMCID: PMC6626127 DOI: 10.1038/s41598-019-46631-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Accepted: 07/01/2019] [Indexed: 12/12/2022] Open
Abstract
The increasing ratio of diabetes is found risky across the planet. Therefore, the diagnosis is important in population with extreme risk of diabetes. In this study, a decision-making classifier (J48) is applied over a data-mining platform (Weka) to measure accuracy and linear regression on classification results to forecast cost/benefit ratio in diabetes mellitus patients along with prevalence. In total 108 invasive and non-invasive medical features are considered from 251 patients for assessment, and the real-time data are gathered from Pakistan over a time span of June 2017 to April 2018. The results indicate that J48 classifiers achieved the best accuracy of (99.28%), whereas, error rate (0.08%), Kappa stats, PRC, and MCC are (0.98%), precision, recall, and F-matrix are (0.99%). In addition, true positive rate is (0.99%) and false positive is (0.08%). The regression forecast decision indicates blood pressure and glucose level are key features for diabetes. The cost/benefit matrix indicates two predictions for positive test with accuracy (66.68%) and (30.60%), and key attributes with total Gain (118.13%). The study confirmed the proposed prediction is practical for screening of diabetes mellitus patients at the initial stage without invasive medical tests and found effectual in the early diagnosis of diabetes.
Collapse
|
30
|
Yoo TK, Ryu IH, Lee G, Kim Y, Kim JK, Lee IS, Kim JS, Rim TH. Adopting machine learning to automatically identify candidate patients for corneal refractive surgery. NPJ Digit Med 2019; 2:59. [PMID: 31304405 PMCID: PMC6586803 DOI: 10.1038/s41746-019-0135-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 05/30/2019] [Indexed: 12/26/2022] Open
Abstract
Recently, it has become more important to screen candidates that undergo corneal refractive surgery to prevent complications. Until now, there is still no definitive screening method to confront the possibility of a misdiagnosis. We evaluate the possibilities of machine learning as a clinical decision support to determine the suitability to corneal refractive surgery. A machine learning architecture was built with the aim of identifying candidates combining the large multi-instrument data from patients and clinical decisions of highly experienced experts. Five heterogeneous algorithms were used to predict candidates for surgery. Subsequently, an ensemble classifier was developed to improve the performance. Training (10,561 subjects) and internal validation (2640 subjects) were conducted using subjects who had visited between 2016 and 2017. External validation (5279 subjects) was performed using subjects who had visited in 2018. The best model, i.e., the ensemble classifier, had a high prediction performance with the area under the receiver operating characteristic curves of 0.983 (95% CI, 0.977-0.987) and 0.972 (95% CI, 0.967-0.976) when tested in the internal and external validation set, respectively. The machine learning models were statistically superior to classic methods including the percentage of tissue ablated and the Randleman ectatic score. Our model was able to correctly reclassify a patient with postoperative ectasia as an ectasia-risk group. Machine learning algorithms using a wide range of preoperative information achieved a comparable performance to screen candidates for corneal refractive surgery. An automated machine learning analysis of preoperative data can provide a safe and reliable clinical decision for refractive surgery.
Collapse
Affiliation(s)
- Tae Keun Yoo
- B&VIIt Eye Center, Seoul, South Korea.,2Institute of Vision Research, Department of Ophthalmology, Yonsei University College of Medicine, Seoul, South Korea
| | | | | | | | | | | | | | - Tyler Hyungtaek Rim
- Singapore Eye Research Institute, Singapore National Eye Centre, Duke-NUS Medical School, Singapore, Singapore.,5Department of Ophthalmology, Yonsei University College of Medicine, Graduate School, Seoul, Korea
| |
Collapse
|
31
|
Radiomics-based machine learning methods for isocitrate dehydrogenase genotype prediction of diffuse gliomas. J Cancer Res Clin Oncol 2019; 145:543-550. [PMID: 30719536 PMCID: PMC6394679 DOI: 10.1007/s00432-018-2787-1] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 11/01/2018] [Indexed: 12/20/2022]
Abstract
Purpose Reliable and accurate predictive models are necessary to drive the success of radiomics. Our aim was to identify the optimal radiomics-based machine learning method for isocitrate dehydrogenase (IDH) genotype prediction in diffuse gliomas. Methods Eight classical machine learning methods were evaluated in terms of their stability and performance for pre-operative IDH genotype prediction. A total of 126 patients were enrolled for analysis. Overall, 704 radiomic features extracted from the pre-operative MRI images were analyzed. The patients were randomly assigned to either the training set or the validation set at a ratio of 2:1. Feature selection and classification model training were done using the training set, whereas the predictive performance and stability of the model were independently assessed using the validation set. Results Random Forest (RF) showed high predictive performance (accuracy 0.885 ± 0.041, AUC 0.931 ± 0.036), whereas neural network (NN) (accuracy 0.829 ± 0.064, AUC 0.878 ± 0.052) and flexible discriminant analysis (FDA) (accuracy 0.851 ± 0.049, AUC 0.875 ± 0.057) displayed low predictive performance. With regard to stability, RF also showed high robustness against data perturbation (relative standard deviations, RSD 3.87%). Conclusions RF is a promising machine learning method in predicting IDH genotype. Development of an accurate and reliable model can assist in the initial diagnostic evaluation and treatment planning for diffuse glioma patients. Electronic supplementary material The online version of this article (10.1007/s00432-018-2787-1) contains supplementary material, which is available to authorized users.
Collapse
|
32
|
Farran B, AlWotayan R, Alkandari H, Al-Abdulrazzaq D, Channanath A, Thanaraj TA. Use of Non-invasive Parameters and Machine-Learning Algorithms for Predicting Future Risk of Type 2 Diabetes: A Retrospective Cohort Study of Health Data From Kuwait. Front Endocrinol (Lausanne) 2019; 10:624. [PMID: 31572303 PMCID: PMC6749017 DOI: 10.3389/fendo.2019.00624] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 08/28/2019] [Indexed: 12/12/2022] Open
Abstract
Objective: In recent decades, the Arab population has experienced an increase in the prevalence of type 2 diabetes (T2DM), particularly within the Gulf Cooperation Council. In this context, early intervention programmes rely on an ability to identify individuals at risk of T2DM. We aimed to build prognostic models for the risk of T2DM in the Arab population using machine-learning algorithms vs. conventional logistic regression (LR) and simple non-invasive clinical markers over three different time scales (3, 5, and 7 years from the baseline). Design: This retrospective cohort study used three models based on LR, k-nearest neighbours (k-NN), and support vector machines (SVM) with five-fold cross-validation. The models included the following baseline non-invasive parameters: age, sex, body mass index (BMI), pre-existing hypertension, family history of hypertension, and T2DM. Setting: This study was based on data from the Kuwait Health Network (KHN), which integrated primary health and hospital laboratory data into a single system. Participants: The study included 1,837 native Kuwaiti Arab individuals (equal proportion of men and women) with mean age as 59.5 ± 11.4 years. Among them, 647 developed T2DM within 7 years of the baseline non-invasive measurements. Analytical methods: The discriminatory power of each model for classifying people at risk of T2DM within 3, 5, or 7 years and the area under the receiver operating characteristic curve (AUC) were determined. Outcome measures: Onset of T2DM at 3, 5, and 7 years. Results: The k-NN machine-learning technique, which yielded AUC values of 0.83, 0.82, and 0.79 for 3-, 5-, and 7-year prediction horizons, respectively, outperformed the most commonly used LR method and other previously reported methods. Comparable results were achieved using the SVM and LR models with corresponding AUC values of (SVM: 0.73, LR: 0.74), (SVM: 0.68, LR: 0.72), and (SVM: 0.71, LR: 0.70) for 3-, 5-, and 7-year prediction horizons, respectively. For all models, the discriminatory power decreased as the prediction horizon increased from 3 to 7 years. Conclusions: Machine-learning techniques represent a useful addition to the commonly reported LR technique. Our prognostic models for the future risk of T2DM could be used to plan and implement early prevention programmes for at risk groups in the Arab population.
Collapse
Affiliation(s)
- Bassam Farran
- Research Division, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Rihab AlWotayan
- Research Division, Dasman Diabetes Institute, Kuwait City, Kuwait
- Department of Primary Health Care, Ministry of Health, Kuwait City, Kuwait
| | - Hessa Alkandari
- Research Division, Dasman Diabetes Institute, Kuwait City, Kuwait
- Department of Pediatrics, Farwaniya Hospital, Al Farwaniyah, Kuwait
| | - Dalia Al-Abdulrazzaq
- Research Division, Dasman Diabetes Institute, Kuwait City, Kuwait
- Department of Pediatrics, Faculty of Medicine, Kuwait University, Kuwait City, Kuwait
| | | | - Thangavel Alphonse Thanaraj
- Research Division, Dasman Diabetes Institute, Kuwait City, Kuwait
- *Correspondence: Thangavel Alphonse Thanaraj
| |
Collapse
|