1
|
Oliullah K, Rasel MH, Islam MM, Islam MR, Wadud MAH, Whaiduzzaman M. A stacked ensemble machine learning approach for the prediction of diabetes. J Diabetes Metab Disord 2024; 23:603-617. [PMID: 38932863 PMCID: PMC11196524 DOI: 10.1007/s40200-023-01321-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/22/2023] [Indexed: 06/28/2024]
Abstract
Objectives Diabetes has become a leading cause of mortality in both developed and developing countries, impacting a growing number of individuals worldwide. As the prevalence of the disease continues to rise, researchers have diligently worked towards developing accurate diabetes prediction models. The primary aim of this study is to utilize a diverse set of machine learning algorithms to detect the presence of diabetes, particularly in females, at an early stage. By leveraging these methods, this research seeks to provide physicians with valuable tools to identify the disease early, enabling timely interventions and improving patient outcomes. Methods In this study, some state-of-the-art machine learning techniques, such as random forest classifiers with gridsearchCV, XGBoost, NGBoost, Bagging, LightGBM, and AdaBoost classifiers, were employed. These models were chosen as the base layer of our proposed stacked ensemble model because of their high accuracy. Before feeding the data into the models, the dataset was preprocessed to ensure optimal performance and obtain improved results. Results The accuracy achieved in this study was 92.91%, which demonstrates its competitiveness with the existing approaches. Moreover, the utilization of the Shapley additive explanation (SHAP) facilitated the interpretation of machine learning models. Conclusion We anticipate that these findings will be beneficial to healthcare providers, stakeholders, students, and researchers involved in diabetes prediction research and development.
Collapse
Affiliation(s)
- Khondokar Oliullah
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Mahedi Hasan Rasel
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Manzurul Islam
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Reazul Islam
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Anwar Hussen Wadud
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
| | - Md. Whaiduzzaman
- School of Information Systems, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
2
|
Jiang L, Yang Z, Liu G, Xia Z, Yang G, Gong H, Wang J, Wang L. A feature optimization study based on a diabetes risk questionnaire. Front Public Health 2024; 12:1328353. [PMID: 38463161 PMCID: PMC10920272 DOI: 10.3389/fpubh.2024.1328353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 02/05/2024] [Indexed: 03/12/2024] Open
Abstract
Introduction The prevalence of diabetes, a common chronic disease, has shown a gradual increase, posing substantial burdens on both society and individuals. In order to enhance the effectiveness of diabetes risk prediction questionnaires, optimize the selection of characteristic variables, and raise awareness of diabetes risk among residents, this study utilizes survey data obtained from the risk factor monitoring system of the Centers for Disease Control and Prevention in the United States. Methods Following univariate analysis and meticulous screening, a more refined dataset was constructed. This dataset underwent preprocessing steps, including data distribution standardization, the application of the Synthetic Minority Oversampling Technique (SMOTE) in combination with the Round function for equilibration, and data standardization. Subsequently, machine learning (ML) techniques were employed, utilizing enumerated feature variables to evaluate the strength of the correlation among diabetes risk factors. Results The research findings effectively delineated the ranking of characteristic variables that significantly influence the risk of diabetes. Obesity emerges as the most impactful factor, overshadowing other risk factors. Additionally, psychological factors, advanced age, high cholesterol, high blood pressure, alcohol abuse, coronary heart disease or myocardial infarction, mobility difficulties, and low family income exhibit correlations with diabetes risk to varying degrees. Discussion The experimental data in this study illustrate that, while maintaining comparable accuracy, optimization of questionnaire variables and the number of questions can significantly enhance efficiency for subsequent follow-up and precise diabetes prevention. Moreover, the research methods employed in this study offer valuable insights into studying the risk correlation of other diseases, while the research results contribute to heightened societal awareness of populations at elevated risk of diabetes.
Collapse
Affiliation(s)
- Liangjun Jiang
- College of Information and Communication Engineering, State Key Lab of Marine Resource Utilization in South China Sea, Hainan University, Haikou, China
| | - Zerui Yang
- School of Electronics and Information, Yangtze University, Jingzhou, China
| | - Gang Liu
- Shenzhen Center for Disease Control and Prevention, Shenzhen, China
| | - Zhenhua Xia
- School of Electronics and Information, Yangtze University, Jingzhou, China
| | - Guangyao Yang
- School of Electronics and Information, Yangtze University, Jingzhou, China
| | - Haimei Gong
- College of Information and Communication Engineering, State Key Lab of Marine Resource Utilization in South China Sea, Hainan University, Haikou, China
| | - Jing Wang
- E-link Wisdom Co., Ltd., Shenzhen, China
| | - Lei Wang
- College of Information and Communication Engineering, State Key Lab of Marine Resource Utilization in South China Sea, Hainan University, Haikou, China
| |
Collapse
|
3
|
Thakur D, Gera T, Bhardwaj V, AlZubi AA, Ali F, Singh J. An enhanced diabetes prediction amidst COVID-19 using ensemble models. Front Public Health 2023; 11:1331517. [PMID: 38155892 PMCID: PMC10754515 DOI: 10.3389/fpubh.2023.1331517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 11/21/2023] [Indexed: 12/30/2023] Open
Abstract
In the contemporary landscape of healthcare, the early and accurate prediction of diabetes has garnered paramount importance, especially in the wake of the COVID-19 pandemic where individuals with diabetes exhibit increased vulnerability. This research embarked on a mission to enhance diabetes prediction by employing state-of-the-art machine learning techniques. Initial evaluations highlighted the Support Vector Machines (SVM) classifier as a promising candidate with an accuracy of 76.62%. To further optimize predictions, the study delved into advanced feature engineering techniques, generating interaction and polynomial features that unearthed hidden patterns in the data. Subsequent correlation analyses, visualized through heatmaps, revealed significant correlations, especially with attributes like Glucose. By integrating the strengths of Decision Trees, Gradient Boosting, and SVM in an ensemble model, we achieved an accuracy of 93.2%, showcasing the potential of harmonizing diverse algorithms. This research offers a robust blueprint for diabetes prediction, holding profound implications for early diagnosis, personalized treatments, and preventive care in the context of global health challenges and with the goal of increasing life expectancy.
Collapse
Affiliation(s)
- Deepak Thakur
- Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
| | - Tanya Gera
- Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
| | - Vivek Bhardwaj
- Department of Computer Science and Engineering, Manipal University Jaipur, Jaipur, Rajasthan
| | - Ahmad Ali AlZubi
- Department of Computer Science, Community College, King Saud University, Riyadh, Saudi Arabia
| | - Farman Ali
- Department of Computer Science and Engineering, School of Convergence, College of Computing and Informatics, Sungkyunkwan University, Seoul, Republic of Korea
| | - Jaiteg Singh
- Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India
| |
Collapse
|
4
|
Feng X, Cai Y, Xin R. Optimizing diabetes classification with a machine learning-based framework. BMC Bioinformatics 2023; 24:428. [PMID: 37957549 PMCID: PMC10644638 DOI: 10.1186/s12859-023-05467-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 09/04/2023] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND Diabetes is a metabolic disorder usually caused by insufficient secretion of insulin from the pancreas or insensitivity of cells to insulin, resulting in long-term elevated blood sugar levels in patients. Patients usually present with frequent urination, thirst, and hunger. If left untreated, it can lead to various complications that can affect essential organs and even endanger life. Therefore, developing an intelligent diagnosis framework for diabetes is necessary. RESULT This paper proposes a machine learning-based diabetes classification framework machine learning optimized GAN. The framework encompasses several methodological approaches to address the diverse challenges encountered during the analysis. These approaches encompass the implementation of the mean and median joint filling method for handling missing values, the application of the cap method for outlier processing, and the utilization of SMOTEENN to mitigate sample imbalance. Additionally, the framework incorporates the employment of the proposed Diabetes Classification Model based on Generative Adversarial Network and employs logistic regression for detailed feature analysis. The effectiveness of the framework is evaluated using both the PIMA dataset and the diabetes dataset obtained from the GEO database. The experimental findings showcase our model achieved exceptional results, including a binary classification accuracy of 96.27%, tertiary classification accuracy of 99.31%, precision and f1 score of 0.9698, recall of 0.9698, and an AUC of 0.9702. CONCLUSION The experimental results show that the framework proposed in this paper can accurately classify diabetes and provide new ideas for intelligent diagnosis of diabetes.
Collapse
Affiliation(s)
- Xin Feng
- School of Science, Jilin Institute of Chemical Technology, Jilin, 130000, People's Republic of China
- State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, College of Chemistry, Jilin University, Changchun, 130012, People's Republic of China
- Department of Epidemiology and Biostatistics, School of Public Health, Jilin University, Changchun, 130012, People's Republic of China
| | - Yihuai Cai
- School of Science, Jilin Institute of Chemical Technology, Jilin, 130000, People's Republic of China.
| | - Ruihao Xin
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin, 130000, People's Republic of China.
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, People's Republic of China.
| |
Collapse
|
5
|
Patro KK, Allam JP, Sanapala U, Marpu CK, Samee NA, Alabdulhafith M, Plawiak P. An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques. BMC Bioinformatics 2023; 24:372. [PMID: 37784049 PMCID: PMC10544445 DOI: 10.1186/s12859-023-05488-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 09/19/2023] [Indexed: 10/04/2023] Open
Abstract
The rising risk of diabetes, particularly in emerging countries, highlights the importance of early detection. Manual prediction can be a challenging task, leading to the need for automatic approaches. The major challenge with biomedical datasets is data scarcity. Biomedical data is often difficult to obtain in large quantities, which can limit the ability to train deep learning models effectively. Biomedical data can be noisy and inconsistent, which can make it difficult to train accurate models. To overcome the above-mentioned challenges, this work presents a new framework for data modeling that is based on correlation measures between features and can be used to process data effectively for predicting diabetes. The standard, publicly available Pima Indians Medical Diabetes (PIMA) dataset is utilized to verify the effectiveness of the proposed techniques. Experiments using the PIMA dataset showed that the proposed data modeling method improved the accuracy of machine learning models by an average of 9%, with deep convolutional neural network models achieving an accuracy of 96.13%. Overall, this study demonstrates the effectiveness of the proposed strategy in the early and reliable prediction of diabetes.
Collapse
Affiliation(s)
- Kiran Kumar Patro
- Department of ECE, Aditya Institute of Technology and Management, Tekkali, AP, 532201, India
| | - Jaya Prakash Allam
- School of Computer Science and Engineering, VIT Vellore, Katpadi, Vellore, Tamil Nadu, 632014, India.
| | | | - Chaitanya Kumar Marpu
- Department of ECE, Aditya Institute of Technology and Management, Tekkali, AP, 532201, India
| | - Nagwan Abdel Samee
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
| | - Maali Alabdulhafith
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
| | - Pawel Plawiak
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155, Krakow, Poland
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, 44-100, Gliwice, Poland
| |
Collapse
|
6
|
Lilhore UK, Dalal S, Faujdar N, Margala M, Chakrabarti P, Chakrabarti T, Simaiya S, Kumar P, Thangaraju P, Velmurugan H. Hybrid CNN-LSTM model with efficient hyperparameter tuning for prediction of Parkinson's disease. Sci Rep 2023; 13:14605. [PMID: 37669970 PMCID: PMC10480168 DOI: 10.1038/s41598-023-41314-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 08/24/2023] [Indexed: 09/07/2023] Open
Abstract
The patients' vocal Parkinson's disease (PD) changes could be identified early on, allowing for management before physically incapacitating symptoms appear. In this work, static as well as dynamic speech characteristics that are relevant to PD identification are examined. Speech changes or communication issues are among the challenges that Parkinson's individuals may encounter. As a result, avoiding the potential consequences of speech difficulties brought on by the condition depends on getting the appropriate diagnosis early. PD patients' speech signals change significantly from those of healthy individuals. This research presents a hybrid model utilizing improved speech signals with dynamic feature breakdown using CNN and LSTM. The proposed hybrid model employs a new, pre-trained CNN with LSTM to recognize PD in linguistic features utilizing Mel-spectrograms derived from normalized voice signal and dynamic mode decomposition. The proposed Hybrid model works in various phases, which include Noise removal, extraction of Mel-spectrograms, feature extraction using pre-trained CNN model ResNet-50, and the final stage is applied for classification. An experimental analysis was performed using the PC-GITA disease dataset. The proposed hybrid model is compared with traditional NN and well-known machine learning-based CART and SVM & XGBoost models. The accuracy level achieved in Neural Network, CART, SVM, and XGBoost models is 72.69%, 84.21%, 73.51%, and 90.81%. The results show that under these four machine approaches of tenfold cross-validation and dataset splitting without samples overlapping one individual, the proposed hybrid model achieves an accuracy of 93.51%, significantly outperforming traditional ML models utilizing static features in detecting Parkinson's disease.
Collapse
Affiliation(s)
- Umesh Kumar Lilhore
- Department of Computer Science and Engineering, Chandigarh University, Chandigarh, Punjab, India
| | - Surjeet Dalal
- Amity School of Engineering and Technology, Amity University Haryana, Gurugram, India
| | - Neetu Faujdar
- Department of Computer Engineering and Application, GLA University, Mathura, Uttar Pradesh, India
| | - Martin Margala
- School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, USA
| | - Prasun Chakrabarti
- Department of Computer Science and Engineering, Sir Padampat Singhania University, Udaipur, 313601, Rajasthan, India
| | | | - Sarita Simaiya
- Department of Computer Science and Engineering, Chandigarh University, Chandigarh, Punjab, India
- Apex Institute of Technology, Department of Computer Science and Engineering, Chandigarh University, Mohali, Punjab, India
| | - Pawan Kumar
- Department of Computer Science and Engineering, Chandigarh University, Chandigarh, Punjab, India
- College of Computing Sciences & IT, Teerthanker Mahaveer University, Moradabad, Uttar Pradesh, India
| | - Pugazhenthan Thangaraju
- Department of Pharmacology, All India Institute of Medical Sciences, Raipur, Chhattisgarh, India.
| | - Hemasri Velmurugan
- Department of Pharmacology, All India Institute of Medical Sciences, Raipur, Chhattisgarh, India
| |
Collapse
|
7
|
Pratap-Singh A, Guo Y, Baldelli A, Singh A. Concept for a Unidirectional Release Mucoadhesive Buccal Tablet for Oral Delivery of Antidiabetic Peptide Drugs Such as Insulin, Glucagon-like Peptide 1 (GLP-1), and their Analogs. Pharmaceutics 2023; 15:2265. [PMID: 37765234 PMCID: PMC10534625 DOI: 10.3390/pharmaceutics15092265] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/11/2023] [Accepted: 08/12/2023] [Indexed: 09/29/2023] Open
Abstract
Injectable peptides such as insulin, glucagon-like peptide 1 (GLP-1), and their agonists are being increasingly used for the treatment of diabetes. Currently, the most common route of administration is injection, which is linked to patient discomfort as well as being subjected to refrigerated storage and the requirement for efficient supply chain logistics. Buccal and sublingual routes are recognized as valid alternatives due to their high accessibility and easy administration. However, there can be several challenges, such as peptide selection, drug encapsulation, and delivery system design, which are linked to the enhancement of drug efficacy and efficiency. By using hydrophobic polymers that do not dissolve in saliva, and by using neutral or positively charged nanoparticles that show better adhesion to the negative charges generated by the sialic acid in the mucus, researchers have attempted to improve drug efficiency and efficacy in buccal delivery. Furthermore, unidirectional films and tablets seem to show the highest bioavailability as compared to sprays and other buccal delivery vehicles. This advantageous attribute can be attributed to their capability to mitigate the impact of saliva and inadvertent gastrointestinal enzymatic digestion, thereby minimizing drug loss. This is especially pertinent as these formulations ensure a more directed drug delivery trajectory, leading to heightened therapeutic outcomes. This communication describes the current state of the art with respect to the creation of nanoparticles containing peptides such as insulin, glucagon-like peptide 1 (GLP-1), and their agonists, and theorizes the production of mucoadhesive unidirectional release buccal tablets or films. Such an approach is more patient-friendly and can improve the lives of millions of diabetics around the world; in addition, these shelf-stable formulations ena a more environmentally friendly and sustainable supply chain network.
Collapse
Affiliation(s)
- Anubhav Pratap-Singh
- Food, Nutrition, and Health Program, Faculty of Land & Food Systems, The University of British Columbia, 2205 East Mall, Vancouver, BC V6T 1Z4, Canada
| | - Yigong Guo
- Food, Nutrition, and Health Program, Faculty of Land & Food Systems, The University of British Columbia, 2205 East Mall, Vancouver, BC V6T 1Z4, Canada
- Natural Health and Food Products Research Group, Centre for Applied Research & Innovation (CARI), British Columbia Institute of Technology, Burnaby, BC V5G 3H2, Canada
| | - Alberto Baldelli
- Food, Nutrition, and Health Program, Faculty of Land & Food Systems, The University of British Columbia, 2205 East Mall, Vancouver, BC V6T 1Z4, Canada
| | - Anika Singh
- Food, Nutrition, and Health Program, Faculty of Land & Food Systems, The University of British Columbia, 2205 East Mall, Vancouver, BC V6T 1Z4, Canada
- Natural Health and Food Products Research Group, Centre for Applied Research & Innovation (CARI), British Columbia Institute of Technology, Burnaby, BC V5G 3H2, Canada
| |
Collapse
|
8
|
Wajeed MA, Tiwari S, Gupta R, Ahmad AJ, Agarwal S, Jamal SS, Hinga SK. A Breast Cancer Image Classification Algorithm with 2c Multiclass Support Vector Machine. JOURNAL OF HEALTHCARE ENGINEERING 2023; 2023:3875525. [PMID: 37457494 PMCID: PMC10349674 DOI: 10.1155/2023/3875525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/14/2022] [Accepted: 03/17/2022] [Indexed: 07/18/2023]
Abstract
Breast cancer is the most frequent type of cancer in women; however, early identification has reduced the mortality rate associated with the condition. Studies have demonstrated that the earlier this sickness is detected by mammography, the lower the death rate. Breast mammography is a critical technique in the early identification of breast cancer since it can detect abnormalities in the breast months or years before a patient is aware of the presence of such abnormalities. Mammography is a type of breast scanning used in medical imaging that involves using x-rays to image the breasts. It is a method that produces high-resolution digital pictures of the breasts known as mammography. Immediately following the capture of digital images and transmission of those images to a piece of high-tech digital mammography equipment, our radiologists evaluate the photos to establish the specific position and degree of the sickness in the breast. When compared to the many classifiers typically used in the literature, the suggested Multiclass Support Vector Machine (MSVM) approach produces promising results, according to the authors. This method may pave the way for developing more advanced statistical characteristics based on most cancer prognostic models shortly. It is demonstrated in this paper that the suggested 2C algorithm with MSVM outperforms a decision tree model in terms of accuracy, which follows prior findings. According to our findings, new screening mammography technologies can increase the accuracy and accessibility of screening mammography around the world.
Collapse
Affiliation(s)
- Mohammed Abdul Wajeed
- Department of Computer Science and Engineering, Swami Vivekananda Institute of Technology, Secunderabad, Telangana, India
| | - Shivam Tiwari
- Department of Computer Science and Engineering, G L Bajaj Institute of Technology and Management, Greater Noida, Uttar Pradesh, India
| | - Rajat Gupta
- Engineering and Technology, Career Point University, Kota, Rajasthan, India
| | - Aamir Junaid Ahmad
- Department of Computer Science and Engineering, Maulana Azad College of Engineering and Technology, Patna, India
| | - Seema Agarwal
- SRM institute of Science and Technology, Delhi-NCR, Campus, Ghaziabad, India
| | - Sajjad Shaukat Jamal
- Department of Mathematics College of Science, King Khalid University, Abha, Saudi Arabia
| | - Simon Karanja Hinga
- Department of Electrical and Electronic Engineering, Technical University of Mombasa, Mombasa, Kenya
| |
Collapse
|
9
|
Healthcare Engineering JO. Retracted: A Novel Diabetes Healthcare Disease Prediction Framework Using Machine Learning Techniques. JOURNAL OF HEALTHCARE ENGINEERING 2023; 2023:9872970. [PMID: 37266255 PMCID: PMC10232100 DOI: 10.1155/2023/9872970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 05/23/2023] [Indexed: 06/03/2023]
Abstract
[This retracts the article DOI: 10.1155/2022/1684017.].
Collapse
|
10
|
Robust Classification and Detection of Big Medical Data Using Advanced Parallel K-Means Clustering, YOLOv4, and Logistic Regression. Life (Basel) 2023; 13:life13030691. [PMID: 36983845 PMCID: PMC10056696 DOI: 10.3390/life13030691] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/24/2023] [Accepted: 02/28/2023] [Indexed: 03/08/2023] Open
Abstract
Big-medical-data classification and image detection are crucial tasks in the field of healthcare, as they can assist with diagnosis, treatment planning, and disease monitoring. Logistic regression and YOLOv4 are popular algorithms that can be used for these tasks. However, these techniques have limitations and performance issue with big medical data. In this study, we presented a robust approach for big-medical-data classification and image detection using logistic regression and YOLOv4, respectively. To improve the performance of these algorithms, we proposed the use of advanced parallel k-means pre-processing, a clustering technique that identified patterns and structures in the data. Additionally, we leveraged the acceleration capabilities of a neural engine processor to further enhance the speed and efficiency of our approach. We evaluated our approach on several large medical datasets and showed that it could accurately classify large amounts of medical data and detect medical images. Our results demonstrated that the combination of advanced parallel k-means pre-processing, and the neural engine processor resulted in a significant improvement in the performance of logistic regression and YOLOv4, making them more reliable for use in medical applications. This new approach offers a promising solution for medical data classification and image detection and may have significant implications for the field of healthcare.
Collapse
|
11
|
Application of Machine Learning for Cardiovascular Disease Risk Prediction. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023. [DOI: 10.1155/2023/9418666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
Abstract
Cardiovascular diseases (CVDs) are a common cause of heart failure globally. The need to explore possible ways to tackle the disease necessitated this study. The study designed a machine learning model for cardiovascular disease risk prediction in accordance with a dataset that contains 11 features which may be used to forecast the disease. The dataset from Kaggle on cardiovascular disease includes approximately 70,000 patient records that were used to determine the outcome. Compared to the UCI dataset, the Kaggle dataset has many more training and validation records. Models created using neural networks, random forests, Bayesian networks, C5.0, and QUEST were compared for this dataset. On training and testing data sets, the results acquired a high accuracy (99.1 percent), which is significantly superior to previous methods. Ahead-of-time detection and diagnosis of cardiac disease, as well as better treatment outcomes, are strong possibilities for the suggested prediction model. Additionally, it may help patients better manage their illness or life forms in order to increase their chances of recovery/survival. The result showed greater accuracy and promising signs that machine-learning algorithms can indeed assist in early identification of the disease and improvement of the treatment outcome.
Collapse
|
12
|
Olatunji SO, Alsheikh N, Alnajrani L, Alanazy A, Almusairii M, Alshammasi S, Alansari A, Zaghdoud R, Alahmadi A, Basheer Ahmed MI, Ahmed MS, Alhiyafi J. Comprehensible Machine-Learning-Based Models for the Pre-Emptive Diagnosis of Multiple Sclerosis Using Clinical Data: A Retrospective Study in the Eastern Province of Saudi Arabia. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:4261. [PMID: 36901273 PMCID: PMC10002108 DOI: 10.3390/ijerph20054261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 02/22/2023] [Accepted: 02/24/2023] [Indexed: 06/18/2023]
Abstract
Multiple Sclerosis (MS) is characterized by chronic deterioration of the nervous system, mainly the brain and the spinal cord. An individual with MS develops the condition when the immune system begins attacking nerve fibers and the myelin sheathing that covers them, affecting the communication between the brain and the rest of the body and eventually causing permanent damage to the nerve. Patients with MS (pwMS) might experience different symptoms depending on which nerve was damaged and how much damage it has sustained. Currently, there is no cure for MS; however, there are clinical guidelines that help control the disease and its accompanying symptoms. Additionally, no specific laboratory biomarker can precisely identify the presence of MS, leaving specialists with a differential diagnosis that relies on ruling out other possible diseases with similar symptoms. Since the emergence of Machine Learning (ML) in the healthcare industry, it has become an effective tool for uncovering hidden patterns that aid in diagnosing several ailments. Several studies have been conducted to diagnose MS using ML and Deep Learning (DL) models trained using MRI images, achieving promising results. However, complex and expensive diagnostic tools are needed to collect and examine imaging data. Thus, the intention of this study is to implement a cost-effective, clinical data-driven model that is capable of diagnosing pwMS. The dataset was obtained from King Fahad Specialty Hospital (KFSH) in Dammam, Saudi Arabia. Several ML algorithms were compared, namely Support Vector Machine (SVM), Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Adaptive Boosting (AdaBoost), and Extra Trees (ET). The results indicated that the ET model outpaced the rest with an accuracy of 94.74%, recall of 97.26%, and precision of 94.67%.
Collapse
Affiliation(s)
- Sunday O. Olatunji
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Nawal Alsheikh
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Lujain Alnajrani
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Alhatoon Alanazy
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Meshael Almusairii
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Salam Alshammasi
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Aisha Alansari
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Rim Zaghdoud
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Alaa Alahmadi
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Mohammed Imran Basheer Ahmed
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Mohammed Salih Ahmed
- College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Jamal Alhiyafi
- Department of Computer Science, Kettering University, Flint, MI 48504, USA
| |
Collapse
|
13
|
Machine Learning Modeling of Disease Treatment Default: A Comparative Analysis of Classification Models. ADVANCES IN PUBLIC HEALTH 2023. [DOI: 10.1155/2023/4168770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Generally, treatment default of diseases by patients is regarded as the biggest threat to favourable disease treatment outcomes. It is seen as the reason for the resurgence of infectious diseases including tuberculosis in some developing countries. Sadly, its occurrence in chronic disease management is associated with high morbidity and mortality rates. Many reasons have been adduced for this phenomenon. Exploration of treatment default using biographic and behavioral metrics collected from patients and healthcare providers remains a challenge. The focus on contextual nonbiomedical measurements using a supervised machine learning modeling technique is aimed at creating an understanding of the reasons why treatment default occurs, including identifying important contextual parameters that contribute to treatment default. The predicted accuracy scores of four supervised machine learning algorithms, namely, gradient boosting, logistic regression, random forest, and support vector machine were 0.87, 0.90, 0.81, and 0.77, respectively. Additionally, performance indicators such as the positive predicted value score for the four models ranged between 98.72%–98.87%, and the negative predicted values of gradient boosting, logistic regression, random forest, and support vector machine were 50%, 75%, 22.22%, and 50%, respectively. Logistic regression appears to have the highest negative-predicted value score of 75%, with the smallest error margin of 25% and the highest accuracy score of 0.90, and the random forest had the lowest negative predicted value score of 22.22%, registering the highest error margin of 77.78%. By performing a chi-square correlation statistic test of variable independence, this study suggests that age, presence of comorbidities, concern for long queuing/waiting time at treatment facilities, availability of qualified clinicians, and the patient’s nutritional state whether on a controlled diet or not are likely to affect their adherence to disease treatment and could result in an increased risk of default.
Collapse
|
14
|
Investigation on the Mechanisms of Zanthoxylum bungeanum for Treating Diabetes Mellitus Based on Network Pharmacology, Molecular Docking, and Experiment Verification. BIOMED RESEARCH INTERNATIONAL 2023; 2023:9298728. [PMID: 36874926 PMCID: PMC9977524 DOI: 10.1155/2023/9298728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 10/25/2022] [Accepted: 11/24/2022] [Indexed: 02/24/2023]
Abstract
Objective The aim of the study was to explore the potential mechanism of Zanthoxylum bungeanum in the treatment of diabetes mellitus (DM) using network pharmacology. Methods The DrugBank database and TCMSP platform were used to search for the main chemical components and their targets of Zanthoxylum bungeanum, and the genes related to diabetes mellitus were obtained from the genecards database. Import the data into the Venny 2.1.0 platform for intersection analysis to obtain the Zanthoxylum bungeanum-DM-gene dataset. The protein-protein interaction (PPI) analysis of Zanthoxylum bungeanum-DM gene was performed using the String data platform, and the visualization and network topology analysis were performed using Cytoscape 3.8.2. The KEGG pathway enrichment and biological process of GO enrichment analysis were carried out using the David platform. The active ingredients and key targets of Zanthoxylum bungeanum were molecularly docked to verify their biological activities by using Discovery Studio 2019 software. Zanthoxylum bungeanum was extracted and isolated by ethanol and dichloromethane. HepG2 cells were cultured, and cell viability assay was utilized to choose the suitable concentration of Zanthoxylum bungeanum extract (ZBE). The western blot assay was used for measuring the expression of AKT1, IL6, HSP90AA1, FOS, and JUN proteins in HepG2 cells. Results A total of 5 main compounds, 339 targets, and 16656 disease genes were obtained and retrieved, respectively. A total of 187 common genes were screened, and 20 core genes were finally obtained after further screening. The antidiabetic active ingredients of Zanthoxylum bungeanum are kokusaginin, skimmianin, diosmetin, beta-sitosterol, and quercetin, respectively. The main targets for its antidiabetic effect are AKT1, IL6, HSP90AA1, FOS, and JUN, respectively. GO enrichment analysis revealed that the biological process of Zanthoxylum bungeanum and DM is related to a positive regulation of gene expression, positive regulation of transcription, positive regulation of transcription from RNA polymerase II promoter, response to drug, positive regulation of apoptotic process, and positive regulation of cell proliferation, etc. KEGG enrichment analysis revealed that common biological pathways mainly including the phospholipase D signaling pathway, MAPK signaling pathway, beta-alanine metabolism, estrogen signaling pathway, PPAR signaling pathway, and TNF signaling pathway. Molecular docking results showed that AKT1 with beta-sitosterol and quercetin, IL-6 with diosmetin and skimmianin, HSP90AA1 with diosmetin and quercetin, FOS with beta-sitosterol and quercetin, and JUN with beta-sitosterol and diosmetin have relatively strong binding activity, respectively. Experiment verification results showed that DM could be significantly improved by downregulating the expression of AKT1, IL6, HSP90AA1, FOS, and JUN proteins after being treated at concentrations of 20 μmol/L and 40 μmol/L of ZBE. Conclusion The active components of Zanthoxylum bungeanum mainly including kokusaginin, skimmianin, diosmetin, beta-sitosterol, and quercetin. The therapeutic effect of Zanthoxylum bungeanum on DM may be achieved by downregulating core target genes including AKT1, IL6, HSP90AA1, FOS, and JUN, respectively. Zanthoxylum bungeanum is an effective drug in treatment of DM related to the above targets.
Collapse
|
15
|
Glaucoma Detection and Classification Using Improved U-Net Deep Learning Model. Healthcare (Basel) 2022; 10:healthcare10122497. [PMID: 36554021 PMCID: PMC9778546 DOI: 10.3390/healthcare10122497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 11/19/2022] [Accepted: 12/05/2022] [Indexed: 12/14/2022] Open
Abstract
Glaucoma is prominent in a variety of nations, with the United States and Europe being two of the most famous. Glaucoma now affects around 78 million people throughout the world (2020). By the year 2040, it is expected that there will be 111.8 million cases of glaucoma worldwide. In countries that are still building enough healthcare infrastructure to cope with glaucoma, the ailment is misdiagnosed nine times out of ten. To aid in the early diagnosis of glaucoma, the creation of a detection system is necessary. In this work, the researchers propose using a technology known as deep learning to identify and predict glaucoma before symptoms appear. The glaucoma dataset is used in this deep learning algorithm that has been proposed for analyzing glaucoma images. To get the required results when using deep learning principles for the job of segmenting the optic cup, pretrained transfer learning models are integrated with the U-Net architecture. For feature extraction, the DenseNet-201 deep convolution neural network (DCNN) is used. The DCNN approach is used to determine whether a person has glaucoma. The fundamental goal of this line of research is to recognize glaucoma in retinal fundus images, which will aid in assessing whether a patient has the condition. Because glaucoma can affect the model in both positive and negative ways, the model's outcome might be either positive or negative. Accuracy, precision, recall, specificity, the F-measure, and the F-score are some of the metrics used in the model evaluation process. An extra comparison study is performed as part of the process of establishing whether the suggested model is accurate. The findings are compared to convolution neural network classification methods based on deep learning. When used for training, the suggested model has an accuracy of 98.82 percent and an accuracy of 96.90 percent when used for testing. All assessments show that the new paradigm that has been proposed is more successful than the one that is currently in use.
Collapse
|
16
|
Ragab M, Choudhry H, Al-Rabia MW, Binyamin SS, Aldarmahi AA, Mansour RF. Early and accurate detection of melanoma skin cancer using hybrid level set approach. Front Physiol 2022; 13:965630. [PMID: 36545278 PMCID: PMC9760861 DOI: 10.3389/fphys.2022.965630] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 11/21/2022] [Indexed: 12/12/2022] Open
Abstract
Digital dermoscopy is used to identify cancer in skin lesions, and sun exposure is one of the leading causes of melanoma. It is crucial to distinguish between healthy skin and malignant lesions when using computerised lesion detection and classification. Lesion segmentation influences categorization accuracy and precision. This study introduces a novel way of classifying lesions. Hair filters, gel, bubbles, and specular reflection are all options. An improved levelling method is employed in an innovative method for detecting and removing cancerous hairs. The lesion is distinguished from the surrounding skin by the adaptive sigmoidal function; this function considers the severity of localised lesions. An improved technique for identifying a lesion from surrounding tissue is proposed in the article, followed by a classifier and available features that resulted in 94.40% accuracy and 93% success. According to research, the best method for selecting features and classifications can produce more accurate predictions before and during treatment. When the recommended strategy is put to the test using the Melanoma Skin Cancer Dataset, the recommended technique outperforms the alternative.
Collapse
Affiliation(s)
- Mahmoud Ragab
- Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia,Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah, Saudi Arabia,Mathematics Department, Faculty of Science, Al-Azhar University, Nasr City, Egypt,*Correspondence: Mahmoud Ragab,
| | - Hani Choudhry
- Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah, Saudi Arabia,Biochemistry Department, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Mohammed W. Al-Rabia
- Department of Medical Microbiology and Parasitology, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia,Health Promotion Center, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Sami Saeed Binyamin
- Computer and Information Technology Department, The Applied College, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Ahmed A. Aldarmahi
- Basic Science Department, College of Science and Health Professions, King Saud Bin Abdulaziz University for Health Sciences, Jeddah, Saudi Arabia,King Abdullah International Medical Research Center, Ministry of National Guard—Health Affairs, Jeddah, Saudi Arabia
| | - Romany F. Mansour
- Department of Mathematics, Faculty of Science, New Valley University, El-Kharga, Egypt
| |
Collapse
|
17
|
Mary SR, Pachar S, Srivastava PK, Malik M, Sharma A, G. Almutiri T, Atal Z. Deep Learning Model for the Image Fusion and Accurate Classification of Remote Sensing Images. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:2668567. [PMID: 36458232 PMCID: PMC9708323 DOI: 10.1155/2022/2668567] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 09/10/2022] [Accepted: 09/30/2022] [Indexed: 07/30/2023]
Abstract
Deep learning is widely used for the classification of images that have various attributes. Image data are used to extract colour, texture, form, and local features. These features are combined in feature-level image fusion to create a merged remote sensing image. A trained depth belief network (DBN) processes and divides fusion images, while a Softmax classifier determines the land type. As tested, the proposed approach can categorise all types of land. Traditional methods of detecting distant sensing photographs have limitations that can be overcome by using convolutional neural networks (CNN). Traditional techniques are incapable of combining deep learning elements while doing badly in classification. After PCA decreases data dimensionality, deep learning is applied to generate effective features that employ deep learning after PCA has reduced the dimensionality of the data. Principal component analysis is commonly used because of its effectiveness in attaining linear dimension reduction. It may be used on its own or as a starting point for further study into various different dimensionality reduction approaches. Data can be altered by remapping onto a new set of orthogonal axes using a process known as projection-based principal component analysis. Following remote sensing of land resources, the pictures were classified using a support vector machine. Euroset satellite images are used to assess the suggested approach. Accuracy and kappa have both increased. It was accurate and within 95.83 % of the planned figures. The classification findings' kappa value and reasoning time were 95.87 % and 128 milliseconds, respectively. Both the model's performance and the classification effect are excellent.
Collapse
Affiliation(s)
- S. Roselin Mary
- Department of CSE, Anand Institute of Higher Technology, Kalasalingam Nagar, Kazhipattur, OMR, Chennai 603103, India
| | - Sunita Pachar
- President Vasundhara Blessing Foundation, Academician GLA University, Chaumuhan, India
| | | | - Medhavi Malik
- School of Computing Science and Engineering, Galgotias University, Noida, India
| | - Avani Sharma
- Department of Information Technology, Manipal University Jaipur, Jaipur, India
| | - Tariq G. Almutiri
- Department of Computer Science, Community College, King Saud University, Riyadh 11437, Saudi Arabia
| | - Zabihullah Atal
- Department of Computer Science, Faculty of Computer Science, Kardan University, Afghanistan
| |
Collapse
|
18
|
Region-Based Segmentation and Classification for Ovarian Cancer Detection Using Convolution Neural Network. CONTRAST MEDIA & MOLECULAR IMAGING 2022; 2022:5968939. [PMID: 36475297 PMCID: PMC9701126 DOI: 10.1155/2022/5968939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 07/15/2022] [Accepted: 07/18/2022] [Indexed: 11/21/2022]
Abstract
Ovarian cancer is a serious sickness for elderly women. According to data, it is the seventh leading cause of death in women as well as the fifth most frequent disease worldwide. Many researchers classified ovarian cancer using Artificial Neural Networks (ANNs). Doctors consider classification accuracy to be an important aspect of making decisions. Doctors consider improved classification accuracy for providing proper treatment. Early and precise diagnosis lowers mortality rates and saves lives. On basis of ROI (region of interest) segmentation, this research presents a novel annotated ovarian image classification utilizing FaRe-ConvNN (rapid region-based Convolutional neural network). The input photos were divided into three categories: epithelial, germ, and stroma cells. This image is segmented as well as preprocessed. After that, FaRe-ConvNN is used to perform the annotation procedure. For region-based classification, the method compares manually annotated features as well as trained feature in FaRe-ConvNN. This will aid in the analysis of higher accuracy in disease identification, as human annotation has lesser accuracy in previous studies; therefore, this effort will empirically prove that ML classification will provide higher accuracy. Classification is done using a combination of SVC and Gaussian NB classifiers after the region-based training in FaRe-ConvNN. The ensemble technique was employed in feature classification due to better data indexing. To diagnose ovarian cancer, the simulation provides an accurate portion of the input image. FaRe-ConvNN has a precision value of more than 95%, SVC has a precision value of 95.96%, and Gaussian NB has a precision value of 97.7%, with FR-CNN enhancing precision in Gaussian NB. For recall/sensitivity, SVC is 94.31 percent and Gaussian NB is 97.7 percent, while for specificity, SVC is 97.39 percent and Gaussian NB is 98.69 percent using FaRe-ConvNN.
Collapse
|
19
|
Qin Y, Wu J, Xiao W, Wang K, Huang A, Liu B, Yu J, Li C, Yu F, Ren Z. Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph192215027. [PMID: 36429751 PMCID: PMC9690067 DOI: 10.3390/ijerph192215027] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 11/04/2022] [Accepted: 11/10/2022] [Indexed: 06/01/2023]
Abstract
The prevalence of diabetes has been increasing in recent years, and previous research has found that machine-learning models are good diabetes prediction tools. The purpose of this study was to compare the efficacy of five different machine-learning models for diabetes prediction using lifestyle data from the National Health and Nutrition Examination Survey (NHANES) database. The 1999-2020 NHANES database yielded data on 17,833 individuals data based on demographic characteristics and lifestyle-related variables. To screen training data for machine models, the Akaike Information Criterion (AIC) forward propagation algorithm was utilized. For predicting diabetes, five machine-learning models (CATBoost, XGBoost, Random Forest (RF), Logistic Regression (LR), and Support Vector Machine (SVM)) were developed. Model performance was evaluated using accuracy, sensitivity, specificity, precision, F1 score, and receiver operating characteristic (ROC) curve. Among the five machine-learning models, the dietary intake levels of energy, carbohydrate, and fat, contributed the most to the prediction of diabetes patients. In terms of model performance, CATBoost ranks higher than RF, LG, XGBoost, and SVM. The best-performing machine-learning model among the five is CATBoost, which achieves an accuracy of 82.1% and an AUC of 0.83. Machine-learning models based on NHANES data can assist medical institutions in identifying diabetes patients.
Collapse
Affiliation(s)
- Yifan Qin
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Jinlong Wu
- College of Physical Education, Southwest University, Chongqing 400715, China
| | - Wen Xiao
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Kun Wang
- Physical Education College, Yanching Institute of Technology, Langfang 065201, China
| | - Anbing Huang
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Bowen Liu
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Jingxuan Yu
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Chuhao Li
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Fengyu Yu
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| | - Zhanbing Ren
- College of Physical Education, Shenzhen University, Shenzhen 518000, China
| |
Collapse
|
20
|
Kibria HB, Nahiduzzaman M, Goni MOF, Ahsan M, Haider J. An Ensemble Approach for the Prediction of Diabetes Mellitus Using a Soft Voting Classifier with an Explainable AI. SENSORS (BASEL, SWITZERLAND) 2022; 22:7268. [PMID: 36236367 PMCID: PMC9571784 DOI: 10.3390/s22197268] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 09/20/2022] [Accepted: 09/21/2022] [Indexed: 06/16/2023]
Abstract
Diabetes is a chronic disease that continues to be a primary and worldwide health concern since the health of the entire population has been affected by it. Over the years, many academics have attempted to develop a reliable diabetes prediction model using machine learning (ML) algorithms. However, these research investigations have had a minimal impact on clinical practice as the current studies focus mainly on improving the performance of complicated ML models while ignoring their explainability to clinical situations. Therefore, the physicians find it difficult to understand these models and rarely trust them for clinical use. In this study, a carefully constructed, efficient, and interpretable diabetes detection method using an explainable AI has been proposed. The Pima Indian diabetes dataset was used, containing a total of 768 instances where 268 are diabetic, and 500 cases are non-diabetic with several diabetic attributes. Here, six machine learning algorithms (artificial neural network (ANN), random forest (RF), support vector machine (SVM), logistic regression (LR), AdaBoost, XGBoost) have been used along with an ensemble classifier to diagnose the diabetes disease. For each machine learning model, global and local explanations have been produced using the Shapley additive explanations (SHAP), which are represented in different types of graphs to help physicians in understanding the model predictions. The balanced accuracy of the developed weighted ensemble model was 90% with a F1 score of 89% using a five-fold cross-validation (CV). The median values were used for the imputation of the missing values and the synthetic minority oversampling technique (SMOTETomek) was used to balance the classes of the dataset. The proposed approach can improve the clinical understanding of a diabetes diagnosis and help in taking necessary action at the very early stages of the disease.
Collapse
Affiliation(s)
- Hafsa Binte Kibria
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Md Nahiduzzaman
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Md. Omaer Faruq Goni
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Mominul Ahsan
- Department of Computer Science, University of York, Deramore Lane, Heslington, York YO10 5GH, UK
| | - Julfikar Haider
- Department of Engineering, Manchester Metropolitan University, Manchester M1 5GD, UK
| |
Collapse
|
21
|
Goel S, Bharti RK, Rao ALN. Advancement in Healthcare Systems by Automated Disease Diagnostic Process Using Machine Learning. INTERNATIONAL JOURNAL OF E-ADOPTION 2022. [DOI: 10.4018/ijea.310002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
E-adoption of emerging technology plays an important role during the pandemic. The COVID-19 pandemic taught us that everyone must make himself healthy and immune to viral disease. Diabetes is the most common disease in the Indian population found in people of every age. The objective of this research work is to use the emerging technologies such as machine learning to implement e-adoption in the healthcare system. The proposed methodology can predict the diabetes disease by using vital parameters like age, glucose level, blood pressure, etc. This proposed model is implemented into Python programming language and various machine learning classifiers such as random forest, decision tree, logistic regression, and XGBoost are used on PIMA database. Thereafter, comparative analysis is performed to test which technique is better for predicting and diagnosing diabetes disease. The method founds XGBoost classifier gives the highest accuracy (i.e., 84%) among all classifiers with single database and single classifier.
Collapse
Affiliation(s)
- Sachin Goel
- Veer Madho Singh Bhandari Uttarakhand Technical University, Dehradun, India
| | - R. K. Bharti
- Bipin Tripathi Kumaon Institute of Technology, Dwarahat, India
| | - A. L. N. Rao
- Lloyd institute of Engineering and Technology, Greater Noida, India
| |
Collapse
|
22
|
Deep Sentiment Analysis of Twitter Data Using a Hybrid Ghost Convolution Neural Network Model. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6595799. [PMID: 35898769 PMCID: PMC9313995 DOI: 10.1155/2022/6595799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/31/2022] [Accepted: 06/13/2022] [Indexed: 11/17/2022]
Abstract
Several problems remain, despite the evident advantages of sentiment analysis of public opinion represented on Twitter and Facebook. On complicated training data, hybrid approaches may reduce sentiment mistakes. This research assesses the dependability of numerous hybrid approaches on a variety of datasets. Across domains and datasets, we compare hybrid models to singles. Text tweets and reviews are included in our deep sentiment analysis learning systems. The support vector machine (SVM), Long Short-Term Memory (LSTM), and ghost model convolution neural network (CNN) are combined to get the hybrid model. The dependability and computation time of each approach were evaluated. On all datasets, hybrid models outperform single models when deep learning and SVM are combined. The traditional models were less trustworthy, and deep learning algorithms have recently shown their enormous promise in sentiment analysis. Linear transformations are used in feature maps to eliminate duplicate or related features. The ghost unit makes ghost features by taking away attributes that are both similar and duplicated from each intrinsic feature. LSTM produces higher results but takes longer to process, while CNN needs less hyperparameter adjusting and monitoring. The effectiveness of the integrated model varies depending on the work, and all performed better than the others. For hybrid deep sentiment analysis learning models, LSTM networks, CNNs, and SVMs are needed. Hybrid models are used to compare SVM, LSTM, and CNN, and we tested each method's accuracy and errors. Deep learning-SVM hybrid models improve sentiment analysis accuracy. Experimental results have shown the accuracy of the proposed model shown 91.3 percent and 91.5 percent for datasets type 1 and 8, respectively.
Collapse
|
23
|
Prediction of Diabetes through Retinal Images Using Deep Neural Network. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7887908. [PMID: 35694596 PMCID: PMC9187442 DOI: 10.1155/2022/7887908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 04/19/2022] [Accepted: 05/17/2022] [Indexed: 12/15/2022]
Abstract
Microvascular problems of diabetes, such as diabetic retinopathy and macular edema, can be seen in the eye's retina, and the retinal images are being used to screen for and diagnose the illness manually. Using deep learning to automate this time-consuming process might be quite beneficial. In this paper, a deep neural network, i.e., convolutional neural network, has been proposed for predicting diabetes through retinal images. Before applying the deep neural network, the dataset is preprocessed and normalised for classification. Deep neural network is constructed by using 7 layers, 5 kernels, and ReLU activation function, and MaxPooling is implemented to combine important features. Finally, the model is implemented to classify whether the retinal image belongs to a diabetic or nondiabetic class. The parameters used for evaluating the model are accuracy, precision, recall, and F1 score. The implemented model has achieved a training accuracy of more than 95%, which is much better than the other states of the art algorithms.
Collapse
|
24
|
Al Mudawi N, Alazeb A. A Model for Predicting Cervical Cancer Using Machine Learning Algorithms. SENSORS 2022; 22:s22114132. [PMID: 35684753 PMCID: PMC9185380 DOI: 10.3390/s22114132] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 05/26/2022] [Accepted: 05/26/2022] [Indexed: 12/27/2022]
Abstract
A growing number of individuals and organizations are turning to machine learning (ML) and deep learning (DL) to analyze massive amounts of data and produce actionable insights. Predicting the early stages of serious illnesses using ML-based schemes, including cancer, kidney failure, and heart attacks, is becoming increasingly common in medical practice. Cervical cancer is one of the most frequent diseases among women, and early diagnosis could be a possible solution for preventing this cancer. Thus, this study presents an astute way to predict cervical cancer with ML algorithms. Research dataset, data pre-processing, predictive model selection (PMS), and pseudo-code are the four phases of the proposed research technique. The PMS section reports experiments with a range of classic machine learning methods, including decision tree (DT), logistic regression (LR), support vector machine (SVM), K-nearest neighbors algorithm (KNN), adaptive boosting, gradient boosting, random forest, and XGBoost. In terms of cervical cancer prediction, the highest classification score of 100% is achieved with random forest (RF), decision tree (DT), adaptive boosting, and gradient boosting algorithms. In contrast, 99% accuracy has been found with SVM. The computational complexity of classic machine learning techniques is computed to assess the efficacy of the models. In addition, 132 Saudi Arabian volunteers were polled as part of this study to learn their thoughts about computer-assisted cervical cancer prediction, to focus attention on the human papillomavirus (HPV).
Collapse
|
25
|
IoT-Enabled Framework for Early Detection and Prediction of COVID-19 Suspects by Leveraging Machine Learning in Cloud. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:7713939. [PMID: 35432824 PMCID: PMC9006083 DOI: 10.1155/2022/7713939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 02/12/2022] [Accepted: 03/14/2022] [Indexed: 01/08/2023]
Abstract
COVID-19 is the repugnant but the most searched word since its outbreak in November 2019 across the globe. The world has to battle with it until an effective solution is developed. Due to the advancement in mobile and sensor technology, it is possible to come up with Internet of things-based healthcare systems. These novel healthcare systems can be proactive and preventive rather than traditional reactive healthcare systems. This article proposes a real-time IoT-enabled framework for the detection and prediction of COVID-19 suspects in early stages, by collecting symptomatic data and analyzing the nature of the virus in a better manner. The framework computes the presence of COVID-19 virus by mining the health parameters collected in real time from sensors and other IoT devices. The framework is comprised of four main components: user system or data collection center, data analytic center, diagnostic system, and cloud system. To point out and detect the COVID-19 suspected in real time, this work proposes the five machine learning techniques, namely support vector machine (SVM), decision tree, naïve Bayes, logistic regression, and neural network. In our proposed framework, the real and primary dataset collected from SKIMS, Srinagar, is used to validate our work. The experiment on the primary dataset was conducted using different machine learning techniques on selected symptoms. The efficiency of algorithms is calculated by computing the results of performance metrics such as accuracy, precision, recall, F1 score, root-mean-square error, and area under the curve score. The employed machine learning techniques have shown the accuracy of above 95% on the primary symptomatic data. Based on the experiment conducted, the proposed framework would be effective in the early identification and prediction of COVID-19 suspect realizing the nature of the disease in better way.
Collapse
|
26
|
Aggarwal A, Srivastava A, Agarwal A, Chahal N, Singh D, Alnuaim AA, Alhadlaq A, Lee HN. Two-Way Feature Extraction for Speech Emotion Recognition Using Deep Learning. SENSORS 2022; 22:s22062378. [PMID: 35336548 PMCID: PMC8949356 DOI: 10.3390/s22062378] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 03/03/2022] [Accepted: 03/15/2022] [Indexed: 02/01/2023]
Abstract
Recognizing human emotions by machines is a complex task. Deep learning models attempt to automate this process by rendering machines to exhibit learning capabilities. However, identifying human emotions from speech with good performance is still challenging. With the advent of deep learning algorithms, this problem has been addressed recently. However, most research work in the past focused on feature extraction as only one method for training. In this research, we have explored two different methods of extracting features to address effective speech emotion recognition. Initially, two-way feature extraction is proposed by utilizing super convergence to extract two sets of potential features from the speech data. For the first set of features, principal component analysis (PCA) is applied to obtain the first feature set. Thereafter, a deep neural network (DNN) with dense and dropout layers is implemented. In the second approach, mel-spectrogram images are extracted from audio files, and the 2D images are given as input to the pre-trained VGG-16 model. Extensive experiments and an in-depth comparative analysis over both the feature extraction methods with multiple algorithms and over two datasets are performed in this work. The RAVDESS dataset provided significantly better accuracy than using numeric features on a DNN.
Collapse
Affiliation(s)
- Apeksha Aggarwal
- Department of Computer Science Engineering & Information Technology, Jaypee Institute of Information Technology, A 10, Sector 62, Noida 201307, India; or
| | - Akshat Srivastava
- School of Computer Science Engineering and Technology, Bennett University, Plot Nos 8-11, TechZone 2, Greater Noida 201310, India;
| | - Ajay Agarwal
- Department of Information Technology, KIET Group of Institutions, Delhi-NCR, Meerut Road (NH-58), Ghaziabad 201206, India;
| | - Nidhi Chahal
- Nidhi Chahal, NIIT Limited, Gurugram 110019, India;
| | - Dilbag Singh
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea; or
| | - Abeer Ali Alnuaim
- Department of Computer Science and Engineering, College of Applied Studies and Community Services, King Saud University, P.O. Box 22459, Riyadh 11495, Saudi Arabia; (A.A.A.); (A.A.)
| | - Aseel Alhadlaq
- Department of Computer Science and Engineering, College of Applied Studies and Community Services, King Saud University, P.O. Box 22459, Riyadh 11495, Saudi Arabia; (A.A.A.); (A.A.)
| | - Heung-No Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea; or
- Correspondence:
| |
Collapse
|
27
|
Detection of Breast Cancer Using Histopathological Image Classification Dataset with Deep Learning Techniques. BIOMED RESEARCH INTERNATIONAL 2022; 2022:8363850. [PMID: 35281604 PMCID: PMC8913119 DOI: 10.1155/2022/8363850] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Revised: 01/02/2022] [Accepted: 02/07/2022] [Indexed: 02/06/2023]
Abstract
Cancer is one of the top causes of mortality, and it arises when cells in the body grow abnormally, like in the case of breast cancer. For people all around the world, it has now become a huge issue and a threat to their safety and wellbeing. Breast cancer is one of the major causes of death among females all over the globe, and it is particularly prevalent in the United States. It is possible to diagnose breast cancer using a variety of imaging modalities including mammography, computerized tomography (CT), magnetic resonance imaging (MRI), ultrasound, and biopsies, among others. To analyze the picture, a histopathology study (biopsy) is often performed, which assists in the diagnosis of breast cancer. The goal of this study is to develop improved strategies for various CAD phases that will play a critical role in minimizing the variability gap between and among observers. It created an automatic segmentation approach that is then followed by self-driven post-processing activities to successfully identify the Fourier Transform based Segmentation in the CAD system to improve its performance. When compared to existing techniques, the proposed segmentation technique has several advantages: spatial information is incorporated, there is no need to set any initial parameters beforehand, it is independent of magnification, it automatically determines the inputs for morphological operations to enhance segmented images so that pathologists can analyze the image with greater clarity, and it is fast. Extensive tests were conducted to determine the most effective feature extraction techniques and to investigate how textural, morphological, and graph characteristics impact the accuracy of categorization classification. In addition, a classification strategy for breast cancer detection has been developed that is based on weighted feature selection and uses an upgraded version of the Genetic Algorithm in conjunction with a Convolutional Neural Network Classifier. The practical application of the suggested improved segmentation and classification algorithms for the CAD framework may reduce the number of incorrect diagnoses and increase the accuracy of classification. So, it may serve as a second opinion tool for pathologists and aid in the early detection of diseases.
Collapse
|
28
|
On the Use of Wavelet Domain and Machine Learning for the Analysis of Epileptic Seizure Detection from EEG Signals. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:8928021. [PMID: 35251581 PMCID: PMC8896918 DOI: 10.1155/2022/8928021] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 01/26/2022] [Accepted: 01/31/2022] [Indexed: 11/24/2022]
Abstract
Epileptic patients suffer from an epileptic brain seizure caused by the temporary and unpredicted electrical interruption. Conventionally, the electroencephalogram (EEG) signals are manually studied by medical practitioners as it records the electrical activities from the brain. This technique consumes a lot of time, and the outputs are unreliable. In a bid to address this problem, a new structure for detecting an epileptic seizure is proposed in this study. The EEG signals obtained from the University of Bonn, Germany, and real-time medical records from the Senthil Multispecialty Hospital, India, were used. These signals were disintegrated into six frequency subbands that employed discrete wavelet transform (DWT) and extracted twelve statistical functions. In particular, seven best features were identified and further fed into k-Nearest Neighbor (kNN), naïve Bayes, Support Vector Machine (SVM), and Decision Tree classifiers for two-type and three-type classifications. Six statistical parameters were employed to measure the performance of these classifications. It has been found that different combinations of features and classifiers produce different results. Overall, the study is a first attempt to find the best combination feature set and classifier for 16 different 2-class and 3-class classification challenges of the Bonn and Senthil real-time clinical dataset.
Collapse
|
29
|
Hu Y, Han J, Ding S, Liu S, Wang H. Identification of ferroptosis-associated biomarkers for the potential diagnosis and treatment of postmenopausal osteoporosis. Front Endocrinol (Lausanne) 2022; 13:986384. [PMID: 36105394 PMCID: PMC9464919 DOI: 10.3389/fendo.2022.986384] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 08/05/2022] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE Postmenopausal osteoporosis (PMOP) is one of the most commonly occurring conditions worldwide and is characterized by estrogen deficiency as well as persistent calcium loss with age. The aim of our study was to identify significant ferroptosis-associated biomarkers for PMOP. METHODS AND MATERIALS We obtained our training dataset from the Gene Expression Omnibus (GEO) database using GSE56815 expression profiling data. Meanwhile, we extracted ferroptosis-associated genes for further analysis. Differentially expressed ferroptosis-associated genes (DEFAGs) between OP patients and normal controls were selected using the "limma" package. We established a ferroptosis-associated gene signature using training models, specifically, random forest (RF) and support vector machine (SVM) models. It was further validated in another dataset (GSE56814) which also showed a high AUC: 0.98, indicating high diagnostic value. Using consensus clustering, the OP patient subtypes were identified. A ferroptosis associated gene (FAG)-Scoring scheme was developed by PCA. The important candidate genes associated with OP were also compared between different ferrclusters and geneclusters. RESULTS There were significant DEFAGs acquired, of which five (HMOX1, HAMP, LPIN1, MAP3K5, FLT3) were selected for establishing a ferroptosis-associated gene signature. Analyzed from the ROC curve, our established RF model had a higher AUC value than the SVM model (RF model AUC:1.00). Considering these results, the established RF model was chosen to be the most appropriate training model. Later, based on the expression levels of the five DEFAGs, a clinical application nomogram was established. The OP patients were divided into two subtypes (ferrcluster A, B and genecluster A, B, respectively) according to the consensus clustering method based on DEFAGs and differentially expressed genes (DEGs). Ferrcluster B and genecluster B had higher ferroptosis score than ferrcluster A and genecluster A, respectively. The expression of COL1A1 gene was significantly higher in ferrcluster B and gencluster B compared with ferrcluster A and gencluster A, respectively, while there is no statistical difference in term of VDR gene, COL1A2 genes, and PTH gene expressions between ferrcluster A and B, together with gencluster A and B. CONCLUSIONS On the basis of five explanatory variables (HMOX1, HAMP, LPIN1, MAP3K5 and FLT3), we developed a diagnostic ferroptosis-associated gene signature and identified two differently categorized OP subtypes that may potentially be applied for the early diagnosis and individualized treatment of PMOP. The ER gene, VDR gene, IL-6 gene, COL1A1 and COL1A2 genes, and PTH gene are important candidate gene of OP, however, more studies are still anticipated to further elucidate the relationship between these genes and ferroptosis in OP.
Collapse
Affiliation(s)
- Yunxiang Hu
- Department of Orthopedics, Dalian Municipal Central Hospital Affiliated of Dalian Medical University, Dalian, China
- School of Graduates, Dalian Medical University, Dalian, China
| | - Jun Han
- Department of Orthopedics, Dalian Municipal Central Hospital Affiliated of Dalian Medical University, Dalian, China
- School of Graduates, Dalian Medical University, Dalian, China
- Department of Spine Surgery, the First Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Shengqiang Ding
- Department of Spine Surgery, The People’s Hospital of Liuyang City, Changsha, China
| | - Sanmao Liu
- Department of Orthopedics, Dalian Municipal Central Hospital Affiliated of Dalian Medical University, Dalian, China
- School of Graduates, Dalian Medical University, Dalian, China
| | - Hong Wang
- Department of Orthopedics, Dalian Municipal Central Hospital Affiliated of Dalian Medical University, Dalian, China
- School of Graduates, Dalian Medical University, Dalian, China
- *Correspondence: Hong Wang,
| |
Collapse
|