1
|
Wang YY, Yang WX, Du QJ, Liu ZH, Lu MH, You CG. Construction and evaluation of a liver cancer risk prediction model based on machine learning. World J Gastrointest Oncol 2024; 16:3839-3850. [PMID: 39350987 PMCID: PMC11438789 DOI: 10.4251/wjgo.v16.i9.3839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 07/31/2024] [Accepted: 08/07/2024] [Indexed: 09/09/2024] Open
Abstract
BACKGROUND Liver cancer is one of the most prevalent malignant tumors worldwide, and its early detection and treatment are crucial for enhancing patient survival rates and quality of life. However, the early symptoms of liver cancer are often not obvious, resulting in a late-stage diagnosis in many patients, which significantly reduces the effectiveness of treatment. Developing a highly targeted, widely applicable, and practical risk prediction model for liver cancer is crucial for enhancing the early diagnosis and long-term survival rates among affected individuals. AIM To develop a liver cancer risk prediction model by employing machine learning techniques, and subsequently assess its performance. METHODS In this study, a total of 550 patients were enrolled, with 190 hepatocellular carcinoma (HCC) and 195 cirrhosis patients serving as the training cohort, and 83 HCC and 82 cirrhosis patients forming the validation cohort. Logistic regression (LR), support vector machine (SVM), random forest (RF), and least absolute shrinkage and selection operator (LASSO) regression models were developed in the training cohort. Model performance was assessed in the validation cohort. Additionally, this study conducted a comparative evaluation of the diagnostic efficacy between the ASAP model and the model developed in this study using receiver operating characteristic curve, calibration curve, and decision curve analysis (DCA) to determine the optimal predictive model for assessing liver cancer risk. RESULTS Six variables including age, white blood cell, red blood cell, platelet counts, alpha-fetoprotein and protein induced by vitamin K absence or antagonist II levels were used to develop LR, SVM, RF, and LASSO regression models. The RF model exhibited superior discrimination, and the area under curve of the training and validation sets was 0.969 and 0.858, respectively. These values significantly surpassed those of the LR (0.850 and 0.827), SVM (0.860 and 0.803), LASSO regression (0.845 and 0.831), and ASAP (0.866 and 0.813) models. Furthermore, calibration and DCA indicated that the RF model exhibited robust calibration and clinical validity. CONCLUSION The RF model demonstrated excellent prediction capabilities for HCC and can facilitate early diagnosis of HCC in clinical practice.
Collapse
Affiliation(s)
- Ying-Ying Wang
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Wan-Xia Yang
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Qia-Jun Du
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Zhen-Hua Liu
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Ming-Hua Lu
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Chong-Ge You
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| |
Collapse
|
2
|
Hu T, Li K, Ma C, Zhou N, Chen Q, Qi C. Improved classification of soil As contamination at continental scale: Resolving class imbalances using machine learning approach. CHEMOSPHERE 2024; 363:142697. [PMID: 38925515 DOI: 10.1016/j.chemosphere.2024.142697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 06/11/2024] [Accepted: 06/23/2024] [Indexed: 06/28/2024]
Abstract
The identification of arsenic (As)-contaminated areas is an important prerequisite for soil management and reclamation. Although previous studies have attempted to identify soil As contamination via machine learning (ML) methods combined with soil spectroscopy, they have ignored the rarity of As-contaminated soil samples, leading to an imbalanced learning problem. A novel ML framework was thus designed herein to solve the imbalance issue in identifying soil As contamination from soil visible and near-infrared spectra. Spectral preprocessing, imbalanced dataset resampling, and model comparisons were combined in the ML framework, and the optimal combination was selected based on the recall. In addition, Bayesian optimization was used to tune the model hyperparameters. The optimized model achieved recall, area under the curve, and balanced accuracy values of 0.83, 0.88, and 0.79, respectively, on the testing set. The recall was further improved to 0.87 with the threshold adjustment, indicating the model's excellent performance and generalization capability in classifying As-contaminated soil samples. The optimal model was applied to a global soil spectral dataset to predict areas at a high risk of soil As contamination on a global scale. The ML framework established in this study represents a milestone in the classification of soil As contamination and can serve as a valuable reference for contamination management in soil science.
Collapse
Affiliation(s)
- Tao Hu
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
| | - Kechao Li
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
| | - Chundi Ma
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
| | - Nana Zhou
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
| | - Qiusong Chen
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China
| | - Chongchong Qi
- School of Resources and Safety Engineering, Central South University, Changsha, 410083, China; School of Metallurgy and Environment, Central South University, Changsha, 410083, China; Fankou Lead-Zinc Mine, NONFEMET, Shaoguan, 511100, China.
| |
Collapse
|
3
|
Geeitha S, Ravishankar K, Cho J, Easwaramoorthy SV. Integrating cat boost algorithm with triangulating feature importance to predict survival outcome in recurrent cervical cancer. Sci Rep 2024; 14:19828. [PMID: 39191808 DOI: 10.1038/s41598-024-67562-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 07/12/2024] [Indexed: 08/29/2024] Open
Abstract
Cervical cancer is one of the most dangerous malignancies in women. Prolonged survival times are made possible by breakthroughs in early recognition and efficient treatment of a disease.The existing methods are lagging on finding the important attributes to predict the survival outcome. The main objective of this study is to find individuals with cervical cancer who are at greater risk of death from recurrence by predicting the survival.A novel approach in a proposed technique is Triangulating feature importance to find the important risk factors through which the treatment may vary to improve the survival outcome.Five algorithms Support vector machine, Naive Bayes, supervised logistic regression, decision tree algorithm, Gradient boosting, and random forest are used to build the concept. Conventional attribute selection methods like information gain (IG), FCBF, and ReliefFare employed. The recommended classifier is evaluated for Precision, Recall, F1, Mathews Correlation Coefficient (MCC), Classification Accuracy (CA), and Area under curve (AUC) using various methods. Gradient boosting algorithm (CAT BOOST) attains the highest accuracy value of 0.99 to predict survival outcome of recurrence cervical cancer patients. The proposed outcome of the research is to identify the important risk factors through which the survival outcome of the patients improved.
Collapse
Affiliation(s)
- S Geeitha
- Department of Information Technology, M. Kumarasamy College of Engineering, Thalavapalayam, Karur, Tamil Nadu, India
| | - K Ravishankar
- Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai, Tamil Nadu, India
| | - Jaehyuk Cho
- Department of Software Engineering and Division of Electronics and Information Engineering, Jeonbuk National University, Jeonju-Si, Republic of Korea.
| | | |
Collapse
|
4
|
He S, Zhu G, Zhou Y, Yang B, Wang J, Wang Z, Wang T. Predictive models for personalized precision medical intervention in spontaneous regression stages of cervical precancerous lesions. J Transl Med 2024; 22:686. [PMID: 39061062 PMCID: PMC11282852 DOI: 10.1186/s12967-024-05417-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 06/19/2024] [Indexed: 07/28/2024] Open
Abstract
BACKGROUND During the prolonged period from Human Papillomavirus (HPV) infection to cervical cancer development, Low-Grade Squamous Intraepithelial Lesion (LSIL) stage provides a critical opportunity for cervical cancer prevention, giving the high potential for reversal in this stage. However, there is few research and a lack of clear guidelines on appropriate intervention strategies at this stage, underscoring the need for real-time prognostic predictions and personalized treatments to promote lesion reversal. METHODS We have established a prospective cohort. Since 2018, we have been collecting clinical data and pathological images of HPV-infected patients, followed by tracking the progression of their cervical lesions. In constructing our predictive models, we applied logistic regression and six machine learning models, evaluating each model's predictive performance using metrics such as the Area Under the Curve (AUC). We also employed the SHAP method for interpretative analysis of the prediction results. Additionally, the model identifies key factors influencing the progression of the lesions. RESULTS Model comparisons highlighted the superior performance of Random Forests (RF) and Support Vector Machines (SVM), both in clinical parameter and pathological image-based predictions. Notably, the RF model, which integrates pathological images and clinical multi-parameters, achieved the highest AUC of 0.866. Another significant finding was the substantial impact of sleep quality on the spontaneous clearance of HPV and regression of LSIL. CONCLUSIONS In contrast to current cervical cancer prediction models, our model's prognostic capabilities extend to the spontaneous regression stage of cervical cancer. This model aids clinicians in real-time monitoring of lesions and in developing personalized treatment or follow-up plans by assessing individual risk factors, thus fostering lesion spontaneous reversal and aiding in cervical cancer prevention and reduction.
Collapse
Affiliation(s)
- Simin He
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, 030001, China
- Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, Ministry of Education, Taiyuan, 030001, China
| | - Guiming Zhu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, 030001, China
- Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, Ministry of Education, Taiyuan, 030001, China
| | - Ying Zhou
- Department of Obstetrics and Gynecology, First Hospital of Shanxi Medical University, Taiyuan, 030001, China
| | - Boran Yang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, 030001, China
- Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, Ministry of Education, Taiyuan, 030001, China
| | - Juping Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, 030001, China
- Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, Ministry of Education, Taiyuan, 030001, China
| | - Zhaoxia Wang
- Department of Obstetrics and Gynecology, First Hospital of Shanxi Medical University, Taiyuan, 030001, China
| | - Tong Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, 030001, China.
- Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, Ministry of Education, Taiyuan, 030001, China.
| |
Collapse
|
5
|
Alabdulqader EA, Alarfaj AA, Umer M, Eshmawi AA, Alsubai S, Kim TH, Ashraf I. Improving prediction of blood cancer using leukemia microarray gene data and Chi2 features with weighted convolutional neural network. Sci Rep 2024; 14:15625. [PMID: 38972881 PMCID: PMC11228030 DOI: 10.1038/s41598-024-65315-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 06/19/2024] [Indexed: 07/09/2024] Open
Abstract
Blood cancer has emerged as a growing concern over the past decade, necessitating early diagnosis for timely and effective treatment. The present diagnostic method, which involves a battery of tests and medical experts, is costly and time-consuming. For this reason, it is crucial to establish an automated diagnostic system for accurate predictions. A particular field of focus in medical research is the use of machine learning and leukemia microarray gene data for blood cancer diagnosis. Even with a great deal of research, more improvements are needed to reach the appropriate levels of accuracy and efficacy. This work presents a supervised machine-learning algorithm for blood cancer prediction. This work makes use of the 22,283-gene leukemia microarray gene data. Chi-squared (Chi2) feature selection methods and the synthetic minority oversampling technique (SMOTE)-Tomek resampling is used to overcome issues with imbalanced and high-dimensional datasets. To balance the dataset for each target class, SMOTE-Tomek creates synthetic data, and Chi2 chooses the most important features to train the learning models from 22,283 genes. A novel weighted convolutional neural network (CNN) model is proposed for classification, utilizing the support of three separate CNN models. To determine the importance of the proposed approach, extensive experiments are carried out on the datasets, including a performance comparison with the most advanced techniques. Weighted CNN demonstrates superior performance over other models when coupled with SMOTE-Tomek and Chi2 techniques, achieving a remarkable 99.9% accuracy. Results from k-fold cross-validation further affirm the supremacy of the proposed model.
Collapse
Affiliation(s)
- Ebtisam Abdullah Alabdulqader
- Department of Information Technology, College of Computer and Information Sciences, King Saud University, P. O. Box 800, 11421, Riyadh, Saudi Arabia
| | - Aisha Ahmed Alarfaj
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, 11671, Riyadh, Saudi Arabia
| | - Muhammad Umer
- Department of Computer Science and Information Technology, The Islamia University of Bahawalpur, Bahawalpur, 63100, Pakistan
| | - Ala' Abdulmajid Eshmawi
- Department of Cybersecurity, College of Computer Science and Engineering, University of Jeddah, Jeddah, 23218, Saudi Arabia
| | - Shtwai Alsubai
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, P.O. Box 151, 11942, Al-Kharj, Saudi Arabia
| | - Tai-Hoon Kim
- School of Electrical and Computer Engineering, Yeosu Campus, Chonnam National University, 50, Daehak-ro, Yeosu-si, 59626, Jeollanam-do, Republic of Korea.
| | - Imran Ashraf
- Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, 38541, Korea.
| |
Collapse
|
6
|
Yang T, Hu H, Li X, Meng Q, Lu H, Huang Q. An efficient Fusion-Purification Network for Cervical pap-smear image classification. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 251:108199. [PMID: 38728830 DOI: 10.1016/j.cmpb.2024.108199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 02/28/2024] [Accepted: 04/21/2024] [Indexed: 05/12/2024]
Abstract
BACKGROUND AND OBJECTIVES In cervical cell diagnostics, autonomous screening technology constitutes the foundation of automated diagnostic systems. Currently, numerous deep learning-based classification techniques have been successfully implemented in the analysis of cervical cell images, yielding favorable outcomes. Nevertheless, efficient discrimination of cervical cells continues to be challenging due to large intra-class and small inter-class variations. The key to dealing with this problem is to capture localized informative differences from cervical cell images and to represent discriminative features efficiently. Existing methods neglect the importance of global morphological information, resulting in inadequate feature representation capability. METHODS To address this limitation, we propose a novel cervical cell classification model that focuses on purified fusion information. Specifically, we first integrate the detailed texture information and morphological structure features, named cervical pathology information fusion. Second, in order to enhance the discrimination of cervical cell features and address the data redundancy and bias inherent after fusion, we design a cervical purification bottleneck module. This model strikes a balance between leveraging purified features and facilitating high-efficiency discrimination. Furthermore, we intend to unveil a more intricate cervical cell dataset: Cervical Cytopathology Image Dataset (CCID). RESULTS Extensive experiments on two real-world datasets show that our proposed model outperforms state-of-the-art cervical cell classification models. CONCLUSIONS The results show that our method can well help pathologists to accurately evaluate cervical smears.
Collapse
Affiliation(s)
- Tianjin Yang
- College of Computer and Information, Hohai University, Nanjing, 211100, PR China.
| | - Hexuan Hu
- College of Computer and Information, Hohai University, Nanjing, 211100, PR China.
| | - Xing Li
- College of information Science and Technology & College of Artificial Intelligence, Nanjing Forestry University, Nanjing 210037, PR China.
| | - Qing Meng
- College of Computer and Information, Hohai University, Nanjing, 211100, PR China.
| | - Hao Lu
- College of Computer and Information, Hohai University, Nanjing, 211100, PR China.
| | - Qian Huang
- College of Computer and Information, Hohai University, Nanjing, 211100, PR China.
| |
Collapse
|
7
|
Hasan M, Sahid MA, Uddin MP, Marjan MA, Kadry S, Kim J. Performance discrepancy mitigation in heart disease prediction for multisensory inter-datasets. PeerJ Comput Sci 2024; 10:e1917. [PMID: 38660196 PMCID: PMC11041935 DOI: 10.7717/peerj-cs.1917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 02/12/2024] [Indexed: 04/26/2024]
Abstract
Heart disease is one of the primary causes of morbidity and death worldwide. Millions of people have had heart attacks every year, and only early-stage predictions can help to reduce the number. Researchers are working on designing and developing early-stage prediction systems using different advanced technologies, and machine learning (ML) is one of them. Almost all existing ML-based works consider the same dataset (intra-dataset) for the training and validation of their method. In particular, they do not consider inter-dataset performance checks, where different datasets are used in the training and testing phases. In inter-dataset setup, existing ML models show a poor performance named the inter-dataset discrepancy problem. This work focuses on mitigating the inter-dataset discrepancy problem by considering five available heart disease datasets and their combined form. All potential training and testing mode combinations are systematically executed to assess discrepancies before and after applying the proposed methods. Imbalance data handling using SMOTE-Tomek, feature selection using random forest (RF), and feature extraction using principle component analysis (PCA) with a long preprocessing pipeline are used to mitigate the inter-dataset discrepancy problem. The preprocessing pipeline builds on missing value handling using RF regression, log transformation, outlier removal, normalization, and data balancing that convert the datasets to more ML-centric. Support vector machine, K-nearest neighbors, decision tree, RF, eXtreme Gradient Boosting, Gaussian naive Bayes, logistic regression, and multilayer perceptron are used as classifiers. Experimental results show that feature selection and classification using RF produce better results than other combination strategies in both single- and inter-dataset setups. In certain configurations of individual datasets, RF demonstrates 100% accuracy and 96% accuracy during the feature selection phase in an inter-dataset setup, exhibiting commendable precision, recall, F1 score, specificity, and AUC score. The results indicate that an effective preprocessing technique has the potential to improve the performance of the ML model without necessitating the development of intricate prediction models. Addressing inter-dataset discrepancies introduces a novel research avenue, enabling the amalgamation of identical features from various datasets to construct a comprehensive global dataset within a specific domain.
Collapse
Affiliation(s)
- Mahmudul Hasan
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
- School of Information Technology, Deakin University, Geelong, VIC, Australia
| | - Md Abdus Sahid
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
| | - Md Palash Uddin
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
- School of Information Technology, Deakin University, Geelong, VIC, Australia
| | - Md Abu Marjan
- Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh
| | - Seifedine Kadry
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon
- Department of Applied Data Science, Noroff University College, Kristiansand, Norway
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, Norway
- MEU Research Unit, Middle East University, Amman, Jordan
| | - Jungeun Kim
- Department of Software, Kongju National University, Cheonan, Republic of South Korea
| |
Collapse
|
8
|
Rajadurai S, Perumal K, Ijaz MF, Chowdhary CL. PrecisionLymphoNet: Advancing Malignant Lymphoma Diagnosis via Ensemble Transfer Learning with CNNs. Diagnostics (Basel) 2024; 14:469. [PMID: 38472941 DOI: 10.3390/diagnostics14050469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 01/26/2024] [Accepted: 01/27/2024] [Indexed: 03/14/2024] Open
Abstract
Malignant lymphoma, which impacts the lymphatic system, presents diverse challenges in accurate diagnosis due to its varied subtypes-chronic lymphocytic leukemia (CLL), follicular lymphoma (FL), and mantle cell lymphoma (MCL). Lymphoma is a form of cancer that begins in the lymphatic system, impacting lymphocytes, which are a specific type of white blood cell. This research addresses these challenges by proposing ensemble and non-ensemble transfer learning models employing pre-trained weights from VGG16, VGG19, DenseNet201, InceptionV3, and Xception. For the ensemble technique, this paper adopts a stack-based ensemble approach. It is a two-level classification approach and best suited for accuracy improvement. Testing on a multiclass dataset of CLL, FL, and MCL reveals exceptional diagnostic accuracy, with DenseNet201, InceptionV3, and Xception exceeding 90% accuracy. The proposed ensemble model, leveraging InceptionV3 and Xception, achieves an outstanding 99% accuracy over 300 epochs, surpassing previous prediction methods. This study demonstrates the feasibility and efficiency of the proposed approach, showcasing its potential in real-world medical applications for precise lymphoma diagnosis.
Collapse
Affiliation(s)
- Sivashankari Rajadurai
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore 632014, India
| | - Kumaresan Perumal
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore 632014, India
| | - Muhammad Fazal Ijaz
- School of IT and Engineering, Melbourne Institute of Technology, Melbourne, VIC 3000, Australia
| | - Chiranji Lal Chowdhary
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore 632014, India
| |
Collapse
|
9
|
Munshi RM. Novel ensemble learning approach with SVM-imputed ADASYN features for enhanced cervical cancer prediction. PLoS One 2024; 19:e0296107. [PMID: 38198475 PMCID: PMC10781159 DOI: 10.1371/journal.pone.0296107] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/06/2023] [Indexed: 01/12/2024] Open
Abstract
Cervical cancer remains a leading cause of female mortality, particularly in developing regions, underscoring the critical need for early detection and intervention guided by skilled medical professionals. While Pap smear images serve as valuable diagnostic tools, many available datasets for automated cervical cancer detection contain missing data, posing challenges for machine learning models' efficacy. To address these hurdles, this study presents an automated system adept at managing missing information using ADASYN characteristics, resulting in exceptional accuracy. The proposed methodology integrates a voting classifier model harnessing the predictive capacity of three distinct machine learning models. It further incorporates SVM Imputer and ADASYN up-sampled features to mitigate missing value concerns, while leveraging CNN-generated features to augment the model's capabilities. Notably, this model achieves remarkable performance metrics, boasting a 99.99% accuracy, precision, recall, and F1 score. A comprehensive comparative analysis evaluates the proposed model against various machine learning algorithms across four scenarios: original dataset usage, SVM imputation, ADASYN feature utilization, and CNN-generated features. Results indicate the superior efficacy of the proposed model over existing state-of-the-art techniques. This research not only introduces a novel approach but also offers actionable suggestions for refining automated cervical cancer detection systems. Its impact extends to benefiting medical practitioners by enabling earlier detection and improved patient care. Furthermore, the study's findings have substantial societal implications, potentially reducing the burden of cervical cancer through enhanced diagnostic accuracy and timely intervention.
Collapse
Affiliation(s)
- Raafat M. Munshi
- Department of Medical Laboratory Technology (MLT), Faculty of Applied Medical Sciences, King Abdulaziz University, Rabigh, Saudi Arabia
| |
Collapse
|
10
|
Aljrees T. Improving prediction of cervical cancer using KNN imputer and multi-model ensemble learning. PLoS One 2024; 19:e0295632. [PMID: 38170713 PMCID: PMC10763959 DOI: 10.1371/journal.pone.0295632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 11/23/2023] [Indexed: 01/05/2024] Open
Abstract
Cervical cancer is a leading cause of women's mortality, emphasizing the need for early diagnosis and effective treatment. In line with the imperative of early intervention, the automated identification of cervical cancer has emerged as a promising avenue, leveraging machine learning techniques to enhance both the speed and accuracy of diagnosis. However, an inherent challenge in the development of these automated systems is the presence of missing values in the datasets commonly used for cervical cancer detection. Missing data can significantly impact the performance of machine learning models, potentially leading to inaccurate or unreliable results. This study addresses a critical challenge in automated cervical cancer identification-handling missing data in datasets. The study present a novel approach that combines three machine learning models into a stacked ensemble voting classifier, complemented by the use of a KNN Imputer to manage missing values. The proposed model achieves remarkable results with an accuracy of 0.9941, precision of 0.98, recall of 0.96, and an F1 score of 0.97. This study examines three distinct scenarios: one involving the deletion of missing values, another utilizing KNN imputation, and a third employing PCA for imputing missing values. This research has significant implications for the medical field, offering medical experts a powerful tool for more accurate cervical cancer therapy and enhancing the overall effectiveness of testing procedures. By addressing missing data challenges and achieving high accuracy, this work represents a valuable contribution to cervical cancer detection, ultimately aiming to reduce the impact of this disease on women's health and healthcare systems.
Collapse
Affiliation(s)
- Turki Aljrees
- College of Computer Science and Engineering, University of Hafr Al-Batin, Hafar Al-Batin, Saudi Arabia
| |
Collapse
|
11
|
Razzak MA, Islam MN, Aadeeb MS, Tasnim T. Digital health interventions for cervical cancer care: A systematic review and future research opportunities. PLoS One 2023; 18:e0296015. [PMID: 38100494 PMCID: PMC10723694 DOI: 10.1371/journal.pone.0296015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 12/04/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND Cervical cancer is a malignancy among women worldwide, which is responsible for innumerable deaths every year. The primary objective of this review study is to offer a comprehensive and synthesized overview of the existing literature concerning digital interventions in cervical cancer care. As such, we aim to uncover prevalent research gaps and highlight prospective avenues for future investigations. METHODS This study adopted a Systematic Literature Review (SLR) methodology where a total of 26 articles were reviewed from an initial set of 1110 articles following an inclusion-exclusion criterion. RESULTS The review highlights a deficiency in existing studies that address awareness dissemination, screening facilitation, and treatment provision for cervical cancer. The review also reveals future research opportunities like explore innovative approaches using emerging technologies to enhance awareness campaigns and treatment accessibility, consider diverse study contexts, develop sophisticated machine learning models for screening, incorporate additional features in machine learning research, investigate the impact of treatments across different stages of cervical cancer, and create more user-friendly applications for cervical cancer care. CONCLUSIONS The findings of this study can contribute to mitigating the adverse effects of cervical cancer and improving patient outcomes. It also highlights the untapped potential of Artificial Intelligence and Machine Learning, which could significantly impact our society.
Collapse
Affiliation(s)
- Md Abdur Razzak
- Faculty of Science and Technology, Bangladesh University of Professionals, Dhaka, Bangladesh
| | - Muhammad Nazrul Islam
- Department of Computer Science and Engineering, Military Institute of Science and Technology, Dhaka, Bangladesh
| | - Md Shadman Aadeeb
- Department of Computer Science and Engineering, Military Institute of Science and Technology, Dhaka, Bangladesh
| | - Tasfia Tasnim
- Department of Computer Science and Engineering, Military Institute of Science and Technology, Dhaka, Bangladesh
| |
Collapse
|
12
|
Karamti H, Alharthi R, Anizi AA, Alhebshi RM, Eshmawi AA, Alsubai S, Umer M. Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach. Cancers (Basel) 2023; 15:4412. [PMID: 37686692 PMCID: PMC10486648 DOI: 10.3390/cancers15174412] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 08/02/2023] [Accepted: 08/09/2023] [Indexed: 09/10/2023] Open
Abstract
Objective: Cervical cancer ranks among the top causes of death among females in developing countries. The most important procedures that should be followed to guarantee the minimizing of cervical cancer's aftereffects are early identification and treatment under the finest medical guidance. One of the best methods to find this sort of malignancy is by looking at a Pap smear image. For automated detection of cervical cancer, the available datasets often have missing values, which can significantly affect the performance of machine learning models. Methods: To address these challenges, this study proposes an automated system for predicting cervical cancer that efficiently handles missing values with SMOTE features to achieve high accuracy. The proposed system employs a stacked ensemble voting classifier model that combines three machine learning models, along with KNN Imputer and SMOTE up-sampled features for handling missing values. Results: The proposed model achieves 99.99% accuracy, 99.99% precision, 99.99% recall, and 99.99% F1 score when using KNN imputed SMOTE features. The study compares the performance of the proposed model with multiple other machine learning algorithms under four scenarios: with missing values removed, with KNN imputation, with SMOTE features, and with KNN imputed SMOTE features. The study validates the efficacy of the proposed model against existing state-of-the-art approaches. Conclusions: This study investigates the issue of missing values and class imbalance in the data collected for cervical cancer detection and might aid medical practitioners in timely detection and providing cervical cancer patients with better care.
Collapse
Affiliation(s)
- Hanen Karamti
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Raed Alharthi
- Department of Computer Science and Engineering, University of Hafr Al-Batin, Hafar Al-Batin 39524, Saudi Arabia;
| | - Amira Al Anizi
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Reemah M. Alhebshi
- Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| | - Ala’ Abdulmajid Eshmawi
- Department of Cybersecurity, College of Computer Science and Engineering, University of Jeddah, Jeddah 23218, Saudi Arabia;
| | - Shtwai Alsubai
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, P.O. Box 151, Al-Kharj 11942, Saudi Arabia;
| | - Muhammad Umer
- Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
| |
Collapse
|
13
|
Iqbal U, Imtiaz R, Saudagar AKJ, Alam KA. CRV-NET: Robust Intensity Recognition of Coronavirus in Lung Computerized Tomography Scan Images. Diagnostics (Basel) 2023; 13:diagnostics13101783. [PMID: 37238266 DOI: 10.3390/diagnostics13101783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 05/01/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open
Abstract
The early diagnosis of infectious diseases is demanded by digital healthcare systems. Currently, the detection of the new coronavirus disease (COVID-19) is a major clinical requirement. For COVID-19 detection, deep learning models are used in various studies, but the robustness is still compromised. In recent years, deep learning models have increased in popularity in almost every area, particularly in medical image processing and analysis. The visualization of the human body's internal structure is critical in medical analysis; many imaging techniques are in use to perform this job. A computerized tomography (CT) scan is one of them, and it has been generally used for the non-invasive observation of the human body. The development of an automatic segmentation method for lung CT scans showing COVID-19 can save experts time and can reduce human error. In this article, the CRV-NET is proposed for the robust detection of COVID-19 in lung CT scan images. A public dataset (SARS-CoV-2 CT Scan dataset), is used for the experimental work and customized according to the scenario of the proposed model. The proposed modified deep-learning-based U-Net model is trained on a custom dataset with 221 training images and their ground truth, which was labeled by an expert. The proposed model is tested on 100 test images, and the results show that the model segments COVID-19 with a satisfactory level of accuracy. Moreover, the comparison of the proposed CRV-NET with different state-of-the-art convolutional neural network models (CNNs), including the U-Net Model, shows better results in terms of accuracy (96.67%) and robustness (low epoch value in detection and the smallest training data size).
Collapse
Affiliation(s)
- Uzair Iqbal
- Department of Artificial Intelligence and Data Science, National University of Computer and Emerging Sciences, Islamabad Campus, Islamabad 44000, Pakistan
| | - Romil Imtiaz
- Information and Communication Engineering, Northwestern Polytechnical University, Xi'an 710072, China
| | - Abdul Khader Jilani Saudagar
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia
| | - Khubaib Amjad Alam
- Department of Software Engineering, National University of Computer and Emerging Sciences, Islamabad Campus, Islamabad 44000, Pakistan
| |
Collapse
|
14
|
Mustafa WA, Ismail S, Mokhtar FS, Alquran H, Al-Issa Y. Cervical Cancer Detection Techniques: A Chronological Review. Diagnostics (Basel) 2023; 13:diagnostics13101763. [PMID: 37238248 DOI: 10.3390/diagnostics13101763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 05/12/2023] [Accepted: 05/15/2023] [Indexed: 05/28/2023] Open
Abstract
Cervical cancer is known as a major health problem globally, with high mortality as well as incidence rates. Over the years, there have been significant advancements in cervical cancer detection techniques, leading to improved accuracy, sensitivity, and specificity. This article provides a chronological review of cervical cancer detection techniques, from the traditional Pap smear test to the latest computer-aided detection (CAD) systems. The traditional method for cervical cancer screening is the Pap smear test. It consists of examining cervical cells under a microscope for abnormalities. However, this method is subjective and may miss precancerous lesions, leading to false negatives and a delayed diagnosis. Therefore, a growing interest has been in shown developing CAD methods to enhance cervical cancer screening. However, the effectiveness and reliability of CAD systems are still being evaluated. A systematic review of the literature was performed using the Scopus database to identify relevant studies on cervical cancer detection techniques published between 1996 and 2022. The search terms used included "(cervix OR cervical) AND (cancer OR tumor) AND (detect* OR diagnosis)". Studies were included if they reported on the development or evaluation of cervical cancer detection techniques, including traditional methods and CAD systems. The results of the review showed that CAD technology for cervical cancer detection has come a long way since it was introduced in the 1990s. Early CAD systems utilized image processing and pattern recognition techniques to analyze digital images of cervical cells, with limited success due to low sensitivity and specificity. In the early 2000s, machine learning (ML) algorithms were introduced to the CAD field for cervical cancer detection, allowing for more accurate and automated analysis of digital images of cervical cells. ML-based CAD systems have shown promise in several studies, with improved sensitivity and specificity reported compared to traditional screening methods. In summary, this chronological review of cervical cancer detection techniques highlights the significant advancements made in this field over the past few decades. ML-based CAD systems have shown promise for improving the accuracy and sensitivity of cervical cancer detection. The Hybrid Intelligent System for Cervical Cancer Diagnosis (HISCCD) and the Automated Cervical Screening System (ACSS) are two of the most promising CAD systems. Still, deeper validation and research are required before being broadly accepted. Continued innovation and collaboration in this field may help enhance cervical cancer detection as well as ultimately reduce the disease's burden on women worldwide.
Collapse
Affiliation(s)
- Wan Azani Mustafa
- Faculty of Electrical Engineering Technology, Campus Pauh Putra, Universiti Malaysia Perlis, Arau 02600, Perlis, Malaysia
- Advanced Computing (AdvComp), Centre of Excellence (CoE), Universiti Malaysia Perlis, Arau 02600, Perlis, Malaysia
| | - Shahrina Ismail
- Faculty of Science and Technology, Universiti Sains Islam Malaysia (USIM), Bandar Baru Nilai 71800, Negeri Sembilan, Malaysia
| | - Fahirah Syaliza Mokhtar
- Faculty of Business, Economy and Social Development, Universiti Malaysia Terengganu, Kuala Nerus 21300, Terengganu, Malaysia
| | - Hiam Alquran
- Department of Biomedical Systems and Informatics Engineering, Yarmouk University, 556, Irbid 21163, Jordan
| | - Yazan Al-Issa
- Department of Computer Engineering, Yarmouk University, Irbid 22110, Jordan
| |
Collapse
|
15
|
Farhan AMQ, Yang S. Automatic lung disease classification from the chest X-ray images using hybrid deep learning algorithm. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-27. [PMID: 37362647 PMCID: PMC10030349 DOI: 10.1007/s11042-023-15047-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/30/2022] [Accepted: 02/27/2023] [Indexed: 06/28/2023]
Abstract
The chest X-ray images provide vital information about the congestion cost-effectively. We propose a novel Hybrid Deep Learning Algorithm (HDLA) framework for automatic lung disease classification from chest X-ray images. The model consists of steps including pre-processing of chest X-ray images, automatic feature extraction, and detection. In a pre-processing step, our goal is to improve the quality of raw chest X-ray images using the combination of optimal filtering without data loss. The robust Convolutional Neural Network (CNN) is proposed using the pre-trained model for automatic lung feature extraction. We employed the 2D CNN model for the optimum feature extraction in minimum time and space requirements. The proposed 2D CNN model ensures robust feature learning with highly efficient 1D feature estimation from the input pre-processed image. As the extracted 1D features have suffered from significant scale variations, we optimized them using min-max scaling. We classify the CNN features using the different machine learning classifiers such as AdaBoost, Support Vector Machine (SVM), Random Forest (RM), Backpropagation Neural Network (BNN), and Deep Neural Network (DNN). The experimental results claim that the proposed model improves the overall accuracy by 3.1% and reduces the computational complexity by 16.91% compared to state-of-the-art methods.
Collapse
Affiliation(s)
- Abobaker Mohammed Qasem Farhan
- School of information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Shangming Yang
- School of information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
16
|
Jia L, Wu W, Hou G, Zhao J, Qiang Y, Zhang Y, Cai M. Residual neural network with mixed loss based on batch training technique for identification of EGFR mutation status in lung cancer. MULTIMEDIA TOOLS AND APPLICATIONS 2023; 82:1-21. [PMID: 37362735 PMCID: PMC10020767 DOI: 10.1007/s11042-023-14876-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Revised: 11/11/2022] [Accepted: 02/06/2023] [Indexed: 06/28/2023]
Abstract
Epidermal growth factor receptor (EGFR) is the key to targeted therapy with tyrosine kinase inhibitors in lung cancer. Traditional identification of EGFR mutation status requires biopsy and sequence testing, which may not be suitable for certain groups who cannot perform biopsy. In this paper, using easily accessible and non-invasive CT images, the residual neural network (ResNet) with mixed loss based on batch training technique is proposed for identification of EGFR mutation status in lung cancer. In this model, the ResNet is regarded as the baseline for feature extraction to avoid the gradient disappearance. Besides, a new mixed loss based on the batch similarity and the cross entropy is proposed to guide the network to better learn the model parameters. The proposed mixed loss utilizes the similarity among batch samples to evaluate the distribution of training data, which can reduce the similarity of different classes and the difference of the same classes. In the experiments, VGG16Net, DenseNet, ResNet18, ResNet34 and ResNet50 models with the mixed loss are trained on the public CT dataset with 155 patients including EGFR mutation status from TCIA. The trained networks are employed to the collected preoperative CT dataset with 56 patients from the cooperative hospital for validating the efficiency of the proposed models. Experimental results show that the proposed models are more appropriate and effective on the lung cancer dataset for identifying the EGFR mutation status. In these models, the ResNet34 with mixed loss is optimal (accuracy = 81.58%, AUC = 0.8861, sensitivity = 80.02%, specificity = 82.90%).
Collapse
Affiliation(s)
- Liye Jia
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| | - Wei Wu
- Department of Physiology, Shanxi Medical University, Taiyuan, 030051 China
| | - Guojie Hou
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| | - Juanjuan Zhao
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| | - Yan Qiang
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| | - Yanan Zhang
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| | - Meiling Cai
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, 030600 China
| |
Collapse
|
17
|
Ou SM, Tsai MT, Lee KH, Tseng WC, Yang CY, Chen TH, Bin PJ, Chen TJ, Lin YP, Sheu WHH, Chu YC, Tarng DC. Prediction of the risk of developing end-stage renal diseases in newly diagnosed type 2 diabetes mellitus using artificial intelligence algorithms. BioData Min 2023; 16:8. [PMID: 36899426 PMCID: PMC10007785 DOI: 10.1186/s13040-023-00324-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Accepted: 02/17/2023] [Indexed: 03/12/2023] Open
Abstract
OBJECTIVES Type 2 diabetes mellitus (T2DM) imposes a great burden on healthcare systems, and these patients experience higher long-term risks for developing end-stage renal disease (ESRD). Managing diabetic nephropathy becomes more challenging when kidney function starts declining. Therefore, developing predictive models for the risk of developing ESRD in newly diagnosed T2DM patients may be helpful in clinical settings. METHODS We established machine learning models constructed from a subset of clinical features collected from 53,477 newly diagnosed T2DM patients from January 2008 to December 2018 and then selected the best model. The cohort was divided, with 70% and 30% of patients randomly assigned to the training and testing sets, respectively. RESULTS The discriminative ability of our machine learning models, including logistic regression, extra tree classifier, random forest, gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine were evaluated across the cohort. XGBoost yielded the highest area under the receiver operating characteristic curve (AUC) of 0.953, followed by extra tree and GBDT, with AUC values of 0.952 and 0.938 on the testing dataset. The SHapley Additive explanation summary plot in the XGBoost model illustrated that the top five important features included baseline serum creatinine, mean serum creatine within 1 year before the diagnosis of T2DM, high-sensitivity C-reactive protein, spot urine protein-to-creatinine ratio and female gender. CONCLUSIONS Because our machine learning prediction models were based on routinely collected clinical features, they can be used as risk assessment tools for developing ESRD. By identifying high-risk patients, intervention strategies may be provided at an early stage.
Collapse
Affiliation(s)
- Shuo-Ming Ou
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Ming-Tsun Tsai
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Kuo-Hua Lee
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Wei-Cheng Tseng
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Chih-Yu Yang
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Tz-Heng Chen
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Pin-Jie Bin
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Tzeng-Ji Chen
- School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Department of Family Medicine, Taipei Veterans General Hospital, Taipei, Taiwan.,Department of Family Medicine, Taipei Veterans General Hospital, Hsinchu Branch, Hsinchu, Taiwan.,Institute of Hospital and Health Care Administration, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yao-Ping Lin
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan.,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Wayne Huey-Herng Sheu
- School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Division of Endocrinology and Metabolism, Department of Internal Medicine, Taipei Veterans General Hospital, Taipei, Taiwan.,Institute of Molecular and Genetic Medicine, National Health Research Institute, Miaoli, Taiwan
| | - Yuan-Chia Chu
- Information Management Office, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan. .,Big Data Center, Taipei Veterans General Hospital, Taipei, Taiwan. .,Department of Information Management, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan.
| | - Der-Cherng Tarng
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, 201, Section 2, Shih-Pai Road, Taipei, 11217, Taiwan. .,School of Medicine, College of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan. .,Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan. .,Department and Institute of Physiology, National Yang Ming Chiao Tung University, Taipei, Taiwan.
| |
Collapse
|
18
|
Goel A, Goel AK, Kumar A. Performance analysis of multiple input single layer neural network hardware chip. MULTIMEDIA TOOLS AND APPLICATIONS 2023; 82:1-22. [PMID: 36846531 PMCID: PMC9939870 DOI: 10.1007/s11042-023-14627-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 02/24/2022] [Accepted: 02/03/2023] [Indexed: 06/18/2023]
Abstract
An artificial neural network (ANN) is a computational system that is designed to replicate and process the behavior of the human brain using neuron nodes. ANNs are made up of thousands of processing neurons with input and output modules that self-learn and compute data to offer the best results. The hardware realization of the massive neuron system is a difficult task. The research article emphasizes the design and realization of multiple input perceptron chips in Xilinx integrated system environment (ISE) 14.7 software. The proposed single-layer ANN architecture is scalable and accepts variable 64 inputs. The design is distributed in eight parallel blocks of ANN in which one block consists of eight neurons. The performance of the chip is analyzed based on the hardware utilization, memory, combinational delay, and different processing elements with targeted hardware Virtex-5 field-programmable gate array (FPGA). The chip simulation is performed in Modelsim 10.0 software. Artificial intelligence has a wide range of applications, and cutting-edge computing technology has a vast market. Hardware processors that are fast, affordable, and suited for ANN applications and accelerators are being developed by the industries. The novelty of the work is that it provides a parallel and scalable design platform on FPGA for fast switching, which is the current need in the forthcoming neuromorphic hardware.
Collapse
Affiliation(s)
- Akash Goel
- Department of Computer Science & Engineering, Galgotia’s University, Greater Noida, NCR India
| | - Amit Kumar Goel
- Department of Computer Science & Engineering, Galgotia’s University, Greater Noida, NCR India
| | - Adesh Kumar
- Department of Electrical & Electronics Engineering, University of Petroleum and Energy Studies, Dehradun, India
| |
Collapse
|
19
|
Du X, Zuo E, Chu Z, He Z, Yu J. Fluctuation-based outlier detection. Sci Rep 2023; 13:2408. [PMID: 36765095 PMCID: PMC9918462 DOI: 10.1038/s41598-023-29549-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 02/06/2023] [Indexed: 02/12/2023] Open
Abstract
Outlier detection is an important topic in machine learning and has been used in a wide range of applications. Outliers are objects that are few in number and deviate from the majority of objects. As a result of these two properties, we show that outliers are susceptible to a mechanism called fluctuation. This article proposes a method called fluctuation-based outlier detection (FBOD) that achieves a low linear time complexity and detects outliers purely based on the concept of fluctuation without employing any distance, density or isolation measure. Fundamentally different from all existing methods. FBOD first converts the Euclidean structure datasets into graphs by using random links, then propagates the feature value according to the connection of the graph. Finally, by comparing the difference between the fluctuation of an object and its neighbors, FBOD determines the object with a larger difference as an outlier. The results of experiments comparing FBOD with eight state-of-the-art algorithms on eight real-worlds tabular datasets and three video datasets show that FBOD outperforms its competitors in the majority of cases and that FBOD has only 5% of the execution time of the fastest algorithm. The experiment codes are available at: https://github.com/FluctuationOD/Fluctuation-based-Outlier-Detection .
Collapse
Affiliation(s)
- Xusheng Du
- School of Information Science and Engineering, Xinjiang University, Ürümqi, 830046, China.
| | - Enguang Zuo
- grid.413254.50000 0000 9544 7024School of Information Science and Engineering, Xinjiang University, Ürümqi, 830046 China
| | - Zheng Chu
- grid.413254.50000 0000 9544 7024School of Information Science and Engineering, Xinjiang University, Ürümqi, 830046 China
| | - Zhenzhen He
- grid.413254.50000 0000 9544 7024School of Information Science and Engineering, Xinjiang University, Ürümqi, 830046 China
| | - Jiong Yu
- grid.413254.50000 0000 9544 7024School of Information Science and Engineering, Xinjiang University, Ürümqi, 830046 China
| |
Collapse
|
20
|
Tumor Localization and Classification from MRI of Brain using Deep Convolution Neural Network and Salp Swarm Algorithm. Cognit Comput 2023. [DOI: 10.1007/s12559-022-10096-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
21
|
Personal Health Train Architecture with Dynamic Cloud Staging. SN COMPUTER SCIENCE 2023; 4:14. [PMID: 36274815 PMCID: PMC9574821 DOI: 10.1007/s42979-022-01422-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 09/16/2022] [Indexed: 11/05/2022]
Abstract
Scientific advances, especially in the healthcare domain, can be accelerated by making data available for analysis. However, in traditional data analysis systems, data need to be moved to a central processing unit that performs analyses, which may be undesirable, e.g. due to privacy regulations in case these data contain personal information. This paper discusses the Personal Health Train (PHT) approach in which data processing is brought to the (personal health) data rather than the other way around, allowing (private) data accessed to be controlled, and to observe ethical and legal concerns. This paper introduces the PHT architecture and discusses the data staging solution that allows processing to be delegated to components spawned in a private cloud environment in case the (health) organisation hosting the data has limited resources to execute the required processing. This paper shows the feasibility and suitability of the solution with a relatively simple, yet representative, case study of data analysis of Covid-19 infections, which is performed by components that are created on demand and run in the Amazon Web Services platform. This paper also shows that the performance of our solution is acceptable, and that our solution is scalable. This paper demonstrates that the PHT approach enables data analysis with controlled access, preserving privacy and complying with regulations such as GDPR, while the solution is deployed in a private cloud environment.
Collapse
|
22
|
Chen X, Aljrees T, Umer M, Saidani O, Almuqren L, Mzoughi O, Ishaq A, Ashraf I. Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel. Digit Health 2023; 9:20552076231203802. [PMID: 37799501 PMCID: PMC10548812 DOI: 10.1177/20552076231203802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 09/08/2023] [Indexed: 10/07/2023] Open
Abstract
Objective Cervical cancer stands as a leading cause of mortality among women in developing nations. To ensure the reduction of its adverse consequences, the primary protocols to be adhered to involve early detection and treatment under the guidance of expert medical professionals. An effective approach for identifying this form of malignancy involves the examination of Pap smear images. However, in the context of automating cervical cancer detection, many of the existing datasets frequently exhibit missing data points, a factor that can substantially impact the effectiveness of machine learning models. Methods In response to these hurdles, this research introduces an automated system designed to predict cervical cancer with a dual focus: adeptly managing missing data while attaining remarkable accuracy. The system's core is built upon a stacked ensemble voting classifier model, which amalgamates three distinct machine learning models, all harmoniously integrated with the KNN Imputer to address the issue of missing values. Results The model put forth attains an accuracy of 99.41%, precision of 97.63%, recall of 95.96%, and an F1 score of 96.76% when incorporating the KNN imputation method. The investigation conducts a comparative analysis, contrasting the performance of this model with seven alternative machine learning algorithms in two scenarios: one where missing values are eliminated, and another employing KNN imputation. This study offers validation of the effectiveness of the proposed model in comparison to current state-of-the-art methodologies. Conclusions This research delves into the challenge of handling missing data in the dataset utilized for cervical cancer detection. The findings have the potential to assist healthcare professionals in achieving early detection and enhancing the quality of care provided to individuals affected by cervical cancer.
Collapse
Affiliation(s)
- Xiaoyuan Chen
- Huzhou Key Laboratory of Green Energy Materials and Battery Cascade Utilization, School of Intelligent Manufacturing, Huzhou College, Huzhou, P.R. China
| | - Turki Aljrees
- Department College of Computer Science and Engineering, University of Hafr Al-Batin, Hafar Al-Batin, Saudi Arabia
| | - Muhammad Umer
- Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
| | - Oumaima Saidani
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Latifah Almuqren
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Olfa Mzoughi
- Department of Computer Science, College of Sciences and Humanities-Aflaj, Prince Sattam bin Abdulaziz University, Aflaj, Saudi Arabia
| | - Abid Ishaq
- Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
| | - Imran Ashraf
- Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, South Korea
| |
Collapse
|
23
|
Abdolkarimi V, Sari A, Shokri S. A hybrid multiscale filter along with an improved adaptive SVR technique for fault diagnosis and machine learning modeling: forecasting the octane number of gasoline in isomerization reactor. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08128-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
24
|
Srinivasu PN, Shafi J, Krishna TB, Sujatha CN, Praveen SP, Ijaz MF. Using Recurrent Neural Networks for Predicting Type-2 Diabetes from Genomic and Tabular Data. Diagnostics (Basel) 2022; 12:3067. [PMID: 36553074 PMCID: PMC9776641 DOI: 10.3390/diagnostics12123067] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/01/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open
Abstract
The development of genomic technology for smart diagnosis and therapies for various diseases has lately been the most demanding area for computer-aided diagnostic and treatment research. Exponential breakthroughs in artificial intelligence and machine intelligence technologies could pave the way for identifying challenges afflicting the healthcare industry. Genomics is paving the way for predicting future illnesses, including cancer, Alzheimer's disease, and diabetes. Machine learning advancements have expedited the pace of biomedical informatics research and inspired new branches of computational biology. Furthermore, knowing gene relationships has resulted in developing more accurate models that can effectively detect patterns in vast volumes of data, making classification models important in various domains. Recurrent Neural Network models have a memory that allows them to quickly remember knowledge from previous cycles and process genetic data. The present work focuses on type 2 diabetes prediction using gene sequences derived from genomic DNA fragments through automated feature selection and feature extraction procedures for matching gene patterns with training data. The suggested model was tested using tabular data to predict type 2 diabetes based on several parameters. The performance of neural networks incorporating Recurrent Neural Network (RNN) components, Long Short-Term Memory (LSTM), and Gated Recurrent Units (GRU) was tested in this research. The model's efficiency is assessed using the evaluation metrics such as Sensitivity, Specificity, Accuracy, F1-Score, and Mathews Correlation Coefficient (MCC). The suggested technique predicted future illnesses with fair Accuracy. Furthermore, our research showed that the suggested model could be used in real-world scenarios and that input risk variables from an end-user Android application could be kept and evaluated on a secure remote server.
Collapse
Affiliation(s)
- Parvathaneni Naga Srinivasu
- Department of Computer Science and Engineering, Prasad V. Potluri Siddhartha Institute of Technology, Vijayawada 520007, Andhra Pradesh, India
| | - Jana Shafi
- Department of Computer Science, College of Arts and Science, Prince Sattam bin Abdul Aziz University, Wadi Ad-Dawasir 11991, Saudi Arabia
| | - T Balamurali Krishna
- Department of Computer Science and Engineering, Dhanekula Institute of Engineering and Technology, Vijayawada 521139, Andhra Pradesh, India
| | - Canavoy Narahari Sujatha
- Department of Electronics and Communication Engineering, Sreenidhi Institute of Science and Technology, Hyderabad 501301, Telangana, India
| | - S Phani Praveen
- Department of Computer Science and Engineering, Prasad V. Potluri Siddhartha Institute of Technology, Vijayawada 520007, Andhra Pradesh, India
| | - Muhammad Fazal Ijaz
- Department of Intelligent Mechatronics Engineering, Sejong University, Seoul 05006, Republic of Korea
| |
Collapse
|
25
|
Praveen SP, Srinivasu PN, Shafi J, Wozniak M, Ijaz MF. ResNet-32 and FastAI for diagnoses of ductal carcinoma from 2D tissue slides. Sci Rep 2022; 12:20804. [PMID: 36460697 PMCID: PMC9716161 DOI: 10.1038/s41598-022-25089-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 11/23/2022] [Indexed: 12/03/2022] Open
Abstract
Carcinoma is a primary source of morbidity in women globally, with metastatic disease accounting for most deaths. Its early discovery and diagnosis may significantly increase the odds of survival. Breast cancer imaging is critical for early identification, clinical staging, management choices, and treatment planning. In the current study, the FastAI technology is used with the ResNet-32 model to precisely identify ductal carcinoma. ResNet-32 is having few layers comparted to majority of its counterparts with almost identical performance. FastAI offers a rapid approximation toward the outcome for deep learning models via GPU acceleration and a faster callback mechanism, which would result in faster execution of the model with lesser code and yield better precision in classifying the tissue slides. Residual Network (ResNet) is proven to handle the vanishing gradient and effective feature learning better. Integration of two computationally efficient technologies has yielded a precision accuracy with reasonable computational efforts. The proposed model has shown considerable efficiency in the evaluating parameters like sensitivity, specificity, accuracy, and F1 Score against the other dominantly used deep learning models. These insights have shown that the proposed approach might assist practitioners in analyzing Breast Cancer (BC) cases appropriately, perhaps saving future complications and death. Clinical and pathological analysis and predictive accuracy have been improved with digital image processing.
Collapse
Affiliation(s)
- S Phani Praveen
- Department of Computer Science and Engineering, Prasad V Potluri Siddhartha Institute of Technology, Vijayawada, 520007, India
| | - Parvathaneni Naga Srinivasu
- Department of Computer Science and Engineering, Prasad V Potluri Siddhartha Institute of Technology, Vijayawada, 520007, India
| | - Jana Shafi
- Department of Computer Science, College of Arts and Science, Prince Sattam bin Abdul Aziz University, Wadi Ad-Dawasir, 11991, Saudi Arabia
| | - Marcin Wozniak
- Faculty of Applied Mathematics, Silesian University of Technology, 44-100, Gliwice, Poland.
| | - Muhammad Fazal Ijaz
- Department of Mechanical Engineering, Faculty of Engineering and Information Technology, The University of Melbourne, Grattam Street, Parkville, VIC, 3010, Australia.
| |
Collapse
|
26
|
Allahqoli L, Laganà AS, Mazidimoradi A, Salehiniya H, Günther V, Chiantera V, Karimi Goghari S, Ghiasvand MM, Rahmani A, Momenimovahed Z, Alkatout I. Diagnosis of Cervical Cancer and Pre-Cancerous Lesions by Artificial Intelligence: A Systematic Review. Diagnostics (Basel) 2022; 12:2771. [PMID: 36428831 PMCID: PMC9689914 DOI: 10.3390/diagnostics12112771] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 11/06/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVE The likelihood of timely treatment for cervical cancer increases with timely detection of abnormal cervical cells. Automated methods of detecting abnormal cervical cells were established because manual identification requires skilled pathologists and is time consuming and prone to error. The purpose of this systematic review is to evaluate the diagnostic performance of artificial intelligence (AI) technologies for the prediction, screening, and diagnosis of cervical cancer and pre-cancerous lesions. MATERIALS AND METHODS Comprehensive searches were performed on three databases: Medline, Web of Science Core Collection (Indexes = SCI-EXPANDED, SSCI, A & HCI Timespan) and Scopus to find papers published until July 2022. Articles that applied any AI technique for the prediction, screening, and diagnosis of cervical cancer were included in the review. No time restriction was applied. Articles were searched, screened, incorporated, and analyzed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines. RESULTS The primary search yielded 2538 articles. After screening and evaluation of eligibility, 117 studies were incorporated in the review. AI techniques were found to play a significant role in screening systems for pre-cancerous and cancerous cervical lesions. The accuracy of the algorithms in predicting cervical cancer varied from 70% to 100%. AI techniques make a distinction between cancerous and normal Pap smears with 80-100% accuracy. AI is expected to serve as a practical tool for doctors in making accurate clinical diagnoses. The reported sensitivity and specificity of AI in colposcopy for the detection of CIN2+ were 71.9-98.22% and 51.8-96.2%, respectively. CONCLUSION The present review highlights the acceptable performance of AI systems in the prediction, screening, or detection of cervical cancer and pre-cancerous lesions, especially when faced with a paucity of specialized centers or medical resources. In combination with human evaluation, AI could serve as a helpful tool in the interpretation of cervical smears or images.
Collapse
Affiliation(s)
- Leila Allahqoli
- Midwifery Department, Ministry of Health and Medical Education, Tehran 1467664961, Iran
| | - Antonio Simone Laganà
- Unit of Gynecologic Oncology, ARNAS “Civico-Di Cristina-Benfratelli”, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties (PROMISE), University of Palermo, 90127 Palermo, Italy
| | - Afrooz Mazidimoradi
- Neyriz Public Health Clinic, Shiraz University of Medical Sciences, Shiraz 7134814336, Iran
| | - Hamid Salehiniya
- Social Determinants of Health Research Center, Birjand University of Medical Sciences, Birjand 9717853577, Iran
| | - Veronika Günther
- University Hospitals Schleswig-Holstein, Campus Kiel, Kiel School of Gynaecological Endoscopy, Arnold-Heller-Str. 3, Haus 24, 24105 Kiel, Germany
| | - Vito Chiantera
- Unit of Gynecologic Oncology, ARNAS “Civico-Di Cristina-Benfratelli”, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties (PROMISE), University of Palermo, 90127 Palermo, Italy
| | - Shirin Karimi Goghari
- School of Industrial and Systems Engineering, Tarbiat Modares University (TMU), Tehran 1411713114, Iran
| | - Mohammad Matin Ghiasvand
- Department of Computer Engineering, Amirkabir University of Technology (AUT), Tehran 1591634311, Iran
| | - Azam Rahmani
- Nursing and Midwifery Care Research Centre, School of Nursing and Midwifery, Tehran University of Medical Sciences, Tehran 141973317, Iran
| | - Zohre Momenimovahed
- Reproductive Health Department, Qom University of Medical Sciences, Qom 3716993456, Iran
| | - Ibrahim Alkatout
- University Hospitals Schleswig-Holstein, Campus Kiel, Kiel School of Gynaecological Endoscopy, Arnold-Heller-Str. 3, Haus 24, 24105 Kiel, Germany
| |
Collapse
|
27
|
Tang J, Wang X, Wan H, Lin C, Shao Z, Chang Y, Wang H, Wu Y, Zhang T, Du Y. Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage. BMC Med Inform Decis Mak 2022; 22:278. [PMID: 36284327 PMCID: PMC9594939 DOI: 10.1186/s12911-022-02018-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Accepted: 10/10/2022] [Indexed: 11/28/2022] Open
Abstract
Background Outliers and class imbalance in medical data could affect the accuracy of machine learning models. For physicians who want to apply predictive models, how to use the data at hand to build a model and what model to choose are very thorny problems. Therefore, it is necessary to consider outliers, imbalanced data, model selection, and parameter tuning when modeling. Methods This study used a joint modeling strategy consisting of: outlier detection and removal, data balancing, model fitting and prediction, performance evaluation. We collected medical record data for all ICH patients with admissions in 2017–2019 from Sichuan Province. Clinical and radiological variables were used to construct models to predict mortality outcomes 90 days after discharge. We used stacking ensemble learning to combine logistic regression (LR), random forest (RF), artificial neural network (ANN), support vector machine (SVM), and k-nearest neighbors (KNN) models. Accuracy, sensitivity, specificity, AUC, precision, and F1 score were used to evaluate model performance. Finally, we compared all 84 combinations of the joint modeling strategy, including training set with and without cross-validated committees filter (CVCF), five resampling techniques (random under-sampling (RUS), random over-sampling (ROS), adaptive synthetic sampling (ADASYN), Borderline synthetic minority oversampling technique (Borderline SMOTE), synthetic minority oversampling technique and edited nearest neighbor (SMOTEENN)) and no resampling, seven models (LR, RF, ANN, SVM, KNN, Stacking, AdaBoost). Results Among 4207 patients with ICH, 2909 (69.15%) survived 90 days after discharge, and 1298 (30.85%) died within 90 days after discharge. The performance of all models improved with removing outliers by CVCF except sensitivity. For data balancing processing, the performance of training set without resampling was better than that of training set with resampling in terms of accuracy, specificity, and precision. And the AUC of ROS was the best. For seven models, the average accuracy, specificity, AUC, and precision of RF were the highest. Stacking performed best in F1 score. Among all 84 combinations of joint modeling strategy, eight combinations performed best in terms of accuracy (0.816). For sensitivity, the best performance was SMOTEENN + Stacking (0.662). For specificity, the best performance was CVCF + KNN (0.987). Stacking and AdaBoost had the best performances in AUC (0.756) and F1 score (0.602), respectively. For precision, the best performance was CVCF + SVM (0.938). Conclusion This study proposed a joint modeling strategy including outlier detection and removal, data balancing, model fitting and prediction, performance evaluation, in order to provide a reference for physicians and researchers who want to build their own models. This study illustrated the importance of outlier detection and removal for machine learning and showed that ensemble learning might be a good modeling strategy. Due to the low imbalanced ratio (IR, the ratio of majority class and minority class) in this study, we did not find any improvement in models with resampling in terms of accuracy, specificity, and precision, while ROS performed best on AUC. Supplementary Information The online version contains supplementary material available at 10.1186/s12911-022-02018-x.
Collapse
Affiliation(s)
- Jianxiang Tang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Xiaoyu Wang
- Department of Neurosurgery, West China Hospital of Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Hongli Wan
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Chunying Lin
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Zilun Shao
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Yang Chang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Hexuan Wang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Yi Wu
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China
| | - Tao Zhang
- Department of Epidemiology and Health Statistics, West China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China. .,Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China.
| | - Yu Du
- Health Emergency Management Research Center, West China-PUMC C.C. Chen Institute of Health, Sichuan University, Chengdu, Sichuan, People's Republic of China. .,Department of Emergency and Critical Care Medicine, West China School of Public Health, West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, People's Republic of China.
| |
Collapse
|
28
|
Detection of lung cancer in CT scans using grey wolf optimization algorithm and recurrent neural network. HEALTH AND TECHNOLOGY 2022. [DOI: 10.1007/s12553-022-00700-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
29
|
Multistrategy Improved Sparrow Search Algorithm Optimized Deep Neural Network for Esophageal Cancer. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1036913. [PMID: 36203733 PMCID: PMC9532078 DOI: 10.1155/2022/1036913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/05/2022] [Accepted: 09/09/2022] [Indexed: 01/09/2023]
Abstract
Deep neural network is a complex pattern recognition network system. It is widely favored by scholars for its strong nonlinear fitting ability. However, training deep neural network models on small datasets typically realizes worse performance than shallow neural network. In this study, a strategy to improve the sparrow search algorithm based on the iterative map, iterative perturbation, and Gaussian mutation is developed. This optimized strategy improved the sparrow search algorithm validated by fourteen benchmark functions, and the algorithm has the best search accuracy and the fastest convergence speed. An algorithm based on the iterative map, iterative perturbation, and Gaussian mutation improved sparrow search algorithm is designed to optimize deep neural networks. The modified sparrow algorithm is exploited to search for the optimal connection weights of deep neural network. This algorithm is implemented for the esophageal cancer dataset along with the other six algorithms. The proposed model is able to achieve 0.92 under all the eight scoring criteria, which is better than the performance of the other six algorithms. Therefore, an optimized deep neural network based on an improved sparrow search algorithm with iterative map, iterative perturbation, and Gaussian mutation is an effective approach to predict the survival rate of esophageal cancer.
Collapse
|
30
|
Purohit H, Dadhich M, Ajmera PK. Analytical study on users' awareness and acceptability towards adoption of multimodal biometrics (MMB) mechanism in online transactions: a two-stage SEM-ANN approach. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:14239-14263. [PMID: 36157357 PMCID: PMC9489485 DOI: 10.1007/s11042-022-13786-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 08/10/2022] [Accepted: 09/05/2022] [Indexed: 06/16/2023]
Abstract
The study analyses user awareness of multimodal biometrics and its acceptability for online transactions in the current dynamic world. The study was performed on the five underlying perspectives: User Acceptability, Cognizant Factors towards Biometrics, Technological factors, Perceptional Factors (Fingerprints, Iris, Face Recognition and Voice) and Data Privacy Factors. A questionnaire was prepared and circulated to the 530 biometrics users; on that basis, the corresponding answer was obtained for analysis. SEM is first employed to gauge the research model and test the prominent hypothesized predictors, which are then used as inputs in the neural network to evaluate the relative significance of each predictor variable. By considering the standardized significance of the feed-for-back-propagation of ANN algorithms, the study found a significant effect of DPF_3 (93%), DPF_2 (50%) and DPF_4 (34%) on the adoption of MMB. In the Perceptional construct, PRF_2 (49%) and PRF_3 (33%) was relatively the most important predictor, whereas, in User Acceptability, UAC_2 (37%), UAC_3 & UAC_5 (41%) was vital to be considered. Only one item, TCF_2 (35%), from Technological Factors, followed by Cognizant factors, i.e., CFG_1 (33%), confirmed the best fit model to adopt MMB. The research is a novel effort when compared to past studies as it considered cognizant and perceptual factors in the proposed model, thereby expanding the analytical outlook of MMB literature. Thus, the study also explored several new and valuable practical implications for adopting multimodal instruments of biometrics along with certain limitations.
Collapse
Affiliation(s)
- Himanshu Purohit
- EEE, Birla Institute of Technology & Science Pilani, Pilani, India
| | | | - Pawan K Ajmera
- EEE, Birla Institute of Technology & Science Pilani, Pilani, India
| |
Collapse
|
31
|
Particle Swarm Optimization and Two-Way Fixed-Effects Analysis of Variance for Efficient Brain Tumor Segmentation. Cancers (Basel) 2022; 14:cancers14184399. [PMID: 36139559 PMCID: PMC9496881 DOI: 10.3390/cancers14184399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 09/04/2022] [Accepted: 09/07/2022] [Indexed: 11/29/2022] Open
Abstract
Simple Summary Segmentation of brain tumor images from magnetic resonance imaging (MRI) is a challenging topic in medical image analysis. The brain tumor can take many shapes, and MRI images vary considerably in intensity, making lesion detection difficult for radiologists. This paper proposes a three-step approach to solving this problem: (1) pre-processing, based on morphological operations, is applied to remove the skull bone from the image; (2) the particle swarm optimization (PSO) algorithm, with a two-way fixed-effects analysis of variance (ANOVA)-based fitness function, is used to find the optimal block containing the brain lesion; (3) the K-means clustering algorithm is adopted, to classify the detected block as tumor or non-tumor. An extensive experimental analysis, including visual and statistical evaluations, was conducted, using two MRI databases: a private database provided by the Kouba imaging center—Algiers (KICA)—and the multimodal brain tumor segmentation challenge (BraTS) 2015 database. The results show that the proposed methodology achieved impressive performance, compared to several competing approaches. Abstract Segmentation of brain tumor images, to refine the detection and understanding of abnormal masses in the brain, is an important research topic in medical imaging. This paper proposes a new segmentation method, consisting of three main steps, to detect brain lesions using magnetic resonance imaging (MRI). In the first step, the parts of the image delineating the skull bone are removed, to exclude insignificant data. In the second step, which is the main contribution of this study, the particle swarm optimization (PSO) technique is applied, to detect the block that contains the brain lesions. The fitness function, used to determine the best block among all candidate blocks, is based on a two-way fixed-effects analysis of variance (ANOVA). In the last step of the algorithm, the K-means segmentation method is used in the lesion block, to classify it as a tumor or not. A thorough evaluation of the proposed algorithm was performed, using: (1) a private MRI database provided by the Kouba imaging center—Algiers (KICA); (2) the multimodal brain tumor segmentation challenge (BraTS) 2015 database. Estimates of the selected fitness function were first compared to those based on the sum-of-absolute-differences (SAD) dissimilarity criterion, to demonstrate the efficiency and robustness of the ANOVA. The performance of the optimized brain tumor segmentation algorithm was then compared to the results of several state-of-the-art techniques. The results obtained, by using the Dice coefficient, Jaccard distance, correlation coefficient, and root mean square error (RMSE) measurements, demonstrated the superiority of the proposed optimized segmentation algorithm over equivalent techniques.
Collapse
|
32
|
Cancerous Tumor Controlled Treatment Using Search Heuristic (GA)-Based Sliding Mode and Synergetic Controller. Cancers (Basel) 2022; 14:cancers14174191. [PMID: 36077727 PMCID: PMC9454425 DOI: 10.3390/cancers14174191] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/11/2022] [Accepted: 08/25/2022] [Indexed: 11/27/2022] Open
Abstract
Simple Summary Cancer is basically a tough condition on a patient’s body where cell grows uncontrollably. Normal cells are affected, which destroys the health of the patient. The main problem in cancer is spreading from one part to another. Therefore, the mathematical modeling of cancerous tumors integrates to check overall stability. A novel approach is introduced such as Bernstein polynomial with combination of genetic algorithm, sliding mode controller, and synergetic control. The proposed solution has easily eliminated cancerous cells within five days using synergetic control. In addition, five cases are incorporated to evaluate error function. In addition, a brief comparative study is added to contrast the simulation results with theoretical modeling. Abstract Cancerous tumor cells divide uncontrollably, which results in either tumor or harm to the immune system of the body. Due to the destructive effects of chemotherapy, optimal medications are needed. Therefore, possible treatment methods should be controlled to maintain the constant/continuous dose for affecting the spreading of cancerous tumor cells. Rapid growth of cells is classified into primary and secondary types. In giving a proper response, the immune system plays an important role. This is considered a natural process while fighting against tumors. In recent days, achieving a better method to treat tumors is the prime focus of researchers. Mathematical modeling of tumors uses combined immune, vaccine, and chemotherapies to check performance stability. In this research paper, mathematical modeling is utilized with reference to cancerous tumor growth, the immune system, and normal cells, which are directly affected by the process of chemotherapy. This paper presents novel techniques, which include Bernstein polynomial (BSP) with genetic algorithm (GA), sliding mode controller (SMC), and synergetic control (SC), for giving a possible solution to the cancerous tumor cells (CCs) model. Through GA, random population is generated to evaluate fitness. SMC is used for the continuous exponential dose of chemotherapy to reduce CCs in about forty-five days. In addition, error function consists of five cases that include normal cells (NCs), immune cells (ICs), CCs, and chemotherapy. Furthermore, the drug control process is explained in all the cases. In simulation results, utilizing SC has completely eliminated CCs in nearly five days. The proposed approach reduces CCs as early as possible.
Collapse
|
33
|
Hazarika RA, Maji AK, Syiem R, Sur SN, Kandar D. Hippocampus Segmentation Using U-Net Convolutional Network from Brain Magnetic Resonance Imaging (MRI). J Digit Imaging 2022; 35:893-909. [PMID: 35304675 PMCID: PMC9485390 DOI: 10.1007/s10278-022-00613-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 01/04/2022] [Accepted: 01/14/2022] [Indexed: 12/21/2022] Open
Abstract
Hippocampus is a part of the limbic system in human brain that plays an important role in forming memories and dealing with intellectual abilities. In most of the neurological disorders related to dementia, such as, Alzheimer's disease, hippocampus is one of the earliest affected regions. Because there are no effective dementia drugs, an ambient assisted living approach may help to prevent or slow the progression of dementia. By segmenting and analyzing the size/shape of hippocampus, it may be possible to classify the early dementia stages. Because of complex structure, traditional image segmentation techniques can't segment hippocampus accurately. Machine learning (ML) is a well known tool in medical image processing that can predict and deliver the outcomes accurately by learning from it's previous results. Convolutional Neural Networks (CNN) is one of the most popular ML algorithms. In this work, a U-Net Convolutional Network based approach is used for hippocampus segmentation from 2D brain images. It is observed that, the original U-Net architecture can segment hippocampus with an average performance rate of 93.6%, which outperforms all other discussed state-of-arts. By using a filter size of [Formula: see text], the original U-Net architecture performs a sequence of convolutional processes. We tweaked the architecture further to extract more relevant features by replacing all [Formula: see text] kernels with three alternative kernels of sizes [Formula: see text], [Formula: see text], and [Formula: see text]. It is observed that, the modified architecture achieved an average performance rate of 96.5%, which outperforms the original U-Net model convincingly.
Collapse
Affiliation(s)
- Ruhul Amin Hazarika
- Department of Information Technology, North Eastern Hill University, Shillong, Meghalaya 793022 India
| | - Arnab Kumar Maji
- Department of Information Technology, North Eastern Hill University, Shillong, Meghalaya 793022 India
| | - Raplang Syiem
- Department of Information Technology, North Eastern Hill University, Shillong, Meghalaya 793022 India
| | - Samarendra Nath Sur
- Department of Electronics and Communication Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Majitar, Sikkim 737136 India
| | - Debdatta Kandar
- Department of Information Technology, North Eastern Hill University, Shillong, Meghalaya 793022 India
| |
Collapse
|
34
|
Analysis of Vessel Segmentation Based on Various Enhancement Techniques for Improvement of Vessel Intensity Profile. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7086632. [PMID: 35800676 PMCID: PMC9256369 DOI: 10.1155/2022/7086632] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 05/31/2022] [Accepted: 06/07/2022] [Indexed: 11/27/2022]
Abstract
It is vital to develop an appropriate prediction model and link carefully to measurable events such as clinical parameters and patient outcomes to analyze the severity of the disease. Timely identifying retinal diseases is becoming more vital to prevent blindness among young and adults. Investigation of blood vessels delivers preliminary information on the existence and treatment of glaucoma, retinopathy, and so on. During the analysis of diabetic retinopathy, one of the essential steps is to extract the retinal blood vessel accurately. This study presents an improved Gabor filter through various enhancement approaches. The degraded images with the enhancement of certain features can simplify image interpretation both for a human observer and for machine recognition. Thus, in this work, few enhancement approaches such as Gamma corrected adaptively with distributed weight (GCADW), joint equalization of histogram (JEH), homomorphic filter, unsharp masking filter, adaptive unsharp masking filter, and particle swarm optimization (PSO) based unsharp masking filter are taken into consideration. In this paper, an effort has been made to improve the performance of the Gabor filter by combining it with different enhancement methods and to enhance the detection of blood vessels. The performance of all the suggested approaches is assessed on publicly available databases such as DRIVE and CHASE_DB1. The results of all the integrated enhanced techniques are analyzed, discussed, and compared. The best result is delivered by PSO unsharp masking filter combined with the Gabor filter with an accuracy of 0.9593 for the DRIVE database and 0.9685 for the CHASE_DB1 database. The results illustrate the robustness of the recommended model in automatic blood vessel segmentation that makes it possible to be a clinical support decision tool in diabetic retinopathy diagnosis.
Collapse
|
35
|
An Intelligent Fusion Model with Portfolio Selection and Machine Learning for Stock Market Prediction. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7588303. [PMID: 35785077 PMCID: PMC9246624 DOI: 10.1155/2022/7588303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/17/2022] [Accepted: 05/26/2022] [Indexed: 11/18/2022]
Abstract
Developing reliable equity market models allows investors to make more informed decisions. A trading model can reduce the risks associated with investment and allow traders to choose the best-paying stocks. However, stock market analysis is complicated with batch processing techniques since stock prices are highly correlated. In recent years, advances in machine learning have given us a lot of chances to use forecasting theory and risk optimization together. The study postulates a unique two-stage framework. First, the mean-variance approach is utilized to select probable stocks (portfolio construction), thereby minimizing investment risk. Second, we present an online machine learning technique, a combination of “perceptron” and “passive-aggressive algorithm,” to predict future stock price movements for the upcoming period. We have calculated the classification reports, AUC score, accuracy, and Hamming loss for the proposed framework in the real-world datasets of 20 health sector indices for four different geographical reasons for the performance evaluation. Lastly, we conduct a numerical comparison of our method's outcomes to those generated via conventional solutions by previous studies. Our aftermath reveals that learning-based ensemble strategies with portfolio selection are effective in comparison.
Collapse
|
36
|
Dimitriadis I, Mavroudopoulos I, Kyrama S, Toliopoulos T, Gounaris A, Vakali A, Billis A, Bamidis P. Scalable real-time health data sensing and analysis enabling collaborative care delivery. SOCIAL NETWORK ANALYSIS AND MINING 2022. [DOI: 10.1007/s13278-022-00891-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
37
|
|
38
|
Zhu L, Yan T, Alimu G, Zhang L, Ma R, Alifu N, Zhang X, Wang D. Liposome-Loaded Targeted Theranostic Fluorescent Nano-Probes for Diagnosis and Treatment of Cervix Carcinoma. J Biomed Nanotechnol 2022. [DOI: 10.1166/jbn.2022.3332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Near-infrared fluorescence imaging, with its high sensitivity, non-invasiveness, and superior real-time feedback properties, has become a powerful skill for accurate diagnosis in the clinic. Nanoparticle-assisted chemotherapy is an effective cure for cancer. Specifically, the combination
of near-infrared fluorescence imaging with chemotherapy represents a promising method for precise diagnosis and treatment of cervical cancer. To realize this approach, it is necessary to design and synthesize therapeutic nano-probes with detection abilities. In this work, an organic NIRF emissive
heptamethine cyanine dye, IR783, was utilized and encapsulated in biocompatible drug-carrier liposomes). Then, the anticancer drug doxorubicin was loaded, to form LP-IR783-DOX nanoparticles. The LP-IR783-DOX nanoparticles had spherical shapes and were smoothly dispersed in aqueous solutions.
Favorable absorption (a peak of 800 nm) and fluorescence (a peak of 896 nm) features were obtained from LP-IR783-DOX nanoparticles in the near-infrared region. Moreover, the specific detection abilities of nanoparticles were confirmed in different cell lines, and nanoparticles exhibited strong
detection abilities in human cervix carcinoma cells in particular. To analyze the chemotherapeutic properties of LP-IR783-DOX nanoparticles, live HeLa cells were studied in detail, and the application of these NPs resulted in a chemotherapeutic efficiency of 56.75% based on fluorescein isothiocyanate
staining and flow cytometry. The results indicate that nanoparticles have great potential for theranostic application of fluorescence imaging and chemotherapy in cases of cervical cancer.
Collapse
Affiliation(s)
- Lijun Zhu
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Ting Yan
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Gulinigaer Alimu
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Linxue Zhang
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Rong Ma
- Department of Gynecology, The First Affiliated Hospital of Xinjiang Medical University, Urumqi 830054, China
| | - Nuernisha Alifu
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Xueliang Zhang
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| | - Duoqiang Wang
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, School of Medical Engineering and Technology, Xinjiang Medical University, Urumqi 830054, China
| |
Collapse
|
39
|
Datta D, Mallick PK, Bhoi AK, Ijaz MF, Shafi J, Choi J. Hyperspectral Image Classification: Potentials, Challenges, and Future Directions. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:3854635. [PMID: 35528334 PMCID: PMC9071975 DOI: 10.1155/2022/3854635] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 03/22/2022] [Accepted: 03/30/2022] [Indexed: 12/14/2022]
Abstract
Recent imaging science and technology discoveries have considered hyperspectral imagery and remote sensing. The current intelligent technologies, such as support vector machines, sparse representations, active learning, extreme learning machines, transfer learning, and deep learning, are typically based on the learning of the machines. These techniques enrich the processing of such three-dimensional, multiple bands, and high-resolution images with their precision and fidelity. This article presents an extensive survey depicting machine-dependent technologies' contributions and deep learning on landcover classification based on hyperspectral images. The objective of this study is three-fold. First, after reading a large pool of Web of Science (WoS), Scopus, SCI, and SCIE-indexed and SCIE-related articles, we provide a novel approach for review work that is entirely systematic and aids in the inspiration of finding research gaps and developing embedded questions. Second, we emphasize contemporary advances in machine learning (ML) methods for identifying hyperspectral images, with a brief, organized overview and a thorough assessment of the literature involved. Finally, we draw the conclusions to assist researchers in expanding their understanding of the relationship between machine learning and hyperspectral images for future research.
Collapse
Affiliation(s)
- Debaleena Datta
- School of Computer Engineering, Kalinga Institute of Industrial Technology, Deemed to be University, Bhubaneswar 751024, India
| | - Pradeep Kumar Mallick
- School of Computer Engineering, Kalinga Institute of Industrial Technology, Deemed to be University, Bhubaneswar 751024, India
| | - Akash Kumar Bhoi
- KIET Group of Institutions, Delhi-NCR, Ghaziabad-201206, India
- Directorate of Research, Sikkim Manipal University, Gangtok 737102, Sikkim, India
- AB-Tech eResearch (ABTeR), Sambalpur, Burla 768018, India
| | - Muhammad Fazal Ijaz
- Department of Intelligent Mechatronics Engineering, Sejong University, Seoul 05006, Republic of Korea
| | - Jana Shafi
- Department of Computer Science, College of Arts and Science, Prince Sattam Bin Abdul Aziz University, Wadi Ad-Dawasir 11991, Saudi Arabia
| | - Jaeyoung Choi
- School of Computing, Gachon University, Seongnam-si 13120, Republic of Korea
| |
Collapse
|
40
|
Lin LS, Hu SC, Lin YS, Li DC, Siao LR. A new approach to generating virtual samples to enhance classification accuracy with small data-a case of bladder cancer. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:6204-6233. [PMID: 35603398 DOI: 10.3934/mbe.2022290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In the medical field, researchers are often unable to obtain the sufficient samples in a short period of time necessary to build a stable data-driven forecasting model used to classify a new disease. To address the problem of small data learning, many studies have demonstrated that generating virtual samples intended to augment the amount of training data is an effective approach, as it helps to improve forecasting models with small datasets. One of the most popular methods used in these studies is the mega-trend-diffusion (MTD) technique, which is widely used in various fields. The effectiveness of the MTD technique depends on the degree of data diffusion. However, data diffusion is seriously affected by extreme values. In addition, the MTD method only considers data fitted using a unimodal triangular membership function. However, in fact, data may come from multiple distributions in the real world. Therefore, considering the fact that data comes from multi-distributions, in this paper, a distance-based mega-trend-diffusion (DB-MTD) technique is proposed to appropriately estimate the degree of data diffusion with less impacts from extreme values. In the proposed method, it is assumed that the data is fitted by the triangular and trapezoidal membership functions to generate virtual samples. In addition, a possibility evaluation mechanism is proposed to measure the applicability of the virtual samples. In our experiment, two bladder cancer datasets are used to verify the effectiveness of the proposed DB-MTD method. The experimental results demonstrated that the proposed method outperforms other VSG techniques in classification and regression items for small bladder cancer datasets.
Collapse
Affiliation(s)
- Liang-Sian Lin
- Department of Information Management, National Taipei University of Nursing and Health Sciences, Ming-te Road, Taipei 112303, Taiwan
| | - Susan C Hu
- Department of Public Health, College of Medicine, National Cheng Kung University, University Road, Tainan 70101, Taiwan
| | - Yao-San Lin
- Singapore Centre for Chinese Language, Nanyang Technological University, Ghim Moh Road Singapore 279623, Singapore
| | - Der-Chiang Li
- Department of Industrial and Information Management, National Cheng Kung University, University Road, Tainan 70101, Taiwan
| | - Liang-Ren Siao
- Department of Industrial and Information Management, National Cheng Kung University, University Road, Tainan 70101, Taiwan
| |
Collapse
|
41
|
Fine-Tuned DenseNet-169 for Breast Cancer Metastasis Prediction Using FastAI and 1-Cycle Policy. SENSORS 2022; 22:s22082988. [PMID: 35458972 PMCID: PMC9025766 DOI: 10.3390/s22082988] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/09/2022] [Accepted: 04/12/2022] [Indexed: 12/02/2022]
Abstract
Lymph node metastasis in breast cancer may be accurately predicted using a DenseNet-169 model. However, the current system for identifying metastases in a lymph node is manual and tedious. A pathologist well-versed with the process of detection and characterization of lymph nodes goes through hours investigating histological slides. Furthermore, because of the massive size of most whole-slide images (WSI), it is wise to divide a slide into batches of small image patches and apply methods independently on each patch. The present work introduces a novel method for the automated diagnosis and detection of metastases from whole slide images using the Fast AI framework and the 1-cycle policy. Additionally, it compares this new approach to previous methods. The proposed model has surpassed other state-of-art methods with more than 97.4% accuracy. In addition, a mobile application is developed for prompt and quick response. It collects user information and models to diagnose metastases present in the early stages of cancer. These results indicate that the suggested model may assist general practitioners in accurately analyzing breast cancer situations, hence preventing future complications and mortality. With digital image processing, histopathologic interpretation and diagnostic accuracy have improved considerably.
Collapse
|
42
|
Kapoor S, Kumar T. Fusing traditionally extracted features with deep learned features from the speech spectrogram for anger and stress detection using convolution neural network. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 81:31107-31128. [PMID: 35431609 PMCID: PMC8993039 DOI: 10.1007/s11042-022-12886-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 01/24/2022] [Accepted: 03/10/2022] [Indexed: 06/14/2023]
Abstract
Stress and anger are two negative emotions that affect individuals both mentally and physically; there is a need to tackle them as soon as possible. Automated systems are highly required to monitor mental states and to detect early signs of emotional health issues. In the present work convolutional neural network is proposed for anger and stress detection using handcrafted features and deep learned features from the spectrogram. The objective of using a combined feature set is gathering information from two different representations of speech signals to obtain more prominent features and to boost the accuracy of recognition. The proposed method of emotion assessment is more computationally efficient than similar approaches used for emotion assessment. The preliminary results obtained on experimental evaluation of the proposed approach on three datasets Toronto Emotional Speech Set (TESS), Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), and Berlin Emotional Database (EMO-DB) indicate that categorical accuracy is boosted and cross-entropy loss is reduced to a considerable extent. The proposed convolutional neural network (CNN) obtains training (T) and validation (V) categorical accuracy of T = 93.7%, V = 95.6% for TESS, T = 97.5%, V = 95.6% for EMO-DB and T = 96.7%, V = 96.7% for RAVDESS dataset.
Collapse
Affiliation(s)
- Shalini Kapoor
- Research Scholar, Dr. A.P.J Abdul Kalam Technical University, Lucknow, India
| | - Tarun Kumar
- Department of Computer Science & Engineering, Radha Govind Group of Institution, Meerut, India
| |
Collapse
|
43
|
Tang S, Yu X, Cheang CF, Hu Z, Fang T, Choi IC, Yu HH. Diagnosis of Esophageal Lesions by Multi-Classification and Segmentation Using an Improved Multi-Task Deep Learning Model. SENSORS (BASEL, SWITZERLAND) 2022; 22:s22041492. [PMID: 35214396 PMCID: PMC8876234 DOI: 10.3390/s22041492] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 01/26/2022] [Accepted: 02/08/2022] [Indexed: 05/03/2023]
Abstract
It is challenging for endoscopists to accurately detect esophageal lesions during gastrointestinal endoscopic screening due to visual similarities among different lesions in terms of shape, size, and texture among patients. Additionally, endoscopists are busy fighting esophageal lesions every day, hence the need to develop a computer-aided diagnostic tool to classify and segment the lesions at endoscopic images to reduce their burden. Therefore, we propose a multi-task classification and segmentation (MTCS) model, including the Esophageal Lesions Classification Network (ELCNet) and Esophageal Lesions Segmentation Network (ELSNet). The ELCNet was used to classify types of esophageal lesions, and the ELSNet was used to identify lesion regions. We created a dataset by collecting 805 esophageal images from 255 patients and 198 images from 64 patients to train and evaluate the MTCS model. Compared with other methods, the proposed not only achieved a high accuracy (93.43%) in classification but achieved a dice similarity coefficient (77.84%) in segmentation. In conclusion, the MTCS model can boost the performance of endoscopists in the detection of esophageal lesions as it can accurately multi-classify and segment the lesions and is a potential assistant for endoscopists to reduce the risk of oversight.
Collapse
Affiliation(s)
- Suigu Tang
- Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China; (S.T.); (X.Y.); (Z.H.); (T.F.)
| | - Xiaoyuan Yu
- Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China; (S.T.); (X.Y.); (Z.H.); (T.F.)
| | - Chak-Fong Cheang
- Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China; (S.T.); (X.Y.); (Z.H.); (T.F.)
- Correspondence:
| | - Zeming Hu
- Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China; (S.T.); (X.Y.); (Z.H.); (T.F.)
| | - Tong Fang
- Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China; (S.T.); (X.Y.); (Z.H.); (T.F.)
| | - I-Cheong Choi
- Kiang Wu Hospital, Macau 999078, China; (I.-C.C.); (H.-H.Y.)
| | - Hon-Ho Yu
- Kiang Wu Hospital, Macau 999078, China; (I.-C.C.); (H.-H.Y.)
| |
Collapse
|
44
|
Abstract
Cervical cancer is one of the leading causes of premature mortality among women worldwide and more than 85% of these deaths are in developing countries. There are several risk factors associated with cervical cancer. In this paper, we developed a predictive model for predicting the outcome of patients with cervical cancer, given risk patterns from individual medical records and preliminary screening. This work presents a decision tree (DT) classification algorithm to analyze the risk factors of cervical cancer. Recursive feature elimination (RFE) and least absolute shrinkage and selection operator (LASSO) feature selection techniques were fully explored to determine the most important attributes for cervical cancer prediction. The dataset employed here contains missing values and is highly imbalanced. Therefore, a combination of under and oversampling techniques called SMOTETomek was employed. A comparative analysis of the proposed model has been performed to show the effectiveness of feature selection and class imbalance based on the classifier’s accuracy, sensitivity, and specificity. The DT with the selected features from RFE and SMOTETomek has better results with an accuracy of 98.72% and sensitivity of 100%. DT classifier is shown to have better performance in handling classification problems when the features are reduced, and the problem of high class imbalance is addressed.
Collapse
|
45
|
Agarwal P, Yadav A, Mathur P, Pal V, Chakrabarty A. BID-Net: An Automated System for Bone Invasion Detection Occurring at Stage T4 in Oral Squamous Carcinoma Using Deep Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4357088. [PMID: 35140773 PMCID: PMC8818426 DOI: 10.1155/2022/4357088] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 11/26/2021] [Accepted: 01/03/2022] [Indexed: 11/22/2022]
Abstract
Detection of the presence and absence of bone invasion by the tumor in oral squamous cell carcinoma (OSCC) patients is very significant for their treatment planning and surgical resection. For bone invasion detection, CT scan imaging is the preferred choice of radiologists because of its high sensitivity and specificity. In the present work, deep learning algorithm based model, BID-Net, has been proposed for the automation of bone invasion detection. BID-Net performs the binary classification of CT scan images as the images with bone invasion and images without bone invasion. The proposed BID-Net model has achieved an outstanding accuracy of 93.62%. The model is also compared with six Transfer Learning models like VGG16, VGG19, ResNet-50, MobileNetV2, DenseNet-121, ResNet-101 and BID-Net outperformed over the other models. As there exists no previous studies on bone invasion detection using Deep Learning models, so the results of the proposed model have been validated from the experts of practitioner radiologists, S.M.S. hospital, Jaipur, India.
Collapse
Affiliation(s)
| | | | | | - Vipin Pal
- Department of Computer Science and Engineering, National Institute of Technology Meghalaya, Shillong, India
| | - Amitabha Chakrabarty
- Department of Computer Science and Engineering, Brac University, Dhaka, Bangladesh
| |
Collapse
|
46
|
Stochastic Analysis of Nonlinear Cancer Disease Model through Virotherapy and Computational Methods. MATHEMATICS 2022. [DOI: 10.3390/math10030368] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Cancer is a common term for many diseases that can affect anybody. A worldwide leading cause of death is cancer, according to the World Health Organization (WHO) report. In 2020, ten million people died from cancer. This model identifies the interaction of cancer cells, viral therapy, and immune response. In this model, the cell population has four parts, namely uninfected cells (x), infected cells (y), virus-free cells (v), and immune cells (z). This study presents the analysis of the stochastic cancer virotherapy model in the cell population dynamics. The model results have restored the properties of the biological problem, such as dynamical consistency, positivity, and boundedness, which are the considerable requirements of the models in these fields. The existing computational methods, such as the Euler Maruyama, Stochastic Euler, and Stochastic Runge Kutta, fail to restore the abovementioned properties. The proposed stochastic nonstandard finite difference method is efficient, cost-effective, and accommodates all the desired feasible properties. The existing standard stochastic methods converge conditionally or diverge in the long run. The solution by the nonstandard finite difference method is stable and convergent over all time steps.
Collapse
|
47
|
Guidance Image-Based Enhanced Matched Filter with Modified Thresholding for Blood Vessel Extraction. Symmetry (Basel) 2022. [DOI: 10.3390/sym14020194] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Fundus images have been established as an important factor in analyzing and recognizing many cardiovascular and ophthalmological diseases. Consequently, precise segmentation of blood using computer vision is vital in the recognition of ailments. Although clinicians have adopted computer-aided diagnostics (CAD) in day-to-day diagnosis, it is still quite difficult to conduct fully automated analysis based exclusively on information contained in fundus images. In fundus image applications, one of the methods for conducting an automatic analysis is to ascertain symmetry/asymmetry details from corresponding areas of the retina and investigate their association with positive clinical findings. In the field of diabetic retinopathy, matched filters have been shown to be an established technique for vessel extraction. However, there is reduced efficiency in matched filters due to noisy images. In this work, a joint model of a fast guided filter and a matched filter is suggested for enhancing abnormal retinal images containing low vessel contrasts. Extracting all information from an image correctly is one of the important factors in the process of image enhancement. A guided filter has an excellent property in edge-preserving, but still tends to suffer from halo artifacts near the edges. Fast guided filtering is a technique that subsamples the filtering input image and the guidance image and calculates the local linear coefficients for upsampling. In short, the proposed technique applies a fast guided filter and a matched filter for attaining improved performance measures for vessel extraction. The recommended technique was assessed on DRIVE and CHASE_DB1 datasets and achieved accuracies of 0.9613 and 0.960, respectively, both of which are higher than the accuracy of the original matched filter and other suggested vessel segmentation algorithms.
Collapse
|
48
|
Kumar Y, Koul A, Singla R, Ijaz MF. Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2022; 14:8459-8486. [PMID: 35039756 PMCID: PMC8754556 DOI: 10.1007/s12652-021-03612-z] [Citation(s) in RCA: 143] [Impact Index Per Article: 71.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 11/18/2021] [Indexed: 05/03/2023]
Abstract
Artificial intelligence can assist providers in a variety of patient care and intelligent health systems. Artificial intelligence techniques ranging from machine learning to deep learning are prevalent in healthcare for disease diagnosis, drug discovery, and patient risk identification. Numerous medical data sources are required to perfectly diagnose diseases using artificial intelligence techniques, such as ultrasound, magnetic resonance imaging, mammography, genomics, computed tomography scan, etc. Furthermore, artificial intelligence primarily enhanced the infirmary experience and sped up preparing patients to continue their rehabilitation at home. This article covers the comprehensive survey based on artificial intelligence techniques to diagnose numerous diseases such as Alzheimer, cancer, diabetes, chronic heart disease, tuberculosis, stroke and cerebrovascular, hypertension, skin, and liver disease. We conducted an extensive survey including the used medical imaging dataset and their feature extraction and classification process for predictions. Preferred reporting items for systematic reviews and Meta-Analysis guidelines are used to select the articles published up to October 2020 on the Web of Science, Scopus, Google Scholar, PubMed, Excerpta Medical Database, and Psychology Information for early prediction of distinct kinds of diseases using artificial intelligence-based techniques. Based on the study of different articles on disease diagnosis, the results are also compared using various quality parameters such as prediction rate, accuracy, sensitivity, specificity, the area under curve precision, recall, and F1-score.
Collapse
Affiliation(s)
- Yogesh Kumar
- Department of Computer Engineering, Indus Institute of Technology and Engineering, Indus University, Ahmedabad, 382115 India
| | | | - Ruchi Singla
- Department of Research, Innovations, Sponsored Projects and Entrepreneurship, CGC Landran, Mohali, India
| | - Muhammad Fazal Ijaz
- Department of Intelligent Mechatronics Engineering, Sejong University, Seoul, 05006 South Korea
| |
Collapse
|
49
|
Tian JX, Zhang J. Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:2193-2205. [PMID: 35240781 DOI: 10.3934/mbe.2022102] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
To overcome the two class imbalance problem among breast cancer diagnosis, a hybrid method by combining principal component analysis (PCA) and boosted C5.0 decision tree algorithm with penalty factor is proposed to address this issue. PCA is used to reduce the dimension of feature subset. The boosted C5.0 decision tree algorithm is utilized as an ensemble classifier for classification. Penalty factor is used to optimize the classification result. To demonstrate the efficiency of the proposed method, it is implemented on biased-representative breast cancer datasets from the University of California Irvine(UCI) machine learning repository. Given the experimental results and further analysis, our proposal is a promising method for breast cancer and can be used as an alternative method in class imbalance learning. Indeed, we observe that the feature extraction process has helped us improve diagnostic accuracy. We also demonstrate that the extracted features considering breast cancer issues are essential to high diagnostic accuracy.
Collapse
Affiliation(s)
- Jian-Xue Tian
- School of Information Engineer, Yulin University, Road chongwen, Yulin 719000, China
| | - Jue Zhang
- School of Information Engineer, Yulin University, Road chongwen, Yulin 719000, China
| |
Collapse
|
50
|
Zhang F, Li Q. Constructing ontologies by mining deep semantics from XML Schemas and XML instance documents. INT J INTELL SYST 2022. [DOI: 10.1002/int.22643] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Fu Zhang
- School of Computer Science & Engineering Northeastern University Shenyang China
| | - Qiang Li
- School of Computer Science & Engineering Northeastern University Shenyang China
| |
Collapse
|