1
|
Li F, Wang B, Li H, Kong L, Zhu B. G6PD and machine learning algorithms as prognostic and diagnostic indicators of liver hepatocellular carcinoma. BMC Cancer 2024; 24:157. [PMID: 38297250 PMCID: PMC10829225 DOI: 10.1186/s12885-024-11887-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/16/2024] [Indexed: 02/02/2024] Open
Abstract
BACKGROUND Liver Hepatocellular carcinoma (LIHC) exhibits a high incidence of liver cancer with escalating mortality rates over time. Despite this, the underlying pathogenic mechanism of LIHC remains poorly understood. MATERIALS & METHODS To address this gap, we conducted a comprehensive investigation into the role of G6PD in LIHC using a combination of bioinformatics analysis with database data and rigorous cell experiments. LIHC samples were obtained from TCGA, ICGC and GEO databases, and the differences in G6PD expression in different tissues were investigated by differential expression analysis, followed by the establishment of Nomogram to determine the percentage of G6PD in causing LIHC by examining the relationship between G6PD and clinical features, and the subsequent validation of the effect of G6PD on the activity, migration, and invasive ability of hepatocellular carcinoma cells by using the low expression of LI-7 and SNU-449. Additionally, we employed machine learning to validate and compare the predictive capacity of four algorithms for LIHC patient prognosis. RESULTS Our findings revealed significantly elevated G6PD expression levels in liver cancer tissues as compared to normal tissues. Meanwhile, Nomogram and Adaboost, Catboost, and Gbdt Regression analyses showed that G6PD accounted for 46%, 31%, and 49% of the multiple factors leading to LIHC. Furthermore, we observed that G6PD knockdown in hepatocellular carcinoma cells led to reduced proliferation, migration, and invasion abilities. Remarkably, the Decision Tree C5.0 decision tree algorithm demonstrated superior discriminatory performance among the machine learning methods assessed. CONCLUSION The potential diagnostic utility of G6PD and Decision Tree C5.0 for LIHC opens up a novel avenue for early detection and improved treatment strategies for hepatocellular carcinoma.
Collapse
Affiliation(s)
- Fei Li
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, 87 Dingjiaqiao, Nanjing, 210009, Jiangsu, China
| | - Boshen Wang
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, 87 Dingjiaqiao, Nanjing, 210009, Jiangsu, China
- Institute of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Prevention and Control, Nanjing, Jiangsu, 210009, China
| | - Hao Li
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, 87 Dingjiaqiao, Nanjing, 210009, Jiangsu, China
| | - Lu Kong
- Key Laboratory of Environmental Medicine Engineering of Ministry of Education, School of Public Health, Southeast University, 87 Dingjiaqiao, Nanjing, 210009, Jiangsu, China.
| | - Baoli Zhu
- Institute of Occupational Disease Prevention, Jiangsu Provincial Center for Disease Prevention and Control, Nanjing, Jiangsu, 210009, China.
- Jiangsu Preventive Medical Association, Nanjing, 210000, Jiangsu, China.
- Center for Global Health, Nanjing Medical University, Nanjing, 211112, China.
- Jiangsu Province Engineering Research Center of Public Health Emergency, Nanjing, 210000, Jiangsu, China.
| |
Collapse
|
2
|
Vanmathi P, Jose D. An ensemble-based serial cascaded attention network and improved variational auto encoder for breast cancer prognosis prediction using data. Comput Methods Biomech Biomed Engin 2024; 27:98-115. [PMID: 38006210 DOI: 10.1080/10255842.2023.2280883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 11/02/2023] [Indexed: 11/26/2023]
Abstract
Breast cancer is one of the most common types of cancer in women and it produces a huge amount of death rate in the world. Early recognition is lessening its impact. The early recognition of breast cancer could convince patients to receive surgical therapy, which will significantly improve the chance of restoration. This information is used by the machine learning technique to find links between them and appraise our forecasts of fresh occurrences. Later recognition of breast cancer can lead to death. An accurate prescient framework for breast cancer prediction is urgently needed in the current era. In order to accomplish the objective, an adaptive ensemble model is proposed for breast cancer prognosis prediction using data. At the initial stage, the raw data are fetched from benchmark datasets. It is then followed by data cleaning and preprocessing. Subsequently, the pre-processed data is fed into the Improved Variational Autoencoder (IVAE), where the deep features are extracted. Finally, the resultant features are given as input to the Ensemble-based Serial Cascaded Attention Network (ESCANet), which is built with Deep Temporal Convolution Network (DTCN), Bi-directional Long Short-Term Memory (BiLSTM), and Recurrent Neural Network (RNN). The effectiveness of the model is validated and compared with conventional methodologies. Therefore, the results elucidate that the proposed methodology achieves extensive results; thus, it increases the system's efficiency.
Collapse
Affiliation(s)
- P Vanmathi
- Full time Research Scholar, Department of ECE, KCG College of Technology, Karapakkam, Chennai, Tamil Nadu, India
| | - Deepa Jose
- Professor, Department of ECE, KCG College of Technology, Karapakkam, Chennai, Tamil Nadu, India
| |
Collapse
|
3
|
Petrillo A, Fusco R, Barretta ML, Granata V, Mattace Raso M, Porto A, Sorgente E, Fanizzi A, Massafra R, Lafranceschina M, La Forgia D, Trombadori CML, Belli P, Trecate G, Tenconi C, De Santis MC, Greco L, Ferranti FR, De Soccio V, Vidiri A, Botta F, Dominelli V, Cassano E, Boldrini L. Radiomics and artificial intelligence analysis by T2-weighted imaging and dynamic contrast-enhanced magnetic resonance imaging to predict Breast Cancer Histological Outcome. LA RADIOLOGIA MEDICA 2023; 128:1347-1371. [PMID: 37801198 DOI: 10.1007/s11547-023-01718-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/01/2023] [Indexed: 10/07/2023]
Abstract
OBJECTIVE The objective of the study was to evaluate the accuracy of radiomics features obtained by MR images to predict Breast Cancer Histological Outcome. METHODS A total of 217 patients with malignant lesions were analysed underwent MRI examinations. Considering histological findings as the ground truth, four different types of findings were used in both univariate and multivariate analyses: (1) G1 + G2 vs G3 classification; (2) presence of human epidermal growth factor receptor 2 (HER2 + vs HER2 -); (3) presence of the hormone receptor (HR + vs HR -); and (4) presence of luminal subtypes of breast cancer. RESULTS The best accuracy for discriminating HER2 + versus HER2 - breast cancers was obtained considering nine predictors by early phase T1-weighted subtraction images and a decision tree (accuracy of 88% on validation set). The best accuracy for discriminating HR + versus HR - breast cancers was obtained considering nine predictors by T2-weighted subtraction images and a decision tree (accuracy of 90% on validation set). The best accuracy for discriminating G1 + G2 versus G3 breast cancers was obtained considering 16 predictors by early phase T1-weighted subtraction images in a linear regression model with an accuracy of 75%. The best accuracy for discriminating luminal versus non-luminal breast cancers was obtained considering 27 predictors by early phase T1-weighted subtraction images and a decision tree (accuracy of 94% on validation set). CONCLUSIONS The combination of radiomics analysis and artificial intelligence techniques could be used to support physician decision-making in prediction of Breast Cancer Histological Outcome.
Collapse
Affiliation(s)
- Antonella Petrillo
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy.
| | - Roberta Fusco
- Medical Oncology Division, Igea SpA, 80013, Naples, Italy
| | - Maria Luisa Barretta
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy
| | - Vincenza Granata
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy
| | - Mauro Mattace Raso
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy
| | - Annamaria Porto
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy
| | - Eugenio Sorgente
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131, Naples, Italy
| | - Annarita Fanizzi
- Direzione Scientifica-IRCCS, Istituto Tumori Giovanni Paolo II-Via Orazio Flacco 65, 70124, Bari, Italy
| | - Raffaella Massafra
- SSD Fisica Sanitaria-IRCCS Istituto Tumori Giovanni Paolo II-Via Orazio Flacco 65, 70124, Bari, Italy
| | - Miria Lafranceschina
- Struttura Semplice Dipartimentale di Radiodiagnostica Senologica-IRCCS Istituto Tumori Giovanni Paolo II-Via Orazio Flacco 65, 70124, Bari, Italy
| | - Daniele La Forgia
- Struttura Semplice Dipartimentale di Radiodiagnostica Senologica-IRCCS Istituto Tumori Giovanni Paolo II-Via Orazio Flacco 65, 70124, Bari, Italy
| | | | - Paolo Belli
- Dipartimento di Diagnostica per Immagini, Radioterapia Oncologica ed Ematologia, Fondazione Policlinico Universitario A. Gemelli IRCCS, 00168, Rome, Italy
| | - Giovanna Trecate
- Department of Radiodiagnostic and Magnetic Resonance, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133, Milan, Italy
| | - Chiara Tenconi
- Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133, Milan, Italy
| | - Maria Carmen De Santis
- De Santis Radiation Oncology, Fondazione IRCCS Istituto Nazionale dei Tumori, 20133, Milan, Italy
| | - Laura Greco
- Radiology and Diagnostic Imaging, Istituto di Ricovero E Cura a Carattere Scientifico (IRCCS) Regina Elena National Cancer Institute, Rome, Italy
| | - Francesca Romana Ferranti
- Radiology and Diagnostic Imaging, Istituto di Ricovero E Cura a Carattere Scientifico (IRCCS) Regina Elena National Cancer Institute, Rome, Italy
| | - Valeria De Soccio
- Radiology and Diagnostic Imaging, Istituto di Ricovero E Cura a Carattere Scientifico (IRCCS) Regina Elena National Cancer Institute, Rome, Italy
| | - Antonello Vidiri
- Radiology and Diagnostic Imaging, Istituto di Ricovero E Cura a Carattere Scientifico (IRCCS) Regina Elena National Cancer Institute, Rome, Italy
| | - Francesca Botta
- Breast Imaging Division, IEO Istituto Europeo di Oncologia, 20141, Milan, Italy
| | - Valeria Dominelli
- Breast Imaging Division, IEO Istituto Europeo di Oncologia, 20141, Milan, Italy
| | - Enrico Cassano
- Breast Imaging Division, IEO Istituto Europeo di Oncologia, 20141, Milan, Italy
| | - Luca Boldrini
- Dipartimento di Diagnostica per Immagini, Radioterapia Oncologica ed Ematologia, Fondazione Policlinico Universitario A. Gemelli IRCCS, 00168, Rome, Italy
| |
Collapse
|
4
|
Sun K, Zhu H, Chai W, Yan F. TP53 Mutation Estimation Based on MRI Radiomics Analysis for Breast Cancer. J Magn Reson Imaging 2023; 57:1095-1103. [PMID: 35771720 DOI: 10.1002/jmri.28323] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/11/2022] [Accepted: 06/16/2022] [Indexed: 11/09/2022] Open
Abstract
BACKGROUND Noninvasive detection of TP53 mutations is useful for the molecular stratification of breast cancer. PURPOSE To explore MRI radiomics features reflecting TP53 mutations in breast cancer and propose a classifier for detecting such mutations. STUDY TYPE Retrospective. POPULATION/SUBJECTS A total of 139 breast cancer patients with TP53 expression profiling (98 with TP53 mutations and 41 without TP53 mutations). FIELD STRENGTH/SEQUENCE 1.5 T, T1-weighted (T1W) DCE-MRI. ASSESSMENT Lesions were manually segmented using subtracted T1WI. A total of 944 radiomics features (including 744 wavelet-related features) and 7 clinicopathological features were extracted from each lesion. Principal component analysis and Pearson's correlation analysis were used to preprocess the features. Linear discriminant analysis, logistic regression (LR), support vector machine (SVM), and random forest (RF) were used as the classifiers. STATISTICAL TESTS Analysis of variance, Kruskal-Wallis and recursive features elimination were used to select features. Receiver operating characteristic (ROC) analysis was performed to compare the diagnostic accuracy. RESULTS For the radiomics model, the validation cohorts AUCs of the four classifiers ranged from 0.69 (RF) to 0.74 (LR), and LR (0.74) attained the highest AUCs. For the clinicopathological-radiomics combined model, the validation AUCs of the four classifiers ranged from 0.68 (RF) to 0.86 (SVM), and SVM (0.86) attained highest AUCs. In the subgroup analysis of triple-negative (TN) and luminal type breast cancer, RF achieved the highest AUCs (0.83 and 0.94). DATA CONCLUSION Clinicopathological-radiomics combined model with SVM could be used as noninvasive biomarkers for predicting TP53 mutations. RF was recommended for the detection of TP53 mutations in TN and luminal type breast cancer. LEVEL OF EVIDENCE 3 TECHNICAL EFFICACY: Stage 2.
Collapse
Affiliation(s)
- Kun Sun
- Department of Radiology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Hong Zhu
- Department of Radiology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Weimin Chai
- Department of Radiology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Fuhua Yan
- Department of Radiology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| |
Collapse
|
5
|
Nilashi M, Abumalloh RA, Alyami S, Alghamdi A, Alrizq M. Parkinson’s Disease Diagnosis Using Laplacian Score, Gaussian Process Regression and Self-Organizing Maps. Brain Sci 2023; 13:brainsci13040543. [PMID: 37190508 DOI: 10.3390/brainsci13040543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 03/10/2023] [Accepted: 03/18/2023] [Indexed: 03/29/2023] Open
Abstract
Parkinson’s disease (PD) is a complex degenerative brain disease that affects nerve cells in the brain responsible for body movement. Machine learning is widely used to track the progression of PD in its early stages by predicting unified Parkinson’s disease rating scale (UPDRS) scores. In this paper, we aim to develop a new method for PD diagnosis with the aid of supervised and unsupervised learning techniques. Our method is developed using the Laplacian score, Gaussian process regression (GPR) and self-organizing maps (SOM). SOM is used to segment the data to handle large PD datasets. The models are then constructed using GPR for the prediction of the UPDRS scores. To select the important features in the PD dataset, we use the Laplacian score in the method. We evaluate the developed approach on a PD dataset including a set of speech signals. The method was evaluated through root-mean-square error (RMSE) and adjusted R-squared (adjusted R²). Our findings reveal that the proposed method is efficient in the prediction of UPDRS scores through a set of speech signals (dysphonia measures). The method evaluation showed that SOM combined with the Laplacian score and Gaussian process regression with the exponential kernel provides the best results for R-squared (Motor-UPDRS = 0.9489; Total-UPDRS = 0.9516) and RMSE (Motor-UPDRS = 0.5144; Total-UPDRS = 0.5105) in predicting UPDRS compared with the other kernels in Gaussian process regression.
Collapse
|
6
|
Ghaffar Nia N, Kaplanoglu E, Nasab A. Evaluation of artificial intelligence techniques in disease diagnosis and prediction. DISCOVER ARTIFICIAL INTELLIGENCE 2023. [PMCID: PMC9885935 DOI: 10.1007/s44163-023-00049-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
AbstractA broad range of medical diagnoses is based on analyzing disease images obtained through high-tech digital devices. The application of artificial intelligence (AI) in the assessment of medical images has led to accurate evaluations being performed automatically, which in turn has reduced the workload of physicians, decreased errors and times in diagnosis, and improved performance in the prediction and detection of various diseases. AI techniques based on medical image processing are an essential area of research that uses advanced computer algorithms for prediction, diagnosis, and treatment planning, leading to a remarkable impact on decision-making procedures. Machine Learning (ML) and Deep Learning (DL) as advanced AI techniques are two main subfields applied in the healthcare system to diagnose diseases, discover medication, and identify patient risk factors. The advancement of electronic medical records and big data technologies in recent years has accompanied the success of ML and DL algorithms. ML includes neural networks and fuzzy logic algorithms with various applications in automating forecasting and diagnosis processes. DL algorithm is an ML technique that does not rely on expert feature extraction, unlike classical neural network algorithms. DL algorithms with high-performance calculations give promising results in medical image analysis, such as fusion, segmentation, recording, and classification. Support Vector Machine (SVM) as an ML method and Convolutional Neural Network (CNN) as a DL method is usually the most widely used techniques for analyzing and diagnosing diseases. This review study aims to cover recent AI techniques in diagnosing and predicting numerous diseases such as cancers, heart, lung, skin, genetic, and neural disorders, which perform more precisely compared to specialists without human error. Also, AI's existing challenges and limitations in the medical area are discussed and highlighted.
Collapse
Affiliation(s)
- Nafiseh Ghaffar Nia
- College of Engineering and Computer Science, The University of Tennessee at Chattanooga, Chattanooga, TN 37403 USA
| | - Erkan Kaplanoglu
- College of Engineering and Computer Science, The University of Tennessee at Chattanooga, Chattanooga, TN 37403 USA
| | - Ahad Nasab
- College of Engineering and Computer Science, The University of Tennessee at Chattanooga, Chattanooga, TN 37403 USA
| |
Collapse
|
7
|
Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning. Cancers (Basel) 2023; 15:cancers15030681. [PMID: 36765642 PMCID: PMC9913345 DOI: 10.3390/cancers15030681] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 01/13/2023] [Accepted: 01/17/2023] [Indexed: 01/24/2023] Open
Abstract
Breast cancer is one of the most common invasive cancers in women and it continues to be a worldwide medical problem since the number of cases has significantly increased over the past decade. Breast cancer is the second leading cause of death from cancer in women. The early detection of breast cancer can save human life but the traditional approach for detecting breast cancer disease needs various laboratory tests involving medical experts. To reduce human error and speed up breast cancer detection, an automatic system is required that would perform the diagnosis accurately and timely. Despite the research efforts for automated systems for cancer detection, a wide gap exists between the desired and provided accuracy of current approaches. To overcome this issue, this research proposes an approach for breast cancer prediction by selecting the best fine needle aspiration features. To enhance the prediction accuracy, several feature selection techniques are applied to analyze their efficacy, such as principal component analysis, singular vector decomposition, and chi-square (Chi2). Extensive experiments are performed with different features and different set sizes of features to investigate the optimal feature set. Additionally, the influence of imbalanced and balanced data using the SMOTE approach is investigated. Six classifiers including random forest, support vector machine, gradient boosting machine, logistic regression, multilayer perceptron, and K-nearest neighbors (KNN) are tuned to achieve increased classification accuracy. Results indicate that KNN outperforms all other classifiers on the used dataset with 20 features using SVD and with the 15 most important features using a PCA with a 100% accuracy score.
Collapse
|
8
|
Exploring different computational approaches for effective diagnosis of breast cancer. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 177:141-150. [PMID: 36509230 DOI: 10.1016/j.pbiomolbio.2022.11.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 10/28/2022] [Accepted: 11/10/2022] [Indexed: 12/13/2022]
Abstract
Breast cancer has been identified as one among the top causes of female death worldwide. According to recent research, earlier detection plays an important role toward fortunate medicaments and thus, decreasing the mortality rate due to breast cancer among females. This review provides a fleeting summary involving traditional diagnostic procedures from the past and today, and also modern computational tools that have greatly aided in the identification of breast cancer. Computational techniques involving different algorithms such as Support vector machines, deep learning techniques and robotics are popular among the academicians for detection of breast cancer. They discovered that Convolutional neural network was a common option for categorization among such approaches. Deep learning techniques are evaluated using performance indicators such as accuracy, sensitivity, specificity, or measure. Furthermore, molecular docking, homology modeling and Molecular dynamics Simulation gives a road map for future discussions about developing improved early detection approaches that holds greater potential in increasing the survival rate of cancer patients. The different computational techniques can be a new dominion among researchers and combating the challenges associated with breast cancer.
Collapse
|
9
|
Thawkar S. Feature selection and classification in mammography using hybrid crow search algorithm with Harris hawks optimization. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
10
|
Prediction of Breast Cancer Histological Outcome by Radiomics and Artificial Intelligence Analysis in Contrast-Enhanced Mammography. Cancers (Basel) 2022; 14:cancers14092132. [PMID: 35565261 PMCID: PMC9102628 DOI: 10.3390/cancers14092132] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 03/30/2022] [Accepted: 04/21/2022] [Indexed: 02/06/2023] Open
Abstract
Simple Summary The assessment of breast lesions through mammographic images is currently challenging, especially in dense breasts. Contrast-enhanced mammography has been shown to overcome the limitations of standard mammography but it greatly depends on the interpretative skills of the physician. The aim of this study was to evaluate the potentialities of statistical and artificial intelligence algorithms as a tool for helping the radiologists in the interpretation of images. The most remarkable results were achieved in discriminating benign from malignant lesions and in the identification of the presence of the hormone receptor. A tool to support the physician’s decision-making process may be designed starting from simple logistic regression and tree-based algorithms. This type of tool may help the radiologist in assessing the investigated breast and in choosing the appropriate follow-up without resorting to histology. Abstract Purpose: To evaluate radiomics features in order to: differentiate malignant versus benign lesions; predict low versus moderate and high grading; identify positive or negative hormone receptors; and discriminate positive versus negative human epidermal growth factor receptor 2 related to breast cancer. Methods: A total of 182 patients with known breast lesions and that underwent Contrast-Enhanced Mammography were enrolled in this retrospective study. The reference standard was pathology (118 malignant lesions and 64 benign lesions). A total of 837 textural metrics were extracted by manually segmenting the region of interest from both craniocaudally (CC) and mediolateral oblique (MLO) views. Non-parametric Wilcoxon–Mann–Whitney test, receiver operating characteristic, logistic regression and tree-based machine learning algorithms were used. The Adaptive Synthetic Sampling balancing approach was used and a feature selection process was implemented. Results: In univariate analysis, the classification of malignant versus benign lesions achieved the best performance when considering the original_gldm_DependenceNonUniformity feature extracted on CC view (accuracy of 88.98%). An accuracy of 83.65% was reached in the classification of grading, whereas a slightly lower value of accuracy (81.65%) was found in the classification of the presence of the hormone receptor; the features extracted were the original_glrlm_RunEntropy and the original_gldm_DependenceNonUniformity, respectively. The results of multivariate analysis achieved the best performances when using two or more features as predictors for classifying malignant versus benign lesions from CC view images (max test accuracy of 95.83% with a non-regularized logistic regression). Considering the features extracted from MLO view images, the best test accuracy (91.67%) was obtained when predicting the grading using a classification-tree algorithm. Combinations of only two features, extracted from both CC and MLO views, always showed test accuracy values greater than or equal to 90.00%, with the only exception being the prediction of the human epidermal growth factor receptor 2, where the best performance (test accuracy of 89.29%) was obtained with the random forest algorithm. Conclusions: The results confirm that the identification of malignant breast lesions and the differentiation of histological outcomes and some molecular subtypes of tumors (mainly positive hormone receptor tumors) can be obtained with satisfactory accuracy through both univariate and multivariate analysis of textural features extracted from Contrast-Enhanced Mammography images.
Collapse
|
11
|
Monirujjaman Khan M, Islam S, Sarkar S, Ayaz FI, Kabir MM, Tazin T, Albraikan AA, Almalki FA. Machine Learning Based Comparative Analysis for Breast Cancer Prediction. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:4365855. [PMID: 35449836 PMCID: PMC9017446 DOI: 10.1155/2022/4365855] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 03/02/2022] [Accepted: 03/25/2022] [Indexed: 11/17/2022]
Abstract
One of the most prevalent and leading causes of cancer in women is breast cancer. It has now become a frequent health problem, and its prevalence has recently increased. The easiest approach to dealing with breast cancer findings is to recognize them early on. Early detection of breast cancer is facilitated by computer-aided detection and diagnosis (CAD) technologies, which can help people live longer lives. The major goal of this work is to take advantage of recent developments in CAD systems and related methodologies. In 2011, the United States reported that one out of every eight women was diagnosed with cancer. Breast cancer originates as a result of aberrant cell division in the breast, which leads to either benign or malignant cancer formation. As a result, early detection of breast cancer is critical, and with effective treatment, many lives can be saved. This research covers the findings and analyses of multiple machine learning models for identifying breast cancer. The Wisconsin Breast Cancer Diagnostic (WBCD) dataset was used to develop the method. Despite its small size, the dataset provides some interesting data. The information was analyzed and put to use in a number of machine learning models. For prediction, random forest, logistic regression, decision tree, and K-nearest neighbor were utilized. When the results are compared, the logistic regression model is found to offer the best results. Logistic regression achieves 98% accuracy, which is better than the previous method reported.
Collapse
Affiliation(s)
- Mohammad Monirujjaman Khan
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Somayea Islam
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Srobani Sarkar
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Fozayel Ibn Ayaz
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Md. Mursalin Kabir
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Tahia Tazin
- Department of Electrical and Computer Engineering, North South University, Bashundhara, Dhaka-1229, Bangladesh
| | - Amani Abdulrahman Albraikan
- Department of Computer Science, College of Computer and Information Sciences, Princess Nourah Bin Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Faris A. Almalki
- Department of Computer Engineering, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| |
Collapse
|
12
|
Xia J, Wang Z, Yang D, Li R, Liang G, Chen H, Heidari AA, Turabieh H, Mafarja M, Pan Z. Performance optimization of support vector machine with oppositional grasshopper optimization for acute appendicitis diagnosis. Comput Biol Med 2022; 143:105206. [PMID: 35101730 DOI: 10.1016/j.compbiomed.2021.105206] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 12/27/2021] [Accepted: 12/30/2021] [Indexed: 12/13/2022]
Abstract
Preoperative differentiation of complicated and uncomplicated appendicitis is challenging. The research goal was to construct a new intelligent diagnostic rule that is accurate, fast, noninvasive, and cost-effective, distinguishing between complicated and uncomplicated appendicitis. Overall, 298 patients with acute appendicitis from the Wenzhou Central Hospital were recruited, and information on their demographic characteristics, clinical findings, and laboratory data was retrospectively reviewed and applied in this study. First, the most significant variables, including C-reactive protein (CRP), heart rate, body temperature, and neutrophils discriminating complicated from uncomplicated appendicitis, were identified using random forest analysis. Second, an improved grasshopper optimization algorithm-based support vector machine was used to construct the diagnostic model to discriminate complicated appendicitis (CAP) from uncomplicated appendicitis (UAP). The resultant optimal model can produce an average of 83.56% accuracy, 81.71% sensitivity, 85.33% specificity, and 0.6732 Matthews correlation coefficients. Based on existing routinely available markers, the proposed intelligent diagnosis model is highly reliable. Thus, the model can potentially be used to assist doctors in making correct clinical decisions.
Collapse
Affiliation(s)
- Jianfu Xia
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Zhifei Wang
- Department of Hepatobiliary, Pancreatic and Minimally Invasive Surgery, Zhejiang Provincial People's Hospital, Hangzhou, 310014, China.
| | - Daqing Yang
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Rizeng Li
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Guoxi Liang
- Department of Information Technology, Wenzhou Polytechnic, Wenzhou, 325035, China.
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Ali Asghar Heidari
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Hamza Turabieh
- Department of Information Technology, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif, 21944, Taif, Saudi Arabia.
| | - Majdi Mafarja
- Department of Computer Science, Birzeit University, Birzeit, 72439, Palestine.
| | - Zhifang Pan
- The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| |
Collapse
|
13
|
Fusco R, Di Bernardo E, Piccirillo A, Rubulotta MR, Petrosino T, Barretta ML, Mattace Raso M, Vallone P, Raiano C, Di Giacomo R, Siani C, Avino F, Scognamiglio G, Di Bonito M, Granata V, Petrillo A. Radiomic and Artificial Intelligence Analysis with Textural Metrics Extracted by Contrast-Enhanced Mammography and Dynamic Contrast Magnetic Resonance Imaging to Detect Breast Malignant Lesions. Curr Oncol 2022; 29:1947-1966. [PMID: 35323359 PMCID: PMC8947713 DOI: 10.3390/curroncol29030159] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 03/07/2022] [Accepted: 03/10/2022] [Indexed: 11/16/2022] Open
Abstract
Purpose:The purpose of this study was to discriminate between benign and malignant breast lesions through several classifiers using, as predictors, radiomic metrics extracted from CEM and DCE-MRI images. In order to optimize the analysis, balancing and feature selection procedures were performed. Methods: Fifty-four patients with 79 histo-pathologically proven breast lesions (48 malignant lesions and 31 benign lesions) underwent both CEM and DCE-MRI. The lesions were retrospectively analyzed with radiomic and artificial intelligence approaches. Forty-eight textural metrics were extracted, and univariate and multivariate analyses were performed: non-parametric statistical test, receiver operating characteristic (ROC) and machine learning classifiers. Results: Considering the single metrics extracted from CEM, the best predictors were KURTOSIS (area under ROC curve (AUC) = 0.71) and SKEWNESS (AUC = 0.71) calculated on late MLO view. Considering the features calculated from DCE-MRI, the best predictors were RANGE (AUC = 0.72), ENERGY (AUC = 0.72), ENTROPY (AUC = 0.70) and GLN (gray-level nonuniformity) of the gray-level run-length matrix (AUC = 0.72). Considering the analysis with classifiers and an unbalanced dataset, no significant results were obtained. After the balancing and feature selection procedures, higher values of accuracy, specificity and AUC were reached. The best performance was obtained considering 18 robust features among all metrics derived from CEM and DCE-MRI, using a linear discriminant analysis (accuracy of 0.84 and AUC = 0.88). Conclusions: Classifiers, adjusted with adaptive synthetic sampling and feature selection, allowed for increased diagnostic performance of CEM and DCE-MRI in the differentiation between benign and malignant lesions.
Collapse
Affiliation(s)
- Roberta Fusco
- Medical Oncolody Division, Igea SpA, 80013 Naples, Italy; (R.F.); (E.D.B.)
| | - Elio Di Bernardo
- Medical Oncolody Division, Igea SpA, 80013 Naples, Italy; (R.F.); (E.D.B.)
| | - Adele Piccirillo
- Department of Electrical Engineering and Information Technologies, Università degli Studi di Napoli Federico II, 80125 Naples, Italy;
| | - Maria Rosaria Rubulotta
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Teresa Petrosino
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Maria Luisa Barretta
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Mauro Mattace Raso
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Paolo Vallone
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Concetta Raiano
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| | - Raimondo Di Giacomo
- Senology Surgical Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (R.D.G.); (C.S.); (F.A.)
| | - Claudio Siani
- Senology Surgical Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (R.D.G.); (C.S.); (F.A.)
| | - Franca Avino
- Senology Surgical Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (R.D.G.); (C.S.); (F.A.)
| | - Giosuè Scognamiglio
- Pathology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (G.S.); (M.D.B.)
| | - Maurizio Di Bonito
- Pathology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (G.S.); (M.D.B.)
| | - Vincenza Granata
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
- Correspondence: ; Tel.: +39-081-590-714; Fax: +39-081-590-3825
| | - Antonella Petrillo
- Radiology Division, Istituto Nazionale Tumori-IRCCS-Fondazione G. Pascale, 80131 Naples, Italy; (M.R.R.); (T.P.); (M.L.B.); (M.M.R.); (P.V.); (C.R.); (A.P.)
| |
Collapse
|
14
|
Rasool A, Bunterngchit C, Tiejian L, Islam MR, Qu Q, Jiang Q. Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19063211. [PMID: 35328897 PMCID: PMC8949437 DOI: 10.3390/ijerph19063211] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 03/02/2022] [Accepted: 03/03/2022] [Indexed: 12/24/2022]
Abstract
Breast cancer death rates are higher than any other cancer in American women. Machine learning-based predictive models promise earlier detection techniques for breast cancer diagnosis. However, making an evaluation for models that efficiently diagnose cancer is still challenging. In this work, we proposed data exploratory techniques (DET) and developed four different predictive models to improve breast cancer diagnostic accuracy. Prior to models, four-layered essential DET, e.g., feature distribution, correlation, elimination, and hyperparameter optimization, were deep-dived to identify the robust feature classification into malignant and benign classes. These proposed techniques and classifiers were implemented on the Wisconsin Diagnostic Breast Cancer (WDBC) and Breast Cancer Coimbra Dataset (BCCD) datasets. Standard performance metrics, including confusion matrices and K-fold cross-validation techniques, were applied to assess each classifier's efficiency and training time. The models' diagnostic capability improved with our DET, i.e., polynomial SVM gained 99.3%, LR with 98.06%, KNN acquired 97.35%, and EC achieved 97.61% accuracy with the WDBC dataset. We also compared our significant results with previous studies in terms of accuracy. The implementation procedure and findings can guide physicians to adopt an effective model for a practical understanding and prognosis of breast cancer tumors.
Collapse
Affiliation(s)
- Abdur Rasool
- University of Chinese Academy of Sciences, Beijing 101408, China; (A.R.); (C.B.)
- Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China;
| | - Chayut Bunterngchit
- University of Chinese Academy of Sciences, Beijing 101408, China; (A.R.); (C.B.)
- State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| | - Luo Tiejian
- University of Chinese Academy of Sciences, Beijing 101408, China; (A.R.); (C.B.)
- Correspondence: (L.T.); (Q.J.); Tel.: +86-137-0127-2380 (L.T.); +86-755-8639-2340 (Q.J.)
| | - Md. Ruhul Islam
- Department of Electrical Engineering and Computer Science, University of Stavanger, 4044 Stavanger, Norway;
| | - Qiang Qu
- Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China;
| | - Qingshan Jiang
- Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China;
- Correspondence: (L.T.); (Q.J.); Tel.: +86-137-0127-2380 (L.T.); +86-755-8639-2340 (Q.J.)
| |
Collapse
|
15
|
Gomathi S, Malarvizhi K, Kavitha MS. Detection of Mammogram Using Improved Watershed Segmentation Algorithm and Classifying with Feed Forward Neural Network (FNN). JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS 2022. [DOI: 10.1166/jmihi.2022.3939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Segmentation of breast tumors with more accuracy using computerized methods is essential for breast cancer monitoring and quantification. Both segmentation and classification of breast tumors using a fully automated or Computer-Aided Diagnosis system poses various problems in terms
of imaging properties. In this work, a new hybrid algorithm is proposed for segmentation with a two-step process. Initially, a watershed transformation is applied to separate all basins based on pixel density variation from the mass present in tumors, since it has been quite booming in the
presence of tumors in all circumstances. Though this is very perceptive to tiny fluctuations in the size of the image, large numbers of areas are produced unacceptably, and the boundaries after segmentations are also quite hard. The second level set is an effective method of segmenting all
types of medical images because; it easily flows with, cavities, folds, splits, and merges. To make the recognition step easier and more accurate, the result of segmentation is considered the beginning position of the curve, and the same will be used at the next step of the level set. This
produces a closed, smooth, and accurately placed contour or surface. As a result, the present research uses watershed segmentation to isolate tumor regions and performs classification using Feed Forward Neural Network (FNN) to extract features for classification. Experimental results are evaluated
based on performance and quality analysis. In the classification process, the study obtained an accuracy rate of 91.2% in the learning model and 71.8% in a testing model.
Collapse
Affiliation(s)
- S. Gomathi
- Department of Computer Science & Engineering, SNS College of Technology, Coimbatore, 641035, Tamil Nadu, India
| | - K. Malarvizhi
- Department of Electrical and Electronics Engineering, Kumaraguru College of Technology, Coimbatore, 641049, India
| | - M. S. Kavitha
- Department of Computer Science & Engineering, SNS College of Technology, Coimbatore, 641035, Tamil Nadu, India
| |
Collapse
|
16
|
A survey on artificial intelligence techniques for chronic diseases: open issues and challenges. Artif Intell Rev 2021. [DOI: 10.1007/s10462-021-10084-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
17
|
Makond B, Wang KJ, Wang KM. Benchmarking prognosis methods for survivability - A case study for patients with contingent primary cancers. Comput Biol Med 2021; 138:104888. [PMID: 34610552 DOI: 10.1016/j.compbiomed.2021.104888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 09/17/2021] [Indexed: 11/18/2022]
Abstract
BACKGROUND There is an increasing number of patients with a first primary cancer who are diagnosed with a second primary cancer, but prognosis methods to predict the survivability of a patient with multiple primary cancers have not been fully benchmarked. METHODS This study investigated the five-year survivability prognosis performances of six machine learning approaches. These approaches are: artificial neural network, decision tree (DT), logistic regression, support vector machine, naïve Bayes (NB), and Bayesian network (BN). A synthetic minority over-sampling technique (SMOTE) was used to solve the imbalanced problem, and a nationwide cancer patient database containing 7,845 subjects in Taiwan was used as a sample source. Ten primary and secondary cancers and their key variables affecting the survivability of the patients were identified. RESULTS All the models using SMOTE improved sensitivity and specificity significantly. NB has the highest performance in terms of accuracy and specificity, whereas BN has the highest performance in terms of sensitivity. Further, the computational time and the power of knowledge representation of NB, BN, and DT outperformed the others. CONCLUSIONS Selecting the appropriate prognosis models to predict survivability of patients with two contingent primary cancers can aid precise prediction and can support appropriate treatment advice.
Collapse
Affiliation(s)
- Bunjira Makond
- Faculty of Commerce and Management, Prince of Songkla University, Trang, Thailand.
| | - Kung-Jeng Wang
- Department of Industrial Management National Taiwan University of Science and Technology, Taipei, 106, ROC, Taiwan.
| | - Kung-Min Wang
- Department of Surgery, Shin-Kong Wu Ho-Su Memorial Hospital, Taipei, R.O.C, Taiwan.
| |
Collapse
|
18
|
Yang X, Stamp M. Computer-aided diagnosis of low grade endometrial stromal sarcoma (LGESS). Comput Biol Med 2021; 138:104874. [PMID: 34571437 DOI: 10.1016/j.compbiomed.2021.104874] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 10/20/2022]
Abstract
Low grade endometrial stromal sarcoma (LGESS) accounts for about 0.2% of all uterine cancer cases. Approximately 75% of LGESS patients are initially misdiagnosed with leiomyoma, which is a type of benign tumor, also known as fibroids. In this research, uterine tissue biopsy images of potential LGESS patients are preprocessed using segmentation and stain normalization algorithms. We then apply a variety of classic machine learning and advanced deep learning models to classify tissue images as either benign or cancerous. For the classic techniques considered, the highest classification accuracy we attain is about 0.85, while our best deep learning model achieves an accuracy of approximately 0.87. These results clearly indicate that properly trained learning algorithms can aid in the diagnosis of LGESS.
Collapse
Affiliation(s)
- Xinxin Yang
- Department of Computer Science, San Jose State University, USA
| | - Mark Stamp
- Department of Computer Science, San Jose State University, USA.
| |
Collapse
|
19
|
Ramesh Dhanaseelan F, Jeya Sutha M. Detection of Breast Cancer Based on Fuzzy Frequent Itemsets Mining. Ing Rech Biomed 2021. [DOI: 10.1016/j.irbm.2020.05.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
20
|
Cheraghlou S, Sadda P, Agogo GO, Girardi M. A machine-learning modified CART algorithm informs Merkel cell carcinoma prognosis. Australas J Dermatol 2021; 62:323-330. [PMID: 34028790 DOI: 10.1111/ajd.13624] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Accepted: 04/17/2021] [Indexed: 11/30/2022]
Abstract
BACKGROUND Merkel cell carcinoma (MCC) is a rare neuroendocrine skin cancer with a high mortality rate. MCC staging is currently based on tumour primary size, clinical detectability of lymph node metastases, performance of a lymph node biopsy, and presence of distant metastases. OBJECTIVE We aimed to use a modified classification and regression tree (CART) algorithm using available data points in the National Cancer Database (NCDB) to elucidate novel prognostic factors for MCC. METHODS Retrospective cohort study of the NCDB and Surveillance, Epidemiology, and End Results (SEER) registries. Cases from the NCDB were randomly assigned to either the training or validation cohorts. A modified CART algorithm was created with data from the training cohort and used to identify prognostic groups that were validated in the NCDB validation and SEER cohorts. RESULTS A modified CART algorithm using tumour variables available in the NCDB identified prognostic strata as follows: I: local disease, II: ≤3 positive nodes, III: ≥4 positive nodes, and IV: presence of distant metastases. Three-year survival for these groups in the NCDB validation cohort were 81.2% (SE: 1.7), 59.6% (SE: 3.0), 38.0% (SE: 6.0), and 20.2% (SE: 7.0), respectively. These strata were exhibited greater within-group homogeneity than AJCC groups and were more predictive of survival. CONCLUSIONS Risk-stratified grouping of MCC patients incorporating positive lymph node count were strongly predictive of survival and demonstrated a high degree of within-group homogeneity and survival prediction. Incorporation of positive lymph node count within overall staging or sub-staging may help to improve future MCC staging criteria.
Collapse
Affiliation(s)
- Shayan Cheraghlou
- Department of Dermatology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Praneeth Sadda
- Department of Biomedical Imaging, Yale School of Medicine, New Haven, Connecticut, USA.,Department of Internal Medicine, Tulane University School of Medicine, New Orleans, Louisiana, USA
| | - George O Agogo
- Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA
| | - Michael Girardi
- Department of Dermatology, Yale School of Medicine, New Haven, Connecticut, USA
| |
Collapse
|
21
|
Zhang R, Xu Z, Hao J, Yu J, Liu Z, Liu S, Chen W, Zhou J, Li H, Lin Z, Zheng W. Label-free identification of human coronary atherosclerotic plaque based on a three-dimensional quantitative assessment of multiphoton microscopy images. BIOMEDICAL OPTICS EXPRESS 2021; 12:2979-2995. [PMID: 34168910 PMCID: PMC8194630 DOI: 10.1364/boe.422525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 04/12/2021] [Accepted: 04/15/2021] [Indexed: 06/13/2023]
Abstract
The rupture of coronary atherosclerotic plaque (CAP) and the resulting intracoronary thrombosis account for most acute coronary syndromes. Thus, the early identification and risk assessment of CAP is crucial for timely medical intervention. In this study, we propose a quantitative and label-free method for human CAP identification using multiphoton microscopy (MPM) and three-dimensional (3D) image analysis techniques. By detecting the intrinsic MPM signals, the microstructures of collagen and elastin fibers within normal and CAP-lesioned human coronary artery walls were imaged. Using a 3D gray level co-occurrence matrix method and 3D weighted vector summation algorithm, quantitative indicators that characterize the spatial texture and orientation features of the fibers were extracted. We demonstrate that these indicators show superior accuracy and repeatability over 2D texture features in CAP discrimination. Furthermore, by combining the 3D microstructural indicators, a support vector machine model that classifies CAP from the normal arterial wall with an accuracy of >97% was established. In conjunction with advances in multiphoton endoscopy, the proposed method shows great potential in providing a quantitative, label-free, and real-time tool for the early identification and risk assessment of CAP in the future.
Collapse
Affiliation(s)
- Rongli Zhang
- Guangdong Provincial Geriatrics Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
- Department of Cardiology, Guangdong Provincial Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
- Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- CAS Key Laboratory of Health Informatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Zhongbiao Xu
- Department of Radiotherapy, Cancer Center, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
| | - Junhai Hao
- Department of Intensive Care Unit of Cardiovascular Surgery, Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
| | - Jia Yu
- Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- CAS Key Laboratory of Health Informatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Zhiyi Liu
- State Key Laboratory of Modern Optical Instrumentation, College of Optical Science and Engineering, International Research Center for Advanced Photonics, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Shun Liu
- Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- CAS Key Laboratory of Health Informatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- School of Optoelectronic Engineering, Xi'an Technological University, Xi'an 710021, China
| | - Wanwen Chen
- Department of Cardiology, Guangdong Provincial Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
| | - Jiahui Zhou
- Department of Cardiology, Guangdong Provincial Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
| | - Hui Li
- Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- CAS Key Laboratory of Health Informatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Zhanyi Lin
- Guangdong Provincial Geriatrics Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
- Department of Cardiology, Guangdong Provincial Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, Guangdong 510080, China
| | - Wei Zheng
- Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- CAS Key Laboratory of Health Informatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| |
Collapse
|
22
|
Ghiasi MM, Zendehboudi S. Application of decision tree-based ensemble learning in the classification of breast cancer. Comput Biol Med 2020; 128:104089. [PMID: 33338982 DOI: 10.1016/j.compbiomed.2020.104089] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 10/22/2020] [Accepted: 10/22/2020] [Indexed: 11/25/2022]
Abstract
As a common screening and diagnostic tool, Fine Needle Aspiration Biopsy (FNAB) of the suspicious breast lumps can be used to distinguish between malignant and benign breast cytology. In this study, we first review published works on the classification of breast cancer where the machine learning and data mining algorithms have been applied by using the Wisconsin Breast Cancer Database (WBCD). This work then introduces useful new tools, based on Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) algorithms to classify breast cancer. The RF and ET strategies use the decision trees as proper classifiers to attain the ultimate classification. The RF and ET approaches include four main stages: input identification, determination of the optimal number of trees, voting analysis, and final decision. The models implemented in this research consider important factors such as uniformity of cell size, bland chromatin, mitoses, and clump thickness as the input parameters. According to the statistical analysis, the proposed methods are able to classify the type of breast cancer accurately. The error analysis results reveal that the designed RF and ET models offer easy-to-use outcomes and the highest diagnostic performance, compared to previous tools/models in the literature for the WBCD classification. The highest and lowest magnitudes of relative importance are attributed to the uniformity of cell size and mitoses among the factors. It is expected that the RF and ET algorithms play an important role in medicine and health systems for screening and diagnosis in the near future.
Collapse
Affiliation(s)
- Mohammad M Ghiasi
- Faculty of Engineering and Applied Science, Memorial University, St. John's, NL A1B 3X5, Canada.
| | - Sohrab Zendehboudi
- Faculty of Engineering and Applied Science, Memorial University, St. John's, NL A1B 3X5, Canada.
| |
Collapse
|
23
|
A Comparative Performance Evaluation of Classification Algorithms for Clinical Decision Support Systems. MATHEMATICS 2020. [DOI: 10.3390/math8101814] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Classification algorithms are widely taken into account for clinical decision support systems. However, it is not always straightforward to understand the behavior of such algorithms on a multiple disease prediction task. When a new classifier is introduced, we, in most cases, will ask ourselves whether the classifier performs well on a particular clinical dataset or not. The decision to utilize classifiers mostly relies upon the type of data and classification task, thus making it often made arbitrarily. In this study, a comparative evaluation of a wide-array classifier pertaining to six different families, i.e., tree, ensemble, neural, probability, discriminant, and rule-based classifiers are dealt with. A number of real-world publicly datasets ranging from different diseases are taken into account in the experiment in order to demonstrate the generalizability of the classifiers in multiple disease prediction. A total of 25 classifiers, 14 datasets, and three different resampling techniques are explored. This study reveals that the classifier that is likely to become the best performer is the conditional inference tree forest (cforest), followed by linear discriminant analysis, generalize linear model, random forest, and Gaussian process classifier. This work contributes to existing literature regarding a thorough benchmark of classification algorithms for multiple diseases prediction.
Collapse
|
24
|
Aruleba K, Obaido G, Ogbuokiri B, Fadaka AO, Klein A, Adekiya TA, Aruleba RT. Applications of Computational Methods in Biomedical Breast Cancer Imaging Diagnostics: A Review. J Imaging 2020; 6:105. [PMID: 34460546 PMCID: PMC8321173 DOI: 10.3390/jimaging6100105] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/12/2020] [Accepted: 09/14/2020] [Indexed: 12/14/2022] Open
Abstract
With the exponential increase in new cases coupled with an increased mortality rate, cancer has ranked as the second most prevalent cause of death in the world. Early detection is paramount for suitable diagnosis and effective treatment of different kinds of cancers, but this is limited to the accuracy and sensitivity of available diagnostic imaging methods. Breast cancer is the most widely diagnosed cancer among women across the globe with a high percentage of total cancer deaths requiring an intensive, accurate, and sensitive imaging approach. Indeed, it is treatable when detected at an early stage. Hence, the use of state of the art computational approaches has been proposed as a potential alternative approach for the design and development of novel diagnostic imaging methods for breast cancer. Thus, this review provides a concise overview of past and present conventional diagnostics approaches in breast cancer detection. Further, we gave an account of several computational models (machine learning, deep learning, and robotics), which have been developed and can serve as alternative techniques for breast cancer diagnostics imaging. This review will be helpful to academia, medical practitioners, and others for further study in this area to improve the biomedical breast cancer imaging diagnosis.
Collapse
Affiliation(s)
- Kehinde Aruleba
- School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg 2001, South Africa; (K.A.); (G.O.); (B.O.)
| | - George Obaido
- School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg 2001, South Africa; (K.A.); (G.O.); (B.O.)
| | - Blessing Ogbuokiri
- School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg 2001, South Africa; (K.A.); (G.O.); (B.O.)
| | - Adewale Oluwaseun Fadaka
- Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Private Bag X17, Bellville, Cape Town 7535, South Africa;
| | - Ashwil Klein
- Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Private Bag X17, Bellville, Cape Town 7535, South Africa;
| | - Tayo Alex Adekiya
- Department of Pharmacy and Pharmacology, School of Therapeutic Science, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, 7 York Road, Parktown 2193, South Africa;
| | - Raphael Taiwo Aruleba
- Department of Molecular and Cell Biology, Faculty of Science, University of Cape Town, Cape Town 7701, South Africa
| |
Collapse
|
25
|
Design of an integrated model for diagnosis and classification of pediatric acute leukemia using machine learning. Proc Inst Mech Eng H 2020; 234:1051-1069. [DOI: 10.1177/0954411920938567] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Applying artificial intelligence techniques for diagnosing diseases in hospitals often provides advanced medical services to patients such as the diagnosis of leukemia. On the other hand, surgery and bone marrow sampling, especially in the diagnosis of childhood leukemia, are even more complex and difficult, resulting in increased human error and procedure time decreased patient satisfaction and increased costs. This study investigates the use of neuro-fuzzy and group method of data handling, for the diagnosis of acute leukemia in children based on the complete blood count test. Furthermore, a principal component analysis is applied to increase the accuracy of the diagnosis. The results show that distinguishing between patient and non-patient individuals can easily be done with adaptive neuro-fuzzy inference system, whereas for classifying between the types of diseases themselves, more pre-processing operations such as reduction of features may be needed. The proposed approach may help to distinguish between two types of leukemia including acute lymphoblastic leukemia and acute myeloid leukemia. Based on the sensitivity of the diagnosis, experts can use the proposed algorithm to help identify the disease earlier and lessen the cost.
Collapse
|
26
|
Cloud-Based Breast Cancer Prediction Empowered with Soft Computing Approaches. JOURNAL OF HEALTHCARE ENGINEERING 2020; 2020:8017496. [PMID: 32509260 PMCID: PMC7254089 DOI: 10.1155/2020/8017496] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2020] [Accepted: 04/30/2020] [Indexed: 11/17/2022]
Abstract
The developing countries are still starving for the betterment of health sector. The disease commonly found among the women is breast cancer, and past researches have proven results that if the cancer is detected at a very early stage, the chances to overcome the disease are higher than the disease treated or detected at a later stage. This article proposed cloud-based intelligent BCP-T1F-SVM with 2 variations/models like BCP-T1F and BCP-SVM. The proposed BCP-T1F-SVM system has employed two main soft computing algorithms. The proposed BCP-T1F-SVM expert system specifically defines the stage and the type of cancer a person is suffering from. Expert system will elaborate the grievous stages of the cancer, to which extent a patient has suffered. The proposed BCP-SVM gives the higher precision of the proposed breast cancer detection model. In the limelight of breast cancer, the proposed BCP-T1F-SVM expert system gives out the higher precision rate. The proposed BCP-T1F expert system is being employed in the diagnosis of breast cancer at an initial stage. Taking different stages of cancer into account, breast cancer is being dealt by BCP-T1F expert system. The calculations and the evaluation done in this research have revealed that BCP-SVM is better than BCP-T1F. The BCP-T1F concludes out the 96.56 percentage accuracy, whereas the BCP-SVM gives accuracy of 97.06 percentage. The above unleashed research is wrapped up with the conclusion that BCP-SVM is better than the BCP-T1F. The opinions have been recommended by the medical expertise of Sheikh Zayed Hospital Lahore, Pakistan, and Cavan General Hospital, Lisdaran, Cavan, Ireland.
Collapse
|
27
|
Wang M, Chen H. Chaotic multi-swarm whale optimizer boosted support vector machine for medical diagnosis. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2019.105946] [Citation(s) in RCA: 215] [Impact Index Per Article: 53.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
28
|
Learning Biased SVM with Weighted Within-Class Scatter for Imbalanced Classification. Neural Process Lett 2020. [DOI: 10.1007/s11063-019-10096-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
29
|
Arzehgar A, Khalilzadeh MM, Varshoei F. Assessment and Classification of Mass Lesions Based on Expert Knowledge Using Mammographic Analysis. Curr Med Imaging 2020; 15:199-208. [PMID: 31975666 DOI: 10.2174/1573405614666171213161559] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Revised: 11/29/2017] [Accepted: 12/02/2017] [Indexed: 11/22/2022]
Abstract
BACKGROUND Masses are one of the most important indicators of breast cancer in mammograms, and their classification into two groups as benign and malignant is highly necessary. Computer Aided Diagnosis (CADx) helps radiologists enhance the accuracy of their decision. Hence, the system is required to support and assess with radiologist's interaction as an expert. METHODS In this research, classification of breast masses using mammography in the two main views which include MLO and CC, is evaluated with respect to the shape, texture and asymmetry aspect. Additionally, a method was developed and proposed using the classification of breast tissue density based on the decision tree. DISCUSSION This study therefore, aims to provide a method based on the human decision-making model that will help in designing the perfect tool for radiologists, regardless of the complexity of computing, costly procedures and also reducing the diagnosis error. CONCLUSION Results show that the proposed system for entirely fat, scattered fibroglandular densities, heterogeneously dense, and extremely dense breast achieved 100, 99, 99 and 98% true malignant rate, respectively with cross-validation procedure.
Collapse
Affiliation(s)
- Afrooz Arzehgar
- Department of Biomedical Engineering, Islamic Azad University, Mashhad Branch, Mashhad, Iran
| | | | | |
Collapse
|
30
|
Dalwinder S, Birmohan S, Manpreet K. Simultaneous feature weighting and parameter determination of Neural Networks using Ant Lion Optimization for the classification of breast cancer. Biocybern Biomed Eng 2020. [DOI: 10.1016/j.bbe.2019.12.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
31
|
Shaikh TA, Ali R. An intelligent healthcare system for optimized breast cancer diagnosis using harmony search and simulated annealing (HS-SA) algorithm. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100408] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
32
|
Yu D, Liu Z, Su C, Han Y, Duan X, Zhang R, Liu X, Yang Y, Xu S. Copy number variation in plasma as a tool for lung cancer prediction using Extreme Gradient Boosting (XGBoost) classifier. Thorac Cancer 2019; 11:95-102. [PMID: 31694073 PMCID: PMC6938748 DOI: 10.1111/1759-7714.13204] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 09/05/2019] [Accepted: 09/05/2019] [Indexed: 01/21/2023] Open
Abstract
Background The main cause of cancer death is lung cancer (LC) which usually presents at an advanced stage, but its early detection would increase the benefits of treatment. Blood is particularly favored in clinical research given the possibility of using it for relatively noninvasive analyses. Copy number variation (CNV) is a common genetic change in tumor genomes, and many studies have indicated that CNV‐derived cell‐free DNA (cfDNA) from plasma could be feasible as a biomarker for cancer diagnosis. Methods In this study, we determined the possibility of using chromosomal arm‐level CNV from cfDNA as a biomarker for lung cancer diagnosis in a small cohort of 40 patients and 41 healthy controls. Arm‐level CNV distributions were analyzed based on z score, and the machine‐learning algorithm Extreme Gradient Boosting (XGBoost) was applied for cancer prediction. Results The results showed that amplifications tended to emerge on chromosomes 3q, 8q, 12p, and 7q. Deletions were frequently detected on chromosomes 22q, 3p, 5q, 16q, 10q, and 15q. Upon applying a trained XGBoost classifier, specificity and sensitivity of 100% were finally achieved in the test group (12 patients and 13 healthy controls). In addition, five‐fold cross‐validation proved the stability of the model. Finally, our results suggested that the integration of four arm‐level CNVs and the concentration of cfDNA into the trained XGBoost classifier provides a potential method for detecting lung cancer. Conclusion Our results suggested that the integration of four arm‐level CNVs and the concentration from of cfDNA integrated withinto the trained XGBoost classifier could become provides a potentially method for detecting lung cancer detection. Key points Significant findings of the study:Healthy individuals have different arm‐level CNV profiles from cancer patients. Amplifications tend to emerge on chromosome 3q, 8q, 12p, 7q and deletions tend to emerge on chromosome 22q, 3p, 5q, 16q, 10q, 15q.
What this study adds:
CfDNA concentration, arm 10q, 3q, 8q, 3p, and 22q are key features for prediction. Trained XGBoost classifier is a potential method for lung cancer detection.
Collapse
Affiliation(s)
- Daping Yu
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - Zhidong Liu
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - Chongyu Su
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - Yi Han
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - XinChun Duan
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - Rui Zhang
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | | | - Yang Yang
- Beijing Gencode Diagnostics Laboratory, Beijing, China
| | - Shaofa Xu
- Thoracic Surgery Department, Beijing Chest Hospital, Capital Medical University; Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| |
Collapse
|
33
|
An Optimisation-Driven Prediction Method for Automated Diagnosis and Prognosis. MATHEMATICS 2019. [DOI: 10.3390/math7111051] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
This article presents a novel hybrid classification paradigm for medical diagnoses and prognoses prediction. The core mechanism of the proposed method relies on a centroid classification algorithm whose logic is exploited to formulate the classification task as a real-valued optimisation problem. A novel metaheuristic combining the algorithmic structure of Swarm Intelligence optimisers with the probabilistic search models of Estimation of Distribution Algorithms is designed to optimise such a problem, thus leading to high-accuracy predictions. This method is tested over 11 medical datasets and compared against 14 cherry-picked classification algorithms. Results show that the proposed approach is competitive and superior to the state-of-the-art on several occasions.
Collapse
|
34
|
Shi Y, Wong WK, Goldin JG, Brown MS, Kim GHJ. Prediction of progression in idiopathic pulmonary fibrosis using CT scans at baseline: A quantum particle swarm optimization - Random forest approach. Artif Intell Med 2019; 100:101709. [PMID: 31607341 DOI: 10.1016/j.artmed.2019.101709] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 08/10/2019] [Accepted: 08/19/2019] [Indexed: 11/28/2022]
Abstract
Idiopathic pulmonary fibrosis (IPF) is a fatal lung disease characterized by an unpredictable progressive decline in lung function. Natural history of IPF is unknown and the prediction of disease progression at the time of diagnosis is notoriously difficult. High resolution computed tomography (HRCT) has been used for the diagnosis of IPF, but not generally for monitoring purpose. The objective of this work is to develop a novel predictive model for the radiological progression pattern at voxel-wise level using only baseline HRCT scans. Mainly, there are two challenges: (a) obtaining a data set of features for region of interest (ROI) on baseline HRCT scans and their follow-up status; and (b) simultaneously selecting important features from high-dimensional space, and optimizing the prediction performance. We resolved the first challenge by implementing a study design and having an expert radiologist contour ROIs at baseline scans, depending on its progression status in follow-up visits. For the second challenge, we integrated the feature selection with prediction by developing an algorithm using a wrapper method that combines quantum particle swarm optimization to select a small number of features with random forest to classify early patterns of progression. We applied our proposed algorithm to analyze anonymized HRCT images from 50 IPF subjects from a multi-center clinical trial. We showed that it yields a parsimonious model with 81.8% sensitivity, 82.2% specificity and an overall accuracy rate of 82.1% at the ROI level. These results are superior to other popular feature selections and classification methods, in that our method produces higher accuracy in prediction of progression and more balanced sensitivity and specificity with a smaller number of selected features. Our work is the first approach to show that it is possible to use only baseline HRCT scans to predict progressive ROIs at 6 months to 1year follow-ups using artificial intelligence.
Collapse
Affiliation(s)
- Yu Shi
- Department of Biostatistics, University of California Los Angeles, USA
| | - Weng Kee Wong
- Department of Biostatistics, University of California Los Angeles, USA
| | - Jonathan G Goldin
- Department of Radiological Sciences, University of California Los Angeles, USA
| | - Matthew S Brown
- Department of Radiological Sciences, University of California Los Angeles, USA
| | - Grace Hyun J Kim
- Department of Biostatistics, University of California Los Angeles, USA; Department of Radiological Sciences, University of California Los Angeles, USA.
| |
Collapse
|
35
|
Huang H, Feng X, Zhou S, Jiang J, Chen H, Li Y, Li C. A new fruit fly optimization algorithm enhanced support vector machine for diagnosis of breast cancer based on high-level features. BMC Bioinformatics 2019; 20:290. [PMID: 31182028 PMCID: PMC6557762 DOI: 10.1186/s12859-019-2771-z] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND It is of great clinical significance to develop an accurate computer aided system to accurately diagnose the breast cancer. In this study, an enhanced machine learning framework is established to diagnose the breast cancer. The core of this framework is to adopt fruit fly optimization algorithm (FOA) enhanced by Levy flight (LF) strategy (LFOA) to optimize two key parameters of support vector machine (SVM) and build LFOA-based SVM (LFOA-SVM) for diagnosing the breast cancer. The high-level features abstracted from the volunteers are utilized to diagnose the breast cancer for the first time. RESULTS In order to verify the effectiveness of the proposed method, 10-fold cross-validation method is used to make comparison among the proposed method, FOA-SVM (model based on original FOA), PSO-SVM (model based on original particle swarm optimization), GA-SVM (model based on genetic algorithm), random forest, back propagation neural network and SVM. The main novelty of LFOA-SVM lies in the combination of FOA with LF strategy that enhances the quality for FOA, thus improving the convergence rate of the FOA optimization process as well as the probability of escaping from local optimal solution. CONCLUSIONS The experimental results demonstrate that the proposed LFOA-SVM method can beat other counterparts in terms of various performance metrics. It can very well distinguish malignant breast cancer from benign ones and assist the doctor with clinical diagnosis.
Collapse
Affiliation(s)
- Hui Huang
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Xi'an Feng
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Suying Zhou
- Pathology Department of Wenzhou People's Hospital, Wenzhou, 325035, China
| | - Jionghui Jiang
- Zhijiang College of Zhejiang University of Technology, Hangzhou, 310024, China
| | - Huiling Chen
- Department of Computer Science, Wenzhou University, Wenzhou, 325035, China.
| | - Yuping Li
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China
| | - Chengye Li
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
36
|
Hybrid Cascade Forward Neural Network with Elman Neural Network for Disease Prediction. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2019. [DOI: 10.1007/s13369-019-03829-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
37
|
Short-Term Traffic Flow Prediction Based on Least Square Support Vector Machine with Hybrid Optimization Algorithm. Neural Process Lett 2019. [DOI: 10.1007/s11063-019-09994-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
38
|
Ghasemzadeh A, Sarbazi Azad S, Esmaeili E. Breast cancer detection based on Gabor-wavelet transform and machine learning methods. INT J MACH LEARN CYB 2018. [DOI: 10.1007/s13042-018-0837-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
39
|
Klidbary SH, Shouraki SB. A novel adaptive learning algorithm for low-dimensional feature space using memristor-crossbar implementation and on-chip training. APPL INTELL 2018. [DOI: 10.1007/s10489-018-1202-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
40
|
Patrício M, Pereira J, Crisóstomo J, Matafome P, Gomes M, Seiça R, Caramelo F. Using Resistin, glucose, age and BMI to predict the presence of breast cancer. BMC Cancer 2018; 18:29. [PMID: 29301500 PMCID: PMC5755302 DOI: 10.1186/s12885-017-3877-1] [Citation(s) in RCA: 120] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 12/05/2017] [Indexed: 12/11/2022] Open
Abstract
Background The goal of this exploratory study was to develop and assess a prediction model which can potentially be used as a biomarker of breast cancer, based on anthropometric data and parameters which can be gathered in routine blood analysis. Methods For each of the 166 participants several clinical features were observed or measured, including age, BMI, Glucose, Insulin, HOMA, Leptin, Adiponectin, Resistin and MCP-1. Machine learning algorithms (logistic regression, random forests, support vector machines) were implemented taking in as predictors different numbers of variables. The resulting models were assessed with a Monte Carlo Cross-Validation approach to determine 95% confidence intervals for the sensitivity, specificity and AUC of the models. Results Support vector machines models using Glucose, Resistin, Age and BMI as predictors allowed predicting the presence of breast cancer in women with sensitivity ranging between 82 and 88% and specificity ranging between 85 and 90%. The 95% confidence interval for the AUC was [0.87, 0.91]. Conclusions These findings provide promising evidence that models combining age, BMI and metabolic parameters may be a powerful tool for a cheap and effective biomarker of breast cancer. Electronic supplementary material The online version of this article (10.1186/s12885-017-3877-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Miguel Patrício
- Laboratory of Biostatistics and Medical Informatics and IBILI - Faculty of Medicine, University of Coimbra, Azinhaga Santa Comba, Celas, 3000-548, Coimbra, Portugal.
| | - José Pereira
- Faculty of Medicine, University of Coimbra, Coimbra, Portugal
| | - Joana Crisóstomo
- Laboratory of Physiology, IBILI - Faculty of Medicine of University of Coimbra, Coimbra, Portugal
| | - Paulo Matafome
- Laboratory of Physiology, IBILI - Faculty of Medicine of University of Coimbra, Coimbra, Portugal.,Department of Complementary Sciences, Coimbra Health School - Instituto Politécnico de Coimbra, Coimbra, Portugal
| | - Manuel Gomes
- Department of Internal Medicine, University Hospital Centre of Coimbra, Coimbra, Portugal
| | - Raquel Seiça
- Laboratory of Physiology, IBILI - Faculty of Medicine of University of Coimbra, Coimbra, Portugal
| | - Francisco Caramelo
- Laboratory of Biostatistics and Medical Informatics and IBILI - Faculty of Medicine, University of Coimbra, Azinhaga Santa Comba, Celas, 3000-548, Coimbra, Portugal
| |
Collapse
|
41
|
Wei TY, Yen TH, Cheng CM. Point-of-care testing in the early diagnosis of acute pesticide intoxication: The example of paraquat. BIOMICROFLUIDICS 2018; 12:011501. [PMID: 29430271 PMCID: PMC5775096 DOI: 10.1063/1.5003848] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 01/04/2018] [Indexed: 05/09/2023]
Abstract
Acute pesticide intoxication is a common method of suicide globally. This article reviews current diagnostic methods and makes suggestions for future development. In the case of paraquat intoxication, it is characterized by multi-organ failure, causing substantial mortality and morbidity. Early diagnosis may save the life of a paraquat intoxication patient. Conventional paraquat intoxication diagnostic methods, such as symptom review and urine sodium dithionite assay, are time-consuming and impractical in resource-scarce areas where most intoxication cases occur. Several experimental and clinical studies have shown the potential of portable Surface Enhanced Raman Scattering (SERS), paper-based devices, and machine learning for paraquat intoxication diagnosis. Portable SERS and new SERS substrates maintain the sensitivity of SERS while being less costly and more convenient than conventional SERS. Paper-based devices provide the advantages of price and portability. Machine learning algorithms can be implemented as a mobile phone application and facilitate diagnosis in resource-limited areas. Although these methods have not yet met all features of an ideal diagnostic method, the combination and development of these methods offer much promise.
Collapse
Affiliation(s)
- Ting-Yen Wei
- Interdisciplinary Program of Life Science, National Tsing Hua University, Hsinchu 300, Taiwan
| | - Tzung-Hai Yen
- Department of Nephrology, Clinical Poison Center, Kidney Research Center, Center for Tissue Engineering, Chang Gung Memorial Hospital and Chang Gung University, Linkou 333, Taiwan
| | - Chao-Min Cheng
- Institute of Biomedical Engineering, National Tsing Hua University, Hsinchu 300, Taiwan
| |
Collapse
|
42
|
Jiang H, Ching WK, Cheung WS, Hou W, Yin H. Hadamard Kernel SVM with applications for breast cancer outcome predictions. BMC SYSTEMS BIOLOGY 2017; 11:138. [PMID: 29322919 PMCID: PMC5763304 DOI: 10.1186/s12918-017-0514-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
BACKGROUND Breast cancer is one of the leading causes of deaths for women. It is of great necessity to develop effective methods for breast cancer detection and diagnosis. Recent studies have focused on gene-based signatures for outcome predictions. Kernel SVM for its discriminative power in dealing with small sample pattern recognition problems has attracted a lot attention. But how to select or construct an appropriate kernel for a specified problem still needs further investigation. RESULTS Here we propose a novel kernel (Hadamard Kernel) in conjunction with Support Vector Machines (SVMs) to address the problem of breast cancer outcome prediction using gene expression data. Hadamard Kernel outperform the classical kernels and correlation kernel in terms of Area under the ROC Curve (AUC) values where a number of real-world data sets are adopted to test the performance of different methods. CONCLUSIONS Hadamard Kernel SVM is effective for breast cancer predictions, either in terms of prognosis or diagnosis. It may benefit patients by guiding therapeutic options. Apart from that, it would be a valuable addition to the current SVM kernel families. We hope it will contribute to the wider biology and related communities.
Collapse
Affiliation(s)
- Hao Jiang
- Department of Mathematics, School of Information, Renmin University of China, No.59 Zhong Guan Cun Avenue, Hai Dian District, Beijing, 100872, China
| | - Wai-Ki Ching
- Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong, Hong Kong
| | - Wai-Shun Cheung
- Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong, Hong Kong
| | - Wenpin Hou
- Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong, Hong Kong
| | - Hong Yin
- Department of Mathematics, School of Information, Renmin University of China, No.59 Zhong Guan Cun Avenue, Hai Dian District, Beijing, 100872, China.
| |
Collapse
|
43
|
Yang L, Xu Z. Feature extraction by PCA and diagnosis of breast tumors using SVM with DE-based parameter tuning. INT J MACH LEARN CYB 2017. [DOI: 10.1007/s13042-017-0741-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
44
|
A multi-verse optimizer approach for feature selection and optimizing SVM parameters based on a robust system architecture. Neural Comput Appl 2017. [DOI: 10.1007/s00521-016-2818-2] [Citation(s) in RCA: 131] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
45
|
Hu L, Lin F, Li H, Tong C, Pan Z, Li J, Chen H. An intelligent prognostic system for analyzing patients with paraquat poisoning using arterial blood gas indexes. J Pharmacol Toxicol Methods 2016; 84:78-85. [PMID: 27884773 DOI: 10.1016/j.vascn.2016.11.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Revised: 10/26/2016] [Accepted: 11/19/2016] [Indexed: 10/20/2022]
Abstract
The arterial blood gas (ABG) test is used to assess gas exchange in the lung, and the acid-base level in the blood. However, it is still unclear whether or not ABG test indexes correlate with paraquat (PQ) poisoning. This study investigates the predictive value of ABG tests in prognosing patients with PQ poisoning; it also identifies the most significant indexes of the ABG test. An intelligent machine learning-based system was established to effectively give prognostic analysis of patients with PQ poisoning based on ABG indexes. In the proposed system, an enhanced support vector machine combined with a feature selection strategy was developed to predict the risk status from a pool of 103 patients (56 males and 47 females); of these, 52 subjects were deceased and 51 patients were alive. The proposed method was rigorously evaluated against the real-life dataset in terms of accuracy, sensitivity, and specificity. Additionally, the feature selection was investigated to identify correlating factors for the risk status. The results demonstrated that there were significant differences in ABG indexes between deceased and alive subjects (p-value <0.01). According to the feature selection, we found that the most important correlated indexes were associated with partial pressure of carbon dioxide (PCO2). This study discovered the relationship between ABG test and poisoning degree to provide a new avenue for prognosing PQ poisoning.
Collapse
Affiliation(s)
- Lufeng Hu
- Department of Pharmacy, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou 325000, China
| | - Feiyan Lin
- Centre Laboratory, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou 325000, China
| | - Huaizhong Li
- Department of Computing, Lishui University, Lishui 323000, Zhejiang, China
| | - Changfei Tong
- College of Physics and Electronic Information Engineering, Wenzhou University, Wenzhou 325035, China
| | - Zhifang Pan
- Information Technology Center, Wenzhou Medical University, 325035, China
| | - Jun Li
- College of Physics and Electronic Information Engineering, Wenzhou University, Wenzhou 325035, China
| | - Huiling Chen
- College of Physics and Electronic Information Engineering, Wenzhou University, Wenzhou 325035, China.
| |
Collapse
|
46
|
A Novel Hybrid Feature Selection Model for Classification of Neuromuscular Dystrophies Using Bhattacharyya Coefficient, Genetic Algorithm and Radial Basis Function Based Support Vector Machine. Interdiscip Sci 2016; 10:244-250. [PMID: 27637476 DOI: 10.1007/s12539-016-0183-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 08/07/2016] [Accepted: 08/30/2016] [Indexed: 10/21/2022]
Abstract
An accurate classification of neuromuscular disorders is important in providing proper treatment facilities to the patients. Recently, the microarray technology is employed to monitor the level of activity or expression of large number of genes simultaneously. The gene expression data derived from the microarray experiment usually involve a large number of genes but a very few number of samples. There is a need to reduce the dimension of gene expression data which intends to find a small set of discriminative genes that accurately classifies the samples of various kinds of diseases. So, our goal is to find a small subset of genes which ensures the accurate classification of neuromuscular disorders. In the present paper, we propose a novel hybrid feature selection model for classification of neuromuscular disorders. The process of feature selection is done in two phases by integrating Bhattacharyya coefficient and genetic algorithm (GA). In the first phase, we find Bhattacharyya coefficient to choose a candidate gene subset by removing the most redundant genes. In the second phase, the target gene subset is created by selecting the most discriminative gene subset by applying GA wherein the fitness function is calculated using radial basis function support vector machine (RBF SVM). The proposed hybrid algorithm is applied on two publicly available microarray neuromuscular disorders datasets. The results are compared with two individual techniques of feature selection, namely Bhattacharyya coefficient and GA, and one integrated technique, i.e., Bhattacharyya-GA wherein the fitness function of GA is calculated using four other classifiers, which shows that the proposed integrated method is capable of giving the better classification accuracy.
Collapse
|
47
|
Fusco R, Sansone M, Filice S, Carone G, Amato DM, Sansone C, Petrillo A. Pattern Recognition Approaches for Breast Cancer DCE-MRI Classification: A Systematic Review. J Med Biol Eng 2016; 36:449-459. [PMID: 27656117 PMCID: PMC5016558 DOI: 10.1007/s40846-016-0163-7] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 03/29/2016] [Indexed: 11/26/2022]
Abstract
We performed a systematic review of several pattern analysis approaches for classifying breast lesions using dynamic, morphological, and textural features in dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI). Several machine learning approaches, namely artificial neural networks (ANN), support vector machines (SVM), linear discriminant analysis (LDA), tree-based classifiers (TC), and Bayesian classifiers (BC), and features used for classification are described. The findings of a systematic review of 26 studies are presented. The sensitivity and specificity are respectively 91 and 83 % for ANN, 85 and 82 % for SVM, 96 and 85 % for LDA, 92 and 87 % for TC, and 82 and 85 % for BC. The sensitivity and specificity are respectively 82 and 74 % for dynamic features, 93 and 60 % for morphological features, 88 and 81 % for textural features, 95 and 86 % for a combination of dynamic and morphological features, and 88 and 84 % for a combination of dynamic, morphological, and other features. LDA and TC have the best performance. A combination of dynamic and morphological features gives the best performance.
Collapse
Affiliation(s)
- Roberta Fusco
- Department of Diagnostic Imaging, metabolic and radiant Therapy, National Cancer Institute of Naples “Pascale Foundation”, Via Mariano Semmola 80131, Naples, Italy
- Department of Electrical Engineering and Information Technologies, University ‘Federico II’, Via Claudio 80125, Naples, Italy
| | - Mario Sansone
- Department of Electrical Engineering and Information Technologies, University ‘Federico II’, Via Claudio 80125, Naples, Italy
| | - Salvatore Filice
- Department of Diagnostic Imaging, metabolic and radiant Therapy, National Cancer Institute of Naples “Pascale Foundation”, Via Mariano Semmola 80131, Naples, Italy
| | - Guglielmo Carone
- Department of Diagnostic Imaging, metabolic and radiant Therapy, National Cancer Institute of Naples “Pascale Foundation”, Via Mariano Semmola 80131, Naples, Italy
| | - Daniela Maria Amato
- Department of Diagnostic Imaging, metabolic and radiant Therapy, National Cancer Institute of Naples “Pascale Foundation”, Via Mariano Semmola 80131, Naples, Italy
| | - Carlo Sansone
- Department of Electrical Engineering and Information Technologies, University ‘Federico II’, Via Claudio 80125, Naples, Italy
| | - Antonella Petrillo
- Department of Diagnostic Imaging, metabolic and radiant Therapy, National Cancer Institute of Naples “Pascale Foundation”, Via Mariano Semmola 80131, Naples, Italy
| |
Collapse
|
48
|
Chen H, Hu L, Li H, Hong G, Zhang T, Ma J, Lu Z. An Effective Machine Learning Approach for Prognosis of Paraquat Poisoning Patients Using Blood Routine Indexes. Basic Clin Pharmacol Toxicol 2016; 120:86-96. [PMID: 27390221 DOI: 10.1111/bcpt.12638] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 06/30/2016] [Indexed: 12/18/2022]
Abstract
The early identification of toxic paraquat (PQ) poisoning in patients is critical to ensure timely and accurate prognosis. Although plasma PQ concentration has been reported as a clinical indicator of PQ poisoning, it is not commonly applied in practice due to the inconvenient necessary instruments and operation. In this study, we explored the use of blood routine indexes to identify the degree of PQ toxicity and/or diagnose PQ poisoning in patients via machine learning approach. Specifically, we developed a method based on support vector machine combined with the feature selection technique to accurately predict PQ poisoning risk status, then tested the method on 79 (42 male and 37 female; 41 living and 38 deceased) patients. The detection method was rigorously evaluated against a real-world data set to determine its accuracy, sensitivity and specificity. Feature selection was also applied to identify the factors correlated with risk status, and the results showed that there are significant differences in blood routine indexes between dead and living PQ-poisoned individuals (p-value < 0.01). Feature selection also showed that the most important correlated indexes are white blood cell and neutrophils. In conclusion, the toxicity or prognosis of PQ poisoning can be preliminarily ascertained by blood routine testing without PQ concentration data, representing an additional tool and innovative approach to assess the prognosis of PQ poisoning.
Collapse
Affiliation(s)
- Huiling Chen
- College of Physics and Electronic Information Engineering, Wenzhou University, Wenzhou, China
| | - Lufeng Hu
- Department of Pharmacy, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Huaizhong Li
- Department of Computing, Lishui University, Lishui, 323000, Zhejiang, P. R. China
| | - Guangliang Hong
- Department of Emergency, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Tao Zhang
- Department of Emergency, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.,Department of Intensive Care Unit, Lishui Central Hospital, Lishui, China
| | - Jianshe Ma
- Analytical and Testing Center, Wenzhou Medical University, Wenzhou, China
| | - Zhongqiu Lu
- Department of Emergency, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
49
|
Sheikhpour R, Sarram MA, Sheikhpour R. Particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2015.10.005] [Citation(s) in RCA: 110] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
50
|
Shen L, Chen H, Yu Z, Kang W, Zhang B, Li H, Yang B, Liu D. Evolving support vector machines using fruit fly optimization for medical data classification. Knowl Based Syst 2016. [DOI: 10.1016/j.knosys.2016.01.002] [Citation(s) in RCA: 287] [Impact Index Per Article: 35.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|