1
|
Yang J, Yee PL, Khan AA, Karamti H, Eldin ET, Aldweesh A, Jery AE, Hussain L, Omar A. Intelligent lung cancer MRI prediction analysis based on cluster prominence and posterior probabilities utilizing intelligent Bayesian methods on extracted gray-level co-occurrence (GLCM) features. Digit Health 2023; 9:20552076231172632. [PMID: 37256015 PMCID: PMC10226179 DOI: 10.1177/20552076231172632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 04/12/2023] [Indexed: 06/01/2023] Open
Abstract
Lung cancer is the second foremost cause of cancer due to which millions of deaths occur worldwide. Developing automated tools is still a challenging task to improve the prediction. This study is specifically conducted for detailed posterior probabilities analysis to unfold the network associations among the gray-level co-occurrence matrix (GLCM) features. We then ranked the features based on t-test. The Cluster Prominence is selected as target node. The association and arc analysis were determined based on mutual information. The occurrence and reliability of selected cluster states were computed. The Cluster Prominence at state ≤330.85 yielded ROC index of 100%, relative Gini index of 99.98%, and relative Gini index of 100%. The proposed method further unfolds the dynamics and to detailed analysis of computed features based on GLCM features for better understanding of the hidden dynamics for proper diagnosis and prognosis of lung cancer.
Collapse
Affiliation(s)
- Jing Yang
- Faculty of Computer Science and
Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Por Lip Yee
- Faculty of Computer Science and
Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Abdullah Ayub Khan
- Department of Computer Science and
Information Technology, Benazir Bhutto Shaheed University Lyari, Karachi,
Pakistan
| | - Hanen Karamti
- Department of Computer Sciences,
College of Computer and Information Sciences, Princess Nourah bint Abdulrahman
University, Riyadh, Saudi Arabia
| | - Elsayed Tag Eldin
- Faculty of Engineering and Technology, Future University in Egypt, New Cairo, Cairo, Egypt
| | - Amjad Aldweesh
- College of Computer Science and
Information Technology, Shaqra University, Shaqra, Saudi Arabia
| | - Atef El Jery
- Department of Chemical Engineering,
College of Engineering, King Khalid University, Abha, Saudi Arabia
- National Engineering School of Gabes,
Gabes University, Zrig Gabes, Tunisia
| | - Lal Hussain
- Department of Computer Science and
Information Technology, King Abdullah Campus Chatter Kalas, University of Azad Jammu
and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
- Department of Computer Science and
Information Technology, University of Azad Jammu and Kashmir, Athmuqam, Azad
Kashmir, Pakistan
| | - Abdulfattah Omar
- Department of English, College of
Science & Humanities, Prince Sattam Bin Abdulaziz
University, Al-Kharj, Saudi Arabia
| |
Collapse
|
2
|
Liu M, Wu J, Wang N, Zhang X, Bai Y, Guo J, Zhang L, Liu S, Tao K. The value of artificial intelligence in the diagnosis of lung cancer: A systematic review and meta-analysis. PLoS One 2023; 18:e0273445. [PMID: 36952523 PMCID: PMC10035910 DOI: 10.1371/journal.pone.0273445] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 02/03/2023] [Indexed: 03/25/2023] Open
Abstract
Lung cancer is a common malignant tumor disease with high clinical disability and death rates. Currently, lung cancer diagnosis mainly relies on manual pathology section analysis, but the low efficiency and subjective nature of manual film reading can lead to certain misdiagnoses and omissions. With the continuous development of science and technology, artificial intelligence (AI) has been gradually applied to imaging diagnosis. Although there are reports on AI-assisted lung cancer diagnosis, there are still problems such as small sample size and untimely data updates. Therefore, in this study, a large amount of recent data was included, and meta-analysis was used to evaluate the value of AI for lung cancer diagnosis. With the help of STATA16.0, the value of AI-assisted lung cancer diagnosis was assessed by specificity, sensitivity, negative likelihood ratio, positive likelihood ratio, diagnostic ratio, and plotting the working characteristic curves of subjects. Meta-regression and subgroup analysis were used to investigate the value of AI-assisted lung cancer diagnosis. The results of the meta-analysis showed that the combined sensitivity of the AI-aided diagnosis system for lung cancer diagnosis was 0.87 [95% CI (0.82, 0.90)], specificity was 0.87 [95% CI (0.82, 0.91)] (CI stands for confidence interval.), the missed diagnosis rate was 13%, the misdiagnosis rate was 13%, the positive likelihood ratio was 6.5 [95% CI (4.6, 9.3)], the negative likelihood ratio was 0.15 [95% CI (0.11, 0.21)], a diagnostic ratio of 43 [95% CI (24, 76)] and a sum of area under the combined subject operating characteristic (SROC) curve of 0.93 [95% CI (0.91, 0.95)]. Based on the results, the AI-assisted diagnostic system for CT (Computerized Tomography), imaging has considerable diagnostic accuracy for lung cancer diagnosis, which is of significant value for lung cancer diagnosis and has greater feasibility of realizing the extension application in the field of clinical diagnosis.
Collapse
Affiliation(s)
- Mingsi Liu
- Department of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, Henan, China
| | - Jinghui Wu
- College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Nian Wang
- School of Basic Medical Sciences, Chengdu Medical College, Chengdu, Sichuan, China
| | - Xianqin Zhang
- School of Basic Medical Sciences, Chengdu Medical College, Chengdu, Sichuan, China
| | - Yujiao Bai
- School of Basic Medical Sciences, Chengdu Medical College, Chengdu, Sichuan, China
- Non-Coding RNA and Drug Discovery Key Laboratory of Sichuan Province, Chengdu Medical College, Chengdu, Sichuan, China
| | - Jinlin Guo
- Chongqing Key Laboratory of Sichuan-Chongqing Co-construction for Diagnosis and Treatment of Infectious Diseases Integrated Traditional Chinese and Western Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Lin Zhang
- Department of Pharmacy, Shaoxing people's Hospital, Shaoxing, Zhejiang, China
| | - Shulin Liu
- Department of the First Affiliated Hospital of Chengdu Medical College, Sichuan, China
| | - Ke Tao
- College of Life Science, Sichuan University, Chengdu, Sichuan, China
| |
Collapse
|
3
|
Liu M, Zhou Z, Liu F, Wang M, Wang Y, Gao M, Sun H, Zhang X, Yang T, Ji L, Li J, Si Q, Dai L, Ouyang S. CT and CEA-based machine learning model for predicting malignant pulmonary nodules. Cancer Sci 2022; 113:4363-4373. [PMID: 36056603 DOI: 10.1111/cas.15561] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 08/22/2022] [Accepted: 08/25/2022] [Indexed: 12/15/2022] Open
Abstract
Computed tomography (CT), an efficient radiological technology, is used to detect lung cancer in the clinic. Carcinoembryonic antigen (CEA), a common tumor biomarker, is applied in the detection of various tumors. To highlight the advantages of two-dimensional techniques and assist clinicians in optimizing lung cancer diagnostic schemes, we established a favorable model combining CT and CEA. In the study, univariate analysis was performed to screen independent predictors in a training cohort of 271 patients with malignant pulmonary nodules (MPNs) and 92 with benign pulmonary nodules (BPNs). Six machine learning-based models involving five CT predictors (mediastinal lymph node enlargement, lobulation, vascular notch sign, spiculation, and nodule number) and lnCEA were constructed and validated in an independent cohort of 129 participants (92 MPNs and 37 BPNs) by SPSS Modeler. A nomogram and the Delong test were generated by R software. Finally, the model established by logistic regression had highest diagnostic efficiency (area under the curve [AUC] = 0.912). Moreover, the diagnostic ability of the logistic model in the validation cohort (AUC = 0.882, 80.4% sensitivity, 75.7% specificity) was higher than that of the Peking University model (AUC = 0.712, 68.5% sensitivity, 70.3% specificity) and the Mayo model (AUC = 0.745, 62.0% sensitivity, 75.7% specificity). Interestingly, for the participants with intermediate (10-30 mm) and CEA-negative nodule, the model reached an AUC of 0.835 (72.3% sensitivity, 83.3% specificity). The AUC for the early lung cancer was as high as 0.822 with 67.3% sensitivity and 78.9% specificity. As a conclusion, this promising model presents a new diagnostic strategy for the clinic to distinguish MPNs from BPNs.
Collapse
Affiliation(s)
- Man Liu
- Department of Respiratory and Sleep Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China.,Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China
| | - Zhigang Zhou
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Fenghui Liu
- Department of Respiratory and Sleep Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Meng Wang
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Yulin Wang
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China
| | - Mengyu Gao
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Huifang Sun
- Department of Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Xue Zhang
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China
| | - Ting Yang
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China.,BGI College, Zhengzhou University, Zhengzhou, China
| | - Longtao Ji
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China.,BGI College, Zhengzhou University, Zhengzhou, China
| | - Jiaqi Li
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China
| | - Qiufang Si
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China.,BGI College, Zhengzhou University, Zhengzhou, China
| | - Liping Dai
- Henan Institute of Medical and Pharmaceutical Sciences & Henan Key Medical Laboratory of Tumor Molecular Biomarkers, Zhengzhou University, Zhengzhou, China.,BGI College, Zhengzhou University, Zhengzhou, China
| | - Songyun Ouyang
- Department of Respiratory and Sleep Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| |
Collapse
|
4
|
An Automatic Random Walker Algorithm for Segmentation of Ground Glass Opacity Pulmonary Nodules. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:6727957. [PMID: 36212245 PMCID: PMC9537033 DOI: 10.1155/2022/6727957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Revised: 07/02/2021] [Accepted: 01/06/2022] [Indexed: 11/24/2022]
Abstract
Automatic and accurate segmentation of ground glass opacity (GGO) nodules still remains challenging due to inhomogeneous interiors, irregular shapes, and blurred boundaries from different patients. Despite successful applications in the image processing domains, the random walk has some limitations for segmentation of GGO pulmonary nodules. In this paper, an improved random walker method is proposed for the segmentation of GGO nodules. To calculate a new affinity matrix, intensity, spatial, and texture features are incorporated. It strengthens discriminative power between two adjacent nodes on the graph. To address the problem of robustness in seed acquisition, the geodesic distance is introduced and a novel local search strategy is presented to automatically acquire reliable seeds. For segmentation, a label constraint term is introduced to the energy function of original random walker, which alleviates the accumulation of errors caused by the initial seeds acquisition. Massive experiments conducted on Lung Images Dataset Consortium (LIDC) demonstrate that the proposed method achieves visually satisfactory results without user interactions. Both qualitative and quantitative evaluations also demonstrate that the proposed method obtains better performance compared with conventional random walker method and state-of-the-art segmentation methods in terms of the overlap score and F-measure.
Collapse
|
5
|
Tuberculosis Disease Diagnosis Based on an Optimized Machine Learning Model. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:8950243. [PMID: 35494520 PMCID: PMC9041161 DOI: 10.1155/2022/8950243] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 02/21/2022] [Accepted: 03/01/2022] [Indexed: 11/30/2022]
Abstract
Computer science plays an important role in modern dynamic health systems. Given the collaborative nature of the diagnostic process, computer technology provides important services to healthcare professionals and organizations, as well as to patients and their families, researchers, and decision-makers. Thus, any innovations that improve the diagnostic process while maintaining quality and safety are crucial to the development of the healthcare field. Many diseases can be tentatively diagnosed during their initial stages. In this study, all developed techniques were applied to tuberculosis (TB). Thus, we propose an optimized machine learning-based model that extracts optimal texture features from TB-related images and selects the hyper-parameters of the classifiers. Increasing the accuracy rate and minimizing the number of characteristics extracted are our goals. In other words, this is a multitask optimization issue. A genetic algorithm (GA) is used to choose the best features, which are then fed into a support vector machine (SVM) classifier. Using the ImageCLEF 2020 data set, we conducted experiments using the proposed approach and achieved significantly higher accuracy and better outcomes in comparison with the state-of-the-art works. The obtained experimental results highlight the efficiency of modified SVM classifier compared with other standard ones.
Collapse
|
6
|
Zhang Y, Meng L. Study on Identification Method of Pulmonary Nodules: Improved Random Walk Pulmonary Parenchyma Segmentation and Fusion Multi-Feature VGG16 Nodule Classification. Front Oncol 2022; 12:822827. [PMID: 35371983 PMCID: PMC8966585 DOI: 10.3389/fonc.2022.822827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 02/18/2022] [Indexed: 11/13/2022] Open
Abstract
Purpose The purpose of this study was to realize automatic segmentation of lung parenchyma based on random walk algorithm to ensure the accuracy of lung parenchyma segmentation. The explicable features of pulmonary nodules were added into VGG16 neural network to improve the classification accuracy of pulmonary nodules. Materials and Methods LIDC-IDRI, a public dataset containing lung Computed Tomography images/pulmonary nodules, was used as experimental data. In lung parenchyma segmentation, the maximum Between-Class Variance method (OTSU), corrosion and expansion methods were used to automatically obtain the foreground and background seed points of random walk algorithm in lung parenchyma region. The shortest distance between point sets was added as one of the criteria of prospect probability in the calculation of random walk weight function to achieve accurate segmentation of pulmonary parenchyma. According to the location of the nodules marked by the doctor, the nodules were extracted. The texture features and grayscale features were extracted by Volume Local Direction Ternary Pattern (VLDTP) method and gray histogram. The explicable features were input into VGG16 network in series mode and fused with depth features to achieve accurate classification of nodules. Intersection of Union (IOU) and false positive rate (FPR) were used to measure the segmentation results. Accuracy, Sensitivity, Specificity, Accuracy and F1 score were used to evaluate the results of nodule classification. Results The automatic random walk algorithm is effective in lung parenchyma segmentation, and its segmentation efficiency is improved obviously. In VGG16 network, the accuracy of nodular classification is 0.045 higher than that of single depth feature classification. Conclusion The method proposed in this paper can effectively and accurately achieve automatic segmentation of lung parenchyma. In addition, the fusion of multi-feature VGG16 network is effective in the classification of pulmonary nodules, which can improve the accuracy of nodular classification.
Collapse
Affiliation(s)
- Yanrong Zhang
- Heilongjiang Key Laboratory of Electronic Commerce and Information Processing, Computer and Information Engineering College, Harbin University of Commerce, Harbin, China
| | - Lingyue Meng
- Heilongjiang Key Laboratory of Electronic Commerce and Information Processing, Computer and Information Engineering College, Harbin University of Commerce, Harbin, China
| |
Collapse
|
7
|
Improved cost-sensitive multikernel learning support vector machine algorithm based on particle swarm optimization in pulmonary nodule recognition. Soft comput 2022. [DOI: 10.1007/s00500-021-06718-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
AbstractIn the lung computer-aided detection (Lung CAD) system, the region of interest (ROI) of lung nodules has more false positives, making the imbalance between positive and negative (true positive and false positive) samples more likely to lead to misclassification of true positive nodules, a cost-sensitive multikernel learning support vector machine (CS-MKL-SVM) algorithm is proposed. Different penalty coefficients are assigned to positive and negative samples, so that the model can better learn the features of true positive nodules and improve the classification effect. To further improve the detection rate of pulmonary nodules and overall recognition accuracy, a score function named F-new based on the harmonic mean of accuracy (ACC) and sensitivity (SEN) is proposed as a fitness function for subsequent particle swarm optimization (PSO) parameter optimization, and a feasibility analysis of this function is performed. Compared with the fitness function that considers only accuracy or sensitivity, both the detection rate and the recognition accuracy of pulmonary nodules can be improved by this new algorithm. Compared with the grid search algorithm, using PSO for parameter search can reduce the model training time by nearly 20 times and achieve rapid parameter optimization. The maximum F-new obtained on the test set is 0.9357 for the proposed algorithm. When the maximum value of F-new is achieved, the corresponding recognition ACC is 91%, and SEN is 96.3%. Compared with the radial basis function in the single kernel, the F-new of the algorithm in this paper is 2.16% higher, ACC is 1.00% higher and SEN is equal. Compared with the polynomial kernel function in the single kernel, the F-new of the algorithm is 3.64% higher, ACC is 1.00% higher and SEN is 7.41% higher. The experimental results show that the F-new, ACC and SEN of the proposed algorithm is the best among them, and the results obtained by using multikernel function combined with F-new index are better than the single kernel function. Compared with the MKL-SVM algorithm of grid search, the ACC of the algorithm in this paper is reduced by 1%, and the results are equal to those of the MKL-SVM algorithm based on PSO only. Compared with the above two algorithms, SEN is increased by 3.71% and 7.41%, respectively. Therefore, it can be seen that the cost sensitive method can effectively reduce the missed detection of nodules, and the availability of the new algorithm can be further verified.
Collapse
|
8
|
Kong L, Cheng J. Based on improved deep convolutional neural network model pneumonia image classification. PLoS One 2021; 16:e0258804. [PMID: 34735483 PMCID: PMC8568342 DOI: 10.1371/journal.pone.0258804] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 10/06/2021] [Indexed: 12/03/2022] Open
Abstract
Pneumonia remains the leading infectious cause of death in children under the age of five, killing about 700,000 children each year and affecting 7% of the world's population. X-ray images of lung become the key to the diagnosis of this disease, skilled doctors in the diagnosis of a certain degree of subjectivity, if the use of computer-aided medical diagnosis to automatically detect lung abnormalities, will improve the accuracy of diagnosis. This research aims to introduce a deep learning technology based on the combination of Xception neural network and long-term short-term memory (LSTM), which can realize automatic diagnosis of patients with pneumonia in X-ray images. First, the model uses the Xception network to extract the deep features of the data, passes the extracted features to the LSTM, and then the LSTM detects the extracted features, and finally selects the most needed features. Secondly, in the training set samples, the traditional cross-entropy loss cannot more balance the mismatch between categories. Therefore, this research combines Pearson's feature selection ideas, fusion of the correlation between the two loss functions, and optimizes the problem. The experimental results show that the accuracy rate of this paper is 96%, the receiver operator characteristic curve accuracy rate is 99%, the precision rate is 98%, the recall rate is 91%, and the F1 score accuracy rate is 94%. Compared with the existing technical methods, the research has achieved expected results on the currently available datasets. And assist doctors to provide higher reliability in the classification task of childhood pneumonia.
Collapse
Affiliation(s)
- Lingzhi Kong
- School of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| | - Jinyong Cheng
- School of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| |
Collapse
|
9
|
Zhang X, Liu X, Zhang B, Dong J, Zhang B, Zhao S, Li S. Accurate segmentation for different types of lung nodules on CT images using improved U-Net convolutional network. Medicine (Baltimore) 2021; 100:e27491. [PMID: 34622882 PMCID: PMC8500581 DOI: 10.1097/md.0000000000027491] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 09/02/2021] [Accepted: 09/23/2021] [Indexed: 01/05/2023] Open
Abstract
ABSTRACT Since lung nodules on computed tomography images can have different shapes, contours, textures or locations and may be attached to neighboring blood vessels or pleural surfaces, accurate segmentation is still challenging. In this study, we propose an accurate segmentation method based on an improved U-Net convolutional network for different types of lung nodules on computed tomography images.The first phase is to segment lung parenchyma and correct the lung contour by applying α-hull algorithm. The second phase is to extract image pairs of patches containing lung nodules in the center and the corresponding ground truth and build an improved U-Net network with introduction of batch normalization.A large number of experiments manifest that segmentation performance of Dice loss has superior results than mean square error and Binary_crossentropy loss. The α-hull algorithm and batch normalization can improve the segmentation performance effectively. Our best result for Dice similar coefficient (0.8623) is also more competitive than other state-of-the-art segmentation algorithms.In order to segment different types of lung nodules accurately, we propose an improved U-Net network, which can improve the segmentation accuracy effectively. Moreover, this work also has practical value in helping radiologists segment lung nodules and diagnose lung cancer.
Collapse
|
10
|
Yeh MCH, Wang YH, Yang HC, Bai KJ, Wang HH, Li YCJ. Artificial Intelligence-Based Prediction of Lung Cancer Risk Using Nonimaging Electronic Medical Records: Deep Learning Approach. J Med Internet Res 2021; 23:e26256. [PMID: 34342588 PMCID: PMC8371476 DOI: 10.2196/26256] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 04/03/2021] [Accepted: 05/04/2021] [Indexed: 01/20/2023] Open
Abstract
Background Artificial intelligence approaches can integrate complex features and can be used to predict a patient’s risk of developing lung cancer, thereby decreasing the need for unnecessary and expensive diagnostic interventions. Objective The aim of this study was to use electronic medical records to prescreen patients who are at risk of developing lung cancer. Methods We randomly selected 2 million participants from the Taiwan National Health Insurance Research Database who received care between 1999 and 2013. We built a predictive lung cancer screening model with neural networks that were trained and validated using pre-2012 data, and we tested the model prospectively on post-2012 data. An age- and gender-matched subgroup that was 10 times larger than the original lung cancer group was used to assess the predictive power of the electronic medical record. Discrimination (area under the receiver operating characteristic curve [AUC]) and calibration analyses were performed. Results The analysis included 11,617 patients with lung cancer and 1,423,154 control patients. The model achieved AUCs of 0.90 for the overall population and 0.87 in patients ≥55 years of age. The AUC in the matched subgroup was 0.82. The positive predictive value was highest (14.3%) among people aged ≥55 years with a pre-existing history of lung disease. Conclusions Our model achieved excellent performance in predicting lung cancer within 1 year and has potential to be deployed for digital patient screening. Convolution neural networks facilitate the effective use of EMRs to identify individuals at high risk for developing lung cancer.
Collapse
Affiliation(s)
- Marvin Chia-Han Yeh
- Department of Dermatology, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
| | - Yu-Hsiang Wang
- School of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Hsuan-Chia Yang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.,International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan
| | - Kuan-Jen Bai
- Division of Pulmonary Medicine, Department of Internal Medicine, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,School of Respiratory Therapy, College of Medicine, Taipei Medical University, Taipei, Taiwan.,Pulmonary Research Center, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
| | - Hsiao-Han Wang
- Department of Dermatology, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.,Department of Dermatology, School of Medicine, Taipei Medical University, Taipei, Taiwan
| | - Yu-Chuan Jack Li
- Department of Dermatology, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Research Center of Big Data and Meta-analysis, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan.,Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan.,International Center for Health Information Technology, Taipei Medical University, Taipei, Taiwan.,Department of Dermatology, School of Medicine, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
11
|
Abbas A, Abdelsamea MM, Gaber MM. 4S-DT: Self-Supervised Super Sample Decomposition for Transfer Learning With Application to COVID-19 Detection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:2798-2808. [PMID: 34038371 PMCID: PMC8544943 DOI: 10.1109/tnnls.2021.3082015] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 01/23/2021] [Accepted: 05/17/2021] [Indexed: 05/12/2023]
Abstract
Due to the high availability of large-scale annotated image datasets, knowledge transfer from pretrained models showed outstanding performance in medical image classification. However, building a robust image classification model for datasets with data irregularity or imbalanced classes can be a very challenging task, especially in the medical imaging domain. In this article, we propose a novel deep convolutional neural network, which we called self-supervised super sample decomposition for transfer learning (4S-DT) model. The 4S-DT encourages a coarse-to-fine transfer learning from large-scale image recognition tasks to a specific chest X-ray image classification task using a generic self-supervised sample decomposition approach. Our main contribution is a novel self-supervised learning mechanism guided by a super sample decomposition of unlabeled chest X-ray images. 4S-DT helps in improving the robustness of knowledge transformation via a downstream learning strategy with a class-decomposition (CD) layer to simplify the local structure of the data. The 4S-DT can deal with any irregularities in the image dataset by investigating its class boundaries using a downstream CD mechanism. We used 50000 unlabeled chest X-ray images to achieve our coarse-to-fine transfer learning with an application to COVID-19 detection, as an exemplar. The 4S-DT has achieved a high accuracy of 99.8% on the larger of the two datasets used in the experimental study and an accuracy of 97.54% on the smaller dataset, which was enriched by augmented images, out of which all real COVID-19 cases were detected.
Collapse
Affiliation(s)
- Asmaa Abbas
- Department of MathematicsUniversity of AssiutAsyut71515Egypt
| | - Mohammed M. Abdelsamea
- School of Computing and Digital TechnologyBirmingham City UniversityBirminghamB4 7APU.K.
- Department of Computer ScienceFaculty of Computers and InformationUniversity of AssiutAsyut71515Egypt
| | - Mohamed Medhat Gaber
- School of Computing and Digital TechnologyBirmingham City UniversityBirminghamB4 7APU.K.
| |
Collapse
|
12
|
Akter O, Moni MA, Islam MM, Quinn JMW, Kamal AHM. Lung cancer detection using enhanced segmentation accuracy. APPL INTELL 2021. [DOI: 10.1007/s10489-020-02046-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
13
|
P SAB, Annavarapu CSR. Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification. APPL INTELL 2021; 51:3104-3120. [PMID: 34764590 PMCID: PMC7986181 DOI: 10.1007/s10489-021-02199-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/06/2021] [Indexed: 12/22/2022]
Abstract
COVID-19 has proven to be a deadly virus, and unfortunately, it triggered a worldwide pandemic. Its detection for further treatment poses a severe threat to researchers, scientists, health professionals, and administrators worldwide. One of the daunting tasks during the pandemic for doctors in radiology is the use of chest X-ray or CT images for COVID-19 diagnosis. Time is required to inspect each report manually. While a CT scan is the better standard, an X-ray is still useful because it is cheaper, faster, and more widely used. To diagnose COVID-19, this paper proposes to use a deep learning-based improved Snapshot Ensemble technique for efficient COVID-19 chest X-ray classification. In addition, the proposed method takes advantage of the transfer learning technique using the ResNet-50 model, which is a pre-trained model. The proposed model uses the publicly accessible COVID-19 chest X-ray dataset consisting of 2905 images, which include COVID-19, viral pneumonia, and normal chest X-ray images. For performance evaluation, the model applied the metrics such as AU-ROC, AU-PR, and Jaccard Index. Furthermore, it also obtained a multi-class micro-average of 97% specificity, 95% f 1-score, and 95% classification accuracy. The obtained results demonstrate that the performance of the proposed method outperformed those of several existing methods. This method appears to be a suitable and efficient approach for COVID-19 chest X-ray classification.
Collapse
Affiliation(s)
- Samson Anosh Babu P
- Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, 826004 India
| | | |
Collapse
|
14
|
Heydari F, Rafsanjani MK. A Review on Lung Cancer Diagnosis Using Data Mining Algorithms. Curr Med Imaging 2021; 17:16-26. [PMID: 32586255 DOI: 10.2174/1573405616666200625153017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 05/01/2020] [Accepted: 05/11/2020] [Indexed: 11/22/2022]
Abstract
Due to the serious consequences of lung cancer, medical associations use computer-aided diagnostic procedures to diagnose this disease more accurately. Despite the damaging effects of lung cancer on the body, the lifetime of cancer patients can be extended by early diagnosis. Data mining techniques are practical in diagnosing lung cancer in its first stages. This paper surveys a number of leading data mining-based cancer diagnosis approaches. Moreover, this review draws a comparison between data mining approaches in terms of selection criteria and presents the advantages and disadvantages of each method.
Collapse
Affiliation(s)
- Farzad Heydari
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Marjan Kuchaki Rafsanjani
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran
| |
Collapse
|
15
|
Ziyad SR, Radha V, Vayyapuri T. Overview of Computer Aided Detection and Computer Aided Diagnosis Systems for Lung Nodule Detection in Computed Tomography. Curr Med Imaging 2020; 16:16-26. [PMID: 31989890 DOI: 10.2174/1573405615666190206153321] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Revised: 01/02/2019] [Accepted: 01/10/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND Lung cancer has become a major cause of cancer-related deaths. Detection of potentially malignant lung nodules is essential for the early diagnosis and clinical management of lung cancer. In clinical practice, the interpretation of Computed Tomography (CT) images is challenging for radiologists due to a large number of cases. There is a high rate of false positives in the manual findings. Computer aided detection system (CAD) and computer aided diagnosis systems (CADx) enhance the radiologists in accurately delineating the lung nodules. OBJECTIVES The objective is to analyze CAD and CADx systems for lung nodule detection. It is necessary to review the various techniques followed in CAD and CADx systems proposed and implemented by various research persons. This study aims at analyzing the recent application of various concepts in computer science to each stage of CAD and CADx. METHODS This review paper is special in its own kind because it analyses the various techniques proposed by different eminent researchers in noise removal, contrast enhancement, thorax removal, lung segmentation, bone suppression, segmentation of trachea, classification of nodule and nonnodule and final classification of benign and malignant nodules. RESULTS A comparison of the performance of different techniques implemented by various researchers for the classification of nodule and non-nodule has been tabulated in the paper. CONCLUSION The findings of this review paper will definitely prove to be useful to the research community working on automation of lung nodule detection.
Collapse
Affiliation(s)
- Shabana Rasheed Ziyad
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin AbdulAziz University, Al Kharj, Saudi Arabia
| | - Venkatachalam Radha
- Department of Computer Science, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, India
| | - Thavavel Vayyapuri
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin AbdulAziz University, Al Kharj, Saudi Arabia
| |
Collapse
|
16
|
Hybrid bio-inspired algorithm and convolutional neural network for automatic lung tumor detection. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-05362-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
AbstractIn this paper, we have proposed a hybrid bio-inspired algorithm which takes the merits of whale optimization algorithm (WOA) and adaptive particle swarm optimization (APSO). The proposed algorithm is referred as the hybrid WOA_APSO algorithm. We utilize a convolutional neural network (CNN) for classification purposes. Extensive experiments are performed to evaluate the performance of the proposed model. Here, pre-processing and segmentation are performed on 120 lung CT images for obtaining the segmented tumored and non-tumored region nodule. The statistical, texture, geometrical and structural features are extracted from the processed image using different techniques. The optimized feature selection plays a crucial role in determining the accuracy of the classification algorithm. The novel variant of whale optimization algorithm and adaptive particle swarm optimization, hybrid bio-inspired WOA_APSO, is proposed for selecting optimized features. The feature selection grouping is applied by embedding linear discriminant analysis which helps in determining the reduced dimensions of subsets. Twofold performance comparisons are done. First, we compare the performance against the different classification techniques such as support vector machine, artificial neural network (ANN) and CNN. Second, the computational cost of the hybrid WOA_APSO is compared with the standard WOA and APSO algorithms. The experimental result reveals that the proposed algorithm is capable of automatic lung tumor detection and it outperforms the other state-of-the-art methods on standard quality measures such as accuracy (97.18%), sensitivity (97%) and specificity (98.66%). The results reported in this paper are encouraging; hence, these results will motivate other researchers to explore more in this direction.
Collapse
|
17
|
Abstract
Chest X-ray is the first imaging technique that plays an important role in the diagnosis of COVID-19 disease. Due to the high availability of large-scale annotated image datasets, great success has been achieved using convolutional neural networks (CNN s) for image recognition and classification. However, due to the limited availability of annotated medical images, the classification of medical images remains the biggest challenge in medical diagnosis. Thanks to transfer learning, an effective mechanism that can provide a promising solution by transferring knowledge from generic object recognition tasks to domain-specific tasks. In this paper, we validate and a deep CNN, called Decompose, Transfer, and Compose (DeTraC), for the classification of COVID-19 chest X-ray images. DeTraC can deal with any irregularities in the image dataset by investigating its class boundaries using a class decomposition mechanism. The experimental results showed the capability of DeTraC in the detection of COVID-19 cases from a comprehensive image dataset collected from several hospitals around the world. High accuracy of 93.1% (with a sensitivity of 100%) was achieved by DeTraC in the detection of COVID-19 X-ray images from normal, and severe acute respiratory syndrome cases.
Collapse
|
18
|
Abstract
There is a lot of abnormal information in the development of lung cancer, and how to extract useful knowledge is urgent from massive information. Data mining technology has become a popular tool for medical classification and prediction. However, each technology has its advantage and disadvantage, and several data mining methods will be applied to conduct the in-depth analysis step by step. And the prediction results of different models are compared. A total of 180 lung cancer patients and 243 lung benign individuals were collected from the First Affiliated Hospital of Zhengzhou University from October 2014 to March 2016, and the prediction models based on epidemiological data, clinical features and tumor markers were developed by artificial neural network (ANN), decision tree C5.0 and support vector machine (SVM). The results showed that there were significant differences between the lung cancer group and the lung benign group in terms of seven tumor markers and 10 epidemiological and clinical indicators. The accuracy rates of ANN, C5.0 and SVM were 76.47, 89.92 and 85.71%, respectively. The results of receiver operating characteristic curve (ROC) curve revealed the area under the ROC curve (AUC) of ANN was 0.811 (0.770-0.847), the AUC of C5.0 was 0.897 (0.864-0.924) and the AUC of SVM was 0.878 (0.843-0.908). It was shown that the decision tree C5.0 model has the least error rate and highest accuracy, and it could be used to diagnose lung cancer.
Collapse
|
19
|
Classification of Lung Nodules Based on Deep Residual Networks and Migration Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2020; 2020:8975078. [PMID: 32318102 PMCID: PMC7149413 DOI: 10.1155/2020/8975078] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 01/30/2020] [Accepted: 02/12/2020] [Indexed: 01/22/2023]
Abstract
The classification process of lung nodule detection in a traditional computer-aided detection (CAD) system is complex, and the classification result is heavily dependent on the performance of each step in lung nodule detection, causing low classification accuracy and high false positive rate. In order to alleviate these issues, a lung nodule classification method based on a deep residual network is proposed. Abandoning traditional image processing methods and taking the 50-layer ResNet network structure as the initial model, the deep residual network is constructed by combining residual learning and migration learning. The proposed approach is verified by conducting experiments on the lung computed tomography (CT) images from the publicly available LIDC-IDRI database. An average accuracy of 98.23% and a false positive rate of 1.65% are obtained based on the ten-fold cross-validation method. Compared with the conventional support vector machine (SVM)-based CAD system, the accuracy of our method improved by 9.96% and the false positive rate decreased by 6.95%, while the accuracy improved by 1.75% and 2.42%, respectively, and the false positive rate decreased by 2.07% and 2.22%, respectively, in contrast to the VGG19 model and InceptionV3 convolutional neural networks. The experimental results demonstrate the effectiveness of our proposed method in lung nodule classification for CT images.
Collapse
|
20
|
Kuo CFJ, Huang CC, Siao JJ, Hsieh CW, Huy VQ, Ko KH, Hsu HH. Automatic lung nodule detection system using image processing techniques in computed tomography. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2019.101659] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
21
|
A systematic review and meta-analysis on performance of intelligent systems in lung cancer: Where are we? Artif Intell Rev 2019. [DOI: 10.1007/s10462-019-09764-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
22
|
Automatic nodule detection for lung cancer in CT images: A review. Comput Biol Med 2018; 103:287-300. [PMID: 30415174 DOI: 10.1016/j.compbiomed.2018.10.033] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 10/29/2018] [Accepted: 10/29/2018] [Indexed: 12/18/2022]
Abstract
Automatic lung nodule detection has great significance for treating lung cancer and increasing patient survival. This work summarizes a critical review of recent techniques for automatic lung nodule detection in computed tomography images. This review indicates the current tendency and obtained progress as well as future challenges in this field. This research covered the databases including Web of Science, PubMed, and the Press, including IEEE Xplore and Science Direct, up to May 2018. Each part of the paper is summarized carefully in terms of the method and validation results for better comparison. Based on the results, some techniques show better performance for lung nodule detection. However, researchers should pay attention to the existing challenges, such as high sensitivity with a low false positive rate, large and different patient databases, developing or optimizing the detection technique of various types of lung nodules with different sizes, shapes, textures and locations, combining electronic medical records and picture archiving and communication systems, building efficient feature sets for better classification and promoting the cooperation and communication between academic institutions and medical organizations. We believe that automatic computer-aided detection systems will be developed with strong robustness, high efficiency and security assurance. This review will be helpful for professional researchers and radiologists to further learn about the latest techniques in computer-aided detection systems.
Collapse
|
23
|
Pulmonary Nodule Recognition Based on Multiple Kernel Learning Support Vector Machine-PSO. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018; 2018:1461470. [PMID: 29853983 PMCID: PMC5949190 DOI: 10.1155/2018/1461470] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2017] [Accepted: 03/12/2018] [Indexed: 11/30/2022]
Abstract
Pulmonary nodule recognition is the core module of lung CAD. The Support Vector Machine (SVM) algorithm has been widely used in pulmonary nodule recognition, and the algorithm of Multiple Kernel Learning Support Vector Machine (MKL-SVM) has achieved good results therein. Based on grid search, however, the MKL-SVM algorithm needs long optimization time in course of parameter optimization; also its identification accuracy depends on the fineness of grid. In the paper, swarm intelligence is introduced and the Particle Swarm Optimization (PSO) is combined with MKL-SVM algorithm to be MKL-SVM-PSO algorithm so as to realize global optimization of parameters rapidly. In order to obtain the global optimal solution, different inertia weights such as constant inertia weight, linear inertia weight, and nonlinear inertia weight are applied to pulmonary nodules recognition. The experimental results show that the model training time of the proposed MKL-SVM-PSO algorithm is only 1/7 of the training time of the MKL-SVM grid search algorithm, achieving better recognition effect. Moreover, Euclidean norm of normalized error vector is proposed to measure the proximity between the average fitness curve and the optimal fitness curve after convergence. Through statistical analysis of the average of 20 times operation results with different inertial weights, it can be seen that the dynamic inertial weight is superior to the constant inertia weight in the MKL-SVM-PSO algorithm. In the dynamic inertial weight algorithm, the parameter optimization time of nonlinear inertia weight is shorter; the average fitness value after convergence is much closer to the optimal fitness value, which is better than the linear inertial weight. Besides, a better nonlinear inertial weight is verified.
Collapse
|
24
|
Ngadi M, Amine A, Hachimi H, El-Attar A. Vicinal support vector classifier: A novel approach for robust classification based on SKDA. PATTERN RECOGNITION AND IMAGE ANALYSIS 2017. [DOI: 10.1134/s1054661817030245] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|