1
|
Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics. Int J Mol Sci 2022; 23:ijms232214155. [PMID: 36430631 PMCID: PMC9697273 DOI: 10.3390/ijms232214155] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 10/31/2022] [Accepted: 11/12/2022] [Indexed: 11/18/2022] Open
Abstract
Determining the aggressiveness of gliomas, termed grading, is a critical step toward treatment optimization to increase the survival rate and decrease treatment toxicity for patients. Streamlined grading using molecular information has the potential to facilitate decision making in the clinic and aid in treatment planning. In recent years, molecular markers have increasingly gained importance in the classification of tumors. In this study, we propose a novel hierarchical voting-based methodology for improving the performance results of the feature selection stage and machine learning models for glioma grading with clinical and molecular predictors. To identify the best scheme for the given soft-voting-based ensemble learning model selections, we utilized publicly available TCGA and CGGA datasets and employed four dimensionality reduction methods to carry out a voting-based ensemble feature selection and five supervised models, with a total of sixteen combination sets. We also compared our proposed feature selection method with the LASSO feature selection method in isolation. The computational results indicate that the proposed method achieves 87.606% and 79.668% accuracy rates on TCGA and CGGA datasets, respectively, outperforming the LASSO feature selection method.
Collapse
|
2
|
Saihood A, Karshenas H, Nilchi ARN. Deep fusion of gray level co-occurrence matrices for lung nodule classification. PLoS One 2022; 17:e0274516. [PMID: 36174073 PMCID: PMC9521911 DOI: 10.1371/journal.pone.0274516] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 08/28/2022] [Indexed: 11/19/2022] Open
Abstract
Lung cancer is a serious threat to human health, with millions dying because of its late diagnosis. The computerized tomography (CT) scan of the chest is an efficient method for early detection and classification of lung nodules. The requirement for high accuracy in analyzing CT scan images is a significant challenge in detecting and classifying lung cancer. In this paper, a new deep fusion structure based on the long short-term memory (LSTM) has been introduced, which is applied to the texture features computed from lung nodules through new volumetric grey-level-co-occurrence-matrices (GLCMs), classifying the nodules into benign, malignant, and ambiguous. Also, an improved Otsu segmentation method combined with the water strider optimization algorithm (WSA) is proposed to detect the lung nodules. WSA-Otsu thresholding can overcome the fixed thresholds and time requirement restrictions in previous thresholding methods. Extended experiments are used to assess this fusion structure by considering 2D-GLCM based on 2D-slices and approximating the proposed 3D-GLCM computations based on volumetric 2.5D-GLCMs. The proposed methods are trained and assessed through the LIDC-IDRI dataset. The accuracy, sensitivity, and specificity obtained for 2D-GLCM fusion are 94.4%, 91.6%, and 95.8%, respectively. For 2.5D-GLCM fusion, the accuracy, sensitivity, and specificity are 97.33%, 96%, and 98%, respectively. For 3D-GLCM, the accuracy, sensitivity, and specificity of the proposed fusion structure reached 98.7%, 98%, and 99%, respectively, outperforming most state-of-the-art counterparts. The results and analysis also indicate that the WSA-Otsu method requires a shorter execution time and yields a more accurate thresholding process.
Collapse
Affiliation(s)
- Ahmed Saihood
- Artificial Intelligence Department, Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran
- Faculty of Computer Science and Mathematics, University of Thi-Qar, Nasiriyah, Thi-Qar, Iraq
| | - Hossein Karshenas
- Artificial Intelligence Department, Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran
| | - Ahmad Reza Naghsh Nilchi
- Artificial Intelligence Department, Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran
| |
Collapse
|
3
|
An ISHAP-based interpretation-model-guided classification method for malignant pulmonary nodule. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107778] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
4
|
Krauze AV, Zhuge Y, Zhao R, Tasci E, Camphausen K. AI-Driven Image Analysis in Central Nervous System Tumors-Traditional Machine Learning, Deep Learning and Hybrid Models. JOURNAL OF BIOTECHNOLOGY AND BIOMEDICINE 2022; 5:1-19. [PMID: 35106480 PMCID: PMC8802234 DOI: 10.26502/jbb.2642-91280046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The interpretation of imaging in medicine in general and in oncology specifically remains problematic due to several limitations which include the need to incorporate detailed clinical history, patient and disease-specific history, clinical exam features, previous and ongoing treatment, and account for the dependency on reproducible human interpretation of multiple factors with incomplete data linkage. To standardize reporting, minimize bias, expedite management, and improve outcomes, the use of Artificial Intelligence (AI) has gained significant prominence in imaging analysis. In oncology, AI methods have as a result been explored in most cancer types with ongoing progress in employing AI towards imaging for oncology treatment, assessing treatment response, and understanding and communicating prognosis. Challenges remain with limited available data sets, variability in imaging changes over time augmented by a growing heterogeneity in analysis approaches. We review the imaging analysis workflow and examine how hand-crafted features also referred to as traditional Machine Learning (ML), Deep Learning (DL) approaches, and hybrid analyses, are being employed in AI-driven imaging analysis in central nervous system tumors. ML, DL, and hybrid approaches coexist, and their combination may produce superior results although data in this space is as yet novel, and conclusions and pitfalls have yet to be fully explored. We note the growing technical complexities that may become increasingly separated from the clinic and enforce the acute need for clinician engagement to guide progress and ensure that conclusions derived from AI-driven imaging analysis reflect that same level of scrutiny lent to other avenues of clinical research.
Collapse
Affiliation(s)
- A V Krauze
- Center for Cancer Research, National Cancer Institute, NIH, Building 10, Room B2-3637, Bethesda, USA
| | - Y Zhuge
- Center for Cancer Research, National Cancer Institute, NIH, Building 10, Room B2-3637, Bethesda, USA
| | - R Zhao
- University of British Columbia, Faculty of Medicine, 317 - 2194 Health Sciences Mall, Vancouver, Canada
| | - E Tasci
- Center for Cancer Research, National Cancer Institute, NIH, Building 10, Room B2-3637, Bethesda, USA
| | - K Camphausen
- Center for Cancer Research, National Cancer Institute, NIH, Building 10, Room B2-3637, Bethesda, USA
| |
Collapse
|
5
|
|
6
|
Lung Nodule Detection from Feature Engineering to Deep Learning in Thoracic CT Images: a Comprehensive Review. J Digit Imaging 2021; 33:655-677. [PMID: 31997045 DOI: 10.1007/s10278-020-00320-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
This paper presents a systematic review of the literature focused on the lung nodule detection in chest computed tomography (CT) images. Manual detection of lung nodules by the radiologist is a sequential and time-consuming process. The detection is subjective and depends on the radiologist's experiences. Owing to the variation in shapes and appearances of a lung nodule, it is very difficult to identify the proper location of the nodule from a huge number of slices generated by the CT scanner. Small nodules (< 10 mm in diameter) may be missed by this manual detection process. Therefore, computer-aided diagnosis (CAD) system acts as a "second opinion" for the radiologists, by making final decision quickly with higher accuracy and greater confidence. The goal of this survey work is to present the current state of the artworks and their progress towards lung nodule detection to the researchers and readers in this domain. This review paper has covered the published works from 2009 to April 2018. Different nodule detection approaches are described elaborately in this work. Recently, it is observed that deep learning (DL)-based approaches are applied extensively for nodule detection and characterization. Therefore, emphasis has been given to convolutional neural network (CNN)-based DL approaches by describing different CNN-based networks.
Collapse
|
7
|
Rapid Assessment of Acute Ischemic Stroke by Computed Tomography Using Deep Convolutional Neural Networks. J Digit Imaging 2021; 34:637-646. [PMID: 33963421 DOI: 10.1007/s10278-021-00457-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Revised: 03/10/2021] [Accepted: 04/27/2021] [Indexed: 01/01/2023] Open
Abstract
Acute stroke is one of the leading causes of disability and death worldwide. Regarding clinical diagnoses, a rapid and accurate procedure is necessary for patients suffering from acute stroke. This study proposes an automatic identification scheme for acute ischemic stroke using deep convolutional neural networks (DCNNs) based on non-contrast computed tomographic (NCCT) images. Our image database for the classification model was composed of 1254 grayscale NCCT images from 96 patients (573 images) with acute ischemic stroke and 121 normal controls (681 images). According to the consensus of critical stroke findings by two neuroradiologists, a gold standard was established and used to train the proposed DCNN using machine-generated image features. Including the earliest DCNN, AlexNet, the popular Inception-v3, and ResNet-101 were proposed. To train the limited data size, transfer learning with ImageNet parameters was also used. The established models were evaluated by tenfold cross-validation and tested on an independent dataset containing 50 patients with acute ischemic stroke (108 images) and 58 normal controls (117 images) from another institution. AlexNet without pretrained parameters achieved an accuracy of 97.12%, a sensitivity of 98.11%, a specificity of 96.08%, and an area under the receiver operating characteristic curve (AUC) of 0.9927. Using transfer learning, transferred AlexNet, transferred Inception-v3, and transferred ResNet-101 achieved accuracies between 90.49 and 95.49%. Tested with a dataset from another institution, AlexNet showed an accuracy of 60.89%, a sensitivity of 18.52%, and a specificity of 100%. Transferred AlexNet, Inception-v3, and ResNet-101 achieved accuracies of 81.77%, 85.78%, and 80.89%, respectively. The proposed DCNN architecture as a computer-aided diagnosis system showed that training from scratch can generate a customized model for a specific scanner, and transfer learning can generate a more generalized model to provide diagnostic suggestions of acute ischemic stroke to radiologists.
Collapse
|
8
|
Heydari F, Rafsanjani MK. A Review on Lung Cancer Diagnosis Using Data Mining Algorithms. Curr Med Imaging 2021; 17:16-26. [PMID: 32586255 DOI: 10.2174/1573405616666200625153017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 05/01/2020] [Accepted: 05/11/2020] [Indexed: 11/22/2022]
Abstract
Due to the serious consequences of lung cancer, medical associations use computer-aided diagnostic procedures to diagnose this disease more accurately. Despite the damaging effects of lung cancer on the body, the lifetime of cancer patients can be extended by early diagnosis. Data mining techniques are practical in diagnosing lung cancer in its first stages. This paper surveys a number of leading data mining-based cancer diagnosis approaches. Moreover, this review draws a comparison between data mining approaches in terms of selection criteria and presents the advantages and disadvantages of each method.
Collapse
Affiliation(s)
- Farzad Heydari
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Marjan Kuchaki Rafsanjani
- Department of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman, Iran
| |
Collapse
|
9
|
Pehrson LM, Nielsen MB, Ammitzbøl Lauridsen C. Automatic Pulmonary Nodule Detection Applying Deep Learning or Machine Learning Algorithms to the LIDC-IDRI Database: A Systematic Review. Diagnostics (Basel) 2019; 9:E29. [PMID: 30866425 PMCID: PMC6468920 DOI: 10.3390/diagnostics9010029] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 01/29/2019] [Accepted: 02/19/2019] [Indexed: 12/27/2022] Open
Abstract
The aim of this study was to provide an overview of the literature available on machine learning (ML) algorithms applied to the Lung Image Database Consortium Image Collection (LIDC-IDRI) database as a tool for the optimization of detecting lung nodules in thoracic CT scans. This systematic review was compiled according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Only original research articles concerning algorithms applied to the LIDC-IDRI database were included. The initial search yielded 1972 publications after removing duplicates, and 41 of these articles were included in this study. The articles were divided into two subcategories describing their overall architecture. The majority of feature-based algorithms achieved an accuracy >90% compared to the deep learning (DL) algorithms that achieved an accuracy in the range of 82.2%⁻97.6%. In conclusion, ML and DL algorithms are able to detect lung nodules with a high level of accuracy, sensitivity, and specificity using ML, when applied to an annotated archive of CT scans of the lung. However, there is no consensus on the method applied to determine the efficiency of ML algorithms.
Collapse
Affiliation(s)
- Lea Marie Pehrson
- Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark.
- Department of Technology, Faculty of Health and Technology, University College Copenhagen, 2200 Copenhagen, Denmark.
| | - Michael Bachmann Nielsen
- Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark.
| | - Carsten Ammitzbøl Lauridsen
- Department of Diagnostic Radiology, Copenhagen University Hospital, Rigshospitalet, 2100 Copenhagen, Denmark.
- Department of Technology, Faculty of Health and Technology, University College Copenhagen, 2200 Copenhagen, Denmark.
| |
Collapse
|
10
|
Tan J, Huo Y, Liang Z, Li L. Expert knowledge-infused deep learning for automatic lung nodule detection. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2019; 27:17-35. [PMID: 30452432 PMCID: PMC6453714 DOI: 10.3233/xst-180426] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
BACKGROUND Computer aided detection (CADe) of pulmonary nodules from computed tomography (CT) is crucial for early diagnosis of lung cancer. Self-learned features obtained by training datasets via deep learning have facilitated CADe of the nodules. However, the complexity of CT lung images renders a challenge of extracting effective features by self-learning only. This condition is exacerbated for limited size of datasets. On the other hand, the engineered features have been widely studied. OBJECTIVE We proposed a novel nodule CADe which aims to relieve the challenge by the use of available engineered features to prevent convolution neural networks (CNN) from overfitting under dataset limitation and reduce the running-time complexity of self-learning. METHODS The CADe methodology infuses adequately the engineered features, particularly texture features, into the deep learning process. RESULTS The methodology was validated on 208 patients with at least one juxta-pleural nodule from the public LIDC-IDRI database. Results demonstrated that the methodology achieves a sensitivity of 88% with 1.9 false positives per scan and a sensitivity of 94.01% with 4.01 false positives per scan. CONCLUSIONS The methodology shows high performance compared with the state-of-the-art results, in terms of accuracy and efficiency, from both existing CNN-based approaches and engineered feature-based classifications.
Collapse
Affiliation(s)
- Jiaxing Tan
- Department of Computer Science, City University of New York, the Graduate Center, NY, USA
| | - Yumei Huo
- Department of Computer Science, City University of New York at CSI, NY, USA
| | - Zhengrong Liang
- Department of Radiology, State University of New York at Stony Brook, NY, USA
- Corresponding author: Zhengrong Liang, Department of Radiology, Electrical and Computer Engineering, and Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794, USA. .
| | - Lihong Li
- Department of Engineering Science and Physics, City University of New York at CSI, NY, USA
| |
Collapse
|
11
|
Lung nodule detection and classification based on geometric fit in parametric form and deep learning. Neural Comput Appl 2018. [DOI: 10.1007/s00521-018-3773-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
12
|
Abstract
Lung cancer mortality is currently the highest among all kinds of fatal cancers. With the help of computer-aided detection systems, a timely detection of malignant pulmonary nodule at early stage could improve the patient survival rate efficiently. However, the sizes of the pulmonary nodules are usually various, and it is more difficult to detect small diameter nodules. The traditional convolution neural network uses pooling layers to reduce the resolution progressively, but it hampers the network’s ability to capture the tiny but vital features of the pulmonary nodules. To tackle this problem, we propose a novel 3D spatial pyramid dilated convolution network to classify the malignancy of the pulmonary nodules. Instead of using the pooling layers, we use 3D dilated convolution to learn the detailed characteristic information of the pulmonary nodules. Furthermore, we show that the fusion of multiple receptive fields from different dilated convolutions could further improve the classification performance of the model. Extensive experimental results demonstrate that our model achieves a better result with an accuracy of 88 . 6 % , which outperforms other state-of-the- art methods.
Collapse
|
13
|
da Silva GLF, Valente TLA, Silva AC, de Paiva AC, Gattass M. Convolutional neural network-based PSO for lung nodule false positive reduction on CT images. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 162:109-118. [PMID: 29903476 DOI: 10.1016/j.cmpb.2018.05.006] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2017] [Revised: 09/15/2017] [Accepted: 05/03/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND AND OBJECTIVE Detection of lung nodules is critical in CAD systems; this is because of their similar contrast with other structures and low density, which result in the generation of numerous false positives (FPs). Therefore, this study proposes a methodology to reduce the FP number using a deep learning technique in conjunction with an evolutionary technique. METHOD The particle swarm optimization (PSO) algorithm was used to optimize the network hyperparameters in the convolutional neural network (CNN) in order to enhance the network performance and eliminate the requirement of manual search. RESULTS The methodology was tested on computed tomography (CT) scans from the Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) with the highest accuracy of 97.62%, sensitivity of 92.20%, specificity of 98.64%, and area under the receiver operating characteristic (ROC) curve of 0.955. CONCLUSION The results demonstrate the high performance-potential of the PSO algorithm in the identification of optimal CNN hyperparameters for lung nodule candidate classification into nodules and non-nodules, increasing the sensitivity rates in the FP reduction step of CAD systems.
Collapse
Affiliation(s)
- Giovanni Lucca França da Silva
- Federal University of Maranhão - UFMA, Applied Computing Group - NCA Av. dos Portugueses, SN, Bacanga, São Luís, MA 65085-580, Brazil.
| | - Thales Levi Azevedo Valente
- Pontifical Catholic University of Rio de Janeiro - PUC - Rio R. São Vicente, 225, Gávea, Rio de Janeiro, RJ 22453-900, Brazil.
| | - Aristófanes Corrêa Silva
- Federal University of Maranhão - UFMA, Applied Computing Group - NCA Av. dos Portugueses, SN, Bacanga, São Luís, MA 65085-580, Brazil.
| | - Anselmo Cardoso de Paiva
- Federal University of Maranhão - UFMA, Applied Computing Group - NCA Av. dos Portugueses, SN, Bacanga, São Luís, MA 65085-580, Brazil.
| | - Marcelo Gattass
- Pontifical Catholic University of Rio de Janeiro - PUC - Rio R. São Vicente, 225, Gávea, Rio de Janeiro, RJ 22453-900, Brazil.
| |
Collapse
|
14
|
Multistage segmentation model and SVM-ensemble for precise lung nodule detection. Int J Comput Assist Radiol Surg 2018; 13:1083-1095. [DOI: 10.1007/s11548-018-1715-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 02/16/2018] [Indexed: 10/17/2022]
|
15
|
Juxta-Vascular Pulmonary Nodule Segmentation in PET-CT Imaging Based on an LBF Active Contour Model with Information Entropy and Joint Vector. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018. [PMID: 29531575 PMCID: PMC5817370 DOI: 10.1155/2018/2183847] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The accurate segmentation of pulmonary nodules is an important preprocessing step in computer-aided diagnoses of lung cancers. However, the existing segmentation methods may cause the problem of edge leakage and cannot segment juxta-vascular pulmonary nodules accurately. To address this problem, a novel automatic segmentation method based on an LBF active contour model with information entropy and joint vector is proposed in this paper. Our method extracts the interest area of pulmonary nodules by a standard uptake value (SUV) in Positron Emission Tomography (PET) images, and automatic threshold iteration is used to construct an initial contour roughly. The SUV information entropy and the gray-value joint vector of Positron Emission Tomography–Computed Tomography (PET-CT) images are calculated to drive the evolution of contour curve. At the edge of pulmonary nodules, evolution will be stopped and accurate results of pulmonary nodule segmentation can be obtained. Experimental results show that our method can achieve 92.35% average dice similarity coefficient, 2.19 mm Hausdorff distance, and 3.33% false positive with the manual segmentation results. Compared with the existing methods, our proposed method that segments juxta-vascular pulmonary nodules in PET-CT images is more accurate and efficient.
Collapse
|
16
|
Takahashi R, Kajikawa Y. Computer-aided diagnosis: A survey with bibliometric analysis. Int J Med Inform 2017; 101:58-67. [DOI: 10.1016/j.ijmedinf.2017.02.004] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2016] [Revised: 01/28/2017] [Accepted: 02/04/2017] [Indexed: 12/18/2022]
|
17
|
Liu JK, Jiang HY, Gao MD, He CG, Wang Y, Wang P, Ma H, Li Y. An Assisted Diagnosis System for Detection of Early Pulmonary Nodule in Computed Tomography Images. J Med Syst 2016; 41:30. [PMID: 28032305 DOI: 10.1007/s10916-016-0669-0] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Accepted: 12/07/2016] [Indexed: 11/28/2022]
Abstract
Lung cancer is still the most concerned disease around the world. Lung nodule generates in the pulmonary parenchyma which indicates the latent risk of lung cancer. Computer-aided pulmonary nodules detection system is necessary, which can reduce diagnosis time and decrease mortality of patients. In this study, we have proposed a new computer aided diagnosis (CAD) system for detection of early pulmonary nodule, which can help radiologists quickly locate suspected nodules and make judgments. This system consists of four main sections: pulmonary parenchyma segmentation, nodule candidate detection, features extraction (total 22 features) and nodule classification. The publicly available data set created by the Lung Image Database Consortium (LIDC) is used for training and testing. This study selects 6400 slices from 80 CT scans containing totally 978 nodules, which is labeled by four radiologists. Through a fast segmentation method proposed in this paper, pulmonary nodules including 888 true nodules and 11,379 false positive nodules are segmented. By means of an ensemble classifier, Random Forest (RF), this study acquires 93.2, 92.4, 94.8, 97.6% of accuracy, sensitivity, specificity, area under the curve (AUC), respectively. Compared with support vector machine (SVM) classifier, RF can reduce more false positive nodules and acquire larger AUC. With the help of this CAD system, radiologist can be provided with a great reference for pulmonary nodule diagnosis timely.
Collapse
Affiliation(s)
- Ji-Kui Liu
- Key Laboratory for Health Informatics of the Chinese Academy of Sciences (HICAS), Shenzhen Institutes of Advanced Technology, Shenzhen, 518055, Guangdong, China
| | - Hong-Yang Jiang
- Sino-Dutch Biomedical and Information Engineering School, Hunnan Campus, Northeastern University, Shenyang, 110169, Liaoning, China
| | - Meng-di Gao
- Sino-Dutch Biomedical and Information Engineering School, Hunnan Campus, Northeastern University, Shenyang, 110169, Liaoning, China
| | - Chen-Guang He
- Software School, North China University of Water Resources and Electric Power, Zhengzhou, 450045, Henan, China
| | - Yu Wang
- Sino-Dutch Biomedical and Information Engineering School, Hunnan Campus, Northeastern University, Shenyang, 110169, Liaoning, China
| | - Pu Wang
- Key Laboratory for Health Informatics of the Chinese Academy of Sciences (HICAS), Shenzhen Institutes of Advanced Technology, Shenzhen, 518055, Guangdong, China
| | - He Ma
- Sino-Dutch Biomedical and Information Engineering School, Hunnan Campus, Northeastern University, Shenyang, 110169, Liaoning, China.
| | - Ye Li
- Key Laboratory for Health Informatics of the Chinese Academy of Sciences (HICAS), Shenzhen Institutes of Advanced Technology, Shenzhen, 518055, Guangdong, China.
| |
Collapse
|
18
|
Hancock MC, Magnan JF. Lung nodule malignancy classification using only radiologist-quantified image features as inputs to statistical learning algorithms: probing the Lung Image Database Consortium dataset with two statistical learning methods. J Med Imaging (Bellingham) 2016; 3:044504. [PMID: 27990453 DOI: 10.1117/1.jmi.3.4.044504] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Accepted: 11/14/2016] [Indexed: 01/12/2023] Open
Abstract
In the assessment of nodules in CT scans of the lungs, a number of image-derived features are diagnostically relevant. Currently, many of these features are defined only qualitatively, so they are difficult to quantify from first principles. Nevertheless, these features (through their qualitative definitions and interpretations thereof) are often quantified via a variety of mathematical methods for the purpose of computer-aided diagnosis (CAD). To determine the potential usefulness of quantified diagnostic image features as inputs to a CAD system, we investigate the predictive capability of statistical learning methods for classifying nodule malignancy. We utilize the Lung Image Database Consortium dataset and only employ the radiologist-assigned diagnostic feature values for the lung nodules therein, as well as our derived estimates of the diameter and volume of the nodules from the radiologists' annotations. We calculate theoretical upper bounds on the classification accuracy that are achievable by an ideal classifier that only uses the radiologist-assigned feature values, and we obtain an accuracy of 85.74 [Formula: see text], which is, on average, 4.43% below the theoretical maximum of 90.17%. The corresponding area-under-the-curve (AUC) score is 0.932 ([Formula: see text]), which increases to 0.949 ([Formula: see text]) when diameter and volume features are included and has an accuracy of 88.08 [Formula: see text]. Our results are comparable to those in the literature that use algorithmically derived image-based features, which supports our hypothesis that lung nodules can be classified as malignant or benign using only quantified, diagnostic image features, and indicates the competitiveness of this approach. We also analyze how the classification accuracy depends on specific features and feature subsets, and we rank the features according to their predictive power, statistically demonstrating the top four to be spiculation, lobulation, subtlety, and calcification.
Collapse
Affiliation(s)
- Matthew C Hancock
- Florida State University , Department of Mathematics, 208 Love Building, 1017 Academic Way, Tallahassee, Florida 32306-4510, United States
| | - Jerry F Magnan
- Florida State University , Department of Mathematics, 208 Love Building, 1017 Academic Way, Tallahassee, Florida 32306-4510, United States
| |
Collapse
|
19
|
Computer-aided detection of pulmonary nodules using dynamic self-adaptive template matching and a FLDA classifier. Phys Med 2016; 32:1502-1509. [PMID: 27856118 DOI: 10.1016/j.ejmp.2016.11.001] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Revised: 11/01/2016] [Accepted: 11/01/2016] [Indexed: 11/24/2022] Open
Abstract
Improving the performance of computer-aided detection (CAD) system for pulmonary nodules is still an important issue for its future clinical applications. This study aims to develop a new CAD scheme for pulmonary nodule detection based on dynamic self-adaptive template matching and Fisher linear discriminant analysis (FLDA) classifier. We first segment and repair lung volume by using OTSU algorithm and three-dimensional (3D) region growing. Next, the suspicious regions of interest (ROIs) are extracted and filtered by applying 3D dot filtering and thresholding method. Then, pulmonary nodule candidates are roughly detected with 3D dynamic self-adaptive template matching. Finally, we optimally select 11 image features and apply FLDA classifier to reduce false positive detections. The performance of the new method is validated by comparing with other methods through experiments using two groups of public datasets from Lung Image Database Consortium (LIDC) and ANODE09. By a 10-fold cross-validation experiment, the new CAD scheme finally has achieved a sensitivity of 90.24% and a false-positive (FP) of 4.54 FP/scan on average for the former dataset, and a sensitivity of 84.1% with 5.59 FP/scan for the latter. By comparing with other previously reported CAD schemes tested on the same datasets, the study proves that this new scheme can yield higher and more robust results in detecting pulmonary nodules.
Collapse
|
20
|
Wu P, Xia K, Yu H. Correlation coefficient based supervised locally linear embedding for pulmonary nodule recognition. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016; 136:97-106. [PMID: 27686707 PMCID: PMC5076559 DOI: 10.1016/j.cmpb.2016.08.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 07/01/2016] [Accepted: 08/11/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND AND OBJECTIVE Dimensionality reduction techniques are developed to suppress the negative effects of high dimensional feature space of lung CT images on classification performance in computer aided detection (CAD) systems for pulmonary nodule detection. METHODS An improved supervised locally linear embedding (SLLE) algorithm is proposed based on the concept of correlation coefficient. The Spearman's rank correlation coefficient is introduced to adjust the distance metric in the SLLE algorithm to ensure that more suitable neighborhood points could be identified, and thus to enhance the discriminating power of embedded data. The proposed Spearman's rank correlation coefficient based SLLE (SC(2)SLLE) is implemented and validated in our pilot CAD system using a clinical dataset collected from the publicly available lung image database consortium and image database resource initiative (LICD-IDRI). Particularly, a representative CAD system for solitary pulmonary nodule detection is designed and implemented. After a sequential medical image processing steps, 64 nodules and 140 non-nodules are extracted, and 34 representative features are calculated. The SC(2)SLLE, as well as SLLE and LLE algorithm, are applied to reduce the dimensionality. Several quantitative measurements are also used to evaluate and compare the performances. RESULTS Using a 5-fold cross-validation methodology, the proposed algorithm achieves 87.65% accuracy, 79.23% sensitivity, 91.43% specificity, and 8.57% false positive rate, on average. Experimental results indicate that the proposed algorithm outperforms the original locally linear embedding and SLLE coupled with the support vector machine (SVM) classifier. CONCLUSIONS Based on the preliminary results from a limited number of nodules in our dataset, this study demonstrates the great potential to improve the performance of a CAD system for nodule detection using the proposed SC(2)SLLE.
Collapse
Affiliation(s)
- Panpan Wu
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, 300401, China; Department of Electrical and Computer Engineering, University of Massachusetts Lowell, Lowell, MA 01854, USA
| | - Kewen Xia
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, 300401, China
| | - Hengyong Yu
- Department of Electrical and Computer Engineering, University of Massachusetts Lowell, Lowell, MA 01854, USA.
| |
Collapse
|
21
|
Wang Q, Kang W, Hu H, Wang B. HOSVD-Based 3D Active Appearance Model: Segmentation of Lung Fields in CT Images. J Med Syst 2016; 40:176. [PMID: 27277277 DOI: 10.1007/s10916-016-0535-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 06/01/2016] [Indexed: 11/24/2022]
Abstract
An Active Appearance Model (AAM) is a computer vision model which can be used to effectively segment lung fields in CT images. However, the fitting result is often inadequate when the lungs are affected by high-density pathologies. To overcome this problem, we propose a Higher-order Singular Value Decomposition (HOSVD)-based Three-dimensional (3D) AAM. An evaluation was performed on 310 diseased lungs form the Lung Image Database Consortium Image Collection. Other contemporary AAMs operate directly on patterns represented by vectors, i.e., before applying the AAM to a 3D lung volume,it has to be vectorized first into a vector pattern by some technique like concatenation. However, some implicit structural or local contextual information may be lost in this transformation. According to the nature of the 3D lung volume, HOSVD is introduced to represent and process the lung in tensor space. Our method can not only directly operate on the original 3D tensor patterns, but also efficiently reduce the computer memory usage. The evaluation resulted in an average Dice coefficient of 97.0 % ± 0.59 %, a mean absolute surface distance error of 1.0403 ± 0.5716 mm, a mean border positioning errors of 0.9187 ± 0.5381 pixel, and a Hausdorff Distance of 20.4064 ± 4.3855, respectively. Experimental results showed that our methods delivered significant and better segmentation results, compared with the three other model-based lung segmentation approaches, namely 3D Snake, 3D ASM and 3D AAM.
Collapse
Affiliation(s)
- Qingzhu Wang
- School of Information Engineering, Northeast Dianli University, Jilin, 132012, China.
| | - Wanjun Kang
- School of Information Engineering, Northeast Dianli University, Jilin, 132012, China
| | - Haihui Hu
- School of Information Engineering, Northeast Dianli University, Jilin, 132012, China
| | - Bin Wang
- Jilin Tumor Hospital, Changchun, China
| |
Collapse
|
22
|
Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomed Eng Online 2016; 15:2. [PMID: 26759159 PMCID: PMC5002110 DOI: 10.1186/s12938-015-0120-7] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 12/22/2015] [Indexed: 01/04/2023] Open
Abstract
Background CADe and CADx systems for the detection and diagnosis of lung cancer have been important areas of research in recent decades. However, these areas are being worked on separately. CADe systems do not present the radiological characteristics of tumors, and CADx systems do not detect nodules and do not have good levels of automation. As a result, these systems are not yet widely used in clinical settings. Methods The purpose of this article is to develop a new system for detection and diagnosis of pulmonary nodules on CT images, grouping them into a single system for the identification and characterization of the nodules to improve the level of automation. The article also presents as contributions: the use of Watershed and Histogram of oriented Gradients (HOG) techniques for distinguishing the possible nodules from other structures and feature extraction for pulmonary nodules, respectively. For the diagnosis, it is based on the likelihood of malignancy allowing more aid in the decision making by the radiologists. A rule-based classifier and Support Vector Machine (SVM) have been used to eliminate false positives. Results The database used in this research consisted of 420 cases obtained randomly from LIDC-IDRI. The segmentation method achieved an accuracy of 97 % and the detection system showed a sensitivity of 94.4 % with 7.04 false positives per case. Different types of nodules (isolated, juxtapleural, juxtavascular and ground-glass) with diameters between 3 mm and 30 mm have been detected. For the diagnosis of malignancy our system presented ROC curves with areas of: 0.91 for nodules highly unlikely of being malignant, 0.80 for nodules moderately unlikely of being malignant, 0.72 for nodules with indeterminate malignancy, 0.67 for nodules moderately suspicious of being malignant and 0.83 for nodules highly suspicious of being malignant. Conclusions From our preliminary results, we believe that our system is promising for clinical applications assisting radiologists in the detection and diagnosis of lung cancer.
Collapse
|