1
|
Two-way threshold-based intelligent water drops feature selection algorithm for accurate detection of breast cancer. Soft comput 2021. [DOI: 10.1007/s00500-021-06498-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
2
|
Mohammadi M, Hourfar F, Elkamel A, Leonenko Y. Economic Optimization Design of CO 2 Pipeline Transportation with Booster Stations. Ind Eng Chem Res 2019. [DOI: 10.1021/acs.iecr.9b02348] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Mohammad Mohammadi
- Department of Chemical Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Farzad Hourfar
- Control & Intelligent Processing Center of Excellence (CIPCE), School of Electrical & Computer Engineering, University of Tehran, 1417466191 Tehran, Iran
| | - Ali Elkamel
- Department of Chemical Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
- Department of Chemical Engineering, Khalifa University, Abu Dhabi 127788, United Arab Emirates
| | - Yuri Leonenko
- Department of Chemical Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
- Department of Earth and Environmental Sciences, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
- Department of Geography and Environmental Management, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| |
Collapse
|
3
|
Hussain L, Ahmed A, Saeed S, Rathore S, Awan IA, Shah SA, Majid A, Idris A, Awan AA. Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies. Cancer Biomark 2018; 21:393-413. [PMID: 29226857 DOI: 10.3233/cbm-170643] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Prostate is a second leading causes of cancer deaths among men. Early detection of cancer can effectively reduce the rate of mortality caused by Prostate cancer. Due to high and multiresolution of MRIs from prostate cancer require a proper diagnostic systems and tools. In the past researchers developed Computer aided diagnosis (CAD) systems that help the radiologist to detect the abnormalities. In this research paper, we have employed novel Machine learning techniques such as Bayesian approach, Support vector machine (SVM) kernels: polynomial, radial base function (RBF) and Gaussian and Decision Tree for detecting prostate cancer. Moreover, different features extracting strategies are proposed to improve the detection performance. The features extracting strategies are based on texture, morphological, scale invariant feature transform (SIFT), and elliptic Fourier descriptors (EFDs) features. The performance was evaluated based on single as well as combination of features using Machine Learning Classification techniques. The Cross validation (Jack-knife k-fold) was performed and performance was evaluated in term of receiver operating curve (ROC) and specificity, sensitivity, Positive predictive value (PPV), negative predictive value (NPV), false positive rate (FPR). Based on single features extracting strategies, SVM Gaussian Kernel gives the highest accuracy of 98.34% with AUC of 0.999. While, using combination of features extracting strategies, SVM Gaussian kernel with texture + morphological, and EFDs + morphological features give the highest accuracy of 99.71% and AUC of 1.00.
Collapse
Affiliation(s)
- Lal Hussain
- QEC, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan.,Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Adeel Ahmed
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Sharjil Saeed
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Saima Rathore
- Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA
| | - Imtiaz Ahmed Awan
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Saeed Arif Shah
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Abdul Majid
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| | - Adnan Idris
- Department of CS and IT, University of Poonch Rawalakot, Rawalakot, Azad Kashmir, Pakistan
| | - Anees Ahmed Awan
- Department of CS and IT, The University of Azad Jammu and Kashmir, Muzaffarabad, Azad Kashmir, Pakistan
| |
Collapse
|
4
|
Zhao LP, Bolouri H. Object-oriented regression for building predictive models with high dimensional omics data from translational studies. J Biomed Inform 2016; 60:431-45. [PMID: 26972839 PMCID: PMC5097461 DOI: 10.1016/j.jbi.2016.03.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Revised: 02/23/2016] [Accepted: 03/01/2016] [Indexed: 12/31/2022]
Abstract
Maturing omics technologies enable researchers to generate high dimension omics data (HDOD) routinely in translational clinical studies. In the field of oncology, The Cancer Genome Atlas (TCGA) provided funding support to researchers to generate different types of omics data on a common set of biospecimens with accompanying clinical data and has made the data available for the research community to mine. One important application, and the focus of this manuscript, is to build predictive models for prognostic outcomes based on HDOD. To complement prevailing regression-based approaches, we propose to use an object-oriented regression (OOR) methodology to identify exemplars specified by HDOD patterns and to assess their associations with prognostic outcome. Through computing patient's similarities to these exemplars, the OOR-based predictive model produces a risk estimate using a patient's HDOD. The primary advantages of OOR are twofold: reducing the penalty of high dimensionality and retaining the interpretability to clinical practitioners. To illustrate its utility, we apply OOR to gene expression data from non-small cell lung cancer patients in TCGA and build a predictive model for prognostic survivorship among stage I patients, i.e., we stratify these patients by their prognostic survival risks beyond histological classifications. Identification of these high-risk patients helps oncologists to develop effective treatment protocols and post-treatment disease management plans. Using the TCGA data, the total sample is divided into training and validation data sets. After building up a predictive model in the training set, we compute risk scores from the predictive model, and validate associations of risk scores with prognostic outcome in the validation data (P-value=0.015).
Collapse
Affiliation(s)
- Lue Ping Zhao
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States; Department of Biostatistics and Epidemiology, University of Washington School of Public Health, Seattle, WA, United States.
| | - Hamid Bolouri
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| |
Collapse
|
5
|
Choi JY. A generalized multiple classifier system for improving computer-aided classification of breast masses in mammography. Biomed Eng Lett 2016. [DOI: 10.1007/s13534-015-0191-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
|
6
|
Tan M, Pu J, Zheng B. A new and fast image feature selection method for developing an optimal mammographic mass detection scheme. Med Phys 2015; 41:081906. [PMID: 25086537 DOI: 10.1118/1.4890080] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
PURPOSE Selecting optimal features from a large image feature pool remains a major challenge in developing computer-aided detection (CAD) schemes of medical images. The objective of this study is to investigate a new approach to significantly improve efficacy of image feature selection and classifier optimization in developing a CAD scheme of mammographic masses. METHODS An image dataset including 1600 regions of interest (ROIs) in which 800 are positive (depicting malignant masses) and 800 are negative (depicting CAD-generated false positive regions) was used in this study. After segmentation of each suspicious lesion by a multilayer topographic region growth algorithm, 271 features were computed in different feature categories including shape, texture, contrast, isodensity, spiculation, local topological features, as well as the features related to the presence and location of fat and calcifications. Besides computing features from the original images, the authors also computed new texture features from the dilated lesion segments. In order to select optimal features from this initial feature pool and build a highly performing classifier, the authors examined and compared four feature selection methods to optimize an artificial neural network (ANN) based classifier, namely: (1) Phased Searching with NEAT in a Time-Scaled Framework, (2) A sequential floating forward selection (SFFS) method, (3) A genetic algorithm (GA), and (4) A sequential forward selection (SFS) method. Performances of the four approaches were assessed using a tenfold cross validation method. RESULTS Among these four methods, SFFS has highest efficacy, which takes 3%-5% of computational time as compared to GA approach, and yields the highest performance level with the area under a receiver operating characteristic curve (AUC) = 0.864 ± 0.034. The results also demonstrated that except using GA, including the new texture features computed from the dilated mass segments improved the AUC results of the ANNs optimized using other three feature selection methods. In addition, among 271 features, the shape, local morphological features, fat and calcification based features were the most frequently selected features to build ANNs. CONCLUSIONS Although conventional GA is a powerful tool in optimizing classifiers used in CAD schemes of medical images, it is very computationally intensive. This study demonstrated that using a new SFFS based approach enabled to significantly improve efficacy of image feature selection for developing CAD schemes.
Collapse
Affiliation(s)
- Maxine Tan
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019
| | - Jiantao Pu
- Department of Radiology, University of Pittsburgh, Pittsburgh, Pennsylvania 15213
| | - Bin Zheng
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, Oklahoma 73019 and Department of Radiology, University of Pittsburgh, Pittsburgh, Pennsylvania 15213
| |
Collapse
|
7
|
Tan M, Pu J, Zheng B. Optimization of Network Topology in Computer-Aided Detection Schemes Using Phased Searching with NEAT in a Time-Scaled Framework. Cancer Inform 2014; 13:17-27. [PMID: 25392680 PMCID: PMC4216038 DOI: 10.4137/cin.s13885] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2014] [Revised: 04/01/2014] [Accepted: 04/02/2014] [Indexed: 11/05/2022] Open
Abstract
In the field of computer-aided mammographic mass detection, many different features and classifiers have been tested. Frequently, the relevant features and optimal topology for the artificial neural network (ANN)-based approaches at the classification stage are unknown, and thus determined by trial-and-error experiments. In this study, we analyzed a classifier that evolves ANNs using genetic algorithms (GAs), which combines feature selection with the learning task. The classifier named "Phased Searching with NEAT in a Time-Scaled Framework" was analyzed using a dataset with 800 malignant and 800 normal tissue regions in a 10-fold cross-validation framework. The classification performance measured by the area under a receiver operating characteristic (ROC) curve was 0.856 ± 0.029. The result was also compared with four other well-established classifiers that include fixed-topology ANNs, support vector machines (SVMs), linear discriminant analysis (LDA), and bagged decision trees. The results show that Phased Searching outperformed the LDA and bagged decision tree classifiers, and was only significantly outperformed by SVM. Furthermore, the Phased Searching method required fewer features and discarded superfluous structure or topology, thus incurring a lower feature computational and training and validation time requirement. Analyses performed on the network complexities evolved by Phased Searching indicate that it can evolve optimal network topologies based on its complexification and simplification parameter selection process. From the results, the study also concluded that the three classifiers - SVM, fixed-topology ANN, and Phased Searching with NeuroEvolution of Augmenting Topologies (NEAT) in a Time-Scaled Framework - are performing comparably well in our mammographic mass detection scheme.
Collapse
Affiliation(s)
- Maxine Tan
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK, USA
| | - Jiantao Pu
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Bin Zheng
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK, USA. ; Department of Radiology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
8
|
Choi JY, Kim DH, Plataniotis KN, Ro YM. Computer-aided detection (CAD) of breast masses in mammography: combined detection and ensemble classification. Phys Med Biol 2014; 59:3697-719. [DOI: 10.1088/0031-9155/59/14/3697] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
9
|
Tan M, Pu J, Zheng B. Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model. Int J Comput Assist Radiol Surg 2014; 9:1005-20. [PMID: 24664267 DOI: 10.1007/s11548-014-0992-1] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Accepted: 03/06/2014] [Indexed: 12/13/2022]
Abstract
PURPOSE Improving radiologists' performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification. METHODS We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest, we performed the study using a tenfold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only. RESULTS The area under the receiver operating characteristic curve [Formula: see text] was obtained for the classification task. The results also showed that the most frequently selected features by the SFFS-based algorithm in tenfold iterations were those related to mass shape, isodensity, and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions. CONCLUSION In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a "second reader" in future clinical practice.
Collapse
Affiliation(s)
- Maxine Tan
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK, 73019, USA.
| | - Jiantao Pu
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| | - Bin Zheng
- School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK, 73019, USA
- Department of Radiology, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| |
Collapse
|
10
|
Rakoczy M, McGaughey D, Korenberg MJ, Levman J, Martel AL. Feature selection in computer-aided breast cancer diagnosis via dynamic contrast-enhanced magnetic resonance images. J Digit Imaging 2013; 26:198-208. [PMID: 22828783 DOI: 10.1007/s10278-012-9506-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The accuracy of computer-aided diagnosis (CAD) for early detection and classification of breast cancer in dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is dependent upon the features used by the CAD classifier. Here, we show that fast orthogonal search (FOS), which provides a more efficient iterative manner of computing stepwise regression feature selection, can select features with predictive value from a set of kinetic and texture candidate features computed from dynamic contrast-enhanced magnetic resonance images. FOS can in minutes search candidate feature sets of millions of terms, which may include cross-products of features up to second-, third- or fourth-order. This method is tested on a set of 83 DCE-MRI images, of which 20 are for cancerous and 63 for benign cases, using a leave-one-out trial. The features selected by FOS were used in a FOS predictor and nearest-neighbour predictor and had an area under the receiver operating curve (AUC) of 0.889 and 0.791 respectively. The FOS predictor AUC is significantly improved over the signal enhancement ratio predictor with an AUC of 0.706 (p = 0.0035 for the difference in the AUCs). Moreover, using FOS-selected features in a support vector machine increased the AUC over that resulting when the features were manually selected.
Collapse
Affiliation(s)
- Megan Rakoczy
- DLCSPM 4-5, National Defence, 101 Colonel By Dr., Ottawa, Canada, K1A 0K2.
| | | | | | | | | |
Collapse
|
11
|
Mohanty AK, Senapati M, Beberta S, Lenka SK. RETRACTED ARTICLE: Mass classification method in mammograms using correlated association rule mining. Neural Comput Appl 2012. [DOI: 10.1007/s00521-012-0857-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
12
|
Way T, Chan HP, Hadjiiski L, Sahiner B, Chughtai A, Song TK, Poopat C, Stojanovska J, Frank L, Attili A, Bogot N, Cascade PN, Kazerooni EA. Computer-aided diagnosis of lung nodules on CT scans: ROC study of its effect on radiologists' performance. Acad Radiol 2010; 17:323-32. [PMID: 20152726 DOI: 10.1016/j.acra.2009.10.016] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2009] [Revised: 10/02/2009] [Accepted: 10/13/2009] [Indexed: 10/19/2022]
Abstract
RATIONALE AND OBJECTIVES The aim of this study was to evaluate the effect of computer-aided diagnosis (CAD) on radiologists' estimates of the likelihood of malignancy of lung nodules on computed tomographic (CT) imaging. METHODS AND MATERIALS A total of 256 lung nodules (124 malignant, 132 benign) were retrospectively collected from the thoracic CT scans of 152 patients. An automated CAD system was developed to characterize and provide malignancy ratings for lung nodules on CT volumetric images. An observer study was conducted using receiver-operating characteristic analysis to evaluate the effect of CAD on radiologists' characterization of lung nodules. Six fellowship-trained thoracic radiologists served as readers. The readers rated the likelihood of malignancy on a scale of 0% to 100% and recommended appropriate action first without CAD and then with CAD. The observer ratings were analyzed using the Dorfman-Berbaum-Metz multireader, multicase method. RESULTS The CAD system achieved a test area under the receiver-operating characteristic curve (A(z)) of 0.857 +/- 0.023 using the perimeter, two nodule radii measures, two texture features, and two gradient field features. All six radiologists obtained improved performance with CAD. The average A(z) of the radiologists improved significantly (P < .01) from 0.833 (range, 0.817-0.847) to 0.853 (range, 0.834-0.887). CONCLUSION CAD has the potential to increase radiologists' accuracy in assessing the likelihood of malignancy of lung nodules on CT imaging.
Collapse
|
13
|
Elter M, Horsch A. CADx of mammographic masses and clustered microcalcifications: A review. Med Phys 2009; 36:2052-68. [PMID: 19610294 DOI: 10.1118/1.3121511] [Citation(s) in RCA: 141] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Affiliation(s)
- Matthias Elter
- Fraunhofer Institute for Integrated Circuits, Am Wolfsmantel 33, 91058 Erlangen, Germany.
| | | |
Collapse
|
14
|
Mazurowski MA, Habas PA, Zurada JM, Tourassi GD. Decision optimization of case-based computer-aided decision systems using genetic algorithms with application to mammography. Phys Med Biol 2008; 53:895-908. [PMID: 18263947 DOI: 10.1088/0031-9155/53/4/005] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
This paper presents an optimization framework for improving case-based computer-aided decision (CB-CAD) systems. The underlying hypothesis of the study is that each example in the knowledge database of a medical decision support system has different importance in the decision making process. A new decision algorithm incorporating an importance weight for each example is proposed to account for these differences. The search for the best set of importance weights is defined as an optimization problem and a genetic algorithm is employed to solve it. The optimization process is tailored to maximize the system's performance according to clinically relevant evaluation criteria. The study was performed using a CAD system developed for the classification of regions of interests (ROIs) in mammograms as depicting masses or normal tissue. The system was constructed and evaluated using a dataset of ROIs extracted from the Digital Database for Screening Mammography (DDSM). Experimental results show that, according to receiver operator characteristic (ROC) analysis, the proposed method significantly improves the overall performance of the CAD system as well as its average specificity for high breast mass detection rates.
Collapse
Affiliation(s)
- Maciej A Mazurowski
- Department of Electrical and Computer Engineering, University of Louisville, Louisville, KY 40292, USA.
| | | | | | | |
Collapse
|
15
|
Verma B. Novel network architecture and learning algorithm for the classification of mass abnormalities in digitized mammograms. Artif Intell Med 2008; 42:67-79. [DOI: 10.1016/j.artmed.2007.09.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2007] [Revised: 09/18/2007] [Accepted: 09/27/2007] [Indexed: 10/22/2022]
|
16
|
Way TW, Hadjiiski LM, Sahiner B, Chan HP, Cascade PN, Kazerooni EA, Bogot N, Zhou C. Computer-aided diagnosis of pulmonary nodules on CT scans: segmentation and classification using 3D active contours. Med Phys 2006; 33:2323-37. [PMID: 16898434 PMCID: PMC2728558 DOI: 10.1118/1.2207129] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
We are developing a computer-aided diagnosis (CAD) system to classify malignant and benign lung nodules found on CT scans. A fully automated system was designed to segment the nodule from its surrounding structured background in a local volume of interest (VOI) and to extract image features for classification. Image segmentation was performed with a three-dimensional (3D) active contour (AC) method. A data set of 96 lung nodules (44 malignant, 52 benign) from 58 patients was used in this study. The 3D AC model is based on two-dimensional AC with the addition of three new energy components to take advantage of 3D information: (1) 3D gradient, which guides the active contour to seek the object surface, (2) 3D curvature, which imposes a smoothness constraint in the z direction, and (3) mask energy, which penalizes contours that grow beyond the pleura or thoracic wall. The search for the best energy weights in the 3D AC model was guided by a simplex optimization method. Morphological and gray-level features were extracted from the segmented nodule. The rubber band straightening transform (RBST) was applied to the shell of voxels surrounding the nodule. Texture features based on run-length statistics were extracted from the RBST image. A linear discriminant analysis classifier with stepwise feature selection was designed using a second simplex optimization to select the most effective features. Leave-one-case-out resampling was used to train and test the CAD system. The system achieved a test area under the receiver operating characteristic curve (A(z)) of 0.83 +/- 0.04. Our preliminary results indicate that use of the 3D AC model and the 3D texture features surrounding the nodule is a promising approach to the segmentation and classification of lung nodules with CAD. The segmentation performance of the 3D AC model trained with our data set was evaluated with 23 nodules available in the Lung Image Database Consortium (LIDC). The lung nodule volumes segmented by the 3D AC model for best classification were generally larger than those outlined by the LIDC radiologists using visual judgment of nodule boundaries.
Collapse
Affiliation(s)
- Ted W Way
- Department of Radiology, University of Michigan, Ann Arbor, Michigan 48109, USA.
| | | | | | | | | | | | | | | |
Collapse
|
17
|
Varela C, Timp S, Karssemeijer N. Use of border information in the classification of mammographic masses. Phys Med Biol 2006; 51:425-41. [PMID: 16394348 DOI: 10.1088/0031-9155/51/2/016] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We are developing a new method to characterize the margin of a mammographic mass lesion to improve the classification of benign and malignant masses. Towards this goal, we designed features that measure the degree of sharpness and microlobulation of mass margins. We calculated these features in a border region of the mass defined as a thin band along the mass contour. The importance of these features in the classification of benign and malignant masses was studied in relation to existing features used for mammographic mass detection. Features were divided into three groups, each representing a different mass segment: the interior region of a mass, the border and the outer area. The interior and the outer area of a mass were characterized using contrast and spiculation measures. Classification was done in two steps. First, features representing each of the three mass segments were merged into a neural network classifier resulting in a single regional classification score for each segment. Secondly, a classifier combined the three single scores into a final output to discriminate between benign and malignant lesions. We compared the classification performance of each regional classifier and the combined classifier on a data set of 1076 biopsy proved masses (590 malignant and 486 benign) from 481 women included in the Digital Database for Screening Mammography. Receiver operating characteristic (ROC) analysis was used to evaluate the accuracy of the classifiers. The area under the ROC curve (A(z)) was 0.69 for the interior mass segment, 0.76 for the border segment and 0.75 for the outer mass segment. The performance of the combined classifier was 0.81 for image-based and 0.83 for case-based evaluation. These results show that the combination of information from different mass segments is an effective approach for computer-aided characterization of mammographic masses. An advantage of this approach is that it allows the assessment of the contribution of regions rather than individual features. Results suggest that the border and the outer areas contained the most valuable information for discrimination between benign and malignant masses.
Collapse
Affiliation(s)
- C Varela
- Department of Radiology, Radboud University, Nijmegen Medical Centre, The Netherlands
| | | | | |
Collapse
|
18
|
Abstract
It is well established that radiologists are better able to interpret mammograms when two mammographic views are available. Consequently, two mammographic projections are standard: mediolateral oblique (MLO) and craniocaudal (CC). Computer-aided diagnosis algorithms have been investigated for assisting in the detection and diagnosis of breast lesions in digitized/digital mammograms. A few previous studies suggest that computer-aided systems may also benefit from combining evidence from the two views. Intuitively, we expect that there would only be value in merging data from two views if they provide complementary information. A measure of the similarity of information is the correlation coefficient between corresponding features from the MLO and CC views. The purpose of this study was to investigate the correspondence in Haralick's texture features between the MLO and CC mammographic views of breast lesions. Features were ranked on the basis of correlation values and the two-view correlation of features for subgroups of data including masses versus calcification and benign versus malignant lesions were compared. All experiments were performed on a subset of mammography cases from the Digital Database for Screening Mammography (DDSM). It was observed that the texture features from the MLO and CC views were less strongly correlated for calcification lesions than for mass lesions. Similarly, texture features from the two views were less strongly correlated for benign lesions than for malignant lesions. These differences were statistically significant. The results suggest that the inclusion of texture features from multiple mammographic views in a CADx algorithm may impact the accuracy of diagnosis of calcification lesions and benign lesions.
Collapse
Affiliation(s)
- Shalini Gupta
- Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, Texas 78712-0240, USA
| | | |
Collapse
|
19
|
Giger ML. Computerized analysis of images in the detection and diagnosis of breast cancer. Semin Ultrasound CT MR 2005; 25:411-8. [PMID: 15559124 DOI: 10.1053/j.sult.2004.07.003] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Improvements in mammographic acquisition techniques have resulted in making the early signs of breast cancer more apparent on mammograms. However, the accuracy of the overall mammographic examination depends on both the quality of the mammographic images and the ability of the radiologist to interpret those images. While mammography is the best screening method for the early detection of breast cancer, radiologists do miss lesions on mammograms. Use of output, however, from a computerized analysis of an image by a radiologist may help him/her in the detection or diagnostic tasks, and potentially improve the overall interpretation of breast images and the subsequent patient care. Computer-aided detection and diagnosis (CAD) involves the application of computer technology to the process of medical image interpretation. CAD can be defined as a diagnosis made by a radiologist, who uses the output from a computerized analysis of medical images as a "second opinion" in detecting and diagnosing lesions, with the final diagnosis being made by the radiologist. The computer output must be at a sufficient performance level, and in addition, the output must be displayed in a user-friendly format for effective and efficient use by the radiologist. This chapter reviews CAD in breast cancer detection and diagnosis, including examples of image analyses, multi-modality approaches (i.e., special-view diagnostic mammography, ultrasound, and MRI), and means of communicating the computer output to the human.
Collapse
Affiliation(s)
- Maryellen L Giger
- Department of Radiology, University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
20
|
Zellner BB, Rand SD, Prost R, Krouwer H, Chetty VK. A cost-minimizing diagnostic methodology for discrimination between neoplastic and non-neoplastic brain lesions. Acad Radiol 2004; 11:169-77. [PMID: 14974592 DOI: 10.1016/s1076-6332(03)00654-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
RATIONAL AND OBJECTIVES The purpose of this study was to make an improvement in the performance of a logistic regression model in predicting the presence of brain neoplasia with magnetic resonance spectroscopy data by using a new approach for logistic regression coefficient estimation. This new approach, termed cost minimizing (C-min), introduced by one of the authors (Chetty), uses the cost function for prediction outcomes to estimate model coefficients and the prediction decision rule. To do this requires use of a genetic algorithm. MATERIALS AND METHODS Consecutive patients with suspected brain neoplasms or recurrent neoplasia referred for magnetic resonance spectroscopy were enrolled once a final diagnosis was established by histopathology or clinical course, laboratory data, and serial imaging. For the same magnetic resonance spectroscopy explanatory (input) variables, logistic regression models were constructed with conventional and C-min coefficient estimates, and sensitivity and specificity outcomes were compared at alternative probability threshold levels. RESULTS The C-min approach dominated the conventional approach in 14 of 18 trials, in that C-min had either fewer of both false negatives and false positives, or it had the same number of one type, and less of the other type of diagnostic error. C-min was always less costly. CONCLUSION The C-min approach to logistic or other regression model estimation may be a step forward in reducing the cost and, often, the errors of diagnostic (and treatment) processes. However, this new approach must be validated on larger and more varied datasets, and its statistical performance characteristics determined before it can be implemented as a practical clinical tool.
Collapse
|
21
|
Gurcan MN, Chan HP, Sahiner B, Hadjiiski L, Petrick N, Helvie MA. Optimal neural network architecture selection: improvement in computerized detection of microcalcifications. Acad Radiol 2002; 9:420-9. [PMID: 11942656 DOI: 10.1016/s1076-6332(03)80187-3] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
RATIONALE AND OBJECTIVES The authors evaluated the effect of optimal neural network architecture selection on the performance of a computer-aided diagnostic system designed to detect microcalcification clusters on digitized mammograms. MATERIALS AND METHODS The authors developed a computer program to detect microcalcification clusters automatically on digitized mammograms. Previously, they found that a properly selected and trained convolution neural network (CNN) could reduce false-positive (FP) findings and therefore improve the accuracy of microcalcification detection. In the current study, they evaluated the effectiveness of the CNN optimized with an automated optimization technique in improving the accuracy of the microcalcification detection program, comparing it with the manually selected CNN. An independent test data set was used, which included 472 mammograms selected from the University of South Florida public database and contained 253 biopsy-proved malignant clusters. RESULTS At an FP rate of 0.7 cluster per image, the film-based sensitivity was 84.6% for the optimized CNN, compared with 77.2% for the manually selected CNN. For clusters imaged on both craniocaudal and mediolateral oblique views, a cluster could be considered detected when it was detected on one or both views. For this case-based approach, at an FP rate of 0.7 per image, the sensitivity was 93.3% for the optimized and 87.0% for the manually selected CNN. CONCLUSION The classification of true and false signals is an important step in the microcalcification detection program. An optimized CNN can effectively reduce FP findings and improve the accuracy of the computer-aided detection system.
Collapse
Affiliation(s)
- Metin N Gurcan
- Department of Radiology, University of Michigan Hospitals, Ann Arbor 48109-0030, USA
| | | | | | | | | | | |
Collapse
|
22
|
Markey MK, Lo JY, Vargas-Voracek R, Tourassi GD, Floyd CE. Perceptron error surface analysis: a case study in breast cancer diagnosis. Comput Biol Med 2002; 32:99-109. [PMID: 11879823 DOI: 10.1016/s0010-4825(01)00035-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Perceptrons are typically trained to minimize mean square error (MSE). In computer-aided diagnosis (CAD), model performance is usually evaluated according to other more clinically relevant measures. The purpose of this study was to investigate the relationship between MSE and the area (A(z)) under the receiver operating characteristic (ROC) curve and the high-sensitivity partial ROC area ((0.90)A'(z)). A perceptron was used to predict lesion malignancy based on two mammographic findings and patient age. For each performance measure, the error surface in weight space was visualized. Comparison of the surfaces indicated that minimizing MSE tended to maximize A(z), but not (0.90)A'(z).
Collapse
Affiliation(s)
- Mia K Markey
- Department of Biomedical Engineering, Duke University, Durham, NC 27708, USA.
| | | | | | | | | |
Collapse
|
23
|
|
24
|
Tourassi GD, Frederick ED, Markey MK, Floyd CE. Application of the mutual information criterion for feature selection in computer-aided diagnosis. Med Phys 2001; 28:2394-402. [PMID: 11797941 DOI: 10.1118/1.1418724] [Citation(s) in RCA: 161] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
The purpose of this study was to investigate an information theoretic approach to feature selection for computer-aided diagnosis (CAD). The approach is based on the mutual information (MI) concept. MI measures the general dependence of random variables without making any assumptions about the nature of their underlying relationships. Consequently, MI can potentially offer some advantages over feature selection techniques that focus only on the linear relationships of variables. This study was based on a database of statistical texture features extracted from perfusion lung scans. The ultimate goal was to select the optimal subset of features for the computer-aided diagnosis of acute pulmonary embolism (PE). Initially, the study addressed issues regarding the approximation of MI in a limited dataset as it is often the case in CAD applications. The MI selected features were compared to those features selected using stepwise linear discriminant analysis and genetic algorithms for the same PE database. Linear and nonlinear decision models were implemented to merge the selected features into a final diagnosis. Results showed that the MI is an effective feature selection criterion for nonlinear CAD models overcoming some of the well-known limitations and computational complexities of other popular feature selection techniques in the field.
Collapse
Affiliation(s)
- G D Tourassi
- Department of Radiology, Duke University Medical Center, Durham, North Carolina 27710, USA.
| | | | | | | |
Collapse
|
25
|
Li L, Zheng Y, Zhang L, Clark RA. False-positive reduction in CAD mass detection using a competitive classification strategy. Med Phys 2001; 28:250-8. [PMID: 11243350 DOI: 10.1118/1.1344203] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
High false-positive (FP) rate remains to be one of the major problems to be solved in CAD study because too many false-positively cued signals will potentially degrade the performance of detecting true-positive regions and increase the call-back rate in CAD environment. In this paper, we proposed a novel classification method for FP reduction, where the conventional "hard" decision classifier is cascaded with a "soft" decision classification with the objective to reduce false-positives in the cases with multiple FPs retained after the "hard" decision classification. The "soft" classification takes a competitive classification strategy in which only the "best" ones are selected from the pre-classified suspicious regions as the true mass in each case. A neural network structure is designed to implement the proposed competitive classification. Comparative studies of FP reduction on a database of 79 images by a "hard" decision classification and a combined "hard"-"soft" classification method demonstrated the efficiency of the proposed classification strategy. For example, for the high FP sub-database which has only 31.7% of total images but accounts for 63.5% of whole FPs generated in single "hard" classification, the FPs can be reduced for 56% (from 8.36 to 3.72 per image) by using the proposed method at the cost of 1% TP loss (from 69% to 68%) in whole database, while it can only be reduced for 27% (from 8.36 to 6.08 per image) by simply increasing the threshold of "hard" classifier with a cost of TP loss as high as 14% (from 69% to 55%). On the average in whole database, the FP reduction by hybrid "hard"-"soft" classification is 1.58 per image as compared to 1.11 by "hard" classification at the TP costs described above. Because the cases with high dense tissue are of higher risk of cancer incidence and false-negative detection in mammogram screening, and usually generate more FPs in CAD detection, the method proposed in this paper will be very helpful in improving the performance of early detection of breast cancer with CAD.
Collapse
Affiliation(s)
- L Li
- Department of Radiology, College of Medicine, and the H. Lee Moffitt Cancer Center and Research Institute at the University of South Florida, Tampa 33612, USA.
| | | | | | | |
Collapse
|
26
|
Abstract
The term evolutionary computation encompasses a host of methodologies inspired by natural evolution that are used to solve hard problems. This paper provides an overview of evolutionary computation as applied to problems in the medical domains. We begin by outlining the basic workings of six types of evolutionary algorithms: genetic algorithms, genetic programming, evolution strategies, evolutionary programming, classifier systems, and hybrid systems. We then describe how evolutionary algorithms are applied to solve medical problems, including diagnosis, prognosis, imaging, signal processing, planning, and scheduling. Finally, we provide an extensive bibliography, classified both according to the medical task addressed and according to the evolutionary technique used.
Collapse
Affiliation(s)
- C A Peña-Reyes
- Logic Systems Laboratory, Swiss Federal Institute of Technology, IN-Ecublens, CH-1015, Lausanne, Switzerland.
| | | |
Collapse
|
27
|
|
28
|
Madsen MT, Uppaluri R, Hoffman EA, McLennan G. Pulmonary CT image classification with evolutionary programming. Acad Radiol 1999; 6:736-41. [PMID: 10887895 DOI: 10.1016/s1076-6332(99)80470-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
RATIONALE AND OBJECTIVES It is often difficult to classify information in medical images from derived features. The purpose of this research was to investigate the use of evolutionary programming as a tool for selecting important features and generating algorithms to classify computed tomographic (CT) images of the lung. MATERIALS AND METHODS Training and test sets consisting of 11 features derived from multiple lung CT images were generated, along with an indicator of the target area from which features originated. The images included five parameters based on histogram analysis, 11 parameters based on run length and co-occurrence matrix measures, and the fractal dimension. Two classification experiments were performed. In the first, the classification task was to distinguish between the subtle but known differences between anterior and posterior portions of transverse lung CT sections. The second classification task was to distinguish normal lung CT images from emphysematous images. The performance of the evolutionary programming approach was compared with that of three statistical classifiers that used the same training and test sets. RESULTS Evolutionary programming produced solutions that compared favorably with those of the statistical classifiers. In separating the anterior from the posterior lung sections, the evolutionary programming results were better than two of the three statistical approaches. The evolutionary programming approach correctly identified all the normal and abnormal lung images and accomplished this by using less features than the best statistical method. CONCLUSION The results of this study demonstrate the utility of evolutionary programming as a tool for developing classification algorithms.
Collapse
Affiliation(s)
- M T Madsen
- Department of Radiology, University of Iowa, Iowa City 52242, USA
| | | | | | | |
Collapse
|
29
|
Chan HP, Sahiner B, Helvie MA, Petrick N, Roubidoux MA, Wilson TE, Adler DD, Paramagul C, Newman JS, Sanjay-Gopal S. Improvement of radiologists' characterization of mammographic masses by using computer-aided diagnosis: an ROC study. Radiology 1999; 212:817-27. [PMID: 10478252 DOI: 10.1148/radiology.212.3.r99au47817] [Citation(s) in RCA: 195] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
PURPOSE To evaluate the effects of computer-aided diagnosis (CAD) on radiologists' classification of malignant and benign masses seen on mammograms. MATERIALS AND METHODS The authors previously developed an automated computer program for estimation of the relative malignancy rating of masses. In the present study, the authors conducted observer performance experiments with receiver operating characteristic (ROC) methodology to evaluate the effects of computer estimates on radiologists' confidence ratings. Six radiologists assessed biopsy-proved masses with and without CAD. Two experiments, one with a single view and the other with two views, were conducted. The classification accuracy was quantified by using the area under the ROC curve, Az. RESULTS For the reading of 238 images, the Az value for the computer classifier was 0.92. The radiologists' Az values ranged from 0.79 to 0.92 without CAD and improved to 0.87-0.96 with CAD. For the reading of a subset of 76 paired views, the radiologists' Az values ranged from 0.88 to 0.95 without CAD and improved to 0.93-0.97 with CAD. Improvements in the reading of the two sets of images were statistically significant (P = .022 and .007, respectively). An improved positive predictive value as a function of the false-negative fraction was predicted from the improved ROC curves. CONCLUSION CAD may be useful for assisting radiologists in classification of masses and thereby potentially help reduce unnecessary biopsies.
Collapse
Affiliation(s)
- H P Chan
- Department of Radiology, University of Michigan Hospital, Ann Arbor 48109-0030, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Chan HP, Sahiner B, Lam KL, Petrick N, Helvie MA, Goodsitt MM, Adler DD. Computerized analysis of mammographic microcalcifications in morphological and texture feature spaces. Med Phys 1998; 25:2007-19. [PMID: 9800710 DOI: 10.1118/1.598389] [Citation(s) in RCA: 149] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
We are developing computerized feature extraction and classification methods to analyze malignant and benign microcalcifications on digitized mammograms. Morphological features that described the size, contrast, and shape of microcalcifications and their variations within a cluster were designed to characterize microcalcifications segmented from the mammographic background. Texture features were derived from the spatial gray-level dependence (SGLD) matrices constructed at multiple distances and directions from tissue regions containing microcalcifications. A genetic algorithm (GA) based feature selection technique was used to select the best feature subset from the multi-dimensional feature spaces. The GA-based method was compared to the commonly used feature selection method based on the stepwise linear discriminant analysis (LDA) procedure. Linear discriminant classifiers using the selected features as input predictor variables were formulated for the classification task. The discriminant scores output from the classifiers were analyzed by receiver operating characteristic (ROC) methodology and the classification accuracy was quantified by the area, Az, under the ROC curve. We analyzed a data set of 145 mammographic microcalcification clusters in this study. It was found that the feature subsets selected by the GA-based method are comparable to or slightly better than those selected by the stepwise LDA method. The texture features (Az = 0.84) were more effective than morphological features (Az = 0.79) in distinguishing malignant and benign microcalcifications. The highest classification accuracy (Az = 0.89) was obtained in the combined texture and morphological feature space. The improvement was statistically significant in comparison to classification in either the morphological (p = 0.002) or the texture (p = 0.04) feature space alone. The classifier using the best feature subset from the combined feature space and an appropriate decision threshold could correctly identify 35% of the benign clusters without missing a malignant cluster. When the average discriminant score from all views of the same cluster was used for classification, the Az value increased to 0.93 and the classifier could identify 50% of the benign clusters at 100% sensitivity for malignancy. Alternatively, if the minimum discriminant score from all views of the same cluster was used, the Az value would be 0.90 and a specificity of 32% would be obtained at 100% sensitivity. The results of this study indicate the potential of using combined morphological and texture features for computer-aided classification of microcalcifications.
Collapse
Affiliation(s)
- H P Chan
- Department of Radiology, University of Michigan, Ann Arbor 48109, USA.
| | | | | | | | | | | | | |
Collapse
|