1
|
Wang J, Wang L, Yang Z, Tan W, Liu Y. Application of machine learning in the analysis of multiparametric MRI data for the differentiation of treatment responses in breast cancer: retrospective study. Eur J Cancer Prev 2025; 34:56-65. [PMID: 38743632 DOI: 10.1097/cej.0000000000000892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
OBJECTIVE The objective of this study is to develop and validate a multiparametric MRI model employing machine learning to predict the effectiveness of treatment and the stage of breast cancer. METHODS The study encompassed 400 female patients diagnosed with breast cancer, with 200 individuals allocated to both the control and experimental groups, undergoing examinations in Shenzhen, China, during the period 2017-2023. This study pertains to retrospective research. Multiparametric MRI was employed to extract data concerning tumor size, blood flow, and metabolism. RESULTS The model achieved high accuracy, predicting treatment outcomes with an accuracy of 92%, sensitivity of 88%, and specificity of 95%. The model effectively classified breast cancer stages: stage I, 38% ( P = 0.027); stage II, 72% ( P = 0.014); stage III, 50% ( P = 0.032); and stage IV, 45% ( P = 0.041). CONCLUSIONS The developed model, utilizing multiparametric MRI and machine learning, exhibits high accuracy in predicting the effectiveness of treatment and breast cancer staging. These findings affirm the model's potential to enhance treatment strategies and personalize approaches for patients diagnosed with breast cancer. Our study presents an innovative approach to the diagnosis and treatment of breast cancer, integrating MRI data with machine learning algorithms. We demonstrate that the developed model exhibits high accuracy in predicting treatment efficacy and differentiating cancer stages. This underscores the importance of utilizing MRI and machine learning algorithms to enhance the diagnosis and individualization of treatment for this disease.
Collapse
Affiliation(s)
- Jinhua Wang
- Medical Imaging Center, Shenzhen Hospital, Southern Medical University, Shenzhen, China
- The Third of Clinical Medicine, Southern Medical University
| | - Liang Wang
- Interventional Department, The University of Hong Kong-Shenzhen Hospital, Shenzhen
| | - Zhongxian Yang
- Medical Imaging Center, Shenzhen Hospital, Southern Medical University, Shenzhen, China
- The Third of Clinical Medicine, Southern Medical University
| | - Wanchang Tan
- Department of Radiology, The Sixth Affiliated Hospital, School of Medicine, South China University of Technology, Foshan, China
| | - Yubao Liu
- Medical Imaging Center, Shenzhen Hospital, Southern Medical University, Shenzhen, China
- The Third of Clinical Medicine, Southern Medical University
| |
Collapse
|
2
|
Fan Z, Song W, Ke Y, Jia L, Li S, Li JJ, Zhang Y, Lin J, Wang B. XGBoost-SHAP-based interpretable diagnostic framework for knee osteoarthritis: a population-based retrospective cohort study. Arthritis Res Ther 2024; 26:213. [PMID: 39696605 DOI: 10.1186/s13075-024-03450-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Accepted: 12/01/2024] [Indexed: 12/20/2024] Open
Abstract
OBJECTIVE To use routine demographic and clinical data to develop an interpretable individual-level machine learning (ML) model to diagnose knee osteoarthritis (KOA) and to identify highly ranked features. METHODS In this retrospective, population-based cohort study, anonymized questionnaire data was retrieved from the Wu Chuan KOA Study, Inner Mongolia, China. After feature selections, participants were divided in a 7:3 ratio into training and test sets. Class balancing was applied to the training set for data augmentation. Four ML classifiers were compared by cross-validation within the training set and their performance was further analyzed with an unseen test set. Classifications were evaluated using sensitivity, specificity, positive predictive value, negative predictive value, accuracy, area under the curve(AUC), G-means, and F1 scores. The best model was explained using Shapley values to extract highly ranked features. RESULTS A total of 1188 participants were investigated in this study, among whom 26.3% were diagnosed with KOA. Comparatively, XGBoost with Boruta exhibited the highest classification performance among the four models, with an AUC of 0.758, G-means of 0.800, and F1 scores of 0.703. The SHAP method reveals the top 17 features of KOA according to the importance ranking, and the average of the experience of joint pain was recognized as the most important features. CONCLUSIONS Our study highlights the usefulness of machine learning in unveiling important factors that influence the diagnosis of KOA to guide new prevention strategies. Further work is needed to validate this approach.
Collapse
Affiliation(s)
- Zijuan Fan
- Department of Orthopaedic Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, Qingchun Road No. 79, Hangzhou, China
- Department of Health Statistics, School of Public Health, Sun Yat-sen University, Guangzhou, China
| | - Wenzhu Song
- Department of Big Data in Health Science School of Public Health, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Yan Ke
- Arthritis Clinic & Research Center, Peking University People's Hospital, Beijing, China
| | - Ligan Jia
- School of Computer Science and Technology, Xinjiang University, Urumchi, China
| | - Songyan Li
- Department of Orthopaedic Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, Qingchun Road No. 79, Hangzhou, China
| | - Jiao Jiao Li
- School of Biomedical Engineering, Faculty of Engineering and IT, University of Technology Sydney, Sydney, Australia
| | - Yuqing Zhang
- Harvard Medical School, Boston Massachusetts, USA
| | - Jianhao Lin
- Arthritis Clinic & Research Center, Peking University People's Hospital, Beijing, China.
| | - Bin Wang
- Department of Orthopaedic Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, Qingchun Road No. 79, Hangzhou, China.
| |
Collapse
|
3
|
Ghazal H, El-Absawy ESA, Ead W, Hasan ME. Machine learning-guided differential gene expression analysis identifies a highly-connected seven-gene cluster in triple-negative breast cancer. Biomedicine (Taipei) 2024; 14:15-35. [PMID: 39777114 PMCID: PMC11703398 DOI: 10.37796/2211-8039.1467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 08/30/2024] [Accepted: 09/02/2024] [Indexed: 01/11/2025] Open
Abstract
Background One of the most challenging cancers is triple-negative breast cancer, which is subdivided into many molecular subtypes. Due to the high degree of heterogeneity, the role of precision medicine remains challenging. With the use of machine learning (ML)-guided gene selection, the differential gene expression analysis can be optimized, and eventually, the process of precision medicine can see great advancement through biomarker discovery. Purpose Enhancing precision medicine in the oncology field by identification of the most representative differentially-expressed genes to be used as biomarkers or as novel drug targets. Methods By utilizing data from the Gene Expression Omnibus (GEO) repository and The Cancer Genome Atlas (TCGA), we identified the differentially expressed genes using the linear model for microarray analysis (LIMMA) and edgeR algorithms, and applied ML-based feature selection using several algorithms. Results A total of 27 genes were selected by merging features identified with both LIMMA and ML-based feature selection methods. The models with the highest area under the curve (AUC) are CatBoost, Extreme Gradient Boosting (XGBoost), Random Forest, and Multi-Layer Perceptron classifiers. ESR1, FOXA1, GATA3, XBP1, GREB1, AR, and AGR2 were identified as hub genes in a highly interconnected cluster. Conclusion ML-based gene selection shows a great impact on the identification of hub genes. The ML models built can improve precision oncology in diagnosis and prognosis. The identified hub genes can serve as biomarkers and warrant further research for potential drug target development.
Collapse
Affiliation(s)
- Hany Ghazal
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| | - El-Sayed A. El-Absawy
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| | - Waleed Ead
- Information Systems Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef,
Egypt
| | - Mohamed E. Hasan
- Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City,
Egypt
| |
Collapse
|
4
|
Buzatto IPC, Recife SA, Miguel L, Bonini RM, Onari N, Faim ALPA, Silvestre L, Carlotti DP, Fröhlich A, Tiezzi DG. Machine learning can reliably predict malignancy of breast lesions based on clinical and ultrasonographic features. Breast Cancer Res Treat 2024:10.1007/s10549-024-07429-0. [PMID: 39002069 DOI: 10.1007/s10549-024-07429-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 07/02/2024] [Indexed: 07/15/2024]
Abstract
PURPOSE To establish a reliable machine learning model to predict malignancy in breast lesions identified by ultrasound (US) and optimize the negative predictive value to minimize unnecessary biopsies. METHODS We included clinical and ultrasonographic attributes from 1526 breast lesions classified as BI-RADS 3, 4a, 4b, 4c, 5, and 6 that underwent US-guided breast biopsy in four institutions. We selected the most informative attributes to train nine machine learning models, ensemble models and models with tuned threshold to make inferences about the diagnosis of BI-RADS 4a and 4b lesions (validation dataset). We tested the performance of the final model with 403 new suspicious lesions. RESULTS The most informative attributes were shape, margin, orientation and size of the lesions, the resistance index of the internal vessel, the age of the patient and the presence of a palpable lump. The highest mean negative predictive value (NPV) was achieved with the K-Nearest Neighbors algorithm (97.9%). Making ensembles did not improve the performance. Tuning the threshold did improve the performance of the models and we chose the algorithm XGBoost with the tuned threshold as the final one. The tested performance of the final model was: NPV 98.1%, false negative 1.9%, positive predictive value 77.1%, false positive 22.9%. Applying this final model, we would have missed 2 of the 231 malignant lesions of the test dataset (0.8%). CONCLUSION Machine learning can help physicians predict malignancy in suspicious breast lesions identified by the US. Our final model would be able to avoid 60.4% of the biopsies in benign lesions missing less than 1% of the cancer cases.
Collapse
Affiliation(s)
- I P C Buzatto
- Department of Obstetrics and Gynecology - Breast Disease Division, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - S A Recife
- Department of Gynecology & Obstetrics, Women's Health Reference Center of Ribeirão Preto (MATER), Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - L Miguel
- Department of Gynecology & Obstetrics, Women's Health Reference Center of Ribeirão Preto (MATER), Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - R M Bonini
- Department of Radiology, Hospital de Amor de Campo Grande, Campo Grande, Mato Grosso Do Sul, Brazil
| | - N Onari
- Department of Radiology, Hospital de Amor de Barretos, Barretos, Brazil
| | - A L P A Faim
- Department of Radiology, Hospital de Amor de Barretos, Barretos, Brazil
| | - L Silvestre
- Department of Obstetrics and Gynecology - Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - D P Carlotti
- Institute of Mathematics and Statistics, University of São Paulo, São Paulo, Brazil
| | - A Fröhlich
- Department of Mathematics, Federal University of Santa Catarina, Florianópolis, Brazil
| | - D G Tiezzi
- Department of Obstetrics and Gynecology - Breast Disease Division and Laboratory for Translational Data Science, Ribeirão Preto Medical School, University of São Paulo, Avenida Bandeirantes 3.900, Monte Alegre, Ribeirão Preto, Ribeirão Preto, Brazil.
- Advanced Research Center in Medicine, Union of the Colleges of the Great Lakes (UNILAGO), São José Do Rio Preto, São Paulo, Brazil.
| |
Collapse
|
5
|
Guo Y, Zhang H, Yuan L, Chen W, Zhao H, Yu QQ, Shi W. Machine learning and new insights for breast cancer diagnosis. J Int Med Res 2024; 52:3000605241237867. [PMID: 38663911 PMCID: PMC11047257 DOI: 10.1177/03000605241237867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 02/21/2024] [Indexed: 04/28/2024] Open
Abstract
Breast cancer (BC) is the most prominent form of cancer among females all over the world. The current methods of BC detection include X-ray mammography, ultrasound, computed tomography, magnetic resonance imaging, positron emission tomography and breast thermographic techniques. More recently, machine learning (ML) tools have been increasingly employed in diagnostic medicine for its high efficiency in detection and intervention. The subsequent imaging features and mathematical analyses can then be used to generate ML models, which stratify, differentiate and detect benign and malignant breast lesions. Given its marked advantages, radiomics is a frequently used tool in recent research and clinics. Artificial neural networks and deep learning (DL) are novel forms of ML that evaluate data using computer simulation of the human brain. DL directly processes unstructured information, such as images, sounds and language, and performs precise clinical image stratification, medical record analyses and tumour diagnosis. Herein, this review thoroughly summarizes prior investigations on the application of medical images for the detection and intervention of BC using radiomics, namely DL and ML. The aim was to provide guidance to scientists regarding the use of artificial intelligence and ML in research and the clinic.
Collapse
Affiliation(s)
- Ya Guo
- Department of Oncology, Jining No.1 People’s Hospital, Shandong First Medical University, Jining, Shandong Province, China
| | - Heng Zhang
- Department of Laboratory Medicine, Shandong Daizhuang Hospital, Jining, Shandong Province, China
| | - Leilei Yuan
- Department of Oncology, Jining No.1 People’s Hospital, Shandong First Medical University, Jining, Shandong Province, China
| | - Weidong Chen
- Department of Oncology, Jining No.1 People’s Hospital, Shandong First Medical University, Jining, Shandong Province, China
| | - Haibo Zhao
- Department of Oncology, Jining No.1 People’s Hospital, Shandong First Medical University, Jining, Shandong Province, China
| | - Qing-Qing Yu
- Phase I Clinical Research Centre, Jining No.1 People’s Hospital, Shandong First Medical University, Jining, Shandong Province, China
| | - Wenjie Shi
- Molecular and Experimental Surgery, University Clinic for General-, Visceral-, Vascular- and Trans-Plantation Surgery, Medical Faculty University Hospital Magdeburg, Otto-von Guericke University, Magdeburg, Germany
| |
Collapse
|
6
|
Ghorbian M, Ghorbian S. Usefulness of machine learning and deep learning approaches in screening and early detection of breast cancer. Heliyon 2023; 9:e22427. [PMID: 38076050 PMCID: PMC10709063 DOI: 10.1016/j.heliyon.2023.e22427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 11/07/2023] [Accepted: 11/13/2023] [Indexed: 10/16/2024] Open
Abstract
Breast cancer (BC) is one of the most common types of cancer in women, and its prevalence is on the rise. The diagnosis of this disease in the first steps can be highly challenging. Hence, early and rapid diagnosis of this disease in its early stages increases the likelihood of a patient's recovery and survival. This study presents a systematic and detailed analysis of the various ML approaches and mechanisms employed during the BC diagnosis process. Further, this study provides a comprehensive and accurate overview of techniques, approaches, challenges, solutions, and important concepts related to this process in order to provide healthcare professionals and technologists with a deeper understanding of new screening and diagnostic tools and approaches, as well as identify new challenges and popular approaches in this field. Therefore, this study has attempted to provide a comprehensive taxonomy of applying ML techniques to BC diagnosis, focusing on the data obtained from the clinical methods diagnosis. The taxonomy presented in this study has two major components. Clinical diagnostic methods such as MRI, mammography, and hybrid methods are presented in the first part of the taxonomy. The second part involves implementing machine learning approaches such as neural networks (NN), deep learning (DL), and hybrid on the dataset in the first part. Then, the taxonomy will be analyzed based on implementing ML approaches in clinical diagnosis methods. The findings of the study demonstrated that the approaches based on NN and DL are the most accurate and widely used models for BC diagnosis compared to other diagnostic techniques, and accuracy (ACC), sensitivity (SEN), and specificity (SPE) are the most commonly used performance evaluation criteria. Additionally, factors such as the advantages and disadvantages of using machine learning techniques, as well as the objectives of each research, separately for ML technology and BC detection, as well as evaluation criteria, are discussed in this study. Lastly, this study provides an overview of open and unresolved issues related to using ML for BC diagnosis, along with a proposal to resolve each issue to assist researchers and healthcare professionals.
Collapse
Affiliation(s)
- Mohsen Ghorbian
- Department of Computer Engineering, Qom Branch, Islamic Azad University, Qom, Iran
| | - Saeid Ghorbian
- Department of Molecular Genetics, Ahar Branch, Islamic Azad University, Ahar, Iran
| |
Collapse
|
7
|
Zhang YP, Zhang XY, Cheng YT, Li B, Teng XZ, Zhang J, Lam S, Zhou T, Ma ZR, Sheng JB, Tam VCW, Lee SWY, Ge H, Cai J. Artificial intelligence-driven radiomics study in cancer: the role of feature engineering and modeling. Mil Med Res 2023; 10:22. [PMID: 37189155 DOI: 10.1186/s40779-023-00458-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 05/04/2023] [Indexed: 05/17/2023] Open
Abstract
Modern medicine is reliant on various medical imaging technologies for non-invasively observing patients' anatomy. However, the interpretation of medical images can be highly subjective and dependent on the expertise of clinicians. Moreover, some potentially useful quantitative information in medical images, especially that which is not visible to the naked eye, is often ignored during clinical practice. In contrast, radiomics performs high-throughput feature extraction from medical images, which enables quantitative analysis of medical images and prediction of various clinical endpoints. Studies have reported that radiomics exhibits promising performance in diagnosis and predicting treatment responses and prognosis, demonstrating its potential to be a non-invasive auxiliary tool for personalized medicine. However, radiomics remains in a developmental phase as numerous technical challenges have yet to be solved, especially in feature engineering and statistical modeling. In this review, we introduce the current utility of radiomics by summarizing research on its application in the diagnosis, prognosis, and prediction of treatment responses in patients with cancer. We focus on machine learning approaches, for feature extraction and selection during feature engineering and for imbalanced datasets and multi-modality fusion during statistical modeling. Furthermore, we introduce the stability, reproducibility, and interpretability of features, and the generalizability and interpretability of models. Finally, we offer possible solutions to current challenges in radiomics research.
Collapse
Affiliation(s)
- Yuan-Peng Zhang
- Department of Medical Informatics, Nantong University, Nantong, 226001, Jiangsu, China
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
- The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518000, Guangdong, China
| | - Xin-Yun Zhang
- Department of Medical Informatics, Nantong University, Nantong, 226001, Jiangsu, China
| | - Yu-Ting Cheng
- Department of Medical Informatics, Nantong University, Nantong, 226001, Jiangsu, China
| | - Bing Li
- Department of Radiation Oncology, the Affiliated Cancer Hospital of Zhengzhou University and Henan Cancer Hospital, Zhengzhou, 450008, Henan, China
| | - Xin-Zhi Teng
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Jiang Zhang
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Saikit Lam
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Ta Zhou
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Zong-Rui Ma
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Jia-Bao Sheng
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Victor C W Tam
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Shara W Y Lee
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Hong Ge
- Department of Radiation Oncology, the Affiliated Cancer Hospital of Zhengzhou University and Henan Cancer Hospital, Zhengzhou, 450008, Henan, China
| | - Jing Cai
- Department of Health Technology and Informatics, the Hong Kong Polytechnic University, Hong Kong, 999077, China.
- The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518000, Guangdong, China.
| |
Collapse
|
8
|
Freehand 1.5T MR-Guided Vacuum-Assisted Breast Biopsy (MR-VABB): Contribution of Radiomics to the Differentiation of Benign and Malignant Lesions. Diagnostics (Basel) 2023; 13:diagnostics13061007. [PMID: 36980315 PMCID: PMC10047866 DOI: 10.3390/diagnostics13061007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 02/28/2023] [Accepted: 03/05/2023] [Indexed: 03/09/2023] Open
Abstract
Radiomics and artificial intelligence have been increasingly applied in breast MRI. However, the advantages of using radiomics to evaluate lesions amenable to MR-guided vacuum-assisted breast biopsy (MR-VABB) are unclear. This study includes patients scheduled for MR-VABB, corresponding to subjects with MRI-only visible lesions, i.e., with a negative second-look ultrasound. The first acquisition of the multiphase dynamic contrast-enhanced MRI (DCE-MRI) sequence was selected for image segmentation and radiomics analysis. A total of 80 patients with a mean age of 55.8 years ± 11.8 (SD) were included. The dataset was then split into a training set (50 patients) and a validation set (30 patients). Twenty out of the 30 patients with a positive histology for cancer were in the training set, while the remaining 10 patients with a positive histology were included in the test set. Logistic regression on the training set provided seven features with significant p values (<0.05): (1) ‘AverageIntensity’, (2) ‘Autocorrelation’, (3) ‘Contrast’, (4) ‘Compactness’, (5) ‘StandardDeviation’, (6) ‘MeanAbsoluteDeviation’ and (7) ‘InterquartileRange’. AUC values of 0.86 (95% C.I. 0.73–0.94) for the training set and 0.73 (95% C.I. 0.54–0.87) for the test set were obtained for the radiomics model. Radiological evaluation of the same lesions scheduled for MR-VABB had AUC values of 0.42 (95% C.I. 0.28–0.57) for the training set and 0.4 (0.23–0.59) for the test set. In this study, a radiomics logistic regression model applied to DCE-MRI images increased the diagnostic accuracy of standard radiological evaluation of MRI suspicious findings in women scheduled for MR-VABB. Confirming this performance in large multicentric trials would imply that using radiomics in the assessment of patients scheduled for MR-VABB has the potential to reduce the number of biopsies, in suspicious breast lesions where MR-VABB is required, with clear advantages for patients and healthcare resources.
Collapse
|
9
|
Collinearity and Dimensionality Reduction in Radiomics: Effect of Preprocessing Parameters in Hypertrophic Cardiomyopathy Magnetic Resonance T1 and T2 Mapping. BIOENGINEERING (BASEL, SWITZERLAND) 2023; 10:bioengineering10010080. [PMID: 36671652 PMCID: PMC9854492 DOI: 10.3390/bioengineering10010080] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 12/23/2022] [Indexed: 01/11/2023]
Abstract
Radiomics and artificial intelligence have the potential to become a valuable tool in clinical applications. Frequently, radiomic analyses through machine learning methods present issues caused by high dimensionality and multicollinearity, and redundant radiomic features are usually removed based on correlation analysis. We assessed the effect of preprocessing-in terms of voxel size resampling, discretization, and filtering-on correlation-based dimensionality reduction in radiomic features from cardiac T1 and T2 maps of patients with hypertrophic cardiomyopathy. For different combinations of preprocessing parameters, we performed a dimensionality reduction of radiomic features based on either Pearson's or Spearman's correlation coefficient, followed by the computation of the stability index. With varying resampling voxel size and discretization bin width, for both T1 and T2 maps, Pearson's and Spearman's dimensionality reduction produced a slightly different percentage of remaining radiomic features, with a relatively high stability index. For different filters, the remaining features' stability was instead relatively low. Overall, the percentage of eliminated radiomic features through correlation-based dimensionality reduction was more dependent on resampling voxel size and discretization bin width for textural features than for shape or first-order features. Notably, correlation-based dimensionality reduction was less sensitive to preprocessing when considering radiomic features from T2 compared with T1 maps.
Collapse
|
10
|
de la Luz Escobar M, De la Rosa JI, Galván-Tejada CE, Galvan-Tejada JI, Gamboa-Rosales H, de la Rosa Gomez D, Luna-García H, Celaya-Padilla JM. Breast Cancer Detection Using Automated Segmentation and Genetic Algorithms. Diagnostics (Basel) 2022; 12:diagnostics12123099. [PMID: 36553106 PMCID: PMC9777329 DOI: 10.3390/diagnostics12123099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2022] [Revised: 11/30/2022] [Accepted: 12/03/2022] [Indexed: 12/13/2022] Open
Abstract
Breast cancer is the most common cancer among women worldwide, after lung cancer. However, early detection of breast cancer can help to reduce death rates in breast cancer patients and also prevent cancer from spreading to other parts of the body. This work proposes a new method to design a bio-marker integrating Bayesian predictive models, pyRadiomics System and genetic algorithms to classify the benign and malignant lesions. The method allows one to evaluate two types of images: The radiologist-segmented lesion, and a novel automated breast cancer detection by the analysis of the whole breast. The results demonstrate only a difference of 12% of effectiveness for the cases of calcification between the radiologist generated segmentation and the automatic whole breast analysis, and a 25% of difference between the lesion and the breast for the cases of masses. In addition, our approach was compared against other proposed methods in the literature, providing an AUC = 0.86 for the analysis of images with lesions in breast calcification, and AUC = 0.96 for masses.
Collapse
|
11
|
Homayoun H, Yee Chan W, Mohammadi A, Yusuf Kuzan T, Mirza-Aghazadeh-Attari M, Wai Ling L, Murzoglu Altintoprak K, Vijayananthan A, Rahmat K, Ab Mumin MRad N, Sam Leong S, Ejtehadifar S, Faeghi F, Abolghasemi J, Ciaccio EJ, Rajendra Acharya U, Abbasian Ardakani A. Artificial Intelligence, BI-RADS Evaluation and Morphometry: A Novel Combination to Diagnose Breast Cancer Using Ultrasonography, Results from Multi-Center Cohorts. Eur J Radiol 2022; 157:110591. [DOI: 10.1016/j.ejrad.2022.110591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 10/07/2022] [Accepted: 11/01/2022] [Indexed: 11/07/2022]
|
12
|
Mammographic Classification of Breast Cancer Microcalcifications through Extreme Gradient Boosting. ELECTRONICS 2022. [DOI: 10.3390/electronics11152435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
In this paper, we proposed an effective and efficient approach to the classification of breast cancer microcalcifications and evaluated the mathematical model for calcification on mammography with a large medical dataset. We employed several semi-automatic segmentation algorithms to extract 51 calcification features from mammograms, including morphologic and textural features. We adopted extreme gradient boosting (XGBoost) to classify microcalcifications. Then, we compared other machine learning techniques, including k-nearest neighbor (kNN), adaboostM1, decision tree, random decision forest (RDF), and gradient boosting decision tree (GBDT), with XGBoost. XGBoost showed the highest accuracy (90.24%) for classifying microcalcifications, and kNN demonstrated the lowest accuracy. This result demonstrates that it is essential for the classification of microcalcification to use the feature engineering method for the selection of the best composition of features. One of the contributions of this study is to present the best composition of features for efficient classification of breast cancers. This paper finds a way to select the best discriminative features as a collection to improve the accuracy. This study showed the highest accuracy (90.24%) for classifying microcalcifications with AUC = 0.89. Moreover, we highlighted the performance of various features from the dataset and found ideal parameters for classifying microcalcifications. Furthermore, we found that the XGBoost model is suitable both in theory and practice for the classification of calcifications on mammography.
Collapse
|
13
|
Basurto-Hurtado JA, Cruz-Albarran IA, Toledano-Ayala M, Ibarra-Manzano MA, Morales-Hernandez LA, Perez-Ramirez CA. Diagnostic Strategies for Breast Cancer Detection: From Image Generation to Classification Strategies Using Artificial Intelligence Algorithms. Cancers (Basel) 2022; 14:3442. [PMID: 35884503 PMCID: PMC9322973 DOI: 10.3390/cancers14143442] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Revised: 07/02/2022] [Accepted: 07/12/2022] [Indexed: 02/04/2023] Open
Abstract
Breast cancer is one the main death causes for women worldwide, as 16% of the diagnosed malignant lesions worldwide are its consequence. In this sense, it is of paramount importance to diagnose these lesions in the earliest stage possible, in order to have the highest chances of survival. While there are several works that present selected topics in this area, none of them present a complete panorama, that is, from the image generation to its interpretation. This work presents a comprehensive state-of-the-art review of the image generation and processing techniques to detect Breast Cancer, where potential candidates for the image generation and processing are presented and discussed. Novel methodologies should consider the adroit integration of artificial intelligence-concepts and the categorical data to generate modern alternatives that can have the accuracy, precision and reliability expected to mitigate the misclassifications.
Collapse
Affiliation(s)
- Jesus A. Basurto-Hurtado
- C.A. Mecatrónica, Facultad de Ingeniería, Campus San Juan del Río, Universidad Autónoma de Querétaro, Rio Moctezuma 249, San Cayetano, San Juan del Rio 76807, Mexico; (J.A.B.-H.); (I.A.C.-A.)
- Laboratorio de Dispositivos Médicos, Facultad de Ingeniería, Universidad Autónoma de Querétaro, Carretera a Chichimequillas S/N, Ejido Bolaños, Santiago de Querétaro 76140, Mexico
| | - Irving A. Cruz-Albarran
- C.A. Mecatrónica, Facultad de Ingeniería, Campus San Juan del Río, Universidad Autónoma de Querétaro, Rio Moctezuma 249, San Cayetano, San Juan del Rio 76807, Mexico; (J.A.B.-H.); (I.A.C.-A.)
- Laboratorio de Dispositivos Médicos, Facultad de Ingeniería, Universidad Autónoma de Querétaro, Carretera a Chichimequillas S/N, Ejido Bolaños, Santiago de Querétaro 76140, Mexico
| | - Manuel Toledano-Ayala
- División de Investigación y Posgrado de la Facultad de Ingeniería (DIPFI), Universidad Autónoma de Querétaro, Cerro de las Campanas S/N Las Campanas, Santiago de Querétaro 76010, Mexico;
| | - Mario Alberto Ibarra-Manzano
- Laboratorio de Procesamiento Digital de Señales, Departamento de Ingeniería Electrónica, Division de Ingenierias Campus Irapuato-Salamanca (DICIS), Universidad de Guanajuato, Carretera Salamanca-Valle de Santiago KM. 3.5 + 1.8 Km., Salamanca 36885, Mexico;
| | - Luis A. Morales-Hernandez
- C.A. Mecatrónica, Facultad de Ingeniería, Campus San Juan del Río, Universidad Autónoma de Querétaro, Rio Moctezuma 249, San Cayetano, San Juan del Rio 76807, Mexico; (J.A.B.-H.); (I.A.C.-A.)
| | - Carlos A. Perez-Ramirez
- Laboratorio de Dispositivos Médicos, Facultad de Ingeniería, Universidad Autónoma de Querétaro, Carretera a Chichimequillas S/N, Ejido Bolaños, Santiago de Querétaro 76140, Mexico
| |
Collapse
|