1
|
Wang H, Jia Q, Wang Y, Xue W, Jiang Q, Ning F, Wang J, Zhu Z, Tian L. Stacking learning based on micro-CT radiomics for outcome prediction in the early-stage of silica-induced pulmonary fibrosis model. Heliyon 2024; 10:e30651. [PMID: 38765063 PMCID: PMC11098827 DOI: 10.1016/j.heliyon.2024.e30651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 02/28/2024] [Accepted: 05/01/2024] [Indexed: 05/21/2024] Open
Abstract
Silicosis is a progressive pulmonary fibrosis disease caused by long-term inhalation of silica. The early diagnosis and timely implementation of intervention measures are crucial in preventing silicosis deterioration further. However, the lack of screening and diagnostic measures for early-stage silicosis remains a significant challenge. In this study, silicosis models of varying severity were established through a single exposure to silica with different doses (2.5mg/mice or 5mg/mice) and durations (4 weeks or 12 weeks). The diagnostic performance of computed tomography (CT) quantitative analysis was assessed using lung density biomarkers and the lung density distribution histogram, with a particular focus on non-aerated lung volume. Subsequently, we developed and evaluated a stacking learning model for early diagnosis of silicosis after extracting and selecting features from CT images. The CT quantitative analysis reveals that while the lung densitometric biomarkers and lung density distribution histogram, as traditional indicators, effectively differentiate severe fibrosis models, they are unable to distinguish early-stage silicosis. Furthermore, these findings remained consistent even when employing non-aerated areas, which is a more sensitive indicator. By establishing a radiomics stacking learning model based on non-aerated areas, we can achieve remarkable diagnostic performance to distinguish early-stage silicosis, which can provide a valuable tool for clinical assistant diagnosis. This study reveals the potential of using non-aerated lung areas as a region of interest in stacking learning for early diagnosis of silicosis, providing new insights into early detection of this disease.
Collapse
Affiliation(s)
- Hongwei Wang
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Qiyue Jia
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Yan Wang
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Wenming Xue
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Qiyue Jiang
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Fuao Ning
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Jiaxin Wang
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Zhonghui Zhu
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| | - Lin Tian
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, China
- Beijing Key Laboratory of Environmental Toxicology, Capital Medical University, Beijing, 100069, China
| |
Collapse
|
2
|
Hosseini Chagahi M, Mohammadi Dashtaki S, Moshiri B, Jalil Piran MD. Cardiovascular disease detection using a novel stack-based ensemble classifier with aggregation layer, DOWA operator, and feature transformation. Comput Biol Med 2024; 173:108345. [PMID: 38564852 DOI: 10.1016/j.compbiomed.2024.108345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 03/14/2024] [Accepted: 03/17/2024] [Indexed: 04/04/2024]
Abstract
Due to their widespread prevalence and impact on quality of life, cardiovascular diseases (CVD) pose a considerable global health burden. Early detection and intervention can reduce the incidence, severity, and progression of CVD and prevent premature death. The application of machine learning (ML) techniques to early CVD detection is therefore a valuable approach. In this paper, A stack-based ensemble classifier with an aggregation layer and the dependent ordered weighted averaging (DOWA) operator is proposed for detecting cardiovascular diseases. We propose transforming features using the Johnson transformation technique and normalizing feature distributions. Three diverse first-level classifiers are selected based on their accuracy, and predictions are combined using the aggregation layer and DOWA. A linear support vector machine (SVM) meta-classifier makes the final classification. Adding the aggregation layer to the stacking classifier improves classification accuracy significantly, according to the study. The accuracy is enhanced by 5%, resulting in an impressive overall accuracy of 94.05%. Moreover, the proposed system significantly increases the area under the receiver operating characteristic (ROC) curve compared to recent studies, reaching 97.14%. It further reinforces the classifier's reliability and effectiveness in classifying cardiovascular disease by distinguishing between positive and negative instances. With improved accuracy and a high area under the curve (AUC), the proposed classifier exhibits robustness and superior performance in the detection of cardiovascular diseases.
Collapse
Affiliation(s)
- Mehdi Hosseini Chagahi
- School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Saeed Mohammadi Dashtaki
- School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Behzad Moshiri
- School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran; Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, Canada.
| | - M D Jalil Piran
- Department of Computer Science and Engineering, Sejong University, Seoul 05006, South Korea.
| |
Collapse
|
3
|
Deng F, Zhao L, Yu N, Lin Y, Zhang L. Union With Recursive Feature Elimination: A Feature Selection Framework to Improve the Classification Performance of Multicategory Causes of Death in Colorectal Cancer. J Transl Med 2024; 104:100320. [PMID: 38158124 DOI: 10.1016/j.labinv.2023.100320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 01/03/2024] Open
Abstract
Despite the use of machine learning tools, it is challenging to properly model cause-specific deaths in colorectal cancer (CRC) patients and choose appropriate treatments. Here, we propose an interesting feature selection framework, namely union with recursive feature elimination (U-RFE), to select the union feature sets that are crucial in CRC progression-specific mortality using The Cancer Genome Atlas (TCGA) dataset. Based on the union feature sets, we compared the performance of 5 classification algorithms, including logistic regression (LR), support vector machines (SVM), random forest (RF), eXtreme gradient boosting (XGBoost), and Stacking, to identify the best model for classifying 4-category deaths. In the first stage of U-RFE, LR, SVM, and RF were used as base estimators to obtain subsets containing the same number of features but not exactly the same specific features. Union analysis of the subsets was then performed to determine the final union feature set, effectively combining the advantages of different algorithms. We found that the U-RFE framework could improve various models' performance. Stacking outperformed LR, SVM, RF, and XGBoost in most scenarios. When the target feature number of the RFE was set to 50 and the union feature set contained 298 deterministic features, the Stacking model achieved F1_weighted, Recall_weighted, Precision_weighted, Accuracy, and Matthews correlation coefficient of 0.851, 0.864, 0.854, 0.864, and 0.717, respectively. The performance of the minority categories was also significantly improved. Therefore, this recursive feature elimination-based approach of feature selection improves performances of classifying CRC deaths using clinical and omics data or those using other data with high feature redundancy and imbalance.
Collapse
Affiliation(s)
- Fei Deng
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China.
| | - Lin Zhao
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Ning Yu
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Yuxiang Lin
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Lanjing Zhang
- Department of Biological Sciences, Rutgers University, Newark, New Jersey; Department of Pathology, Princeton Medical Center, Plainsboro, New Jersey; Rutgers Cancer Institute of New Jersey, New Brunswick, New Jersey; Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, New Jersey.
| |
Collapse
|
4
|
Special Issue on Recent Advances in Machine Learning and Applications. Processes (Basel) 2022. [DOI: 10.3390/pr10112411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Digital technologies for cyber-physical systems are rapidly advancing, and the ubiquity of the Internet of Things (IoT) has created significant challenges for academic, industrial, and service applications due to high dimensionality, noise contamination, incompleteness, inconsistency, and massive amounts of data [...]
Collapse
|