1
|
Lu W, Zhao L, Wang S, Zhang H, Jiang K, Ji J, Chen S, Wang C, Wei C, Zhou R, Wang Z, Li X, Wang F, Wei X, Hou W. Explainable and visualizable machine learning models to predict biochemical recurrence of prostate cancer. Clin Transl Oncol 2024; 26:2369-2379. [PMID: 38602643 DOI: 10.1007/s12094-024-03480-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/23/2024] [Indexed: 04/12/2024]
Abstract
PURPOSE Machine learning (ML) models presented an excellent performance in the prognosis prediction. However, the black box characteristic of ML models limited the clinical applications. Here, we aimed to establish explainable and visualizable ML models to predict biochemical recurrence (BCR) of prostate cancer (PCa). MATERIALS AND METHODS A total of 647 PCa patients were retrospectively evaluated. Clinical parameters were identified using LASSO regression. Then, cohort was split into training and validation datasets with a ratio of 0.75:0.25 and BCR-related features were included in Cox regression and five ML algorithm to construct BCR prediction models. The clinical utility of each model was evaluated by concordance index (C-index) values and decision curve analyses (DCA). Besides, Shapley Additive Explanation (SHAP) values were used to explain the features in the models. RESULTS We identified 11 BCR-related features using LASSO regression, then establishing five ML-based models, including random survival forest (RSF), survival support vector machine (SSVM), survival Tree (sTree), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and a Cox regression model, C-index were 0.846 (95%CI 0.796-0.894), 0.774 (95%CI 0.712-0.834), 0.757 (95%CI 0.694-0.818), 0.820 (95%CI 0.765-0.869), 0.793 (95%CI 0.735-0.852), and 0.807 (95%CI 0.753-0.858), respectively. The DCA showed that RSF model had significant advantages over all models. In interpretability of ML models, the SHAP value demonstrated the tangible contribution of each feature in RSF model. CONCLUSIONS Our score system provide reference for the identification for BCR, and the crafting of a framework for making therapeutic decisions for PCa on a personalized basis.
Collapse
Affiliation(s)
- Wenhao Lu
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Lin Zhao
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Shenfan Wang
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Huiyong Zhang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Kangxian Jiang
- Department of Urology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, 362000, People's Republic of China
| | - Jin Ji
- Department of Urology, Shanghai Changhai Hospital, Second Military Medical University, Shanghai, 200433, People's Republic of China
| | - Shaohua Chen
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
| | - Chengbang Wang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Chunmeng Wei
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Rongbin Zhou
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Zuheng Wang
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China
| | - Xiao Li
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China
| | - Fubo Wang
- Collaborative Innovation Centre of Regenerative Medicine and Medical BioResource Development and Application Co-Constructed By the Province and Ministry, Guangxi Medical University, No. 22, Shuangyong Road, Qingxiu District, Nanning City, 530021, Guangxi Zhuang Autonomous Region, People's Republic of China.
- Center for Genomic and Personalized Medicine, Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China.
- Department of Urology, the First Affiliated Hospital of Guangxi Medical University, Guangxi Medical University, Guangxi, 530021, People's Republic of China.
- School of Life Sciences, Guangxi Medical University, Nanning, 530021, Guangxi, People's Republic of China.
| | - Xuedong Wei
- Department of Urology, the First Affiliated Hospital of Soochow University, Suzhou, 210000, Jiangsu, People's Republic of China.
| | - Wenlei Hou
- Information Technology School of Guangxi Police College, Nanning, 530021, Guangxi, People's Republic of China.
| |
Collapse
|
2
|
Pergaris A, Levidou G, Mandrakis G, Christodoulou MI, Karamouzis MV, Klijanienko J, Theocharis S. The Impact of DAXX, HJURP and CENPA Expression in Uveal Melanoma Carcinogenesis and Associations with Clinicopathological Parameters. Biomedicines 2024; 12:1772. [PMID: 39200236 PMCID: PMC11351862 DOI: 10.3390/biomedicines12081772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 07/31/2024] [Accepted: 08/02/2024] [Indexed: 09/02/2024] Open
Abstract
Uveal melanomas (UMs) represent rare malignant tumors associated with grim prognosis for the majority of patients. DAXX (Death Domain-Associated Protein), HJURP (Holliday Junction Recognition Protein) and CENPA (Centromere Protein A) proteins are implicated in epigenetic mechanisms, now in the spotlight of cancer research to better understand the molecular background of tumorigenesis. Herein, we investigated their expression in UM tissues using immunohistochemistry and explored possible correlations with a multitude of clinicopathological and survival parameters. The Cancer Genome Atlas Program (TCGA) was used for the investigation of their mRNA levels in UM cases. Nuclear DAXX expression correlated with an advanced T-stage (p = 0.004), while cytoplasmic expression marginally with decreased disease-free survival (DFS) (p = 0.084). HJURP nuclear positivity also correlated with advanced T-status (p = 0.054), chromosome 3 loss (p = 0.042) and increased tumor size (p = 0.03). More importantly, both nuclear and cytoplasmic HJURP immunopositivity correlated with decreased overall survival (OS) (p = 0.011 and 0.072, respectively) and worse DFS (p = 0.071 and 0.019, respectively). Lastly, nuclear CENPA overexpression was correlated with presence of irido-corneal angle involvement (p = 0.015) and loss of chromosome 3 (p = 0.041). Nuclear and cytoplasmic CENPA immunopositivity associated with decreased OS (p = 0.028) and DFS (p = 0.018), respectively. HJURP and CENPA mRNA overexpression exhibited strong association with tumor epithelioid histology and was linked to worse prognosis. Our results show the compounding role of DAXX, HJURP and CENPA in UM carcinogenesis, designating them as potential biomarkers for assessing prognosis and possible targets for novel therapeutic interventions.
Collapse
Affiliation(s)
- Alexandros Pergaris
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece; (A.P.); (G.M.)
| | - Georgia Levidou
- Department of Pathology, Paracelsus Medical University, 90419 Nuremberg, Germany;
| | - Georgios Mandrakis
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece; (A.P.); (G.M.)
| | - Maria-Ioanna Christodoulou
- Tumor Immunology and Biomarkers Laboratory, Basic and Translational Cancer Research Center, Department of Life Sciences, European University Cyprus, Nicosia 2404, Cyprus;
| | - Michail V. Karamouzis
- Molecular Oncology Unit, Department of Biological Chemistry, Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece;
| | | | - Stamatios Theocharis
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece; (A.P.); (G.M.)
| |
Collapse
|
3
|
Zhu Y, Li Z, Wu Z, Zhuo T, Dai L, Liang G, Peng H, Lu H, Wang Y. MIS18A upregulation promotes cell viability, migration and tumor immune evasion in lung adenocarcinoma. Oncol Lett 2024; 28:376. [PMID: 38910901 PMCID: PMC11190817 DOI: 10.3892/ol.2024.14509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 05/13/2024] [Indexed: 06/25/2024] Open
Abstract
Lung adenocarcinoma (LUAD) presents a significant global health challenge owing to its poor prognosis and high mortality rates. Despite its involvement in the initiation and progression of a number of cancer types, the understanding of the precise impact of MIS18 kinetochore protein A (MIS18A) on LUAD remains incomplete. In the present study, the role of MIS18A in LUAD was investigated by analyzing the genomic and clinical data from multiple public datasets. The expression of MIS18A was validated using reverse transcription-quantitative polymerase chain reaction, and in vitro experiments involving small interfering RNA-induced downregulation of MIS18A in lung cancer cells were conducted to further explore its impact. These findings revealed that elevated MIS18A expression in LUAD was associated with advanced clinical features and poor prognosis. Functional analysis also revealed the role of MIS18A in regulating the cell cycle and immune-related pathways. Moreover, MIS18A altered the immune microenvironment in LUAD, influencing its response to immunotherapy and drug sensitivity. The results of the in vitro experiments indicated that suppression of MIS18A expression reduced the proliferative and migratory capacities of LUAD cells. In summary, MIS18A possesses potential as a biomarker and may serve as a possible therapeutic target for LUAD, with significant implications for tumor progression by influencing both cell cycle dynamics and immune infiltration.
Collapse
Affiliation(s)
- Yongjie Zhu
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Zihao Li
- Department of Thoracic Surgery, Liuzhou People's Hospital Affiliated to Guangxi Medical University, Liuzhou, Guangxi Zhuang Autonomous Region 545026, P.R. China
| | - Zuotao Wu
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Ting Zhuo
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Lei Dai
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Guanbiao Liang
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Huajian Peng
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Honglin Lu
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| | - Yongyong Wang
- Department of Cardio-Thoracic Surgery, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region 530021, P.R. China
| |
Collapse
|
4
|
Abbasi AF, Asim MN, Ahmed S, Vollmer S, Dengel A. Survival prediction landscape: an in-depth systematic literature review on activities, methods, tools, diseases, and databases. Front Artif Intell 2024; 7:1428501. [PMID: 39021434 PMCID: PMC11252047 DOI: 10.3389/frai.2024.1428501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 06/12/2024] [Indexed: 07/20/2024] Open
Abstract
Survival prediction integrates patient-specific molecular information and clinical signatures to forecast the anticipated time of an event, such as recurrence, death, or disease progression. Survival prediction proves valuable in guiding treatment decisions, optimizing resource allocation, and interventions of precision medicine. The wide range of diseases, the existence of various variants within the same disease, and the reliance on available data necessitate disease-specific computational survival predictors. The widespread adoption of artificial intelligence (AI) methods in crafting survival predictors has undoubtedly revolutionized this field. However, the ever-increasing demand for more sophisticated and effective prediction models necessitates the continued creation of innovative advancements. To catalyze these advancements, it is crucial to bring existing survival predictors knowledge and insights into a centralized platform. The paper in hand thoroughly examines 23 existing review studies and provides a concise overview of their scope and limitations. Focusing on a comprehensive set of 90 most recent survival predictors across 44 diverse diseases, it delves into insights of diverse types of methods that are used in the development of disease-specific predictors. This exhaustive analysis encompasses the utilized data modalities along with a detailed analysis of subsets of clinical features, feature engineering methods, and the specific statistical, machine or deep learning approaches that have been employed. It also provides insights about survival prediction data sources, open-source predictors, and survival prediction frameworks.
Collapse
Affiliation(s)
- Ahtisham Fazeel Abbasi
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Muhammad Nabeel Asim
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Sheraz Ahmed
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Sebastian Vollmer
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| | - Andreas Dengel
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, Germany
- Smart Data & Knowledge Services, Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI), Kaiserslautern, Germany
| |
Collapse
|
5
|
Kikuchi T, Hanaoka S, Nakao T, Takenaga T, Nomura Y, Mori H, Yoshikawa T. Synthesis of Hybrid Data Consisting of Chest Radiographs and Tabular Clinical Records Using Dual Generative Models for COVID-19 Positive Cases. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:1217-1227. [PMID: 38351224 PMCID: PMC11169290 DOI: 10.1007/s10278-024-01015-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 12/21/2023] [Accepted: 12/22/2023] [Indexed: 06/13/2024]
Abstract
To generate synthetic medical data incorporating image-tabular hybrid data by merging an image encoding/decoding model with a table-compatible generative model and assess their utility. We used 1342 cases from the Stony Brook University Covid-19-positive cases, comprising chest X-ray radiographs (CXRs) and tabular clinical data as a private dataset (pDS). We generated a synthetic dataset (sDS) through the following steps: (I) dimensionally reducing CXRs in the pDS using a pretrained encoder of the auto-encoding generative adversarial networks (αGAN) and integrating them with the correspondent tabular clinical data; (II) training the conditional tabular GAN (CTGAN) on this combined data to generate synthetic records, encompassing encoded image features and clinical data; and (III) reconstructing synthetic images from these encoded image features in the sDS using a pretrained decoder of the αGAN. The utility of sDS was assessed by the performance of the prediction models for patient outcomes (deceased or discharged). For the pDS test set, the area under the receiver operating characteristic (AUC) curve was calculated to compare the performance of prediction models trained separately with pDS, sDS, or a combination of both. We created an sDS comprising CXRs with a resolution of 256 × 256 pixels and tabular data containing 13 variables. The AUC for the outcome was 0.83 when the model was trained with the pDS, 0.74 with the sDS, and 0.87 when combining pDS and sDS for training. Our method is effective for generating synthetic records consisting of both images and tabular clinical data.
Collapse
Affiliation(s)
- Tomohiro Kikuchi
- Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan.
- Department of Radiology, School of Medicine, Jichi Medical University, 3311-1 Yakushiji, Shimotsuke, Tochigi, 329-0498, Japan.
| | - Shouhei Hanaoka
- Department of Radiology, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Takahiro Nakao
- Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
| | - Tomomi Takenaga
- Department of Radiology, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Yukihiro Nomura
- Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
- Center for Frontier Medical Engineering, Chiba University, 1-33 Yayoi-cho, Inage-ku, Chiba, 263-8522, Japan
| | - Harushi Mori
- Department of Radiology, School of Medicine, Jichi Medical University, 3311-1 Yakushiji, Shimotsuke, Tochigi, 329-0498, Japan
| | - Takeharu Yoshikawa
- Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
| |
Collapse
|
6
|
Ko E, Kim Y, Shokoohi F, Mersha TB, Kang M. SPIN: sex-specific and pathway-based interpretable neural network for sexual dimorphism analysis. Brief Bioinform 2024; 25:bbae239. [PMID: 38807262 PMCID: PMC11133003 DOI: 10.1093/bib/bbae239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2023] [Revised: 03/29/2024] [Accepted: 04/26/2024] [Indexed: 05/30/2024] Open
Abstract
Sexual dimorphism in prevalence, severity and genetic susceptibility exists for most common diseases. However, most genetic and clinical outcome studies are designed in sex-combined framework considering sex as a covariate. Few sex-specific studies have analyzed males and females separately, which failed to identify gene-by-sex interaction. Here, we propose a novel unified biologically interpretable deep learning-based framework (named SPIN) for sexual dimorphism analysis. We demonstrate that SPIN significantly improved the C-index up to 23.6% in TCGA cancer datasets, and it was further validated using asthma datasets. In addition, SPIN identifies sex-specific and -shared risk loci that are often missed in previous sex-combined/-separate analysis. We also show that SPIN is interpretable for explaining how biological pathways contribute to sexual dimorphism and improve risk prediction in an individual level, which can result in the development of precision medicine tailored to a specific individual's characteristics.
Collapse
Affiliation(s)
- Euiseong Ko
- Department of Computer Science, University of Nevada, Las Vegas, Las Vegas, NV, USA
| | - Youngsoon Kim
- Department of Information and Statistics and Department of Bio&Medical Bigdata (BK21 Four program), Gyeongsang National University, Jinju, Republic of Korea
| | - Farhad Shokoohi
- Department of Mathematical Sciences, University of Nevada, Las Vegas, Las Vegas, NV, USA
| | - Tesfaye B Mersha
- Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | - Mingon Kang
- Department of Computer Science, University of Nevada, Las Vegas, Las Vegas, NV, USA
| |
Collapse
|
7
|
Deng F, Zhao L, Yu N, Lin Y, Zhang L. Union With Recursive Feature Elimination: A Feature Selection Framework to Improve the Classification Performance of Multicategory Causes of Death in Colorectal Cancer. J Transl Med 2024; 104:100320. [PMID: 38158124 DOI: 10.1016/j.labinv.2023.100320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 12/05/2023] [Accepted: 12/20/2023] [Indexed: 01/03/2024] Open
Abstract
Despite the use of machine learning tools, it is challenging to properly model cause-specific deaths in colorectal cancer (CRC) patients and choose appropriate treatments. Here, we propose an interesting feature selection framework, namely union with recursive feature elimination (U-RFE), to select the union feature sets that are crucial in CRC progression-specific mortality using The Cancer Genome Atlas (TCGA) dataset. Based on the union feature sets, we compared the performance of 5 classification algorithms, including logistic regression (LR), support vector machines (SVM), random forest (RF), eXtreme gradient boosting (XGBoost), and Stacking, to identify the best model for classifying 4-category deaths. In the first stage of U-RFE, LR, SVM, and RF were used as base estimators to obtain subsets containing the same number of features but not exactly the same specific features. Union analysis of the subsets was then performed to determine the final union feature set, effectively combining the advantages of different algorithms. We found that the U-RFE framework could improve various models' performance. Stacking outperformed LR, SVM, RF, and XGBoost in most scenarios. When the target feature number of the RFE was set to 50 and the union feature set contained 298 deterministic features, the Stacking model achieved F1_weighted, Recall_weighted, Precision_weighted, Accuracy, and Matthews correlation coefficient of 0.851, 0.864, 0.854, 0.864, and 0.717, respectively. The performance of the minority categories was also significantly improved. Therefore, this recursive feature elimination-based approach of feature selection improves performances of classifying CRC deaths using clinical and omics data or those using other data with high feature redundancy and imbalance.
Collapse
Affiliation(s)
- Fei Deng
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China.
| | - Lin Zhao
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Ning Yu
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Yuxiang Lin
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Lanjing Zhang
- Department of Biological Sciences, Rutgers University, Newark, New Jersey; Department of Pathology, Princeton Medical Center, Plainsboro, New Jersey; Rutgers Cancer Institute of New Jersey, New Brunswick, New Jersey; Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, New Jersey.
| |
Collapse
|
8
|
Wang Z, Zheng C, Han X, Chen W, Lu L. An Innovative and Efficient Diagnostic Prediction Flow for Head and Neck Cancer: A Deep Learning Approach for Multi-Modal Survival Analysis Prediction Based on Text and Multi-Center PET/CT Images. Diagnostics (Basel) 2024; 14:448. [PMID: 38396486 PMCID: PMC10888043 DOI: 10.3390/diagnostics14040448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 02/14/2024] [Accepted: 02/14/2024] [Indexed: 02/25/2024] Open
Abstract
Objective: To comprehensively capture intra-tumor heterogeneity in head and neck cancer (HNC) and maximize the use of valid information collected in the clinical field, we propose a novel multi-modal image-text fusion strategy aimed at improving prognosis. Method: We have developed a tailored diagnostic algorithm for HNC, leveraging a deep learning-based model that integrates both image and clinical text information. For the image fusion part, we used the cross-attention mechanism to fuse the image information between PET and CT, and for the fusion of text and image, we used the Q-former architecture to fuse the text and image information. We also improved the traditional prognostic model by introducing time as a variable in the construction of the model, and finally obtained the corresponding prognostic results. Result: We assessed the efficacy of our methodology through the compilation of a multicenter dataset, achieving commendable outcomes in multicenter validations. Notably, our results for metastasis-free survival (MFS), recurrence-free survival (RFS), overall survival (OS), and progression-free survival (PFS) were as follows: 0.796, 0.626, 0.641, and 0.691. Our results demonstrate a notable superiority over the utilization of CT and PET independently, and exceed the result derived without the clinical textual information. Conclusions: Our model not only validates the effectiveness of multi-modal fusion in aiding diagnosis, but also provides insights for optimizing survival analysis. The study underscores the potential of our approach in enhancing prognosis and contributing to the advancement of personalized medicine in HNC.
Collapse
Affiliation(s)
- Zhaonian Wang
- School of Biomedical Engineering, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
| | - Chundan Zheng
- School of Biomedical Engineering, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Pazhou Lab, Guangzhou 510330, China
| | - Xu Han
- School of Biomedical Engineering, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Pazhou Lab, Guangzhou 510330, China
| | - Wufan Chen
- School of Biomedical Engineering, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
| | - Lijun Lu
- School of Biomedical Engineering, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, 1023 Shatai Road, Guangzhou 510515, China
- Pazhou Lab, Guangzhou 510330, China
| |
Collapse
|
9
|
Sofia D, Zhou Q, Shahriyari L. Mathematical and Machine Learning Models of Renal Cell Carcinoma: A Review. Bioengineering (Basel) 2023; 10:1320. [PMID: 38002445 PMCID: PMC10669004 DOI: 10.3390/bioengineering10111320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 11/08/2023] [Accepted: 11/13/2023] [Indexed: 11/26/2023] Open
Abstract
This review explores the multifaceted landscape of renal cell carcinoma (RCC) by delving into both mechanistic and machine learning models. While machine learning models leverage patients' gene expression and clinical data through a variety of techniques to predict patients' outcomes, mechanistic models focus on investigating cells' and molecules' interactions within RCC tumors. These interactions are notably centered around immune cells, cytokines, tumor cells, and the development of lung metastases. The insights gained from both machine learning and mechanistic models encompass critical aspects such as signature gene identification, sensitive interactions in the tumors' microenvironments, metastasis development in other organs, and the assessment of survival probabilities. By reviewing the models of RCC, this study aims to shed light on opportunities for the integration of machine learning and mechanistic modeling approaches for treatment optimization and the identification of specific targets, all of which are essential for enhancing patient outcomes.
Collapse
Affiliation(s)
| | | | - Leili Shahriyari
- Department of Mathematics and Statistics, University of Massachusetts Amherst, Amherst, MA 01003, USA; (D.S.); (Q.Z.)
| |
Collapse
|
10
|
Pergaris A, Genaris I, Stergiou IE, Klijanienko J, Papadakos SP, Theocharis S. The Clinical Impact of Death Domain-Associated Protein and Holliday Junction Recognition Protein Expression in Cancer: Unmasking the Driving Forces of Neoplasia. Cancers (Basel) 2023; 15:5165. [PMID: 37958340 PMCID: PMC10650673 DOI: 10.3390/cancers15215165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 10/22/2023] [Accepted: 10/24/2023] [Indexed: 11/15/2023] Open
Abstract
Death domain-associated protein (DAXX) and Holliday junction recognition protein (HJURP) act as chaperones of H3 histone variants H3.3 and centromere protein A (CENPA), respectively, and are implicated in many physiological processes, including aging and epigenetic regulation, by controlling various genes' transcription and subsequently protein expression. Research has highlighted both these biomolecules as participants in key procedures of tumorigenesis, including cell proliferation, chromosome instability, and oncogene expression. As cancer continues to exert a heavy impact on patients' well-being and bears substantial socioeconomic ramifications, the discovery of novel biomarkers for timely disease detection, estimation of prognosis, and therapy monitoring remains of utmost importance. In the present review, we present data reported from studies investigating DAXX and HJURP expression, either on mRNA or protein level, in human tissue samples from various types of neoplasia. Of note, the expression of DAXX and HJURP has been associated with a multitude of clinicopathological parameters, including disease stage, tumor grade, patients' overall and disease-free survival, as well as lymphovascular invasion. The data reveal the tumor-promoting properties of DAXX and HJURP in a number of organs as well as their potential use as diagnostic biomarkers and underline the important association between aberrations in their expression and patients' prognosis, rendering them as possible targets of future, personalized and precise therapeutic interventions.
Collapse
Affiliation(s)
- Alexandros Pergaris
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias Street, Bld 10, Goudi, 11527 Athens, Greece; (A.P.); (I.G.); (S.P.P.)
| | - Ioannis Genaris
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias Street, Bld 10, Goudi, 11527 Athens, Greece; (A.P.); (I.G.); (S.P.P.)
| | - Ioanna E. Stergiou
- Department of Pathophysiology, Medical School, National and Kapodistrian University of Athens, 11527 Athens, Greece;
| | | | - Stavros P. Papadakos
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias Street, Bld 10, Goudi, 11527 Athens, Greece; (A.P.); (I.G.); (S.P.P.)
| | - Stamatios Theocharis
- First Department of Pathology, Medical School, National and Kapodistrian University of Athens, 75 Mikras Asias Street, Bld 10, Goudi, 11527 Athens, Greece; (A.P.); (I.G.); (S.P.P.)
| |
Collapse
|
11
|
Al-Tashi Q, Saad MB, Sheshadri A, Wu CC, Chang JY, Al-Lazikani B, Gibbons C, Vokes NI, Zhang J, Lee JJ, Heymach JV, Jaffray D, Mirjalili S, Wu J. SwarmDeepSurv: swarm intelligence advances deep survival network for prognostic radiomics signatures in four solid cancers. PATTERNS (NEW YORK, N.Y.) 2023; 4:100777. [PMID: 37602223 PMCID: PMC10435962 DOI: 10.1016/j.patter.2023.100777] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/18/2023] [Accepted: 05/26/2023] [Indexed: 08/22/2023]
Abstract
Survival models exist to study relationships between biomarkers and treatment effects. Deep learning-powered survival models supersede the classical Cox proportional hazards (CoxPH) model, but substantial performance drops were observed on high-dimensional features because of irrelevant/redundant information. To fill this gap, we proposed SwarmDeepSurv by integrating swarm intelligence algorithms with the deep survival model. Furthermore, four objective functions were designed to optimize prognostic prediction while regularizing selected feature numbers. When testing on multicenter sets (n = 1,058) of four different cancer types, SwarmDeepSurv was less prone to overfitting and achieved optimal patient risk stratification compared with popular survival modeling algorithms. Strikingly, SwarmDeepSurv selected different features compared with classical feature selection algorithms, including the least absolute shrinkage and selection operator (LASSO), with nearly no feature overlapping across these models. Taken together, SwarmDeepSurv offers an alternative approach to model relationships between radiomics features and survival endpoints, which can further extend to study other input data types including genomics.
Collapse
Affiliation(s)
- Qasem Al-Tashi
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Maliazurina B. Saad
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Ajay Sheshadri
- Department of Pulmonary Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Carol C. Wu
- Department of Thoracic Imaging, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Joe Y. Chang
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Bissan Al-Lazikani
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Christopher Gibbons
- Section of Patient-Centered Analytics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Natalie I. Vokes
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Jianjun Zhang
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - J. Jack Lee
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - John V. Heymach
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - David Jaffray
- Office of the Chief Technology and Digital Officer, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimization, Torrens University Australia, Fortitude Valley, Brisbane, QLD 4006, Australia
- Yonsei Frontier Lab, Yonsei University, Seoul 03722, Korea
- University Research and Innovation Center, Obuda University, 1034 Budapest, Hungary
| | - Jia Wu
- Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
12
|
Wu X, Li Z, Wang ZQ, Xu X. The neurological and non-neurological roles of the primary microcephaly-associated protein ASPM. Front Neurosci 2023; 17:1242448. [PMID: 37599996 PMCID: PMC10436222 DOI: 10.3389/fnins.2023.1242448] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 07/24/2023] [Indexed: 08/22/2023] Open
Abstract
Primary microcephaly (MCPH), is a neurological disorder characterized by small brain size that results in numerous developmental problems, including intellectual disability, motor and speech delays, and seizures. Hitherto, over 30 MCPH causing genes (MCPHs) have been identified. Among these MCPHs, MCPH5, which encodes abnormal spindle-like microcephaly-associated protein (ASPM), is the most frequently mutated gene. ASPM regulates mitotic events, cell proliferation, replication stress response, DNA repair, and tumorigenesis. Moreover, using a data mining approach, we have confirmed that high levels of expression of ASPM correlate with poor prognosis in several types of tumors. Here, we summarize the neurological and non-neurological functions of ASPM and provide insight into its implications for the diagnosis and treatment of MCPH and cancer.
Collapse
Affiliation(s)
- Xingxuan Wu
- Guangdong Key Laboratory for Genome Stability and Disease Prevention and Marshall Laboratory of Biomedical Engineering, Shenzhen University Medical School, Shenzhen, Guangdong, China
- Shenzhen University-Friedrich Schiller Universität Jena Joint PhD Program in Biomedical Sciences, Shenzhen University School of Medicine, Shenzhen, Guangdong, China
- Laboratory of Genome Stability, Leibniz Institute on Aging-Fritz Lipmann Institute, Jena, Germany
| | - Zheng Li
- Guangdong Key Laboratory for Genome Stability and Disease Prevention and Marshall Laboratory of Biomedical Engineering, Shenzhen University Medical School, Shenzhen, Guangdong, China
| | - Zhao-Qi Wang
- Shenzhen University-Friedrich Schiller Universität Jena Joint PhD Program in Biomedical Sciences, Shenzhen University School of Medicine, Shenzhen, Guangdong, China
- Laboratory of Genome Stability, Leibniz Institute on Aging-Fritz Lipmann Institute, Jena, Germany
| | - Xingzhi Xu
- Guangdong Key Laboratory for Genome Stability and Disease Prevention and Marshall Laboratory of Biomedical Engineering, Shenzhen University Medical School, Shenzhen, Guangdong, China
- Shenzhen University-Friedrich Schiller Universität Jena Joint PhD Program in Biomedical Sciences, Shenzhen University School of Medicine, Shenzhen, Guangdong, China
| |
Collapse
|
13
|
Lee M. Deep Learning Techniques with Genomic Data in Cancer Prognosis: A Comprehensive Review of the 2021-2023 Literature. BIOLOGY 2023; 12:893. [PMID: 37508326 PMCID: PMC10376033 DOI: 10.3390/biology12070893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 06/16/2023] [Accepted: 06/20/2023] [Indexed: 07/30/2023]
Abstract
Deep learning has brought about a significant transformation in machine learning, leading to an array of novel methodologies and consequently broadening its influence. The application of deep learning in various sectors, especially biomedical data analysis, has initiated a period filled with noteworthy scientific developments. This trend has majorly influenced cancer prognosis, where the interpretation of genomic data for survival analysis has become a central research focus. The capacity of deep learning to decode intricate patterns embedded within high-dimensional genomic data has provoked a paradigm shift in our understanding of cancer survival. Given the swift progression in this field, there is an urgent need for a comprehensive review that focuses on the most influential studies from 2021 to 2023. This review, through its careful selection and thorough exploration of dominant trends and methodologies, strives to fulfill this need. The paper aims to enhance our existing understanding of applications of deep learning in cancer survival analysis, while also highlighting promising directions for future research. This paper undertakes aims to enrich our existing grasp of the application of deep learning in cancer survival analysis, while concurrently shedding light on promising directions for future research in this vibrant and rapidly proliferating field.
Collapse
Affiliation(s)
- Minhyeok Lee
- School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Republic of Korea
| |
Collapse
|
14
|
Wang Y, Gao X, Wang J. Functional Proteomic Profiling Analysis in Four Major Types of Gastrointestinal Cancers. Biomolecules 2023; 13:biom13040701. [PMID: 37189448 DOI: 10.3390/biom13040701] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 04/05/2023] [Accepted: 04/18/2023] [Indexed: 05/17/2023] Open
Abstract
Gastrointestinal (GI) cancer accounts for one in four cancer cases and one in three cancer-related deaths globally. A deeper understanding of cancer development mechanisms can be applied to cancer medicine. Comprehensive sequencing applications have revealed the genomic landscapes of the common types of human cancer, and proteomics technology has identified protein targets and signalling pathways related to cancer growth and progression. This study aimed to explore the functional proteomic profiles of four major types of GI tract cancer based on The Cancer Proteome Atlas (TCPA). We provided an overview of functional proteomic heterogeneity by performing several approaches, including principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), t-stochastic neighbour embedding (t-SNE) analysis, and hierarchical clustering analysis in oesophageal carcinoma (ESCA), stomach adenocarcinoma (STAD), colon adenocarcinoma (COAD), and rectum adenocarcinoma (READ) tumours, to gain a system-wide understanding of the four types of GI cancer. The feature selection approach, mutual information feature selection (MIFS) method, was conducted to screen candidate protein signature subsets to better distinguish different cancer types. The potential clinical implications of candidate proteins in terms of tumour progression and prognosis were also evaluated based on TCPA and The Cancer Genome Atlas (TCGA) databases. The results suggested that functional proteomic profiling can identify different patterns among the four types of GI cancers and provide candidate proteins for clinical diagnosis and prognosis evaluation. We also highlighted the application of feature selection approaches in high-dimensional biological data analysis. Overall, this study could improve the understanding of the complexity of cancer phenotypes and genotypes and thus be applied to cancer medicine.
Collapse
Affiliation(s)
- Yangyang Wang
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710129, China
| | - Xiaoguang Gao
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710129, China
| | - Jihan Wang
- Institute of Medical Research, Northwestern Polytechnical University, Xi'an 710072, China
| |
Collapse
|
15
|
Feng HM, Zhao Y, Yan WJ, Li B. Genomic and immunogenomic analysis of three prognostic signature genes in LUAD. BMC Bioinformatics 2023; 24:19. [PMID: 36650426 PMCID: PMC9843910 DOI: 10.1186/s12859-023-05137-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 01/04/2023] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Searching for immunotherapy-related markers is an important research content to screen for target populations suitable for immunotherapy. Prognosis-related genes in early stage lung cancer may also affect the tumor immune microenvironment, which in turn affects immunotherapy. RESULTS We analyzed the differential genes affecting lung cancer patients receiving immunotherapy through the Cancer Treatment Response gene signature DataBase (CTR-DB), and set a threshold to obtain a total of 176 differential genes between response and non-response to immunotherapy. Functional enrichment analysis found that these differential genes were mainly involved in immune regulation-related pathways. The early-stage lung adenocarcinoma (LUAD) prognostic model was constructed through the cancer genome atlas (TCGA) database, and three target genes (MMP12, NFE2, HOXC8) were screened to calculate the risk score of early-stage LUAD. The receiver operating characteristic (ROC) curve indicated that the model had good prognostic value, and the validation set (GSE50081, GSE11969 and GSE42127) from the gene expression omnibus (GEO) analysis indicated that the model had good stability, and the risk score was correlated with immune infiltrations to varying degrees. Multi-type survival analysis and immune infiltration analysis revealed that the transcriptome, methylation and the copy number variation (CNV) levels of the three genes were correlated with patient prognosis and some tumor microenvironment (TME) components. Drug sensitivity analysis found that the three genes may affect some anti-tumor drugs. The mRNA expression of immune checkpoint-related genes showed significant differences between the high and low group of the three genes, and there may be a mutual regulatory network between immune checkpoint-related genes and target genes. Tumor immune dysfunction and exclusion (TIDE) analysis found that three genes were associated with immunotherapy response and maybe the potential predictors to immunotherapy, consistent with the CTR-DB database analysis. CONCLUSIONS From the perspective of data mining, this study suggests that MMP12, NFE2, and HOXC8 may be involved in tumor immune regulation and affect immunotherapy. They are expected to become markers of immunotherapy and are worthy of further experimental research.
Collapse
Affiliation(s)
- Hai-Ming Feng
- grid.411294.b0000 0004 1798 9345Department of Thoracic Surgery, Lanzhou University Second Hospital, Lanzhou University Second Clinical Medical College, 82 Cuiyingmen, Chengguan District, Lanzhou, 730030 Gansu People’s Republic of China
| | - Ye Zhao
- grid.411634.50000 0004 0632 4559Department of Radiotherapy, Gansu Provincial People’s Hospital, Lanzhou City, 730030 China
| | - Wei-Jian Yan
- grid.411294.b0000 0004 1798 9345Department of Thoracic Surgery, Lanzhou University Second Hospital, Lanzhou University Second Clinical Medical College, 82 Cuiyingmen, Chengguan District, Lanzhou, 730030 Gansu People’s Republic of China
| | - Bin Li
- grid.411294.b0000 0004 1798 9345Department of Thoracic Surgery, Lanzhou University Second Hospital, Lanzhou University Second Clinical Medical College, 82 Cuiyingmen, Chengguan District, Lanzhou, 730030 Gansu People’s Republic of China
| |
Collapse
|
16
|
Survival Analysis with High-Dimensional Omics Data Using a Threshold Gradient Descent Regularization-Based Neural Network Approach. Genes (Basel) 2022; 13:genes13091674. [PMID: 36140842 PMCID: PMC9498566 DOI: 10.3390/genes13091674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 09/13/2022] [Accepted: 09/16/2022] [Indexed: 11/17/2022] Open
Abstract
Analysis of data with a censored survival response and high-dimensional omics measurements is now common. Most of the existing analyses are based on specific (semi)parametric models, in particular the Cox model. Such analyses may be limited by not having sufficient flexibility, for example, in accommodating nonlinearity. For categorical and continuous responses, neural networks (NNs) have provided a highly competitive alternative. Comparatively, NNs for censored survival data remain limited. Omics measurements are usually high-dimensional, and only a small subset is expected to be survival-associated. As such, regularized estimation and selection are needed. In the existing NN studies, this is usually achieved via penalization. In this article, we propose adopting the threshold gradient descent regularization (TGDR) technique, which has competitive performance (for example, when compared to penalization) and unique advantages in regression analysis, but has not been adopted with NNs. The TGDR-based NN has a highly sensible formulation and an architecture different from the unregularized and penalization-based ones. Simulations show its satisfactory performance. Its practical effectiveness is further established via the analysis of two cancer omics datasets. Overall, this study can provide a practical and useful new way in the NN paradigm for survival analysis with high-dimensional omics measurements.
Collapse
|
17
|
Risk Stratification for Breast Cancer Patient by Simultaneous Learning of Molecular Subtype and Survival Outcome Using Genetic Algorithm-Based Gene Set Selection. Cancers (Basel) 2022; 14:cancers14174120. [PMID: 36077657 PMCID: PMC9454699 DOI: 10.3390/cancers14174120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 08/18/2022] [Accepted: 08/20/2022] [Indexed: 11/26/2022] Open
Abstract
Simple Summary Patient stratification is clinically important because it allows us to understand the characteristics and establish treatment strategies for a group. Transcriptomic data play an important role in determining molecular subtypes and predicting survival. In the case of breast cancer, although the order of prognosis according to molecular subtypes is well known, there is heterogeneity even within a subtype. Therefore, patient stratification considering both molecular subtypes and survival outcomes is required. In this study, a methodology to handle this problem is presented. A genetic algorithm is used to select a set of genes, and a risk score is assigned to each patient using their expression level. According to the risk score, patients are ordered and stratified considering molecular subtypes and survival outcomes. Consequently, informative genes for patient stratification with respect to both aspects could be nominated, and the usefulness of the risk score was shown through comparison with other indicators. Abstract Patient stratification is a clinically important task because it allows us to establish and develop efficient treatment strategies for particular groups of patients. Molecular subtypes have been successfully defined using transcriptomic profiles, and they are used effectively in clinical practice, e.g., PAM50 subtypes of breast cancer. Survival prediction contributed to understanding diseases and also identifying genes related to prognosis. It is desirable to stratify patients considering these two aspects simultaneously. However, there are no methods for patient stratification that consider molecular subtypes and survival outcomes at once. Here, we propose a methodology to deal with the problem. A genetic algorithm is used to select a gene set from transcriptome data, and their expression quantities are utilized to assign a risk score to each patient. The patients are ordered and stratified according to the score. A gene set was selected by our method on a breast cancer cohort (TCGA-BRCA), and we examined its clinical utility using an independent cohort (SCAN-B). In this experiment, our method was successful in stratifying patients with respect to both molecular subtype and survival outcome. We demonstrated that the orders of patients were consistent across repeated experiments, and prognostic genes were successfully nominated. Additionally, it was observed that the risk score can be used to evaluate the molecular aggressiveness of individual patients.
Collapse
|