1
|
Zhao K, Wen Q, Li Q, Li P, Liu T, Zhu F, Tan Q, Zhang L. Identification of oxidative stress-related hub genes for predicting prognosis in diffuse large B-cell lymphoma. Gene 2025; 935:149077. [PMID: 39500385 DOI: 10.1016/j.gene.2024.149077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Revised: 10/21/2024] [Accepted: 10/31/2024] [Indexed: 11/15/2024]
Abstract
BACKGROUND Oxidative stress is a cellular characteristic that might induce the proliferation and differentiation of tumor cells and promote tumor progression in diffuse large B-cell lymphoma (DLBCL). METHODS The DLBCL gene sequencing dataset, tumor mutation burden data, copy number variation data of Somatic cell mutation data in TCGA were downloaded for data training analysis, along with four DLBCL datasets in GEO for validation analysis. The known oxidative stress related genes (OSRGs) were collected from websites. The weighted gene co-expression network analysis (WGCNA) was conducted on the TCGA DLBCL dataset to obtain gene modules related to oxidative stress and intersected with the known OSRGs to obtain the hub genes, which were used to perform consensus clustering on the samples to obtain new phenotypes. Next, the prognosis related OSRGs were selected through regression analysis algorithms and key genes were identified. These genes were used to establish the prognostic risk model and predictive model, and to compare functional and pathway differences among different risk groups. RESULTS Through website search, we obtained 297 known OSRGs, and after intersecting with WGCNA results, we obtained 26 OSRGs. The TCGA-DLBC samples were clustered into 2 subtypes with these genes and there were significant differences in immune infiltration between subtypes. After regression analysis, we obtained a total of four key genes, BMI1, CDKN1A, NOX1, and SESN1. The risk prediction model established with these four genes as variables has accurate prognostic prediction ability. The key genes interact with 65 miRNAs, 57 TFs, 47 RBPs, and 62 drugs, respectively, and are closely related to immune infiltration of the disease. Among them, CDKN1A and SESN1 had the highest variability. CONCLUSIONS The key genes involved in oxidative stress could predict the prognosis of DLBCL and potentially become therapeutic targets.
Collapse
Affiliation(s)
- Kewei Zhao
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China
| | - Qiuyue Wen
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China
| | - Qiuhui Li
- The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China; Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Pengye Li
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China
| | - Tao Liu
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China
| | - Fang Zhu
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China
| | - Qiaoyun Tan
- Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Liling Zhang
- Hubei Provincial Key Laboratory of Precision Radiation Oncology, Wuhan 430022, China; The First Clinical School, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430023, China; Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China.
| |
Collapse
|
2
|
Wang L, Cui G, Cai X. Fuzzy clustering optimal k selection method based on multi-objective optimization. Soft comput 2023. [DOI: 10.1007/s00500-022-07727-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
3
|
Kenidra B, Benmohammed M. An Ultra-Fast Method for Clustering of Big Genomic Data. INTERNATIONAL JOURNAL OF APPLIED METAHEURISTIC COMPUTING 2020. [DOI: 10.4018/ijamc.2020010104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The clustering process is used to identify cancer subtypes based on gene expression and DNA methylation datasets, since cancer subtype information is critically important for understanding tumor heterogeneity, detecting previously unknown clusters of biological samples, which are usually associated with unknown types of cancer will, in turn, gives way to prescribe more effective treatments for patients. This is because cancer has varying subtypes which often respond disparately to the same treatment. While the DNA methylation database is extremely large-scale datasets, running time still remains a major challenge. Actually, traditional clustering algorithms are too slow to handle biological high-dimensional datasets, they usually require large amounts of computational time. The proposed clustering algorithm extraordinarily overcomes all others in terms of running time, it is able to rapidly identify a set of biologically relevant clusters in large-scale DNA methylation datasets, its superiority over the others has been demonstrated regarding its relative speed.
Collapse
Affiliation(s)
- Billel Kenidra
- National Superior Institute of Computer Science (ESI), Constantine, Algeria
| | | |
Collapse
|
4
|
Chen X, Huang JZ, Wu Q, Yang M. Subspace Weighting Co-Clustering of Gene Expression Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:352-364. [PMID: 28541221 DOI: 10.1109/tcbb.2017.2705686] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Microarray technology enables the collection of vast amounts of gene expression data from biological experiments. Clustering algorithms have been successfully applied to exploring the gene expression data. Since a set of genes may be only correlated to a subset of samples, it is useful to use co-clustering to recover co-clusters in the gene expression data. In this paper, we propose a novel algorithm, called Subspace Weighting Co-Clustering (SWCC), for high dimensional gene expression data. In SWCC, a gene subspace weight matrix is introduced to identify the contribution of gene objects in distinguishing different sample clusters. We design a new co-clustering objective function to recover the co-clusters in the gene expression data, in which the subspace weight matrix is introduced. An iterative algorithm is developed to solve the objective function, in which the subspace weight matrix is automatically computed during the iterative co-clustering process. Our empirical study shows encouraging results of the proposed algorithm in comparison with six state-of-the-art clustering algorithms on ten gene expression data sets. We also propose to use SWCC for gene clustering and selection. The experimental results show that the selected genes can improve the classification performance of Random Forests.
Collapse
|
6
|
Ji G, Lin Q, Long Y, Ye C, Ye W, Wu X. PAcluster: Clustering polyadenylation site data using canonical correlation analysis. J Bioinform Comput Biol 2017; 15:1750018. [PMID: 28874086 DOI: 10.1142/s0219720017500184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Alternative polyadenylation (APA) is a pervasive mechanism that contributes to gene regulation. Increasing sequenced poly(A) sites are placing new demands for the development of computational methods to investigate APA regulation. Cluster analysis is important to identify groups of co-expressed genes. However, clustering of poly(A) sites has not been extensively studied in APA, where most APA studies failed to consider the distribution, abundance, and variation of APA sites in each gene. Here we constructed a two-layer model based on canonical correlation analysis (CCA) to explore the underlying biological mechanisms in APA regulation. The first layer quantifies the general correlation of APA sites across various conditions between each gene and the second layer identifies genes with statistically significant correlation on their APA patterns to infer APA-specific gene clusters. Using hierarchical clustering, we comprehensively compared our method with four other widely used distance measures based on three performance indexes. Results showed that our method significantly enhanced the clustering performance for both synthetic and real poly(A) site data and could generate clusters with more biological meaning. We have implemented the CCA-based method as a publically available R package called PAcluster, which provides an efficient solution to the clustering of large APA-specific biological dataset.
Collapse
Affiliation(s)
- Guoli Ji
- * Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China
| | - Qianmin Lin
- * Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China
| | - Yuqi Long
- * Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China
| | - Congting Ye
- † College of the Environment and Ecology, Xiamen University, Xiamen, Fujian, P. R. China
| | - Wenbin Ye
- * Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China
| | - Xiaohui Wu
- * Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China
| |
Collapse
|