1
|
Hamamoto R, Takasawa K, Machino H, Kobayashi K, Takahashi S, Bolatkan A, Shinkai N, Sakai A, Aoyama R, Yamada M, Asada K, Komatsu M, Okamoto K, Kameoka H, Kaneko S. Application of non-negative matrix factorization in oncology: one approach for establishing precision medicine. Brief Bioinform 2022; 23:6628783. [PMID: 35788277 PMCID: PMC9294421 DOI: 10.1093/bib/bbac246] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 05/06/2022] [Accepted: 05/25/2022] [Indexed: 12/19/2022] Open
Abstract
The increase in the expectations of artificial intelligence (AI) technology has led to machine learning technology being actively used in the medical field. Non-negative matrix factorization (NMF) is a machine learning technique used for image analysis, speech recognition, and language processing; recently, it is being applied to medical research. Precision medicine, wherein important information is extracted from large-scale medical data to provide optimal medical care for every individual, is considered important in medical policies globally, and the application of machine learning techniques to this end is being handled in several ways. NMF is also introduced differently because of the characteristics of its algorithms. In this review, the importance of NMF in the field of medicine, with a focus on the field of oncology, is described by explaining the mathematical science of NMF and the characteristics of the algorithm, providing examples of how NMF can be used to establish precision medicine, and presenting the challenges of NMF. Finally, the direction regarding the effective use of NMF in the field of oncology is also discussed.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Rina Aoyama
- Showa University Graduate School of Medicine School of Medicine
| | | | - Ken Asada
- RIKEN Center for Advanced Intelligence Project
| | | | | | | | | |
Collapse
|
2
|
Wang J, Wang LH, Liu JX, Kong XZ, Li SJ. Multi-view Random-walk Graph regularization Low-Rank Representation for cancer clustering and Differentially Expressed Gene Selection. IEEE J Biomed Health Inform 2022; 26:3578-3589. [PMID: 35157604 DOI: 10.1109/jbhi.2022.3151333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Cancer genome data generally consists of multiple views from different sources. These views provide different levels of information about gene activity, as well as more comprehensive cancer information. The low-rank representation (LRR) method, as a powerful subspace clustering method, has been extended and applied in cancer data research. However, most methods based on low-rank representation only study single-view data in cancer genome data, such as gene expression data. The methods based on single-view genome data usually ignore the complementary relationship between the views, which is not conducive to further study of cancer. Therefore, this paper proposes a new method named Multi-view Random-walk Graph regularization Low-Rank Representation (MRGLRR) to comprehensively analyze multi-view genomics data. This method uses multi-view model to find the common centroid of view. By constructing a joint affinity matrix to learn the low-rank subspace representation of multiple sets of data, the hidden information of each view is fully obtained. In addition, this method introduces random walk graph regularization constraint to obtain more accurate similarity between samples. Different from the traditional graph regularization constraint, after constructing the KNN graph, we use the random walk algorithm to obtain the weight matrix. The random walk algorithm can retain more local geometric information and better learn the topological structure of the data. What's more, a feature gene selection strategy suitable for multi-view model is proposed to find more differentially expressed genes with research value. Experimental results show that our method is better than other representative methods in terms of clustering and feature gene selection for cancer multi-omics data.
Collapse
|
3
|
Prognosis of Tumor Microenvironment in Luminal B-Type Breast Cancer. DISEASE MARKERS 2022; 2022:5621441. [PMID: 35242245 PMCID: PMC8886761 DOI: 10.1155/2022/5621441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 01/12/2022] [Indexed: 12/09/2022]
Abstract
Objective Tumor microenvironment as an important element of malignancy could help predict cancer prognosis and therapeutic response; thus, a prognostic landscape map of the tumor microenvironment in luminal B breast cancers should be developed. Methods The GEO and TCGA databases were employed to retrieve clinical follow-up data and expression profiles of luminal B breast cancer. CIBERSORT was applied to assess the infiltration of the tumor microenvironment of 209 patients and to construct tumor microenvironment-based subtypes of luminal B breast cancer. We also conducted Cox multivariate regression analysis to select features that could be used to develop a microenvironment signature for cancer. Samples were categorized as having low and high TME scores according to the median TME score. The correlations of prognosis and TME score, expression levels of immune factors and genomic variation, and clinical features were further investigated. Results We found that high TME scores were correlated with poor prognosis. The current findings showed that the expressions of multiple immune-related genes, including CXCL9, CXCL10, GZMB, and PDCD1LG2, were upregulated in cancer with high TME scores. The high-risk group showed lower TP53 gene mutation frequency as opposed to that of the low-risk group. For the purpose of developing a TME scoring system, the TME infiltration levels of 209 patients with luminal B breast cancer from TCGA were comprehensively analyzed. Conclusions Our analysis revealed that the TME score was an indicator of patients' response to immune checkpoint modulators and an effective prognostic biomarker. TME scoring improves current immunotherapy on luminal B breast cancer.
Collapse
|
4
|
Nguyen H, Tran D, Tran B, Roy M, Cassell A, Dascalu S, Draghici S, Nguyen T. SMRT: Randomized Data Transformation for Cancer Subtyping and Big Data Analysis. Front Oncol 2021; 11:725133. [PMID: 34745946 PMCID: PMC8563705 DOI: 10.3389/fonc.2021.725133] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 09/28/2021] [Indexed: 12/25/2022] Open
Abstract
Cancer is an umbrella term that includes a range of disorders, from those that are fast-growing and lethal to indolent lesions with low or delayed potential for progression to death. The treatment options, as well as treatment success, are highly dependent on the correct subtyping of individual patients. With the advancement of high-throughput platforms, we have the opportunity to differentiate among cancer subtypes from a holistic perspective that takes into consideration phenomena at different molecular levels (mRNA, methylation, etc.). This demands powerful integrative methods to leverage large multi-omics datasets for a better subtyping. Here we introduce Subtyping Multi-omics using a Randomized Transformation (SMRT), a new method for multi-omics integration and cancer subtyping. SMRT offers the following advantages over existing approaches: (i) the scalable analysis pipeline allows researchers to integrate multi-omics data and analyze hundreds of thousands of samples in minutes, (ii) the ability to integrate data types with different numbers of patients, (iii) the ability to analyze un-matched data of different types, and (iv) the ability to offer users a convenient data analysis pipeline through a web application. We also improve the efficiency of our ensemble-based, perturbation clustering to support analysis on machines with memory constraints. In an extensive analysis, we compare SMRT with eight state-of-the-art subtyping methods using 37 TCGA and two METABRIC datasets comprising a total of almost 12,000 patient samples from 28 different types of cancer. We also performed a number of simulation studies. We demonstrate that SMRT outperforms other methods in identifying subtypes with significantly different survival profiles. In addition, SMRT is extremely fast, being able to analyze hundreds of thousands of samples in minutes. The web application is available at http://SMRT.tinnguyen-lab.com. The R package will be deposited to CRAN as part of our PINSPlus software suite.
Collapse
Affiliation(s)
- Hung Nguyen
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Duc Tran
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Bang Tran
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Monikrishna Roy
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Adam Cassell
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Sergiu Dascalu
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| | - Sorin Draghici
- Department of Computer Science, Wayne State University, Detroit, MI, United States
| | - Tin Nguyen
- Department of Computer Science and Engineering, University of Nevada Reno, Reno, NV, United States
| |
Collapse
|
5
|
Wang CY, Yu N, Wu MJ, Gao YL, Liu JX, Wang J. Dual Hyper-Graph Regularized Supervised NMF for Selecting Differentially Expressed Genes and Tumor Classification. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2375-2383. [PMID: 32086220 DOI: 10.1109/tcbb.2020.2975173] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Non-negative matrix factorization (NMF) is a dimensionality reduction technique based on high-dimensional mapping. It can learn part-based representations effectively. In this paper, we propose a method called Dual Hyper-graph Regularized Supervised Non-negative Matrix Factorization (HSNMF). To encode the geometric information of the data, the hyper-graph is introduced into the model as a regularization term. The advantage of hyper-graph learning is to find higher order data relationship to enhance data relevance. This method constructs the data hyper-graph and the feature hyper-graph to find the data manifold and the feature manifold simultaneously. The application of hyper-graph theory in cancer datasets can effectively find pathogenic genes. The discrimination information is further introduced into the objective function to obtain more information about the data. Supervised learning with label information greatly improves the classification effect. Furthermore, the real datasets of cancer usually contain sparse noise, so the L2,1-norm is applied to enhance the robustness of HSNMF algorithm. Experiments under The Cancer Genome Atlas (TCGA) datasets verify the feasibility of the HSNMF method.
Collapse
|
6
|
Ren C, Wu C, Wang N, Lian C, Yang C. Clonal Architectures Predict Clinical Outcome in Gastric Adenocarcinoma Based on Genomic Variation, Tumor Evolution, and Heterogeneity. Cell Transplant 2021; 30:963689721989606. [PMID: 33900127 PMCID: PMC8085378 DOI: 10.1177/0963689721989606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Stomach adenocarcinoma (STAD) is a highly heterogeneous disease. Due to the lack of effective molecular markers and personalized treatment, the prognosis of gastric cancer patients is still very poor. The ABSOLUTE algorithm and cancer cell fraction were used to evaluate the clonal and subclonal status of 349 TCGA (The Cancer Genome Cancer Atlas)-STAD patients. Non-negative matrix factorization was used to identify the mutation characteristics of the samples. Univariate Cox regression analysis was used to determine the relationship between clonal/subclonal events and prognosis, and the Spearman correlation was used to evaluate the relationship of clonal/subclonal events to tumor mutation burden (TMB) and neoantigens. The evolution pattern of STAD demonstrated great tumor heterogeneity. TP53, USH2A, and GLI3 appeared earliest in STAD and may drive STAD. CTNNB1, LRP1B, and ERBB4 appeared the latest in STAD, and may be related to STAD’s progress. Univariate Cox regression analysis identified four early genes, eight intermediate genes, and seven late genes significantly associated with overall survival. The number of subclonal events in the T stage was significantly different. The N stage, gender, and histological type were significantly different for clonal events, and there was a significant correlation between clonal/subclonal events and TMB/neoantigens. Our results highlight the importance of systematic evaluation of evolutionary models in the clinical management of STAD and personalized gastric cancer treatment.
Collapse
Affiliation(s)
- Chenxia Ren
- Central Laboratory, 74652Changzhi Medical College, Shanxi Province, China
| | - Cuiling Wu
- Faculty of Basic Medicine, 74652Changzhi Medical College, Shanxi Province, China
| | - Niuniu Wang
- Central Laboratory, 74652Changzhi Medical College, Shanxi Province, China
| | - Changhong Lian
- Department of General Surgery, 117875Heping Hospital Affiliated to Changzhi Medical College, Shanxi Province, China
| | - Changqing Yang
- Department of Gastroenterology, 117875Heping Hospital Affiliated to Changzhi Medical College, Shanxi Province, China
| |
Collapse
|
7
|
Chen Y, Zhao J. Identification of an Immune Gene Signature Based on Tumor Microenvironment Characteristics in Colon Adenocarcinoma. Cell Transplant 2021; 30:9636897211001314. [PMID: 33787354 PMCID: PMC8020110 DOI: 10.1177/09636897211001314] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Tumor microenvironment (TME) changes are related to the occurrence and development of colon adenocarcinoma (COAD). This study aimed to analyze the characteristics of the immune microenvironment in CC, as well as the microenvironment's relationship with the clinical features of CC. Based on The Cancer Genome Atlas (TCGA) and GSE39582 cohorts, the scores of 22 tumor infiltrating lymphocytes (TILs) were calculated using CIBERSORT. ConsensusClusterPlus was used for unsupervised clustering. Three TME subtypes (TMEC1, TMEC2, and TME3) were identified based on TIL scores. TMEC2 was associated with the worst prognosis. Random forest, k-means clustering, and principal component analysis were used to construct the TME score risk signature. The median TME score was used to divide the samples into high- and low-risk groups. The prognoses of the patients with high TME scores were worse than those of the patients with low TME scores. A high TME score was an independent prognostic risk factor for patients with colon cancer. The Gene Set Enrichment Analysis (GSEA) results showed that those with high TME scores were enriched in FOCAL_ADHESION, ECM_RECEPTOR_INTERACTION, and PATHWAYS_IN_CANCER. Our findings will provide a new strategy for immunotherapy in patients with CC.
Collapse
Affiliation(s)
- Ying Chen
- Department of Medical Oncology, the First Hospital of China Medical University, Shenyang, China
- Key Laboratory of Anticancer Drugs and Biotherapy of Liaoning
Province, the First Hospital of China Medical University, Shenyang, China
| | - Jia Zhao
- Department of Medical Oncology, the First Hospital of China Medical University, Shenyang, China
- Key Laboratory of Anticancer Drugs and Biotherapy of Liaoning
Province, the First Hospital of China Medical University, Shenyang, China
- Jia Zhao, Department of Medical Oncology,
the First Hospital of China Medical University, Shenyang, China.
| |
Collapse
|
8
|
Ma M, Chen Y, Chong X, Jiang F, Gao J, Shen L, Zhang C. Integrative analysis of genomic, epigenomic and transcriptomic data identified molecular subtypes of esophageal carcinoma. Aging (Albany NY) 2021; 13:6999-7019. [PMID: 33638948 PMCID: PMC7993659 DOI: 10.18632/aging.202556] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Accepted: 12/29/2020] [Indexed: 12/16/2022]
Abstract
Esophageal cancer (EC) involves many genomic, epigenetic and transcriptomic disorders, which play key roles in the heterogeneous progression of cancer. However, the study of EC with multi-omics has not been conducted. This study identified a high consistency between DNA copy number variations and abnormal methylations in EC by analyzing genomics, epigenetics and transcriptomics data and investigating mutual correlations of DNA copy number variation, methylation and gene expressions, and stratified copy number variation genes (CNV-Gs) and methylation genes (MET-Gs). The methylation, CNVs and expression profiles of CNV-Gs and MET-Gs were analyzed by consistent clustering using iCluster integration, here, we determined three subtypes (iC1, iC2, iC3) with different molecular traits, prognostic characteristics and tumor immune microenvironment features. We also identified 4 prognostic genes (CLDN3, FAM221A, GDF15 and YBX2) differentially expressed in the three subtypes, and could therefore be used as representative biomarkers for the three subtypes of EC. In conclusion, by performing comprehensive analysis on genomic, epigenetic and transcriptomic regulations, the current study provided new insights into the multilayer molecular and pathological traits of EC, and contributed to the precision medication for EC patients.
Collapse
Affiliation(s)
- Mingyang Ma
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Yang Chen
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Xiaoyi Chong
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Fangli Jiang
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Jing Gao
- National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital and Shenzhen Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Shenzhen 518116, China
| | - Lin Shen
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Cheng Zhang
- Department of Gastrointestinal Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Peking University Cancer Hospital and Institute, Beijing 100142, China
| |
Collapse
|
9
|
Song Y, Yang K, Sun T, Tang R. Development and validation of prognostic markers in sarcomas base on a multi-omics analysis. BMC Med Genomics 2021; 14:31. [PMID: 33509178 PMCID: PMC7841904 DOI: 10.1186/s12920-021-00876-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 01/13/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND In sarcomas, the DNA copy number and DNA methylation exhibit genomic aberrations. Transcriptome imbalances play a driving role in the heterogeneous progression of sarcomas. However, it is still unclear whether abnormalities of DNA copy numbers are systematically related to epigenetic DNA methylation, thus, a comprehensive analysis of sarcoma occurrence and development from the perspective of epigenetic and genomics is required. METHODS RNASeq, copy number variation (CNV), methylation data, clinical follow-up information were obtained from The Cancer Genome Atlas (TCGA) and GEO database. The association between methylation and CNV was analyzed to further identify methylation-related genes (MET-Gs) and CNV abnormality-related genes (CNV-Gs). Subsequently DNA copy number, methylation, and gene expression data associated with the MET-Gs and CNV-Gs were integrated to determine molecular subtypes and clinical and molecular characteristics of molecular subtypes. Finally, key biomarkers were determined and validated in independent validation sets. RESULTS A total of 5354 CNV-Gs and 4042 MET-Gs were screened and showed a high degree of consistency. Four molecular subtypes (iC1, iC2, iC3, and iC4) with different prognostic significances were identified by multiomics cluster analysis, specifically, iC2 had the worst prognosis and iC4 indicated an immune-enhancing state. Three potential prognostic markers (ENO1, ACVRL1 and APBB1IP) were determined after comparing the molecular characteristics of the four molecular subtypes. The expression of ENO1 gene was significantly correlated with CNV, and was noticeably higher in iC2 subtype with the worst prognosis than any other subtypes. The expressions of ACVRL1 and APBB1IP were negatively correlated with methylation, and were high-expressed in the iC4 subtype with the most favorable prognosis. In addition, the number of silent/nonsilent mutations and neoantigens in iC2 subtype were significantly more than those in iC1/iC3/iC4 subtype, and the same trend was also observed in CNV Gain/Loss. CONCLUSION The current comprehensive analysis of genomic and epigenomic regulation provides new insights into multilayered pathobiology of sarcomas. Four molecular subtypes and three prognostic markers developed in this study improve the current understanding of the molecular mechanisms underlying sarcoma.
Collapse
Affiliation(s)
- Yongchun Song
- Department of Oncology Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China
| | - Kui Yang
- Department of General Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710061, Shaanxi, China
| | - Tuanhe Sun
- Department of Oncology Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China
| | - Ruixiang Tang
- Department of Oncology Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China.
| |
Collapse
|
10
|
Ren LR, Gao YL, Liu JX, Shang J, Zheng CH. Correntropy induced loss based sparse robust graph regularized extreme learning machine for cancer classification. BMC Bioinformatics 2020; 21:445. [PMID: 33028187 PMCID: PMC7542897 DOI: 10.1186/s12859-020-03790-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 09/30/2020] [Indexed: 01/17/2023] Open
Abstract
Background As a machine learning method with high performance and excellent generalization ability, extreme learning machine (ELM) is gaining popularity in various studies. Various ELM-based methods for different fields have been proposed. However, the robustness to noise and outliers is always the main problem affecting the performance of ELM. Results In this paper, an integrated method named correntropy induced loss based sparse robust graph regularized extreme learning machine (CSRGELM) is proposed. The introduction of correntropy induced loss improves the robustness of ELM and weakens the negative effects of noise and outliers. By using the L2,1-norm to constrain the output weight matrix, we tend to obtain a sparse output weight matrix to construct a simpler single hidden layer feedforward neural network model. By introducing the graph regularization to preserve the local structural information of the data, the classification performance of the new method is further improved. Besides, we design an iterative optimization method based on the idea of half quadratic optimization to solve the non-convex problem of CSRGELM. Conclusions The classification results on the benchmark dataset show that CSRGELM can obtain better classification results compared with other methods. More importantly, we also apply the new method to the classification problems of cancer samples and get a good classification effect.
Collapse
Affiliation(s)
- Liang-Rui Ren
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Ying-Lian Gao
- Qufu Normal University Library, Qufu Normal University, Rizhao, 276826, China
| | - Jin-Xing Liu
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China.
| | - Junliang Shang
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Chun-Hou Zheng
- School of Computer Science, Qufu Normal University, Rizhao, 276826, China.,College of Computer Science and Technology, Anhui University, Hefei, 230601, China
| |
Collapse
|
11
|
Wang G, Wang D, Sun M, Liu X, Yang Q. Identification of prognostic and immune-related gene signatures in the tumor microenvironment of endometrial cancer. Int Immunopharmacol 2020; 88:106931. [PMID: 32889237 DOI: 10.1016/j.intimp.2020.106931] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 07/23/2020] [Accepted: 08/20/2020] [Indexed: 12/13/2022]
Abstract
Uterine corpus endometrial cancer (UCEC) is one of the most prevalent female malignancies in clinical practice. Due to the lack of effective biomarkers and personalized treatments, the prognosis of advanced-stage EC remains unfavorable. Modulation of the immune microenvironment is closely related to the onset and development of endometrial cancer. In the present study, we attempt to systematically analyze the characteristics of the immune microenvironment of endometrial cancer and investigate its association with clinical features by applying bioinformatics. RNA-Seq in TCGA (The Cancer Genome Atlas) and clinical follow-up information of patents were used for analysis. The Tumor Microenvironment (TME) score infiltration patterns of 523 endometrial cancer patients were evaluated using CIBERSORT. Random forest, multivariable cox analysis were used to build the TME score. Fisher's exact test was used to compare the genes that show significant differences in the frequency of mutations between groups. Two TME phenotypes were defined. There is a significant relationship between the TME score and grade. High TME score samples are highly expressed in immune activation, TGF pathway activation and immune checkpoint genes, and low TME score samples have high frequency mutations of PTEN, CSE1L and ITGB3. Therefore, describing the comprehensive landscape of UCEC's TME characteristics may help explain patients' response to immunotherapy and provide new strategies for cancer treatment.
Collapse
Affiliation(s)
- Guangwei Wang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang 110004, China
| | - Dandan Wang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang 110004, China
| | - Meige Sun
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang 110004, China
| | - Xiaofei Liu
- Department of Obstetrics and Gynecology, Shenyang Women's and Children's Hospital, Shenyang 110014, China
| | - Qing Yang
- Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang 110004, China.
| |
Collapse
|
12
|
Wang L, Li X. Identification of an energy metabolism‑related gene signature in ovarian cancer prognosis. Oncol Rep 2020; 43:1755-1770. [PMID: 32186777 PMCID: PMC7160557 DOI: 10.3892/or.2020.7548] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 01/17/2020] [Indexed: 01/08/2023] Open
Abstract
Changes in energy metabolism may be potential biomarkers and therapeutic targets for cancer as they frequently occur within cancer cells. However, basic cancer research has failed to reach a consistent conclusion on the function(s) of mitochondria in energy metabolism. The significance of energy metabolism in the prognosis of ovarian cancer remains unclear; thus, there remains an urgent need to systematically analyze the characteristics and clinical value of energy metabolism in ovarian cancer. Based on gene expression patterns, the present study aimed to analyze energy metabolism‑associated characteristics to evaluate the prognosis of patients with ovarian cancer. A total of 39 energy metabolism‑related genes significantly associated with prognosis were obtained, and three molecular subtypes were identified by nonnegative matrix factorization clustering, among which the C1 subtype was associated with poor clinical outcomes of ovarian cancer. The immune response was enhanced in the tumor microenvironment. A total of 888 differentially expressed genes were identified in C1 compared with the other subtypes, and the results of the pathway enrichment analysis demonstrated that they were enriched in the 'PI3K‑Akt signaling pathway', 'cAMP signaling pathway', 'ECM‑receptor interaction' and other pathways associated with the development and progression of tumors. Finally, eight characteristic genes (tolloid‑like 1 gene, type XVI collagen, prostaglandin F2α, cartilage intermediate layer protein 2, kinesin family member 26b, interferon inducible protein 27, growth arrest‑specific gene 1 and chemokine receptor 7) were obtained through LASSO feature selection; and a number of them have been demonstrated to be associated with ovarian cancer progression. In addition, Cox regression analysis was performed to establish an 8‑gene signature, which was determined to be an independent prognostic factor for patients with ovarian cancer and could stratify sample risk in the training, test and external validation datasets (P<0.01; AUC >0.8). Gene Set Enrichment Analysis results revealed that the 8‑gene signature was involved in important biological processes and pathways of ovarian cancer. In conclusion, the present study established an 8‑gene signature associated with metabolic genes, which may provide new insights into the effects of energy metabolism on ovarian cancer. The 8‑gene signature may serve as an independent prognostic factor for ovarian cancer patients.
Collapse
Affiliation(s)
- Lei Wang
- Department of Obstetrics and Gynecology, ShengJing Hospital of China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Xiuqin Li
- Department of Obstetrics and Gynecology, ShengJing Hospital of China Medical University, Shenyang, Liaoning 110004, P.R. China
| |
Collapse
|
13
|
Wu MJ, Gao YL, Liu JX, Zheng CH, Wang J. Integrative Hypergraph Regularization Principal Component Analysis for Sample Clustering and Co-Expression Genes Network Analysis on Multi-Omics Data. IEEE J Biomed Health Inform 2020; 24:1823-1834. [DOI: 10.1109/jbhi.2019.2948456] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
14
|
Zhong Q, Fan J, Chu H, Pang M, Li J, Fan Y, Liu P, Wu C, Qiao J, Li R, Hang J. Integrative analysis of genomic and epigenetic regulation of endometrial cancer. Aging (Albany NY) 2020; 12:9260-9274. [PMID: 32412912 PMCID: PMC7288931 DOI: 10.18632/aging.103202] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 04/17/2020] [Indexed: 11/25/2022]
Abstract
Endometrial carcinomas (EC) are characterized by high DNA copy numbers and DNA methylation aberrations. In this study, we sought to comprehensively explore the effect of these two factors on development and progression of EC by analyzing integrated genomic and epigenetic analysis to. We found high DNA copy number and DNA methylation abnormalities in EC, with 6308 copy-number variation genes (CNV-G) and 4376 methylation genes (MET-G). We used these CNV-G and MET-G to subcategorize the samples for prognostic analysis, and identified three molecular subtypes (iC1, iC2, iC3). Moreover, the subtypes exhibited different tumor immune microenvironment characteristics. A further analysis of their molecular characteristics revealed three potential prognostic markers (KIAA1324, nonexpresser of pathogenesis-related genes1 (NPR1) and idiopathic hypogonadotropic hypogonadism (IHH)). Notably, all three markers showed distinct CNV, DNA methylation, and gene expression profiles. Analysis of mutations among the three subtypes revealed that iC2 had fewer mutations than the other subtypes. Conversely, iC2 showed significantly higher CNV levels than other subtypes. This comprehensive analysis of genomic and epigenetic profiles identified three prognostic markers, therefore, provides new insights into the multi-layered pathology of EC. These can be utilized for accurate treatment of EC patients.
Collapse
Affiliation(s)
- Qihang Zhong
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Institute of Systems Biomedicine, School of Basic Medical Sciences, Peking University Health Science Center, Peking University, Beijing 100191, China
| | - Junpeng Fan
- Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430000, China
| | - Honglei Chu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China
| | - Mujia Pang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China
| | - Junsheng Li
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing 100191, China
| | - Yong Fan
- Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou 510150, China
| | - Ping Liu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing 100191, China
| | - Congying Wu
- Institute of Systems Biomedicine, School of Basic Medical Sciences, Peking University Health Science Center, Peking University, Beijing 100191, China
| | - Jie Qiao
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing 100191, China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Rong Li
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing 100191, China
| | - Jing Hang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing 100191, China
| |
Collapse
|
15
|
Zhi J, Yi J, Tian M, Wang H, Kang N, Zheng X, Gao M. Immune gene signature delineates a subclass of thyroid cancer with unfavorable clinical outcomes. Aging (Albany NY) 2020; 12:5733-5750. [PMID: 32240105 PMCID: PMC7185138 DOI: 10.18632/aging.102963] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Accepted: 02/25/2020] [Indexed: 12/20/2022]
Abstract
Thyroid cancer (THCA) is a heterogeneous disease with multiple clinical outcomes Immune cells regulate its progression. Three immunomolecular subtypes (C1, C2, C3) were identified in gene expression data sets from TCGA and GEO databases. Among them, subtype C3 had highest frequency of BRAF mutations, lowest frequency of RAS mutations, highest mutation load and shorter progression-free survival. Functional enrichment analysis for the genes revealed that C1 was up-regulated in the ERK cascade pathway, C2 was up-regulated in cell migration and proliferation pathways, while C3 was enriched in body fluid, protein regulation and response to steroid hormones functions. Notably, the three molecular subtypes exhibit differences in immune microenvironments as shown by timer database and analysis of immune expression signatures. The abundance of B_cell, CD4_Tcell, Neutrophil, Macrophage and Dendritic cells in C2 subtype were lower than in C1 and C3 subtypes Leukocyte fraction, proliferation macrophage regulation, lymphocyte infiltration, IFN gamma response and TGF beta response scores were significantly higher in C3 compared with C1 and C2 subtypes. Unlike C3 subtype, it was observed that C1 and C2 subtypes were significantly negatively correlated with most immune checkpoint genes in two different cohorts. The characteristic genes were differentially expressed between cancer cells, adjacent tissues, and metastatic tissues in different cohorts. In summary, THCA can be subclassified into three molecular subtypes with distinct histological types, genetic and transcriptional phenotypes, all of which have potential clinical implications.
Collapse
Affiliation(s)
- Jingtai Zhi
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Jiaoyu Yi
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Mengran Tian
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Huijuan Wang
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Ning Kang
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Xiangqian Zheng
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| | - Ming Gao
- Department of Thyroid and Neck Tumor, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, People's Republic of China
| |
Collapse
|
16
|
Abstract
The molecular mechanisms and functions in complex biological systems currently remain elusive. Recent high-throughput techniques, such as next-generation sequencing, have generated a wide variety of multiomics datasets that enable the identification of biological functions and mechanisms via multiple facets. However, integrating these large-scale multiomics data and discovering functional insights are, nevertheless, challenging tasks. To address these challenges, machine learning has been broadly applied to analyze multiomics. This review introduces multiview learning-an emerging machine learning field-and envisions its potentially powerful applications to multiomics. In particular, multiview learning is more effective than previous integrative methods for learning data's heterogeneity and revealing cross-talk patterns. Although it has been applied to various contexts, such as computer vision and speech recognition, multiview learning has not yet been widely applied to biological data-specifically, multiomics data. Therefore, this paper firstly reviews recent multiview learning methods and unifies them in a framework called multiview empirical risk minimization (MV-ERM). We further discuss the potential applications of each method to multiomics, including genomics, transcriptomics, and epigenomics, in an aim to discover the functional and mechanistic interpretations across omics. Secondly, we explore possible applications to different biological systems, including human diseases (e.g., brain disorders and cancers), plants, and single-cell analysis, and discuss both the benefits and caveats of using multiview learning to discover the molecular mechanisms and functions of these systems.
Collapse
Affiliation(s)
- Nam D. Nguyen
- Department of Computer Science, Stony Brook University, Stony Brook, New York, United States of America
| | - Daifeng Wang
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| |
Collapse
|
17
|
Jiao CN, Gao YL, Yu N, Liu JX, Qi LY. Hyper-Graph Regularized Constrained NMF for Selecting Differentially Expressed Genes and Tumor Classification. IEEE J Biomed Health Inform 2020; 24:3002-3011. [PMID: 32086224 DOI: 10.1109/jbhi.2020.2975199] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Non-negative Matrix Factorization (NMF) is a dimensionality reduction approach for learning a parts-based and linear representation of non-negative data. It has attracted more attention because of that. In practice, NMF not only neglects the manifold structure of data samples, but also overlooks the priori label information of different classes. In this paper, a novel matrix decomposition method called Hyper-graph regularized Constrained Non-negative Matrix Factorization (HCNMF) is proposed for selecting differentially expressed genes and tumor sample classification. The advantage of hyper-graph learning is to capture local spatial information in high dimensional data. This method incorporates a hyper-graph regularization constraint to consider the higher order data sample relationships. The application of hyper-graph theory can effectively find pathogenic genes in cancer datasets. Besides, the label information is further incorporated in the objective function to improve the discriminative ability of the decomposition matrix. Supervised learning with label information greatly improves the classification effect. We also provide the iterative update rules and convergence proofs for the optimization problems of HCNMF. Experiments under The Cancer Genome Atlas (TCGA) datasets confirm the superiority of HCNMF algorithm compared with other representative algorithms through a set of evaluations.
Collapse
|
18
|
Zhang J, Xiao X, Zhang X, Hua W. Tumor Microenvironment Characterization in Glioblastoma Identifies Prognostic and Immunotherapeutically Relevant Gene Signatures. J Mol Neurosci 2020; 70:738-750. [PMID: 32006162 DOI: 10.1007/s12031-020-01484-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Accepted: 01/17/2020] [Indexed: 12/13/2022]
Abstract
Tumor microenvironment (TME) cells are important elements in tumor tissue. There is increasing evidence that they have important clinical pathological significance in predicting tumor clinical outcomes and therapeutic effects. However, no systematic analysis of TME cell interactions in glioblastoma (GBM) has been reported. We systematically analyzed the transcriptional sequencing data of GBM to find an immune gene marker to predict the clinical results of GBM. First, we downloaded the expression profiles and clinical follow-up information of GBM from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). CIBERSORT was used to evaluate the infiltration mode of TME in 757 patients, systematically correlated TME phenotype with genomic characteristics and clinicopathological characteristics of GBM, defined four TME phenotypes, and TMEScore was constructed using algorithms such as random forest and principal component analysis. There is a significant correlation between TMEScore and age of onset. High TMEScore samples are characterized by immune activation, TGF pathway activation, and high expression of immune checkpoint genes, while low TMEScore samples are characterized by high-frequency IDH1 and MET mutations. Therefore, a comprehensive landscape depicting the TME characteristics of GBM may help explain GBM's response to immunotherapy and provide new strategies for cancer treatment. In this study, TMEScore can be used as a new prognostic marker to predict the survival of GBM patients, and as a potential predictor of immune checkpoint inhibitor response.
Collapse
Affiliation(s)
- Jinsen Zhang
- Department of Neurosurgery, Huashan Hospital, Fudan University, No.12 Wulumuqi Zhong Road, Shanghai, 200040, China
| | - Xing Xiao
- Department of Neurosurgery, Huashan Hospital, Fudan University, No.12 Wulumuqi Zhong Road, Shanghai, 200040, China
| | - Xin Zhang
- Department of Neurosurgery, Huashan Hospital, Fudan University, No.12 Wulumuqi Zhong Road, Shanghai, 200040, China
| | - Wei Hua
- Department of Neurosurgery, Huashan Hospital, Fudan University, No.12 Wulumuqi Zhong Road, Shanghai, 200040, China.
| |
Collapse
|
19
|
Cui Z, Liu JX, Gao YL, Zheng CH, Wang J. RCMF: a robust collaborative matrix factorization method to predict miRNA-disease associations. BMC Bioinformatics 2019; 20:686. [PMID: 31874608 PMCID: PMC6929455 DOI: 10.1186/s12859-019-3260-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Background Predicting miRNA-disease associations (MDAs) is time-consuming and expensive. It is imminent to improve the accuracy of prediction results. So it is crucial to develop a novel computing technology to predict new MDAs. Although some existing methods can effectively predict novel MDAs, there are still some shortcomings. Especially when the disease matrix is processed, its sparsity is an important factor affecting the final results. Results A robust collaborative matrix factorization (RCMF) is proposed to predict novel MDAs. The L2,1-norm are introduced to our method to achieve the highest AUC value than other advanced methods. Conclusions 5-fold cross validation is used to evaluate our method, and simulation experiments are used to predict novel associations on Gold Standard Dataset. Finally, our prediction accuracy is better than other existing advanced methods. Therefore, our approach is effective and feasible in predicting novel MDAs.
Collapse
Affiliation(s)
- Zhen Cui
- School of Information Science and Engineering, Qufu Normal University, Rizhao, 276826, China
| | - Jin-Xing Liu
- School of Information Science and Engineering, Qufu Normal University, Rizhao, 276826, China. .,Co-Innovation Center for Information Supply & Assurance Technology, Anhui University, Hefei, 230601, China.
| | - Ying-Lian Gao
- Qufu Normal University Library, Qufu Normal University, Rizhao, 276826, China
| | - Chun-Hou Zheng
- Co-Innovation Center for Information Supply & Assurance Technology, Anhui University, Hefei, 230601, China
| | - Juan Wang
- School of Information Science and Engineering, Qufu Normal University, Rizhao, 276826, China.
| |
Collapse
|