1
|
Song J, Song Z, Zhang J, Gong Y. Privacy-Preserving Identification of Cancer Subtype-Specific Driver Genes Based on Multigenomics Data with Privatedriver. J Comput Biol 2024; 31:99-116. [PMID: 38271572 DOI: 10.1089/cmb.2023.0115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024] Open
Abstract
Identifying cancer subtype-specific driver genes from a large number of irrelevant passengers is crucial for targeted therapy in cancer treatment. Recently, the rapid accumulation of large-scale cancer genomics data from multiple institutions has presented remarkable opportunities for identification of cancer subtype-specific driver genes. However, the insufficient subtype samples, privacy issues, and heterogenous of aberration events pose great challenges in precisely identifying cancer subtype-specific driver genes. To address this, we introduce privatedriver, the first model for identifying subtype-specific driver genes that integrates genomics data from multiple institutions in a data privacy-preserving collaboration manner. The process of identifying subtype-specific cancer driver genes using privatedriver involves the following two steps: genomics data integration and collaborative training. In the integration process, the aberration events from multiple genomics data sources are combined for each institution using the forward and backward propagation method of NetICS. In the collaborative training process, each institution utilizes the federated learning framework to upload encrypted model parameters instead of raw data of all institutions to train a global model by using the non-negative matrix factorization algorithm. We applied privatedriver on head and neck squamous cell and colon cancer from The Cancer Genome Atlas website and evaluated it with two benchmarks using macro-Fscore. The comparison analysis demonstrates that privatedriver achieves comparable results to centralized learning models and outperforms most other nonprivacy preserving models, all while ensuring the confidentiality of patient information. We also demonstrate that, for varying predicted driver gene distributions in subtype, our model fully considers the heterogeneity of subtype and identifies subtype-specific driver genes corresponding to the given prognosis and therapeutic effect. The success of privatedriver reveals the feasibility and effectiveness of identifying cancer subtype-specific driver genes in a data protection manner, providing new insights for future privacy-preserving driver gene identification studies.
Collapse
Affiliation(s)
- Junrong Song
- School of Information; Kunming, P.R. China
- Yunnan Key Laboratory of Service Computing; Yunnan University of Finance and Economics, Kunming, P.R. China
| | - Zhiming Song
- School of Information; Kunming, P.R. China
- Yunnan Key Laboratory of Service Computing; Yunnan University of Finance and Economics, Kunming, P.R. China
| | - Jinpeng Zhang
- School of Information; Kunming, P.R. China
- Yunnan Key Laboratory of Service Computing; Yunnan University of Finance and Economics, Kunming, P.R. China
- The School of Computer Science and Engineering, Yunnan University, Kunming, P.R. China
| | | |
Collapse
|
2
|
Cao Y, Wu S, Gu Y, Wong YH, Shi Y, Zhang L. Disulfidptosis-related PABPC3 promotes tumor progression and inhibits immune activity in osteosarcoma. J Gene Med 2024; 26:e3641. [PMID: 38058264 DOI: 10.1002/jgm.3641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 10/10/2023] [Accepted: 11/05/2023] [Indexed: 12/08/2023] Open
Abstract
BACKGROUND Osteosarcoma is a very aggressive bone tumor mainly affecting teens and young adults. Disulfidptosis is a metabolic-related form of regulated cell death. However, the interconnection between disulfidptosis and osteosarcoma has not been explored. METHODS In the present study, disulfidptosis-related clusters were identified in osteosarcoma using the nonnegative matrix factorization clustering method. PABPC3 was identified as a hazardous gene in osteosarcoma using machine learning algorithms, CoxBoost, and Random Survival Forest. The prognostic value, pathway annotation, immune characteristics, and drug prediction of PABPC3 were systematically explored. MTT (i.e., 3-(4, 5-dimethyl thiazol-2-yl)-2,5-diphenytetrazolium bromide), EdU (ie. 5-ethyny-2'-deoxvuridine), and Transwell assays were used for in vitro validation of PABPC3. RESULTS The disulfidptosis-related clusters could distinguish survival outcomes of osteosarcoma patients. PABPC3 could predict survival outcomes, immune activity, and drug response in osteosarcoma patients. Besides, PABPC3 was proven to facilitate the proliferation and migration of osteosarcoma. CONCLUSIONS The present study is expected to establish the bridge between disulfidptosis and osteosarcoma. PABPC3 is expected to be further explored as a therapeutic target in osteosarcoma.
Collapse
Affiliation(s)
- Yangbo Cao
- Department of Orthopaedics, The Third Xiangya Hospital, Central South University, Changsha, China
| | - Song Wu
- Department of Orthopaedics, The Third Xiangya Hospital, Central South University, Changsha, China
| | - Yishan Gu
- Division of Life Science and the Biotechnology Research Institute, Hong Kong University of Science and Technology, Hong Kong, China
| | - Yung Hou Wong
- Division of Life Science and the Biotechnology Research Institute, Hong Kong University of Science and Technology, Hong Kong, China
| | - Yanbin Shi
- Department of Orthopaedics, The Third Xiangya Hospital, Central South University, Changsha, China
| | - Lina Zhang
- Department of Orthopaedics, Hunan Provincial People's Hospital, Hunan Normal University, Changsha, China
| |
Collapse
|
3
|
Zarei Ghobadi M, Afsaneh E, Emamzadeh R. Gene biomarkers and classifiers for various subtypes of HTLV-1-caused ATLL cancer identified by a combination of differential gene co‑expression and support vector machine algorithms. Med Microbiol Immunol 2023:10.1007/s00430-023-00767-8. [PMID: 37222763 DOI: 10.1007/s00430-023-00767-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 05/12/2023] [Indexed: 05/25/2023]
Abstract
Adult T-cell leukemia/lymphoma (ATLL) is pathogen-caused cancer that is progressed after the infection by human T-cell leukemia virus type 1. Four significant subtypes comprising acute, lymphoma, chronic, and smoldering have been identified for this cancer. However, there are no trustworthy prognostic biomarkers for these subtypes. We utilized a combination of two powerful network-based and machine-learning algorithms including differential co-expressed genes (DiffCoEx) and support vector machine-recursive feature elimination with cross-validation (SVM-RFECV) methods to categorize disparate ATLL subtypes from asymptomatic carriers (ACs). The results disclosed the significant involvement of CBX6, CNKSR1, and MAX in chronic, MYH10 and P2RY1 in acute, C22orf46 and HNRNPA0 in smoldering subtypes. These genes also can classify each ATLL subtype from AC carriers. The integration of the results of two powerful algorithms led to the identification of reliable gene classifiers and biomarkers for diverse ATLL subtypes.
Collapse
Affiliation(s)
- Mohadeseh Zarei Ghobadi
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | | | - Rahman Emamzadeh
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| |
Collapse
|
4
|
Ali R, Sultan A, Ishrat R, Haque S, Khan NJ, Prieto MA. Identification of New Key Genes and Their Association with Breast Cancer Occurrence and Poor Survival Using In Silico and In Vitro Methods. Biomedicines 2023; 11:biomedicines11051271. [PMID: 37238942 DOI: 10.3390/biomedicines11051271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/20/2023] [Accepted: 03/07/2023] [Indexed: 05/28/2023] Open
Abstract
Breast cancer is one of the most prevalent types of cancer diagnosed globally and continues to have a significant impact on the global number of cancer deaths. Despite all efforts of epidemiological and experimental research, therapeutic concepts in cancer are still unsatisfactory. Gene expression datasets are widely used to discover the new biomarkers and molecular therapeutic targets in diseases. In the present study, we analyzed four datasets using R packages with accession number GSE29044, GSE42568, GSE89116, and GSE109169 retrieved from NCBI-GEO and differential expressed genes (DEGs) were identified. Protein-protein interaction (PPI) network was constructed to screen the key genes. Subsequently, the GO function and KEGG pathways were analyzed to determine the biological function of key genes. Expression profile of key genes was validated in MCF-7 and MDA-MB-231 human breast cancer cell lines using qRT-PCR. Overall expression level and stage wise expression pattern of key genes was determined by GEPIA. The bc-GenExMiner was used to compare expression level of genes among groups of patients with respect to age factor. OncoLnc was used to analyze the effect of expression levels of LAMA2, TIMP4, and TMTC1 on the survival of breast cancer patients. We identified nine key genes, of which COL11A1, MMP11, and COL10A1 were found up-regulated and PCOLCE2, LAMA2, TMTC1, ADAMTS5, TIMP4, and RSPO3 were found down-regulated. Similar expression pattern of seven among nine genes (except ADAMTS5 and RSPO3) was observed in MCF-7 and MDA-MB-231 cells. Further, we found that LAMA2, TMTC1, and TIMP4 were significantly expressed among different age groups of patients. LAMA2 and TIMP4 were found significantly associated and TMTC1 was found less correlated with breast cancer occurrence. We found that the expression level of LAMA2, TIMP4, and TMTC1 was abnormal in all TCGA tumors and significantly associated with poor survival.
Collapse
Affiliation(s)
- Rafat Ali
- Department of Biosciences, Jamia Millia Islamia (A Central University), New Delhi 110025, India
| | - Armiya Sultan
- Department of Biosciences, Jamia Millia Islamia (A Central University), New Delhi 110025, India
| | - Romana Ishrat
- Center for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia (A Central University), New Delhi 110025, India
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan 45142, Saudi Arabia
- Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut P.O. Box 36, Lebanon
- Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman P.O. Box 346, United Arab Emirates
| | - Nida Jamil Khan
- Department of Biosciences, Jamia Millia Islamia (A Central University), New Delhi 110025, India
| | - Miguel Angel Prieto
- Nutrition and Bromatology Group, Department of Analytical Chemistry and Food Science, Faculty of Science, Universidade de Vigo, E32004 Ourense, Spain
| |
Collapse
|
5
|
Ghobadi MZ, Afsaneh E, Emamzadeh R, Soroush M. Potential miRNA-gene interactions determining progression of various ATLL cancer subtypes after infection by HTLV-1 oncovirus. BMC Med Genomics 2023; 16:62. [PMID: 36978083 PMCID: PMC10045051 DOI: 10.1186/s12920-023-01492-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 03/17/2023] [Indexed: 03/30/2023] Open
Abstract
BACKGROUND Adult T-cell Leukemia/Lymphoma (ATLL) is a rapidly progressing type of T-cell non-Hodgkin lymphoma that is developed after the infection by human T-cell leukemia virus type 1 (HTLV-1). It could be categorized into four major subtypes, acute, lymphoma, chronic, and smoldering. These different subtypes have some shared clinical manifestations, and there are no trustworthy biomarkers for diagnosis of them. METHODS We applied weighted-gene co-expression network analysis to find the potential gene and miRNA biomarkers for various ATLL subtypes. Afterward, we found reliable miRNA-gene interactions by identifying the experimentally validated-target genes of miRNAs. RESULTS The outcomes disclosed the interactions of miR-29b-2-5p and miR-342-3p with LSAMP in ATLL_acute, miR-575 with UBN2, miR-342-3p with ZNF280B, and miR-342-5p with FOXRED2 in ATLL_chronic, miR-940 and miR-423-3p with C6orf141, miR-940 and miR-1225-3p with CDCP1, and miR-324-3p with COL14A1 in ATLL_smoldering. These miRNA-gene interactions determine the molecular factors involved in the pathogenesis of each ATLL subtype and the unique ones could be considered biomarkers. CONCLUSION The above-mentioned miRNAs-genes interactions are suggested as diagnostic biomarkers for different ATLL subtypes.
Collapse
Affiliation(s)
- Mohadeseh Zarei Ghobadi
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | | | - Rahman Emamzadeh
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | - Mona Soroush
- Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| |
Collapse
|
6
|
Shayeghpour A, Forghani-Ramandi MM, Solouki S, Hosseini A, Hosseini P, Khodayar S, Hasani M, Aghajanian S, Siami Z, Zarei Ghobadi M, Mozhgani SH. Identification of novel miRNAs potentially involved in the pathogenesis of adult T-cell leukemia/lymphoma using WGCNA followed by RT-qPCR test of hub genes. Infect Agent Cancer 2023; 18:12. [PMID: 36841815 PMCID: PMC9968414 DOI: 10.1186/s13027-023-00492-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 02/17/2023] [Indexed: 02/27/2023] Open
Abstract
BACKGROUND Adult T-cell Lymphoma/Leukemia (ATLL) is characterized by the malignant proliferation of T-cells in Human T-Lymphotropic Virus Type 1 and a high mortality rate. Considering the emerging roles of microRNAs (miRNAs) in various malignancies, the analysis of high-throughput miRNA data employing computational algorithms helps to identify potential biomarkers. METHODS Weighted gene co-expression network analysis was utilized to analyze miRNA microarray data from ATLL and healthy uninfected samples. To identify miRNAs involved in the progression of ATLL, module preservation analysis was used. Subsequently, based on the target genes of the identified miRNAs, the STRING database was employed to construct protein-protein interaction networks (PPIN). Real-time quantitative PCR was also performed to validate the expression of identified hub genes in the PPIN network. RESULTS After constructing co-expression modules and then performing module preservation analysis, four out of 15 modules were determined as ATLL-specific modules. Next, the hub miRNA including hsa-miR-18a-3p, has-miR-187-5p, hsa-miR-196a-3p, and hsa-miR-346 were found as hub miRNAs. The protein-protein interaction networks were constructed for the target genes of each hub miRNA and hub genes were identified. Among them, UBB, RPS15A, and KMT2D were validated by Reverse-transcriptase PCR in ATLL patients. CONCLUSION The results of the network analysis of miRNAs and their target genes revealed the major players in the pathogenesis of ATLL. Further studies are required to confirm the role of these molecular factors and to discover their potential benefits as treatment targets and diagnostic biomarkers.
Collapse
Affiliation(s)
- Ali Shayeghpour
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | | | - Setayesh Solouki
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Amin Hosseini
- Department of Computer, Faculty of Engineering, Raja University, Qazvin, Iran
| | - Parastoo Hosseini
- grid.411705.60000 0001 0166 0922Department of Virology, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran ,grid.411705.60000 0001 0166 0922Research Center for Clinical Virology, Tehran University of Medical Sciences, Tehran, Iran
| | - Sara Khodayar
- grid.411705.60000 0001 0166 0922Department of Microbiology, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Mahsa Hasani
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Sepehr Aghajanian
- grid.411705.60000 0001 0166 0922School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Zeinab Siami
- grid.411705.60000 0001 0166 0922Department of Infectious Diseases, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | | | - Sayed-Hamidreza Mozhgani
- Department of Microbiology, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran. .,Non-Communicable Disease Research Center, Alborz University of Medical Sciences, Karaj, Iran.
| |
Collapse
|
7
|
Ghobadi MZ, Emamzadeh R, Afsaneh E. Exploration of mRNAs and miRNA classifiers for various ATLL cancer subtypes using machine learning. BMC Cancer 2022; 22:433. [PMID: 35449091 PMCID: PMC9026691 DOI: 10.1186/s12885-022-09540-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 04/14/2022] [Indexed: 11/29/2022] Open
Abstract
Background Adult T-cell Leukemia/Lymphoma (ATLL) is a cancer disease that is developed due to the infection by human T-cell leukemia virus type 1. It can be classified into four main subtypes including, acute, chronic, smoldering, and lymphoma. Despite the clinical manifestations, there are no reliable diagnostic biomarkers for the classification of these subtypes. Methods Herein, we employed a machine learning approach, namely, Support Vector Machine-Recursive Feature Elimination with Cross-Validation (SVM-RFECV) to classify the different ATLL subtypes from Asymptomatic Carriers (ACs). The expression values of multiple mRNAs and miRNAs were used as the features. Afterward, the reliable miRNA-mRNA interactions for each subtype were identified through exploring the experimentally validated-target genes of miRNAs. Results The results revealed that miR-21 and its interactions with DAAM1 and E2F2 in acute, SMAD7 in chronic, MYEF2 and PARP1 in smoldering subtypes could significantly classify the diverse subtypes. Conclusions Considering the high accuracy of the constructed model, the identified mRNAs and miRNA are proposed as the potential therapeutic targets and the prognostic biomarkers for various ATLL subtypes. Supplementary Information The online version contains supplementary material available at 10.1186/s12885-022-09540-1.
Collapse
Affiliation(s)
- Mohadeseh Zarei Ghobadi
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | - Rahman Emamzadeh
- Department of Cell and Molecular Biology and Microbiology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.
| | - Elaheh Afsaneh
- Department of Physics, University of Isfahan, Hezar Jarib, Isfahan, 81746, Iran
| |
Collapse
|