1
|
Hu J, Jiang Q, Mao W, Zhong S, Sun H, Mao K. STARD7 could be an immunological and prognostic biomarker: from pan-cancer analysis to hepatocellular carcinoma validation. Discov Oncol 2024; 15:543. [PMID: 39390226 PMCID: PMC11467145 DOI: 10.1007/s12672-024-01434-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 10/07/2024] [Indexed: 10/12/2024] Open
Abstract
BACKGROUND As the emergence of technologies such as sequencing and gene mapping, significant advancements have been made in understanding the landscape of tumors. However, the effective treatment of tumors continues to pose a tremendous challenge in clinical practice, which highlights the importance of predicting tumor markers and studying drug resistance mechanisms. The prognosis and differential expression of STARD7 in human pan-cancer were investigated by bioinformatic methods and experimental verification. METHODS The expression, diagnostic, and prognostic significance of the STARD7 gene were comprehensive analyzed using bioinformatics techniques. Furthermore, we validated our projected outcomes in liver cancer through experimental methodologies, including the use of qRT-PCR, CCK8 and transwell assays. RESULTS The STARD7 gene exhibits differential expression in 25 tumors, with high expression observed in 22 tumors. These distinct expression patterns within different tumor types are closely associated with poor prognosis and diagnosis. Furthermore, the STARD7 gene plays a role in regulating the tumor immune microenvironment. Methylation levels of STARD7 vary among 20 types of tumors and are correlated with survival outcomes. Furthermore, the experiment results demonstrated that STARD7 is highly expressed in hepatocellular carcinoma cells. Suppression of STARD7 significantly impedes the proliferation, migration, and invasion of HepG-2 and SMMC-7721 cells. CONCLUSIONS STARD7 has the potential to function as a crucial prognostic biomarker and exhibit correlation with tumor immunity in various types of human cancers. The implications of our findings extend to informing cancer immune-therapy and promoting the advancement of precision immune-oncology.
Collapse
Affiliation(s)
- Jie Hu
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China
| | - Qiu Jiang
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China
| | - Weili Mao
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China
| | - Songyang Zhong
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China
| | - Huayu Sun
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China.
| | - Kaili Mao
- Department of Pharmacy, The Quzhou Affiliated Hospital of Wenzhou Medical University, Quzhou People's Hospital, No.100 Minjiang Road, Kecheng District, Quzhou, 324000, Zhejiang, China.
| |
Collapse
|
2
|
Wang T, Zhuo L, Chen Y, Fu X, Zeng X, Zou Q. ECD-CDGI: An efficient energy-constrained diffusion model for cancer driver gene identification. PLoS Comput Biol 2024; 20:e1012400. [PMID: 39213450 PMCID: PMC11392234 DOI: 10.1371/journal.pcbi.1012400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 09/12/2024] [Accepted: 08/10/2024] [Indexed: 09/04/2024] Open
Abstract
The identification of cancer driver genes (CDGs) poses challenges due to the intricate interdependencies among genes and the influence of measurement errors and noise. We propose a novel energy-constrained diffusion (ECD)-based model for identifying CDGs, termed ECD-CDGI. This model is the first to design an ECD-Attention encoder by combining the ECD technique with an attention mechanism. ECD-Attention encoder excels at generating robust gene representations that reveal the complex interdependencies among genes while reducing the impact of data noise. We concatenate topological embedding extracted from gene-gene networks through graph transformers to these gene representations. We conduct extensive experiments across three testing scenarios. Extensive experiments show that the ECD-CDGI model possesses the ability to not only be proficient in identifying known CDGs but also efficiently uncover unknown potential CDGs. Furthermore, compared to the GNN-based approach, the ECD-CDGI model exhibits fewer constraints by existing gene-gene networks, thereby enhancing its capability to identify CDGs. Additionally, ECD-CDGI is open-source and freely available. We have also launched the model as a complimentary online tool specifically crafted to expedite research efforts focused on CDGs identification.
Collapse
Affiliation(s)
- Tao Wang
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, China
| | - Linlin Zhuo
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, China
| | - Yifan Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Xiangzheng Fu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Xiangxiang Zeng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
3
|
Taheri G, Habibi M. Uncovering driver genes in breast cancer through an innovative machine learning mutational analysis method. Comput Biol Med 2024; 171:108234. [PMID: 38430742 DOI: 10.1016/j.compbiomed.2024.108234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 01/25/2024] [Accepted: 02/25/2024] [Indexed: 03/05/2024]
Abstract
Breast cancer has become a severe public health concern and one of the leading causes of cancer-related death in women worldwide. Several genes and mutations in these genes linked to breast cancer have been identified using sophisticated techniques, despite the fact that the exact cause of breast cancer is still unknown. A commonly used feature for identifying driver mutations is the recurrence of a mutation in patients. Nevertheless, some mutations are more likely to occur than others for various reasons. Sequencing analysis has shown that cancer-driving genes operate across complex networks, often with mutations appearing in a modular pattern. In this work, as a retrospective study, we used TCGA data, which is gathered from breast cancer patients. We introduced a new machine-learning approach to examine gene functionality in networks derived from mutation associations, gene-gene interactions, and graph clustering for breast cancer analysis. These networks have uncovered crucial biological components in critical pathways, particularly those that exhibit low-frequency mutations. The statistical strength of the clinical study is significantly boosted by evaluating the network as a whole instead of just single gene effects. Our method successfully identified essential driver genes with diverse mutation frequencies. We then explored the functions of these potential driver genes and their related pathways. By uncovering low-frequency genes, we shed light on understudied pathways associated with breast cancer. Additionally, we present a novel Monte Carlo-based algorithm to identify driver modules in breast cancer. Our findings highlight the significance and role of these modules in critical signaling pathways in breast cancer, providing a comprehensive understanding of breast cancer development. Materials and implementations are available at: [https://github.com/MahnazHabibi/BreastCancer].
Collapse
Affiliation(s)
- Golnaz Taheri
- Department of Computer and Systems Sciences, Stockholm University, Stockholm, Sweden; Science for Life Laboratory, Stockholm, Sweden.
| | - Mahnaz Habibi
- Department of Mathematics, Qazvin Branch, Islamic Azad University, Qazvin, Iran
| |
Collapse
|
4
|
Ali Shah A, Shaker ASA, Jabbar S, Abbas Q, Al-Balawi TS, Celebi ME. An ensemble-based deep learning model for detection of mutation causing cutaneous melanoma. Sci Rep 2023; 13:22251. [PMID: 38097641 PMCID: PMC10721601 DOI: 10.1038/s41598-023-49075-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 12/04/2023] [Indexed: 12/17/2023] Open
Abstract
When the mutation affects the melanocytes of the body, a condition called melanoma results which is one of the deadliest skin cancers. Early detection of cutaneous melanoma is vital for raising the chances of survival. Melanoma can be due to inherited defective genes or due to environmental factors such as excessive sun exposure. The accuracy of the state-of-the-art computer-aided diagnosis systems is unsatisfactory. Moreover, the major drawback of medical imaging is the shortage of labeled data. Generalized classifiers are required to diagnose melanoma to avoid overfitting the dataset. To address these issues, blending ensemble-based deep learning (BEDLM-CMS) model is proposed to detect mutation of cutaneous melanoma by integrating long short-term memory (LSTM), Bi-directional LSTM (BLSTM) and gated recurrent unit (GRU) architectures. The dataset used in the proposed study contains 2608 human samples and 6778 mutations in total along with 75 types of genes. The most prominent genes that function as biomarkers for early diagnosis and prognosis are utilized. Multiple extraction techniques are used in this study to extract the most-prominent features. Afterwards, we applied different DL models optimized through grid search technique to diagnose melanoma. The validity of the results is confirmed using several techniques, including tenfold cross validation (10-FCVT), independent set (IST), and self-consistency (SCT). For validation of the results multiple metrics are used which include accuracy, specificity, sensitivity, and Matthews's correlation coefficient. BEDLM gives the highest accuracy of 97% in the independent set test whereas in self-consistency test and tenfold cross validation test it gives 94% and 93% accuracy, respectively. Accuracy of in self-consistency test, independent set test, and tenfold cross validation test is LSTM (96%, 94%, 92%), GRU (93%, 94%, 91%), and BLSTM (99%, 98%, 93%), respectively. The findings demonstrate that the proposed BEDLM-CMS can be used effectively applied for early diagnosis and treatment efficacy evaluation of cutaneous melanoma.
Collapse
Affiliation(s)
- Asghar Ali Shah
- Department of Computer Science, Bahria University, Islamabad, Pakistan
| | | | - Sohail Jabbar
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| | - Qaisar Abbas
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia.
| | - Talal Saad Al-Balawi
- College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia
| | - M Emre Celebi
- Department of Computer Science and Engineering, University of Central Arkansas, 201 Donaghey Ave., Conway, AR, 72035, USA
| |
Collapse
|
5
|
Lazebnik T, Simon-Keren L. Cancer-inspired genomics mapper model for the generation of synthetic DNA sequences with desired genomics signatures. Comput Biol Med 2023; 164:107221. [PMID: 37478715 DOI: 10.1016/j.compbiomed.2023.107221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 06/16/2023] [Accepted: 06/30/2023] [Indexed: 07/23/2023]
Abstract
Genome data are crucial in modern medicine, offering significant potential for diagnosis and treatment. Thanks to technological advancements, many millions of healthy and diseased genomes have already been sequenced; however, obtaining the most suitable data for a specific study, and specifically for validation studies, remains challenging with respect to scale and access. Therefore, in silico genomics sequence generators have been proposed as a possible solution. However, the current generators produce inferior data using mostly shallow (stochastic) connections, detected with limited computational complexity in the training data. This means they do not take the appropriate biological relations and constraints, that originally caused the observed connections, into consideration. To address this issue, we propose cancer-inspired genomics mapper model (CGMM), that combines genetic algorithm (GA) and deep learning (DL) methods to tackle this challenge. CGMM mimics processes that generate genetic variations and mutations to transform readily available control genomes into genomes with the desired phenotypes. We demonstrate that CGMM can generate synthetic genomes of selected phenotypes such as ancestry and cancer that are indistinguishable from real genomes of such phenotypes, based on unsupervised clustering. Our results show that CGMM outperforms four current state-of-the-art genomics generators on two different tasks, suggesting that CGMM will be suitable for a wide range of purposes in genomic medicine, especially for much-needed validation studies.
Collapse
Affiliation(s)
- Teddy Lazebnik
- Department of Cancer Biology, Cancer Institute, University College London, London, UK.
| | | |
Collapse
|