1
|
Wang X, Yang K, Jia T, Gu F, Wang C, Xu K, Shu Z, Xia J, Zhu Q, Zhou X. KDGene: knowledge graph completion for disease gene prediction using interactional tensor decomposition. Brief Bioinform 2024; 25:bbae161. [PMID: 38605639 PMCID: PMC11009469 DOI: 10.1093/bib/bbae161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 02/20/2024] [Accepted: 03/13/2024] [Indexed: 04/13/2024] Open
Abstract
The accurate identification of disease-associated genes is crucial for understanding the molecular mechanisms underlying various diseases. Most current methods focus on constructing biological networks and utilizing machine learning, particularly deep learning, to identify disease genes. However, these methods overlook complex relations among entities in biological knowledge graphs. Such information has been successfully applied in other areas of life science research, demonstrating their effectiveness. Knowledge graph embedding methods can learn the semantic information of different relations within the knowledge graphs. Nonetheless, the performance of existing representation learning techniques, when applied to domain-specific biological data, remains suboptimal. To solve these problems, we construct a biological knowledge graph centered on diseases and genes, and develop an end-to-end knowledge graph completion framework for disease gene prediction using interactional tensor decomposition named KDGene. KDGene incorporates an interaction module that bridges entity and relation embeddings within tensor decomposition, aiming to improve the representation of semantically similar concepts in specific domains and enhance the ability to accurately predict disease genes. Experimental results show that KDGene significantly outperforms state-of-the-art algorithms, whether existing disease gene prediction methods or knowledge graph embedding methods for general domains. Moreover, the comprehensive biological analysis of the predicted results further validates KDGene's capability to accurately identify new candidate genes. This work proposes a scalable knowledge graph completion framework to identify disease candidate genes, from which the results are promising to provide valuable references for further wet experiments. Data and source codes are available at https://github.com/2020MEAI/KDGene.
Collapse
Affiliation(s)
| | - Kuo Yang
- Corresponding author: Kuo Yang and Xuezhong Zhou, Institute of Medical Intelligence, Beijing Key Lab of Traffic Data Analysis and Mining, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China. E-mail: and
| | | | | | | | | | | | | | | | - Xuezhong Zhou
- Corresponding author: Kuo Yang and Xuezhong Zhou, Institute of Medical Intelligence, Beijing Key Lab of Traffic Data Analysis and Mining, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China. E-mail: and
| |
Collapse
|
2
|
Zhang P, Zhang D, Zhou W, Wang L, Wang B, Zhang T, Li S. Network pharmacology: towards the artificial intelligence-based precision traditional Chinese medicine. Brief Bioinform 2023; 25:bbad518. [PMID: 38197310 PMCID: PMC10777171 DOI: 10.1093/bib/bbad518] [Citation(s) in RCA: 56] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Revised: 11/03/2023] [Accepted: 11/30/2023] [Indexed: 01/11/2024] Open
Abstract
Network pharmacology (NP) provides a new methodological perspective for understanding traditional medicine from a holistic perspective, giving rise to frontiers such as traditional Chinese medicine network pharmacology (TCM-NP). With the development of artificial intelligence (AI) technology, it is key for NP to develop network-based AI methods to reveal the treatment mechanism of complex diseases from massive omics data. In this review, focusing on the TCM-NP, we summarize involved AI methods into three categories: network relationship mining, network target positioning and network target navigating, and present the typical application of TCM-NP in uncovering biological basis and clinical value of Cold/Hot syndromes. Collectively, our review provides researchers with an innovative overview of the methodological progress of NP and its application in TCM from the AI perspective.
Collapse
Affiliation(s)
- Peng Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Dingfan Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Wuai Zhou
- China Mobile Information System Integration Co., Ltd, Beijing 100032, China
| | - Lan Wang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Boyang Wang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Tingyu Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Shao Li
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| |
Collapse
|
3
|
Xu H, Fu X, Liu B, Weng S, Guo C, Quan L, Liu L, Wang L, Xing Z, Cheng Q, Luo P, Chen K, Liu Z, Han X. Immune perturbation network identifies an EMT subtype with chromosomal instability and tumor immune-desert microenvironment. iScience 2023; 26:107871. [PMID: 37766999 PMCID: PMC10520355 DOI: 10.1016/j.isci.2023.107871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/11/2023] [Accepted: 09/06/2023] [Indexed: 09/29/2023] Open
Abstract
Most gastric cancer (GC) subtypes are identified through transcriptional profiling overlooking dynamic changes and interactions in gene expression. Based on the background network of global immune genes, we constructed sample-specific edge-perturbation matrices and identified four molecular network subtypes of GC (MNG). MNG-1 displayed the best prognosis and vigorous cell cycle activity. MNG-2 was enriched by immune-hot phenotype with the potential for immunotherapy response. MNG-3 and MNG-4 were identified with epithelial-mesenchymal transition (EMT) peculiarity and worse prognosis, termed EMT subtypes. MNG-3 was characterized by low mutational burden and stromal cells and considered a replica of previous subtypes associated with poor prognosis. Notably, MNG-4 was considered a previously undefined subtype with a dismal prognosis, characterized by chromosomal instability and immune-desert microenvironment. This subtype tended to metastasize and was resistant to respond to immunotherapy. Pharmacogenomics analysis showed three therapeutic agents (NVP-BEZ235, LY2606368, and rutin) were potential interventions for MNG-4.
Collapse
Affiliation(s)
- Hui Xu
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
| | - Xinyu Fu
- Genetic and Prenatal Diagnosis Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Ben Liu
- Key Laboratory of Molecular Cancer Epidemiology of Tianjin, Department of Epidemiology and Biostatistics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China
| | - Siyuan Weng
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
| | - Chunguang Guo
- Department of Endovascular Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
| | - Libo Quan
- Department of Gastroenterology and Hepatology, First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
| | - Long Liu
- Department of Hepatobiliary and Pancreatic Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Libo Wang
- Department of Hepatobiliary and Pancreatic Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Zhe Xing
- Department of Neurosurgery, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Quan Cheng
- Department of Neurosurgery, Xiangya Hospital, Central South University, Changsha, China
| | - Peng Luo
- Department of Oncology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Kexin Chen
- Key Laboratory of Molecular Cancer Epidemiology of Tianjin, Department of Epidemiology and Biostatistics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China
| | - Zaoqu Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, Department of Pathophysiology, Peking Union Medical College, Beijing, 100730, China
| | - Xinwei Han
- Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
| |
Collapse
|
4
|
Song E. Persistent homology analysis of type 2 diabetes genome-wide association studies in protein-protein interaction networks. Front Genet 2023; 14:1270185. [PMID: 37823029 PMCID: PMC10562725 DOI: 10.3389/fgene.2023.1270185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 09/12/2023] [Indexed: 10/13/2023] Open
Abstract
Genome-wide association studies (GWAS) involving increasing sample sizes have identified hundreds of genetic variants associated with complex diseases, such as type 2 diabetes (T2D); however, it is unclear how GWAS hits form unique topological structures in protein-protein interaction (PPI) networks. Using persistent homology, this study explores the evolution and persistence of the topological features of T2D GWAS hits in the PPI network with increasing p-value thresholds. We define an n-dimensional persistent disease module as a higher-order generalization of the largest connected component (LCC). The 0-dimensional persistent T2D disease module is the LCC of the T2D GWAS hits, which is significantly detected in the PPI network (196 nodes and 235 edges, P< 0.05). In the 1-dimensional homology group analysis, all 18 1-dimensional holes (loops) of the T2D GWAS hits persist over all p-value thresholds. The 1-dimensional persistent T2D disease module comprising these 18 persistent 1-dimensional holes is significantly larger than that expected by chance (59 nodes and 83 edges, P< 0.001), indicating a significant topological structure in the PPI network. Our computational topology framework potentially possesses broad applicability to other complex phenotypes in identifying topological features that play an important role in disease pathobiology.
Collapse
Affiliation(s)
- Euijun Song
- Yonsei University College of Medicine, Seoul, Republic of Korea
- Present: Independent Researcher, Gyeonggi, Republic of Korea
| |
Collapse
|
5
|
He Z, Gao K, Dong L, Liu L, Qu X, Zou Z, Wu Y, Bu D, Guo JC, Zhao Y. Drug screening and biomarker gene investigation in cancer therapy through the human transcriptional regulatory network. Comput Struct Biotechnol J 2023; 21:1557-1572. [PMID: 36879883 PMCID: PMC9984461 DOI: 10.1016/j.csbj.2023.02.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 01/19/2023] [Accepted: 02/04/2023] [Indexed: 02/10/2023] Open
Abstract
A complex and vast biological network regulates all biological functions in the human body in a sophisticated manner, and abnormalities in this network can lead to disease and even cancer. The construction of a high-quality human molecular interaction network is possible with the development of experimental techniques that facilitate the interpretation of the mechanisms of drug treatment for cancer. We collected 11 molecular interaction databases based on experimental sources and constructed a human protein-protein interaction (PPI) network and a human transcriptional regulatory network (HTRN). A random walk-based graph embedding method was used to calculate the diffusion profiles of drugs and cancers, and a pipeline was constructed by using five similarity comparison metrics combined with a rank aggregation algorithm, which can be implemented for drug screening and biomarker gene prediction. Taking NSCLC as an example, curcumin was identified as a potentially promising anticancer drug from 5450 natural small molecules, and combined with differentially expressed genes, survival analysis, and topological ranking, we obtained BIRC5 (survivin), which is both a biomarker for NSCLC and a key target for curcumin. Finally, the binding mode of curcumin and survivin was explored using molecular docking. This work has a guiding significance for antitumor drug screening and the identification of tumor markers.
Collapse
Affiliation(s)
- Zihao He
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Kai Gao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Lei Dong
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Liu Liu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Xinchi Qu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Zhengkai Zou
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Yang Wu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Dechao Bu
- Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Jin-Cheng Guo
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China
| | - Yi Zhao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing 100029, China.,Research Center for Ubiquitous Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|
6
|
Zhang M, Qiao J, Zhang S, Zeng P. Exploring the association between birthweight and breast cancer using summary statistics from a perspective of genetic correlation, mediation, and causality. J Transl Med 2022; 20:227. [PMID: 35568861 PMCID: PMC9107660 DOI: 10.1186/s12967-022-03435-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 04/04/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Previous studies demonstrated a positive relationship between birthweight and breast cancer; however, inconsistent, sometimes even controversial, observations also emerged, and the nature of such relationship remains unknown. METHODS Using summary statistics of birthweight and breast cancer, we assessed the fetal/maternal-specific genetic correlation between them via LDSC and prioritized fetal/maternal-specific pleiotropic genes through MAIUP. Relying on summary statistics we conducted Mendelian randomization (MR) to evaluate the fetal/maternal-specific origin of causal relationship between birthweight, age of menarche, age at menopause and breast cancer. RESULTS With summary statistics we identified a positive genetic correlation between fetal-specific birthweight and breast cancer (rg = 0.123 and P = 0.013) as well as a negative but insignificant correlation between maternal-specific birthweight and breast cancer (rg = - 0.068, P = 0.206); and detected 84 pleiotropic genes shared by fetal-specific birthweight and breast cancer, 49 shared by maternal-specific birthweight and breast cancer. We also revealed fetal-specific birthweight indirectly influenced breast cancer risk in adulthood via the path of age of menarche or age at menopause in terms of MR-based mediation analysis. CONCLUSION This study reveals that shared genetic foundation and causal mediation commonly drive the connection between the two traits, and that fetal/maternal-specific birthweight plays substantially distinct roles in such relationship. However, our work offers little supportive evidence for the fetal origins hypothesis of breast cancer originating in utero.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Jiahao Qiao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|