1
|
Liu Y, Zhang P, Che C, Wei Z. SDDSynergy: Learning Important Molecular Substructures for Explainable Anticancer Drug Synergy Prediction. J Chem Inf Model 2024. [PMID: 38687366 DOI: 10.1021/acs.jcim.4c00177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
Drug combination therapies are well-established strategies for the treatment of cancer with low toxicity and fewer adverse effects. Computational drug synergy prediction approaches can accelerate the discovery of novel combination therapies, but the existing methods do not explicitly consider the key role of important substructures in producing synergistic effects. To this end, we propose a significant substructure-aware anticancer drug synergy prediction method, named SDDSynergy, to adaptively identify critical functional groups in drug synergy. SDDSynergy splits the task of predicting drug synergy into predicting the effect of individual substructures on cancer cell lines and highlights the impact of important substructures through a novel drug-cell line attention mechanism. And a substructure pair attention mechanism is incorporated to capture the information on internal substructure pairs interaction in drug combinations, which aids in predicting synergy. The substructures of different sizes and shapes are directly obtained from the molecular graph of the drugs by multilayer substructure information passing networks. Extensive experiments on three real-world data sets demonstrate that SDDSynergy outperforms other state-of-the-art methods. We also verify that many of the novel drug combinations predicted by SDDSynergy are supported by previous studies or clinical trials through an in-depth literature survey.
Collapse
Affiliation(s)
- Yunjiong Liu
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, Dalian 116622, China
- School of Software Engineering, Dalian University, Dalian 116622, China
| | - Peiliang Zhang
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan 430070, China
| | - Chao Che
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, Dalian 116622, China
- School of Software Engineering, Dalian University, Dalian 116622, China
| | - Ziqi Wei
- CAS Key Laboratory of Molecular Imaging, Institute of Automation, Chinese Academy of Sciences, Beijing 100864, China
| |
Collapse
|
2
|
Liu J, Yang M, Yu Y, Xu H, Li K, Zhou X. Large language models in bioinformatics: applications and perspectives. ARXIV 2024:arXiv:2401.04155v1. [PMID: 38259343 PMCID: PMC10802675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Large language models (LLMs) are a class of artificial intelligence models based on deep learning, which have great performance in various tasks, especially in natural language processing (NLP). Large language models typically consist of artificial neural networks with numerous parameters, trained on large amounts of unlabeled input using self-supervised or semi-supervised learning. However, their potential for solving bioinformatics problems may even exceed their proficiency in modeling human language. In this review, we will present a summary of the prominent large language models used in natural language processing, such as BERT and GPT, and focus on exploring the applications of large language models at different omics levels in bioinformatics, mainly including applications of large language models in genomics, transcriptomics, proteomics, drug discovery and single cell analysis. Finally, this review summarizes the potential and prospects of large language models in solving bioinformatic problems.
Collapse
Affiliation(s)
- Jiajia Liu
- Center for Computational Systems Medicine, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, 77030, USA
| | - Mengyuan Yang
- School of Life Sciences, Zhengzhou University, Zhengzhou, Henan 450001, China
| | - Yankai Yu
- School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, Sichuan 611756, China
| | - Haixia Xu
- The Center of Gerontology and Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Kang Li
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Xiaobo Zhou
- Center for Computational Systems Medicine, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, 77030, USA
- McGovern Medical School, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- School of Dentistry, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| |
Collapse
|
3
|
Warner JL, Patt D. Cancer Informatics 2023: Data Sharing and Federating Learning Point Towards New Collaborative Opportunities. Yearb Med Inform 2023; 32:111-114. [PMID: 38147854 PMCID: PMC10751122 DOI: 10.1055/s-0043-1768744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023] Open
Abstract
OBJECTIVE To summarize significant research contributions on cancer informatics published in 2022. METHODS An extensive search using PubMed/MEDLINE was conducted to identify the scientific contributions published in 2022 that address topics in cancer informatics. The selection process comprised three steps: (i) ten candidate best papers were first selected by the two section editors, (ii) external reviewers from internationally renowned research teams reviewed each candidate best paper, and (iii) the final selection of three best papers was conducted by the editorial board of the Yearbook. RESULTS The three selected best papers demonstrate advances in federated learning, drug synergy prediction, and utilization of clinical note data. CONCLUSION Cancer informatics continues to mature as a subfield of biomedical informatics. Applications of informatics methods to data sharing and federated approaches are especially notable in 2022.
Collapse
Affiliation(s)
- Jeremy L. Warner
- Professor, Department of Medicine, Brown University, Providence, RI, USA
- Director, Center for Clinical Cancer Informatics and Data Science, Legorreta Cancer Center, Brown University, Providence, RI, USA
| | - Debra Patt
- Vice President, Texas Oncology, Austin, TX, USA
| | | |
Collapse
|