1
|
Tunc H, Yilmaz S, Darendeli Kiraz BN, Sari M, Kotil SE, Sensoy O, Durdagi S. Improving Predictive Efficacy for Drug Resistance in Novel HIV-1 Protease Inhibitors through Transfer Learning Mechanisms. J Chem Inf Model 2024; 64:7844-7863. [PMID: 39393002 DOI: 10.1021/acs.jcim.4c01037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2024]
Abstract
The human immunodeficiency virus presents a significant global health challenge due to its rapid mutation and the development of resistance mechanisms against antiretroviral drugs. Recent studies demonstrate the impressive performance of machine learning (ML) and deep learning (DL) models in predicting the drug resistance profile of specific FDA-approved inhibitors. However, generalizing ML and DL models to learn not only from isolates but also from inhibitor representations remains challenging for HIV-1 infection. We propose a novel drug-isolate-fold change (DIF) model framework that aims to predict drug resistance score directly from the protein sequence and inhibitor representation. Various ML and DL models, inhibitor representations, and protein representations were analyzed through realistic validation mechanisms. To enhance the molecular learning capacity of DIF models, we employ a transfer learning approach by pretraining a graph neural network (GNN) model for activity prediction on a data set of 4855 HIV-1 protease inhibitors (PIs). By performing various realistic validation strategies on internal and external genotype-phenotype data sets, we statistically show that the learned representations of inhibitors improve the predictive ability of DIF-based ML and DL models. We achieved an accuracy of 0.802, AUROC of 0.874, and r of 0.727 for the unseen external PIs. By comparing the DIF-based models with a null model consisting of isolate-fold change (IF) architecture, it is observed that the DIF models significantly benefit from molecular representations. Combined results from various testing strategies and statistical tests confirm the effectiveness of DIF models in testing novel PIs for drug resistance in the presence of an isolate.
Collapse
Affiliation(s)
- Huseyin Tunc
- Department of Biostatistics and Medical Informatics, School of Medicine, Bahcesehir University, Istanbul 34734, Turkey
| | - Sumeyye Yilmaz
- Department of Mathematical Engineering, Istanbul Technical University, Istanbul 34469, Turkey
| | | | - Murat Sari
- Department of Mathematical Engineering, Istanbul Technical University, Istanbul 34469, Turkey
- Istanbul Technical University TRNC Campus, Mersin 10, Famagusta 99450, Turkey
| | - Seyfullah Enes Kotil
- Department of Molecular Biology and Genetics, Bogazici University, Istanbul 34342, Turkey
| | - Ozge Sensoy
- Graduate School of Engineering and Natural Sciences, Istanbul Medipol University, Istanbul 34810, Turkey
- Regenerative and Restorative Medicine Research Center (REMER), Research Institute for Health Sciences and Technologies (SABITA), Istanbul Medipol University, Istanbul 34810, Turkey
| | - Serdar Durdagi
- Molecular Therapy Lab, Department of Pharmaceutical Chemistry, School of Pharmacy, Bahcesehir University, Istanbul 34353, Turkey
- Lab for Innovative Drugs (Lab4IND), Computational Drug Design Center (HITMER), Istanbul 34734, Turkey
| |
Collapse
|
2
|
Wei Z, Zhao C, Zhang M, Xu J, Xu N, Wu S, Xin X, Yu L, Feng W. PrCRS: a prediction model of severe CRS in CAR-T therapy based on transfer learning. BMC Bioinformatics 2024; 25:197. [PMID: 38769505 PMCID: PMC11106915 DOI: 10.1186/s12859-024-05804-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 05/08/2024] [Indexed: 05/22/2024] Open
Abstract
BACKGROUND CAR-T cell therapy represents a novel approach for the treatment of hematologic malignancies and solid tumors. However, its implementation is accompanied by the emergence of potentially life-threatening adverse events known as cytokine release syndrome (CRS). Given the escalating number of patients undergoing CAR-T therapy, there is an urgent need to develop predictive models for severe CRS occurrence to prevent it in advance. Currently, all existing models are based on decision trees whose accuracy is far from meeting our expectations, and there is a lack of deep learning models to predict the occurrence of severe CRS more accurately. RESULTS We propose PrCRS, a deep learning prediction model based on U-net and Transformer. Given the limited data available for CAR-T patients, we employ transfer learning using data from COVID-19 patients. The comprehensive evaluation demonstrates the superiority of the PrCRS model over other state-of-the-art methods for predicting CRS occurrence. We propose six models to forecast the probability of severe CRS for patients with one, two, and three days in advance. Additionally, we present a strategy to convert the model's output into actual probabilities of severe CRS and provide corresponding predictions. CONCLUSIONS Based on our findings, PrCRS effectively predicts both the likelihood and timing of severe CRS in patients, thereby facilitating expedited and precise patient assessment, thus making a significant contribution to medical research. There is little research on applying deep learning algorithms to predict CRS, and our study fills this gap. This makes our research more novel and significant. Our code is publicly available at https://github.com/wzy38828201/PrCRS . The website of our prediction platform is: http://prediction.unicar-therapy.com/index-en.html .
Collapse
Affiliation(s)
- Zhenyu Wei
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China.
| | - Chengkui Zhao
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China
- Shanghai Unicar-Therapy BioMedicine Technology Co., Ltd, Shanghai, China
| | - Min Zhang
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China
| | - Jiayu Xu
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China
| | - Nan Xu
- School of Chemical and Molecular Engineering, East China Normal University, Zhongshan North Street, Shanghai, 200000, People's Republic of China
- Shanghai Unicar-Therapy BioMedicine Technology Co., Ltd, Shanghai, China
| | - Shiwei Wu
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China
| | - Xiaohui Xin
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China
| | - Lei Yu
- School of Chemical and Molecular Engineering, East China Normal University, Zhongshan North Street, Shanghai, 200000, People's Republic of China.
- Shanghai Unicar-Therapy BioMedicine Technology Co., Ltd, Shanghai, China.
| | - Weixing Feng
- Intelligent Systems Science and Engineering College, Harbin Engineering University, Liaoyuan Street, Harbin, 150006, Heilongjiang Province, People's Republic of China.
| |
Collapse
|
3
|
Qiu X, Wang H, Tan X, Fang Z. G-K BertDTA: A graph representation learning and semantic embedding-based framework for drug-target affinity prediction. Comput Biol Med 2024; 173:108376. [PMID: 38552281 DOI: 10.1016/j.compbiomed.2024.108376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/21/2024] [Accepted: 03/24/2024] [Indexed: 04/17/2024]
Abstract
Developing new drugs is costly, time-consuming, and risky. Drug-target affinity (DTA), indicating the binding capability between drugs and target proteins, is a crucial indicator for drug development. Accurately predicting interaction strength between new drug-target pairs by analyzing previous experiments aids in screening potential drug molecules, repurposing them, and developing safe and effective medicines. Existing computational models for DTA prediction rely on strings or single-graph neural networks, lacking consideration of protein structure and molecular semantic information, leading to limited accuracy. Our experiments demonstrate that string-based methods may overlook protein conformations, causing a high root mean square error (RMSE) of 3.584 in affinity due to a lack of spatial context. Single graph networks also underperform on topology features, with a 6% lower confidence interval (CI) for activity classification. Absent semantic information also limits generalization across diverse compounds, resulting in 18% increment in RMSE and 5% in misclassifications within quantifications study, restricting potential drug discovery. To address these limitations, we propose G-K BertDTA, a novel framework for accurate DTA prediction incorporating protein features, molecular semantic features, and molecular structural information. In this proposed model, we represent drugs as graphs, with a GIN employed to learn the molecular topological information. For the extraction of protein structural features, we utilize a DenseNet architecture. A knowledge-based BERT semantic model is incorporated to obtain rich pre-trained semantic embeddings, thereby enhancing the feature information. We extensively evaluated our proposed approach on the publicly available benchmark datasets (i.e., KIBA and Davis), and experimental results demonstrate the promising performance of our method, which consistently outperforms previous state-of-the-art approaches. Code is available at https://github.com/AmbitYuki/G-K-BertDTA.
Collapse
Affiliation(s)
- Xihe Qiu
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai, China
| | - Haoyu Wang
- School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai, China
| | - Xiaoyu Tan
- INF Technology (Shanghai) Co., Ltd., Shanghai, China
| | - Zhijun Fang
- School of Computer Science and Technology, Donghua University, Shanghai, China.
| |
Collapse
|