1
|
Shahzad M, Tahir MA, Alhussein M, Mobin A, Shams Malick RA, Anwar MS. NeuPD-A Neural Network-Based Approach to Predict Antineoplastic Drug Response. Diagnostics (Basel) 2023; 13:2043. [PMID: 37370938 DOI: 10.3390/diagnostics13122043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/01/2023] [Accepted: 06/05/2023] [Indexed: 06/29/2023] Open
Abstract
With the beginning of the high-throughput screening, in silico-based drug response analysis has opened lots of research avenues in the field of personalized medicine. For a decade, many different predicting techniques have been recommended for the antineoplastic (anti-cancer) drug response, but still, there is a need for improvements in drug sensitivity prediction. The intent of this research study is to propose a framework, namely NeuPD, to validate the potential anti-cancer drugs against a panel of cancer cell lines in publicly available datasets. The datasets used in this work are Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). As not all drugs are effective on cancer cell lines, we have worked on 10 essential drugs from the GDSC dataset that have achieved the best modeling results in previous studies. We also extracted 1610 essential oncogene expressions from 983 cell lines from the same dataset. Whereas, from the CCLE dataset, 16,383 gene expressions from 1037 cell lines and 24 drugs have been used in our experiments. For dimensionality reduction, Pearson correlation is applied to best fit the model. We integrate the genomic features of cell lines and drugs' fingerprints to fit the neural network model. For evaluation of the proposed NeuPD framework, we have used repeated K-fold cross-validation with 5 times repeats where K = 10 to demonstrate the performance in terms of root mean square error (RMSE) and coefficient determination (R2). The results obtained on the GDSC dataset that were measured using these cost functions show that our proposed NeuPD framework has outperformed existing approaches with an RMSE of 0.490 and R2 of 0.929.
Collapse
Affiliation(s)
- Muhammad Shahzad
- FAST School of Computing, National University of Computer and Emerging Sciences (NUCES-FAST), Karachi 75030, Pakistan
| | - Muhammad Atif Tahir
- FAST School of Computing, National University of Computer and Emerging Sciences (NUCES-FAST), Karachi 75030, Pakistan
| | - Musaed Alhussein
- Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia
| | - Ansharah Mobin
- FAST School of Computing, National University of Computer and Emerging Sciences (NUCES-FAST), Karachi 75030, Pakistan
| | - Rauf Ahmed Shams Malick
- FAST School of Computing, National University of Computer and Emerging Sciences (NUCES-FAST), Karachi 75030, Pakistan
| | - Muhammad Shahid Anwar
- Department of AI and Software, Gachon University, Seongnam-si 13120, Republic of Korea
| |
Collapse
|
2
|
Chu T, Nguyen TT, Hai BD, Nguyen QH, Nguyen T. Graph Transformer for Drug Response Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1065-1072. [PMID: 36107906 DOI: 10.1109/tcbb.2022.3206888] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
BACKGROUND Previous models have shown that learning drug features from their graph representation is more efficient than learning from their strings or numeric representations. Furthermore, integrating multi-omics data of cell lines increases the performance of drug response prediction. However, these models have shown drawbacks in extracting drug features from graph representation and incorporating redundancy information from multi-omics data. This paper proposes a deep learning model, GraTransDRP, to better drug representation and reduce information redundancy. First, the Graph transformer was utilized to extract the drug representation more efficiently. Next, Convolutional neural networks were used to learn the mutation, meth, and transcriptomics features. However, the dimension of transcriptomics features was up to 17737. Therefore, KernelPCA was applied to transcriptomics features to reduce the dimension and transform them into a dense presentation before putting them through the CNN model. Finally, drug and omics features were combined to predict a response value by a fully connected network. Experimental results show that our model outperforms some state-of-the-art methods, including GraphDRP and GraOmicDRP.
Collapse
|
3
|
Puerta A, González-Bakker A, Santos G, Padrón JM. Early Pharmacological Profiling of Antiproliferative Compounds by Live Cell Imaging. Molecules 2022; 27:molecules27165261. [PMID: 36014500 PMCID: PMC9415461 DOI: 10.3390/molecules27165261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 08/11/2022] [Accepted: 08/16/2022] [Indexed: 12/04/2022] Open
Abstract
Natural products represent an excellent source of unprecedented anticancer compounds. However, the identification of the mechanism of action remains a major challenge. Several techniques and methodologies have been considered, but with limited success. In this work, we explored the combination of live cell imaging and machine learning techniques as a promising tool to depict in a fast and affordable test the mode of action of natural compounds with antiproliferative activity. To develop the model, we selected the non-small cell lung cancer cell line SW1573, which was exposed to the known antimitotic drugs paclitaxel, colchicine and vinblastine. The novelty of our methodology focuses on two main features with the highest relevance, (a) meaningful phenotypic metrics, and (b) fast Fourier transform (FFT) of the time series of the phenotypic parameters into their corresponding amplitudes and phases. The resulting algorithm was able to cluster the microtubule disruptors, and meanwhile showed a negative correlation between paclitaxel and the other treatments. The FFT approach was able to group the samples as efficiently as checking by eye. This methodology could easily scale to group a large amount of data without visual supervision.
Collapse
Affiliation(s)
- Adrián Puerta
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
| | - Aday González-Bakker
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
| | - Guido Santos
- Departament of Biochemistry, Microbiology, Cell Biology and Genetics, Faculty of Sciences, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez s/n, 38206 La Laguna, Spain
| | - José M. Padrón
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
- Correspondence: ; Tel.: +34-922-316-502 (ext. 6126)
| |
Collapse
|
4
|
Leveraging Deep Learning Techniques and Integrated Omics Data for Tailored Treatment of Breast Cancer. J Pers Med 2022; 12:jpm12050674. [PMID: 35629097 PMCID: PMC9147748 DOI: 10.3390/jpm12050674] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 03/06/2022] [Accepted: 04/14/2022] [Indexed: 12/12/2022] Open
Abstract
Multiomics data of cancer patients and cell lines, in synergy with deep learning techniques, have aided in unravelling predictive problems related to cancer research and treatment. However, there is still room for improvement in the performance of the existing models based on the aforementioned combination. In this work, we propose two models that complement the treatment of breast cancer patients. First, we discuss our deep learning-based model for breast cancer subtype classification. Second, we propose DCNN-DR, a deep convolute.ion neural network-drug response method for predicting the effectiveness of drugs on in vitro and in vivo breast cancer datasets. Finally, we applied DCNN-DR for predicting effective drugs for the basal-like breast cancer subtype and validated the results with the information available in the literature. The models proposed use late integration methods and have fairly better predictive performance compared to the existing methods. We use the Pearson correlation coefficient and accuracy as the performance measures for the regression and classification models, respectively.
Collapse
|
5
|
Automatic identification of drug sensitivity of cancer cell with novel regression-based ensemble convolution neural network model. Soft comput 2022. [DOI: 10.1007/s00500-022-07098-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
6
|
Functional stratification of cancer drugs through integrated network similarity. NPJ Syst Biol Appl 2022; 8:11. [PMID: 35440787 PMCID: PMC9018743 DOI: 10.1038/s41540-022-00219-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 01/21/2022] [Indexed: 11/30/2022] Open
Abstract
Drugs not only perturb their immediate protein targets but also modulate multiple signaling pathways. In this study, we explored networks modulated by several drugs across multiple cancer cell lines by integrating their targets with transcriptomic and phosphoproteomic data. As a result, we obtained 236 reconstructed networks covering five cell lines and 70 drugs. A rigorous topological and pathway analysis showed that chemically and functionally different drugs may modulate overlapping networks. Additionally, we revealed a set of tumor-specific hidden pathways with the help of drug network models that are not detectable from the initial data. The difference in the target selectivity of the drugs leads to disjoint networks despite sharing a similar mechanism of action, e.g., HDAC inhibitors. We also used the reconstructed network models to study potential drug combinations based on the topological separation and found literature evidence for a set of drug pairs. Overall, network-level exploration of drug-modulated pathways and their deep comparison may potentially help optimize treatment strategies and suggest new drug combinations.
Collapse
|
7
|
Nguyen GTT, Vu HD, Le DH. Integrating Molecular Graph Data of Drugs and Multiple -Omic Data of Cell Lines for Drug Response Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:710-717. [PMID: 34260355 DOI: 10.1109/tcbb.2021.3096960] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Previous studies have either learned drug's features from their string or numeric representations, which are not natural forms of drugs, or only used genomic data of cell lines for the drug response prediction problem. Here, we proposed a deep learning model, GraOmicDRP, to learn drug's features from their graph representation and integrate multiple -omic data of cell lines. In GraOmicDRP, drugs are represented as graphs of bindings among atoms; meanwhile, cell lines are depicted by not only genomic but also transcriptomic and epigenomic data. Graph convolutional and convolutional neural networks were used to learn the representation of drugs and cell lines, respectively. A combination of the two representations was then used to be representative of each pair of drug-cell line. Finally, the response value of each pair was predicted by a fully connected network. Experimental results indicate that transcriptomic data shows the best among single -omic data; meanwhile, the combinations of transcriptomic and other -omic data achieved the best performance overall in terms of both Root Mean Square Error and Pearson correlation coefficient. In addition, we also show that GraOmicDRP outperforms some state-of-the-art methods, including ones integrating -omic data with drug information such as GraphDRP, and ones using -omic data without drug information such as DeepDR and MOLI.
Collapse
|
8
|
Strybol PP, Larmuseau M, de Schaetzen van Brienen L, Van den Bulcke T, Marchal K. Extracting functional insights from loss-of-function screens using deep link prediction. CELL REPORTS METHODS 2022; 2:100171. [PMID: 35474966 PMCID: PMC9017186 DOI: 10.1016/j.crmeth.2022.100171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 12/09/2021] [Accepted: 01/25/2022] [Indexed: 11/10/2022]
Abstract
We present deep link prediction (DLP), a method for the interpretation of loss-of-function screens. Our approach uses representation-based link prediction to reprioritize phenotypic readouts by integrating screening experiments with gene-gene interaction networks. We validate on 2 different loss-of-function technologies, RNAi and CRISPR, using datasets obtained from DepMap. Extensive benchmarking shows that DLP-DeepWalk outperforms other methods in recovering cell-specific dependencies, achieving an average precision well above 90% across 7 different cancer types and on both RNAi and CRISPR data. We show that the genes ranked highest by DLP-DeepWalk are appreciably more enriched in drug targets compared to the ranking based on original screening scores. Interestingly, this enrichment is more pronounced on RNAi data compared to CRISPR data, consistent with the greater inherent noise of RNAi screens. Finally, we demonstrate how DLP-DeepWalk can infer the molecular mechanism through which putative targets trigger cell line mortality.
Collapse
Affiliation(s)
- Pieter-Paul Strybol
- Department of Plant Biotechnology and Bioinformatics, Department of Information Technology, IDLab, imec, iGent Toren, 9000 Gent, Belgium
| | - Maarten Larmuseau
- Department of Plant Biotechnology and Bioinformatics, Department of Information Technology, IDLab, imec, iGent Toren, 9000 Gent, Belgium
| | - Louise de Schaetzen van Brienen
- Department of Plant Biotechnology and Bioinformatics, Department of Information Technology, IDLab, imec, iGent Toren, 9000 Gent, Belgium
| | | | - Kathleen Marchal
- Department of Plant Biotechnology and Bioinformatics, Department of Information Technology, IDLab, imec, iGent Toren, 9000 Gent, Belgium
| |
Collapse
|
9
|
Nguyen T, Nguyen GTT, Nguyen T, Le DH. Graph Convolutional Networks for Drug Response Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:146-154. [PMID: 33606633 DOI: 10.1109/tcbb.2021.3060430] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
BACKGROUND Drug response prediction is an important problem in computational personalized medicine. Many machine-learning-based methods, especially deep learning-based ones, have been proposed for this task. However, these methods often represent the drugs as strings, which are not a natural way to depict molecules. Also, interpretation (e.g., what are the mutation or copy number aberration contributing to the drug response) has not been considered thoroughly. METHODS In this study, we propose a novel method, GraphDRP, based on graph convolutional network for the problem. In GraphDRP, drugs were represented in molecular graphs directly capturing the bonds among atoms, meanwhile cell lines were depicted as binary vectors of genomic aberrations. Representative features of drugs and cell lines were learned by convolution layers, then combined to represent for each drug-cell line pair. Finally, the response value of each drug-cell line pair was predicted by a fully-connected neural network. Four variants of graph convolutional networks were used for learning the features of drugs. RESULTS We found that GraphDRP outperforms tCNNS in all performance measures for all experiments. Also, through saliency maps of the resulting GraphDRP models, we discovered the contribution of the genomic aberrations to the responses. CONCLUSION Representing drugs as graphs can improve the performance of drug response prediction. Availability of data and materials: Data and source code can be downloaded athttps://github.com/hauldhut/GraphDRP.
Collapse
|
10
|
Qiu X, Zhong X, Zhang H. Applied Research on the Combination of Weighted Network and Supervised Learning in Acupoints Compatibility. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:4699420. [PMID: 34745499 PMCID: PMC8570846 DOI: 10.1155/2021/4699420] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/13/2021] [Accepted: 10/20/2021] [Indexed: 11/17/2022]
Abstract
To enhance the depth of excavation and promote the intelligence of acupoint compatibility, a method of constructing weighted network, which combines the attributes of acupoints and supervised learning, is proposed for link prediction. Medical cases of cervical spondylosis with acupuncture treatment are standardized, and a weighted network is constructed according to acupoint attributes. Multiple similarity features are extracted from the network and input into a supervised learning model for prediction. And, the performance of the algorithm is evaluated through evaluation indicators. The experiment finally screened 67 eligible medical cases, and the network model involved 141 acupoint nodes with 1048 edge. Except for the Preferential Attachment similarity index and the Decision Tree model, all other similarity indexes performed well in the model, among which the combination of PI index and Multilayer Perception model had the best prediction effect with an AUC value of 0.9351, confirming the feasibility of weighted networks combined with supervised learning for link prediction, also as a strong support for clinical point selection.
Collapse
Affiliation(s)
- Xia Qiu
- College of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Xiaoying Zhong
- College of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Honglai Zhang
- College of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| |
Collapse
|
11
|
Borisov N, Sergeeva A, Suntsova M, Raevskiy M, Gaifullin N, Mendeleeva L, Gudkov A, Nareiko M, Garazha A, Tkachev V, Li X, Sorokin M, Surin V, Buzdin A. Machine Learning Applicability for Classification of PAD/VCD Chemotherapy Response Using 53 Multiple Myeloma RNA Sequencing Profiles. Front Oncol 2021; 11:652063. [PMID: 33937058 PMCID: PMC8083158 DOI: 10.3389/fonc.2021.652063] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 03/19/2021] [Indexed: 12/17/2022] Open
Abstract
Multiple myeloma (MM) affects ~500,000 people and results in ~100,000 deaths annually, being currently considered treatable but incurable. There are several MM chemotherapy treatment regimens, among which eleven include bortezomib, a proteasome-targeted drug. MM patients respond differently to bortezomib, and new prognostic biomarkers are needed to personalize treatments. However, there is a shortage of clinically annotated MM molecular data that could be used to establish novel molecular diagnostics. We report new RNA sequencing profiles for 53 MM patients annotated with responses on two similar chemotherapy regimens: bortezomib, doxorubicin, dexamethasone (PAD), and bortezomib, cyclophosphamide, dexamethasone (VCD), or with responses to their combinations. Fourteen patients received both PAD and VCD; six received only PAD, and 33 received only VCD. We compared profiles for the good and poor responders and found five genes commonly regulated here and in the previous datasets for other bortezomib regimens (all upregulated in the good responders): FGFR3, MAF, IGHA2, IGHV1-69, and GRB14. Four of these genes are linked with known immunoglobulin locus rearrangements. We then used five machine learning (ML) methods to build a classifier distinguishing good and poor responders for two cohorts: PAD + VCD (53 patients), and separately VCD (47 patients). We showed that the application of FloWPS dynamic data trimming was beneficial for all ML methods tested in both cohorts, and also in the previous MM bortezomib datasets. However, the ML models build for the different datasets did not allow cross-transferring, which can be due to different treatment regimens, experimental profiling methods, and MM heterogeneity.
Collapse
Affiliation(s)
- Nicolas Borisov
- Moscow Institute of Physics and Technology, Laboratory for Translational Genomic Bioinformatics, Dolgoprudny, Russia
| | - Anna Sergeeva
- National Research Center for Hematology, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Maria Suntsova
- I.M. Sechenov First Moscow State Medical University, Institute of Personalized Medicine, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Group for Genomic Analysis of Cell Signaling Systems, Moscow, Russia
| | - Mikhail Raevskiy
- Moscow Institute of Physics and Technology, Laboratory for Translational Genomic Bioinformatics, Dolgoprudny, Russia
| | - Nurshat Gaifullin
- Department of Pathology, Faculty of Medicine, Lomonosov Moscow State University, Moscow, Russia
| | - Larisa Mendeleeva
- National Research Center for Hematology, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Alexander Gudkov
- I.M. Sechenov First Moscow State Medical University, Institute of Personalized Medicine, Moscow, Russia
| | - Maria Nareiko
- National Research Center for Hematology, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Andrew Garazha
- Omicsway Corp., Research Department, Walnut, CA, United States
- Oncobox Ltd., Research Department, Moscow, Russia
| | - Victor Tkachev
- Omicsway Corp., Research Department, Walnut, CA, United States
- Oncobox Ltd., Research Department, Moscow, Russia
| | - Xinmin Li
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA, United States
| | - Maxim Sorokin
- I.M. Sechenov First Moscow State Medical University, Institute of Personalized Medicine, Moscow, Russia
- Omicsway Corp., Research Department, Walnut, CA, United States
- Oncobox Ltd., Research Department, Moscow, Russia
| | - Vadim Surin
- National Research Center for Hematology, Ministry of Health of the Russian Federation, Moscow, Russia
| | - Anton Buzdin
- I.M. Sechenov First Moscow State Medical University, Institute of Personalized Medicine, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Group for Genomic Analysis of Cell Signaling Systems, Moscow, Russia
- Omicsway Corp., Research Department, Walnut, CA, United States
| |
Collapse
|
12
|
Liu Q, Hu Z, Jiang R, Zhou M. DeepCDR: a hybrid graph convolutional network for predicting cancer drug response. Bioinformatics 2021; 36:i911-i918. [PMID: 33381841 DOI: 10.1093/bioinformatics/btaa822] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
MOTIVATION Accurate prediction of cancer drug response (CDR) is challenging due to the uncertainty of drug efficacy and heterogeneity of cancer patients. Strong evidences have implicated the high dependence of CDR on tumor genomic and transcriptomic profiles of individual patients. Precise identification of CDR is crucial in both guiding anti-cancer drug design and understanding cancer biology. RESULTS In this study, we present DeepCDR which integrates multi-omics profiles of cancer cells and explores intrinsic chemical structures of drugs for predicting CDR. Specifically, DeepCDR is a hybrid graph convolutional network consisting of a uniform graph convolutional network and multiple subnetworks. Unlike prior studies modeling hand-crafted features of drugs, DeepCDR automatically learns the latent representation of topological structures among atoms and bonds of drugs. Extensive experiments showed that DeepCDR outperformed state-of-the-art methods in both classification and regression settings under various data settings. We also evaluated the contribution of different types of omics profiles for assessing drug response. Furthermore, we provided an exploratory strategy for identifying potential cancer-associated genes concerning specific cancer types. Our results highlighted the predictive power of DeepCDR and its potential translational value in guiding disease-specific drug design. AVAILABILITY AND IMPLEMENTATION DeepCDR is freely available at https://github.com/kimmo1019/DeepCDR. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qiao Liu
- Ministry of Education Key Laboratory of Bioinformatics, Research Department of Bioinformatics, Beijing National Research Center, Information Science and Technology, Center for Synthetic and Systems Biology.,Department of Automation, Tsinghua University, Beijing 100084, China
| | - Zhiqiang Hu
- Department of Automation, Tsinghua University, Beijing 100084, China.,SenseTime Research, Shanghai 200233, China
| | - Rui Jiang
- Ministry of Education Key Laboratory of Bioinformatics, Research Department of Bioinformatics, Beijing National Research Center, Information Science and Technology, Center for Synthetic and Systems Biology.,Department of Automation, Tsinghua University, Beijing 100084, China
| | - Mu Zhou
- SenseBrain Research, San Jose, CA 95131, USA
| |
Collapse
|
13
|
In Silico Inference of Synthetic Cytotoxic Interactions from Paclitaxel Responses. Int J Mol Sci 2021; 22:ijms22031097. [PMID: 33499282 PMCID: PMC7865701 DOI: 10.3390/ijms22031097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/18/2021] [Accepted: 01/19/2021] [Indexed: 11/16/2022] Open
Abstract
To exploit negatively interacting pairs of cancer somatic mutations in chemotherapy responses or synthetic cytotoxicity (SC), we systematically determined mutational pairs that had significantly lower paclitaxel half maximal inhibitory concentration (IC50) values. We evaluated 407 cell lines with somatic mutation profiles and estimated their copy number and drug-inhibitory concentrations in Genomics of Drug Sensitivity in Cancer (GDSC) database. The SC effect of 142 mutated gene pairs on response to paclitaxel was successfully cross-validated using human cancer datasets for urogenital cancers available in The Cancer Genome Atlas (TCGA) database. We further analyzed the cumulative effect of increasing SC pair numbers on the TP53 tumor suppressor gene. Patients with TCGA bladder and urogenital cancer exhibited improved cancer survival rates as the number of disrupted SC partners (i.e., SYNE2, SON, and/or PRY) of TP53 increased. The prognostic effect of SC burden on response to paclitaxel treatment could be differentiated from response to other cytotoxic drugs. Thus, the concept of pairwise SC may aid the identification of novel therapeutic and prognostic targets.
Collapse
|
14
|
Sharma A, Rani R. Ensembled machine learning framework for drug sensitivity prediction. IET Syst Biol 2020; 14:39-46. [PMID: 31931480 DOI: 10.1049/iet-syb.2018.5094] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Drug sensitivity prediction is one of the critical tasks involved in drug designing and discovery. Recently several online databases and consortiums have contributed to providing open access to pharmacogenomic data. These databases have helped in developing computational approaches for drug sensitivity prediction. Cancer is a complex disease involving the heterogeneous behaviour of same tumour-type patients towards the same kind of drug therapy. Several methods have been proposed in the literature to predict drug sensitivity. However, these methods are not efficient enough to predict drug sensitivity. The present study has proposed an ensemble learning framework for drug-response prediction using a modified rotation forest. The proposed framework is further compared with three state-of-the-art algorithms and two baseline methods using Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE) drug screens. The authors have also predicted missing drug response values in the data set using the proposed approach. The proposed approach outperforms other counterparts even though gene mutation data is not incorporated while designing the approach. An average mean square error of 3.14 and 0.404 is achieved using GDSC and CCLE drug screens, respectively. The obtained results show that the proposed framework has considerable potential to improve anti-cancer drug response prediction.
Collapse
|
15
|
Cancer gene expression profiles associated with clinical outcomes to chemotherapy treatments. BMC Med Genomics 2020; 13:111. [PMID: 32948183 PMCID: PMC7499993 DOI: 10.1186/s12920-020-00759-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 07/27/2020] [Indexed: 12/18/2022] Open
Abstract
Background Machine learning (ML) methods still have limited applicability in personalized oncology due to low numbers of available clinically annotated molecular profiles. This doesn’t allow sufficient training of ML classifiers that could be used for improving molecular diagnostics. Methods We reviewed published datasets of high throughput gene expression profiles corresponding to cancer patients with known responses on chemotherapy treatments. We browsed Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and Tumor Alterations Relevant for GEnomics-driven Therapy (TARGET) repositories. Results We identified data collections suitable to build ML models for predicting responses on certain chemotherapeutic schemes. We identified 26 datasets, ranging from 41 till 508 cases per dataset. All the datasets identified were checked for ML applicability and robustness with leave-one-out cross validation. Twenty-three datasets were found suitable for using ML that had balanced numbers of treatment responder and non-responder cases. Conclusions We collected a database of gene expression profiles associated with clinical responses on chemotherapy for 2786 individual cancer cases. Among them seven datasets included RNA sequencing data (for 645 cases) and the others – microarray expression profiles. The cases represented breast cancer, lung cancer, low-grade glioma, endothelial carcinoma, multiple myeloma, adult leukemia, pediatric leukemia and kidney tumors. Chemotherapeutics included taxanes, bortezomib, vincristine, trastuzumab, letrozole, tipifarnib, temozolomide, busulfan and cyclophosphamide.
Collapse
|
16
|
Yu L, Zhou D, Gao L, Zha Y. Prediction of drug response in multilayer networks based on fusion of multiomics data. Methods 2020; 192:85-92. [PMID: 32798653 DOI: 10.1016/j.ymeth.2020.08.006] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 06/22/2020] [Accepted: 08/09/2020] [Indexed: 12/14/2022] Open
Abstract
Predicting the response of each individual patient to a drug is a key issue assailing personalized medicine. Our study predicted drug response based on the fusion of multiomics data with low-dimensional feature vector representation on a multilayer network model. We named this new method DREMO (Drug Response prEdiction based on MultiOmics data fusion). DREMO fuses similarities between cell lines and similarities between drugs, thereby improving the ability to predict the response of cancer cell lines to therapeutic agents. First, a multilayer similarity network related to cell lines and drugs was constructed based on gene expression profiles, somatic mutation, copy number variation (CNV), drug chemical structures, and drug targets. Next, low-dimensional feature vector representation was used to fuse the biological information in the multilayer network. Then, a machine learning model was applied to predict new drug-cell line associations. Finally, our results were validated using the well-established GDSC/CCLE databases, literature, and the functional pathway database. Furthermore, a comparison was made between DREMO and other methods. Results of the comparison showed that DREMO improves predictive capabilities significantly.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China.
| | - Dandan Zhou
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China
| | - Yunhong Zha
- Department of Neurology, Institute of Neural Regeneration and Repair, Three Gorges University College of Medicine, The First Hospital of Yichang, Yichang, China.
| |
Collapse
|
17
|
Wang W, Lv H, Zhao Y, Liu D, Wang Y, Zhang Y. DLS: A Link Prediction Method Based on Network Local Structure for Predicting Drug-Protein Interactions. Front Bioeng Biotechnol 2020; 8:330. [PMID: 32391341 PMCID: PMC7193019 DOI: 10.3389/fbioe.2020.00330] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/25/2020] [Indexed: 12/22/2022] Open
Abstract
The studies on drug-protein interactions (DPIs) had significant for drug repositioning, drug discovery, and clinical medicine. The biochemical experimentation (in vitro) requires a long time and high cost to be confirmed because it is difficult to estimate. Therefore, a feasible solution is to predict DPIs efficiently with computers. We propose a link prediction method based on drug-protein interaction (DPI) local structural similarity (DLS) for predicting the DPIs. The DLS method combines link prediction and binary network structure to predict DPIs. The ten-fold cross-validation method was applied in the experiment. After comparing the predictive capability of DLS with the improved similarity-based network prediction method, the results of DLS on the test set are significantly better. Moreover, several candidate proteins were predicted for three approved drugs, namely captopril, desferrioxamine and losartan, and these predictions are further validated by the literature. In addition, the combination of the Common Neighborhood (CN) method and the DLS method provides a new idea for the integrated application of the link prediction method.
Collapse
Affiliation(s)
- Wei Wang
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China.,Big Data Engineering Laboratory for Teaching Resources and Assessment of Education Quality, Xinxiang, China
| | - Hehe Lv
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
| | - Yuan Zhao
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
| | - Dong Liu
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
| | - Yongqing Wang
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China.,Big Data Engineering Laboratory for Teaching Resources and Assessment of Education Quality, Xinxiang, China
| | - Yu Zhang
- Department of Computer Science and Technology, College of Computer and Information Engineering, Henan Normal University, Xinxiang, China
| |
Collapse
|
18
|
Gene regulatory network analysis with drug sensitivity reveals synergistic effects of combinatory chemotherapy in gastric cancer. Sci Rep 2020; 10:3932. [PMID: 32127608 PMCID: PMC7054272 DOI: 10.1038/s41598-020-61016-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 02/19/2020] [Indexed: 12/14/2022] Open
Abstract
The combination of docetaxel, cisplatin, and fluorouracil (DCF) is highly synergistic in advanced gastric cancer. We aimed to explain these synergistic effects at the molecular level. Thus, we constructed a weighted correlation network using the differentially expressed genes between Stage I and IV gastric cancer based on The Cancer Genome Atlas (TCGA), and three modules were derived. Next, we investigated the correlation between the eigengene of the expression of the gene network modules and the chemotherapeutic drug response to DCF from the Genomics of Drug Sensitivity in Cancer (GDSC) database. The three modules were associated with functions related to cell migration, angiogenesis, and the immune response. The eigengenes of the three modules had a high correlation with DCF (−0.41, −0.40, and −0.15). The eigengenes of the three modules tended to increase as the stage increased. Advanced gastric cancer was affected by the interaction the among modules with three functions, namely cell migration, angiogenesis, and the immune response, all of which are related to metastasis. The weighted correlation network analysis model proved the complementary effects of DCF at the molecular level and thus, could be used as a unique methodology to determine the optimal combination of chemotherapy drugs for patients with gastric cancer.
Collapse
|
19
|
Turki T, Taguchi YH. SCGRNs: Novel supervised inference of single-cell gene regulatory networks of complex diseases. Comput Biol Med 2020; 118:103656. [PMID: 32174324 DOI: 10.1016/j.compbiomed.2020.103656] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 02/06/2020] [Accepted: 02/07/2020] [Indexed: 12/19/2022]
|
20
|
Tkachev V, Sorokin M, Borisov C, Garazha A, Buzdin A, Borisov N. Flexible Data Trimming Improves Performance of Global Machine Learning Methods in Omics-Based Personalized Oncology. Int J Mol Sci 2020; 21:ijms21030713. [PMID: 31979006 PMCID: PMC7037338 DOI: 10.3390/ijms21030713] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 01/16/2020] [Accepted: 01/17/2020] [Indexed: 12/21/2022] Open
Abstract
(1) Background: Machine learning (ML) methods are rarely used for an omics-based prescription of cancer drugs, due to shortage of case histories with clinical outcome supplemented by high-throughput molecular data. This causes overtraining and high vulnerability of most ML methods. Recently, we proposed a hybrid global-local approach to ML termed floating window projective separator (FloWPS) that avoids extrapolation in the feature space. Its core property is data trimming, i.e., sample-specific removal of irrelevant features. (2) Methods: Here, we applied FloWPS to seven popular ML methods, including linear SVM, k nearest neighbors (kNN), random forest (RF), Tikhonov (ridge) regression (RR), binomial naïve Bayes (BNB), adaptive boosting (ADA) and multi-layer perceptron (MLP). (3) Results: We performed computational experiments for 21 high throughput gene expression datasets (41–235 samples per dataset) totally representing 1778 cancer patients with known responses on chemotherapy treatments. FloWPS essentially improved the classifier quality for all global ML methods (SVM, RF, BNB, ADA, MLP), where the area under the receiver-operator curve (ROC AUC) for the treatment response classifiers increased from 0.61–0.88 range to 0.70–0.94. We tested FloWPS-empowered methods for overtraining by interrogating the importance of different features for different ML methods in the same model datasets. (4) Conclusions: We showed that FloWPS increases the correlation of feature importance between the different ML methods, which indicates its robustness to overtraining. For all the datasets tested, the best performance of FloWPS data trimming was observed for the BNB method, which can be valuable for further building of ML classifiers in personalized oncology.
Collapse
Affiliation(s)
- Victor Tkachev
- OmicsWayCorp, Walnut, CA 91788, USA; (V.T.); (M.S.); (A.G.)
| | - Maxim Sorokin
- OmicsWayCorp, Walnut, CA 91788, USA; (V.T.); (M.S.); (A.G.)
- Institute for Personailzed Medicine, I.M. Sechenov First Moscow State Medical University, 119991 Moscow, Russia
| | - Constantin Borisov
- National Research University—Higher School of Economics, 101000 Moscow, Russia;
| | - Andrew Garazha
- OmicsWayCorp, Walnut, CA 91788, USA; (V.T.); (M.S.); (A.G.)
| | - Anton Buzdin
- OmicsWayCorp, Walnut, CA 91788, USA; (V.T.); (M.S.); (A.G.)
- Institute for Personailzed Medicine, I.M. Sechenov First Moscow State Medical University, 119991 Moscow, Russia
- Moscow Institute of Physics and Technology, 141701 Moscow Oblast, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 117997 Moscow, Russia
| | - Nicolas Borisov
- OmicsWayCorp, Walnut, CA 91788, USA; (V.T.); (M.S.); (A.G.)
- Institute for Personailzed Medicine, I.M. Sechenov First Moscow State Medical University, 119991 Moscow, Russia
- Moscow Institute of Physics and Technology, 141701 Moscow Oblast, Russia
- Correspondence: ; Tel.: +7-903-218-7261
| |
Collapse
|
21
|
Keikha MM, Rahgozar M, Asadpour M. DeepLink: A novel link prediction framework based on deep learning. J Inf Sci 2019. [DOI: 10.1177/0165551519891345] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Recently, link prediction has attracted more attention from various disciplines such as computer science, bioinformatics and economics. In link prediction, numerous information such as network topology, profile information and user-generated contents are considered to discover missing links between nodes. Whereas numerous previous researches had focused on the structural features of the networks for link prediction, recent studies have shown more interest in profile and content information, too. So, some of these researches combine structural and content information. However, some issues such as scalability and feature engineering need to be investigated to solve a few remaining problems. Moreover, most of the previous researches are presented only for undirected and unweighted networks. In this article, a novel link prediction framework named ‘DeepLink’ is presented, which is based on deep learning techniques. While deep learning has the advantage of extracting automatically the best features for link prediction, many other link prediction algorithms need manual feature engineering. Moreover, in the proposed framework, both structural and content information are employed. The framework is capable of using different structural feature vectors that are prepared by various link prediction methods. It learns all proximity orders that are presented on a network during the structural feature learning. We have evaluated the effectiveness of DeepLink on two real social network datasets, Telegram and irBlogs. On both datasets, the proposed framework outperforms several other structural and hybrid approaches for link prediction.
Collapse
Affiliation(s)
- Mohammad Mehdi Keikha
- University of Sistan and Baluchestan, Iran; Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Iran
| | - Maseud Rahgozar
- Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Iran
| | - Masoud Asadpour
- Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Iran
| |
Collapse
|
22
|
Manica M, Oskooei A, Born J, Subramanian V, Sáez-Rodríguez J, Rodríguez Martínez M. Toward Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-Based Convolutional Encoders. Mol Pharm 2019; 16:4797-4806. [DOI: 10.1021/acs.molpharmaceut.9b00520] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
| | | | - Jannis Born
- IBM Research, 8803 Zürich, Switzerland
- ETH Zürich, 8092 Zürich, Switzerland
- University of Zürich, 8006 Zürich, Switzerland
| | | | | | | |
Collapse
|
23
|
Guan NN, Zhao Y, Wang CC, Li JQ, Chen X, Piao X. Anticancer Drug Response Prediction in Cell Lines Using Weighted Graph Regularized Matrix Factorization. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 17:164-174. [PMID: 31265947 PMCID: PMC6610642 DOI: 10.1016/j.omtn.2019.05.017] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 05/17/2019] [Accepted: 05/20/2019] [Indexed: 12/14/2022]
Abstract
Precision medicine has become a novel and rising concept, which depends much on the identification of individual genomic signatures for different patients. The cancer cell lines could reflect the “omic” diversity of primary tumors, based on which many works have been carried out to study the cancer biology and drug discovery both in experimental and computational aspects. In this work, we presented a novel method to utilize weighted graph regularized matrix factorization (WGRMF) for inferring anticancer drug response in cell lines. We constructed a p-nearest neighbor graph to sparsify drug similarity matrix and cell line similarity matrix, respectively. Using the sparsified matrices in the graph regularization terms, we performed matrix factorization to generate the latent matrices for drug and cell line. The graph regularization terms including neighbor information could help to exclude the noisy ingredient and improve the prediction accuracy. The 10-fold cross-validation was implemented, and the Pearson correlation coefficient (PCC), root-mean-square error (RMSE), PCCsr, and RMSEsr averaged over all drugs were calculated to evaluate the performance of WGRMF. The results on the Genomics of Drug Sensitivity in Cancer (GDSC) dataset are 0.64 ± 0.16, 1.37 ± 0.35, 0.73 ± 0.14, and 1.71 ± 0.44 for PCC, RMSE, PCCsr, and RMSEsr in turn. And for the Cancer Cell Line Encyclopedia (CCLE) dataset, WGRMF got results of 0.72 ± 0.09, 0.56 ± 0.19, 0.79 ± 0.07, and 0.69 ± 0.19, respectively. The results showed the superiority of WGRMF compared with previous methods. Besides, based on the prediction results using the GDSC dataset, three types of case studies were carried out. The results from both cross-validation and case studies have shown the effectiveness of WGRMF on the prediction of drug response in cell lines.
Collapse
Affiliation(s)
- Na-Na Guan
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Jian-Qiang Li
- College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China.
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| | - Xue Piao
- School of Medical Informatics, Xuzhou Medical University, Xuzhou 221004, China.
| |
Collapse
|
24
|
Abstract
In recent years, endless link prediction algorithms based on network representation learning have emerged. Network representation learning mainly constructs feature vectors by capturing the neighborhood structure information of network nodes for link prediction. However, this type of algorithm only focuses on learning topology information from the simple neighbor network node. For example, DeepWalk takes a random walk path as the neighborhood of nodes. In addition, such algorithms only take advantage of the potential features of nodes, but the explicit features of nodes play a good role in link prediction. In this paper, a link prediction method based on deep convolutional neural network is proposed. It constructs a model of the residual attention network to capture the link structure features from the sub-graph. Further study finds that the information flow transmission efficiency of the residual attention mechanism was not high, so a densely convolutional neural network model was proposed for link prediction. We evaluate our proposed method on four published data sets. The results show that our method is better than several other benchmark algorithms on link prediction.
Collapse
|
25
|
Singh H, Rana PS, Singh U. Prediction of drug synergy score using ensemble based differential evolution. IET Syst Biol 2019; 13:24-29. [PMID: 30774113 PMCID: PMC8687263 DOI: 10.1049/iet-syb.2018.5023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 07/23/2018] [Accepted: 09/05/2018] [Indexed: 12/23/2022] Open
Abstract
Prediction of drug synergy score is an ill-posed problem. It plays an efficient role in the medical field for inhibiting specific cancer agents. An efficient regression-based machine learning technique has an ability to minimise the drug synergy prediction errors. Therefore, in this study, an efficient machine learning technique for drug synergy prediction technique is designed by using ensemble based differential evolution (DE) for optimising the support vector machine (SVM). Because the tuning of the attributes of SVM kernel regulates the prediction precision. The ensemble based DE employs two trial vector generation techniques and two control attributes settings. The initial generation technique has the best solution and the other is without the best solution. The proposed and existing competitive machine learning techniques are applied to drug synergy data. The extensive analysis demonstrates that the proposed technique outperforms others in terms of accuracy, root mean square error and coefficient of correlation.
Collapse
Affiliation(s)
- Harpreet Singh
- Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India.
| | - Prashant Singh Rana
- Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
| | - Urvinder Singh
- Electronics & Communication Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
| |
Collapse
|
26
|
Clinical intelligence: New machine learning techniques for predicting clinical drug response. Comput Biol Med 2019; 107:302-322. [PMID: 30771879 DOI: 10.1016/j.compbiomed.2018.12.017] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 12/29/2018] [Accepted: 12/31/2018] [Indexed: 12/13/2022]
Abstract
Predicting the response, or sensitivity, of a clinical drug to a specific cancer type is an important research problem. By predicting the clinical drug response correctly, clinicians are able to understand patient-to-patient differences in drug sensitivity outcomes, which in turn results in lesser time spent and lower cost associated with identifying effective drug candidates. Although technological advances in high-throughput drug screening in cells led to the generation of a substantial amount of relevant data, the analysis of such data would be a challenging task. There is a critical need for advanced machine learning (ML) algorithms to generate accurate predictions of clinical drug response. A major goal of this work is to provide advanced ML tools to data analysts, who would in turn build prediction calculators to be incorporated into intelligent clinical decision support systems. Such innovative tools could be used to enhance patient-care, among other uses. To achieve this goal, we develop new ML techniques, including a transfer learning approach coupled with or without a boosting technique. Experimental results on real clinical data pertaining to breast cancer, multiple myeloma, and triple-negative cancer patients demonstrate the effectiveness and superiority of the proposed approaches compared to baseline approaches, including existing transfer learning methods.
Collapse
|
27
|
Graim K, Friedl V, Houlahan KE, Stuart JM. PLATYPUS: A Multiple-View Learning Predictive Framework for Cancer Drug Sensitivity Prediction. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019; 24:136-147. [PMID: 30864317 PMCID: PMC6417802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Cancer is a complex collection of diseases that are to some degree unique to each patient. Precision oncology aims to identify the best drug treatment regime using molecular data on tumor samples. While omics-level data is becoming more widely available for tumor specimens, the datasets upon which computational learning methods can be trained vary in coverage from sample to sample and from data type to data type. Methods that can 'connect the dots' to leverage more of the information provided by these studies could offer major advantages for maximizing predictive potential. We introduce a multi-view machinelearning strategy called PLATYPUS that builds 'views' from multiple data sources that are all used as features for predicting patient outcomes. We show that a learning strategy that finds agreement across the views on unlabeled data increases the performance of the learning methods over any single view. We illustrate the power of the approach by deriving signatures for drug sensitivity in a large cancer cell line database. Code and additional information are available from the PLATYPUS website https://sysbiowiki.soe.ucsc.edu/platypus.
Collapse
Affiliation(s)
| | - Verena Friedl
- Dept. of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA
| | | | - Joshua M. Stuart
- Dept. of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA
| |
Collapse
|
28
|
Boosting support vector machines for cancer discrimination tasks. Comput Biol Med 2018; 101:236-249. [DOI: 10.1016/j.compbiomed.2018.08.006] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 07/31/2018] [Accepted: 08/04/2018] [Indexed: 01/17/2023]
|
29
|
Tan M, Özgül OF, Bardak B, Ekşioğlu I, Sabuncuoğlu S. Drug response prediction by ensemble learning and drug-induced gene expression signatures. Genomics 2018; 111:1078-1088. [PMID: 31533900 DOI: 10.1016/j.ygeno.2018.07.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 06/12/2018] [Accepted: 07/03/2018] [Indexed: 12/14/2022]
Abstract
Chemotherapeutic response of cancer cells to a given compound is one of the most fundamental information one requires to design anti-cancer drugs. Recently, considerable amount of drug-induced gene expression data has become publicly available, in addition to cytotoxicity databases. These large sets of data provided an opportunity to apply machine learning methods to predict drug activity. However, due to the complexity of cancer drug mechanisms, none of the existing methods is perfect. In this paper, we propose a novel ensemble learning method to predict drug response. In addition, we attempt to use the drug screen data together with two novel signatures produced from the drug-induced gene expression profiles of cancer cell lines. Finally, we evaluate predictions by in vitro experiments in addition to the tests on data sets. The predictions of the methods, the signatures and the software are available from http://mtan.etu.edu.tr/drug-response-prediction/.
Collapse
Affiliation(s)
- Mehmet Tan
- Department of Computer Engineering, TOBB University of Economics and Technology, Ankara, Turkey.
| | - Ozan Fırat Özgül
- Department of Computer Engineering, TOBB University of Economics and Technology, Ankara, Turkey
| | - Batuhan Bardak
- Department of Computer Engineering, TOBB University of Economics and Technology, Ankara, Turkey
| | - Işıksu Ekşioğlu
- Department of Computer Engineering, TOBB University of Economics and Technology, Ankara, Turkey
| | - Suna Sabuncuoğlu
- Department of Toxicology, Faculty of Pharmacy, Hacettepe University, Ankara, Turkey
| |
Collapse
|
30
|
Zheng WJ, Ruan J, Xu H, Zhao Z, Liu Z. The International Conference on Intelligent Biology and Medicine (ICIBM) 2016: putting systems biology to work. BMC SYSTEMS BIOLOGY 2017; 11:88. [PMID: 28984194 PMCID: PMC5629615 DOI: 10.1186/s12918-017-0461-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Between December 8–10, 2016, the International Conference on Intelligent Biology and Medicine (ICIBM 2016) was held in Houston, Texas, USA. The conference included eight scientific sessions, four tutorials, one poster session, four highlighted talks and four keynotes that covered topics in 3D genome structure analysis and visualization, next generation sequencing analysis, computational drug discovery, medical informatics, cancer genomics and systems biology. Systems biology has been a main theme in ICIBM 2016, with exciting advances were presented in many areas of systems biology. Here, we selected seven high quality papers to be published in BMC Systems Biology.
Collapse
Affiliation(s)
- W Jim Zheng
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
| | - Jianhua Ruan
- Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX, 78249, USA
| | - Hua Xu
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.,Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA
| | - Zhangdong Liu
- The Jan and Dan Duncan Neurological Research Institute, Baylor College of Medicine, Houston, TX, 77030, USA
| |
Collapse
|