1
|
Lee S, Sun M, Hu Y, Wang Y, Islam MN, Goerlitz D, Lucas PC, Lee AV, Swain SM, Tang G, Wang XS. iGenSig-Rx: an integral genomic signature based white-box tool for modeling cancer therapeutic responses using multi-omics data. BMC Bioinformatics 2024; 25:220. [PMID: 38898383 PMCID: PMC11186173 DOI: 10.1186/s12859-024-05835-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 06/10/2024] [Indexed: 06/21/2024] Open
Abstract
Multi-omics sequencing is poised to revolutionize clinical care in the coming decade. However, there is a lack of effective and interpretable genome-wide modeling methods for the rational selection of patients for personalized interventions. To address this, we present iGenSig-Rx, an integral genomic signature-based approach, as a transparent tool for modeling therapeutic response using clinical trial datasets. This method adeptly addresses challenges related to cross-dataset modeling by capitalizing on high-dimensional redundant genomic features, analogous to reinforcing building pillars with redundant steel rods. Moreover, it integrates adaptive penalization of feature redundancy on a per-sample basis to prevent score flattening and mitigate overfitting. We then developed a purpose-built R package to implement this method for modeling clinical trial datasets. When applied to genomic datasets for HER2 targeted therapies, iGenSig-Rx model demonstrates consistent and reliable predictive power across four independent clinical trials. More importantly, the iGenSig-Rx model offers the level of transparency much needed for clinical application, allowing for clear explanations as to how the predictions are produced, how the features contribute to the prediction, and what are the key underlying pathways. We anticipate that iGenSig-Rx, as an interpretable class of multi-omics modeling methods, will find broad applications in big-data based precision oncology. The R package is available: https://github.com/wangxlab/iGenSig-Rx .
Collapse
Affiliation(s)
- Sanghoon Lee
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA
| | - Min Sun
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Yiheng Hu
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| | - Yue Wang
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| | - Md N Islam
- Genomics and Epigenomics Shared Resource (GESR), Georgetown University Medical Center, Washington, DC, 20057, USA
| | - David Goerlitz
- Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC, 20057, USA
| | - Peter C Lucas
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- National Surgical Adjuvant Breast and Bowel Project (NSABP), Pittsburgh, PA, 15213, USA
| | - Adrian V Lee
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA
- Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| | - Sandra M Swain
- National Surgical Adjuvant Breast and Bowel Project (NSABP), Pittsburgh, PA, 15213, USA
| | - Gong Tang
- Department of Biostatistics, School of Public Health, University of Pittsburgh, Pittsburgh, PA, 15261, USA
- National Surgical Adjuvant Breast and Bowel Project (NSABP), Pittsburgh, PA, 15213, USA
| | - Xiao-Song Wang
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, 15213, USA.
- Department of Pathology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA.
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA.
| |
Collapse
|
2
|
Yang S, Wang Z, Wang C, Li C, Wang B. Comparative Evaluation of Machine Learning Models for Subtyping Triple-Negative Breast Cancer: A Deep Learning-Based Multi-Omics Data Integration Approach. J Cancer 2024; 15:3943-3957. [PMID: 38911381 PMCID: PMC11190774 DOI: 10.7150/jca.93215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/19/2024] [Indexed: 06/25/2024] Open
Abstract
Objective: Triple-negative breast cancer (TNBC) poses significant diagnostic challenges due to its aggressive nature. This research develops an innovative deep learning (DL) model based on the latest multi-omics data to enhance the accuracy of TNBC subtype and prognosis prediction. The study focuses on addressing the constraints of prior studies by showcasing a model with substantial advancements in data integration, statistical performance, and algorithmic optimization. Methods: Breast cancer-related molecular characteristic data, including mRNA, miRNA, gene mutations, DNA methylation, and magnetic resonance imaging (MRI) images, were retrieved from the TCGA and TCIA databases. This study not only compared single-omics with multi-omics machine learning models but also applied Bayesian optimization to innovatively optimize the neural network structure of a DL model for multi-omics data. Results: The DL model for multi-omics data significantly outperformed single-omics models in subtype prediction, achieving a 98.0% accuracy in cross-validation, 97.0% in the validation set, and 91.0% in an external test set. Additionally, the MRI radiomics model showed promising performance, especially with the training set; however, a decrease in performance during transfer testing underscored the advantages of the DL model for multi-omics data in data consistency and digital processing. Conclusion: Our multi-omics DL model presents notable innovations in statistical performance and transfer learning capability, bearing significant clinical relevance for TNBC classification and prognosis prediction. While the MRI radiomics model proved effective, it requires further optimization for cross-dataset application to enhance accuracy and consistency. Our findings offer new insights into improving TNBC classification and prognosis through multi-omics data and DL algorithms.
Collapse
Affiliation(s)
| | | | | | | | - Binjie Wang
- Department of Imaging, Huaihe Hospital of Henan University, Kaifeng 475000, P. R. China
| |
Collapse
|
3
|
Hajim WI, Zainudin S, Mohd Daud K, Alheeti K. Optimized models and deep learning methods for drug response prediction in cancer treatments: a review. PeerJ Comput Sci 2024; 10:e1903. [PMID: 38660174 PMCID: PMC11042005 DOI: 10.7717/peerj-cs.1903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 01/31/2024] [Indexed: 04/26/2024]
Abstract
Recent advancements in deep learning (DL) have played a crucial role in aiding experts to develop personalized healthcare services, particularly in drug response prediction (DRP) for cancer patients. The DL's techniques contribution to this field is significant, and they have proven indispensable in the medical field. This review aims to analyze the diverse effectiveness of various DL models in making these predictions, drawing on research published from 2017 to 2023. We utilized the VOS-Viewer 1.6.18 software to create a word cloud from the titles and abstracts of the selected studies. This study offers insights into the focus areas within DL models used for drug response. The word cloud revealed a strong link between certain keywords and grouped themes, highlighting terms such as deep learning, machine learning, precision medicine, precision oncology, drug response prediction, and personalized medicine. In order to achieve an advance in DRP using DL, the researchers need to work on enhancing the models' generalizability and interoperability. It is also crucial to develop models that not only accurately represent various architectures but also simplify these architectures, balancing the complexity with the predictive capabilities. In the future, researchers should try to combine methods that make DL models easier to understand; this will make DRP reviews more open and help doctors trust the decisions made by DL models in cancer DRP.
Collapse
Affiliation(s)
- Wesam Ibrahim Hajim
- Department of Applied Geology, College of Sciences, Tirkit University, Tikrit, Salah ad Din, Iraq
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Suhaila Zainudin
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Kauthar Mohd Daud
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Khattab Alheeti
- Department of Computer Networking Systems, College of Computer Sciences and Information Technology, University of Anbar, Al Anbar, Ramadi, Iraq
| |
Collapse
|
4
|
Hussain S, Ali M, Naseem U, Nezhadmoghadam F, Jatoi MA, Gulliver TA, Tamez-Peña JG. Breast cancer risk prediction using machine learning: a systematic review. Front Oncol 2024; 14:1343627. [PMID: 38571502 PMCID: PMC10987819 DOI: 10.3389/fonc.2024.1343627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 02/26/2024] [Indexed: 04/05/2024] Open
Abstract
Background Breast cancer is the leading cause of cancer-related fatalities among women worldwide. Conventional screening and risk prediction models primarily rely on demographic and patient clinical history to devise policies and estimate likelihood. However, recent advancements in artificial intelligence (AI) techniques, particularly deep learning (DL), have shown promise in the development of personalized risk models. These models leverage individual patient information obtained from medical imaging and associated reports. In this systematic review, we thoroughly investigated the existing literature on the application of DL to digital mammography, radiomics, genomics, and clinical information for breast cancer risk assessment. We critically analyzed these studies and discussed their findings, highlighting the promising prospects of DL techniques for breast cancer risk prediction. Additionally, we explored ongoing research initiatives and potential future applications of AI-driven approaches to further improve breast cancer risk prediction, thereby facilitating more effective screening and personalized risk management strategies. Objective and methods This study presents a comprehensive overview of imaging and non-imaging features used in breast cancer risk prediction using traditional and AI models. The features reviewed in this study included imaging, radiomics, genomics, and clinical features. Furthermore, this survey systematically presented DL methods developed for breast cancer risk prediction, aiming to be useful for both beginners and advanced-level researchers. Results A total of 600 articles were identified, 20 of which met the set criteria and were selected. Parallel benchmarking of DL models, along with natural language processing (NLP) applied to imaging and non-imaging features, could allow clinicians and researchers to gain greater awareness as they consider the clinical deployment or development of new models. This review provides a comprehensive guide for understanding the current status of breast cancer risk assessment using AI. Conclusion This study offers investigators a different perspective on the use of AI for breast cancer risk prediction, incorporating numerous imaging and non-imaging features.
Collapse
Affiliation(s)
- Sadam Hussain
- School of Engineering and Sciences, Tecnologico de Monterrey, Monterrey, Mexico
- Department of Electrical and Computer Engineering, University of Victoria, Victoria, BC, Canada
| | - Mansoor Ali
- School of Engineering and Sciences, Tecnologico de Monterrey, Monterrey, Mexico
| | - Usman Naseem
- College of Science and Engineering, James Cook University, Cairns, QLD, Australia
| | | | - Munsif Ali Jatoi
- Department of Biomedical Engineering, Salim Habib University, Karachi, Pakistan
| | - T. Aaron Gulliver
- Department of Electrical and Computer Engineering, University of Victoria, Victoria, BC, Canada
| | | |
Collapse
|
5
|
Tong L, Shi W, Isgut M, Zhong Y, Lais P, Gloster L, Sun J, Swain A, Giuste F, Wang MD. Integrating Multi-Omics Data With EHR for Precision Medicine Using Advanced Artificial Intelligence. IEEE Rev Biomed Eng 2024; 17:80-97. [PMID: 37824325 DOI: 10.1109/rbme.2023.3324264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
With the recent advancement of novel biomedical technologies such as high-throughput sequencing and wearable devices, multi-modal biomedical data ranging from multi-omics molecular data to real-time continuous bio-signals are generated at an unprecedented speed and scale every day. For the first time, these multi-modal biomedical data are able to make precision medicine close to a reality. However, due to data volume and the complexity, making good use of these multi-modal biomedical data requires major effort. Researchers and clinicians are actively developing artificial intelligence (AI) approaches for data-driven knowledge discovery and causal inference using a variety of biomedical data modalities. These AI-based approaches have demonstrated promising results in various biomedical and healthcare applications. In this review paper, we summarize the state-of-the-art AI models for integrating multi-omics data and electronic health records (EHRs) for precision medicine. We discuss the challenges and opportunities in integrating multi-omics data with EHRs and future directions. We hope this review can inspire future research and developing in integrating multi-omics data with EHRs for precision medicine.
Collapse
|
6
|
Ogunleye A, Piyawajanusorn C, Ghislat G, Ballester PJ. Large-Scale Machine Learning Analysis Reveals DNA Methylation and Gene Expression Response Signatures for Gemcitabine-Treated Pancreatic Cancer. HEALTH DATA SCIENCE 2024; 4:0108. [PMID: 38486621 PMCID: PMC10904073 DOI: 10.34133/hds.0108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 12/08/2023] [Indexed: 03/17/2024]
Abstract
Background: Gemcitabine is a first-line chemotherapy for pancreatic adenocarcinoma (PAAD), but many PAAD patients do not respond to gemcitabine-containing treatments. Being able to predict such nonresponders would hence permit the undelayed administration of more promising treatments while sparing gemcitabine life-threatening side effects for those patients. Unfortunately, the few predictors of PAAD patient response to this drug are weak, none of them exploiting yet the power of machine learning (ML). Methods: Here, we applied ML to predict the response of PAAD patients to gemcitabine from the molecular profiles of their tumors. More concretely, we collected diverse molecular profiles of PAAD patient tumors along with the corresponding clinical data (gemcitabine responses and clinical features) from the Genomic Data Commons resource. From systematically combining 8 tumor profiles with 16 classification algorithms, each of the resulting 128 ML models was evaluated by multiple 10-fold cross-validations. Results: Only 7 of these 128 models were predictive, which underlines the importance of carrying out such a large-scale analysis to avoid missing the most predictive models. These were here random forest using 4 selected mRNAs [0.44 Matthews correlation coefficient (MCC), 0.785 receiver operating characteristic-area under the curve (ROC-AUC)] and XGBoost combining 12 DNA methylation probes (0.32 MCC, 0.697 ROC-AUC). By contrast, the hENT1 marker obtained much worse random-level performance (practically 0 MCC, 0.5 ROC-AUC). Despite not being trained to predict prognosis (overall and progression-free survival), these ML models were also able to anticipate this patient outcome. Conclusions: We release these promising ML models so that they can be evaluated prospectively on other gemcitabine-treated PAAD patients.
Collapse
Affiliation(s)
- Adeolu Ogunleye
- Department of Organismal Biology,
Uppsala University, Uppsala, Sweden
| | | | - Ghita Ghislat
- Department of Life Sciences,
Imperial College London, London, UK
| | | |
Collapse
|
7
|
Grieb N, Schmierer L, Kim HU, Strobel S, Schulz C, Meschke T, Kubasch AS, Brioli A, Platzbecker U, Neumuth T, Merz M, Oeser A. A digital twin model for evidence-based clinical decision support in multiple myeloma treatment. Front Digit Health 2023; 5:1324453. [PMID: 38173909 PMCID: PMC10761485 DOI: 10.3389/fdgth.2023.1324453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024] Open
Abstract
The treatment landscape for multiple myeloma (MM) has experienced substantial progress over the last decade. Despite the efficacy of new substances, patient responses tend to still be highly unpredictable. With increasing cognitive burden that is introduced through a complex and evolving treatment landscape, data-driven assistance tools are becoming more and more popular. Model-based approaches, such as digital twins (DT), enable simulation of probable responses to a set of input parameters based on retrospective observations. In the context of treatment decision-support, those mechanisms serve the goal to predict therapeutic outcomes to distinguish a favorable option from a potential failure. In the present work, we propose a similarity-based multiple myeloma digital twin (MMDT) that emphasizes explainability and interpretability in treatment outcome evaluation. We've conducted a requirement specification process using scientific literature from the medical and methodological domains to derive an architectural blueprint for the design and implementation of the MMDT. In a subsequent stage, we've implemented a four-layer concept where for each layer, we describe the utilized implementation procedure and interfaces to the surrounding DT environment. We further specify our solutions regarding the adoption of multi-line treatment strategies, the integration of external evidence and knowledge, as well as mechanisms to enable transparency in the data processing logic. Furthermore, we define an initial evaluation scenario in the context of patient characterization and treatment outcome simulation as an exemplary use case for our MMDT. Our derived MMDT instance is defined by 475 unique entities connected through 438 edges to form a MM knowledge graph. Using the MMRF CoMMpass real-world evidence database and a sample MM case, we processed a complete outcome assessment. The output shows a valid selection of potential treatment strategies for the integrated medical case and highlights the potential of the MMDT to be used for such applications. DT models face significant challenges in development, including availability of clinical data to algorithmically derive clinical decision support, as well as trustworthiness of the evaluated treatment options. We propose a collaborative approach that mitigates the regulatory and ethical concerns that are broadly discussed when automated decision-making tools are to be included into clinical routine.
Collapse
Affiliation(s)
- Nora Grieb
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Lukas Schmierer
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Hyeon Ung Kim
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Sarah Strobel
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Christian Schulz
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Tim Meschke
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Anne Sophie Kubasch
- Department of Hematology, Hemostaseology, Cellular Therapy and Infectiology, University Hospital of Leipzig, Leipzig, Germany
| | - Annamaria Brioli
- Clinic of Internal Medicine C, Hematology and Oncology, Stem Cell Transplantation and Palliative Care, Greifswald University Medicine, Greifswald, Germany
| | - Uwe Platzbecker
- Department of Hematology, Hemostaseology, Cellular Therapy and Infectiology, University Hospital of Leipzig, Leipzig, Germany
| | - Thomas Neumuth
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| | - Maximilian Merz
- Department of Hematology, Hemostaseology, Cellular Therapy and Infectiology, University Hospital of Leipzig, Leipzig, Germany
| | - Alexander Oeser
- Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, Leipzig, Germany
| |
Collapse
|
8
|
Li Y, Guo Z, Gao X, Wang G. MMCL-CDR: enhancing cancer drug response prediction with multi-omics and morphology images contrastive representation learning. Bioinformatics 2023; 39:btad734. [PMID: 38070154 PMCID: PMC10756335 DOI: 10.1093/bioinformatics/btad734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/09/2023] [Indexed: 12/30/2023] Open
Abstract
MOTIVATION Cancer is a complex disease that results in a significant number of global fatalities. Treatment strategies can vary among patients, even if they have the same type of cancer. The application of precision medicine in cancer shows promise for treating different types of cancer, reducing healthcare expenses, and improving recovery rates. To achieve personalized cancer treatment, machine learning models have been developed to predict drug responses based on tumor and drug characteristics. However, current studies either focus on constructing homogeneous networks from single data source or heterogeneous networks from multiomics data. While multiomics data have shown potential in predicting drug responses in cancer cell lines, there is still a lack of research that effectively utilizes insights from different modalities. Furthermore, effectively utilizing the multimodal knowledge of cancer cell lines poses a challenge due to the heterogeneity inherent in these modalities. RESULTS To address these challenges, we introduce MMCL-CDR (Multimodal Contrastive Learning for Cancer Drug Responses), a multimodal approach for cancer drug response prediction that integrates copy number variation, gene expression, morphology images of cell lines, and chemical structure of drugs. The objective of MMCL-CDR is to align cancer cell lines across different data modalities by learning cell line representations from omic and image data, and combined with structural drug representations to enhance the prediction of cancer drug responses (CDR). We have carried out comprehensive experiments and show that our model significantly outperforms other state-of-the-art methods in CDR prediction. The experimental results also prove that the model can learn more accurate cell line representation by integrating multiomics and morphological data from cell lines, thereby improving the accuracy of CDR prediction. In addition, the ablation study and qualitative analysis also confirm the effectiveness of each part of our proposed model. Last but not least, MMCL-CDR opens up a new dimension for cancer drug response prediction through multimodal contrastive learning, pioneering a novel approach that integrates multiomics and multimodal drug and cell line modeling. AVAILABILITY AND IMPLEMENTATION MMCL-CDR is available at https://github.com/catly/MMCL-CDR.
Collapse
Affiliation(s)
- Yang Li
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150006, China
| | - Zihou Guo
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150006, China
| | - Xin Gao
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150006, China
| |
Collapse
|
9
|
Lee S, Sun M, Hu Y, Wang Y, Islam MN, Goerlitz D, Lucas PC, Lee AV, Swain SM, Tang G, Wang XS. iGenSig-Rx: an integral genomic signature based white-box tool for modeling cancer therapeutic responses using multi-omics data. RESEARCH SQUARE 2023:rs.3.rs-3649238. [PMID: 38077030 PMCID: PMC10705599 DOI: 10.21203/rs.3.rs-3649238/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Multi-omics sequencing is expected to become clinically routine within the next decade and transform clinical care. However, there is a paucity of viable and interpretable genome-wide modeling methods that can facilitate rational selection of patients for tailored intervention. Here we develop an integral genomic signature-based method called iGenSig-Rx as a white-box tool for modeling therapeutic response based on clinical trial datasets with improved cross-dataset applicability and tolerance to sequencing bias. This method leverages high-dimensional redundant genomic features to address the challenges of cross-dataset modeling, a concept similar to the use of redundant steel rods to reinforce the pillars of a building. Using genomic datasets for HER2 targeted therapies, the iGenSig-Rx model demonstrates stable predictive power across four independent clinical trials. More importantly, the iGenSig-Rx model offers the level of transparency much needed for clinical application, allowing for clear explanations as to how the predictions are produced, how the features contribute to the prediction, and what are the key underlying pathways. We expect that iGenSig-Rx as a class of biologically interpretable multi-omics modeling methods will have broad applications in big-data based precision oncology. The R package is available: https://github.com/wangxlab/iGenSig-Rx. NOTE: the Github website will be released upon publication and the R package is available for review through google drive: https://drive.google.com/drive/folders/1KgecmUoon9-h2Dg1rPCyEGFPOp28Ols3?usp=sharing.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Sandra M Swain
- National Surgical Adjuvant Breast and Bowel Project (NSABP)
| | | | | |
Collapse
|
10
|
Nguyen QTN, Nguyen P, Wang C, Phuc PT, Lin R, Hung C, Kuo N, Cheng Y, Lin S, Hsieh Z, Cheng C, Hsu M, Hsu JC. Machine learning approaches for predicting 5-year breast cancer survival: A multicenter study. Cancer Sci 2023; 114:4063-4072. [PMID: 37489252 PMCID: PMC10551582 DOI: 10.1111/cas.15917] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 06/27/2023] [Accepted: 07/05/2023] [Indexed: 07/26/2023] Open
Abstract
The study used clinical data to develop a prediction model for breast cancer survival. Breast cancer prognostic factors were explored using machine learning techniques. We conducted a retrospective study using data from the Taipei Medical University Clinical Research Database, which contains electronic medical records from three affiliated hospitals in Taiwan. The study included female patients aged over 20 years who were diagnosed with primary breast cancer and had medical records in hospitals between January 1, 2009 and December 31, 2020. The data were divided into training and external testing datasets. Nine different machine learning algorithms were applied to develop the models. The performances of the algorithms were measured using the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1-score. A total of 3914 patients were included in the study. The highest AUC of 0.95 was observed with the artificial neural network model (accuracy, 0.90; sensitivity, 0.71; specificity, 0.73; PPV, 0.28; NPV, 0.94; and F1-score, 0.37). Other models showed relatively high AUC, ranging from 0.75 to 0.83. According to the optimal model results, cancer stage, tumor size, diagnosis age, surgery, and body mass index were the most critical factors for predicting breast cancer survival. The study successfully established accurate 5-year survival predictive models for breast cancer. Furthermore, the study found key factors that could affect breast cancer survival in Taiwanese women. Its results might be used as a reference for the clinical practice of breast cancer treatment.
Collapse
Affiliation(s)
- Quynh Thi Nhu Nguyen
- School of Pharmacy, College of PharmacyTaipei Medical UniversityTaipei CityTaiwan
| | - Phung‐Anh Nguyen
- Clinical Data Center, Office of Data ScienceTaipei Medical UniversityTaipei CityTaiwan
- Clinical Big Data Research CenterTaipei Medical University Hospital, Taipei Medical UniversityTaipei CityTaiwan
- Research Center of Health Care Industry Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| | - Chun‐Jung Wang
- School of Pharmacy, College of PharmacyTaipei Medical UniversityTaipei CityTaiwan
| | - Phan Thanh Phuc
- Research Center of Health Care Industry Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| | - Ruo‐Kai Lin
- School of Pharmacy, College of PharmacyTaipei Medical UniversityTaipei CityTaiwan
| | - Chin‐Sheng Hung
- Department of Surgery, School of Medicine, College of MedicineTaipei Medical UniversityTaipei CityTaiwan
| | - Nei‐Hui Kuo
- Oncology CenterTaipei Medical University HospitalTaipei CityTaiwan
| | - Yu‐Wen Cheng
- School of Pharmacy, College of PharmacyTaipei Medical UniversityTaipei CityTaiwan
| | - Shwu‐Jiuan Lin
- School of Pharmacy, College of PharmacyTaipei Medical UniversityTaipei CityTaiwan
| | - Zong‐You Hsieh
- Research Center of Health Care Industry Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| | - Chi‐Tsun Cheng
- Research Center of Health Care Industry Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| | - Min‐Huei Hsu
- Clinical Data Center, Office of Data ScienceTaipei Medical UniversityTaipei CityTaiwan
- Graduate Institute of Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| | - Jason C. Hsu
- Clinical Data Center, Office of Data ScienceTaipei Medical UniversityTaipei CityTaiwan
- Clinical Big Data Research CenterTaipei Medical University Hospital, Taipei Medical UniversityTaipei CityTaiwan
- Research Center of Health Care Industry Data Science, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
- International Ph.D. Program in Biotech and Healthcare Management, College of ManagementTaipei Medical UniversityTaipei CityTaiwan
| |
Collapse
|
11
|
Xia L, Xu L, Pan S, Niu D, Zhang B, Li Z. Drug-target binding affinity prediction using message passing neural network and self supervised learning. BMC Genomics 2023; 24:557. [PMID: 37730555 PMCID: PMC10510145 DOI: 10.1186/s12864-023-09664-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/09/2023] [Indexed: 09/22/2023] Open
Abstract
BACKGROUND Drug-target binding affinity (DTA) prediction is important for the rapid development of drug discovery. Compared to traditional methods, deep learning methods provide a new way for DTA prediction to achieve good performance without much knowledge of the biochemical background. However, there are still room for improvement in DTA prediction: (1) only focusing on the information of the atom leads to an incomplete representation of the molecular graph; (2) the self-supervised learning method could be introduced for protein representation. RESULTS In this paper, a DTA prediction model using the deep learning method is proposed, which uses an undirected-CMPNN for molecular embedding and combines CPCProt and MLM models for protein embedding. An attention mechanism is introduced to discover the important part of the protein sequence. The proposed method is evaluated on the datasets Ki and Davis, and the model outperformed other deep learning methods. CONCLUSIONS The proposed model improves the performance of the DTA prediction, which provides a novel strategy for deep learning-based virtual screening methods.
Collapse
Affiliation(s)
- Leiming Xia
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Lei Xu
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Shourun Pan
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Dongjiang Niu
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Beiyi Zhang
- College of Computer Science and Technology, Qingdao University, Qingdao, China
| | - Zhen Li
- College of Computer Science and Technology, Qingdao University, Qingdao, China.
| |
Collapse
|
12
|
Ochoa S, Hernández-Lemus E. Molecular mechanisms of multi-omic regulation in breast cancer. Front Oncol 2023; 13:1148861. [PMID: 37564937 PMCID: PMC10411627 DOI: 10.3389/fonc.2023.1148861] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 07/05/2023] [Indexed: 08/12/2023] Open
Abstract
Breast cancer is a complex disease that is influenced by the concurrent influence of multiple genetic and environmental factors. Recent advances in genomics and other high throughput biomolecular techniques (-omics) have provided numerous insights into the molecular mechanisms underlying breast cancer development and progression. A number of these mechanisms involve multiple layers of regulation. In this review, we summarize the current knowledge on the role of multiple omics in the regulation of breast cancer, including the effects of DNA methylation, non-coding RNA, and other epigenomic changes. We comment on how integrating such diverse mechanisms is envisioned as key to a more comprehensive understanding of breast carcinogenesis and cancer biology with relevance to prognostics, diagnostics and therapeutics. We also discuss the potential clinical implications of these findings and highlight areas for future research. Overall, our understanding of the molecular mechanisms of multi-omic regulation in breast cancer is rapidly increasing and has the potential to inform the development of novel therapeutic approaches for this disease.
Collapse
Affiliation(s)
- Soledad Ochoa
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
- Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
- Center for Complexity Sciences, Universidad Nacional Autónoma de México, Mexico City, Mexico
| |
Collapse
|
13
|
Salimy S, Lanjanian H, Abbasi K, Salimi M, Najafi A, Tapak L, Masoudi-Nejad A. A deep learning-based framework for predicting survival-associated groups in colon cancer by integrating multi-omics and clinical data. Heliyon 2023; 9:e17653. [PMID: 37455955 PMCID: PMC10344710 DOI: 10.1016/j.heliyon.2023.e17653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/30/2023] [Accepted: 06/25/2023] [Indexed: 07/18/2023] Open
Abstract
Precise prognostic classification of patients and identifying survival subgroups and their associated genes can be important clinical references when designing treatment strategies for cancer patients. Multi-omics and data integration techniques are powerful tools to achieve this goal. This study aimed to introduce a machine learning method to integrate three types of biological data, and investigate the performance of two other methods, in identifying the survival dependency of patients. The data included TCGA RNA-seq gene expression, DNA methylation, and clinical data from 368 patients with colon cancer also we use an independent external validation data set, containing 232 samples. Three methods including, hyper-parameter optimized autoencoders (HPOAE), normal autoencoder, and penalized principal component analysis (PPCA) were used for simultaneous data integration and estimation under a COX hazards model. The HPOAE was thought to outperform other methods. The HPOAE had the Log Rank Mantel-Cox value of 14.27 ± 2, and a Breslow-Generalized Wilcoxon value of 13.13 ± 1. Ten miRNA, 11 methylated genes, and 28 mRNA all by (importance of marginal cutoff > 0.95) were identified. The study demonstrated that hsa-miR-485-5p targets both ZMYM1 and tp53, the latter of which has been previously associated with cancer in numerous studies. Furthermore, compared to other methods, the HPOAE exhibited a greater capacity for identifying survival subgroups and the genes associated with them in patients with colon cancer. However, all of the results were obtained by computational methods, and clinical and experimental studies are needed to validate these results.
Collapse
Affiliation(s)
- Siamak Salimy
- Laboratory of System Biology and Bioinformatics (LBB), Department of Bioinformatics, University of Tehran, Kish International Campus, Kish, Iran
| | - Hossein Lanjanian
- Cellular and Molecular Endocrine Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Karim Abbasi
- Laboratory of System Biology, Bioinformatics & Artificial Intelligent in Medicine (LBBai), Faculty of Mathematics and Computer Science, Kharazmi University, Tehran, Iran
| | - Mahdieh Salimi
- Department of Medical Genetics, Institute of Medical Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Ali Najafi
- Molecular Biology Research Center, Systems Biology and Poisonings Institute, Tehran, Iran
| | - Leili Tapak
- Department of Biostatistics, School of Public Health and Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Ali Masoudi-Nejad
- Laboratory of System Biology and Bioinformatics (LBB), Department of Bioinformatics, University of Tehran, Kish International Campus, Kish, Iran
| |
Collapse
|
14
|
Lee M. Deep Learning Techniques with Genomic Data in Cancer Prognosis: A Comprehensive Review of the 2021-2023 Literature. BIOLOGY 2023; 12:893. [PMID: 37508326 PMCID: PMC10376033 DOI: 10.3390/biology12070893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 06/16/2023] [Accepted: 06/20/2023] [Indexed: 07/30/2023]
Abstract
Deep learning has brought about a significant transformation in machine learning, leading to an array of novel methodologies and consequently broadening its influence. The application of deep learning in various sectors, especially biomedical data analysis, has initiated a period filled with noteworthy scientific developments. This trend has majorly influenced cancer prognosis, where the interpretation of genomic data for survival analysis has become a central research focus. The capacity of deep learning to decode intricate patterns embedded within high-dimensional genomic data has provoked a paradigm shift in our understanding of cancer survival. Given the swift progression in this field, there is an urgent need for a comprehensive review that focuses on the most influential studies from 2021 to 2023. This review, through its careful selection and thorough exploration of dominant trends and methodologies, strives to fulfill this need. The paper aims to enhance our existing understanding of applications of deep learning in cancer survival analysis, while also highlighting promising directions for future research. This paper undertakes aims to enrich our existing grasp of the application of deep learning in cancer survival analysis, while concurrently shedding light on promising directions for future research in this vibrant and rapidly proliferating field.
Collapse
Affiliation(s)
- Minhyeok Lee
- School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Republic of Korea
| |
Collapse
|
15
|
Hostallero DE, Wei L, Wang L, Cairns J, Emad A. Preclinical-to-clinical Anti-cancer Drug Response Prediction and Biomarker Identification Using TINDL. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:535-550. [PMID: 36775056 PMCID: PMC10787192 DOI: 10.1016/j.gpb.2023.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 11/28/2022] [Accepted: 01/31/2023] [Indexed: 02/12/2023]
Abstract
Prediction of the response of cancer patients to different treatments and identification of biomarkers of drug response are two major goals of individualized medicine. Here, we developed a deep learning framework called TINDL, completely trained on preclinical cancer cell lines (CCLs), to predict the response of cancer patients to different treatments. TINDL utilizes a tissue-informed normalization to account for the tissue type and cancer type of the tumors and to reduce the statistical discrepancies between CCLs and patient tumors. Moreover, by making the deep learning black box interpretable, this model identifies a small set of genes whose expression levels are predictive of drug response in the trained model, enabling identification of biomarkers of drug response. Using data from two large databases of CCLs and cancer tumors, we showed that this model can distinguish between sensitive and resistant tumors for 10 (out of 14) drugs, outperforming various other machine learning models. In addition, our small interfering RNA (siRNA) knockdown experiments on 10 genes identified by this model for one of the drugs (tamoxifen) confirmed that tamoxifen sensitivity is substantially influenced by all of these genes in MCF7 cells, and seven of these genes in T47D cells. Furthermore, genes implicated for multiple drugs pointed to shared mechanism of action among drugs and suggested several important signaling pathways. In summary, this study provides a powerful deep learning framework for prediction of drug response and identification of biomarkers of drug response in cancer. The code can be accessed at https://github.com/ddhostallero/tindl.
Collapse
Affiliation(s)
- David Earl Hostallero
- Department of Electrical and Computer Engineering, McGill University, Montreal, QC H3A, Canada; Mila - Quebec Artificial Intelligence Institute, Montreal, QC H2S, Canada
| | - Lixuan Wei
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA
| | - Liewei Wang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA
| | - Junmei Cairns
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA.
| | - Amin Emad
- Department of Electrical and Computer Engineering, McGill University, Montreal, QC H3A, Canada; Mila - Quebec Artificial Intelligence Institute, Montreal, QC H2S, Canada; The Rosalind and Morris Goodman Cancer Institute, McGill University, Montreal, QC H3A, Canada.
| |
Collapse
|
16
|
Pang J, Xiu W, Ma X. Application of Artificial Intelligence in the Diagnosis, Treatment, and Prognostic Evaluation of Mediastinal Malignant Tumors. J Clin Med 2023; 12:jcm12082818. [PMID: 37109155 PMCID: PMC10144939 DOI: 10.3390/jcm12082818] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/01/2023] [Accepted: 04/06/2023] [Indexed: 04/29/2023] Open
Abstract
Artificial intelligence (AI), also known as machine intelligence, is widely utilized in the medical field, promoting medical advances. Malignant tumors are the critical focus of medical research and improvement of clinical diagnosis and treatment. Mediastinal malignancy is an important tumor that attracts increasing attention today due to the difficulties in treatment. Combined with artificial intelligence, challenges from drug discovery to survival improvement are constantly being overcome. This article reviews the progress of the use of AI in the diagnosis, treatment, and prognostic prospects of mediastinal malignant tumors based on current literature findings.
Collapse
Affiliation(s)
- Jiyun Pang
- Division of Thoracic Tumor Multimodality Treatment, Cancer Center, West China Hospital, Sichuan University, Chengdu 610041, China
- State Key Laboratory of Biotherapy, Cancer Center, West China Hospital, Sichuan University, Chengdu 610041, China
- West China School of Medicine, Sichuan University, Chengdu 610041, China
| | - Weigang Xiu
- Division of Thoracic Tumor Multimodality Treatment, Cancer Center, West China Hospital, Sichuan University, Chengdu 610041, China
- State Key Laboratory of Biotherapy, Cancer Center, West China Hospital, Sichuan University, Chengdu 610041, China
- West China School of Medicine, Sichuan University, Chengdu 610041, China
| | - Xuelei Ma
- Department of Biotherapy, Cancer Center, West China Hospital, Sichuan University, Chengdu 610041, China
| |
Collapse
|
17
|
Wu P, Sun R, Fahira A, Chen Y, Jiangzhou H, Wang K, Yang Q, Dai Y, Pan D, Shi Y, Wang Z. DROEG: a method for cancer drug response prediction based on omics and essential genes integration. Brief Bioinform 2023; 24:7008798. [PMID: 36715269 DOI: 10.1093/bib/bbad003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 12/06/2022] [Accepted: 12/30/2022] [Indexed: 01/31/2023] Open
Abstract
Predicting therapeutic responses in cancer patients is a major challenge in the field of precision medicine due to high inter- and intra-tumor heterogeneity. Most drug response models need to be improved in terms of accuracy, and there is limited research to assess therapeutic responses of particular tumor types. Here, we developed a novel method DROEG (Drug Response based on Omics and Essential Genes) for prediction of drug response in tumor cell lines by integrating genomic, transcriptomic and methylomic data along with CRISPR essential genes, and revealed that the incorporation of tumor proliferation essential genes can improve drug sensitivity prediction. Concisely, DROEG integrates literature-based and statistics-based methods to select features and uses Support Vector Regression for model construction. We demonstrate that DROEG outperforms most state-of-the-art algorithms by both qualitative (prediction accuracy for drug-sensitive/resistant) and quantitative (Pearson correlation coefficient between the predicted and actual IC50) evaluation in Genomics of Drug Sensitivity in Cancer and Cancer Cell Line Encyclopedia datasets. In addition, DROEG is further applied to the pan-gastrointestinal tumor with high prevalence and mortality as a case study at both cell line and clinical levels to evaluate the model efficacy and discover potential prognostic biomarkers in Cisplatin and Epirubicin treatment. Interestingly, the CRISPR essential gene information is found to be the most important contributor to enhance the accuracy of the DROEG model. To our knowledge, this is the first study to integrate essential genes with multi-omics data to improve cancer drug response prediction and provide insights into personalized precision treatment.
Collapse
Affiliation(s)
- Peike Wu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Renliang Sun
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- CAS Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Aamir Fahira
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Yongzhou Chen
- School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, China
| | - Huiting Jiangzhou
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Ke Wang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Qiangzhen Yang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Yang Dai
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Dun Pan
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Yongyong Shi
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| | - Zhuo Wang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
- Collaborative Innovation Centre for Brain Science, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
18
|
Partin A, Brettin TS, Zhu Y, Narykov O, Clyde A, Overbeek J, Stevens RL. Deep learning methods for drug response prediction in cancer: Predominant and emerging trends. Front Med (Lausanne) 2023; 10:1086097. [PMID: 36873878 PMCID: PMC9975164 DOI: 10.3389/fmed.2023.1086097] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 01/23/2023] [Indexed: 02/17/2023] Open
Abstract
Cancer claims millions of lives yearly worldwide. While many therapies have been made available in recent years, by in large cancer remains unsolved. Exploiting computational predictive models to study and treat cancer holds great promise in improving drug development and personalized design of treatment plans, ultimately suppressing tumors, alleviating suffering, and prolonging lives of patients. A wave of recent papers demonstrates promising results in predicting cancer response to drug treatments while utilizing deep learning methods. These papers investigate diverse data representations, neural network architectures, learning methodologies, and evaluations schemes. However, deciphering promising predominant and emerging trends is difficult due to the variety of explored methods and lack of standardized framework for comparing drug response prediction models. To obtain a comprehensive landscape of deep learning methods, we conducted an extensive search and analysis of deep learning models that predict the response to single drug treatments. A total of 61 deep learning-based models have been curated, and summary plots were generated. Based on the analysis, observable patterns and prevalence of methods have been revealed. This review allows to better understand the current state of the field and identify major challenges and promising solution paths.
Collapse
Affiliation(s)
- Alexander Partin
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Thomas S. Brettin
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Yitan Zhu
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Oleksandr Narykov
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Austin Clyde
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Jamie Overbeek
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
| | - Rick L. Stevens
- Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United States
- Department of Computer Science, The University of Chicago, Chicago, IL, United States
| |
Collapse
|
19
|
Prediction of pathologic complete response to neoadjuvant systemic therapy in triple negative breast cancer using deep learning on multiparametric MRI. Sci Rep 2023; 13:1171. [PMID: 36670144 PMCID: PMC9859781 DOI: 10.1038/s41598-023-27518-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 01/03/2023] [Indexed: 01/22/2023] Open
Abstract
Triple-negative breast cancer (TNBC) is an aggressive subtype of breast cancer. Neoadjuvant systemic therapy (NAST) followed by surgery are currently standard of care for TNBC with 50-60% of patients achieving pathologic complete response (pCR). We investigated ability of deep learning (DL) on dynamic contrast enhanced (DCE) MRI and diffusion weighted imaging acquired early during NAST to predict TNBC patients' pCR status in the breast. During the development phase using the images of 130 TNBC patients, the DL model achieved areas under the receiver operating characteristic curves (AUCs) of 0.97 ± 0.04 and 0.82 ± 0.10 for the training and the validation, respectively. The model achieved an AUC of 0.86 ± 0.03 when evaluated in the independent testing group of 32 patients. In an additional prospective blinded testing group of 48 patients, the model achieved an AUC of 0.83 ± 0.02. These results demonstrated that DL based on multiparametric MRI can potentially differentiate TNBC patients with pCR or non-pCR in the breast early during NAST.
Collapse
|
20
|
Wang S, Wang S, Wang Z. A survey on multi-omics-based cancer diagnosis using machine learning with the potential application in gastrointestinal cancer. Front Med (Lausanne) 2023; 9:1109365. [PMID: 36703893 PMCID: PMC9871466 DOI: 10.3389/fmed.2022.1109365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Accepted: 12/28/2022] [Indexed: 01/12/2023] Open
Abstract
Gastrointestinal cancer is becoming increasingly common, which leads to over 3 million deaths every year. No typical symptoms appear in the early stage of gastrointestinal cancer, posing a significant challenge in the diagnosis and treatment of patients with gastrointestinal cancer. Many patients are in the middle and late stages of gastrointestinal cancer when they feel uncomfortable, unfortunately, most of them will die of gastrointestinal cancer. Recently, various artificial intelligence techniques like machine learning based on multi-omics have been presented for cancer diagnosis and treatment in the era of precision medicine. This paper provides a survey on multi-omics-based cancer diagnosis using machine learning with potential application in gastrointestinal cancer. Particularly, we make a comprehensive summary and analysis from the perspective of multi-omics datasets, task types, and multi-omics-based integration methods. Furthermore, this paper points out the remaining challenges of multi-omics-based cancer diagnosis using machine learning and discusses future topics.
Collapse
Affiliation(s)
- Suixue Wang
- School of Information and Communication Engineering, Hainan University, Haikou, China
| | - Shuling Wang
- Department of Neurology, Affiliated Haikou Hospital of Xiangya School of Medicine, Central South University, Haikou, China
| | - Zhengxia Wang
- School of Computer Science and Technology, Hainan University, Haikou, China
| |
Collapse
|
21
|
Singh DP, Kaushik B. A systematic literature review for the prediction of anticancer drug response using various machine-learning and deep-learning techniques. Chem Biol Drug Des 2023; 101:175-194. [PMID: 36303299 DOI: 10.1111/cbdd.14164] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/13/2022] [Accepted: 10/24/2022] [Indexed: 12/24/2022]
Abstract
Computational methods have gained prominence in healthcare research. The accessibility of healthcare data has greatly incited academicians and researchers to develop executions that help in prognosis of cancer drug response. Among various computational methods, machine-learning (ML) and deep-learning (DL) methods provide the most consistent and effectual approaches to handle the serious aftermaths of the deadly disease and drug administered to the patients. Hence, this systematic literature review has reviewed researches that have investigated drug discovery and prognosis of anticancer drug response using ML and DL algorithms. Fot this purpose, PRISMA guidelines have been followed to choose research papers from Google Scholar, PubMed, and Sciencedirect websites. A total count of 105 papers that align with the context of this review were chosen. Further, the review also presents accuracy of the existing ML and DL methods in the prediction of anticancer drug response. It has been found from the review that, amidst the availability of various studies, there are certain challenges associated with each method. Thus, future researchers can consider these limitations and challenges to develop a prominent anticancer drug response prediction method, and it would be greatly beneficial to the medical professionals in administering non-invasive treatment to the patients.
Collapse
Affiliation(s)
- Davinder Paul Singh
- School of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, Jammu and Kashmir, India
| | - Baijnath Kaushik
- School of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, Jammu and Kashmir, India
| |
Collapse
|
22
|
Lee M, Kim PJ, Joe H, Kim HG. Gene-centric multi-omics integration with convolutional encoders for cancer drug response prediction. Comput Biol Med 2022; 151:106192. [PMID: 36327883 DOI: 10.1016/j.compbiomed.2022.106192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/26/2022] [Accepted: 10/08/2022] [Indexed: 12/27/2022]
Abstract
MOTIVATION Tumor heterogeneity, including genetic and transcriptomic characteristics, can reduce the efficacy of anticancer pharmacological therapy, resulting in clinical variability in patient response to therapeutic medications. Multi-omics integration can allow in silico models to provide an additional perspective on a biological system. METHODS In this study, we propose a gene-centric multi-channel (GCMC) architecture to integrate multi-omics for predicting cancer drug response. GCMC transformed multi-omics profiles into a three-dimensional tensor with an additional dimension for omics types. GCMC's convolutional encoders captures multi-omics profiles for each gene and yields gene-centric features to predict drug responses. RESULTS We evaluated GCMC on various datasets, including The Cancer Genome Atlas (TCGA) patients, patient-derived xenografts (PDX) mice models, and the Genomics of Drug Sensitivity in Cancer (GDSC) cell line datasets. GCMC achieved better performance than baseline models, including single-omics models, in more than 75% of 265 drugs from GDSC cell line datasets. Furthermore, as for the clinical applicability of GCMC, it achieved the best performance on TCGA and PDX datasets in terms of both AUPR and AUC. We also analyzed models' capability of integrating multi-omics profiles by measuring the contribution ratio of omics types. GCMC can incorporate multi-omics profiles in various manners to enhance performance for each drug type. These results suggested that GCMC can improve performance and feature extraction capability by integrating multi-omics profiles in a gene-centric manner.
Collapse
Affiliation(s)
- Munhwan Lee
- Biomedical Knowledge Engineering Lab., Seoul National University, 1 Gwanak-ro, Seoul, 08826, Republic of Korea.
| | - Pil-Jong Kim
- Biomedical Knowledge Engineering Lab., Seoul National University, 1 Gwanak-ro, Seoul, 08826, Republic of Korea.
| | - Hyunwhan Joe
- Biomedical Knowledge Engineering Lab., Seoul National University, 1 Gwanak-ro, Seoul, 08826, Republic of Korea.
| | - Hong-Gee Kim
- Biomedical Knowledge Engineering Lab., Seoul National University, 1 Gwanak-ro, Seoul, 08826, Republic of Korea.
| |
Collapse
|
23
|
Samal BR, Loers JU, Vermeirssen V, De Preter K. Opportunities and challenges in interpretable deep learning for drug sensitivity prediction of cancer cells. FRONTIERS IN BIOINFORMATICS 2022; 2:1036963. [PMID: 36466148 PMCID: PMC9714662 DOI: 10.3389/fbinf.2022.1036963] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 11/03/2022] [Indexed: 01/02/2024] Open
Abstract
In precision oncology, therapy stratification is done based on the patients' tumor molecular profile. Modeling and prediction of the drug response for a given tumor molecular type will further improve therapeutic decision-making for cancer patients. Indeed, deep learning methods hold great potential for drug sensitivity prediction, but a major problem is that these models are black box algorithms and do not clarify the mechanisms of action. This puts a limitation on their clinical implementation. To address this concern, many recent studies attempt to overcome these issues by developing interpretable deep learning methods that facilitate the understanding of the logic behind the drug response prediction. In this review, we discuss strengths and limitations of recent approaches, and suggest future directions that could guide further improvement of interpretable deep learning in drug sensitivity prediction in cancer research.
Collapse
Affiliation(s)
- Bikash Ranjan Samal
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics Ghent (CMGG), Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jens Uwe Loers
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics Ghent (CMGG), Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
| | - Vanessa Vermeirssen
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics Ghent (CMGG), Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
| | - Katleen De Preter
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Center for Medical Genetics Ghent (CMGG), Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| |
Collapse
|
24
|
Hacking SM, Yakirevich E, Wang Y. From Immunohistochemistry to New Digital Ecosystems: A State-of-the-Art Biomarker Review for Precision Breast Cancer Medicine. Cancers (Basel) 2022; 14:cancers14143469. [PMID: 35884530 PMCID: PMC9315712 DOI: 10.3390/cancers14143469] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/13/2022] [Accepted: 07/15/2022] [Indexed: 02/04/2023] Open
Abstract
Simple Summary In this state-of-the-art breast biomarker review, we have tried to imagine and illustrate future, emerging digital breast cancer ecosystems which allow for greater incorporation of traditional immunohistochemical and molecular biomarkers, WSI, and radiomic features. Abstract Breast cancers represent complex ecosystem-like networks of malignant cells and their associated microenvironment. Estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) are biomarkers ubiquitous to clinical practice in evaluating prognosis and predicting response to therapy. Recent feats in breast cancer have led to a new digital era, and advanced clinical trials have resulted in a growing number of personalized therapies with corresponding biomarkers. In this state-of-the-art review, we included the latest 10-year updated recommendations for ER, PR, and HER2, along with the most salient information on tumor-infiltrating lymphocytes (TILs), Ki-67, PD-L1, and several prognostic/predictive biomarkers at genomic, transcriptomic, and proteomic levels recently developed for selection and optimization of breast cancer treatment. Looking forward, the multi-omic landscape of the tumor ecosystem could be integrated with computational findings from whole slide images and radiomics in predictive machine learning (ML) models. These are new digital ecosystems on the road to precision breast cancer medicine.
Collapse
Affiliation(s)
| | | | - Yihong Wang
- Correspondence: ; Tel.: +1-401-444-9897; Fax: +1-401-444-4377
| |
Collapse
|
25
|
The Potential and Emerging Role of Quantitative Imaging Biomarkers for Cancer Characterization. Cancers (Basel) 2022; 14:cancers14143349. [PMID: 35884409 PMCID: PMC9321521 DOI: 10.3390/cancers14143349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 07/07/2022] [Accepted: 07/08/2022] [Indexed: 12/10/2022] Open
Abstract
Simple Summary Modern, personalized therapy approaches are increasingly changing advanced cancer into a chronic disease. Compared to imaging, novel omics methodologies in molecular biology have already achieved an individual characterization of cancerous lesions. With quantitative imaging biomarkers, analyzed by radiomics or deep learning, an imaging-based assessment of tumoral biology can be brought into clinical practice. Combining these with other non-invasive methods, e.g., liquid profiling, could allow for more individual decision making regarding therapies and applications. Abstract Similar to the transformation towards personalized oncology treatment, emerging techniques for evaluating oncologic imaging are fostering a transition from traditional response assessment towards more comprehensive cancer characterization via imaging. This development can be seen as key to the achievement of truly personalized and optimized cancer diagnosis and treatment. This review gives a methodological introduction for clinicians interested in the potential of quantitative imaging biomarkers, treating of radiomics models, texture visualization, convolutional neural networks and automated segmentation, in particular. Based on an introduction to these methods, clinical evidence for the corresponding imaging biomarkers—(i) dignity and etiology assessment; (ii) tumoral heterogeneity; (iii) aggressiveness and response; and (iv) targeting for biopsy and therapy—is summarized. Further requirements for the clinical implementation of these imaging biomarkers and the synergistic potential of personalized molecular cancer diagnostics and liquid profiling are discussed.
Collapse
|
26
|
Combining Molecular, Imaging, and Clinical Data Analysis for Predicting Cancer Prognosis. Cancers (Basel) 2022; 14:cancers14133215. [PMID: 35804988 PMCID: PMC9265023 DOI: 10.3390/cancers14133215] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/24/2022] [Accepted: 06/27/2022] [Indexed: 02/04/2023] Open
Abstract
Simple Summary The rise of Big Data, the widespread use of Machine Learning, and the cheapening of omics techniques have allowed for the creation of more sophisticated and accurate models in biomedical research. This article presents the state-of-the-art predictive models of cancer prognosis that use multimodal data, considering clinical, molecular (omics and non-omics), and image data. The subject of study, the data modalities used, the data processing and modelling methods applied, the validation strategies involved, the integration strategies encompassed, and the evolution of prognostic predictive models are discussed. Finally, we discuss challenges and opportunities in this field of cancer research, with great potential impact on the clinical management of patients and, by extension, on the implementation of personalised and precision medicine. Abstract Cancer is one of the most detrimental diseases globally. Accordingly, the prognosis prediction of cancer patients has become a field of interest. In this review, we have gathered 43 state-of-the-art scientific papers published in the last 6 years that built cancer prognosis predictive models using multimodal data. We have defined the multimodality of data as four main types: clinical, anatomopathological, molecular, and medical imaging; and we have expanded on the information that each modality provides. The 43 studies were divided into three categories based on the modelling approach taken, and their characteristics were further discussed together with current issues and future trends. Research in this area has evolved from survival analysis through statistical modelling using mainly clinical and anatomopathological data to the prediction of cancer prognosis through a multi-faceted data-driven approach by the integration of complex, multimodal, and high-dimensional data containing multi-omics and medical imaging information and by applying Machine Learning and, more recently, Deep Learning techniques. This review concludes that cancer prognosis predictive multimodal models are capable of better stratifying patients, which can improve clinical management and contribute to the implementation of personalised medicine as well as provide new and valuable knowledge on cancer biology and its progression.
Collapse
|
27
|
Mo H, Breitling R, Francavilla C, Schwartz JM. Data integration and mechanistic modelling for breast cancer biology: Current state and future directions. CURRENT OPINION IN ENDOCRINE AND METABOLIC RESEARCH 2022; 24:None. [PMID: 36034741 PMCID: PMC9402443 DOI: 10.1016/j.coemr.2022.100350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Breast cancer is one of the most common cancers threatening women worldwide. A limited number of available treatment options, frequent recurrence, and drug resistance exacerbate the prognosis of breast cancer patients. Thus, there is an urgent need for methods to investigate novel treatment options, while taking into account the vast molecular heterogeneity of breast cancer. Recent advances in molecular profiling technologies, including genomics, epigenomics, transcriptomics, proteomics and metabolomics data, enable approaching breast cancer biology at multiple levels of omics interaction networks. Systems biology approaches, including computational inference of ‘big data’ and mechanistic modelling of specific pathways, are emerging to identify potential novel combinations of breast cancer subtype signatures and more diverse targeted therapies.
Collapse
|
28
|
Zhu EY, Dupuy AJ. Machine learning approach informs biology of cancer drug response. BMC Bioinformatics 2022; 23:184. [PMID: 35581546 PMCID: PMC9112473 DOI: 10.1186/s12859-022-04720-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 05/03/2022] [Indexed: 12/12/2022] Open
Abstract
Background The mechanism of action for most cancer drugs is not clear. Large-scale pharmacogenomic cancer cell line datasets offer a rich resource to obtain this knowledge. Here, we present an analysis strategy for revealing biological pathways that contribute to drug response using publicly available pharmacogenomic cancer cell line datasets. Methods We present a custom machine-learning based approach for identifying biological pathways involved in cancer drug response. We test the utility of our approach with a pan-cancer analysis of ML210, an inhibitor of GPX4, and a melanoma-focused analysis of inhibitors of BRAFV600. We apply our approach to reveal determinants of drug resistance to microtubule inhibitors. Results Our method implicated lipid metabolism and Rac1/cytoskeleton signaling in the context of ML210 and BRAF inhibitor response, respectively. These findings are consistent with current knowledge of how these drugs work. For microtubule inhibitors, our approach implicated Notch and Akt signaling as pathways that associated with response. Conclusions Our results demonstrate the utility of combining informed feature selection and machine learning algorithms in understanding cancer drug response. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04720-z.
Collapse
Affiliation(s)
- Eliot Y Zhu
- Department of Anatomy and Cell Biology, The University of Iowa, Iowa City, IA, USA.,Holden Comprehensive Cancer Center, The University of Iowa, Iowa City, IA, USA.,Cancer Biology Graduate Program, The University of Iowa, Iowa City, IA, USA.,The Medical Scientist Training Program, The University of Iowa, Iowa City, IA, USA
| | - Adam J Dupuy
- Department of Anatomy and Cell Biology, The University of Iowa, Iowa City, IA, USA. .,Holden Comprehensive Cancer Center, The University of Iowa, Iowa City, IA, USA.
| |
Collapse
|
29
|
Leveraging Deep Learning Techniques and Integrated Omics Data for Tailored Treatment of Breast Cancer. J Pers Med 2022; 12:jpm12050674. [PMID: 35629097 PMCID: PMC9147748 DOI: 10.3390/jpm12050674] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 03/06/2022] [Accepted: 04/14/2022] [Indexed: 12/12/2022] Open
Abstract
Multiomics data of cancer patients and cell lines, in synergy with deep learning techniques, have aided in unravelling predictive problems related to cancer research and treatment. However, there is still room for improvement in the performance of the existing models based on the aforementioned combination. In this work, we propose two models that complement the treatment of breast cancer patients. First, we discuss our deep learning-based model for breast cancer subtype classification. Second, we propose DCNN-DR, a deep convolute.ion neural network-drug response method for predicting the effectiveness of drugs on in vitro and in vivo breast cancer datasets. Finally, we applied DCNN-DR for predicting effective drugs for the basal-like breast cancer subtype and validated the results with the information available in the literature. The models proposed use late integration methods and have fairly better predictive performance compared to the existing methods. We use the Pearson correlation coefficient and accuracy as the performance measures for the regression and classification models, respectively.
Collapse
|
30
|
Effectiveness of Artificial Intelligence for Personalized Medicine in Neoplasms: A Systematic Review. BIOMED RESEARCH INTERNATIONAL 2022; 2022:7842566. [PMID: 35434134 PMCID: PMC9010213 DOI: 10.1155/2022/7842566] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 01/29/2022] [Accepted: 03/06/2022] [Indexed: 02/07/2023]
Abstract
Purpose Artificial intelligence (AI) techniques are used in precision medicine to explore novel genotypes and phenotypes data. The main aims of precision medicine include early diagnosis, screening, and personalized treatment regime for a patient based on genetic-oriented features and characteristics. The main objective of this study was to review AI techniques and their effectiveness in neoplasm precision medicine. Materials and Methods A comprehensive search was performed in Medline (through PubMed), Scopus, ISI Web of Science, IEEE Xplore, Embase, and Cochrane databases from inception to December 29, 2021, in order to identify the studies that used AI methods for cancer precision medicine and evaluate outcomes of the models. Results Sixty-three studies were included in this systematic review. The main AI approaches in 17 papers (26.9%) were linear and nonlinear categories (random forest or decision trees), and in 21 citations, rule-based systems and deep learning models were used. Notably, 62% of the articles were done in the United States and China. R package was the most frequent software, and breast and lung cancer were the most selected neoplasms in the papers. Out of 63 papers, in 34 articles, genomic data like gene expression, somatic mutation data, phenotype data, and proteomics with drug-response which is functional data was used as input in AI methods; in 16 papers' (25.3%) drug response, functional data was utilized in personalization of treatment. The maximum values of the assessment indicators such as accuracy, sensitivity, specificity, precision, recall, and area under the curve (AUC) in included studies were 0.99, 1.00, 0.96, 0.98, 0.99, and 0.9929, respectively. Conclusion The findings showed that in many cases, the use of artificial intelligence methods had effective application in personalized medicine.
Collapse
|
31
|
Firoozbakht F, Yousefi B, Schwikowski B. An overview of machine learning methods for monotherapy drug response prediction. Brief Bioinform 2022; 23:bbab408. [PMID: 34619752 PMCID: PMC8769705 DOI: 10.1093/bib/bbab408] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 08/25/2021] [Accepted: 09/06/2021] [Indexed: 12/11/2022] Open
Abstract
For an increasing number of preclinical samples, both detailed molecular profiles and their responses to various drugs are becoming available. Efforts to understand, and predict, drug responses in a data-driven manner have led to a proliferation of machine learning (ML) methods, with the longer term ambition of predicting clinical drug responses. Here, we provide a uniquely wide and deep systematic review of the rapidly evolving literature on monotherapy drug response prediction, with a systematic characterization and classification that comprises more than 70 ML methods in 13 subclasses, their input and output data types, modes of evaluation, and code and software availability. ML experts are provided with a fundamental understanding of the biological problem, and how ML methods are configured for it. Biologists and biomedical researchers are introduced to the basic principles of applicable ML methods, and their application to the problem of drug response prediction. We also provide systematic overviews of commonly used data sources used for training and evaluation methods.
Collapse
Affiliation(s)
- Farzaneh Firoozbakht
- Systems Biology Group, Department of Computational Biology, Institut Pasteur, Paris, France
| | - Behnam Yousefi
- Systems Biology Group, Department of Computational Biology, Institut Pasteur, Paris, France
- Sorbonne Université, École Doctorale Complexite du Vivant, Paris, France
| | - Benno Schwikowski
- Systems Biology Group, Department of Computational Biology, Institut Pasteur, Paris, France
| |
Collapse
|
32
|
Subramanian A, Zakeri P, Mousa M, Alnaqbi H, Alshamsi FY, Bettoni L, Damiani E, Alsafar H, Saeys Y, Carmeliet P. Angiogenesis goes computational – The future way forward to discover new angiogenic targets? Comput Struct Biotechnol J 2022; 20:5235-5255. [PMID: 36187917 PMCID: PMC9508490 DOI: 10.1016/j.csbj.2022.09.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/09/2022] [Accepted: 09/09/2022] [Indexed: 11/26/2022] Open
Abstract
Multi-omics technologies are being increasingly utilized in angiogenesis research. Yet, computational methods have not been widely used for angiogenic target discovery and prioritization in this field, partly because (wet-lab) vascular biologists are insufficiently familiar with computational biology tools and the opportunities they may offer. With this review, written for vascular biologists who lack expertise in computational methods, we aspire to break boundaries between both fields and to illustrate the potential of these tools for future angiogenic target discovery. We provide a comprehensive survey of currently available computational approaches that may be useful in prioritizing candidate genes, predicting associated mechanisms, and identifying their specificity to endothelial cell subtypes. We specifically highlight tools that use flexible, machine learning frameworks for large-scale data integration and gene prioritization. For each purpose-oriented category of tools, we describe underlying conceptual principles, highlight interesting applications and discuss limitations. Finally, we will discuss challenges and recommend some guidelines which can help to optimize the process of accurate target discovery.
Collapse
|
33
|
Jung HD, Sung YJ, Kim HU. Omics and Computational Modeling Approaches for the Effective Treatment of Drug-Resistant Cancer Cells. Front Genet 2021; 12:742902. [PMID: 34691155 PMCID: PMC8527086 DOI: 10.3389/fgene.2021.742902] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 09/20/2021] [Indexed: 02/05/2023] Open
Abstract
Chemotherapy is a mainstream cancer treatment, but has a constant challenge of drug resistance, which consequently leads to poor prognosis in cancer treatment. For better understanding and effective treatment of drug-resistant cancer cells, omics approaches have been widely conducted in various forms. A notable use of omics data beyond routine data mining is to use them for computational modeling that allows generating useful predictions, such as drug responses and prognostic biomarkers. In particular, an increasing volume of omics data has facilitated the development of machine learning models. In this mini review, we highlight recent studies on the use of multi-omics data for studying drug-resistant cancer cells. We put a particular focus on studies that use computational models to characterize drug-resistant cancer cells, and to predict biomarkers and/or drug responses. Computational models covered in this mini review include network-based models, machine learning models and genome-scale metabolic models. We also provide perspectives on future research opportunities for combating drug-resistant cancer cells.
Collapse
Affiliation(s)
- Hae Deok Jung
- Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
| | - Yoo Jin Sung
- Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
| | - Hyun Uk Kim
- Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea.,KAIST Institute for Artificial Intelligence, KAIST, Daejeon, South Korea.,BioProcess Engineering Research Center and BioInformatics Research Center KAIST, Daejeon, South Korea
| |
Collapse
|