1
|
Stephan S, Galland S, Labbani Narsis O, Shoji K, Vachenc S, Gerart S, Nicolle C. Agent-based approaches for biological modeling in oncology: A literature review. Artif Intell Med 2024; 152:102884. [PMID: 38703466 DOI: 10.1016/j.artmed.2024.102884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/06/2024]
Abstract
CONTEXT Computational modeling involves the use of computer simulations and models to study and understand real-world phenomena. Its application is particularly relevant in the study of potential interactions between biological elements. It is a promising approach to understand complex biological processes and predict their behavior under various conditions. METHODOLOGY This paper is a review of the recent literature on computational modeling of biological systems. Our study focuses on the field of oncology and the use of artificial intelligence (AI) and, in particular, agent-based modeling (ABM), between 2010 and May 2023. RESULTS Most of the articles studied focus on improving the diagnosis and understanding the behaviors of biological entities, with metaheuristic algorithms being the models most used. Several challenges are highlighted regarding increasing and structuring knowledge about biological systems, developing holistic models that capture multiple scales and levels of organization, reproducing emergent behaviors of biological systems, validating models with experimental data, improving computational performance of models and algorithms, and ensuring privacy and personal data protection are discussed.
Collapse
Affiliation(s)
- Simon Stephan
- UTBM, CIAD UMR 7533, Belfort, F-90010, France; Université de Bourgogne, CIAD UMR 7533, Dijon, F-21000, France.
| | | | | | - Kenji Shoji
- Oncodesign Precision Medicine (OPM), 18 Rue Jean Mazen, Dijon, F-21000, France
| | - Sébastien Vachenc
- Oncodesign Precision Medicine (OPM), 18 Rue Jean Mazen, Dijon, F-21000, France
| | - Stéphane Gerart
- Oncodesign Precision Medicine (OPM), 18 Rue Jean Mazen, Dijon, F-21000, France
| | | |
Collapse
|
2
|
Dermawan JK, Rubin BP. The role of molecular profiling in the diagnosis and management of metastatic undifferentiated cancer of unknown primary ✰: Molecular profiling of metastatic cancer of unknown primary. Semin Diagn Pathol 2020; 38:193-198. [PMID: 33309276 DOI: 10.1053/j.semdp.2020.12.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 11/24/2020] [Accepted: 12/02/2020] [Indexed: 12/17/2022]
Abstract
Cancer of unknown primary (CUP) refers to metastatic tumors for which the primary tumor of origin cannot be determined at the time of diagnosis, despite extensive clinicopathologic investigations. Molecular profiling is increasingly able to predict a probable primary tumor type for CUP when clinicopathologic workup is inconclusive. Numerous studies have explored the use of various molecular profiling techniques for identification of site/tissue of origin of CUP. These techniques include gene expression profiling utilizing microarray, reverse transcriptase polymerase chain reaction, RNA-sequencing, somatic gene mutation profiling with next-generation DNA sequencing, and epigenomics including DNA methylation profiling. Despite the generally poor prognosis of CUP, a minority of patients can expect to benefit from targeted therapy despite being agnostic to the tissue of origin. Studies have explored the use of various molecular profiling techniques to predict prognostic and therapeutic biomarkers, with the goal of improving outcome for patients with CUP. However, discordant results between non-randomized and randomized clinical trials in evaluating tumor-type specific therapies raise uncertainties of the benefits of molecularly-predicted tissue of origin-based treatment in routine clinical use. Nevertheless, the current overall trend is in favor of using molecular tools to refine the diagnosis and clinical management of patients with CUP. More large-cohort, randomized prospective studies are needed to assess and validate the utility and feasibility of molecular profiling to uncover potentially targetable genetic alterations. These efforts will also yield further biological insights into the biology and pathogenesis of CUP (Graphical Abstract).
Collapse
Affiliation(s)
- Josephine K Dermawan
- Robert J. Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Brian P Rubin
- Robert J. Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, OH 44195, United States.
| |
Collapse
|
3
|
Zhang Y, Feng T, Wang S, Dong R, Yang J, Su J, Wang B. A Novel XGBoost Method to Identify Cancer Tissue-of-Origin Based on Copy Number Variations. Front Genet 2020; 11:585029. [PMID: 33329723 PMCID: PMC7716814 DOI: 10.3389/fgene.2020.585029] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 10/05/2020] [Indexed: 01/18/2023] Open
Abstract
The discovery of cancer of unknown primary (CUP) is of great significance in designing more effective treatments and improving the diagnostic efficiency in cancer patients. In the study, we develop an appropriate machine learning model for tracing the tissue of origin of CUP with high accuracy after feature engineering and model evaluation. Based on a copy number variation data consisting of 4,566 training cases and 1,262 independent validation cases, an XGBoost classifier is applied to 10 types of cancer. Extremely randomized tree (Extra tree) is used for dimension reduction so that fewer variables replace the original high-dimensional variables. Features with top 300 weights are selected and principal component analysis is applied to eliminate noise. We find that XGBoost classifier achieves the highest overall accuracy of 0.8913 in the 10-fold cross-validation for training samples and 0.7421 on independent validation datasets for predicting tumor tissue of origin. Furthermore, by contrasting various performance indices, such as precision and recall rate, the experimental results show that XGBoost classifier significantly improves the classification performance of various tumors with less prediction error, as compared to other classifiers, such as K-nearest neighbors (KNN), Bayes, support vector machine (SVM), and Adaboost. Our method can infer tissue of origin for the 10 cancer types with acceptable accuracy in both cross-validation and independent validation data. It may be used as an auxiliary diagnostic method to determine the actual clinicopathological status of specific cancer.
Collapse
Affiliation(s)
- Yulin Zhang
- College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao, China
| | - Tong Feng
- College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao, China
| | - Shudong Wang
- College of Computer and Communication Engineering, China University of Petroleum (East China), Qingdao, China
| | - Ruyi Dong
- Geneis (Beijing) Co., Ltd., Beijing, China
| | | | - Jionglong Su
- School of AI and Advanced Computing, XJTLU Entrepreneur College (Taicang), Xi’an Jiaotong-Liverpool University, Suzhou, China
| | - Bo Wang
- Geneis (Beijing) Co., Ltd., Beijing, China
| |
Collapse
|
4
|
He B, Zhang Y, Zhou Z, Wang B, Liang Y, Lang J, Lin H, Bing P, Yu L, Sun D, Luo H, Yang J, Tian G. A Neural Network Framework for Predicting the Tissue-of-Origin of 15 Common Cancer Types Based on RNA-Seq Data. Front Bioeng Biotechnol 2020; 8:737. [PMID: 32850691 PMCID: PMC7419649 DOI: 10.3389/fbioe.2020.00737] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 06/10/2020] [Indexed: 12/19/2022] Open
Abstract
Sequencing-based identification of tumor tissue-of-origin (TOO) is critical for patients with cancer of unknown primary lesions. Even if the TOO of a tumor can be diagnosed by clinicopathological observation, reevaluations by computational methods can help avoid misdiagnosis. In this study, we developed a neural network (NN) framework using the expression of a 150-gene panel to infer the tumor TOO for 15 common solid tumor cancer types, including lung, breast, liver, colorectal, gastroesophageal, ovarian, cervical, endometrial, pancreatic, bladder, head and neck, thyroid, prostate, kidney, and brain cancers. To begin with, we downloaded the RNA-Seq data of 7,460 primary tumor samples across the above mentioned 15 cancer types, with each type of cancer having between 142 and 1,052 samples, from the cancer genome atlas. Then, we performed feature selection by the Pearson correlation method and performed a 150-gene panel analysis; the genes were significantly enriched in the GO:2001242 Regulation of intrinsic apoptotic signaling pathway and the GO:0009755 Hormone-mediated signaling pathway and other similar functions. Next, we developed a novel NN model using the 150 genes to predict tumor TOO for the 15 cancer types. The average prediction sensitivity and precision of the framework are 93.36 and 94.07%, respectively, for the 7,460 tumor samples based on the 10-fold cross-validation; however, the prediction sensitivity and precision for a few specific cancers, like prostate cancer, reached 100%. We also tested the trained model on a 20-sample independent dataset with metastatic tumor, and achieved an 80% accuracy. In summary, we present here a highly accurate method to infer tumor TOO, which has potential clinical implementation.
Collapse
Affiliation(s)
- Binsheng He
- Academician Workstation, Changsha Medical University, Changsha, China
| | | | - Zhen Zhou
- Department of Radiology, Beijing Chest Hospital, Capital Medical University, Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing, China
| | - Bo Wang
- Geneis (Beijing) Co., Ltd., Beijing, China
| | | | | | - Huixin Lin
- Geneis (Beijing) Co., Ltd., Beijing, China
| | - Pingping Bing
- Academician Workstation, Changsha Medical University, Changsha, China
| | - Lan Yu
- Inner Mongolia People's Hospital, Huhhot, China
| | - Dejun Sun
- Inner Mongolia People's Hospital, Huhhot, China
| | - Huaiqing Luo
- Academician Workstation, Changsha Medical University, Changsha, China
| | - Jialiang Yang
- Academician Workstation, Changsha Medical University, Changsha, China.,Geneis (Beijing) Co., Ltd., Beijing, China
| | - Geng Tian
- Geneis (Beijing) Co., Ltd., Beijing, China
| |
Collapse
|
5
|
Bhowmick SS, Bhattacharjee D, Rato L. In silico markers: an evolutionary and statistical approach to select informative genes of human breast cancer subtypes. Genes Genomics 2019; 41:1371-1382. [PMID: 31004329 DOI: 10.1007/s13258-019-00816-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 04/02/2019] [Indexed: 10/27/2022]
Abstract
BACKGROUND Recent advancement in bioinformatics offers the ability to identify informative genes from high dimensional gene expression data. Selection of informative genes from these large datasets has emerged as an issue of major concern among researchers. OBJECTIVE Gene functionality and regulatory mechanisms can be understood through the analysis of these gene expression data. Here, we present a computational method to identify informative genes for breast cancer subtypes such as Basal, human epidermal growth factor receptor 2 (Her2), luminal A (LumA), and luminal B (LumB). METHODS The proposed In Silico Markers method is a wrapper feature selection method based on Least Absolute Shrinkage and Selection Operator (LASSO), Covariance Matrix Adaptation Evolution Strategy (CMA-ES) and Support Vector Machine (SVM) as a classifier. Moreover, the composite measure consisting of relevance, redundancy, and rank score of frequently appeared genes are used to select informative genes. RESULTS The informative genes are validated by statistical and biologically relevant criteria. For a comparative evaluation of the proposed approach, biological similarity score designed on semantic similarity measure of GO terms are investigated. Further, the proposed technique is evaluated with 7 existing gene selection techniques using two-class annotated breast cancer subtype datasets. CONCLUSION The utilization of this method can bring about the discovery of informative genes. Furthermore, under multiple criteria decision-making set-up, informative genes selected by the In Silico Markers are found to be admirable than the compared methods selected genes.
Collapse
Affiliation(s)
- Shib Sankar Bhowmick
- Department of Electronics and Communication Engineering, Heritage Institute of Technology, Kolkata, 700107, India.
| | - Debotosh Bhattacharjee
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India
| | - Luis Rato
- Department of Informatics, University of Evora, 7004-516, Evora, Portugal
| |
Collapse
|