1
|
Jurisica I. Explainable biology for improved therapies in precision medicine: AI is not enough. Best Pract Res Clin Rheumatol 2024; 38:102006. [PMID: 39332994 DOI: 10.1016/j.berh.2024.102006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 09/18/2024] [Accepted: 09/18/2024] [Indexed: 09/29/2024]
Abstract
Technological advances and high-throughput bio-chemical assays are rapidly changing ways how we formulate and test biological hypotheses, and how we treat patients. Most complex diseases arise on a background of genetics, lifestyle and environment factors, and manifest themselves as a spectrum of symptoms. To fathom intricate biological processes and their changes from healthy to disease states, we need to systematically integrate and analyze multi-omics datasets, ontologies, and diverse annotations. Without proper management of such complex biological and clinical data, artificial intelligence (AI) algorithms alone cannot be effectively trained, validated, and successfully applied to provide trustworthy and patient-centric diagnosis, prognosis and treatment. Precision medicine requires to use multi-omics approaches effectively, and offers many opportunities for using AI, "big data" analytics, and integrative computational biology workflows. Advances in optical and biochemical assay technologies including sequencing, mass spectrometry and imaging modalities have transformed research by empowering us to simultaneously view all genes expressed, identify proteome-wide changes, and assess interacting partners of each individual protein within a dynamically changing biological system, at an individual cell level. While such views are already having an impact on our understanding of healthy and disease conditions, it remains challenging to extract useful information comprehensively and systematically from individual studies, ensure that signal is separated from noise, develop models, and provide hypotheses for further research. Data remain incomplete and are often poorly connected using fragmented biological networks. In addition, statistical and machine learning models are developed at a cohort level and often not validated at the individual patient level. Combining integrative computational biology and AI has the potential to improve understanding and treatment of diseases by identifying biomarkers and building explainable models characterizing individual patients. From systematic data analysis to more specific diagnostic, prognostic and predictive biomarkers, drug mechanism of action, and patient selection, such analyses influence multiple steps from prevention to disease characterization, and from prognosis to drug discovery. Data mining, machine learning, graph theory and advanced visualization may help identify diagnostic, prognostic and predictive biomarkers, and create causal models of disease. Intertwining computational prediction and modeling with biological experiments leads to faster, more biologically and clinically relevant discoveries. However, computational analysis results and models are going to be only as accurate and useful as correct and comprehensive are the networks, ontologies and datasets used to build them. High quality, curated data portals provide the necessary foundation for translational research. They help to identify better biomarkers, new drugs, precision treatments, and should lead to improved patient outcomes and their quality of life. Intertwining computational prediction and modeling with biological experiments, efficiently and effectively leads to more useful findings faster.
Collapse
Affiliation(s)
- I Jurisica
- Division of Orthopaedics, Osteoarthritis Research Program, Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases, Krembil Research Institute, University Health Network, Toronto, ON, M5T 0S8, Canada; Departments of Medical Biophysics and Computer Science, and Faculty of Dentistry, University of Toronto, Toronto, ON, Canada; Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia.
| |
Collapse
|
2
|
Global microRNA expression profile in laryngeal carcinoma unveils new prognostic biomarkers and novel insights into field cancerization. Sci Rep 2022; 12:17051. [PMID: 36224266 PMCID: PMC9556831 DOI: 10.1038/s41598-022-20338-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 09/12/2022] [Indexed: 12/30/2022] Open
Abstract
Laryngeal carcinoma is still a worldwide burden that has shown no significant improvement during the last few decades regarding definitive treatment strategies. The lack of suitable biomarkers for personalized treatment protocols and delineating field cancerization prevents further progress in clinical outcomes. In the light of this perspective, MicroRNAs could be promising biomarkers both in terms of diagnostic and prognostic value. The aim of this prospective study is to find strong prognostic microRNA biomarkers for advanced laryngeal carcinoma and molecular signatures of field cancerization. Sixty patients were enrolled and four samples were collected from each patient: tumor surface and depth, peritumor normal mucosa, and control distant laryngeal mucosa. Initially, a global microRNA profile was conducted in twelve patients from the whole cohort and subsequently, we validated a selected group of 12 microRNAs with RT-qPCR. The follow-up period was 24 months (SD ± 13 months). Microarray expression profile revealed 59 dysregulated microRNAs. The validated expression levels of miR-93-5p (χ2(2) = 4.68, log-rank p = 0.03), miR-144-3p (χ2(2) = 4.53, log-rank p = 0.03) and miR-210-3p (χ2(2) = 4.53, log-rank p = 0.03) in tumor samples exhibited strong association with recurrence-free survival as higher expression levels of these genes predict worse outcome. Tumor suppressor genes miR-144-3p (mean rank 1.58 vs 2.14 vs 2.29, p = 0.000) and miR-145-5p (mean rank 1.57 vs 2.15 vs 2.28, p = 0.000) were significantly dysregulated in peritumor mucosa with a pattern of expression consistent with paired tumor samples thus revealing a signature of field cancerization in laryngeal carcinoma. Additionally, miR-1260b, miR-21-3p, miR-31-3p and miR-31-5p were strongly associated with tumor grade. Our study reports the first global microRNA profile specifically in advanced laryngeal carcinoma that includes survival analysis and investigates the molecular signature of field cancerization. We report two strong biomarkers of field cancerization and three predictors for recurrence in advance stage laryngeal cancer.
Collapse
|
3
|
Tokar T, Pastrello C, Abovsky M, Rahmati S, Jurisica I. miRAnno-network-based functional microRNA annotation. Bioinformatics 2022; 38:592-593. [PMID: 34297061 DOI: 10.1093/bioinformatics/btab527] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 06/16/2021] [Accepted: 07/16/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Functional annotation is a common part of microRNA (miRNA)-related research, typically carried as pathway enrichment analysis of the selected miRNA targets. Here, we propose miRAnno, a fast and easy-to-use web application for miRNA annotation. RESULTS miRAnno uses comprehensive molecular interaction network and random walks with restart to measure the association between miRNAs and individual pathways. Independent validation shows that miRAnno achieves higher signal-to-noise ratio compared to the standard enrichment analysis. AVAILABILITY AND IMPLEMENTATION miRAnno is freely available at https://ophid.utoronto.ca/miRAnno/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tomas Tokar
- Osteoarthritis Research Program Division of Orthopedic Surgery Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases Krembil Research Institute, University Health Network, Toronto, Ontario, Canada
| | - Chiara Pastrello
- Osteoarthritis Research Program Division of Orthopedic Surgery Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases Krembil Research Institute, University Health Network, Toronto, Ontario, Canada
| | - Mark Abovsky
- Osteoarthritis Research Program Division of Orthopedic Surgery Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases Krembil Research Institute, University Health Network, Toronto, Ontario, Canada
| | - Sara Rahmati
- Osteoarthritis Research Program Division of Orthopedic Surgery Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases Krembil Research Institute, University Health Network, Toronto, Ontario, Canada
| | - Igor Jurisica
- Osteoarthritis Research Program Division of Orthopedic Surgery Schroeder Arthritis Institute, and Data Science Discovery Centre for Chronic Diseases Krembil Research Institute, University Health Network, Toronto, Ontario, Canada.,Departments of Medical Biophysics and Computer Science, University of Toronto, Toronto, Ontario, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
4
|
Uysal D, Kowalewski KF, Kriegmair MC, Wirtz R, Popovic ZV, Erben P. A comprehensive molecular characterization of the 8q22.2 region reveals the prognostic relevance of OSR2 mRNA in muscle invasive bladder cancer. PLoS One 2021; 16:e0248342. [PMID: 33711044 PMCID: PMC7954304 DOI: 10.1371/journal.pone.0248342] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 02/25/2021] [Indexed: 12/27/2022] Open
Abstract
Technological advances in molecular profiling have enabled the comprehensive identification of common regions of gene amplification on chromosomes (amplicons) in muscle invasive bladder cancer (MIBC). One such region is 8q22.2, which is largely unexplored in MIBC and could harbor genes with potential for outcome prediction or targeted therapy. To investigate the prognostic role of 8q22.2 and to compare different amplicon definitions, an in-silico analysis of 357 patients from The Cancer Genome Atlas, who underwent radical cystectomy for MIBC, was performed. Amplicons were generated using the GISTIC2.0 algorithm for copy number alterations (DNA_Amplicon) and z-score normalization for mRNA gene overexpression (RNA_Amplicon). Kaplan-Meier survival analysis, univariable, and multivariable Cox proportional hazard ratios were used to relate amplicons, genes, and clinical parameters to overall (OS) and disease-free survival (DFS). Analyses of the biological functions of 8q22.2 genes and genomic events in MIBC were performed to identify potential targets. Genes with prognostic significance from the in silico analysis were validated using RT-qPCR of MIBC tumor samples (n = 46). High 8q22.2 mRNA expression (RNA-AMP) was associated with lymph node metastases. Furthermore, 8q22.2 DNA and RNA amplified patients were more likely to show a luminal subtype (DNA_Amplicon_core: p = 0.029; RNA_Amplicon_core: p = 0.01). Overexpression of the 8q22.2 gene OSR2 predicted shortened DFS in univariable (HR [CI] 1.97 [1.2; 3.22]; p = 0.01) and multivariable in silico analysis (HR [CI] 1.91 [1.15; 3.16]; p = 0.01) and decreased OS (HR [CI] 6.25 [1.37; 28.38]; p = 0.0177) in RT-qPCR data analysis. Alterations in different levels of the 8q22.2 region are associated with manifestation of different clinical characteristics in MIBC. An in-depth comprehensive molecular characterization of genomic regions involved in cancer should include multiple genetic levels, such as DNA copy number alterations and mRNA gene expression, and could lead to a better molecular understanding. In this study, OSR2 is identified as a potential biomarker for survival prognosis.
Collapse
Affiliation(s)
- Daniel Uysal
- Department of Urology and Urosurgery, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| | - Karl-Friedrich Kowalewski
- Department of Urology and Urosurgery, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| | | | - Ralph Wirtz
- STRATIFYER Molecular Pathology GmbH, Köln, Germany
| | - Zoran V. Popovic
- Institute of Pathology, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| | - Philipp Erben
- Department of Urology and Urosurgery, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
- * E-mail:
| |
Collapse
|
5
|
Tokar T, Pastrello C, Jurisica I. GSOAP: a tool for visualization of gene set over-representation analysis. Bioinformatics 2020; 36:2923-2925. [PMID: 31977031 PMCID: PMC7203738 DOI: 10.1093/bioinformatics/btaa001] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 11/10/2019] [Accepted: 01/21/2020] [Indexed: 11/13/2022] Open
Abstract
Motivation Gene sets over-representation analysis (GSOA) is a common technique of enrichment analysis that measures the overlap between a gene set and selected instances (e.g. pathways). Despite its popularity, there is currently no established standard for visualization of GSOA results. Results Here, we propose a visual exploration of the GSOA results by showing the relationships among the enriched instances, while highlighting important instance attributes, such as significance, closeness (centrality) and clustering. Availability and implementation GSOAP is implemented as an R package and is available at https://github.com/tomastokar/gsoap.
Collapse
Affiliation(s)
- Tomas Tokar
- Krembil Research Institute, UHN, 60 Leonard Avenue, Toronto, ON M5T 0S8, Canada
| | - Chiara Pastrello
- Krembil Research Institute, UHN, 60 Leonard Avenue, Toronto, ON M5T 0S8, Canada
| | - Igor Jurisica
- Krembil Research Institute, UHN, 60 Leonard Avenue, Toronto, ON M5T 0S8, Canada.,Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Dubravska cesta 9, SK-84510, Bratislava, Slovakia
| |
Collapse
|
6
|
Rock LD, Minatel BC, Marshall EA, Guisier F, Sage AP, Barros-Filho MC, Stewart GL, Garnis C, Lam WL. Expanding the Transcriptome of Head and Neck Squamous Cell Carcinoma Through Novel MicroRNA Discovery. Front Oncol 2019; 9:1305. [PMID: 31828039 PMCID: PMC6890850 DOI: 10.3389/fonc.2019.01305] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 11/11/2019] [Indexed: 12/31/2022] Open
Abstract
Head and neck squamous cell carcinoma (HNSCC) has a poor survival rate mainly due to late stage diagnosis and recurrence. Despite genomic efforts to identify driver mutations and changes in protein-coding gene expression, developing effective diagnostic and prognostic biomarkers remains a priority to guide disease management and improve patient outcome. Recent reports of previously-unannotated microRNAs (miRNAs) from multiple somatic tissues have raised the possibility of HNSCC-specific miRNAs. In this study, we applied a customized in-silico analysis pipeline to identify novel miRNAs from raw small-RNA sequencing datasets from public repositories. We discovered 146 previously-unannotated sequences expressed in head and neck samples that share structural properties highly characteristic of miRNAs. The combined expression of the novel miRNAs revealed tissue and context-specific patterns. Furthermore, comparison of tumor with non-malignant tissue samples (n = 43 pairs) revealed 135 of these miRNAs as differentially expressed, most of which were overexpressed or exclusively found in tumor samples. Additionally, a subset of novel miRNAs was significantly associated with HPV infection status and patient outcome. A prognostic-model combining novel and known miRNA was developed (multivariate Cox regression analysis) leading to an improved death and relapse risk stratification (log rank p < 1e-7). The presence of these miRNAs was corroborated both in an independent dataset and by RT-qPCR analysis, supporting their potential involvement in HNSCC. In this study, we report the discovery of 146 novel miRNAs in head and neck tissues and demonstrate their potential biological significance and clinical relevance to head and neck cancer, providing a new resource for the study of HNSCC.
Collapse
Affiliation(s)
- Leigha D Rock
- Department of Cancer Control Research, British Columbia Cancer Research Centre, Vancouver, BC, Canada.,Faculty of Dentistry, University of British Columbia, Vancouver, BC, Canada.,Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada.,Faculty of Dentistry, Dalhousie University, Halifax, NS, Canada
| | - Brenda C Minatel
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | - Erin A Marshall
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | - Florian Guisier
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada.,Department of Pulmonology and CIC-CRB 1404, Rouen University Hospital, Rouen, France
| | - Adam P Sage
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | - Mateus Camargo Barros-Filho
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada.,International Research Center-A.C.Camargo Cancer Center, São Paulo, Brazil
| | - Greg L Stewart
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | - Cathie Garnis
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| | - Wan L Lam
- Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, BC, Canada
| |
Collapse
|
7
|
Why imaging data alone is not enough: AI-based integration of imaging, omics, and clinical data. Eur J Nucl Med Mol Imaging 2019; 46:2722-2730. [PMID: 31203421 DOI: 10.1007/s00259-019-04382-9] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Accepted: 05/28/2019] [Indexed: 12/13/2022]
Abstract
Artificial intelligence (AI) is currently regaining enormous interest due to the success of machine learning (ML), and in particular deep learning (DL). Image analysis, and thus radiomics, strongly benefits from this research. However, effectively and efficiently integrating diverse clinical, imaging, and molecular profile data is necessary to understand complex diseases, and to achieve accurate diagnosis in order to provide the best possible treatment. In addition to the need for sufficient computing resources, suitable algorithms, models, and data infrastructure, three important aspects are often neglected: (1) the need for multiple independent, sufficiently large and, above all, high-quality data sets; (2) the need for domain knowledge and ontologies; and (3) the requirement for multiple networks that provide relevant relationships among biological entities. While one will always get results out of high-dimensional data, all three aspects are essential to provide robust training and validation of ML models, to provide explainable hypotheses and results, and to achieve the necessary trust in AI and confidence for clinical applications.
Collapse
|