Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ramirez R, Chiu YC, Hererra A, Mostavi M, Ramirez J, Chen Y, Huang Y, Jin YF. Classification of Cancer Types Using Graph Convolutional Neural Networks. Front Phys 2020;8:203. [PMID: 33437754 PMCID: PMC7799442 DOI: 10.3389/fphy.2020.00203] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

For:	Ramirez R, Chiu YC, Hererra A, Mostavi M, Ramirez J, Chen Y, Huang Y, Jin YF. Classification of Cancer Types Using Graph Convolutional Neural Networks. Front Phys 2020;8:203. [PMID: 33437754 PMCID: PMC7799442 DOI: 10.3389/fphy.2020.00203] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Number

Cited by Other Article(s)

Park S, Hong CH, Son SJ, Roh HW, Kim D, Shin H, Woo HG. Identification of molecular subtypes of dementia by using blood-proteins interaction-aware graph propagational network. Brief Bioinform 2024;25:bbae428. [PMID: 39226887 PMCID: PMC11370639 DOI: 10.1093/bib/bbae428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 07/26/2024] [Accepted: 08/15/2024] [Indexed: 09/05/2024] Open

Mi H, Sivagnanam S, Ho WJ, Zhang S, Bergman D, Deshpande A, Baras AS, Jaffee EM, Coussens LM, Fertig EJ, Popel AS. Computational methods and biomarker discovery strategies for spatial proteomics: a review in immuno-oncology. Brief Bioinform 2024;25:bbae421. [PMID: 39179248 PMCID: PMC11343572 DOI: 10.1093/bib/bbae421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 07/11/2024] [Accepted: 08/09/2024] [Indexed: 08/26/2024] Open

Affiliation(s)

Haoyang Mi Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
Shamilene Sivagnanam The Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97201, United States Department of Cell, Development and Cancer Biology, Oregon Health and Science University, Portland, OR 97201, United States
Won Jin Ho Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
Shuming Zhang Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
Daniel Bergman Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
Atul Deshpande Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
Alexander S Baras Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States Department of Pathology, Johns Hopkins University School of Medicine, MD 21205, United States The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
Elizabeth M Jaffee Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
Lisa M Coussens The Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97201, United States Department of Cell, Development and Cancer Biology, Oregon Health and Science University, Portland, OR 97201, United States Brenden-Colson Center for Pancreatic Care, Oregon Health and Science University, Portland, OR 97201, United States
Elana J Fertig Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States Department of Applied Mathematics and Statistics, Johns Hopkins University Whiting School of Engineering, Baltimore, MD 21218, United States
Aleksander S Popel Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States

Collapse

Chereda H, Leha A, Beißbarth T. Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer. Artif Intell Med 2024;151:102840. [PMID: 38658129 DOI: 10.1016/j.artmed.2024.102840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 03/05/2024] [Accepted: 03/10/2024] [Indexed: 04/26/2024]

Abstract

High-throughput technologies are becoming increasingly important in discovering prognostic biomarkers and in identifying novel drug targets. With Mammaprint, Oncotype DX, and many other prognostic molecular signatures breast cancer is one of the paradigmatic examples of the utility of high-throughput data to deliver prognostic biomarkers, that can be represented in a form of a rather short gene list. Such gene lists can be obtained as a set of features (genes) that are important for the decisions of a Machine Learning (ML) method applied to high-dimensional gene expression data. Several studies have identified predictive gene lists for patient prognosis in breast cancer, but these lists are unstable and have only a few genes in common. Instability of feature selection impedes biological interpretability: genes that are relevant for cancer pathology should be members of any predictive gene list obtained for the same clinical type of patients. Stability and interpretability of selected features can be improved by including information on molecular networks in ML methods. Graph Convolutional Neural Network (GCNN) is a contemporary deep learning approach applicable to gene expression data structured by a prior knowledge molecular network. Layer-wise Relevance Propagation (LRP) and SHapley Additive exPlanations (SHAP) are methods to explain individual decisions of deep learning models. We used both GCNN+LRP and GCNN+SHAP techniques to construct feature sets by aggregating individual explanations. We suggest a methodology to systematically and quantitatively analyze the stability, the impact on the classification performance, and the interpretability of the selected feature sets. We used this methodology to compare GCNN+LRP to GCNN+SHAP and to more classical ML-based feature selection approaches. Utilizing a large breast cancer gene expression dataset we show that, while feature selection with SHAP is useful in applications where selected features have to be impactful for classification performance, among all studied methods GCNN+LRP delivers the most stable (reproducible) and interpretable gene lists.

Collapse

Jubran J, Slutsky R, Rozenblum N, Rokach L, Ben-David U, Yeger-Lotem E. Machine-learning analysis reveals an important role for negative selection in shaping cancer aneuploidy landscapes. Genome Biol 2024;25:95. [PMID: 38622679 PMCID: PMC11020441 DOI: 10.1186/s13059-024-03225-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 03/26/2024] [Indexed: 04/17/2024] Open

Abstract

BACKGROUND

Aneuploidy, an abnormal number of chromosomes within a cell, is a hallmark of cancer. Patterns of aneuploidy differ across cancers, yet are similar in cancers affecting closely related tissues. The selection pressures underlying aneuploidy patterns are not fully understood, hindering our understanding of cancer development and progression.

RESULTS

Here, we apply interpretable machine learning methods to study tissue-selective aneuploidy patterns. We define 20 types of features corresponding to genomic attributes of chromosome-arms, normal tissues, primary tumors, and cancer cell lines (CCLs), and use them to model gains and losses of chromosome arms in 24 cancer types. To reveal the factors that shape the tissue-specific cancer aneuploidy landscapes, we interpret the machine learning models by estimating the relative contribution of each feature to the models. While confirming known drivers of positive selection, our quantitative analysis highlights the importance of negative selection for shaping aneuploidy landscapes. This is exemplified by tumor suppressor gene density being a better predictor of gain patterns than oncogene density, and vice versa for loss patterns. We also identify the importance of tissue-selective features and demonstrate them experimentally, revealing KLF5 as an important driver for chr13q gain in colon cancer. Further supporting an important role for negative selection in shaping the aneuploidy landscapes, we find compensation by paralogs to be among the top predictors of chromosome arm loss prevalence and demonstrate this relationship for one paralog interaction. Similar factors shape aneuploidy patterns in human CCLs, demonstrating their relevance for aneuploidy research.

CONCLUSIONS

Our quantitative, interpretable machine learning models improve the understanding of the genomic properties that shape cancer aneuploidy landscapes.

Collapse

Yan H, Weng D, Li D, Gu Y, Ma W, Liu Q. Prior knowledge-guided multilevel graph neural network for tumor risk prediction and interpretation via multi-omics data integration. Brief Bioinform 2024;25:bbae184. [PMID: 38670157 PMCID: PMC11052635 DOI: 10.1093/bib/bbae184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/11/2024] [Accepted: 04/06/2024] [Indexed: 04/28/2024] Open

Luo H, Liang H, Liu H, Fan Z, Wei Y, Yao X, Cong S. TEMINET: A Co-Informative and Trustworthy Multi-Omics Integration Network for Diagnostic Prediction. Int J Mol Sci 2024;25:1655. [PMID: 38338932 PMCID: PMC10855161 DOI: 10.3390/ijms25031655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 01/20/2024] [Accepted: 01/26/2024] [Indexed: 02/12/2024] Open

Brouard C, Mourad R, Vialaneix N. Should we really use graph neural networks for transcriptomic prediction? Brief Bioinform 2024;25:bbae027. [PMID: 38349060 PMCID: PMC10939369 DOI: 10.1093/bib/bbae027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 12/20/2023] [Accepted: 01/17/2024] [Indexed: 02/15/2024] Open

Li B, Nabavi S. A multimodal graph neural network framework for cancer molecular subtype classification. BMC Bioinformatics 2024;25:27. [PMID: 38225583 PMCID: PMC10789042 DOI: 10.1186/s12859-023-05622-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 12/15/2023] [Indexed: 01/17/2024] Open

Abstract

BACKGROUND

The recent development of high-throughput sequencing has created a large collection of multi-omics data, which enables researchers to better investigate cancer molecular profiles and cancer taxonomy based on molecular subtypes. Integrating multi-omics data has been proven to be effective for building more precise classification models. Most current multi-omics integrative models use either an early fusion in the form of concatenation or late fusion with a separate feature extractor for each omic, which are mainly based on deep neural networks. Due to the nature of biological systems, graphs are a better structural representation of bio-medical data. Although few graph neural network (GNN) based multi-omics integrative methods have been proposed, they suffer from three common disadvantages. One is most of them use only one type of connection, either inter-omics or intra-omic connection; second, they only consider one kind of GNN layer, either graph convolution network (GCN) or graph attention network (GAT); and third, most of these methods have not been tested on a more complex classification task, such as cancer molecular subtypes.

RESULTS

In this study, we propose a novel end-to-end multi-omics GNN framework for accurate and robust cancer subtype classification. The proposed model utilizes multi-omics data in the form of heterogeneous multi-layer graphs, which combine both inter-omics and intra-omic connections from established biological knowledge. The proposed model incorporates learned graph features and global genome features for accurate classification. We tested the proposed model on the Cancer Genome Atlas (TCGA) Pan-cancer dataset and TCGA breast invasive carcinoma (BRCA) dataset for molecular subtype and cancer subtype classification, respectively. The proposed model shows superior performance compared to four current state-of-the-art baseline models in terms of accuracy, F1 score, precision, and recall. The comparative analysis of GAT-based models and GCN-based models reveals that GAT-based models are preferred for smaller graphs with less information and GCN-based models are preferred for larger graphs with extra information.

Collapse

Zou J, Shah O, Chiu YC, Ma T, Atkinson JM, Oesterreich S, Lee AV, Tseng GC. Systems approach for congruence and selection of cancer models towards precision medicine. PLoS Comput Biol 2024;20:e1011754. [PMID: 38198519 PMCID: PMC10805322 DOI: 10.1371/journal.pcbi.1011754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 01/23/2024] [Accepted: 12/12/2023] [Indexed: 01/12/2024] Open

Affiliation(s)

Jian Zou Department of Statistics, School of Public Health, Chongqing Medical University, Chongqing, China
Osama Shah Women’s Cancer Research Center, UPMC Hillman Cancer Center (HCC), Pittsburgh, Pennsylvania, United States of America Magee-Womens Research Institute, Pittsburgh, Pennsylvania, United States of America Department of Pharmacology & Chemical Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
Yu-Chiao Chiu Cancer Therapeutics Program, UPMC Hillman Cancer Center (HCC), Pittsburgh, Pennsylvania, United States of America Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
Tianzhou Ma Department of Epidemiology and Biostatistics, University of Maryland, College Park, Maryland, United States of America
Jennifer M. Atkinson Women’s Cancer Research Center, UPMC Hillman Cancer Center (HCC), Pittsburgh, Pennsylvania, United States of America Magee-Womens Research Institute, Pittsburgh, Pennsylvania, United States of America Department of Pharmacology & Chemical Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
Steffi Oesterreich Women’s Cancer Research Center, UPMC Hillman Cancer Center (HCC), Pittsburgh, Pennsylvania, United States of America Magee-Womens Research Institute, Pittsburgh, Pennsylvania, United States of America Department of Pharmacology & Chemical Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
Adrian V. Lee Women’s Cancer Research Center, UPMC Hillman Cancer Center (HCC), Pittsburgh, Pennsylvania, United States of America Magee-Womens Research Institute, Pittsburgh, Pennsylvania, United States of America Department of Pharmacology & Chemical Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
George C. Tseng Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America

Collapse

Hassan T, Li Z, Javed S, Dias J, Werghi N. Neural Graph Refinement for Robust Recognition of Nuclei Communities in Histopathological Landscape. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023;33:241-256. [PMID: 38064329 DOI: 10.1109/tip.2023.3337666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]

Bhonde SB, Wagh SK, Prasad JR. Identification of cancer types from gene expressions using learning techniques. Comput Methods Biomech Biomed Engin 2023;26:1951-1965. [PMID: 36562388 DOI: 10.1080/10255842.2022.2160243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 10/15/2022] [Accepted: 11/15/2022] [Indexed: 12/24/2022]

Tran KA, Addala V, Johnston RL, Lovell D, Bradley A, Koufariotis LT, Wood S, Wu SZ, Roden D, Al-Eryani G, Swarbrick A, Williams ED, Pearson JV, Kondrashova O, Waddell N. Performance of tumour microenvironment deconvolution methods in breast cancer using single-cell simulated bulk mixtures. Nat Commun 2023;14:5758. [PMID: 37717006 PMCID: PMC10505141 DOI: 10.1038/s41467-023-41385-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 09/01/2023] [Indexed: 09/18/2023] Open

Affiliation(s)

Khoa A Tran Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia School of Biomedical Sciences, Queensland University of Technology (QUT), Brisbane, QLD, 4000, Australia
Venkateswar Addala Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Rebecca L Johnston Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
David Lovell School of Computer Science, Queensland University of Technology, Brisbane, QLD, 4000, Australia QUT Centre for Data Science, Brisbane, QLD, 4000, Australia
Andrew Bradley Faculty of Engineering, Queensland University of Technology, Brisbane, QLD, 4000, Australia
Lambros T Koufariotis Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Scott Wood Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Sunny Z Wu Cancer Ecosystems Program, Garvan Institute of Medical Research, Darlinghurst, NSW, 2010, Australia School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Kensington, NSW, 2052, Australia
Daniel Roden Cancer Ecosystems Program, Garvan Institute of Medical Research, Darlinghurst, NSW, 2010, Australia School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Kensington, NSW, 2052, Australia
Ghamdan Al-Eryani Cancer Ecosystems Program, Garvan Institute of Medical Research, Darlinghurst, NSW, 2010, Australia School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Kensington, NSW, 2052, Australia
Alexander Swarbrick Cancer Ecosystems Program, Garvan Institute of Medical Research, Darlinghurst, NSW, 2010, Australia School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Kensington, NSW, 2052, Australia
Elizabeth D Williams School of Biomedical Sciences, Queensland University of Technology (QUT), Brisbane, QLD, 4000, Australia Australian Prostate Cancer Research Centre - Queensland (APCRC-Q) and Queensland Bladder Cancer Initiative (QBCI), Brisbane, QLD, 4000, Australia
John V Pearson Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Olga Kondrashova Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Nicola Waddell Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia. School of Biomedical Sciences, Queensland University of Technology (QUT), Brisbane, QLD, 4000, Australia.

Collapse

Zolotovskaia M, Kovalenko M, Pugacheva P, Tkachev V, Simonov A, Sorokin M, Seryakov A, Garazha A, Gaifullin N, Sekacheva M, Zakharova G, Buzdin AA. Algorithmically Reconstructed Molecular Pathways as the New Generation of Prognostic Molecular Biomarkers in Human Solid Cancers. Proteomes 2023;11:26. [PMID: 37755705 PMCID: PMC10535530 DOI: 10.3390/proteomes11030026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/18/2023] [Accepted: 08/22/2023] [Indexed: 09/28/2023] Open

Affiliation(s)

Marianna Zolotovskaia Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia Omicsway Corp., Walnut, CA 91789, USA Laboratory of Clinical and Genomic Bioinformatics, I.M. Sechenov First Moscow State Medical University, 119048 Moscow, Russia
Maks Kovalenko Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia
Polina Pugacheva Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia
Victor Tkachev Omicsway Corp., Walnut, CA 91789, USA
Alexander Simonov Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia Omicsway Corp., Walnut, CA 91789, USA
Maxim Sorokin Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia Laboratory of Clinical and Genomic Bioinformatics, I.M. Sechenov First Moscow State Medical University, 119048 Moscow, Russia PathoBiology Group, European Organization for Research and Treatment of Cancer (EORTC), 1200 Brussels, Belgium
Alexander Seryakov Medical Holding SM-Clinic, 105120 Moscow, Russia
Andrew Garazha Omicsway Corp., Walnut, CA 91789, USA
Nurshat Gaifullin Department of Pathology, Faculty of Medicine, Lomonosov Moscow State University, 119991 Moscow, Russia
Marina Sekacheva Laboratory of Clinical and Genomic Bioinformatics, I.M. Sechenov First Moscow State Medical University, 119048 Moscow, Russia
Galina Zakharova Laboratory of Clinical and Genomic Bioinformatics, I.M. Sechenov First Moscow State Medical University, 119048 Moscow, Russia
Anton A. Buzdin Laboratory for Translational Genomic Bioinformatics, Moscow Institute of Physics and Technology (State University), 141701 Dolgoprudny, Russia PathoBiology Group, European Organization for Research and Treatment of Cancer (EORTC), 1200 Brussels, Belgium World-Class Research Center “Digital Biodesign and Personalized Healthcare”, Sechenov First Moscow State Medical University, 119048 Moscow, Russia Laboratory of Systems Biology, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 117997 Moscow, Russia

Collapse

Duan M, Wang Y, Zhao D, Liu H, Zhang G, Li K, Zhang H, Huang L, Zhang R, Zhou F. Orchestrating information across tissues via a novel multitask GAT framework to improve quantitative gene regulation relation modeling for survival analysis. Brief Bioinform 2023;24:bbad238. [PMID: 37427963 DOI: 10.1093/bib/bbad238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 05/29/2023] [Accepted: 06/08/2023] [Indexed: 07/11/2023] Open

Beaude A, Rafiee Vahid M, Augé F, Zehraoui F, Hanczar B. AttOmics: attention-based architecture for diagnosis and prognosis from omics data. Bioinformatics 2023;39:i94-i102. [PMID: 37387182 DOI: 10.1093/bioinformatics/btad232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open

Padegal G, Rao MK, Boggaram Ravishankar OA, Acharya S, Athri P, Srinivasa G. Analysis of RNA-Seq data using self-supervised learning for vital status prediction of colorectal cancer patients. BMC Bioinformatics 2023;24:241. [PMID: 37286944 DOI: 10.1186/s12859-023-05347-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 05/21/2023] [Indexed: 06/09/2023] Open

Kesimoglu ZN, Bozdag S. SUPREME: multiomics data integration using graph convolutional networks. NAR Genom Bioinform 2023;5:lqad063. [PMID: 37680392 PMCID: PMC10481254 DOI: 10.1093/nargab/lqad063] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 05/08/2023] [Accepted: 06/07/2023] [Indexed: 09/09/2023] Open

Wysocka M, Wysocki O, Zufferey M, Landers D, Freitas A. A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data. BMC Bioinformatics 2023;24:198. [PMID: 37189058 DOI: 10.1186/s12859-023-05262-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 03/30/2023] [Indexed: 05/17/2023] Open

Abstract

BACKGROUND

There is an increasing interest in the use of Deep Learning (DL) based methods as a supporting analytical framework in oncology. However, most direct applications of DL will deliver models with limited transparency and explainability, which constrain their deployment in biomedical settings.

METHODS

This systematic review discusses DL models used to support inference in cancer biology with a particular emphasis on multi-omics analysis. It focuses on how existing models address the need for better dialogue with prior knowledge, biological plausibility and interpretability, fundamental properties in the biomedical domain. For this, we retrieved and analyzed 42 studies focusing on emerging architectural and methodological advances, the encoding of biological domain knowledge and the integration of explainability methods.

RESULTS

We discuss the recent evolutionary arch of DL models in the direction of integrating prior biological relational and network knowledge to support better generalisation (e.g. pathways or Protein-Protein-Interaction networks) and interpretability. This represents a fundamental functional shift towards models which can integrate mechanistic and statistical inference aspects. We introduce a concept of bio-centric interpretability and according to its taxonomy, we discuss representational methodologies for the integration of domain prior knowledge in such models.

CONCLUSIONS

The paper provides a critical outlook into contemporary methods for explainability and interpretability used in DL for cancer. The analysis points in the direction of a convergence between encoding prior knowledge and improved interpretability. We introduce bio-centric interpretability which is an important step towards formalisation of biological interpretability of DL models and developing methods that are less problem- or application-specific.

Collapse

Zhang Z, Wei X. Artificial intelligence-assisted selection and efficacy prediction of antineoplastic strategies for precision cancer therapy. Semin Cancer Biol 2023;90:57-72. [PMID: 36796530 DOI: 10.1016/j.semcancer.2023.02.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/12/2023] [Accepted: 02/13/2023] [Indexed: 02/16/2023]

Bairakdar MD, Tewari A, Truttmann MC. A meta-analysis of RNA-Seq studies to identify novel genes that regulate aging. Exp Gerontol 2023;173:112107. [PMID: 36731807 PMCID: PMC10653729 DOI: 10.1016/j.exger.2023.112107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/17/2023] [Accepted: 01/23/2023] [Indexed: 02/04/2023]

Liu C, Duan Y, Zhou Q, Wang Y, Gao Y, Kan H, Hu J. A classification method of gastric cancer subtype based on residual graph convolution network. Front Genet 2023;13:1090394. [PMID: 36685956 PMCID: PMC9845413 DOI: 10.3389/fgene.2022.1090394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 12/09/2022] [Indexed: 01/06/2023] Open

Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration. Biomolecules 2022;12:biom12121839. [PMID: 36551266 PMCID: PMC9775093 DOI: 10.3390/biom12121839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 12/05/2022] [Accepted: 12/05/2022] [Indexed: 12/14/2022] Open

Jones S, Beyers M, Shukla M, Xia F, Brettin T, Stevens R, Weil MR, Ranganathan Ganakammal S. TULIP: An RNA-seq-based Primary Tumor Type Prediction Tool Using Convolutional Neural Networks. Cancer Inform 2022;21:11769351221139491. [PMID: 36507076 PMCID: PMC9729992 DOI: 10.1177/11769351221139491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 10/28/2022] [Indexed: 12/12/2022] Open

Abstract

Background

With cancer as one of the leading causes of death worldwide, accurate primary tumor type prediction is critical in identifying genetic factors that can inhibit or slow tumor progression. There have been efforts to categorize primary tumor types with gene expression data using machine learning, and more recently with deep learning, in the last several years.

Methods

In this paper, we developed four 1-dimensional (1D) Convolutional Neural Network (CNN) models to classify RNA-seq count data as one of 17 highly represented primary tumor types or 32 primary tumor types regardless of imbalanced representation. Additionally, we adapted the models to take as input either all Ensembl genes (60,483) or protein coding genes only (19,758). Unlike previous work, we avoided selection bias by not filtering genes based on expression values. RNA-seq count data expressed as FPKM-UQ of 9,025 and 10,940 samples from The Cancer Genome Atlas (TCGA) were downloaded from the Genomic Data Commons (GDC) corresponding to 17 and 32 primary tumor types respectively for training and validating the models.

Results

All 4 1D-CNN models had an overall accuracy of 94.7% to 97.6% on the test dataset. Further evaluation indicates that the models with protein coding genes only as features performed with better accuracy compared to the models with all Ensembl genes for both 17 and 32 primary tumor types. For all models, the accuracy by primary tumor type was above 80% for most primary tumor types.

Conclusions

We packaged all 4 models as a Python-based deep learning classification tool called TULIP (TUmor CLassIfication Predictor) for performing quality control on primary tumor samples and characterizing cancer samples of unknown tumor type. Further optimization of the models is needed to improve the accuracy of certain primary tumor types.

Collapse

Kuang J, Scoglio C, Michel K. Feature learning and network structure from noisy node activity data. Phys Rev E 2022;106:064301. [PMID: 36671154 PMCID: PMC9869472 DOI: 10.1103/physreve.106.064301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 11/17/2022] [Indexed: 06/17/2023]

Characterizing Macrophages Diversity in COVID-19 Patients Using Deep Learning. Genes (Basel) 2022;13:genes13122264. [PMID: 36553530 PMCID: PMC9777824 DOI: 10.3390/genes13122264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 11/23/2022] [Accepted: 11/28/2022] [Indexed: 12/04/2022] Open

Abstract

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the etiological agent responsible for coronavirus disease 2019 (COVID-19), has affected the lives of billions and killed millions of infected people. This virus has been demonstrated to have different outcomes among individuals, with some of them presenting a mild infection, while others present severe symptoms or even death. The identification of the molecular states related to the severity of a COVID-19 infection has become of the utmost importance to understanding the differences in critical immune response. In this study, we computationally processed a set of publicly available single-cell RNA-Seq (scRNA-Seq) data of 12 Bronchoalveolar Lavage Fluid (BALF) samples diagnosed as having a mild, severe, or no infection, and generated a high-quality dataset that consists of 63,734 cells, each with 23,916 genes. We extended the cell-type and sub-type composition identification and our analysis showed significant differences in cell-type composition in mild and severe groups compared to the normal. Importantly, inflammatory responses were dramatically elevated in the severe group, which was evidenced by the significant increase in macrophages, from 10.56% in the normal group to 20.97% in the mild group and 34.15% in the severe group. As an indicator of immune defense, populations of T cells accounted for 24.76% in the mild group and decreased to 7.35% in the severe group. To verify these findings, we developed several artificial neural networks (ANNs) and graph convolutional neural network (GCNN) models. We showed that the GCNN models reach a prediction accuracy of the infection of 91.16% using data from subtypes of macrophages. Overall, our study indicates significant differences in the gene expression profiles of inflammatory response and immune cells of severely infected patients.

Collapse

Li MM, Huang K, Zitnik M. Graph representation learning in biomedicine and healthcare. Nat Biomed Eng 2022;6:1353-1369. [PMID: 36316368 PMCID: PMC10699434 DOI: 10.1038/s41551-022-00942-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 08/09/2022] [Indexed: 11/11/2022]

EpICC: A Bayesian neural network model with uncertainty correction for a more accurate classification of cancer. Sci Rep 2022;12:14628. [PMID: 36028643 PMCID: PMC9418241 DOI: 10.1038/s41598-022-18874-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 08/22/2022] [Indexed: 11/09/2022] Open

Hanczar B, Bourgeais V, Zehraoui F. Assessment of deep learning and transfer learning for cancer prediction based on gene expression data. BMC Bioinformatics 2022;23:262. [PMID: 35786378 PMCID: PMC9250744 DOI: 10.1186/s12859-022-04807-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 06/15/2022] [Indexed: 11/10/2022] Open

Pathway importance by graph convolutional network and Shapley additive explanations in gene expression phenotype of diffuse large B-cell lymphoma. PLoS One 2022;17:e0269570. [PMID: 35749395 PMCID: PMC9231717 DOI: 10.1371/journal.pone.0269570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 05/09/2022] [Indexed: 11/30/2022] Open

A Novel Attention-Mechanism Based Cox Survival Model by Exploiting Pan-Cancer Empirical Genomic Information. Cells 2022;11:cells11091421. [PMID: 35563727 PMCID: PMC9100007 DOI: 10.3390/cells11091421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 04/15/2022] [Accepted: 04/19/2022] [Indexed: 01/27/2023] Open

Bourgeais V, Zehraoui F, Hanczar B. GraphGONet: a self-explaining neural network encapsulating the Gene Ontology graph for phenotype prediction on gene expression. Bioinformatics 2022;38:2504-2511. [PMID: 35266505 DOI: 10.1093/bioinformatics/btac147] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 02/02/2022] [Accepted: 03/07/2022] [Indexed: 11/13/2022] Open

Kakati T, Bhattacharyya DK, Kalita JK, Norden-Krichmar TM. DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning. BMC Bioinformatics 2022;23:17. [PMID: 34991439 PMCID: PMC8734099 DOI: 10.1186/s12859-021-04527-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 12/13/2021] [Indexed: 12/11/2022] Open

Abstract

BACKGROUND

A limitation of traditional differential expression analysis on small datasets involves the possibility of false positives and false negatives due to sample variation. Considering the recent advances in deep learning (DL) based models, we wanted to expand the state-of-the-art in disease biomarker prediction from RNA-seq data using DL. However, application of DL to RNA-seq data is challenging due to absence of appropriate labels and smaller sample size as compared to number of genes. Deep learning coupled with transfer learning can improve prediction performance on novel data by incorporating patterns learned from other related data. With the emergence of new disease datasets, biomarker prediction would be facilitated by having a generalized model that can transfer the knowledge of trained feature maps to the new dataset. To the best of our knowledge, there is no Convolutional Neural Network (CNN)-based model coupled with transfer learning to predict the significant upregulating (UR) and downregulating (DR) genes from both trained and untrained datasets.

RESULTS

We implemented a CNN model, DEGnext, to predict UR and DR genes from gene expression data obtained from The Cancer Genome Atlas database. DEGnext uses biologically validated data along with logarithmic fold change values to classify differentially expressed genes (DEGs) as UR and DR genes. We applied transfer learning to our model to leverage the knowledge of trained feature maps to untrained cancer datasets. DEGnext's results were competitive (ROC scores between 88 and 99[Formula: see text]) with those of five traditional machine learning methods: Decision Tree, K-Nearest Neighbors, Random Forest, Support Vector Machine, and XGBoost. DEGnext was robust and effective in terms of transferring learned feature maps to facilitate classification of unseen datasets. Additionally, we validated that the predicted DEGs from DEGnext were mapped to significant Gene Ontology terms and pathways related to cancer.

CONCLUSIONS

DEGnext can classify DEGs into UR and DR genes from RNA-seq cancer datasets with high performance. This type of analysis, using biologically relevant fine-tuning data, may aid in the exploration of potential biomarkers and can be adapted for other disease datasets.

Collapse

Ghandikota S, Jegga AG. gene2gauss: A multi-view gaussian gene embedding learner for analyzing transcriptomic networks. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022;2022:206-215. [PMID: 35854722 PMCID: PMC9285176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]

Baranwal M, Krishnan S, Oneka M, Frankel T, Rao A. CGAT: Cell Graph ATtention Network for Grading of Pancreatic Disease Histology Images. Front Immunol 2021;12:727610. [PMID: 34671349 PMCID: PMC8522581 DOI: 10.3389/fimmu.2021.727610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 09/03/2021] [Indexed: 11/13/2022] Open

Tran KA, Kondrashova O, Bradley A, Williams ED, Pearson JV, Waddell N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med 2021;13:152. [PMID: 34579788 PMCID: PMC8477474 DOI: 10.1186/s13073-021-00968-x] [Citation(s) in RCA: 244] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 09/12/2021] [Indexed: 12/13/2022] Open

Chiu YC, Zheng S, Wang LJ, Iskra BS, Rao MK, Houghton PJ, Huang Y, Chen Y. Predicting and characterizing a cancer dependency map of tumors with deep learning. SCIENCE ADVANCES 2021;7:7/34/eabh1275. [PMID: 34417181 PMCID: PMC8378822 DOI: 10.1126/sciadv.abh1275] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 06/29/2021] [Indexed: 05/14/2023]

Shawki MM, Azmy MM, Salama M, Shawki S. Mathematical and deep learning analysis based on tissue dielectric properties at low frequencies predict outcome in human breast cancer. Technol Health Care 2021;30:633-645. [PMID: 34366303 DOI: 10.3233/thc-213096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Mostavi M, Chiu YC, Chen Y, Huang Y. CancerSiamese: one-shot learning for predicting primary and metastatic tumor types unseen during model training. BMC Bioinformatics 2021;22:244. [PMID: 33980137 PMCID: PMC8117642 DOI: 10.1186/s12859-021-04157-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 04/27/2021] [Indexed: 02/06/2023] Open

Abstract

BACKGROUND

The state-of-the-art deep learning based cancer type prediction can only predict cancer types whose samples are available during the training where the sample size is commonly large. In this paper, we consider how to utilize the existing training samples to predict cancer types unseen during the training. We hypothesize the existence of a set of type-agnostic expression representations that define the similarity/dissimilarity between samples of the same/different types and propose a novel one-shot learning model called CancerSiamese to learn this common representation. CancerSiamese accepts a pair of query and support samples (gene expression profiles) and learns the representation of similar or dissimilar cancer types through two parallel convolutional neural networks joined by a similarity function.

RESULTS

We trained CancerSiamese for cancer type prediction for primary and metastatic tumors using samples from the Cancer Genome Atlas (TCGA) and MET500. Network transfer learning was utilized to facilitate the training of the CancerSiamese models. CancerSiamese was tested for different N-way predictions and yielded an average accuracy improvement of 8% and 4% over the benchmark 1-Nearest Neighbor (1-NN) classifier for primary and metastatic tumors, respectively. Moreover, we applied the guided gradient saliency map and feature selection to CancerSiamese to examine 100 and 200 top marker-gene candidates for the prediction of primary and metastatic cancers, respectively. Functional analysis of these marker genes revealed several cancer related functions between primary and metastatic tumors.

CONCLUSION

This work demonstrated, for the first time, the feasibility of predicting unseen cancer types whose samples are limited. Thus, it could inspire new and ingenious applications of one-shot and few-shot learning solutions for improving cancer diagnosis, prognostic, and our understanding of cancer.

Collapse

Gated Graph Attention Network for Cancer Prediction. SENSORS 2021;21:s21061938. [PMID: 33801894 PMCID: PMC7998488 DOI: 10.3390/s21061938] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 03/02/2021] [Accepted: 03/05/2021] [Indexed: 01/17/2023]

Ramirez R, Chiu YC, Zhang S, Ramirez J, Chen Y, Huang Y, Jin YF. Prediction and interpretation of cancer survival using graph convolution neural networks. Methods 2021;192:120-130. [PMID: 33484826 DOI: 10.1016/j.ymeth.2021.01.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 01/07/2021] [Accepted: 01/12/2021] [Indexed: 12/13/2022] Open

Abstract

The survival rate of cancer has increased significantly during the past two decades for breast, prostate, testicular, and colon cancer, while the brain and pancreatic cancers have a much lower median survival rate that has not improved much over the last forty years. This has imposed the challenge of finding gene markers for early cancer detection and treatment strategies. Different methods including regression-based Cox-PH, artificial neural networks, and recently deep learning algorithms have been proposed to predict the survival rate for cancers. We established in this work a novel graph convolution neural network (GCNN) approach called Surv_GCNN to predict the survival rate for 13 different cancer types using the TCGA dataset. For each cancer type, 6 Surv_GCNN models with graphs generated by correlation analysis, GeneMania database, and correlation + GeneMania were trained with and without clinical data to predict the risk score (RS). The performance of the 6 Surv_GCNN models was compared with two other existing models, Cox-PH and Cox-nnet. The results showed that Cox-PH has the worst performance among 8 tested models across the 13 cancer types while Surv_GCNN models with clinical data reported the best overall performance, outperforming other competing models in 7 out of 13 cancer types including BLCA, BRCA, COAD, LUSC, SARC, STAD, and UCEC. A novel network-based interpretation of Surv_GCNN was also proposed to identify potential gene markers for breast cancer. The signatures learned by the nodes in the hidden layer of Surv_GCNN were identified and were linked to potential gene markers by network modularization. The identified gene markers for breast cancer have been compared to a total of 213 gene markers from three widely cited lists for breast cancer survival analysis. About 57% of gene markers obtained by Surv_GCNN with correlation + GeneMania graph either overlap or directly interact with the 213 genes, confirming the effectiveness of the identified markers by Surv_GCNN.

Collapse