Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fortelny N, Bock C. Knowledge-primed neural networks enable biologically interpretable deep learning on single-cell sequencing data. Genome Biol 2020;21:190. [PMID: 32746932 PMCID: PMC7397672 DOI: 10.1186/s13059-020-02100-5] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 07/10/2020] [Indexed: 12/23/2022] Open

For:	Fortelny N, Bock C. Knowledge-primed neural networks enable biologically interpretable deep learning on single-cell sequencing data. Genome Biol 2020;21:190. [PMID: 32746932 PMCID: PMC7397672 DOI: 10.1186/s13059-020-02100-5] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 07/10/2020] [Indexed: 12/23/2022] Open

Number

Cited by Other Article(s)

Xi X, Li J, Jia J, Meng Q, Li C, Wang X, Wei L, Zhang X. A mechanism-informed deep neural network enables prioritization of regulators that drive cell state transitions. Nat Commun 2025;16:1284. [PMID: 39900922 PMCID: PMC11790924 DOI: 10.1038/s41467-025-56475-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Accepted: 01/15/2025] [Indexed: 02/05/2025] Open

Qi H, Zhao H, Li E, Lu X, Yu N, Liu J, Han J. DeepQA: A Unified Transcriptome-Based Aging Clock Using Deep Neural Networks. Aging Cell 2025:e14471. [PMID: 39757434 DOI: 10.1111/acel.14471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2024] [Revised: 11/21/2024] [Accepted: 12/17/2024] [Indexed: 01/07/2025] Open

Zeng R, Li Z, Li J, Zhang Q. DNA promoter task-oriented dictionary mining and prediction model based on natural language technology. Sci Rep 2025;15:153. [PMID: 39747934 PMCID: PMC11697570 DOI: 10.1038/s41598-024-84105-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2024] [Accepted: 12/19/2024] [Indexed: 01/04/2025] Open

Cao G, Chen D. Unveiling Long Non-coding RNA Networks from Single-Cell Omics Data Through Artificial Intelligence. Methods Mol Biol 2025;2883:257-279. [PMID: 39702712 DOI: 10.1007/978-1-0716-4290-0_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2024]

Crawford J, Chikina M, Greene CS. Best holdout assessment is sufficient for cancer transcriptomic model selection. PATTERNS (NEW YORK, N.Y.) 2024;5:101115. [PMID: 39776849 PMCID: PMC11701843 DOI: 10.1016/j.patter.2024.101115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 08/01/2024] [Accepted: 11/13/2024] [Indexed: 01/11/2025]

Wu Y, Xu P, Wang L, Liu S, Hou Y, Lu H, Hu P, Li X, Yu X. scGO: interpretable deep neural network for cell status annotation and disease diagnosis. Brief Bioinform 2024;26:bbaf018. [PMID: 39820437 PMCID: PMC11737892 DOI: 10.1093/bib/bbaf018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Revised: 12/16/2024] [Accepted: 01/10/2025] [Indexed: 01/19/2025] Open

Wang J, Wen Y, Zhang Y, Wang Z, Jiang Y, Dai C, Wu L, Leng D, He S, Bo X. An interpretable artificial intelligence framework for designing synthetic lethality-based anti-cancer combination therapies. J Adv Res 2024;65:329-343. [PMID: 38043609 PMCID: PMC11519055 DOI: 10.1016/j.jare.2023.11.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 11/27/2023] [Accepted: 11/29/2023] [Indexed: 12/05/2023] Open

Cao P, Dun Y, Xiang X, Wang D, Cheng W, Yan L, Li H. Machine learning-based individualized survival prediction model for prognosis in osteosarcoma: Data from the SEER database. Medicine (Baltimore) 2024;103:e39582. [PMID: 39331900 PMCID: PMC11441932 DOI: 10.1097/md.0000000000039582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 08/15/2024] [Indexed: 09/29/2024] Open

Abstract

Patient outcomes of osteosarcoma vary because of tumor heterogeneity and treatment strategies. This study aimed to compare the performance of multiple machine learning (ML) models with the traditional Cox proportional hazards (CoxPH) model in predicting prognosis and explored the potential of ML models in clinical decision-making. From 2000 to 2018, 1243 patients with osteosarcoma were collected from the Surveillance, Epidemiology, and End Results (SEER) database. Three ML methods were chosen for model development (DeepSurv, neural multi-task logistic regression [NMTLR]) and random survival forest [RSF]) and compared them with the traditional CoxPH model and TNM staging systems. 871 samples were used for model training, and the rest were used for model validation. The models' overall performance and predictive accuracy for 3- and 5-year survival were assessed by several metrics, including the concordance index (C-index), the Integrated Brier Score (IBS), receiver operating characteristic curves (ROC), area under the ROC curves (AUC), calibration curves, and decision curve analysis. The efficacy of personalized recommendations by ML models was evaluated by the survival curves. The performance was highest in the DeepSurv model (C-index, 0.77; IBS, 0.14; 3-year AUC, 0.80; 5-year AUC, 0.78) compared with other methods (C-index, 0.73-0.74; IBS, 0.16-0.17; 3-year AUC, 0.73-0.78; 5-year AUC, 0.72-0.78). There are also significant differences in survival outcomes between patients who align with the treatment option recommended by the DeepSurv model and those who do not (hazard ratio, 1.88; P < .05). The DeepSurv model is available in an approachable web app format at https://survivalofosteosarcoma.streamlit.app/. We developed ML models capable of accurately predicting the survival of osteosarcoma, which can provide useful information for decision-making regarding the appropriate treatment.

Collapse

Venafra V, Sacco F, Perfetto L. SignalingProfiler 2.0 a network-based approach to bridge multi-omics data to phenotypic hallmarks. NPJ Syst Biol Appl 2024;10:95. [PMID: 39179556 PMCID: PMC11343843 DOI: 10.1038/s41540-024-00417-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 07/31/2024] [Indexed: 08/26/2024] Open

van Hilten A, van Rooij J, Ikram MA, Niessen WJ, van Meurs JBJ, Roshchupkin GV. Phenotype prediction using biologically interpretable neural networks on multi-cohort multi-omics data. NPJ Syst Biol Appl 2024;10:81. [PMID: 39095438 PMCID: PMC11297229 DOI: 10.1038/s41540-024-00405-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 07/12/2024] [Indexed: 08/04/2024] Open

Lobentanzer S, Rodriguez-Mier P, Bauer S, Saez-Rodriguez J. Molecular causality in the advent of foundation models. Mol Syst Biol 2024;20:848-858. [PMID: 38890548 PMCID: PMC11297329 DOI: 10.1038/s44320-024-00041-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 03/18/2024] [Accepted: 03/21/2024] [Indexed: 06/20/2024] Open

Chen V, Yang M, Cui W, Kim JS, Talwalkar A, Ma J. Applying interpretable machine learning in computational biology-pitfalls, recommendations and opportunities for new developments. Nat Methods 2024;21:1454-1461. [PMID: 39122941 PMCID: PMC11348280 DOI: 10.1038/s41592-024-02359-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 06/24/2024] [Indexed: 08/12/2024]

Wagle MM, Long S, Chen C, Liu C, Yang P. Interpretable deep learning in single-cell omics. Bioinformatics 2024;40:btae374. [PMID: 38889275 PMCID: PMC11211213 DOI: 10.1093/bioinformatics/btae374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Revised: 05/11/2024] [Accepted: 06/12/2024] [Indexed: 06/20/2024] Open

Kim Y, Han Y, Hopper C, Lee J, Joo JI, Gong JR, Lee CK, Jang SH, Kang J, Kim T, Cho KH. A gray box framework that optimizes a white box logical model using a black box optimizer for simulating cellular responses to perturbations. CELL REPORTS METHODS 2024;4:100773. [PMID: 38744288 PMCID: PMC11133856 DOI: 10.1016/j.crmeth.2024.100773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 03/19/2024] [Accepted: 04/19/2024] [Indexed: 05/16/2024]

Affiliation(s)

Yunseong Kim Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Younghyun Han Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Corbin Hopper Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Jonghoon Lee Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Jae Il Joo Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Jeong-Ryeol Gong Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Chun-Kyung Lee Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Seong-Hoon Jang Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Junsoo Kang Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Taeyoung Kim Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Kwang-Hyun Cho Laboratory for Systems Biology and Bio-inspired Engineering, Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea.

Collapse

Meimetis N, Lauffenburger DA, Nilsson A. Inference of drug off-target effects on cellular signaling using interactome-based deep learning. iScience 2024;27:109509. [PMID: 38591003 PMCID: PMC11000001 DOI: 10.1016/j.isci.2024.109509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/04/2024] [Accepted: 03/13/2024] [Indexed: 04/10/2024] Open

Girard C. The tri-flow adaptiveness of codes in major evolutionary transitions. Biosystems 2024;237:105133. [PMID: 38336225 DOI: 10.1016/j.biosystems.2024.105133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/26/2024] [Accepted: 01/27/2024] [Indexed: 02/12/2024]

Abstract

Life codes increase in both number and variety with biological complexity. Although our knowledge of codes is constantly expanding, the evolutionary progression of organic, neural, and cultural codes in response to selection pressure remains poorly understood. Greater clarification of the selective mechanisms is achieved by investigating how major evolutionary transitions reduce spatiotemporal and energetic constraints on transmitting heritable code to offspring. Evolution toward less constrained flows is integral to enduring flow architecture everywhere, in both engineered and natural flow systems. Beginning approximately 4 billion years ago, the most basic level for transmitting genetic material to offspring was initiated by protocell division. Evidence from ribosomes suggests that protocells transmitted comma-free or circular codes, preceding the evolution of standard genetic code. This rudimentary information flow within protocells is likely to have first emerged within the geo-energetic and geospatial constraints of hydrothermal vents. A broad-gauged hypothesis is that major evolutionary transitions overcame such constraints with tri-flow adaptations. The interconnected triple flows incorporated energy-converting, spatiotemporal, and code-based informational dynamics. Such tri-flow adaptations stacked sequence splicing code on top of protein-DNA recognition code in eukaryotes, prefiguring the transition to sexual reproduction. Sex overcame the spatiotemporal-energetic constraints of binary fission with further code stacking. Examples are tubulin code and transcription initiation code in vertebrates. In a later evolutionary transition, language reduced metabolic-spatiotemporal constraints on inheritance by stacking phonetic, phonological, and orthographic codes. In organisms that reproduce sexually, each major evolutionary transition is shown to be a tri-flow adaptation that adds new levels of code-based informational exchange. Evolving biological complexity is also shown to increase the nongenetic transmissibility of code.

Collapse

Xiong G, Xie N, Nie M, Ling R, Yun B, Xie J, Ren L, Huang Y, Wang W, Yi C, Zhang M, Xu X, Zhang C, Zou B, Zhang L, Liu X, Huang H, Chen D, Cao W, Wang C. Single-cell transcriptomics reveals cell atlas and identifies cycling tumor cells responsible for recurrence in ameloblastoma. Int J Oral Sci 2024;16:21. [PMID: 38424060 PMCID: PMC10904398 DOI: 10.1038/s41368-024-00281-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 01/04/2024] [Accepted: 01/05/2024] [Indexed: 03/02/2024] Open

Affiliation(s)

Gan Xiong Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Nan Xie Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Min Nie Department of Periodontics, Affiliated Stomatology Hospital of Guangzhou Medical University, Guangzhou Key Laboratory of Basic and Applied Research of Oral Regenerative Medicine, Guangzhou, China
Rongsong Ling Institute for Advanced Study, Shenzhen University, Shenzhen, China
Bokai Yun Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Jiaxiang Xie Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Linlin Ren Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Yaqi Huang Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Wenjin Wang Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Chen Yi Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Ming Zhang Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Xiuyun Xu Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Caihua Zhang Center for Translational Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Bin Zou State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
Leitao Zhang Department of Oral and Maxillofacial Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, China
Xiqiang Liu Department of Oral and Maxillofacial Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, China
Hongzhang Huang Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China
Demeng Chen Center for Translational Medicine, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Wei Cao Department of Oral and Maxillofacial & Head and Neck Oncology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China. National Center for Stomatology, National Clinical Research Center for Oral diseases, Shanghai Key Laboratory of Stomatology, Shanghai, China.
Cheng Wang Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China. Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China. Guanghua School of Stomatology, Sun Yat-sen University, Guangzhou, China.

Collapse

Holton E, Muskovic W, Powell JE. Deciphering cancer cell state plasticity with single-cell genomics and artificial intelligence. Genome Med 2024;16:36. [PMID: 38409176 PMCID: PMC10897991 DOI: 10.1186/s13073-024-01309-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 02/21/2024] [Indexed: 02/28/2024] Open

Brunnsåker D, Kronström F, Tiukova IA, King RD. Interpreting protein abundance in Saccharomyces cerevisiae through relational learning. Bioinformatics 2024;40:btae050. [PMID: 38273672 PMCID: PMC10868306 DOI: 10.1093/bioinformatics/btae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 01/16/2024] [Accepted: 01/23/2024] [Indexed: 01/27/2024] Open

Tjärnberg A, Beheler-Amass M, Jackson CA, Christiaen LA, Gresham D, Bonneau R. Structure-primed embedding on the transcription factor manifold enables transparent model architectures for gene regulatory network and latent activity inference. Genome Biol 2024;25:24. [PMID: 38238840 PMCID: PMC10797903 DOI: 10.1186/s13059-023-03134-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 11/30/2023] [Indexed: 01/22/2024] Open

Xie L, Raj Y, Varathan P, He B, Yu M, Nho K, Salama P, Saykin AJ, Yan J. Deep Trans-Omic Network Fusion for Molecular Mechanism of Alzheimer's Disease. J Alzheimers Dis 2024;99:715-727. [PMID: 38728189 DOI: 10.3233/jad-240098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]

Abstract

Background

There are various molecular hypotheses regarding Alzheimer's disease (AD) like amyloid deposition, tau propagation, neuroinflammation, and synaptic dysfunction. However, detailed molecular mechanism underlying AD remains elusive. In addition, genetic contribution of these molecular hypothesis is not yet established despite the high heritability of AD.

Objective

The study aims to enable the discovery of functionally connected multi-omic features through novel integration of multi-omic data and prior functional interactions.

Methods

We propose a new deep learning model MoFNet with improved interpretability to investigate the AD molecular mechanism and its upstream genetic contributors. MoFNet integrates multi-omic data with prior functional interactions between SNPs, genes, and proteins, and for the first time models the dynamic information flow from DNA to RNA and proteins.

Results

When evaluated using the ROS/MAP cohort, MoFNet outperformed other competing methods in prediction performance. It identified SNPs, genes, and proteins with significantly more prior functional interactions, resulting in three multi-omic subnetworks. SNP-gene pairs identified by MoFNet were mostly eQTLs specific to frontal cortex tissue where gene/protein data was collected. These molecular subnetworks are enriched in innate immune system, clearance of misfolded proteins, and neurotransmitter release respectively. We validated most findings in an independent dataset. One multi-omic subnetwork consists exclusively of core members of SNARE complex, a key mediator of synaptic vesicle fusion and neurotransmitter transportation.

Conclusions

Our results suggest that MoFNet is effective in improving classification accuracy and in identifying multi-omic markers for AD with improved interpretability. Multi-omic subnetworks identified by MoFNet provided insights of AD molecular mechanism with improved details.

Collapse

Monshizadeh M, Ye Y. Incorporating metabolic activity, taxonomy and community structure to improve microbiome-based predictive models for host phenotype prediction. Gut Microbes 2024;16:2302076. [PMID: 38214657 PMCID: PMC10793686 DOI: 10.1080/19490976.2024.2302076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 01/02/2024] [Indexed: 01/13/2024] Open

YOUSEF M, ALLMER J. Deep learning in bioinformatics. Turk J Biol 2023;47:366-382. [PMID: 38681776 PMCID: PMC11045206 DOI: 10.55730/1300-0152.2671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/28/2023] [Accepted: 12/18/2023] [Indexed: 05/01/2024] Open

Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023;13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]

Affiliation(s)

Petr Kouba Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic Faculty of Electrical Engineering, Czech Technical University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
Pavel Kohout Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Faraneh Haddadi Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Anton Bushuiev Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Raman Samusevich Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
Jiri Sedlar Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Jiri Damborsky Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
Tomas Pluskal Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
Josef Sivic Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
Stanislav Mazurenko Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic International Clinical Research Center, St. Anne’s University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic

Collapse

Tarquino J, Arabyarmohammadi S, Tejada RE, Madabhushi A, Romero E. Intra-nucleus mosaic pattern (InMop) and whole-cell Haralick combined-descriptor for identifying and characterizing acute leukemia blasts on single cell peripheral blood images. Cytometry A 2023;103:857-867. [PMID: 37565838 PMCID: PMC10841385 DOI: 10.1002/cyto.a.24785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 07/14/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023]

Abstract

Acute leukemia is usually diagnosed when a test of peripheral blood shows at least 20% of abnormal immature cells (blasts), a figure even lower in case of recurrent cytogenetic abnormalities. Blast identification is crucial for white blood cell (WBC) counting, which depends on both identifying the cell type and characterizing the cellular morphology, processes susceptible of inter- and intraobserver variability. The present work introduces an image combined-descriptor to detect blasts and determine their probable lineage. This strategy uses an intra-nucleus mosaic pattern (InMop) descriptor that captures subtle nuclei differences within WBCs, and Haralick's statistics which quantify the local structure of both nucleus and cytoplasm. The InMop captures WBC inner-nucleus structure by applying a multiscale Shearlet decomposition over a repetitive pattern (mosaic) of automatically-segmented nuclei. As a complement, Haralick's statistics characterize the local structure of the whole cell from an intensity co-occurrence matrix representation. Both InMoP and Haralick-based descriptors are calculated using the b-channel from Lab color-space. The combined-descriptor is assessed by differentiating blasts from nonleukemic cells with support vector machine (SVM) classifiers and different transformation kernels, in two public and independent databases. The first database-D1 (n = 260) is composed of healthy and acute lymphoid leukemia (ALL) single cell images, and second database-D2 contains acute myeloid leukemia (AML) blasts (n = 3294) and nonblast (n = 15,071) cell images. In a first experiment, blasts versus nonblast differentiation is performed by training with a subset of D2 (n = 6588) and testing in D1 (n = 260), obtaining a training AUC of 0.991 ± 0.002 and AUC = 0.782 for the independent validation. A second experiment automatically differentiates AML blasts (260 images from D2) from ALL blasts (260 images from D1), with an AUC of 0.93. In a third experiment, state-of-the-art strategies, VGG16 and RESNEXT convolutional neural networks (CNN), separate blast from nonblast cells in both databases. The VGG16 showed an AUC of 0.673 and the RESNEXT of 0.75. Reported metrics for all the experiments are area under the ROC curve (AUC), accuracy and F1-score.

Collapse

Wang K, Theeke LA, Liao C, Wang N, Lu Y, Xiao D, Xu C. Deep learning analysis of UPLC-MS/MS-based metabolomics data to predict Alzheimer's disease. J Neurol Sci 2023;453:120812. [PMID: 37776718 DOI: 10.1016/j.jns.2023.120812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 08/22/2023] [Accepted: 09/14/2023] [Indexed: 10/02/2023]

Esser-Skala W, Fortelny N. Reliable interpretability of biology-inspired deep neural networks. NPJ Syst Biol Appl 2023;9:50. [PMID: 37816807 PMCID: PMC10564878 DOI: 10.1038/s41540-023-00310-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 09/15/2023] [Indexed: 10/12/2023] Open

Halawani R, Buchert M, Chen YPP. Deep learning exploration of single-cell and spatially resolved cancer transcriptomics to unravel tumour heterogeneity. Comput Biol Med 2023;164:107274. [PMID: 37506451 DOI: 10.1016/j.compbiomed.2023.107274] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 07/03/2023] [Accepted: 07/16/2023] [Indexed: 07/30/2023]

Faure L, Mollet B, Liebermeister W, Faulon JL. A neural-mechanistic hybrid approach improving the predictive power of genome-scale metabolic models. Nat Commun 2023;14:4669. [PMID: 37537192 PMCID: PMC10400647 DOI: 10.1038/s41467-023-40380-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 07/19/2023] [Indexed: 08/05/2023] Open

Wysocka M, Wysocki O, Zufferey M, Landers D, Freitas A. A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data. BMC Bioinformatics 2023;24:198. [PMID: 37189058 PMCID: PMC10186658 DOI: 10.1186/s12859-023-05262-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 03/30/2023] [Indexed: 05/17/2023] Open

Abstract

BACKGROUND

There is an increasing interest in the use of Deep Learning (DL) based methods as a supporting analytical framework in oncology. However, most direct applications of DL will deliver models with limited transparency and explainability, which constrain their deployment in biomedical settings.

METHODS

This systematic review discusses DL models used to support inference in cancer biology with a particular emphasis on multi-omics analysis. It focuses on how existing models address the need for better dialogue with prior knowledge, biological plausibility and interpretability, fundamental properties in the biomedical domain. For this, we retrieved and analyzed 42 studies focusing on emerging architectural and methodological advances, the encoding of biological domain knowledge and the integration of explainability methods.

RESULTS

We discuss the recent evolutionary arch of DL models in the direction of integrating prior biological relational and network knowledge to support better generalisation (e.g. pathways or Protein-Protein-Interaction networks) and interpretability. This represents a fundamental functional shift towards models which can integrate mechanistic and statistical inference aspects. We introduce a concept of bio-centric interpretability and according to its taxonomy, we discuss representational methodologies for the integration of domain prior knowledge in such models.

CONCLUSIONS

The paper provides a critical outlook into contemporary methods for explainability and interpretability used in DL for cancer. The analysis points in the direction of a convergence between encoding prior knowledge and improved interpretability. We introduce bio-centric interpretability which is an important step towards formalisation of biological interpretability of DL models and developing methods that are less problem- or application-specific.

Collapse

Hosseini-Gerami L, Higgins IA, Collier DA, Laing E, Evans D, Broughton H, Bender A. Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis. BMC Bioinformatics 2023;24:154. [PMID: 37072707 PMCID: PMC10111792 DOI: 10.1186/s12859-023-05277-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 04/06/2023] [Indexed: 04/20/2023] Open

Abstract

BACKGROUND

Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological networks; however, a comprehensive benchmarking of such approaches has not yet been reported. Here we benchmarked four causal reasoning algorithms (SigNet, CausalR, CausalR ScanR and CARNIVAL) with four networks (the smaller Omnipath network vs. 3 larger MetaBase™ networks), using LINCS L1000 and CMap microarray data, and assessed to what extent each factor dictated the successful recovery of direct targets and compound-associated signalling pathways in a benchmark dataset comprising 269 compounds. We additionally examined impact on performance in terms of the functions and roles of protein targets and their connectivity bias in the prior knowledge networks.

RESULTS

According to statistical analysis (negative binomial model), the combination of algorithm and network most significantly dictated the performance of causal reasoning algorithms, with the SigNet recovering the greatest number of direct targets. With respect to the recovery of signalling pathways, CARNIVAL with the Omnipath network was able to recover the most informative pathways containing compound targets, based on the Reactome pathway hierarchy. Additionally, CARNIVAL, SigNet and CausalR ScanR all outperformed baseline gene expression pathway enrichment results. We found no significant difference in performance between L1000 data or microarray data, even when limited to just 978 'landmark' genes. Notably, all causal reasoning algorithms also outperformed pathway recovery based on input DEGs, despite these often being used for pathway enrichment. Causal reasoning methods performance was somewhat correlated with connectivity and biological role of the targets.

CONCLUSIONS

Overall, we conclude that causal reasoning performs well at recovering signalling proteins related to compound MoA upstream from gene expression changes by leveraging prior knowledge networks, and that the choice of network and algorithm has a profound impact on the performance of causal reasoning algorithms. Based on the analyses presented here this is true for both microarray-based gene expression data as well as those based on the L1000 platform.

Collapse

Tjärnberg A, Beheler-Amass M, Jackson CA, Christiaen LA, Gresham D, Bonneau R. Structure primed embedding on the transcription factor manifold enables transparent model architectures for gene regulatory network and latent activity inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.02.526909. [PMID: 36778259 PMCID: PMC9915715 DOI: 10.1101/2023.02.02.526909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Novakovsky G, Dexter N, Libbrecht MW, Wasserman WW, Mostafavi S. Obtaining genetics insights from deep learning via explainable artificial intelligence. Nat Rev Genet 2023;24:125-137. [PMID: 36192604 DOI: 10.1038/s41576-022-00532-2] [Citation(s) in RCA: 96] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/31/2022] [Indexed: 01/24/2023]

Lotfollahi M, Rybakov S, Hrovatin K, Hediyeh-Zadeh S, Talavera-López C, Misharin AV, Theis FJ. Biologically informed deep learning to query gene programs in single-cell atlases. Nat Cell Biol 2023;25:337-350. [PMID: 36732632 PMCID: PMC9928587 DOI: 10.1038/s41556-022-01072-x] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 12/08/2022] [Indexed: 02/04/2023]

Li MM, Huang K, Zitnik M. Graph representation learning in biomedicine and healthcare. Nat Biomed Eng 2022;6:1353-1369. [PMID: 36316368 PMCID: PMC10699434 DOI: 10.1038/s41551-022-00942-x] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 08/09/2022] [Indexed: 11/11/2022]

Ghosh Roy G, Geard N, Verspoor K, He S. MPVNN: Mutated Pathway Visible Neural Network architecture for interpretable prediction of cancer-specific survival risk. Bioinformatics 2022;38:5026-5032. [PMID: 36124954 DOI: 10.1093/bioinformatics/btac636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 08/04/2022] [Accepted: 09/16/2022] [Indexed: 12/24/2022] Open

Kolmar L, Autour A, Ma X, Vergier B, Eduati F, Merten CA. Technological and computational advances driving high-throughput oncology. Trends Cell Biol 2022;32:947-961. [PMID: 35577671 DOI: 10.1016/j.tcb.2022.04.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 04/11/2022] [Accepted: 04/20/2022] [Indexed: 01/21/2023]

Brendel M, Su C, Bai Z, Zhang H, Elemento O, Wang F. Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:814-835. [PMID: 36528240 PMCID: PMC10025684 DOI: 10.1016/j.gpb.2022.11.011] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 08/17/2022] [Accepted: 11/24/2022] [Indexed: 12/23/2022]

Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, Artificial Intelligence (AI), and Allostery. J Phys Chem B 2022;126:6372-6383. [PMID: 35976160 PMCID: PMC9442638 DOI: 10.1021/acs.jpcb.2c04346] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/03/2022] [Indexed: 02/08/2023]

Zhao X, Lan Y, Chen D. Exploring long non-coding RNA networks from single cell omics data. Comput Struct Biotechnol J 2022;20:4381-4389. [PMID: 36051880 PMCID: PMC9403499 DOI: 10.1016/j.csbj.2022.08.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 08/01/2022] [Accepted: 08/01/2022] [Indexed: 11/03/2022] Open

Garrido‐Rodriguez M, Zirngibl K, Ivanova O, Lobentanzer S, Saez‐Rodriguez J. Integrating knowledge and omics to decipher mechanisms via large-scale models of signaling networks. Mol Syst Biol 2022;18:e11036. [PMID: 35880747 PMCID: PMC9316933 DOI: 10.15252/msb.202211036] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 05/12/2022] [Accepted: 05/31/2022] [Indexed: 11/10/2022] Open

Wei Z, Han D, Zhang C, Wang S, Liu J, Chao F, Song Z, Chen G. Deep Learning-Based Multi-Omics Integration Robustly Predicts Relapse in Prostate Cancer. Front Oncol 2022;12:893424. [PMID: 35814412 PMCID: PMC9259796 DOI: 10.3389/fonc.2022.893424] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 05/13/2022] [Indexed: 11/13/2022] Open

Abstract ObjectivePost-operative biochemical relapse (BCR) continues to occur in a significant percentage of patients with localized prostate cancer (PCa). Current stratification methods are not adequate to identify high-risk patients. The present study exploits the ability of deep learning (DL) algorithms using the H2O package to combine multi-omics data to resolve this problem.MethodsFive-omics data from 417 PCa patients from The Cancer Genome Atlas (TCGA) were used to construct the DL-based, relapse-sensitive model. Among them, 265 (63.5%) individuals experienced BCR. Five additional independent validation sets were applied to assess its predictive robustness. Bioinformatics analyses of two relapse-associated subgroups were then performed for identification of differentially expressed genes (DEGs), enriched pathway analysis, copy number analysis and immune cell infiltration analysis.ResultsThe DL-based model, with a significant difference (P = 6e-9) between two subgroups and good concordance index (C-index = 0.767), were proven to be robust by external validation. 1530 DEGs including 678 up- and 852 down-regulated genes were identified in the high-risk subgroup S2 compared with the low-risk subgroup S1. Enrichment analyses found five hallmark gene sets were up-regulated while 13 were down-regulated. Then, we found that DNA damage repair pathways were significantly enriched in the S2 subgroup. CNV analysis showed that 30.18% of genes were significantly up-regulated and gene amplification on chromosomes 7 and 8 was significantly elevated in the S2 subgroup. Moreover, enrichment analysis revealed that some DEGs and pathways were associated with immunity. Three tumor-infiltrating immune cell (TIIC) groups with a higher proportion in the S2 subgroup (p = 1e-05, p = 8.7e-06, p = 0.00014) and one TIIC group with a higher proportion in the S1 subgroup (P = 1.3e-06) were identified.ConclusionWe developed a novel, robust classification for understanding PCa relapse. This study validated the effectiveness of deep learning technique in prognosis prediction, and the method may benefit patients and prevent relapse by improving early detection and advancing early intervention. Collapse

Nilsson A, Peters JM, Meimetis N, Bryson B, Lauffenburger DA. Artificial neural networks enable genome-scale simulations of intracellular signaling. Nat Commun 2022;13:3069. [PMID: 35654811 PMCID: PMC9163072 DOI: 10.1038/s41467-022-30684-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Accepted: 05/11/2022] [Indexed: 12/14/2022] Open

Lee D, Kim S. Knowledge-guided artificial intelligence technologies for decoding complex multiomics interactions in cells. Clin Exp Pediatr 2022;65:239-249. [PMID: 34844399 PMCID: PMC9082244 DOI: 10.3345/cep.2021.01438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/19/2021] [Accepted: 10/21/2021] [Indexed: 11/27/2022] Open

Abstract

Cells survive and proliferate through complex interactions among diverse molecules across multiomics layers. Conventional experimental approaches for identifying these interactions have built a firm foundation for molecular biology, but their scalability is gradually becoming inadequate compared to the rapid accumulation of multiomics data measured by high-throughput technologies. Therefore, the need for data-driven computational modeling of interactions within cells has been highlighted in recent years. The complexity of multiomics interactions is primarily due to their nonlinearity. That is, their accurate modeling requires intricate conditional dependencies, synergies, or antagonisms between considered genes or proteins, which retard experimental validations. Artificial intelligence (AI) technologies, including deep learning models, are optimal choices for handling complex nonlinear relationships between features that are scalable and produce large amounts of data. Thus, they have great potential for modeling multiomics interactions. Although there exist many AI-driven models for computational biology applications, relatively few explicitly incorporate the prior knowledge within model architectures or training procedures. Such guidance of models by domain knowledge will greatly reduce the amount of data needed to train models and constrain their vast expressive powers to focus on the biologically relevant space. Therefore, it can enhance a model's interpretability, reduce spurious interactions, and prove its validity and utility. Thus, to facilitate further development of knowledge-guided AI technologies for the modeling of multiomics interactions, here we review representative bioinformatics applications of deep learning models for multiomics interactions developed to date by categorizing them by guidance mode.

Collapse

Trapotsi MA, Hosseini-Gerami L, Bender A. Computational analyses of mechanism of action (MoA): data, methods and integration. RSC Chem Biol 2022;3:170-200. [PMID: 35360890 PMCID: PMC8827085 DOI: 10.1039/d1cb00069a] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/09/2021] [Indexed: 12/15/2022] Open

Gundogdu P, Loucera C, Alamo-Alvarez I, Dopazo J, Nepomuceno I. Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data. BioData Min 2022;15:1. [PMID: 34980200 PMCID: PMC8722116 DOI: 10.1186/s13040-021-00285-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 12/04/2021] [Indexed: 11/13/2022] Open

Abstract

Background

Single-cell RNA sequencing (scRNA-seq) data provide valuable insights into cellular heterogeneity which is significantly improving the current knowledge on biology and human disease. One of the main applications of scRNA-seq data analysis is the identification of new cell types and cell states. Deep neural networks (DNNs) are among the best methods to address this problem. However, this performance comes with the trade-off for a lack of interpretability in the results. In this work we propose an intelligible pathway-driven neural network to correctly solve cell-type related problems at single-cell resolution while providing a biologically meaningful representation of the data.

Results

In this study, we explored the deep neural networks constrained by several types of prior biological information, e.g. signaling pathway information, as a way to reduce the dimensionality of the scRNA-seq data. We have tested the proposed biologically-based architectures on thousands of cells of human and mouse origin across a collection of public datasets in order to check the performance of the model. Specifically, we tested the architecture across different validation scenarios that try to mimic how unknown cell types are clustered by the DNN and how it correctly annotates cell types by querying a database in a retrieval problem. Moreover, our approach demonstrated to be comparable to other less interpretable DNN approaches constrained by using protein-protein interactions gene regulation data. Finally, we show how the latent structure learned by the network could be used to visualize and to interpret the composition of human single cell datasets.

Conclusions

Here we demonstrate how the integration of pathways, which convey fundamental information on functional relationships between genes, with DNNs, that provide an excellent classification framework, results in an excellent alternative to learn a biologically meaningful representation of scRNA-seq data. In addition, the introduction of prior biological knowledge in the DNN reduces the size of the network architecture. Comparative results demonstrate a superior performance of this approach with respect to other similar approaches. As an additional advantage, the use of pathways within the DNN structure enables easy interpretability of the results by connecting features to cell functionalities by means of the pathway nodes, as demonstrated with an example with human melanoma tumor cells.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13040-021-00285-4.

Collapse

Huminiecki Ł. Virtual Gene Concept and a Corresponding Pragmatic Research Program in Genetical Data Science. ENTROPY (BASEL, SWITZERLAND) 2021;24:17. [PMID: 35052043 PMCID: PMC8774939 DOI: 10.3390/e24010017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/02/2021] [Accepted: 12/14/2021] [Indexed: 06/14/2023]

Shao D, Dai Y, Li N, Cao X, Zhao W, Cheng L, Rong Z, Huang L, Wang Y, Zhao J. Artificial intelligence in clinical research of cancers. Brief Bioinform 2021;23:6470966. [PMID: 34929741 PMCID: PMC8769909 DOI: 10.1093/bib/bbab523] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 11/06/2021] [Accepted: 11/13/2021] [Indexed: 12/16/2022] Open

Scherer P, Trębacz M, Simidjievski N, Viñas R, Shams Z, Terre HA, Jamnik M, Liò P. Unsupervised construction of computational graphs for gene expression data with explicit structural inductive biases. Bioinformatics 2021;38:1320-1327. [PMID: 34888618 PMCID: PMC8826027 DOI: 10.1093/bioinformatics/btab830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 09/29/2021] [Accepted: 12/03/2021] [Indexed: 01/05/2023] Open

Abstract

MOTIVATION

Gene expression data are commonly used at the intersection of cancer research and machine learning for better understanding of the molecular status of tumour tissue. Deep learning predictive models have been employed for gene expression data due to their ability to scale and remove the need for manual feature engineering. However, gene expression data are often very high dimensional, noisy and presented with a low number of samples. This poses significant problems for learning algorithms: models often overfit, learn noise and struggle to capture biologically relevant information. In this article, we utilize external biological knowledge embedded within structures of gene interaction graphs such as protein-protein interaction (PPI) networks to guide the construction of predictive models.

RESULTS

We present Gene Interaction Network Constrained Construction (GINCCo), an unsupervised method for automated construction of computational graph models for gene expression data that are structurally constrained by prior knowledge of gene interaction networks. We employ this methodology in a case study on incorporating a PPI network in cancer phenotype prediction tasks. Our computational graphs are structurally constructed using topological clustering algorithms on the PPI networks which incorporate inductive biases stemming from network biology research on protein complex discovery. Each of the entities in the GINCCo computational graph represents biological entities such as genes, candidate protein complexes and phenotypes instead of arbitrary hidden nodes of a neural network. This provides a biologically relevant mechanism for model regularization yielding strong predictive performance while drastically reducing the number of model parameters and enabling guided post-hoc enrichment analyses of influential gene sets with respect to target phenotypes. Our experiments analysing a variety of cancer phenotypes show that GINCCo often outperforms support vector machine, Fully Connected Multi-layer Perceptrons (MLP) and Randomly Connected MLPs despite greatly reduced model complexity.

AVAILABILITY AND IMPLEMENTATION

https://github.com/paulmorio/gincco contains the source code for our approach. We also release a library with algorithms for protein complex discovery within PPI networks at https://github.com/paulmorio/protclus. This repository contains implementations of the clustering algorithms used in this article.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Bao S, Li K, Yan C, Zhang Z, Qu J, Zhou M. Deep learning-based advances and applications for single-cell RNA-sequencing data analysis. Brief Bioinform 2021;23:6444320. [PMID: 34849562 DOI: 10.1093/bib/bbab473] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 09/24/2021] [Accepted: 10/15/2021] [Indexed: 11/14/2022] Open