1
|
Qin Y, Huo M, Liu X, Li SC. Biomarkers and computational models for predicting efficacy to tumor ICI immunotherapy. Front Immunol 2024; 15:1368749. [PMID: 38524135 PMCID: PMC10957591 DOI: 10.3389/fimmu.2024.1368749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 02/27/2024] [Indexed: 03/26/2024] Open
Abstract
Numerous studies have shown that immune checkpoint inhibitor (ICI) immunotherapy has great potential as a cancer treatment, leading to significant clinical improvements in numerous cases. However, it benefits a minority of patients, underscoring the importance of discovering reliable biomarkers that can be used to screen for potential beneficiaries and ultimately reduce the risk of overtreatment. Our comprehensive review focuses on the latest advancements in predictive biomarkers for ICI therapy, particularly emphasizing those that enhance the efficacy of programmed cell death protein 1 (PD-1)/programmed cell death-ligand 1 (PD-L1) inhibitors and cytotoxic T-lymphocyte antigen-4 (CTLA-4) inhibitors immunotherapies. We explore biomarkers derived from various sources, including tumor cells, the tumor immune microenvironment (TIME), body fluids, gut microbes, and metabolites. Among them, tumor cells-derived biomarkers include tumor mutational burden (TMB) biomarker, tumor neoantigen burden (TNB) biomarker, microsatellite instability (MSI) biomarker, PD-L1 expression biomarker, mutated gene biomarkers in pathways, and epigenetic biomarkers. TIME-derived biomarkers include immune landscape of TIME biomarkers, inhibitory checkpoints biomarkers, and immune repertoire biomarkers. We also discuss various techniques used to detect and assess these biomarkers, detailing their respective datasets, strengths, weaknesses, and evaluative metrics. Furthermore, we present a comprehensive review of computer models for predicting the response to ICI therapy. The computer models include knowledge-based mechanistic models and data-based machine learning (ML) models. Among the knowledge-based mechanistic models are pharmacokinetic/pharmacodynamic (PK/PD) models, partial differential equation (PDE) models, signal networks-based models, quantitative systems pharmacology (QSP) models, and agent-based models (ABMs). ML models include linear regression models, logistic regression models, support vector machine (SVM)/random forest/extra trees/k-nearest neighbors (KNN) models, artificial neural network (ANN) and deep learning models. Additionally, there are hybrid models of systems biology and ML. We summarized the details of these models, outlining the datasets they utilize, their evaluation methods/metrics, and their respective strengths and limitations. By summarizing the major advances in the research on predictive biomarkers and computer models for the therapeutic effect and clinical utility of tumor ICI, we aim to assist researchers in choosing appropriate biomarkers or computer models for research exploration and help clinicians conduct precision medicine by selecting the best biomarkers.
Collapse
Affiliation(s)
- Yurong Qin
- Department of Computer Science, City University of Hong Kong, Kowloon, China
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, Guangdong, China
| | - Miaozhe Huo
- Department of Computer Science, City University of Hong Kong, Kowloon, China
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, Guangdong, China
| | - Xingwu Liu
- School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning, China
| | - Shuai Cheng Li
- Department of Computer Science, City University of Hong Kong, Kowloon, China
- City University of Hong Kong Shenzhen Research Institute, Shenzhen, Guangdong, China
| |
Collapse
|
2
|
Azuma I, Mizuno T, Morita K, Suzuki Y, Kusuhara H. Investigation of the usefulness of liver-specific deconvolution method by establishing a liver benchmark dataset. NAR Genom Bioinform 2024; 6:lqad111. [PMID: 38187088 PMCID: PMC10768887 DOI: 10.1093/nargab/lqad111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 10/31/2023] [Accepted: 12/16/2023] [Indexed: 01/09/2024] Open
Abstract
Immune responses in the liver are related to the development and progression of liver failure, and precise prediction of their behavior is important. Deconvolution is a methodology for estimating the immune cell proportions from the transcriptome, and it is mainly applied to blood-derived samples and tumor tissues. However, the influence of tissue-specific modeling on the estimation results has rarely been investigated. Here, we constructed a system to evaluate the performance of the deconvolution method on liver transcriptome data. We prepared seven mouse liver injury models using small-molecule compounds and established a benchmark dataset with corresponding liver bulk RNA-Seq and immune cell proportions. RNA-Seq expression for nine leukocyte subsets and four liver-associated cell types were obtained from the Gene Expression Omnibus to provide a reference. We found that the combination of reference cell sets affects the estimation results of reference-based deconvolution methods and established a liver-specific deconvolution by optimizing the reference cell set for each cell to be estimated. We applied this model to independent datasets and showed that liver-specific modeling is highly extrapolatable. We expect that this approach will enable sophisticated estimation from rich tissue data accumulated in public databases and to obtain information on aggregated immune cell trafficking.
Collapse
Affiliation(s)
- Iori Azuma
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo, Tokyo 113-0033, Japan
| | - Tadahaya Mizuno
- To whom correspondence should be addressed. Tel: +81 3 5841 4771; Fax: +81 3 5841 4766;
| | - Katsuhisa Morita
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo, Tokyo 113-0033, Japan
| | - Yutaka Suzuki
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba 277-8561, Japan
| | - Hiroyuki Kusuhara
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo, Tokyo 113-0033, Japan
| |
Collapse
|
3
|
Jiang Y, Chen Z, Han N, Shang J, Wu A. sc-ImmuCC: hierarchical annotation for immune cell types in single-cell RNA-seq. Front Immunol 2023; 14:1223471. [PMID: 37545533 PMCID: PMC10399579 DOI: 10.3389/fimmu.2023.1223471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 07/03/2023] [Indexed: 08/08/2023] Open
Abstract
Accurately identifying immune cell types in single-cell RNA-sequencing (scRNA-Seq) data is critical to uncovering immune responses in health or disease conditions. However, the high heterogeneity and sparsity of scRNA-Seq data, as well as the similarity in gene expression among immune cell types, poses a great challenge for accurate identification of immune cell types in scRNA-Seq data. Here, we developed a tool named sc-ImmuCC for hierarchical annotation of immune cell types from scRNA-Seq data, based on the optimized gene sets and ssGSEA algorithm. sc-ImmuCC simulates the natural differentiation of immune cells, and the hierarchical annotation includes three layers, which can annotate nine major immune cell types and 29 cell subtypes. The test results showed its stable performance and strong consistency among different tissue datasets with average accuracy of 71-90%. In addition, the optimized gene sets and hierarchical annotation strategy could be applied to other methods to improve their annotation accuracy and the spectrum of annotated cell types and subtypes. We also applied sc-ImmuCC to a dataset composed of COVID-19, influenza, and healthy donors, and found that the proportion of monocytes in patients with COVID-19 and influenza was significantly higher than that in healthy people. The easy-to-use sc-ImmuCC tool provides a good way to comprehensively annotate immune cell types from scRNA-Seq data, and will also help study the immune mechanism underlying physiological and pathological conditions.
Collapse
|
4
|
Nava A, Alves da Quinta D, Prato L, Girotti R, Moron G, Llera AS, Fernández EA. Novel evaluation approach for molecular signature-based deconvolution methods. J Biomed Inform 2023; 142:104387. [PMID: 37172634 DOI: 10.1016/j.jbi.2023.104387] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 03/17/2023] [Accepted: 05/07/2023] [Indexed: 05/15/2023]
Abstract
The tumoral immune microenvironment (TIME) plays a key role in prognosis, therapeutic approach and pathophysiological understanding over oncological processes. Several computational immune cell-type deconvolution methods (DM), supported by diverse molecular signatures (MS), have been developed to uncover such TIME interplay from RNA-seq tumor biopsies. MS-DM pairs were benchmarked against each other by means of different metrics, such as Pearson's correlation, R2 and RMSE, but these only evaluate the linear association of the estimated proportion related to the expected one, missing the analysis of prediction-dependent bias trends and cell identification accuracy. We present a novel protocol composed of four tests allowing appropriate evaluation of the cell type identification performance and proportion prediction accuracy of molecular signature-deconvolution method pair by means of certainty and confidence cell-type identification scores (F1-score, distance to the optimal point and error rates) as well the Bland-Altman method for error-trend analysis. Our protocol was used to benchmark six state-of-the-art DMs (CIBERSORTx, DCQ, DeconRNASeq, EPIC, MIXTURE and quanTIseq) paired to five murine tissue-specific MSs, revealing a systematic overestimation of the number of different cell types across almost all methods.
Collapse
Affiliation(s)
- A Nava
- Fundación Instituto Leloir-CONICET, Buenos Aires, Argentina; Fundación Huésped, Buenos Aires, Argentina
| | - D Alves da Quinta
- Fundación Instituto Leloir-CONICET, Buenos Aires, Argentina; Universidad Argentina de la Empresa (UADE). Instituto de Tecnología (INTEC), Buenos Aires, Argentina
| | - L Prato
- Universidad de Villa María, Córdoba, Argentina
| | - R Girotti
- Universidad Argentina de la Empresa (UADE). Instituto de Tecnología (INTEC), Buenos Aires, Argentina
| | - G Moron
- Departamento de Bioquímica Clínica, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Córdoba, Argentina; Centro de Investigaciones en Bioquímica Clínica e Inmunología, CONICET, Córdoba, Argentina
| | - A S Llera
- Fundación Instituto Leloir-CONICET, Buenos Aires, Argentina
| | - E A Fernández
- Facultad de Ingeniería, Carrera de Bioinformática, Universidad Católica de Córdoba (UCC), Córdoba, Argentina; Facultad de Ciencias Exactas Físicas y Naturales, Universidad Nacional de Córdoba, Córdoba, Argentina; Centro de Investigaciòn en Inmunología y Enfermedades Infecciosas, UCC, CONICET, Córdoba, Argentina.
| |
Collapse
|
5
|
Chen Z, Na H, Wu A. ImmuCellDB: An Indicative Database of Immune Cell Composition From Different Tissues and Disease Conditions in Mouse and Human. Front Immunol 2021; 12:670070. [PMID: 34456903 PMCID: PMC8387820 DOI: 10.3389/fimmu.2021.670070] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 07/27/2021] [Indexed: 11/13/2022] Open
Abstract
Immune cell composition is highly divergent across different tissues and diseases. A comprehensive resource of tissue immune cells across different conditions in mouse and human will thus provide great understanding of the immune microenvironment of many diseases. Recently, computational methods for estimating immune cell abundance from tissue transcriptome data have been developed and are now widely used. Using these computational tools, large-scale estimation of immune cell composition across tissues and conditions should be possible using gene expression data collected from public databases. In total, 266 tissue types and 706 disease types in humans, as well as 143 tissue types and 61 disease types, and 206 genotypes in mouse had been included in a database we have named ImmuCellDB (http://wap-lab.org:3200/ImmuCellDB/). In ImmuCellDB, users can search and browse immune cell proportions based on tissues, disease or genotype in mouse or humans. Additionally, the variation and correlation of immune cell abundance and gene expression level between different conditions can be compared and viewed in this database. We believe that ImmuCellDB provides not only an indicative view of tissue-dependent or disease-dependent immune cell profiles, but also represents an easy way to pre-determine immune cell abundance and gene expression profiles for specific situations.
Collapse
Affiliation(s)
- Ziyi Chen
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Suzhou, China.,Department of Infectious Diseases, The Second Hospital of Nanjing, The Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, China
| | - Han Na
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Suzhou, China
| | - Aiping Wu
- Institute of Systems Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China.,Suzhou Institute of Systems Medicine, Suzhou, Suzhou, China
| |
Collapse
|
6
|
Xiao H, Zhang J, Wang K, Song K, Zheng H, Yang J, Li K, Yuan R, Zhao W, Hui Y. A Cancer-Specific Qualitative Method for Estimating the Proportion of Tumor-Infiltrating Immune Cells. Front Immunol 2021; 12:672031. [PMID: 34054849 PMCID: PMC8160514 DOI: 10.3389/fimmu.2021.672031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 04/22/2021] [Indexed: 11/13/2022] Open
Abstract
Tumor-infiltrating immune cells are important components in the tumor microenvironment (TME) and different types of these cells exert different effects on tumor development and progression; these effects depend upon the type of cancer involved. Several methods have been developed for estimating the proportion of immune cells using bulk transcriptome data. However, there is a distinct lack of methods that are capable of predicting the immune contexture in specific types of cancer. Furthermore, the existing methods are based on absolute gene expression and are susceptible to experimental batch effects, thus resulting in incomparability across different datasets. In this study, we considered two common neoplasms as examples (colorectal cancer [CRC] and melanoma) and introduced the Tumor-infiltrating Immune Cell Proportion Estimator (TICPE), a cancer-specific qualitative method for estimating the proportion of tumor-infiltrating immune cells. The TICPE was based on the relative expression orderings (REOs) of gene pairs within a sample and is notably insensitive to batch effects. Performance evaluation using public expression data with mRNA mixtures, single-cell RNA-Seq (scRNA-Seq) data, immunohistochemistry data, and simulated bulk RNA-seq samples, indicated that the TICPE can estimate the proportion of immune cells with levels of accuracy that are clearly superior to other methods. Furthermore, we showed that the TICPE could effectively detect prognostic signals in patients with tumors and changes in the fractions of immune cells during immunotherapy in melanoma. In conclusion, our work presented a unique novel method, TICPE, to estimate the proportion of immune cells in specific cancer types and explore the effect of the infiltration of immune cells on the efficacy of immunotherapy and the prognosis of cancer. The source code for TICPE is available at https://github.com/huitingxiao/TICPE.
Collapse
Affiliation(s)
- Huiting Xiao
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Jiashuai Zhang
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Kai Wang
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.,Department of Biochemistry and Molecular Biology, Harbin Medical University, Harbin, China
| | - Kai Song
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hailong Zheng
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Jing Yang
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Keru Li
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Rongqiang Yuan
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Wenyuan Zhao
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yang Hui
- Department of Biochemistry and Molecular Biology, Harbin Medical University, Harbin, China
| |
Collapse
|