1
|
Yadav S, Vora DS, Sundar D, Dhanjal JK. TCR-ESM: Employing protein language embeddings to predict TCR-peptide-MHC binding. Comput Struct Biotechnol J 2024; 23:165-173. [PMID: 38146434 PMCID: PMC10749252 DOI: 10.1016/j.csbj.2023.11.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 11/19/2023] [Accepted: 11/20/2023] [Indexed: 12/27/2023] Open
Abstract
Cognate target identification for T-cell receptors (TCRs) is a significant barrier in T-cell therapy development, which may be overcome by accurately predicting TCR interaction with peptide-bound major histocompatibility complex (pMHC). In this study, we have employed peptide embeddings learned from a large protein language model- Evolutionary Scale Modeling (ESM), to predict TCR-pMHC binding. The TCR-ESM model presented outperforms existing predictors. The complementarity-determining region 3 (CDR3) of the hypervariable TCR is located at the center of the paratope and plays a crucial role in peptide recognition. TCR-ESM trained on paired TCR data with both CDR3α and CDR3β chain information performs significantly better than those trained on data with only CDR3β, suggesting that both TCR chains contribute to specificity, the relative importance however depends on the specific peptide-MHC targeted. The study illuminates the importance of MHC information in TCR-peptide binding which remained inconclusive so far and was thought dependent on the dataset characteristics. TCR-ESM outperforms existing approaches on external datasets, suggesting generalizability. Overall, the potential of deep learning for predicting TCR-pMHC interactions and improving the understanding of factors driving TCR specificity are highlighted. The prediction model is available at http://tcresm.dhanjal-lab.iiitd.edu.in/ as an online tool.
Collapse
Affiliation(s)
- Shashank Yadav
- Department of Biomedical Engineering, University of Arizona, Tucson 85721, AZ, USA
| | - Dhvani Sandip Vora
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi 110016, India
- Department of Computational Biology, Indraprastha Institute of Information Technology, Delhi, New Delhi 110020, India
| | - Durai Sundar
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi 110016, India
| | - Jaspreet Kaur Dhanjal
- Department of Computational Biology, Indraprastha Institute of Information Technology, Delhi, New Delhi 110020, India
| |
Collapse
|
2
|
Pham MDN, Su CTT, Nguyen TN, Nguyen HN, Nguyen DDA, Giang H, Nguyen DT, Phan MD, Nguyen V. epiTCR-KDA: knowledge distillation model on dihedral angles for TCR-peptide prediction. BIOINFORMATICS ADVANCES 2024; 4:vbae190. [PMID: 39678207 PMCID: PMC11646569 DOI: 10.1093/bioadv/vbae190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 11/03/2024] [Accepted: 11/27/2024] [Indexed: 12/17/2024]
Abstract
Motivation The prediction of the T-cell receptor (TCR) and antigen bindings is crucial for advancements in immunotherapy. However, most current TCR-peptide interaction predictors struggle to perform well on unseen data. This limitation may stem from the conventional use of TCR and/or peptide sequences as input, which may not adequately capture their structural characteristics. Therefore, incorporating the structural information of TCRs and peptides into the prediction model is necessary to improve its generalizability. Results We developed epiTCR-KDA (KDA stands for Knowledge Distillation model on Dihedral Angles), a new predictor of TCR-peptide binding that utilizes the dihedral angles between the residues of the peptide and the TCR as a structural descriptor. This structural information was integrated into a knowledge distillation model to enhance its generalizability. epiTCR-KDA demonstrated competitive prediction performance, with an area under the curve (AUC) of 1.00 for seen data and AUC of 0.91 for unseen data. On public datasets, epiTCR-KDA consistently outperformed other predictors, maintaining a median AUC of 0.93. Further analysis of epiTCR-KDA revealed that the cosine similarity of the dihedral angle vectors between the unseen testing data and training data is crucial for its stable performance. In conclusion, our epiTCR-KDA model represents a significant step forward in developing a highly effective pipeline for antigen-based immunotherapy. Availability and implementation epiTCR-KDA is available on GitHub (https://github.com/ddiem-ri-4D/epiTCR-KDA).
Collapse
Affiliation(s)
- My-Diem Nguyen Pham
- Faculty of Information Technology, University of Science, Ho Chi Minh City, Vietnam
- Vietnam National University, Ho Chi Minh City, Vietnam
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
| | | | | | | | - Dinh Duy An Nguyen
- Department of Genetics and Genomic Sciences School of Medicine, Case Western Reserve University, Cleveland, Ohio, United States
| | - Hoa Giang
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
| | - Dinh-Thuc Nguyen
- Faculty of Information Technology, University of Science, Ho Chi Minh City, Vietnam
- Vietnam National University, Ho Chi Minh City, Vietnam
| | - Minh-Duy Phan
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
- NexCalibur Therapeutics, DE, United States
| | - Vy Nguyen
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
| |
Collapse
|
3
|
Postovskaya A, Vercauteren K, Meysman P, Laukens K. tcrBLOSUM: an amino acid substitution matrix for sensitive alignment of distant epitope-specific TCRs. Brief Bioinform 2024; 26:bbae602. [PMID: 39576224 PMCID: PMC11583439 DOI: 10.1093/bib/bbae602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 10/07/2024] [Accepted: 11/05/2024] [Indexed: 11/24/2024] Open
Abstract
Deciphering the specificity of T-cell receptor (TCR) repertoires is crucial for monitoring adaptive immune responses and developing targeted immunotherapies and vaccines. To elucidate the specificity of previously unseen TCRs, many methods employ the BLOSUM62 matrix to find TCRs with similar amino acid (AA) sequences. However, while BLOSUM62 reflects the AA substitutions within conserved regions of proteins with similar functions, the remarkable diversity of TCRs means that both TCRs with similar and dissimilar sequences can bind the same epitope. Therefore, reliance on BLOSUM62 may bias detection towards epitope-specific TCRs with similar biochemical properties, overlooking those with more diverse AA compositions. In this study, we introduce tcrBLOSUMa and tcrBLOSUMb, specialized AA substitution matrices for CDR3 alpha and CDR3 beta TCR chains, respectively. The matrices reflect AA frequencies and variations occurring within TCRs that bind the same epitope, revealing that both CDR3 alpha and CDR3 beta display tolerance to a wide range of AA substitutions and differ noticeably from the standard BLOSUM62. By accurately aligning distant TCRs employing tcrBLOSUMb, we were able to improve clustering performance and capture a large number of epitope-specific TCRs with diverse AA compositions and physicochemical profiles overlooked by BLOSUM62. Utilizing both the general BLOSUM62 and specialized tcrBLOSUM matrices in existing computational tools will broaden the range of TCRs that can be associated with their cognate epitopes, thereby enhancing TCR repertoire analysis.
Collapse
MESH Headings
- Receptors, Antigen, T-Cell/immunology
- Receptors, Antigen, T-Cell/genetics
- Receptors, Antigen, T-Cell/chemistry
- Amino Acid Substitution
- Humans
- Amino Acid Sequence
- Epitopes, T-Lymphocyte/immunology
- Epitopes, T-Lymphocyte/chemistry
- Sequence Alignment
- Complementarity Determining Regions/genetics
- Complementarity Determining Regions/immunology
- Complementarity Determining Regions/chemistry
- Computational Biology/methods
- Epitopes/immunology
- Epitopes/chemistry
- Algorithms
- Receptors, Antigen, T-Cell, alpha-beta/genetics
- Receptors, Antigen, T-Cell, alpha-beta/immunology
- Receptors, Antigen, T-Cell, alpha-beta/chemistry
Collapse
Affiliation(s)
- Anna Postovskaya
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
- Clinical Virology Unit, Department of Clinical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Koen Vercauteren
- Clinical Virology Unit, Department of Clinical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Pieter Meysman
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
| | - Kris Laukens
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Network Antwerp (BIOMINA), University of Antwerp, Antwerp, Belgium
| |
Collapse
|
4
|
Xu L, Yang Q, Dong W, Li X, Wang K, Dong S, Zhang X, Yang T, Luo G, Liao X, Gao X, Wang G. Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy. Brief Bioinform 2024; 26:bbae625. [PMID: 39656887 DOI: 10.1093/bib/bbae625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 09/18/2024] [Accepted: 11/14/2024] [Indexed: 12/17/2024] Open
Abstract
Accurate prediction of binding between human leukocyte antigen (HLA) class I molecules and antigenic peptide segments is a challenging task and a key bottleneck in personalized immunotherapy for cancer. Although existing prediction tools have demonstrated significant results using established datasets, most can only predict the binding affinity of antigenic peptides to HLA and do not enable the immunogenic interpretation of new antigenic epitopes. This limitation results from the training data for the computational models relying heavily on a large amount of peptide-HLA (pHLA) eluting ligand data, in which most of the candidate epitopes lack immunogenicity. Here, we propose an adaptive immunogenicity prediction model, named MHLAPre, which is trained on the large-scale MS-derived HLA I eluted ligandome (mostly presented by epitopes) that are immunogenic. Allele-specific and pan-allelic prediction models are also provided for endogenous peptide presentation. Using a meta-learning strategy, MHLAPre rapidly assessed HLA class I peptide affinities across the whole pHLA pairs and accurately identified tumor-associated endogenous antigens. During the process of adaptive immune response of T-cells, pHLA-specific binding in the antigen presentation is only a pre-task for CD8+ T-cell recognition. The key factor in activating the immune response is the interaction between pHLA complexes and T-cell receptors (TCRs). Therefore, we performed transfer learning on the pHLA model using the pHLA-TCR dataset. In pHLA binding task, MHLAPre demonstrated significant improvement in identifying neoepitope immunogenicity compared with five state-of-the-art models, proving its effectiveness and robustness. After transfer learning of the pHLA-TCR data, MHLAPre also exhibited relatively superior performance in revealing the mechanism of immunotherapy. MHLAPre is a powerful tool to identify neoepitopes that can interact with TCR and induce immune responses. We believe that the proposed method will greatly contribute to clinical immunotherapy, such as anti-tumor immunity, tumor-specific T-cell engineering, and personalized tumor vaccine.
Collapse
Affiliation(s)
- Long Xu
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
| | - Qiang Yang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- School of Medicine and Health, Harbin Institute of Technology, Yikuang Street, 150000 Harbin, China
| | - Weihe Dong
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| | - Xiaokun Li
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- School of Computer Science and Technology, Heilongjiang University, Xuefu Road, 150080 Harbin, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Xuefu Road, 150090 Harbin, China
- Shandong Hengxun Technology Co., Ltd., Miaoling Road, 266100 Qingdao, China
| | - Kuanquan Wang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
| | - Suyu Dong
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| | - Xianyu Zhang
- Department of Breast Surgery, Harbin Medical University Cancer Hospital, Haping Road, 150081 Harbin, China
| | - Tiansong Yang
- Department of Rehabilitation, The First Affiliated Hospital of Heilongjiang University of Traditional Chinese Medicine, Xuefu Road, 150040 Harbin, China
| | - Gongning Luo
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia
| | - Xingyu Liao
- School of Computer Science, Northwestern Polytechnical University, 710072 Xian, China
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia
| | - Guohua Wang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| |
Collapse
|
5
|
Drost F, Dorigatti E, Straub A, Hilgendorf P, Wagner KI, Heyer K, López Montes M, Bischl B, Busch DH, Schober K, Schubert B. Predicting T cell receptor functionality against mutant epitopes. CELL GENOMICS 2024; 4:100634. [PMID: 39151427 PMCID: PMC11480844 DOI: 10.1016/j.xgen.2024.100634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 04/22/2024] [Accepted: 07/22/2024] [Indexed: 08/19/2024]
Abstract
Cancer cells and pathogens can evade T cell receptors (TCRs) via mutations in immunogenic epitopes. TCR cross-reactivity (i.e., recognition of multiple epitopes with sequence similarities) can counteract such escape but may cause severe side effects in cell-based immunotherapies through targeting self-antigens. To predict the effect of epitope point mutations on T cell functionality, we here present the random forest-based model Predicting T Cell Epitope-Specific Activation against Mutant Versions (P-TEAM). P-TEAM was trained and tested on three datasets with TCR responses to single-amino-acid mutations of the model epitope SIINFEKL, the tumor neo-epitope VPSVWRSSL, and the human cytomegalovirus antigen NLVPMVATV, totaling 9,690 unique TCR-epitope interactions. P-TEAM was able to accurately classify T cell reactivities and quantitatively predict T cell functionalities for unobserved single-point mutations and unseen TCRs. Overall, P-TEAM provides an effective computational tool to study T cell responses against mutated epitopes.
Collapse
Affiliation(s)
- Felix Drost
- Institute of Computational Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany; School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Emilio Dorigatti
- Institute of Computational Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany; Department of Statistics, Ludwig Maximilian Universität, 80539 Munich, Germany; Munich Center for Machine Learning (MCML), Ludwig Maximilian Universität, 80538 Munich, Germany
| | - Adrian Straub
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany
| | - Philipp Hilgendorf
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany; Mikrobiologisches Institut-Klinische Mikrobiologie, Immunologie, und Hygiene, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Karolin I Wagner
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany
| | - Kersten Heyer
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany
| | - Marta López Montes
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany
| | - Bernd Bischl
- Department of Statistics, Ludwig Maximilian Universität, 80539 Munich, Germany; Munich Center for Machine Learning (MCML), Ludwig Maximilian Universität, 80538 Munich, Germany
| | - Dirk H Busch
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany; German Center for Infection Research, Deutschen Zentrum für Infektionsforschung (DZIF), Partner Site Munich, 81675 Munich, Germany
| | - Kilian Schober
- Institute for Medical Microbiology, Immunology, and Hygiene, Technical University of Munich, 81675 Munich, Germany; Mikrobiologisches Institut-Klinische Mikrobiologie, Immunologie, und Hygiene, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany; Medical Immunology Campus Erlangen, Friedrich-Alexander-Universität (FAU) Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Benjamin Schubert
- Institute of Computational Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany; School of Computation, Information, and Technology, Technical University of Munich, 85748 Garching bei München, Germany.
| |
Collapse
|
6
|
Zhang Y, Leng X, Shenhav A. Make or break: The influence of expected challenges and rewards on the motivation and experience associated with cognitive effort exertion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.05.570154. [PMID: 38106182 PMCID: PMC10723292 DOI: 10.1101/2023.12.05.570154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Challenging goals can induce harder work but also greater stress, in turn potentially undermining goal achievement. We sought to examine how mental effort and subjective experiences thereof interact as a function of challenge level and the size of the incentives at stake. Participants performed a task that rewarded individual units of effort investment (correctly performed Stroop trials) but only if they met a threshold number of correct trials within a fixed time interval (challenge level). We varied this challenge level (Study 1, N = 40), and the rewards at stake (Study 2, N = 79), and measured variability in task performance and self-reported affect across task intervals. Greater challenge and higher rewards facilitated greater effort investment but also induced greater stress, while higher rewards (and lower challenge) simultaneously induced greater positive affect. Within intervals, we observed an initial speed up then slowdown in performance, which could reflect dynamic reconfiguration of control. Collectively, these findings further our understanding of the influence of task demands and incentives on mental effort exertion and wellbeing.
Collapse
Affiliation(s)
- Yue Zhang
- Department of Psychology, University of Michigan, Ann Arbor, MI
| | - Xiamin Leng
- Department of Psychology, University of California, Berkeley, Berkeley, CA
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA
| | - Amitai Shenhav
- Department of Psychology, University of California, Berkeley, Berkeley, CA
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA
| |
Collapse
|
7
|
Ehrlich R, Glynn E, Singh M, Ghersi D. Computational Methods for Predicting Key Interactions in T Cell-Mediated Adaptive Immunity. Annu Rev Biomed Data Sci 2024; 7:295-316. [PMID: 38748864 DOI: 10.1146/annurev-biodatasci-102423-122741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
The adaptive immune system recognizes pathogen- and cancer-specific features and is endowed with memory, enabling it to respond quickly and efficiently to repeated encounters with the same antigens. T cells play a central role in the adaptive immune system by directly targeting intracellular pathogens and helping to activate B cells to secrete antibodies. Several fundamental protein interactions-including those between major histocompatibility complex (MHC) proteins and antigen-derived peptides as well as between T cell receptors and peptide-MHC complexes-underlie the ability of T cells to recognize antigens with great precision. Computational approaches to predict these interactions are increasingly being used for medically relevant applications, including vaccine design and prediction of patient response to cancer immunotherapies. We provide computational researchers with an accessible introduction to the adaptive immune system, review computational approaches to predict the key protein interactions underlying T cell-mediated adaptive immunity, and highlight remaining challenges.
Collapse
Affiliation(s)
- Ryan Ehrlich
- School of Interdisciplinary Informatics, University of Nebraska, Omaha, Nebraska, USA;
| | - Eric Glynn
- Lewis-Sigler Institute, Princeton University, Princeton, New Jersey, USA
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey, USA;
- Lewis-Sigler Institute, Princeton University, Princeton, New Jersey, USA
| | - Dario Ghersi
- School of Interdisciplinary Informatics, University of Nebraska, Omaha, Nebraska, USA;
| |
Collapse
|
8
|
Meynard-Piganeau B, Feinauer C, Weigt M, Walczak AM, Mora T. TULIP: A transformer-based unsupervised language model for interacting peptides and T cell receptors that generalizes to unseen epitopes. Proc Natl Acad Sci U S A 2024; 121:e2316401121. [PMID: 38838016 PMCID: PMC11181096 DOI: 10.1073/pnas.2316401121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 04/29/2024] [Indexed: 06/07/2024] Open
Abstract
The accurate prediction of binding between T cell receptors (TCR) and their cognate epitopes is key to understanding the adaptive immune response and developing immunotherapies. Current methods face two significant limitations: the shortage of comprehensive high-quality data and the bias introduced by the selection of the negative training data commonly used in the supervised learning approaches. We propose a method, Transformer-based Unsupervised Language model for Interacting Peptides and T cell receptors (TULIP), that addresses both limitations by leveraging incomplete data and unsupervised learning and using the transformer architecture of language models. Our model is flexible and integrates all possible data sources, regardless of their quality or completeness. We demonstrate the existence of a bias introduced by the sampling procedure used in previous supervised approaches, emphasizing the need for an unsupervised approach. TULIP recognizes the specific TCRs binding an epitope, performing well on unseen epitopes. Our model outperforms state-of-the-art models and offers a promising direction for the development of more accurate TCR epitope recognition models.
Collapse
Affiliation(s)
- Barthelemy Meynard-Piganeau
- Laboratory of Computational and Quantitative Biology, Institut de Biologie Paris Seine, CNRS, Sorbonne Université, Paris75005, France
- Department of Computing Sciences, Bocconi University, Milan20100, Italy
| | | | - Martin Weigt
- Laboratory of Computational and Quantitative Biology, Institut de Biologie Paris Seine, CNRS, Sorbonne Université, Paris75005, France
| | - Aleksandra M. Walczak
- Laboratoire de Physique de l’Ecole Normale Supérieure, Université Paris Sciences et Lettres, CNRS, Sorbonne Université, Université de Paris Cité, Paris75005, France
| | - Thierry Mora
- Laboratoire de Physique de l’Ecole Normale Supérieure, Université Paris Sciences et Lettres, CNRS, Sorbonne Université, Université de Paris Cité, Paris75005, France
| |
Collapse
|
9
|
Lundgren S, Myllymäki M, Järvinen T, Keränen MAI, Theodoropoulos J, Smolander J, Kim D, Salmenniemi U, Walldin G, Savola P, Kelkka T, Rajala H, Hellström-Lindberg E, Itälä-Remes M, Kankainen M, Mustjoki S. Somatic mutations associate with clonal expansion of CD8 + T cells. SCIENCE ADVANCES 2024; 10:eadj0787. [PMID: 38848368 PMCID: PMC11160466 DOI: 10.1126/sciadv.adj0787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 05/06/2024] [Indexed: 06/09/2024]
Abstract
Somatic mutations in T cells can cause cancer but also have implications for immunological diseases and cell therapies. The mutation spectrum in nonmalignant T cells is unclear. Here, we examined somatic mutations in CD4+ and CD8+ T cells from 90 patients with hematological and immunological disorders and used T cell receptor (TCR) and single-cell sequencing to link mutations with T cell expansions and phenotypes. CD8+ cells had a higher mutation burden than CD4+ cells. Notably, the biggest variant allele frequency (VAF) of non-synonymous variants was higher than synonymous variants in CD8+ T cells, indicating non-random occurrence. The non-synonymous VAF in CD8+ T cells strongly correlated with the TCR frequency, but not age. We identified mutations in pathways essential for T cell function and often affected lymphoid neoplasia. Single-cell sequencing revealed cytotoxic TEMRA phenotypes of mutated T cells. Our findings suggest that somatic mutations contribute to CD8+ T cell expansions without malignant transformation.
Collapse
Affiliation(s)
- Sofie Lundgren
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Mikko Myllymäki
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Timo Järvinen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Mikko A. I. Keränen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Department of Hematology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Jason Theodoropoulos
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Johannes Smolander
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Daehong Kim
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Urpu Salmenniemi
- Department of Hematology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Stem Cell Transplantation Unit, Turku University Hospital, Turku, Finland
| | - Gunilla Walldin
- Center for Hematology and Regenerative Medicine, Department of Medicine, Karolinska Institute and Karolinska University Hospital, Stockholm, Sweden
| | - Paula Savola
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Department of Clinical Chemistry, HUS Diagnostic Center, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | - Tiina Kelkka
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Hanna Rajala
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Department of Hematology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Eva Hellström-Lindberg
- Center for Hematology and Regenerative Medicine, Department of Medicine, Karolinska Institute and Karolinska University Hospital, Stockholm, Sweden
| | - Maija Itälä-Remes
- Stem Cell Transplantation Unit, Turku University Hospital, Turku, Finland
| | - Matti Kankainen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Satu Mustjoki
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- ICAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| |
Collapse
|
10
|
Yu Z, Jiang M, Lan X. HeteroTCR: A heterogeneous graph neural network-based method for predicting peptide-TCR interaction. Commun Biol 2024; 7:684. [PMID: 38834836 PMCID: PMC11150398 DOI: 10.1038/s42003-024-06380-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 05/23/2024] [Indexed: 06/06/2024] Open
Abstract
Identifying interactions between T-cell receptors (TCRs) and immunogenic peptides holds profound implications across diverse research domains and clinical scenarios. Unsupervised clustering models (UCMs) cannot predict peptide-TCR binding directly, while supervised predictive models (SPMs) often face challenges in identifying antigens previously unencountered by the immune system or possessing limited TCR binding repertoires. Therefore, we propose HeteroTCR, an SPM based on Heterogeneous Graph Neural Network (GNN), to accurately predict peptide-TCR binding probabilities. HeteroTCR captures within-type (TCR-TCR or peptide-peptide) similarity information and between-type (peptide-TCR) interaction insights for predictions on unseen peptides and TCRs, surpassing limitations of existing SPMs. Our evaluation shows HeteroTCR outperforms state-of-the-art models on independent datasets. Ablation studies and visual interpretation underscore the Heterogeneous GNN module's critical role in enhancing HeteroTCR's performance by capturing pivotal binding process features. We further demonstrate the robustness and reliability of HeteroTCR through validation using single-cell datasets, aligning with the expectation that pMHC-TCR complexes with higher predicted binding probabilities correspond to increased binding fractions.
Collapse
Affiliation(s)
- Zilan Yu
- School of Medicine, Tsinghua University, 100084, Beijing, China
- Centre for Life Sciences, Tsinghua University, 100084, Beijing, China
| | - Mengnan Jiang
- School of Medicine, Tsinghua University, 100084, Beijing, China
| | - Xun Lan
- School of Medicine, Tsinghua University, 100084, Beijing, China.
- Centre for Life Sciences, Tsinghua University, 100084, Beijing, China.
- Tsinghua-Peking Center for Life Sciences, MOE Key Laboratory of Tsinghua University, Beijing, China.
- MOE Key Laboratory of Bioinformatics, Tsinghua University, 100084, Beijing, China.
| |
Collapse
|
11
|
Machaca V, Goyzueta V, Cruz MG, Sejje E, Pilco LM, López J, Túpac Y. Transformers meets neoantigen detection: a systematic literature review. J Integr Bioinform 2024; 21:jib-2023-0043. [PMID: 38960869 PMCID: PMC11377031 DOI: 10.1515/jib-2023-0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 03/20/2024] [Indexed: 07/05/2024] Open
Abstract
Cancer immunology offers a new alternative to traditional cancer treatments, such as radiotherapy and chemotherapy. One notable alternative is the development of personalized vaccines based on cancer neoantigens. Moreover, Transformers are considered a revolutionary development in artificial intelligence with a significant impact on natural language processing (NLP) tasks and have been utilized in proteomics studies in recent years. In this context, we conducted a systematic literature review to investigate how Transformers are applied in each stage of the neoantigen detection process. Additionally, we mapped current pipelines and examined the results of clinical trials involving cancer vaccines.
Collapse
Affiliation(s)
| | | | | | - Erika Sejje
- Universidad Nacional de San Agustín, Arequipa, Perú
| | | | | | - Yván Túpac
- 187038 Universidad Católica San Pablo , Arequipa, Perú
| |
Collapse
|
12
|
Bulashevska A, Nacsa Z, Lang F, Braun M, Machyna M, Diken M, Childs L, König R. Artificial intelligence and neoantigens: paving the path for precision cancer immunotherapy. Front Immunol 2024; 15:1394003. [PMID: 38868767 PMCID: PMC11167095 DOI: 10.3389/fimmu.2024.1394003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 05/13/2024] [Indexed: 06/14/2024] Open
Abstract
Cancer immunotherapy has witnessed rapid advancement in recent years, with a particular focus on neoantigens as promising targets for personalized treatments. The convergence of immunogenomics, bioinformatics, and artificial intelligence (AI) has propelled the development of innovative neoantigen discovery tools and pipelines. These tools have revolutionized our ability to identify tumor-specific antigens, providing the foundation for precision cancer immunotherapy. AI-driven algorithms can process extensive amounts of data, identify patterns, and make predictions that were once challenging to achieve. However, the integration of AI comes with its own set of challenges, leaving space for further research. With particular focus on the computational approaches, in this article we have explored the current landscape of neoantigen prediction, the fundamental concepts behind, the challenges and their potential solutions providing a comprehensive overview of this rapidly evolving field.
Collapse
Affiliation(s)
- Alla Bulashevska
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| | - Zsófia Nacsa
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| | - Franziska Lang
- TRON - Translational Oncology at the University Medical Center of the Johannes Gutenberg University gGmbH, Mainz, Germany
| | - Markus Braun
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| | - Martin Machyna
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| | - Mustafa Diken
- TRON - Translational Oncology at the University Medical Center of the Johannes Gutenberg University gGmbH, Mainz, Germany
| | - Liam Childs
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| | - Renate König
- Host-Pathogen-Interactions, Paul-Ehrlich-Institut, Langen, Germany
| |
Collapse
|
13
|
Jiang F, Guo Y, Ma H, Na S, Zhong W, Han Y, Wang T, Huang J. GTE: a graph learning framework for prediction of T-cell receptors and epitopes binding specificity. Brief Bioinform 2024; 25:bbae343. [PMID: 39007599 PMCID: PMC11247411 DOI: 10.1093/bib/bbae343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 05/15/2024] [Accepted: 07/01/2024] [Indexed: 07/16/2024] Open
Abstract
The interaction between T-cell receptors (TCRs) and peptides (epitopes) presented by major histocompatibility complex molecules (MHC) is fundamental to the immune response. Accurate prediction of TCR-epitope interactions is crucial for advancing the understanding of various diseases and their prevention and treatment. Existing methods primarily rely on sequence-based approaches, overlooking the inherent topology structure of TCR-epitope interaction networks. In this study, we present $GTE$, a novel heterogeneous Graph neural network model based on inductive learning to capture the topological structure between TCRs and Epitopes. Furthermore, we address the challenge of constructing negative samples within the graph by proposing a dynamic edge update strategy, enhancing model learning with the nonbinding TCR-epitope pairs. Additionally, to overcome data imbalance, we adapt the Deep AUC Maximization strategy to the graph domain. Extensive experiments are conducted on four public datasets to demonstrate the superiority of exploring underlying topological structures in predicting TCR-epitope interactions, illustrating the benefits of delving into complex molecular networks. The implementation code and data are available at https://github.com/uta-smile/GTE.
Collapse
Affiliation(s)
- Feng Jiang
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| | - Yuzhi Guo
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| | - Hehuan Ma
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| | - Saiyang Na
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| | - Wenliang Zhong
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| | - Yi Han
- Public Health, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, TX 75390, United States
| | - Tao Wang
- Public Health, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, TX 75390, United States
| | - Junzhou Huang
- Department of Computer Science and Engineering, University of Texas at Arlington, 701 S. Nedderman Drive, TX 76019, United States
| |
Collapse
|
14
|
Jiang M, Yu Z, Lan X. VitTCR: A deep learning method for peptide recognition prediction. iScience 2024; 27:109770. [PMID: 38711451 PMCID: PMC11070698 DOI: 10.1016/j.isci.2024.109770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 01/21/2024] [Accepted: 04/15/2024] [Indexed: 05/08/2024] Open
Abstract
This study introduces VitTCR, a predictive model based on the vision transformer (ViT) architecture, aimed at identifying interactions between T cell receptors (TCRs) and peptides, crucial for developing cancer immunotherapies and vaccines. VitTCR converts TCR-peptide interactions into numerical AtchleyMaps using Atchley factors for prediction, achieving AUROC (0.6485) and AUPR (0.6295) values. Benchmark analysis indicates VitTCR's performance is comparable to other models, with further comparative studies suggested to understand its effectiveness in varied contexts. Additionally, integrating a positional bias weight matrix (PBWM), derived from amino acid contact probabilities in structurally resolved pMHC-TCR complexes, slightly improves VitTCR's accuracy. The model's predictions show weak yet statistically significant correlations with immunological factors like T cell clonal expansion and activation percentages, underscoring the biological relevance of VitTCR's predictive capabilities. VitTCR emerges as a valuable computational tool for predicting TCR-peptide interactions, offering insights for immunotherapy and vaccine development.
Collapse
Affiliation(s)
- Mengnan Jiang
- School of Medicine, Tsinghua University, Beijing 100084, China
| | - Zilan Yu
- School of Medicine, Tsinghua University, Beijing 100084, China
- Centre for Life Sciences, Tsinghua University, Beijing 100084, China
| | - Xun Lan
- School of Medicine, Tsinghua University, Beijing 100084, China
- Centre for Life Sciences, Tsinghua University, Beijing 100084, China
- Tsinghua-Peking Center for Life Sciences, MOE Key Laboratory of Tsinghua University, Beijing, China
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China
| |
Collapse
|
15
|
Marrer-Berger E, Nicastri A, Augustin A, Kramar V, Liao H, Hanisch LJ, Carpy A, Weinzierl T, Durr E, Schaub N, Nudischer R, Ortiz-Franyuti D, Breous-Nystrom E, Stucki J, Hobi N, Raggi G, Cabon L, Lezan E, Umaña P, Woodhouse I, Bujotzek A, Klein C, Ternette N. The physiological interactome of TCR-like antibody therapeutics in human tissues. Nat Commun 2024; 15:3271. [PMID: 38627373 PMCID: PMC11021511 DOI: 10.1038/s41467-024-47062-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 03/19/2024] [Indexed: 04/19/2024] Open
Abstract
Selective binding of TCR-like antibodies that target a single tumour-specific peptide antigen presented by human leukocyte antigens (HLA) is the absolute prerequisite for their therapeutic suitability and patient safety. To date, selectivity assessment has been limited to peptide library screening and predictive modeling. We developed an experimental platform to de novo identify interactomes of TCR-like antibodies directly in human tissues using mass spectrometry. As proof of concept, we confirm the target epitope of a MAGE-A4-specific TCR-like antibody. We further determine cross-reactive peptide sequences for ESK1, a TCR-like antibody with known off-target activity, in human liver tissue. We confirm off-target-induced T cell activation and ESK1-mediated liver spheroid killing. Off-target sequences feature an amino acid motif that allows a structural groove-coordination mimicking that of the target peptide, therefore allowing the interaction with the engager molecule. We conclude that our strategy offers an accurate, scalable route for evaluating the non-clinical safety profile of TCR-like antibody therapeutics prior to first-in-human clinical application.
Collapse
Affiliation(s)
- Estelle Marrer-Berger
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Annalisa Nicastri
- The Jenner Institute, Old Road Campus Research Building, Oxford, OX37DQ, UK
- Centre for Immuno-Oncology, Old Road Campus Research Building, Oxford, OX37DQ, UK
| | - Angelique Augustin
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Vesna Kramar
- Roche Innovation Center Zürich, 8952, Schlieren, Switzerland
| | - Hanqing Liao
- The Jenner Institute, Old Road Campus Research Building, Oxford, OX37DQ, UK
- Centre for Immuno-Oncology, Old Road Campus Research Building, Oxford, OX37DQ, UK
| | | | - Alejandro Carpy
- Roche Pharma Research & Early Development, Roche Innovation Center Munich, 82377, Penzberg, Germany
| | - Tina Weinzierl
- Roche Innovation Center Zürich, 8952, Schlieren, Switzerland
| | - Evelyne Durr
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Nathalie Schaub
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Ramona Nudischer
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Daniela Ortiz-Franyuti
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Ekaterina Breous-Nystrom
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Janick Stucki
- Alveolix AG, Swiss Organs-on-Chip Innovation, 3010, Bern, Switzerland
| | - Nina Hobi
- Alveolix AG, Swiss Organs-on-Chip Innovation, 3010, Bern, Switzerland
| | - Giulia Raggi
- Alveolix AG, Swiss Organs-on-Chip Innovation, 3010, Bern, Switzerland
| | - Lauriane Cabon
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Emmanuelle Lezan
- Roche Pharma Research & Early Development, Roche Innovation Center Basel, 4070, Basel, Switzerland
| | - Pablo Umaña
- Roche Innovation Center Zürich, 8952, Schlieren, Switzerland
| | - Isaac Woodhouse
- The Jenner Institute, Old Road Campus Research Building, Oxford, OX37DQ, UK
- Centre for Immuno-Oncology, Old Road Campus Research Building, Oxford, OX37DQ, UK
| | - Alexander Bujotzek
- Roche Pharma Research & Early Development, Roche Innovation Center Munich, 82377, Penzberg, Germany
| | - Christian Klein
- Roche Innovation Center Zürich, 8952, Schlieren, Switzerland.
| | - Nicola Ternette
- The Jenner Institute, Old Road Campus Research Building, Oxford, OX37DQ, UK.
- Centre for Immuno-Oncology, Old Road Campus Research Building, Oxford, OX37DQ, UK.
- Department of Pharmaceutical Sciences, University of Utrecht, 3584, CH, Utrecht, The Netherlands.
| |
Collapse
|
16
|
Croce G, Bobisse S, Moreno DL, Schmidt J, Guillame P, Harari A, Gfeller D. Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells. Nat Commun 2024; 15:3211. [PMID: 38615042 PMCID: PMC11016097 DOI: 10.1038/s41467-024-47461-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 04/03/2024] [Indexed: 04/15/2024] Open
Abstract
T cells have the ability to eliminate infected and cancer cells and play an essential role in cancer immunotherapy. T cell activation is elicited by the binding of the T cell receptor (TCR) to epitopes displayed on MHC molecules, and the TCR specificity is determined by the sequence of its α and β chains. Here, we collect and curate a dataset of 17,715 αβTCRs interacting with dozens of class I and class II epitopes. We use this curated data to develop MixTCRpred, an epitope-specific TCR-epitope interaction predictor. MixTCRpred accurately predicts TCRs recognizing several viral and cancer epitopes. MixTCRpred further provides a useful quality control tool for multiplexed single-cell TCR sequencing assays of epitope-specific T cells and pinpoints a substantial fraction of putative contaminants in public databases. Analysis of epitope-specific dual α T cells demonstrates that MixTCRpred can identify α chains mediating epitope recognition. Applying MixTCRpred to TCR repertoires from COVID-19 patients reveals enrichment of clonotypes predicted to bind an immunodominant SARS-CoV-2 epitope. Overall, MixTCRpred provides a robust tool to predict TCRs interacting with specific epitopes and interpret TCR-sequencing data from both bulk and epitope-specific T cells.
Collapse
Affiliation(s)
- Giancarlo Croce
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
| | - Sara Bobisse
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Dana Léa Moreno
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
| | - Julien Schmidt
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Philippe Guillame
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Alexandre Harari
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - David Gfeller
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
- Agora Cancer Research Centre, Lausanne, Switzerland.
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland.
| |
Collapse
|
17
|
Jensen MF, Nielsen M. Enhancing TCR specificity predictions by combined pan- and peptide-specific training, loss-scaling, and sequence similarity integration. eLife 2024; 12:RP93934. [PMID: 38437160 PMCID: PMC10942633 DOI: 10.7554/elife.93934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024] Open
Abstract
Predicting the interaction between Major Histocompatibility Complex (MHC) class I-presented peptides and T-cell receptors (TCR) holds significant implications for vaccine development, cancer treatment, and autoimmune disease therapies. However, limited paired-chain TCR data, skewed towards well-studied epitopes, hampers the development of pan-specific machine-learning (ML) models. Leveraging a larger peptide-TCR dataset, we explore various alterations to the ML architectures and training strategies to address data imbalance. This leads to an overall improved performance, particularly for peptides with scant TCR data. However, challenges persist for unseen peptides, especially those distant from training examples. We demonstrate that such ML models can be used to detect potential outliers, which when removed from training, leads to augmented performance. Integrating pan-specific and peptide-specific models alongside with similarity-based predictions, further improves the overall performance, especially when a low false positive rate is desirable. In the context of the IMMREP22 benchmark, this modeling framework attained state-of-the-art performance. Moreover, combining these strategies results in acceptable predictive accuracy for peptides characterized with as little as 15 positive TCRs. This observation places great promise on rapidly expanding the peptide covering of the current models for predicting TCR specificity. The NetTCR 2.2 model incorporating these advances is available on GitHub (https://github.com/mnielLab/NetTCR-2.2) and as a web server at https://services.healthtech.dtu.dk/services/NetTCR-2.2/.
Collapse
Affiliation(s)
- Mathias Fynbo Jensen
- Department of Health Technology, Section for Bioinformatics, Technical University of DenmarkLyngbyDenmark
| | - Morten Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of DenmarkLyngbyDenmark
| |
Collapse
|
18
|
Xiong P, Liang A, Cai X, Xia T. APTAnet: an atom-level peptide-TCR interaction affinity prediction model. BIOPHYSICS REPORTS 2024; 10:1-14. [PMID: 38737473 PMCID: PMC11079603 DOI: 10.52601/bpr.2023.230037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 01/26/2024] [Indexed: 05/14/2024] Open
Abstract
The prediction of affinity between TCRs and peptides is crucial for the further development of TIL (Tumor-Infiltrating Lymphocytes) immunotherapy. Inspired by the broader research of drug-protein interaction (DPI), we propose an atom-level peptide-TCR interaction (PTI) affinity prediction model APTAnet using natural language processing methods. APTAnet model achieved an average ROC-AUC and PR-AUC of 0.893 and 0.877, respectively, in ten-fold cross-validation on 25,675 pairs of PTI data. Furthermore, experimental results on an independent test set from the McPAS database showed that APTAnet outperformed the current mainstream models. Finally, through the validation on 11 cases of real tumor patient data, we found that the APTAnet model can effectively identify tumor peptides and screen tumor-specific TCRs.
Collapse
Affiliation(s)
- Peng Xiong
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Anyi Liang
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Xunhui Cai
- Institute of Pathology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Tian Xia
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
- Institute of Pathology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| |
Collapse
|
19
|
Bravi B. Development and use of machine learning algorithms in vaccine target selection. NPJ Vaccines 2024; 9:15. [PMID: 38242890 PMCID: PMC10798987 DOI: 10.1038/s41541-023-00795-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 12/07/2023] [Indexed: 01/21/2024] Open
Abstract
Computer-aided discovery of vaccine targets has become a cornerstone of rational vaccine design. In this article, I discuss how Machine Learning (ML) can inform and guide key computational steps in rational vaccine design concerned with the identification of B and T cell epitopes and correlates of protection. I provide examples of ML models, as well as types of data and predictions for which they are built. I argue that interpretable ML has the potential to improve the identification of immunogens also as a tool for scientific discovery, by helping elucidate the molecular processes underlying vaccine-induced immune responses. I outline the limitations and challenges in terms of data availability and method development that need to be addressed to bridge the gap between advances in ML predictions and their translational application to vaccine design.
Collapse
Affiliation(s)
- Barbara Bravi
- Department of Mathematics, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
20
|
Huuhtanen J, Adnan-Awad S, Theodoropoulos J, Forstén S, Warfvinge R, Dufva O, Bouhlal J, Dhapola P, Duàn H, Laajala E, Kasanen T, Klievink J, Ilander M, Jaatinen T, Olsson-Strömberg U, Hjorth-Hansen H, Burchert A, Karlsson G, Kreutzman A, Lähdesmäki H, Mustjoki S. Single-cell analysis of immune recognition in chronic myeloid leukemia patients following tyrosine kinase inhibitor discontinuation. Leukemia 2024; 38:109-125. [PMID: 37919606 PMCID: PMC10776410 DOI: 10.1038/s41375-023-02074-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 09/19/2023] [Accepted: 10/17/2023] [Indexed: 11/04/2023]
Abstract
Immunological control of residual leukemia cells is thought to occur in patients with chronic myeloid leukemia (CML) that maintain treatment-free remission (TFR) following tyrosine kinase inhibitor (TKI) discontinuation. To study this, we analyzed 55 single-cell RNA and T cell receptor (TCR) sequenced samples (scRNA+TCRαβ-seq) from patients with CML (n = 13, N = 25), other cancers (n = 28), and healthy (n = 7). The high number and active phenotype of natural killer (NK) cells in CML separated them from healthy and other cancers. Most NK cells in CML belonged to the active CD56dim cluster with high expression of GZMA/B, PRF1, CCL3/4, and IFNG, with interactions with leukemic cells via inhibitory LGALS9-TIM3 and PVR-TIGIT interactions. Accordingly, upregulation of LGALS9 was observed in CML target cells and TIM3 in NK cells when co-cultured together. Additionally, we created a classifier to identify TCRs targeting leukemia-associated antigen PR1 and quantified anti-PR1 T cells in 90 CML and 786 healthy TCRβ-sequenced samples. Anti-PR1 T cells were more prevalent in CML, enriched in bone marrow samples, and enriched in the mature, cytotoxic CD8 + TEMRA cluster, especially in a patient maintaining TFR. Our results highlight the role of NK cells and anti-PR1 T cells in anti-leukemic immune responses in CML.
Collapse
Affiliation(s)
- Jani Huuhtanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.
- Department of Computer Science, Aalto University, Espoo, Finland.
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland.
| | - Shady Adnan-Awad
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
- Foundation for the Finnish Cancer Institute, Helsinki, Finland
| | - Jason Theodoropoulos
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Sofia Forstén
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Rebecca Warfvinge
- Division of Molecular Hematology, Lund Stem Cell Center, Lund University, Lund, Sweden
| | - Olli Dufva
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Jonas Bouhlal
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Parashar Dhapola
- Division of Molecular Hematology, Lund Stem Cell Center, Lund University, Lund, Sweden
| | - Hanna Duàn
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Essi Laajala
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Tiina Kasanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Jay Klievink
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Mette Ilander
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Taina Jaatinen
- Histocompatibility Testing Laboratory, Finnish Red Cross Blood Service, Helsinki, Finland
| | - Ulla Olsson-Strömberg
- Department of Medical Sciences, Uppsala University and Hematology Section, Uppsala University Hospital, Uppsala, Sweden
| | - Henrik Hjorth-Hansen
- Department of Hematology, St. Olavs Hospital, Trondheim, Norway
- Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Andreas Burchert
- Department of Hematology, Oncology and Immunology, Philipps University Marburg, and University Medical Center Giessen and Marburg, Marburg, Germany
| | - Göran Karlsson
- Division of Molecular Hematology, Lund Stem Cell Center, Lund University, Lund, Sweden
| | - Anna Kreutzman
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Satu Mustjoki
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland.
| |
Collapse
|
21
|
Han Y, Yang Y, Tian Y, Fattah FJ, von Itzstein MS, Hu Y, Zhang M, Kang X, Yang DM, Liu J, Xue Y, Liang C, Raman I, Zhu C, Xiao O, Dowell JE, Homsi J, Rashdan S, Yang S, Gwin ME, Hsiehchen D, Gloria-McCutchen Y, Pan K, Wu F, Gibbons D, Wang X, Yee C, Huang J, Reuben A, Cheng C, Zhang J, Gerber DE, Wang T. pan-MHC and cross-Species Prediction of T Cell Receptor-Antigen Binding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.01.569599. [PMID: 38105939 PMCID: PMC10723300 DOI: 10.1101/2023.12.01.569599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Profiling the binding of T cell receptors (TCRs) of T cells to antigenic peptides presented by MHC proteins is one of the most important unsolved problems in modern immunology. Experimental methods to probe TCR-antigen interactions are slow, labor-intensive, costly, and yield moderate throughput. To address this problem, we developed pMTnet-omni, an Artificial Intelligence (AI) system based on hybrid protein sequence and structure information, to predict the pairing of TCRs of αβ T cells with peptide-MHC complexes (pMHCs). pMTnet-omni is capable of handling peptides presented by both class I and II pMHCs, and capable of handling both human and mouse TCR-pMHC pairs, through information sharing enabled this hybrid design. pMTnet-omni achieves a high overall Area Under the Curve of Receiver Operator Characteristics (AUROC) of 0.888, which surpasses competing tools by a large margin. We showed that pMTnet-omni can distinguish binding affinity of TCRs with similar sequences. Across a range of datasets from various biological contexts, pMTnet-omni characterized the longitudinal evolution and spatial heterogeneity of TCR-pMHC interactions and their functional impact. We successfully developed a biomarker based on pMTnet-omni for predicting immune-related adverse events of immune checkpoint inhibitor (ICI) treatment in a cohort of 57 ICI-treated patients. pMTnet-omni represents a major advance towards developing a clinically usable AI system for TCR-pMHC pairing prediction that can aid the design and implementation of TCR-based immunotherapeutics.
Collapse
|
22
|
Zhao M, Xu SX, Yang Y, Yuan M. GGNpTCR: A Generative Graph Structure Neural Network for Predicting Immunogenic Peptides for T-cell Immune Response. J Chem Inf Model 2023; 63:7557-7567. [PMID: 37990917 DOI: 10.1021/acs.jcim.3c01293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
Identifying the interactions between T-cell receptor (TCRs) and human antigens is a crucial step in developing new vaccines, diagnostics, and immunotherapy. Current methods primarily focus on learning binding patterns from known TCR binding repertoires by using sequence information alone without considering the binding specificity of new antigens or exogenous peptides that have not appeared in the training set. Furthermore, the spatial structure of antigens plays a critical role in immune studies and immunotherapy, which should be addressed properly in the identification of interacting TCR-antigen pairs. In this study, we introduced a novel deep learning framework based on generative graph structures, GGNpTCR, for predicting interactions between TCR and peptides from sequence information. Results of real data analysis indicate that our model achieved excellent prediction for new antigens unseen in the training data set, making significant improvements compared to existing methods. We also applied the model to a large COVID-19 data set with no antigens in the training data set, and the improvement was also significant. Furthermore, through incorporation of additional supervised mechanisms, GGNpTCR demonstrated the ability to precisely forecast the locations of peptide-TCR interactions within 3D configurations. This enhancement substantially improved the model's interpretability. In summary, based on the performance on multiple data sets, GGNpTCR has made significant progress in terms of performance, universality, and interpretability.
Collapse
Affiliation(s)
- Minghua Zhao
- Department of Statistics and Finance, University of Science and Technology of China, Hefei 230026, China
| | - Steven X Xu
- Genmab US, Inc., Princeton, New Jersey 08540, United States
| | - Yaning Yang
- Department of Statistics and Finance, University of Science and Technology of China, Hefei 230026, China
| | - Min Yuan
- School of Public Health Administration, Anhui Medical University, Hefei 230032, China
| |
Collapse
|
23
|
Korpela D, Jokinen E, Dumitrescu A, Huuhtanen J, Mustjoki S, Lähdesmäki H. EPIC-TRACE: predicting TCR binding to unseen epitopes using attention and contextualized embeddings. Bioinformatics 2023; 39:btad743. [PMID: 38070156 PMCID: PMC10963061 DOI: 10.1093/bioinformatics/btad743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 11/20/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023] Open
Abstract
MOTIVATION T cells play an essential role in adaptive immune system to fight pathogens and cancer but may also give rise to autoimmune diseases. The recognition of a peptide-MHC (pMHC) complex by a T cell receptor (TCR) is required to elicit an immune response. Many machine learning models have been developed to predict the binding, but generalizing predictions to pMHCs outside the training data remains challenging. RESULTS We have developed a new machine learning model that utilizes information about the TCR from both α and β chains, epitope sequence, and MHC. Our method uses ProtBERT embeddings for the amino acid sequences of both chains and the epitope, as well as convolution and multi-head attention architectures. We show the importance of each input feature as well as the benefit of including epitopes with only a few TCRs to the training data. We evaluate our model on existing databases and show that it compares favorably against other state-of-the-art models. AVAILABILITY AND IMPLEMENTATION https://github.com/DaniTheOrange/EPIC-TRACE.
Collapse
Affiliation(s)
- Dani Korpela
- Department of Computer Science, Aalto University, 02150 Espoo, Finland
| | - Emmi Jokinen
- Department of Computer Science, Aalto University, 02150 Espoo, Finland
- Translational Immunology Research Program, Department of Clinical Chemistry and Hematology, University of Helsinki, 00290 Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, 00290 Helsinki, Finland
| | | | - Jani Huuhtanen
- Translational Immunology Research Program, Department of Clinical Chemistry and Hematology, University of Helsinki, 00290 Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, 00290 Helsinki, Finland
| | - Satu Mustjoki
- Translational Immunology Research Program, Department of Clinical Chemistry and Hematology, University of Helsinki, 00290 Helsinki, Finland
- Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, 00290 Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, 02150 Espoo, Finland
| |
Collapse
|
24
|
Khan AR, Reinders MJT, Khatri I. Determining epitope specificity of T-cell receptors with transformers. Bioinformatics 2023; 39:btad632. [PMID: 37847663 PMCID: PMC10636277 DOI: 10.1093/bioinformatics/btad632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 09/09/2023] [Accepted: 10/16/2023] [Indexed: 10/19/2023] Open
Abstract
SUMMARY T-cell receptors (TCRs) on T cells recognize and bind to epitopes presented by the major histocompatibility complex in case of an infection or cancer. However, the high diversity of TCRs, as well as their unique and complex binding mechanisms underlying epitope recognition, make it difficult to predict the binding between TCRs and epitopes. Here, we present the utility of transformers, a deep learning strategy that incorporates an attention mechanism that learns the informative features, and show that these models pre-trained on a large set of protein sequences outperform current strategies. We compared three pre-trained auto-encoder transformer models (ProtBERT, ProtAlbert, and ProtElectra) and one pre-trained auto-regressive transformer model (ProtXLNet) to predict the binding specificity of TCRs to 25 epitopes from the VDJdb database (human and murine). Two additional modifications were performed to incorporate gene usage of the TCRs in the four transformer models. Of all 12 transformer implementations (four models with three different modifications), a modified version of the ProtXLNet model could predict TCR-epitope pairs with the highest accuracy (weighted F1 score 0.55 simultaneously considering all 25 epitopes). The modification included additional features representing the gene names for the TCRs. We also showed that the basic implementation of transformers outperformed the previously available methods, i.e. TCRGP, TCRdist, and DeepTCR, developed for the same biological problem, especially for the hard-to-classify labels. We show that the proficiency of transformers in attention learning can be made operational in a complex biological setting like TCR binding prediction. Further ingenuity in utilizing the full potential of transformers, either through attention head visualization or introducing additional features, can extend T-cell research avenues. AVAILABILITY AND IMPLEMENTATION Data and code are available on https://github.com/InduKhatri/tcrformer.
Collapse
Affiliation(s)
- Abdul Rehman Khan
- Department of Intelligent Systems, Delft University of Technology, Delft 2600 GA, The Netherlands
| | - Marcel J T Reinders
- Department of Intelligent Systems, Delft University of Technology, Delft 2600 GA, The Netherlands
- Leiden Computational Biology Center, Department of Molecular Epidemiology, Leiden University Medical Center, Leiden 2333 ZA, The Netherlands
| | - Indu Khatri
- Leiden Computational Biology Center, Department of Molecular Epidemiology, Leiden University Medical Center, Leiden 2333 ZA, The Netherlands
- Department of Immunology, Leiden University Medical Center, Leiden 2333 ZA, The Netherlands
| |
Collapse
|
25
|
Flumens D, Gielis S, Bartholomeus E, Campillo-Davo D, van der Heijden S, Versteven M, De Reu H, Smits E, Ogunjimi B, Laukens K, Meysman P, Lion E. Training of epitope-TCR prediction models with healthy donor-derived cancer-specific T cells. Methods Cell Biol 2023; 183:143-160. [PMID: 38548410 DOI: 10.1016/bs.mcb.2023.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2024]
Abstract
Discovery of epitope-specific T-cell receptors (TCRs) for cancer therapies is a time consuming and expensive procedure that usually requires a large amount of patient cells. To maximize information from and minimize the need of precious samples in cancer research, prediction models have been developed to identify in silico epitope-specific TCRs. In this chapter, we provide a step-by-step protocol to train a prediction model using the user-friendly TCRex webtool for the nearly universal tumor-associated antigen Wilms' tumor 1 (WT1)-specific TCR repertoire. WT1 is a self-antigen overexpressed in numerous solid and hematological malignancies with a high clinical relevance. Training of computational models starts from a list of known epitope-specific TCRs which is often not available for new cancer epitopes. Therefore, we describe a workflow to assemble a training data set consisting of TCR sequences obtained from WT137-45-reactive CD8 T cell clones expanded and sorted from healthy donor peripheral blood mononuclear cells.
Collapse
Affiliation(s)
- Donovan Flumens
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Sofie Gielis
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium; Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Biomedical Informatics Research Network Antwerp (Biomina), University of Antwerp, Antwerp, Belgium
| | - Esther Bartholomeus
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Centre for Health Economics Research & Modeling Infectious Diseases (CHERMID), VAXINFECTIO, University of Antwerp, Antwerp, Belgium; Antwerp Center for Translational Immunology and Virology (ACTIV), Vaccine and Infectious Disease Institute (VAXINFECTIO), University of Antwerp, Antwerp, Belgium
| | - Diana Campillo-Davo
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Sanne van der Heijden
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium; Center for Oncological Research (CORE), Integrated Personalized & Precision Oncology Network (IPPON), University of Antwerp, Antwerp, Belgium
| | - Maarten Versteven
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Hans De Reu
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Evelien Smits
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Centre for Health Economics Research & Modeling Infectious Diseases (CHERMID), VAXINFECTIO, University of Antwerp, Antwerp, Belgium
| | - Benson Ogunjimi
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Centre for Health Economics Research & Modeling Infectious Diseases (CHERMID), VAXINFECTIO, University of Antwerp, Antwerp, Belgium; Antwerp Center for Translational Immunology and Virology (ACTIV), Vaccine and Infectious Disease Institute (VAXINFECTIO), University of Antwerp, Antwerp, Belgium
| | - Kris Laukens
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium; Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Biomedical Informatics Research Network Antwerp (Biomina), University of Antwerp, Antwerp, Belgium
| | - Pieter Meysman
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium; Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Biomedical Informatics Research Network Antwerp (Biomina), University of Antwerp, Antwerp, Belgium
| | - Eva Lion
- Laboratory of Experimental Hematology (LEH), Vaccine & Infectious Disease Institute (VAXINFECTIO), Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium; Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium; Center for Cell Therapy & Regenerative Medicine (CCRG), Antwerp University Hospital, Edegem, Belgium.
| |
Collapse
|
26
|
Ghoreyshi ZS, George JT. Quantitative approaches for decoding the specificity of the human T cell repertoire. Front Immunol 2023; 14:1228873. [PMID: 37781387 PMCID: PMC10539903 DOI: 10.3389/fimmu.2023.1228873] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 08/17/2023] [Indexed: 10/03/2023] Open
Abstract
T cell receptor (TCR)-peptide-major histocompatibility complex (pMHC) interactions play a vital role in initiating immune responses against pathogens, and the specificity of TCRpMHC interactions is crucial for developing optimized therapeutic strategies. The advent of high-throughput immunological and structural evaluation of TCR and pMHC has provided an abundance of data for computational approaches that aim to predict favorable TCR-pMHC interactions. Current models are constructed using information on protein sequence, structures, or a combination of both, and utilize a variety of statistical learning-based approaches for identifying the rules governing specificity. This review examines the current theoretical, computational, and deep learning approaches for identifying TCR-pMHC recognition pairs, placing emphasis on each method's mathematical approach, predictive performance, and limitations.
Collapse
Affiliation(s)
- Zahra S. Ghoreyshi
- Department of Biomedical Engineering, Texas A&M University, College Station, TX, United States
| | - Jason T. George
- Department of Biomedical Engineering, Texas A&M University, College Station, TX, United States
- Engineering Medicine Program, Texas A&M University, Houston, TX, United States
- Center for Theoretical Biological Physics, Rice University, Houston, TX, United States
| |
Collapse
|
27
|
Hudson D, Fernandes RA, Basham M, Ogg G, Koohy H. Can we predict T cell specificity with digital biology and machine learning? Nat Rev Immunol 2023; 23:511-521. [PMID: 36755161 PMCID: PMC9908307 DOI: 10.1038/s41577-023-00835-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2022] [Indexed: 02/10/2023]
Abstract
Recent advances in machine learning and experimental biology have offered breakthrough solutions to problems such as protein structure prediction that were long thought to be intractable. However, despite the pivotal role of the T cell receptor (TCR) in orchestrating cellular immunity in health and disease, computational reconstruction of a reliable map from a TCR to its cognate antigens remains a holy grail of systems immunology. Current data sets are limited to a negligible fraction of the universe of possible TCR-ligand pairs, and performance of state-of-the-art predictive models wanes when applied beyond these known binders. In this Perspective article, we make the case for renewed and coordinated interdisciplinary effort to tackle the problem of predicting TCR-antigen specificity. We set out the general requirements of predictive models of antigen binding, highlight critical challenges and discuss how recent advances in digital biology such as single-cell technology and machine learning may provide possible solutions. Finally, we describe how predicting TCR specificity might contribute to our understanding of the broader puzzle of antigen immunogenicity.
Collapse
Affiliation(s)
- Dan Hudson
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- The Rosalind Franklin Institute, Didcot, UK
| | - Ricardo A Fernandes
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | | | - Graham Ogg
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | - Hashem Koohy
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
- Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
28
|
Myronov A, Mazzocco G, Król P, Plewczynski D. BERTrand-peptide:TCR binding prediction using Bidirectional Encoder Representations from Transformers augmented with random TCR pairing. Bioinformatics 2023; 39:btad468. [PMID: 37535685 PMCID: PMC10444968 DOI: 10.1093/bioinformatics/btad468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 06/28/2023] [Accepted: 08/01/2023] [Indexed: 08/05/2023] Open
Abstract
MOTIVATION The advent of T-cell receptor (TCR) sequencing experiments allowed for a significant increase in the amount of peptide:TCR binding data available and a number of machine-learning models appeared in recent years. High-quality prediction models for a fixed epitope sequence are feasible, provided enough known binding TCR sequences are available. However, their performance drops significantly for previously unseen peptides. RESULTS We prepare the dataset of known peptide:TCR binders and augment it with negative decoys created using healthy donors' T-cell repertoires. We employ deep learning methods commonly applied in Natural Language Processing to train part a peptide:TCR binding model with a degree of cross-peptide generalization (0.69 AUROC). We demonstrate that BERTrand outperforms the published methods when evaluated on peptide sequences not used during model training. AVAILABILITY AND IMPLEMENTATION The datasets and the code for model training are available at https://github.com/SFGLab/bertrand.
Collapse
Affiliation(s)
- Alexander Myronov
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Ardigen, Krakow, Poland
| | | | | | - Dariusz Plewczynski
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
| |
Collapse
|
29
|
Lee MH, Theodoropoulos J, Huuhtanen J, Bhattacharya D, Järvinen P, Tornberg S, Nísen H, Mirtti T, Uski I, Kumari A, Peltonen K, Draghi A, Donia M, Kreutzman A, Mustjoki S. Immunologic Characterization and T cell Receptor Repertoires of Expanded Tumor-infiltrating Lymphocytes in Patients with Renal Cell Carcinoma. CANCER RESEARCH COMMUNICATIONS 2023; 3:1260-1276. [PMID: 37484198 PMCID: PMC10361538 DOI: 10.1158/2767-9764.crc-22-0514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 03/27/2023] [Accepted: 06/21/2023] [Indexed: 07/25/2023]
Abstract
The successful use of expanded tumor-infiltrating lymphocytes (TIL) in adoptive TIL therapies has been reported, but the effects of the TIL expansion, immunophenotype, function, and T cell receptor (TCR) repertoire of the infused products relative to the tumor microenvironment (TME) are not well understood. In this study, we analyzed the tumor samples (n = 58) from treatment-naïve patients with renal cell carcinoma (RCC), "pre-rapidly expanded" TILs (pre-REP TIL, n = 15) and "rapidly expanded" TILs (REP TIL, n = 25) according to a clinical-grade TIL production protocol, with single-cell RNA (scRNA)+TCRαβ-seq (TCRαβ sequencing), TCRβ-sequencing (TCRβ-seq), and flow cytometry. REP TILs encompassed a greater abundance of CD4+ than CD8+ T cells, with increased LAG-3 and low PD-1 expressions in both CD4+ and CD8+ T cell compartments compared with the pre-REP TIL and tumor T cells. The REP protocol preferentially expanded small clones of the CD4+ phenotype (CD4, IL7R, KLRB1) in the TME, indicating that the largest exhausted T cell clones in the tumor do not expand during the expansion protocol. In addition, by generating a catalog of RCC-associated TCR motifs from >1,000 scRNA+TCRαβ-seq and TCRβ-seq RCC, healthy and other cancer sample cohorts, we quantified the RCC-associated TCRs from the expansion protocol. Unlike the low-remaining amount of anti-viral TCRs throughout the expansion, the quantity of the RCC-associated TCRs was high in the tumors and pre-REP TILs but decreased in the REP TILs. Our results provide an in-depth understanding of the origin, phenotype, and TCR specificity of RCC TIL products, paving the way for a more rationalized production of TILs. Significance TILs are a heterogenous group of immune cells that recognize and attack the tumor, thus are utilized in various clinical trials. In our study, we explored the TILs in patients with kidney cancer by expanding the TILs using a clinical-grade protocol, as well as observed their characteristics and ability to recognize the tumor using in-depth experimental and computational tools.
Collapse
Affiliation(s)
- Moon Hee Lee
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Jason Theodoropoulos
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Jani Huuhtanen
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Dipabarna Bhattacharya
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Petrus Järvinen
- Abdominal Center, Urology, Helsinki University and Helsinki University Hospital, Helsinki, Finland
| | - Sara Tornberg
- Abdominal Center, Urology, Helsinki University and Helsinki University Hospital, Helsinki, Finland
| | - Harry Nísen
- Abdominal Center, Urology, Helsinki University and Helsinki University Hospital, Helsinki, Finland
| | - Tuomas Mirtti
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
- Department of Pathology, HUS Diagnostic Center, Helsinki University Hospital, Helsinki, Finland
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- Department of Biomedical Engineering, School of Medicine, Emory University, Atlanta, Georgia
| | - Ilona Uski
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
| | - Anita Kumari
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Karita Peltonen
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Arianna Draghi
- National Center for Cancer Immune Therapy, Department of Oncology, Copenhagen University Hospital, Herlev, Denmark
| | - Marco Donia
- National Center for Cancer Immune Therapy, Department of Oncology, Copenhagen University Hospital, Herlev, Denmark
| | - Anna Kreutzman
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
| | - Satu Mustjoki
- Hematology Research Unit Helsinki, Department of Clinical Chemistry and Hematology, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| |
Collapse
|
30
|
Shen Y, Voigt A, Leng X, Rodriguez AA, Nguyen CQ. A current and future perspective on T cell receptor repertoire profiling. Front Genet 2023; 14:1159109. [PMID: 37408774 PMCID: PMC10319011 DOI: 10.3389/fgene.2023.1159109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 06/12/2023] [Indexed: 07/07/2023] Open
Abstract
T cell receptors (TCR) play a vital role in the immune system's ability to recognize and respond to foreign antigens, relying on the highly polymorphic rearrangement of TCR genes. The recognition of autologous peptides by adaptive immunity may lead to the development and progression of autoimmune diseases. Understanding the specific TCR involved in this process can provide insights into the autoimmune process. RNA-seq (RNA sequencing) is a valuable tool for studying TCR repertoires by providing a comprehensive and quantitative analysis of the RNA transcripts. With the development of RNA technology, transcriptomic data must provide valuable information to model and predict TCR and antigen interaction and, more importantly, identify or predict neoantigens. This review provides an overview of the application and development of bulk RNA-seq and single-cell (SC) RNA-seq to examine the TCR repertoires. Furthermore, discussed here are bioinformatic tools that can be applied to study the structural biology of peptide/TCR/MHC (major histocompatibility complex) and predict antigenic epitopes using advanced artificial intelligence tools.
Collapse
Affiliation(s)
- Yiran Shen
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Alexandria Voigt
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Xuebing Leng
- Department of Microbiology and Immunology, Miller School of Medicine, University of Miami, Miami, FL, United States
| | - Amy A. Rodriguez
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Cuong Q. Nguyen
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
- Department of Oral Biology, College of Dentistry, University of Florida, Gainesville, FL, United States
- Center of Orphaned Autoimmune Diseases, University of Florida, Gainesville, FL, United States
| |
Collapse
|
31
|
Pham MDN, Nguyen TN, Tran LS, Nguyen QTB, Nguyen TPH, Pham TMQ, Nguyen HN, Giang H, Phan MD, Nguyen V. epiTCR: a highly sensitive predictor for TCR-peptide binding. Bioinformatics 2023; 39:btad284. [PMID: 37094220 PMCID: PMC10159657 DOI: 10.1093/bioinformatics/btad284] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 02/28/2023] [Accepted: 04/22/2023] [Indexed: 04/26/2023] Open
Abstract
MOTIVATION Predicting the binding between T-cell receptor (TCR) and peptide presented by human leucocyte antigen molecule is a highly challenging task and a key bottleneck in the development of immunotherapy. Existing prediction tools, despite exhibiting good performance on the datasets they were built with, suffer from low true positive rates when used to predict epitopes capable of eliciting T-cell responses in patients. Therefore, an improved tool for TCR-peptide prediction built upon a large dataset combining existing publicly available data is still needed. RESULTS We collected data from five public databases (IEDB, TBAdb, VDJdb, McPAS-TCR, and 10X) to form a dataset of >3 million TCR-peptide pairs, 3.27% of which were binding interactions. We proposed epiTCR, a Random Forest-based method dedicated to predicting the TCR-peptide interactions. epiTCR used simple input of TCR CDR3β sequences and antigen sequences, which are encoded by flattened BLOSUM62. epiTCR performed with area under the curve (0.98) and higher sensitivity (0.94) than other existing tools (NetTCR, Imrex, ATM-TCR, and pMTnet), while maintaining comparable prediction specificity (0.9). We identified seven epitopes that contributed to 98.67% of false positives predicted by epiTCR and exerted similar effects on other tools. We also demonstrated a considerable influence of peptide sequences on prediction, highlighting the need for more diverse peptides in a more balanced dataset. In conclusion, epiTCR is among the most well-performing tools, thanks to the use of combined data from public sources and its use will contribute to the quest in identifying neoantigens for precision cancer immunotherapy. AVAILABILITY AND IMPLEMENTATION epiTCR is available on GitHub (https://github.com/ddiem-ri-4D/epiTCR).
Collapse
Affiliation(s)
| | | | - Le Son Tran
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
- NexCalibur Therapeutics, Wilmington, DE, United States
| | | | | | | | - Hoai-Nghia Nguyen
- NexCalibur Therapeutics, Wilmington, DE, United States
- University of Medicine & Pharmacy, Ho Chi Minh City, Vietnam
| | - Hoa Giang
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
- NexCalibur Therapeutics, Wilmington, DE, United States
| | - Minh-Duy Phan
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
- NexCalibur Therapeutics, Wilmington, DE, United States
| | - Vy Nguyen
- Medical Genetics Institute, Ho Chi Minh City, Vietnam
| |
Collapse
|
32
|
Huuhtanen J, Kasanen H, Peltola K, Lönnberg T, Glumoff V, Brück O, Dufva O, Peltonen K, Vikkula J, Jokinen E, Ilander M, Lee MH, Mäkelä S, Nyakas M, Li B, Hernberg M, Bono P, Lähdesmäki H, Kreutzman A, Mustjoki S. Single-cell characterization of anti-LAG-3 and anti-PD-1 combination treatment in patients with melanoma. J Clin Invest 2023; 133:164809. [PMID: 36719749 PMCID: PMC10014104 DOI: 10.1172/jci164809] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 01/25/2023] [Indexed: 02/01/2023] Open
Abstract
BackgroundRelatlimab plus nivolumab (anti-lymphocyte-activation gene 3 plus anti-programmed death 1 [anti-LAG-3+anti-PD-1]) has been approved by the FDA as a first-line therapy for stage III/IV melanoma, but its detailed effect on the immune system is unknown.MethodsWe evaluated blood samples from 40 immunotherapy-naive or prior immunotherapy-refractory patients with metastatic melanoma treated with anti-LAG-3+anti-PD-1 in a phase I trial using single-cell RNA and T cell receptor sequencing (scRNA+TCRαβ-Seq) combined with other multiomics profiling.ResultsThe highest LAG3 expression was noted in NK cells, Tregs, and CD8+ T cells, and these cell populations underwent the most significant changes during the treatment. Adaptive NK cells were enriched in responders and underwent profound transcriptomic changes during the therapy, resulting in an active phenotype. LAG3+ Tregs expanded, but based on the transcriptome profile, became metabolically silent during the treatment. Last, higher baseline TCR clonality was observed in responding patients, and their expanding CD8+ T cell clones gained a more cytotoxic and NK-like phenotype.ConclusionAnti-LAG-3+anti-PD-1 therapy has profound effects on NK cells and Tregs in addition to CD8+ T cells.Trial registrationClinicalTrials.gov (NCT01968109)FundingCancer Foundation Finland, Sigrid Juselius Foundation, Signe and Ane Gyllenberg Foundation, Relander Foundation, State funding for university-level health research in Finland, a Helsinki Institute of Life Sciences Fellow grant, Academy of Finland (grant numbers 314442, 311081, 335432, and 335436), and an investigator-initiated research grant from BMS.
Collapse
Affiliation(s)
- Jani Huuhtanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,Department of Computer Science, Aalto University, Espoo, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Henna Kasanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Katriina Peltola
- iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland.,Department of Oncology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Tapio Lönnberg
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
| | - Virpi Glumoff
- Research Unit of Biomedicine, Medical Microbiology and Immunology, University of Oulu, Oulu, Finland
| | - Oscar Brück
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Olli Dufva
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Karita Peltonen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Johanna Vikkula
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Emmi Jokinen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,Department of Computer Science, Aalto University, Espoo, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Mette Ilander
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Moon Hee Lee
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Siru Mäkelä
- Department of Oncology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Marta Nyakas
- Oslo University Hospital-Radiumhospitalet, Oslo, Norway
| | - Bin Li
- Bristol Myers Squibb (BMS) Research and Development, Princeton, New Jersey, USA
| | - Micaela Hernberg
- Department of Oncology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Petri Bono
- Department of Oncology, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Anna Kreutzman
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
| | - Satu Mustjoki
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| |
Collapse
|
33
|
Frank ML, Lu K, Erdogan C, Han Y, Hu J, Wang T, Heymach JV, Zhang J, Reuben A. T-Cell Receptor Repertoire Sequencing in the Era of Cancer Immunotherapy. Clin Cancer Res 2023; 29:994-1008. [PMID: 36413126 PMCID: PMC10011887 DOI: 10.1158/1078-0432.ccr-22-2469] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 10/07/2022] [Accepted: 11/14/2022] [Indexed: 11/23/2022]
Abstract
T cells are integral components of the adaptive immune system, and their responses are mediated by unique T-cell receptors (TCR) that recognize specific antigens from a variety of biological contexts. As a result, analyzing the T-cell repertoire offers a better understanding of immune responses and of diseases like cancer. Next-generation sequencing technologies have greatly enabled the high-throughput analysis of the TCR repertoire. On the basis of our extensive experience in the field from the past decade, we provide an overview of TCR sequencing, from the initial library preparation steps to sequencing and analysis methods and finally to functional validation techniques. With regards to data analysis, we detail important TCR repertoire metrics and present several computational tools for predicting antigen specificity. Finally, we highlight important applications of TCR sequencing and repertoire analysis to understanding tumor biology and developing cancer immunotherapies.
Collapse
Affiliation(s)
- Meredith L. Frank
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
| | - Kaylene Lu
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
- Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Can Erdogan
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- Rice University, Houston, Texas
| | - Yi Han
- Quantitative Biomedical Research Center, Peter O'Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Jian Hu
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
- Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Tao Wang
- Quantitative Biomedical Research Center, Peter O'Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, Texas
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, Texas
| | - John V. Heymach
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
| | - Jianjun Zhang
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Alexandre Reuben
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas
- The University of Texas MD Anderson Cancer Center UT Health Houston Graduate School of Biomedical Sciences, Houston, Texas
| |
Collapse
|
34
|
Gao Y, Gao Y, Fan Y, Zhu C, Wei Z, Zhou C, Chuai G, Chen Q, Zhang H, Liu Q. Pan-Peptide Meta Learning for T-cell receptor–antigen binding recognition. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00619-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
|
35
|
Akerman O, Isakov H, Levi R, Psevkin V, Louzoun Y. Counting is almost all you need. Front Immunol 2023; 13:1031011. [PMID: 36741395 PMCID: PMC9896581 DOI: 10.3389/fimmu.2022.1031011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 12/27/2022] [Indexed: 01/21/2023] Open
Abstract
The immune memory repertoire encodes the history of present and past infections and immunological attributes of the individual. As such, multiple methods were proposed to use T-cell receptor (TCR) repertoires to detect disease history. We here show that the counting method outperforms two leading algorithms. We then show that the counting can be further improved using a novel attention model to weigh the different TCRs. The attention model is based on the projection of TCRs using a Variational AutoEncoder (VAE). Both counting and attention algorithms predict better than current leading algorithms whether the host had CMV and its HLA alleles. As an intermediate solution between the complex attention model and the very simple counting model, we propose a new Graph Convolutional Network approach that obtains the accuracy of the attention model and the simplicity of the counting model. The code for the models used in the paper is provided at: https://github.com/louzounlab/CountingIsAlmostAllYouNeed.
Collapse
Affiliation(s)
- Ofek Akerman
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
| | - Haim Isakov
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Reut Levi
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Vladimir Psevkin
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Yoram Louzoun
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| |
Collapse
|
36
|
Valkiers S, Gielis S, Van Deuren VML, Laukens K, Meysman P. Clustering and Annotation of T Cell Receptor Repertoires. Methods Mol Biol 2023; 2673:33-51. [PMID: 37258905 DOI: 10.1007/978-1-0716-3239-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Immunological protection against a wide variety of pathogens is largely mediated by the diverse and dynamic T cell receptor (TCR) repertoire, a crucial component of the adaptive immune system. An encounter with infectious agents stimulates specific T cells to initiate a direct immune response to combat intruders. Hence, the TCR repertoire may conceal crucial information regarding current and past infections and might assist in the development and monitoring of vaccines. To unlock its knowledge, we describe a computational workflow involving both supervised and unsupervised machine learning techniques to analyze and annotate full TCR repertoire data. The method is explained using data from a published yellow fever virus (YFV) vaccination study in healthy individuals. The TCR repertoire of one individual is studied before and 2 weeks after vaccination, using an efficient clustering method and identification of YFV-specific TCRs.
Collapse
Affiliation(s)
- Sebastiaan Valkiers
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- AUDACIS, Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing, University of Antwerp, Antwerp, Belgium
| | - Sofie Gielis
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- AUDACIS, Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing, University of Antwerp, Antwerp, Belgium
| | - Vincent M L Van Deuren
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- AUDACIS, Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing, University of Antwerp, Antwerp, Belgium
| | - Kris Laukens
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- AUDACIS, Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing, University of Antwerp, Antwerp, Belgium
| | - Pieter Meysman
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium.
- AUDACIS, Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing, University of Antwerp, Antwerp, Belgium.
| |
Collapse
|
37
|
Kanduri C, Scheffer L, Pavlović M, Rand KD, Chernigovskaya M, Pirvandy O, Yaari G, Greiff V, Sandve GK. simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods. Gigascience 2022; 12:giad074. [PMID: 37848619 PMCID: PMC10580376 DOI: 10.1093/gigascience/giad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 07/20/2023] [Accepted: 08/29/2023] [Indexed: 10/19/2023] Open
Abstract
BACKGROUND Machine learning (ML) has gained significant attention for classifying immune states in adaptive immune receptor repertoires (AIRRs) to support the advancement of immunodiagnostics and therapeutics. Simulated data are crucial for the rigorous benchmarking of AIRR-ML methods. Existing approaches to generating synthetic benchmarking datasets result in the generation of naive repertoires missing the key feature of many shared receptor sequences (selected for common antigens) found in antigen-experienced repertoires. RESULTS We demonstrate that a common approach to generating simulated AIRR benchmark datasets can introduce biases, which may be exploited for undesired shortcut learning by certain ML methods. To mitigate undesirable access to true signals in simulated AIRR datasets, we devised a simulation strategy (simAIRR) that constructs antigen-experienced-like repertoires with a realistic overlap of receptor sequences. simAIRR can be used for constructing AIRR-level benchmarks based on a range of assumptions (or experimental data sources) for what constitutes receptor-level immune signals. This includes the possibility of making or not making any prior assumptions regarding the similarity or commonality of immune state-associated sequences that will be used as true signals. We demonstrate the real-world realism of our proposed simulation approach by showing that basic ML strategies perform similarly on simAIRR-generated and real-world experimental AIRR datasets. CONCLUSIONS This study sheds light on the potential shortcut learning opportunities for ML methods that can arise with the state-of-the-art way of simulating AIRR datasets. simAIRR is available as a Python package: https://github.com/KanduriC/simAIRR.
Collapse
Affiliation(s)
- Chakravarthi Kanduri
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0373 Oslo, Norway
- UiORealArt Convergence Environment, University of Oslo, 0373 Oslo, Norway
| | - Lonneke Scheffer
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Milena Pavlović
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0373 Oslo, Norway
- UiORealArt Convergence Environment, University of Oslo, 0373 Oslo, Norway
| | - Knut Dagestad Rand
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Maria Chernigovskaya
- Department of Immunology and Oslo University Hospital, University of Oslo, 0373 Oslo, Norway
| | - Oz Pirvandy
- Faculty of Engineering, Bar-Ilan University, 5290002, Israel
| | - Gur Yaari
- Faculty of Engineering, Bar-Ilan University, 5290002, Israel
| | - Victor Greiff
- Department of Immunology and Oslo University Hospital, University of Oslo, 0373 Oslo, Norway
| | - Geir K Sandve
- Centre for Bioinformatics, Department of Informatics, University of Oslo, 0373 Oslo, Norway
- UiORealArt Convergence Environment, University of Oslo, 0373 Oslo, Norway
| |
Collapse
|
38
|
Jokinen E, Dumitrescu A, Huuhtanen J, Gligorijević V, Mustjoki S, Bonneau R, Heinonen M, Lähdesmäki H. TCRconv: predicting recognition between T cell receptors and epitopes using contextualized motifs. Bioinformatics 2022; 39:6881078. [PMID: 36477794 PMCID: PMC9825763 DOI: 10.1093/bioinformatics/btac788] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/01/2022] [Accepted: 12/06/2022] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION T cells use T cell receptors (TCRs) to recognize small parts of antigens, called epitopes, presented by major histocompatibility complexes. Once an epitope is recognized, an immune response is initiated and T cell activation and proliferation by clonal expansion begin. Clonal populations of T cells with identical TCRs can remain in the body for years, thus forming immunological memory and potentially mappable immunological signatures, which could have implications in clinical applications including infectious diseases, autoimmunity and tumor immunology. RESULTS We introduce TCRconv, a deep learning model for predicting recognition between TCRs and epitopes. TCRconv uses a deep protein language model and convolutions to extract contextualized motifs and provides state-of-the-art TCR-epitope prediction accuracy. Using TCR repertoires from COVID-19 patients, we demonstrate that TCRconv can provide insight into T cell dynamics and phenotypes during the disease. AVAILABILITY AND IMPLEMENTATION TCRconv is available at https://github.com/emmijokinen/tcrconv. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Alexandru Dumitrescu
- Department of Computer Science, Aalto University, Espoo 02150, Finland,Helsinki Institute of Life Science, University of Helsinki, Helsinki 00014, Finland
| | - Jani Huuhtanen
- Department of Clinical Chemistry and Hematology, Translational Immunology Research Program, University of Helsinki, Helsinki 00290, Finland,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki 00290, Finland
| | - Vladimir Gligorijević
- Center for Computational Biology (CCB), Flatiron Institute, Simons Foundation, New York, NY 10010, USA,Prescient Design, Genentech, New York, NY, USA
| | - Satu Mustjoki
- Department of Clinical Chemistry and Hematology, Translational Immunology Research Program, University of Helsinki, Helsinki 00290, Finland,Hematology Research Unit Helsinki, Helsinki University Hospital Comprehensive Cancer Center, Helsinki 00290, Finland,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Richard Bonneau
- Center for Computational Biology (CCB), Flatiron Institute, Simons Foundation, New York, NY 10010, USA,Prescient Design, Genentech, New York, NY, USA,Center for Data Science, New York University, New York, NY 10011, USA,Department of Computer Science, New York University, Courant Institute of Mathematical Sciences, New York, NY 10012, USA
| | - Markus Heinonen
- Department of Computer Science, Aalto University, Espoo 02150, Finland
| | | |
Collapse
|
39
|
Fang Y, Liu X, Liu H. Attention-aware contrastive learning for predicting T cell receptor-antigen binding specificity. Brief Bioinform 2022; 23:6696141. [PMID: 36094087 DOI: 10.1093/bib/bbac378] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 07/06/2022] [Accepted: 08/09/2022] [Indexed: 12/14/2022] Open
Abstract
MOTIVATION It has been proven that only a small fraction of the neoantigens presented by major histocompatibility complex (MHC) class I molecules on the cell surface can elicit T cells. This restriction can be attributed to the binding specificity of T cell receptor (TCR) and peptide-MHC complex (pMHC). Computational prediction of T cells binding to neoantigens is a challenging and unresolved task. RESULTS In this paper, we proposed an attention-aware contrastive learning model, ATMTCR, to infer the TCR-pMHC binding specificity. For each TCR sequence, we used a transformer encoder to transform it to latent representation, and then masked a percentage of amino acids guided by attention weights to generate its contrastive view. Compared to fully-supervised baseline model, we verified that contrastive learning-based pretraining on large-scale TCR sequences significantly improved the prediction performance of downstream tasks. Interestingly, masking a percentage of amino acids with low attention weights yielded best performance compared to other masking strategies. Comparison experiments on two independent datasets demonstrated our method achieved better performance than other existing algorithms. Moreover, we identified important amino acids and their positional preference through attention weights, which indicated the potential interpretability of our proposed model.
Collapse
Affiliation(s)
- Yiming Fang
- School of Computer Science and Technology, Nanjing Tech University, 211816, Nanjing, China
| | - Xuejun Liu
- School of Computer Science and Technology, Nanjing Tech University, 211816, Nanjing, China
| | - Hui Liu
- School of Computer Science and Technology, Nanjing Tech University, 211816, Nanjing, China
| |
Collapse
|
40
|
Grazioli F, Mösch A, Machart P, Li K, Alqassem I, O’Donnell TJ, Min MR. On TCR binding predictors failing to generalize to unseen peptides. Front Immunol 2022; 13:1014256. [PMID: 36341448 PMCID: PMC9634250 DOI: 10.3389/fimmu.2022.1014256] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 10/05/2022] [Indexed: 11/18/2022] Open
Abstract
Several recent studies investigate TCR-peptide/-pMHC binding prediction using machine learning or deep learning approaches. Many of these methods achieve impressive results on test sets, which include peptide sequences that are also included in the training set. In this work, we investigate how state-of-the-art deep learning models for TCR-peptide/-pMHC binding prediction generalize to unseen peptides. We create a dataset including positive samples from IEDB, VDJdb, McPAS-TCR, and the MIRA set, as well as negative samples from both randomization and 10X Genomics assays. We name this collection of samples TChard. We propose the hard split, a simple heuristic for training/test split, which ensures that test samples exclusively present peptides that do not belong to the training set. We investigate the effect of different training/test splitting techniques on the models’ test performance, as well as the effect of training and testing the models using mismatched negative samples generated randomly, in addition to the negative samples derived from assays. Our results show that modern deep learning methods fail to generalize to unseen peptides. We provide an explanation why this happens and verify our hypothesis on the TChard dataset. We then conclude that robust prediction of TCR recognition is still far for being solved.
Collapse
Affiliation(s)
- Filippo Grazioli
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
- *Correspondence: Filippo Grazioli, ; Martin Renqiang Min,
| | - Anja Mösch
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Pierre Machart
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Kai Li
- Machine Learning Department, NEC Laboratories America, Princeton, NJ, United States
| | - Israa Alqassem
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Timothy J. O’Donnell
- Division of Hematology and Medical Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Martin Renqiang Min
- Machine Learning Department, NEC Laboratories America, Princeton, NJ, United States
- *Correspondence: Filippo Grazioli, ; Martin Renqiang Min,
| |
Collapse
|
41
|
Huuhtanen J, Chen L, Jokinen E, Kasanen H, Lönnberg T, Kreutzman A, Peltola K, Hernberg M, Wang C, Yee C, Lähdesmäki H, Davis MM, Mustjoki S. Evolution and modulation of antigen-specific T cell responses in melanoma patients. Nat Commun 2022; 13:5988. [PMID: 36220826 PMCID: PMC9553985 DOI: 10.1038/s41467-022-33720-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 09/15/2022] [Indexed: 11/15/2022] Open
Abstract
Analyzing antigen-specific T cell responses at scale has been challenging. Here, we analyze three types of T cell receptor (TCR) repertoire data (antigen-specific TCRs, TCR-repertoire, and single-cell RNA + TCRαβ-sequencing data) from 515 patients with primary or metastatic melanoma and compare it to 783 healthy controls. Although melanoma-associated antigen (MAA) -specific TCRs are restricted to individuals, they share sequence similarities that allow us to build classifiers for predicting anti-MAA T cells. The frequency of anti-MAA T cells distinguishes melanoma patients from healthy and predicts metastatic recurrence from primary melanoma. Anti-MAA T cells have stem-like properties and frequent interactions with regulatory T cells and tumor cells via Galectin9-TIM3 and PVR-TIGIT -axes, respectively. In the responding patients, the number of expanded anti-MAA clones are higher after the anti-PD1(+anti-CTLA4) therapy and the exhaustion phenotype is rescued. Our systems immunology approach paves the way for understanding antigen-specific responses in human disorders. Previous studies have characterized the diversity and dynamics of the T cell receptor (TCR) repertoire in patients with solid cancer. Here, by analyzing TCR repertoire data from multiple datasets, the authors report that melanoma-associated antigen-specific TCRs can be used to separate metastatic melanoma patients from healthy controls and to follow anti-tumor responses in patients treated with immunotherapy.
Collapse
|
42
|
Mateyka LM, Strobl PM, Jarosch S, Scheu SJC, Busch DH, D'Ippolito E. Gene Signatures of T-Cell Activation Can Serve as Predictors of Functionality for SARS-CoV-2-Specific T-Cell Receptors. Vaccines (Basel) 2022; 10:vaccines10101617. [PMID: 36298482 PMCID: PMC9611811 DOI: 10.3390/vaccines10101617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/15/2022] [Accepted: 09/23/2022] [Indexed: 11/16/2022] Open
Abstract
The importance of T cells in controlling SARS-CoV-2 infections has been demonstrated widely, but insights into the quality of these responses are still limited due to technical challenges. Indeed, understanding the functionality of the T-cell receptor (TCR) repertoire of a polyclonal antigen-specific population still requires the tedious work of T-cell cloning or TCR re-expression and subsequent characterization. In this work, we show that it is possible to discriminate highly functional and bystander TCRs based on gene signatures of T-cell activation induced by recent peptide stimulation. SARS-CoV-2-specific TCRs previously identified by cytokine release after peptide restimulation and subsequent single-cell RNA sequencing were re-expressed via CRISPR-Cas9-mediated gene editing into a Jurkat-based reporter cell line system suitable for high-throughput screening. We could observe differences in SARS-CoV-2 epitope recognition as well as a wide range of functional avidities. By correlating these in vitro TCR engineered functional data with the transcriptomic profiles of the corresponding TCR-expressing parental T cells, we could validate that gene signatures of recent T-cell activation accurately identify and predict truly SARS-CoV-2-specific TCRs. In summary, this work paves the way for alternative approaches useful for the functional analysis of global antigen-specific TCR repertoires with largely improved throughput.
Collapse
Affiliation(s)
- Laura M Mateyka
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Philipp M Strobl
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Sebastian Jarosch
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Sebastian J C Scheu
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Dirk H Busch
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
- German Center for Infection Research (DZIF), Partner Site Munich, 81675 Munich, Germany
| | - Elvira D'Ippolito
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| |
Collapse
|
43
|
Huuhtanen J, Ilander M, Yadav B, Dufva OM, Lähteenmäki H, Kasanen T, Klievink J, Olsson-Strömberg U, Stentoft J, Richter J, Koskenvesa P, Höglund M, Söderlund S, Dreimane A, Porkka K, Gedde-Dahl T, Gjertsen BT, Stenke L, Myhr-Eriksson K, Markevärn B, Lübking A, Dimitrijevic A, Udby L, Bjerrum OW, Hjorth-Hansen H, Mustjoki S. IFN-α with dasatinib broadens the immune repertoire in patients with chronic-phase chronic myeloid leukemia. J Clin Invest 2022; 132:152585. [PMID: 36047494 PMCID: PMC9433106 DOI: 10.1172/jci152585] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 07/07/2022] [Indexed: 11/24/2022] Open
Abstract
In chronic myeloid leukemia (CML), combination therapies with tyrosine kinase inhibitors (TKIs) aim to improve the achievement of deep molecular remission that would allow therapy discontinuation. IFN-α is one promising candidate, as it has long-lasting effects on both malignant and immune cells. In connection with a multicenter clinical trial combining dasatinib with IFN-α in 40 patients with chronic-phase CML (NordCML007, NCT01725204), we performed immune monitoring with single-cell RNA and T cell receptor (TCR) sequencing (n = 4, 12 samples), bulk TCRβ sequencing (n = 13, 26 samples), flow cytometry (n = 40, 106 samples), cytokine analyses (n = 17, 80 samples), and ex vivo functional studies (n = 39, 80 samples). Dasatinib drove the immune repertoire toward terminally differentiated NK and CD8+ T cells with dampened functional capabilities. Patients with dasatinib-associated pleural effusions had increased numbers of CD8+ recently activated effector memory T (Temra) cells. In vitro, dasatinib prevented CD3-induced cell death by blocking TCR signaling. The addition of IFN-α reversed the terminally differentiated phenotypes and increased the number of costimulatory intercellular interactions and the number of unique putative epitope-specific TCR clusters. In vitro IFN-α had costimulatory effects on TCR signaling. Our work supports the combination of IFN-α with TKI therapy, as IFN-α broadens the immune repertoire and restores immunological function.
Collapse
Affiliation(s)
- Jani Huuhtanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland.,Department of Computer Science, Aalto University, Espoo, Finland
| | - Mette Ilander
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Bhagwan Yadav
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Olli Mj Dufva
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Hanna Lähteenmäki
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Tiina Kasanen
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Jay Klievink
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Ulla Olsson-Strömberg
- Department of Medical Sciences, Uppsala University and Hematology Section, Uppsala University Hospital, Uppsala, Sweden
| | - Jesper Stentoft
- Department of Hematology, Aarhus University Hospital, Aarhus, Denmark
| | - Johan Richter
- Department of Hematology, Oncology and Radiation Physics, Skåne University Hospital, Lund, Sweden
| | - Perttu Koskenvesa
- Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland
| | - Martin Höglund
- Department of Medical Sciences, Uppsala University and Hematology Section, Uppsala University Hospital, Uppsala, Sweden
| | - Stina Söderlund
- Department of Medical Sciences, Uppsala University and Hematology Section, Uppsala University Hospital, Uppsala, Sweden
| | - Arta Dreimane
- Department of Medical and Health Sciences, Linköping University, Department of Hematology, County Council of Östergötland, Linköping, Sweden
| | - Kimmo Porkka
- Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| | - Tobias Gedde-Dahl
- Department of Hematology, Oslo University Hospital, Rikshospitalet, Oslo, Norway
| | - Björn T Gjertsen
- Department of Internal Medicine, Hematology Section, Haukeland University Hospital and Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Leif Stenke
- Department of Hematology, Karolinska University Hospital and Karolinska Institutet, Stockholm, Sweden
| | | | - Berit Markevärn
- Department of Hematology, Umeå University Hospital, Umeå, Sweden
| | - Anna Lübking
- Department of Hematology, Oncology and Radiation Physics, Skåne University Hospital, Lund, Sweden
| | | | - Lene Udby
- Department of Hematology, Zealand University Hospital, Roskilde, Denmark
| | - Ole Weis Bjerrum
- Department of Hematology, Rigshospitalet, University Hospital of Copenhagen, Copenhagen, Denmark
| | - Henrik Hjorth-Hansen
- Department of Hematology, St. Olavs Hospital, Trondheim, Norway.,Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
| | - Satu Mustjoki
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.,Hematology Research Unit Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Department of Hematology, Helsinki, Finland.,iCAN Digital Precision Cancer Medicine Flagship, Helsinki, Finland
| |
Collapse
|
44
|
Bucktrout SL, Banovich NE, Butterfield LH, Cimen-Bozkus C, Giles JR, Good Z, Goodman D, Jonsson VD, Lareau C, Marson A, Maurer DM, Munson PV, Stubbington M, Taylor S, Cutchin A. Advancing T cell-based cancer therapy with single-cell technologies. Nat Med 2022; 28:1761-1764. [PMID: 36127419 DOI: 10.1038/s41591-022-01986-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
| | - Nicholas E Banovich
- Integrated Cancer Genomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | - Cansu Cimen-Bozkus
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Josephine R Giles
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Zinaida Good
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Stanford Cancer Institute and Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Daniel Goodman
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Microbiology and Immunology, School of Medicine, University of California San Francisco, San Francisco, CA, USA
| | - Vanessa D Jonsson
- Department of Applied Mathematics, University of California, Santa Cruz, Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Caleb Lareau
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexander Marson
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
- Gladstone-UCSF Institute for Genomic Immunology, San Francisco, CA, USA
| | - Deena M Maurer
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
| | - Paul V Munson
- Parker Institute of Cancer Immunotherapy, San Francisco, CA, USA
| | | | | | | |
Collapse
|
45
|
Cai M, Bang S, Zhang P, Lee H. ATM-TCR: TCR-Epitope Binding Affinity Prediction Using a Multi-Head Self-Attention Model. Front Immunol 2022; 13:893247. [PMID: 35874725 PMCID: PMC9299376 DOI: 10.3389/fimmu.2022.893247] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 04/27/2022] [Indexed: 11/29/2022] Open
Abstract
TCR-epitope pair binding is the key component for T cell regulation. The ability to predict whether a given pair binds is fundamental to understanding the underlying biology of the binding mechanism as well as developing T-cell mediated immunotherapy approaches. The advent of large-scale public databases containing TCR-epitope binding pairs enabled the recent development of computational prediction methods for TCR-epitope binding. However, the number of epitopes reported along with binding TCRs is far too small, resulting in poor out-of-sample performance for unseen epitopes. In order to address this issue, we present our model ATM-TCR which uses a multi-head self-attention mechanism to capture biological contextual information and improve generalization performance. Additionally, we present a novel application of the attention map from our model to improve out-of-sample performance by demonstrating on recent SARS-CoV-2 data.
Collapse
Affiliation(s)
- Michael Cai
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States.,Biodesign Institute, Arizona State University, Tempe, AZ, United States
| | - Seojin Bang
- Biodesign Institute, Arizona State University, Tempe, AZ, United States
| | - Pengfei Zhang
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States.,Biodesign Institute, Arizona State University, Tempe, AZ, United States
| | - Heewook Lee
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States.,Biodesign Institute, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
46
|
Papadopoulou I, Nguyen AP, Weber A, Martínez MR. DECODE: a computational pipeline to discover T cell receptor binding rules. Bioinformatics 2022; 38:i246-i254. [PMID: 35758821 PMCID: PMC9235487 DOI: 10.1093/bioinformatics/btac257] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Motivation Understanding the mechanisms underlying T cell receptor (TCR) binding is of fundamental importance to understanding adaptive immune responses. A better understanding of the biochemical rules governing TCR binding can be used, e.g. to guide the design of more powerful and safer T cell-based therapies. Advances in repertoire sequencing technologies have made available millions of TCR sequences. Data abundance has, in turn, fueled the development of many computational models to predict the binding properties of TCRs from their sequences. Unfortunately, while many of these works have made great strides toward predicting TCR specificity using machine learning, the black-box nature of these models has resulted in a limited understanding of the rules that govern the binding of a TCR and an epitope. Results We present an easy-to-use and customizable computational pipeline, DECODE, to extract the binding rules from any black-box model designed to predict the TCR-epitope binding. DECODE offers a range of analytical and visualization tools to guide the user in the extraction of such rules. We demonstrate our pipeline on a recently published TCR-binding prediction model, TITAN, and show how to use the provided metrics to assess the quality of the computed rules. In conclusion, DECODE can lead to a better understanding of the sequence motifs that underlie TCR binding. Our pipeline can facilitate the investigation of current immunotherapeutic challenges, such as cross-reactive events due to off-target TCR binding. Availability and implementation Code is available publicly at https://github.com/phineasng/DECODE. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Iliana Papadopoulou
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | - An-Phi Nguyen
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Mathematics (D-Math), 8092 Zurich, Switzerland
| | - Anna Weber
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | | |
Collapse
|
47
|
Glazer N, Akerman O, Louzoun Y. Naive and memory T cells TCR-HLA-binding prediction. OXFORD OPEN IMMUNOLOGY 2022; 3:iqac001. [PMID: 36846560 PMCID: PMC9914496 DOI: 10.1093/oxfimm/iqac001] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 05/01/2022] [Accepted: 05/17/2022] [Indexed: 11/12/2022] Open
Abstract
T cells recognize antigens through the interaction of their T cell receptor (TCR) with a peptide-major histocompatibility complex (pMHC) molecule. Following thymic-positive selection, TCRs in peripheral naive T cells are expected to bind MHC alleles of the host. Peripheral clonal selection is expected to further increase the frequency of antigen-specific TCRs that bind to the host MHC alleles. To check for a systematic preference for MHC-binding T cells in TCR repertoires, we developed Natural Language Processing-based methods to predict TCR-MHC binding independently of the peptide presented for Class I MHC alleles. We trained a classifier on published TCR-pMHC binding pairs and obtained a high area under curve (AUC) of over 0.90 on the test set. However, when applied to TCR repertoires, the accuracy of the classifier dropped. We thus developed a two-stage prediction model, based on large-scale naive and memory TCR repertoires, denoted TCR HLA-binding predictor (CLAIRE). Since each host carries multiple human leukocyte antigen (HLA) alleles, we first computed whether a TCR on a CD8 T cell binds an MHC from any of the host Class-I HLA alleles. We then performed an iteration, where we predict the binding with the most probable allele from the first round. We show that this classifier is more precise for memory than for naïve cells. Moreover, it can be transferred between datasets. Finally, we developed a CD4-CD8 T cell classifier to apply CLAIRE to unsorted bulk sequencing datasets and showed a high AUC of 0.96 and 0.90 on large datasets. CLAIRE is available through a GitHub at: https://github.com/louzounlab/CLAIRE, and as a server at: https://claire.math.biu.ac.il/Home.
Collapse
Affiliation(s)
- Neta Glazer
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Ofek Akerman
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Yoram Louzoun
- Correspondence address. Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel. E-mail:
| |
Collapse
|
48
|
Yu J, Wang L, Kong X, Cao Y, Zhang M, Sun Z, Liu Y, Wang J, Shen B, Bo X, Feng J. CAD v1.0: Cancer Antigens Database Platform for Cancer Antigen Algorithm Development and Information Exploration. Front Bioeng Biotechnol 2022; 10:819583. [PMID: 35646870 PMCID: PMC9133807 DOI: 10.3389/fbioe.2022.819583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 04/06/2022] [Indexed: 12/02/2022] Open
Abstract
Cancer vaccines have gradually attracted attention for their tremendous preclinical and clinical performance. With the development of next-generation sequencing technologies and related algorithms, pipelines based on sequencing and machine learning methods have become mainstream in cancer antigen prediction; of particular focus are neoantigens, mutation peptides that only exist in tumor cells that lack central tolerance and have fewer side effects. The rapid prediction and filtering of neoantigen peptides are crucial to the development of neoantigen-based cancer vaccines. However, due to the lack of verified neoantigen datasets and insufficient research on the properties of neoantigens, neoantigen prediction algorithms still need to be improved. Here, we recruited verified cancer antigen peptides and collected as much relevant peptide information as possible. Then, we discussed the role of each dataset for algorithm improvement in cancer antigen research, especially neoantigen prediction. A platform, Cancer Antigens Database (CAD, http://cad.bio-it.cn/), was designed to facilitate users to perform a complete exploration of cancer antigens online.
Collapse
Affiliation(s)
- Jijun Yu
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology, Beijing, China
- Beijing Key Laboratory of Therapeutic Gene Engineering Antibody, Beijing, China
| | - Luoxuan Wang
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Key Laboratory of Neuropsychopharmacology, Beijing Institute of Pharmacology and Toxicology, Beijing, China
| | - Xiangya Kong
- Beijing Geneworks Technology Co., Ltd., Beijing, China
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Mengmeng Zhang
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology, Beijing, China
- Beijing Capital Agribusiness Future Biotechnology Co, Beijing, China
| | - Zhaolin Sun
- Beijing Capital Agribusiness Future Biotechnology Co, Beijing, China
| | - Yang Liu
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Jing Wang
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology, Beijing, China
- Beijing Key Laboratory of Therapeutic Gene Engineering Antibody, Beijing, China
| | - Beifen Shen
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology, Beijing, China
- Beijing Key Laboratory of Therapeutic Gene Engineering Antibody, Beijing, China
| | - Xiaochen Bo
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing, China
- *Correspondence: Xiaochen Bo, ; Jiannan Feng,
| | - Jiannan Feng
- State Key Laboratory of Toxicology and Medical Countermeasures, Beijing Institute of Pharmacology and Toxicology, Beijing, China
- Beijing Key Laboratory of Therapeutic Gene Engineering Antibody, Beijing, China
- *Correspondence: Xiaochen Bo, ; Jiannan Feng,
| |
Collapse
|
49
|
Abondio P, De Intinis C, da Silva Gonçalves Vianez Júnior JL, Pace L. SINGLE CELL MULTIOMIC APPROACHES TO DISENTANGLE T CELL HETEROGENEITY. Immunol Lett 2022; 246:37-51. [DOI: 10.1016/j.imlet.2022.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 04/16/2022] [Accepted: 04/26/2022] [Indexed: 11/29/2022]
|
50
|
Huuhtanen J, Bhattacharya D, Lönnberg T, Kankainen M, Kerr C, Theodoropoulos J, Rajala H, Gurnari C, Kasanen T, Braun T, Teramo A, Zambello R, Herling M, Ishida F, Kawakami T, Salmi M, Loughran T, Maciejewski JP, Lähdesmäki H, Kelkka T, Mustjoki S. Single-cell characterization of leukemic and non-leukemic immune repertoires in CD8 + T-cell large granular lymphocytic leukemia. Nat Commun 2022; 13:1981. [PMID: 35411050 PMCID: PMC9001660 DOI: 10.1038/s41467-022-29173-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 02/17/2022] [Indexed: 12/13/2022] Open
Abstract
T cell large granular lymphocytic leukemia (T-LGLL) is a rare lymphoproliferative disorder of mature, clonally expanded T cells, where somatic-activating STAT3 mutations are common. Although T-LGLL has been described as a chronic T cell response to an antigen, the function of the non-leukemic immune system in this response is largely uncharacterized. Here, by utilizing single-cell RNA and T cell receptor profiling (scRNA+TCRαβ-seq), we show that irrespective of STAT3 mutation status, T-LGLL clonotypes are more cytotoxic and exhausted than healthy reactive clonotypes. In addition, T-LGLL clonotypes show more active cell communication than reactive clones with non-leukemic immune cells via costimulatory cell-cell interactions, monocyte-secreted proinflammatory cytokines, and T-LGLL-clone-secreted IFNγ. Besides the leukemic repertoire, the non-leukemic T cell repertoire in T-LGLL is also more mature, cytotoxic, and clonally restricted than in other cancers and autoimmune disorders. Finally, 72% of the leukemic T-LGLL clonotypes share T cell receptor similarities with their non-leukemic repertoire, linking the leukemic and non-leukemic repertoires together via possible common target antigens. Our results provide a rationale to prioritize therapies that target the entire immune repertoire and not only the T-LGLL clonotype.
Collapse
Affiliation(s)
- Jani Huuhtanen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Dipabarna Bhattacharya
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Tapio Lönnberg
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
- InFlames Flagship Center, University of Turku, Turku, Finland
| | - Matti Kankainen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Cassandra Kerr
- Translational Hematology and Oncology Department, Taussig Cancer Center, Cleveland Clinic, Cleveland, OH, USA
| | - Jason Theodoropoulos
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Hanna Rajala
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Carmelo Gurnari
- Translational Hematology and Oncology Department, Taussig Cancer Center, Cleveland Clinic, Cleveland, OH, USA
| | - Tiina Kasanen
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Till Braun
- Department I of Internal Medicine, Center for Integrated Oncology (CIO), Aachen-Bonn-Cologne-Duesseldorf, University of Cologne (UoC), Cologne, Germany
| | - Antonella Teramo
- Department of Medicine (DIMED), Hematology and Clinical Immunology Branch, Padova University School of Medicine, Padova, Italy
- Veneto Institute of Molecular Medicine (VIMM), Padova, Italy
| | - Renato Zambello
- Department of Medicine (DIMED), Hematology and Clinical Immunology Branch, Padova University School of Medicine, Padova, Italy
- Veneto Institute of Molecular Medicine (VIMM), Padova, Italy
| | - Marco Herling
- Department I of Internal Medicine, Center for Integrated Oncology (CIO), Aachen-Bonn-Cologne-Duesseldorf, University of Cologne (UoC), Cologne, Germany
- Clinic of Hematology and Cellular Therapy, University of Leipzig, Leipzig, Germany
| | - Fumihiro Ishida
- Department of Biomedical Laboratory Sciences, Shinshu University School of Medicine, Matsumoto, Japan
- Division of Hematology, Department of Internal Medicine, Shinshu University School of Medicine, Matsumoto, Japan
| | - Toru Kawakami
- Department of Biomedical Laboratory Sciences, Shinshu University School of Medicine, Matsumoto, Japan
- Division of Hematology, Department of Internal Medicine, Shinshu University School of Medicine, Matsumoto, Japan
| | - Marko Salmi
- InFlames Flagship Center, University of Turku, Turku, Finland
- MediCity Research Laboratory and Institute of Biomedicine, University of Turku, Turku, Finland
| | - Thomas Loughran
- Division of Hematology/Oncology, Department of Medicine, UVA Cancer Center, University of Virginia, Charlottesville, VA, USA
| | - Jaroslaw P Maciejewski
- Translational Hematology and Oncology Department, Taussig Cancer Center, Cleveland Clinic, Cleveland, OH, USA
| | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Tiina Kelkka
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland
| | - Satu Mustjoki
- Hematology Research Unit Helsinki, University of Helsinki and Helsinki University Hospital Comprehensive Cancer Center, Helsinki, Finland.
- Translational Immunology Research Program and Department of Clinical Chemistry and Hematology, University of Helsinki, Helsinki, Finland.
- iCAN Digital Precision Medicine Flagship, Helsinki, Finland.
| |
Collapse
|