1
|
Vegesana K, Thomas PG. Cracking the code of adaptive immunity: The role of computational tools. Cell Syst 2024; 15:1156-1167. [PMID: 39701033 DOI: 10.1016/j.cels.2024.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/14/2024] [Accepted: 11/14/2024] [Indexed: 12/21/2024]
Abstract
In recent years, the advances in high-throughput and deep sequencing have generated a diverse amount of adaptive immune repertoire data. This surge in data has seen a proportional increase in computational methods aimed to characterize T cell receptor (TCR) repertoires. In this perspective, we will provide a brief commentary on the various domains of TCR repertoire analysis, their respective computational methods, and the ongoing challenges. Given the breadth of methods and applications of TCR analysis, we will focus our perspective on sequence-based computational methods.
Collapse
Affiliation(s)
- Kasi Vegesana
- Department of Host-Microbe Interactions, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Paul G Thomas
- Department of Host-Microbe Interactions, St. Jude Children's Research Hospital, Memphis, TN, USA.
| |
Collapse
|
2
|
O'Donnell TJ, Kanduri C, Isacchini G, Limenitakis JP, Brachman RA, Alvarez RA, Haff IH, Sandve GK, Greiff V. Reading the repertoire: Progress in adaptive immune receptor analysis using machine learning. Cell Syst 2024; 15:1168-1189. [PMID: 39701034 DOI: 10.1016/j.cels.2024.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Revised: 08/16/2024] [Accepted: 11/14/2024] [Indexed: 12/21/2024]
Abstract
The adaptive immune system holds invaluable information on past and present immune responses in the form of B and T cell receptor sequences, but we are limited in our ability to decode this information. Machine learning approaches are under active investigation for a range of tasks relevant to understanding and manipulating the adaptive immune receptor repertoire, including matching receptors to the antigens they bind, generating antibodies or T cell receptors for use as therapeutics, and diagnosing disease based on patient repertoires. Progress on these tasks has the potential to substantially improve the development of vaccines, therapeutics, and diagnostics, as well as advance our understanding of fundamental immunological principles. We outline key challenges for the field, highlighting the need for software benchmarking, targeted large-scale data generation, and coordinated research efforts.
Collapse
Affiliation(s)
| | - Chakravarthi Kanduri
- Department of Informatics, University of Oslo, Oslo, Norway; UiO:RealArt Convergence Environment, University of Oslo, Oslo, Norway
| | | | | | - Rebecca A Brachman
- Imprint Labs, LLC, New York, NY, USA; Cornell Tech, Cornell University, New York, NY, USA
| | | | - Ingrid H Haff
- Department of Mathematics, University of Oslo, 0371 Oslo, Norway
| | - Geir K Sandve
- Department of Informatics, University of Oslo, Oslo, Norway; UiO:RealArt Convergence Environment, University of Oslo, Oslo, Norway
| | - Victor Greiff
- Imprint Labs, LLC, New York, NY, USA; Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
3
|
Mason DM, Reddy ST. Predicting adaptive immune receptor specificities by machine learning is a data generation problem. Cell Syst 2024; 15:1190-1197. [PMID: 39701035 DOI: 10.1016/j.cels.2024.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/14/2024] [Accepted: 11/14/2024] [Indexed: 12/21/2024]
Abstract
Determining the specificity of adaptive immune receptors-B cell receptors (BCRs), their secreted form antibodies, and T cell receptors (TCRs)-is critical for understanding immune responses and advancing immunotherapy and drug discovery. Immune receptors exhibit extensive diversity in their variable domains, enabling them to interact with a plethora of antigens. Despite the significant progress made by AI tools such as AlphaFold in predicting protein structures, challenges remain in accurately modeling the structure and specificity of immune receptors, primarily due to the limited availability of high-quality crystal structures and the complexity of immune receptor-antigen interactions. In this perspective, we highlight recent advancements in sequence-based and structure-based data generation for immune receptors, which are crucial for training machine learning models that predict receptor specificity. We discuss the current bottlenecks and potential future directions in generating and utilizing high-dimensional datasets for predicting and designing the specificity of antibodies and TCRs.
Collapse
Affiliation(s)
- Derek M Mason
- Botnar Institute of Immune Engineering, 4056 Basel, Switzerland
| | - Sai T Reddy
- Botnar Institute of Immune Engineering, 4056 Basel, Switzerland; Department of Biosystems Science and Engineering, ETH Zurich, 4056 Basel, Switzerland.
| |
Collapse
|
4
|
McMaster B, Thorpe CJ, Rossjohn J, Deane CM, Koohy H. Quantifying conformational changes in the TCR:pMHC-I binding interface. Front Immunol 2024; 15:1491656. [PMID: 39687625 PMCID: PMC11646856 DOI: 10.3389/fimmu.2024.1491656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Accepted: 10/24/2024] [Indexed: 12/18/2024] Open
Abstract
Background T cells form one of the key pillars of adaptive immunity. Using their surface bound T cell antigen receptors (TCRs), these cells screen millions of antigens presented by major histocompatibility complex (MHC) or MHC-like molecules. In other protein families, the dynamics of protein-protein interactions have important implications for protein function. Case studies of TCR:class I peptide-MHCs (pMHC-Is) structures have reported mixed results on whether the binding interfaces undergo conformational change during engagement and no robust statistical quantification has been done to generalise these results. Thus, it remains an open question of whether movement occurs in the binding interface that enables the recognition and activation of T cells. Methods In this work, we quantify the conformational changes in the TCR:pMHC-I binding interface by creating a dataset of 391 structures, comprising 22 TCRs, 19 MHC alleles, and 79 peptide structures in both unbound (apo) and bound (holo) conformations. Results In support of some case studies, we demonstrate that all complementarity determining region (CDR) loops move to a certain extent but only CDR3α and CDR3β loops modify their shape when binding pMHC-Is. We also map the contacts between TCRs and pMHC-Is, generating a novel fingerprint of TCRs on MHC molecules and show that the CDR3α tends to bind the N-terminus of the peptide and the CDR3β tends to bind the C-terminus of the peptide. Finally, we show that the presented peptides can undergo conformational changes when engaged by TCRs, as has been reported in past literature, but novelly show these changes depend on how the peptides are anchored in the MHC binding groove. Conclusions Our work has implications in understanding the behaviour of TCR:pMHC-I interactions and providing insights that can be used for modelling Tcell antigen specificity, an ongoing grand challenge in immunology.
Collapse
Affiliation(s)
- Benjamin McMaster
- Koohy Lab, Medical Research Council Translational Immune Discovery Unit (MRC TIDU), Weatherall Institute of Molecular Medicine (WIMM), Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Christopher J. Thorpe
- Open Targets, Wellcome Genome Campus, Hinxton, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | - Jamie Rossjohn
- Rossjohn Lab, Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC, Australia
- Rossjohn Lab, Institute of Infection and Immunity, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Charlotte M. Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Hashem Koohy
- Koohy Lab, Medical Research Council Translational Immune Discovery Unit (MRC TIDU), Weatherall Institute of Molecular Medicine (WIMM), Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
5
|
Song K, Xu H, Shi Y, Zou X, Da LT, Hao J. Investigating TCR-pMHC interactions for TCRs without identified epitopes by constructing a computational pipeline. Int J Biol Macromol 2024; 282:136502. [PMID: 39423970 DOI: 10.1016/j.ijbiomac.2024.136502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 10/04/2024] [Accepted: 10/09/2024] [Indexed: 10/21/2024]
Abstract
The molecular mechanisms underlying epitope recognition by T cell receptors (TCRs) are critical for activating T cell immune responses and rationally designing TCR-based therapeutics. Single-cell sequencing techniques vastly boost the accumulation of TCR sequences, while the limitation of available TCR-pMHC structures hampers further investigations. In this study, we proposed a computational pipeline that incorporates structural information and single-cell sequencing data to investigate the epitope-recognition mechanisms for TCRs without identified epitopes. By antigen specificity clustering, we mapped the epitope sequences between epitope-known and epitope-unknown TCRs from COVID-19 patients. One reported SARS-CoV-2 epitope, NQKLIANQF (S919-927), was identified for a TCR expressed by 614 T cells (TCR-614). Epitope screening also identified a potential cross-reactive epitope, KLKTLVATA (NSP31790-1798), for a TCR expressed by 204 T cells (TCR-204). By molecular dynamics (MD) simulations, we revealed the detailed epitope-recognition mechanisms for both TCRs. The structural motifs responsible for epitope recognition revealed by the MD simulations are consistent with the sequential features recognized by the sequence-based clustering method. We hope that this strategy could facilitate the discovery and optimization of TCR-based therapeutics.
Collapse
Affiliation(s)
- Kaiyuan Song
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Honglin Xu
- School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yi Shi
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Shanghai Jiao Tong University, 1954 Huashan Road, Shanghai 200030, China; Shanghai Key Laboratory of Psychotic Disorders, Brain Science and Technology Research Center, Shanghai Jiao Tong University, 1954 Huashan Road, Shanghai 200030, China
| | - Xin Zou
- Digital Diagnosis and Treatment Innovation Center for Cancer, Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China; Ninth People's Hospital, Shanghai Key Laboratory of Stomatology and Shanghai Research Institute of Stomatology, National Clinical Research Center of Stomatology, Shanghai Jiao Tong University, School of Medicine, Shanghai 200011, China.
| | - Lin-Tai Da
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China.
| | - Jie Hao
- Institute of Clinical Science, Zhongshan Hospital, Fudan University, Shanghai 200032, China.
| |
Collapse
|
6
|
Rollins ZA, Curtis MB, George SC, Faller R. A Computational Strategy for the Rapid Identification and Ranking of Patient-Specific T Cell Receptors Bound to Neoantigens. Macromol Rapid Commun 2024; 45:e2400225. [PMID: 38839076 PMCID: PMC11661661 DOI: 10.1002/marc.202400225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 06/02/2024] [Indexed: 06/07/2024]
Abstract
T cell receptor (TCR) recognition of a peptide-major histocompatibility complex (pMHC) is crucial for adaptive immune response. The identification of therapeutically relevant TCR-pMHC protein pairs is a bottleneck in the implementation of TCR-based immunotherapies. The ability to computationally design TCRs to target a specific pMHC requires automated integration of next-generation sequencing, protein-protein structure prediction, molecular dynamics, and TCR ranking. A pipeline to evaluate patient-specific, sequence-based TCRs to a target pMHC is presented. Using the three most frequently expressed TCRs from 16 colorectal cancer patients, the protein-protein structure of the TCRs to the target CEA peptide-MHC is predicted using Modeller and ColabFold. TCR-pMHC structures are compared using automated equilibration and successive analysis. ColabFold generated configurations require an ≈2.5× reduction in equilibration time of TCR-pMHC structures compared to Modeller. The structural differences between Modeller and ColabFold are demonstrated by root mean square deviation (≈0.20 nm) between clusters of equilibrated configurations, which impact the number of hydrogen bonds and Lennard-Jones contacts between the TCR and pMHC. TCR ranking criteria that may prioritize TCRs for evaluation of in vitro immunogenicity are identified, and this ranking is validated by comparing to state-of-the-art machine learning-based methods trained to predict the probability of TCR-pMHC binding.
Collapse
Affiliation(s)
- Zachary A. Rollins
- Department of Chemical EngineeringUniversity of CaliforniaDavis, 1 Shields Ave, Bainer HallDavisCA95616USA
| | - Matthew B. Curtis
- Department of Biomedical EngineeringUniversity of CaliforniaDavis, 451 E. Health Sciences Dr., GBSF 2303DavisCA95616USA
| | - Steven C. George
- Department of Biomedical EngineeringUniversity of CaliforniaDavis, 451 E. Health Sciences Dr., GBSF 2303DavisCA95616USA
| | - Roland Faller
- Department of Chemical EngineeringUniversity of CaliforniaDavis, 1 Shields Ave, Bainer HallDavisCA95616USA
- Department of Chemical EngineeringTexas Tech UniversityLubbockTX79409USA
| |
Collapse
|
7
|
Postovskaya A, Vercauteren K, Meysman P, Laukens K. tcrBLOSUM: an amino acid substitution matrix for sensitive alignment of distant epitope-specific TCRs. Brief Bioinform 2024; 26:bbae602. [PMID: 39576224 PMCID: PMC11583439 DOI: 10.1093/bib/bbae602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 10/07/2024] [Accepted: 11/05/2024] [Indexed: 11/24/2024] Open
Abstract
Deciphering the specificity of T-cell receptor (TCR) repertoires is crucial for monitoring adaptive immune responses and developing targeted immunotherapies and vaccines. To elucidate the specificity of previously unseen TCRs, many methods employ the BLOSUM62 matrix to find TCRs with similar amino acid (AA) sequences. However, while BLOSUM62 reflects the AA substitutions within conserved regions of proteins with similar functions, the remarkable diversity of TCRs means that both TCRs with similar and dissimilar sequences can bind the same epitope. Therefore, reliance on BLOSUM62 may bias detection towards epitope-specific TCRs with similar biochemical properties, overlooking those with more diverse AA compositions. In this study, we introduce tcrBLOSUMa and tcrBLOSUMb, specialized AA substitution matrices for CDR3 alpha and CDR3 beta TCR chains, respectively. The matrices reflect AA frequencies and variations occurring within TCRs that bind the same epitope, revealing that both CDR3 alpha and CDR3 beta display tolerance to a wide range of AA substitutions and differ noticeably from the standard BLOSUM62. By accurately aligning distant TCRs employing tcrBLOSUMb, we were able to improve clustering performance and capture a large number of epitope-specific TCRs with diverse AA compositions and physicochemical profiles overlooked by BLOSUM62. Utilizing both the general BLOSUM62 and specialized tcrBLOSUM matrices in existing computational tools will broaden the range of TCRs that can be associated with their cognate epitopes, thereby enhancing TCR repertoire analysis.
Collapse
MESH Headings
- Receptors, Antigen, T-Cell/immunology
- Receptors, Antigen, T-Cell/genetics
- Receptors, Antigen, T-Cell/chemistry
- Amino Acid Substitution
- Humans
- Amino Acid Sequence
- Epitopes, T-Lymphocyte/immunology
- Epitopes, T-Lymphocyte/chemistry
- Sequence Alignment
- Complementarity Determining Regions/genetics
- Complementarity Determining Regions/immunology
- Complementarity Determining Regions/chemistry
- Computational Biology/methods
- Epitopes/immunology
- Epitopes/chemistry
- Algorithms
- Receptors, Antigen, T-Cell, alpha-beta/genetics
- Receptors, Antigen, T-Cell, alpha-beta/immunology
- Receptors, Antigen, T-Cell, alpha-beta/chemistry
Collapse
Affiliation(s)
- Anna Postovskaya
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
- Clinical Virology Unit, Department of Clinical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Koen Vercauteren
- Clinical Virology Unit, Department of Clinical Sciences, Institute of Tropical Medicine, Antwerp, Belgium
| | - Pieter Meysman
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
| | - Kris Laukens
- Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, Belgium
- Antwerp Unit for Data Analysis and Computation in Immunology and Sequencing (AUDACIS), University of Antwerp, Antwerp, Belgium
- Biomedical Informatics Research Network Antwerp (BIOMINA), University of Antwerp, Antwerp, Belgium
| |
Collapse
|
8
|
Lin V, Cheung M, Gowthaman R, Eisenberg M, Baker BM, Pierce BG. TCR3d 2.0: expanding the T cell receptor structure database with new structures, tools and interactions. Nucleic Acids Res 2024:gkae840. [PMID: 39329260 DOI: 10.1093/nar/gkae840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 09/07/2024] [Accepted: 09/13/2024] [Indexed: 09/28/2024] Open
Abstract
Recognition of antigens by T cell receptors (TCRs) is a key component of adaptive immunity. Understanding the structures of these TCR interactions provides major insights into immune protection and diseases, and enables design of therapeutics, vaccines and predictive modeling algorithms. Previously, we released TCR3d, a database and resource for structures of TCRs and their recognition. Due to the growth of available structures and categories of complexes, the content of TCR3d has expanded substantially in the past 5 years. This expansion includes new tables dedicated to TCR mimic antibody complex structures, TCR-CD3 complexes and annotated Class I and II peptide-MHC complexes. Additionally, tools are available for users to calculate docking geometries for input TCR and TCR mimic complex structures. The core tables of TCR-peptide-MHC complexes have grown by 50%, and include binding affinity data for experimentally determined structures. These major content and feature updates enhance TCR3d as a resource for immunology, therapeutics and structural biology research, and enable advanced approaches for predictive TCR modeling and design. TCR3d is available at: https://tcr3d.ibbr.umd.edu.
Collapse
Affiliation(s)
- Valerie Lin
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
| | - Melyssa Cheung
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
| | - Ragul Gowthaman
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
| | - Maya Eisenberg
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
| | - Brian M Baker
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
- Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Brian G Pierce
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
- University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center, Baltimore, MD 21201, USA
| |
Collapse
|
9
|
Raybould MIJ, Greenshields-Watson A, Agarwal P, Aguilar-Sanjuan B, Olsen TH, Turnbull OM, Quast NP, Deane CM. The Observed T Cell Receptor Space database enables paired-chain repertoire mining, coherence analysis, and language modeling. Cell Rep 2024; 43:114704. [PMID: 39216000 DOI: 10.1016/j.celrep.2024.114704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 08/05/2024] [Accepted: 08/15/2024] [Indexed: 09/04/2024] Open
Abstract
T cell activation is governed through T cell receptors (TCRs), heterodimers of two sequence-variable chains (often an α and β chain) that synergistically recognize antigen fragments presented on cell surfaces. Despite this, there only exist repositories dedicated to collecting single-chain, not paired-chain, TCR sequence data. We addressed this gap by creating the Observed TCR Space (OTS) database, a source of consistently processed and annotated, full-length, paired-chain TCR sequences. Currently, OTS contains 5.35 million redundant (1.63 million non-redundant), predominantly human sequences from across 50 studies and at least 75 individuals. Using OTS, we identify pairing biases, public TCRs, and distinct chain coherence patterns relative to antibodies. We also release a paired-chain TCR language model, providing paired embedding representations and a method for residue in-filling conditional on the partner chain. OTS will be updated as a central community resource and is freely downloadable and available as a web application.
Collapse
Affiliation(s)
- Matthew I J Raybould
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK.
| | - Alexander Greenshields-Watson
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Parth Agarwal
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Broncio Aguilar-Sanjuan
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Tobias H Olsen
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Oliver M Turnbull
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Nele P Quast
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK
| | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB Oxford, UK.
| |
Collapse
|
10
|
Kim HY, Kim S, Park WY, Kim D. TSpred: a robust prediction framework for TCR-epitope interactions using paired chain TCR sequence data. Bioinformatics 2024; 40:btae472. [PMID: 39052940 PMCID: PMC11297499 DOI: 10.1093/bioinformatics/btae472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 06/11/2024] [Accepted: 07/25/2024] [Indexed: 07/27/2024] Open
Abstract
MOTIVATION Prediction of T-cell receptor (TCR)-epitope interactions is important for many applications in biomedical research, such as cancer immunotherapy and vaccine design. The prediction of TCR-epitope interactions remains challenging especially for novel epitopes, due to the scarcity of available data. RESULTS We propose TSpred, a new deep learning approach for the pan-specific prediction of TCR binding specificity based on paired chain TCR data. We develop a robust model that generalizes well to unseen epitopes by combining the predictive power of CNN and the attention mechanism. In particular, we design a reciprocal attention mechanism which focuses on extracting the patterns underlying TCR-epitope interactions. Upon a comprehensive evaluation of our model, we find that TSpred achieves state-of-the-art performances in both seen and unseen epitope specificity prediction tasks. Also, compared to other predictors, TSpred is more robust to bias related to peptide imbalance in the dataset. In addition, the reciprocal attention component of our model allows for model interpretability by capturing structurally important binding regions. Results indicate that TSpred is a robust and reliable method for the task of TCR-epitope binding prediction. AVAILABILITY AND IMPLEMENTATION Source code is available at https://github.com/ha01994/TSpred.
Collapse
Affiliation(s)
- Ha Young Kim
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon 34141, South Korea
| | | | - Woong-Yang Park
- GENINUS Inc., Seoul 05836, South Korea
- Samsung Genome Institute, Samsung Medical Center, Seoul 06351, South Korea
- Department of Molecular Cell Biology, Sungkyunkwan University School of Medicine, Suwon 16419, South Korea
| | - Dongsup Kim
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon 34141, South Korea
| |
Collapse
|
11
|
Ali S, Chourasia P, Patterson M. From PDB files to protein features: a comparative analysis of PDB bind and STCRDAB datasets. Med Biol Eng Comput 2024; 62:2449-2483. [PMID: 38622438 DOI: 10.1007/s11517-024-03074-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 03/13/2024] [Indexed: 04/17/2024]
Abstract
Understanding protein structures is crucial for various bioinformatics research, including drug discovery, disease diagnosis, and evolutionary studies. Protein structure classification is a critical aspect of structural biology, where supervised machine learning algorithms classify structures based on data from databases such as Protein Data Bank (PDB). However, the challenge lies in designing numerical embeddings for protein structures without losing essential information. Although some effort has been made in the literature, researchers have not effectively and rigorously combined the structural and sequence-based features for efficient protein classification to the best of our knowledge. To this end, we propose numerical embeddings that extract relevant features for protein sequences fetched from PDB structures from popular datasets such as PDB Bind and STCRDAB. The features are physicochemical properties such as aromaticity, instability index, flexibility, Grand Average of Hydropathy (GRAVY), isoelectric point, charge at pH, secondary structure fracture, molar extinction coefficient, and molecular weight. We also incorporate scaling features for the sliding windows (e.g., k-mers), which include Kyte and Doolittle (KD) hydropathy scale, Eisenberg hydrophobicity scale, Hydrophilicity scale, Flexibility of the amino acids, and Hydropathy scale. Multiple-feature selection aims to improve the accuracy of protein classification models. The results showed that the selected features significantly improved the predictive performance of existing embeddings.
Collapse
Affiliation(s)
- Sarwan Ali
- Georgia State University, Atlanta, GA, USA.
| | | | | |
Collapse
|
12
|
Ananya, Panchariya DC, Karthic A, Singh SP, Mani A, Chawade A, Kushwaha S. Vaccine design and development: Exploring the interface with computational biology and AI. Int Rev Immunol 2024; 43:361-380. [PMID: 38982912 DOI: 10.1080/08830185.2024.2374546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 04/29/2024] [Accepted: 06/26/2024] [Indexed: 07/11/2024]
Abstract
Computational biology involves applying computer science and informatics techniques in biology to understand complex biological data. It allows us to collect, connect, and analyze biological data at a large scale and build predictive models. In the twenty first century, computational resources along with Artificial Intelligence (AI) have been widely used in various fields of biological sciences such as biochemistry, structural biology, immunology, microbiology, and genomics to handle massive data for decision-making, including in applications such as drug design and vaccine development, one of the major areas of focus for human and animal welfare. The knowledge of available computational resources and AI-enabled tools in vaccine design and development can improve our ability to conduct cutting-edge research. Therefore, this review article aims to summarize important computational resources and AI-based tools. Further, the article discusses the various applications and limitations of AI tools in vaccine development.
Collapse
Affiliation(s)
- Ananya
- National Institute of Animal Biotechnology, Hyderabad, India
| | | | | | | | - Ashutosh Mani
- Motilal Nehru National Institute of Technology, Prayagraj, India
| | - Aakash Chawade
- Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | |
Collapse
|
13
|
McMaster B, Thorpe C, Ogg G, Deane CM, Koohy H. Can AlphaFold's breakthrough in protein structure help decode the fundamental principles of adaptive cellular immunity? Nat Methods 2024; 21:766-776. [PMID: 38654083 DOI: 10.1038/s41592-024-02240-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 03/08/2024] [Indexed: 04/25/2024]
Abstract
T cells are essential immune cells responsible for identifying and eliminating pathogens. Through interactions between their T-cell antigen receptors (TCRs) and antigens presented by major histocompatibility complex molecules (MHCs) or MHC-like molecules, T cells discriminate foreign and self peptides. Determining the fundamental principles that govern these interactions has important implications in numerous medical contexts. However, reconstructing a map between T cells and their antagonist antigens remains an open challenge for the field of immunology, and success of in silico reconstructions of this relationship has remained incremental. In this Perspective, we discuss the role that new state-of-the-art deep-learning models for predicting protein structure may play in resolving some of the unanswered questions the field faces linking TCR and peptide-MHC properties to T-cell specificity. We provide a comprehensive overview of structural databases and the evolution of predictive models, and highlight the breakthrough AlphaFold provided the field.
Collapse
Affiliation(s)
- Benjamin McMaster
- MRC Translational Immune Discovery Unit, MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| | - Christopher Thorpe
- Open Targets, Wellcome Genome Campus, Hinxton, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK
| | - Graham Ogg
- MRC Translational Immune Discovery Unit, MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | | | - Hashem Koohy
- MRC Translational Immune Discovery Unit, MRC Weatherall Institute of Molecular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK.
- Alan Turning Fellow in Health and Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
14
|
Greenshields-Watson A, Abanades B, Deane CM. Investigating the ability of deep learning-based structure prediction to extrapolate and/or enrich the set of antibody CDR canonical forms. Front Immunol 2024; 15:1352703. [PMID: 38482007 PMCID: PMC10933040 DOI: 10.3389/fimmu.2024.1352703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 01/30/2024] [Indexed: 04/13/2024] Open
Abstract
Deep learning models have been shown to accurately predict protein structure from sequence, allowing researchers to explore protein space from the structural viewpoint. In this paper we explore whether "novel" features, such as distinct loop conformations can arise from these predictions despite not being present in the training data. Here we have used ABodyBuilder2, a deep learning antibody structure predictor, to predict the structures of ~1.5M paired antibody sequences. We examined the predicted structures of the canonical CDR loops and found that most of these predictions fall into the already described CDR canonical form structural space. We also found a small number of "new" canonical clusters composed of heterogeneous sequences united by a common sequence motif and loop conformation. Analysis of these novel clusters showed their origins to be either shapes seen in the training data at very low frequency or shapes seen at high frequency but at a shorter sequence length. To evaluate explicitly the ability of ABodyBuilder2 to extrapolate, we retrained several models whilst withholding all antibody structures of a specific CDR loop length or canonical form. These "starved" models showed evidence of generalisation across CDRs of different lengths, but they did not extrapolate to loop conformations which were highly distinct from those present in the training data. However, the models were able to accurately predict a canonical form even if only a very small number of examples of that shape were in the training data. Our results suggest that deep learning protein structure prediction methods are unable to make completely out-of-domain predictions for CDR loops. However, in our analysis we also found that even minimal amounts of data of a structural shape allow the method to recover its original predictive abilities. We have made the ~1.5 M predicted structures used in this study available to download at https://doi.org/10.5281/zenodo.10280181.
Collapse
|
15
|
Bravi B. Development and use of machine learning algorithms in vaccine target selection. NPJ Vaccines 2024; 9:15. [PMID: 38242890 PMCID: PMC10798987 DOI: 10.1038/s41541-023-00795-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 12/07/2023] [Indexed: 01/21/2024] Open
Abstract
Computer-aided discovery of vaccine targets has become a cornerstone of rational vaccine design. In this article, I discuss how Machine Learning (ML) can inform and guide key computational steps in rational vaccine design concerned with the identification of B and T cell epitopes and correlates of protection. I provide examples of ML models, as well as types of data and predictions for which they are built. I argue that interpretable ML has the potential to improve the identification of immunogens also as a tool for scientific discovery, by helping elucidate the molecular processes underlying vaccine-induced immune responses. I outline the limitations and challenges in terms of data availability and method development that need to be addressed to bridge the gap between advances in ML predictions and their translational application to vaccine design.
Collapse
Affiliation(s)
- Barbara Bravi
- Department of Mathematics, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
16
|
Chen J, Zhao B, Lin S, Sun H, Mao X, Wang M, Chu Y, Hong L, Wei D, Li M, Xiong Y. TEPCAM: Prediction of T-cell receptor-epitope binding specificity via interpretable deep learning. Protein Sci 2024; 33:e4841. [PMID: 37983648 PMCID: PMC10731497 DOI: 10.1002/pro.4841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/11/2023] [Accepted: 11/16/2023] [Indexed: 11/22/2023]
Abstract
The recognition of T-cell receptor (TCR) on the surface of T cell to specific epitope presented by the major histocompatibility complex is the key to trigger the immune response. Identifying the binding rules of TCR-epitope pair is crucial for developing immunotherapies, including neoantigen vaccine and drugs. Accurate prediction of TCR-epitope binding specificity via deep learning remains challenging, especially in test cases which are unseen in the training set. Here, we propose TEPCAM (TCR-EPitope identification based on Cross-Attention and Multi-channel convolution), a deep learning model that incorporates self-attention, cross-attention mechanism, and multi-channel convolution to improve the generalizability and enhance the model interpretability. Experimental results demonstrate that our model outperformed several state-of-the-art models on two challenging tasks including a strictly split dataset and an external dataset. Furthermore, the model can learn some interaction patterns between TCR and epitope by extracting the interpretable matrix from cross-attention layer and mapping them to the three-dimensional structures. The source code and data are freely available at https://github.com/Chenjw99/TEPCAM.
Collapse
Affiliation(s)
- Junwei Chen
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Bowen Zhao
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Shenggeng Lin
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Heqi Sun
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Xueying Mao
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Meng Wang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and EngineeringCentral South UniversityChangshaChina
| | - Yanyi Chu
- Department of PathologyStanford University School of MedicineStandfordCaliforniaUSA
| | - Liang Hong
- Institute of Natural Sciences, Shanghai Jiao Tong UniversityShanghaiChina
- Artificial Intelligence Biomedical Center, Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong UniversityShanghaiChina
| | - Dong‐Qing Wei
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and EngineeringCentral South UniversityChangshaChina
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and BiotechnologyShanghai Jiao Tong UniversityShanghaiChina
- Artificial Intelligence Biomedical Center, Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong UniversityShanghaiChina
| |
Collapse
|
17
|
Ali S, Chourasia P, Patterson M. When Protein Structure Embedding Meets Large Language Models. Genes (Basel) 2023; 15:25. [PMID: 38254915 PMCID: PMC10815811 DOI: 10.3390/genes15010025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/16/2023] [Accepted: 12/21/2023] [Indexed: 01/24/2024] Open
Abstract
Protein structure analysis is essential in various bioinformatics domains such as drug discovery, disease diagnosis, and evolutionary studies. Within structural biology, the classification of protein structures is pivotal, employing machine learning algorithms to categorize structures based on data from databases like the Protein Data Bank (PDB). To predict protein functions, embeddings based on protein sequences have been employed. Creating numerical embeddings that preserve vital information while considering protein structure and sequence presents several challenges. The existing literature lacks a comprehensive and effective approach that combines structural and sequence-based features to achieve efficient protein classification. While large language models (LLMs) have exhibited promising outcomes for protein function prediction, their focus primarily lies on protein sequences, disregarding the 3D structures of proteins. The quality of embeddings heavily relies on how well the geometry of the embedding space aligns with the underlying data structure, posing a critical research question. Traditionally, Euclidean space has served as a widely utilized framework for embeddings. In this study, we propose a novel method for designing numerical embeddings in Euclidean space for proteins by leveraging 3D structure information, specifically employing the concept of contact maps. These embeddings are synergistically combined with features extracted from LLMs and traditional feature engineering techniques to enhance the performance of embeddings in supervised protein analysis. Experimental results on benchmark datasets, including PDB Bind and STCRDAB, demonstrate the superior performance of the proposed method for protein function prediction.
Collapse
Affiliation(s)
| | | | - Murray Patterson
- Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA; (S.A.); (P.C.)
| |
Collapse
|
18
|
Klebanoff CA, Chandran SS, Baker BM, Quezada SA, Ribas A. T cell receptor therapeutics: immunological targeting of the intracellular cancer proteome. Nat Rev Drug Discov 2023; 22:996-1017. [PMID: 37891435 PMCID: PMC10947610 DOI: 10.1038/s41573-023-00809-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2023] [Indexed: 10/29/2023]
Abstract
The T cell receptor (TCR) complex is a naturally occurring antigen sensor that detects, amplifies and coordinates cellular immune responses to epitopes derived from cell surface and intracellular proteins. Thus, TCRs enable the targeting of proteins selectively expressed by cancer cells, including neoantigens, cancer germline antigens and viral oncoproteins. As such, TCRs have provided the basis for an emerging class of oncology therapeutics. Herein, we review the current cancer treatment landscape using TCRs and TCR-like molecules. This includes adoptive cell transfer of T cells expressing endogenous or engineered TCRs, TCR bispecific engagers and antibodies specific for human leukocyte antigen (HLA)-bound peptides (TCR mimics). We discuss the unique complexities associated with the clinical development of these therapeutics, such as HLA restriction, TCR retrieval, potency assessment and the potential for cross-reactivity. In addition, we highlight emerging clinical data that establish the antitumour potential of TCR-based therapies, including tumour-infiltrating lymphocytes, for the treatment of diverse human malignancies. Finally, we explore the future of TCR therapeutics, including emerging genome editing methods to safely enhance potency and strategies to streamline patient identification.
Collapse
Affiliation(s)
- Christopher A Klebanoff
- Memorial Sloan Kettering Cancer Center (MSKCC), Human Oncology and Pathogenesis Program, New York, NY, USA.
| | - Smita S Chandran
- Memorial Sloan Kettering Cancer Center (MSKCC), Human Oncology and Pathogenesis Program, New York, NY, USA
- Parker Institute for Cancer Immunotherapy, New York, NY, USA
- Weill Cornell Medical College, Cornell University, New York, NY, USA
| | - Brian M Baker
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, ID, USA
- The Harper Cancer Research Institute, University of Notre Dame, Notre Dame, ID, USA
| | - Sergio A Quezada
- Cancer Immunology Unit, Research Department of Haematology, University College London Cancer Institute, London, UK
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK
- Achilles Therapeutics, London, UK
| | - Antoni Ribas
- Jonsson Comprehensive Cancer Center at the University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| |
Collapse
|
19
|
Zhao Y, He B, Xu F, Li C, Xu Z, Su X, He H, Huang Y, Rossjohn J, Song J, Yao J. DeepAIR: A deep learning framework for effective integration of sequence and 3D structure to enable adaptive immune receptor analysis. SCIENCE ADVANCES 2023; 9:eabo5128. [PMID: 37556545 PMCID: PMC10411891 DOI: 10.1126/sciadv.abo5128] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 07/06/2023] [Indexed: 08/11/2023]
Abstract
Structural docking between the adaptive immune receptors (AIRs), including T cell receptors (TCRs) and B cell receptors (BCRs), and their cognate antigens are one of the most fundamental processes in adaptive immunity. However, current methods for predicting AIR-antigen binding largely rely on sequence-derived features of AIRs, omitting the structure features that are essential for binding affinity. In this study, we present a deep learning framework, termed DeepAIR, for the accurate prediction of AIR-antigen binding by integrating both sequence and structure features of AIRs. DeepAIR achieves a Pearson's correlation of 0.813 in predicting the binding affinity of TCR, and a median area under the receiver-operating characteristic curve (AUC) of 0.904 and 0.942 in predicting the binding reactivity of TCR and BCR, respectively. Meanwhile, using TCR and BCR repertoire, DeepAIR correctly identifies every patient with nasopharyngeal carcinoma and inflammatory bowel disease in test data. Thus, DeepAIR improves the AIR-antigen binding prediction that facilitates the study of adaptive immunity.
Collapse
Affiliation(s)
- Yu Zhao
- AI Lab, Tencent, Shenzhen, China
| | - Bing He
- AI Lab, Tencent, Shenzhen, China
| | - Fan Xu
- AI Lab, Tencent, Shenzhen, China
| | - Chen Li
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | | | | | | | | - Jamie Rossjohn
- Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
- Institute of Infection and Immunity, Cardiff University School of Medicine, Heath Park, Cardiff, UK
| | - Jiangning Song
- AI Lab, Tencent, Shenzhen, China
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | |
Collapse
|
20
|
Rappazzo CG, Fernández-Quintero ML, Mayer A, Wu NC, Greiff V, Guthmiller JJ. Defining and Studying B Cell Receptor and TCR Interactions. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2023; 211:311-322. [PMID: 37459189 PMCID: PMC10495106 DOI: 10.4049/jimmunol.2300136] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 04/15/2023] [Indexed: 07/20/2023]
Abstract
BCRs (Abs) and TCRs (or adaptive immune receptors [AIRs]) are the means by which the adaptive immune system recognizes foreign and self-antigens, playing an integral part in host defense, as well as the emergence of autoimmunity. Importantly, the interaction between AIRs and their cognate Ags defies a simple key-in-lock paradigm and is instead a complex many-to-many mapping between an individual's massively diverse AIR repertoire, and a similarly diverse antigenic space. Understanding how adaptive immunity balances specificity with epitopic coverage is a key challenge for the field, and terms such as broad specificity, cross-reactivity, and polyreactivity remain ill-defined and are used inconsistently. In this Immunology Notes and Resources article, a group of experimental, structural, and computational immunologists define commonly used terms associated with AIR binding, describe methodologies to study these binding modes, as well as highlight the implications of these different binding modes for therapeutic design.
Collapse
Affiliation(s)
| | | | - Andreas Mayer
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK
| | - Nicholas C. Wu
- Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, 0372 Oslo, Norway
| | - Jenna J. Guthmiller
- Department of Immunology and Microbiology, University of Colorado Anschutz Medical Campus, Aurora, CO 80045
| |
Collapse
|
21
|
Hudson D, Fernandes RA, Basham M, Ogg G, Koohy H. Can we predict T cell specificity with digital biology and machine learning? Nat Rev Immunol 2023; 23:511-521. [PMID: 36755161 PMCID: PMC9908307 DOI: 10.1038/s41577-023-00835-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2022] [Indexed: 02/10/2023]
Abstract
Recent advances in machine learning and experimental biology have offered breakthrough solutions to problems such as protein structure prediction that were long thought to be intractable. However, despite the pivotal role of the T cell receptor (TCR) in orchestrating cellular immunity in health and disease, computational reconstruction of a reliable map from a TCR to its cognate antigens remains a holy grail of systems immunology. Current data sets are limited to a negligible fraction of the universe of possible TCR-ligand pairs, and performance of state-of-the-art predictive models wanes when applied beyond these known binders. In this Perspective article, we make the case for renewed and coordinated interdisciplinary effort to tackle the problem of predicting TCR-antigen specificity. We set out the general requirements of predictive models of antigen binding, highlight critical challenges and discuss how recent advances in digital biology such as single-cell technology and machine learning may provide possible solutions. Finally, we describe how predicting TCR specificity might contribute to our understanding of the broader puzzle of antigen immunogenicity.
Collapse
Affiliation(s)
- Dan Hudson
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- The Rosalind Franklin Institute, Didcot, UK
| | - Ricardo A Fernandes
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | | | - Graham Ogg
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | - Hashem Koohy
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
- Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
22
|
Shen Y, Voigt A, Leng X, Rodriguez AA, Nguyen CQ. A current and future perspective on T cell receptor repertoire profiling. Front Genet 2023; 14:1159109. [PMID: 37408774 PMCID: PMC10319011 DOI: 10.3389/fgene.2023.1159109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 06/12/2023] [Indexed: 07/07/2023] Open
Abstract
T cell receptors (TCR) play a vital role in the immune system's ability to recognize and respond to foreign antigens, relying on the highly polymorphic rearrangement of TCR genes. The recognition of autologous peptides by adaptive immunity may lead to the development and progression of autoimmune diseases. Understanding the specific TCR involved in this process can provide insights into the autoimmune process. RNA-seq (RNA sequencing) is a valuable tool for studying TCR repertoires by providing a comprehensive and quantitative analysis of the RNA transcripts. With the development of RNA technology, transcriptomic data must provide valuable information to model and predict TCR and antigen interaction and, more importantly, identify or predict neoantigens. This review provides an overview of the application and development of bulk RNA-seq and single-cell (SC) RNA-seq to examine the TCR repertoires. Furthermore, discussed here are bioinformatic tools that can be applied to study the structural biology of peptide/TCR/MHC (major histocompatibility complex) and predict antigenic epitopes using advanced artificial intelligence tools.
Collapse
Affiliation(s)
- Yiran Shen
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Alexandria Voigt
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Xuebing Leng
- Department of Microbiology and Immunology, Miller School of Medicine, University of Miami, Miami, FL, United States
| | - Amy A. Rodriguez
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Cuong Q. Nguyen
- Department of Infectious Diseases and Immunology, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
- Department of Oral Biology, College of Dentistry, University of Florida, Gainesville, FL, United States
- Center of Orphaned Autoimmune Diseases, University of Florida, Gainesville, FL, United States
| |
Collapse
|
23
|
Gouttefangeas C, Klein R, Maia A. The good and the bad of T cell cross-reactivity: challenges and opportunities for novel therapeutics in autoimmunity and cancer. Front Immunol 2023; 14:1212546. [PMID: 37409132 PMCID: PMC10319254 DOI: 10.3389/fimmu.2023.1212546] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 05/24/2023] [Indexed: 07/07/2023] Open
Abstract
T cells are main actors of the immune system with an essential role in protection against pathogens and cancer. The molecular key event involved in this absolutely central task is the interaction of membrane-bound specific T cell receptors with peptide-MHC complexes which initiates T cell priming, activation and recall, and thus controls a range of downstream functions. While textbooks teach us that the repertoire of mature T cells is highly diverse, it is clear that this diversity cannot possibly cover all potential foreign peptides that might be encountered during life. TCR cross-reactivity, i.e. the ability of a single TCR to recognise different peptides, offers the best solution to this biological challenge. Reports have shown that indeed, TCR cross-reactivity is surprisingly high. Hence, the T cell dilemma is the following: be as specific as possible to target foreign danger and spare self, while being able to react to a large spectrum of body-threatening situations. This has major consequences for both autoimmune diseases and cancer, and significant implications for the development of T cell-based therapies. In this review, we will present essential experimental evidence of T cell cross-reactivity, implications for two opposite immune conditions, i.e. autoimmunity vs cancer, and how this can be differently exploited for immunotherapy approaches. Finally, we will discuss the tools available for predicting cross-reactivity and how improvements in this field might boost translational approaches.
Collapse
Affiliation(s)
- Cécile Gouttefangeas
- Nencki Institute of Experimental Biology of the Polish Academy of Sciences, Warsaw, Poland
- Cluster of Excellence iFIT (EXC2180) “Image-Guided and Functionally Instructed Tumor Therapies”, University of Tübingen, Tübingen, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ) partner site Tübingen, Tübingen, Germany
| | - Reinhild Klein
- Department of Hematology, Oncology, Clinical Immunology and Rheumatology, University Hospital Tübingen, Tübingen, Germany
| | - Ana Maia
- Nencki Institute of Experimental Biology of the Polish Academy of Sciences, Warsaw, Poland
- Cluster of Excellence iFIT (EXC2180) “Image-Guided and Functionally Instructed Tumor Therapies”, University of Tübingen, Tübingen, Germany
| |
Collapse
|
24
|
Abanades B, Wong WK, Boyles F, Georges G, Bujotzek A, Deane CM. ImmuneBuilder: Deep-Learning models for predicting the structures of immune proteins. Commun Biol 2023; 6:575. [PMID: 37248282 DOI: 10.1038/s42003-023-04927-7] [Citation(s) in RCA: 68] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 05/11/2023] [Indexed: 05/31/2023] Open
Abstract
Immune receptor proteins play a key role in the immune system and have shown great promise as biotherapeutics. The structure of these proteins is critical for understanding their antigen binding properties. Here, we present ImmuneBuilder, a set of deep learning models trained to accurately predict the structure of antibodies (ABodyBuilder2), nanobodies (NanoBodyBuilder2) and T-Cell receptors (TCRBuilder2). We show that ImmuneBuilder generates structures with state of the art accuracy while being far faster than AlphaFold2. For example, on a benchmark of 34 recently solved antibodies, ABodyBuilder2 predicts CDR-H3 loops with an RMSD of 2.81Å, a 0.09Å improvement over AlphaFold-Multimer, while being over a hundred times faster. Similar results are also achieved for nanobodies, (NanoBodyBuilder2 predicts CDR-H3 loops with an average RMSD of 2.89Å, a 0.55Å improvement over AlphaFold2) and TCRs. By predicting an ensemble of structures, ImmuneBuilder also gives an error estimate for every residue in its final prediction. ImmuneBuilder is made freely available, both to download ( https://github.com/oxpig/ImmuneBuilder ) and to use via our webserver ( http://opig.stats.ox.ac.uk/webapps/newsabdab/sabpred ). We also make available structural models for ~150 thousand non-redundant paired antibody sequences ( https://doi.org/10.5281/zenodo.7258553 ).
Collapse
Affiliation(s)
| | - Wing Ki Wong
- Large Molecule Research, Roche Pharma Research and Early Development, Roche Innovation Center Munich, Penzberg, Germany
| | - Fergus Boyles
- Department of Statistics, University of Oxford, Oxford, UK
| | - Guy Georges
- Large Molecule Research, Roche Pharma Research and Early Development, Roche Innovation Center Munich, Penzberg, Germany
| | - Alexander Bujotzek
- Large Molecule Research, Roche Pharma Research and Early Development, Roche Innovation Center Munich, Penzberg, Germany
| | | |
Collapse
|
25
|
Xu AM, Chour W, DeLucia DC, Su Y, Pavlovitch-Bedzyk AJ, Ng R, Rasheed Y, Davis MM, Lee JK, Heath JR. Entropic analysis of antigen-specific CDR3 domains identifies essential binding motifs shared by CDR3s with different antigen specificities. Cell Syst 2023; 14:273-284.e5. [PMID: 37001518 PMCID: PMC10355346 DOI: 10.1016/j.cels.2023.03.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 09/01/2022] [Accepted: 03/01/2023] [Indexed: 04/22/2023]
Abstract
Antigen-specific T cell receptor (TCR) sequences can have prognostic, predictive, and therapeutic value, but decoding the specificity of TCR recognition remains challenging. Unlike DNA strands that base pair, TCRs bind to their targets with different orientations and different lengths, which complicates comparisons. We present scanning parametrized by normalized TCR length (SPAN-TCR) to analyze antigen-specific TCR CDR3 sequences and identify patterns driving TCR-pMHC specificity. Using entropic analysis, SPAN-TCR identifies 2-mer motifs that decrease the diversity (entropy) of CDR3s. These motifs are the most common patterns that can predict CDR3 composition, and we identify "essential" motifs that decrease entropy in the same CDR3 α or β chain containing the 2-mer, and "super-essential" motifs that decrease entropy in both chains. Molecular dynamics analysis further suggests that these motifs may play important roles in binding. We then employ SPAN-TCR to resolve similarities in TCR repertoires against different antigens using public databases of TCR sequences.
Collapse
Affiliation(s)
- Alexander M Xu
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA; Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA.
| | - William Chour
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Keck School of Medicine, University of Southern California, Los Angeles, CA 91125, USA
| | - Diana C DeLucia
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Yapeng Su
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | | | - Rachel Ng
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Yusuf Rasheed
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Mark M Davis
- Computational and Systems Immunology Program, Stanford University School of Medicine, Stanford, CA 94305, USA; Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - John K Lee
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Division of Medical Oncology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - James R Heath
- Institute for Systems Biology, Seattle, WA 98109, USA.
| |
Collapse
|
26
|
Peng X, Lei Y, Feng P, Jia L, Ma J, Zhao D, Zeng J. Characterizing the interaction conformation between T-cell receptors and epitopes with deep learning. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00634-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
27
|
Sanromán ÁF, Joshi K, Au L, Chain B, Turajlic S. TCR sequencing: applications in immuno-oncology research. IMMUNO-ONCOLOGY TECHNOLOGY 2023; 17:100373. [PMID: 36908996 PMCID: PMC9996383 DOI: 10.1016/j.iotech.2023.100373] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
•T-cell receptor (TCR) interaction with major histocompatibility complex-antigen complexes leads to antitumour responses.•TCR sequencing analysis allows characterisation of T cells that recognise tumour neoantigens.•T-cell clonal revival and clonal replacement potentially underpin immunotherapy responses.
Collapse
Affiliation(s)
- Á F Sanromán
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, UK
| | - K Joshi
- Department of Medical Oncology, The Royal Marsden NHS Foundation Trust, London, UK.,Renal and Skin Unit, The Royal Marsden NHS Foundation Trust, London, UK
| | - L Au
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, UK.,Department of Medical Oncology, Peter MacCallum Cancer Centre, Melbourne, Australia.,Cancer Immunology Program, Peter MacCallum Cancer Centre, Melbourne, Australia.,Sir Peter MacCallum Department of Oncology, The University of Melbourne, Australia
| | - B Chain
- Division of Infection and Immunity, University College London, London, UK.,Department of Computer Science, University College London, London, UK
| | - S Turajlic
- Renal and Skin Unit, The Royal Marsden NHS Foundation Trust, London, UK.,Melanoma and Kidney Cancer Team, The Institute of Cancer Research, London, UK
| |
Collapse
|
28
|
Raybould MIJ, Nissley DA, Kumar S, Deane CM. Computationally profiling peptide:MHC recognition by T-cell receptors and T-cell receptor-mimetic antibodies. Front Immunol 2023; 13:1080596. [PMID: 36700202 PMCID: PMC9868621 DOI: 10.3389/fimmu.2022.1080596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 12/07/2022] [Indexed: 01/11/2023] Open
Abstract
T-cell receptor-mimetic antibodies (TCRms) targeting disease-associated peptides presented by Major Histocompatibility Complexes (pMHCs) are set to become a major new drug modality. However, we lack a general understanding of how TCRms engage pMHC targets, which is crucial for predicting their specificity and safety. Several new structures of TCRm:pMHC complexes have become available in the past year, providing sufficient initial data for a holistic analysis of TCRms as a class of pMHC binding agents. Here, we profile the complete set of TCRm:pMHC complexes against representative TCR:pMHC complexes to quantify the TCR-likeness of their pMHC engagement. We find that intrinsic molecular differences between antibodies and TCRs lead to fundamentally different roles for their heavy/light chains and Complementarity-Determining Region loops during antigen recognition. The idiotypic properties of antibodies may increase the likelihood of TCRms engaging pMHCs with less peptide selectivity than TCRs. However, the pMHC recognition features of some TCRms, including the two TCRms currently in clinical trials, can be remarkably TCR-like. The insights gained from this study will aid in the rational design and optimisation of next-generation TCRms.
Collapse
Affiliation(s)
- Matthew I. J. Raybould
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Daniel A. Nissley
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Sandeep Kumar
- Biotherapeutics Discovery, Boehringer Ingelheim, Ridgefield, CT, United States
| | - Charlotte M. Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom,*Correspondence: Charlotte M. Deane,
| |
Collapse
|
29
|
Bi J, Zheng Y, Wang C, Ding Y. An Attention Based Bidirectional LSTM Method to Predict the Binding of TCR and Epitope. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3272-3280. [PMID: 34559661 DOI: 10.1109/tcbb.2021.3115353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The T-cell epitope prediction has always been a long-term challenge in immunoinformatics and bioinformatics. Studying the specific recognition between T-cell receptor (TCR) and peptide-major histocompatibility complex (p-MHC) complexes can help us better understand the immune mechanism, it's also make a signification contribution in developing vaccines and targeted drugs. Meanwhile, more advanced methods are needed for distinguishing TCRs binding from different epitopes. In this paper, we introduce a hybrid model composed of bidirectional long short-term memory networks (BiLSTM), attention and convolutional neural networks (CNN) that can identified the binding of TCRs to epitopes. The BiLSTM can more completely extract amino acid forward and backward information in the sequence, and attention mechanism can focus on amino acids at certain positions from complex sequences to capture the most important feature, then CNN was used to further extract salient features to predict the binding of TCR-epitope. In McPAS dataset, the AUC value (the area under ROC curve) of naive TCR-epitope binding is 0.974 and specific TCR-epitope binding is 0.887. The model has achieved better prediction results than other existing models (TCRGP, ERGO, NetTCR), and some experiments are used to analyze the advantages of our model. The algorithm is available at https://github.com/bijingshu/BiAttCNN.git.
Collapse
|
30
|
T. RR, Smith JC. Structural patterns in class 1 major histocompatibility complex‐restricted nonamer peptide binding to T‐cell receptors. Proteins 2022; 90:1645-1654. [DOI: 10.1002/prot.26343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 03/12/2022] [Accepted: 03/27/2022] [Indexed: 11/09/2022]
Affiliation(s)
- Rajitha Rajeshwar T.
- Department of Biochemistry and Cellular and Molecular Biology University of Tennessee Knoxville Tennessee USA
- UT/ORNL Center for Molecular Biophysics Oak Ridge National Laboratory Oak Ridge Tennessee USA
| | - Jeremy C. Smith
- Department of Biochemistry and Cellular and Molecular Biology University of Tennessee Knoxville Tennessee USA
- UT/ORNL Center for Molecular Biophysics Oak Ridge National Laboratory Oak Ridge Tennessee USA
| |
Collapse
|
31
|
Abstract
Antibodies and T cell receptors (TCRs) are the fundamental building blocks of adaptive immunity. Repertoire-scale functionality derives from their epitope-binding properties, just as macroscopic properties like temperature derive from microscopic molecular properties. However, most approaches to repertoire-scale measurement, including sequence diversity and entropy, are not based on antibody or TCR function in this way. Thus, they potentially overlook key features of immunological function. Here we present a framework that describes repertoires in terms of the epitope-binding properties of their constituent antibodies and TCRs, based on analysis of thousands of antibody-antigen and TCR-peptide-major-histocompatibility-complex binding interactions and over 400 high-throughput repertoires. We show that repertoires consist of loose overlapping classes of antibodies and TCRs with similar binding properties. We demonstrate the potential of this framework to distinguish specific responses vs. bystander activation in influenza vaccinees, stratify cytomegalovirus (CMV)-infected cohorts, and identify potential immunological "super-agers." Classes add a valuable dimension to the assessment of immune function.
Collapse
|
32
|
Cohen T, Halfon M, Schneidman-Duhovny D. NanoNet: Rapid and accurate end-to-end nanobody modeling by deep learning. Front Immunol 2022; 13:958584. [PMID: 36032123 PMCID: PMC9411858 DOI: 10.3389/fimmu.2022.958584] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 07/15/2022] [Indexed: 11/20/2022] Open
Abstract
Antibodies are a rapidly growing class of therapeutics. Recently, single domain camelid VHH antibodies, and their recognition nanobody domain (Nb) appeared as a cost-effective highly stable alternative to full-length antibodies. There is a growing need for high-throughput epitope mapping based on accurate structural modeling of the variable domains that share a common fold and differ in the Complementarity Determining Regions (CDRs). We develop a deep learning end-to-end model, NanoNet, that given a sequence directly produces the 3D coordinates of the backbone and Cβ atoms of the entire VH domain. For the Nb test set, NanoNet achieves 3.16Å average RMSD for the most variable CDR3 loops and 2.65Å, 1.73Å for the CDR1, CDR2 loops, respectively. The accuracy for antibody VH domains is even higher: 2.38Å RMSD for CDR3 and 0.89Å, 0.96Å for the CDR1, CDR2 loops, respectively. NanoNet run times allow generation of ∼1M nanobody structures in less than 4 hours on a standard CPU computer enabling high-throughput structure modeling. NanoNet is available at GitHub: https://github.com/dina-lab3D/NanoNet.
Collapse
Affiliation(s)
- Tomer Cohen
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | - Dina Schneidman-Duhovny
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
33
|
Joshi K, Milighetti M, Chain BM. Application of T cell receptor (TCR) repertoire analysis for the advancement of cancer immunotherapy. Curr Opin Immunol 2022; 74:1-8. [PMID: 34454284 DOI: 10.1016/j.coi.2021.07.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/13/2021] [Accepted: 07/13/2021] [Indexed: 12/14/2022]
Abstract
T cell receptor (TCR) sequencing has emerged as a powerful new technology in analysis of the host-tumour interaction. The advances in NextGen sequencing technologies, coupled with powerful novel bioinformatic tools, allow quantitative and reproducible characterisation of repertoires from tumour and blood samples from an increasing number of patients with a variety of solid cancers. In this review, we consider how global metrics such as T cell clonality and diversity can be extracted from these repertoires and used to give insight into the mechanism of action of immune checkpoint blockade. Furthermore, we explore how the analysis of TCR overlap between repertories can help define spatial and temporal heterogeneity of the anti-tumoural immune response. Finally, we review how analysis of TCR sequence and structure, either of individual TCRs or from sets of related TCRs can be used to annotate the antigenic specificity, with important implications for the development of personalised adoptive cellular immunotherapies.
Collapse
Affiliation(s)
- Kroopa Joshi
- Department of Medical Oncology, The Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Martina Milighetti
- Division of Infection and Immunity, University College London, London, United Kingdom
| | - Benjamin M Chain
- Division of Infection and Immunity, University College London, London, United Kingdom; Department of Computer Science, University College London, London, United Kingdom.
| |
Collapse
|
34
|
Perez MAS, Cuendet MA, Röhrig UF, Michielin O, Zoete V. Structural Prediction of Peptide-MHC Binding Modes. Methods Mol Biol 2022; 2405:245-282. [PMID: 35298818 DOI: 10.1007/978-1-0716-1855-4_13] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The immune system is constantly protecting its host from the invasion of pathogens and the development of cancer cells. The specific CD8+ T-cell immune response against virus-infected cells and tumor cells is based on the T-cell receptor recognition of antigenic peptides bound to class I major histocompatibility complexes (MHC) at the surface of antigen presenting cells. Consequently, the peptide binding specificities of the highly polymorphic MHC have important implications for the design of vaccines, for the treatment of autoimmune diseases, and for personalized cancer immunotherapy. Evidence-based machine-learning approaches have been successfully used for the prediction of peptide binders and are currently being developed for the prediction of peptide immunogenicity. However, understanding and modeling the structural details of peptide/MHC binding is crucial for a better understanding of the molecular mechanisms triggering the immunological processes, estimating peptide/MHC affinity using universal physics-based approaches, and driving the design of novel peptide ligands. Unfortunately, due to the large diversity of MHC allotypes and possible peptides, the growing number of 3D structures of peptide/MHC (pMHC) complexes in the Protein Data Bank only covers a small fraction of the possibilities. Consequently, there is a growing need for rapid and efficient approaches to predict 3D structures of pMHC complexes. Here, we review the key characteristics of the 3D structure of pMHC complexes before listing databases and other sources of information on pMHC structures and MHC specificities. Finally, we discuss some of the most prominent pMHC docking software.
Collapse
Affiliation(s)
- Marta A S Perez
- Computer-aided Molecular Engineering Group, Department of Oncology UNIL-CHUV, Lausanne University, Lausanne, Switzerland
- Ludwig Institute for Cancer Research, Lausanne, Switzerland
- Molecular Modelling Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Michel A Cuendet
- Molecular Modelling Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Oncology Department, Centre Hospitalier Universitaire Vaudois (CHUV), Precision Oncology Center, Lausanne, Switzerland
| | - Ute F Röhrig
- Molecular Modelling Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Olivier Michielin
- Molecular Modelling Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.
- Oncology Department, Centre Hospitalier Universitaire Vaudois (CHUV), Precision Oncology Center, Lausanne, Switzerland.
| | - Vincent Zoete
- Computer-aided Molecular Engineering Group, Department of Oncology UNIL-CHUV, Lausanne University, Lausanne, Switzerland.
- Ludwig Institute for Cancer Research, Lausanne, Switzerland.
- Molecular Modelling Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
35
|
Heather JM, Spindler MJ, Alonso M, Shui Y, Millar DG, Johnson D, Cobbold M, Hata A. OUP accepted manuscript. Nucleic Acids Res 2022; 50:e68. [PMID: 35325179 PMCID: PMC9262623 DOI: 10.1093/nar/gkac190] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/18/2022] [Accepted: 03/09/2022] [Indexed: 11/17/2022] Open
Abstract
The study and manipulation of T cell receptors (TCRs) is central to multiple fields across basic and translational immunology research. Produced by V(D)J recombination, TCRs are often only recorded in the literature and data repositories as a combination of their V and J gene symbols, plus their hypervariable CDR3 amino acid sequence. However, numerous applications require full-length coding nucleotide sequences. Here we present Stitchr, a software tool developed to specifically address this limitation. Given minimal V/J/CDR3 information, Stitchr produces complete coding sequences representing a fully spliced TCR cDNA. Due to its modular design, Stitchr can be used for TCR engineering using either published germline or novel/modified variable and constant region sequences. Sequences produced by Stitchr were validated by synthesizing and transducing TCR sequences into Jurkat cells, recapitulating the expected antigen specificity of the parental TCR. Using a companion script, Thimble, we demonstrate that Stitchr can process a million TCRs in under ten minutes using a standard desktop personal computer. By systematizing the production and modification of TCR sequences, we propose that Stitchr will increase the speed, repeatability, and reproducibility of TCR research. Stitchr is available on GitHub.
Collapse
Affiliation(s)
- James M Heather
- To whom correspondence should be addressed. Tel: +1 617 724 0104;
| | | | | | | | - David G Millar
- Massachusetts General Hospital Cancer Center, Charlestown, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | | | - Mark Cobbold
- Massachusetts General Hospital Cancer Center, Charlestown, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Aaron N Hata
- Correspondence may also be addressed to Aaron N. Hata. Tel: +1 617 724 3442;
| |
Collapse
|
36
|
Ghobadi Z, Mahnam K, Shakhsi-Niaei M. In-silico design of peptides for inhibition of HLA-A*03-KLIETYFSK complex as a new drug design for treatment of multiples sclerosis disease. J Mol Graph Model 2021; 111:108079. [PMID: 34837787 DOI: 10.1016/j.jmgm.2021.108079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/03/2021] [Accepted: 11/14/2021] [Indexed: 10/19/2022]
Abstract
Multiple sclerosis is recognized as a chronic inflammatory disease. Human leukocyte antigen (HLA) plays an important role in initiating adaptive immune responses. HLA class I is present in almost all nucleated cells and presents the cleaved endogenous peptide antigens to cytotoxic T cells. HLA-A*03 is one of the HLA class I alleles, which is reported as substantially related HLA to MS disease. In 2011, the structure of the HLA-A*03 in complex was identified with an immunodominant proteolipid protein (PLP) epitope (KLIETYFSK). This complex has been reported as an important autoantigen-presenting complex in MS pathogenesis. In this study, new peptides were designed to bind to this complex that may prevent specific pathogenic cytotoxic T cell binding to this autoantigen-presenting complex and CNS demyelination. Herein, 14 new helical peptides containing 19 amino acids were designed and their structures were predicted using the PEP-FOLD server. The binding of each designed peptide to the mentioned complex was then performed. A mutation approach was used by the BeAtMuSiC server to improve the binding affinity of the designed peptide. In each position, amino acid substitutions leading to an increase in the binding affinity of the peptide to the mentioned complex were determined. Finally, the resulting complexes were simulated for 40 ns using AMBER18 software. The results revealed that out of 14 designed peptides, "WRYWWKDWAKQFRQFYRWF" peptide exhibited the highest affinity for binding to the mentioned complex. This peptide can be considered as a potential drug to control multiple sclerosis disease in patients carrying the HLA-A*03 allele.
Collapse
Affiliation(s)
- Zahra Ghobadi
- Department of Biology, Faculty of Basic Sciences, Shahrekord University, Shahrekord, Iran
| | - Karim Mahnam
- Department of Biology, Faculty of Basic Sciences, Shahrekord University, Shahrekord, Iran; Nanotechnology Research Center, Shahrekord University, Shahrekord, Iran.
| | - Mostafa Shakhsi-Niaei
- Department of Genetics, Faculty of Basic Sciences, Shahrekord University, Shahrekord, Iran
| |
Collapse
|
37
|
Riedel F, Aparicio-Soto M, Curato C, Thierse HJ, Siewert K, Luch A. Immunological Mechanisms of Metal Allergies and the Nickel-Specific TCR-pMHC Interface. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:10867. [PMID: 34682608 PMCID: PMC8535423 DOI: 10.3390/ijerph182010867] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 10/03/2021] [Accepted: 10/11/2021] [Indexed: 12/19/2022]
Abstract
Besides having physiological functions and general toxic effects, many metal ions can cause allergic reactions in humans. We here review the immune events involved in the mediation of metal allergies. We focus on nickel (Ni), cobalt (Co) and palladium (Pd), because these allergens are among the most prevalent sensitizers (Ni, Co) and immediate neighbors in the periodic table of the chemical elements. Co-sensitization between Ni and the other two metals is frequent while the knowledge on a possible immunological cross-reactivity using in vivo and in vitro approaches remains limited. At the center of an allergic reaction lies the capability of a metal allergen to form T cell epitopes that are recognized by specific T cell receptors (TCR). Technological advances such as activation-induced marker assays and TCR high-throughput sequencing recently provided new insights into the interaction of Ni2+ with the αβ TCR-peptide-major histocompatibility complex (pMHC) interface. Ni2+ functionally binds to the TCR gene segment TRAV9-2 or a histidine in the complementarity determining region 3 (CDR3), the main antigen binding region. Thus, we overview known, newly identified and hypothesized mechanisms of metal-specific T cell activation and discuss current knowledge on cross-reactivity.
Collapse
Affiliation(s)
- Franziska Riedel
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
- Institute of Pharmacy, Freie Universität Berlin, Königin-Luise-Straße 2, 14195 Berlin, Germany
| | - Marina Aparicio-Soto
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
| | - Caterina Curato
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
| | - Hermann-Josef Thierse
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
| | - Katherina Siewert
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
| | - Andreas Luch
- Department for Chemicals and Product Safety, Federal Institute for Risk Assessment, Max-Dohrn-Straße 8-10, 10589 Berlin, Germany; (M.A.-S.); (C.C.); (H.-J.T.); (K.S.); (A.L.)
- Institute of Pharmacy, Freie Universität Berlin, Königin-Luise-Straße 2, 14195 Berlin, Germany
| |
Collapse
|
38
|
Schaap-Johansen AL, Vujović M, Borch A, Hadrup SR, Marcatili P. T Cell Epitope Prediction and Its Application to Immunotherapy. Front Immunol 2021; 12:712488. [PMID: 34603286 PMCID: PMC8479193 DOI: 10.3389/fimmu.2021.712488] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 07/12/2021] [Indexed: 12/13/2022] Open
Abstract
T cells play a crucial role in controlling and driving the immune response with their ability to discriminate peptides derived from healthy as well as pathogenic proteins. In this review, we focus on the currently available computational tools for epitope prediction, with a particular focus on tools aimed at identifying neoepitopes, i.e. cancer-specific peptides and their potential for use in immunotherapy for cancer treatment. This review will cover how these tools work, what kind of data they use, as well as pros and cons in their respective applications.
Collapse
Affiliation(s)
| | - Milena Vujović
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Annie Borch
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Sine Reker Hadrup
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Paolo Marcatili
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
39
|
Milighetti M, Shawe-Taylor J, Chain B. Predicting T Cell Receptor Antigen Specificity From Structural Features Derived From Homology Models of Receptor-Peptide-Major Histocompatibility Complexes. Front Physiol 2021; 12:730908. [PMID: 34566692 PMCID: PMC8456106 DOI: 10.3389/fphys.2021.730908] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 08/02/2021] [Indexed: 11/13/2022] Open
Abstract
The physical interaction between the T cell receptor (TCR) and its cognate antigen causes T cells to activate and participate in the immune response. Understanding this physical interaction is important in predicting TCR binding to a target epitope, as well as potential cross-reactivity. Here, we propose a way of collecting informative features of the binding interface from homology models of T cell receptor-peptide-major histocompatibility complex (TCR-pMHC) complexes. The information collected from these structures is sufficient to discriminate binding from non-binding TCR-pMHC pairs in multiple independent datasets. The classifier is limited by the number of crystal structures available for the homology modelling and by the size of the training set. However, the classifier shows comparable performance to sequence-based classifiers requiring much larger training sets.
Collapse
Affiliation(s)
- Martina Milighetti
- Division of Infection and Immunity, University College London, London, United Kingdom
- Cancer Institute, University College London, London, United Kingdom
| | - John Shawe-Taylor
- Department of Computer Science, University College London, London, United Kingdom
| | - Benny Chain
- Division of Infection and Immunity, University College London, London, United Kingdom
- Department of Computer Science, University College London, London, United Kingdom
| |
Collapse
|
40
|
Peacock T, Chain B. Information-Driven Docking for TCR-pMHC Complex Prediction. Front Immunol 2021; 12:686127. [PMID: 34177934 PMCID: PMC8219952 DOI: 10.3389/fimmu.2021.686127] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 05/07/2021] [Indexed: 12/16/2022] Open
Abstract
T cell receptor (TCR) recognition of peptides presented by major histocompatibility complex (MHC) molecules is a fundamental process in the adaptive immune system. An understanding of this recognition process at the molecular level is crucial for TCR based therapeutics and vaccine design. The broad nature of TCR diversity and cross-reactivity presents a challenge for traditional structural resolution. Computational modelling of TCR-pMHC complexes offers an efficient alternative. This study compares the ability of four general-purpose docking platforms (ClusPro, LightDock, ZDOCK and HADDOCK) to make use of varying levels of binding interface information for accurate TCR-pMHC modelling. Each platform was tested on an expanded benchmark set of 44 TCR-pMHC docking cases. In general, HADDOCK is shown to be the best performer. Docking strategy guidance is provided to obtain the best models for each platform for future research. The TCR-pMHC docking cases used in this study can be downloaded from https://github.com/innate2adaptive/ExpandedBenchmark.
Collapse
Affiliation(s)
- Thomas Peacock
- Division of Infection and Immunity, University College London, London, United Kingdom.,The UCL Centre for Computation, Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), Department Computer Science, University College London, London, United Kingdom
| | - Benny Chain
- Division of Infection and Immunity, University College London, London, United Kingdom
| |
Collapse
|
41
|
Luu AM, Leistico JR, Miller T, Kim S, Song JS. Predicting TCR-Epitope Binding Specificity Using Deep Metric Learning and Multimodal Learning. Genes (Basel) 2021; 12:genes12040572. [PMID: 33920780 PMCID: PMC8071129 DOI: 10.3390/genes12040572] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 04/12/2021] [Accepted: 04/13/2021] [Indexed: 12/18/2022] Open
Abstract
Understanding the recognition of specific epitopes by cytotoxic T cells is a central problem in immunology. Although predicting binding between peptides and the class I Major Histocompatibility Complex (MHC) has had success, predicting interactions between T cell receptors (TCRs) and MHC class I-peptide complexes (pMHC) remains elusive. This paper utilizes a convolutional neural network model employing deep metric learning and multimodal learning to perform two critical tasks in TCR-epitope binding prediction: identifying the TCRs that bind a given epitope from a TCR repertoire, and identifying the binding epitope of a given TCR from a list of candidate epitopes. Our model can perform both tasks simultaneously and reveals that inconsistent preprocessing of TCR sequences can confound binding prediction. Applying a neural network interpretation method identifies key amino acid sequence patterns and positions within the TCR, important for binding specificity. Contrary to common assumption, known crystal structures of TCR-pMHC complexes show that the predicted salient amino acid positions are not necessarily the closest to the epitopes, implying that physical proximity may not be a good proxy for importance in determining TCR-epitope specificity. Our work thus provides an insight into the learned predictive features of TCR-epitope binding specificity and advances the associated classification tasks.
Collapse
Affiliation(s)
- Alan M. Luu
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; (A.M.L.); (J.R.L.); (T.M.); (S.K.)
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jacob R. Leistico
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; (A.M.L.); (J.R.L.); (T.M.); (S.K.)
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Tim Miller
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; (A.M.L.); (J.R.L.); (T.M.); (S.K.)
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Somang Kim
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; (A.M.L.); (J.R.L.); (T.M.); (S.K.)
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jun S. Song
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; (A.M.L.); (J.R.L.); (T.M.); (S.K.)
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Cancer Center at Illinois, University of Illinois, Urbana, IL 61801, USA
- Correspondence:
| |
Collapse
|
42
|
Pertseva M, Gao B, Neumeier D, Yermanos A, Reddy ST. Applications of Machine and Deep Learning in Adaptive Immunity. Annu Rev Chem Biomol Eng 2021; 12:39-62. [PMID: 33852352 DOI: 10.1146/annurev-chembioeng-101420-125021] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Adaptive immunity is mediated by lymphocyte B and T cells, which respectively express a vast and diverse repertoire of B cell and T cell receptors and, in conjunction with peptide antigen presentation through major histocompatibility complexes (MHCs), can recognize and respond to pathogens and diseased cells. In recent years, advances in deep sequencing have led to a massive increase in the amount of adaptive immune receptor repertoire data; additionally, proteomics techniques have led to a wealth of data on peptide-MHC presentation. These large-scale data sets are now making it possible to train machine and deep learning models, which can be used to identify complex and high-dimensional patterns in immune repertoires. This article introduces adaptive immune repertoires and machine and deep learning related to biological sequence data and then summarizes the many applications in this field, which span from predicting the immunological status of a host to the antigen specificity of individual receptors and the engineering of immunotherapeutics.
Collapse
Affiliation(s)
- Margarita Pertseva
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland; .,Life Science Zurich Graduate School, ETH Zurich and University of Zurich, 8006 Zurich, Switzerland
| | - Beichen Gao
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| | - Daniel Neumeier
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| | - Alexander Yermanos
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland; .,Department of Pathology and Immunology, University of Geneva, 1205 Geneva, Switzerland.,Department of Biology, Institute of Microbiology and Immunology, ETH Zurich, 8093 Zurich, Switzerland
| | - Sai T Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| |
Collapse
|
43
|
Chen SY, Yue T, Lei Q, Guo AY. TCRdb: a comprehensive database for T-cell receptor sequences with powerful search function. Nucleic Acids Res 2021; 49:D468-D474. [PMID: 32990749 PMCID: PMC7778924 DOI: 10.1093/nar/gkaa796] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 09/02/2020] [Accepted: 09/11/2020] [Indexed: 01/05/2023] Open
Abstract
T cells and the T-cell receptor (TCR) repertoire play pivotal roles in immune response and immunotherapy. TCR sequencing (TCR-Seq) technology has enabled accurate profiling TCR repertoire and currently a large number of TCR-Seq data are available in public. Based on the urgent need to effectively re-use these data, we developed TCRdb, a comprehensive human TCR sequences database, by a uniform pipeline to characterize TCR sequences on TCR-Seq data. TCRdb contains more than 277 million highly reliable TCR sequences from over 8265 TCR-Seq samples across hundreds of tissues/clinical conditions/cell types. The unique features of TCRdb include: (i) comprehensive and reliable sequences for TCR repertoire in different samples generated by a strict and uniform pipeline of TCRdb; (ii) powerful search function, allowing users to identify their interested TCR sequences in different conditions; (iii) categorized sample metadata, enabling comparison of TCRs in different sample types; (iv) interactive data visualization charts, describing the TCR repertoire in TCR diversity, length distribution and V-J gene utilization. The TCRdb database is freely available at http://bioinfo.life.hust.edu.cn/TCRdb/ and will be a useful resource in the research and application community of T cell immunology.
Collapse
Affiliation(s)
- Si-Yi Chen
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan, 430074, China
| | - Tao Yue
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan, 430074, China
| | - Qian Lei
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan, 430074, China
| | - An-Yuan Guo
- Center for Artificial Intelligence Biology, Hubei Bioinformatics & Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology; Wuhan, 430074, China
| |
Collapse
|
44
|
Wong WK, Marks C, Leem J, Lewis AP, Shi J, Deane CM. TCRBuilder: multi-state T-cell receptor structure prediction. Bioinformatics 2020; 36:3580-3581. [PMID: 32181809 DOI: 10.1093/bioinformatics/btaa194] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 02/21/2020] [Accepted: 03/13/2020] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION T-cell receptors (TCRs) are immune proteins that primarily target peptide antigens presented by the major histocompatibility complex. They tend to have lower specificity and affinity than their antibody counterparts, and their binding sites have been shown to adopt multiple conformations, which is potentially an important factor for their polyspecificity. None of the current TCR-modelling tools predict this variability which limits our ability to accurately predict TCR binding. RESULTS We present TCRBuilder, a multi-state TCR structure prediction tool. Given a paired αβTCR sequence, TCRBuilder returns a model or an ensemble of models covering the potential conformations of the binding site. This enables the analysis of structurally driven polyspecificity in TCRs, which is not possible with existing tools. AVAILABILITY AND IMPLEMENTATION http://opig.stats.ox.ac.uk/resources. CONTACT deane@stats.ox.ac.uk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wing Ki Wong
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Claire Marks
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Jinwoo Leem
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Alan P Lewis
- Data and Computational Sciences, GlaxoSmithKline Research and Development, Stevenage SG1 2NY, UK
| | - Jiye Shi
- Department of Chemistry, UCB Pharma, Slough SL1 3WE, UK
| | | |
Collapse
|
45
|
Lee CH, Salio M, Napolitani G, Ogg G, Simmons A, Koohy H. Predicting Cross-Reactivity and Antigen Specificity of T Cell Receptors. Front Immunol 2020; 11:565096. [PMID: 33193332 PMCID: PMC7642207 DOI: 10.3389/fimmu.2020.565096] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Accepted: 09/07/2020] [Indexed: 12/13/2022] Open
Abstract
Adaptive immune recognition is mediated by specific interactions between heterodimeric T cell receptors (TCRs) and their cognate peptide-MHC (pMHC) ligands, and the methods to accurately predict TCR:pMHC interaction would have profound clinical, therapeutic and pharmaceutical applications. Herein, we review recent developments in predicting cross-reactivity and antigen specificity of TCR recognition. We discuss current experimental and computational approaches to investigate cross-reactivity and antigen-specificity of TCRs and highlight how integrating kinetic, biophysical and structural features may offer valuable insights in modeling immunogenicity. We further underscore the close inter-relationship of these two interconnected notions and the need to investigate each in the light of the other for a better understanding of T cell responsiveness for the effective clinical applications.
Collapse
Affiliation(s)
- Chloe H. Lee
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
- MRC WIMM Centre for Computational Biology, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| | - Mariolina Salio
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| | - Giorgio Napolitani
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| | - Graham Ogg
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| | - Alison Simmons
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
- Translational Gastroenterology Unit, John Radcliffe Hospital, Oxford, United Kingdom
| | - Hashem Koohy
- MRC Human Immunology Unit, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine (WIMM), John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
- MRC WIMM Centre for Computational Biology, Medical Research Council (MRC) Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
46
|
Vujovic M, Degn KF, Marin FI, Schaap-Johansen AL, Chain B, Andresen TL, Kaplinsky J, Marcatili P. T cell receptor sequence clustering and antigen specificity. Comput Struct Biotechnol J 2020; 18:2166-2173. [PMID: 32952933 PMCID: PMC7473833 DOI: 10.1016/j.csbj.2020.06.041] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Revised: 06/25/2020] [Accepted: 06/27/2020] [Indexed: 11/17/2022] Open
Abstract
There has been increasing interest in the role of T cells and their involvement in cancer, autoimmune and infectious diseases. However, the nature of T cell receptor (TCR) epitope recognition at a repertoire level is not yet fully understood. Due to technological advances a plethora of TCR sequences from a variety of disease and treatment settings has become readily available. Current efforts in TCR specificity analysis focus on identifying characteristics in immune repertoires which can explain or predict disease outcome or progression, or can be used to monitor the efficacy of disease therapy. In this context, clustering of TCRs by sequence to reflect biological similarity, and especially to reflect antigen specificity have become of paramount importance. We review the main TCR sequence clustering methods and the different similarity measures they use, and discuss their performance and possible improvement. We aim to provide guidance for non-specialists who wish to use TCR repertoire sequencing for disease tracking, patient stratification or therapy prediction, and to provide a starting point for those aiming to develop novel techniques for TCR annotation through clustering.
Collapse
Affiliation(s)
- Milena Vujovic
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| | - Kristine Fredlund Degn
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| | - Frederikke Isa Marin
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| | - Anna-Lisa Schaap-Johansen
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| | - Benny Chain
- UCL Division of Infection and Immunity, University College London, Wing 3.2, Cruciform Building, Gower Street, London WC1E 6BT, United Kingdom
| | - Thomas Lars Andresen
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| | - Joseph Kaplinsky
- Ludwig Institute for Cancer Research Ltd, University of Oxford, Nuffield Department of Medicine, Old Road Campus Research Building, Roosevelt Drive, Oxford OX3 7DQ, United Kingdom
| | - Paolo Marcatili
- DTU HealthTech, Department of Health Technology, Technical University of Denmark, Ørsteds Plads, Building 345C, DK-2800 Kgs. Lyngby, Denmark
| |
Collapse
|
47
|
Gowthaman R, Pierce BG. TCR3d: The T cell receptor structural repertoire database. Bioinformatics 2020; 35:5323-5325. [PMID: 31240309 PMCID: PMC6954642 DOI: 10.1093/bioinformatics/btz517] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 05/31/2019] [Accepted: 06/20/2019] [Indexed: 12/31/2022] Open
Abstract
Summary T cell receptors (TCRs) are critical molecules of the adaptive immune system, capable of recognizing diverse antigens, including peptides, lipids and small molecules, and represent a rapidly growing class of therapeutics. Determining the structural and mechanistic basis of TCR targeting of antigens is a major challenge, as each individual has a vast and diverse repertoire of TCRs. Despite shared general recognition modes, diversity in TCR sequence and recognition represents a challenge to predictive modeling and computational techniques being developed to predict antigen specificity and mechanistic basis of TCR targeting. To this end, we have developed the TCR3d database, a resource containing all known TCR structures, with a particular focus on antigen recognition. TCR3d provides key information on antigen binding mode, interface features, loop sequences and germline gene usage. Users can interactively view TCR complex structures, search sequences of interest against known structures and sequences, and download curated datasets of structurally characterized TCR complexes. This database is updated on a weekly basis, and can serve the community as a centralized resource for those studying T cell receptors and their recognition. Availability and implementation The TCR3d database is available at https://tcr3d.ibbr.umd.edu/.
Collapse
Affiliation(s)
- Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA.,University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center, Baltimore, MD, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA.,University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center, Baltimore, MD, USA
| |
Collapse
|
48
|
Uncovering the Tumor Antigen Landscape: What to Know about the Discovery Process. Cancers (Basel) 2020; 12:cancers12061660. [PMID: 32585818 PMCID: PMC7352969 DOI: 10.3390/cancers12061660] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 06/11/2020] [Accepted: 06/20/2020] [Indexed: 12/14/2022] Open
Abstract
According to the latest available data, cancer is the second leading cause of death, highlighting the need for novel cancer therapeutic approaches. In this context, immunotherapy is emerging as a reliable first-line treatment for many cancers, particularly metastatic melanoma. Indeed, cancer immunotherapy has attracted great interest following the recent clinical approval of antibodies targeting immune checkpoint molecules, such as PD-1, PD-L1, and CTLA-4, that release the brakes of the immune system, thus reviving a field otherwise poorly explored. Cancer immunotherapy mainly relies on the generation and stimulation of cytotoxic CD8 T lymphocytes (CTLs) within the tumor microenvironment (TME), priming T cells and establishing efficient and durable anti-tumor immunity. Therefore, there is a clear need to define and identify immunogenic T cell epitopes to use in therapeutic cancer vaccines. Naturally presented antigens in the human leucocyte antigen-1 (HLA-I) complex on the tumor surface are the main protagonists in evocating a specific anti-tumor CD8+ T cell response. However, the methodologies for their identification have been a major bottleneck for their reliable characterization. Consequently, the field of antigen discovery has yet to improve. The current review is intended to define what are today known as tumor antigens, with a main focus on CTL antigenic peptides. We also review the techniques developed and employed to date for antigen discovery, exploring both the direct elution of HLA-I peptides and the in silico prediction of epitopes. Finally, the last part of the review analyses the future challenges and direction of the antigen discovery field.
Collapse
|
49
|
High-affinity oligoclonal TCRs define effective adoptive T cell therapy targeting mutant KRAS-G12D. Proc Natl Acad Sci U S A 2020; 117:12826-12835. [PMID: 32461371 DOI: 10.1073/pnas.1921964117] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Complete cancer regression occurs in a subset of patients following adoptive T cell therapy (ACT) of ex vivo expanded tumor-infiltrating lymphocytes (TILs). However, the low success rate presents a great challenge to broader clinical application. To provide insight into TIL-based immunotherapy, we studied a successful case of ACT where regression was observed against tumors carrying the hotspot mutation G12D in the KRAS oncogene. Four T cell receptors (TCRs) made up the TIL infusion and recognized two KRAS-G12D neoantigens, a nonamer and a decamer, all restricted by human leukocyte antigen (HLA) C*08:02. Three of them (TCR9a, 9b, and 9c) were nonamer-specific, while one was decamer-specific (TCR10). We show that only mutant G12D but not the wild-type peptides stabilized HLA-C*08:02 due to the formation of a critical anchor salt bridge to HLA-C. Therapeutic TCRs exhibited high affinities, ranging from nanomolar to low micromolar. Intriguingly, TCR binding affinities to HLA-C inversely correlated with their persistence in vivo, suggesting the importance of antigenic affinity in the function of therapeutic T cells. Crystal structures of TCR-HLA-C complexes revealed that TCR9a to 9c recognized G12D nonamer with multiple conserved contacts through shared CDR2β and CDR3α. This allowed CDR3β variation to confer different affinities via a variable HLA-C contact, generating an oligoclonal response. TCR10 recognized an induced and distinct G12D decamer conformation. Thus, this successful case of ACT included oligoclonal TCRs of high affinity recognizing distinct conformations of neoantigens. Our study revealed the potential of a structural approach to inform clinical efforts in targeting KRAS-G12D tumors by immunotherapy and has general implications for T cell-based immunotherapies.
Collapse
|
50
|
Fernández-Quintero ML, Seidler CA, Liedl KR. T-Cell Receptor Variable β Domains Rigidify During Affinity Maturation. Sci Rep 2020; 10:4472. [PMID: 32161287 PMCID: PMC7066139 DOI: 10.1038/s41598-020-61433-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 01/30/2020] [Indexed: 11/17/2022] Open
Abstract
We investigated T-cell receptor variable β chains binding to the superantigen staphylococcal enterotoxin C3 (SEC 3) with structure information in different stages of affinity maturation. Metadynamics in combination with molecular dynamics simulations allow to access the micro-to-millisecond timescale and reveal a strong effect of energetically significant mutations on the flexibility of the antigen-binding site. The observed changes in dynamics of the complementarity determining region (CDR) loops, especially the CDR 2, and HV 4 loop on this specific pathway of affinity maturation are reflected in their structural diversity, thermodynamics of conformations and kinetics of structural transitions. In addition, this affinity maturation pathway follows the concept of conformational selection, because even without the presence of the antigen the binding competent state is present in this pre-existing ensemble of conformations. In all stages of this affinity maturation process we observe a link between specificity and reduced flexibility.
Collapse
Affiliation(s)
- Monica L Fernández-Quintero
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innrain 80-82, A-6020, Innsbruck, Austria
| | - Clarissa A Seidler
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innrain 80-82, A-6020, Innsbruck, Austria
| | - Klaus R Liedl
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innrain 80-82, A-6020, Innsbruck, Austria.
| |
Collapse
|