1
|
Taksøe-Vester CA, Mikolaj K, Petersen OBB, Vejlstrup NG, Christensen AN, Feragen A, Nielsen M, Svendsen MBS, Tolsgaard MG. Role of artificial-intelligence-assisted automated cardiac biometrics in prenatal screening for coarctation of aorta. ULTRASOUND IN OBSTETRICS & GYNECOLOGY : THE OFFICIAL JOURNAL OF THE INTERNATIONAL SOCIETY OF ULTRASOUND IN OBSTETRICS AND GYNECOLOGY 2024; 64:36-43. [PMID: 38339776 DOI: 10.1002/uog.27608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 01/31/2024] [Accepted: 01/31/2024] [Indexed: 02/12/2024]
Abstract
OBJECTIVE Although remarkable strides have been made in fetal medicine and the prenatal diagnosis of congenital heart disease, around 60% of newborns with isolated coarctation of the aorta (CoA) are not identified prior to birth. The prenatal detection of CoA has been shown to have a notable impact on survival rates of affected infants. To this end, implementation of artificial intelligence (AI) in fetal ultrasound may represent a groundbreaking advance. We aimed to investigate whether the use of automated cardiac biometric measurements with AI during the 18-22-week anomaly scan would enhance the identification of fetuses that are at risk of developing CoA. METHODS We developed an AI model capable of identifying standard cardiac planes and conducting automated cardiac biometric measurements. Our data consisted of pregnancy ultrasound image and outcome data spanning from 2008 to 2018 and collected from four distinct regions in Denmark. Cases with a postnatal diagnosis of CoA were paired with healthy controls in a ratio of 1:100 and matched for gestational age within 2 days. Cardiac biometrics obtained from the four-chamber and three-vessel views were included in a logistic regression-based prediction model. To assess its predictive capabilities, we assessed sensitivity and specificity on receiver-operating-characteristics (ROC) curves. RESULTS At the 18-22-week scan, the right ventricle (RV) area and length, left ventricle (LV) diameter and the ratios of RV/LV areas and main pulmonary artery/ascending aorta diameters showed significant differences, with Z-scores above 0.7, when comparing subjects with a postnatal diagnosis of CoA (n = 73) and healthy controls (n = 7300). Using logistic regression and backward feature selection, our prediction model had an area under the ROC curve of 0.96 and a specificity of 88.9% at a sensitivity of 90.4%. CONCLUSIONS The integration of AI technology with automated cardiac biometric measurements obtained during the 18-22-week anomaly scan has the potential to enhance substantially the performance of screening for fetal CoA and subsequently the detection rate of CoA. Future research should clarify how AI technology can be used to aid in the screening and detection of congenital heart anomalies to improve neonatal outcomes. © 2024 The Authors. Ultrasound in Obstetrics & Gynecology published by John Wiley & Sons Ltd on behalf of International Society of Ultrasound in Obstetrics and Gynecology.
Collapse
|
2
|
Yan Z, Kim K, Kim H, Ha B, Gambiez A, Bennett J, de Almeida Mendes MF, Trevizani R, Mahita J, Richardson E, Marrama D, Blazeska N, Koşaloğlu-Yalçın Z, Nielsen M, Sette A, Peters B, Greenbaum JA. Next-generation IEDB tools: a platform for epitope prediction and analysis. Nucleic Acids Res 2024:gkae407. [PMID: 38783079 DOI: 10.1093/nar/gkae407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 04/24/2024] [Accepted: 05/01/2024] [Indexed: 05/25/2024] Open
Abstract
The Next-Generation (NG) IEDB Tools website (https://nextgen-tools.iedb.org) provides users with a redesigned interface to many of the algorithms for epitope prediction and analysis that were originally released on the legacy IEDB Tools website. The initial release focuses on consolidation of all tools related to HLA class I epitopes (MHC binding, elution, immunogenicity, and processing), making all of these predictions accessible from a single application and allowing for their simultaneous execution with minimal user inputs. Additionally, the PEPMatch tool for identifying highly similar epitopes in a set of curated proteomes, as well as a tool for epitope clustering, are available on the site. The NG Tools site allows users to build data pipelines by sending the output of one tool as input for the next. Over the next several years, all pre-existing IEDB Tools, and any newly developed tools, will be integrated into this new site. Here we describe the philosophy behind the redesign and demonstrate the utility and productivity enhancements that are enabled by the new interface.
Collapse
|
3
|
Egholm Bruun Jensen E, Reynisson B, Barra C, Nielsen M. New light on the HLA-DR immunopeptidomic landscape. J Leukoc Biol 2024; 115:913-925. [PMID: 38214568 PMCID: PMC11057780 DOI: 10.1093/jleuko/qiae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 12/18/2023] [Accepted: 12/27/2023] [Indexed: 01/13/2024] Open
Abstract
The set of peptides processed and presented by major histocompatibility complex class II molecules defines the immunopeptidome, and its characterization holds keys to understanding essential properties of the immune system. High-throughput mass spectrometry (MS) techniques enable interrogation of the diversity and complexity of the immunopeptidome at an unprecedented scale. Here, we analyzed a large set of MS immunopeptidomics data from 40 donors, 221 samples, covering 30 unique HLA-DR molecules. We identified likely co-immunoprecipitated HLA-DR irrelevant contaminants using state-of-the-art prediction methods and unveiled novel light on the properties of HLA antigen processing and presentation. The ligandome (HLA binders) was enriched in 15-mer peptides, and the contaminome (nonbinders) in longer peptides. Classification of singletons and nested sets showed that the first were enriched in contaminants. Investigating the source protein location of ligands revealed that only contaminants shared a positional bias. Regarding subcellular localization, nested peptides were found to be predominantly of endolysosomal origin, whereas singletons shared an equal distribution between the cytosolic and endolysosomal origin. According to antigen-processing signatures, no significant differences were observed between the cytosolic and endolysosomal ligands. Further, the sensitivity of MS immunopeptidomics was investigated by analyzing overlap and saturation between biological MS replicas, concluding that at least 5 replicas are needed to identify 80% of the immunopeptidome. Moreover, the overlap in immunopeptidome between donors was found to be very low both in terms of peptides and source proteins, the latter indicating a critical HLA bias in the antigen sampling in the HLA antigen presentation. Finally, the complementarity between MS and in silico approaches for comprehensively sampling the immunopeptidome was demonstrated.
Collapse
|
4
|
Borch A, Carri I, Reynisson B, Alvarez HMG, Munk KK, Montemurro A, Kristensen NP, Tvingsholm SA, Holm JS, Heeke C, Moss KH, Hansen UK, Schaap-Johansen AL, Bagger FO, de Lima VAB, Rohrberg KS, Funt SA, Donia M, Svane IM, Lassen U, Barra C, Nielsen M, Hadrup SR. IMPROVE: a feature model to predict neoepitope immunogenicity through broad-scale validation of T-cell recognition. Front Immunol 2024; 15:1360281. [PMID: 38633261 PMCID: PMC11021644 DOI: 10.3389/fimmu.2024.1360281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/07/2024] [Indexed: 04/19/2024] Open
Abstract
Background Mutation-derived neoantigens are critical targets for tumor rejection in cancer immunotherapy, and better tools for neoepitope identification and prediction are needed to improve neoepitope targeting strategies. Computational tools have enabled the identification of patient-specific neoantigen candidates from sequencing data, but limited data availability has hindered their capacity to predict which of the many neoepitopes will most likely give rise to T cell recognition. Method To address this, we make use of experimentally validated T cell recognition towards 17,500 neoepitope candidates, with 467 being T cell recognized, across 70 cancer patients undergoing immunotherapy. Results We evaluated 27 neoepitope characteristics, and created a random forest model, IMPROVE, to predict neoepitope immunogenicity. The presence of hydrophobic and aromatic residues in the peptide binding core were the most important features for predicting neoepitope immunogenicity. Conclusion Overall, IMPROVE was found to significantly advance the identification of neoepitopes compared to other current methods.
Collapse
|
5
|
Jensen MF, Nielsen M. Enhancing TCR specificity predictions by combined pan- and peptide-specific training, loss-scaling, and sequence similarity integration. eLife 2024; 12:RP93934. [PMID: 38437160 PMCID: PMC10942633 DOI: 10.7554/elife.93934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024] Open
Abstract
Predicting the interaction between Major Histocompatibility Complex (MHC) class I-presented peptides and T-cell receptors (TCR) holds significant implications for vaccine development, cancer treatment, and autoimmune disease therapies. However, limited paired-chain TCR data, skewed towards well-studied epitopes, hampers the development of pan-specific machine-learning (ML) models. Leveraging a larger peptide-TCR dataset, we explore various alterations to the ML architectures and training strategies to address data imbalance. This leads to an overall improved performance, particularly for peptides with scant TCR data. However, challenges persist for unseen peptides, especially those distant from training examples. We demonstrate that such ML models can be used to detect potential outliers, which when removed from training, leads to augmented performance. Integrating pan-specific and peptide-specific models alongside with similarity-based predictions, further improves the overall performance, especially when a low false positive rate is desirable. In the context of the IMMREP22 benchmark, this modeling framework attained state-of-the-art performance. Moreover, combining these strategies results in acceptable predictive accuracy for peptides characterized with as little as 15 positive TCRs. This observation places great promise on rapidly expanding the peptide covering of the current models for predicting TCR specificity. The NetTCR 2.2 model incorporating these advances is available on GitHub (https://github.com/mnielLab/NetTCR-2.2) and as a web server at https://services.healthtech.dtu.dk/services/NetTCR-2.2/.
Collapse
|
6
|
Wan YTR, Koşaloğlu‐Yalçın Z, Peters B, Nielsen M. A large-scale study of peptide features defining immunogenicity of cancer neo-epitopes. NAR Cancer 2024; 6:zcae002. [PMID: 38288446 PMCID: PMC10823584 DOI: 10.1093/narcan/zcae002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 01/03/2024] [Accepted: 01/12/2024] [Indexed: 01/31/2024] Open
Abstract
Accurate prediction of immunogenicity for neo-epitopes arising from a cancer associated mutation is a crucial step in many bioinformatics pipelines that predict outcome of checkpoint blockade treatments or that aim to design personalised cancer immunotherapies and vaccines. In this study, we performed a comprehensive analysis of peptide features relevant for prediction of immunogenicity using the Cancer Epitope Database and Analysis Resource (CEDAR), a curated database of cancer epitopes with experimentally validated immunogenicity annotations from peer-reviewed publications. The developed model, ICERFIRE (ICore-based Ensemble Random Forest for neo-epitope Immunogenicity pREdiction), extracts the predicted ICORE from the full neo-epitope as input, i.e. the nested peptide with the highest predicted major histocompatibility complex (MHC) binding potential combined with its predicted likelihood of antigen presentation (%Rank). Key additional features integrated into the model include assessment of the BLOSUM mutation score of the neo-epitope, and antigen expression levels of the wild-type counterpart which is often reflecting a neo-epitope's abundance. We demonstrate improved and robust performance of ICERFIRE over existing immunogenicity and epitope prediction models, both in cross-validation and on external validation datasets.
Collapse
|
7
|
Høie MH, Gade FS, Johansen J, Würtzen C, Winther O, Nielsen M, Marcatili P. DiscoTope-3.0: improved B-cell epitope prediction using inverse folding latent representations. Front Immunol 2024; 15:1322712. [PMID: 38390326 PMCID: PMC10882062 DOI: 10.3389/fimmu.2024.1322712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/08/2024] [Indexed: 02/24/2024] Open
Abstract
Accurate computational identification of B-cell epitopes is crucial for the development of vaccines, therapies, and diagnostic tools. However, current structure-based prediction methods face limitations due to the dependency on experimentally solved structures. Here, we introduce DiscoTope-3.0, a markedly improved B-cell epitope prediction tool that innovatively employs inverse folding structure representations and a positive-unlabelled learning strategy, and is adapted for both solved and predicted structures. Our tool demonstrates a considerable improvement in performance over existing methods, accurately predicting linear and conformational epitopes across multiple independent datasets. Most notably, DiscoTope-3.0 maintains high predictive performance across solved, relaxed and predicted structures, alleviating the need for experimental structures and extending the general applicability of accurate B-cell epitope prediction by 3 orders of magnitude. DiscoTope-3.0 is made widely accessible on two web servers, processing over 100 structures per submission, and as a downloadable package. In addition, the servers interface with RCSB and AlphaFoldDB, facilitating large-scale prediction across over 200 million cataloged proteins. DiscoTope-3.0 is available at: https://services.healthtech.dtu.dk/service.php?DiscoTope-3.0.
Collapse
|
8
|
Yankilevich P, Nazerai L, Willis SC, Schmiegelow K, De Zio D, Nielsen M. An analysis pipeline for understanding 6-thioguanine effects on a mouse tumour genome. Cancer Immunol Immunother 2024; 73:22. [PMID: 38279992 PMCID: PMC10821971 DOI: 10.1007/s00262-023-03610-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 12/07/2023] [Indexed: 01/29/2024]
Abstract
Mouse tumour models are extensively used as a pre-clinical research tool in the field of oncology, playing an important role in anticancer drugs discovery. Accordingly, in cancer genomics research, the demand for next-generation sequencing (NGS) is increasing, and consequently, the need for data analysis pipelines is likewise growing. Most NGS data analysis solutions to date do not support mouse data or require highly specific configuration for their use. Here, we present a genome analysis pipeline for mouse tumour NGS data including the whole-genome sequence (WGS) data analysis flow for somatic variant discovery, and the RNA-seq data flow for differential expression, functional analysis and neoantigen prediction. The pipeline is based on standards and best practices and integrates mouse genome references and annotations. In a recent study, the pipeline was applied to demonstrate the efficacy of low dose 6-thioguanine (6TG) treatment on low-mutation melanoma in a pre-clinical mouse model. Here, we further this study and describe in detail the pipeline and the results obtained in terms of tumour mutational burden (TMB) and number of predicted neoantigens, and correlate these with 6TG effects on tumour volume. Our pipeline was expanded to include a neoantigen analysis, resulting in neopeptide prediction and MHC class I antigen presentation evaluation. We observed that the number of predicted neoepitopes were more accurate indicators of tumour immune control than TMB. In conclusion, this study demonstrates the usability of the proposed pipeline, and suggests it could be an essential robust genome analysis platform for future mouse genomic analysis.
Collapse
|
9
|
Liao H, Barra C, Zhou Z, Peng X, Woodhouse I, Tailor A, Parker R, Carré A, Borrow P, Hogan MJ, Paes W, Eisenlohr LC, Mallone R, Nielsen M, Ternette N. MARS an improved de novo peptide candidate selection method for non-canonical antigen target discovery in cancer. Nat Commun 2024; 15:661. [PMID: 38253617 PMCID: PMC10803737 DOI: 10.1038/s41467-023-44460-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 12/14/2023] [Indexed: 01/24/2024] Open
Abstract
Understanding the nature and extent of non-canonical human leukocyte antigen (HLA) presentation in tumour cells is a priority for target antigen discovery for the development of next generation immunotherapies in cancer. We here employ a de novo mass spectrometric sequencing approach with a refined, MHC-centric analysis strategy to detect non-canonical MHC-associated peptides specific to cancer without any prior knowledge of the target sequence from genomic or RNA sequencing data. Our strategy integrates MHC binding rank, Average local confidence scores, and peptide Retention time prediction for improved de novo candidate Selection; culminating in the machine learning model MARS. We benchmark our model on a large synthetic peptide library dataset and reanalysis of a published dataset of high-quality non-canonical MHC-associated peptide identifications in human cancer. We achieve almost 2-fold improvement for high quality spectral assignments in comparison to de novo sequencing alone with an estimated accuracy of above 85.7% when integrated with a stepwise peptide sequence mapping strategy. Finally, we utilize MARS to detect and validate lncRNA-derived peptides in human cervical tumour resections, demonstrating its suitability to discover novel, immunogenic, non-canonical peptide sequences in primary tumour tissue.
Collapse
|
10
|
Terlouw D, Boot A, Ducarmon QR, Nooij S, Jessurun MA, van Leerdam ME, Tops CM, Langers AMJ, Morreau H, van Wezel T, Nielsen M. Colibactin mutational signatures in NTHL1 tumor syndrome and MUTYH associated polyposis patients. Genes Chromosomes Cancer 2024; 63:e23208. [PMID: 37795928 DOI: 10.1002/gcc.23208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/06/2023] Open
Abstract
Polyketide synthase (pks) island harboring Escherichia coli are, under the right circumstances, able to produce the genotoxin colibactin. Colibactin is a risk factor for the development of colorectal cancer and associated with mutational signatures SBS88 and ID18. This study explores colibactin-associated mutational signatures in biallelic NTHL1 and MUTYH patients. Targeted Next Generation Sequencing (NGS) was performed on colorectal adenomas and carcinomas of one biallelic NTHL and 12 biallelic MUTYH patients. Additional fecal metagenomics and genome sequencing followed by mutational signature analysis was conducted for the NTHL1 patient. Targeted NGS of the NTHL1 patient showed somatic APC variants fitting SBS88 which was confirmed using WGS. Furthermore, fecal metagenomics revealed pks genes. Also, in 1 out of 11 MUTYH patient a somatic variant was detected fitting SBS88. This report shows that colibactin may influence development of colorectal neoplasms in predisposed patients.
Collapse
|
11
|
Barra C, Nilsson JB, Saksager A, Carri I, Deleuran S, Garcia Alvarez HM, Høie MH, Li Y, Clifford JN, Wan YTR, Moreta LS, Nielsen M. In Silico Tools for Predicting Novel Epitopes. Methods Mol Biol 2024; 2813:245-280. [PMID: 38888783 DOI: 10.1007/978-1-0716-3890-3_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
Identifying antigens within a pathogen is a critical task to develop effective vaccines and diagnostic methods, as well as understanding the evolution and adaptation to host immune responses. Historically, antigenicity was studied with experiments that evaluate the immune response against selected fragments of pathogens. Using this approach, the scientific community has gathered abundant information regarding which pathogenic fragments are immunogenic. The systematic collection of this data has enabled unraveling many of the fundamental rules underlying the properties defining epitopes and immunogenicity, and has resulted in the creation of a large panel of immunologically relevant predictive (in silico) tools. The development and application of such tools have proven to accelerate the identification of novel epitopes within biomedical applications reducing experimental costs. This chapter introduces some basic concepts about MHC presentation, T cell and B cell epitopes, the experimental efforts to determine those, and focuses on state-of-the-art methods for epitope prediction, highlighting their strengths and limitations, and catering instructions for their rational use.
Collapse
|
12
|
Friborg J, Jensen K, Eriksen JG, Samsøe E, Maare C, Farhadi M, Sibolt P, Nielsen M, Andersen M, Holm AIS, Skyt P, Smulders B, Johansen J, Overgaard J, Grau C, Hansen CR. Considerations for study design in the DAHANCA 35 trial of protons versus photons for head and neck cancer. Radiother Oncol 2024; 190:109958. [PMID: 37871751 DOI: 10.1016/j.radonc.2023.109958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 08/10/2023] [Accepted: 09/26/2023] [Indexed: 10/25/2023]
Abstract
Proton radiotherapy offers a dosimetric advantage compared to photon therapy in sparing normal tissue, but the clinical evidence for toxicity reductions in the treatment of head and neck cancer is limited. The Danish Head and Neck Cancer Group (DAHANCA) has initiated the DAHANCA 35 randomised trial to clarify the value of proton therapy (NCT04607694). The DAHANCA 35 trial is performed in an enriched population of patients selected by an anticipated benefit of proton therapy to reduce the risk of late dysphagia or xerostomia based on normal tissue complication probability (NTCP) modelling. We present our considerations on the trial design and a test of the selection procedure conducted before initiating the randomised study.
Collapse
|
13
|
Marrama D, Chronister WD, Westernberg L, Vita R, Koşaloğlu-Yalçın Z, Sette A, Nielsen M, Greenbaum JA, Peters B. PEPMatch: a tool to identify short peptide sequence matches in large sets of proteins. BMC Bioinformatics 2023; 24:485. [PMID: 38110863 PMCID: PMC10726511 DOI: 10.1186/s12859-023-05606-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Accepted: 12/06/2023] [Indexed: 12/20/2023] Open
Abstract
BACKGROUND Numerous tools exist for biological sequence comparisons and search. One case of particular interest for immunologists is finding matches for linear peptide T cell epitopes, typically between 8 and 15 residues in length, in a large set of protein sequences. Both to find exact matches or matches that account for residue substitutions. The utility of such tools is critical in applications ranging from identifying conservation across viral epitopes, identifying putative epitope targets for allergens, and finding matches for cancer-associated neoepitopes to examine the role of tolerance in tumor recognition. RESULTS We defined a set of benchmarks that reflect the different practical applications of short peptide sequence matching. We evaluated a suite of existing methods for speed and recall and developed a new tool, PEPMatch. The tool uses a deterministic k-mer mapping algorithm that preprocesses proteomes before searching, achieving a 50-fold increase in speed over methods such as the Basic Local Alignment Search Tool (BLAST) without compromising recall. PEPMatch's code and benchmark datasets are publicly available. CONCLUSIONS PEPMatch offers significant speed and recall advantages for peptide sequence matching. While it is of immediate utility for immunologists, the developed benchmarking framework also provides a standard against which future tools can be evaluated for improvements. The tool is available at https://nextgen-tools.iedb.org , and the source code can be found at https://github.com/IEDB/PEPMatch .
Collapse
|
14
|
Nielsen M, Monberg T, Sundvold V, Albieri B, Hovgaard D, Petersen MM, Krarup-Hansen A, Met Ö, Camilio K, Clancy T, Stratford R, Sveinbjornsson B, Rekdal Ø, Junker N, Svane IM. LTX-315 and adoptive cell therapy using tumor-infiltrating lymphocytes generate tumor specific T cells in patients with metastatic soft tissue sarcoma. Oncoimmunology 2023; 13:2290900. [PMID: 38125722 PMCID: PMC10732595 DOI: 10.1080/2162402x.2023.2290900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 11/30/2023] [Indexed: 12/23/2023] Open
Abstract
LTX-315 is an oncolytic peptide that elicits both local and systemic immune responses upon intratumoral injection. In the present pilot trial, we treated patients with metastatic soft tissue sarcoma with the combination of LTX-315 and adoptive T-cell therapy using in vitro expanded tumor-infiltrating lymphocytes. Six heavily pretreated patients were included in the trial and treated with LTX-315 of which four patients proceeded to adoptive T-cell therapy. Overall, the treatment was considered safe with only expected and manageable toxicity. The best overall clinical response was stable disease for 208 days, and in this patient, we detected tumor-reactive T cells in the blood that lasted until disease progression. In three patients T-cell reactivity against in silico predicted neoantigens was demonstrated. Additionally, de novo T-cell clones were generated and expanded in the blood following LTX-315 injections. In conclusion, this pilot study provides proof that it is feasible to combine LTX-315 and adoptive T-cell therapy, and that this treatment can induce systemic immune responses that resulted in stabilization of the disease in sarcoma patients with otherwise progressive disease. Further optimization of the treatment protocol is warranted to increase clinical activity. ClinicalTrials.gov Identifier: NCT03725605.
Collapse
|
15
|
Nilsson JB, Kaabinejadian S, Yari H, Kester MG, van Balen P, Hildebrand WH, Nielsen M. Accurate prediction of HLA class II antigen presentation across all loci using tailored data acquisition and refined machine learning. SCIENCE ADVANCES 2023; 9:eadj6367. [PMID: 38000035 PMCID: PMC10672173 DOI: 10.1126/sciadv.adj6367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 10/25/2023] [Indexed: 11/26/2023]
Abstract
Accurate prediction of antigen presentation by human leukocyte antigen (HLA) class II molecules is crucial for rational development of immunotherapies and vaccines targeting CD4+ T cell activation. So far, most prediction methods for HLA class II antigen presentation have focused on HLA-DR because of limited availability of immunopeptidomics data for HLA-DQ and HLA-DP while not taking into account alternative peptide binding modes. We present an update to the NetMHCIIpan prediction method, which closes the performance gap between all three HLA class II loci. We accomplish this by first integrating large immunopeptidomics datasets describing the HLA class II specificity space across all loci using a refined machine learning framework that accommodates inverted peptide binders. Next, we apply targeted immunopeptidomics assays to generate data that covers additional HLA-DP specificities. The final method, NetMHCIIpan-4.3, achieves high accuracy and molecular coverage across all HLA class II allotypes.
Collapse
|
16
|
Foos G, Blazeska N, Nielsen M, Carter H, Kosaloglu-Yalcin Z, Peters B, Sette A. A meta-analysis of epitopes in prostate-specific antigens identifies opportunities and knowledge gaps. Hum Immunol 2023; 84:578-589. [PMID: 37679223 PMCID: PMC11017785 DOI: 10.1016/j.humimm.2023.08.145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 08/16/2023] [Accepted: 08/21/2023] [Indexed: 09/09/2023]
Abstract
BACKGROUND The Cancer Epitope Database and Analysis Resource (CEDAR) is a newly developed repository of cancer epitope data from peer-reviewed publications, which includes epitope-specific T cell, antibody, and MHC ligand assays. Here we focus on prostate cancer as our first cancer category to demonstrate the capabilities of CEDAR, and to shed light on the advances of epitope-related prostate cancer research. RESULTS The meta-analysis focused on a subset of data describing epitopes from 8 prostate-specific (PS) antigens. A total of 460 epitopes were associated with these proteins, 187 T cell, 109B cell, and 271 MHC ligand epitopes. The number of epitopes was not correlated with the length of the protein; however, we found a significant positive correlation between the number of references per specific PS antigen and the number of reported epitopes. Forty-four different class I and 27 class II restrictions were found, with the most epitopes described for HLA-A*02:01 and HLA-DRB1*01:01. Cytokine assays were mostly limited to IFNg assays and a very limited number of tetramer assays were performed. Monoclonal and polyclonal B cell responses were balanced, with the highest number of epitopes studied in ELISA/Western blot assays. Additionally, epitopes were generically described as associated with prostate cancer, with little granularity specifying diseases state. We found that in vivo and tumor recognition assays were sparse, and the number of epitopes with annotated B/T cell receptor information were limited. Potential immunodominant regions were identified by the use of the ImmunomeBrowser tool. CONCLUSION CEDAR provides a comprehensive repository of epitopes related to prostate-specific antigens. This inventory of epitope data with its wealth of searchable T cell, B cell and MHC ligand information provides a useful tool for the scientific community. At the same time, we identify significant knowledge gaps that could be addressed by experimental analysis.
Collapse
|
17
|
Li Y, Sackett PW, Nielsen M, Barra C. NetAllergen, a random forest model integrating MHC-II presentation propensity for improved allergenicity prediction. BIOINFORMATICS ADVANCES 2023; 3:vbad151. [PMID: 37901344 PMCID: PMC10603389 DOI: 10.1093/bioadv/vbad151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 09/28/2023] [Accepted: 10/13/2023] [Indexed: 10/31/2023]
Abstract
Motivation Allergy is a pathological immune reaction towards innocuous protein antigens. Although only a narrow fraction of plant or animal proteins induce allergy, atopic disorders affect millions of children and adults and cost billions in healthcare systems worldwide. In silico predictors can aid in the development of more innocuous food sources. Previous allergenicity predictors used sequence similarity, common structural domains, and amino acid physicochemical features. However, these predictors strongly rely on sequence similarity to known allergens and fail to predict protein allergenicity accurately when similarity diminishes. Results To overcome these limitations, we collected allergens from AllergenOnline, a curated database of IgE-inducing allergens, carefully removed allergen redundancy with a novel protein partitioning pipeline, and developed a new allergen prediction method, introducing MHC presentation propensity as a novel feature. NetAllergen outperformed a sequence similarity-based BLAST baseline approach, and previous allergenicity predictor AlgPred 2 when similarity to known allergens is limited. Availability and implementation The web service NetAllergen and the datasets are available at https://services.healthtech.dtu.dk/services/NetAllergen-1.0/.
Collapse
|
18
|
Brincker M, Jensen I, Rechner LA, Schut DA, Johansen TS, Nielsen M, Thomsen JB. Multi-center comparison between proton and photon plans for mediastinal lymphomas. Acta Oncol 2023; 62:1251-1255. [PMID: 37624751 DOI: 10.1080/0284186x.2023.2251089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 08/09/2023] [Indexed: 08/27/2023]
|
19
|
Montemurro A, Povlsen HR, Jessen LE, Nielsen M. Benchmarking data-driven filtering for denoising of TCRpMHC single-cell data. Sci Rep 2023; 13:16147. [PMID: 37752190 PMCID: PMC10522655 DOI: 10.1038/s41598-023-43048-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 09/18/2023] [Indexed: 09/28/2023] Open
Abstract
Pairing of the T cell receptor (TCR) with its cognate peptide-MHC (pMHC) is a cornerstone in T cell-mediated immunity. Recently, single-cell sequencing coupled with DNA-barcoded MHC multimer staining has enabled high-throughput studies of T cell specificities. However, the immense variability of TCR-pMHC interactions combined with the relatively low signal-to-noise ratio in the data generated using current technologies are complicating these studies. Several approaches have been proposed for denoising single-cell TCR-pMHC specificity data. Here, we present a benchmark evaluating two such denoising methods, ICON and ITRAP. We applied and evaluated the methods on publicly available immune profiling data provided by 10x Genomics. We find that both methods identified approximately 75% of the raw data as noise. We analyzed both internal metrics developed for the purpose and performance on independent data using machine learning methods trained on the raw and denoised 10x data. We find an increased signal-to-noise ratio comparing the denoised to the raw data for both methods, and demonstrate an overall superior performance of the ITRAP method in terms of both data consistency and performance. In conclusion, this study demonstrates that Improving the data quality from high throughput studies of TCRpMHC-specificity by denoising is paramount in increasing our understanding of T cell-mediated immunity.
Collapse
|
20
|
Eltschkner S, Mellinger S, Buus S, Nielsen M, Paulsson KM, Lindkvist-Petersson K, Westerdahl H. The structure of songbird MHC class I reveals antigen binding that is flexible at the N-terminus and static at the C-terminus. Front Immunol 2023; 14:1209059. [PMID: 37483599 PMCID: PMC10360169 DOI: 10.3389/fimmu.2023.1209059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 06/08/2023] [Indexed: 07/25/2023] Open
Abstract
Long-distance migratory animals such as birds and bats have evolved to withstand selection imposed by pathogens across the globe, and pathogen richness is known to be particularly high in tropical regions. Immune genes, so-called Major Histocompatibility Complex (MHC) genes, are highly duplicated in songbirds compared to other vertebrates, and this high MHC diversity has been hypothesised to result in a unique adaptive immunity. To understand the rationale behind the evolution of the high MHC genetic diversity in songbirds, we determined the structural properties of an MHC class I protein, Acar3, from a long-distance migratory songbird, the great reed warbler Acrocephalus arundinaceus (in short: Acar). The structure of Acar3 was studied in complex with pathogen-derived antigens and shows an overall antigen presentation similar to human MHC class I. However, the peptides bound to Acar3 display an unusual conformation: Whereas the N-terminal ends of the peptides display enhanced flexibility, the conformation of their C-terminal halves is rather static. This uncommon peptide-binding mode in Acar3 is facilitated by a central Arg residue within the peptide-binding groove that fixes the backbone of the peptide at its central position, and potentially permits successful interactions between MHC class I and innate immune receptors. Our study highlights the importance of investigating the immune system of wild animals, such as birds and bats, to uncover unique immune mechanisms which may neither exist in humans nor in model organisms.
Collapse
|
21
|
Povlsen HR, Bentzen AK, Kadivar M, Jessen LE, Hadrup SR, Nielsen M. Improved T cell receptor antigen pairing through data-driven filtering of sequencing information from single cells. eLife 2023; 12:e81810. [PMID: 37133356 PMCID: PMC10156162 DOI: 10.7554/elife.81810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 03/13/2023] [Indexed: 05/04/2023] Open
Abstract
Novel single-cell-based technologies hold the promise of matching T cell receptor (TCR) sequences with their cognate peptide-MHC recognition motif in a high-throughput manner. Parallel capture of TCR transcripts and peptide-MHC is enabled through the use of reagents labeled with DNA barcodes. However, analysis and annotation of such single-cell sequencing (SCseq) data are challenged by dropout, random noise, and other technical artifacts that must be carefully handled in the downstream processing steps. We here propose a rational, data-driven method termed ITRAP (improved T cell Receptor Antigen Paring) to deal with these challenges, filtering away likely artifacts, and enable the generation of large sets of TCR-pMHC sequence data with a high degree of specificity and sensitivity, thus outputting the most likely pMHC target per T cell. We have validated this approach across 10 different virus-specific T cell responses in 16 healthy donors. Across these samples, we have identified up to 1494 high-confident TCR-pMHC pairs derived from 4135 single cells.
Collapse
|
22
|
Nilsson JB, Kaabinejadian S, Yari H, Peters B, Barra C, Gragert L, Hildebrand W, Nielsen M. Machine learning reveals limited contribution of trans-only encoded variants to the HLA-DQ immunopeptidome. Commun Biol 2023; 6:442. [PMID: 37085710 PMCID: PMC10121683 DOI: 10.1038/s42003-023-04749-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 03/23/2023] [Indexed: 04/23/2023] Open
Abstract
Human leukocyte antigen (HLA) class II antigen presentation is key for controlling and triggering T cell immune responses. HLA-DQ molecules, which are believed to play a major role in autoimmune diseases, are heterodimers that can be formed as both cis and trans variants depending on whether the α- and β-chains are encoded on the same (cis) or opposite (trans) chromosomes. So far, limited progress has been made for predicting HLA-DQ antigen presentation. In addition, the contribution of trans-only variants (i.e. variants not observed in the population as cis) in shaping the HLA-DQ immunopeptidome remains largely unresolved. Here, we seek to address these issues by integrating state-of-the-art immunoinformatics data mining models with large volumes of high-quality HLA-DQ specific mass spectrometry immunopeptidomics data. The analysis demonstrates highly improved predictive power and molecular coverage for models trained including these novel HLA-DQ data. More importantly, investigating the role of trans-only HLA-DQ variants reveals a limited to no contribution to the overall HLA-DQ immunopeptidome. In conclusion, this study furthers our understanding of HLA-DQ specificities and casts light on the relative role of cis versus trans-only HLA-DQ variants in the HLA class II antigen presentation space. The developed method, NetMHCIIpan-4.2, is available at https://services.healthtech.dtu.dk/services/NetMHCIIpan-4.2 .
Collapse
|
23
|
Nazerai L, Willis SC, Yankilevich P, Di Leo L, Bosisio FM, Frias A, Bertolotto C, Nersting J, Thastrup M, Buus S, Thomsen AR, Nielsen M, Rohrberg KS, Schmiegelow K, De Zio D. Thiopurine 6TG treatment increases tumor immunogenicity and response to immune checkpoint blockade. Oncoimmunology 2022; 12:2158610. [PMID: 36545256 PMCID: PMC9762757 DOI: 10.1080/2162402x.2022.2158610] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Immune-checkpoint inhibitors (ICI) are highly effective in reinvigorating T cells to attack cancer. Nevertheless, a large subset of patients fails to benefit from ICI, partly due to lack of the cancer neoepitopes necessary to trigger an immune response. In this study, we used the thiopurine 6-thioguanine (6TG) to induce random mutations and thus increase the level of neoepitopes presented by tumor cells. Thiopurines are prodrugs which are converted into thioguanine nucleotides that are incorporated into DNA (DNA-TG), where they can induce mutation through single nucleotide mismatching. In a pre-clinical mouse model of a mutation-low melanoma cell line, we demonstrated that 6TG induced clinical-grade DNA-TG integration resulting in an improved tumor control that was strongly T cell dependent. 6TG exposure increased the tumor mutational burden, without affecting tumor cell proliferation and cell death. Moreover, 6TG treatment re-shaped the tumor microenvironment by increasing T and NK immune cells, making the tumors more responsive to immune-checkpoint blockade. We further validated that 6TG exposure improved tumor control in additional mouse models of melanoma. These findings have paved the way for a phase I/II clinical trial that explores whether treatment with thiopurines can increase the proportion of otherwise treatment-resistant cancer patients who may benefit from ICI therapy (NCT05276284).
Collapse
|
24
|
Montemurro A, Jessen LE, Nielsen M. NetTCR-2.1: Lessons and guidance on how to develop models for TCR specificity predictions. Front Immunol 2022; 13:1055151. [PMID: 36561755 PMCID: PMC9763291 DOI: 10.3389/fimmu.2022.1055151] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 11/22/2022] [Indexed: 12/12/2022] Open
Abstract
T cell receptors (TCR) define the specificity of T cells and are responsible for their interaction with peptide antigen targets presented in complex with major histocompatibility complex (MHC) molecules. Understanding the rules underlying this interaction hence forms the foundation for our understanding of basic adaptive immunology. Over the last decade, efforts have been dedicated to developing assays for high throughput identification of peptide-specific TCRs. Based on such data, several computational methods have been proposed for predicting the TCR-pMHC interaction. The general conclusion from these studies is that the prediction of TCR interactions with MHC-peptide complexes remains highly challenging. Several reasons form the basis for this including scarcity and quality of data, and ill-defined modeling objectives imposed by the high redundancy of the available data. In this work, we propose a framework for dealing with this redundancy, allowing us to address essential questions related to the modeling of TCR specificity including the use of peptide- versus pan-specific models, how to best define negative data, and the performance impact of integrating of CDR1 and 2 loops. Further, we illustrate how and why it is strongly recommended to include simple similarity-based modeling approaches when validating an improved predictive power of machine learning models, and that such validation should include a performance evaluation as a function of "distance" to the training data, to quantify the potential for generalization of the proposed model. The conclusion of the work is that, given current data, TCR specificity is best modeled using peptide-specific approaches, integrating information from all 6 CDR loops, and with negative data constructed from a combination of true and mislabeled negatives. Comparing such machine learning models to similarity-based approaches demonstrated an increased performance gain of the former as the "distance" to the training data was increased; thus demonstrating an improved generalization ability of the machine learning-based approaches. We believe these results demonstrate that the outlined modeling framework and proposed evaluation strategy form a solid basis for investigating the modeling of TCR specificities and that adhering to such a framework will allow for faster progress within the field. The final devolved model, NetTCR-2.1, is available at https://services.healthtech.dtu.dk/service.php?NetTCR-2.1.
Collapse
|
25
|
Connelley T, Nicastri A, Sheldrake T, Vrettou C, Fisch A, Reynisson B, Buus S, Hill A, Morrison I, Nielsen M, Ternette N. Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells. Vaccines (Basel) 2022; 10:vaccines10111907. [PMID: 36423003 PMCID: PMC9699068 DOI: 10.3390/vaccines10111907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 11/03/2022] [Accepted: 11/04/2022] [Indexed: 11/16/2022] Open
Abstract
The apicomplexan parasite Theileria parva is the causative agent of East Coast fever, usually a fatal disease for cattle, which is prevalent in large areas of eastern, central, and southern Africa. Protective immunity against T. parva is mediated by CD8+ T cells, with CD4+ T-cells thought to be important in facilitating the full maturation and development of the CD8+ T-cell response. T. parva has a large proteome, with >4000 protein-coding genes, making T-cell antigen identification using conventional screening approaches laborious and expensive. To date, only a limited number of T-cell antigens have been described. Novel approaches for identifying candidate antigens for T. parva are required to replace and/or complement those currently employed. In this study, we report on the use of immunopeptidomics to study the repertoire of T. parva peptides presented by both BoLA-I and BoLA-DR molecules on infected cells. The study reports on peptides identified from the analysis of 13 BoLA-I and 6 BoLA-DR datasets covering a range of different BoLA genotypes. This represents the most comprehensive immunopeptidomic dataset available for any eukaryotic pathogen to date. Examination of the immunopeptidome data suggested the presence of a large number of coprecipitated and non-MHC-binding peptides. As part of the work, a pipeline to curate the datasets to remove these peptides was developed and used to generate a final list of 74 BoLA-I and 15 BoLA-DR-presented peptides. Together, the data demonstrated the utility of immunopeptidomics as a method to identify novel T-cell antigens for T. parva and the importance of careful curation and the application of high-quality immunoinformatics to parse the data generated.
Collapse
|