1
|
Dashti M, Malik MZ, Al-Matrouk A, Bhatti S, Nizam R, Jacob S, Al-Mulla F, Thanaraj TA. HLA-B allele frequencies and implications for pharmacogenetics in the Kuwaiti population. Front Pharmacol 2024; 15:1423636. [PMID: 39464636 PMCID: PMC11502445 DOI: 10.3389/fphar.2024.1423636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 09/16/2024] [Indexed: 10/29/2024] Open
Abstract
Objective: This study explores the frequency of human leukocyte antigen (HLA) genes, particularly HLA-B alleles, within the Kuwaiti population. We aim to identify alleles with known associations to adverse drug reactions (ADRs) based on existing literature. We focus on the HLA-B gene due to its well-documented associations with severe cutaneous adverse reactions and the extensive pharmacogenetic research supporting its clinical relevance. Methods We utilized the HLA-HD tool to extract, annotate, and analyse HLA-B alleles from the exome data of 561 Kuwaiti individuals, sequenced on the Illumina HiSeq platform. HLA typing was conducted using the HLA-HD tool with a reference panel from the IPD-IMGT/HLA database. The major HLA-B pharmacogenetic markers were obtained from the HLA Adverse Drug Reaction Database, focusing on alleles with significant ADR associations in published literature. Results The distribution of HLA-B alleles in the Kuwaiti population revealed that the most frequent alleles were HLA-B*50:01 (10.52%), HLA-B*51:01 (9.89%), HLA-B*08:01 (6.06%), HLA-B*52:01 (4.55%), HLA-B*18:01 (3.92%), and HLA-B*41:01 (3.65%). Notably, alleles HLA-B*13:01, HLA-B*13:02, HLA-B*15:02, HLA-B*15:13, HLA-B*35:02, HLA-B*35:05, HLA-B*38:01, HLA-B*40:02, HLA-B*44:03, HLA-B*51:01, HLA-B*57:01 and HLA-B*58:01 were identified with known associations to various ADRs. For example, HLA-B*51:01 was associated with clindamycin, phenobarbital, and phenytoin, and was found in 18% of individuals. Conclusion Our study enriches the regional genetic landscape by delineating HLA-B allele variations within Kuwait and across the Arabian Peninsula. This genetic insight, along with the identification of markers previously linked to drug hypersensitivity, provides a foundation for future pharmacogenetic research and potential personalized medicine strategies in the region.
Collapse
Affiliation(s)
- Mohammed Dashti
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Md Zubbair Malik
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Abdullah Al-Matrouk
- Narcotic and Psychotropic Department, Ministry of Interior, Farwaniya, Kuwait
| | - Saeeda Bhatti
- College of Medical Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Rasheeba Nizam
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Sindhu Jacob
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Fahd Al-Mulla
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | | |
Collapse
|
2
|
Dashti M, Malik MZ, Nizam R, Jacob S, Al-Mulla F, Thanaraj TA. Evaluation of HLA typing content of next-generation sequencing datasets from family trios and individuals of arab ethnicity. Front Genet 2024; 15:1407285. [PMID: 38859936 PMCID: PMC11163123 DOI: 10.3389/fgene.2024.1407285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 05/07/2024] [Indexed: 06/12/2024] Open
Abstract
Introduction: HLA typing is a critical tool in both clinical and research applications at the individual and population levels. Benchmarking studies have indicated HLA-HD as the preferred tool for accurate and comprehensive HLA allele calling. The advent of next-generation sequencing (NGS) has revolutionized genetic analysis by providing high-throughput sequencing data. This study aims to evaluate, using the HLA-HD tool, the HLA typing content of whole exome, whole genome, and HLA-targeted panel sequence data from the consanguineous population of Arab ethnicity, which has been underrepresented in prior benchmarking studies. Methods: We utilized sequence data from family trios and individuals, sequenced on one or more of the whole exome, whole genome, and HLA-targeted panel sequencing technologies. The performance and resolution across various HLA genes were evaluated. We incorporated a comparative quality control analysis, assessing the results obtained from HLA-HD by comparing them with those from the HLA-Twin tool to authenticate the accuracy of the findings. Results: Our analysis found that alleles across 29 HLA loci can be successfully and consistently typed from NGS datasets. Clinical-grade whole exome sequencing datasets achieved the highest consistency rate at three-field resolution, followed by targeted HLA panel, research-grade whole exome, and whole genome datasets. Discussion: The study catalogues HLA typing consistency across NGS datasets for a large array of HLA genes and highlights assessments regarding the feasibility of utilizing available NGS datasets in HLA allele studies. These findings underscore the reliability of HLA-HD for HLA typing in underrepresented populations and demonstrate the utility of various NGS technologies in achieving accurate HLA allele calling.
Collapse
Affiliation(s)
| | | | | | | | - Fahd Al-Mulla
- Department of Genetics and Bioinformatics, Dasman Diabetes Institute, Kuwait City, Kuwait
| | | |
Collapse
|
3
|
Zhou Y, Lauschke VM. Next-generation sequencing in pharmacogenomics - fit for clinical decision support? Expert Rev Clin Pharmacol 2024; 17:213-223. [PMID: 38247431 DOI: 10.1080/17512433.2024.2307418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/16/2024] [Indexed: 01/23/2024]
Abstract
INTRODUCTION The technological advances of sequencing methods during the past 20 years have fuelled the generation of large amounts of sequencing data that comprise common variations, as well as millions of rare and personal variants that would not be identified by conventional genotyping. While comprehensive sequencing is technically feasible, its clinical utility for guiding personalized treatment decisions remains controversial. AREAS COVERED We discuss the opportunities and challenges of comprehensive sequencing compared to targeted genotyping for pharmacogenomic applications. Current pharmacogenomic sequencing panels are heterogeneous and clinical actionability of the included genes is not a major focus. We provide a current overview and critical discussion of how current studies utilize sequencing data either retrospectively from biobanks, databases or repurposed diagnostic sequencing, or prospectively using pharmacogenomic sequencing. EXPERT OPINION While sequencing-based pharmacogenomics has provided important insights into genetic variations underlying the safety and efficacy of a multitude pharmacological treatments, important hurdles for the clinical implementation of pharmacogenomic sequencing remain. We identify gaps in the interpretation of pharmacogenetic variants, technical challenges pertaining to complex loci and variant phasing, as well as unclear cost-effectiveness and incomplete reimbursement. It is critical to address these challenges in order to realize the promising prospects of pharmacogenomic sequencing.
Collapse
Affiliation(s)
- Yitian Zhou
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | - Volker M Lauschke
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
- Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany
- University of Tübingen, Tübingen, Germany
| |
Collapse
|
4
|
Ricker CA, Meli K, Van Allen EM. Historical perspective and future directions: computational science in immuno-oncology. J Immunother Cancer 2024; 12:e008306. [PMID: 38191244 PMCID: PMC10826578 DOI: 10.1136/jitc-2023-008306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2023] [Indexed: 01/10/2024] Open
Abstract
Immuno-oncology holds promise for transforming patient care having achieved durable clinical response rates across a variety of advanced and metastatic cancers. Despite these achievements, only a minority of patients respond to immunotherapy, underscoring the importance of elucidating molecular mechanisms responsible for response and resistance to inform the development and selection of treatments. Breakthroughs in molecular sequencing technologies have led to the generation of an immense amount of genomic and transcriptomic sequencing data that can be mined to uncover complex tumor-immune interactions using computational tools. In this review, we discuss existing and emerging computational methods that contextualize the composition and functional state of the tumor microenvironment, infer the reactivity and clonal dynamics from reconstructed immune cell receptor repertoires, and predict the antigenic landscape for immune cell recognition. We further describe the advantage of multi-omics analyses for capturing multidimensional relationships and artificial intelligence techniques for integrating omics data with histopathological and radiological images to encapsulate patterns of treatment response and tumor-immune biology. Finally, we discuss key challenges impeding their widespread use and clinical application and conclude with future perspectives. We are hopeful that this review will both serve as a guide for prospective researchers seeking to use existing tools for scientific discoveries and inspire the optimization or development of novel tools to enhance precision, ultimately expediting advancements in immunotherapy that improve patient survival and quality of life.
Collapse
Affiliation(s)
- Cora A Ricker
- Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Kevin Meli
- Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | | |
Collapse
|
5
|
Claeys A, Merseburger P, Staut J, Marchal K, Van den Eynden J. Benchmark of tools for in silico prediction of MHC class I and class II genotypes from NGS data. BMC Genomics 2023; 24:247. [PMID: 37161318 PMCID: PMC10170851 DOI: 10.1186/s12864-023-09351-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 04/30/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND The Human Leukocyte Antigen (HLA) genes are a group of highly polymorphic genes that are located in the Major Histocompatibility Complex (MHC) region on chromosome 6. The HLA genotype affects the presentability of tumour antigens to the immune system. While knowledge of these genotypes is of utmost importance to study differences in immune responses between cancer patients, gold standard, PCR-derived genotypes are rarely available in large Next Generation Sequencing (NGS) datasets. Therefore, a variety of methods for in silico NGS-based HLA genotyping have been developed, bypassing the need to determine these genotypes with separate experiments. However, there is currently no consensus on the best performing tool. RESULTS We evaluated 13 MHC class I and/or class II HLA callers that are currently available for free academic use and run on either Whole Exome Sequencing (WES) or RNA sequencing data. Computational resource requirements were highly variable between these tools. Three orthogonal approaches were used to evaluate the accuracy on several large publicly available datasets: a direct benchmark using PCR-derived gold standard HLA calls, a correlation analysis with population-based allele frequencies and an analysis of the concordance between the different tools. The highest MHC-I calling accuracies were found for Optitype (98.0%) and arcasHLA (99.4%) on WES and RNA sequencing data respectively, while for MHC-II HLA-HD was the most accurate tool for both data types (96.2% and 99.4% on WES and RNA data respectively). CONCLUSION The optimal strategy for HLA genotyping from NGS data depends on the availability of either WES or RNA data, the size of the dataset and the available computational resources. If sufficient resources are available, we recommend Optitype and HLA-HD for MHC-I and MHC-II genotype calling respectively.
Collapse
Affiliation(s)
- Arne Claeys
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent, Ghent, Belgium
| | - Peter Merseburger
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
- Cancer Research Institute Ghent, Ghent, Belgium
| | - Jasper Staut
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
| | - Kathleen Marchal
- Cancer Research Institute Ghent, Ghent, Belgium
- Department of Information Technology, Ghent University, IDLab, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Jimmy Van den Eynden
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium.
- Cancer Research Institute Ghent, Ghent, Belgium.
| |
Collapse
|
6
|
Bruijnesteijn J. HLA/MHC and KIR characterization in humans and non-human primates using Oxford Nanopore Technologies and Pacific Biosciences sequencing platforms. HLA 2023; 101:205-221. [PMID: 36583332 DOI: 10.1111/tan.14957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 12/12/2022] [Accepted: 12/28/2022] [Indexed: 12/31/2022]
Abstract
The gene products of the HLA/MHC and KIR multigene families are important modulators of the immune system and are associated with health and disease. Characterization of the genes encoding these receptors has been integrated into different biomedical applications, including transplantation and reproduction biology, immune therapies and in fundamental research into disease susceptibility or resistance. Conventional short-read sequencing strategies have shown their value in high throughput typing, but are insufficient to uncover the entire complexity of the highly polymorphic HLA/MHC and KIR gene systems. The implementation of single-molecule and real-time sequencing platforms, offered by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), revolutionized the fields of genomics and transcriptomics. Using fundamentally distinct principles, these platforms generate long-read data that can unwire the plasticity of the HLA/MHC and KIR genes, including high-resolution characterization of genes, alleles, phased haplotypes, transcription levels and epigenetics modification patterns. These insights might have profound clinical relevance, such as improved matching of donors and patients in clinical transplantation, but could also lift disease association studies to a higher level. Even more, a comprehensive characterization may refine animal models in preclinical studies. In this review, the different HLA/MHC and KIR characterization approaches using PacBio and ONT platforms are described and discussed.
Collapse
Affiliation(s)
- Jesse Bruijnesteijn
- Department of Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, The Netherlands
| |
Collapse
|
7
|
Thuesen NH, Klausen MS, Gopalakrishnan S, Trolle T, Renaud G. Benchmarking freely available HLA typing algorithms across varying genes, coverages and typing resolutions. Front Immunol 2022; 13:987655. [PMID: 36426357 PMCID: PMC9679531 DOI: 10.3389/fimmu.2022.987655] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 10/10/2022] [Indexed: 11/02/2023] Open
Abstract
Identifying the specific human leukocyte antigen (HLA) allele combination of an individual is crucial in organ donation, risk assessment of autoimmune and infectious diseases and cancer immunotherapy. However, due to the high genetic polymorphism in this region, HLA typing requires specialized methods. We investigated the performance of five next-generation sequencing (NGS) based HLA typing tools with a non-restricted license namely HLA*LA, Optitype, HISAT-genotype, Kourami and STC-Seq. This evaluation was done for the five HLA loci, HLA-A, -B, -C, -DRB1 and -DQB1 using whole-exome sequencing (WES) samples from 829 individuals. The robustness of the tools to lower depth of coverage (DOC) was evaluated by subsampling and HLA typing 230 WES samples at DOC ranging from 1X to 100X. The HLA typing accuracy was measured across four typing resolutions. Among these, we present two clinically-relevant typing resolutions (P group and pseudo-sequence), which specifically focus on the peptide binding region. On average, across the five HLA loci examined, HLA*LA was found to have the highest typing accuracy. For the individual loci, HLA-A, -B and -C, Optitype's typing accuracy was the highest and HLA*LA had the highest typing accuracy for HLA-DRB1 and -DQB1. The tools' robustness to lower DOC data varied widely and further depended on the specific HLA locus. For all Class I loci, Optitype had a typing accuracy above 95% (according to the modification of the amino acids in the functionally relevant portion of the HLA molecule) at 50X, but increasing the DOC beyond even 100X could still improve the typing accuracy of HISAT-genotype, Kourami, and STC-seq across all five HLA loci as well as HLA*LA's typing accuracy for HLA-DQB1. HLA typing is also used in studies of ancient DNA (aDNA), which is often based on sequencing data with lower quality and DOC. Interestingly, we found that Optitype's typing accuracy is not notably impaired by short read length or by DNA damage, which is typical of aDNA, as long as the DOC is sufficiently high.
Collapse
Affiliation(s)
- Nikolas Hallberg Thuesen
- Evaxion Biotech, Copenhagen, Denmark
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Lyngby, Denmark
| | | | - Shyam Gopalakrishnan
- Section for Hologenomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Gabriel Renaud
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
8
|
Ritari J, Hyvärinen K, Partanen J, Koskela S. KIR gene content imputation from single-nucleotide polymorphisms in the Finnish population. PeerJ 2022; 10:e12692. [PMID: 35036093 PMCID: PMC8744484 DOI: 10.7717/peerj.12692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/06/2021] [Indexed: 01/07/2023] Open
Abstract
The killer cell immunoglobulin-like receptor (KIR) gene cluster on chromosome 19 encodes cell surface glycoproteins that bind class I human leukocyte antigen (HLA) molecules as well as some other ligands. Through regulation of natural killer (NK) cell activity KIRs participate in tumour surveillance and clearing viral infections. KIR gene gene copy number variation associates with the outcome of transplantations and susceptibility to immune-mediated diseases. Inferring KIR gene content from genetic variant data is therefore desirable for immunogenetic analysis, particularly in the context of growing biobank genome data collections that rely on genotyping by microarray. Here we describe a stand-alone and freely available gene content imputation for 12 KIR genes. The models were trained using 807 Finnish biobank samples genotyped for 5900 KIR-region SNPs and analysed for KIR gene content with targeted sequencing. Cross-validation results demonstrate a high mean overall accuracy of 98.5% (95% CI [97.0-99.2]%) which compares favourably with previous methods including short-read sequencing based approaches.
Collapse
Affiliation(s)
- Jarmo Ritari
- Finnish Red Cross Blood Service, Helsinki, Finland
| | | | | | - Satu Koskela
- Finnish Red Cross Blood Service, Helsinki, Finland
| |
Collapse
|
9
|
Douillard V, Castelli EC, Mack SJ, Hollenbach JA, Gourraud PA, Vince N, Limou S. Approaching Genetics Through the MHC Lens: Tools and Methods for HLA Research. Front Genet 2021; 12:774916. [PMID: 34925459 PMCID: PMC8677840 DOI: 10.3389/fgene.2021.774916] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Accepted: 11/08/2021] [Indexed: 01/11/2023] Open
Abstract
The current SARS-CoV-2 pandemic era launched an immediate and broad response of the research community with studies both about the virus and host genetics. Research in genetics investigated HLA association with COVID-19 based on in silico, population, and individual data. However, they were conducted with variable scale and success; convincing results were mostly obtained with broader whole-genome association studies. Here, we propose a technical review of HLA analysis, including basic HLA knowledge as well as available tools and advice. We notably describe recent algorithms to infer and call HLA genotypes from GWAS SNPs and NGS data, respectively, which opens the possibility to investigate HLA from large datasets without a specific initial focus on this region. We thus hope this overview will empower geneticists who were unfamiliar with HLA to run MHC-focused analyses following the footsteps of the Covid-19|HLA & Immunogenetics Consortium.
Collapse
Affiliation(s)
- Venceslas Douillard
- Centre de Recherche en Transplantation et Immunologie, CHU Nantes, Inserm, Centre de Recherche en Transplantation et Immunologie, Université de Nantes, Nantes, France
| | | | - Steven J. Mack
- Division of Allergy, Immunology and Bone Marrow Transplantation, Department of Pediatrics, School of Medicine, University of California, San Francisco, San Francisco, CA, United States
| | - Jill A. Hollenbach
- Department of Neurology, University of California, San Francisco, San Francisco, CA, United States
- Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA, United States
| | - Pierre-Antoine Gourraud
- Centre de Recherche en Transplantation et Immunologie, CHU Nantes, Inserm, Centre de Recherche en Transplantation et Immunologie, Université de Nantes, Nantes, France
| | - Nicolas Vince
- Centre de Recherche en Transplantation et Immunologie, CHU Nantes, Inserm, Centre de Recherche en Transplantation et Immunologie, Université de Nantes, Nantes, France
| | - Sophie Limou
- Centre de Recherche en Transplantation et Immunologie, CHU Nantes, Inserm, Centre de Recherche en Transplantation et Immunologie, Université de Nantes, Nantes, France
- Ecole Centrale de Nantes, Department of Computer Sciences and Mathematics in Biology, Nantes, France
| |
Collapse
|
10
|
Aguiar VRC, Augusto DG, Castelli EC, Hollenbach JA, Meyer D, Nunes K, Petzl-Erler ML. An immunogenetic view of COVID-19. Genet Mol Biol 2021; 44:e20210036. [PMID: 34436508 PMCID: PMC8388242 DOI: 10.1590/1678-4685-gmb-2021-0036] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 06/12/2021] [Indexed: 02/06/2023] Open
Abstract
Meeting the challenges brought by the COVID-19 pandemic requires an interdisciplinary approach. In this context, integrating knowledge of immune function with an understanding of how genetic variation influences the nature of immunity is a key challenge. Immunogenetics can help explain the heterogeneity of susceptibility and protection to the viral infection and disease progression. Here, we review the knowledge developed so far, discussing fundamental genes for triggering the innate and adaptive immune responses associated with a viral infection, especially with the SARS-CoV-2 mechanisms. We emphasize the role of the HLA and KIR genes, discussing what has been uncovered about their role in COVID-19 and addressing methodological challenges of studying these genes. Finally, we comment on questions that arise when studying admixed populations, highlighting the case of Brazil. We argue that the interplay between immunology and an understanding of genetic associations can provide an important contribution to our knowledge of COVID-19.
Collapse
Affiliation(s)
- Vitor R. C. Aguiar
- Universidade de São Paulo, Departamento de Genética e Biologia
Evolutiva, São Paulo, SP, Brazil
| | - Danillo G. Augusto
- University of California, UCSF Weill Institute for Neurosciences,
Department of Neurology, San Francisco, CA, USA
- Universidade Federal do Paraná, Departamento de Genética, Curitiba,
PR, Brazil
| | - Erick C. Castelli
- Universidade Estadual Paulista, Faculdade de Medicina de Botucatu,
Departamento de Patologia, Botucatu, SP, Brazil
| | - Jill A. Hollenbach
- University of California, UCSF Weill Institute for Neurosciences,
Department of Neurology, San Francisco, CA, USA
| | - Diogo Meyer
- Universidade de São Paulo, Departamento de Genética e Biologia
Evolutiva, São Paulo, SP, Brazil
| | - Kelly Nunes
- Universidade de São Paulo, Departamento de Genética e Biologia
Evolutiva, São Paulo, SP, Brazil
| | | |
Collapse
|
11
|
Marin WM, Dandekar R, Augusto DG, Yusufali T, Heyn B, Hofmann J, Lange V, Sauter J, Norman PJ, Hollenbach JA. High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING. PLoS Comput Biol 2021; 17:e1008904. [PMID: 34339413 PMCID: PMC8360517 DOI: 10.1371/journal.pcbi.1008904] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 08/12/2021] [Accepted: 07/16/2021] [Indexed: 02/07/2023] Open
Abstract
The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies.
Collapse
Affiliation(s)
- Wesley M. Marin
- UCSF Weill Institute for Neurosciences, Department of Neurology, University of California, San Francisco, San Francisco, California, United States of America
| | - Ravi Dandekar
- UCSF Weill Institute for Neurosciences, Department of Neurology, University of California, San Francisco, San Francisco, California, United States of America
| | - Danillo G. Augusto
- UCSF Weill Institute for Neurosciences, Department of Neurology, University of California, San Francisco, San Francisco, California, United States of America
| | - Tasneem Yusufali
- UCSF Weill Institute for Neurosciences, Department of Neurology, University of California, San Francisco, San Francisco, California, United States of America
| | | | | | | | | | - Paul J. Norman
- Division of Biomedical Informatics and Personalized Medicine, and Department of Immunology and Microbiology, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States of America
| | - Jill A. Hollenbach
- UCSF Weill Institute for Neurosciences, Department of Neurology, University of California, San Francisco, San Francisco, California, United States of America
| |
Collapse
|
12
|
Migdal M, Ruan DF, Forrest WF, Horowitz A, Hammer C. MiDAS-Meaningful Immunogenetic Data at Scale. PLoS Comput Biol 2021; 17:e1009131. [PMID: 34228721 PMCID: PMC8284797 DOI: 10.1371/journal.pcbi.1009131] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 07/16/2021] [Accepted: 05/30/2021] [Indexed: 12/15/2022] Open
Abstract
Human immunogenetic variation in the form of HLA and KIR types has been shown to be strongly associated with a multitude of immune-related phenotypes. However, association studies involving immunogenetic loci most commonly involve simple analyses of classical HLA allelic diversity, resulting in limitations regarding the interpretability and reproducibility of results. We here present MiDAS, a comprehensive R package for immunogenetic data transformation and statistical analysis. MiDAS recodes input data in the form of HLA alleles and KIR types into biologically meaningful variables, allowing HLA amino acid fine mapping, analyses of HLA evolutionary divergence as well as experimentally validated HLA-KIR interactions. Further, MiDAS enables comprehensive statistical association analysis workflows with phenotypes of diverse measurement scales. MiDAS thus closes the gap between the inference of immunogenetic variation and its efficient utilization to make relevant discoveries related to immune and disease biology. It is freely available under a MIT license.
Collapse
Affiliation(s)
- Maciej Migdal
- Roche Global IT Solution Centre (RGITSC), Warsaw, Poland
| | - Dan Fu Ruan
- Department of Oncological Sciences, Precision Immunology Institute, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - William F. Forrest
- Department of OMNI Bioinformatics, Genentech, South San Francisco, California, United States of America
| | - Amir Horowitz
- Department of Oncological Sciences, Precision Immunology Institute, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Christian Hammer
- Department of Cancer Immunology, Genentech, South San Francisco, California, United States of America
- Department of Human Genetics, Genentech, South San Francisco, California, United States of America
| |
Collapse
|
13
|
Roe D, Kuang R. Accurate and Efficient KIR Gene and Haplotype Inference From Genome Sequencing Reads With Novel K-mer Signatures. Front Immunol 2020; 11:583013. [PMID: 33324401 PMCID: PMC7727328 DOI: 10.3389/fimmu.2020.583013] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 11/02/2020] [Indexed: 12/17/2022] Open
Abstract
The killer-cell immunoglobulin-like receptor (KIR) proteins evolve to fight viruses and mediate the body's reaction to pregnancy. These roles provide selection pressure for variation at both the structural/haplotype and base/allele levels. At the same time, the genes have evolved relatively recently by tandem duplication and therefore exhibit very high sequence similarity over thousands of bases. These variation-homology patterns make it impossible to interpret KIR haplotypes from abundant short-read genome sequencing data at population scale using existing methods. Here, we developed an efficient computational approach for in silico KIR probe interpretation (KPI) to accurately interpret individual's KIR genes and haplotype-pairs from KIR sequencing reads. We designed synthetic 25-base sequence probes by analyzing previously reported haplotype sequences, and we developed a bioinformatics pipeline to interpret the probes in the context of 16 KIR genes and 16 haplotype structures. We demonstrated its accuracy on a synthetic data set as well as a real whole genome sequences from 748 individuals from The Genome of the Netherlands (GoNL). The GoNL predictions were compared with predictions from SNP-based predictions. Our results show 100% accuracy rate for the synthetic tests and a 99.6% family-consistency rate in the GoNL tests. Agreement with the SNP-based calls on KIR genes ranges from 72%-100% with a mean of 92%; most differences occur in genes KIR2DS2, KIR2DL2, KIR2DS3, and KIR2DL5 where KPI predicts presence and the SNP-based interpretation predicts absence. Overall, the evidence suggests that KPI's accuracy is 97% or greater for both KIR gene and haplotype-pair predictions, and the presence/absence genotyping leads to ambiguous haplotype-pair predictions with 16 reference KIR haplotype structures. KPI is free, open, and easily executable as a Nextflow workflow supported by a Docker environment at https://github.com/droeatumn/kpi.
Collapse
Affiliation(s)
- David Roe
- Bioinformatics and Computational Biology, University of Minnesota, Rochester, MN, United States
| | - Rui Kuang
- Bioinformatics and Computational Biology, University of Minnesota, Rochester, MN, United States
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN, United States
| |
Collapse
|
14
|
Punta M, Jennings VA, Melcher AA, Lise S. The Immunogenic Potential of Recurrent Cancer Drug Resistance Mutations: An In Silico Study. Front Immunol 2020; 11:524968. [PMID: 33133066 PMCID: PMC7578429 DOI: 10.3389/fimmu.2020.524968] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 09/14/2020] [Indexed: 12/13/2022] Open
Abstract
Cancer somatic mutations have been identified as a source of antigens that can be targeted by cancer immunotherapy. In this work, expanding on previous studies, we analyze the HLA-presentation properties of mutations that are known to drive resistance to cancer targeted-therapies. We survey a large dataset of mutations that confer resistance to different drugs and occur in numerous genes and tumor types. We show that a significant number of them are predicted in silico to be potentially immunogenic across a large proportion of the human population. Further, by analyzing a cohort of patients carrying a small subset of these resistance mutations, we provide evidence that what is observed in the general population may be indicative of the mutations' immunogenic potential in resistant patients. Two of the mutations in our dataset had previously been experimentally validated by others and it was confirmed that some of their associated neopeptides elicit T-cell responses in vitro. The identification of potent cancer-specific antigens can be instrumental for developing more effective immunotherapies. In this work, we propose a novel list of drug-resistance mutations, several of which are recurrent, that could be of particular interest in the context of off-the-shelf precision immunotherapies such as therapeutic cancer vaccines.
Collapse
Affiliation(s)
- Marco Punta
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, United Kingdom
| | - Victoria A. Jennings
- Department of Immunity and Infection, Leeds Institute of Medical Research, Leeds, United Kingdom
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, United Kingdom
| | - Alan A. Melcher
- Division of Radiotherapy and Imaging, The Institute of Cancer Research, London, United Kingdom
| | - Stefano Lise
- Centre for Evolution and Cancer, The Institute of Cancer Research, London, United Kingdom
| |
Collapse
|