1
|
Nameki R, Shetty A, Dareng E, Tyrer J, Lin X, Pharoah P, Corona RI, Kar S, Lawrenson K. chromMAGMA: regulatory element-centric interrogation of risk variants. Life Sci Alliance 2022; 5:e202201446. [PMID: 35777959 PMCID: PMC9251535 DOI: 10.26508/lsa.202201446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 06/07/2022] [Accepted: 06/08/2022] [Indexed: 11/24/2022] Open
Abstract
Candidate causal risk variants from genome-wide association studies reside almost exclusively in noncoding regions of the genome and innovative approaches are necessary to understand their biological function. Multi-marker analysis of genomic annotation (MAGMA) is a widely used program that nominates candidate risk genes by mapping single-nucleotide polymorphism summary statistics from genome-wide association studies to gene bodies. We augmented MAGMA to create chromatin-MAGMA (chromMAGMA), a method to nominate candidate risk genes based on the presence of risk variants within noncoding regulatory elements (REs). We applied chromMAGMA to a genetic susceptibility dataset for epithelial ovarian cancer (EOC), a rare gynecologic malignancy characterized by high mortality. This identified 155 unique candidate EOC risk genes across five EOC histotypes; 83% (105/127) of high-grade serous ovarian cancer risk genes had not previously been implicated in this EOC histotype. Risk genes nominated by chromMAGMA converged on mRNA splicing and transcriptional dysregulation pathways. chromMAGMA is a pipeline that nominates candidate risk genes through a gene regulation-focused approach and helps interpret the biological mechanism of noncoding risk variants for complex diseases.
Collapse
Affiliation(s)
- Robbin Nameki
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Anamay Shetty
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Eileen Dareng
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Jonathan Tyrer
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Xianzhi Lin
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Paul Pharoah
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Rosario I Corona
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Siddhartha Kar
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Kate Lawrenson
- Women's Cancer Research Program at the Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Cancer Prevention and Control Program, Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Center for Bioinformatics and Functional Genomics, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| |
Collapse
|
2
|
Wang E, Adams S, Stroncek DF, Marincola FM. Human Leukocyte Antigen and Human Neutrophil Antigen Systems. Hematology 2018. [DOI: 10.1016/b978-0-323-35762-3.00113-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
3
|
Milius RP, Mack SJ, Hollenbach JA, Pollack J, Heuer ML, Gragert L, Spellman S, Guethlein LA, Trachtenberg EA, Cooley S, Bochtler W, Mueller CR, Robinson J, Marsh SGE, Maiers M. Genotype List String: a grammar for describing HLA and KIR genotyping results in a text string. TISSUE ANTIGENS 2013; 82:106-12. [PMID: 23849068 PMCID: PMC3715123 DOI: 10.1111/tan.12150] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2013] [Accepted: 05/22/2013] [Indexed: 01/19/2023]
Abstract
Knowledge of an individual's human leukocyte antigen (HLA) genotype is essential for modern medical genetics, and is crucial for hematopoietic stem cell and solid-organ transplantation. However, the high levels of polymorphism known for the HLA genes make it difficult to generate an HLA genotype that unambiguously identifies the alleles that are present at a given HLA locus in an individual. For the last 20 years, the histocompatibility and immunogenetics community has recorded this HLA genotyping ambiguity using allele codes developed by the National Marrow Donor Program (NMDP). While these allele codes may have been effective for recording an HLA genotyping result when initially developed, their use today results in increased ambiguity in an HLA genotype, and they are no longer suitable in the era of rapid allele discovery and ultra-high allele polymorphism. Here, we present a text string format capable of fully representing HLA genotyping results. This Genotype List (GL) String format is an extension of a proposed standard for reporting killer-cell immunoglobulin-like receptor (KIR) genotype data that can be applied to any genetic data that use a standard nomenclature for identifying variants. The GL String format uses a hierarchical set of operators to describe the relationships between alleles, lists of possible alleles, phased alleles, genotypes, lists of possible genotypes, and multilocus unphased genotypes, without losing typing information or increasing typing ambiguity. When used in concert with appropriate tools to create, exchange, and parse these strings, we anticipate that GL Strings will replace NMDP allele codes for reporting HLA genotypes.
Collapse
Affiliation(s)
- R P Milius
- Department of Bioinformatics, National Marrow Donor Program, Minneapolis, MN 55413-1753, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Bochtler W, Maiers M, Oudshoorn M, Marsh SGE, Raffoux C, Mueller C, Hurley CK. World Marrow Donor Association guidelines for use of HLA nomenclature and its validation in the data exchange among hematopoietic stem cell donor registries and cord blood banks. Bone Marrow Transplant 2007; 39:737-41. [PMID: 17438587 DOI: 10.1038/sj.bmt.1705672] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Since the advent of the European Marrow Donor Information System in the first half of the last decade, fully automated data exchange between registry computer systems has been playing an ever-increasing role in the international search for unrelated donors of blood progenitor cells. This exchange, however, was hampered by different local conventions used to present HLA data and complicated by the need to extend the official WHO nomenclature to accommodate the registries' information systems and to cross-validate HLA data obtained with different methods and/or at different loci. The guidelines presented here have been developed by the World Marrow Donor Association to standardize the nomenclature to be used and the validation checks to be applied in the international electronic exchange of HLA-typing data among unrelated volunteer hematopoietic stem cell donor registries and umbilical cord blood banks. Two reference web sites have been designated to maintain and update the approved HLA nomenclature and all the ancillary information needed by the conventions described here.
Collapse
Affiliation(s)
- W Bochtler
- Information Technology and Quality Assurance Working Groups of the World Marrow Donor Association, Leiden, The Netherlands
| | | | | | | | | | | | | |
Collapse
|
5
|
Dunn PPJ, Day S, Williams S, Bendukidze N. HLA-DQB1 sequencing-based typing using newly identified conserved nucleotide sequences in introns 1 and 2. ACTA ACUST UNITED AC 2005; 66:99-106. [PMID: 16029429 DOI: 10.1111/j.1399-0039.2005.00445.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Sequencing-based typing (SBT) human leukocyte antigen (HLA) class I and II genes should examine entire exon sequences where polymorphisms lie. Primers for the amplification of complete exons therefore anneal in introns and their design relies on accurate intron sequences being available. We decided to develop a SBT method for HLA-DQB1 using amplification primers which anneal in introns 1 and 2, yet the amount of intron sequence data previously available in databases was sparse. Therefore, we undertook a systematic sequencing of introns 1 and 2 using DNA from cell lines homozygous for DQB1. This study confirmed an earlier report that the non-coding regions of this gene are the most polymorphic seen in the human genome. Intron sequences within an allele group were largely identical, the exceptions being DQB1*0301 differing from other DQB1*03 allele groups and DQB1*0601 differing from all other DQB1*06 alleles. A retroviral Alu element, related to the AluYa5a2 subfamily, was identified uniquely inserted in intron 2 of DQB1*02 alleles. For the typing approach, six amplification primers were designed based on conserved allele group sequences covering all of the HLA DQB antigens, and two sequencing primers were also designed which anneal in intron 2. This method has proved to be very robust and has been used as part of a referral DNA sequencing service for a number of years.
Collapse
Affiliation(s)
- P P J Dunn
- DNA Reference Laboratory, National Blood Service, Southmead Road, Bristol BS10 5ND, UK. paul.dunn@nzblood. co.nz
| | | | | | | |
Collapse
|
6
|
Adams SD, Barracchini KC, Chen D, Robbins F, Wang L, Larsen P, Luhm R, Stroncek DF. Ambiguous allele combinations in HLA Class I and Class II sequence-based typing: when precise nucleotide sequencing leads to imprecise allele identification. J Transl Med 2004; 2:30. [PMID: 15363110 PMCID: PMC517951 DOI: 10.1186/1479-5876-2-30] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 09/13/2004] [Indexed: 11/21/2022] Open
Abstract
Sequence-based typing (SBT) is one of the most comprehensive methods utilized for HLA typing. However, one of the inherent problems with this typing method is the interpretation of ambiguous allele combinations which occur when two or more different allele combinations produce identical sequences. The purpose of this study is to investigate the probability of this occurrence. We performed HLA-A,-B SBT for Exons 2 and 3 on 676 donors. Samples were analyzed with a capillary sequencer. The racial distribution of the donors was as follows: 615-Caucasian, 13-Asian, 23-African American, 17-Hispanic and 8-Unknown. 672 donors were analyzed for HLA-A locus ambiguities and 666 donors were analyzed for HLA-B locus ambiguities. At the HLA-A locus a total of 548 total ambiguous allele combinations were identified (548/1344 = 41%). Most (278/548 = 51%) of these ambiguities were due to the fact that Exon 4 analysis was not performed. At the HLA-B locus 322 total ambiguous allele combinations were found (322/1332 = 24%). The HLA-B*07/08/15/27/35/44 antigens, common in Caucasians, produced a large portion of the ambiguities (279/322 = 87%). A large portion of HLA-A and B ambiguous allele combinations can be addressed by utilizing a group-specific primary amplification approach to produce an unambiguous homozygous sequence. Therefore, although the prevalence of ambiguous allele combinations is high, if the resolution of these ambiguities is clinically warranted, methods exist to compensate for this problem.
Collapse
Affiliation(s)
- Sharon D Adams
- HLA Laboratory, Department of Transfusion Medicine, National Institutes of Health, Warren G. Magnuson Clinical Center, Bethesda, Maryland USA
| | - Kathleen C Barracchini
- HLA Laboratory, Department of Transfusion Medicine, National Institutes of Health, Warren G. Magnuson Clinical Center, Bethesda, Maryland USA
| | - Deborah Chen
- HLA Laboratory, Department of Transfusion Medicine, National Institutes of Health, Warren G. Magnuson Clinical Center, Bethesda, Maryland USA
| | - FuMeei Robbins
- HLA Laboratory, Department of Transfusion Medicine, National Institutes of Health, Warren G. Magnuson Clinical Center, Bethesda, Maryland USA
| | - Lu Wang
- DYNAL Biotech, Brown Deer, Wisconsin USA
| | | | | | - David F Stroncek
- HLA Laboratory, Department of Transfusion Medicine, National Institutes of Health, Warren G. Magnuson Clinical Center, Bethesda, Maryland USA
| |
Collapse
|