1
|
Alper CA, Dawkins RL, Kulski JK, Larsen CE, Lloyd SS. Editorial: Population genomic architecture: Conserved polymorphic sequences (CPSs), not linkage disequilibrium. Front Genet 2023; 14:1140350. [PMID: 36777737 PMCID: PMC9911302 DOI: 10.3389/fgene.2023.1140350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open
Affiliation(s)
- Chester A. Alper
- Program in Cellular and Molecular Medicine, Boston Children’s Hospital, Boston, MA, United States,Department of Pediatrics, Harvard Medical School, Boston, MA, United States,*Correspondence: Chester A. Alper, ; Roger L. Dawkins, ; Jerzy K. Kulski, ; Charles E. Larsen, ; Sally S. Lloyd,
| | - Roger L. Dawkins
- CY O’Connor ERADE Village Foundation, North Dandalup, WA, Australia,*Correspondence: Chester A. Alper, ; Roger L. Dawkins, ; Jerzy K. Kulski, ; Charles E. Larsen, ; Sally S. Lloyd,
| | - Jerzy K. Kulski
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan,*Correspondence: Chester A. Alper, ; Roger L. Dawkins, ; Jerzy K. Kulski, ; Charles E. Larsen, ; Sally S. Lloyd,
| | - Charles E. Larsen
- Program in Cellular and Molecular Medicine, Boston Children’s Hospital, Boston, MA, United States,Department of Pediatrics, Harvard Medical School, Boston, MA, United States,*Correspondence: Chester A. Alper, ; Roger L. Dawkins, ; Jerzy K. Kulski, ; Charles E. Larsen, ; Sally S. Lloyd,
| | - Sally S. Lloyd
- CY O’Connor ERADE Village Foundation, North Dandalup, WA, Australia,*Correspondence: Chester A. Alper, ; Roger L. Dawkins, ; Jerzy K. Kulski, ; Charles E. Larsen, ; Sally S. Lloyd,
| |
Collapse
|
2
|
Bypassing Mendel's First Law: Transmission Ratio Distortion in Mammals. Int J Mol Sci 2023; 24:ijms24021600. [PMID: 36675116 PMCID: PMC9863905 DOI: 10.3390/ijms24021600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 01/10/2023] [Accepted: 01/11/2023] [Indexed: 01/15/2023] Open
Abstract
Mendel's law of segregation states that the two alleles at a diploid locus should be transmitted equally to the progeny. A genetic segregation distortion, also referred to as transmission ratio distortion (TRD), is a statistically significant deviation from this rule. TRD has been observed in several mammal species and may be due to different biological mechanisms occurring at diverse time points ranging from gamete formation to lethality at post-natal stages. In this review, we describe examples of TRD and their possible mechanisms in mammals based on current knowledge. We first focus on the differences between TRD in male and female gametogenesis in the house mouse, in which some of the most well studied TRD systems have been characterized. We then describe known TRD in other mammals, with a special focus on the farmed species and in the peculiar common shrew species. Finally, we discuss TRD in human diseases. Thus far, to our knowledge, this is the first time that such description is proposed. This review will help better comprehend the processes involved in TRD. A better understanding of these molecular mechanisms will imply a better comprehension of their impact on fertility and on genome evolution. In turn, this should allow for better genetic counseling and lead to better care for human families.
Collapse
|
3
|
Human leukocyte antigen super-locus: nexus of genomic supergenes, SNPs, indels, transcripts, and haplotypes. Hum Genome Var 2022; 9:49. [PMID: 36543786 PMCID: PMC9772353 DOI: 10.1038/s41439-022-00226-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 11/08/2022] [Accepted: 11/15/2022] [Indexed: 12/24/2022] Open
Abstract
The human Major Histocompatibility Complex (MHC) or Human Leukocyte Antigen (HLA) super-locus is a highly polymorphic genomic region that encodes more than 140 coding genes including the transplantation and immune regulatory molecules. It receives special attention for genetic investigation because of its important role in the regulation of innate and adaptive immune responses and its strong association with numerous infectious and/or autoimmune diseases. In recent years, MHC genotyping and haplotyping using Sanger sequencing and next-generation sequencing (NGS) methods have produced many hundreds of genomic sequences of the HLA super-locus for comparative studies of the genetic architecture and diversity between the same and different haplotypes. In this special issue on 'The Current Landscape of HLA Genomics and Genetics', we provide a short review of some of the recent analytical developments used to investigate the SNP polymorphisms, structural variants (indels), transcription and haplotypes of the HLA super-locus. This review highlights the importance of using reference cell-lines, population studies, and NGS methods to improve and update our understanding of the mechanisms, architectural structures and combinations of human MHC genomic alleles (SNPs and indels) that better define and characterise haplotypes and their association with various phenotypes and diseases.
Collapse
|
4
|
Alper CA. The Path to Conserved Extended Haplotypes: Megabase-Length Haplotypes at High Population Frequency. Front Genet 2021; 12:716603. [PMID: 34422017 PMCID: PMC8378214 DOI: 10.3389/fgene.2021.716603] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/13/2021] [Indexed: 11/13/2022] Open
Abstract
This minireview describes the history of the conceptual development of conserved extended haplotypes (CEHs): megabase-length haplotypes that exist at high (≥0.5%) population frequency. My career began in internal medicine, shifted to pediatrics, and clinical practice changed to research. My research interest was initially in hematology: on plasma proteins, their metabolism, synthesis, and function. This narrowed to a focus on proteins of the human complement system, their role in immunity and their genetics, beginning with polymorphism and deficiency of C3. My group identified genetic polymorphisms and/or inherited deficiencies of C2, C4, C6, and C8. After defining glycine-rich beta glycoprotein as factor B (Bf) in the properdin system, we found that the genes for Bf (CFB), C2, C4A, and C4B were inherited as a single haplotypic unit which we named the "complotype." Complotypes are located within the major histocompatibility complex (MHC) between HLA-B and HLA-DRB1 and are designated (in arbitrary order) by their CFB, C2, C4A, and C4B types. Pedigree analysis revealed long stretches (several megabases) of apparently fixed DNA within the MHC that we referred to as "extended haplotypes" (later as "CEHs"). About 10 to 12 common CEHs constitute at least 25 - 30% of MHC haplotypes among European Caucasian populations. These CEHs contain virtually all the most common markers of MHC-associated diseases. In the case of type 1 diabetes, we have proposed a purely genetic and epigenetic model (with a small number of Mendelian recessive disease genes) that explains all the puzzling features of the disease, including its rising incidence.
Collapse
Affiliation(s)
- Chester A Alper
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA, United States.,Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
5
|
Hernández-Doño S, Jakez-Ocampo J, Márquez-García JE, Ruiz D, Acuña-Alonzo V, Lima G, Llorente L, Tovar-Méndez VH, García-Silva R, Granados J, Zúñiga J, Vargas-Alarcón G. Heterogeneity of Genetic Admixture Determines SLE Susceptibility in Mexican. Front Genet 2021; 12:701373. [PMID: 34413879 PMCID: PMC8369992 DOI: 10.3389/fgene.2021.701373] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 07/12/2021] [Indexed: 12/11/2022] Open
Abstract
Systemic Lupus Erythematosus (SLE) is an autoimmune inflammatory disorder for which Major Histocompatibility Complex (MHC) genes are well identified as risk factors. SLE patients present different clinical phenotypes, which are partly explained by admixture patterns variation among Mexicans. Population genetic has insight into the high genetic variability of Mexicans, mainly described through HLA gene studies with anthropological and biomedical importance. A prospective, case-control study was performed. In this study, we recruited 146 SLE patients, and 234 healthy individuals were included as a control group; both groups were admixed Mexicans from Mexico City. The HLA typing methods were based on Next Generation Sequencing and Sequence-Based Typing (SBT). The data analysis was performed with population genetic programs and statistical packages. The admixture estimations based on HLA-B and -DRB1 revealed that SLE patients have a higher Southwestern European ancestry proportion (48 ± 8%) than healthy individuals (30 ± 7%). In contrast, Mexican Native American components are diminished in SLE patients (44 ± 1%) and augmented in Healthy individuals (63 ± 4%). HLA alleles and haplotypes' frequency analysis found variants previously described in SLE patients from Mexico City. Moreover, a conserved extended haplotype that confers risk to develop SLE was found, the HLA-A∗29:02∼C∗16:01∼B∗44:03∼DRB1∗07:01∼DQB1∗02:02, pC = 0.02, OR = 1.41. Consistent with the admixture estimations, the origin of all risk alleles and haplotypes found in this study are European, while the protection alleles are Mexican Native American. The analysis of genetic distances supported that the SLE patient group is closer to the Southwestern European parental populace and farthest from Mexican Native Americans than healthy individuals. Heterogeneity of genetic admixture determines SLE susceptibility and protection in Mexicans. HLA sequencing is helpful to determine susceptibility alleles and haplotypes restricted to some populations.
Collapse
Affiliation(s)
- Susana Hernández-Doño
- Immunogenetics Division, Department of Transplant, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Juan Jakez-Ocampo
- Department of Immunology and Rheumatology, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - José Eduardo Márquez-García
- Molecular Biology Core Facility, Instituto Nacional de Enfermedades Respiratorias Ismael Cosío Villegas, Mexico City, Mexico
| | - Daniela Ruiz
- Department of Dermatology, Hospital General Dr. Manuel Gea González, Mexico City, Mexico
| | - Víctor Acuña-Alonzo
- Laboratory of Physiology, Biochemistry, and Genetics, Escuela Nacional de Antropología e Historia, Mexico City, Mexico
| | - Guadalupe Lima
- Department of Immunology and Rheumatology, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Luis Llorente
- Department of Immunology and Rheumatology, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Víctor Hugo Tovar-Méndez
- Department of Endocrinology, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Rafael García-Silva
- Department of Internal Medicine, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Julio Granados
- Immunogenetics Division, Department of Transplant, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Joaquín Zúñiga
- Laboratory of Immunobiology and Genetics, Instituto Nacional de Enfermedades Respiratorias Ismael Cosío Villegas, Mexico City, Mexico.,Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Mexico City, Mexico
| | | |
Collapse
|
6
|
Kulski JK, Suzuki S, Shiina T. Haplotype Shuffling and Dimorphic Transposable Elements in the Human Extended Major Histocompatibility Complex Class II Region. Front Genet 2021; 12:665899. [PMID: 34122517 PMCID: PMC8193847 DOI: 10.3389/fgene.2021.665899] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/12/2021] [Indexed: 12/26/2022] Open
Abstract
The major histocompatibility complex (MHC) on chromosome 6p21 is one of the most single-nucleotide polymorphism (SNP)-dense regions of the human genome and a prime model for the study and understanding of conserved sequence polymorphisms and structural diversity of ancestral haplotypes/conserved extended haplotypes. This study aimed to follow up on a previous analysis of the MHC class I region by using the same set of 95 MHC haplotype sequences downloaded from a publicly available BioProject database at the National Center for Biotechnology Information to identify and characterize the polymorphic human leukocyte antigen (HLA)-class II genes, the MTCO3P1 pseudogene alleles, the indels of transposable elements as haplotypic lineage markers, and SNP-density crossover (XO) loci at haplotype junctions in DNA sequence alignments of different haplotypes across the extended class II region (∼1 Mb) from the telomeric PRRT1 gene in class III to the COL11A2 gene at the centromeric end of class II. We identified 42 haplotypic indels (20 Alu, 7 SVA, 13 LTR or MERs, and 2 indels composed of a mosaic of different transposable elements) linked to particular HLA-class II alleles. Comparative sequence analyses of 136 haplotype pairs revealed 98 unique XO sites between SNP-poor and SNP-rich genomic segments with considerable haplotype shuffling located in the proximity of putative recombination hotspots. The majority of XO sites occurred across various regions including in the vicinity of MTCO3P1 between HLA-DQB1 and HLA-DQB3, between HLA-DQB2 and HLA-DOB, between DOB and TAP2, and between HLA-DOA and HLA-DPA1, where most XOs were within a HERVK22 sequence. We also determined the genomic positions of the PRDM9-recombination suppression sequence motif ATCCATG/CATGGAT and the PRDM9 recombination activation partial binding motif CCTCCCCT/AGGGGAG in the class II region of the human reference genome (NC_ 000006) relative to published meiotic recombination positions. Both the recombination and anti-recombination PRDM9 binding motifs were widely distributed throughout the class II genomic regions with 50% or more found within repeat elements; the anti-recombination motifs were found mostly in L1 fragmented repeats. This study shows substantial haplotype shuffling between different polymorphic blocks and confirms the presence of numerous putative ancestral recombination sites across the class II region between various HLA class II genes.
Collapse
Affiliation(s)
- Jerzy K Kulski
- Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia.,Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
7
|
Kulski JK, Suzuki S, Shiina T. SNP-Density Crossover Maps of Polymorphic Transposable Elements and HLA Genes Within MHC Class I Haplotype Blocks and Junction. Front Genet 2021; 11:594318. [PMID: 33537058 PMCID: PMC7848197 DOI: 10.3389/fgene.2020.594318] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The genomic region (~4 Mb) of the human major histocompatibility complex (MHC) on chromosome 6p21 is a prime model for the study and understanding of conserved polymorphic sequences (CPSs) and structural diversity of ancestral haplotypes (AHs)/conserved extended haplotypes (CEHs). The aim of this study was to use a set of 95 MHC genomic sequences downloaded from a publicly available BioProject database at NCBI to identify and characterise polymorphic human leukocyte antigen (HLA) class I genes and pseudogenes, MICA and MICB, and retroelement indels as haplotypic lineage markers, and single-nucleotide polymorphism (SNP) crossover loci in DNA sequence alignments of different haplotypes across the Olfactory Receptor (OR) gene region (~1.2 Mb) and the MHC class I region (~1.8 Mb) from the GPX5 to the MICB gene. Our comparative sequence analyses confirmed the identity of 12 haplotypic retroelement markers and revealed that they partitioned the HLA-A/B/C haplotypes into distinct evolutionary lineages. Crossovers between SNP-poor and SNP-rich regions defined the sequence range of haplotype blocks, and many of these crossover junctions occurred within particular transposable elements, lncRNA, OR12D2, MUC21, MUC22, PSORS1A3, HLA-C, HLA-B, and MICA. In a comparison of more than 250 paired sequence alignments, at least 38 SNP-density crossover sites were mapped across various regions from GPX5 to MICB. In a homology comparison of 16 different haplotypes, seven CEH/AH (7.1, 8.1, 18.2, 51.x, 57.1, 62.x, and 62.1) had no detectable SNP-density crossover junctions and were SNP poor across the entire ~2.8 Mb of sequence alignments. Of the analyses between different recombinant haplotypes, more than half of them had SNP crossovers within 10 kb of LTR16B/ERV3-16A3_I, MLT1, Charlie, and/or THE1 sequences and were in close vicinity to structurally polymorphic Alu and SVA insertion sites. These studies demonstrate that (1) SNP-density crossovers are associated with putative ancestral recombination sites that are widely spread across the MHC class I genomic region from at least the telomeric OR12D2 gene to the centromeric MICB gene and (2) the genomic sequences of MHC homozygous cell lines are useful for analysing haplotype blocks, ancestral haplotypic landscapes and markers, CPSs, and SNP-density crossover junctions.
Collapse
Affiliation(s)
- Jerzy K. Kulski
- Faculty of Health and Medical Sciences, Medical School, The University of Western Australia, Crawley, WA, Australia
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
8
|
Vadva Z, Larsen CE, Propp BE, Trautwein MR, Alford DR, Alper CA. A New Pedigree-Based SNP Haplotype Method for Genomic Polymorphism and Genetic Studies. Cells 2019; 8:E835. [PMID: 31387299 PMCID: PMC6721696 DOI: 10.3390/cells8080835] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 07/30/2019] [Accepted: 07/31/2019] [Indexed: 12/25/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs) are usually the most frequent genomic variants. Directly pedigree-phased multi-SNP haplotypes provide a more accurate view of polymorphic population genomic structure than individual SNPs. The former are, therefore, more useful in genetic correlation with subject phenotype. We describe a new pedigree-based methodology for generating non-ambiguous SNP haplotypes for genetic study. SNP data for haplotype analysis were extracted from a larger Type 1 Diabetes Genetics Consortium SNP dataset based on minor allele frequency variation and redundancy, coverage rate (the frequency of phased haplotypes in which each SNP is defined) and genomic location. Redundant SNPs were eliminated, overall haplotype polymorphism was optimized and the number of undefined haplotypes was minimized. These edited SNP haplotypes from a region containing HLA-DRB1 (DR) and HLA-DQB1 (DQ) both correlated well with HLA-typed DR,DQ haplotypes and differentiated HLA-DR,DQ fragments shared by three pairs of previously identified megabase-length conserved extended haplotypes. In a pedigree-based genetic association assay for type 1 diabetes, edited SNP haplotypes and HLA-typed HLA-DR,DQ haplotypes from the same families generated essentially identical qualitative and quantitative results. Therefore, this edited SNP haplotype method is useful for both genomic polymorphic architecture and genetic association evaluation using SNP markers with diverse minor allele frequencies.
Collapse
Affiliation(s)
- Zareen Vadva
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Charles E Larsen
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA.
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
| | - Bennett E Propp
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Michael R Trautwein
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Dennis R Alford
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA
| | - Chester A Alper
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA.
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
9
|
Alper CA, Larsen CE, Trautwein MR, Alford DR. A stochastic epigenetic Mendelian oligogenic disease model for type 1 diabetes. J Autoimmun 2018; 96:123-133. [PMID: 30309752 DOI: 10.1016/j.jaut.2018.09.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 09/12/2018] [Indexed: 01/14/2023]
Abstract
The incidence of type 1 diabetes (T1D) and some other complex diseases is increasing. The cause has been attributed to an undefined changing environment. We examine the role of the environment (or any changing non-genetic mechanism) in causing the rising incidence, and find much evidence against it: 1) Dizygotic twin T1D concordance is the same as siblings of patients in general; 2) If the environment is responsible for both the discordance among identical twins of patients with T1D and its rising incidence, the twin concordance rate should be rising, but it is not; 3) Migrants from high-to low-incidence countries continue to have high-incidence children; 4) TID incidence among the offspring of two T1D parents is identical to the monozygotic twin rate. On the other hand, genetic association studies of T1D have revealed strong susceptibility in the major histocompatibility complex and many optional additive genes of small effect throughout the human genome increasing T1D risk. We have, from an analysis of previously published family studies, developed a stochastic epigenetic Mendelian oligogenic (SEMO) model consistent with published observations. The model posits a few required recessive causal genes with incomplete penetrance explaining virtually all of the puzzling features of T1D, including its rising incidence and the specific low T1D incidence rates among first-degree relatives of patients. Since historic selection against any causal gene could prevent T1D, we postulate that the rising incidence is because of increasing population mixing of parents from some previously isolated populations that had selected against different causal genes.
Collapse
Affiliation(s)
- Chester A Alper
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA; Department of Pediatrics, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA.
| | - Charles E Larsen
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA; Department of Pediatrics, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA
| | - Michael R Trautwein
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA
| | - Dennis R Alford
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA
| |
Collapse
|
10
|
Lloyd SS, Steele EJ, Valenzuela JL, Dawkins RL. Haplotypes for Type, Degree, and Rate of Marbling in Cattle Are Syntenic with Human Muscular Dystrophy. Int J Genomics 2017; 2017:6532837. [PMID: 28913347 PMCID: PMC5585636 DOI: 10.1155/2017/6532837] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 05/28/2017] [Indexed: 01/04/2023] Open
Abstract
Traditional analyses of a QTL on Bota 19 implicate a surfeit of candidates, but each is of marginal significance in explaining the deposition of healthy, low melting temperature fat within marbled muscle of Wagyu cattle. As an alternative approach, we have used genomic, multigenerational segregation to identify 14 conserved, ancestral 20 Mb haplotypes. These determine the degree and rate of marbling in Wagyu and other breeds of cattle. The melting temperature of intramuscular fat is highly heritable and traceable by haplotyping. Fortunately, for the production of healthy beef, some of these haplotypes are sufficiently penetrant to be expressed in heterozygous crossbreds, thereby allowing selection of sires which will improve the healthiness of beef produced under even harsh climatic conditions. The region of Bota 19 is syntenic to a region of Hosa 17 known to be important in muscle metabolism and in determining susceptibility to a form of human muscular dystrophy.
Collapse
Affiliation(s)
- Sally S. Lloyd
- CY O'Connor ERADE Village Foundation, P.O. Box 5100, Canning Vale South, WA 6155, Australia
- Melaleuka Stud, 24 Genomics Rise, Piara Waters, WA 6112, Australia
- Centre for Innovation in Agriculture, Murdoch University, Murdoch, WA 6150, Australia
| | - Edward J. Steele
- CY O'Connor ERADE Village Foundation, P.O. Box 5100, Canning Vale South, WA 6155, Australia
| | - Jose L. Valenzuela
- CY O'Connor ERADE Village Foundation, P.O. Box 5100, Canning Vale South, WA 6155, Australia
- Melaleuka Stud, 24 Genomics Rise, Piara Waters, WA 6112, Australia
| | - Roger L. Dawkins
- CY O'Connor ERADE Village Foundation, P.O. Box 5100, Canning Vale South, WA 6155, Australia
- Melaleuka Stud, 24 Genomics Rise, Piara Waters, WA 6112, Australia
- Centre for Innovation in Agriculture, Murdoch University, Murdoch, WA 6150, Australia
| |
Collapse
|
11
|
Abstract
A haplotype is a string of nucleotides or alleles at nearby loci on one chromosome, usually inherited as a unit. Within the major histocompatibility complex (MHC) region on human chromosome 6p, independent population studies of multiple families have identified conserved extended haplotypes (CEHs) that segregate as long stretches (≥1 megabase) of essentially identical DNA sequence at relatively high (≥0.5 %) population frequency ("genetic fixity"). CEHs were first identified through segregation analysis in the early 1980s. In European Caucasian populations, the most frequent 30 CEHs account for at least one-third of all MHC haplotypes. These CEHs provide all of the known individual MHC susceptibility and protective genetic markers within those populations for several complex genetic diseases. Haplotypes are rigorously determined directly by sequencing single chromosomes or by Mendelian segregation analysis using families with informative genotypes. Four parental haplotypes are assigned unambiguously using genotypes from the two parents and from two of their haploidentical (to each other) children. However, the most common current technique to phase haplotypes is probabilistic statistical imputation, using unrelated subjects. Such probabilistic techniques have failed to detect CEHs and are thus of questionable value in identifying long-range haplotype structure and, consequently, genetic structure-function relationships. Finally, with haplotypes rigorously defined, association studies can determine frequencies of alleles among unrelated patient haplotypes vs. those among only unaffected family members (i.e., control alleles/haplotypes). Such studies reduce, as much as possible, the confounding effects of population stratification common to all genetic studies.
Collapse
Affiliation(s)
- Chester A Alper
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, CLS_03, 3 Blackfan Circle, Boston, MA, 02115, USA.
- Department of Pediatrics, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA.
| | - Charles E Larsen
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, CLS_03, 3 Blackfan Circle, Boston, MA, 02115, USA
- Department of Medicine, Harvard Medical School, 25 Shattuck Street, Boston, MA, 02115, USA
| |
Collapse
|
12
|
Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes. Sci Rep 2015; 5:16972. [PMID: 26593880 PMCID: PMC4655331 DOI: 10.1038/srep16972] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 10/22/2015] [Indexed: 12/21/2022] Open
Abstract
Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases.
Collapse
|
13
|
Steele EJ, Lloyd SS. Soma-to-germline feedback is implied by the extreme polymorphism at IGHV relative to MHC: The manifest polymorphism of the MHC appears greatly exceeded at Immunoglobulin loci, suggesting antigen-selected somatic V mutants penetrate Weismann's Barrier. Bioessays 2015; 37:557-69. [PMID: 25810320 DOI: 10.1002/bies.201400213] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Revised: 02/15/2015] [Accepted: 02/24/2015] [Indexed: 01/22/2023]
Abstract
Soma-to-germline feedback is forbidden under the neo-Darwinian paradigm. Nevertheless, there is a growing realization it occurs frequently in immunoglobulin (Ig) variable (V) region genes. This is a surprising development. It arises from a most unlikely source in light of the exposure of co-author EJS to the haplotype data of RL Dawkins and others on the polymorphism of the Major Histocompatibility Complex, which is generally assumed to be the most polymorphic region in the genome (spanning ∼4 Mb). The comparison between the magnitude of MHC polymorphism with estimates for the human heavy chain immunoglobulin V locus (spanning ∼1 Mb), suggests IGHV could be many orders of magnitude more polymorphic than the MHC. This conclusion needs airing in the literature as it implies generational churn and soma-to-germline gene feedback. Pedigree-based experimental strategies to resolve the IGHV issue are outlined.
Collapse
Affiliation(s)
- Edward J Steele
- C.Y. O'Connor ERADE Village Foundation, Piara Waters, WA, Australia
| | | |
Collapse
|
14
|
Dominant sequences of human major histocompatibility complex conserved extended haplotypes from HLA-DQA2 to DAXX. PLoS Genet 2014; 10:e1004637. [PMID: 25299700 PMCID: PMC4191933 DOI: 10.1371/journal.pgen.1004637] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2013] [Accepted: 07/30/2014] [Indexed: 11/19/2022] Open
Abstract
We resequenced and phased 27 kb of DNA within 580 kb of the MHC class II region in 158 population chromosomes, most of which were conserved extended haplotypes (CEHs) of European descent or contained their centromeric fragments. We determined the single nucleotide polymorphism and deletion-insertion polymorphism alleles of the dominant sequences from HLA-DQA2 to DAXX for these CEHs. Nine of 13 CEHs remained sufficiently intact to possess a dominant sequence extending at least to DAXX, 230 kb centromeric to HLA-DPB1. We identified the regions centromeric to HLA-DQB1 within which single instances of eight “common” European MHC haplotypes previously sequenced by the MHC Haplotype Project (MHP) were representative of those dominant CEH sequences. Only two MHP haplotypes had a dominant CEH sequence throughout the centromeric and extended class II region and one MHP haplotype did not represent a known European CEH anywhere in the region. We identified the centromeric recombination transition points of other MHP sequences from CEH representation to non-representation. Several CEH pairs or groups shared sequence identity in small blocks but had significantly different (although still conserved for each separate CEH) sequences in surrounding regions. These patterns partly explain strong calculated linkage disequilibrium over only short (tens to hundreds of kilobases) distances in the context of a finite number of observed megabase-length CEHs comprising half a population's haplotypes. Our results provide a clearer picture of European CEH class II allelic structure and population haplotype architecture, improved regional CEH markers, and raise questions concerning regional recombination hotspots. The human major histocompatibility complex (MHC) is a gene-dense region highly enriched in immune response genes. MHC genetic variation is among the highest in the human genome and is associated with both tissue transplant compatibility and many genetic disorders. Long-range (1–3 Mb) MHC haplotypes of essentially identical DNA sequence at relatively high (≥0.5%) population frequency (“genetic fixity”), called conserved extended haplotypes (CEHs), comprise roughly half of all European population haplotypes. We sequenced an aggregate of 27 kb over 580 kb in the MHC class II region from HLA-DQA2 to DAXX in 158 European haplotypes to quantify the breakdown of this genetic fixity in the centromeric portion of the MHC and to determine the representative nature within that region of eight previously fully or nearly fully sequenced “common” European haplotypes. We identified the dominant sequences of 13 European CEHs and determined where the “common” sequences did (or did not) represent related CEHs. We found patterns of shared sequence identity among different CEHs surrounded by fixed (for each CEH) but differing sequence. Our direct observational results for population haplotypes explain the mutual occurrence of CEHs and short (5–200 kb) blocks of fixed sequence detected by the statistical measure of linkage disequilibrium.
Collapse
|
15
|
Abstract
Graft-versus-host disease (GVHD) is a potentially life-threatening complication of allogeneic hematopoietic cell transplantation. Many genes are presumed to be involved in GVHD, but the best characterized genetic system is that of the human major histocompatibility complex (MHC) located on chromosome 6. Among the hundreds of genes located within the MHC region, the best known and characterized are the classical HLA genes, HLA-A, C, B, DRB1, DQB1, and DPB1. They play a fundamental role in T cell immune responses, and HLA-A, C, and B also function as ligands for the natural killer cell immunoglobulin-like receptors involved in innate immunity. This review highlights the state-of-the art in the field of histocompatibility and immunogenetics of the MHC with respect to genetic risk factors for GVHD.
Collapse
|
16
|
Kawashima M, Ohashi J, Nishida N, Tokunaga K. Evolutionary analysis of classical HLA class I and II genes suggests that recent positive selection acted on DPB1*04:01 in Japanese population. PLoS One 2012; 7:e46806. [PMID: 23056460 PMCID: PMC3463557 DOI: 10.1371/journal.pone.0046806] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Accepted: 09/10/2012] [Indexed: 01/01/2023] Open
Abstract
The human leukocyte antigen (HLA) genes exhibit the highest degree of polymorphism in the human genome. This high degree of variation at classical HLA class I and class II loci has been maintained by balancing selection for a long evolutionary time. However, little is known about recent positive selection acting on specific HLA alleles in a local population. To detect the signature of recent positive selection, we genotyped six HLA loci, HLA-A, HLA-B, HLA-C, HLA-DRB1, HLA-DQB1, and HLA-DPB1 in 418 Japanese subjects, and then assessed the haplotype homozygosity (HH) of each HLA allele. There were 120 HLA alleles across the six loci. Among the 80 HLA alleles with frequencies of more than 1%, DPB1*04∶01, which had a frequency of 6.1%, showed exceptionally high HH (0.53). This finding raises the possibility that recent positive selection has acted on DPB1*04∶01. The DPB1*04∶01 allele, which was present in the most common 6-locus HLA haplotype (4.4%), A*33∶03-C*14∶03-B*44∶03-DRB1*13∶02-DQB1*06∶04-DPB1*04∶01, seems to have flowed from the Korean peninsula to the Japanese archipelago in the Yayoi period. A stochastic simulation approach indicated that the strong linkage disequilibrium between DQB1*06∶04 and DPB1*04∶01 observed in Japanese cannot be explained without positive selection favoring DPB1*04∶01. The selection coefficient of DPB1*04∶01 was estimated as 0.041 (95% credible interval 0.021–0.077). Our results suggest that DPB1*04∶01 has recently undergone strong positive selection in Japanese population.
Collapse
Affiliation(s)
- Minae Kawashima
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- * E-mail: (MK); (JO)
| | - Jun Ohashi
- Molecular and Genetic Epidemiology, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan
- * E-mail: (MK); (JO)
| | - Nao Nishida
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Ichikawa, Chiba, Japan
| | - Katsushi Tokunaga
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
17
|
Kingery SE, Wu YL, Zhou B, Hoffman RP, Yu CY. Gene CNVs and protein levels of complement C4A and C4B as novel biomarkers for partial disease remissions in new-onset type 1 diabetes patients. Pediatr Diabetes 2012; 13:408-18. [PMID: 22151770 PMCID: PMC4178531 DOI: 10.1111/j.1399-5448.2011.00836.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2011] [Accepted: 10/17/2011] [Indexed: 12/30/2022] Open
Abstract
OBJECTIVE To determine the roles of complement C4A and C4B gene copy-number variations and their plasma protein concentrations in residual insulin secretion and loss of pancreatic β-cell function in new-onset type 1 diabetes (T1D) patients. METHODS We studied 34 patients of European ancestry with new-onset T1D, aged between 3 and 17 yr (10.7 ± 3.45), at Nationwide Children's Hospital in Columbus, Ohio. Gene copy-number and size variations of complement C4A and C4B were determined by genomic Southern blot analyses. C4A and C4B protein phenotypes were elucidated by immunofixation and radial immunodiffusion. Two-digit human leukocyte antigen (HLA)-DRB1 genotypes were determined by sequence-specific polymerase chain reaction. At 1- and 9-month post diagnosis, stimulated C-peptide levels were measured after a standardized mixed-meal tolerance test. RESULTS The diploid gene copy-numbers of C4A varied from 0 to 4, and those of C4B from 0 to 3. Patients with higher copy-number of C4A or higher C4A plasma protein concentrations at diagnosis had higher C-peptide levels at 1-month post diagnosis (p = 0.008; p = 0.008). When controlled by the Z-score of body mass index, C4A copy-numbers, C4A protein concentrations, the age of disease onset, and the number of HLA-DR3 but not DR4 alleles were significant parameters in determining C-peptide levels. At 9-month post diagnosis, 42.3% of patients remained in partial remission, and these patients were characterized by lower total C4B copy-numbers or lower C4B protein concentrations (p = 0.02; p = 0.0004). CONCLUSIONS C4A appears to associate with the protection of residual β-cell function in new-onset T1D; C4B is correlated with the end of disease remission at 9-month post diagnosis.
Collapse
Affiliation(s)
- Suzanne E. Kingery
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Hospital, Columbus, Ohio 43205,Division of Endocrinology, Nationwide Children's Hospital, Columbus, Ohio 43205
| | - Yee Ling Wu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Hospital, Columbus, Ohio 43205
| | - Bi Zhou
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Hospital, Columbus, Ohio 43205
| | - Robert P. Hoffman
- Division of Endocrinology, Nationwide Children's Hospital, Columbus, Ohio 43205,Department of Pediatrics, The Ohio State University, Columbus, Ohio 43205
| | - C. Yung Yu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Hospital, Columbus, Ohio 43205,Department of Pediatrics, The Ohio State University, Columbus, Ohio 43205
| |
Collapse
|
18
|
Baschal EE, Jasinski JM, Boyle TA, Fain PR, Eisenbarth GS, Siebert JC. Congruence as a measurement of extended haplotype structure across the genome. J Transl Med 2012; 10:32. [PMID: 22369243 PMCID: PMC3310717 DOI: 10.1186/1479-5876-10-32] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2011] [Accepted: 02/27/2012] [Indexed: 02/01/2023] Open
Abstract
Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies.
Collapse
Affiliation(s)
- Erin E Baschal
- Barbara Davis Center for Childhood Diabetes, University of Colorado Denver, Denver, CO 80045, USA
| | | | | | | | | | | |
Collapse
|
19
|
Williamson JF, McLure CA, Guymer RH, Baird PN, Millman J, Cantsilieris S, Dawkins RL. Almost total protection from age-related macular degeneration by haplotypes of the Regulators of Complement Activation. Genomics 2011; 98:412-21. [PMID: 21855625 DOI: 10.1016/j.ygeno.2011.08.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2011] [Revised: 07/25/2011] [Accepted: 08/01/2011] [Indexed: 11/16/2022]
Abstract
Age-related macular degeneration (AMD) is the leading cause of blindness in developed countries. It has been proposed that the polymorphism encoding Y402H (T1277C) in the complement factor H gene (CFH) is one of the main determinants of disease. We genotyped the polymorphism at a number of loci in the region encompassing the Regulators of Complement Activation (RCA) on chromosome 1, including T1277C SNP, in 187 patients and 146 controls. Haplotypes have been classified as protective (P) or susceptible (S) with respect to AMD. This included the identification of an S haplotype with a T at 1277. The results show that no single locus should be assumed to be directly responsible for AMD, but rather argue for the existence of RCA haplotypes, which can be assigned meaningful predictive values for AMD. We conclude that the critical sequences are within a region 450 kb centromeric to 128 kb telomeric of CFH.
Collapse
Affiliation(s)
- Joseph F Williamson
- C.Y. O'Connor ERADE Village Foundation, Canning Vale, Western Australia, Australia
| | | | | | | | | | | | | |
Collapse
|
20
|
Crespi BJ, Thiselton DL. Comparative immunogenetics of autism and schizophrenia. GENES BRAIN AND BEHAVIOR 2011; 10:689-701. [DOI: 10.1111/j.1601-183x.2011.00710.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
21
|
Steele EJ, Williamson JF, Lester S, Stewart BJ, Millman JA, Carnegie P, Lindley RA, Pain GN, Dawkins RL. Genesis of ancestral haplotypes: RNA modifications and reverse transcription-mediated polymorphisms. Hum Immunol 2010; 72:283-293.e1. [PMID: 21156194 DOI: 10.1016/j.humimm.2010.12.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2010] [Revised: 11/15/2010] [Accepted: 12/06/2010] [Indexed: 11/30/2022]
Abstract
Understanding the genesis of the block haplotype structure of the genome is a major challenge. With the completion of the sequencing of the Human Genome and the initiation of the HapMap project the concept that the chromosomes of the mammalian genome are a mosaic, or patchwork, of conserved extended block haplotype sequences is now accepted by the mainstream genomics research community. Ancestral Haplotypes (AHs) can be viewed as a recombined string of smaller Polymorphic Frozen Blocks (PFBs). How have such variant extended DNA sequence tracts emerged in evolution? Here the relevant literature on the problem is reviewed from various fields of molecular and cell biology particularly molecular immunology and comparative and functional genomics. Based on our synthesis we then advance a testable molecular and cellular model. A critical part of the analysis concerns the origin of the strand biased mutation signatures in the transcribed regions of the human and higher primate genome, A-to-G versus T-to-C (ratio ∼ 1.5 fold) and C-to-T versus G-to-A (≥ 1.5 fold). A comparison and evaluation of the current state of the fields of immunoglobulin Somatic Hypermutation (SHM) and Transcription-Coupled DNA Repair focused on how mutations in newly synthesized RNA might be copied back to DNA thus accounting for some of the genome-wide strand biases (e.g., the A-to-G vs T-to-C component of the strand biased spectrum). We hypothesize that the genesis of PFBs and extended AHs occurs during mutagenic episodes in evolution (e.g., retroviral infections) and that many of the critical DNA sequence diversifying events occur first at the RNA level, e.g., recombination between RNA strings resulting in tandem and dispersed RNA duplications (retroduplications), RNA mutations via adenosine-to-inosine pre-mRNA editing events as well as error prone RNA synthesis. These are then copied back into DNA by a cellular reverse transcription process (also likely to be error-prone) that we have called "reverse transcription-mediated long DNA conversion." Finally we suggest that all these activities and others can be envisaged as being brought physically under the umbrella of special sites in the nucleus involved in transcription known as "transcription factories."
Collapse
Affiliation(s)
- Edward J Steele
- C.Y O'Connor ERADE Village Foundation, Canning Vale, Western Australia, Australia.
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Fernando MM, Boteva L, Morris DL, Zhou B, Wu YL, Lokki ML, Yu CY, Rioux JD, Hollox EJ, Vyse TJ. Assessment of complement C4 gene copy number using the paralog ratio test. Hum Mutat 2010; 31:866-74. [PMID: 20506482 PMCID: PMC3567757 DOI: 10.1002/humu.21259] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The complement C4 locus is in the class III region of the MHC, and exhibits copy number variation. Complement C4 null alleles have shown association with a number of diseases including systemic lupus erythematosus (SLE). However, most studies to date have used protein immunophenotyping and not direct interrogation of the genome to determine C4 null allele status. Moreover, a lack of accurate C4 gene copy number (GCN) estimation and tight linkage disequilibrium across the disease-associated MHC haplotypes has confounded attempts to establish whether or not these associations are causal. We have therefore developed a high throughput paralog ratio test (PRT) in association with two restriction enzyme digest variant ratio tests (REDVRs) to determine total C4 GCN, C4A GCN, and C4B GCN. In the densely genotyped CEU cohort we show that this method is accurate and reproducible when compared to gold standard Southern blot copy number estimation with a discrepancy rate of 9%. We find a broad range of C4 GCNs in the CEU and the 1958 British Birth Cohort populations under study. In addition, SNP-C4 CNV analyses show only moderate levels of correlation and therefore do not support the use of SNP genotypes as proxies for complement C4 GCN.
Collapse
Affiliation(s)
- Michelle M.A. Fernando
- Section of Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Lora Boteva
- Section of Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - David L. Morris
- Section of Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Bi Zhou
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, Columbus, Ohio
| | - Yee Ling Wu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, Columbus, Ohio
| | - Marja-Liisa Lokki
- Transplantation Laboratory, Haartman Institute, University of Helsinki, Helsinki, Finland
| | - Chack Yung Yu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, Columbus, Ohio
| | - John D. Rioux
- Montréal Heart Institute, Montréal, Quebec, Canada
- Université de Montréal, Montréal, Quebec, Canada
| | - Edward J. Hollox
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Timothy J. Vyse
- Section of Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| |
Collapse
|
23
|
Szilágyi Á, Bánlaki Z, Pozsonyi É, Yunis EJ, Awdeh ZL, Hossó A, Rajczy K, Larsen CE, Fici DA, Alper CA, Füst G. Frequent occurrence of conserved extended haplotypes (CEHs) in two Caucasian populations. Mol Immunol 2010; 47:1899-904. [DOI: 10.1016/j.molimm.2010.03.013] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2010] [Revised: 03/17/2010] [Accepted: 03/18/2010] [Indexed: 10/19/2022]
|
24
|
Mohammadi J, Ramanujam R, Jarefors S, Rezaei N, Aghamohammadi A, Gregersen PK, Hammarström L. IgA deficiency and the MHC: assessment of relative risk and microheterogeneity within the HLA A1 B8, DR3 (8.1) haplotype. J Clin Immunol 2010; 30:138-43. [PMID: 19834793 PMCID: PMC11292587 DOI: 10.1007/s10875-009-9336-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2009] [Accepted: 09/24/2009] [Indexed: 10/20/2022]
Abstract
INTRODUCTION Selective IgA deficiency (IgAD; serum IgA concentration of <0.07 g/l) is the most common primary immunodeficiency in Caucasians with an estimated prevalence of 1/600. The frequency of the extended major histocompatibility complex haplotype HLA A1, B8, DR3, DQ2 (the "8.1" haplotype) is increased among patients with IgAD. MATERIALS AND METHODS We carried out a direct measurement of the relative risk of homozygosity of the 8.1 haplotype for IgA deficiency in a population-based sample of 117 B8, DR3 homozygous individuals. RESULTS AND DISCUSSION IgA deficiency was found to be present in 2 of 117 (1.7%) of these subjects, a figure that is concordant with estimates of relative risk from large case-control studies in the Swedish population. These data are consistent with a multiplicative model for the 8.1 haplotype contribution to IgA deficiency and contrasts with prior studies, suggesting a much higher risk for 8.1 homozygosity. Using a dense single nucleotide polymorphism marker analysis of the MHC region in HLA B8, DR3, DQ2 homozygous individuals, we did not observe consistent differences between cases (n = 26) and controls (n = 24). Overall, our results do not support the hypothesis that IgA deficiency is associated with a distinct subgroup of 8.1 related haplotypes, but rather indicate that risk is conferred by the common 8.1 haplotype acting in multiplicative manner.
Collapse
Affiliation(s)
- Javad Mohammadi
- Division of Clinical Immunology, Department of Laboratory Medicine, Karolinska Institutet at Karolinska University Hospital Huddinge, Stockholm, Sweden
| | | | | | | | | | | | | |
Collapse
|
25
|
A critical assessment of the factors affecting reporter gene assays for promoter SNP function: a reassessment of -308 TNF polymorphism function using a novel integrated reporter system. Eur J Hum Genet 2009; 17:1454-62. [PMID: 19471307 DOI: 10.1038/ejhg.2009.80] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
One of the greatest challenges facing genetics is the development of strategies to identify functionally relevant genetic variation. The most common test of function is the reporter gene assay, in which allelic regulatory regions are used to drive the expression of a reporter gene, and differences in expression in a cell line after transient transfection are taken to be a reflection of the polymorphism. Many studies have reported small differences in single nucleotide polymorphism (SNP)-specific reporter activity, including the tumor necrosis factor (TNF) G-308A polymorphism. However, we have established that many variables inherent in the reporter gene approach can account for the reported allelic differences. Variables, such as the amount of DNA used in transfection, the amount of transfection control vector used, the method of transfection, the growth history of the host cells, and the quality and purity of DNA used, all influence TNF -308 SNP-specific transient reporter gene assays and serve as a caution for those researchers who apply this method to the functional assessment of polymorphic promoter sequences. We have developed an integrated reporter system that obviates some of these problems and shows that the TNF G-308A polymorphism is functionally relevant in this improved assay, thus confirming that the -308A allele expresses at a higher level compared with the -308G allele.
Collapse
|
26
|
Baschal EE, Aly TA, Jasinski JM, Steck AK, Noble JA, Erlich HA, Eisenbarth GS. Defining multiple common "completely" conserved major histocompatibility complex SNP haplotypes. Clin Immunol 2009; 132:203-14. [PMID: 19427271 DOI: 10.1016/j.clim.2009.03.530] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2008] [Revised: 03/19/2009] [Accepted: 03/20/2009] [Indexed: 10/20/2022]
Abstract
The availability of both HLA data and genotypes for thousands of SNPs across the major histocompatibility complex (MHC) in 1240 complete families of the Type 1 Diabetes Genetics Consortium allowed us to analyze the occurrence and extent of megabase contiguous identity for founder chromosomes from unrelated individuals. We identified 82 HLA-defined haplotype groups, and within these groups, megabase regions of SNP identity were readily apparent. The conserved chromosomes within the 82 haplotype groups comprise approximately one third of the founder chromosomes. It is currently unknown whether such frequent conservation for groups of unrelated individuals is specific to the MHC, or if initial binning by highly polymorphic HLA alleles facilitated detection of a more general phenomenon within the MHC. Such common identity, specifically across the MHC, impacts type 1 diabetes susceptibility and may impact transplantation between unrelated individuals.
Collapse
Affiliation(s)
- Erin E Baschal
- Barbara Davis Center for Childhood Diabetes, University of Colorado Denver, Building M20, Aurora, CO 80045-6511, USA
| | | | | | | | | | | | | | | |
Collapse
|
27
|
Philipsen M, Kristensen B. Preliminary evidence of segregation distortion in the SLA system. ANIMAL BLOOD GROUPS AND BIOCHEMICAL GENETICS 2009; 16:125-33. [PMID: 4037425 DOI: 10.1111/j.1365-2052.1985.tb01460.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The segregation of 11 well-defined SLA haplotypes was investigated in 40 Land-race and 48 Large White Danish and Swiss litters. Among the 11 haplotypes, the segregation of one (SLA 20-8.2.11) deviated significantly from the expected 1:1 segregation ratio in back-cross families. Tests indicated that these families were homogeneous with respect to segregation distortion, although the distortion was more pronounced in litters sired by heterozygous Danish boars than by heterozygous Swiss boars and Danish and Swiss sows. The data presented do not allow any definite conclusion about the cause of the segregation distortion. The possibility of the distortion being caused either by a complex similar to the T/t-complex found in mouse and contemplated in man or directly by the SLA region is discussed.
Collapse
|
28
|
Saxena K, Kitzmiller KJ, Wu YL, Zhou B, Esack N, Hiremath L, Chung EK, Yang Y, Yu CY. Great genotypic and phenotypic diversities associated with copy-number variations of complement C4 and RP-C4-CYP21-TNX (RCCX) modules: a comparison of Asian-Indian and European American populations. Mol Immunol 2009; 46:1289-303. [PMID: 19135723 PMCID: PMC2716727 DOI: 10.1016/j.molimm.2008.11.018] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2008] [Accepted: 11/22/2008] [Indexed: 01/26/2023]
Abstract
Inter-individual gene copy-number variations (CNVs) probably afford human populations the flexibility to respond to a variety of environmental challenges, but also lead to differential disease predispositions. We investigated gene CNVs for complement component C4 and steroid 21-hydroxylase from the RP-C4-CYP21-TNX (RCCX) modules located in the major histocompatibility complex among healthy Asian-Indian Americans (AIA) and compared them to European Americans. A combination of definitive techniques that yielded cross-confirmatory results was used. The medium gene copy-numbers for C4 and its isotypes, acidic C4A and basic C4B, were 4, 2 and 2, respectively, but their frequencies were only 53-56%. The distribution patterns for total C4 and C4A are skewed towards the high copy-number side. For example, the frequency of AIA-subjects with three copies of C4A (30.7%) was 3.92-fold of those with a single copy (7.83%). The monomodular-short haplotype with a single C4B gene and the absence of C4A, which is in linkage-disequilibrium with HLA DRB1*0301 in Europeans and a strong risk factor for autoimmune diseases, has a frequency of 0.012 in AIA but 0.106 among healthy European Americans (p=6.6x10(-8)). The copy-number and the size of C4 genes strongly determine the plasma C4 protein concentrations. Parallel variations in copy-numbers of CYP21A (CYP21A1P) and TNXA with total C4 were also observed. Notably, 13.1% of AIA-subjects had three copies of the functional CYP21B, which were likely generated by recombinations between monomodular and bimodular RCCX haplotypes. The high copy-numbers of C4 and the high frequency of RCCX recombinants offer important insights to the prevalence of autoimmune and genetic diseases.
Collapse
Affiliation(s)
- Kapil Saxena
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
| | - Kathryn J. Kitzmiller
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
- Department of Pediatrics, The Ohio State University, Columbus, Ohio
| | - Yee Ling Wu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
- Department of Pediatrics, The Ohio State University, Columbus, Ohio
| | - Bi Zhou
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
| | - Nazreen Esack
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
| | - Leena Hiremath
- Department of Internal Medicine, The Ohio State University, Columbus, Ohio
| | - Erwin K. Chung
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
- Department of Molecular Virology, Immunology and Medical Genetics, The Ohio State University, Columbus, Ohio
| | - Yan Yang
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
- Department of Molecular Virology, Immunology and Medical Genetics, The Ohio State University, Columbus, Ohio
| | - C. Yung Yu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children’s Hospital, 700 Children’s Drive, Columbus Ohio 43205
- Department of Pediatrics, The Ohio State University, Columbus, Ohio
- Department of Molecular Virology, Immunology and Medical Genetics, The Ohio State University, Columbus, Ohio
| |
Collapse
|
29
|
Wu Y, Yang Y, Chung E, Zhou B, Kitzmiller K, Savelli S, Nagaraja H, Birmingham D, Tsao B, Rovin B, Hebert L, Yu C. Phenotypes, genotypes and disease susceptibility associated with gene copy number variations: complement C4 CNVs in European American healthy subjects and those with systemic lupus erythematosus. Cytogenet Genome Res 2009; 123:131-41. [PMID: 19287147 PMCID: PMC2709077 DOI: 10.1159/000184700] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/13/2008] [Indexed: 08/25/2023] Open
Abstract
A new paradigm in human genetics is high frequencies of inter-individual variations in copy numbers of specific genomic DNA segments. Such common copy number variation (CNV) loci often contain genes engaged in host-environment interaction including those involved in immune effector functions. DNA sequences within a CNV locus often share a high degree of identity but beneficial or deleterious polymorphic variants are present among different individuals. Thus, common gene CNVs can contribute, both qualitatively and quantitatively, to a spectrum of phenotypic variants. In this review we describe the phenotypic and genotypic diversities of complement C4 created by copy number variations of RCCX modules (RP-C4-CYP21-TNX) and size dichotomy of C4 genes. A direct outcome of C4 CNV is the generation of two classes of polymorphic proteins, C4A and C4B, with differential chemical reactivities towards peptide or carbohydrate antigens, and a range of C4 plasma protein concentrations (from 15 to 70 mg/dl) among healthy subjects. Deliberate molecular genetic studies enabled development of definitive techniques to determine exact patterns of RCCX modular variations, copy numbers of long and short C4A and C4B genes by Southern blot analyses or by real-time quantitative PCR. It is found that in healthy European Americans, the total C4 gene copy number per diploid genome ranges from 2 to 6: 60.8% of people with four copies of C4 genes, 27.2% with less than four copies, and 12% with more than four copies. Such a distribution is skewed towards the low copy number side in patients with systemic lupus erythematosus (SLE), a prototypic autoimmune disease with complex etiology. In SLE, the frequency of individuals with less than four copies of C4 is significantly increased (42.2%), while the frequency of those with more than four copies is decreased (6%). This decrease in total C4 gene copy number in SLE is due to increases in homozygous and heterozygous deficiencies of C4A but not C4B. Therefore, it is concluded that lower copy number of C4 is a risk factor for and higher gene copy number of C4 is a protective factor against SLE disease susceptibility.
Collapse
Affiliation(s)
- Y.L. Wu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
- Integrated Biomedical Science Graduate Program, The Ohio State University, Columbus, OH (USA)
| | - Y. Yang
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
| | - E.K. Chung
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
| | - B. Zhou
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
| | - K.J. Kitzmiller
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
- Integrated Biomedical Science Graduate Program, The Ohio State University, Columbus, OH (USA)
| | - S.L. Savelli
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
- Department of Pediatrics, The Ohio State University, Columbus, OH (USA)
| | - H.N. Nagaraja
- Department of Statistics, The Ohio State University, Columbus, OH (USA)
| | - D.J. Birmingham
- Integrated Biomedical Science Graduate Program, The Ohio State University, Columbus, OH (USA)
- Department of Internal Medicine, The Ohio State University, Columbus, OH (USA)
| | - B.P. Tsao
- University of California, Los Angeles, CA (USA)
| | - B.H. Rovin
- Department of Internal Medicine, The Ohio State University, Columbus, OH (USA)
| | - L.A. Hebert
- Department of Internal Medicine, The Ohio State University, Columbus, OH (USA)
| | - C.Y. Yu
- Center for Molecular and Human Genetics, The Research Institute at Nationwide Children's Research Institute, Columbus, OH (USA)
- Integrated Biomedical Science Graduate Program, The Ohio State University, Columbus, OH (USA)
- Department of Pediatrics, The Ohio State University, Columbus, OH (USA)
| |
Collapse
|
30
|
Baschal EE, Aly TA, Jasinski JM, Steck AK, Johnson KN, Noble JA, Erlich HA, Eisenbarth GS. The frequent and conserved DR3-B8-A1 extended haplotype confers less diabetes risk than other DR3 haplotypes. Diabetes Obes Metab 2009; 11 Suppl 1:25-30. [PMID: 19143812 PMCID: PMC2769935 DOI: 10.1111/j.1463-1326.2008.01000.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AIM The goal of this study was to develop and implement methodology that would aid in the analysis of extended high-density single nucleotide polymorphism (SNP) major histocompatibility complex (MHC) haplotypes combined with human leucocyte antigen (HLA) alleles in relation to type 1 diabetes risk. METHODS High-density SNP genotype data (2918 SNPs) across the MHC from the Type 1 Diabetes Genetics Consortium (1240 families), in addition to HLA data, were processed into haplotypes using PedCheck and Merlin, and extended DR3 haplotypes were analysed. RESULTS With this large dense set of SNPs, the conservation of DR3-B8-A1 (8.1) haplotypes spanned the MHC (>/=99% SNP identity). Forty-seven individuals homozygous for the 8.1 haplotype also shared the same homozygous genotype at four 'sentinel' SNPs (rs2157678 'T', rs3130380 'A', rs3094628 'C' and rs3130352 'T'). Conservation extended from HLA-DQB1 to the telomeric end of the SNP panels (3.4 Mb total). In addition, we found that the 8.1 haplotype is associated with lower risk than other DR3 haplotypes by both haplotypic and genotypic analyses [haplotype: p = 0.009, odds ratio (OR) = 0.65; genotype: p = 6.3 x 10(-5), OR = 0.27]. The 8.1 haplotype (from genotypic analyses) is associated with lower risk than the high-risk DR3-B18-A30 haplotype (p = 0.01, OR = 0.23), but the DR3-B18-A30 haplotype did not differ from other non-8.1 DR3 haplotypes relative to diabetes association. CONCLUSION The 8.1 haplotype demonstrates extreme conservation (>3.4 Mb) and is associated with significantly lower risk for type 1 diabetes than other DR3 haplotypes.
Collapse
Affiliation(s)
- E E Baschal
- Barbara Davis Center for Childhood Diabetes, Department of Pediatrics, University of Colorado Denver, Aurora, CO, USA.
| | | | | | | | | | | | | | | |
Collapse
|
31
|
HEDRICK PHILIPW, THOMSON GLENYS, KLITZ WILLIAM. Evolutionary genetics and HLA: another classic example. Biol J Linn Soc Lond 2008. [DOI: 10.1111/j.1095-8312.1987.tb01996.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
32
|
Baschal EE, Eisenbarth GS. Extreme genetic risk for type 1A diabetes in the post-genome era. J Autoimmun 2008; 31:1-6. [PMID: 18450419 DOI: 10.1016/j.jaut.2008.03.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2008] [Revised: 03/19/2008] [Accepted: 03/19/2008] [Indexed: 02/07/2023]
Abstract
A series of genes and loci influencing the genetic risk of type 1A (immune-mediated) diabetes are now well characterized. These include genes of the major histocompatibility complex (MHC), polymorphisms 5' of the insulin gene, and PTPN22, as well as more recently defined loci from genome-wide association studies. By far the major determinants of risk for type 1A diabetes are genes within or linked to the MHC and in particular alleles of class II genes (HLA-DR, DQ, and DP). There is evidence that MHC class I alleles contribute and there are additional MHC-linked influences such that for a major subset of relatives of patients there is a risk as high as 80% for siblings, and for the general population a risk as high as 20% can be defined at birth just by analyzing the MHC. We believe the search for additional MHC loci will require analysis of the remarkable long-range identity (up to 9 million base pairs) of extended MHC haplotypes. Current prediction algorithms will likely be greatly improved for the general population when the additional contributing loci of the MHC are defined.
Collapse
Affiliation(s)
- Erin E Baschal
- Barbara Davis Center for Childhood Diabetes, University of Colorado Health Sciences Center, Aurora, CO 80045-6511, USA
| | | |
Collapse
|
33
|
Fernando MMA, Stevens CR, Walsh EC, De Jager PL, Goyette P, Plenge RM, Vyse TJ, Rioux JD. Defining the role of the MHC in autoimmunity: a review and pooled analysis. PLoS Genet 2008; 4:e1000024. [PMID: 18437207 PMCID: PMC2291482 DOI: 10.1371/journal.pgen.1000024] [Citation(s) in RCA: 387] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The major histocompatibility complex (MHC) is one of the most extensively studied regions in the human genome because of the association of variants at this locus with autoimmune, infectious, and inflammatory diseases. However, identification of causal variants within the MHC for the majority of these diseases has remained difficult due to the great variability and extensive linkage disequilibrium (LD) that exists among alleles throughout this locus, coupled with inadequate study design whereby only a limited subset of about 20 from a total of approximately 250 genes have been studied in small cohorts of predominantly European origin. We have performed a review and pooled analysis of the past 30 years of research on the role of the MHC in six genetically complex disease traits - multiple sclerosis (MS), type 1 diabetes (T1D), systemic lupus erythematosus (SLE), ulcerative colitis (UC), Crohn's disease (CD), and rheumatoid arthritis (RA) - in order to consolidate and evaluate the current literature regarding MHC genetics in these common autoimmune and inflammatory diseases. We corroborate established MHC disease associations and identify predisposing variants that previously have not been appreciated. Furthermore, we find a number of interesting commonalities and differences across diseases that implicate both general and disease-specific pathogenetic mechanisms in autoimmunity.
Collapse
Affiliation(s)
- Michelle M. A. Fernando
- Section of Molecular Genetics and Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Christine R. Stevens
- Program in Medical and Population Genetics, Broad Institute, Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
| | - Emily C. Walsh
- Program in Medical and Population Genetics, Broad Institute, Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
| | - Philip L. De Jager
- Program in Medical and Population Genetics, Broad Institute, Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
- Department of Neurology, Center for Neurologic Diseases, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Harvard Medical School/Partners Healthcare Center for Genetics and Genomics, Boston, Massachusetts, United States of America
| | - Philippe Goyette
- Université de Montréal, Montréal Heart Institute, Montréal, Québec, Canada
| | - Robert M. Plenge
- Program in Medical and Population Genetics, Broad Institute, Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
- Harvard Medical School, Division of Rheumatology, Allergy and Immunology, Boston, Massachusetts, United States of America
| | - Timothy J. Vyse
- Section of Molecular Genetics and Rheumatology, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - John D. Rioux
- Program in Medical and Population Genetics, Broad Institute, Massachusetts Institute of Technology and Harvard University, Cambridge, Massachusetts, United States of America
- Université de Montréal, Montréal Heart Institute, Montréal, Québec, Canada
| |
Collapse
|
34
|
Aly TA, Baschal EE, Jahromi MM, Fernando MS, Babu SR, Fingerlin TE, Kretowski A, Erlich HA, Fain PR, Rewers MJ, Eisenbarth GS. Analysis of single nucleotide polymorphisms identifies major type 1A diabetes locus telomeric of the major histocompatibility complex. Diabetes 2008; 57:770-6. [PMID: 18065518 DOI: 10.2337/db07-0900] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVE HLA-DRB1*03-DQB1*0201/DRB1*04-DQB1*0302 (DR3/4-DQ8) siblings who share both major histocompatibility complex (MHC) haplotypes identical-by-descent with their proband siblings have a higher risk for type 1A diabetes than DR3/4-DQ8 siblings who do not share both MHC haplotypes identical-by-descent. Our goal was to search for non-DR/DQ MHC genetic determinants that cause the additional risk in the DR3/4-DQ8 siblings who share both MHC haplotypes. RESEARCH DESIGN AND METHODS We completed an extensive single nucleotide polymorphism (SNP) analysis of the extended MHC in 237 families with type 1A diabetes from the U.S. and 1,240 families from the Type 1 Diabetes Genetics Consortium. RESULTS We found evidence for an association with type 1A diabetes (rs1233478, P = 1.6 x 10(-23), allelic odds ratio 2.0) in the UBD/MAS1L region, telomeric of the classic MHC. We also observed over 99% conservation for up to 9 million nucleotides between chromosomes containing a common haplotype with the HLA-DRB1*03, HLA-B*08, and HLA-A*01 alleles, termed the "8.1 haplotype." The diabetes association in the UBD/MAS1L region remained significant both after chromosomes with the 8.1 haplotype were removed (rs1233478, P = 1.4 x 10(-12)) and after adjustment for known HLA risk factors HLA-DRB1, HLA-DQB1, HLA-B, and HLA-A (P = 0.01). CONCLUSIONS Polymorphisms in the region of the UBD/MAS1L genes are associated with type 1A diabetes independent of HLA class II and I alleles.
Collapse
Affiliation(s)
- Theresa A Aly
- Barbara Davis Center for Childhood Diabetes, University of Colorado at Denver and Health Sciences Center, Aurora, CO 80045-6511, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Husain Z, Kelly MA, Eisenbarth GS, Pugliese A, Awdeh ZL, Larsen CE, Alper CA. The MHC type 1 diabetes susceptibility gene is centromeric to HLA-DQB1. J Autoimmun 2007; 30:266-72. [PMID: 18065200 DOI: 10.1016/j.jaut.2007.10.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2007] [Revised: 10/19/2007] [Accepted: 10/19/2007] [Indexed: 10/22/2022]
Abstract
HLA-DQB1 is widely considered to be the major histocompatibility complex (MHC) susceptibility gene for type 1 diabetes (T1D). However, since inheritance of the gene in T1D is recessive, the presence of the protective HLA-DQB1 0602 allele with normal nucleotide sequence in some patients raises the question of whether HLA-DQB1 is not the susceptibility locus itself but merely a good marker. HLA-DQB1 0602 is part of a conserved extended haplotype (CEH) [HLA-B7, SC31, DR2] (B7, DR2) with fixed DNA over more than 1Mb of genomic DNA that normally carries a protective allele at the true susceptibility locus. We postulated that, in patients with HLA-DQB1 0602, the protective allele at the susceptibility locus has been replaced by a susceptibility allele through an ancient crossover at meiosis centromeric to HLA-DQB1. We analyzed single nucleotide polymorphisms (SNPs) distinguishing the HLA-DQA2 (the first expressed gene centromeric to HLA-DQB1) allele on the normal HLA-B7, DR2 CEH from those on susceptibility CEHs in T1D patients and controls with HLA-DQB1 0602. All but 1 of 20 healthy control HLA-DQB1 0602 haplotypes had identical (consensus) first intron HLA-DQA2 5-SNP haplotypes. Fifteen of 19 patients with HLA-DQB1 0602 were homozygous for 1 or more HLA-DQA2 SNPs differing from consensus HLA-DQA2 SNPs, providing evidence of crossover involving the HLA-DQA2 locus. The remaining 4 patients were heterozygous at all positions and therefore uninformative. The loss of dominant protection usually associated with HLA-DQB1 0602 haplotypes is consistent with a locus centromeric to HLA-DQB1 being a major determinant of MHC-associated susceptibility, and perhaps the true T1D susceptibility locus.
Collapse
Affiliation(s)
- Zaheed Husain
- Immune Disease Institute, 800 Huntington Avenue, Boston, MA 02115, USA
| | | | | | | | | | | | | |
Collapse
|
36
|
Wu YL, Savelli SL, Yang Y, Zhou B, Rovin BH, Birmingham DJ, Nagaraja HN, Hebert LA, Yu CY. Sensitive and specific real-time polymerase chain reaction assays to accurately determine copy number variations (CNVs) of human complement C4A, C4B, C4-long, C4-short, and RCCX modules: elucidation of C4 CNVs in 50 consanguineous subjects with defined HLA genotypes. THE JOURNAL OF IMMUNOLOGY 2007; 179:3012-25. [PMID: 17709516 DOI: 10.4049/jimmunol.179.5.3012] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Recent comparative genome hybridization studies revealed that hundreds to thousands of human genomic loci can have interindividual copy number variations (CNVs). One of such CNV loci in the HLA codes for the immune effector protein complement component C4. Sensitive, specific, and accurate assays to interrogate the C4 CNV and its associated polymorphisms by using submicrogram quantities of genomic DNA are needed for high throughput epidemiologic studies of C4 CNVs in autoimmune, infectious, and neurological diseases. Quantitative real-time PCR (qPCR) assays were developed using TaqMan chemistry and based on sequences specific for C4A and C4B genes, structural characteristics corresponding to the long and short forms of C4 genes, and the breakpoint region of RP-C4-CYP21-TNX (RCCX) modular duplication. Assignments for gene copy numbers were achieved by relative standard curve methods using cloned C4 genomic DNA covering 6 logs of DNA concentrations for calibrations. The accuracies of test results were cross-confirmed internally in each sample, as the sum of C4A plus C4B equals to the sum of C4L plus C4S or the total copy number of RCCX modules. These qPCR assays were applied to determine C4 CNVs from samples of 50 consanguineous subjects who were mostly homozygous in HLA genotypes. The results revealed eight HLA haplotypes with single C4 genes in monomodular RCCX that are associated with multiple autoimmune and infectious diseases and 32 bimodular, 4 trimodular, and one quadrimodular RCCX. These C4 qPCR assays are proven to be robust, sensitive, and reliable, as they have contributed to the elucidation of C4 CNVs in >1000 human samples with autoimmune and neurological diseases.
Collapse
Affiliation(s)
- Yee Ling Wu
- Center for Molecular and Human Genetics, Columbus Children's Research Institute, 700 Children's Drive, Columbus, OH 43205, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Fernando MMA, Stevens CR, Sabeti PC, Walsh EC, McWhinnie AJM, Shah A, Green T, Rioux JD, Vyse TJ. Identification of two independent risk factors for lupus within the MHC in United Kingdom families. PLoS Genet 2007; 3:e192. [PMID: 17997607 PMCID: PMC2065882 DOI: 10.1371/journal.pgen.0030192] [Citation(s) in RCA: 118] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2007] [Accepted: 09/18/2007] [Indexed: 11/30/2022] Open
Abstract
The association of the major histocompatibility complex (MHC) with SLE is well established yet the causal variants arising from this region remain to be identified, largely due to inadequate study design and the strong linkage disequilibrium demonstrated by genes across this locus. The majority of studies thus far have identified strong association with classical class II alleles, in particular HLA-DRB1*0301 and HLA-DRB1*1501. Additional associations have been reported with class III alleles; specifically, complement C4 null alleles and a tumor necrosis factor promoter SNP (TNF-308G/A). However, the relative effects of these class II and class III variants have not been determined. We have thus used a family-based approach to map association signals across the MHC class II and class III regions in a cohort of 314 complete United Kingdom Caucasian SLE trios by typing tagging SNPs together with classical typing of the HLA-DRB1 locus. Using TDT and conditional regression analyses, we have demonstrated the presence of two distinct and independent association signals in SLE: HLA-DRB1*0301 (nominal p = 4.9 x 10(-8), permuted p < 0.0001, OR = 2.3) and the T allele of SNP rs419788 (nominal p = 4.3 x 10(-8), permuted p < 0.0001, OR = 2.0) in intron 6 of the class III region gene SKIV2L. Assessment of genotypic risk demonstrates a likely dominant model of inheritance for HLA-DRB1*0301, while rs419788-T confers susceptibility in an additive manner. Furthermore, by comparing transmitted and untransmitted parental chromosomes, we have delimited our class II signal to a 180 kb region encompassing the alleles HLA-DRB1*0301-HLA-DQA1*0501-HLA-DQB1*0201 alone. Our class III signal importantly excludes independent association at the TNF promoter polymorphism, TNF-308G/A, in our SLE cohort and provides a potentially novel locus for future genetic and functional studies.
Collapse
Affiliation(s)
- Michelle M. A Fernando
- Section of Molecular Genetics and Rheumatology, Imperial College London, London, United Kingdom
| | - Christine R Stevens
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Pardis C Sabeti
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Emily C Walsh
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Alasdair J. M McWhinnie
- Histocompatibility Laboratories and Research Institute, The Anthony Nolan Trust, London, United Kingdom
| | - Anila Shah
- Histocompatibility Laboratories and Research Institute, The Anthony Nolan Trust, London, United Kingdom
| | - Todd Green
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - John D Rioux
- The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Université de Montréal, Montréal Heart Institute, Montréal, Québec, Canada
| | - Timothy J Vyse
- Section of Molecular Genetics and Rheumatology, Imperial College London, London, United Kingdom
| |
Collapse
|
38
|
Yang Y, Chung EK, Wu YL, Savelli SL, Nagaraja HN, Zhou B, Hebert M, Jones KN, Shu Y, Kitzmiller K, Blanchong CA, McBride KL, Higgins GC, Rennebohm RM, Rice RR, Hackshaw KV, Roubey RAS, Grossman JM, Tsao BP, Birmingham DJ, Rovin BH, Hebert LA, Yu CY. Gene copy-number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans. Am J Hum Genet 2007; 80:1037-54. [PMID: 17503323 PMCID: PMC1867093 DOI: 10.1086/518257] [Citation(s) in RCA: 348] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2006] [Accepted: 03/07/2007] [Indexed: 12/18/2022] Open
Abstract
Interindividual gene copy-number variation (CNV) of complement component C4 and its associated polymorphisms in gene size (long and short) and protein isotypes (C4A and C4B) probably lead to different susceptibilities to autoimmune disease. We investigated the C4 gene CNV in 1,241 European Americans, including patients with systemic lupus erythematosus (SLE), their first-degree relatives, and unrelated healthy subjects, by definitive genotyping and phenotyping techniques. The gene copy number (GCN) varied from 2 to 6 for total C4, from 0 to 5 for C4A, and from 0 to 4 for C4B. Four copies of total C4, two copies of C4A, and two copies of C4B were the most common GCN counts, but each constituted only between one-half and three-quarters of the study populations. Long C4 genes were strongly correlated with C4A (R=0.695; P<.0001). Short C4 genes were correlated with C4B (R=0.437; P<.0001). In comparison with healthy subjects, patients with SLE clearly had the GCN of total C4 and C4A shifting to the lower side. The risk of SLE disease susceptibility significantly increased among subjects with only two copies of total C4 (patients 9.3%; unrelated controls 1.5%; odds ratio [OR] = 6.514; P=.00002) but decreased in those with > or =5 copies of C4 (patients 5.79%; controls 12%; OR=0.466; P=.016). Both zero copies (OR=5.267; P=.001) and one copy (OR=1.613; P=.022) of C4A were risk factors for SLE, whereas > or =3 copies of C4A appeared to be protective (OR=0.574; P=.012). Family-based association tests suggested that a specific haplotype with a single short C4B in tight linkage disequilibrium with the -308A allele of TNFA was more likely to be transmitted to patients with SLE. This work demonstrates how gene CNV and its related polymorphisms are associated with the susceptibility to a human complex disease.
Collapse
Affiliation(s)
- Yan Yang
- Center for Molecular and Human Genetics, Columbus Children's Research Institute, Columbus, OH 43205, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Genetic fixity in the human major histocompatibility complex and block size diversity in the class I region including HLA-E. BMC Genet 2007; 8:14. [PMID: 17430593 PMCID: PMC1853106 DOI: 10.1186/1471-2156-8-14] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2006] [Accepted: 04/12/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The definition of human MHC class I haplotypes through association of HLA-A, HLA-Cw and HLA-B has been used to analyze ethnicity, population migrations and disease association. RESULTS Here, we present HLA-E allele haplotype association and population linkage disequilibrium (LD) analysis within the ~1.3 Mb bounded by HLA-B/Cw and HLA-A to increase the resolution of identified class I haplotypes. Through local breakdown of LD, we inferred ancestral recombination points both upstream and downstream of HLA-E contributing to alternative block structures within previously identified haplotypes. Through single nucleotide polymorphism (SNP) analysis of the MHC region, we also confirmed the essential genetic fixity, previously inferred by MHC allele analysis, of three conserved extended haplotypes (CEHs), and we demonstrated that commercially-available SNP analysis can be used in the MHC to help define CEHs and CEH fragments. CONCLUSION We conclude that to generate high-resolution maps for relating MHC haplotypes to disease susceptibility, both SNP and MHC allele analysis must be conducted as complementary techniques.
Collapse
|
40
|
Alper CA, Husain Z, Larsen CE, Dubey DP, Stein R, Day C, Baker A, Beyan H, Hawa M, Ola TO, Leslie RD. Incomplete penetrance of susceptibility genes for MHC-determined immunoglobulin deficiencies in monozygotic twins discordant for type 1 diabetes. J Autoimmun 2006; 27:89-95. [PMID: 17029885 PMCID: PMC1810396 DOI: 10.1016/j.jaut.2006.07.007] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2006] [Revised: 07/21/2006] [Accepted: 07/23/2006] [Indexed: 01/31/2023]
Abstract
Incomplete intrinsic penetrance is the failure of some genetically susceptible individuals (e.g., monozygotic twins of those who have a trait) to exhibit that trait. For the first time, we examine penetrance of susceptibility genes for multiple MHC gene-determined traits in the same subjects. Serum levels of IgA, IgD, IgG3, but not IgG4, in 50 pairs of monozygotic twins discordant for type 1 diabetes (T1D) correlated more closely in the twins than in random paired controls. The frequencies of subjects deficient in IgA (6%), IgD (33%) and IgG4 (12%), but not in IgG3, were higher in the twins than in controls. We postulate that this was because the MHC haplotypes (and possible non-MHC genes) that predispose to T1D also carry susceptibility genes for certain immunoglobulin deficiencies. Immunoglobulin deficiencies were not associated with T1D. Pairwise concordance for the deficiencies in the twins was 50% for IgA, 57% for IgD and 50% for IgG4. There were no significant associations among the specific immunoglobulin deficiencies except that all IgA-deficient subjects had IgD deficiency. Thus, intrinsic penetrance is a random process independently affecting different MHC susceptibility genes. Because multiple different external triggers would be required to explain the results, differential environmental determinants appear unlikely.
Collapse
Affiliation(s)
- Chester A Alper
- The CBR Institute for Biomedical Research, Harvard Medical School, 800 Huntington Avenue, Boston, MA 02115, USA. . edu
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Harley JB, Kelly JA, Kaufman KM. Unraveling the genetics of systemic lupus erythematosus. ACTA ACUST UNITED AC 2006; 28:119-30. [PMID: 17021721 DOI: 10.1007/s00281-006-0040-5] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2006] [Accepted: 07/14/2006] [Indexed: 02/07/2023]
Abstract
The capacity to locate polymorphisms on a virtually complete map of the human genome coupled with the ability to accurately evaluate large numbers (by historical standards) of genetic markers has led to gene identification in complex diseases, such as systemic lupus erythematosus (SLE or lupus). While this is a phenotype with enormous clinical variation, the twin studies and the observed familial aggregation, along with the genetic effects now known, suggest a strong genetic component. Unlike type 1 diabetes, lupus genetics is not dominated by the powerful effect of a single locus. Instead, there are at least six known genetic association effects in lupus of smaller magnitude (odds ratio <2), and at least 17 robust linkages (established and arguably confirmed independently) defining potentially responsible genes that largely remain to be discovered. The more convincing genetic associations include the human leukocyte antigen region (with multiple genes), C1q, PTPN22, PDCD1, Fc receptor-like 3, FcgammaRIIA, FcgammaRIIIA, interferon regulatory factor 5, and others. How they contribute to disease risk remains yet to be clarified, beyond the obvious speculation derived from what has previously been learned about these genes. Certainly, they are expected to contribute to lupus risk independently and in combination with each other, with genes not yet identified, and with the environment. A substantial number of genes (>10) are expected to be identified to contribute to lupus or in its many subsets defined by clinical and laboratory features.
Collapse
Affiliation(s)
- John B Harley
- Department of Medicine, University of Oklahoma, Oklahoma City, OK, USA
| | | | | |
Collapse
|
42
|
Bilbao JR, Calvo B, Aransay AM, Martin-Pagola A, Perez de Nanclares G, Aly TA, Rica I, Vitoria JC, Gaztambide S, Noble J, Fain PR, Awdeh ZL, Alper CA, Castaño L. Conserved extended haplotypes discriminate HLA-DR3-homozygous Basque patients with type 1 diabetes mellitus and celiac disease. Genes Immun 2006; 7:550-4. [PMID: 16929349 DOI: 10.1038/sj.gene.6364328] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The major susceptibility locus for type 1 diabetes mellitus (T1D) maps to the human lymphocyte antigen (HLA) class II region in the major histocompatibility complex on chromosome 6p21. In southern European populations, like the Basques, the greatest risk to T1D is associated with DR3 homo- and heterozygosity and is comparable to that of DR3/DR4, the highest risk genotype in northern European populations. Celiac disease (CD) is another DR3-associated autoimmune disorder showing certain overlap with T1D that has been explained by the involvement of common genetic determinants, a situation more frequent in DR3-rich populations, like the Basques. As both T1D- and CD-associated HLA alleles are part of conserved extended haplotypes (CEH), we compared DR3-homozygous T1D and CD patients to determine whether CEHs were equally distributed between both disorders or there was a differential contribution of different haplotypes. We observed a very pronounced distribution bias (P<10(-5)) of the two major DR3 CEHs, with DR3-B18 predominating in T1D and DR3-B8 in CD. Additionally, high-density single nucleotide polymorphism (SNP) analysis of the complete CEH [A*30-B*18-MICA*4-F1C30-DRB1*0301-DQB1*0201-DPB1*0202] revealed extraordinary conservation throughout the 4.9 Mbp analyzed supporting the existence of additional diabetogenic variants (other than HLA-DRB1*0301-DQB1*0201), conserved within the DR3-B18 CEH (but not in other DR3 haplotypes) that could explain its enhanced diabetogenicity.
Collapse
Affiliation(s)
- J R Bilbao
- Endocrinology and Diabetes Research Group, Hospital de Cruces, Barakaldo, Bizkaia, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Abstract
Similar to other classical science disciplines, immunology has been embracing novel technologies and approaches giving rise to specialised sub-disciplines such as immunogenetics and, more recently, immunogenomics, which, in many ways, is the genome-wide application of immunogenetic approaches. Here, recent progress in the understanding of the immune sub-genome will be reviewed, and the ways in which immunogenomic datasets consisting of genetic and epigenetic variation, linkage disequilibrium and recombination can be harnessed for disease association and evolutionary studies will be discussed. The discussion will focus on data available for the major histocompatibility complex and the leukocyte receptor complex, the two most polymorphic regions of the human immune sub-genome.
Collapse
Affiliation(s)
- Marcos M Miretti
- Immunogenomics Laboratory, The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Stephan Beck
- Immunogenomics Laboratory, The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
44
|
Vettriselvi V, Vijayalakshmi K, Suganya S, Krishnan M, Paul SFD, Jayanth V. Molecular diversity of HLA-A*19 group of alleles in south Indian population. Int J Immunogenet 2006; 33:69-72. [PMID: 16611249 DOI: 10.1111/j.1744-313x.2006.00570.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
To determine the genetic diversity of the human leucocyte antigen (HLA)-A*19 group of alleles in the south Indian Tamil population, we studied 100 random healthy unrelated individuals. The frequency of HLA-A*19 was 37% with A*33 (45.9%), A*32 (29.7%), A*31 (16.2%), A*30 (5.4%), A*29 (2.7%) and A*74 (0%). The frequency distribution of the HLA-A*19 alleles was distinct and revealed marked similarities and variations with other populations.
Collapse
Affiliation(s)
- V Vettriselvi
- Dept. of Human Genetics, Sri Ramachandra Medical College and Research Institute, Chennai-600116, Tamil Nadu, India
| | | | | | | | | | | |
Collapse
|
45
|
Alper CA, Larsen CE, Dubey DP, Awdeh ZL, Fici DA, Yunis EJ. The Haplotype Structure of the Human Major Histocompatibility Complex. Hum Immunol 2006; 67:73-84. [PMID: 16698428 DOI: 10.1016/j.humimm.2005.11.006] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2005] [Revised: 11/17/2005] [Accepted: 11/22/2005] [Indexed: 11/17/2022]
Abstract
There is great interest in the use of single-nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) analysis to localize human disease genes. The results suggest that the human genome, including the major histocompatibility complex (MHC), consists largely of 5- to 200-kb blocks of sequence fixity between which random recombination occurs. Direct determination of MHC haplotypes from family studies also demonstrates similar-sized blocks, but otherwise gives a very different picture, with a third to a half of Caucasian haplotypes fixed from HLA-B to HLA-DR/DQ (at least 1 Mb) as conserved extended haplotypes (CEHs), some of which encompass more than 3 Mb. These fixed haplotypes differ in frequency both in different Caucasian subpopulations and in Caucasian patients with HLA-associated diseases, complicating disease susceptibility gene localization. The inherent inability of LD analysis to "see" DNA fixity beyond three markers contributes to the failure of SNP/LD analysis to define in detail or even detect CEHs in the MHC and probably elsewhere in the genome. More importantly, the use of statistical analysis, rather than direct haplotype determination and counting, fails to reveal the details of haplotype structure essential for gene localization. Given the oversimplified picture of the MHC (and probably the rest of the genome) provided only by SNP/LD-defined blocks, it is questionable whether this approach will be of great help in disease susceptibility gene localization or identification.
Collapse
Affiliation(s)
- Chester A Alper
- CBR Institute for Biomedical Research, and Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
| | | | | | | | | | | |
Collapse
|
46
|
Traherne JA, Horton R, Roberts AN, Miretti MM, Hurles ME, Stewart CA, Ashurst JL, Atrazhev AM, Coggill P, Palmer S, Almeida J, Sims S, Wilming LG, Rogers J, de Jong PJ, Carrington M, Elliott JF, Sawcer S, Todd JA, Trowsdale J, Beck S. Genetic analysis of completely sequenced disease-associated MHC haplotypes identifies shuffling of segments in recent human history. PLoS Genet 2006; 2:e9. [PMID: 16440057 PMCID: PMC1331980 DOI: 10.1371/journal.pgen.0020009] [Citation(s) in RCA: 137] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2005] [Accepted: 12/13/2005] [Indexed: 11/23/2022] Open
Abstract
The major histocompatibility complex (MHC) is recognised as one of the most important genetic regions in relation to common human disease. Advancement in identification of MHC genes that confer susceptibility to disease requires greater knowledge of sequence variation across the complex. Highly duplicated and polymorphic regions of the human genome such as the MHC are, however, somewhat refractory to some whole-genome analysis methods. To address this issue, we are employing a bacterial artificial chromosome (BAC) cloning strategy to sequence entire MHC haplotypes from consanguineous cell lines as part of the MHC Haplotype Project. Here we present 4.25 Mb of the human haplotype QBL (HLA-A26-B18-Cw5-DR3-DQ2) and compare it with the MHC reference haplotype and with a second haplotype, COX (HLA-A1-B8-Cw7-DR3-DQ2), that shares the same HLA-DRB1, -DQA1, and -DQB1 alleles. We have defined the complete gene, splice variant, and sequence variation contents of all three haplotypes, comprising over 259 annotated loci and over 20,000 single nucleotide polymorphisms (SNPs). Certain coding sequences vary significantly between different haplotypes, making them candidates for functional and disease-association studies. Analysis of the two DR3 haplotypes allowed delineation of the shared sequence between two HLA class II-related haplotypes differing in disease associations and the identification of at least one of the sites that mediated the original recombination event. The levels of variation across the MHC were similar to those seen for other HLA-disparate haplotypes, except for a 158-kb segment that contained the HLA-DRB1, -DQA1, and -DQB1 genes and showed very limited polymorphism compatible with identity-by-descent and relatively recent common ancestry (<3,400 generations). These results indicate that the differential disease associations of these two DR3 haplotypes are due to sequence variation outside this central 158-kb segment, and that shuffling of ancestral blocks via recombination is a potential mechanism whereby certain DR-DQ allelic combinations, which presumably have favoured immunological functions, can spread across haplotypes and populations.
Collapse
Affiliation(s)
- James A Traherne
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Roger Horton
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Anne N Roberts
- Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - Marcos M Miretti
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Matthew E Hurles
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - C. Andrew Stewart
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Jennifer L Ashurst
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Alexey M Atrazhev
- Alberta Diabetes Institute (ADI), Department of Medical Microbiology and Immunology, Division of Dermatology and Cutaneous Sciences, University of Alberta, Edmonton, Canada
| | - Penny Coggill
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Sophie Palmer
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Jeff Almeida
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Sarah Sims
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Laurens G Wilming
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Jane Rogers
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Pieter J. de Jong
- Children's Hospital Oakland Research Institute, Oakland, California, United States of America
| | - Mary Carrington
- Basic Research Program, SAIC-Frederick, Inc., Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland, United States of America
| | - John F Elliott
- Alberta Diabetes Institute (ADI), Department of Medical Microbiology and Immunology, Division of Dermatology and Cutaneous Sciences, University of Alberta, Edmonton, Canada
| | - Stephen Sawcer
- Department of Clinical Neurosciences, University of Cambridge, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - John A Todd
- Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Cambridge, United Kingdom
| | - John Trowsdale
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge, United Kingdom
| | - Stephan Beck
- Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
47
|
Abstract
Type 1 diabetes mellitus (T1D) remains the most intensively studied, and thus the best paradigm, of MHC-associated diseases. Accumulating evidence suggests that MHC susceptibility for T1D is recessive, with susceptibility alleles more common than protective alleles. Updated allele-level and nucleotide sequence analysis of MHC class II T1D susceptibility markers of conserved extended haplotypes underscore the uncertainty surrounding the actual T1D MHC susceptibility locus. Recent studies have established that disease concordance in dizygotic twins is the same as that in siblings generally, for both T1D and the MHC-associated autoimmune disease gluten-sensitive enteropathy, leaving little room for a differential environmental trigger. Epigenetic mechanisms are probably involved in many MHC-associated phenomena, including autoimmunity, and appear to be the best explanation for incomplete penetrance.
Collapse
Affiliation(s)
- Charles E Larsen
- The CBR Institute for Biomedical Research, 800 Huntington Avenue, Boston, MA 02115, USA
| | | |
Collapse
|
48
|
Cao K, Moormann AM, Lyke KE, Masaberg C, Sumba OP, Doumbo OK, Koech D, Lancaster A, Nelson M, Meyer D, Single R, Hartzman RJ, Plowe CV, Kazura J, Mann DL, Sztein MB, Thomson G, Fernández-Viña MA. Differentiation between African populations is evidenced by the diversity of alleles and haplotypes of HLA class I loci. ACTA ACUST UNITED AC 2004; 63:293-325. [PMID: 15009803 DOI: 10.1111/j.0001-2815.2004.00192.x] [Citation(s) in RCA: 148] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The allelic and haplotypic diversity of the HLA-A, HLA-B, and HLA-C loci was investigated in 852 subjects from five sub-Saharan populations from Kenya (Nandi and Luo), Mali (Dogon), Uganda, and Zambia. Distributions of genotypes at all loci and in all populations fit Hardy-Weinberg equilibrium expectations. There was not a single allele predominant at any of the loci in these populations, with the exception of A*3002 [allele frequency (AF) = 0.233] in Zambians and Cw*1601 (AF = 0.283) in Malians. This distribution was consistent with balancing selection for all class I loci in all populations, which was evidenced by the homozygosity F statistic that was less than that expected under neutrality. Only in the A locus in Zambians and the C locus in Malians, the AF distribution was very close to neutrality expectations. There were six instances in which there were significant deviations of allele distributions from neutrality in the direction of balancing selection. All allelic lineages from each of the class I loci were found in all the African populations. Several alleles of these loci have intermediate frequencies (AF = 0.020-0.150) and seem to appear only in the African populations. Most of these alleles are widely distributed in the African continent and their origin may predate the separation of linguistic groups. In contrast to native American and other populations, the African populations do not seem to show extensive allelic diversification within lineages, with the exception of the groups of alleles A*02, A*30, B*57, and B*58. The alleles of human leukocyte antigen (HLA)-B are in strong linkage disequilibrium (LD) with alleles of the C locus, and the sets of B/C haplotypes are found in several populations. The associations between A alleles with C-blocks are weaker, and only a few A/B/C haplotypes (A*0201-B*4501-Cw*1601; A*2301-B*1503-Cw*0202; A*7401-B* 1503-Cw*0202; A*2902-B*4201-Cw*1701; A*3001-B*4201-Cw*1701; and A*3601-B*5301-Cw*0401) are found in multiple populations with intermediate frequencies [haplotype frequency (HF) = 0.010-0.100]. The strength of the LD associations between alleles of HLA-A and HLA-B loci and those of HLA-B and HLA-C loci was on average of the same or higher magnitude as those observed in other non-African populations for the same pairs of loci. Comparison of the genetic distances measured by the distribution of alleles at the HLA class I loci in the sub-Saharan populations included in this and other studies indicate that the Luo population from western Kenya has the closest distance with virtually all sub-Saharan population so far studied for HLA-A, a finding consistent with the putative origin of modern humans in East Africa. In all African populations, the genetic distances between each other are greater than those observed between European populations. The remarkable current allelic and haplotypic diversity in the HLA system as well as their variable distribution in different sub-Saharan populations is probably the result of evolutionary forces and environments that have acted on each individual population or in their ancestors. In this regard, the genetic diversity of the HLA system in African populations poses practical challenges for the design of T-cell vaccines and for the transplantation medical community to find HLA-matched unrelated donors for patients in need of an allogeneic transplant.
Collapse
Affiliation(s)
- K Cao
- Department of Oncology, Georgetown University, Washington, DC, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Yu CY, Chung EK, Yang Y, Blanchong CA, Jacobsen N, Saxena K, Yang Z, Miller W, Varga L, Fust G. Dancing with complement C4 and the RP-C4-CYP21-TNX (RCCX) modules of the major histocompatibility complex. ACTA ACUST UNITED AC 2004; 75:217-92. [PMID: 14604014 DOI: 10.1016/s0079-6603(03)75007-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The number of the complement component C4 genes varies from 2 to 8 in a diploid genome among different human individuals. Three quarters of the C4 genes in Caucasian populations have the endogenous retrovirus, HERV-K(C4), in the ninth intron. The remainder does not. The C4 serum proteins are highly polymorphic and their concentrations vary from 100 to approximately 1000 microg/ml. There are two distinct classes of C4 protein, C4A and C4B, which have diversified to fulfill (a) the opsonization/immunoclearance purposes and (b) the well-known complement function in the killing of microbes by lysis and neutralization, respectively. Many infectious and autoimmune diseases are associated with complete or partial deficiency of C4A and/or C4B. The adverse effects of high C4 gene dosages, however, are just emerging, as the concepts of human C4 genetics are revised and accurate techniques are applied to distinguish partial deficiencies from differential expression caused by unequal C4A and C4B gene dosages and gene sizes. This review attempts to dissect the sophisticated genetics of complement C4A and C4B. The emphases are on the qualitative and quantitative diversities of C4 genotypes and phenotypes. The many allotypic variants and the processed products of human and mouse C4 proteins are described. The modular variation of C4 genes together with the serine/threonine nuclear kinase gene RP, the steroid 21-hydroxylase CYP21, and extracellular matrix protein TNX (RCCX modules) are investigated for the effects on homogenization of C4 protein polymorphisms, and on the unequal genetic crossovers that knocked out the functions of CYP21 and/or TNX. Furthermore, the influence of the endogenous retrovirus HERV-K(C4) on C4 gene expression and the dispersal of HERV-K(C4) family members in the human genome are discussed.
Collapse
Affiliation(s)
- C Yung Yu
- Center for Molecular and Human Genetics, Columbus Children's Research Institute, 700 Children's Drive, Columbus, OH 43205-2696, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Pinto C, Smith AG, Larsen CE, Fernández-Viña M, Husain Z, Clavijo OP, Wang ZC, Nisperos B, Hansen JA, Alper CA, Yunis EJ. HLA-Cw*0409N is associated with HLA-A*2301 and HLA-B*4403-carrying haplotypes. Hum Immunol 2004; 65:181-7. [PMID: 14969773 DOI: 10.1016/j.humimm.2003.11.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2003] [Accepted: 11/21/2003] [Indexed: 10/26/2022]
Abstract
The associations of HLA-B*4402 and HLA-B*4403 with alleles of HLA-A and HLA-Cw were investigated in panels of HLA-B*4403 and HLA-B*4402 homozygous individuals and in selected individuals carrying HLA-Cw*04 and HLA-B*4403. Some of these individuals were genotyped and also carried (HLA-DRB1*0701, DQB1*02). Among the latter, we studied individuals carrying the conserved extended haplotype (CEH) [HLA-Cw*04, B*4403, FC31, DRB1*0701, DQB1*02]. Four different common (HLA-Cw*, B*44) haplotypes were identified that extended to the HLA-A locus: HLA-A*0201, Cw*0501, B*4402; HLA-A*2902, Cw*1601, B*4403; HLA-A*2301, Cw*0401, B*4403; and HLA-A*2301, Cw*0409N, B*4403. We identified eight unrelated examples of the allele HLA-Cw*0409N. HLA-A*2301 was associated with both HLA-Cw*0401 and HLA-Cw*0409N, suggesting that HLA-Cw*0409N may have arisen from a mutation in a CEH. We estimate that approximately 2 to 5 in 1000 Caucasian individuals carry the allele HLA-Cw*0409N, making it one of the most frequent null HLA alleles known to date. Our findings demonstrate the first example of three different HLA-Cw-determined subtypes of a common or CEH carrying a shared HLA-B allele, in this case HLA-B*4403.
Collapse
Affiliation(s)
- C Pinto
- Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|