1
|
Sun Q, Broadaway KA, Edmiston SN, Fajgenbaum K, Miller-Fleming T, Westerkam LL, Melendez-Gonzalez M, Bui H, Blum FR, Levitt B, Lin L, Hao H, Harris KM, Liu Z, Thomas NE, Cox NJ, Li Y, Mohlke KL, Sayed CJ. Genetic Variants Associated With Hidradenitis Suppurativa. JAMA Dermatol 2023; 159:930-938. [PMID: 37494057 PMCID: PMC10372759 DOI: 10.1001/jamadermatol.2023.2217] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 04/25/2023] [Indexed: 07/27/2023]
Abstract
Importance Hidradenitis suppurativa (HS) is a common and severely morbid chronic inflammatory skin disease that is reported to be highly heritable. However, the genetic understanding of HS is insufficient, and limited genome-wide association studies (GWASs) have been performed for HS, which have not identified significant risk loci. Objective To identify genetic variants associated with HS and to shed light on the underlying genes and genetic mechanisms. Design, Setting, and Participants This genetic association study recruited 753 patients with HS in the HS Program for Research and Care Excellence (HS ProCARE) at the University of North Carolina Department of Dermatology from August 2018 to July 2021. A GWAS was performed for 720 patients (after quality control) with controls from the Add Health study and then meta-analyzed with 2 large biobanks, UK Biobank (247 cases) and FinnGen (673 cases). Variants at 3 loci were tested for replication in the BioVU biobank (290 cases). Data analysis was performed from September 2021 to December 2022. Main Outcomes and Measures Main outcome measures are loci identified, with association of P < 1 × 10-8 considered significant. Results A total of 753 patients were recruited, with 720 included in the analysis. Mean (SD) age at symptom onset was 20.3 (10.57) years and at enrollment was 35.3 (13.52) years; 360 (50.0%) patients were Black, and 575 (79.7%) were female. In a meta-analysis of the 4 studies, 2 HS-associated loci were identified and replicated, with lead variants rs10512572 (P = 2.3 × 10-11) and rs17090189 (P = 2.1 × 10-8) near the SOX9 and KLF5 genes, respectively. Variants at these loci are located in enhancer regulatory elements detected in skin tissue. Conclusions and Relevance In this genetic association study, common variants associated with HS located near the SOX9 and KLF5 genes were associated with risk of HS. These or other nearby genes may be associated with genetic risk of disease and the development of clinical features, such as cysts, comedones, and inflammatory tunnels, that are unique to HS. New insights into disease pathogenesis related to these genes may help predict disease progression and novel treatment approaches in the future.
Collapse
Affiliation(s)
- Quan Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill
| | | | - Sharon N. Edmiston
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
- Lineberger Comprehensive Cancer Center, Chapel Hill, North Carolina
| | - Kristen Fajgenbaum
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
| | - Tyne Miller-Fleming
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Linnea Lackstrom Westerkam
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
- University of North Carolina at Chapel Hill School of Medicine
| | | | - Helen Bui
- Department of Internal Medicine, University of North Carolina at Chapel Hill School of Medicine
| | | | - Brandt Levitt
- Carolina Population Center, University of North Carolina at Chapel Hill
| | - Lan Lin
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
| | - Honglin Hao
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
| | - Kathleen Mullan Harris
- Carolina Population Center, University of North Carolina at Chapel Hill
- Sociology Department, University of North Carolina at Chapel Hill
| | - Zhi Liu
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
- Lineberger Comprehensive Cancer Center, Chapel Hill, North Carolina
| | - Nancy E. Thomas
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
- Carolina Population Center, University of North Carolina at Chapel Hill
| | - Nancy J. Cox
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Yun Li
- Department of Biostatistics, University of North Carolina at Chapel Hill
- Department of Genetics, University of North Carolina at Chapel Hill
| | - Karen L. Mohlke
- Department of Genetics, University of North Carolina at Chapel Hill
| | - Christopher J. Sayed
- Department of Dermatology, University of North Carolina at Chapel Hill School of Medicine
| |
Collapse
|
2
|
Sun Q, Yang Y, Rosen JD, Jiang MZ, Chen J, Liu W, Wen J, Raffield LM, Pace RG, Zhou YH, Wright FA, Blackman SM, Bamshad MJ, Gibson RL, Cutting GR, Knowles MR, Schrider DR, Fuchsberger C, Li Y. MagicalRsq: Machine-learning-based genotype imputation quality calibration. Am J Hum Genet 2022; 109:1986-1997. [PMID: 36198314 PMCID: PMC9674945 DOI: 10.1016/j.ajhg.2022.09.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 09/16/2022] [Indexed: 01/26/2023] Open
Abstract
Whole-genome sequencing (WGS) is the gold standard for fully characterizing genetic variation but is still prohibitively expensive for large samples. To reduce costs, many studies sequence only a subset of individuals or genomic regions, and genotype imputation is used to infer genotypes for the remaining individuals or regions without sequencing data. However, not all variants can be well imputed, and the current state-of-the-art imputation quality metric, denoted as standard Rsq, is poorly calibrated for lower-frequency variants. Here, we propose MagicalRsq, a machine-learning-based method that integrates variant-level imputation and population genetics statistics, to provide a better calibrated imputation quality metric. Leveraging WGS data from the Cystic Fibrosis Genome Project (CFGP), and whole-exome sequence data from UK BioBank (UKB), we performed comprehensive experiments to evaluate the performance of MagicalRsq compared to standard Rsq for partially sequenced studies. We found that MagicalRsq aligns better with true R2 than standard Rsq in almost every situation evaluated, for both European and African ancestry samples. For example, when applying models trained from 1,992 CFGP sequenced samples to an independent 3,103 samples with no sequencing but TOPMed imputation from array genotypes, MagicalRsq, compared to standard Rsq, achieved net gains of 1.4 million rare, 117k low-frequency, and 18k common variants, where net gains were gained numbers of correctly distinguished variants by MagicalRsq over standard Rsq. MagicalRsq can serve as an improved post-imputation quality metric and will benefit downstream analysis by better distinguishing well-imputed variants from those poorly imputed. MagicalRsq is freely available on GitHub.
Collapse
Affiliation(s)
- Quan Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yingxi Yang
- Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA
| | - Jonathan D Rosen
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Min-Zhi Jiang
- Department of Applied Physical Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jiawen Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Weifang Liu
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jia Wen
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Laura M Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Rhonda G Pace
- Marsico Lung Institute/UNC CF Research Center, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yi-Hui Zhou
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Fred A Wright
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA; Bioinformatics Research Center and Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Scott M Blackman
- Division of Pediatric Endocrinology, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Michael J Bamshad
- Department of Pediatrics, University of Washington, Seattle, WA 98105, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Ronald L Gibson
- Department of Pediatrics, University of Washington, Seattle, WA 98105, USA
| | - Garry R Cutting
- Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Michael R Knowles
- Marsico Lung Institute/UNC CF Research Center, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Daniel R Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Christian Fuchsberger
- Institute for Biomedicine, Eurac Research (affiliated with the University of Lübeck), Bolzano, Italy.
| | - Yun Li
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|