1
|
Sun KY, Bai X, Chen S, Bao S, Zhang C, Kapoor M, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke AE, Ganel L, Hawes A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Di Gioia A, Rajagopal VM, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton JD, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalogue of protein-coding variation in 983,578 individuals. Nature 2024:10.1038/s41586-024-07556-0. [PMID: 38768635 DOI: 10.1038/s41586-024-07556-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 05/10/2024] [Indexed: 05/22/2024]
Abstract
Rare coding variants that significantly impact function provide insights into the biology of a gene1-3. However, ascertaining their frequency requires large sample sizes4-8. Here, we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. 23% of the Regeneron Genetics Center Million Exome data (RGC-ME) comes from non-European individuals of African, East Asian, Indigenous American, Middle Eastern, and South Asian ancestry. This catalogue includes over 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss-of-function, we identify 3,988 loss-of-function intolerant genes, including 86 that were previously assessed as tolerant and 1,153 lacking established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions depleted of missense variants despite being tolerant to pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this important resource of coding variation from the RGC-ME accessible via a public variant allele frequency browser.
Collapse
|
2
|
Sun KY, Bai X, Chen S, Bao S, Kapoor M, Zhang C, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Gioia AD, Rajagopal V, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton J, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalog of protein-coding variation in 985,830 individuals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539329. [PMID: 37214792 PMCID: PMC10197621 DOI: 10.1101/2023.05.09.539329] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.
Collapse
|
3
|
Zhou S, Sosina OA, Bovijn J, Laurent L, Sharma V, Akbari P, Forgetta V, Jiang L, Kosmicki JA, Banerjee N, Morris JA, Oerton E, Jones M, LeBlanc MG, Idone V, Overton JD, Reid JG, Cantor M, Abecasis GR, Goltzman D, Greenwood CMT, Langenberg C, Baras A, Economides AN, Ferreira MAR, Hatsell S, Ohlsson C, Richards JB, Lotta LA. Converging evidence from exome sequencing and common variants implicates target genes for osteoporosis. Nat Genet 2023; 55:1277-1287. [PMID: 37558884 DOI: 10.1038/s41588-023-01444-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 06/09/2023] [Indexed: 08/11/2023]
Abstract
In this study, we leveraged the combined evidence of rare coding variants and common alleles to identify therapeutic targets for osteoporosis. We undertook a large-scale multiancestry exome-wide association study for estimated bone mineral density, which showed that the burden of rare coding alleles in 19 genes was associated with estimated bone mineral density (P < 3.6 × 10-7). These genes were highly enriched for a set of known causal genes for osteoporosis (65-fold; P = 2.5 × 10-5). Exome-wide significant genes had 96-fold increased odds of being the top ranked effector gene at a given GWAS locus (P = 1.8 × 10-10). By integrating proteomics Mendelian randomization evidence, we prioritized CD109 (cluster of differentiation 109) as a gene for which heterozygous loss of function is associated with higher bone density. CRISPR-Cas9 editing of CD109 in SaOS-2 osteoblast-like cell lines showed that partial CD109 knockdown led to increased mineralization. This study demonstrates that the convergence of common and rare variants, proteomics and CRISPR can highlight new bone biology to guide therapeutic development.
Collapse
|
4
|
Manderstedt E, Lind‐Halldén C, Halldén C, Elf J, Svensson PJ, Engström G, Melander O, Baras A, Lotta LA, Zöller B, Abecasis G, Baras A, Cantor M, Coppola G, Economides A, Lotta LA, Overton JD, Reid JG, Shuldiner A, Beechert C, Forsythe C, Fuller ED, Gu Z, Lattari M, Lopez A, Overton JD, Schleicher TD, Padilla MS, Widom L, Wolf SE, Pradhan M, Manoochehri K, Ulloa RH, Bai X, Balasubramanian S, Blumenfeld A, Boutkov B, Eom G, Habegger L, Hawes A, Khalid S, Krasheninina O, Lanche R, Mansfield AJ, Maxwell EK, Nafde M, O’Keeffe S, Orelus M, Panea R, Polanco T, Rasool A, Reid JG, Salerno W, Staples JC, Jones MB, Mighty J, Mitnaul LJ. Genetic variation of the blood coagulation regulator tissue factor pathway inhibitor and venous thromboembolism among middle‐aged and older adults: A population‐based cohort study. Res Pract Thromb Haemost 2022; 6:e12842. [PMID: 36381289 PMCID: PMC9644338 DOI: 10.1002/rth2.12842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 09/09/2022] [Accepted: 10/19/2022] [Indexed: 11/11/2022] Open
Abstract
Background Tissue factor is the main initiator of blood coagulation, and tissue factor pathway inhibitor (TFPI) is the primary inhibitor of the initiation of blood coagulation. The genetic variation of TFPI and the relation to venous thromboembolism (VTE), that is, venous thrombosis and pulmonary embolism, remains to be clarified. This exome sequencing study aimed to determine the molecular epidemiology of the TFPI gene and the relation to VTE in a large population‐based cohort of middle‐aged and older adults. Methods The exomes of TFPI were analyzed for variants in 28,794 subjects without previous VTE (born 1923–1950, 60% women), who participated in the Malmö Diet and Cancer Study (1991–1996). Patients were followed until the first event of VTE, death, or 2018. Qualifying variants were defined as loss‐of‐function or nonbenign (PolyPhen‐2) missense variants with minor allele frequency less than 0.1%. Results No common variant was associated with VTE. Nine rare variants (two loss‐of‐function and seven nonbenign missense) were classified as qualifying and included in collapsing analysis. Prevalence of qualifying variants was 0.09%. Five individuals with VTE compared to 17 individuals without VTE carried one qualifying variant. Cox multivariate regression analysis adjusted for age, sex, body mass index, systolic blood pressure, smoking and alcohol consumption, rs6025, rs1799963, and ancestry showed a hazard ratio of 2.9 (95% CI, 1.2–7.1) for rare qualifying variants. Conclusion Rare qualifying TFPI variants were associated with VTE, suggesting that rare variants in TFPI contribute to the development of VTE. The qualifying TFPI gene variants were very rare, suggesting a constrained gene.
Collapse
|
5
|
Akbari P, Sosina OA, Bovijn J, Landheer K, Nielsen JB, Kim M, Aykul S, De T, Haas ME, Hindy G, Lin N, Dinsmore IR, Luo JZ, Hectors S, Geraghty B, Germino M, Panagis L, Parasoglou P, Walls JR, Halasz G, Atwal GS, Jones M, LeBlanc MG, Still CD, Carey DJ, Giontella A, Orho-Melander M, Berumen J, Kuri-Morales P, Alegre-Díaz J, Torres JM, Emberson JR, Collins R, Rader DJ, Zambrowicz B, Murphy AJ, Balasubramanian S, Overton JD, Reid JG, Shuldiner AR, Cantor M, Abecasis GR, Ferreira MAR, Sleeman MW, Gusarova V, Altarejos J, Harris C, Economides AN, Idone V, Karalis K, Della Gatta G, Mirshahi T, Yancopoulos GD, Melander O, Marchini J, Tapia-Conyer R, Locke AE, Baras A, Verweij N, Lotta LA. Multiancestry exome sequencing reveals INHBE mutations associated with favorable fat distribution and protection from diabetes. Nat Commun 2022; 13:4844. [PMID: 35999217 PMCID: PMC9399235 DOI: 10.1038/s41467-022-32398-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 07/28/2022] [Indexed: 12/13/2022] Open
Abstract
Body fat distribution is a major, heritable risk factor for cardiometabolic disease, independent of overall adiposity. Using exome-sequencing in 618,375 individuals (including 160,058 non-Europeans) from the UK, Sweden and Mexico, we identify 16 genes associated with fat distribution at exome-wide significance. We show 6-fold larger effect for fat-distribution associated rare coding variants compared with fine-mapped common alleles, enrichment for genes expressed in adipose tissue and causal genes for partial lipodystrophies, and evidence of sex-dimorphism. We describe an association with favorable fat distribution (p = 1.8 × 10-09), favorable metabolic profile and protection from type 2 diabetes (~28% lower odds; p = 0.004) for heterozygous protein-truncating mutations in INHBE, which encodes a circulating growth factor of the activin family, highly and specifically expressed in hepatocytes. Our results suggest that inhibin βE is a liver-expressed negative regulator of adipose storage whose blockade may be beneficial in fat distribution-associated metabolic disease.
Collapse
|
6
|
Verweij N, Haas ME, Nielsen JB, Sosina OA, Kim M, Akbari P, De T, Hindy G, Bovijn J, Persaud T, Miloscio L, Germino M, Panagis L, Watanabe K, Mbatchou J, Jones M, LeBlanc M, Balasubramanian S, Lammert C, Enhörning S, Melander O, Carey DJ, Still CD, Mirshahi T, Rader DJ, Parasoglou P, Walls JR, Overton JD, Reid JG, Economides A, Cantor MN, Zambrowicz B, Murphy AJ, Abecasis GR, Ferreira MAR, Smagris E, Gusarova V, Sleeman M, Yancopoulos GD, Marchini J, Kang HM, Karalis K, Shuldiner AR, Della Gatta G, Locke AE, Baras A, Lotta LA. Germline Mutations in CIDEB and Protection against Liver Disease. N Engl J Med 2022; 387:332-344. [PMID: 35939579 DOI: 10.1056/nejmoa2117872] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
BACKGROUND Exome sequencing in hundreds of thousands of persons may enable the identification of rare protein-coding genetic variants associated with protection from human diseases like liver cirrhosis, providing a strategy for the discovery of new therapeutic targets. METHODS We performed a multistage exome sequencing and genetic association analysis to identify genes in which rare protein-coding variants were associated with liver phenotypes. We conducted in vitro experiments to further characterize associations. RESULTS The multistage analysis involved 542,904 persons with available data on liver aminotransferase levels, 24,944 patients with various types of liver disease, and 490,636 controls without liver disease. We found that rare coding variants in APOB, ABCB4, SLC30A10, and TM6SF2 were associated with increased aminotransferase levels and an increased risk of liver disease. We also found that variants in CIDEB, which encodes a structural protein found in hepatic lipid droplets, had a protective effect. The burden of rare predicted loss-of-function variants plus missense variants in CIDEB (combined carrier frequency, 0.7%) was associated with decreased alanine aminotransferase levels (beta per allele, -1.24 U per liter; 95% confidence interval [CI], -1.66 to -0.83; P = 4.8×10-9) and with 33% lower odds of liver disease of any cause (odds ratio per allele, 0.67; 95% CI, 0.57 to 0.79; P = 9.9×10-7). Rare coding variants in CIDEB were associated with a decreased risk of liver disease across different underlying causes and different degrees of severity, including cirrhosis of any cause (odds ratio per allele, 0.50; 95% CI, 0.36 to 0.70). Among 3599 patients who had undergone bariatric surgery, rare coding variants in CIDEB were associated with a decreased nonalcoholic fatty liver disease activity score (beta per allele in score units, -0.98; 95% CI, -1.54 to -0.41 [scores range from 0 to 8, with higher scores indicating more severe disease]). In human hepatoma cell lines challenged with oleate, CIDEB small interfering RNA knockdown prevented the buildup of large lipid droplets. CONCLUSIONS Rare germline mutations in CIDEB conferred substantial protection from liver disease. (Funded by Regeneron Pharmaceuticals.).
Collapse
|
7
|
Lin N, Damask A, Boyapati A, Hamilton JD, Hamon S, Ternes N, Nivens MC, Penn J, Lopez A, Reid JG, Overton J, Shuldiner AR, Abecasis G, Baras A, Paulding C. UGT1A1 genetic variants are associated with increases in bilirubin levels in rheumatoid arthritis patients treated with sarilumab. THE PHARMACOGENOMICS JOURNAL 2022; 22:160-165. [PMID: 35149777 PMCID: PMC9151390 DOI: 10.1038/s41397-022-00269-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 12/16/2021] [Accepted: 01/24/2022] [Indexed: 11/09/2022]
Abstract
Sarilumab is a human monoclonal antibody against interleukin (IL)-6Rα that has been approved for the treatment of adult patients with moderately to severely active rheumatoid arthritis (RA) and an inadequate response or intolerance to one or more disease-modifying antirheumatic drugs (DMARDs). Mild liver function test abnormalities have been observed in patients treated with sarilumab. We describe a genome-wide association study of bilirubin elevations in RA patients treated with sarilumab. Array genotyping and exome sequencing were performed on DNA samples from 1075 patients. Variants in the UGT1A1 gene were strongly associated with maximum bilirubin elevations in sarilumab-treated patients (rs4148325; p = 2.88 × 10−41) but were not associated with aminotransferase elevations. No other independent loci showed evidence of association with bilirubin elevations after sarilumab treatment. These findings suggest that most bilirubin increases during sarilumab treatment are related to genetic variation in UGT1A1 rather than underlying liver injury.
Collapse
|
8
|
Finucane B, Oetjens MT, Johns A, Myers SM, Fisher C, Habegger L, Maxwell EK, Reid JG, Ledbetter DH, Kirchner HL, Martin CL. Medical manifestations and health care utilization among adult MyCode participants with neurodevelopmental psychiatric copy number variants. Genet Med 2021; 24:703-711. [PMID: 34906480 DOI: 10.1016/j.gim.2021.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 10/07/2021] [Accepted: 11/12/2021] [Indexed: 10/19/2022] Open
Abstract
PURPOSE Recurrent pathogenic copy number variants (pCNVs) have large-effect impacts on brain function and represent important etiologies of neurodevelopmental psychiatric disorders (NPDs), including autism and schizophrenia. Patterns of health care utilization in adults with pCNVs have gone largely unstudied and are likely to differ in significant ways from those of children. METHODS We compared the prevalence of NPDs and electronic health record-based medical conditions in 928 adults with 26 pCNVs to a demographically-matched cohort of pCNV-negative controls from >135,000 patient-participants in Geisinger's MyCode Community Health Initiative. We also evaluated 3 quantitative health care utilization measures (outpatient, inpatient, and emergency department visits) in both groups. RESULTS Adults with pCNVs (24.9%) were more likely than controls (16.0%) to have a documented NPD. They had significantly higher rates of several chronic diseases, including diabetes (29.3% in participants with pCNVs vs 20.4% in participants without pCNVs) and dementia (2.2% in participants with pCNVs vs 1.0% participants without pCNVs), and twice as many annual emergency department visits. CONCLUSION These findings highlight the potential for genetic information-specifically, pCNVs-to inform the study of health care outcomes and utilization in adults. If, as our findings suggest, adults with pCNVs have poorer health and require disproportionate health care resources, early genetic diagnosis paired with patient-centered interventions may help to anticipate problems, improve outcomes, and reduce the associated economic burden.
Collapse
|
9
|
Gao C, Marcketta A, Backman JD, O'Dushlaine C, Staples J, Ferreira MAR, Lotta LA, Overton JD, Reid JG, Mirshahi T, Regeneron Genetics Center, Geisinger Regeneron Discovehr Collaboration, Baras A, Abecasis G, Shuldiner AR, Van Hout CV, McCarthy S. Genome-wide association analysis of serum alanine and aspartate aminotransferase, and the modifying effects of BMI in 388k European individuals. Genet Epidemiol 2021; 45:664-681. [PMID: 34184762 PMCID: PMC8457092 DOI: 10.1002/gepi.22392] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 04/17/2021] [Accepted: 05/17/2021] [Indexed: 12/21/2022]
Abstract
Serum alanine aminotransferase (ALT) and aspartate aminotransferase (AST) are biomarkers for liver health. Here we report the largest genome-wide association analysis to date of serum ALT and AST levels in over 388k people of European ancestry from UK biobank and DiscovEHR. Eleven million imputed markers with a minor allele frequency (MAF) ≥ 0.5% were analyzed. Overall, 300 ALT and 336 AST independent genome-wide significant associations were identified. Among them, 81 ALT and 61 AST associations are reported for the first time. Genome-wide interaction study identified 9 ALT and 12 AST independent associations significantly modified by body mass index (BMI), including several previously reported potential liver disease therapeutic targets, for example, PNPLA3, HSD17B13, and MARC1. While further work is necessary to understand the effect of ALT and AST-associated variants on liver disease, the weighted burden of significant BMI-modified signals is significantly associated with liver disease outcomes. In summary, this study identifies genetic associations which offer an important step forward in understanding the genetic architecture of serum ALT and AST levels. Significant interactions between BMI and genetic loci not only highlight the important role of adiposity in liver damage but also shed light on the genetic etiology of liver disease in obese individuals.
Collapse
|
10
|
Adler C, Ahammed Z, Allgower C, Amonett J, Anderson BD, Anderson M, Averichev GS, Balewski J, Barannikova O, Barnby LS, Baudot J, Bekele S, Belaga VV, Bellwied R, Berger J, Bichsel H, Billmeier A, Bland LC, Blyth CO, Bonner BE, Boucham A, Brandin A, Bravar A, Cadman RV, Caines H, Calderón de la Barca Sánchez M, Cardenas A, Carroll J, Castillo J, Castro M, Cebra D, Chaloupka P, Chattopadhyay S, Chen Y, Chernenko SP, Cherney M, Chikanian A, Choi B, Christie W, Coffin JP, Cormier TM, Cramer JG, Crawford HJ, Csanád M, Deng WS, Derevschikov AA, Didenko L, Dietel T, Draper JE, Dunin VB, Dunlop JC, Eckardt V, Efimov LG, Emelianov V, Engelage J, Eppley G, Erazmus B, Fachini P, Faine V, Filimonov K, Finch E, Fisyak Y, Flierl D, Foley KJ, Fu J, Gagliardi CA, Gagunashvili N, Gans J, Gaudichet L, Germain M, Geurts F, Ghazikhanian V, Grachov O, Grigoriev V, Guedon M, Gushin E, Hallman TJ, Hardtke D, Harris JW, Henry TW, Heppelmann S, Herston T, Hippolyte B, Hirsch A, Hjort E, Hoffmann GW, Horsley M, Huang HZ, Humanic TJ, Igo G, Ishihara A, Ivanshin YI, Jacobs P, Jacobs WW, Janik M, Johnson I, Jones PG, Judd EG, Kaneta M, Kaplan M, Keane D, Kiryluk J, Kisiel A, Klay J, Klein SR, Klyachko A, Konstantinov AS, Kopytine M, Kotchenda L, Kovalenko AD, Kramer M, Kravtsov P, Krueger K, Kuhn C, Kulikov AI, Kunde GJ, Kunz CL, Kutuev RK, Kuznetsov AA, Lakehal-Ayat L, Lamont MAC, Landgraf JM, Lange S, Lansdell CP, Lasiuk B, Laue F, Lebedev A, Lednický R, Leontiev VM, LeVine MJ, Li Q, Lindenbaum SJ, Lisa MA, Liu F, Liu L, Liu Z, Liu QJ, Ljubicic T, Llope WJ, LoCurto G, Long H, Longacre RS, Lopez-Noriega M, Love WA, Ludlam T, Lynn D, Ma J, Ma R, Majka R, Margetis S, Markert C, Martin L, Marx J, Matis HS, Matulenko YA, McShane TS, Meissner F, Melnick Y, Meschanin A, Messer M, Miller ML, Milosevich Z, Minaev NG, Mitchell J, Moiseenko VA, Moore CF, Morozov V, de Moura MM, Munhoz MG, Nelson JM, Nevski P, Niida T, Nikitin VA, Nogach LV, Norman B, Nurushev SB, Odyniec G, Ogawa A, Okorokov V, Oldenburg M, Olson D, Paic G, Pandey SU, Panebratsev Y, Panitkin SY, Pavlinov AI, Pawlak T, Perevoztchikov V, Peryt W, Petrov VA, Planinic M, Pluta J, Porile N, Porter J, Poskanzer AM, Potrebenikova E, Prindle D, Pruneau C, Putschke J, Rai G, Rakness G, Ravel O, Ray RL, Razin SV, Reichhold D, Reid JG, Retiere F, Ridiger A, Ritter HG, Roberts JB, Rogachevski OV, Romero JL, Rose A, Roy C, Rykov V, Sakrejda I, Salur S, Sandweiss J, Saulys AC, Savin I, Schambach J, Scharenberg RP, Schmitz N, Schroeder LS, Schüttauf A, Schweda K, Seger J, Seliverstov D, Seyboth P, Shahaliev E, Shestermanov KE, Shimanskii SS, Shvetcov VS, Skoro G, Smirnov N, Snellings R, Sorensen P, Sowinski J, Spinka HM, Srivastava B, Stephenson EJ, Stock R, Stolpovsky A, Strikhanov M, Stringfellow B, Struck C, Suaide AAP, Sugarbaker E, Suire C, Šumbera M, Surrow B, Symons TJM, Szanto de Toledo A, Szarwas P, Tai A, Takahashi J, Tang AH, Thomas JH, Thompson M, Tikhomirov V, Todoroki T, Tokarev M, Tonjes MB, Trainor TA, Trentalange S, Tribble RE, Trofimov V, Tsai O, Ullrich T, Underwood DG, Van Buren G, VanderMolen AM, Vasilevski IM, Vasiliev AN, Vigdor SE, Voloshin SA, Wang F, Ward H, Watson JW, Wells R, Westfall GD, Whitten C, Wieman H, Willson R, Wissink SW, Witt R, Wood J, Xu N, Xu Z, Yakutin AE, Yamamoto E, Yang J, Yepes P, Yurevich VI, Zanevski YV, Zborovský I, Zhang H, Zhang WM, Zoulkarneev R, Zubarev AN. Erratum: Azimuthal Anisotropy of K_{S}^{0} and Λ+Λ[over ¯] Production at Midrapidity from Au+Au Collisions at sqrt[s]_{NN}=130 GeV [Phys. Rev. Lett. 89, 132301 (2002)]. PHYSICAL REVIEW LETTERS 2021; 127:089901. [PMID: 34477449 DOI: 10.1103/physrevlett.127.089901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Indexed: 06/13/2023]
Abstract
This corrects the article DOI: 10.1103/PhysRevLett.89.132301.
Collapse
|
11
|
Adams J, Adler C, Aggarwal MM, Ahammed Z, Amonett J, Anderson BD, Anderson M, Arkhipkin D, Averichev GS, Badyal SK, Balewski J, Barannikova O, Barnby LS, Baudot J, Bekele S, Belaga VV, Bellwied R, Berger J, Bezverkhny BI, Bhardwaj S, Bhaskar P, Bhati AK, Bichsel H, Billmeier A, Bland LC, Blyth CO, Bonner BE, Botje M, Boucham A, Brandin A, Bravar A, Cadman RV, Cai XZ, Caines H, Calderón de la Barca Sánchez M, Carroll J, Castillo J, Castro M, Cebra D, Chaloupka P, Chattopadhyay S, Chen HF, Chen Y, Chernenko SP, Cherney M, Chikanian A, Choi B, Christie W, Coffin JP, Cormier TM, Cramer JG, Crawford HJ, Csanád M, Das D, Das S, Derevschikov AA, Didenko L, Dietel T, Dong WJ, Dong X, Draper JE, Du F, Dubey AK, Dunin VB, Dunlop JC, Dutta Majumdar MR, Eckardt V, Efimov LG, Emelianov V, Engelage J, Eppley G, Erazmus B, Estienne M, Fachini P, Faine V, Faivre J, Fatemi R, Filimonov K, Filip P, Finch E, Fisyak Y, Flierl D, Foley KJ, Fu J, Gagliardi CA, Gagunashvili N, Gans J, Ganti MS, Gaudichet L, Germain M, Geurts F, Ghazikhanian V, Ghosh P, Gonzalez JE, Grachov O, Grigoriev V, Gronstal S, Grosnick D, Guedon M, Guertin SM, Gupta A, Gushin E, Gutierrez TD, Hallman TJ, Hardtke D, Harris JW, Heinz M, Henry TW, Heppelmann S, Herston T, Hippolyte B, Hirsch A, Hjort E, Hoffmann GW, Horsley M, Huang HZ, Huang SL, Humanic TJ, Igo G, Ishihara A, Jacobs P, Jacobs WW, Janik M, Jiang H, Johnson I, Jones PG, Judd EG, Kabana S, Kaneta M, Kaplan M, Keane D, Khodyrev VY, Kiryluk J, Kisiel A, Klay J, Klein SR, Klyachko A, Koetke DD, Kollegger T, Kopytine M, Kotchenda L, Kovalenko AD, Kramer M, Kravtsov P, Kravtsov VI, Krueger K, Kuhn C, Kulikov AI, Kumar A, Kunde GJ, Kunz CL, Kutuev RK, Kuznetsov AA, Lamont MAC, Landgraf JM, Lange S, Lansdell CP, Lasiuk B, Laue F, Lauret J, Lebedev A, Lednický R, LeVine MJ, Li C, Li Q, Lindenbaum SJ, Lisa MA, Liu F, Liu L, Liu Z, Liu QJ, Ljubicic T, Llope WJ, Long H, Longacre RS, Lopez-Noriega M, Love WA, Ludlam T, Lynn D, Ma J, Ma R, Ma YG, Magestro D, Mahajan S, Mangotra LK, Mahapatra DP, Majka R, Manweiler R, Margetis S, Markert C, Martin L, Marx J, Matis HS, Matulenko YA, McShane TS, Meissner F, Melnick Y, Meschanin A, Messer M, Miller ML, Milosevich Z, Minaev NG, Mironov C, Mishra D, Mitchell J, Mohanty B, Molnar L, Moore CF, Mora-Corral MJ, Morozov DA, Morozov V, de Moura MM, Munhoz MG, Nandi BK, Nayak SK, Nayak TK, Nelson JM, Nevski P, Niida T, Nikitin VA, Nogach LV, Norman B, Nurushev SB, Odyniec G, Ogawa A, Okorokov V, Oldenburg M, Olson D, Paic G, Pandey SU, Pal SK, Panebratsev Y, Panitkin SY, Pavlinov AI, Pawlak T, Perevoztchikov V, Perkins C, Peryt W, Petrov VA, Phatak SC, Picha R, Planinic M, Pluta J, Porile N, Porter J, Poskanzer AM, Potekhin M, Potrebenikova E, Potukuchi BVKS, Prindle D, Pruneau C, Putschke J, Rai G, Rakness G, Raniwala R, Raniwala S, Ravel O, Ray RL, Razin SV, Reichhold D, Reid JG, Renault G, Retiere F, Ridiger A, Ritter HG, Roberts JB, Rogachevski OV, Romero JL, Rose A, Roy C, Ruan LJ, Sahoo R, Sakrejda I, Salur S, Sandweiss J, Savin I, Schambach J, Scharenberg RP, Schmitz N, Schroeder LS, Schweda K, Seger J, Seliverstov D, Seyboth P, Shahaliev E, Shao M, Sharma M, Shestermanov KE, Shimanskii SS, Singaraju RN, Simon F, Skoro G, Smirnov N, Snellings R, Sood G, Sorensen P, Sowinski J, Spinka HM, Srivastava B, Stanislaus S, Stock R, Stolpovsky A, Strikhanov M, Stringfellow B, Struck C, Suaide AAP, Sugarbaker E, Suire C, Šumbera M, Surrow B, Symons TJM, Szanto de Toledo A, Szarwas P, Tai A, Takahashi J, Tang AH, Thein D, Thomas JH, Tikhomirov V, Todoroki T, Tokarev M, Tonjes MB, Trainor TA, Trentalange S, Tribble RE, Trivedi MD, Trofimov V, Tsai O, Ullrich T, Underwood DG, Van Buren G, VanderMolen AM, Vasiliev AN, Vasiliev M, Vigdor SE, Viyogi YP, Voloshin SA, Waggoner W, Wang F, Wang G, Wang XL, Wang ZM, Ward H, Watson JW, Wells R, Westfall GD, Whitten C, Wieman H, Willson R, Wissink SW, Witt R, Wood J, Wu J, Xu N, Xu Z, Xu ZZ, Yamamoto E, Yepes P, Yurevich VI, Zanevski YV, Zborovský I, Zhang H, Zhang WM, Zhang ZP, Żołnierczuk PA, Zoulkarneev R, Zoulkarneeva J, Zubarev AN. Erratum: Azimuthal Anisotropy at the Relativistic Heavy Ion Collider: The First and Fourth Harmonics [Phys. Rev. Lett. 92, 062301 (2004)]. PHYSICAL REVIEW LETTERS 2021; 127:069901. [PMID: 34420354 DOI: 10.1103/physrevlett.127.069901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Indexed: 06/13/2023]
Abstract
This corrects the article DOI: 10.1103/PhysRevLett.92.062301.
Collapse
|
12
|
Akbari P, Gilani A, Sosina O, Kosmicki JA, Khrimian L, Fang YY, Persaud T, Garcia V, Sun D, Li A, Mbatchou J, Locke AE, Benner C, Verweij N, Lin N, Hossain S, Agostinucci K, Pascale JV, Dirice E, Dunn M, Kraus WE, Shah SH, Chen YDI, Rotter JI, Rader DJ, Melander O, Still CD, Mirshahi T, Carey DJ, Berumen-Campos J, Kuri-Morales P, Alegre-Díaz J, Torres JM, Emberson JR, Collins R, Balasubramanian S, Hawes A, Jones M, Zambrowicz B, Murphy AJ, Paulding C, Coppola G, Overton JD, Reid JG, Shuldiner AR, Cantor M, Kang HM, Abecasis GR, Karalis K, Economides AN, Marchini J, Yancopoulos GD, Sleeman MW, Altarejos J, Della Gatta G, Tapia-Conyer R, Schwartzman ML, Baras A, Ferreira MAR, Lotta LA. Sequencing of 640,000 exomes identifies GPR75 variants associated with protection from obesity. Science 2021; 373:373/6550/eabf8683. [PMID: 34210852 DOI: 10.1126/science.abf8683] [Citation(s) in RCA: 107] [Impact Index Per Article: 35.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 05/17/2021] [Indexed: 12/11/2022]
Abstract
Large-scale human exome sequencing can identify rare protein-coding variants with a large impact on complex traits such as body adiposity. We sequenced the exomes of 645,626 individuals from the United Kingdom, the United States, and Mexico and estimated associations of rare coding variants with body mass index (BMI). We identified 16 genes with an exome-wide significant association with BMI, including those encoding five brain-expressed G protein-coupled receptors (CALCR, MC4R, GIPR, GPR151, and GPR75). Protein-truncating variants in GPR75 were observed in ~4/10,000 sequenced individuals and were associated with 1.8 kilograms per square meter lower BMI and 54% lower odds of obesity in the heterozygous state. Knock out of Gpr75 in mice resulted in resistance to weight gain and improved glycemic control in a high-fat diet model. Inhibition of GPR75 may provide a therapeutic strategy for obesity.
Collapse
|
13
|
Kosmicki JA, Horowitz JE, Banerjee N, Lanche R, Marcketta A, Maxwell E, Bai X, Sun D, Backman JD, Sharma D, Kury FSP, Kang HM, O'Dushlaine C, Yadav A, Mansfield AJ, Li AH, Watanabe K, Gurski L, McCarthy SE, Locke AE, Khalid S, O'Keeffe S, Mbatchou J, Chazara O, Huang Y, Kvikstad E, O'Neill A, Nioi P, Parker MM, Petrovski S, Runz H, Szustakowski JD, Wang Q, Wong E, Cordova-Palomera A, Smith EN, Szalma S, Zheng X, Esmaeeli S, Davis JW, Lai YP, Chen X, Justice AE, Leader JB, Mirshahi T, Carey DJ, Verma A, Sirugo G, Ritchie MD, Rader DJ, Povysil G, Goldstein DB, Kiryluk K, Pairo-Castineira E, Rawlik K, Pasko D, Walker S, Meynert A, Kousathanas A, Moutsianas L, Tenesa A, Caulfield M, Scott R, Wilson JF, Baillie JK, Butler-Laporte G, Nakanishi T, Lathrop M, Richards JB, Jones M, Balasubramanian S, Salerno W, Shuldiner AR, Marchini J, Overton JD, Habegger L, Cantor MN, Reid JG, Baras A, Abecasis GR, Ferreira MAR. Pan-ancestry exome-wide association analyses of COVID-19 outcomes in 586,157 individuals. Am J Hum Genet 2021; 108:1350-1355. [PMID: 34115965 PMCID: PMC8173480 DOI: 10.1016/j.ajhg.2021.05.017] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 05/24/2021] [Indexed: 01/08/2023] Open
Abstract
Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) causes coronavirus disease 2019 (COVID-19), a respiratory illness that can result in hospitalization or death. We used exome sequence data to investigate associations between rare genetic variants and seven COVID-19 outcomes in 586,157 individuals, including 20,952 with COVID-19. After accounting for multiple testing, we did not identify any clear associations with rare variants either exome wide or when specifically focusing on (1) 13 interferon pathway genes in which rare deleterious variants have been reported in individuals with severe COVID-19, (2) 281 genes located in susceptibility loci identified by the COVID-19 Host Genetics Initiative, or (3) 32 additional genes of immunologic relevance and/or therapeutic potential. Our analyses indicate there are no significant associations with rare protein-coding variants with detectable effect sizes at our current sample sizes. Analyses will be updated as additional data become available, and results are publicly available through the Regeneron Genetics Center COVID-19 Results Browser.
Collapse
|
14
|
Horowitz JE, Kosmicki JA, Damask A, Sharma D, Roberts GHL, Justice AE, Banerjee N, Coignet MV, Yadav A, Leader JB, Marcketta A, Park DS, Lanche R, Maxwell E, Knight SC, Bai X, Guturu H, Sun D, Baltzell A, Kury FSP, Backman JD, Girshick AR, O'Dushlaine C, McCurdy SR, Partha R, Mansfield AJ, Turissini DA, Li AH, Zhang M, Mbatchou J, Watanabe K, Gurski L, McCarthy SE, Kang HM, Dobbyn L, Stahl E, Verma A, Sirugo G, Ritchie MD, Jones M, Balasubramanian S, Siminovitch K, Salerno WJ, Shuldiner AR, Rader DJ, Mirshahi T, Locke AE, Marchini J, Overton JD, Carey DJ, Habegger L, Cantor MN, Rand KA, Hong EL, Reid JG, Ball CA, Baras A, Abecasis GR, Ferreira MA. Genome-wide analysis in 756,646 individuals provides first genetic evidence that ACE2 expression influences COVID-19 risk and yields genetic risk scores predictive of severe disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021. [PMID: 33619501 PMCID: PMC7899471 DOI: 10.1101/2020.12.14.20248176] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
SARS-CoV-2 enters host cells by binding angiotensin-converting enzyme 2 (ACE2). Through a genome-wide association study, we show that a rare variant (MAF = 0.3%, odds ratio 0.60, P=4.5×10-13) that down-regulates ACE2 expression reduces risk of COVID-19 disease, providing human genetics support for the hypothesis that ACE2 levels influence COVID-19 risk. Further, we show that common genetic variants define a risk score that predicts severe disease among COVID-19 cases.
Collapse
|
15
|
Lin MF, Bai X, Salerno WJ, Reid JG. Sparse Project VCF: efficient encoding of population genotype matrices. Bioinformatics 2021; 36:5537-5538. [PMID: 33300997 PMCID: PMC8016461 DOI: 10.1093/bioinformatics/btaa1004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Revised: 11/13/2020] [Accepted: 11/20/2020] [Indexed: 11/13/2022] Open
Abstract
SUMMARY Variant Call Format (VCF), the prevailing representation for germline genotypes in population sequencing, suffers rapid size growth as larger cohorts are sequenced and more rare variants are discovered. We present Sparse Project VCF (spVCF), an evolution of VCF with judicious entropy reduction and run-length encoding, delivering >10× size reduction for modern studies with practically minimal information loss. spVCF interoperates with VCF efficiently, including tabix-based random access. We demonstrate its effectiveness with the DiscovEHR and UK Biobank whole-exome sequencing cohorts. AVAILABILITY AND IMPLEMENTATION Apache-licensed reference implementation: github.com/mlin/spVCF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
16
|
Kosmicki JA, Horowitz JE, Banerjee N, Lanche R, Marcketta A, Maxwell E, Bai X, Sun D, Backman JD, Sharma D, Kang HM, O'Dushlaine C, Yadav A, Mansfield AJ, Li AH, Watanabe K, Gurski L, McCarthy SE, Locke AE, Khalid S, O'Keeffe S, Mbatchou J, Chazara O, Huang Y, Kvikstad E, O'Neill A, Nioi P, Parker MM, Petrovski S, Runz H, Szustakowski JD, Wang Q, Wong E, Cordova-Palomera A, Smith EN, Szalma S, Zheng X, Esmaeeli S, Davis JW, Lai YP, Chen X, Justice AE, Leader JB, Mirshahi T, Carey DJ, Verma A, Sirugo G, Ritchie MD, Rader DJ, Povysil G, Goldstein DB, Kiryluk K, Pairo-Castineira E, Rawlik K, Pasko D, Walker S, Meynert A, Kousathanas A, Moutsianas L, Tenesa A, Caulfield M, Scott R, Wilson JF, Baillie JK, Butler-Laporte G, Nakanishi T, Lathrop M, Richards JB, Jones M, Balasubramanian S, Salerno W, Shuldiner AR, Marchini J, Overton JD, Habegger L, Cantor MN, Reid JG, Baras A, Abecasis GR, Ferreira MA. A catalog of associations between rare coding variants and COVID-19 outcomes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2020.10.28.20221804. [PMID: 33655273 PMCID: PMC7924298 DOI: 10.1101/2020.10.28.20221804] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) causes coronavirus disease-19 (COVID-19), a respiratory illness that can result in hospitalization or death. We investigated associations between rare genetic variants and seven COVID-19 outcomes in 543,213 individuals, including 8,248 with COVID-19. After accounting for multiple testing, we did not identify any clear associations with rare variants either exome-wide or when specifically focusing on (i) 14 interferon pathway genes in which rare deleterious variants have been reported in severe COVID-19 patients; (ii) 167 genes located in COVID-19 GWAS risk loci; or (iii) 32 additional genes of immunologic relevance and/or therapeutic potential. Our analyses indicate there are no significant associations with rare protein-coding variants with detectable effect sizes at our current sample sizes. Analyses will be updated as additional data become available, with results publicly browsable at https://rgc-covid19.regeneron.com.
Collapse
|
17
|
Backman JD, Li AH, Marcketta A, Sun D, Mbatchou J, Kessler MD, Benner C, Liu D, Locke AE, Balasubramanian S, Yadav A, Banerjee N, Gillies CE, Damask A, Liu S, Bai X, Hawes A, Maxwell E, Gurski L, Watanabe K, Kosmicki JA, Rajagopal V, Mighty J, Jones M, Mitnaul L, Stahl E, Coppola G, Jorgenson E, Habegger L, Salerno WJ, Shuldiner AR, Lotta LA, Overton JD, Cantor MN, Reid JG, Yancopoulos G, Kang HM, Marchini J, Baras A, Abecasis GR, Ferreira MAR. Exome sequencing and analysis of 454,787 UK Biobank participants. Nature 2021; 599:628-634. [PMID: 34662886 PMCID: PMC8596853 DOI: 10.1038/s41586-021-04103-z] [Citation(s) in RCA: 311] [Impact Index Per Article: 103.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 10/06/2021] [Indexed: 12/19/2022]
Abstract
A major goal in human genetics is to use natural variation to understand the phenotypic consequences of altering each protein-coding gene in the genome. Here we used exome sequencing1 to explore protein-altering variants and their consequences in 454,787 participants in the UK Biobank study2. We identified 12 million coding variants, including around 1 million loss-of-function and around 1.8 million deleterious missense variants. When these were tested for association with 3,994 health-related traits, we found 564 genes with trait associations at P ≤ 2.18 × 10-11. Rare variant associations were enriched in loci from genome-wide association studies (GWAS), but most (91%) were independent of common variant signals. We discovered several risk-increasing associations with traits related to liver disease, eye disease and cancer, among others, as well as risk-lowering associations for hypertension (SLC9A3R2), diabetes (MAP3K15, FAM234A) and asthma (SLC27A3). Six genes were associated with brain imaging phenotypes, including two involved in neural development (GBE1, PLD1). Of the signals available and powered for replication in an independent cohort, 81% were confirmed; furthermore, association signals were generally consistent across individuals of European, Asian and African ancestry. We illustrate the ability of exome sequencing to identify gene-trait associations, elucidate gene function and pinpoint effector genes that underlie GWAS signals at scale.
Collapse
|
18
|
Martin CL, Wain KE, Oetjens MT, Tolwinski K, Palen E, Hare-Harris A, Habegger L, Maxwell EK, Reid JG, Walsh LK, Myers SM, Ledbetter DH. Identification of Neuropsychiatric Copy Number Variants in a Health Care System Population. JAMA Psychiatry 2020; 77:1276-1285. [PMID: 32697297 PMCID: PMC7376464 DOI: 10.1001/jamapsychiatry.2020.2159] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
IMPORTANCE Population screening for medically relevant genomic variants that cause diseases such as hereditary cancer and cardiovascular disorders is increasing to facilitate early disease detection or prevention. Neuropsychiatric disorders (NPDs) are common, complex disorders with clear genetic causes; yet, access to genetic diagnosis is limited. We explored whether inclusion of NPD in population-based genomic screening programs is warranted by assessing 3 key factors: prevalence, penetrance, and personal utility. OBJECTIVE To evaluate the suitability of including pathogenic copy number variants (CNVs) associated with NPD in population screening by determining their prevalence and penetrance and exploring the personal utility of disclosing results. DESIGN, SETTING, AND PARTICIPANTS In this cohort study, the frequency of 31 NPD CNVs was determined in patient-participants via exome data. Associated clinical phenotypes were assessed using linked electronic health records. Nine CNVs were selected for disclosure by licensed genetic counselors, and participants' psychosocial reactions were evaluated using a mixed-methods approach. A primarily adult population receiving medical care at Geisinger, a large integrated health care system in the United States with the only population-based genomic screening program approved for medically relevant results disclosure, was included. The cohort was identified from the Geisinger MyCode Community Health Initiative. Exome and linked electronic health record data were available for this cohort, which was recruited from February 2007 to April 2017. Data were collected for the qualitative analysis April 2017 through February 2018. Analysis began February 2018 and ended December 2019. MAIN OUTCOMES AND MEASURES The planned outcomes of this study include (1) prevalence estimate of NPD-associated CNVs in an unselected health care system population; (2) penetrance estimate of NPD diagnoses in CNV-positive individuals; and (3) qualitative themes that describe participants' responses to receiving NPD-associated genomic results. RESULTS Of 90 595 participants with CNV data, a pathogenic CNV was identified in 708 (0.8%; 436 women [61.6%]; mean [SD] age, 50.04 [18.74] years). Seventy percent (n = 494) had at least 1 associated clinical symptom. Of these, 28.8% (204) of CNV-positive individuals had an NPD code in their electronic health record, compared with 13.3% (11 835 of 89 887) of CNV-negative individuals (odds ratio, 2.21; 95% CI, 1.86-2.61; P < .001); 66.4% (470) of CNV-positive individuals had a history of depression and anxiety compared with 54.6% (49 118 of 89 887) of CNV-negative individuals (odds ratio, 1.53; 95% CI, 1.31-1.80; P < .001). 16p13.11 (71 [0.078%]) and 22q11.2 (108 [0.119%]) were the most prevalent deletions and duplications, respectively. Only 5.8% of individuals (41 of 708) had a previously known genetic diagnosis. Results disclosure was completed for 141 individuals. Positive participant responses included poignant reactions to learning a medical reason for lifelong cognitive and psychiatric disabilities. CONCLUSIONS AND RELEVANCE This study informs critical factors central to the development of population-based genomic screening programs and supports the inclusion of NPD in future designs to promote equitable access to clinically useful genomic information.
Collapse
|
19
|
Van Hout CV, Tachmazidou I, Backman JD, Hoffman JD, Liu D, Pandey AK, Gonzaga-Jauregui C, Khalid S, Ye B, Banerjee N, Li AH, O'Dushlaine C, Marcketta A, Staples J, Schurmann C, Hawes A, Maxwell E, Barnard L, Lopez A, Penn J, Habegger L, Blumenfeld AL, Bai X, O'Keeffe S, Yadav A, Praveen K, Jones M, Salerno WJ, Chung WK, Surakka I, Willer CJ, Hveem K, Leader JB, Carey DJ, Ledbetter DH, Cardon L, Yancopoulos GD, Economides A, Coppola G, Shuldiner AR, Balasubramanian S, Cantor M, Nelson MR, Whittaker J, Reid JG, Marchini J, Overton JD, Scott RA, Abecasis GR, Yerges-Armstrong L, Baras A. Exome sequencing and characterization of 49,960 individuals in the UK Biobank. Nature 2020; 586:749-756. [PMID: 33087929 PMCID: PMC7759458 DOI: 10.1038/s41586-020-2853-0] [Citation(s) in RCA: 275] [Impact Index Per Article: 68.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Accepted: 08/25/2020] [Indexed: 12/12/2022]
Abstract
The UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world1. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%). The data include 198,269 autosomal predicted loss-of-function (LOF) variants, a more than 14-fold increase compared to the imputed sequence. Nearly all genes (more than 97%) had at least one carrier with a LOF variant, and most genes (more than 69%) had at least ten carriers with a LOF variant. We illustrate the power of characterizing LOF variants in this population through association analyses across 1,730 phenotypes. In addition to replicating established associations, we found novel LOF variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical importance, and show that 2% of this population has a medically actionable variant. Furthermore, we characterize the penetrance of cancer in carriers of pathogenic BRCA1 and BRCA2 variants. Exome sequences from the first 49,960 participants highlight the promise of genome sequencing in large population-based studies and are now accessible to the scientific community.
Collapse
|
20
|
Gonzaga-Jauregui C, Ge W, Staples J, Van Hout C, Yadav A, Colonie R, Leader JB, Kirchner HL, Murray MF, Reid JG, Carey DJ, Overton JD, Shuldiner AR, Gottesman O, Gao S, Gromada J, Baras A, Altarejos J. Clinical and Molecular Prevalence of Lipodystrophy in an Unascertained Large Clinical Care Cohort. Diabetes 2020; 69:249-258. [PMID: 31836692 DOI: 10.2337/db19-0447] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 11/18/2019] [Indexed: 11/13/2022]
Abstract
Lipodystrophies are a group of disorders characterized by absence or loss of adipose tissue and abnormal fat distribution, commonly accompanied by metabolic dysregulation. Although considered rare disorders, their prevalence in the general population is not well understood. We aimed to evaluate the clinical and genetic prevalence of lipodystrophy disorders in a large clinical care cohort. We interrogated the electronic health record (EHR) information of >1.3 million adults from the Geisinger Health System for lipodystrophy diagnostic codes. We estimate a clinical prevalence of disease of 1 in 20,000 individuals. We performed genetic analyses in individuals with available genomic data to identify variants associated with inherited lipodystrophies and examined their EHR for comorbidities associated with lipodystrophy. We identified 16 individuals carrying the p.R482Q pathogenic variant in LMNA associated with Dunnigan familial partial lipodystrophy. Four had a clinical diagnosis of lipodystrophy, whereas the remaining had no documented clinical diagnosis despite having accompanying metabolic abnormalities. We observed a lipodystrophy-associated variant carrier frequency of 1 in 3,082 individuals in our cohort with substantial burden of metabolic dysregulation. We estimate a genetic prevalence of disease of ∼1 in 7,000 in the general population. Partial lipodystrophy is an underdiagnosed condition. and its prevalence, as defined molecularly, is higher than previously reported. Genetically guided stratification of patients with common metabolic disorders, like diabetes and dyslipidemia, is an important step toward precision medicine.
Collapse
|
21
|
Park J, Damrauer SM, Baras A, Reid JG, Overton JD, Gonzalez-Alegre P. Epidemiology of DYT1 dystonia: Estimating prevalence via genetic ascertainment. NEUROLOGY-GENETICS 2019; 5:e358. [PMID: 31583275 PMCID: PMC6745720 DOI: 10.1212/nxg.0000000000000358] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 08/02/2019] [Indexed: 11/17/2022]
Abstract
Objective To estimate the prevalence of TOR1A sequence variants associated with DYT1 dystonia. Methods We determined the frequency of the common trinucleotide deletion that causes DYT1 in the Genome Aggregation Database and the Penn Medicine Biobank, totaling exomes from over 135,000 individuals. We also evaluated the prevalence of other possible pathogenic variants in this gene and asked whether the D216H polymorphism is linked to a higher diagnostic rate for dystonia independent of the DYT1-causing mutation. Results The estimated range of prevalence of the most common pathogenic variant that causes DYT1 is ∼17.6–26.1 carriers per 100,000 individuals. Based on the different data sets used, we predict that there are between 54,366 and 80,891 mutation carriers in the United States, which, due to the reduced penetrance of this variant, would translate into 16,475–24,513 DYT1 patients. Conclusions Our data provide a prevalence estimate of the most common DYT1 mutation in the general population. This information is specifically important for those with interest in the development of precision therapeutics for dystonia.
Collapse
|
22
|
Williams KB, Brigatti KW, Puffenberger EG, Gonzaga-Jauregui C, Griffin LB, Martinez ED, Wenger OK, Yoder MA, Kandula VVR, Fox MD, Demczko MM, Poskitt L, Furuya KN, Reid JG, Overton JD, Baras A, Miles L, Radhakrishnan K, Carson VJ, Antonellis A, Jinks RN, Strauss KA. Homozygosity for a mutation affecting the catalytic domain of tyrosyl-tRNA synthetase (YARS) causes multisystem disease. Hum Mol Genet 2019; 28:525-538. [PMID: 30304524 DOI: 10.1093/hmg/ddy344] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 09/21/2018] [Indexed: 12/21/2022] Open
Abstract
Aminoacyl-tRNA synthetases (ARSs) are critical for protein translation. Pathogenic variants of ARSs have been previously associated with peripheral neuropathy and multisystem disease in heterozygotes and homozygotes, respectively. We report seven related children homozygous for a novel mutation in tyrosyl-tRNA synthetase (YARS, c.499C > A, p.Pro167Thr) identified by whole exome sequencing. This variant lies within a highly conserved interface required for protein homodimerization, an essential step in YARS catalytic function. Affected children expressed a more severe phenotype than previously reported, including poor growth, developmental delay, brain dysmyelination, sensorineural hearing loss, nystagmus, progressive cholestatic liver disease, pancreatic insufficiency, hypoglycemia, anemia, intermittent proteinuria, recurrent bloodstream infections and chronic pulmonary disease. Related adults heterozygous for YARS p.Pro167Thr showed no evidence of peripheral neuropathy on electromyography, in contrast to previous reports for other YARS variants. Analysis of YARS p.Pro167Thr in yeast complementation assays revealed a loss-of-function, hypomorphic allele that significantly impaired growth. Recombinant YARS p.Pro167Thr demonstrated normal subcellular localization, but greatly diminished ability to homodimerize in human embryonic kidney cells. This work adds to a rapidly growing body of research emphasizing the importance of ARSs in multisystem disease and significantly expands the allelic and clinical heterogeneity of YARS-associated human disease. A deeper understanding of the role of YARS in human disease may inspire innovative therapies and improve care of affected patients.
Collapse
|
23
|
Gorvin CM, Metpally R, Stokes VJ, Hannan FM, Krishnamurthy SB, Overton JD, Reid JG, Breitwieser GE, Thakker RV. Large-scale exome datasets reveal a new class of adaptor-related protein complex 2 sigma subunit (AP2σ) mutations, located at the interface with the AP2 alpha subunit, that impair calcium-sensing receptor signalling. Hum Mol Genet 2019; 27:901-911. [PMID: 29325022 PMCID: PMC5982735 DOI: 10.1093/hmg/ddy010] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 12/21/2017] [Indexed: 11/12/2022] Open
Abstract
Mutations of the sigma subunit of the heterotetrameric adaptor-related protein complex 2 (AP2σ) impair signalling of the calcium-sensing receptor (CaSR), and cause familial hypocalciuric hypercalcaemia type 3 (FHH3). To date, FHH3-associated AP2σ mutations have only been identified at one residue, Arg15. We hypothesized that additional rare AP2σ variants may also be associated with altered CaSR function and hypercalcaemia, and sought for these by analysing >111 995 exomes (>60 706 from ExAc and dbSNP, and 51 289 from the Geisinger Health System-Regeneron DiscovEHR dataset, which also contains clinical data). This identified 11 individuals to have 9 non-synonymous AP2σ variants (Arg3His, Arg15His (x3), Ala44Thr, Phe52Tyr, Arg61His, Thr112Met, Met117Ile, Glu122Gly and Glu142Lys) with 3 of the 4 individuals who had Arg15His and Met117Ile AP2σ variants having mild hypercalcaemia, thereby indicating a prevalence of FHH3-associated AP2σ mutations of ∼7.8 per 100 000 individuals. Structural modelling of the novel eight AP2σ variants (Arg3His, Ala44Thr, Phe52Tyr, Arg61His, Thr112Met, Met117Ile, Glu122Gly and Glu142Lys) predicted that the Arg3His, Thr112Met, Glu122Gly and Glu142Lys AP2σ variants would disrupt polar contacts within the AP2σ subunit or affect the interface between the AP2σ and AP2α subunits. Functional analyses of all eight AP2σ variants in CaSR-expressing cells demonstrated that the Thr112Met, Met117Ile and Glu142Lys variants, located in the AP2σ α4-α5 helical region that forms an interface with AP2α, impaired CaSR-mediated intracellular calcium (Cai2+) signalling, consistent with a loss of function, and this was rectified by treatment with the CaSR positive allosteric modulator cinacalcet. Thus, our studies demonstrate another potential class of FHH3-causing AP2σ mutations located at the AP2σ-AP2α interface.
Collapse
|
24
|
Chen C, Meng Q, Xia Y, Ding C, Wang L, Dai R, Cheng L, Gunaratne P, Gibbs RA, Min S, Coarfa C, Reid JG, Zhang C, Jiao C, Jiang Y, Giase G, Thomas A, Fitzgerald D, Brunetti T, Shieh A, Xia C, Wang Y, Wang Y, Badner JA, Gershon ES, White KP, Liu C. The transcription factor POU3F2 regulates a gene coexpression network in brain tissue from patients with psychiatric disorders. Sci Transl Med 2018; 10:scitranslmed.aat8178. [PMID: 30545964 DOI: 10.1126/scitranslmed.aat8178] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 07/26/2018] [Accepted: 11/07/2018] [Indexed: 12/22/2022]
Abstract
Schizophrenia and bipolar disorder are complex psychiatric diseases with risks contributed by multiple genes. Dysregulation of gene expression has been implicated in these disorders, but little is known about such dysregulation in the human brain. We analyzed three transcriptome datasets from 394 postmortem brain tissue samples from patients with schizophrenia or bipolar disorder or from healthy control individuals without a known history of psychiatric disease. We built genome-wide coexpression networks that included microRNAs (miRNAs). We identified a coexpression network module that was differentially expressed in the brain tissue from patients compared to healthy control individuals. This module contained genes that were principally involved in glial and neural cell genesis and glial cell differentiation, and included schizophrenia risk genes carrying rare variants. This module included five miRNAs and 545 mRNAs, with six transcription factors serving as hub genes in this module. We found that the most connected transcription factor gene POU3F2, also identified on a genome-wide association study for bipolar disorder, could regulate the miRNA hsa-miR-320e and other putative target mRNAs. These regulatory relationships were replicated using PsychENCODE/BrainGVEX datasets and validated by knockdown and overexpression experiments in SH-SY5Y cells and human neural progenitor cells in vitro. Thus, we identified a brain gene expression module that was enriched for rare coding variants in genes associated with schizophrenia and that contained the putative bipolar disorder risk gene POU3F2 The transcription factor POU3F2 may be a key regulator of gene expression in this disease-associated gene coexpression module.
Collapse
|
25
|
Bohnen MS, Ma L, Zhu N, Qi H, McClenaghan C, Gonzaga-Jauregui C, Dewey FE, Overton JD, Reid JG, Shuldiner AR, Baras A, Sampson KJ, Bleda M, Hadinnapola C, Haimel M, Bogaard HJ, Church C, Coghlan G, Corris PA, Eyries M, Gibbs JSR, Girerd B, Houweling AC, Humbert M, Guignabert C, Kiely DG, Lawrie A, MacKenzie Ross RV, Martin JM, Montani D, Peacock AJ, Pepke-Zaba J, Soubrier F, Suntharalingam J, Toshner M, Treacy CM, Trembath RC, Vonk Noordegraaf A, Wharton J, Wilkins MR, Wort SJ, Yates K, Gräf S, Morrell NW, Krishnan U, Rosenzweig EB, Shen Y, Nichols CG, Kass RS, Chung WK. Loss-of-Function ABCC8 Mutations in Pulmonary Arterial Hypertension. CIRCULATION. GENOMIC AND PRECISION MEDICINE 2018; 11:e002087. [PMID: 30354297 PMCID: PMC6206877 DOI: 10.1161/circgen.118.002087] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 08/01/2018] [Indexed: 11/16/2022]
Abstract
BACKGROUND In pulmonary arterial hypertension (PAH), pathological changes in pulmonary arterioles progressively raise pulmonary artery pressure and increase pulmonary vascular resistance, leading to right heart failure and high mortality rates. Recently, the first potassium channelopathy in PAH, because of mutations in KCNK3, was identified as a genetic cause and pharmacological target. METHODS Exome sequencing was performed to identify novel genes in a cohort of 99 pediatric and 134 adult-onset group I PAH patients. Novel rare variants in the gene identified were independently identified in a cohort of 680 adult-onset patients. Variants were expressed in COS cells and function assessed by patch-clamp and rubidium flux analysis. RESULTS We identified a de novo novel heterozygous predicted deleterious missense variant c.G2873A (p.R958H) in ABCC8 in a child with idiopathic PAH. We then evaluated all individuals in the original and a second cohort for rare or novel variants in ABCC8 and identified 11 additional heterozygous predicted damaging ABCC8 variants. ABCC8 encodes SUR1 (sulfonylurea receptor 1)-a regulatory subunit of the ATP-sensitive potassium channel. We observed loss of ATP-sensitive potassium channel function for all ABCC8 variants evaluated and pharmacological rescue of all channel currents in vitro by the SUR1 activator, diazoxide. CONCLUSIONS Novel and rare missense variants in ABCC8 are associated with PAH. Identified ABCC8 mutations decreased ATP-sensitive potassium channel function, which was pharmacologically recovered.
Collapse
|