1
|
Yu D, Koslovsky M, Steiner MC, Mohammadi K, Zhang C, Swartz MD. TRIO RVEMVS: A Bayesian framework for rare variant association analysis with expectation-maximization variable selection using family trio data. PLoS One 2024; 19:e0314502. [PMID: 39630689 PMCID: PMC11616829 DOI: 10.1371/journal.pone.0314502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Accepted: 11/11/2024] [Indexed: 12/07/2024] Open
Abstract
It is commonly reported that rare variants may be more functionally related to complex diseases than common variants. However, individual rare variant association tests remain challenging due to low minor allele frequency in the available samples. This paper proposes an expectation maximization variable selection (EMVS) method to simultaneously detect common and rare variants at the individual variant level using family trio data. TRIO_RVEMVS was assessed in both large (1500 families) and small (350 families) datasets based on simulation. The performance of TRIO_RVEMVS was compared with gene-level kernel and burden association tests that use pedigree data (PedGene) and rare-variant extensions of the transmission disequilibrium test (RV-TDT). At the region level, TRIO_RVEMVS outperformed PedGene and RV-TDT when common variants were included. TRIO_RVEMVS performed competitively with PedGene and outperformed RV-TDT when the analysis was only restricted to rare variants. At the individual variants level, with 1,500 trios, the average true positive rate of individual rare variants that were polymorphic across 500 datasets was 12.20%, and the average false positive rate was 0.74%. In the datasets with 350 trios, the average true and false positive rates of individual rare variants were 13.10% and 1.30%, respectively. When applying TRIO_RVEMVS to real data from the Gabriella Miller Kids First Pediatric Research Program, it identified 3 rare variants in q24.21 and q24.22 associated with the risk of orofacial clefts in the Kids First European population.
Collapse
Affiliation(s)
- Duo Yu
- Division of Biostatistics, Data Science Institute, Medical College of Wisconsin, Milwaukee, Wisconsin, United States of America
| | - Matthew Koslovsky
- Department of Statistics, Colorado State University, Fort Collins, Colorado, United States of America
| | - Margaret C. Steiner
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Kusha Mohammadi
- Department of Biostatistics and Data Management, Regeneron Pharmaceuticals, Inc., Tarrytown, New York, United States of America
| | - Chenguang Zhang
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, Pennsylvania, United States of America
| | - Michael D. Swartz
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas, United States of America
| |
Collapse
|
2
|
Grinde KE, Browning BL, Reiner AP, Thornton TA, Browning SR. Adjusting for principal components can induce collider bias in genome-wide association studies. PLoS Genet 2024; 20:e1011242. [PMID: 39680601 PMCID: PMC11684764 DOI: 10.1371/journal.pgen.1011242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 12/30/2024] [Accepted: 11/14/2024] [Indexed: 12/18/2024] Open
Abstract
Principal component analysis (PCA) is widely used to control for population structure in genome-wide association studies (GWAS). Top principal components (PCs) typically reflect population structure, but challenges arise in deciding how many PCs are needed and ensuring that PCs do not capture other artifacts such as regions with atypical linkage disequilibrium (LD). In response to the latter, many groups suggest performing LD pruning or excluding known high LD regions prior to PCA. However, these suggestions are not universally implemented and the implications for GWAS are not fully understood, especially in the context of admixed populations. In this paper, we investigate the impact of pre-processing and the number of PCs included in GWAS models in African American samples from the Women's Health Initiative SNP Health Association Resource and two Trans-Omics for Precision Medicine Whole Genome Sequencing Project contributing studies (Jackson Heart Study and Genetic Epidemiology of Chronic Obstructive Pulmonary Disease Study). In all three samples, we find the first PC is highly correlated with genome-wide ancestry whereas later PCs often capture local genomic features. The pattern of which, and how many, genetic variants are highly correlated with individual PCs differs from what has been observed in prior studies focused on European populations and leads to distinct downstream consequences: adjusting for such PCs yields biased effect size estimates and elevated rates of spurious associations due to the phenomenon of collider bias. Excluding high LD regions identified in previous studies does not resolve these issues. LD pruning proves more effective, but the optimal choice of thresholds varies across datasets. Altogether, our work highlights unique issues that arise when using PCA to control for ancestral heterogeneity in admixed populations and demonstrates the importance of careful pre-processing and diagnostics to ensure that PCs capturing multiple local genomic features are not included in GWAS models.
Collapse
Affiliation(s)
- Kelsey E. Grinde
- Department of Mathematics, Statistics, and Computer Science, Macalester College, Saint Paul, Minnesota, United States of America
| | - Brian L. Browning
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, United States of America
| | - Alexander P. Reiner
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Epidemiology, University of Washington, Seattle, Washington, United States of America
| | - Timothy A. Thornton
- Regeneron Genetics Center, Tarrytown, New York, United States of America
- Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
| | - Sharon R. Browning
- Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
3
|
Uttam V, Vohra V, Chhotaray S, Santhosh A, Diwakar V, Patel V, Gahlyan RK. Exome-wide comparative analyses revealed differentiating genomic regions for performance traits in Indian native buffaloes. Anim Biotechnol 2024; 35:2277376. [PMID: 37934017 DOI: 10.1080/10495398.2023.2277376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2023]
Abstract
In India, 20 breeds of buffalo have been identified and registered, yet limited studies have been conducted to explore the performance potential of these breeds, especially in the Indian native breeds. This study is a maiden attempt to delineate the important variants and unique genes through exome sequencing for milk yield, milk composition, fertility, and adaptation traits in Indian local breeds of buffalo. In the present study, whole exome sequencing was performed on Chhattisgarhi (n = 3), Chilika (n = 4), Gojri (n = 3), and Murrah (n = 4) buffalo breeds and after stringent quality control, 4333, 6829, 4130, and 4854 InDels were revealed, respectively. Exome-wide FST along 100-kb sliding windows detected 27, 98, 38, and 35 outlier windows in Chhattisgarhi, Chilika, Gojri, and Murrah, respectively. The comparative exome analysis of InDels and subsequent gene ontology revealed unique breed specific genes for milk yield (CAMSAP3), milk composition (CLCN1, NUDT3), fertility (PTGER3) and adaptation (KCNA3, TH) traits. Study provides insight into mechanism of how these breeds have evolved under natural selection, the impact of these events on their respective genomes, and their importance in maintaining purity of these breeds for the traits under study. Additionally, this result will underwrite to the genetic acquaintance of these breeds for breeding application, and in understanding of evolution of these Indian local breeds.
Collapse
Affiliation(s)
- Vishakha Uttam
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Vikas Vohra
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Supriya Chhotaray
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Ameya Santhosh
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Vikas Diwakar
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Vaibhav Patel
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Rajesh Kumar Gahlyan
- Animal Genetics & Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| |
Collapse
|
4
|
Shyr D, Dey R, Li X, Zhou H, Boerwinkle E, Buyske S, Daly M, Gibbs RA, Hall I, Matise T, Reeves C, Stitziel NO, Zody M, Neale BM, Lin X. Semi-supervised machine learning method for predicting homogeneous ancestry groups to assess Hardy-Weinberg equilibrium in diverse whole-genome sequencing studies. Am J Hum Genet 2024; 111:2129-2138. [PMID: 39270648 PMCID: PMC11480788 DOI: 10.1016/j.ajhg.2024.08.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 08/22/2024] [Accepted: 08/22/2024] [Indexed: 09/15/2024] Open
Abstract
Large-scale, multi-ethnic whole-genome sequencing (WGS) studies, such as the National Human Genome Research Institute Genome Sequencing Program's Centers for Common Disease Genomics (CCDG), play an important role in increasing diversity for genetic research. Before performing association analyses, assessing Hardy-Weinberg equilibrium (HWE) is a crucial step in quality control procedures to remove low quality variants and ensure valid downstream analyses. Diverse WGS studies contain ancestrally heterogeneous samples; however, commonly used HWE methods assume that the samples are homogeneous. Therefore, directly applying these to the whole dataset can yield statistically invalid results. To account for this heterogeneity, HWE can be tested on subsets of samples that have genetically homogeneous ancestries and the results aggregated at each variant. To facilitate valid HWE subset testing, we developed a semi-supervised learning approach that predicts homogeneous ancestries based on the genotype. This method provides a convenient tool for estimating HWE in the presence of population structure and missing self-reported race and ethnicities in diverse WGS studies. In addition, assessing HWE within the homogeneous ancestries provides reliable HWE estimates that will directly benefit downstream analyses, including association analyses in WGS studies. We applied our proposed method on the CCDG dataset, predicting homogeneous genetic ancestry groups for 60,545 multi-ethnic WGS samples to assess HWE within each group.
Collapse
Affiliation(s)
- Derek Shyr
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Rounak Dey
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Xihao Li
- Department of Biostatistics, University of North Carolina at Chapel Hill Gillings School of Global Public Health, Chapel Hill, NC 27599, USA
| | - Hufeng Zhou
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Eric Boerwinkle
- Department of Epidemiology, Human Genetics and Environmental Sciences, The University of Texas Health Science Center at Houston School of Public Health, Houston, TX 77030, USA
| | - Steve Buyske
- Department of Statistics, Rutgers University, Piscataway, NJ 08854, USA
| | - Mark Daly
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Ira Hall
- Center for Genomic Health, Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA
| | - Tara Matise
- Department of Genetics, Rutgers University, Piscataway, NJ 08854, USA
| | | | - Nathan O Stitziel
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | | - Benjamin M Neale
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Statistics, Harvard University, Cambridge, MA 02115, USA.
| |
Collapse
|
5
|
Davies NM, Hemani G, Neiderhiser JM, Martin HC, Mills MC, Visscher PM, Yengo L, Young AS, Keller MC. The importance of family-based sampling for biobanks. Nature 2024; 634:795-803. [PMID: 39443775 PMCID: PMC11623399 DOI: 10.1038/s41586-024-07721-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/13/2024] [Indexed: 10/25/2024]
Abstract
Biobanks aim to improve our understanding of health and disease by collecting and analysing diverse biological and phenotypic information in large samples. So far, biobanks have largely pursued a population-based sampling strategy, where the individual is the unit of sampling, and familial relatedness occurs sporadically and by chance. This strategy has been remarkably efficient and successful, leading to thousands of scientific discoveries across multiple research domains, and plans for the next wave of biobanks are underway. In this Perspective, we discuss the strengths and limitations of a complementary sampling strategy for future biobanks based on oversampling of close genetic relatives. Such family-based samples facilitate research that clarifies causal relationships between putative risk factors and outcomes, particularly in estimates of genetic effects, because they enable analyses that reduce or eliminate confounding due to familial and demographic factors. Family-based biobank samples would also shed new light on fundamental questions across multiple fields that are often difficult to explore in population-based samples. Despite the potential for higher costs and greater analytical complexity, the many advantages of family-based samples should often outweigh their potential challenges.
Collapse
Affiliation(s)
- Neil M Davies
- Division of Psychiatry, University College London, London, UK.
- Department of Statistical Science, University College London, London, UK.
- Department of Public Health and Nursing, Norwegian University of Science and Technology, Trondheim, Norway.
| | - Gibran Hemani
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Jenae M Neiderhiser
- Department of Psychology, The Pennsylvania State University, University Park, PA, USA
| | - Hilary C Martin
- Human Genetics Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Melinda C Mills
- Department of Economics, Econometrics & Finance, University of Groningen, Groningen, The Netherlands
- Department of Genetics, University Medical Centre Groningen, Groningen, The Netherlands
- Leverhulme Centre for Demographic Science, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Loïc Yengo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
| | - Alexander Strudwick Young
- UCLA Anderson School of Management, Los Angeles, CA, USA
- Human Genetics Department, UCLA David Geffen School of Medicine, Los Angeles, CA, USA
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado at Boulder, Boulder, CO, USA.
- Department of Psychology and Neuroscience, University of Colorado at Boulder, Boulder, CO, USA.
| |
Collapse
|
6
|
Alves I, Giemza J, Blum MGB, Bernhardsson C, Chatel S, Karakachoff M, Saint Pierre A, Herzig AF, Olaso R, Monteil M, Gallien V, Cabot E, Svensson E, Bacq D, Baron E, Berthelier C, Besse C, Blanché H, Bocher O, Boland A, Bonnaud S, Charpentier E, Dandine-Roulland C, Férec C, Fruchet C, Lecointe S, Le Floch E, Ludwig TE, Marenne G, Meyer V, Quellery E, Racimo F, Rouault K, Sandron F, Schott JJ, Velo-Suarez L, Violleau J, Willerslev E, Coativy Y, Jézéquel M, Le Bris D, Nicolas C, Pailler Y, Goldberg M, Zins M, Le Marec H, Jakobsson M, Darlu P, Génin E, Deleuze JF, Redon R, Dina C. Human genetic structure in Northwest France provides new insights into West European historical demography. Nat Commun 2024; 15:6710. [PMID: 39112481 PMCID: PMC11306750 DOI: 10.1038/s41467-024-51087-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 07/29/2024] [Indexed: 08/10/2024] Open
Abstract
The demographical history of France remains largely understudied despite its central role toward understanding modern population structure across Western Europe. Here, by exploring publicly available Europe-wide genotype datasets together with the genomes of 3234 present-day and six newly sequenced medieval individuals from Northern France, we found extensive fine-scale population structure across Brittany and the downstream Loire basin and increased population differentiation between the northern and southern sides of the river Loire, associated with higher proportions of steppe vs. Neolithic-related ancestry. We also found increased allele sharing between individuals from Western Brittany and those associated with the Bell Beaker complex. Our results emphasise the need for investigating local populations to better understand the distribution of rare (putatively deleterious) variants across space and the importance of common genetic legacy in understanding the sharing of disease-related alleles between Brittany and people from western Britain and Ireland.
Collapse
Affiliation(s)
- Isabel Alves
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
- Université de Strasbourg, CNRS, GMGM, Strasbourg, France
| | - Joanna Giemza
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Michael G B Blum
- TIMC-IMAG, UMR 5525 CNRS, Univ. Grenoble Alpes, Grenoble, France
| | - Carolina Bernhardsson
- Department of Organismal Biology, Human Evolution, Uppsala University, Uppsala, Sweden
| | - Stéphanie Chatel
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Matilde Karakachoff
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
- Nantes Université, CHU Nantes, Pôle Hospitalo-Universitaire 11: Santé Publique, Clinique des données, INSERMCIC 1413, Nantes, France
| | | | | | - Robert Olaso
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Martial Monteil
- Nantes Université, CNRS, Ministère de la Culture, CReAAH, LARA, Nantes, France
| | - Véronique Gallien
- INRAP - Institut national de recherches archéologiques préventives, Paris, France
- CEPAM UMR7264 - Culture et Environnements, Préhistoire, Antiquité, Moyen-Age, Nice, France
| | - Elodie Cabot
- INRAP - Institut national de recherches archéologiques préventives, Paris, France
- Anthropologie Bio-Culturelle, Droit, Ethique et Santé, Faculté de Médecine Site Nord, Marseille, France
| | - Emma Svensson
- Department of Organismal Biology, Human Evolution, Uppsala University, Uppsala, Sweden
| | - Delphine Bacq
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Estelle Baron
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Charlotte Berthelier
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Céline Besse
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | | | - Ozvan Bocher
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France
| | - Anne Boland
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Stéphanie Bonnaud
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Eric Charpentier
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Claire Dandine-Roulland
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Claude Férec
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France
- CHRU Brest, Brest, France
| | - Christine Fruchet
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Simon Lecointe
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Edith Le Floch
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Thomas E Ludwig
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France
- CHRU Brest, Brest, France
| | | | - Vincent Meyer
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
| | - Elisabeth Quellery
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Fernando Racimo
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Karen Rouault
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France
- CHRU Brest, Brest, France
| | - Florian Sandron
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
| | - Jean-Jacques Schott
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | | | - Jade Violleau
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Eske Willerslev
- Lundbeck GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Zoology, University of Cambridge, Cambridge, UK
| | - Yves Coativy
- Centre de Recherche Bretonne et Celtique, UR 4451, Université de Bretagne Occidentale, Brest, France
| | - Mael Jézéquel
- Centre de Recherche Bretonne et Celtique, UR 4451, Université de Bretagne Occidentale, Brest, France
| | - Daniel Le Bris
- Centre de Recherche Bretonne et Celtique, UR 4451, Université de Bretagne Occidentale, Brest, France
| | - Clément Nicolas
- CNRS UMR 8215 Trajectoires, Université Paris 1 Panthéon-Sorbonne, Centre Malher, 9 rue Malher, Paris, France
| | - Yvan Pailler
- CPJ ArMeRIE UBO, UMR 6554 LETG, CNRS, Université de Brest, Université de Nantes, Université de Rennes 2, Institut Universitaire Européen de la Mer, Plouzané, France
| | - Marcel Goldberg
- Université Paris Cité, "Population-based Cohorts Unit", INSERM, Paris Saclay University, UVSQ, Paris, France
| | - Marie Zins
- Université Paris Cité, "Population-based Cohorts Unit", INSERM, Paris Saclay University, UVSQ, Paris, France
| | - Hervé Le Marec
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Mattias Jakobsson
- Department of Organismal Biology, Human Evolution, Uppsala University, Uppsala, Sweden
| | - Pierre Darlu
- UMR 7206 Eco-anthropologie, Musée de l'Homme, MNHN, CNRS, Université de Paris Cité, Paris, France
| | - Emmanuelle Génin
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France
- CHRU Brest, Brest, France
| | - Jean-François Deleuze
- Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, France
- Labex GenMed, Evry, France
- Fondation Jean Dausset, CEPH, Paris, France
| | - Richard Redon
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France.
| | - Christian Dina
- Nantes Université, CHU Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France.
| |
Collapse
|
7
|
Kosmicki JA, Marcketta A, Sharma D, Di Gioia SA, Batista S, Yang XM, Tzoneva G, Martinez H, Sidore C, Kessler MD, Horowitz JE, Roberts GHL, Justice AE, Banerjee N, Coignet MV, Leader JB, Park DS, Lanche R, Maxwell E, Knight SC, Bai X, Guturu H, Baltzell A, Girshick AR, McCurdy SR, Partha R, Mansfield AJ, Turissini DA, Zhang M, Mbatchou J, Watanabe K, Verma A, Sirugo G, Ritchie MD, Salerno WJ, Shuldiner AR, Rader DJ, Mirshahi T, Marchini J, Overton JD, Carey DJ, Habegger L, Reid JG, Economides A, Kyratsous C, Karalis K, Baum A, Cantor MN, Rand KA, Hong EL, Ball CA, Siminovitch K, Baras A, Abecasis GR, Ferreira MAR. Genetic risk factors for COVID-19 and influenza are largely distinct. Nat Genet 2024; 56:1592-1596. [PMID: 39103650 PMCID: PMC11319199 DOI: 10.1038/s41588-024-01844-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 06/24/2024] [Indexed: 08/07/2024]
Abstract
Coronavirus disease 2019 (COVID-19) and influenza are respiratory illnesses caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and influenza viruses, respectively. Both diseases share symptoms and clinical risk factors1, but the extent to which these conditions have a common genetic etiology is unknown. This is partly because host genetic risk factors are well characterized for COVID-19 but not for influenza, with the largest published genome-wide association studies for these conditions including >2 million individuals2 and about 1,000 individuals3-6, respectively. Shared genetic risk factors could point to targets to prevent or treat both infections. Through a genetic study of 18,334 cases with a positive test for influenza and 276,295 controls, we show that published COVID-19 risk variants are not associated with influenza. Furthermore, we discovered and replicated an association between influenza infection and noncoding variants in B3GALT5 and ST6GAL1, neither of which was associated with COVID-19. In vitro small interfering RNA knockdown of ST6GAL1-an enzyme that adds sialic acid to the cell surface, which is used for viral entry-reduced influenza infectivity by 57%. These results mirror the observation that variants that downregulate ACE2, the SARS-CoV-2 receptor, protect against COVID-19 (ref. 7). Collectively, these findings highlight downregulation of key cell surface receptors used for viral entry as treatment opportunities to prevent COVID-19 and influenza.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Anurag Verma
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Giorgio Sirugo
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Marylyn D Ritchie
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | | | - Daniel J Rader
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | | | | | | | | | | | | | | | | | - Alina Baum
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | |
Collapse
|
8
|
Blanc J, Berg JJ. Testing for differences in polygenic scores in the presence of confounding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.12.532301. [PMID: 36993707 PMCID: PMC10055004 DOI: 10.1101/2023.03.12.532301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals' phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into the evolutionary forces acting on the trait in question, and is important for understanding health disparities. However, because most polygenic scores are computed using effect estimates from population samples, they are susceptible to confounding by both genetic and environmental effects that are correlated with ancestry. The extent to which this confounding drives patterns in the distribution of polygenic scores depends on patterns of population structure in both the original estimation panel and in the prediction/test panel. Here, we use theory from population and statistical genetics, together with simulations, to study the procedure of testing for an association between polygenic scores and axes of ancestry variation in the presence of confounding. We use a general model of genetic relatedness to describe how confounding in the estimation panel biases the distribution of polygenic scores in a way that depends on the degree of overlap in population structure between panels. We then show how this confounding can bias tests for associations between polygenic scores and important axes of ancestry variation in the test panel. Specifically, for any given test, there exists a single axis of population structure in the GWAS panel that needs to be controlled for in order to protect the test. Based on this result, we propose a new approach for directly estimating this axis of population structure in the GWAS panel. We then use simulations to compare the performance of this approach to the standard approach in which the principal components of the GWAS panel genotypes are used to control for stratification.
Collapse
Affiliation(s)
- Jennifer Blanc
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Jeremy J. Berg
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
9
|
Tran LN, Sun CK, Struck TJ, Sajan M, Gutenkunst RN. Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning. Mol Biol Evol 2024; 41:msae077. [PMID: 38636507 PMCID: PMC11082913 DOI: 10.1093/molbev/msae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 04/08/2024] [Accepted: 04/12/2024] [Indexed: 04/20/2024] Open
Abstract
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
Collapse
Affiliation(s)
- Linh N Tran
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ 85721, USA
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Connie K Sun
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Travis J Struck
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Mathews Sajan
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Ryan N Gutenkunst
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
10
|
Grinde KE, Browning BL, Reiner AP, Thornton TA, Browning SR. Adjusting for principal components can induce spurious associations in genome-wide association studies in admixed populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.02.587682. [PMID: 38617337 PMCID: PMC11014513 DOI: 10.1101/2024.04.02.587682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/24/2024]
Abstract
Principal component analysis (PCA) is widely used to control for population structure in genome-wide association studies (GWAS). Top principal components (PCs) typically reflect population structure, but challenges arise in deciding how many PCs are needed and ensuring that PCs do not capture other artifacts such as regions with atypical linkage disequilibrium (LD). In response to the latter, many groups suggest performing LD pruning or excluding known high LD regions prior to PCA. However, these suggestions are not universally implemented and the implications for GWAS are not fully understood, especially in the context of admixed populations. In this paper, we investigate the impact of pre-processing and the number of PCs included in GWAS models in African American samples from the Women's Women's Health Initiative SNP Health Association Resource and two Trans-Omics for Precision Medicine Whole Genome Sequencing Project contributing studies (Jackson Heart Study and Genetic Epidemiology of Chronic Obstructive Pulmonary Disease Study). In all three samples, we find the first PC is highly correlated with genome-wide ancestry whereas later PCs often capture local genomic features. The pattern of which, and how many, genetic variants are highly correlated with individual PCs differs from what has been observed in prior studies focused on European populations and leads to distinct downstream consequences: adjusting for such PCs yields biased effect size estimates and elevated rates of spurious associations due to the phenomenon of collider bias. Excluding high LD regions identified in previous studies does not resolve these issues. LD pruning proves more effective, but the optimal choice of thresholds varies across datasets. Altogether, our work highlights unique issues that arise when using PCA to control for ancestral heterogeneity in admixed populations and demonstrates the importance of careful pre-processing and diagnostics to ensure that PCs capturing multiple local genomic features are not included in GWAS models.
Collapse
Affiliation(s)
- Kelsey E. Grinde
- Department of Mathematics, Statistics, and Computer Science, Macalester College, Saint Paul, Minnesota, 55105, USA
| | - Brian L. Browning
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, 98195, USA
| | - Alexander P. Reiner
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
- Department of Epidemiology, University of Washington, Seattle, Washington, 98195, USA
| | - Timothy A. Thornton
- Regeneron Genetics Center, Tarrytown, New York, 10591, USA
- Department of Biostatistics, University of Washington, Seattle, Washington, 98195, USA
| | - Sharon R. Browning
- Department of Biostatistics, University of Washington, Seattle, Washington, 98195, USA
| |
Collapse
|
11
|
Han SH, Camp SY, Chu H, Collins R, Gillani R, Park J, Bakouny Z, Ricker CA, Reardon B, Moore N, Kofman E, Labaki C, Braun D, Choueiri TK, AlDubayan SH, Van Allen EM. Integrative Analysis of Germline Rare Variants in Clear and Non-clear Cell Renal Cell Carcinoma. EUR UROL SUPPL 2024; 62:107-122. [PMID: 38496821 PMCID: PMC10940785 DOI: 10.1016/j.euros.2024.02.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2024] [Indexed: 03/19/2024] Open
Abstract
Background and objective Previous germline studies on renal cell carcinoma (RCC) have usually pooled clear and non-clear cell RCCs and have not adequately accounted for population stratification, which might have led to an inaccurate estimation of genetic risk. Here, we aim to analyze the major germline drivers of RCC risk and clinically relevant but underexplored germline variant types. Methods We first characterized germline pathogenic variants (PVs), cryptic splice variants, and copy number variants (CNVs) in 1436 unselected RCC patients. To evaluate the enrichment of PVs in RCC, we conducted a case-control study of 1356 RCC patients ancestry matched with 16 512 cancer-free controls using approaches accounting for population stratification and histological subtypes, followed by characterization of secondary somatic events. Key findings and limitations Clear cell RCC patients (n = 976) exhibited a significant burden of PVs in VHL compared with controls (odds ratio [OR]: 39.1, p = 4.95e-05). Non-clear cell RCC patients (n = 380) carried enrichment of PVs in FH (OR: 77.9, p = 1.55e-08) and MET (OR: 1.98e11, p = 2.07e-05). In a CHEK2-focused analysis with European participants, clear cell RCC (n = 906) harbored nominal enrichment of low-penetrance CHEK2 variants-p.Ile157Thr (OR: 1.84, p = 0.049) and p.Ser428Phe (OR: 5.20, p = 0.045), while non-clear cell RCC (n = 295) exhibited nominal enrichment of CHEK2 loss of function PVs (OR: 3.51, p = 0.033). Patients with germline PVs in FH, MET, and VHL exhibited significantly earlier age of cancer onset than patients without germline PVs (mean: 46.0 vs 60.2 yr, p < 0.0001), and more than half had secondary somatic events affecting the same gene (n = 10/15, 66.7%). Conversely, CHEK2 PV carriers exhibited a similar age of onset to patients without germline PVs (mean: 60.1 vs 60.2 yr, p = 0.99), and only 30.4% carried somatic events in CHEK2 (n = 7/23). Finally, pathogenic germline cryptic splice variants were identified in SDHA and TSC1, and pathogenic germline CNVs were found in 18 patients, including CNVs in FH, SDHA, and VHL. Conclusions and clinical implications This analysis supports the existing link between several RCC risk genes and RCC risk manifesting in earlier age of onset. It calls for caution when assessing the role of CHEK2 due to the burden of founder variants with varying population frequency. It also broadens the definition of the RCC germline landscape of pathogenicity to incorporate previously understudied types of germline variants. Patient summary In this study, we carefully compared the frequency of rare inherited mutations with a focus on patients' genetic ancestry. We discovered that subtle variations in genetic background may confound a case-control analysis, especially in evaluating the cancer risk associated with specific genes, such as CHEK2. We also identified previously less explored forms of rare inherited mutations, which could potentially increase the risk of kidney cancer.
Collapse
Affiliation(s)
- Seung Hun Han
- Ph.D. Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sabrina Y. Camp
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Hoyin Chu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ryan Collins
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Riaz Gillani
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
- Boston Children’s Hospital, Boston, MA, USA
| | - Jihye Park
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ziad Bakouny
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cora A. Ricker
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Brendan Reardon
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nicholas Moore
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT, USA
| | - Eric Kofman
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Chris Labaki
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - David Braun
- Center of Molecular and Cellular Oncology, Yale School of Medicine, New Haven, CT, USA
| | - Toni K. Choueiri
- Lank Center for Genitourinary Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Brigham and Women’s Hospital, Boston, MA, USA
| | - Saud H. AlDubayan
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics, Brigham and Women’s Hospital, Boston, MA, USA
- College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Eliezer M. Van Allen
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Genomics, Dana-Farber Cancer Institute, Boston, MA, USA
| |
Collapse
|
12
|
Rehmann CT, Ralph PL, Kern AD. Evaluating evidence for co-geography in the Anopheles-Plasmodium host-parasite system. G3 (BETHESDA, MD.) 2024; 14:jkae008. [PMID: 38230808 PMCID: PMC10917517 DOI: 10.1093/g3journal/jkae008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 11/08/2023] [Accepted: 12/22/2023] [Indexed: 01/18/2024]
Abstract
The often tight association between parasites and their hosts means that under certain scenarios, the evolutionary histories of the two species can become closely coupled both through time and across space. Using spatial genetic inference, we identify a potential signal of common dispersal patterns in the Anopheles gambiae and Plasmodium falciparum host-parasite system as seen through a between-species correlation of the differences between geographic sampling location and geographic location predicted from the genome. This correlation may be due to coupled dispersal dynamics between host and parasite but may also reflect statistical artifacts due to uneven spatial distribution of sampling locations. Using continuous-space population genetics simulations, we investigate the degree to which uneven distribution of sampling locations leads to bias in prediction of spatial location from genetic data and implement methods to counter this effect. We demonstrate that while algorithmic bias presents a problem in inference from spatio-genetic data, the correlation structure between A. gambiae and P. falciparum predictions cannot be attributed to spatial bias alone and is thus likely a genetic signal of co-dispersal in a host-parasite system.
Collapse
Affiliation(s)
- Clara T Rehmann
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene 97403, USA
| | - Peter L Ralph
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene 97403, USA
- Department of Mathematics, University of Oregon, Eugene 97403, USA
| | - Andrew D Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene 97403, USA
| |
Collapse
|
13
|
Tran LN, Sun CK, Struck TJ, Sajan M, Gutenkunst RN. Computationally efficient demographic history inference from allele frequencies with supervised machine learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.24.542158. [PMID: 38405827 PMCID: PMC10888863 DOI: 10.1101/2023.05.24.542158] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we developed donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future input data AFS. We demonstrated that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
Collapse
Affiliation(s)
- Linh N. Tran
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ, USA
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Connie K. Sun
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Travis J. Struck
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Mathews Sajan
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Ryan N. Gutenkunst
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
14
|
Marzouka NAD, Alnaqbi H, Al-Aamri A, Tay G, Alsafar H. Investigating the genetic makeup of the major histocompatibility complex (MHC) in the United Arab Emirates population through next-generation sequencing. Sci Rep 2024; 14:3392. [PMID: 38337023 PMCID: PMC10858242 DOI: 10.1038/s41598-024-53986-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Accepted: 02/07/2024] [Indexed: 02/12/2024] Open
Abstract
The Human leukocyte antigen (HLA) molecules are central to immune response and have associations with the phenotypes of various diseases and induced drug toxicity. Further, the role of HLA molecules in presenting antigens significantly affects the transplantation outcome. The objective of this study was to examine the extent of the diversity of HLA alleles in the population of the United Arab Emirates (UAE) using Next-Generation Sequencing methodologies and encompassing a larger cohort of individuals. A cohort of 570 unrelated healthy citizens of the UAE volunteered to provide samples for Whole Genome Sequencing and Whole Exome Sequencing. The definition of the HLA alleles was achieved through the application of the bioinformatics tools, HLA-LA and xHLA. Subsequently, the findings from this study were compared with other local and international datasets. A broad range of HLA alleles in the UAE population, of which some were previously unreported, was identified. A comparison with other populations confirmed the current population's unique intertwined genetic heritage while highlighting similarities with populations from the Middle East region. Some disease-associated HLA alleles were detected at a frequency of > 5%, such as HLA-B*51:01, HLA-DRB1*03:01, HLA-DRB1*15:01, and HLA-DQB1*02:01. The increase in allele homozygosity, especially for HLA class I genes, was identified in samples with a higher level of genome-wide homozygosity. This highlights a possible effect of consanguinity on the HLA homozygosity. The HLA allele distribution in the UAE population showcases a unique profile, underscoring the need for tailored databases for traditional activities such as unrelated transplant matching and for newer initiatives in precision medicine based on specific populations. This research is part of a concerted effort to improve the knowledge base, particularly in the fields of transplant medicine and investigating disease associations as well as in understanding human migration patterns within the Arabian Peninsula and surrounding regions.
Collapse
Affiliation(s)
- Nour Al Dain Marzouka
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Halima Alnaqbi
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Amira Al-Aamri
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan Tay
- Division of Psychiatry, Faculty of Health and Medical Sciences, Medical School, The University of Western Australia, Crawley, WA, Australia
- School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Habiba Alsafar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
- College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
15
|
Rehmann CT, Ralph PL, Kern AD. Evaluating evidence for co-geography in the Anopheles-Plasmodium host-parasite system. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.17.549405. [PMID: 37503196 PMCID: PMC10370088 DOI: 10.1101/2023.07.17.549405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The often tight association between parasites and their hosts means that under certain scenarios, the evolutionary histories of the two species can become closely coupled both through time and across space. Using spatial genetic inference, we identify a potential signal of common dispersal patterns in the Anopheles gambiae and Plasmodium falciparum host-parasite system as seen through a between-species correlation of the differences between geographic sampling location and geographic location predicted from the genome. This correlation may be due to coupled dispersal dynamics between host and parasite, but may also reflect statistical artifacts due to uneven spatial distribution of sampling locations. Using continuous-space population genetics simulations, we investigate the degree to which uneven distribution of sampling locations leads to bias in prediction of spatial location from genetic data and implement methods to counter this effect. We demonstrate that while algorithmic bias presents a problem in inference from spatio-genetic data, the correlation structure between A. gambiae and P. falciparum predictions cannot be attributed to spatial bias alone, and is thus likely a genetic signal of co-dispersal in a host-parasite system.
Collapse
Affiliation(s)
- Clara T Rehmann
- University of Oregon, Institute of Ecology and Evolution and Department of Biology
| | - Peter L Ralph
- University of Oregon, Institute of Ecology and Evolution and Department of Biology
- University of Oregon, Department of Mathematics
| | - Andrew D Kern
- University of Oregon, Institute of Ecology and Evolution and Department of Biology
| |
Collapse
|
16
|
Chan TF, Rui X, Conti DV, Fornage M, Graff M, Haessler J, Haiman C, Highland HM, Jung SY, Kenny EE, Kooperberg C, Le Marchand L, North KE, Tao R, Wojcik G, Gignoux CR, Chiang CWK, Mancuso N. Estimating heritability explained by local ancestry and evaluating stratification bias in admixture mapping from summary statistics. Am J Hum Genet 2023; 110:1853-1862. [PMID: 37875120 PMCID: PMC10645552 DOI: 10.1016/j.ajhg.2023.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 09/20/2023] [Accepted: 09/21/2023] [Indexed: 10/26/2023] Open
Abstract
The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of ∼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hˆγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hˆγ2 = 0.012 ± 9.2 × 10-4), which translates to hˆ2 ranging from 0.062 to 0.85 (mean hˆ2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.
Collapse
Affiliation(s)
- Tsz Fung Chan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Xinyue Rui
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - David V Conti
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Myriam Fornage
- Brown Foundation Institute for Molecular Medicine, The University of Texas Health Science Center, Houston, TX, USA
| | - Mariaelisa Graff
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Christopher Haiman
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Heather M Highland
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Su Yon Jung
- Translational Sciences Section, School of Nursing, Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Loic Le Marchand
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
| | - Kari E North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Ran Tao
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA; Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Genevieve Wojcik
- Department of Epidemiology, Bloomberg School of Public Health, John Hopkins University, Baltimore, MD, USA
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
17
|
Liu Z, Turkmen AS, Lin S. Population stratification correction using Bayesian shrinkage priors for genetic association studies. Ann Hum Genet 2023; 87:302-315. [PMID: 37771252 PMCID: PMC11624906 DOI: 10.1111/ahg.12527] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 08/20/2023] [Accepted: 08/24/2023] [Indexed: 09/30/2023]
Abstract
INTRODUCTION Population stratification (PS) is a major source of confounding in population-based genetic association studies of quantitative traits. Principal component regression (PCR) and linear mixed model (LMM) are two commonly used approaches to account for PS in association studies. Previous studies have shown that LMM can be interpreted as including all principal components (PCs) as random-effect covariates. However, including all PCs in LMM may dilute the influence of relevant PCs in some scenarios, while including only a few preselected PCs in PCR may fail to fully capture the genetic diversity. MATERIALS AND METHODS To address these shortcomings, we introduce Bayestrat-a method to detect associated variants with PS correction under the Bayesian LASSO framework. To adjust for PS, Bayestrat accommodates a large number of PCs and utilizes appropriate shrinkage priors to shrink the effects of nonassociated PCs. RESULTS Simulation results show that Bayestrat consistently controls type I error rates and achieves higher power compared to its non-shrinkage counterparts, especially when the number of PCs included in the model is large. As a demonstration of the utility of Bayestrat, we apply it to the Multi-Ethnic Study of Atherosclerosis (MESA). Variants and genes associated with serum triglyceride or HDL cholesterol are identified in our analyses. DISCUSSION The automatic and self-selection features of Bayestrat make it particularly suited in situations with complex underlying PS scenarios, where it is unknown a priori which PCs are potential confounders, yet the number that needs to be considered could be large in order to fully account for PS.
Collapse
Affiliation(s)
- Zilu Liu
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| | - Asuman S. Turkmen
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| | - Shili Lin
- Department of Statistics, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
18
|
Fan C, Cahoon JL, Dinh BL, Ortega-Del Vecchyo D, Huber C, Edge MD, Mancuso N, Chiang CWK. A likelihood-based framework for demographic inference from genealogical trees. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561787. [PMID: 37873208 PMCID: PMC10592779 DOI: 10.1101/2023.10.10.561787] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The demographic history of a population drives the pattern of genetic variation and is encoded in the gene-genealogical trees of the sampled alleles. However, existing methods to infer demographic history from genetic data tend to use relatively low-dimensional summaries of the genealogy, such as allele frequency spectra. As a step toward capturing more of the information encoded in the genome-wide sequence of genealogical trees, here we propose a novel framework called the genealogical likelihood (gLike), which derives the full likelihood of a genealogical tree under any hypothesized demographic history. Employing a graph-based structure, gLike summarizes across independent trees the relationships among all lineages in a tree with all possible trajectories of population memberships through time and efficiently computes the exact marginal probability under a parameterized demographic model. Through extensive simulations and empirical applications on populations that have experienced multiple admixtures, we showed that gLike can accurately estimate dozens of demographic parameters when the true genealogy is known, including ancestral population sizes, admixture timing, and admixture proportions. Moreover, when using genealogical trees inferred from genetic data, we showed that gLike outperformed conventional demographic inference methods that leverage only the allele-frequency spectrum and yielded parameter estimates that align with established historical knowledge of the past demographic histories for populations like Latino Americans and Native Hawaiians. Furthermore, our framework can trace ancestral histories by analyzing a sample from the admixed population without proxies for its source populations, removing the need to sample ancestral populations that may no longer exist. Taken together, our proposed gLike framework harnesses underutilized genealogical information to offer exceptional sensitivity and accuracy in inferring complex demographies for humans and other species, particularly as estimation of genome-wide genealogies improves.
Collapse
Affiliation(s)
- Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Jordan L Cahoon
- Department of Quantitative and Computational Biology, University of Southern California
- Department of Computer Science, University of Southern California
| | - Bryan L Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México
| | - Christian Huber
- Department of Biology, Penn State University, University Park, PA, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| |
Collapse
|
19
|
Chi J, Xu M, Sheng X, Zhou Y. Association detection between multiple traits and rare variants based on family data via a nonparametric method. PeerJ 2023; 11:e16040. [PMID: 37780393 PMCID: PMC10541022 DOI: 10.7717/peerj.16040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 08/15/2023] [Indexed: 10/03/2023] Open
Abstract
Background The rapid development of next-generation sequencing technologies allow people to analyze human complex diseases at the molecular level. It has been shown that rare variants play important roles for human diseases besides common variants. Thus, effective statistical methods need to be proposed to test for the associations between traits (e.g., diseases) and rare variants. Currently, more and more rare genetic variants are being detected throughout the human genome, which demonstrates the possibility to study rare variants. Yet complex diseases are usually measured as a variety of forms, such as binary, ordinal, quantitative, or some mixture of them. Therefore, the genetic mapping problem can be attributable to the association detection between multiple traits and multiple loci, with sufficiently considering the correlated structure among multiple traits. Methods In this article, we construct a new non-parametric statistic by the generalized Kendall's τ theory based on family data. The new test statistic has an asymptotic distribution, it can be used to study the associations between multiple traits and rare variants, which broadens the way to identify genetic factors of human complex diseases. Results We apply our method (called Nonp-FAM) to analyze simulated data and GAW17 data, and conduct comprehensive comparison with some existing methods. Experimental results show that the proposed family-based method is powerful and robust for testing associations between multiple traits and rare variants, even if the data has some population stratification effect.
Collapse
Affiliation(s)
- Jinling Chi
- Department of Statistics, Heilongjiang University, Harbin, China
- School of Mathematics and Statistics, Xidian University, Xi’an, China
| | - Meijuan Xu
- Department of Statistics, Heilongjiang University, Harbin, China
| | - Xiaona Sheng
- School of Information Engineering, Harbin University, Harbin, China
| | - Ying Zhou
- Department of Statistics, Heilongjiang University, Harbin, China
| |
Collapse
|
20
|
Havdahl A, Hughes AM, Sanderson E, Ask H, Cheesman R, Reichborn-Kjennerud T, Andreassen OA, Corfield EC, Hannigan L, Magnus P, Njølstad PR, Stoltenberg C, Torvik FA, Brandlistuen R, Smith GD, Ystrom E, Davies NM. Intergenerational effects of parental educational attainment on parenting and childhood educational outcomes: Evidence from MoBa using within-family Mendelian randomization. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.02.22.23285699. [PMID: 36865116 PMCID: PMC9980223 DOI: 10.1101/2023.02.22.23285699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Abstract
The intergenerational transmission of educational attainment from parents to their children is one of the most important and studied relationships in social science. Longitudinal studies have found strong associations between parents' and their children's educational outcomes, which could be due to the effects of parents. Here we provide new evidence about whether parents' educational attainment affects their parenting behaviours and children's early educational outcomes using within-family Mendelian randomization and data from 40,879 genotyped parent-child trios from the Norwegian Mother, Father and Child Cohort (MoBa) study. We found evidence suggesting that parents' educational attainment affects their children's educational outcomes from age 5 to 14. More studies are needed to provide more samples of parent-child trios and assess the potential consequences of selection bias and grandparental effects.
Collapse
Affiliation(s)
- Alexandra Havdahl
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- Centre for Genetic Epidemiology and Mental Health, Norwegian Institute of Public Health, Oslo, Norway
- Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
| | - Amanda M Hughes
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Barley House, Oakfield Grove, Bristol, BS8 2BN, United Kingdom
| | - Eleanor Sanderson
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Barley House, Oakfield Grove, Bristol, BS8 2BN, United Kingdom
| | - Helga Ask
- Centre for Genetic Epidemiology and Mental Health, Norwegian Institute of Public Health, Oslo, Norway
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
| | - Rosa Cheesman
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
| | - Ted Reichborn-Kjennerud
- Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway
| | - Ole A. Andreassen
- NORMENT Centre, Division of Mental Health and Addiction, Oslo University Hospital & Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Elizabeth C. Corfield
- Centre for Genetic Epidemiology and Mental Health, Norwegian Institute of Public Health, Oslo, Norway
- Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway
| | - Laurie Hannigan
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- Centre for Genetic Epidemiology and Mental Health, Norwegian Institute of Public Health, Oslo, Norway
- Nic Waals Institute, Lovisenberg Diaconal Hospital, Oslo, Norway
| | - Per Magnus
- Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway
| | - Pål R. Njølstad
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen, Norway
- Children and Youth Clinic, Haukeland University Hospital, Bergen, Norway
| | - Camilla Stoltenberg
- Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway
| | - Fartein Ask Torvik
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Ragnhild Brandlistuen
- Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway
| | - George Davey Smith
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Barley House, Oakfield Grove, Bristol, BS8 2BN, United Kingdom
| | - Eivind Ystrom
- PROMENTA Research Center, Department of Psychology, University of Oslo, Oslo, Norway
- Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway
| | - Neil M Davies
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, BS8 2BN, United Kingdom
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Norway
- Division of Psychiatry, University College London, Maple House, 149 Tottenham Court Rd, London W1T 7NF
- Department of Statistical Sciences, University College London, London WC1E 6BT, UK
| |
Collapse
|
21
|
Abstract
Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.
Collapse
Affiliation(s)
- Taotao Tan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| |
Collapse
|
22
|
Gloss AD, Steiner MC, Novembre J, Bergelson J. The design of mapping populations: Impacts of geographic scale on genetic architecture and mapping efficacy for defense and immunity. CURRENT OPINION IN PLANT BIOLOGY 2023; 74:102399. [PMID: 37307746 PMCID: PMC10441534 DOI: 10.1016/j.pbi.2023.102399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 04/29/2023] [Accepted: 05/15/2023] [Indexed: 06/14/2023]
Abstract
Genome-wide association studies (GWAS) have yielded tremendous insight into the genetic architecture of trait variation. However, the collections of loci they uncover are far from exhaustive. As many of the complicating factors that confound or limit the efficacy of GWAS are exaggerated over broad geographic scales, a shift toward more analyses using mapping panels sampled from narrow geographic localities ("local" populations) could provide novel, complementary insights. Here, we present an overview of the major complicating factors, review mounting evidence from genomic analyses that these factors are pervasive, and synthesize theoretical and empirical evidence for the power of GWAS in local populations.
Collapse
Affiliation(s)
- Andrew D Gloss
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, USA.
| | | | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL, USA; Department of Ecology & Evolution, University of Chicago, Chicago, IL, USA
| | - Joy Bergelson
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, USA.
| |
Collapse
|
23
|
Zhou Z, Ku HC, Manning SE, Zhang M, Xing C. A Varying Coefficient Model to Jointly Test Genetic and Gene-Environment Interaction Effects. Behav Genet 2023; 53:374-382. [PMID: 36622576 PMCID: PMC10277225 DOI: 10.1007/s10519-022-10131-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 12/18/2022] [Indexed: 01/10/2023]
Abstract
Most human traits are influenced by the interplay between genetic and environmental factors. Many statistical methods have been proposed to screen for gene-environment interaction (GxE) in the post genome-wide association study era. However, most of the existing methods assume a linear interaction between genetic and environmental factors toward phenotypic variations, which diminishes statistical power in the case of nonlinear GxE. In this paper, we present a flexible statistical procedure to detect GxE regardless of whether the underlying relationship is linear or not. By modeling the joint genetic and GxE effects as a varying-coefficient function of the environmental factor, the proposed model is able to capture dynamic trajectories of GxE. We employ a likelihood ratio test with a fast Monte Carlo algorithm for hypothesis testing. Simulations were conducted to evaluate validity and power of the proposed model in various settings. Real data analysis was performed to illustrate its power, in particular, in the case of nonlinear GxE.
Collapse
Affiliation(s)
- Zhengyang Zhou
- Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, USA.
| | - Hung-Chih Ku
- Department of Mathematical Sciences, DePaul University, Chicago, IL, USA
| | - Sydney E Manning
- Department of Pharmacotherapy, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Ming Zhang
- Department of Statistical Science, Southern Methodist University, Dallas, TX, USA
| | - Chao Xing
- McDermott Center for Human Growth and Development and Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|
24
|
Haider M, Schilling MP, Moest MH, Steiner FM, Schlick‐Steiner BC, Arthofer W. Evolutionary history of an Alpine Archaeognath ( Machilis pallida): Insights from different variant. Ecol Evol 2023; 13:e10227. [PMID: 37404697 PMCID: PMC10316371 DOI: 10.1002/ece3.10227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 05/31/2023] [Accepted: 06/09/2023] [Indexed: 07/06/2023] Open
Abstract
Reconstruction of species histories is a central aspect of evolutionary biology. Patterns of genetic variation within and among populations can be leveraged to elucidate evolutionary processes and demographic histories. However, interpreting genetic signatures and unraveling the contributing processes can be challenging, in particular for non-model organisms with complex reproductive modes and genome organization. One way forward is the combined consideration of patterns revealed by different molecular markers (nuclear vs. mitochondrial) and types of variants (common vs. rare) that differ in their age, mode, and rate of evolution. Here, we applied this approach to RNAseq data generated for Machilis pallida (Archaeognatha), an Alpine jumping bristletail considered parthenogenetic and triploid. We generated de novo transcriptome and mitochondrial assemblies to obtain high-density data to investigate patterns of mitochondrial and common and rare nuclear variation in 17 M. pallida individuals sampled from all known populations. We find that the different variant types capture distinct aspects of the evolutionary history and discuss the observed patterns in the context of parthenogenesis, polyploidy, and survival during glaciation. This study highlights the potential of different variant types to gain insights into evolutionary scenarios even from challenging but often available data and the suitability of M. pallida and the genus Machilis as a study system for the evolution of sexual strategies and polyploidization during environmental change. We also emphasize the need for further research which will be stimulated and facilitated by these newly generated resources and insights.
Collapse
Affiliation(s)
- Marlene Haider
- Department of Ecology, Molecular Ecology GroupUniversity of InnsbruckInnsbruckAustria
| | - Martin P. Schilling
- Department of Ecology, Molecular Ecology GroupUniversity of InnsbruckInnsbruckAustria
| | - Markus H. Moest
- Department of Ecology, Molecular Ecology GroupUniversity of InnsbruckInnsbruckAustria
| | - Florian M. Steiner
- Department of Ecology, Molecular Ecology GroupUniversity of InnsbruckInnsbruckAustria
| | | | - Wolfgang Arthofer
- Department of Ecology, Molecular Ecology GroupUniversity of InnsbruckInnsbruckAustria
| |
Collapse
|
25
|
Zabad S, Gravel S, Li Y. Fast and accurate Bayesian polygenic risk modeling with variational inference. Am J Hum Genet 2023; 110:741-761. [PMID: 37030289 PMCID: PMC10183379 DOI: 10.1016/j.ajhg.2023.03.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 03/13/2023] [Indexed: 04/10/2023] Open
Abstract
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regression framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient and do not scale favorably to higher dimensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statistics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consistently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based approaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities, and independent GWAS cohorts. In addition to its competitive accuracy on the "White British" samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
Collapse
Affiliation(s)
- Shadi Zabad
- School of Computer Science, McGill University, Montreal, QC, Canada
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, QC, Canada.
| | - Yue Li
- School of Computer Science, McGill University, Montreal, QC, Canada.
| |
Collapse
|
26
|
Yang J, Xu Y, Yao M, Wang G, Liu Z. ERStruct: a fast Python package for inferring the number of top principal components from whole genome sequencing data. BMC Bioinformatics 2023; 24:180. [PMID: 37131141 PMCID: PMC10155328 DOI: 10.1186/s12859-023-05305-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 04/25/2023] [Indexed: 05/04/2023] Open
Abstract
BACKGROUND Large-scale multi-ethnic DNA sequencing data is increasingly available owing to decreasing cost of modern sequencing technologies. Inference of the population structure with such sequencing data is fundamentally important. However, the ultra-dimensionality and complicated linkage disequilibrium patterns across the whole genome make it challenging to infer population structure using traditional principal component analysis based methods and software. RESULTS We present the ERStruct Python Package, which enables the inference of population structure using whole-genome sequencing data. By leveraging parallel computing and GPU acceleration, our package achieves significant improvements in the speed of matrix operations for large-scale data. Additionally, our package features adaptive data splitting capabilities to facilitate computation on GPUs with limited memory. CONCLUSION Our Python package ERStruct is an efficient and user-friendly tool for estimating the number of top informative principal components that capture population structure from whole genome sequencing data.
Collapse
Affiliation(s)
- Jinghan Yang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| | - Yuyang Xu
- Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| | - Minhao Yao
- Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| | - Gao Wang
- Department of Neurology, Gertrude. H. Sergievsky Center, Columbia University, New York, NY, USA
| | - Zhonghua Liu
- Department of Biostatistics, Columbia University, New York, NY, USA.
| |
Collapse
|
27
|
Gilbert E, Zurel H, MacMillan ME, Demiriz S, Mirhendi S, Merrigan M, O'Reilly S, Molloy AM, Brody LC, Bodmer W, Leach RA, Scott REM, Mugford G, Randhawa R, Stephens JC, Symington AL, Cavalleri GL, Phillips MS. The Newfoundland and Labrador mosaic founder population descends from an Irish and British diaspora from 300 years ago. Commun Biol 2023; 6:469. [PMID: 37117635 PMCID: PMC10147672 DOI: 10.1038/s42003-023-04844-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 03/28/2023] [Indexed: 04/30/2023] Open
Abstract
The founder population of Newfoundland and Labrador (NL) is a unique genetic resource, in part due to its geographic and cultural isolation, where historical records describe a migration of European settlers, primarily from Ireland and England, to NL in the 18th and 19th centuries. Whilst its historical isolation, and increased prevalence of certain monogenic disorders are well appreciated, details of the fine-scale genetic structure and ancestry of the population are lacking. Understanding the genetic origins and background of functional, disease causing, genetic variants would aid genetic mapping efforts in the Province. Here, we leverage dense genome-wide SNP data on 1,807 NL individuals to reveal fine-scale genetic structure in NL that is clustered around coastal communities and correlated with Christian denomination. We show that the majority of NL European ancestry can be traced back to the south-east and south-west of Ireland and England, respectively. We date a substantial population size bottleneck approximately 10-15 generations ago in NL, associated with increased haplotype sharing and autozygosity. Our results reveal insights into the population history of NL and demonstrate evidence of a population conducive to further genetic studies and biomarker discovery.
Collapse
Affiliation(s)
- Edmund Gilbert
- School of Pharmacy and Biomolecular Sciences, Royal College of Surgeons in Ireland, Dublin, Ireland.
- FutureNeuro SFI Research Centre, Royal College of Surgeons in Ireland, Dublin, Ireland.
| | - Heather Zurel
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | | | - Sedat Demiriz
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | - Sadra Mirhendi
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | | | | | - Anne M Molloy
- School of Medicine, Trinity College, Dublin, Ireland
| | - Lawrence C Brody
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Walter Bodmer
- Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Oxford, UK
| | - Richard A Leach
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | - Roderick E M Scott
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | - Gerald Mugford
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | - Ranjit Randhawa
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | | | - Alison L Symington
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| | - Gianpiero L Cavalleri
- School of Pharmacy and Biomolecular Sciences, Royal College of Surgeons in Ireland, Dublin, Ireland
- FutureNeuro SFI Research Centre, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Michael S Phillips
- Sequence Bioinformatics, Inc., St. John's, Newfoundland and Labrador, Canada
| |
Collapse
|
28
|
Chan TF, Rui X, Conti DV, Fornage M, Graff M, Haessler J, Haiman C, Highland HM, Jung SY, Kenny E, Kooperberg C, Marchland LL, North KE, Tao R, Wojcik G, Gignoux CR, Chiang CWK, Mancuso N. Estimating heritability explained by local ancestry and evaluating stratification bias in admixture mapping from summary statistics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.10.536252. [PMID: 37131817 PMCID: PMC10153181 DOI: 10.1101/2023.04.10.536252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The heritability explained by local ancestry markers in an admixed population h γ 2 provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of h γ 2 can be susceptible to biases due to population structure in ancestral populations. Here, we present a novel approach, Heritability estimation from Admixture Mapping Summary STAtistics (HAMSTA), which uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA h γ 2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of ~5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe h ˆ γ 2 in the 20 phenotypes range from 0.0025 to 0.033 (mean h ˆ γ 2 = 0.012 + / - 9.2 × 10 - 4 ), which translates to h ˆ 2 ranging from 0.062 to 0.85 (mean h ˆ 2 = 0.30 + / - 0.023 ). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 +/- 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.
Collapse
Affiliation(s)
- Tsz Fung Chan
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Xinyue Rui
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - David V Conti
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Myriam Fornage
- Brown Foundation Institute for Molecular Medicine, The University of Texas Health Science Center, Houston, TX, USA
| | - Mariaelisa Graff
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Christopher Haiman
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Heather M Highland
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Su Yon Jung
- Translational Sciences Section, School of Nursing, Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, CA, United States
| | - Eimear Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Loic Le Marchland
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
| | - Kari E North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Ran Tao
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Genevieve Wojcik
- Department of Epidemiology, Bloomberg School of Public Health, John Hopkins University, Baltimore, MD, USA
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
- Department of Quantitative and Computational Biology, University of Southern California
| | - Nicholas Mancuso
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
- Department of Quantitative and Computational Biology, University of Southern California
| |
Collapse
|
29
|
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CW, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.07.536093. [PMID: 37066144 PMCID: PMC10104234 DOI: 10.1101/2023.04.07.536093] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide Association Studies (GWAS) are a powerful way to find genetic loci associated with phenotypes. GWAS are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix given the ARG (local eGRM). Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to identify a large-effect BMI locus, the CREBRF gene, in a sample of Native Hawaiians in which it was not previously detectable by GWAS because of a lack of population-specific imputation resources. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California
| | - Joshua G. Schraiber
- Department of Quantitative and Computational Biology, University of Southern California
| | - Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Bryan Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W.K. Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California
| |
Collapse
|
30
|
Nudel R, Allesøe RL, Werge T, Thompson WK, Rasmussen S, Benros ME. An immunogenetic investigation of 30 autoimmune and autoinflammatory diseases and their links to psychiatric disorders in a nationwide sample. Immunology 2023; 168:622-639. [PMID: 36273265 DOI: 10.1111/imm.13597] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 10/20/2022] [Indexed: 11/05/2022] Open
Abstract
Autoimmune and autoinflammatory diseases (AIIDs) involve a deficit in an individual's immune system function, whereby the immune reaction is directed against self-antigens. Many AIIDs have a strong genetic component, but they can also be triggered by environmental factors. AIIDs often have a highly negative impact on the individual's physical and mental wellbeing. Understanding the genetic underpinning of AIIDs is thus crucial both for diagnosis and for identifying individuals at high risk of an AIID and mental illness as a result thereof. The aim of the present study was to perform systematic statistical and genetic analyses to assess the role of human leukocyte antigen (HLA) alleles in 30 AIIDs and to study the links between AIIDs and psychiatric disorders. We leveraged the Danish iPSYCH Consortium sample comprising 65 534 individuals diagnosed with psychiatric disorders or selected as part of a random population sample, for whom we also had genetic data and diagnoses of AIIDs. We employed regression analysis to examine comorbidities between AIIDs and psychiatric disorders and associations between AIIDs and HLA alleles across seven HLA genes. Our comorbidity analyses showed that overall AIID and five specific AIIDs were associated with having a psychiatric diagnosis. Our genetic analyses found 81 significant associations between HLA alleles and AIIDs. Lastly, we show connections across AIIDs, psychiatric disorders and infection susceptibility through network analysis of significant HLA associations in these disease classes. Combined, our results include both novel associations as well as replications of previously reported associations in a large sample, and highlight the genetic and epidemiological links between AIIDs and psychiatric disorders.
Collapse
Affiliation(s)
- Ron Nudel
- CORE-Copenhagen Research Centre for Mental Health, Mental Health Centre Copenhagen, Copenhagen University Hospital, Copenhagen, Denmark
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
| | - Rosa Lundbye Allesøe
- CORE-Copenhagen Research Centre for Mental Health, Mental Health Centre Copenhagen, Copenhagen University Hospital, Copenhagen, Denmark
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Werge
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Institute of Biological Psychiatry, Mental Health Centre Sct. Hans, Mental Health Services Copenhagen, Roskilde, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Wesley K Thompson
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Institute of Biological Psychiatry, Mental Health Centre Sct. Hans, Mental Health Services Copenhagen, Roskilde, Denmark
- Division of Biostatistics, Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, California, USA
| | - Simon Rasmussen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Michael E Benros
- CORE-Copenhagen Research Centre for Mental Health, Mental Health Centre Copenhagen, Copenhagen University Hospital, Copenhagen, Denmark
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
31
|
Wei X, Robles CR, Pazokitoroudi A, Ganna A, Gusev A, Durvasula A, Gazal S, Loh PR, Reich D, Sankararaman S. The lingering effects of Neanderthal introgression on human complex traits. eLife 2023; 12:e80757. [PMID: 36939312 PMCID: PMC10076017 DOI: 10.7554/elife.80757] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 03/17/2023] [Indexed: 03/21/2023] Open
Abstract
The genetic variants introduced into the ancestors of modern humans from interbreeding with Neanderthals have been suggested to contribute an unexpected extent to complex human traits. However, testing this hypothesis has been challenging due to the idiosyncratic population genetic properties of introgressed variants. We developed rigorous methods to assess the contribution of introgressed Neanderthal variants to heritable trait variation and applied these methods to analyze 235,592 introgressed Neanderthal variants and 96 distinct phenotypes measured in about 300,000 unrelated white British individuals in the UK Biobank. Introgressed Neanderthal variants make a significant contribution to trait variation (explaining 0.12% of trait variation on average). However, the contribution of introgressed variants tends to be significantly depleted relative to modern human variants matched for allele frequency and linkage disequilibrium (about 59% depletion on average), consistent with purifying selection on introgressed variants. Different from previous studies (McArthur et al., 2021), we find no evidence for elevated heritability across the phenotypes examined. We identified 348 independent significant associations of introgressed Neanderthal variants with 64 phenotypes. Previous work (Skov et al., 2020) has suggested that a majority of such associations are likely driven by statistical association with nearby modern human variants that are the true causal variants. Applying a customized fine-mapping led us to identify 112 regions across 47 phenotypes containing 4303 unique genetic variants where introgressed variants are highly likely to have a phenotypic effect. Examination of these variants reveals their substantial impact on genes that are important for the immune system, development, and metabolism.
Collapse
Affiliation(s)
- Xinzhu Wei
- Department of Computational Biology, Cornell UniversityNew YorkUnited States
| | - Christopher R Robles
- Department of Human Genetics, University of California, Los AngelesLos AngelesUnited States
| | - Ali Pazokitoroudi
- Department of Computer Science, University of California, Los AngelesLos AngelesUnited States
| | - Andrea Ganna
- Analytical and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General HospitalBostonUnited States
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric Research, Broad Institute of MIT and HarvardCambridgeUnited States
| | - Alexander Gusev
- Dana-Farber Cancer Institute, Harvard Medical SchoolBostonUnited States
| | - Arun Durvasula
- Department of Genetics, Harvard Medical SchoolBostonUnited States
- Department of Human Evolutionary Biology, Harvard UniversityCambridgeUnited States
| | - Steven Gazal
- Center for Genetic Epidemiology, Department of Public and Population Health Sciences, University of Southern CaliforniaLos AngelesUnited States
- Division of Genetics,Department of Medicine, Brigham and Women’s Hospital, Harvard Medical SchoolBostonUnited States
| | - Po-Ru Loh
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
| | - David Reich
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
- Department of Genetics, Harvard Medical SchoolBostonUnited States
- Department of Human Evolutionary Biology, Harvard UniversityCambridgeUnited States
- Howard Hughes Medical Institute, Harvard Medical SchoolBostonUnited States
| | - Sriram Sankararaman
- Department of Human Genetics, University of California, Los AngelesLos AngelesUnited States
- Department of Computer Science, University of California, Los AngelesLos AngelesUnited States
- Department of Computational Medicine, University of California, Los AngelesLos AngelesUnited States
| |
Collapse
|
32
|
Mortensen Ó, Thomsen E, Lydersen LN, Apol KD, Weihe P, Steig BÁ, Andorsdóttir G, Als TD, Gregersen NO. FarGen: Elucidating the distribution of coding variants in the isolated population of the Faroe Islands. Eur J Hum Genet 2023; 31:329-337. [PMID: 36404349 PMCID: PMC9995356 DOI: 10.1038/s41431-022-01227-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 09/30/2022] [Accepted: 10/27/2022] [Indexed: 11/22/2022] Open
Abstract
Here we present results from FarGen Phase I exomes. This dataset is based on the FarGen cohort, which consists of 1,541 individuals from the isolated population of the Faroe Islands. The purpose of this cohort is to serve as a reference catalog of coding variants, and to conduct population genetic studies to better understand the genetic contribution to various diseases in the Faroese population. The first whole-exome data set comprise 465 individuals and a total of 148,267 genetic variants were discovered. Principle Component Analysis indicates that the population is isolated and weakly structured. The distribution of variants in various functional classes was compared with populations in the gnomAD dataset; the results indicated that the proportions were consistent across the cohorts, but probably due to a small sample size, the FarGen dataset contained relatively few rare variants. We identified 19 variants that are classified as pathogenic or likely pathogenic in ClinVar; several of these variants are associated with monogenetic diseases with increased prevalence in the Faroe Islands. The results support previous studies, which indicate that the Faroe Islands is an isolated and weakly structured population. Future studies may elucidate the significance of the 19 pathogenic variants that were identified. The FarGen Phase I dataset is an important step for genetic research in the Faroese population, and the next phase of FarGen will increase the sample size and broaden the scope.
Collapse
Affiliation(s)
- Ólavur Mortensen
- The Genetic Biobank of the Faroe Islands, Tórshavn, Faroe Islands
| | - Elisabet Thomsen
- The Genetic Biobank of the Faroe Islands, Tórshavn, Faroe Islands
| | | | - Katrin D Apol
- The Genetic Biobank of the Faroe Islands, Tórshavn, Faroe Islands
| | - Pál Weihe
- Department of Occupational Medicine and Public Health, National Hospital of the Faroe Islands Tórshavn, Tórshavn, Faroe Islands
| | - Bjarni Á Steig
- Medical Department, National Hospital of the Faroe Islands, Tórshavn, Faroe Islands
| | - Guðrið Andorsdóttir
- The Genetic Biobank of the Faroe Islands, Tórshavn, Faroe Islands
- Centre of Health Science, Faculty of Health, University of the Faroe Islands, Tórshavn, Faroe Islands
| | - Thomas D Als
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Center for Genomics and Personalized Medicine, Aarhus, Denmark
| | - Noomi O Gregersen
- The Genetic Biobank of the Faroe Islands, Tórshavn, Faroe Islands.
- Centre of Health Science, Faculty of Health, University of the Faroe Islands, Tórshavn, Faroe Islands.
| |
Collapse
|
33
|
Dattani S, Sham PC, Jermy BS, Coleman JRI, Howard DM, Lewis CM. Common and rare variant associations with latent traits underlying depression, bipolar disorder, and schizophrenia. Transl Psychiatry 2023; 13:46. [PMID: 36746926 PMCID: PMC9902570 DOI: 10.1038/s41398-023-02324-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 01/07/2023] [Accepted: 01/18/2023] [Indexed: 02/08/2023] Open
Abstract
Genetic studies in psychiatry have primarily focused on the effects of common genetic variants, but few have investigated the role of rare genetic variants, particularly for major depression. In order to explore the role of rare variants in the gap between estimates of single nucleotide polymorphism (SNP) heritability and twin study heritability, we examined the contribution of common and rare genetic variants to latent traits underlying psychiatric disorders using high-quality imputed genotype data from the UK Biobank. Using a pre-registered analysis, we used items from the UK Biobank Mental Health Questionnaire relevant to three psychiatric disorders: major depression (N = 134,463), bipolar disorder (N = 117,376) and schizophrenia (N = 130,013) and identified a general hierarchical factor for each that described participants' responses. We calculated participants' scores on these latent traits and conducted single-variant genetic association testing (MAF > 0.05%), gene-based burden testing and pathway association testing associations with these latent traits. We tested for enrichment of rare variants (MAF 0.05-1%) in genes that had been previously identified by common variant genome-wide association studies, and genes previously associated with Mendelian disorders having relevant symptoms. We found moderate genetic correlations between the latent traits in our study and case-control phenotypes in previous genome-wide association studies, and identified one common genetic variant (rs72657988, minor allele frequency = 8.23%, p = 1.01 × 10-9) associated with the general factor of schizophrenia, but no other single variants, genes or pathways passed significance thresholds in this analysis, and we did not find enrichment in previously identified genes.
Collapse
Affiliation(s)
- Saloni Dattani
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK.
- Department of Psychiatry, Li Ka Shing (LKS) Faculty of Medicine, University of Hong Kong, Hong Kong SAR, China.
| | - Pak C Sham
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
- Department of Psychiatry, State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China
- Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Bradley S Jermy
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - Jonathan R I Coleman
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
- NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - David M Howard
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
- Division of Psychiatry, University of Edinburgh, Royal Edinburgh Hospital, Edinburgh, UK
| | - Cathryn M Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
- Department of Medical and Molecular Genetics, Faculty of Life Sciences and Medicine, King's College London, London, UK
| |
Collapse
|
34
|
Abdellaoui A, Yengo L, Verweij KJH, Visscher PM. 15 years of GWAS discovery: Realizing the promise. Am J Hum Genet 2023; 110:179-194. [PMID: 36634672 PMCID: PMC9943775 DOI: 10.1016/j.ajhg.2022.12.011] [Citation(s) in RCA: 154] [Impact Index Per Article: 77.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
It has been 15 years since the advent of the genome-wide association study (GWAS) era. Here, we review how this experimental design has realized its promise by facilitating an impressive range of discoveries with remarkable impact on multiple fields, including population genetics, complex trait genetics, epidemiology, social science, and medicine. We predict that the emergence of large-scale biobanks will continue to expand to more diverse populations and capture more of the allele frequency spectrum through whole-genome sequencing, which will further improve our ability to investigate the causes and consequences of human genetic variation for complex traits and diseases.
Collapse
Affiliation(s)
- Abdel Abdellaoui
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands.
| | - Loic Yengo
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Karin J H Verweij
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Peter M Visscher
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
35
|
How rare mutations contribute to complex traits. Nature 2023; 614:418-419. [PMID: 36755145 DOI: 10.1038/d41586-023-00272-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
36
|
Nickchi P, Karunarathna C, Graham J. An exploration of linkage fine-mapping on sequences from case-control studies. Genet Epidemiol 2023; 47:78-94. [PMID: 36047334 PMCID: PMC10087369 DOI: 10.1002/gepi.22502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 05/30/2022] [Accepted: 08/09/2022] [Indexed: 02/01/2023]
Abstract
Linkage analysis maps genetic loci for a heritable trait by identifying genomic regions with excess relatedness among individuals with similar trait values. Analysis may be conducted on related individuals from families, or on samples of unrelated individuals from a population. For allelically heterogeneous traits, population-based linkage analysis can be more powerful than genotypic-association analysis. Here, we focus on linkage analysis in a population sample, but use sequences rather than individuals as our unit of observation. Earlier investigations of sequence-based linkage mapping relied on known sequence relatedness, whereas we infer relatedness from the sequence data. We propose two ways to associate similarity in relatedness of sequences with similarity in their trait values and compare the resulting linkage methods to two genotypic-association methods. We also introduce a procedure to label case sequences as potential carriers or noncarriers of causal variants after an association has been found. This post hoc labeling of case sequences is based on inferred relatedness to other case sequences. Our simulation results indicate that methods based on sequence relatedness improve localization and perform as well as genotypic-association methods for detecting rare causal variants. Sequence-based linkage analysis therefore has potential to fine-map allelically heterogeneous disease traits.
Collapse
Affiliation(s)
- Payman Nickchi
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Charith Karunarathna
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, British Columbia, Canada.,Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Jinko Graham
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, British Columbia, Canada
| |
Collapse
|
37
|
Ballinger ML, Pattnaik S, Mundra PA, Zaheed M, Rath E, Priestley P, Baber J, Ray-Coquard I, Isambert N, Causeret S, van der Graaf WTA, Puri A, Duffaud F, Le Cesne A, Seddon B, Chandrasekar C, Schiffman JD, Brohl AS, James PA, Kurtz JE, Penel N, Myklebost O, Meza-Zepeda LA, Pickett H, Kansara M, Waddell N, Kondrashova O, Pearson JV, Barbour AP, Li S, Nguyen TL, Fatkin D, Graham RM, Giannoulatou E, Green MJ, Kaplan W, Ravishankar S, Copty J, Powell JE, Cuppen E, van Eijk K, Veldink J, Ahn JH, Kim JE, Randall RL, Tucker K, Judson I, Sarin R, Ludwig T, Genin E, Deleuze JF, Haber M, Marshall G, Cairns MJ, Blay JY, Thomas DM, Tattersall M, Neuhaus S, Lewis C, Tucker K, Carey-Smith R, Wood D, Porceddu S, Dickinson I, Thorne H, James P, Ray-Coquard I, Blay JY, Cassier P, Le Cesne A, Duffaud F, Penel N, Isambert N, Kurtz JE, Puri A, Sarin R, Ahn JH, Kim JE, Ward I, Judson I, van der Graaf W, Seddon B, Chandrasekar C, Rickar R, Hennig I, Schiffman J, Randall RL, Silvestri A, Zaratzian A, Tayao M, Walwyn K, Niedermayr E, Mang D, Clark R, Thorpe T, MacDonald J, Riddell K, Mar J, Fennelly V, Wicht A, Zielony B, Galligan E, Glavich G, Stoeckert J, Williams L, Djandjgava L, Buettner I, Osinki C, Stephens S, Rogasik M, Bouclier L, Girodet M, Charreton A, Fayet Y, Crasto S, Sandupatla B, Yoon Y, Je N, Thompson L, Fowler T, Johnson B, Petrikova G, Hambridge T, Hutchins A, Bottero D, Scanlon D, Stokes-Denson J, Génin E, Campion D, Dartigues JF, Deleuze JF, Lambert JC, Redon R, Ludwig T, Grenier-Boley B, Letort S, Lindenbaum P, Meyer V, Quenez O, Dina C, Bellenguez C, Le Clézio CC, Giemza J, Chatel S, Férec C, Le Marec H, Letenneur L, Nicolas G, Rouault K. Heritable defects in telomere and mitotic function selectively predispose to sarcomas. Science 2023; 379:253-260. [PMID: 36656928 DOI: 10.1126/science.abj4784] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 11/16/2022] [Indexed: 01/20/2023]
Abstract
Cancer genetics has to date focused on epithelial malignancies, identifying multiple histotype-specific pathways underlying cancer susceptibility. Sarcomas are rare malignancies predominantly derived from embryonic mesoderm. To identify pathways specific to mesenchymal cancers, we performed whole-genome germline sequencing on 1644 sporadic cases and 3205 matched healthy elderly controls. Using an extreme phenotype design, a combined rare-variant burden and ontologic analysis identified two sarcoma-specific pathways involved in mitotic and telomere functions. Variants in centrosome genes are linked to malignant peripheral nerve sheath and gastrointestinal stromal tumors, whereas heritable defects in the shelterin complex link susceptibility to sarcoma, melanoma, and thyroid cancers. These studies indicate a specific role for heritable defects in mitotic and telomere biology in risk of sarcomas.
Collapse
Affiliation(s)
- Mandy L Ballinger
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | - Swetansu Pattnaik
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | - Piyushkumar A Mundra
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | - Milita Zaheed
- Hereditary Cancer Centre, Prince of Wales Hospital, Sydney 2031, Australia
| | - Emma Rath
- Garvan Institute of Medical Research, Sydney 2010, Australia
| | - Peter Priestley
- Hartwig Medical Foundation, 1098 XH Amsterdam, Netherlands
- Hartwig Medical Foundation Australia, Sydney 2000, Australia
| | - Jonathan Baber
- Hartwig Medical Foundation, 1098 XH Amsterdam, Netherlands
- Hartwig Medical Foundation Australia, Sydney 2000, Australia
| | - Isabelle Ray-Coquard
- Department of Adult Medical Oncology, Centre Leon Berard, University Claude Bernard, 69373 Lyon, France
| | | | | | | | - Ajay Puri
- Department of Orthopedic Oncology, Tata Memorial Hospital, Mumbai, Maharashtra 400012, India
| | | | | | - Beatrice Seddon
- Sarcoma Unit, University College Hospital, London NW1 2BU, UK
| | | | - Joshua D Schiffman
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT 84112, USA
| | - Andrew S Brohl
- Sarcoma Department, Moffitt Cancer Center, Tampa, FL 33612, USA
| | - Paul A James
- The Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne 3010, Australia
- Parkville Familial Cancer Centre, Peter MacCallum Cancer Centre and Royal Melbourne Hospital, Melbourne 3000, Australia
| | | | | | - Ola Myklebost
- Western Norway Familial Cancer Centre, Haukeland University Hospital, 5021 Bergen, Norway
- Department of Clinical Science, University of Bergen, 5007 Bergen, Norway
- Institute for Cancer Research, Oslo University Hospital, N-0424 Oslo, Norway
| | | | - Hilda Pickett
- Children's Medical Research Institute, The University of Sydney, Westmead 2145, Australia
| | - Maya Kansara
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | - Nicola Waddell
- QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
| | - Olga Kondrashova
- QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
| | - John V Pearson
- QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
| | - Andrew P Barbour
- Faculty of Medicine. The University of Queensland, Brisbane 4072, Australia
| | - Shuai Li
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne 3010, Australia
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton 3800, Australia
- Murdoch Children's Research Institute, Royal Children's Hospital, Parkville 3051, Australia
| | - Tuong L Nguyen
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne 3010, Australia
| | - Diane Fatkin
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
- Molecular Cardiology Division, Victor Chang Cardiac Research Institute, Darlinghurst 2010, Australia
- Cardiology Department, St Vincent's Hospital, Sydney 2010, Australia
| | - Robert M Graham
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
- Molecular Cardiology Division, Victor Chang Cardiac Research Institute, Darlinghurst 2010, Australia
| | - Eleni Giannoulatou
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
- Computational Genomics Division, Victor Chang Cardiac Research Institute, Sydney 2010, Australia
| | - Melissa J Green
- School of Psychiatry, University of New South Wales, Sydney 2052, Australia
- Neuorscience Research Australia, Sydney 2031, Australia
| | - Warren Kaplan
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | | | - Joseph Copty
- Garvan Institute of Medical Research, Sydney 2010, Australia
| | - Joseph E Powell
- Garvan Institute of Medical Research, Sydney 2010, Australia
- UNSW Cellular Genomics Futures Institute, University of New South Wales, Sydney 2052, Australia
| | - Edwin Cuppen
- Hartwig Medical Foundation, 1098 XH Amsterdam, Netherlands
| | - Kristel van Eijk
- Department of Neurology, University Medical Centre Utrecht Brain Center, Utrecht University, 3584 CX Utrecht, Netherlands
| | - Jan Veldink
- Department of Neurology, University Medical Centre Utrecht Brain Center, Utrecht University, 3584 CX Utrecht, Netherlands
| | - Jin-Hee Ahn
- Department of Oncology, Asan Medical Centre, Seoul 05505, South Korea
| | - Jeong Eun Kim
- Department of Oncology, Asan Medical Centre, Seoul 05505, South Korea
| | - R Lor Randall
- Department of Orthopaedic Surgery, University of California, Davis Health, Sacramento, CA 95817, USA
| | - Kathy Tucker
- Hereditary Cancer Centre, Prince of Wales Hospital, Sydney 2031, Australia
| | - Ian Judson
- Sarcoma Unit, The Royal Marsden NHS Foundation Trust, London SW3 6JJ, UK
| | - Rajiv Sarin
- Cancer Genetics Unit, ACTREC, Tata Memorial Centre, Mumbai, Maharashtra 410210, India
| | - Thomas Ludwig
- Université de Brest, Inserm, EFS, UMR 1078, GGB, CHU de Brest, 29200 Brest, France
| | - Emmanuelle Genin
- Université de Brest, Inserm, EFS, UMR 1078, GGB, CHU de Brest, 29200 Brest, France
| | - Jean-Francois Deleuze
- Centre National de Recherche en Génomique Humaine, Institut de Génomique, 91057 Evry, France
| | - Michelle Haber
- Children's Cancer Institute, Lowy Cancer Research Centre, University of New South Wales, Kensington 2033, Australia
| | - Glenn Marshall
- Children's Cancer Institute, Lowy Cancer Research Centre, University of New South Wales, Kensington 2033, Australia
- Kids Cancer Centre, Sydney Children's Hospital, Randwick 2031, Australia
| | - Murray J Cairns
- School of Biomedical Sciences and Pharmacy, The University of Newcastle, Callaghan 2308, Australia
- Centre for Brain and Mental Health Research, The Hunter Medical Research Institute, Newcastle 2305, Australia
| | - Jean-Yves Blay
- Department of Adult Medical Oncology, Centre Leon Berard, University Claude Bernard, 69373 Lyon, France
| | - David M Thomas
- Garvan Institute of Medical Research, Sydney 2010, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney 2010, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Han S, Camp SY, Chu H, Collins R, Gillani R, Park J, Bakouny Z, Ricker CA, Reardon B, Moore N, Kofman E, Labaki C, Braun D, Choueiri TK, AlDubayan SH, Van Allen EM. Integrative Analysis of Germline Rare Variants in Clear and Non-Clear Cell Renal Cell Carcinoma. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.01.18.23284664. [PMID: 36712083 PMCID: PMC9882438 DOI: 10.1101/2023.01.18.23284664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
IMPORTANCE RCC encompasses a set of histologically distinct cancers with a high estimated genetic heritability, of which only a portion is currently explained. Previous rare germline variant studies in RCC have usually pooled clear and non-clear cell RCCs and have not adequately accounted for population stratification that may significantly impact the interpretation and discovery of certain candidate risk genes. OBJECTIVE To evaluate the enrichment of germline PVs in established cancer-predisposing genes (CPGs) in clear cell and non-clear cell RCC patients compared to cancer-free controls using approaches that account for population stratification and to identify unconventional types of germline RCC risk variants that confer an increased risk of developing RCC. DESIGN SETTING AND PARTICIPANTS In 1,436 unselected RCC patients with sufficient data quality, we systematically identified rare germline PVs, cryptic splice variants, and copy number variants (CNVs). From this unselected cohort, 1,356 patients were ancestry-matched with 16,512 cancer-free controls, and gene-level enrichment of rare germline PVs were assessed in 143 CPGs, followed by an investigation of somatic events in matching tumor samples. MAIN OUTCOMES AND MEASURES Gene-level burden of rare germline PVs, identification of secondary somatic events accompanying the germline PVs, and characterization of less-explored types of rare germline PVs in RCC patients. RESULTS In clear cell RCC (n = 976 patients), patients exhibited significantly higher prevalence of PVs in VHL compared to controls (OR: 39.1, 95% CI: 7.01-218.07, p-value:4.95e-05, q-value:0.00584). In non-clear cell RCC (n = 380 patients), patients carried enriched burden of PVs in FH (OR: 77.9, 95% CI: 18.68-324.97, p-value:1.55e-08, q-value: 1.83e-06) and MET (OR: 1.98e11, 95% CI: 0-inf, p-value: 2.07e-05, q-value: 3.50e-07). In a CHEK2-focused analysis with European cases and controls, clear cell RCC patients (n=906 European patients) harbored nominal enrichment of the previously reported low-penetrance CHEK2 variants, p.Ile157Thr (OR:1.84, 95% CI: 1.00-3.36, p-value:0.049) and p.Ser428Phe (OR:5.20, 95% CI: 1.00-26.40, p-value:0.045) while non-clear cell RCC patients (n=295 European patients) exhibited nominal enrichment of CHEK2 LOF germline PVs (OR: 3.51, 95% CI: 1.10-11.10, p-value: 0.033). RCC patients with germline PVs in FH, MET, and VHL exhibited significantly earlier age of cancer onset compared to patients without any germline PVs in CPGs (Mean: 46.0 vs 60.2 years old, Tukey adjusted p-value < 0.0001), and more than half had secondary somatic events affecting the same gene (n=10/15, 66.7%, 95% CI: 38.7-87.0%). Conversely, patients with rare germline PVs in CHEK2 exhibited a similar age of disease onset to patients without any identified germline PVs in CPGs (Mean: 60.1 vs 60.2 years old, Tukey adjusted p-value: 0.99), and only 30.4% of the patients carried secondary somatic events in CHEK2 (n=7/23, 95% CI: 14.1-53.0%). Finally, rare pathogenic germline cryptic splice variants underexplored in RCC were identified in SDHA and TSC1, and rare pathogenic germline CNVs were found in 18 patients, including CNVs in FH, SDHA, and VHL. CONCLUSIONS AND RELEVANCE This systematic analysis supports the existing link between several RCC risk genes and elevated RCC risk manifesting in earlier age of RCC onset. Our analysis calls for caution when assessing the role of germline PVs in CHEK2 due to the burden of founder variants with varying population frequency in different ancestry groups. It also broadens the definition of the RCC germline landscape of pathogenicity to incorporate previously understudied types of germline variants, such as cryptic splice variants and CNVs.
Collapse
Affiliation(s)
- Seunghun Han
- Ph.D. Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sabrina Y. Camp
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Hoyin Chu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ryan Collins
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Riaz Gillani
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
- Boston Children’s Hospital, Boston, MA, USA
| | - Jihye Park
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ziad Bakouny
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cora A. Ricker
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Brendan Reardon
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nicholas Moore
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT, USA
| | - Eric Kofman
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Chris Labaki
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - David Braun
- Center of Molecular and Cellular Oncology, Yale School of Medicine, New Haven, CT, USA
| | - Toni K. Choueiri
- Lank Center for Genitourinary Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Brigham and Women’s Hospital, Boston, MA, USA
| | - Saud H. AlDubayan
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics, Brigham and Women’s Hospital, Boston, MA, USA
- College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Eliezer M. Van Allen
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Cancer Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Genomics, Dana-Farber Cancer Institute, Boston, MA, USA
| |
Collapse
|
39
|
Novembre J, Stein C, Asgari S, Gonzaga-Jauregui C, Landstrom A, Lemke A, Li J, Mighton C, Taylor M, Tishkoff S. Addressing the challenges of polygenic scores in human genetic research. Am J Hum Genet 2022; 109:2095-2100. [PMID: 36459976 PMCID: PMC9808501 DOI: 10.1016/j.ajhg.2022.10.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
The genotyping of millions of human samples has made it possible to evaluate variants across the human genome for their possible association with risks for numerous diseases and other traits by using genome-wide association studies (GWASs). The associations between phenotype and genotype found in GWASs make possible the construction of polygenic scores (PGSs), which aim to predict a trait or disease outcome in an individual on the basis of their genotype (in the disease case, the term polygenic risk score [PRS] is often used). PGSs have shown promise for studying the biology of complex traits and as a tool for evaluating individual disease risks in clinical settings. Although the quantity and quality of data to compute PGSs are increasing, challenges remain in the technical aspects of developing PGSs and in the ethical and social issues that might arise from their use. This ASHG Guidance emphasizes three major themes for researchers working with or interested in the application of PGSs in their own research: (1) developing diverse research cohorts; (2) fostering robustness in the development, application, and interpretation of PGSs; and (3) improving the communication of PGS results and their implications to broad audiences.
Collapse
Affiliation(s)
- John Novembre
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Chicago, Chicago, IL, USA,Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA,Corresponding author
| | - Catherine Stein
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA,Corresponding author
| | - Samira Asgari
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Claudia Gonzaga-Jauregui
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,International Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México
| | - Andrew Landstrom
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Pediatrics, Division of Cardiology, Duke University School of Medicine, Durham, NC, USA
| | - Amy Lemke
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Norton Children’s Research Institute, affiliated with the University of Louisville School of Medicine, Louisville, KY, USA
| | - Jun Li
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Chloe Mighton
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Genomics Health Services Research Program, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada,Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Matthew Taylor
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Adult Medical Genetics Program, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Sarah Tishkoff
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Genetics, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA,Department of Biology, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
40
|
Yan S, Sha Q, Zhang S. Control for population stratification in genetic association studies based on GWAS summary statistics. Genet Epidemiol 2022; 46:604-614. [PMID: 35766057 DOI: 10.1002/gepi.22493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 05/20/2022] [Accepted: 05/25/2022] [Indexed: 11/11/2022]
Abstract
Over the past years, genome-wide association studies (GWAS) have generated a wealth of new information. Summary data from many GWAS are now publicly available, promoting the development of many statistical methods for association studies based on GWAS summary statistics, which avoids the increasing challenges associated with individual-level genotype and phenotype data sharing. However, for population-based association studies such as GWAS, it has been long recognized that population stratification can seriously confound association results. For large GWAS, it is very likely that there exist population stratification and cryptic relatedness, which will result in inflated Type I error in association testing. Although many methods have been developed to control for population stratification, only two of these approaches can be used to control population stratification without individual-level data: one is based on genomic control (GC) and the other one is based on linkage disequilibrium score regression (LDSC). However, the performance of these two approaches is currently unknown. In this study, we use extensive simulation studies including populations with subpopulations, spatially structured populations, and populations with cryptic relatedness to compare the performance of these two approaches to control for population stratification using only GWAS summary statistics without individual-level data. Data sets from the genetic analysis workshop 19 and UK Biobank are also used to evaluate these two approaches. We demonstrate that the intercept of LDSC can be used as a more accurate correction factor than GC. The results from this study will provide very useful information for researchers using GWAS summary statistics while trying to control for population stratification.
Collapse
Affiliation(s)
- Shijia Yan
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, USA
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, USA
| | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, USA
| |
Collapse
|
41
|
Butler-Laporte G, Povysil G, Kosmicki JA, Cirulli ET, Drivas T, Furini S, Saad C, Schmidt A, Olszewski P, Korotko U, Quinodoz M, Çelik E, Kundu K, Walter K, Jung J, Stockwell AD, Sloofman LG, Jordan DM, Thompson RC, Del Valle D, Simons N, Cheng E, Sebra R, Schadt EE, Kim-Schulze S, Gnjatic S, Merad M, Buxbaum JD, Beckmann ND, Charney AW, Przychodzen B, Chang T, Pottinger TD, Shang N, Brand F, Fava F, Mari F, Chwialkowska K, Niemira M, Pula S, Baillie JK, Stuckey A, Salas A, Bello X, Pardo-Seco J, Gómez-Carballa A, Rivero-Calle I, Martinón-Torres F, Ganna A, Karczewski KJ, Veerapen K, Bourgey M, Bourque G, Eveleigh RJM, Forgetta V, Morrison D, Langlais D, Lathrop M, Mooser V, Nakanishi T, Frithiof R, Hultström M, Lipcsey M, Marincevic-Zuniga Y, Nordlund J, Schiabor Barrett KM, Lee W, Bolze A, White S, Riffle S, Tanudjaja F, Sandoval E, Neveux I, Dabe S, Casadei N, Motameny S, Alaamery M, Massadeh S, Aljawini N, Almutairi MS, Arabi YM, Alqahtani SA, Al Harthi FS, Almutairi A, Alqubaishi F, Alotaibi S, Binowayn A, Alsolm EA, El Bardisy H, Fawzy M, Cai F, Soranzo N, Butterworth A, Geschwind DH, Arteaga S, Stephens A, Butte MJ, Boutros PC, Yamaguchi TN, Tao S, Eng S, Sanders T, Tung PJ, Broudy ME, Pan Y, Gonzalez A, Chavan N, Johnson R, Pasaniuc B, Yaspan B, Smieszek S, Rivolta C, Bibert S, Bochud PY, Dabrowski M, Zawadzki P, Sypniewski M, Kaja E, Chariyavilaskul P, Nilaratanakul V, Hirankarn N, Shotelersuk V, Pongpanich M, Phokaew C, Chetruengchai W, Tokunaga K, Sugiyama M, Kawai Y, Hasegawa T, Naito T, Namkoong H, Edahiro R, Kimura A, Ogawa S, Kanai T, Fukunaga K, Okada Y, Imoto S, Miyano S, Mangul S, Abedalthagafi MS, Zeberg H, Grzymski JJ, Washington NL, Ossowski S, Ludwig KU, Schulte EC, Riess O, Moniuszko M, Kwasniewski M, Mbarek H, Ismail SI, Verma A, Goldstein DB, Kiryluk K, Renieri A, Ferreira MAR, Richards JB. Exome-wide association study to identify rare variants influencing COVID-19 outcomes: Results from the Host Genetics Initiative. PLoS Genet 2022; 18:e1010367. [PMID: 36327219 PMCID: PMC9632827 DOI: 10.1371/journal.pgen.1010367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 07/29/2022] [Indexed: 11/05/2022] Open
Abstract
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p = 5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights.
Collapse
Grants
- T32 HL144442 NHLBI NIH HHS
- U24 CA224319 NCI NIH HHS
- RG/13/13/30194 British Heart Foundation
- C18281/A29019 Cancer Research UK
- MC_PC_20004 Medical Research Council
- UL1 TR001873 NCATS NIH HHS
- RG/18/13/33946 British Heart Foundation
- CH/12/2/29428 British Heart Foundation
- CanCOGeN HostSeq
- Fonds de Recherche Québec Santé (FRQS)
- Génome Québec
- Public Health Agency of Canada
- Canadian Institutes of Health Research (CIHR)
- Lady Davis Institute of the Jewish General Hospital
- Canadian Foundation for Innovation
- NIH Foundation
- McGill Interdisciplinary Initiative in Infection and Immunity (MI4)
- Jewish General Hospital Foundation
- McGill University
- Calcul Québec and Compute Canada
- Compute Canada
- Vagelos College of Physicians & Surgeons Office for Research
- Biomedical Informatics Resource of the Columbia University Irving Institute for Clinical and Translational Research (CTSA)
- National Center for Advancing Translational Sciences, National Institutes of Health
- German Research Foundation
- NGS Competence Center Tübingen
- West German Genome Center
- Stiftung Universitätsmedizin Essen
- Technical University of Munich
- BONFOR program of the Medical Faculty, University of Bonn
- Emmy-Noether programm of the German Research Foundation
- State of Saarland
- Dr. Rolf M. Schwiete Foundation
- Munich Clinician Scientist Programm
- Netzwerk-Universitaetsmedizin-COVIM
- Federal Ministry of Education and Research
- Swiss National Science Foundation
- Leenaards Foundation
- Santos-Suarez Foundation
- Carigest
- MIUR project “Dipartimenti di Eccellenza 2018-2020”
- Bando Ricerca COVID-19 Toscana
- charity fund 2020 from Intesa San Paolo
- Italian Ministry of University and Research
- Istituto Buddista Italiano Soka Gakkai
- Instituto de Salud Carlos III
- GePEM
- DIAVIR
- Resvi-Omics
- ReSVinext
- Enterogen
- Agencia Gallega para la Gestión del Conocimiento en Salud
- BI-BACVIR
- CovidPhy
- Agencia Gallega de Innovación (GAIN):
- GEN-COVID
- Framework Partnership Agreement between the Consellería de Sanidad de la XUNTA de Galicia
- GENVIP-IDIS
- consorcio Centro de Investigación Biomédica en Red de Enfermedades Respiratorias
- F. Hoffmann-La Roche Ltd
- U.S. Department of Health and Human Services, Office of the Assistant Secretary for Preparedness and Response, and Biomedical Advanced Research and Development Authority
- Nevada Governor's Office of Economic Development
- Renown Health and the Renown Health Foundation
- Ratchadapiseksompotch Fund, Faculty of Medicine, Chulalongkorn University
- Healthcare-associated Infection Research Group STAR (Special Task Force for Activating Research)
- Grant for Development of New Faculty Staff, Ratchadaphiseksomphot Endowment Fund
- e-ASIA Joint Research Program (National Science and Technology Development Agency)
- Health Systems Research Institute, TSRI Fund
- Thailand Research Fund
- Ratchadapiseksompotch Fund
- Ratchadapiseksompotch Fund, Faculty of Medicine,Chulalongkorn University, Bangkok, Thailand
- Health Systems Research Institute
- Ratchadapisek Sompoch Endowment Fund, Chulalongkorn University
- NHS Blood and Transplant
- National Institute for Health Research
- UK Medical Research Council
- Japan Agency for Medical Research and Development
- Japan Science and Technology Agency
- National Center for Global Health and Medicine
- Agency for Medical Research and Development
- Polish National Science Centre
- Medical Research Agency
- Perelman School of Medicine at University of Pennsylvania
- Smilow family
- National Center for Advancing Translational Sciences of the National Institutes of Health
- Polish Medical Research Agency
- Qatar Foundation for Education, Science and Community Development
- Saudi Ministry of Health
- King Abdulaziz City for Science and Technology
- European Union’s Horizon 2020 research and innovation program
- Science for Life Laboratory
- Swedish Research Council
- Knut and Alice Wallenberg Foundation
- OCRC
- Microsoft COVID Compute Funding
- Illumina
- UCLA David Geffen School of Medicine - Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research Award Program
Collapse
Affiliation(s)
- Guillaume Butler-Laporte
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, Québec, Canada
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
| | - Gundula Povysil
- Institute for Genomic Medicine, Columbia University, New York city, New York, United States of America
| | - Jack A. Kosmicki
- Regeneron Genetics Center, Tarrytown, New York, United States of America
| | | | - Theodore Drivas
- Division of Human Genetics, Department of Pediatrics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- Division of Translational Medicine and Human Genetics, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Simone Furini
- Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy
| | - Chadi Saad
- Qatar Genome Program, Qatar Foundation Research, Development and Innovation, Qatar Foundation, Doha, Qatar
| | - Axel Schmidt
- Institute of Human Genetics, School of Medicine and University Hospital Bonn, University of Bonn, Bonn, Germany
| | | | - Urszula Korotko
- IMAGENE.ME SA, Bialystok, Poland
- Centre for Bioinformatics and Data Analysis, Medical University of Bialystok, Bialystok, Poland
| | - Mathieu Quinodoz
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), Basel, Switzerland
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
- Department of Ophthalmology, University Hospital Basel, Basel, Switzerland
| | - Elifnaz Çelik
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), Basel, Switzerland
- Department of Ophthalmology, University Hospital Basel, Basel, Switzerland
| | - Kousik Kundu
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Klaudia Walter
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Junghyun Jung
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, California, United States of America
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Amy D. Stockwell
- Genentech Inc, South San Francisco, California, United States of America
| | - Laura G. Sloofman
- Seaver Autism Center for Research and Treatment, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Daniel M. Jordan
- Mount Sinai Clnical Intelligence Center, Charles Bronfman Institute for Personalized Medicine, Department of Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Ryan C. Thompson
- Icahn Institute of Data Science and Genomics Technology, New York city, New York, United States of America
| | - Diane Del Valle
- Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Nicole Simons
- Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Esther Cheng
- Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Robert Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York city,New York, United States of America
| | - Eric E. Schadt
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York city,New York, United States of America
| | - Seunghee Kim-Schulze
- Department of Oncological Science, Human Immune Monitoring Center, Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Sacha Gnjatic
- Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Miriam Merad
- Precision Immunology Institute, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Joseph D. Buxbaum
- Seaver Autism Center for Research and Treatment, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Noam D. Beckmann
- Precision Immunology Institute, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | - Alexander W. Charney
- Mount Sinai Clinical Intelligence Center; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York city, New York, United States of America
| | | | - Timothy Chang
- Department of Neurology, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Tess D. Pottinger
- Institute for Genomic Medicine, Columbia University, New York city, New York, United States of America
| | - Ning Shang
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York city, New York, United States of America
| | - Fabian Brand
- Institute of Genomic Statistics and Bioinformatics, School of Medicine and University Hospital Bonn, University of Bonn, Bonn, Germany
| | - Francesca Fava
- Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy
- Genetica Medica, Azienda Ospedaliero-Universitaria Senese, Siena, Italy
- Medical Genetics, University of Siena, Siena, Italy
| | - Francesca Mari
- Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy
- Genetica Medica, Azienda Ospedaliero-Universitaria Senese, Siena, Italy
- Medical Genetics, University of Siena, Siena, Italy
| | - Karolina Chwialkowska
- IMAGENE.ME SA, Bialystok, Poland
- Centre for Bioinformatics and Data Analysis, Medical University of Bialystok, Bialystok, Poland
| | - Magdalena Niemira
- Centre for Clinical Research, Medical University of Bialystok, Bialystok, Poland
| | | | - J Kenneth Baillie
- Roslin Institute, University of Edinburgh, Edinburgh, United Kingdom
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, United Kingdom
- Centre for Inflammation Research, The Queen’s Medical Research Institute, University of Edinburgh, Edinburgh, United Kingdom
- Intensive Care Unit, Royal Infirmary of Edinburgh, Edinburgh, United Kingdom
| | | | - Antonio Salas
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, and GenPoB Research Group, Instituto de Investigaciones Sanitarias, Hospital Clínico Universitario de Santiago (SERGAS), Santiago de Compostela, Galicia, Spain
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
| | - Xabier Bello
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, and GenPoB Research Group, Instituto de Investigaciones Sanitarias, Hospital Clínico Universitario de Santiago (SERGAS), Santiago de Compostela, Galicia, Spain
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
| | - Jacobo Pardo-Seco
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, and GenPoB Research Group, Instituto de Investigaciones Sanitarias, Hospital Clínico Universitario de Santiago (SERGAS), Santiago de Compostela, Galicia, Spain
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
| | - Alberto Gómez-Carballa
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, and GenPoB Research Group, Instituto de Investigaciones Sanitarias, Hospital Clínico Universitario de Santiago (SERGAS), Santiago de Compostela, Galicia, Spain
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
| | - Irene Rivero-Calle
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
- Translational Pediatrics and Infectious Diseases, Department of Pediatrics, Hospital Clínico Universitario de Santiago de Compostela, Santiago de Compostela, Spain
| | - Federico Martinón-Torres
- Genetics, Vaccines and Infections Research Group (GENVIP), Instituto de Investigación Sanitaria de Santiago, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Madrid, Spain
- Translational Pediatrics and Infectious Diseases, Department of Pediatrics, Hospital Clínico Universitario de Santiago de Compostela, Santiago de Compostela, Spain
| | - Andrea Ganna
- Institute for Molecular Medicine Finland, Helsinki Institute of Life Science, University of Helsinki, Helsinki, Finland
- Massachusetts General Hospital, Harvard Medical School, Boston, Massachussets, United States of America
| | - Konrad J. Karczewski
- Stanley Center for Psychiatric Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Kumar Veerapen
- Stanley Center for Psychiatric Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Mathieu Bourgey
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
- McGill Genome Center, McGill University, Montréal, Québec, Canada
| | - Guillaume Bourque
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
- McGill Genome Center, McGill University, Montréal, Québec, Canada
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Robert JM Eveleigh
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
- McGill Genome Center, McGill University, Montréal, Québec, Canada
| | - Vincenzo Forgetta
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
| | - David Morrison
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
| | - David Langlais
- McGill Genome Center, McGill University, Montréal, Québec, Canada
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Mark Lathrop
- McGill Genome Center, McGill University, Montréal, Québec, Canada
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Vincent Mooser
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Tomoko Nakanishi
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
- Kyoto-McGill International Collaborative School in Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto, Japan
- Research Fellow, Japan Society for the Promotion of Science, Tokyo, Japan
| | - Robert Frithiof
- Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
| | - Michael Hultström
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, Québec, Canada
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
- Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
- Integrative Physiology, Department of Medical Cell Biology, Uppsala University, Uppsala, Sweden
| | - Miklos Lipcsey
- Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
- Hedenstierna Laboratory, CIRRUS, Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
| | - Yanara Marincevic-Zuniga
- Department of Medical Sciences, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Jessica Nordlund
- Department of Medical Sciences, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | | | - William Lee
- Helix, San Mateo, California, United States of America
| | | | - Simon White
- Helix, San Mateo, California, United States of America
| | | | | | | | - Iva Neveux
- Center for Genomic Medicine, Desert Research Institute, Reno, Nevada United States of America
| | - Shaun Dabe
- Renown Health, Reno, Nevada, United States of America
| | - Nicolas Casadei
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
- NGS Competence Center Tuebingen, Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
| | - Susanne Motameny
- West German Genome Center, site Cologne, University of Cologne, Cologne, Germany
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
| | - Manal Alaamery
- Developmental Medicine Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
- Saudi Human Genome Project at King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Salam Massadeh
- Developmental Medicine Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
- Saudi Human Genome Project at King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Nora Aljawini
- Developmental Medicine Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
- Saudi Human Genome Project at King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Mansour S. Almutairi
- Developmental Medicine Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
- Saudi Human Genome Project at King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Yaseen M. Arabi
- Ministry of the National Guard Health Affairs, King Abdullah International Medical Research Center and King Saud Bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Saleh A. Alqahtani
- Liver Transplant Unit, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia
- Division of Gastroenterology and Hepatology, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Fawz S. Al Harthi
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Amal Almutairi
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Fatima Alqubaishi
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Sarah Alotaibi
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Albandari Binowayn
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Ebtehal A. Alsolm
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Hadeel El Bardisy
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Mohammad Fawzy
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Fang Cai
- Genentech Inc, South San Francisco, California, United States of America
| | - Nicole Soranzo
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, United Kingdom
| | - Adam Butterworth
- Department of Human Genetics, Wellcome Sanger Institute, Hinxton, United Kingdom
| | | | | | | | | | | | | | | | | | - Daniel H. Geschwind
- Department of Neurology, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Stephanie Arteaga
- Department of Neurology, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Alexis Stephens
- Department of Pediatrics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Manish J. Butte
- Department of Pediatrics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
- Department of Microbiology, Immunology, and Molecular Genetics (MIMG), David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Paul C. Boutros
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Takafumi N. Yamaguchi
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Shu Tao
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Stefan Eng
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Timothy Sanders
- Office of Health Informatics and Analytics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Paul J. Tung
- Office of Health Informatics and Analytics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Michael E. Broudy
- Office of Health Informatics and Analytics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Yu Pan
- Office of Health Informatics and Analytics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Alfredo Gonzalez
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Nikhil Chavan
- Office of Health Informatics and Analytics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Ruth Johnson
- Department of Computer Science, McGill University, Montréal, Québec, Canada
| | - Bogdan Pasaniuc
- Department of Human Genetics, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
- Department of Computational Medicine, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
- Department of Pathology, David Geffen School of Medicine, University of California—Los Angeles, Los Angeles, California, United States of America
| | - Brian Yaspan
- Genentech Inc, South San Francisco, California, United States of America
| | - Sandra Smieszek
- Vanda Pharmaceuticals, Washington, District of Columbia, United States of America
| | - Carlo Rivolta
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), Basel, Switzerland
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
- Department of Ophthalmology, University Hospital Basel, Basel, Switzerland
| | - Stephanie Bibert
- Infectious Diseases Service, Department of Medicine, University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Pierre-Yves Bochud
- Infectious Diseases Service, Department of Medicine, University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Maciej Dabrowski
- MNM Bioscience Inc., Cambridge, Massachusetts, United States of America
| | - Pawel Zawadzki
- MNM Bioscience Inc., Cambridge, Massachusetts, United States of America
- Faculty of Physics, Adam Mickiewicz University, Poznan, Poland
| | | | - Elżbieta Kaja
- MNM Bioscience Inc., Cambridge, Massachusetts, United States of America
- Department of Medical Chemistry and Laboratory Medicine, Poznań University of Medical Sciences, Poznań, Poland
| | - Pajaree Chariyavilaskul
- Clinical Pharmacokinetics and Pharmacogenomics Research Unit, Department of Pharmacology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Voraphoj Nilaratanakul
- Healthcare-associated Infection Research Group STAR (Special Task Force for Activating Research) and Division of Infectious Diseases, Department of Medicine,Chulalongkorn University, Bangkok, Thailand
| | - Nattiya Hirankarn
- Center of Excellence in Immunology and Immune-mediated Diseases, Department of Microbiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Vorasuk Shotelersuk
- Center of Excellence for Medical Genomics, Medical Genomics Cluster, and Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Monnat Pongpanich
- Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
| | - Chureerat Phokaew
- Research Affairs, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Wanna Chetruengchai
- Center of Excellence for Medical Genomics, Medical Genomics Cluster, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Katsushi Tokunaga
- Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
| | - Masaya Sugiyama
- Genome Medical Science Project, Research Institute, National Center for Global Health and Medicine, Shinjuku-ku, Tokyo, Japan
| | - Yosuke Kawai
- Genome Medical Science Project, National Center for Global Health and Medicine (NCGM), Tokyo, Japan
| | - Takanori Hasegawa
- M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
| | - Tatsuhiko Naito
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Ho Namkoong
- Department of Infectious Diseases, Keio University School of Medicine, Tokyo, Japan
| | - Ryuya Edahiro
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, Japan
| | - Akinori Kimura
- Institute of Research, Tokyo Medical and Dental University, Tokyo, Japan
| | - Seishi Ogawa
- Department of Pathology and Tumor Biology, Kyoto University, Kyoto, Japan
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
- Department of Medicine, Center for Hematology and Regenerative Medicine, Karolinska Institute, Stockholm, Sweden
| | - Takanori Kanai
- Division of Gastroenterology and Hepatology, Department of Medicine, Keio University School of Medicine, Tokyo, Japan
| | - Koichi Fukunaga
- Division of Pulmonary Medicine, Department of Medicine, Keio University School of Medicine, Tokyo, Japan
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
- Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan
- Center for Infectious Disease Education and Research (CiDER), Osaka University, Suita, Japan
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, the Institute of Medical Science, the University of Tokyo, Tokyo, Japan
| | - Satoru Miyano
- M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
| | - Serghei Mangul
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, California, United States of America
| | - Malak S. Abedalthagafi
- Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - Hugo Zeberg
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - Joseph J. Grzymski
- Center for Genomic Medicine, Desert Research Institute, Reno, Nevada United States of America
| | | | - Stephan Ossowski
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
- NGS Competence Center Tuebingen, Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
| | - Kerstin U. Ludwig
- Institute of Human Genetics, School of Medicine and University Hospital Bonn, University of Bonn, Bonn, Germany
- West German Genome Center, site Bonn, University of Bonn, Bonn, Germany
| | - Eva C. Schulte
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
- Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
- Institute of Virology, Technical University Munich/Helmholtz Zentrum München, Munich, Germany
| | - Olaf Riess
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
- NGS Competence Center Tuebingen, Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
| | - Marcin Moniuszko
- Department of Regenerative Medicine and Immune Regulation, Medical University of Bialystok, Bialystok, Poland
- Department of Allergology and Internal Medicine, Medical University of Bialystok, Bialystok, Poland
| | - Miroslaw Kwasniewski
- IMAGENE.ME SA, Bialystok, Poland
- Centre for Bioinformatics and Data Analysis, Medical University of Bialystok, Bialystok, Poland
| | - Hamdi Mbarek
- Qatar Genome Program, Qatar Foundation Research, Development and Innovation, Qatar Foundation, Doha, Qatar
| | - Said I. Ismail
- Qatar Genome Program, Qatar Foundation Research, Development and Innovation, Qatar Foundation, Doha, Qatar
| | - Anurag Verma
- Division of Translational Medicine and Human Genetics, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- Corporal Michael Crescenz VA Medical Center, Philadelphia, Pennsylvania, United States of America
| | - David B. Goldstein
- Institute for Genomic Medicine, Columbia University, New York city, New York, United States of America
- Department of Genetics & Development, Columbia University, New York city, New York, United States of America
| | - Krzysztof Kiryluk
- Institute for Genomic Medicine, Columbia University, New York city, New York, United States of America
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York city, New York, United States of America
| | - Alessandra Renieri
- Department of Medical Biotechnologies, Med Biotech Hub and Competence Center, University of Siena, Siena, Italy
- Genetica Medica, Azienda Ospedaliero-Universitaria Senese, Siena, Italy
- Medical Genetics, University of Siena, Siena, Italy
| | | | - J Brent Richards
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, Québec, Canada
- Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
- Infectious Diseases and Immunity in Global Health Program, Research Institute of the McGill University Health Centre, Montréal, Québec, Canada
- Department of Twin Research, King’s College London, London, United Kingdom
- 5 Prime Sciences Inc, Montreal, Quebec, Canada
| |
Collapse
|
42
|
Jang SK, Evans L, Fialkowski A, Arnett DK, Ashley-Koch AE, Barnes KC, Becker DM, Bis JC, Blangero J, Bleecker ER, Boorgula MP, Bowden DW, Brody JA, Cade BE, Jenkins BWC, Carson AP, Chavan S, Cupples LA, Custer B, Damrauer SM, David SP, de Andrade M, Dinardo CL, Fingerlin TE, Fornage M, Freedman BI, Garrett ME, Gharib SA, Glahn DC, Haessler J, Heckbert SR, Hokanson JE, Hou L, Hwang SJ, Hyman MC, Judy R, Justice AE, Kaplan RC, Kardia SLR, Kelly S, Kim W, Kooperberg C, Levy D, Lloyd-Jones DM, Loos RJF, Manichaikul AW, Gladwin MT, Martin LW, Nouraie M, Melander O, Meyers DA, Montgomery CG, North KE, Oelsner EC, Palmer ND, Payton M, Peljto AL, Peyser PA, Preuss M, Psaty BM, Qiao D, Rader DJ, Rafaels N, Redline S, Reed RM, Reiner AP, Rich SS, Rotter JI, Schwartz DA, Shadyab AH, Silverman EK, Smith NL, Smith JG, Smith AV, Smith JA, Tang W, Taylor KD, Telen MJ, Vasan RS, Gordeuk VR, Wang Z, Wiggins KL, Yanek LR, Yang IV, Young KA, Young KL, Zhang Y, Liu DJ, Keller MC, Vrieze S. Rare genetic variants explain missing heritability in smoking. Nat Hum Behav 2022; 6:1577-1586. [PMID: 35927319 PMCID: PMC9985486 DOI: 10.1038/s41562-022-01408-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 06/10/2022] [Indexed: 12/11/2022]
Abstract
Common genetic variants explain less variation in complex phenotypes than inferred from family-based studies, and there is a debate on the source of this 'missing heritability'. We investigated the contribution of rare genetic variants to tobacco use with whole-genome sequences from up to 26,257 unrelated individuals of European ancestries and 11,743 individuals of African ancestries. Across four smoking traits, single-nucleotide-polymorphism-based heritability ([Formula: see text]) was estimated from 0.13 to 0.28 (s.e., 0.10-0.13) in European ancestries, with 35-74% of it attributable to rare variants with minor allele frequencies between 0.01% and 1%. These heritability estimates are 1.5-4 times higher than past estimates based on common variants alone and accounted for 60% to 100% of our pedigree-based estimates of narrow-sense heritability ([Formula: see text], 0.18-0.34). In the African ancestry samples, [Formula: see text] was estimated from 0.03 to 0.33 (s.e., 0.09-0.14) across the four smoking traits. These results suggest that rare variants are important contributors to the heritability of smoking.
Collapse
Affiliation(s)
- Seon-Kyeong Jang
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA
| | - Luke Evans
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
- Department of Ecology & Evolution, University of Colorado Boulder, Boulder, CO, USA
| | | | - Donna K Arnett
- Dean's Office, University of Kentucky College of Public Health, Lexington, KY, USA
| | | | - Kathleen C Barnes
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Diane M Becker
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - John Blangero
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | | | - Meher Preethi Boorgula
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Donald W Bowden
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Brian E Cade
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Brenda W Campbell Jenkins
- Jackson Heart Study Graduate Training and Education Center, Jackson State University School of Public Health, Jackson, MS, USA
| | - April P Carson
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Sameer Chavan
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Brian Custer
- Vitalant Research Institute, San Francisco, CA, USA
| | - Scott M Damrauer
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Surgery, Corporal Michael Crescenz VA Medical Center, Philadelphia, PA, USA
| | - Sean P David
- Department of Family Medicine, Prtizker School of Medicine, University of Chicago, Chicago, IL, USA
- NorthShore University HealthSystem, Evanston, IL, USA
| | - Mariza de Andrade
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | | | - Tasha E Fingerlin
- Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Center for Genes Environment and Health, National Jewish Health, Denver, CO, USA
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Barry I Freedman
- Section on Nephrology, Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Melanie E Garrett
- Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Sina A Gharib
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Center for Lung Biology, Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington, Seattle, WA, USA
| | - David C Glahn
- Department of Psychiatry, Boston Children's Hosptial and Harvard Medical School, Boston, MA, USA
| | - Jeffrey Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Susan R Heckbert
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
| | - John E Hokanson
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Lifang Hou
- Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
| | - Shih-Jen Hwang
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Matthew C Hyman
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Renae Judy
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Anne E Justice
- Department of Population Health Sciences, Geisinger Health System, Danville, PA, USA
| | - Robert C Kaplan
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Shannon Kelly
- Department of Pediatrics, UCSF Benioff Children's Hospital Oakland, Oakland, CA, USA
| | - Wonji Kim
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Daniel Levy
- Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
- Framingham Heart Study, Framingham, MA, USA
| | | | - Ruth J F Loos
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ani W Manichaikul
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Mark T Gladwin
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | | | - Mehdi Nouraie
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Olle Melander
- Department of Clinical Sciences, Lund University, Malmö, Sweden
- Department of Internal Medicine, Skåne University Hospital, Malmö, Sweden
| | | | - Courtney G Montgomery
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
| | - Kari E North
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Elizabeth C Oelsner
- Division of General Medicine, Columbia University Irving Medical Center, Columbia University, New York, NY, USA
| | - Nicholette D Palmer
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Marinelle Payton
- Department of Epidemiology and Biostatistics, Jackson Heart Study Graduate Training and Education Center, Jackson State University School of Public Health, Jackson, MS, USA
| | - Anna L Peljto
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Michael Preuss
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Department of Medicine, Epidemiology and Health Services, University of Washington, Seattle, WA, USA
| | - Dandi Qiao
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Daniel J Rader
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Nicholas Rafaels
- Division of Biomedical Informatics & Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Susan Redline
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Robert M Reed
- University of Maryland School of Medicine, Baltimore, MD, USA
| | - Alexander P Reiner
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Stephen S Rich
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - David A Schwartz
- Department of Medicine, School of Medicine, University of Colorado Denver, Aurora, CO, USA
- Department of Immunology, School of Medicine, University of Colorado Denver, Aurora, CO, USA
| | - Aladdin H Shadyab
- Herbert Wertheim School of Public Health and Human Longevity Science, University of California, San Diego, La Jolla, CA, USA
| | - Edwin K Silverman
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Nicholas L Smith
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
| | - J Gustav Smith
- Wallenberg Laboratory/Department of Molecular and Clinical Medicine, Institute of Medicine, Gothenburg University, Gothenburg, Sweden
- Department of Cardiology, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Albert V Smith
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Weihong Tang
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Kent D Taylor
- Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Marilyn J Telen
- Department of Medicine, Duke University School of Medicine, Durham, NC, USA
| | - Ramachandran S Vasan
- Sections of Preventive Medicine and Epidemiology and Cardiovascular Medicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
| | - Victor R Gordeuk
- Department of Medicine, University of Illinois at Chicago, Chicago, IL, USA
| | - Zhe Wang
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Kerri L Wiggins
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Lisa R Yanek
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ivana V Yang
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Kendra A Young
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Kristin L Young
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Yingze Zhang
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Dajiang J Liu
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
| | - Scott Vrieze
- Department of Psychology, University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
43
|
Wojcik GL, Murphy J, Edelson JL, Gignoux CR, Ioannidis AG, Manning A, Rivas MA, Buyske S, Hendricks AE. Opportunities and challenges for the use of common controls in sequencing studies. Nat Rev Genet 2022; 23:665-679. [PMID: 35581355 PMCID: PMC9765323 DOI: 10.1038/s41576-022-00487-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2022] [Indexed: 01/02/2023]
Abstract
Genome-wide association studies using large-scale genome and exome sequencing data have become increasingly valuable in identifying associations between genetic variants and disease, transforming basic research and translational medicine. However, this progress has not been equally shared across all people and conditions, in part due to limited resources. Leveraging publicly available sequencing data as external common controls, rather than sequencing new controls for every study, can better allocate resources by augmenting control sample sizes or providing controls where none existed. However, common control studies must be carefully planned and executed as even small differences in sample ascertainment and processing can result in substantial bias. Here, we discuss challenges and opportunities for the robust use of common controls in high-throughput sequencing studies, including study design, quality control and statistical approaches. Thoughtful generation and use of large and valuable genetic sequencing data sets will enable investigation of a broader and more representative set of conditions, environments and genetic ancestries than otherwise possible.
Collapse
Affiliation(s)
- Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Jessica Murphy
- Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO, USA
- Mathematical and Statistical Sciences, University of Colorado Denver, Denver, CO, USA
| | - Jacob L Edelson
- Department of Biomedical Data Science, Stanford Medical School, Stanford, CA, USA
| | - Christopher R Gignoux
- Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO, USA
- Human Medical Genetics and Genomics Program, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Alexander G Ioannidis
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA, USA
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Alisa Manning
- Metabolism Program, Broad Institute, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Manuel A Rivas
- Department of Biomedical Data Science, Stanford Medical School, Stanford, CA, USA
| | - Steven Buyske
- Department of Statistics, Rutgers University, Piscataway, NJ, USA
| | - Audrey E Hendricks
- Biostatistics and Informatics, Colorado School of Public Health, Aurora, CO, USA.
- Mathematical and Statistical Sciences, University of Colorado Denver, Denver, CO, USA.
- Human Medical Genetics and Genomics Program, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
44
|
Chen W, Coombes BJ, Larson NB. Recent advances and challenges of rare variant association analysis in the biobank sequencing era. Front Genet 2022; 13:1014947. [PMID: 36276986 PMCID: PMC9582646 DOI: 10.3389/fgene.2022.1014947] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 09/22/2022] [Indexed: 12/04/2022] Open
Abstract
Causal variants for rare genetic diseases are often rare in the general population. Rare variants may also contribute to common complex traits and can have much larger per-allele effect sizes than common variants, although power to detect these associations can be limited. Sequencing costs have steadily declined with technological advancements, making it feasible to adopt whole-exome and whole-genome profiling for large biobank-scale sample sizes. These large amounts of sequencing data provide both opportunities and challenges for rare-variant association analysis. Herein, we review the basic concepts of rare-variant analysis methods, the current state-of-the-art methods in utilizing variant annotations or external controls to improve the statistical power, and particular challenges facing rare variant analysis such as accounting for population structure, extremely unbalanced case-control design. We also review recent advances and challenges in rare variant analysis for familial sequencing data and for more complex phenotypes such as survival data. Finally, we discuss other potential directions for further methodology investigation.
Collapse
Affiliation(s)
- Wenan Chen
- Center for Applied Bioinformatics, St. Jude Children’s Research Hospital, Memphis, TN, United States
- *Correspondence: Wenan Chen, ; Brandon J. Coombes, ; Nicholas B. Larson,
| | - Brandon J. Coombes
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, United States
- *Correspondence: Wenan Chen, ; Brandon J. Coombes, ; Nicholas B. Larson,
| | - Nicholas B. Larson
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, United States
- *Correspondence: Wenan Chen, ; Brandon J. Coombes, ; Nicholas B. Larson,
| |
Collapse
|
45
|
Asensio EM, Ortega-Azorín C, Barragán R, Alvarez-Sala A, Sorlí JV, Pascual EC, Fernández-Carrión R, Villamil LV, Corella D, Coltell O. Association between Microbiome-Related Human Genetic Variants and Fasting Plasma Glucose in a High-Cardiovascular-Risk Mediterranean Population. MEDICINA (KAUNAS, LITHUANIA) 2022; 58:1238. [PMID: 36143914 PMCID: PMC9502852 DOI: 10.3390/medicina58091238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/02/2022] [Accepted: 09/05/2022] [Indexed: 11/29/2022]
Abstract
Background and Objectives: The gut microbiota has been increasingly recognized as a relevant factor associated with metabolic diseases. However, directly measuring the microbiota composition is a limiting factor for several studies. Therefore, using genetic variables as proxies for the microbiota composition is an important issue. Landmark microbiome-host genome-wide association studies (mbGWAS) have identified many SNPs associated with gut microbiota. Our aim was to analyze the association between relevant microbiome-related genetic variants (Mi-RSNPs) and fasting glucose and type 2 diabetes in a Mediterranean population, exploring the interaction with Mediterranean diet adherence. Materials and Methods: We performed a cross-sectional study in a high-cardiovascular-risk Mediterranean population (n = 1020), analyzing the association of Mi-RSNPs (from four published mbGWAS) with fasting glucose and type 2 diabetes. A single-variant approach was used for fitting fasting glucose and type 2 diabetes to a multivariable regression model. In addition, a Mendelian randomization analysis with multiple variants was performed as a sub-study. Results: We obtained several associations between Mi-RSNPs and fasting plasma glucose involving gut Gammaproteobacteria_HB, the order Rhizobiales, the genus Rumminococcus torques group, and the genus Tyzzerella as the top ranked. For type 2 diabetes, we also detected significant associations with Mi-RSNPs related to the order Rhizobiales, the family Desulfovibrionaceae, and the genus Romboutsia. In addition, some Mi-RSNPs and adherence to Mediterranean diet interactions were detected. Lastly, the formal Mendelian randomization analysis suggested combined effects. Conclusions: Although the use of Mi-RSNPs as proxies of the microbiome is still in its infancy, and although this is the first study analyzing such associations with fasting plasma glucose and type 2 diabetes in a Mediterranean population, some interesting associations, as well as modulations, with adherence to the Mediterranean diet were detected in these high-cardiovascular-risk subjects, eliciting new hypotheses.
Collapse
Affiliation(s)
- Eva M. Asensio
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Carolina Ortega-Azorín
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Computer Languages and Systems, Universitat Jaume I, 12071 Castellón, Spain
| | - Rocío Barragán
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Andrea Alvarez-Sala
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
| | - José V. Sorlí
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Eva C. Pascual
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
| | - Rebeca Fernández-Carrión
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Laura V. Villamil
- Department of Phisiology, School of Medicine, University Antonio Nariño, Bogotá 111511, Colombia
| | - Dolores Corella
- Department of Preventive Medicine and Public Health, School of Medicine, University of Valencia, 46010 Valencia, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Oscar Coltell
- CIBER Fisiopatología de la Obesidad y Nutrición, Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Computer Languages and Systems, Universitat Jaume I, 12071 Castellón, Spain
| |
Collapse
|
46
|
Ju D, Hui D, Hammond DA, Wonkam A, Tishkoff SA. Importance of Including Non-European Populations in Large Human Genetic Studies to Enhance Precision Medicine. Annu Rev Biomed Data Sci 2022; 5:321-339. [PMID: 35576557 PMCID: PMC9904154 DOI: 10.1146/annurev-biodatasci-122220-112550] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
One goal of genomic medicine is to uncover an individual's genetic risk for disease, which generally requires data connecting genotype to phenotype, as done in genome-wide association studies (GWAS). While there may be clinical promise to employing prediction tools such as polygenic risk scores (PRS), it currently stands that individuals of non-European ancestry may not reap the benefits of genomic medicine because of underrepresentation in large-scale genetics studies. Here, we discuss why this inequity poses a problem for genomic medicine and the reasons for the low transferability of PRS across populations. We also survey the ancestry representation of published GWAS and investigate how estimates of ancestry diversity in GWASparticipants might be biased. We highlight the importance of expanding genetic research in Africa, one of the most underrepresented regions in human genomics research, and discuss issues of ethics, resources, and technology for equitable advancement of genomic medicine.
Collapse
Affiliation(s)
- Dan Ju
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Daniel Hui
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
- Graduate Program in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Dorothy A Hammond
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
- Penn Center for Global Genomics & Health Equity, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Ambroise Wonkam
- Division of Human Genetics, Department of Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland, USA;
| | - Sarah A Tishkoff
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
47
|
Dixon P, Harrison S, Hollingworth W, Davies NM, Davey Smith G. Estimating the causal effect of liability to disease on healthcare costs using Mendelian Randomization. ECONOMICS AND HUMAN BIOLOGY 2022; 46:101154. [PMID: 35803012 DOI: 10.1016/j.ehb.2022.101154] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 06/15/2022] [Accepted: 06/28/2022] [Indexed: 05/27/2023]
Abstract
Accurate measurement of the effects of disease status on healthcare costs is important in the pragmatic evaluation of interventions but is complicated by endogeneity bias. Mendelian Randomization, the use of random perturbations in germline genetic variation as instrumental variables, can avoid these limitations. We used a novel Mendelian Randomization analysis to model the causal impact on inpatient hospital costs of liability to six prevalent diseases and health conditions: asthma, eczema, migraine, coronary heart disease, Type 2 diabetes, and depression. We identified genetic variants from replicated genome-wide associations studies and estimated their association with inpatient hospital costs on over 300,000 individuals. There was concordance of findings across varieties of sensitivity analyses, including stratification by sex and methods robust to violations of the exclusion restriction. Results overall were imprecise and we could not rule out large effects of liability to disease on healthcare costs. In particular, genetic liability to coronary heart disease had substantial impacts on costs.
Collapse
Affiliation(s)
- Padraig Dixon
- Nuffield Department of Primary Care Health Sciences, University of Oxford, United Kingdom; MRC Integrative Epidemiology Unit, University of Bristol, United Kingdom.
| | - Sean Harrison
- MRC Integrative Epidemiology Unit, University of Bristol, United Kingdom; Population Health Sciences, University of Bristol, United Kingdom
| | | | - Neil M Davies
- MRC Integrative Epidemiology Unit, University of Bristol, United Kingdom; Population Health Sciences, University of Bristol, United Kingdom; K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Norway
| | - George Davey Smith
- MRC Integrative Epidemiology Unit, University of Bristol, United Kingdom; Population Health Sciences, University of Bristol, United Kingdom; NIHR Biomedical Research Centre, University of Bristol, United Kingdom
| |
Collapse
|
48
|
Mathey CM, Maj C, Scheer AB, Fazaal J, Wedi B, Wieczorek D, Amann PM, Löffler H, Koch L, Schöffl C, Dickel H, Ganjuur N, Hornung T, Forkel S, Greve J, Wurpts G, Hallberg P, Bygum A, Von Buchwald C, Karawajczyk M, Steffens M, Stingl J, Hoffmann P, Heilmann-Heimbach S, Mangold E, Ludwig KU, Rasmussen ER, Wadelius M, Sachs B, Nöthen MM, Forstner AJ. Molecular Genetic Screening in Patients With ACE Inhibitor/Angiotensin Receptor Blocker-Induced Angioedema to Explore the Role of Hereditary Angioedema Genes. Front Genet 2022; 13:914376. [PMID: 35923707 PMCID: PMC9339951 DOI: 10.3389/fgene.2022.914376] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 06/21/2022] [Indexed: 11/13/2022] Open
Abstract
Angioedema is a relatively rare but potentially life-threatening adverse reaction to angiotensin-converting enzyme inhibitors (ACEi) and angiotensin receptor blockers (ARBs). As with hereditary forms of angioedema (HAE), this adverse reaction is mediated by bradykinin. Research suggests that ACEi/ARB-induced angioedema has a multifactorial etiology. In addition, recent case reports suggest that some ACEi/ARB-induced angioedema patients may carry pathogenic HAE variants. The aim of the present study was to investigate the possible association between ACEi/ARB-induced angioedema and HAE genes via systematic molecular genetic screening in a large cohort of ACEi/ARB-induced angioedema cases. Targeted re-sequencing of five HAE-associated genes (SERPING1, F12, PLG, ANGPT1, and KNG1) was performed in 212 ACEi/ARB-induced angioedema patients recruited in Germany/Austria, Sweden, and Denmark, and in 352 controls from a German cohort. Among patients, none of the identified variants represented a known pathogenic variant for HAE. Moreover, no significant association with ACEi/ARB-induced angioedema was found for any of the identified common [minor allele frequency (MAF) >5%] or rare (MAF < 5%) variants. However, several non-significant trends suggestive of possible protective effects were observed. The lowest p-value for an individual variant was found in PLG (rs4252129, p.R523W, p = 0.057, p.adjust > 0.999, Fisher’s exact test). Variant p.R523W was found exclusively in controls and has previously been associated with decreased levels of plasminogen, a precursor of plasmin which is part of a pathway directly involved in bradykinin production. In addition, rare, potentially functional variants (MAF < 5%, Phred-scaled combined annotation dependent depletion score >10) showed a nominally significant enrichment in controls both: 1) across all five genes; and 2) in the F12 gene alone. However, these results did not withstand correction for multiple testing. In conclusion, our results suggest that HAE-associated mutations are, at best, a rare cause of ACEi/ARB-induced angioedema. Furthermore, we were unable to identify a significant association between ACEi/ARB-induced angioedema and other variants in the investigated genes. Further studies with larger sample sizes are warranted to draw more definite conclusions concerning variants with limited effect sizes, including protective variants.
Collapse
Affiliation(s)
- Carina M. Mathey
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Carlo Maj
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
- Centre for Human Genetics, University of Marburg, Marburg, Germany
| | - Annika B. Scheer
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Julia Fazaal
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Bettina Wedi
- Department of Dermatology and Allergy, Comprehensive Allergy Center, Hannover Medical School, Hannover, Germany
| | - Dorothea Wieczorek
- Department of Dermatology and Allergy, Comprehensive Allergy Center, Hannover Medical School, Hannover, Germany
| | - Philipp M. Amann
- Department of Dermatology, SLK Hospital Heilbronn, Heilbronn, Germany
| | - Harald Löffler
- Department of Dermatology, SLK Hospital Heilbronn, Heilbronn, Germany
| | - Lukas Koch
- Department of Dermatology and Venereology, Medical University Graz, Graz, Austria
| | - Clemens Schöffl
- Department of Dermatology and Venereology, Medical University Graz, Graz, Austria
| | - Heinrich Dickel
- Department of Dermatology, Venereology and Allergology, St. Josef Hospital, University Medical Center, Ruhr University Bochum, Bochum, Germany
| | - Nomun Ganjuur
- Department of Dermatology, Venereology and Allergology, St. Josef Hospital, University Medical Center, Ruhr University Bochum, Bochum, Germany
| | - Thorsten Hornung
- Department of Dermatology and Allergy, University Hospital of Bonn, Bonn, Germany
| | - Susann Forkel
- Department of Dermatology, Venereology and Allergology, University Medical Center Göttingen, Göttingen, Germany
| | - Jens Greve
- Department of Otorhinolaryngology—Head and Neck Surgery, Ulm University Medical Center, Ulm, Germany
| | - Gerda Wurpts
- Department of Dermatology and Allergy, Aachen Comprehensive Allergy Center, University Hospital RWTH Aachen, Aachen, Germany
| | - Pär Hallberg
- Department of Medical Sciences, Clinical Pharmacology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Anette Bygum
- Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
- Clinical Institute, University of Southern Denmark, Odense, Denmark
| | - Christian Von Buchwald
- Department of Otorhinolaryngology—Head and Neck Surgery and Audiology, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
| | | | - Michael Steffens
- Research Division, Federal Institute for Drugs and Medical Devices, Bonn, Germany
| | - Julia Stingl
- Institute for Clinical Pharmacology, RWTH Aachen University, Aachen, Germany
| | - Per Hoffmann
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Stefanie Heilmann-Heimbach
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Elisabeth Mangold
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Kerstin U. Ludwig
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Eva R. Rasmussen
- Department of Otorhinolaryngology—Head and Neck Surgery and Audiology, Rigshospitalet, University of Copenhagen, Copenhagen, Denmark
| | - Mia Wadelius
- Department of Medical Sciences, Clinical Pharmacology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Bernhardt Sachs
- Department of Dermatology and Allergy, Aachen Comprehensive Allergy Center, University Hospital RWTH Aachen, Aachen, Germany
- Research Division, Federal Institute for Drugs and Medical Devices, Bonn, Germany
| | - Markus M. Nöthen
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
| | - Andreas J. Forstner
- Institute of Human Genetics, University of Bonn, School of Medicine and University Hospital Bonn, Bonn, Germany
- Institute of Neuroscience and Medicine (INM-1), Research Center Jülich, Jülich, Germany
- *Correspondence: Andreas J. Forstner,
| |
Collapse
|
49
|
Gloss AD, Vergnol A, Morton TC, Laurin PJ, Roux F, Bergelson J. Genome-wide association mapping within a local Arabidopsis thaliana population more fully reveals the genetic architecture for defensive metabolite diversity. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200512. [PMID: 35634919 PMCID: PMC9149790 DOI: 10.1098/rstb.2020.0512] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 03/08/2022] [Indexed: 12/16/2022] Open
Abstract
A paradoxical finding from genome-wide association studies (GWAS) in plants is that variation in metabolite profiles typically maps to a small number of loci, despite the complexity of underlying biosynthetic pathways. This discrepancy may partially arise from limitations presented by geographically diverse mapping panels. Properties of metabolic pathways that impede GWAS by diluting the additive effect of a causal variant, such as allelic and genetic heterogeneity and epistasis, would be expected to increase in severity with the geographical range of the mapping panel. We hypothesized that a population from a single locality would reveal an expanded set of associated loci. We tested this in a French Arabidopsis thaliana population (less than 1 km transect) by profiling and conducting GWAS for glucosinolates, a suite of defensive metabolites that have been studied in depth through functional and genetic mapping approaches. For two distinct classes of glucosinolates, we discovered more associations at biosynthetic loci than the previous GWAS with continental-scale mapping panels. Candidate genes underlying novel associations were supported by concordance between their observed effects in the TOU-A population and previous functional genetic and biochemical characterization. Local populations complement geographically diverse mapping panels to reveal a more complete genetic architecture for metabolic traits. This article is part of the theme issue 'Genetic basis of adaptation and speciation: from loci to causative mutations'.
Collapse
Affiliation(s)
- Andrew D. Gloss
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Amélie Vergnol
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Timothy C. Morton
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Peter J. Laurin
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Fabrice Roux
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Joy Bergelson
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| |
Collapse
|
50
|
Burt CH. Challenging the utility of polygenic scores for social science: Environmental confounding, downward causation, and unknown biology. Behav Brain Sci 2022; 46:e207. [PMID: 35551690 PMCID: PMC9653522 DOI: 10.1017/s0140525x22001145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The sociogenomics revolution is upon us, we are told. Whether revolutionary or not, sociogenomics is poised to flourish given the ease of incorporating polygenic scores (or PGSs) as "genetic propensities" for complex traits into social science research. Pointing to evidence of ubiquitous heritability and the accessibility of genetic data, scholars have argued that social scientists not only have an opportunity but a duty to add PGSs to social science research. Social science research that ignores genetics is, some proponents argue, at best partial and likely scientifically flawed, misleading, and wasteful. Here, I challenge arguments about the value of genetics for social science and with it the claimed necessity of incorporating PGSs into social science models as measures of genetic influences. In so doing, I discuss the impracticability of distinguishing genetic influences from environmental influences because of non-causal gene-environment correlations, especially population stratification, familial confounding, and downward causation. I explain how environmental effects masquerade as genetic influences in PGSs, which undermines their raison d'être as measures of genetic propensity, especially for complex socially contingent behaviors that are the subject of sociogenomics. Additionally, I draw attention to the partial, unknown biology, while highlighting the persistence of an implicit, unavoidable reductionist genes versus environments approach. Leaving sociopolitical and ethical concerns aside, I argue that the potential scientific rewards of adding PGSs to social science are few and greatly overstated and the scientific costs, which include obscuring structural disadvantages and cultural influences, outweigh these meager benefits for most social science applications.
Collapse
Affiliation(s)
- Callie H Burt
- Department of Criminal Justice & Criminology, Center for Research on Interpersonal Violence (CRIV), Georgia State University, Atlanta, GA, USA ; www.callieburt.org
| |
Collapse
|