1
|
Lin KH, Granka JM, Shastri AJ, Bonham VL, Naik RP. Ancestry-independent risk of venous thromboembolism in individuals with sickle cell trait vs factor V Leiden. Blood Adv 2024; 8:5710-5718. [PMID: 39255335 DOI: 10.1182/bloodadvances.2024014252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 08/08/2024] [Accepted: 08/11/2024] [Indexed: 09/12/2024] Open
Abstract
ABSTRACT Sickle cell trait (SCT) is a risk factor for venous thromboembolism (VTE). Prior studies investigating the association between SCT and VTE have been performed nearly exclusively in Black populations. However, race-based research can contribute to systemic racism in medicine. We leveraged data from the 23andMe research cohort (4 184 082 participants) to calculate the ancestry-independent risk of VTE associated with SCT as well as comparative risk estimates for heterozygous factor V Leiden (FVL). Odds ratios (ORs) were calculated using a meta-analysis of 3 genetic ancestry groups (European [n = 3 183 142], Latine [n = 597 539], and African [n = 202 281]) and a secondary full-cohort analysis including 2 additional groups (East Asian [n = 159 863] and South Asian [n = 41 257]). Among the full cohort, 94 323 participants (2.25%) reported a history of VTE. On meta-analysis, individuals with SCT had a 1.45-fold (confidence interval [CI], 1.32-1.60) increased risk of VTE compared with SCT noncarriers, which was similar to the full-cohort estimate. The risk of pulmonary embolism (PE) in SCT (OR, 1.95; CI, 1.72-2.20) was higher than that of isolated deep venous thrombosis (DVT; OR, 1.04; CI, 0.90-1.21). FVL carriers had 3.30-fold (CI, 3.24-3.37) increased risk of VTE compared with FVL noncarriers, with a higher risk of isolated DVT (OR, 3.59; CI, 3.51-3.68) than PE (OR, 2.72; CI, 2.64-2.81). In this large, diverse cohort, the risk of VTE was increased among individuals with SCT compared with those without, independent of race or genetic ancestry. The risk of VTE with SCT was lower than that observed in FVL; however, the pattern of VTE in SCT was PE predominant, which is the opposite to that observed in FVL.
Collapse
Affiliation(s)
| | | | | | - Vence L Bonham
- National Human Genome Research Institute, Social and Behavioral Research Branch, National Institutes of Health, Bethesda, MD
| | - Rakhi P Naik
- Division of Hematology, Department of Medicine, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
2
|
Jewett EM. CORRECTING MODEL MISSPECIFICATION IN RELATIONSHIP ESTIMATES. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.13.594005. [PMID: 38868169 PMCID: PMC11167690 DOI: 10.1101/2024.05.13.594005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
The datasets of large genotyping biobanks and direct-to-consumer genetic testing companies contain many related individuals. Until now, it has been widely accepted that the most distant relationships that can be detected are around fifteen degrees (approximately 8 th cousins) and that practical relationship estimates have a ceiling around ten degrees (approximately 5 th cousins). However, we show that these assumptions are incorrect and that they are due to a misapplication of relationship estimators. In particular, relationship estimators are applied almost exclusively to putative relatives who have been identified because they share detectable tracts of DNA identically by descent (IBD). However, no existing relationship estimator conditions on the event that two individuals share at least one detectable segment of IBD anywhere in the genome. As a result, the relationship estimates obtained using existing estimators are dramatically biased for distant relationships, inferring all sufficiently distant relationships to be around ten degrees regardless of the depth of the true relationship. Existing relationship estimators are derived under a model that assumes that each pair of related individuals shares a single common ancestor (or mating pair of ancestors). This model breaks down for relationships beyond 10 generations in the past because individuals share many thousands of cryptic common ancestors due to pedigree collapse. We first derive a corrected likelihood that conditions on the event that at least one segment is observed between a pair of putative relatives and we demonstrate that the corrected likelihood largely eliminates the bias in estimates of pairwise relationships and provides a more accurate characterization of the uncertainty in these estimates. We then reformulate the relationship inference problem to account for the fact that individuals share many common ancestors, not just one. We demonstrate that the most distant relationship that can be inferred using IBD may be 200 degrees or more, rather than ten, extending the time-to-common ancestor from approximately 300 years in the past to approximately 3,000 years in the past or more. This dramatic increase in the range of relationship estimators makes it possible to infer relationships whose common ancestors lived before historical events such as European settlement of the Americas, the Transatlantic Slave Trade, and the rise and fall of the Roman Empire.
Collapse
|
3
|
Budowle B, Baker L, Sajantila A, Mittelman K, Mittelman D. Prioritizing privacy and presentation of supportable hypothesis testing in forensic genetic genealogy investigations. Biotechniques 2024; 76:425-431. [PMID: 39119680 DOI: 10.1080/07366205.2024.2386218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Accepted: 07/25/2024] [Indexed: 08/10/2024] Open
Abstract
Investigative leads are not generated by traditional forensic DNA testing, if the source of the forensic evidence or a 1st degree relative of unidentified human remains is not in the DNA database. In such cases, forensic genetic genealogy (FGG) can provide valuable leads. However, FGG generated genetic data contain private and sensitive information. Therefore, it is essential to deploy approaches that minimize unnecessary disclosure of these data to mitigate potential risks to individual privacy. We recommend protective practices that need not impact effective reporting of relationship identifications. Examples include performing one-to-one comparisons of DNA profiles of third-party samples and evidence samples offline with an "air gap" to the internet and shielding the specific shared single nucleotide polymorphisms (SNP) states and locations by binning adjacent SNPs in forensic reports. Such approaches reduce risk of unwanted access to or reverse engineering of third-party individuals' genetic data and can give these donors greater confidence to support use of their DNA profiles in FGG investigation.
Collapse
Affiliation(s)
- Bruce Budowle
- Othram Inc., The Woodlands, TX 77381, USA
- Department of Forensic Medicine, University of Helsinki, Finland
- Forensic Science Institute, Radford University, Radford, VA 24142, USA
| | - Lee Baker
- Othram Inc., The Woodlands, TX 77381, USA
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health & Welfare, Helsinki,Finland
| | | | | |
Collapse
|
4
|
Jewett EM. SIMULATING PEDIGREES ASCERTAINED ON THE BASIS OF OBSERVED IBD SHARING. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.13.594012. [PMID: 38872734 PMCID: PMC11170672 DOI: 10.1101/2024.05.13.594012] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
In large genotyping datasets, individuals often have thousands of distant cousins with whom they share detectable segments of DNA identically by descent (IBD). The ability to simulate these distant relationships is important for developing and testing methods, carrying out power analyses, and performing population genetic analyses. Because distant relatives are unlikely to share detectable IBD segments by chance, many simulation replicates are needed to sample IBD between any given pair of distant relatives. Exponentially more samples are needed to simulate observable segments of IBD simultaneously among multiple pairs of distant relatives in a single pedigree. Using existing pedigree simulation methods that do not condition on the event that IBD is observed among certain pairs of relatives, the chances of sampling shared IBD patterns that reflect those observed in real data ascertained from large genotyping datasets are vanishingly small, even for pedigrees of modest size. Here, we show how to sample recombination breakpoints on a fixed pedigree while conditioning on the event that specified pairs of individuals share at least one observed segment of IBD. The resulting simulator makes it possible to sample genotypes and IBD segments on pedigrees that reflect those ascertained from biobank scale data.
Collapse
|
5
|
Qiao Y, Jewett EM, McManus KF, Freyman WA, Curran JE, Williams-Blangero S, Blangero J, Williams AL. Reconstructing parent genomes using siblings and other relatives. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.10.593578. [PMID: 38798596 PMCID: PMC11118276 DOI: 10.1101/2024.05.10.593578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Reconstructing the DNA of ancestors from their descendants has the potential to empower phenotypic analyses (including association and genetic nurture studies), improve pedigree reconstruction, and shed light on the ancestral population and phenotypes of ancestors. We developed HAPI-RECAP, a method that reconstructs the DNA of parents from full siblings and their relatives. This tool leverages HAPI2's output, a new phasing approach that applies to siblings (and optionally one or both parents) and reliably infers parent haplotypes but does not link the ungenotyped parents' DNA across chromosomes or between segments flanking ambiguities. By combining IBD between the reconstructed parents and the relatives, HAPI-RECAP resolves the source parent of these segments. Moreover, the method exploits crossovers the children inherited and sex-specific genetic maps to infer the reconstructed parents' sexes. We validated these methods on research participants from both 23andMe, Inc. and the San Antonio Mexican American Family Studies. Given data for one parent, HAPI2 reconstructs large fractions of the missing parent's DNA, between 77.6% and 99.97% among all families, and 90.3% on average in three- and four-child families. When reconstructing both parents, HAPI-RECAP inferred between 33.2% and 96.6% of the parents' genotypes, averaging 70.6% in four-child families. Reconstructed genotypes have average error rates < 10-3, or comparable to those from direct genotyping. HAPI-RECAP inferred the parent sexes 100% correctly given IBD-linked segments and can also reconstruct parents without any IBD. As datasets grow in size, more families will be implicitly collected; HAPI-RECAP holds promise to enable high quality parent genotype reconstruction.
Collapse
Affiliation(s)
- Ying Qiao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | - Joanne E. Curran
- South Texas Diabetes and Obesity Institute and Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX 78520, USA
| | - Sarah Williams-Blangero
- South Texas Diabetes and Obesity Institute and Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX 78520, USA
| | - John Blangero
- South Texas Diabetes and Obesity Institute and Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX 78520, USA
| | | | - Amy L. Williams
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- 23andMe, Inc., Sunnyvale, CA 94086, USA
| |
Collapse
|
6
|
Browning SR, Browning BL. Biobank-scale inference of multi-individual identity by descent and gene conversion. Am J Hum Genet 2024; 111:691-700. [PMID: 38513668 PMCID: PMC11023918 DOI: 10.1016/j.ajhg.2024.02.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 02/26/2024] [Accepted: 02/27/2024] [Indexed: 03/23/2024] Open
Abstract
We present a method for efficiently identifying clusters of identical-by-descent haplotypes in biobank-scale sequence data. Our multi-individual approach enables much more computationally efficient inference of identity by descent (IBD) than approaches that infer pairwise IBD segments and provides locus-specific IBD clusters rather than IBD segments. Our method's computation time, memory requirements, and output size scale linearly with the number of individuals in the dataset. We also present a method for using multi-individual IBD to detect alleles changed by gene conversion. Application of our methods to the autosomal sequence data for 125,361 White British individuals in the UK Biobank detects more than 9 million converted alleles. This is 2,900 times more alleles changed by gene conversion than were detected in a previous analysis of familial data. We estimate that more than 250,000 sequenced probands and a much larger number of additional genomes from multi-generational family members would be required to find a similar number of alleles changed by gene conversion using a family-based approach. Our IBD clustering method is implemented in the open-source ibd-cluster software package.
Collapse
Affiliation(s)
- Sharon R Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA.
| | - Brian L Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, USA.
| |
Collapse
|
7
|
Ji Q, Yao Y, Li Z, Zhou Z, Qian J, Tang Q, Xie J. Characterizing identity by descent segments in Chinese interpopulation unrelated individual pairs. Mol Genet Genomics 2024; 299:37. [PMID: 38494535 DOI: 10.1007/s00438-024-02132-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Accepted: 02/22/2024] [Indexed: 03/19/2024]
Abstract
Identity by descent (IBD) segments, uninterrupted DNA segments derived from the same ancestral chromosomes, are widely used as indicators of relationships in genetics. A great deal of research focuses on IBD segments between related pairs, while the statistical analyses of segments in irrelevant individuals are rare. In this study, we investigated the basic informative features of IBD segments in unrelated pairs in Chinese populations from the 1000 Genome Project. A total of 5922 IBD segments in Chinese interpopulation unrelated individual pairs were detected via IBIS and the average length of IBD was 3.71 Mb in length. It was found that 17.86% of unrelated pairs shared at least one IBD segment in the Chinese cohort. Furthermore, a total of 49 chromosomal regions where IBD segments clustered in high abundance were identified, which might be sharing hotspots in the human genome. Such regions could also be observed in other ancestry populations, which implies that similar IBD backgrounds also exist. Altogether, these results demonstrated the distribution of common background IBD segments, which helps improve the accuracy in pedigree studies based on IBD analysis.
Collapse
Affiliation(s)
- Qiqi Ji
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Yining Yao
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Zhimin Li
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Zhihan Zhou
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Jinglei Qian
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Qiqun Tang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, 200032, China
| | - Jianhui Xie
- Department of Forensic Medicine, School of Basic Medical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China.
| |
Collapse
|
8
|
Marquez-Romero JM, Romo-Martínez J, Hernández-Curiel B, Ruiz-Franco A, Krishnamurthi R, Feigin V. Assessing the individual risk of stroke in caregivers of patients with stroke. ARQUIVOS DE NEURO-PSIQUIATRIA 2024; 82:1-5. [PMID: 38467391 PMCID: PMC10927366 DOI: 10.1055/s-0044-1779691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 12/29/2023] [Indexed: 03/13/2024]
Abstract
BACKGROUND Genetic factors influence the risk of developing stroke. Still, it is unclear whether this risk is intrinsically high in certain people or if nongenetic factors explain it entirely. OBJECTIVE To compare the risk of stroke in kin and nonkin caregivers. METHODS In a cross-sectional study using the Stroke Riskometer app (AUT Ventures Limited, Auckland, AUK, New Zealand), we determined the 5- and 10-year stroke risk (SR) among caregivers of stroke inpatients. The degree of kinship was rated with a score ranging from 0 to 50 points. RESULTS We studied 278 caregivers (69.4% of them female) with a mean age of 47.5 ± 14.2 years. Kin caregivers represented 70.1% of the sample, and 49.6% of them were offspring. The median SR at 5 years was of 2.1 (range: 0.35-17.3) versus 1.73 (range: 0.04-29.9), and of 4.0 (range: 0.45-38.6) versus 2.94 (range: 0.05-59.35) at 10 years for the nonkin and kin caregivers respectively. In linear logistic regression controlled for the age of the caregivers, adding the kinship score did not increase the overall variability of the model for the risk at 5 years (R2 = 0.271; p = 0.858) nor the risk at 10 years (R2 = 0.376; p = 0.78). CONCLUSION Caregivers of stroke patients carry a high SR regardless of their degree of kinship.
Collapse
Affiliation(s)
- Juan Manuel Marquez-Romero
- Instituto Mexicano del Seguro Social, Órgano de Operación Administrativa Desconcentrada, Hospital General de Zona #2, Departamento de Neurología, Aguascalientes AGS, Mexico.
| | - Jessica Romo-Martínez
- Instituto Mexicano del Seguro Social, Órgano de Operación Administrativa Desconcentrada, Centro Médico de Occidente, Departamento de Radiología, Guadalajara JAL, Mexico.
| | | | - Angélica Ruiz-Franco
- Hospital Juárez de México, Departamento de Neurología, Ciudad México CDMX, Mexico.
| | - Rita Krishnamurthi
- National Institute for Stroke & Applied Neurosciences, School of Clinical Sciences, Auckland AUK, New Zealand.
| | - Valery Feigin
- National Institute for Stroke & Applied Neurosciences, School of Clinical Sciences, Auckland AUK, New Zealand.
| |
Collapse
|
9
|
Diz AP, Skibinski DOF. Patterns of admixture and introgression in a mosaic Mytilus galloprovincialis and Mytilus edulis hybrid zone in SW England. Mol Ecol 2024; 33:e17233. [PMID: 38063472 DOI: 10.1111/mec.17233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 11/21/2023] [Accepted: 11/24/2023] [Indexed: 01/25/2024]
Abstract
The study of hybrid zones offers important insights into speciation. Earlier studies on hybrid populations of the marine mussel species Mytilus edulis and Mytilus galloprovincialis in SW England provided evidence of admixture but were constrained by the limited number of molecular markers available. We use 57 ancestry-informative SNPs, most of which have been mapped genetically, to provide evidence of distinctive differences between admixed populations in SW England and asymmetrical introgression from M. edulis to M. galloprovincialis. We combine the genetic study with analysis of phenotypic traits of potential ecological and adaptive significance. We demonstrate that hybrid individuals have brown mantle edges unlike the white or purple in the parental species, suggesting allelic or non-allelic genomic interactions. We report differences in gonad development stage between the species consistent with a prezygotic barrier between the species. By incorporating results from publications dating back to 1980, we confirm the long-term stability of the hybrid zone despite higher viability of M. galloprovincialis. This stability coincides with a dramatic change in temperature of UK coastal waters and suggests that these hybrid populations might be resisting the effects of global warming. However, a single SNP locus associated with the Notch transmembrane signalling protein shows a markedly different pattern of variation to the others and might be associated with adaptation of M. galloprovincialis to colder northern temperatures.
Collapse
Affiliation(s)
- Angel P Diz
- Centro de Investigación Mariña, Universidade de Vigo (CIM-UVIGO), Vigo, Spain
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain
| | | |
Collapse
|
10
|
Pitz V, Makarious MB, Bandres-Ciga S, Iwaki H, Singleton AB, Nalls M, Heilbron K, Blauwendraat C. Analysis of rare Parkinson's disease variants in millions of people. NPJ Parkinsons Dis 2024; 10:11. [PMID: 38191580 PMCID: PMC10774311 DOI: 10.1038/s41531-023-00608-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 11/24/2023] [Indexed: 01/10/2024] Open
Abstract
Although many rare variants have been reportedly associated with Parkinson's disease (PD), many have not been replicated or have failed to replicate. Here, we conduct a large-scale replication of rare PD variants. We assessed a total of 27,590 PD cases, 6701 PD proxies, and 3,106,080 controls from three data sets: 23andMe, Inc., UK Biobank, and AMP-PD. Based on well-known PD genes, 834 variants of interest were selected from the ClinVar annotated 23andMe dataset. We performed a meta-analysis using summary statistics of all three studies. The meta-analysis resulted in five significant variants after Bonferroni correction, including variants in GBA1 and LRRK2. Another eight variants are strong candidate variants for their association with PD. Here, we provide the largest rare variant meta-analysis to date, providing information on confirmed and newly identified variants for their association with PD using several large databases. Additionally we also show the complexities of studying rare variants in large-scale cohorts.
Collapse
Affiliation(s)
- Vanessa Pitz
- Integrative Neurogenomics Unit, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
| | - Mary B Makarious
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK
- UCL Movement Disorders Centre, University College London, London, UK
| | - Sara Bandres-Ciga
- Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
| | - Hirotaka Iwaki
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Washington, DC, USA
| | - Andrew B Singleton
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
| | - Mike Nalls
- Molecular Genetics Section, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Washington, DC, USA
| | | | - Cornelis Blauwendraat
- Integrative Neurogenomics Unit, Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
11
|
Browning SR, Browning BL. Biobank-scale inference of multi-individual identity by descent and gene conversion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.03.565574. [PMID: 37961601 PMCID: PMC10635131 DOI: 10.1101/2023.11.03.565574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
We present a method for efficiently identifying clusters of identical-by-descent haplotypes in biobank-scale sequence data. Our multi-individual approach enables much more efficient collection and storage of identity by descent (IBD) information than approaches that detect and store pairwise IBD segments. Our method's computation time, memory requirements, and output size scale linearly with the number of individuals in the dataset. We also present a method for using multi-individual IBD to detect alleles changed by gene conversion. Application of our methods to the autosomal sequence data for 125,361 White British individuals in the UK Biobank detects more than 9 million converted alleles. This is 2900 times more alleles changed by gene conversion than were detected in a previous analysis of familial data. We estimate that more than 250,000 sequenced probands and a much larger number of additional genomes from multi-generational family members would be required to find a similar number of alleles changed by gene conversion using a family-based approach.
Collapse
Affiliation(s)
| | - Brian L. Browning
- Department of Biostatistics, University of Washington, Seattle, WA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA
| |
Collapse
|
12
|
Malawsky DS, van Walree E, Jacobs BM, Heng TH, Huang QQ, Sabir AH, Rahman S, Sharif SM, Khan A, Mirkov MU, Kuwahara H, Gao X, Alkuraya FS, Posthuma D, Newman WG, Griffiths CJ, Mathur R, van Heel DA, Finer S, O'Connell J, Martin HC. Influence of autozygosity on common disease risk across the phenotypic spectrum. Cell 2023; 186:4514-4527.e14. [PMID: 37757828 PMCID: PMC10580289 DOI: 10.1016/j.cell.2023.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 07/11/2023] [Accepted: 08/23/2023] [Indexed: 09/29/2023]
Abstract
Autozygosity is associated with rare Mendelian disorders and clinically relevant quantitative traits. We investigated associations between the fraction of the genome in runs of homozygosity (FROH) and common diseases in Genes & Health (n = 23,978 British South Asians), UK Biobank (n = 397,184), and 23andMe. We show that restricting analysis to offspring of first cousins is an effective way of reducing confounding due to social/environmental correlates of FROH. Within this group in G&H+UK Biobank, we found experiment-wide significant associations between FROH and twelve common diseases. We replicated associations with type 2 diabetes (T2D) and post-traumatic stress disorder via within-sibling analysis in 23andMe (median n = 480,282). We estimated that autozygosity due to consanguinity accounts for 5%-18% of T2D cases among British Pakistanis. Our work highlights the possibility of widespread non-additive genetic effects on common diseases and has important implications for global populations with high rates of consanguinity.
Collapse
Affiliation(s)
| | - Eva van Walree
- Department of Clinical Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands; Department of Complex Trait Genetics Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, VU Amsterdam, Amsterdam, the Netherlands
| | - Benjamin M Jacobs
- Blizard Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK; Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - Teng Hiang Heng
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Qin Qin Huang
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Ataf H Sabir
- West Midlands Regional Clinical Genetics Unit, Birmingham Women's and Children's NHS FT, Birmingham, UK; Institute of Cancer and Genomics, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Saadia Rahman
- Queen Square Institute of Neurology, University College London, London, UK
| | - Saghira Malik Sharif
- Yorkshire Regional Genetics Service, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - Ahsan Khan
- Waltham Forest Council, Waltham Forest Town Hall, Forest Road, Walthamstow E17 4JF, UK
| | - Maša Umićević Mirkov
- Congenica Limited, BioData Innovation Centre, Wellcome Genome Campus, Hinxton, UK
| | - Hiroyuki Kuwahara
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal 23955, Saudi Arabia
| | - Xin Gao
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal 23955, Saudi Arabia
| | - Fowzan S Alkuraya
- Department of Translational Genomics, Center for Genomic Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Danielle Posthuma
- Department of Complex Trait Genetics Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, VU Amsterdam, Amsterdam, the Netherlands
| | - William G Newman
- Division of Evolution, Infection and Genomics, Faculty of Biology, Medicine and Human Sciences, University of Manchester, Manchester M13 9PL, UK; Manchester Centre for Genomic Medicine, Manchester University NHS Foundation Trust, Manchester M13 9WL, UK
| | - Christopher J Griffiths
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK; MRC and Asthma UK Centre in Allergic Mechanisms of Asthma, King's College London, London, UK
| | - Rohini Mathur
- Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | - David A van Heel
- Blizard Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Sarah Finer
- Blizard Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK; Wolfson Institute of Population Health, Queen Mary University of London, London, UK
| | | | - Hilary C Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| |
Collapse
|
13
|
Thorpe HHA, Fontanillas P, Pham BK, Meredith JJ, Jennings MV, Courchesne-Krak NS, Vilar-Ribó L, Bianchi SB, Mutz J, Elson SL, Khokhar JY, Abdellaoui A, Davis LK, Palmer AA, Sanchez-Roige S. Genome-Wide Association Studies of Coffee Intake in UK/US Participants of European Ancestry Uncover Gene-Cohort Influences. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.09.23295284. [PMID: 37745582 PMCID: PMC10516045 DOI: 10.1101/2023.09.09.23295284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Coffee is one of the most widely consumed beverages. We performed a genome-wide association study (GWAS) of coffee intake in US-based 23andMe participants (N=130,153) and identified 7 significant loci, with many replicating in three multi-ancestral cohorts. We examined genetic correlations and performed a phenome-wide association study across thousands of biomarkers and health and lifestyle traits, then compared our results to the largest available GWAS of coffee intake from UK Biobank (UKB; N=334,659). The results of these two GWAS were highly discrepant. We observed positive genetic correlations between coffee intake and psychiatric illnesses, pain, and gastrointestinal traits in 23andMe that were absent or negative in UKB. Genetic correlations with cognition were negative in 23andMe but positive in UKB. The only consistent observations were positive genetic correlations with substance use and obesity. Our study shows that GWAS in different cohorts could capture cultural differences in the relationship between behavior and genetics.
Collapse
Affiliation(s)
- Hayley H A Thorpe
- Department of Anatomy and Cell Biology, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Department of Biomedical Sciences, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
| | | | - Benjamin K Pham
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - John J Meredith
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Mariela V Jennings
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | | | - Laura Vilar-Ribó
- Psychiatric Genetics Unit, Group of Psychiatry, Mental Health and Addiction, Vall d’Hebron Research Institute (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Sevim B Bianchi
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
| | - Julian Mutz
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
| | - 23andMe Research Team
- Department of Biomedical Sciences, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
| | - Sarah L Elson
- Department of Biomedical Sciences, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
| | - Jibran Y Khokhar
- Department of Anatomy and Cell Biology, Schulich School of Medicine and Dentistry, Western University, London, ON, Canada
- Department of Biomedical Sciences, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
| | - Abdel Abdellaoui
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - Lea K Davis
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Abraham A Palmer
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA
| | - Sandra Sanchez-Roige
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
14
|
Budowle B, Arnette A, Sajantila A. A cost-benefit analysis for use of large SNP panels and high throughput typing for forensic investigative genetic genealogy. Int J Legal Med 2023; 137:1595-1614. [PMID: 37341834 PMCID: PMC10421786 DOI: 10.1007/s00414-023-03029-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 05/16/2023] [Indexed: 06/22/2023]
Abstract
Next-generation sequencing (NGS), also known as massively sequencing, enables large dense SNP panel analyses which generate the genetic component of forensic investigative genetic genealogy (FIGG). While the costs of implementing large SNP panel analyses into the laboratory system may seem high and daunting, the benefits of the technology may more than justify the investment. To determine if an infrastructural investment in public laboratories and using large SNP panel analyses would reap substantial benefits to society, a cost-benefit analysis (CBA) was performed. This CBA applied the logic that an increase of DNA profile uploads to a DNA database due to a sheer increase in number of markers and a greater sensitivity of detection afforded with NGS and a higher hit/association rate due to large SNP/kinship resolution and genealogy will increase investigative leads, will be more effective for identifying recidivists which in turn reduces future victims of crime, and will bring greater safety and security to communities. Analyses were performed for worst case/best case scenarios as well as by simulation sampling the range spaces with multiple input values simultaneously to generate best estimate summary statistics. This study shows that the benefits, both tangible and intangible, over the lifetime of an advanced database system would be huge and can be projected to be for less than $1 billion per year (over a 10-year period) investment can reap on average > $4.8 billion in tangible and intangible cost-benefits per year. More importantly, on average > 50,000 individuals need not become victims if FIGG were employed, assuming investigative associations generated were acted upon. The benefit to society is immense making the laboratory investment a nominal cost. The benefits likely are underestimated herein. There is latitude in the estimated costs, and even if they were doubled or tripled, there would still be substantial benefits gained with a FIGG-based approach. While the data used in this CBA are US centric (primarily because data were readily accessible), the model is generalizable and could be used by other jurisdictions to perform relevant and representative CBAs.
Collapse
Affiliation(s)
- Bruce Budowle
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland.
- Radford University Forensic Science Institute, Radford University, Radford, VA, USA.
| | - Andrew Arnette
- Department of Business Information Technology, Virginia Tech, Blacksburg, VA, USA
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki, Helsinki, Finland
- Forensic Medicine Unit, Finnish Institute for Health and Welfare, Helsinki, Finland
| |
Collapse
|
15
|
Caggiano C, Boudaie A, Shemirani R, Mefford J, Petter E, Chiu A, Ercelen D, He R, Tward D, Paul KC, Chang TS, Pasaniuc B, Kenny EE, Shortt JA, Gignoux CR, Balliu B, Arboleda VA, Belbin G, Zaitlen N. Disease risk and healthcare utilization among ancestrally diverse groups in the Los Angeles region. Nat Med 2023; 29:1845-1856. [PMID: 37464048 PMCID: PMC11121511 DOI: 10.1038/s41591-023-02425-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 05/30/2023] [Indexed: 07/20/2023]
Abstract
An individual's disease risk is affected by the populations that they belong to, due to shared genetics and environmental factors. The study of fine-scale populations in clinical care is important for identifying and reducing health disparities and for developing personalized interventions. To assess patterns of clinical diagnoses and healthcare utilization by fine-scale populations, we leveraged genetic data and electronic medical records from 35,968 patients as part of the UCLA ATLAS Community Health Initiative. We defined clusters of individuals using identity by descent, a form of genetic relatedness that utilizes shared genomic segments arising due to a common ancestor. In total, we identified 376 clusters, including clusters with patients of Afro-Caribbean, Puerto Rican, Lebanese Christian, Iranian Jewish and Gujarati ancestry. Our analysis uncovered 1,218 significant associations between disease diagnoses and clusters and 124 significant associations with specialty visits. We also examined the distribution of pathogenic alleles and found 189 significant alleles at elevated frequency in particular clusters, including many that are not regularly included in population screening efforts. Overall, this work progresses the understanding of health in understudied communities and can provide the foundation for further study into health inequities.
Collapse
Affiliation(s)
- Christa Caggiano
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Neurology, University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Ruhollah Shemirani
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joel Mefford
- Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, Los Angeles, CA, USA
| | - Ella Petter
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
| | - Alec Chiu
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Defne Ercelen
- Computational and Systems Biology Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | - Rosemary He
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Daniel Tward
- Department of Neurology, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Kimberly C Paul
- Department of Neurology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Timothy S Chang
- Department of Neurology, University of California, Los Angeles, Los Angeles, CA, USA
| | - Bogdan Pasaniuc
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Institute of Precision Health, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jonathan A Shortt
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Division of Bioinformatics and Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Division of Bioinformatics and Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Brunilda Balliu
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Valerie A Arboleda
- Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Gillian Belbin
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Noah Zaitlen
- Department of Neurology, University of California, Los Angeles, Los Angeles, CA, USA.
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA.
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
16
|
Wei Y, Naseri A, Zhi D, Zhang S. RaPID-Query for fast identity by descent search and genealogical analysis. Bioinformatics 2023; 39:btad312. [PMID: 37166451 PMCID: PMC10244210 DOI: 10.1093/bioinformatics/btad312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 04/26/2023] [Accepted: 05/09/2023] [Indexed: 05/12/2023] Open
Abstract
MOTIVATION Due to the rapid growth of the genetic database size, genealogical search, a process of inferring familial relatedness by identifying DNA matches, has become a viable approach to help individuals finding missing family members or law enforcement agencies locating suspects. A fast and accurate method is needed to search an out-of-database individual against millions of individuals. Most existing approaches only offer all-versus-all within panel match. Some prototype algorithms offer one-versus-all query from out-of-panel individual, but they do not tolerate errors. RESULTS A new method, random projection-based identity-by-descent (IBD) detection (RaPID) query, is introduced to make fast genealogical search possible. RaPID-Query identifies IBD segments between a query haplotype and a panel of haplotypes. By integrating matches over multiple PBWT indexes, RaPID-Query manages to locate IBD segments quickly with a given cutoff length while allowing mismatched sites. A single query against all UK biobank autosomal chromosomes was completed within 2.76 seconds on average, with the minimum length 7 cM and 700 markers. RaPID-Query achieved a 0.016 false negative rate and a 0.012 false positive rate simultaneously on a chromosome 20 sequencing panel having 86 265 sites. This is comparable to the state-of-the-art IBD detection method TPBWT(out-of-sample) and Hap-IBD. The high-quality IBD segments yielded by RaPID-Query were able to distinguish up to fourth degree of the familial relatedness for a given individual pair, and the area under the receiver operating characteristic curve values are at least 97.28%. AVAILABILITY AND IMPLEMENTATION The RaPID-Query program is available at https://github.com/ucfcbb/RaPID-Query.
Collapse
Affiliation(s)
- Yuan Wei
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States
| | - Ardalan Naseri
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Degui Zhi
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States
| |
Collapse
|
17
|
David LT. Addressing the feasibility of people of African descent finding living African relatives using direct-to-consumer genetic testing. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2023; 181:163-165. [PMID: 36790590 PMCID: PMC10192001 DOI: 10.1002/ajpa.24705] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/22/2023] [Accepted: 01/23/2023] [Indexed: 02/05/2023]
Abstract
People of African descent use direct-to-consumer genomics services such as 23andMe and AncestryDNA for various family histories and health reasons, including identifying and interacting with the previously unknown living African genetic relatives. In this commentary, I argue that it is reasonable to consider that cousin pairs consisting of an African person and a descendant of an African person enslaved in the Americans during the Transatlantic Slave Trade (i.e., a person of African descent) have genealogical ancestors recent enough to be detected using autosomal DNA testing where the pair has shared ancestors in the range of 20-6 generations ago from the present.
Collapse
Affiliation(s)
- LaKisha T David
- Department of Medical Ethics and Health Policy, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
18
|
Rizig M, Bandres-Ciga S, Makarious MB, Ojo O, Crea PW, Abiodun O, Levine KS, Abubakar S, Achoru C, Vitale D, Adeniji O, Agabi O, Koretsky MJ, Agulanna U, Hall DA, Akinyemi R, Xie T, Ali M, Shamim EA, Ani-Osheku I, Padmanaban M, Arigbodi O, Standaert DG, Bello A, Dean M, Erameh C, Elsayed I, Farombi T, Okunoye O, Fawale M, Billingsley KJ, Imarhiagbe F, Jerez PA, Iwuozo E, Baker B, Komolafe M, Malik L, Nwani P, Daida K, Nwazor E, Miano-Burkhardt A, Nyandaiti Y, Fang ZH, Obiabo Y, Kluss JH, Odeniyi O, Hernandez D, Odiase F, Tayebi N, Ojini F, Sidranksy E, Onwuegbuzie G, D’Souza AM, Osaigbovo G, Berhe B, Osemwegie N, Reed X, Oshinaike O, Leonard H, Otubogun F, Alvarado CX, Oyakhire S, Ozomma S, Samuel S, Taiwo F, Wahab K, Zubair Y, Iwaki H, Kim JJ, Morris HR, Hardy J, Nalls M, Heilbron K, Norcliffe-Kaufmann L, Blauwendraat C, Houlden H, Singleton A, Okubadejo N. Genome-wide Association Identifies Novel Etiological Insights Associated with Parkinson's Disease in African and African Admixed Populations. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.05.23289529. [PMID: 37398408 PMCID: PMC10312852 DOI: 10.1101/2023.05.05.23289529] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Background Understanding the genetic mechanisms underlying diseases in ancestrally diverse populations is a critical step towards the realization of the global application of precision medicine. The African and African admixed populations enable mapping of complex traits given their greater levels of genetic diversity, extensive population substructure, and distinct linkage disequilibrium patterns. Methods Here we perform a comprehensive genome-wide assessment of Parkinson's disease (PD) in 197,918 individuals (1,488 cases; 196,430 controls) of African and African admixed ancestry, characterizing population-specific risk, differential haplotype structure and admixture, coding and structural genetic variation and polygenic risk profiling. Findings We identified a novel common risk factor for PD and age at onset at the GBA1 locus (risk, rs3115534-G; OR=1.58, 95% CI = 1.37 - 1.80, P=2.397E-14; age at onset, BETA =-2.004, SE =0.57, P = 0.0005), that was found to be rare in non-African/African admixed populations. Downstream short- and long-read whole genome sequencing analyses did not reveal any coding or structural variant underlying the GWAS signal. However, we identified that this signal mediates PD risk via expression quantitative trait locus (eQTL) mechanisms. While previously identified GBA1 associated disease risk variants are coding mutations, here we suggest a novel functional mechanism consistent with a trend in decreasing glucocerebrosidase activity levels. Given the high population frequency of the underlying signal and the phenotypic characteristics of the homozygous carriers, we hypothesize that this variant may not cause Gaucher disease. Additionally, the prevalence of Gaucher's disease in Africa is low. Interpretation The present study identifies a novel African-ancestry genetic risk factor in GBA1 as a major mechanistic basis of PD in the African and African admixed populations. This striking result contrasts to previous work in Northern European populations, both in terms of mechanism and attributable risk. This finding highlights the importance of understanding population-specific genetic risk in complex diseases, a particularly crucial point as the field moves toward precision medicine in PD clinical trials and while recognizing the need for equitable inclusion of ancestrally diverse groups in such trials. Given the distinctive genetics of these underrepresented populations, their inclusion represents a valuable step towards insights into novel genetic determinants underlying PD etiology. This opens new avenues towards RNA-based and other therapeutic strategies aimed at reducing lifetime risk. Research in Context Evidence Before this Study Our current understanding of Parkinson's disease (PD) is disproportionately based on studying populations of European ancestry, leading to a significant gap in our knowledge about the genetics, clinical characteristics, and pathophysiology in underrepresented populations. This is particularly notable in individuals of African and African admixed ancestries. Over the last two decades, we have witnessed a revolution in the research area of complex genetic diseases. In the PD field, large-scale genome-wide association studies in the European, Asian, and Latin American populations have identified multiple risk loci associated with disease. These include 78 loci and 90 independent signals associated with PD risk in the European population, nine replicated loci and two novel population-specific signals in the Asian population, and a total of 11 novel loci recently nominated through multi-ancestry GWAS efforts.Nevertheless, the African and African admixed populations remain completely unexplored in the context of PD genetics. Added Value of this Study To address the lack of diversity in our research field, this study aimed to conduct the first genome-wide assessment of PD genetics in the African and African admixed populations. Here, we identified a genetic risk factor linked to PD etiology, dissected African-specific differences in risk and age at onset, characterized known genetic risk factors, and highlighted the utility of the African and African admixed risk haplotype substructure for future fine-mapping efforts. We identified a novel disease mechanism via expression changes consistent with decreased GBA1 activity levels. Future large scale single cell expression studies should investigate the neuronal populations in which expression differences are most prominent. This novel mechanism may hold promise for future efficient RNA-based therapeutic strategies such as antisense oligonucleotides or short interfering RNAs aimed at preventing and decreasing disease risk. We envisage that these data generated under the umbrella of the Global Parkinson's Genetics Program (GP2) will shed light on the molecular mechanisms involved in the disease process and might pave the way for future clinical trials and therapeutic interventions. This work represents a valuable resource in an underserved population, supporting pioneering research within GP2 and beyond. Deciphering causal and genetic risk factors in all these ancestries will help determine whether interventions, potential targets for disease modifying treatment, and prevention strategies that are being studied in the European populations are relevant to the African and African admixed populations. Implications of all the Available Evidence We nominate a novel signal impacting GBA1 as the major genetic risk factor for PD in the African and African admixed populations. The present study could inform future GBA1 clinical trials, improving patient stratification. In this regard, genetic testing can help to design trials likely to provide meaningful and actionable answers. It is our hope that these findings may ultimately have clinical utility for this underrepresented population.
Collapse
Affiliation(s)
- Mie Rizig
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- UCL Movement Disorders Centre, University College London, London, WC1N 3BG, UK
| | - Sara Bandres-Ciga
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | - Mary B Makarious
- UCL Movement Disorders Centre, University College London, London, WC1N 3BG, UK
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Oluwadamilola Ojo
- College of Medicine, University of Lagos, Idi Araba, Lagos State, Nigeria
| | - Peter Wild Crea
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | | | - Kristin S Levine
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Data Tecnica International, Washington, DC, USA
| | - Sani Abubakar
- Ahmadu Bello University, Zaria, Kaduna State, Nigeria
| | - Charles Achoru
- Jos University Teaching Hospital, Jos, Plateau State, Nigeria
| | - Dan Vitale
- Data Tecnica International, Washington, DC, USA
| | | | - Osigwe Agabi
- College of Medicine, University of Lagos, Idi Araba, Lagos State, Nigeria
| | - Mathew J Koretsky
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | - Uchechi Agulanna
- Lagos University Teaching Hospital, Idi Araba, Lagos State, Nigeria
| | - Deborah A. Hall
- Department of Neurological Sciences, Rush University Medical Center, Chicago, IL, USA
| | - Rufus Akinyemi
- Neuroscience and Ageing Research Unit, Institute for Advanced Medical Research and Training, College of Medicine, University of Ibadan, Ibadan, Oyo State, Nigeria
| | - Tao Xie
- Department of Neurology, University of Chicago Medicine, Chicago, Illinois, USA
| | - Mohammed Ali
- Federal Teaching Hospital Gombe, Gombe State, Nigeria
| | - Ejaz A. Shamim
- Human Motor Control Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland, USA
- Kaiser Permanente Mid-Atlantic States, Largo, Maryland, USA
- MidAtlantic Permanente Research Institute, Rockville, Maryland, USA
| | | | - Mahesh Padmanaban
- Department of Neurology, University of Chicago Medicine, Chicago, Illinois, USA
| | | | - David G Standaert
- Department of Neurology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Abiodun Bello
- University of Ilorin Teaching Hospital, Ilorin, Kwara State, Nigeria
| | - Marissa Dean
- Department of Neurology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Cyril Erameh
- Irrua Specialist Teaching Hospital, Irrua, Edo State, Nigeria
| | - Inas Elsayed
- Faculty of Pharmacy, University of Gezira, Wadmadani, 20, Sudan
| | | | - Olaitan Okunoye
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
| | - Michael Fawale
- Obafemi Awolowo University, Ile-Ife, Osun State, Nigeria
| | - Kimberley J Billingsley
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | | | - Pilar Alvarez Jerez
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | | | - Breeana Baker
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | | | - Laksh Malik
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | - Paul Nwani
- Nnamdi Azikiwe University Teaching Hospital, Nnewi, Anambra State, Nigeria
| | - Kensuke Daida
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Ernest Nwazor
- Rivers State University Teaching Hospital, Port Harcourt, Rivers State, Nigeria
| | - Abigail Miano-Burkhardt
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Yakub Nyandaiti
- University of Maiduguri Teaching Hospital, Maiduguri, Borno State, Nigeria
| | - Zih-Hua Fang
- German Center for Neurodegenerative Diseases (DZNE), Tuebingen, Germany
| | - Yahaya Obiabo
- Federal University of Health Sciences, Otukpo, Benue State, Nigeria
| | - Jillian H. Kluss
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | | | - Dena Hernandez
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | | | - Nahid Tayebi
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Francis Ojini
- College of Medicine, University of Lagos, Idi Araba, Lagos State, Nigeria
| | - Ellen Sidranksy
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Andrea M. D’Souza
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Bahafta Berhe
- Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | | | - Xylena Reed
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
| | | | - Hampton Leonard
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Data Tecnica International, Washington, DC, USA
| | | | - Chelsea X Alvarado
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Data Tecnica International, Washington, DC, USA
| | | | - Simon Ozomma
- University of Calabar Teaching Hospital, Calabar, Cross River State, Nigeria
| | - Sarah Samuel
- University of Maiduguri Teaching Hospital, Maiduguri, Borno State, Nigeria
| | | | - Kolawole Wahab
- University of Ilorin Teaching Hospital, Ilorin, Kwara State, Nigeria
- University of Ilorin, Ilorin, Kwara State, Nigeria
| | - Yusuf Zubair
- National Hospital, Abuja, Federal Capital Territory, Nigeria
| | - Hirotaka Iwaki
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Data Tecnica International, Washington, DC, USA
| | - Jonggeol Jeffrey Kim
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Huw R Morris
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
- UCL Movement Disorders Centre, University College London, London, WC1N 3BG, UK
| | - John Hardy
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
| | - Mike Nalls
- Data Tecnica International, Washington, DC, USA
| | | | | | | | - Cornelis Blauwendraat
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Henry Houlden
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, London WC1N 3BG, UK
| | - Andrew Singleton
- Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA, 20814
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Njideka Okubadejo
- College of Medicine, University of Lagos, Idi Araba, Lagos State, Nigeria
| |
Collapse
|
19
|
Yagasaki K, Nishida N, Mabuchi A, Tokunaga K, Fujimoto A. Development of a novel microarray data analysis tool without normalization for genotyping degraded forensic DNA. Forensic Sci Int Genet 2023; 65:102885. [PMID: 37137205 DOI: 10.1016/j.fsigen.2023.102885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 04/13/2023] [Accepted: 04/27/2023] [Indexed: 05/05/2023]
Abstract
Since the arrest of the Golden State Killer in the US in April 2018, forensic geneticists have been increasingly interested in the investigative genetic genealogy (IGG) method. While this method has already been in practical use as a powerful tool for criminal investigation, we have yet to know well the limitations and potential risks. In this current study, we performed an evaluation study focusing on degraded DNA using the Affymetrix Genome-Wide Human SNP Array 6.0 platform (Thermo Fisher Scientific). We revealed one of the potential problems that occur during SNP genotype determination using a microarray-based platform. Our analysis results indicated that the SNP profiles derived from degraded DNA contained many false heterozygous SNPs. In addition, it was confirmed that the total amount of probe signal intensity on microarray chips derived from degraded DNA decreased significantly. Because the conventional analysis algorithm performs normalization during genotype determination, we concluded that noise signals could be genotype-called. To address this issue, we proposed a novel microarray data analysis method without normalization (nMAP). Although the nMAP algorithm resulted in a low call rate, it substantially improved genotyping accuracy. Finally, we confirmed the usefulness of the nMAP algorithm for kinship inferences. These findings and the nMAP algorithm will make a contribution to the advance of the IGG method.
Collapse
Affiliation(s)
- Kayoko Yagasaki
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo Ward, Tokyo 113-0033, Japan; Forensic Science Laboratory, Tokyo Metropolitan Police Department, 3-35-21, Shakujiidai, Nerima Ward, Tokyo 177-0045, Japan.
| | - Nao Nishida
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo Ward, Tokyo 113-0033, Japan; Genome Medical Science Project, National Center for Global Health and Medicine, 1-7-1, Kohnodai, Ichikawa, Chiba, 272-8516, Japan
| | - Akihiko Mabuchi
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo Ward, Tokyo 113-0033, Japan
| | - Katsushi Tokunaga
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo Ward, Tokyo 113-0033, Japan; Genome Medical Science Project, National Center for Global Health and Medicine, 1-21-1, Toyama, Shinjuku word, Tokyo 162-8655, Japan
| | - Akihiro Fujimoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo Ward, Tokyo 113-0033, Japan
| |
Collapse
|
20
|
Wagner JK, Yu JH, Fullwiley D, Moore C, Wilson JF, Bamshad MJ, Royal CD. Guidelines for genetic ancestry inference created through roundtable discussions. HGG ADVANCES 2023; 4:100178. [PMID: 36798092 PMCID: PMC9926022 DOI: 10.1016/j.xhgg.2023.100178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 01/03/2023] [Indexed: 01/15/2023] Open
Abstract
The use of genetic and genomic technology to infer ancestry is commonplace in a variety of contexts, particularly in biomedical research and for direct-to-consumer genetic testing. In 2013 and 2015, two roundtables engaged a diverse group of stakeholders toward the development of guidelines for inferring genetic ancestry in academia and industry. This report shares the stakeholder groups' work and provides an analysis of, commentary on, and views from the groundbreaking and sustained dialogue. We describe the engagement processes and the stakeholder groups' resulting statements and proposed guidelines. The guidelines focus on five key areas: application of genetic ancestry inference, assumptions and confidence/laboratory and statistical methods, terminology and population identifiers, impact on individuals and groups, and communication or translation of genetic ancestry inferences. We delineate the terms and limitations of the guidelines and discuss their critical role in advancing the development and implementation of best practices for inferring genetic ancestry and reporting the results. These efforts should inform both governmental regulation and self-regulation.
Collapse
Affiliation(s)
- Jennifer K. Wagner
- School of Engineering Design and Innovation, Pennsylvania State University, University Park, PA 16802, USA
- Institute for Computational and Data Science, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Rock Ethics Institute, Pennsylvania State University, University Park, PA 16802, USA
- Penn State Law, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Joon-Ho Yu
- Department of Pediatrics and Institute for Public Health Genetics, University of Washington, Seattle, WA 98195, USA
- Treuman Katz Center for Pediatric Bioethics, Seattle Children’s Hospital and Research Institute, Seattle, WA 98101, USA
| | - Duana Fullwiley
- Department of Anthropology, Stanford University, Stanford, CA 94305, USA
| | | | - James F. Wilson
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh EH8 9AG, Scotland
| | - Michael J. Bamshad
- Department of Pediatrics and Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Division of Genetic Medicine, Seattle Children’s Hospital, Seattle, WA 98101, USA
| | - Charmaine D. Royal
- Departments of African and African American Studies, Biology, Global Health, and Family Medicine and Community Health, Duke University, Durham, NC 27708, USA
| | - Genetic Ancestry Inference Roundtable Participants
- School of Engineering Design and Innovation, Pennsylvania State University, University Park, PA 16802, USA
- Institute for Computational and Data Science, Pennsylvania State University, University Park, PA 16802, USA
- Department of Biomedical Engineering, Pennsylvania State University, University Park, PA 16802, USA
- Rock Ethics Institute, Pennsylvania State University, University Park, PA 16802, USA
- Penn State Law, University Park, PA 16802, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
- Department of Pediatrics and Institute for Public Health Genetics, University of Washington, Seattle, WA 98195, USA
- Treuman Katz Center for Pediatric Bioethics, Seattle Children’s Hospital and Research Institute, Seattle, WA 98101, USA
- Department of Anthropology, Stanford University, Stanford, CA 94305, USA
- The DNA Detectives, Dana Point, CA, USA
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh EH8 9AG, Scotland
- Department of Pediatrics and Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Division of Genetic Medicine, Seattle Children’s Hospital, Seattle, WA 98101, USA
- Departments of African and African American Studies, Biology, Global Health, and Family Medicine and Community Health, Duke University, Durham, NC 27708, USA
| |
Collapse
|
21
|
Medvedev A, Lebedev M, Ponomarev A, Kosaretskiy M, Osipenko D, Tischenko A, Kosaretskiy E, Wang H, Kolobkov D, Chamberlain-Evans V, Vakhitov R, Nikonorov P. GRAPE: genomic relatedness detection pipeline. F1000Res 2023; 11:589. [PMID: 37224332 PMCID: PMC10182380 DOI: 10.12688/f1000research.111658.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/22/2023] [Indexed: 05/26/2023] Open
Abstract
Classifying the degree of relatedness between pairs of individuals has both scientific and commercial applications. As an example, genome-wide association studies (GWAS) may suffer from high rates of false positive results due to unrecognized population structure. This problem becomes especially relevant with recent increases in large-cohort studies. Accurate relationship classification is also required for genetic linkage analysis to identify disease-associated loci. Additionally, DNA relatives matching service is one of the leading drivers for the direct-to-consumer genetic testing market. Despite the availability of scientific and research information on the methods for determining kinship and the accessibility of relevant tools, the assembly of the pipeline, which stably operates on a real-world genotypic data, requires significant research and development resources. Currently, there is no open source end-to-end solution for relatedness detection in genomic data, that is fast, reliable and accurate for both close and distant degrees of kinship, combines all the necessary processing steps to work on a real data, and is ready for production integration. To address this, we developed GRAPE: Genomic RelAtedness detection PipelinE. It combines data preprocessing, identity-by-descent (IBD) segments detection, and accurate relationship estimation. The project uses software development best practices, as well as Global Alliance for Genomics and Health (GA4GH) standards and tools. Pipeline efficiency is demonstrated on both simulated and real-world datasets. GRAPE is available from: https://github.com/genxnetwork/grape.
Collapse
Affiliation(s)
- Alexander Medvedev
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation
- GENXT, Hinxton, UK
| | | | | | | | | | | | | | - Hui Wang
- GENXT, Hinxton, UK
- Huazhong Agricultural University, Wuhan, China
| | | | | | | | | |
Collapse
|
22
|
Genome-wide data from medieval German Jews show that the Ashkenazi founder event pre-dated the 14 th century. Cell 2022; 185:4703-4716.e16. [PMID: 36455558 PMCID: PMC9793425 DOI: 10.1016/j.cell.2022.11.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 08/26/2022] [Accepted: 11/01/2022] [Indexed: 12/05/2022]
Abstract
We report genome-wide data from 33 Ashkenazi Jews (AJ), dated to the 14th century, obtained following a salvage excavation at the medieval Jewish cemetery of Erfurt, Germany. The Erfurt individuals are genetically similar to modern AJ, but they show more variability in Eastern European-related ancestry than modern AJ. A third of the Erfurt individuals carried a mitochondrial lineage common in modern AJ and eight carried pathogenic variants known to affect AJ today. These observations, together with high levels of runs of homozygosity, suggest that the Erfurt community had already experienced the major reduction in size that affected modern AJ. The Erfurt bottleneck was more severe, implying substructure in medieval AJ. Overall, our results suggest that the AJ founder event and the acquisition of the main sources of ancestry pre-dated the 14th century and highlight late medieval genetic heterogeneity no longer present in modern AJ.
Collapse
|
23
|
Tang K, Naseri A, Wei Y, Zhang S, Zhi D. Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts. Gigascience 2022; 11:giac111. [PMID: 36472573 PMCID: PMC9724555 DOI: 10.1093/gigascience/giac111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 08/04/2022] [Accepted: 09/28/2022] [Indexed: 12/12/2022] Open
Abstract
In the recent biobank era of genetics, the problem of identical-by-descent (IBD) segment detection received renewed interest, as IBD segments in large cohorts offer unprecedented opportunities in the study of population and genealogical history, as well as genetic association of long haplotypes. While a new generation of efficient methods for IBD segment detection becomes available, direct comparison of these methods is difficult: existing benchmarks were often evaluated in different datasets, with some not openly accessible; methods benchmarked were run under suboptimal parameters; and benchmark performance metrics were not defined consistently. Here, we developed a comprehensive and completely open-source evaluation of the power, accuracy, and resource consumption of these IBD segment detection methods using realistic population genetic simulations with various settings. Our results pave the road for fair evaluation of IBD segment detection methods and provide an practical guide for users.
Collapse
Affiliation(s)
- Kecong Tang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Ardalan Naseri
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Yuan Wei
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Degui Zhi
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| |
Collapse
|
24
|
Noto K, Ruiz L. Accurate genome-wide phasing from IBD data. BMC Bioinformatics 2022; 23:502. [PMID: 36424541 PMCID: PMC9686111 DOI: 10.1186/s12859-022-05066-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 11/17/2022] [Indexed: 11/25/2022] Open
Abstract
As genotype databases increase in size, so too do the number of detectable segments of identity by descent (IBD): segments of the genome where two individuals share an identical copy of one of their two parental haplotypes, due to shared ancestry. We show that given a large enough genotype database, these segments of IBD collectively overlap entire chromosomes, including instances of IBD that span multiple chromosomes, and can be used to accurately separate the alleles inherited from each parent across the entire genome. The resulting phase is not an improvement over state-of-the-art local phasing methods, but provides accurate long-range phasing that indicates which of two haplotypes in different regions of the genome, including different chromosomes, was inherited from the same parent. We are able to separate the DNA inherited from each parent completely, across the entire genome, with 98% median accuracy in a test set of 30,000 individuals. We estimate the IBD data requirements for accurate genome-wide phasing, and we propose a method for estimating confidence in the resulting phase. We show that our methods do not require the genotypes of close family, and that they are robust to genotype errors and missing data. In fact, our method can impute missing data accurately and correct genotype errors.
Collapse
|
25
|
Practical forensic use of kinship determination using high-density SNP profiling based on a microarray platform, focusing on low-quantity DNA. Forensic Sci Int Genet 2022; 61:102752. [DOI: 10.1016/j.fsigen.2022.102752] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 07/16/2022] [Accepted: 07/27/2022] [Indexed: 11/20/2022]
|
26
|
Smith J, Qiao Y, Williams AL. Evaluating the utility of identity-by-descent segment numbers for relatedness inference via information theory and classification. G3 (BETHESDA, MD.) 2022; 12:jkac072. [PMID: 35348675 PMCID: PMC9157175 DOI: 10.1093/g3journal/jkac072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 03/07/2022] [Indexed: 11/29/2022]
Abstract
Despite decades of methods development for classifying relatives in genetic studies, pairwise relatedness methods' recalls are above 90% only for first through third-degree relatives. The top-performing approaches, which leverage identity-by-descent segments, often use only kinship coefficients, while others, including estimation of recent shared ancestry (ERSA), use the number of segments relatives share. To quantify the potential for using segment numbers in relatedness inference, we leveraged information theory measures to analyze exact (i.e. produced by a simulator) identity-by-descent segments from simulated relatives. Over a range of settings, we found that the mutual information between the relatives' degree of relatedness and a tuple of their kinship coefficient and segment number is on average 4.6% larger than between the degree and the kinship coefficient alone. We further evaluated identity-by-descent segment number utility by building a Bayes classifier to predict first through sixth-degree relationships using different feature sets. When trained and tested with exact segments, the inclusion of segment numbers improves the recall by between 0.28% and 3% for second through sixth-degree relatives. However, the recalls improve by less than 1.8% per degree when using inferred segments, suggesting limitations due to identity-by-descent detection accuracy. Last, we compared our Bayes classifier that includes segment numbers with both ERSA and IBIS and found comparable recalls, with the Bayes classifier and ERSA slightly outperforming each other across different degrees. Overall, this study shows that identity-by-descent segment numbers can improve relatedness inference, but errors from current SNP array-based detection methods yield dampened signals in practice.
Collapse
Affiliation(s)
- Jesse Smith
- School of Applied and Engineering Physics, Cornell University, Ithaca, NY 14853, USA
| | - Ying Qiao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Amy L Williams
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
27
|
Dalby M, Vitezic M, Plath N, Hammer-Helmich L, Jiang Y, Tian C, Dhamija D, Wilson CH, Hinds D, Sullivan PF, Buckholtz JW, Smoller JW. Characterizing mood disorders in the AFFECT study: a large, longitudinal, and phenotypically rich genetic cohort in the US. Transl Psychiatry 2022; 12:121. [PMID: 35338122 PMCID: PMC8956583 DOI: 10.1038/s41398-022-01877-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 02/23/2022] [Accepted: 02/24/2022] [Indexed: 11/09/2022] Open
Abstract
There has recently been marked progress in identifying genetic risk factors for major depression (MD) and bipolar disorder (BD); however, few systematic efforts have been made to elucidate heterogeneity that exists within and across these diagnostic taxa. The Affective disorders, Environment, and Cognitive Trait (AFFECT) study presents an opportunity to identify and associate the structure of cognition and symptom-level domains across the mood disorder spectrum in a prospective study from a diverse US population.Participants were recruited from the 23andMe, Inc research participant database and through social media; self-reported diagnosis of MD or BD by a medical professional and medication status data were used to enrich for mood-disorder cases. Remote assessments were used to acquire an extensive range of phenotypes, including mood state, transdiagnostic symptom severity, task-based measures of cognition, environmental exposures, personality traits. In this paper we describe the study design, and the demographic and clinical characteristics of the cohort. In addition we report genetic ancestry, SNP heritability, and genetic correlations with other large cohorts of mood disorders.A total of 48,467 participants were enrolled: 14,768 with MD, 9864 with BD, and 23,835 controls. Upon enrollment, 47% of participants with MD and 27% with BD indicated being in an active mood episode. Cases reported early ages of onset (mean = 13.2 and 14.3 years for MD and BD, respectively), and high levels of recurrence (78.6% and 84.9% with >5 episodes), psychotherapy, and psychotropic medication use. SNP heritability on the liability scale for the ascertained MD participants (0.19-0.21) was consistent with the high level of disease severity in this cohort, while BD heritability estimates (0.16-0.22) were comparable to reports in other large scale genomic studies of mood disorders. Genetic correlations between the AFFECT cohort and other large-scale cohorts were high for MD but not for BD. By incorporating transdiagnostic symptom assessments, repeated measures, and genomic data, the AFFECT study represents a unique resource for dissecting the structure of mood disorders across multiple levels of analysis. In addition, the fully remote nature of the study provides valuable insights for future virtual and decentralized clinical trials within mood disorders.
Collapse
Affiliation(s)
- Maria Dalby
- H. Lundbeck A/S, Valby, Denmark.
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutete, Stockholm, Sweden.
| | | | | | | | | | | | | | | | | | - Patrick F Sullivan
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutete, Stockholm, Sweden
- Department of Genetics and Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Joshua W Buckholtz
- Department of Psychology, Harvard University, Cambridge, MA, USA
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Jordan W Smoller
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| |
Collapse
|
28
|
Rogalla-Ładniak U. The overview of forensic genetic genealogy. ARCHIVES OF FORENSIC MEDICINE AND CRIMINOLOGY 2022; 72:211-222. [PMID: 37405841 DOI: 10.4467/16891716amsik.22.023.17623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/06/2023] Open
Abstract
Forensic genetic genealogy (FGG) benefits largely from popularity of genealogical research within (mostly) American society and the advent of new sequencing techniques that allow typing of challenging forensic samples. It is considered a true breakthrough for both active and especially cold cases where all other resources and methods have failed during investigation. Despite media coverage generally highlighting its powers, the method itself is considered very laborious and the investigation may easily got suspended at every stage due to many factors including no hits in the database or breaks in traceable lineages within the family tree. This review summarizes the scope of FGG use, mentions most concerns and misconceptions associated with the technique and points to the plausible solutions already suggested. It also brings together current guidelines and regulations intended to be followed by law enforcement authorities wishing to utilize genetic genealogy research.
Collapse
Affiliation(s)
- Urszula Rogalla-Ładniak
- Faculty of Medicine, Ludwik Rydygier Collegium Medicum in Bydgoszcz; Nicolaus Copernicus University in Toruń, Poland
| |
Collapse
|
29
|
|
30
|
Laurent FX, Fischer A, Oldt RF, Kanthaswamy S, Buckleton JS, Hitchin S. Streamlining the decision-making process for international DNA kinship matching using Worldwide allele frequencies and tailored cutoff log 10LR thresholds. Forensic Sci Int Genet 2021; 57:102634. [PMID: 34871915 DOI: 10.1016/j.fsigen.2021.102634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 10/13/2021] [Accepted: 11/15/2021] [Indexed: 11/30/2022]
Abstract
The identification of human remains belonging to missing persons is one of the main challenges for forensic genetics. Although other means of identification can be applied to missing person investigations, DNA is often extremely valuable to further support or refute potential associations. When reference DNA samples cannot be collected from personal items belonging to a missing person, a direct DNA identification cannot be carried out. However, identifications can be made indirectly using DNA from the missing person's relatives. The ranking of likelihood ratio (LR) values, which measure the fit of a missing person for any given pedigree, is often the first step in selecting candidates in a DNA database. Although implementing DNA kinship matching in a national environment is feasible, many challenges need to be resolved before applying this method to an international configuration. In this study, we present an innovative and intuitive method to perform international DNA kinship matching and facilitate the comparison of DNA profiles when the ancestry is unknown or unsure and/or when different marker sets are used. This straightforward method, which is based on calculations performed with the DNA matching software BONAPARTE, Worldwide allele frequencies and tailored cutoff log10LR thresholds, allows for the classification of potential candidates according to the strength of the DNA evidence and the predicted proportion of adventitious matches. This is a powerful method for streamlining the decision-making process in missing person investigations and DVI processes, especially when there are low numbers of overlapping typed STRs. Intuitive interpretation tables and a decision tree will help strengthen international data comparison for the identification of reported missing individuals discovered outside their national borders.
Collapse
Affiliation(s)
- François-Xavier Laurent
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| | - Andrea Fischer
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France; Landeskriminalamt Baden-Württemberg, Taubenheimstr. 85, 70372 Stuttgart, Germany
| | - Robert F Oldt
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - Sree Kanthaswamy
- School of Mathematical and Natural Sciences, Arizona State University, Phoenix, AZ 85004, USA
| | - John S Buckleton
- University of Auckland, Department of Statistics, Private Bag, 92019 Auckland, New Zealand
| | - Susan Hitchin
- International Criminal Police Organization - INTERPOL, DNA Unit, 200 quai Charles de Gaulle, 69006 Lyon, France.
| |
Collapse
|
31
|
de Vries JH, Kling D, Vidaki A, Arp P, Kalamara V, Verbiest MMPJ, Piniewska-Róg D, Parsons TJ, Uitterlinden AG, Kayser M. Impact of SNP microarray analysis of compromised DNA on kinship classification success in the context of investigative genetic genealogy. Forensic Sci Int Genet 2021; 56:102625. [PMID: 34753062 DOI: 10.1016/j.fsigen.2021.102625] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/25/2021] [Accepted: 10/27/2021] [Indexed: 11/04/2022]
Abstract
Single nucleotide polymorphism (SNP) data generated with microarray technologies have been used to solve murder cases via investigative leads obtained from identifying relatives of the unknown perpetrator included in accessible genomic databases, an approach referred to as investigative genetic genealogy (IGG). However, SNP microarrays were developed for relatively high input DNA quantity and quality, while DNA typically obtainable from crime scene stains is of low DNA quantity and quality, and SNP microarray data obtained from compromised DNA are largely missing. By applying the Illumina Global Screening Array (GSA) to 264 DNA samples with systematically altered quantity and quality, we empirically tested the impact of SNP microarray analysis of compromised DNA on kinship classification success, as relevant in IGG. Reference data from manufacturer-recommended input DNA quality and quantity were used to estimate genotype accuracy in the compromised DNA samples and for simulating data of different degree relatives. Although stepwise decrease of input DNA amount from 200 ng to 6.25 pg led to decreased SNP call rates and increased genotyping errors, kinship classification success did not decrease down to 250 pg for siblings and 1st cousins, 1 ng for 2nd cousins, while at 25 pg and below kinship classification success was zero. Stepwise decrease of input DNA quality via increased DNA fragmentation resulted in the decrease of genotyping accuracy as well as kinship classification success, which went down to zero at the average DNA fragment size of 150 base pairs. Combining decreased DNA quantity and quality in mock casework and skeletal samples further highlighted possibilities and limitations. Overall, GSA analysis achieved maximal kinship classification success from 800 to 200 times lower input DNA quantities than manufacturer-recommended, although DNA quality plays a key role too, while compromised DNA produced false negative kinship classifications rather than false positive ones.
Collapse
Affiliation(s)
- Jard H de Vries
- Erasmus MC, University Medical Center Rotterdam, Department of Internal Medicine, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Daniel Kling
- Department of Forensic Genetics and Toxicology, National Board of Forensic Medicine, Artillerigatan 12, 587 58 Linköping, Sweden
| | - Athina Vidaki
- Erasmus MC, University Medical Center Rotterdam, Department of Genetic Identification, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Pascal Arp
- Erasmus MC, University Medical Center Rotterdam, Department of Internal Medicine, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Vivian Kalamara
- Erasmus MC, University Medical Center Rotterdam, Department of Genetic Identification, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Michael M P J Verbiest
- Erasmus MC, University Medical Center Rotterdam, Department of Internal Medicine, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Danuta Piniewska-Róg
- Malopolska Centre of Biotechnology, Jagiellonian University, 30-387 Krakow, Poland; Department of Forensic Medicine, Jagiellonian University Medical College, 31-531 Krakow, Poland
| | - Thomas J Parsons
- International Commission on Missing Persons, Koninginnegracht 12a, 2514 AA Den Haag, the Netherlands
| | - André G Uitterlinden
- Erasmus MC, University Medical Center Rotterdam, Department of Internal Medicine, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands; Erasmus MC, University Medical Center Rotterdam, Department of Epidemiology, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands
| | - Manfred Kayser
- Erasmus MC, University Medical Center Rotterdam, Department of Genetic Identification, Dr. Molewaterplein 40, 3015 GD Rotterdam, the Netherlands.
| |
Collapse
|
32
|
Jewett EM, McManus KF, Freyman WA, Auton A, Auton A. Bonsai: An efficient method for inferring large human pedigrees from genotype data. Am J Hum Genet 2021; 108:2052-2070. [PMID: 34739834 PMCID: PMC8595950 DOI: 10.1016/j.ajhg.2021.09.013] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 09/24/2021] [Indexed: 11/29/2022] Open
Abstract
Pedigree inference from genotype data is a challenging problem, particularly when pedigrees are sparsely sampled and individuals may be distantly related to their closest genotyped relatives. We present a method that infers small pedigrees of close relatives and then assembles them into larger pedigrees. To assemble large pedigrees, we introduce several formulas and tools including a likelihood for the degree separating two small pedigrees, a generalization of the fast DRUID point estimate of the degree separating two pedigrees, a method for detecting individuals who share background identity-by-descent (IBD) that does not reflect recent common ancestry, and a method for identifying the ancestral branches through which distant relatives are connected. Our method also takes several approaches that help to improve the accuracy and efficiency of pedigree inference. In particular, we incorporate age information directly into the likelihood rather than using ages only for consistency checks and we employ a heuristic branch-and-bound-like approach to more efficiently explore the space of possible pedigrees. Together, these approaches make it possible to construct large pedigrees that are challenging or intractable for current inference methods.
Collapse
|
33
|
Sticca EL, Belbin GM, Gignoux CR. Current Developments in Detection of Identity-by-Descent Methods and Applications. Front Genet 2021; 12:722602. [PMID: 34567074 PMCID: PMC8461052 DOI: 10.3389/fgene.2021.722602] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 08/24/2021] [Indexed: 01/23/2023] Open
Abstract
Identity-by-descent (IBD), the detection of shared segments inherited from a common ancestor, is a fundamental concept in genomics with broad applications in the characterization and analysis of genomes. While historically the concept of IBD was extensively utilized through linkage analyses and in studies of founder populations, applications of IBD-based methods subsided during the genome-wide association study era. This was primarily due to the computational expense of IBD detection, which becomes increasingly relevant as the field moves toward the analysis of biobank-scale datasets that encompass individuals from highly diverse backgrounds. To address these computational barriers, the past several years have seen new methodological advances enabling IBD detection for datasets in the hundreds of thousands to millions of individuals, enabling novel analyses at an unprecedented scale. Here, we describe the latest innovations in IBD detection and describe opportunities for the application of IBD-based methods across a broad range of questions in the field of genomics.
Collapse
Affiliation(s)
- Evan L Sticca
- Human Medical Genetics and Genomics Program and Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Gillian M Belbin
- Institute for Genomic Health, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Christopher R Gignoux
- Human Medical Genetics and Genomics Program and Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| |
Collapse
|
34
|
Luo Y, Li X, Wang X, Gazal S, Mercader JM, Neale BM, Florez JC, Auton A, Price AL, Finucane HK, Raychaudhuri S. Estimating heritability and its enrichment in tissue-specific gene sets in admixed populations. Hum Mol Genet 2021; 30:1521-1534. [PMID: 33987664 PMCID: PMC8330913 DOI: 10.1093/hmg/ddab130] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/29/2021] [Accepted: 04/29/2021] [Indexed: 01/07/2023] Open
Abstract
It is important to study the genetics of complex traits in diverse populations. Here, we introduce covariate-adjusted linkage disequilibrium (LD) score regression (cov-LDSC), a method to estimate SNP-heritability (${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}})$ and its enrichment in homogenous and admixed populations with summary statistics and in-sample LD estimates. In-sample LD can be estimated from a subset of the genome-wide association studies samples, allowing our method to be applied efficiently to very large cohorts. In simulations, we show that unadjusted LDSC underestimates ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ by 10-60% in admixed populations; in contrast, cov-LDSC is robustly accurate. We apply cov-LDSC to genotyping data from 8124 individuals, mostly of admixed ancestry, from the Slim Initiative in Genomic Medicine for the Americas study, and to approximately 161 000 Latino-ancestry individuals, 47 000 African American-ancestry individuals and 135 000 European-ancestry individuals, as classified by 23andMe. We estimate ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ and detect heritability enrichment in three quantitative and five dichotomous phenotypes, making this, to our knowledge, the most comprehensive heritability-based analysis of admixed individuals to date. Most traits have high concordance of ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$ and consistent tissue-specific heritability enrichment among different populations. However, for age at menarche, we observe population-specific heritability estimates of ${\boldsymbol{h}}_{\boldsymbol{g}}^{\mathbf{2}}$. We observe consistent patterns of tissue-specific heritability enrichment across populations; for example, in the limbic system for BMI, the per-standardized-annotation effect size $ \tau $* is 0.16 ± 0.04, 0.28 ± 0.11 and 0.18 ± 0.03 in the Latino-, African American- and European-ancestry populations, respectively. Our approach is a powerful way to analyze genetic data for complex traits from admixed populations.
Collapse
Affiliation(s)
- Yang Luo
- Center for Data Sciences, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Immunology, and Immunity, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Xinyi Li
- Center for Data Sciences, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Immunology, and Immunity, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Xin Wang
- 23andMe, Inc., Mountain View, California, USA
| | - Steven Gazal
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Josep Maria Mercader
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
| | | | | | - Benjamin M Neale
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Jose C Florez
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
| | - Adam Auton
- 23andMe, Inc., Mountain View, California, USA
| | - Alkes L Price
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Hilary K Finucane
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Soumya Raychaudhuri
- Center for Data Sciences, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Immunology, and Immunity, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Arthritis Research UK Centre for Genetics and Genomics, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
| |
Collapse
|
35
|
Rapid detection of identity-by-descent tracts for mega-scale datasets. Nat Commun 2021; 12:3546. [PMID: 34112768 PMCID: PMC8192555 DOI: 10.1038/s41467-021-22910-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 04/01/2021] [Indexed: 01/08/2023] Open
Abstract
The ability to identify segments of genomes identical-by-descent (IBD) is a part of standard workflows in both statistical and population genetics. However, traditional methods for finding local IBD across all pairs of individuals scale poorly leading to a lack of adoption in very large-scale datasets. Here, we present iLASH, an algorithm based on similarity detection techniques that shows equal or improved accuracy in simulations compared to current leading methods and speeds up analysis by several orders of magnitude on genomic datasets, making IBD estimation tractable for millions of individuals. We apply iLASH to the PAGE dataset of ~52,000 multi-ethnic participants, including several founder populations with elevated IBD sharing, identifying IBD segments in ~3 minutes per chromosome compared to over 6 days for a state-of-the-art algorithm. iLASH enables efficient analysis of very large-scale datasets, as we demonstrate by computing IBD across the UK Biobank (~500,000 individuals), detecting 12.9 billion pairwise connections.
Collapse
|
36
|
Tillmar A, Fagerholm SA, Staaf J, Sjölund P, Ansell R. Getting the conclusive lead with investigative genetic genealogy - A successful case study of a 16 year old double murder in Sweden. Forensic Sci Int Genet 2021; 53:102525. [PMID: 33991867 DOI: 10.1016/j.fsigen.2021.102525] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 04/26/2021] [Accepted: 04/28/2021] [Indexed: 12/13/2022]
Abstract
On the morning of October 19, 2004, an eight-year-old boy and a 56-year-old woman were stabbed to death on an open street in the city of Linköping, Sweden. The perpetrator left his DNA at the crime scene, and after 15 years of various investigation efforts, including more than 9000 interrogations and mass DNA screening of more than 6000 men, there were still no clues about the identity of the unknown murderer. The successful application of investigative genetic genealogy (IGG) in the US raised the interest for this tool within the Swedish Police Authority. After legal consultations it was decided that IGG could be applied in this double murder case as a pilot case study. From extensive DNA analysis, including whole-genome sequencing and genotype imputation, DNA data sets were established and searched within both GEDmatch and FamilyTree DNA genealogy databases. A number of fairly distant relatives were found from which family trees were created. The genealogy work resulted in two candidates, two brothers, one of whom matched the crime scene samples by routine STR profiling. The suspect confessed the murders at the initial police hearing and was later convicted of the murders. In this paper we describe the successful application of an emerging technology. We disclose details of the DNA analyses which, due to the poor quality and low quantity of the DNA, required reiterative sequencing and genotype imputation efforts. The successful application of IGG in this double murder case exemplifies its applicability not only in the US but also in Europe. The pressure is now high on the involved authorities to establish IGG as a tool for cold case criminal investigations and for missing person identifications. There is, however, a continuous need to accommodate legal, social and ethical aspects as well.
Collapse
Affiliation(s)
- Andreas Tillmar
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Biomedical and Clinical Sciences, Faculty of Medicine and Health Sciences, Linköping University, Linköping, Sweden.
| | | | - Jan Staaf
- Polisregion Öst, Swedish Police Authority, Linköping, Sweden
| | | | - Ricky Ansell
- National Forensic Centre, Swedish Police Authority, Linköping, Sweden; Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden.
| |
Collapse
|
37
|
Freyman WA, McManus KF, Shringarpure SS, Jewett EM, Bryc K, Auton A. Fast and Robust Identity-by-Descent Inference with the Templated Positional Burrows-Wheeler Transform. Mol Biol Evol 2021; 38:2131-2151. [PMID: 33355662 PMCID: PMC8097300 DOI: 10.1093/molbev/msaa328] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Estimating the genomic location and length of identical-by-descent (IBD) segments among individuals is a crucial step in many genetic analyses. However, the exponential growth in the size of biobank and direct-to-consumer genetic data sets makes accurate IBD inference a significant computational challenge. Here we present the templated positional Burrows-Wheeler transform (TPBWT) to make fast IBD estimates robust to genotype and phasing errors. Using haplotype data simulated over pedigrees with realistic genotyping and phasing errors, we show that the TPBWT outperforms other state-of-the-art IBD inference algorithms in terms of speed and accuracy. For each phase-aware method, we explore the false positive and false negative rates of inferring IBD by segment length and characterize the types of error commonly found. Our results highlight the fragility of most phased IBD inference methods; the accuracy of IBD estimates can be highly sensitive to the quality of haplotype phasing. Additionally, we compare the performance of the TPBWT against a widely used phase-free IBD inference approach that is robust to phasing errors. We introduce both in-sample and out-of-sample TPBWT-based IBD inference algorithms and demonstrate their computational efficiency on massive-scale data sets with millions of samples. Furthermore, we describe the binary file format for TPBWT-compressed haplotypes that results in fast and efficient out-of-sample IBD computes against very large cohort panels. Finally, we demonstrate the utility of the TPBWT in a brief empirical analysis, exploring geographic patterns of haplotype sharing within Mexico. Hierarchical clustering of IBD shared across regions within Mexico reveals geographically structured haplotype sharing and a strong signal of isolation by distance. Our software implementation of the TPBWT is freely available for noncommercial use in the code repository (https://github.com/23andMe/phasedibd, last accessed January 11, 2021).
Collapse
|
38
|
Pirastu N, Cordioli M, Nandakumar P, Mignogna G, Abdellaoui A, Hollis B, Kanai M, Rajagopal VM, Parolo PDB, Baya N, Carey CE, Karjalainen J, Als TD, Van der Zee MD, Day FR, Ong KK, Morisaki T, de Geus E, Bellocco R, Okada Y, Børglum AD, Joshi P, Auton A, Hinds D, Neale BM, Walters RK, Nivard MG, Perry JRB, Ganna A. Genetic analyses identify widespread sex-differential participation bias. Nat Genet 2021; 53:663-671. [PMID: 33888908 DOI: 10.1101/2020.03.22.001453v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 03/16/2021] [Indexed: 05/25/2023]
Abstract
Genetic association results are often interpreted with the assumption that study participation does not affect downstream analyses. Understanding the genetic basis of participation bias is challenging since it requires the genotypes of unseen individuals. Here we demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another. For example, we showed that sex exhibits artifactual autosomal heritability in the presence of sex-differential participation bias. By performing a genome-wide association study of sex in approximately 3.3 million males and females, we identified over 158 autosomal loci spuriously associated with sex and highlighted complex traits underpinning differences in study participation between the sexes. For example, the body mass index-increasing allele at FTO was observed at higher frequency in males compared to females (odds ratio = 1.02, P = 4.4 × 10-36). Finally, we demonstrated how these biases can potentially lead to incorrect inferences in downstream analyses and propose a conceptual framework for addressing such biases. Our findings highlight a new challenge that genetic studies may face as sample sizes continue to grow.
Collapse
Affiliation(s)
- Nicola Pirastu
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Mattia Cordioli
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | | | - Gianmarco Mignogna
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Department of Statistics and Quantitative Methods, University of Milano Bicocca, Milan, Italy
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Abdel Abdellaoui
- Department of Psychiatry, Amsterdam Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands
| | - Benjamin Hollis
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
- The Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Veera M Rajagopal
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark
- Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | | | - Nikolas Baya
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Caitlin E Carey
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Juha Karjalainen
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Thomas D Als
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark
- Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | - Matthijs D Van der Zee
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands
| | - Felix R Day
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Ken K Ong
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
- Department of Paediatrics, University of Cambridge, Cambridge, UK
| | - Takayuki Morisaki
- Division of Molecular Pathology, Institute of Medical Sciences, University of Tokyo, Tokyo, Japan
- BioBank Japan, Institute of Medical Science, University of Tokyo, Tokyo, Japan
- Department of Internal Medicine, Institute of Medical Science, University of Tokyo Hospital, Tokyo, Japan
| | - Eco de Geus
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands
- Amsterdam Public Health Research institute, Amsterdam, the Netherlands
| | - Rino Bellocco
- Department of Statistics and Quantitative Methods, University of Milano Bicocca, Milan, Italy
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory of Statistical Immunology, World Premier International Immunology Frontier Research Center, Osaka University, Suita, Japan
- Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
| | - Anders D Børglum
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark
- Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark
- Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | - Peter Joshi
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK
| | | | | | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Raymond K Walters
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michel G Nivard
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands
- Amsterdam Public Health, Methodology Program, Amsterdam, the Netherlands
- Amsterdam Neuroscience-Mood, Anxiety, Psychosis, Stress & Sleep, Amsterdam, the Netherlands
| | - John R B Perry
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.
| | - Andrea Ganna
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
39
|
Pirastu N, Cordioli M, Nandakumar P, Mignogna G, Abdellaoui A, Hollis B, Kanai M, Rajagopal VM, Parolo PDB, Baya N, Carey CE, Karjalainen J, Als TD, Van der Zee MD, Day FR, Ong KK, Morisaki T, de Geus E, Bellocco R, Okada Y, Børglum AD, Joshi P, Auton A, Hinds D, Neale BM, Walters RK, Nivard MG, Perry JRB, Ganna A. Genetic analyses identify widespread sex-differential participation bias. Nat Genet 2021; 53:663-671. [PMID: 33888908 DOI: 10.1038/s41588-021-00846-7] [Citation(s) in RCA: 113] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 03/16/2021] [Indexed: 01/22/2023]
Abstract
Genetic association results are often interpreted with the assumption that study participation does not affect downstream analyses. Understanding the genetic basis of participation bias is challenging since it requires the genotypes of unseen individuals. Here we demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another. For example, we showed that sex exhibits artifactual autosomal heritability in the presence of sex-differential participation bias. By performing a genome-wide association study of sex in approximately 3.3 million males and females, we identified over 158 autosomal loci spuriously associated with sex and highlighted complex traits underpinning differences in study participation between the sexes. For example, the body mass index-increasing allele at FTO was observed at higher frequency in males compared to females (odds ratio = 1.02, P = 4.4 × 10-36). Finally, we demonstrated how these biases can potentially lead to incorrect inferences in downstream analyses and propose a conceptual framework for addressing such biases. Our findings highlight a new challenge that genetic studies may face as sample sizes continue to grow.
Collapse
Affiliation(s)
- Nicola Pirastu
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Mattia Cordioli
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | | | - Gianmarco Mignogna
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.,Department of Statistics and Quantitative Methods, University of Milano Bicocca, Milan, Italy.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Abdel Abdellaoui
- Department of Psychiatry, Amsterdam Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands
| | - Benjamin Hollis
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.,The Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Veera M Rajagopal
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.,Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark.,Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | | | - Nikolas Baya
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Caitlin E Carey
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Juha Karjalainen
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Thomas D Als
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.,Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark.,Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | - Matthijs D Van der Zee
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands
| | - Felix R Day
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Ken K Ong
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.,Department of Paediatrics, University of Cambridge, Cambridge, UK
| | | | | | | | - Takayuki Morisaki
- Division of Molecular Pathology, Institute of Medical Sciences, University of Tokyo, Tokyo, Japan.,BioBank Japan, Institute of Medical Science, University of Tokyo, Tokyo, Japan.,Department of Internal Medicine, Institute of Medical Science, University of Tokyo Hospital, Tokyo, Japan
| | - Eco de Geus
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands.,Amsterdam Public Health Research institute, Amsterdam, the Netherlands
| | - Rino Bellocco
- Department of Statistics and Quantitative Methods, University of Milano Bicocca, Milan, Italy.,Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan.,Laboratory of Statistical Immunology, World Premier International Immunology Frontier Research Center, Osaka University, Suita, Japan.,Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
| | - Anders D Børglum
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.,Centre for Genomics and Personalized Medicine, Center for Genimics and Personalized Medice, Aarhus University, Aarhus, Denmark.,Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
| | - Peter Joshi
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK
| | | | | | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Raymond K Walters
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.,Stanley Center for Psychiatric Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michel G Nivard
- Faculty of Behavioural and Movement Sciences, Biological Psychology, Vrije Universiteit, Amsterdam, the Netherlands.,Amsterdam Public Health, Methodology Program, Amsterdam, the Netherlands.,Amsterdam Neuroscience-Mood, Anxiety, Psychosis, Stress & Sleep, Amsterdam, the Netherlands
| | - John R B Perry
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.
| | - Andrea Ganna
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland. .,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA. .,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
40
|
Nandakumar P, Tian C, O'Connell J, Hinds D, Paterson AD, Sondheimer N. Nuclear genome-wide associations with mitochondrial heteroplasmy. SCIENCE ADVANCES 2021; 7:7/12/eabe7520. [PMID: 33731350 PMCID: PMC7968846 DOI: 10.1126/sciadv.abe7520] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 01/27/2021] [Indexed: 05/10/2023]
Abstract
The role of the nuclear genome in maintaining the stability of the mitochondrial genome (mtDNA) is incompletely known. mtDNA sequence variants can exist in a state of heteroplasmy, which denotes the coexistence of organellar genomes with different sequences. Heteroplasmic variants that impair mitochondrial capacity cause disease, and the state of heteroplasmy itself is deleterious. However, mitochondrial heteroplasmy may provide an intermediate state in the emergence of novel mitochondrial haplogroups. We used genome-wide genotyping data from 982,072 European ancestry individuals to evaluate variation in mitochondrial heteroplasmy and to identify the regions of the nuclear genome that affect it. Age, sex, and mitochondrial haplogroup were associated with the extent of heteroplasmy. GWAS identified 20 loci for heteroplasmy that exceeded genome-wide significance. This included a region overlapping mitochondrial transcription factor A (TFAM), which has multiple roles in mtDNA packaging, replication, and transcription. These results show that mitochondrial heteroplasmy has a heritable nuclear component.
Collapse
Affiliation(s)
| | - Chao Tian
- 23andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA, USA
| | | | - David Hinds
- 23andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA, USA.
| | - Andrew D Paterson
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
- Divisions of Epidemiology and Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Neal Sondheimer
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada.
- Departments of Paediatrics and Molecular Genetics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
41
|
Simcoe M, Valdes A, Liu F, Furlotte NA, Evans DM, Hemani G, Ring SM, Smith GD, Duffy DL, Zhu G, Gordon SD, Medland SE, Vuckovic D, Girotto G, Sala C, Catamo E, Concas MP, Brumat M, Gasparini P, Toniolo D, Cocca M, Robino A, Yazar S, Hewitt A, Wu W, Kraft P, Hammond CJ, Shi Y, Chen Y, Zeng C, Klaver CCW, Uitterlinden AG, Ikram MA, Hamer MA, van Duijn CM, Nijsten T, Han J, Mackey DA, Martin NG, Cheng CY, Hinds DA, Spector TD, Kayser M, Hysi PG. Genome-wide association study in almost 195,000 individuals identifies 50 previously unidentified genetic loci for eye color. SCIENCE ADVANCES 2021; 7:7/11/eabd1239. [PMID: 33692100 PMCID: PMC7946369 DOI: 10.1126/sciadv.abd1239] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 01/25/2021] [Indexed: 05/03/2023]
Abstract
Human eye color is highly heritable, but its genetic architecture is not yet fully understood. We report the results of the largest genome-wide association study for eye color to date, involving up to 192,986 European participants from 10 populations. We identify 124 independent associations arising from 61 discrete genomic regions, including 50 previously unidentified. We find evidence for genes involved in melanin pigmentation, but we also find associations with genes involved in iris morphology and structure. Further analyses in 1636 Asian participants from two populations suggest that iris pigmentation variation in Asians is genetically similar to Europeans, albeit with smaller effect sizes. Our findings collectively explain 53.2% (95% confidence interval, 45.4 to 61.0%) of eye color variation using common single-nucleotide polymorphisms. Overall, our study outcomes demonstrate that the genetic complexity of human eye color considerably exceeds previous knowledge and expectations, highlighting eye color as a genetically highly complex human trait.
Collapse
Affiliation(s)
- Mark Simcoe
- Department of Twins Research and Genetic Epidemiology, King's College London, London, UK
- Department of Ophthalmology, King's College London, London, UK
| | - Ana Valdes
- Department of Twins Research and Genetic Epidemiology, King's College London, London, UK
- Division of Rheumatology, Orthopaedics and Dermatology, School of Medicine, University of Nottingham, Nottingham, UK
| | - Fan Liu
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | | | - David M Evans
- University of Queensland Diamantina Institute, University of Queensland, Brisbane, Queensland, Australia
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Gibran Hemani
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences Bristol Medical School University of Bristol, Bristol, UK
| | - Susan M Ring
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences Bristol Medical School University of Bristol, Bristol, UK
| | - George Davey Smith
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences Bristol Medical School University of Bristol, Bristol, UK
| | - David L Duffy
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Gu Zhu
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Scott D Gordon
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Sarah E Medland
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Dragana Vuckovic
- Department of Medical Sciences, University of Trieste, Trieste, Italy
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
- Epidemiology and Biostatistics Department, Faculty of Medicine, School of Public Health, Imperial College London, London, UK
| | - Giorgia Girotto
- Department of Medical Sciences, University of Trieste, Trieste, Italy
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
| | - Cinzia Sala
- Division of Genetics of Common Disorders, S. Raffaele Scientific Institute, Milan, Italy
| | - Eulalia Catamo
- Department of Medical Sciences, University of Trieste, Trieste, Italy
| | - Maria Pina Concas
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
| | - Marco Brumat
- Department of Medical Sciences, University of Trieste, Trieste, Italy
| | - Paolo Gasparini
- Department of Medical Sciences, University of Trieste, Trieste, Italy
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
| | - Daniela Toniolo
- Division of Genetics of Common Disorders, S. Raffaele Scientific Institute, Milan, Italy
| | - Massimiliano Cocca
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
| | - Antonietta Robino
- Institute for Maternal and Child Health IRCCS "Burlo Garofolo", Trieste, Italy
| | - Seyhan Yazar
- Centre for Ophthalmology and Visual Science, University of Western Australia, Lions Eye Institute, Perth, Australia
| | - Alex Hewitt
- Centre for Ophthalmology and Visual Science, University of Western Australia, Lions Eye Institute, Perth, Australia
- Centre for Eye Research Australia, University of Melbourne, Department of Ophthalmology, Royal Victorian Eye and Ear Hospital, Melbourne, Australia
- School of Medicine, Menzies Research Institute Tasmania, University of Tasmania, Hobart, Australia
| | - Wenting Wu
- Department of Epidemiology, Fairbanks School of Public Health, Indiana University, and Indiana University Melvin and Bren Simon Cancer Center, Indianapolis, IN, USA
| | - Peter Kraft
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Christopher J Hammond
- Department of Twins Research and Genetic Epidemiology, King's College London, London, UK
- Department of Ophthalmology, King's College London, London, UK
| | - Yuan Shi
- Singapore Eye Research Institute, Singapore National Eye Center, Singapore
| | - Yan Chen
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Changqing Zeng
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Caroline C W Klaver
- Department of Ophthalmology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Ophthalmology, Radboud University Medical Center, Nijmegen, Netherlands
| | - Andre G Uitterlinden
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Internal Medicine, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
| | - M Arfan Ikram
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Merel A Hamer
- Department of Dermatology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Cornelia M van Duijn
- Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Tamar Nijsten
- Department of Dermatology, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Jiali Han
- Department of Epidemiology, Fairbanks School of Public Health, Indiana University, and Indiana University Melvin and Bren Simon Cancer Center, Indianapolis, IN, USA
| | - David A Mackey
- Centre for Ophthalmology and Visual Science, University of Western Australia, Lions Eye Institute, Perth, Australia
| | - Nicholas G Martin
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Ching-Yu Cheng
- Singapore Eye Research Institute, Singapore National Eye Center, Singapore
- Duke-NUS Medical School, Singapore
| | | | - Timothy D Spector
- Department of Twins Research and Genetic Epidemiology, King's College London, London, UK
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, Netherlands.
| | - Pirro G Hysi
- Department of Twins Research and Genetic Epidemiology, King's College London, London, UK.
- Department of Ophthalmology, King's College London, London, UK
| |
Collapse
|
42
|
Kling D, Phillips C, Kennett D, Tillmar A. Investigative genetic genealogy: Current methods, knowledge and practice. Forensic Sci Int Genet 2021; 52:102474. [PMID: 33592389 DOI: 10.1016/j.fsigen.2021.102474] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 01/12/2021] [Accepted: 01/27/2021] [Indexed: 12/15/2022]
Abstract
Investigative genetic genealogy (IGG) has emerged as a new, rapidly growing field of forensic science. We describe the process whereby dense SNP data, commonly comprising more than half a million markers, are employed to infer distant relationships. By distant we refer to degrees of relatedness exceeding that of first cousins. We review how methods of relationship matching and SNP analysis on an enlarged scale are used in a forensic setting to identify a suspect in a criminal investigation or a missing person. There is currently a strong need in forensic genetics not only to understand the underlying models to infer relatedness but also to fully explore the DNA technologies and data used in IGG. This review brings together many of the topics and examines their effectiveness and operational limits, while suggesting future directions for their forensic validation. We further investigated the methods used by the major direct-to-consumer (DTC) genetic ancestry testing companies as well as submitting a questionnaire where providers of forensic genetic genealogy summarized their operation/services. Although most of the DTC market, and genetic genealogy in general, has undisclosed, proprietary algorithms we review the current knowledge where information has been discussed and published more openly.
Collapse
Affiliation(s)
- Daniel Kling
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
| | - Christopher Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Santiago de Compostela, Spain.
| | - Debbie Kennett
- Research Department of Genetics, Evolution and Environment, University College London, Gower Street, London WC1E 6BT, United Kingdom
| | - Andreas Tillmar
- Department of Forensic Genetics and Forensic Toxicology, National Board of Forensic Medicine, Linköping, Sweden; Department of Biomedical and Clinical Sciences, Faculty of Medicine and Health Sciences, Linköping University, Linköping, Sweden
| |
Collapse
|
43
|
Abstract
We trained and validated risk prediction models for the three major types of skin cancer- basal cell carcinoma (BCC), squamous cell carcinoma (SCC), and melanoma-on a cross-sectional and longitudinal dataset of 210,000 consented research participants who responded to an online survey covering personal and family history of skin cancer, skin susceptibility, and UV exposure. We developed a primary disease risk score (DRS) that combined all 32 identified genetic and non-genetic risk factors. Top percentile DRS was associated with an up to 13-fold increase (odds ratio per standard deviation increase >2.5) in the risk of developing skin cancer relative to the middle DRS percentile. To derive lifetime risk trajectories for the three skin cancers, we developed a second and age independent disease score, called DRSA. Using incident cases, we demonstrated that DRSA could be used in early detection programs for identifying high risk asymptotic individuals, and predicting when they are likely to develop skin cancer. High DRSA scores were not only associated with earlier disease diagnosis (by up to 14 years), but also with more severe and recurrent forms of skin cancer.
Collapse
|
44
|
Qiao Y, Sannerud JG, Basu-Roy S, Hayward C, Williams AL. Distinguishing pedigree relationships via multi-way identity by descent sharing and sex-specific genetic maps. Am J Hum Genet 2021; 108:68-83. [PMID: 33385324 PMCID: PMC7820736 DOI: 10.1016/j.ajhg.2020.12.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 12/07/2020] [Indexed: 12/31/2022] Open
Abstract
The proportion of samples with one or more close relatives in a genetic dataset increases rapidly with sample size, necessitating relatedness modeling and enabling pedigree-based analyses. Despite this, relatives are generally unreported and current inference methods typically detect only the degree of relatedness of sample pairs and not pedigree relationships. We developed CREST, an accurate and fast method that identifies the pedigree relationships of close relatives. CREST utilizes identity by descent (IBD) segments shared between a pair of samples and their mutual relatives, leveraging the fact that sharing rates among these individuals differ across pedigree configurations. Furthermore, CREST exploits the profound differences in sex-specific genetic maps to classify pairs as maternally or paternally related-e.g., paternal half-siblings-using the locations of autosomal IBD segments shared between the pair. In simulated data, CREST correctly classifies 91.5%-100% of grandparent-grandchild (GP) pairs, 80.0%-97.5% of avuncular (AV) pairs, and 75.5%-98.5% of half-siblings (HS) pairs compared to PADRE's rates of 38.5%-76.0% of GP, 60.5%-92.0% of AV, 73.0%-95.0% of HS pairs. Turning to the real 20,032 sample Generation Scotland (GS) dataset, CREST identified seven pedigrees with incorrect relationship types or maternal/paternal parent sexes, five of which we confirmed as mistakes, and two with uncertain relationships. After correcting these, CREST correctly determines relationship types for 93.5% of GP, 97.7% of AV, and 92.2% of HS pairs that have sufficient mutual relative data; the parent sex in 100% of HS and 99.6% of GP pairs; and it completes this analysis in 2.8 h including IBD detection in eight threads.
Collapse
Affiliation(s)
- Ying Qiao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Jens G Sannerud
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Sayantani Basu-Roy
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | - Caroline Hayward
- MRC Human Genetics Unit, MRC Institute of Genetic and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Amy L Williams
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
45
|
Cuellar-Partida G, Tung JY, Eriksson N, Albrecht E, Aliev F, Andreassen OA, Barroso I, Beckmann JS, Boks MP, Boomsma DI, Boyd HA, Breteler MMB, Campbell H, Chasman DI, Cherkas LF, Davies G, de Geus EJC, Deary IJ, Deloukas P, Dick DM, Duffy DL, Eriksson JG, Esko T, Feenstra B, Geller F, Gieger C, Giegling I, Gordon SD, Han J, Hansen TF, Hartmann AM, Hayward C, Heikkilä K, Hicks AA, Hirschhorn JN, Hottenga JJ, Huffman JE, Hwang LD, Ikram MA, Kaprio J, Kemp JP, Khaw KT, Klopp N, Konte B, Kutalik Z, Lahti J, Li X, Loos RJF, Luciano M, Magnusson SH, Mangino M, Marques-Vidal P, Martin NG, McArdle WL, McCarthy MI, Medina-Gomez C, Melbye M, Melville SA, Metspalu A, Milani L, Mooser V, Nelis M, Nyholt DR, O'Connell KS, Ophoff RA, Palmer C, Palotie A, Palviainen T, Pare G, Paternoster L, Peltonen L, Penninx BWJH, Polasek O, Pramstaller PP, Prokopenko I, Raikkonen K, Ripatti S, Rivadeneira F, Rudan I, Rujescu D, Smit JH, Smith GD, Smoller JW, Soranzo N, Spector TD, Pourcain BS, Starr JM, Stefánsson H, Steinberg S, Teder-Laving M, Thorleifsson G, Stefánsson K, Timpson NJ, Uitterlinden AG, van Duijn CM, van Rooij FJA, Vink JM, Vollenweider P, Vuoksimaa E, Waeber G, Wareham NJ, Warrington N, Waterworth D, Werge T, Wichmann HE, Widen E, Willemsen G, Wright AF, Wright MJ, Xu M, Zhao JH, Kraft P, Hinds DA, Lindgren CM, Mägi R, Neale BM, Evans DM, Medland SE. Genome-wide association study identifies 48 common genetic variants associated with handedness. Nat Hum Behav 2021; 5:59-70. [PMID: 32989287 PMCID: PMC7116623 DOI: 10.1038/s41562-020-00956-y] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 08/18/2020] [Indexed: 02/06/2023]
Abstract
Handedness has been extensively studied because of its relationship with language and the over-representation of left-handers in some neurodevelopmental disorders. Using data from the UK Biobank, 23andMe and the International Handedness Consortium, we conducted a genome-wide association meta-analysis of handedness (N = 1,766,671). We found 41 loci associated (P < 5 × 10-8) with left-handedness and 7 associated with ambidexterity. Tissue-enrichment analysis implicated the CNS in the aetiology of handedness. Pathways including regulation of microtubules and brain morphology were also highlighted. We found suggestive positive genetic correlations between left-handedness and neuropsychiatric traits, including schizophrenia and bipolar disorder. Furthermore, the genetic correlation between left-handedness and ambidexterity is low (rG = 0.26), which implies that these traits are largely influenced by different genetic mechanisms. Our findings suggest that handedness is highly polygenic and that the genetic variants that predispose to left-handedness may underlie part of the association with some psychiatric disorders.
Collapse
Affiliation(s)
- Gabriel Cuellar-Partida
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
- 23andMe, Inc., Sunnyvale, CA, USA
| | | | | | - Eva Albrecht
- Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
| | - Fazil Aliev
- Department of Psychology, Virginia Commonwealth University, Richmond, VA, USA
- Karabuk University, Faculty of Business, Karabük, Turkey
| | - Ole A Andreassen
- NORMENT, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| | - Inês Barroso
- Human Genetics, Wellcome Sanger Institute, Hinxton, UK
- MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
- Wellcome-MRC Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Jacques S Beckmann
- Service of Medical Genetics, Centre Hospitalier Universitaire Vaudois and University of Lausanne, Lausanne, Switzerland
| | - Marco P Boks
- Department of Psychiatry, UMC Utrecht Brain Center, University Utrecht, Utrecht, The Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Public Health research institute, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Heather A Boyd
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Monique M B Breteler
- Population Health Sciences, German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany
| | - Harry Campbell
- Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, UK
| | - Daniel I Chasman
- Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Lynn F Cherkas
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Gail Davies
- Department of Psychology, University of Edinburgh, Edinburgh, UK
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
| | - Eco J C de Geus
- Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Public Health research institute, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Ian J Deary
- Department of Psychology, University of Edinburgh, Edinburgh, UK
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
| | - Panos Deloukas
- William Harvey Research Institute, Barts and the London Medical School, and the Centre for Genomic Health, Queen Mary University of London, London, UK
| | - Danielle M Dick
- Department of Psychology, Virginia Commonwealth University, Richmond, VA, USA
- Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA, USA
| | - David L Duffy
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Johan G Eriksson
- Department of General Practice and Primary Health Care, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- Folkhälsan Research Center, Helsinki, Finland
- Singapore Institute for Clinical Sciences, Agency for Science Technology and Research, Singapore, Singapore
- Department of Obstetrics and Gynaecology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Tõnu Esko
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bjarke Feenstra
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Frank Geller
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Christian Gieger
- Research Unit of Molecular Epidemiology, Institute of Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany
- German Center for Diabetes Research (DZD), München-Neuherberg, Neuherberg, Germany
| | - Ina Giegling
- University Clinic and Outpatient Clinic for Psychiatry, Psychotherapy and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Scott D Gordon
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Jiali Han
- Department of Epidemiology, Richard M. Fairbanks School of Public Health, Indiana University, Indianapolis, IN, USA
- Melvin and Bren Simon Cancer Center, Indiana University, Indianapolis, IN, USA
| | - Thomas F Hansen
- Institute of Biological Psychiatry, Mental Health Services of Copenhagen, Copenhagen, Denmark
- Danish Headache Center, Copenhagen University Hospital, Glostrup, Denmark
| | - Annette M Hartmann
- University Clinic and Outpatient Clinic for Psychiatry, Psychotherapy and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Caroline Hayward
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Kauko Heikkilä
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Andrew A Hicks
- Institute for Biomedicine, Eurac Research, Bolzano, Italy
| | - Joel N Hirschhorn
- Division of Endocrinology, Boston Children's Hospital, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
| | - Jouke-Jan Hottenga
- Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- Amsterdam Public Health research institute, Amsterdam University Medical Centers, Amsterdam, The Netherlands
| | - Jennifer E Huffman
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Liang-Dar Hwang
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
| | - M Arfan Ikram
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Jaakko Kaprio
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
| | - John P Kemp
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Kay-Tee Khaw
- Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Norman Klopp
- Hannover Unified Biobank, Hannover Medical School, Hannover, Germany
| | - Bettina Konte
- University Clinic and Outpatient Clinic for Psychiatry, Psychotherapy and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Zoltan Kutalik
- Center for Primary Care and Public Health, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Jari Lahti
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
- Turku Institute for Advanced Studies, University of Turku, Turku, Finland
- Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Xin Li
- Department of Epidemiology, Richard M. Fairbanks School of Public Health, Indiana University, Indianapolis, IN, USA
- Melvin and Bren Simon Cancer Center, Indiana University, Indianapolis, IN, USA
| | - Ruth J F Loos
- MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Michelle Luciano
- Department of Psychology, University of Edinburgh, Edinburgh, UK
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
| | | | - Massimo Mangino
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Pedro Marques-Vidal
- Department of Medicine, Internal Medicine, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Nicholas G Martin
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | | | - Mark I McCarthy
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK
- Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Trust, Oxford, UK
- Human Genetics, Genentech, South San Francisco, CA, USA
| | - Carolina Medina-Gomez
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Mads Melbye
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Andres Metspalu
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Lili Milani
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Vincent Mooser
- Service of Clinical Chemistry, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Mari Nelis
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Dale R Nyholt
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
- School of Biomedical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Kevin S O'Connell
- NORMENT, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Roel A Ophoff
- Department of Human Genetics, University California Los Angeles, Los Angeles, CA, USA
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA, USA
- Department of Psychiatry, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Cameron Palmer
- Department of Computer Science, Columbia University, New York, NY, USA
| | - Aarno Palotie
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Teemu Palviainen
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Guillaume Pare
- Pathology and Molecular Medicine, McMaster University, Hamilton, Ontario, Canada
| | | | - Leena Peltonen
- Drug Research Program, Division of Pharmaceutical Chemistry and Technology, Faculty of Pharmacy, University of Helsinki, Helsinki, Finland
| | - Brenda W J H Penninx
- Amsterdam Public Health research institute, Amsterdam University Medical Centers, Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam UMC, VU University, Amsterdam, The Netherlands
| | - Ozren Polasek
- Department of Public Health, University of Split School of Medicine, Split, Croatia
- Research Unit, Psychiatric Hospital Sveti Ivan, Zagreb, Croatia
| | | | - Inga Prokopenko
- Section of Statistical Multi-Omics, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, UK
- Section of Genomics of Common Disease, Department of Medicine, Imperial College London, London, UK
| | - Katri Raikkonen
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Samuli Ripatti
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Fernando Rivadeneira
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Igor Rudan
- Centre for Global Health Research, The Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Dan Rujescu
- University Clinic and Outpatient Clinic for Psychiatry, Psychotherapy and Psychosomatics, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | - Johannes H Smit
- Amsterdam Public Health research institute, Amsterdam University Medical Centers, Amsterdam, The Netherlands
- Department of Psychiatry, Amsterdam UMC, VU University, Amsterdam, The Netherlands
| | | | - Jordan W Smoller
- Department of Psychiatry and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, USA
| | | | - Tim D Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
| | - Beate St Pourcain
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Max Planck Institute for Psycholinguistics, Wundtlaan, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - John M Starr
- Alzheimer Scotland Dementia Research Centre, University of Edinburgh, Edinburgh, UK
- Centre for Cognitive Ageing and Cognitive Epidemilogy, University of Edinburgh, Edinburgh, UK
| | | | | | - Maris Teder-Laving
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | | | | | | | - André G Uitterlinden
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Cornelia M van Duijn
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Frank J A van Rooij
- Department of Epidemiology, Erasmus MC, University Medical Center, Rotterdam, The Netherlands
| | - Jaqueline M Vink
- Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- Behavioural Science Institute, Radboud University, Nijmegen, The Netherlands
| | - Peter Vollenweider
- Department of Medicine, Internal Medicine, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | - Eero Vuoksimaa
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Gérard Waeber
- Department of Medicine, Internal Medicine, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland
| | | | - Nicole Warrington
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
| | | | - Thomas Werge
- Institute of Biological Psychiatry, Mental Health Services of Copenhagen, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- The Lundbeck Foundation's IPSYCH Initiative, Copenhagen, Denmark
| | | | - Elisabeth Widen
- Institute for Molecular Medicine FIMM, University of Helsinki, Helsinki, Finland
| | - Gonneke Willemsen
- Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
| | - Alan F Wright
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Margaret J Wright
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
- Centre for Advanced Imaging, The University of Queensland, Brisbane, Queensland, Australia
| | - Mousheng Xu
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Harvard Medical School, Boston, MA, USA
| | - Jing Hua Zhao
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Peter Kraft
- Department of Epidemiology, Harvard T. H. Chan School of Public Health, Harvard Medical School, Boston, MA, USA
| | | | - Cecilia M Lindgren
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Big Data Institute at the Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Reedik Mägi
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Benjamin M Neale
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - David M Evans
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia.
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK.
| | - Sarah E Medland
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia.
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia.
| |
Collapse
|
46
|
Heilbron K, Jensen MP, Bandres-Ciga S, Fontanillas P, Blauwendraat C, Nalls MA, Singleton AB, Smith GD, Cannon P, Noyce AJ. Unhealthy Behaviours and Risk of Parkinson's Disease: A Mendelian Randomisation Study. JOURNAL OF PARKINSON'S DISEASE 2021; 11:1981-1993. [PMID: 34275906 PMCID: PMC8609708 DOI: 10.3233/jpd-202487] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 06/28/2021] [Indexed: 12/17/2022]
Abstract
BACKGROUND Tobacco smoking and alcohol intake have been identified in observational studies as potentially protective factors against developing Parkinson's disease (PD); the impact of body mass index (BMI) on PD risk is debated. Whether such epidemiological associations are causal remains unclear. Mendelian randomsation (MR) uses genetic variants to explore the effects of exposures on outcomes; potentially reducing bias from residual confounding and reverse causation. OBJECTIVE Using MR, we examined relationships between PD risk and three unhealthy behaviours: tobacco smoking, alcohol intake, and higher BMI. METHODS 19,924 PD cases and 2,413,087 controls were included in the analysis. We performed genome-wide association studies to identify single nucleotide polymorphisms associated with tobacco smoking, alcohol intake, and BMI. MR analysis of the relationship between each exposure and PD was undertaken using a split-sample design. RESULTS Ever-smoking reduced the risk of PD (OR 0.955; 95%confidence interval [CI] 0.921-0.991; p = 0.013). Higher daily alcohol intake increased the risk of PD (OR 1.125, 95%CI 1.025-1.235; p = 0.013) and a 1 kg/m2 higher BMI reduced the risk of PD (OR 0.988, 95%CI 0.979-0.997; p = 0.008). Sensitivity analyses did not suggest bias from horizontal pleiotropy or invalid instruments. CONCLUSION Using split-sample MR in over 2.4 million participants, we observed a protective effect of smoking on risk of PD. In contrast to observational data, alcohol consumption appeared to increase the risk of PD. Higher BMI had a protective effect on PD, but the effect was small.
Collapse
Affiliation(s)
| | - Melanie P. Jensen
- Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, UK
- Department of Cellular Pathology, Northwest London Pathology, Charing Cross Hospital, London, UK
| | - Sara Bandres-Ciga
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | | | - Cornelis Blauwendraat
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Mike A. Nalls
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Glen Echo, MD, USA
| | - Andrew B. Singleton
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - George Davey Smith
- MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, UK, UK
| | | | - Alastair J. Noyce
- Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, UK
- Department of Clinical and Movement Neurosciences, UCL Institute of Neurology, London, UK
| | - The 23andMe Research Team
- 23andMe, Inc., Sunnyvale, CA, USA
- Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, UK
- Department of Cellular Pathology, Northwest London Pathology, Charing Cross Hospital, London, UK
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
- Data Tecnica International, Glen Echo, MD, USA
- MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, UK, UK
- Department of Clinical and Movement Neurosciences, UCL Institute of Neurology, London, UK
| |
Collapse
|
47
|
Li QS, Tian C, Hinds D, Seabrook GR. The association of clinical phenotypes to known AD/FTD genetic risk loci and their inter-relationship. PLoS One 2020; 15:e0241552. [PMID: 33152005 PMCID: PMC7644002 DOI: 10.1371/journal.pone.0241552] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 10/18/2020] [Indexed: 11/25/2022] Open
Abstract
To elucidate how variants in genetic risk loci previously implicated in Alzheimer’s Disease (AD) and/or frontotemporal dementia (FTD) contribute to expression of disease phenotypes, a phenome-wide association study was performed in two waves. In the first wave, we explored clinical traits associated with thirteen genetic variants previously reported to be linked to disease risk using both the 23andMe and UKB cohorts. We tested 30 additional AD variants in UKB cohort only in the second wave. APOE variants defining ε2/ε3/ε4 alleles and rs646776 were identified to be significantly associated with metabolic/cardiovascular and longevity traits. APOE variants were also significantly associated with neurological traits. ABI3 variant rs28394864 was significantly associated with cardiovascular (e.g. (hypertension, ischemic heart disease, coronary atherosclerosis, angina) and immune-related trait asthma. Both APOE variants and CLU variant were significantly associated with nearsightedness. HLA- DRB1 variant was associated with diseases with immune-related traits. Additionally, variants from 10+ AD genes (BZRAP1-AS1, ADAMTS4, ADAM10, APH1B, SCIMP, ABI3, SPPL2A, ZNF232, GRN, CD2AP, and CD33) were associated with hematological measurements such as white blood cell (leukocyte) count, monocyte count, neutrophill count, platelet count, and/or mean platelet (thrombocyte) volume (an autoimmune disease biomarker). Many of these genes are expressed specifically in microglia. The associations of ABI3 variant with cardiovascular and immune-related traits are one of the novel findings from this study. Taken together, it is evidenced that at least some AD and FTD variants are associated with multiple clinical phenotypes and not just dementia. These findings were discussed in the context of causal relationship versus pleiotropy via Mendelian randomization analysis.
Collapse
Affiliation(s)
- Qingqin S. Li
- Janssen Research & Development, LLC, Titusville, NJ, United States of America
- * E-mail:
| | - Chao Tian
- 23andMe, Inc., Mountain View, CA, United States of America
| | | | - David Hinds
- 23andMe, Inc., Mountain View, CA, United States of America
| | - Guy R. Seabrook
- Johnson & Johnson Innovation, South San Francisco, CA, United States of America
| |
Collapse
|
48
|
Ge J, Budowle B. Forensic investigation approaches of searching relatives in DNA databases. J Forensic Sci 2020; 66:430-443. [PMID: 33136341 DOI: 10.1111/1556-4029.14615] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 09/11/2020] [Accepted: 10/05/2020] [Indexed: 11/29/2022]
Abstract
There are several indirect database searching approaches to identify the potential source of a forensic biological sample. These DNA-based approaches are familial searching, Y-STR database searching, and investigative genetic genealogy (IGG). The first two strategies use forensic DNA databases managed by the government, and the latter uses databases managed by private citizens or companies. Each of these search strategies relies on DNA testing to identify relatives of the donor of the crime scene sample, provided such profiles reside in the DNA database(s). All three approaches have been successfully used to identify the donor of biological evidence, which assisted in solving criminal cases or identifying unknown human remains. This paper describes and compares these approaches in terms of genotyping technologies, searching methods, database structures, searching efficiency, data quality, data security, and costs, and raises some potential privacy and legal considerations for further discussion by stakeholders and scientists. Y-STR database searching and IGG are advantageous since they are able to assist in more cases than familial searching readily identifying distant relatives. In contrast, familial searching can be performed more readily with existing laboratory systems. Every country or state may have its own unique economic, technical, cultural, and legal considerations and should decide the best approach(es) to fit those circumstances. Regardless of the approach, the ultimate goal should be the same: generate investigative leads and solve active and cold criminal cases to public safety, under stringent policies and security practices designed to protect the privacy of its citizenry.
Collapse
Affiliation(s)
- Jianye Ge
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA.,Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| |
Collapse
|
49
|
Identification of a new genetic variant associated with cholecystitis: A multicenter genome-wide association study. J Trauma Acute Care Surg 2020; 89:173-178. [PMID: 32118827 DOI: 10.1097/ta.0000000000002647] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
BACKGROUND The genomic landscape of gallbladder disease remains poorly understood. We sought to examine the association between genetic variants and the development of cholecystitis. METHODS The Biobank of a large multi-institutional health care system was used. All patients with cholecystitis were identified using International Statistical Classification of Diseases, 10th Revision, codes and genotyped across six batches. To control for population stratification, data were restricted to that from individuals of European genomic ancestry using a multidimensional scaling approach. The association between single nucleotide polymorphisms and cholecystitis was evaluated with a mixed linear model-based analysis, controlling for age, sex, and obesity. The threshold for significance was set at 5 × 10. RESULTS Of 24,635 patients (mean ± SD age, 60.1 ± 16.7 years; 13,022 females [52.9%]), 900 had cholecystitis (mean ± SD age, 65.4 ± 14.3 years; 496 females [55.1%]). After meta-analysis, three single nucleotide polymorphisms on chromosome 5p15 exceeded the threshold for significance (p < 5 × 10). The phenotypic variance of cholecystitis explained by genetics and controlling for sex and obesity was estimated to be 17.9%. CONCLUSION Using a multi-institutional genomic Biobank, we report that a region on chromosome 5p15 is associated with the development of cholecystitis that can be used to identify patients at risk. LEVEL OF EVIDENCE Prognostic and epidemiological, Level III.
Collapse
|
50
|
Katsanis SH. Pedigrees and Perpetrators: Uses of DNA and Genealogy in Forensic Investigations. Annu Rev Genomics Hum Genet 2020; 21:535-564. [DOI: 10.1146/annurev-genom-111819-084213] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the past few years, cases with DNA evidence that could not be solved with direct matches in DNA databases have benefited from comparing single-nucleotide polymorphism data with private and public genomic databases. Using a combination of genome comparisons and traditional genealogical research, investigators can triangulate distant relatives to the contributor of DNA data from a crime scene, ultimately identifying perpetrators of violent crimes. This approach has also been successful in identifying unknown deceased persons and perpetrators of lesser crimes. Such advances are bringing into focus ethical questions on how much access to DNA databases should be granted to law enforcement and how best to empower public genome contributors with control over their data. The necessary policies will take time to develop but can be informed by reflection on the familial searching policies developed for searches of the federal DNA database and considerations of the anonymity and privacy interests of civilians.
Collapse
Affiliation(s)
- Sara H. Katsanis
- Mary Ann & J. Milburn Smith Child Health Research, Outreach, and Advocacy Center, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, Illinois 60611, USA
- Department of Pediatrics, Northwestern University, Chicago, Illinois 60611, USA
| |
Collapse
|