1
|
Horimoto AR, Boyken LA, Blue EE, Grinde KE, Nafikov RA, Sohi HK, Nato AQ, Bis JC, Brusco LI, Morelli L, Ramirez A, Dalmasso MC, Temple S, Satizabal C, Browning SR, Seshadri S, Wijsman EM, Thornton TA. Admixture mapping implicates 13q33.3 as ancestry-of-origin locus for Alzheimer disease in Hispanic and Latino populations. HGG Adv 2023; 4:100207. [PMID: 37333771 PMCID: PMC10276158 DOI: 10.1016/j.xhgg.2023.100207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 05/16/2023] [Indexed: 06/20/2023] Open
Abstract
Alzheimer disease (AD) is the most common form of senile dementia, with high incidence late in life in many populations including Caribbean Hispanic (CH) populations. Such admixed populations, descended from more than one ancestral population, can present challenges for genetic studies, including limited sample sizes and unique analytical constraints. Therefore, CH populations and other admixed populations have not been well represented in studies of AD, and much of the genetic variation contributing to AD risk in these populations remains unknown. Here, we conduct genome-wide analysis of AD in multiplex CH families from the Alzheimer Disease Sequencing Project (ADSP). We developed, validated, and applied an implementation of a logistic mixed model for admixture mapping with binary traits that leverages genetic ancestry to identify ancestry-of-origin loci contributing to AD. We identified three loci on chromosome 13q33.3 associated with reduced risk of AD, where associations were driven by Native American (NAM) ancestry. This AD admixture mapping signal spans the FAM155A, ABHD13, TNFSF13B, LIG4, and MYO16 genes and was supported by evidence for association in an independent sample from the Alzheimer's Genetics in Argentina-Alzheimer Argentina consortium (AGA-ALZAR) study with considerable NAM ancestry. We also provide evidence of NAM haplotypes and key variants within 13q33.3 that segregate with AD in the ADSP whole-genome sequencing data. Interestingly, the widely used genome-wide association study approach failed to identify associations in this region. Our findings underscore the potential of leveraging genetic ancestry diversity in recently admixed populations to improve genetic mapping, in this case for AD-relevant loci.
Collapse
Affiliation(s)
| | - Lisa A. Boyken
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Elizabeth E. Blue
- Division of Medical Genetics/Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA 98195, USA
| | - Kelsey E. Grinde
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
- Department of Mathematics, Statistics and Computer Science, Macalester College, Saint Paul, MN 55105, USA
| | - Rafael A. Nafikov
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
- Division of Medical Genetics/Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Harkirat K. Sohi
- Division of Medical Genetics/Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Biomedical and Health Informatics Program, University of Washington, Seattle, WA 98195, USA
| | - Alejandro Q. Nato
- Division of Medical Genetics/Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Department of Biomedical Sciences, Joan C. Edwards School of Medicine, Marshall University, Huntington, WV 25755, USA
| | - Joshua C. Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA 98101, USA
| | - Luis I. Brusco
- CENECON - Center of Behavioural Neurology and Neuropsychiatry, School of Medicine, University of Buenos Aires, C1121A6B Buenos Aires, Argentina
| | - Laura Morelli
- Laboratory of Brain Aging and Neurodegeneration-Fundación Instituto Leloir-IIBBA- National Scientific and Technical Research Council (CONICET), C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Alfredo Ramirez
- Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, University of Cologne, Medical Faculty, 50937 Cologne, Germany
- Department of Neurodegeneration and Gerontopsychiatry, University of Bonn, 53127 Bonn, Germany
- German Center for Neurodegenerative Diseases (DZNE), 53127 Bonn, Germany
- Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD) University of Cologne, 50674 Cologne, Germany
- Department of Psychiatry, UT Health San Antonio, San Antonio, TX 78229, USA
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, UT Health San Antonio, San Antonio, TX 78229, USA
| | - Maria Carolina Dalmasso
- Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, University of Cologne, Medical Faculty, 50937 Cologne, Germany
- Neurosciences and Complex Systems Unit (EnyS), CONICET, Hospital El Cruce, National University A. Jauretche (UNAJ), B1888AAE Florencio Varela, Argentina
| | - Seth Temple
- Department of Statistics, University of Washington, Seattle, WA 98195, USA
| | - Claudia Satizabal
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, UT Health San Antonio, San Antonio, TX 78229, USA
- Department of Population Health Sciences, University of Texas, San Antonio, TX 78229, USA
- Department of Neurology, University of Texas, San Antonio, TX 78229, USA
| | - Sharon R. Browning
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Sudha Seshadri
- Department of Neurology, University of Texas, San Antonio, TX 78229, USA
| | - Ellen M. Wijsman
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
- Division of Medical Genetics/Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Timothy A. Thornton
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
- Department of Statistics, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
2
|
Wijsman EM, Day TR, Thornton TA, Horimoto AR, Blue EE, Bis JC, Sohi HK, Nato AQ, Nafikov RA, Navas P, Saad M, Tsuang DW, Barral S, Vardarajan BN, Beecham GW, Martin ER, van Duijn CM, Pericak‐Vance MA, Mayeux R. Analysis of individual families implicates noncoding DNA variation and multiple biological pathways in Alzheimer’s disease risk. Alzheimers Dement 2020. [DOI: 10.1002/alz.046456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Debby W. Tsuang
- Geriatric Research, Education, and Clinical Center VA Puget Sound Health Care System Seattle WA USA
| | | | | | | | - Eden R. Martin
- University of Miami Miller School of Medicine Miami FL USA
| | | | - Margaret A. Pericak‐Vance
- Dr. John T. Macdonald Foundation Department of Human Genetics University of Miami Miller School of Medicine Miami FL USA
| | | | | |
Collapse
|
3
|
Naj AC, Lin H, Vardarajan BN, White S, Lancour D, Ma Y, Schmidt M, Sun F, Butkiewicz M, Bush WS, Kunkle BW, Malamon J, Amin N, Choi SH, Hamilton-Nelson KL, van der Lee SJ, Gupta N, Koboldt DC, Saad M, Wang B, Nato AQ, Sohi HK, Kuzma A, Wang LS, Cupples LA, van Duijn C, Seshadri S, Schellenberg GD, Boerwinkle E, Bis JC, Dupuis J, Salerno WJ, Wijsman EM, Martin ER, DeStefano AL. Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer's disease sequencing project. Genomics 2019; 111:808-818. [PMID: 29857119 PMCID: PMC6397097 DOI: 10.1016/j.ygeno.2018.05.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 04/03/2018] [Accepted: 05/06/2018] [Indexed: 12/30/2022]
Abstract
The Alzheimer's Disease Sequencing Project (ADSP) performed whole genome sequencing (WGS) of 584 subjects from 111 multiplex families at three sequencing centers. Genotype calling of single nucleotide variants (SNVs) and insertion-deletion variants (indels) was performed centrally using GATK-HaplotypeCaller and Atlas V2. The ADSP Quality Control (QC) Working Group applied QC protocols to project-level variant call format files (VCFs) from each pipeline, and developed and implemented a novel protocol, termed "consensus calling," to combine genotype calls from both pipelines into a single high-quality set. QC was applied to autosomal bi-allelic SNVs and indels, and included pipeline-recommended QC filters, variant-level QC, and sample-level QC. Low-quality variants or genotypes were excluded, and sample outliers were noted. Quality was assessed by examining Mendelian inconsistencies (MIs) among 67 parent-offspring pairs, and MIs were used to establish additional genotype-specific filters for GATK calls. After QC, 578 subjects remained. Pipeline-specific QC excluded ~12.0% of GATK and 14.5% of Atlas SNVs. Between pipelines, ~91% of SNV genotypes across all QCed variants were concordant; 4.23% and 4.56% of genotypes were exclusive to Atlas or GATK, respectively; the remaining ~0.01% of discordant genotypes were excluded. For indels, variant-level QC excluded ~36.8% of GATK and 35.3% of Atlas indels. Between pipelines, ~55.6% of indel genotypes were concordant; while 10.3% and 28.3% were exclusive to Atlas or GATK, respectively; and ~0.29% of discordant genotypes were. The final WGS consensus dataset contains 27,896,774 SNVs and 3,133,926 indels and is publicly available.
Collapse
Affiliation(s)
- Adam C Naj
- Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| | - Honghuang Lin
- Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Badri N Vardarajan
- Department of Neurology, Columbia University Medical Center, New York, NY, USA
| | - Simon White
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Daniel Lancour
- Department of Biomedical Genetics, Boston University School of Medicine, Boston, MA, USA
| | - Yiyi Ma
- Department of Biomedical Genetics, Boston University School of Medicine, Boston, MA, USA
| | - Michael Schmidt
- John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Fangui Sun
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Mariusz Butkiewicz
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| | - William S Bush
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| | - Brian W Kunkle
- John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - John Malamon
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Najaf Amin
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Seung Hoan Choi
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Kara L Hamilton-Nelson
- John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Sven J van der Lee
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Namrata Gupta
- Medical and Population Genetics Program, Broad Institute, Cambridge, MA, USA
| | - Daniel C Koboldt
- Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
| | - Mohamad Saad
- Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, University of Washington, Seattle, WA, USA
| | - Bowen Wang
- Department of Statistics, University of Washington, Seattle, WA, USA
| | - Alejandro Q Nato
- Division of Medical Genetics, University of Washington, Seattle, WA, USA
| | - Harkirat K Sohi
- Division of Medical Genetics, University of Washington, Seattle, WA, USA
| | - Amanda Kuzma
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Li-San Wang
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA
| | - Cornelia van Duijn
- Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
| | - Sudha Seshadri
- The Framingham Heart Study, Framingham, MA, USA; Department of Neurology, Boston University School of Medicine, Boston, MA, USA
| | - Gerard D Schellenberg
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Eric Boerwinkle
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA; Human Genetics Center, University of Texas Health Science Center, Houston, TX, USA
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Josée Dupuis
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA
| | - William J Salerno
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Ellen M Wijsman
- Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, University of Washington, Seattle, WA, USA
| | - Eden R Martin
- John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Anita L DeStefano
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA; Department of Neurology, Boston University School of Medicine, Boston, MA, USA
| |
Collapse
|
4
|
Kunji K, Ullah E, Nato AQ, Wijsman EM, Saad M. GIGI-Quick: a fast approach to impute missing genotypes in genome-wide association family data. Bioinformatics 2019; 34:1591-1593. [PMID: 29267877 DOI: 10.1093/bioinformatics/btx782] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Accepted: 12/15/2017] [Indexed: 11/12/2022] Open
Abstract
Summary Genome-wide association studies have become common over the last ten years, with a shift towards targeting rare variants, especially in pedigree-data. Despite lower costs, sequencing for rare variants still remains expensive. To have a relatively large sample with acceptable cost, imputation approaches may be used, such as GIGI for pedigree data. GIGI is an imputation method that handles large pedigrees and is particularly good for rare variant imputation. GIGI requires a subset of individuals in a pedigree to be fully sequenced, while other individuals are sequenced only at relevant markers. The imputation will infer the missing genotypes at untyped markers. Running GIGI on large pedigrees for large numbers of markers can be very time consuming. We present GIGI-Quick as a method to efficiently split GIGI's input, run GIGI in parallel and efficiently merge the output to reduce the runtime with the number of cores. This allows obtaining imputation results faster, and therefore all subsequent association analyses. Availability and and implementation GIGI-Quick is open source and publicly available via: https://cse-git.qcri.org/Imputation/GIGI-Quick. Contact msaad@hbku.edu.qa. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Khalid Kunji
- Data Analytics Department, Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Ehsan Ullah
- Data Analytics Department, Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Alejandro Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195-7720, USA
| | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195-7720, USA
| | - Mohamad Saad
- Data Analytics Department, Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| |
Collapse
|
5
|
Ullah E, Mall R, Abbas MM, Kunji K, Nato AQ, Bensmail H, Wijsman EM, Saad M. Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees. Genome Res 2018; 29:125-134. [PMID: 30514702 PMCID: PMC6314157 DOI: 10.1101/gr.236315.118] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 11/30/2018] [Indexed: 01/19/2023]
Abstract
Genotype imputation is widely used in genome-wide association studies to boost variant density, allowing increased power in association testing. Many studies currently include pedigree data due to increasing interest in rare variants coupled with the availability of appropriate analysis tools. The performance of population-based (subjects are unrelated) imputation methods is well established. However, the performance of family- and population-based imputation methods on family data has been subject to much less scrutiny. Here, we extensively compare several family- and population-based imputation methods on family data of large pedigrees with both European and African ancestry. Our comparison includes many widely used family- and population-based tools and another method, Ped_Pop, which combines family- and population-based imputation results. We also compare four subject selection strategies for full sequencing to serve as the reference panel for imputation: GIGI-Pick, ExomePicks, PRIMUS, and random selection. Moreover, we compare two imputation accuracy metrics: the Imputation Quality Score and Pearson's correlation R 2 for predicting power of association analysis using imputation results. Our results show that (1) GIGI outperforms Merlin; (2) family-based imputation outperforms population-based imputation for rare variants but not for common ones; (3) combining family- and population-based imputation outperforms all imputation approaches for all minor allele frequencies; (4) GIGI-Pick gives the best selection strategy based on the R 2 criterion; and (5) R 2 is the best measure of imputation accuracy. Our study is the first to extensively evaluate the imputation performance of many available family- and population-based tools on the same family data and provides guidelines for future studies.
Collapse
Affiliation(s)
- Ehsan Ullah
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Raghvendra Mall
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Mostafa M Abbas
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Khalid Kunji
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Alejandro Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington 98195-9460, USA.,Department of Biomedical Sciences, Joan C. Edwards School of Medicine, Marshall University, Huntington, West Virginia 25755, USA
| | - Halima Bensmail
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington 98195-9460, USA.,Department of Biostatistics, University of Washington, Seattle, Washington 98195-9460, USA
| | - Mohamad Saad
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| |
Collapse
|
6
|
Nafikov RA, Nato AQ, Sohi H, Wang B, Brown L, Horimoto AR, Vardarajan BN, Barral SM, Tosto G, Mayeux RP, Thornton TA, Blue E, Wijsman EM. Analysis of pedigree data in populations with multiple ancestries: Strategies for dealing with admixture in Caribbean Hispanic families from the ADSP. Genet Epidemiol 2018; 42:500-515. [PMID: 29862559 PMCID: PMC6160322 DOI: 10.1002/gepi.22133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Revised: 05/04/2018] [Accepted: 05/14/2018] [Indexed: 11/12/2022]
Abstract
Multipoint linkage analysis is an important approach for localizing disease-associated loci in pedigrees. Linkage analysis, however, is sensitive to misspecification of marker allele frequencies. Pedigrees from recently admixed populations are particularly susceptible to this problem because of the challenge of accurately accounting for population structure. Therefore, increasing emphasis on use of multiethnic samples in genetic studies requires reevaluation of best practices, given data currently available. Typical strategies have been to compute allele frequencies from the sample, or to use marker allele frequencies determined by admixture proportions averaged over the entire sample. However, admixture proportions vary among pedigrees and throughout the genome in a family-specific manner. Here, we evaluate several approaches to model admixture in linkage analysis, providing different levels of detail about ancestral origin. To perform our evaluations, for specification of marker allele frequencies, we used data on 67 Caribbean Hispanic admixed families from the Alzheimer's Disease Sequencing Project. Our results show that choice of admixture model has an effect on the linkage analysis results. Variant-specific admixture proportions, computed for individual families, provide the most detailed regional admixture estimates, and, as such, are the most appropriate allele frequencies for linkage analysis. This likely decreases the number of false-positive results, and is straightforward to implement.
Collapse
Affiliation(s)
- Rafael A Nafikov
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington
| | - Alejandro Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington
| | - Harkirat Sohi
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington
| | - Bowen Wang
- Department of Statistics, University of Washington, Seattle, Washington
| | - Lisa Brown
- Department of Biostatistics, University of Washington, Seattle, Washington
| | - Andrea R Horimoto
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington
| | | | - Sandra M Barral
- Department of Neurology, Columbia University, New York, Washington
| | - Giuseppe Tosto
- Department of Neurology, Columbia University, New York, Washington
| | - Richard P Mayeux
- Department of Neurology, Columbia University, New York, Washington
| | - Timothy A Thornton
- Department of Biostatistics, University of Washington, Seattle, Washington
| | - Elizabeth Blue
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington
| | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington.,Department of Biostatistics, University of Washington, Seattle, Washington
| |
Collapse
|
7
|
Blue EM, Brown LA, Conomos MP, Kirk JL, Nato AQ, Popejoy AB, Raffa J, Ranola J, Wijsman EM, Thornton T. Estimating relationships between phenotypes and subjects drawn from admixed families. BMC Proc 2016; 10:357-362. [PMID: 27980662 PMCID: PMC5133521 DOI: 10.1186/s12919-016-0056-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Background Estimating relationships among subjects in a sample, within family structures or caused by population substructure, is complicated in admixed populations. Inaccurate allele frequencies can bias both kinship estimates and tests for association between subjects and a phenotype. We analyzed the simulated and real family data from Genetic Analysis Workshop 19, and were aware of the simulation model. Results We found that kinship estimation is more accurate when marker data include common variants whose frequencies are less variable across populations. Estimates of heritability and association vary with age for longitudinally measured traits. Accounting for local ancestry identified different true associations than those identified by a traditional approach. Principal components aid kinship estimation and tests for association, but their utility is influenced by the frequency of the markers used to generate them. Conclusions Admixed families can provide a powerful resource for detecting disease loci, as well as analytical challenges. Allele frequencies, although difficult to adequately estimate in admixed populations, have a strong impact on the estimation of kinship, ancestry, and association with phenotypes. Approaches that acknowledge population structure in admixed families outperform those which ignore it.
Collapse
Affiliation(s)
- Elizabeth M Blue
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195 USA
| | - Lisa A Brown
- Department of Biostatistics, University of Washington, Seattle, WA 98195 USA
| | - Matthew P Conomos
- Department of Biostatistics, University of Washington, Seattle, WA 98195 USA
| | - Jennifer L Kirk
- Department of Biostatistics, University of Washington, Seattle, WA 98195 USA
| | - Alejandro Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195 USA
| | - Alice B Popejoy
- Institute for Public Health Genetics, University of Washington, Seattle, WA 98195 USA
| | - Jesse Raffa
- Department of Statistics, University of Washington, Seattle, WA 98195 USA
| | - John Ranola
- Department of Statistics, University of Washington, Seattle, WA 98195 USA
| | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195 USA ; Department of Biostatistics, University of Washington, Seattle, WA 98195 USA
| | - Timothy Thornton
- Department of Biostatistics, University of Washington, Seattle, WA 98195 USA
| |
Collapse
|
8
|
Saad M, Nato AQ, Grimson FL, Lewis SM, Brown LA, Blue EM, Thornton TA, Thompson EA, Wijsman EM. Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data. BMC Proc 2016; 10:295-301. [PMID: 27980652 PMCID: PMC5133511 DOI: 10.1186/s12919-016-0046-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023] Open
Abstract
Background In the past few years, imputation approaches have been mainly used in population-based designs of genome-wide association studies, although both family- and population-based imputation methods have been proposed. With the recent surge of family-based designs, family-based imputation has become more important. Imputation methods for both designs are based on identity-by-descent (IBD) information. Apart from imputation, the use of IBD information is also common for several types of genetic analysis, including pedigree-based linkage analysis. Methods We compared the performance of several family- and population-based imputation methods in large pedigrees provided by Genetic Analysis Workshop 19 (GAW19). We also evaluated the performance of a new IBD mapping approach that we propose, which combines IBD information from known pedigrees with information from unrelated individuals. Results Different combinations of the imputation methods have varied imputation accuracies. Moreover, we showed gains from the use of both known pedigrees and unrelated individuals with our IBD mapping approach over the use of known pedigrees only. Conclusions Our results represent accuracies of different combinations of imputation methods that may be useful for data sets similar to the GAW19 pedigree data. Our IBD mapping approach, which uses both known pedigree and unrelated individuals, performed better than classical linkage analysis.
Collapse
Affiliation(s)
- Mohamad Saad
- Department of Biostatistics, University of Washington, Seattle, WA USA ; Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
| | - Alejandro Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
| | - Fiona L Grimson
- Department of Statistics, University of Washington, Seattle, WA USA
| | - Steven M Lewis
- Department of Statistics, University of Washington, Seattle, WA USA
| | - Lisa A Brown
- Department of Biostatistics, University of Washington, Seattle, WA USA
| | - Elizabeth M Blue
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
| | | | | | - Ellen M Wijsman
- Department of Biostatistics, University of Washington, Seattle, WA USA ; Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA ; Department of Genome Sciences, University of Washington, Seattle, WA USA
| |
Collapse
|
9
|
Truong DT, Shriberg LD, Smith SD, Chapman KL, Scheer-Cohen AR, DeMille MMC, Adams AK, Nato AQ, Wijsman EM, Eicher JD, Gruen JR. Multipoint genome-wide linkage scan for nonword repetition in a multigenerational family further supports chromosome 13q as a locus for verbal trait disorders. Hum Genet 2016; 135:1329-1341. [PMID: 27535846 PMCID: PMC5065602 DOI: 10.1007/s00439-016-1717-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 07/22/2016] [Indexed: 12/19/2022]
Abstract
Verbal trait disorders encompass a wide range of conditions and are marked by deficits in five domains that impair a person's ability to communicate: speech, language, reading, spelling, and writing. Nonword repetition is a robust endophenotype for verbal trait disorders that is sensitive to cognitive processes critical to verbal development, including auditory processing, phonological working memory, and motor planning and programming. In the present study, we present a six-generation extended pedigree with a history of verbal trait disorders. Using genome-wide multipoint variance component linkage analysis of nonword repetition, we identified a region spanning chromosome 13q14-q21 with LOD = 4.45 between 52 and 55 cM, spanning approximately 5.5 Mb on chromosome 13. This region overlaps with SLI3, a locus implicated in reading disability in families with a history of specific language impairment. Our study of a large multigenerational family with verbal trait disorders further implicates the SLI3 region in verbal trait disorders. Future studies will further refine the specific causal genetic factors in this locus on chromosome 13q that contribute to language traits.
Collapse
Affiliation(s)
- D T Truong
- Department of Pediatrics, Yale School of Medicine, New Haven, CT, 06510, USA
| | - L D Shriberg
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - S D Smith
- Department of Pediatrics, University of Nebraska at Omaha, Omaha, NE, 68182, USA
| | - K L Chapman
- Department of Communication Sciences and Disorders, University of Utah, Salt Lake City, UT, 84112, USA
| | - A R Scheer-Cohen
- Department of Speech-Language Pathology, California State University, San Marcos, CA, 92096, USA
| | - M M C DeMille
- Department of Pediatrics, Yale School of Medicine, New Haven, CT, 06510, USA
| | - A K Adams
- Department of Genetics, Yale School of Medicine, New Haven, CT, 06510, USA
| | - A Q Nato
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98195, USA
| | - E M Wijsman
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98195, USA
- Department of Biostatistics and Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - J D Eicher
- Department of Genetics, Yale School of Medicine, New Haven, CT, 06510, USA
| | - J R Gruen
- Department of Pediatrics, Yale School of Medicine, New Haven, CT, 06510, USA.
- Department of Genetics, Yale School of Medicine, New Haven, CT, 06510, USA.
- Investigative Medicine Program, Yale School of Medicine, New Haven, CT, 06510, USA.
| |
Collapse
|
10
|
Nato AQ, Chapman NH, Sohi HK, Nguyen HD, Brkanac Z, Wijsman EM. PBAP: a pipeline for file processing and quality control of pedigree data with dense genetic markers. Bioinformatics 2015; 31:3790-8. [PMID: 26231429 PMCID: PMC4668752 DOI: 10.1093/bioinformatics/btv444] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Revised: 07/07/2015] [Accepted: 07/25/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Huge genetic datasets with dense marker panels are now common. With the availability of sequence data and recognition of importance of rare variants, smaller studies based on pedigrees are again also common. Pedigree-based samples often start with a dense marker panel, a subset of which may be used for linkage analysis to reduce computational burden and to limit linkage disequilibrium between single-nucleotide polymorphisms (SNPs). Programs attempting to select markers for linkage panels exist but lack flexibility. RESULTS We developed a pedigree-based analysis pipeline (PBAP) suite of programs geared towards SNPs and sequence data. PBAP performs quality control, marker selection and file preparation. PBAP sets up files for MORGAN, which can handle analyses for small and large pedigrees, typically human, and results can be used with other programs and for downstream analyses. We evaluate and illustrate its features with two real datasets. AVAILABILITY AND IMPLEMENTATION PBAP scripts may be downloaded from http://faculty.washington.edu/wijsman/software.shtml. CONTACT wijsman@uw.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | | | - Hiep D Nguyen
- Division of Medical Genetics, Department of Medicine
| | | | - Ellen M Wijsman
- Division of Medical Genetics, Department of Medicine, Department of Biostatistics and Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
11
|
Chapman NH, Nato AQ, Bernier R, Ankenman K, Sohi H, Munson J, Patowary A, Archer M, Blue EM, Webb SJ, Coon H, Raskind WH, Brkanac Z, Wijsman EM. Whole exome sequencing in extended families with autism spectrum disorder implicates four candidate genes. Hum Genet 2015. [PMID: 26204995 DOI: 10.1007/s00439-015-1585-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Autism spectrum disorders (ASDs) are a group of neurodevelopmental disorders, characterized by impairment in communication and social interactions, and by repetitive behaviors. ASDs are highly heritable, and estimates of the number of risk loci range from hundreds to >1000. We considered 7 extended families (size 12-47 individuals), each with ≥3 individuals affected by ASD. All individuals were genotyped with dense SNP panels. A small subset of each family was typed with whole exome sequence (WES). We used a 3-step approach for variant identification. First, we used family-specific parametric linkage analysis of the SNP data to identify regions of interest. Second, we filtered variants in these regions based on frequency and function, obtaining exactly 200 candidates. Third, we compared two approaches to narrowing this list further. We used information from the SNP data to impute exome variant dosages into those without WES. We regressed affected status on variant allele dosage, using pedigree-based kinship matrices to account for relationships. The p value for the test of the null hypothesis that variant allele dosage is unrelated to phenotype was used to indicate strength of evidence supporting the variant. A cutoff of p = 0.05 gave 28 variants. As an alternative third filter, we required Mendelian inheritance in those with WES, resulting in 70 variants. The imputation- and association-based approach was effective. We identified four strong candidate genes for ASD (SEZ6L, HISPPD1, FEZF1, SAMD11), all of which have been previously implicated in other studies, or have a strong biological argument for their relevance.
Collapse
Affiliation(s)
- Nicola H Chapman
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
| | - Alejandro Q Nato
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
| | - Raphael Bernier
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - Katy Ankenman
- Department of Psychiatry, University of California, San Francisco, CA, USA
| | - Harkirat Sohi
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
| | - Jeff Munson
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA.,Center on Child Development and Disability, University of Washington, Seattle, WA, USA
| | - Ashok Patowary
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - Marilyn Archer
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - Elizabeth M Blue
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
| | - Sara Jane Webb
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA.,Center on Child Development and Disability, University of Washington, Seattle, WA, USA
| | - Hilary Coon
- Department of Internal Medicine, University of Utah, Salt Lake City, UT, USA.,Department of Psychiatry, School of Medicine, University of Utah, Salt Lake City, UT, USA
| | - Wendy H Raskind
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA.,Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA.,Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Zoran Brkanac
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - Ellen M Wijsman
- Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA. .,Department of Biostatistics, University of Washington, Seattle, WA, USA. .,Department of Genome Sciences, University of Washington, Seattle, WA, USA. .,University of Washington, University of Washington Tower, T15, 4333 Brooklyn Ave, NE, BOX 359460, Seattle, WA, 98195-9460, USA.
| |
Collapse
|
12
|
Li BI, Matteson PG, Ababon MF, Nato AQ, Lin Y, Nanda V, Matise TC, Millonig JH. The orphan GPCR, Gpr161, regulates the retinoic acid and canonical Wnt pathways during neurulation. Dev Biol 2015; 402:17-31. [PMID: 25753732 DOI: 10.1016/j.ydbio.2015.02.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Revised: 02/03/2015] [Accepted: 02/11/2015] [Indexed: 10/23/2022]
Abstract
The vacuolated lens (vl) mouse mutation arose on the C3H/HeSnJ background and results in lethality, neural tube defects (NTDs) and cataracts. The vl phenotypes are due to a deletion/frameshift mutation in the orphan GPCR, Gpr161. A recent study using a null allele demonstrated that Gpr161 functions in primary cilia and represses the Shh pathway. We show the hypomorphic Gpr161(vl) allele does not severely affect the Shh pathway. To identify additional pathways regulated by Gpr161 during neurulation, we took advantage of naturally occurring genetic variation in the mouse. Previously Gpr161(vl-C3H) was crossed to different inbred backgrounds including MOLF/EiJ and the Gpr161(vl) mutant phenotypes were rescued. Five modifiers were mapped (Modvl: Modifier of vl) including Modvl5(MOLF). In this study we demonstrate the Modvl5(MOLF) congenic rescues the Gpr161(vl)-associated lethality and NTDs but not cataracts. Bioinformatics determined the transcription factor, Cdx1, is the only annotated gene within the Modvl5 95% CI co-expressed with Gpr161 during neurulation and not expressed in the eye. Using Cdx1 as an entry point, we identified the retinoid acid (RA) and canonical Wnt pathways as downstream targets of Gpr161. QRT-PCR, ISH and IHC determined that expression of RA and Wnt genes are down-regulated in Gpr161(vl/vl) but rescued by the Modvl5(MOLF) congenic during neurulation. Intraperitoneal RA injection restores expression of canonical Wnt markers and rescues Gpr161(vl/vl) NTDs. These results establish the RA and canonical Wnt as pathways downstream of Gpr161 during neurulation, and suggest that Modvl5(MOLF) bypasses the Gpr161(vl) mutation by restoring the activity of these pathways.
Collapse
Affiliation(s)
- Bo I Li
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, United States; Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States.
| | - Paul G Matteson
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, United States; Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States
| | - Myka F Ababon
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, United States; Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States
| | - Alejandro Q Nato
- Department of Genetics; Rutgers University, Piscataway, NJ, United States
| | - Yong Lin
- Division of Biometrics, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, United States
| | - Vikas Nanda
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, United States; Department of Biochemistry, Rutgers University, Piscataway, NJ, United States; Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States
| | - Tara C Matise
- Department of Genetics; Rutgers University, Piscataway, NJ, United States
| | - James H Millonig
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, United States; Department of Neuroscience and Cell Biology, Rutgers University, Piscataway, NJ, United States; Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, United States; Department of Genetics; Rutgers University, Piscataway, NJ, United States.
| |
Collapse
|
13
|
Dumitrescu L, Carty CL, Franceschini N, Hindorff LA, Cole SA, Bůžková P, Schumacher FR, Eaton CB, Goodloe RJ, Duggan DJ, Haessler J, Cochran B, Henderson BE, Cheng I, Johnson KC, Carlson CS, Love SA, Brown-Gentry K, Nato AQ, Quibrera M, Anderson G, Shohet RV, Ambite JL, Wilkens LR, Marchand LL, Haiman CA, Buyske S, Kooperberg C, North KE, Fornage M, Crawford DC. Post-genome-wide association study challenges for lipid traits: describing age as a modifier of gene-lipid associations in the Population Architecture using Genomics and Epidemiology (PAGE) study. Ann Hum Genet 2013; 77:416-25. [PMID: 23808484 DOI: 10.1111/ahg.12027] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 03/28/2013] [Indexed: 11/27/2022]
Abstract
Numerous common genetic variants that influence plasma high-density lipoprotein cholesterol, low-density lipoprotein cholesterol (LDL-C), and triglyceride distributions have been identified via genome-wide association studies (GWAS). However, whether or not these associations are age-dependent has largely been overlooked. We conducted an association study and meta-analysis in more than 22,000 European Americans between 49 previously identified GWAS variants and the three lipid traits, stratified by age (males: <50 or ≥50 years of age; females: pre- or postmenopausal). For each variant, a test of heterogeneity was performed between the two age strata and significant Phet values were used as evidence of age-specific genetic effects. We identified seven associations in females and eight in males that displayed suggestive heterogeneity by age (Phet < 0.05). The association between rs174547 (FADS1) and LDL-C in males displayed the most evidence for heterogeneity between age groups (Phet = 1.74E-03, I(2) = 89.8), with a significant association in older males (P = 1.39E-06) but not younger males (P = 0.99). However, none of the suggestive modifying effects survived adjustment for multiple testing, highlighting the challenges of identifying modifiers of modest SNP-trait associations despite large sample sizes.
Collapse
Affiliation(s)
- Logan Dumitrescu
- Center for Human Genetics Research, Vanderbilt University, Nashville, TN
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Kim W, Londono D, Zhou L, Xing J, Nato AQ, Musolf A, Matise TC, Finch SJ, Gordon D. Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error. Hum Hered 2013; 74:172-83. [PMID: 23594495 DOI: 10.1159/000346824] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single-variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p value, no matter how many loci.
Collapse
Affiliation(s)
- Wonkuk Kim
- Department of Mathematics and Statistics, University of South Florida, Tampa, FL, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Zhang L, Buzkova P, Wassel CL, Roman MJ, North KE, Crawford DC, Boston J, Brown-Gentry KD, Cole SA, Deelman E, Goodloe R, Wilson S, Heiss G, Jenny NS, Jorgensen NW, Matise TC, McClellan BE, Nato AQ, Ritchie MD, Franceschini N, Kao WHL. Lack of associations of ten candidate coronary heart disease risk genetic variants and subclinical atherosclerosis in four US populations: the Population Architecture using Genomics and Epidemiology (PAGE) study. Atherosclerosis 2013; 228:390-9. [PMID: 23587283 DOI: 10.1016/j.atherosclerosis.2013.02.038] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2012] [Revised: 02/26/2013] [Accepted: 02/27/2013] [Indexed: 12/30/2022]
Abstract
BACKGROUND A number of genetic variants have been discovered by recent genome-wide association studies for their associations with clinical coronary heart disease (CHD). However, it is unclear whether these variants are also associated with the development of CHD as measured by subclinical atherosclerosis phenotypes, ankle brachial index (ABI), carotid artery intima-media thickness (cIMT) and carotid plaque. METHODS Ten CHD risk single nucleotide polymorphisms (SNPs) were genotyped in individuals of European American (EA), African American (AA), American Indian (AI), and Mexican American (MA) ancestry in the Population Architecture using Genomics and Epidemiology (PAGE) study. In each individual study, we performed linear or logistic regression to examine population-specific associations between SNPs and ABI, common and internal cIMT, and plaque. The results from individual studies were meta-analyzed using a fixed effect inverse variance weighted model. RESULTS None of the ten SNPs was significantly associated with ABI and common or internal cIMT, after Bonferroni correction. In the sample of 13,337 EA, 3809 AA, and 5353 AI individuals with carotid plaque measurement, the GCKR SNP rs780094 was significantly associated with the presence of plaque in AI only (OR = 1.32, 95% confidence interval: 1.17, 1.49, P = 1.08 × 10(-5)), but not in the other populations (P = 0.90 in EA and P = 0.99 in AA). A 9p21 region SNP, rs1333049, was nominally associated with plaque in EA (OR = 1.07, P = 0.02) and in AI (OR = 1.10, P = 0.05). CONCLUSIONS We identified a significant association between rs780094 and plaque in AI populations, which needs to be replicated in future studies. There was little evidence that the index CHD risk variants identified through genome-wide association studies in EA influence the development of CHD through subclinical atherosclerosis as assessed by cIMT and ABI across ancestries.
Collapse
Affiliation(s)
- Lili Zhang
- Department of Internal Medicine, Jacobi Medical Center, Bronx, NY, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Shimada MK, Panchapakesan K, Tishkoff SA, Nato AQ, Hey J. Divergent haplotypes and human history as revealed in a worldwide survey of X-linked DNA sequence variation. Mol Biol Evol 2006; 24:687-98. [PMID: 17175528 DOI: 10.1093/molbev/msl196] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The population genetic history of a 10.1-kbp noncoding region of the human X chromosome was studied using the males of the HGDP-CEPH Human Genome Diversity Panel (672 individuals from 52 populations). The geographic distribution of patterns of variation was roughly consistent with previous studies, with the major exception that 1 highly divergent haplotype (haplotype X, hX) was observed at low frequency in widely scattered non-African populations and not at all observed in sub-Saharan African populations. Microsatellite (short tandem repeat) variation within the sequenced region was low among copies of hX, even though the estimated time of ancestry of hX and other sequences was 1.44 Myr. The estimated age of the common ancestor of all hX copies was 5,230 years (95% consistency index: 2,000-75,480 years). To further address the presence of hX in Africa, additional samples from Chad and Tanzania were screened. Five additional copies of hX were observed, consistent with a history in which hX was present in Africa prior to the migration of modern humans out of Africa and with eastern Africa being the source of non-African modern human populations. Taken together, these features of hX-that it is much older than other haplotypes and uncommon and patchily distributed throughout Africa, Europe, and Asia-present a cautionary tale for interpretations of human history.
Collapse
|