1
|
Variants of Toll-like receptor 6 associated with tuberculosis susceptibility in the Chinese Tibetan population. Microb Pathog 2021; 162:105208. [PMID: 34563610 DOI: 10.1016/j.micpath.2021.105208] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 07/07/2021] [Accepted: 09/22/2021] [Indexed: 02/05/2023]
Abstract
BACKGROUND Our investigation attempted to understand the role of innate immunity related genes played in tuberculosis. The relationship between single-nucleotide polymorphisms (SNPs) of three innate immunity-related genes (TLR6, MyD88, and TIRAP) and tuberculosis (TB) risk in two Chinese populations were explored. METHODS Totally 1185 Chinese Han, consisting of 580 active TB cases and 605 healthy controls (HCs), and 1216 Chinese Tibetan individuals including 613 TB patients and 603 HCs were enrolled to conduct two case-control studies. TagSNPs of the three genes were selected based on the HapMap database and genotyped by the SNPscan™ Kit. Haploview software 4.2 was applied to perform linkage disequilibrium (LD) analysis and online software SHEsis was used to discover significant haplotype block. RegulomeDB and HaploReg were applied to predict potential functional SNPs of the three genes. RESULTS The results showed that minor alleles of rs5743808 and rs5743827 of TLR6 were related with increased TB risk (p = 0.001, OR 95%CI = 1.51 (1.18-1.95) and p = 0.002, OR 95%CI = 1.42 (1.14-1.77)), and significant association was also observed between rs5743827 and TB risk in male subgroup (p = 0.003, OR 95%CI = 1.67 (1.91-2.35)) in the Tibetan population. For the Tibetan population, frequency of haplotype ACGT of rs1039559-rs3775073-rs5743808-rs5743827 of TLR6 was significantly higher in the TB group (p = 0.0008), while haplotype ATAC was significantly higher in the control group (p = 0.0002). The above associations remained after permutation and Bonferroni correction. No significant association was found in the Han population. Probable functions of tagSNPs of TLR6 and some other linked variants were discovered after bioinformatic analysis. CONCLUSIONS This study suggested that variants of TLR6 might be associated with TB risk in the Tibetan population, while not in the Han population. The difference between Chinese Han and Tibetan people will provide better understanding of tuberculosis.
Collapse
|
2
|
Variants of TLR1 associated with tuberculosis susceptibility in the Chinese Tibetan population but not in Han Chinese. INFECTION GENETICS AND EVOLUTION 2018; 61:53-59. [DOI: 10.1016/j.meegid.2018.02.021] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Revised: 02/13/2018] [Accepted: 02/14/2018] [Indexed: 12/19/2022]
|
3
|
de Kovel CG, Mulder F, van Setten J, van ‘t Slot R, Al-Rubaish A, Alshehri AM, Al Faraidy K, Al-Ali A, Al-Madan M, Al Aqaili I, Larbi E, Al-Ali R, Alzahrani A, Asselbergs FW, Koeleman BPC, Al-Ali A. Exome-Wide Association Analysis of Coronary Artery Disease in the Kingdom of Saudi Arabia Population. PLoS One 2016; 11:e0146502. [PMID: 26849363 PMCID: PMC4744043 DOI: 10.1371/journal.pone.0146502] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 12/17/2015] [Indexed: 11/28/2022] Open
Abstract
Coronary Artery Disease (CAD) remains the leading cause of mortality worldwide. Mortality rates associated with CAD have shown an exceptional increase particularly in fast developing economies like the Kingdom of Saudi Arabia (KSA). Over the past twenty years, CAD has become the leading cause of death in KSA and has reached epidemic proportions. This rise is undoubtedly caused by fast urbanization that is associated with a life-style that promotes CAD. However, the question remains whether genetics play a significant role and whether genetic susceptibility is increased in KSA compared to the well-studied Western European populations. Therefore, we performed an Exome-wide association study (EWAS) in 832 patients and 1,076 controls of Saudi Arabian origin to test whether population specific, strong genetic risk factors for CAD exist, or whether the polygenic risk score for known genetic risk factors for CAD, lipids, and Type 2 Diabetes show evidence for an enriched genetic burden. Our results do not show significant associations for a single genetic locus. However, the heritability estimate for CAD for this population was high (h(2) = 0.53, S.E. = 0.1, p = 4e(-12)) and we observed a significant association of the polygenic risk score for CAD that demonstrates that the population of KSA, at least in part, shares the genetic risk associated to CAD in Western populations.
Collapse
Affiliation(s)
- Carolien G. de Kovel
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Flip Mulder
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Jessica van Setten
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Ruben van ‘t Slot
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Abdullah Al-Rubaish
- Department of Medicine, King Fahd Hospital of the University, University of Dammam, Dammam, Kingdom of Saudi Arabia
| | - Abdullah M. Alshehri
- Department of Medicine, King Fahd Hospital of the University, University of Dammam, Dammam, Kingdom of Saudi Arabia
| | - Khalid Al Faraidy
- Department of Cardiology, King Fahd Military Medical Complex, Al-Khobar, Kingdom of Saudi Arabia
| | - Abdullah Al-Ali
- Prince Sultan Cardiac Center, Al-Ahssa, Kingdom of Saudi Arabia
| | - Mohammed Al-Madan
- King Fahd Hospital of the University, University of Dammam, Dammam, Kingdom of Saudi Arabia
| | - Issa Al Aqaili
- Department of Medicine, Qatif Central Hospital, Qatif, Kingdom of Saudi Arabia
| | - Emmanuel Larbi
- Department of Medicine, King Fahd Hospital of the University, University of Dammam, Dammam, Kingdom of Saudi Arabia
| | - Rudaynah Al-Ali
- Department of Medicine, King Fahd Hospital of the University, University of Dammam, Dammam, Kingdom of Saudi Arabia
| | - Alhusain Alzahrani
- College of Applied Medical Sciences, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Folkert W. Asselbergs
- Department of Cardiology, Division Heart & Lungs, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Bobby P. C. Koeleman
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Amein Al-Ali
- Prince Mohammed Center for Research & Consultation Studies, College of Medicine, University of Dammam, Dammam, Kingdom of Saudi Arabia
| |
Collapse
|
4
|
Kaur H, Jajodia A, Grover S, Agarwal N, Baghel R, Kukreti R. Pharmacogenomics of neuropsychiatric disorders: analysis of genetic variability in 162 identified neuroreceptors using 1000 Genomes Project data. Pharmacogenomics 2015; 15:1575-87. [PMID: 25340732 DOI: 10.2217/pgs.14.113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Neuroreceptors are considered to be primary drug targets and their abrupt signaling is a notable cause of interindividual drug response variability and treatment failure for complex neuropsychiatric diseases. In view of recent evidence, it is believed that common genetic risk factors mainly highly polymorphic neuroreceptors are being shared among neuropsychiatric disorders. MATERIALS & METHODS We identified 162 neuroreceptors from the 639 known receptors in Homo sapiens and investigated 231,683 SNPs using 1000 Genomes Project data and evaluated their biological effect using in silico tools including RegulomeDB, SIFT, PolyPhen-2 and CAROL. Furthermore, data from the 1000 Genomes Project was utilized to retrieve minor allele frequency and calculate pairwise logartithm of the odds score among these SNPs for African, American, Asian and European populations separately as well as when combined together using Haploview v4.2. LRTag was used to identify tagSNPs in populations. RESULTS A total of 52,381 (22.60%) SNPs were predicted as functionally important genetic variations. We identified sets of 603, 495, 450, 453 and 646 informative tagSNPs for African, American, Asian, European and combined populations, respectively. We propose construction of a 'neuroreceptor variants array' with these informative SNPs for future pharmacogenomic studies of neuropsychiatric disorders. CONCLUSION Such an approach might improve genotype-phenotype correlation across different populations and lead to identification of reliable genetic markers and novel drug targets. Integration of these SNPs in literature would further provide evidence relevant to underlying mechanisms of genetics based nosology, pathophysiology and development of new drugs for the treatment of neuropsychiatric disorders.
Collapse
Affiliation(s)
- Harpreet Kaur
- Genomics & Molecular Medicine Unit, CSIR-Institute of Genomics & Integrative Biology, Mall Road, Delhi-110007, India
| | | | | | | | | | | |
Collapse
|
5
|
Juyal G, Mondal M, Luisi P, Laayouni H, Sood A, Midha V, Heutink P, Bertranpetit J, Thelma BK, Casals F. Population and genomic lessons from genetic analysis of two Indian populations. Hum Genet 2014; 133:1273-87. [PMID: 24980708 DOI: 10.1007/s00439-014-1462-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 06/05/2014] [Indexed: 12/25/2022]
Abstract
Indian demographic history includes special features such as founder effects, interpopulation segregation, complex social structure with a caste system and elevated frequency of consanguineous marriages. It also presents a higher frequency for some rare mendelian disorders and in the last two decades increased prevalence of some complex disorders. Despite the fact that India represents about one-sixth of the human population, deep genetic studies from this terrain have been scarce. In this study, we analyzed high-density genotyping and whole-exome sequencing data of a North and a South Indian population. Indian populations show higher differentiation levels than those reported between populations of other continents. In this work, we have analyzed its consequences, by specifically assessing the transferability of genetic markers from or to Indian populations. We show that there is limited genetic marker portability from available genetic resources such as HapMap or the 1,000 Genomes Project to Indian populations, which also present an excess of private rare variants. Conversely, tagSNPs show a high level of portability between the two Indian populations, in contrast to the common belief that North and South Indian populations are genetically very different. By estimating kinship from mates and consanguinity in our data from trios, we also describe different patterns of assortative mating and inbreeding in the two populations, in agreement with distinct mating preferences and social structures. In addition, this analysis has allowed us to describe genomic regions under recent adaptive selection, indicating differential adaptive histories for North and South Indian populations. Our findings highlight the importance of considering demography for design and analysis of genetic studies, as well as the need for extending human genetic variation catalogs to new populations and particularly to those with particular demographic histories.
Collapse
Affiliation(s)
- Garima Juyal
- Department of Genetics, University of Delhi South Campus, New Delhi, 110 021, India
| | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
Genomic medicine presents many opportunities for improved health outcomes in the future. African countries, however, face many challenges in harnessing these opportunities for the benefit of African patients. Unique aspects that fuel these challenges include the enormous genetic diversity found in African individuals and communities across the continent; a high burden of infectious diseases and prioritized commitment of scarce public health resources to primary healthcare; limited economic resources for genomic health research and translation; and a history of the one-way transfer of samples, human resources and research translation off the continent. While these challenges are significant, there are opportunities for African countries to harness the economic and health benefits of genomic medicine for people in Africa. An active and supported biotechnology sector can provide an avenue for translating the benefits of African genomic research to African patients and populations.
Collapse
Affiliation(s)
- Nicki Tiffin
- South African National Bioinformatics Institute/SA Medical Research Council Bioinformatics Unit, University of the Western Cape, Cape Town, Western Cape, South Africa.,
| |
Collapse
|
7
|
Bernig T, Chanock SJ. Challenges of SNP genotyping and genetic variation: its future role in diagnosis and treatment of cancer. Expert Rev Mol Diagn 2014; 6:319-31. [PMID: 16706736 DOI: 10.1586/14737159.6.3.319] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Thorough annotation of common germline genetic variation in the human genome has generated a foundation for the investigation of the contribution of genetics to the etiology and pathogenesis of cancer. For many malignancies, it has become increasingly apparent that numerous alleles, with small-to-moderate effects, additively contribute to cancer susceptibility. The most common genetic variant in the genome, the single nucleotide polymorphism, is of special interest for the study of susceptibility to and protection from cancer. Similarly, intense effort has focused on genetic variants that can predict either response or toxicity to therapeutic interventions. This review discusses the challenges and prospects of genetic association studies in cancer research. On the basis of recent changes in genomics and high-throughput genotyping platforms, future genetic findings of association studies could impact clinical care and public health screening.
Collapse
Affiliation(s)
- Toralf Bernig
- National Cancer Institute, Section on Genomic Variation, Pediatric Oncology Branch, National Institutes of Health, Bethesda, MD 20892-4605, USA.
| | | |
Collapse
|
8
|
|
9
|
Stathias V, Sotiris GR, Karagiannidis I, Bourikas G, Martinis G, Papazoglou D, Tavridou A, Papanas N, Maltezos E, Theodoridis M, Vargemezis V, Manolopoulos VG, Speed WC, Kidd JR, Kidd KK, Drineas P, Paschou P. Exploring genomic structure differences and similarities between the Greek and European HapMap populations: implications for association studies. Ann Hum Genet 2013; 76:472-83. [PMID: 23061745 DOI: 10.1111/j.1469-1809.2012.00730.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Studies of the genomic structure of the Greek population and Southeastern Europe are limited, despite the central position of the area as a gateway for human migrations into Europe. HapMap has provided a unique tool for the analysis of human genetic variation. Europe is represented by the CEU (Northwestern Europe) and the TSI populations (Tuscan Italians from Southern Europe), which serve as reference for the design of genetic association studies. Furthermore, genetic association findings are often transferred to unstudied populations. Although initial studies support the fact that the CEU can, in general, be used as reference for the selection of tagging SNPs in European populations, this has not been extensively studied across Europe. We set out to explore the genomic structure of the Greek population (56 individuals) and compare it to the HapMap TSI and CEU populations. We studied 1112 SNPs (27 regions, 13 chromosomes). Although the HapMap European populations are, in general, a good reference for the Greek population, regions of population differentiation do exist and results should not be light-heartedly generalized. We conclude that, perhaps due to the individual evolutionary history of each genomic region, geographic proximity is not always a perfect guide for selecting a reference population for an unstudied population.
Collapse
Affiliation(s)
- Vasileios Stathias
- Department of Molecular Biology and Genetics, Democritus University of Thrace, Alexandroupoli, Greece
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Talledo M, López G, Huyghe JR, Verdonck K, González E, Clark D, Vanham G, Gotuzzo E, Van Camp G, Van Laer L. Possible implication of NFKB1A and NKG2D genes in susceptibility to HTLV-1-associated myelopathy/tropical spastic paraparesis in Peruvian patients infected with HTLV-1. J Med Virol 2012; 84:319-26. [PMID: 22170554 DOI: 10.1002/jmv.22255] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The human T-cell lymphotropic virus type 1 (HTLV-1) is the etiological agent of HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP), a progressive disease causing paraparesis of the lower limbs. Only a minority of persons infected with HTLV-1 develop HAM/TSP. Universal susceptibility factors for HAM/TSP are not known. The viral genotype is similar in asymptomatic carriers and HAM/TSP patients. High proviral load has been associated consistently with HAM/TSP, but this factor does not explain fully the presence of disease in HTLV-1-infected subjects. Most likely, host genetic factors will play an important role in HAM/TSP development. A two-stage case-control study was carried out to evaluate the association between HAM/TSP and candidate single nucleotide polymorphisms (SNPs) from 45 genes in addition to six human leukocyte antigen (HLA) alleles. Ancestry-informative markers were used to correct for population stratification. Several SNPs belonging to NFKB1A and NKG2D showed a trend of association in both stages. The fact that the direction of the association observed in the first stage was the same in the second stage suggests that NFKB1A and NKG2D may be implicated in the development of HAM/TSP. Further replication studies in independent HTLV-1 patient groups should validate further these associations.
Collapse
Affiliation(s)
- Michael Talledo
- Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, Lima, Peru.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Trifonova E, Eremina E, Urnov F, Stepanov V. The Genetic Diversity and Structure of Linkage Disequilibrium of the MTHFR Gene in Populations of Northern Eurasia. Acta Naturae 2012; 4:53-69. [PMID: 22708063 PMCID: PMC3372994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The structure of the haplotypes and linkage disequilibrium (LD) of the methylenetetrahydrofolate reductase gene (MTHFR) in 9 population groups from Northern Eurasia and populations of the international HapMap project was investigated in the present study. The data suggest that the architecture of LD in the human genome is largely determined by the evolutionary history of populations; however, the results of phylogenetic and haplotype analyses seems to suggest that in fact there may be a common "old" mechanism for the formation of certain patterns of LD. Variability in the structure of LD and the level of diversity of MTHFRhaplotypes cause a certain set of tagSNPs with an established prognostic significance for each population. In our opinion, the results obtained in the present study are of considerable interest for understanding multiple genetic phenomena: namely, the association of interpopulation differences in the patterns of LD with structures possessing a genetic susceptibility to complex diseases, and the functional significance of the pleiotropicMTHFR gene effect. Summarizing the results of this study, a conclusion can be made that the genetic variability analysis with emphasis on the structure of LD in human populations is a powerful tool that can make a significant contribution to such areas of biomedical science as human evolutionary biology, functional genomics, genetics of complex diseases, and pharmacogenomics.
Collapse
Affiliation(s)
- E.A. Trifonova
- Research Institute of Medical Genetics, Siberian Branch, Russian Academy
of Medical Sciences
| | | | | | - V.A. Stepanov
- Research Institute of Medical Genetics, Siberian Branch, Russian Academy
of Medical Sciences
| |
Collapse
|
12
|
Luisi P, Alvarez-Ponce D, Dall'Olio GM, Sikora M, Bertranpetit J, Laayouni H. Network-Level and Population Genetics Analysis of the Insulin/TOR Signal Transduction Pathway Across Human Populations. Mol Biol Evol 2011; 29:1379-92. [DOI: 10.1093/molbev/msr298] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
|
13
|
Javed A, Drineas P, Mahoney MW, Paschou P. Efficient genomewide selection of PCA-correlated tSNPs for genotype imputation. Ann Hum Genet 2011; 75:707-22. [PMID: 21902678 DOI: 10.1111/j.1469-1809.2011.00673.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The linkage disequilibrium structure of the human genome allows identification of small sets of single nucleotide polymorphisms (SNPs) (tSNPs) that efficiently represent dense sets of markers. This structure can be translated into linear algebraic terms as evidenced by the well documented principal components analysis (PCA)-based methods. Here we apply, for the first time, PCA-based methodology for efficient genomewide tSNP selection; and explore the linear algebraic structure of the human genome. Our algorithm divides the genome into contiguous nonoverlapping windows of high linear structure. Coupling this novel window definition with a PCA-based tSNP selection method, we analyze 2.5 million SNPs from the HapMap phase 2 dataset. We show that 10-25% of these SNPs suffice to predict the remaining genotypes with over 95% accuracy. A comparison with other popular methods in the ENCODE regions indicates significant genotyping savings. We evaluate the portability of genome-wide tSNPs across a diverse set of populations (HapMap phase 3 dataset). Interestingly, African populations are good reference populations for the rest of the world. Finally, we demonstrate the applicability of our approach in a real genome-wide disease association study. The chosen tSNP panels can be used toward genotype imputation using either a simple regression-based algorithm or more sophisticated genotype imputation methods.
Collapse
Affiliation(s)
- Asif Javed
- Computational Biology Center, IBM TJ Watson Research, Yorktown Heights, NY 10598, USA
| | | | | | | |
Collapse
|
14
|
Abstract
DNA variants in a 31-kb region of the human major histocompatibility complex, encompassing the tumor necrosis factor (TNF) gene cluster, were surveyed by direct sequencing of 283 unrelated individuals from six Chinese populations. A total of 273 polymorphic sites were identified, with nearly half of them novel. We observed an excess of rare variants and negative values of selection tests of the region, implying either that these populations experienced a historical expansion or that the surveyed region was subjected to natural selection. Different characteristics of the sequence variation in the six populations outline the genetic differentiation between Northern and Southern Chinese populations. The distributions of recombination rates are similar among all the populations, with variation in the magnitude and/or in the fine location of hot spots. Tag single-nucleotide polymorphisms (SNPs) selected from HapMap (Phase II) CHB data accounted for an average of 64% of common SNPs from the six Chinese populations. We also observed a limited transferability of tag SNPs between Chinese populations on the 31-kb region with an excess of untaggable SNPs and ragged linkage disequilibrium blocks. It suggested that the design and interpretation of future association studies should be more cautious, and that a resequencing approach may refine tag SNP selection on Chinese-specific disease mapping.
Collapse
|
15
|
de Bakker PIW. Selection and evaluation of Tag-SNPs using Tagger and HapMap. Cold Spring Harb Protoc 2010; 2009:pdb.ip67. [PMID: 20147177 DOI: 10.1101/pdb.ip67] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
16
|
Hastie CE, Padmanabhan S, Dominiczak AF. Genome-Wide Association Studies of Hypertension: Light at the End of the Tunnel. Int J Hypertens 2010; 2010:509581. [PMID: 20981355 PMCID: PMC2958365 DOI: 10.4061/2010/509581] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2010] [Accepted: 02/07/2010] [Indexed: 01/08/2023] Open
Abstract
Despite its significant genetic component, the study of hypertension by genome-wide
association presents more challenges than other common complex diseases. Its high
prevalence, heterogeneity, and somewhat unclear definition are the challenges that need
to be overcome on one hand. On the other hand, there are issues of small effect sizes and
pleiotropism that are not specific to hypertension alone but nonetheless magnify the
problems of genetic dissection when coupled with phenotypic misclassification. We
discuss issues of study design and summarise published genome-wide association studies
(GWASs) of hypertension and blood pressure. With careful study design and analysis
success is possible, as demonstrated by the recent large-scale studies. Following these, there
is still further scope to advance the field through high fidelity phenotyping and deep
sequencing.
Collapse
Affiliation(s)
- Claire E. Hastie
- British Heart Foundation Cardiovascular Research Centre, University of Glasgow, 126 University Place, Glasgow G12 8TA, UK
| | - Sandosh Padmanabhan
- British Heart Foundation Cardiovascular Research Centre, University of Glasgow, 126 University Place, Glasgow G12 8TA, UK
| | - Anna F. Dominiczak
- British Heart Foundation Cardiovascular Research Centre, University of Glasgow, 126 University Place, Glasgow G12 8TA, UK
| |
Collapse
|
17
|
Lins TC, Abreu BS, Pereira RW. TagSNP transferability and relative loss of variability prediction from HapMap to an admixed population. J Biomed Sci 2009; 16:73. [PMID: 19682379 PMCID: PMC2737315 DOI: 10.1186/1423-0127-16-73] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2009] [Accepted: 08/14/2009] [Indexed: 01/30/2023] Open
Abstract
Background The application of a subset of single nucleotide polymorphisms, the tagSNPs, can be useful in capturing untyped SNPs information in a genomic region. TagSNP transferability from the HapMap dataset to admixed populations is of uncertain value due population structure, admixture, drift and recombination effects. In this work an empirical dataset from a Brazilian admixed sample was evaluated against the HapMap population to measure tagSNP transferability and the relative loss of variability prediction. Methods The transferability study was carried out using SNPs dispersed over four genomic regions: the PTPN22, HMGCR, VDR and CETP genes. Variability coverage and the prediction accuracy for tagSNPs in the selected genomic regions of HapMap phase II were computed using a prediction accuracy algorithm. Transferability of tagSNPs and relative loss of prediction were evaluated according to the difference between the Brazilian sample and the pooled and single HapMap population estimates. Results Each population presented different levels of prediction per gene. On average, the Brazilian (BRA) sample displayed a lower power of prediction when compared to HapMap and the pooled sample. There was a relative loss of prediction for BRA when using single HapMap populations, but a pooled HapMap dataset generated minor loss of variability prediction and lower standard deviations, except at the VDR locus at which loss was minor using CEU tagSNPs. Conclusion Studies that involve tagSNP selection for an admixed population should not be generally correlated with any specific HapMap population and can be better represented with a pooled dataset in most cases.
Collapse
Affiliation(s)
- Tulio C Lins
- Programa de Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, DF, Brazil.
| | | | | |
Collapse
|
18
|
Bosch E, Laayouni H, Morcillo-Suarez C, Casals F, Moreno-Estrada A, Ferrer-Admetlla A, Gardner M, Rosa A, Navarro A, Comas D, Graffelman J, Calafell F, Bertranpetit J. Decay of linkage disequilibrium within genes across HGDP-CEPH human samples: most population isolates do not show increased LD. BMC Genomics 2009; 10:338. [PMID: 19638193 PMCID: PMC2723139 DOI: 10.1186/1471-2164-10-338] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2009] [Accepted: 07/28/2009] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND It is well known that the pattern of linkage disequilibrium varies between human populations, with remarkable geographical stratification. Indirect association studies routinely exploit linkage disequilibrium around genes, particularly in isolated populations where it is assumed to be higher. Here, we explore both the amount and the decay of linkage disequilibrium with physical distance along 211 gene regions, most of them related to complex diseases, across 39 HGDP-CEPH population samples, focusing particularly on the populations defined as isolates. Within each gene region and population we use r2 between all possible single nucleotide polymorphism (SNP) pairs as a measure of linkage disequilibrium and focus on the proportion of SNP pairs with r2 greater than 0.8. RESULTS Although the average r2 was found to be significantly different both between and within continental regions, a much higher proportion of r2 variance could be attributed to differences between continental regions (2.8% vs. 0.5%, respectively). Similarly, while the proportion of SNP pairs with r2 > 0.8 was significantly different across continents for all distance classes, it was generally much more homogenous within continents, except in the case of Africa and the Americas. The only isolated populations with consistently higher LD in all distance classes with respect to their continent are the Kalash (Central South Asia) and the Surui (America). Moreover, isolated populations showed only slightly higher proportions of SNP pairs with r2 > 0.8 per gene region than non-isolated populations in the same continent. Thus, the number of SNPs in isolated populations that need to be genotyped may be only slightly less than in non-isolates. CONCLUSION The "isolated population" label by itself does not guarantee a greater genotyping efficiency in association studies, and properties other than increased linkage disequilibrium may make these populations interesting in genetic epidemiology.
Collapse
Affiliation(s)
- Elena Bosch
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Ciències Experimentals i de Salut, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Dissanayake VHW, Weerasekera LY, Gammulla CG, Jayasekara RW. Prevalence of genetic thrombophilic polymorphisms in the Sri Lankan population--implications for association study design and clinical genetic testing services. Exp Mol Pathol 2009; 87:159-62. [PMID: 19591822 DOI: 10.1016/j.yexmp.2009.07.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2009] [Revised: 06/18/2009] [Accepted: 07/01/2009] [Indexed: 11/20/2022]
Abstract
We investigated the prevalence of genotypes/alleles of single nucleotide polymorphisms (SNP) and haplotypes defined by them in three genes in which variations are associated with venous thromboembolism in 80 Sinhalese, 80 Sri Lankan Tamils and 80 Moors in the Sri Lankan population and compared the SNP data with that of other populations in Southern India and haplotype data with that of HapMap populations. The genes and polymorphisms investigated were Methylenetetrahydrofolate reductase (MTHFR) - 677C>T (rs1801133), 1298A>C (rs1801131), 1317T>C, 1793G>A (rs2274976); Factor V (F5) - 1691G>A (rs6025) and 4070A>G (rs1800595); and prothrombin (F2) - 20210G>A (rs1799963). The polymorphisms were genotyped using PCR/RFLP methods. The prevalence of the variant alleles of each polymorphism in the Sinhalese, Tamils, and Moors was MTHFR 677T: Sinhalese - 13%, Tamils - 9%, Moors - 9%. 1317T>C: Sinhalese - 0%; Tamils - 0%; Moors - 0%. 1793A: Sinhalese - 19%, Tamils - 19%, Moors - 19%. F5 1691A: Sinhalese - 2%, Tamils - 3%, Moors - 2%. 4070G: Sinhalese - 6%, Tamils - 5%, Moors - 8%. F2 20210A: Sinhalese - 0%, Tamils - 0%, Moors - 0%. The frequencies observed were similar to data from other South Indian populations; the haplotype data showed haplotypes unique to the Sri Lankan population when compared to HapMap populations. rs9651118 was identified as a SNP that splits the haplotypes harbouring the functionally significant 677T allele in the MTHFR gene. This data would be useful in planning genetic association studies in the Sri Lankan population and in deciding on which genetic variants should be tested in a clinical genetic testing service.
Collapse
Affiliation(s)
- Vajira H W Dissanayake
- Human Genetics Unit, Faculty of Medicine, University of Colombo, Kynsey Road, Colombo 8, Sri Lanka.
| | | | | | | |
Collapse
|
20
|
Isolated populations as treasure troves in genetic epidemiology: the case of the Basques. Eur J Hum Genet 2009; 17:1490-4. [PMID: 19417765 DOI: 10.1038/ejhg.2009.69] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The Basques are a culturally isolated population, living across the western border between France and Spain and speaking a non-Indo-European language. They show outlier allele frequencies in the ABO, RH, and HLA loci. To test whether Basques are a genetic isolate with the features that would make them good candidates in genetic association studies, we genotyped 123 SNPs in a 1-Mb region in chromosome 22 in Basque samples from France and Spain, as well as in samples from northern and southern Spain, and in three North African samples. Both Basque samples showed similar levels of heterozygosity to the other populations, and the decay of linkage disequilibrium with physical distance was not different between Basques and non-Basques. Thus, Basques do not show the genetic properties expected in population isolates.
Collapse
|
21
|
Foster MW. Looking for race in all the wrong places: analyzing the lack of productivity in the ongoing debate about race and genetics. Hum Genet 2009; 126:355-62. [DOI: 10.1007/s00439-009-0674-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2009] [Accepted: 04/17/2009] [Indexed: 12/25/2022]
|
22
|
Abstract
In the last decade, molecular beacons have emerged to become a widely used tool in the multiplex typing of single nucleotide polymorphisms (SNPs). Improvements in detection technologies in instrumentation and chemistries to label these probes have made it possible to use up to six spectrally distinguishable probes per reaction well. With the remarkable advances made in the characterization of human genome diversity, it has been possible to describe empirical patterns of SNPs and haplotype variation in the genome of diverse human populations. These patterns have revealed that the human genome is structured in blocks of strong linkage disequilibrium (LD). Because SNPs tend to be in LD with each other, common haplotypes share common SNPs and thus the majority of the diversity in a region can be characterized by typing a very small number of SNPs; so-called tag SNPs. Herein lies the advantage of the multiplexing ability of molecular beacons, since it becomes possible to use as few as 30 probes to interrogate several haplotypes in a high-throughput approach. Thus, through the combined use of tag SNPs and molecular beacons it becomes possible to type individuals for clinically relevant haplotypes in a high-throughput manner at a cost that is orders of magnitude less than that for high throughput sequencing methods.
Collapse
Affiliation(s)
- Anton A. Komar
- Center for Gene Regulation in, Cleveland State University, Euclid Ave. 2121, Cleveland, 44115 U.S.A
| |
Collapse
|
23
|
Casals F, Ferrer-Admetlla A, Sikora M, Ramírez-Soriano A, Marquès-Bonet T, Despiau S, Roubinet F, Calafell F, Bertranpetit J, Blancher A. Human pseudogenes of the ABO family show a complex evolutionary dynamics and loss of function. Glycobiology 2009; 19:583-91. [PMID: 19218399 DOI: 10.1093/glycob/cwp017] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The GT6 glycosyltransferases gene family, that includes the ABO blood group, shows a complex evolution pattern, with multiple events of gain and loss in different mammal species. In humans the ABO gene is considered the sole functional member although the O allele is null and is fixed in certain populations. Here, we analyze the human GT6 pseudogene sequences (Forssman, IGB3, GGTA1, GT6m5, GT6m6, and GT6m7) from an evolutionary perspective, by the study of (i) their diversity levels in populations through the resequencing analysis of European and African individuals; (ii) the interpopulation differentiation, with genotyping data from a survey of populations covering most of human genetic diversity; and (iii) the interespecific divergence, by the comparison of the human and some other primate species sequences. Since pseudogenes are expected to evolve under neutrality, they should show an evolutionary pattern different to that of functional sequences, with higher levels of diversity as well as a ratio of nonsynonymous to synonymous changes close to 1. We describe some departures from these expectations, including selection for inactivation in IGB3, GGTA1, and the interesting case of FS (Forssman) with a probable shift of its initial function in the primate lineage, which put it apart from a pure neutral pseudogene. These results suggest that some of these GT6 human pseudogenes may still be functional and retain some valuable unknown function in humans, in some case even at the protein level. The evolutionary analysis of all members of the GT6 family in humans allows an insight into their functional history, a process likely due to the interaction of the host glycans that they synthesize with pathogens; the past process that can be unraveled through the footprints left by natural selection in the extant genome variation.
Collapse
Affiliation(s)
- Ferran Casals
- Institut de Biologia Evolutiva (CSIC-UPF), CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Huang L, Li Y, Singleton AB, Hardy JA, Abecasis G, Rosenberg NA, Scheet P. Genotype-imputation accuracy across worldwide human populations. Am J Hum Genet 2009; 84:235-50. [PMID: 19215730 DOI: 10.1016/j.ajhg.2009.01.013] [Citation(s) in RCA: 182] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2008] [Revised: 01/09/2009] [Accepted: 01/16/2009] [Indexed: 11/19/2022] Open
Abstract
A current approach to mapping complex-disease-susceptibility loci in genome-wide association (GWA) studies involves leveraging the information in a reference database of dense genotype data. By modeling the patterns of linkage disequilibrium in a reference panel, genotypes not directly measured in the study samples can be imputed and tested for disease association. This imputation strategy has been successful for GWA studies in populations well represented by existing reference panels. We used genotypes at 513,008 autosomal single-nucleotide polymorphism (SNP) loci in 443 unrelated individuals from 29 worldwide populations to evaluate the "portability" of the HapMap reference panels for imputation in studies of diverse populations. When a single HapMap panel was leveraged for imputation of randomly masked genotypes, European populations had the highest imputation accuracy, followed by populations from East Asia, Central and South Asia, the Americas, Oceania, the Middle East, and Africa. For each population, we identified "optimal" mixtures of reference panels that maximized imputation accuracy, and we found that in most populations, mixtures including individuals from at least two HapMap panels produced the highest imputation accuracy. From a separate survey of additional SNPs typed in the same samples, we evaluated imputation accuracy in the scenario in which all genotypes at a given SNP position were unobserved and were imputed on the basis of data from a commercial "SNP chip," again finding that most populations benefited from the use of combinations of two or more HapMap reference panels. Our results can serve as a guide for selecting appropriate reference panels for imputation-based GWA analysis in diverse populations.
Collapse
Affiliation(s)
- Lucy Huang
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
| | | | | | | | | | | | | |
Collapse
|
25
|
Use of weighted reference panels based on empirical estimates of ancestry for capturing untyped variation. Hum Genet 2009; 125:295-303. [PMID: 19184111 DOI: 10.1007/s00439-009-0627-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2008] [Accepted: 01/11/2009] [Indexed: 12/30/2022]
Abstract
Many association methods use a subset of genotyped single nucleotide polymorphisms (SNPs) to capture or infer genotypes at other untyped SNPs. We and others previously showed that tag SNPs selected to capture common variation using data from The International HapMap Consortium (Nature 437:1299-1320, 2005), The International HapMap Consortium (Nature 449:851-861, 2007) could also capture variation in populations of similar ancestry to HapMap reference populations (de Bakker et al. in Nat Genet 38:1298-1303, 2006; González-Neira et al. in Genome Res 16:323-330, 2006; Montpetit et al. in PLoS Genet 2:282-290, 2006; Mueller et al. in Am J Hum Genet 76:387-398, 2005). To capture variation in admixed populations or populations less similar to HapMap panels, a "cosmopolitan approach," in which all samples from HapMap are used as a single reference panel, was proposed. Here we refine this suggestion and show that use of a "weighted reference panel," constructed based on empirical estimates of ancestry in the target population (relative to available reference panels), is more efficient than the cosmopolitan approach. Weighted reference panels capture, on average, only slightly fewer common variants (minor allele frequency > 5%) than the cosmopolitan approach (mean r (2) = 0.977 vs. 0.989, 94.5% variation captured vs. 96.8% at r (2) > 0.8), across the five populations of the Multiethnic Cohort, but entail approximately 25% fewer tag SNPs per panel (average 538 vs. 718). These results extend a recent study in two Indian populations (Pemberton et al. in Ann Hum Genet 72:535-546, 2008). Weighted reference panels are potentially useful for both the selection of tag SNPs in diverse populations and perhaps in the design of reference panels for imputation of untyped genotypes in genome-wide association studies in admixed populations.
Collapse
|
26
|
Global similarity with local differences in linkage disequilibrium between the Dutch and HapMap-CEU populations. Eur J Hum Genet 2009; 17:802-10. [PMID: 19127282 DOI: 10.1038/ejhg.2008.248] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The HapMap project has facilitated the selection of tagging single nucleotide polymorphisms (tagSNPs) for genome-wide association studies (GWAS) under the assumption that linkage disequilibrium (LD) in the HapMap populations is similar to the populations under investigation. Earlier reports support this assumption, although in most of these studies only a few loci were evaluated. We compared pair-wise LD and LD block structure across autosomes between the Dutch population and the CEU-HapMap reference panel. The impact of sampling distribution on the estimation of LD blocks was studied by bootstrapping. A high Pearson correlation (genome-wide; 0.93) between pair-wise r(2) for the Dutch and the CEU populations was found, indicating that tagSNPs from the CEU-HapMap panel capture common variation in the Dutch population. However, some genomic regions exhibited, significantly lower correlation than the genome-wide estimate. This might decrease the validity of HapMap tagSNPs in these regions and the power of GWAS. The LD block structure differed considerably between the Dutch and CEU-HapMap populations. This was not explained by demographic differences between the CEU and Dutch samples, as testing for population stratification was not significant. We also found that sampling variation had a large effect on the estimation of LD blocks, as shown by the bootstrapping analysis. Thus, in small samples, most of the observed differences in LD blocks between populations are most likely the result of sampling variation. This poor concordance in LD block structure suggests that large samples are required for robust estimations of local LD block structure in populations.
Collapse
|
27
|
Gardner M, Bertranpetit J, Comas D. Worldwide genetic variation in dopamine and serotonin pathway genes: implications for association studies. Am J Med Genet B Neuropsychiatr Genet 2008; 147B:1070-5. [PMID: 18270970 DOI: 10.1002/ajmg.b.30717] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The dopamine and serotonin systems are two of the most important neurotransmitter pathways in the human nervous system and their roles in controlling behavior and mental status are well accepted. Genes from both systems have been widely implicated in psychiatric and behavioral disorders, with numerous reports of associations and almost equally as numerous reports of the failure to replicate a previous finding of association. We investigate a set of 21 dopamine and serotonin genes commonly tested for association with psychiatric disease in a set of 39 worldwide populations representing global genetic diversity to see whether the failure to replicate findings of association may be explained by population based differences in allele frequencies and linkage disequilibrium (LD) in this gene set. We present results demonstrating a surprising homogeneity of the allele frequencies across worldwide populations in these genes. LD both for populations within continent groupings and across continental regions also showed a remarkable similarity. These findings taken together suggest that ethnic differences in these parameters are not major generators of artifacts in genetic association studies of psychiatric disorders with genes from this set. Therefore, factors other than ethnic differences in genetic variation may explain the discrepancies reported among genetic association studies with this set of genes to date. The transferability of tagSNPs defined in the HapMap populations to other worldwide populations was also investigated and found to be high. A list of tagSNPs per gene and continental region is proposed providing a guide for future association studies with these genes.
Collapse
Affiliation(s)
- Michelle Gardner
- Unitat de Biologia Evolutiva, Universitat Pompeu Fabra, Doctor Aiguader, Barcelona, Spain
| | | | | |
Collapse
|
28
|
Makeeva O, Stepanov V, Puzyrev V, Goldstein DB, Grossman I. Global pharmacogenetics: genetic substructure of Eurasian populations and its effect on variants of drug-metabolizing enzymes. Pharmacogenomics 2008; 9:847-68. [PMID: 18597650 DOI: 10.2217/14622416.9.7.847] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
AIMS To study the frequency distribution of cytochrome P450 (CYP) functional genetic variants in five Eurasian populations from the territory of Siberia in Russia. MATERIALS & METHODS Unrelated healthy Tuvinians, Buryats, Altaians, Yakuts and Russians (n = 87-88) were genotyped for CYP2C9*2, CYP2C9*3, CYP2C19*2, CYP2C19*3, CYP3A5*3 and CYP3A5*6. Standard pairwise genetic distances, locus-specific and global Fst statistics were calculated. RESULTS CYP allele and genotype frequencies demonstrated significant variability. Overall, the degree of between-population variance displayed by CYP SNPs was lower than that recorded from neutral short tandem repeats and Alu-insertion polymorphism, indicating evolutionary conservation of CYP polymorphisms. CYP-based genetic distances were well correlated with the geographic distances across populations (r = 0.822, p = 0.008). CONCLUSIONS Although the tested variants were present in the neighboring, yet secluded, populations at the expected range of frequencies, the observed frequencies were significantly variable across Eurasian populations, indicating potential relevance to clinical decision making.
Collapse
Affiliation(s)
- Oksana Makeeva
- Tomsk Research Institute of Medical Genetics, The Russian Academy of Medical Sciences, 10 Nab. Ushaiky, Tomsk, 634050 Russia.
| | | | | | | | | |
Collapse
|
29
|
Nordgard SH, Johansen FE, Alnaes GIG, Bucher E, Syvänen AC, Naume B, Børresen-Dale AL, Kristensen VN. Genome-wide analysis identifies 16q deletion associated with survival, molecular subtypes, mRNA expression, and germline haplotypes in breast cancer patients. Genes Chromosomes Cancer 2008; 47:680-96. [PMID: 18398821 DOI: 10.1002/gcc.20569] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Breast carcinomas are characterized by DNA copy number alterations (CNAs) with biological and clinical significance. This explorative study integrated CNA, expression, and germline genotype data of 112 early-stage breast cancer patients. Recurrent CNAs differed substantially between tumor subtypes classified according to expression pattern. Deletion of 16q was overrepresented in Luminal A, and a predictor of good prognosis, both overall and for the nonluminal A subgroups. The deleted region most significantly associated with survival mapped to 16q22.2, harboring the genes TXNL4B and DXH38, whose expression was strongly correlated with the deletion. The area most frequently deleted resided on 16q23.1, 3.5 MB downstream of the area most significantly associated with survival, and included the tumor suppressor gene ADAMTS18 and the cell recognition gene CNTNAP4. Whole-genome association analysis identified germline single nucleotide polymorphisms (SNPs) and their corresponding haplotypes, residing on several different chromosomes, to be associated with deletion of 16q. The genes where these SNPs reside encode proteins involved in the extracellular matrix (CHST3 and SPOCK2), in regulation of the cell cycle (JMY, PTPRN2, and Cwf19L2) and chromosome stability (KPNB1).
Collapse
Affiliation(s)
- Silje H Nordgard
- Department of Genetics, Institute for Cancer Research, Norwegian Radium Hospital, Rikshospitalet University Hospital, Oslo, Norway
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Hao K, Schadt EE, Storey JD. Calibrating the performance of SNP arrays for whole-genome association studies. PLoS Genet 2008; 4:e1000109. [PMID: 18584036 PMCID: PMC2432039 DOI: 10.1371/journal.pgen.1000109] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2008] [Accepted: 05/28/2008] [Indexed: 11/18/2022] Open
Abstract
To facilitate whole-genome association studies (WGAS), several high-density SNP genotyping arrays have been developed. Genetic coverage and statistical power are the primary benchmark metrics in evaluating the performance of SNP arrays. Ideally, such evaluations would be done on a SNP set and a cohort of individuals that are both independently sampled from the original SNPs and individuals used in developing the arrays. Without utilization of an independent test set, previous estimates of genetic coverage and statistical power may be subject to an overfitting bias. Additionally, the SNP arrays' statistical power in WGAS has not been systematically assessed on real traits. One robust setting for doing so is to evaluate statistical power on thousands of traits measured from a single set of individuals. In this study, 359 newly sampled Americans of European descent were genotyped using both Affymetrix 500K (Affx500K) and Illumina 650Y (Ilmn650K) SNP arrays. From these data, we were able to obtain estimates of genetic coverage, which are robust to overfitting, by constructing an independent test set from among these genotypes and individuals. Furthermore, we collected liver tissue RNA from the participants and profiled these samples on a comprehensive gene expression microarray. The RNA levels were used as a large-scale set of quantitative traits to calibrate the relative statistical power of the commercial arrays. Our genetic coverage estimates are lower than previous reports, providing evidence that previous estimates may be inflated due to overfitting. The Ilmn650K platform showed reasonable power (50% or greater) to detect SNPs associated with quantitative traits when the signal-to-noise ratio (SNR) is greater than or equal to 0.5 and the causal SNP's minor allele frequency (MAF) is greater than or equal to 20% (N = 359). In testing each of the more than 40,000 gene expression traits for association to each of the SNPs on the Ilmn650K and Affx500K arrays, we found that the Ilmn650K yielded 15% times more discoveries than the Affx500K at the same false discovery rate (FDR) level. Advances in SNP genotyping array technologies have made whole-genome association studies (WGAS) a readily available approach. Genetic coverage and the statistical power are two key properties to evaluate on the arrays. In this study, 359 newly sampled individuals were genotyped using Affymetrix 500K and Illumina 650Y SNP arrays. From these data, we obtained new estimates of genetic coverage by constructing a test set from among these genotypes and individuals that is independent from the SNPs and individuals used to construct the arrays. These estimates are notably smaller than previous ones, which we argue is due to an overfitting bias in previous studies. We also collected liver tissue RNA from the participants and profiled these samples on a comprehensive gene expression microarray. The RNA levels were used as a large-scale set of quantitative traits to calibrate the relative statistical power of the commercial arrays. Through this dataset and simulations, we find that the SNP arrays provide adequate power to detect quantitative trait loci when the causal SNP's minor allele frequency is greater than 20%, but low power is less than 10%. Importantly, we provide evidence that sample size has a greater impact on the power of WGAS than SNP density or genetic coverage.
Collapse
Affiliation(s)
- Ke Hao
- Rosetta Inpharmatics, Seattle, Washington, United States of America
| | - Eric E. Schadt
- Rosetta Inpharmatics, Seattle, Washington, United States of America
| | - John D. Storey
- Rosetta Inpharmatics, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
31
|
Stobdan T, Karar J, Pasha MAQ. High Altitude Adaptation: Genetic Perspectives. High Alt Med Biol 2008; 9:140-7. [DOI: 10.1089/ham.2007.1076] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Tsering Stobdan
- Institute of Genomics and Integrative Biology, Delhi, India, and Department of Biotechnology, University of Pune, Pune, India
| | - Jayashree Karar
- Institute of Genomics and Integrative Biology, Delhi, India, and Department of Biotechnology, University of Pune, Pune, India
| | - M. A. Qadar Pasha
- Institute of Genomics and Integrative Biology, Delhi, India, and Department of Biotechnology, University of Pune, Pune, India
| |
Collapse
|
32
|
Xing J, Witherspoon DJ, Watkins WS, Zhang Y, Tolpinrud W, Jorde LB. HapMap tagSNP transferability in multiple populations: general guidelines. Genomics 2008; 92:41-51. [PMID: 18482828 DOI: 10.1016/j.ygeno.2008.03.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2008] [Revised: 03/26/2008] [Accepted: 03/28/2008] [Indexed: 11/30/2022]
Abstract
Linkage disequilibrium (LD) has received much attention recently because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian sample and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r(2) >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | | | | | | | | | | |
Collapse
|
33
|
Hodgkinson CA, Yuan Q, Xu K, Shen PH, Heinz E, Lobos EA, Binder EB, Cubells J, Ehlers CL, Gelernter J, Mann J, Riley B, Roy A, Tabakoff B, Todd RD, Zhou Z, Goldman D. Addictions biology: haplotype-based analysis for 130 candidate genes on a single array. Alcohol Alcohol 2008; 43:505-15. [PMID: 18477577 DOI: 10.1093/alcalc/agn032] [Citation(s) in RCA: 206] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
AIMS To develop a panel of markers able to extract full haplotype information for candidate genes in alcoholism, other addictions and disorders of mood and anxiety. METHODS A total of 130 genes were haplotype tagged and genotyped in 7 case/control populations and 51 reference populations using Illumina GoldenGate SNP genotyping technology, determining haplotype coverage. We also constructed and determined the efficacy of a panel of 186 ancestry informative markers. RESULTS An average of 1465 loci were genotyped at an average completion rate of 91.3%, with an average call rate of 98.3% and replication rate of 99.7%. Completion and call rates were lowered by the performance of two datasets, highlighting the importance of the DNA quality in high throughput assays. A comparison of haplotypes captured by the Addictions Array tagging SNPs and commercially available whole-genome arrays from Illumina and Affymetrix shows comparable performance of the tag SNPs to the best whole-genome array in all populations for which data are available. CONCLUSIONS Arrays of haplotype-tagged candidate genes, such as this addictions-focused array, represent a cost-effective approach to generate high-quality SNP genotyping data useful for the haplotype-based analysis of panels of genes such as these 130 genes of interest to alcohol and addictions researchers. The inclusion of the 186 ancestry informative markers allows for the detection and correction for admixture and further enhances the utility of the array.
Collapse
Affiliation(s)
- Colin A Hodgkinson
- Laboratory of Neurogenetics, NIAAA, 5625 Fishers Lane, Room 3S32 MSC9412, Rockville, MD 20852-1728, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Neale BM, Sham PC, Purcell S, Banaschewski T, Buitelaar J, Franke B, Sonuga-Barke E, Ebstein R, Eisenberg J, Mulligan A, Gill M, Manor I, Miranda A, Mulas F, Oades RD, Roeyers H, Rothenberger A, Sergeant J, Steinhausen HC, Taylor E, Thompson M, Chen W, Zhou K, Asherson P, Faraone SV. Population differences in the International Multi-Centre ADHD Gene Project. Genet Epidemiol 2008; 32:98-107. [PMID: 17868146 DOI: 10.1002/gepi.20265] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The International Multi-Centre ADHD Gene sample consists of 674 families from eight countries (Belgium, England, Germany, Holland, Ireland, Israel, Spain, and Switzerland) ascertained from clinics for combined-type attention definity hyperactivity disorder in an offspring. 863 SNPs were successfully genotyped across 47 autosomal genes implicated in psychiatric disorders yielding a single nucleotide polymorphism (SNP) density of approximately one SNP per 2.5 kb. A global test of heterogeneity showed 269 SNPs nominally significant (expected 43). Inclusion of the Israeli population accounted for approximately 70% of these nominally significant tests. Hardy-Weinberg equilibrium tests suggest that combining all these populations would induce stratification, but that the Northern European populations (Belgium, England, Germany, Holland, and Ireland) could be appropriate. Tag SNPs were generated using pair-wise and aggressive tagging from Carlson et al. [2004] and de Bakker et al. [2005], respectively, in each population and applied to the other populations. Cross-population performance across Northern Europe was consistent with within population comparisons. Smaller sample size for each population tended to yield more problems for the generation of aggressive tags and the application of pair-wise tags. Any case-control sample employing an Israeli sample with Northern Europeans must consider stratification. A Northern European tag set, however, appears to be appropriate for capturing the variation across populations.
Collapse
Affiliation(s)
- Benjamin M Neale
- MRC Social Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King's College London, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
The question of tagging single nucleotide polymorphism (tagSNP) transferability is an important one because many ongoing and upcoming Genome-Wide Association studies rely critically upon the validity, and practical feasibility of using a universal core set of tagSNPs. A series of recent studies analyzed performance of tagSNPs selected based on the HapMap. While these studies showed largely satisfactory transferability of the tagSNPs, they also reported that the level of transferability varies, substantively sometimes, especially when tagSNPs selected in one population were used in another distant population. We present a review of the literature about where and why tagSNP transferability may become a problem and suggest research directions that may help the resolution.
Collapse
Affiliation(s)
- C Charles Gu
- Division of Biostatistics, Washington University School of Medicine, St Louis, MO 63110, USA.
| | | | | | | | | |
Collapse
|
36
|
Evaluating linkage disequilibrium and recombination provides a haplotype-tagging SNP panel of the major histocompatibility complex in African Americans. Genes Immun 2008; 9:271-3. [PMID: 18305489 DOI: 10.1038/gene.2008.6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The major histocompatibility complex (MHC) (Chromosome 6p21.3) is a dynamic, immune gene-rich region that is associated with multiple diseases. Haplotype-tagging single-nucleotide polymorphism (htSNP) panels for the MHC can aid association studies but have only been reported for African, Asian and Caucasian populations to date. We genotyped 2154 SNPs spanning a 3.8-Mb region of the classical MHC in 94 healthy African Americans using Illumina BeadArray technology. We describe the haplotype structure of the MHC in African Americans, calculate the recombination rate (0.35 cM Mb(-1)) across the region, identify recombination hot spots and develop a panel of htSNPs for future genetic association studies in this population. We conclude that while patterns of LD and recombination are similar within the MHC to that reported in other populations, differences in minor allele frequency at specific markers necessitates an htSNP panel unique to African Americans, which we provide here for use in future genetic association studies.
Collapse
|
37
|
Gu CC, Yu K, Rao DC. Characterization of LD structures and the utility of HapMap in genetic association studies. ADVANCES IN GENETICS 2008; 60:407-35. [PMID: 18358328 DOI: 10.1016/s0065-2660(07)00415-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Observed distribution of and variation in linkage disequilibrium (LD) with respect to the evolution history and disease transmission in a population is the driving force behind the current wave of genome-wide association (GWA) studies of complex human diseases. An extensive literature covers topics from haplotype analysis that utilizes local LD structures in candidate genes and regions to genome-wide organization of LD blocks (neighborhood) that led to the development of International HapMap Project and panels of "tagSNPs" used by current GWA studies. In this chapter, we examine the scenarios where each of the major types of analysis methods may be applicable and where the current popular genotyping platforms for GWA might come short. We discuss current association analysis methods by emphasizing their reliance on the local LD structures or the global organization of the LD structures, and highlight the need to consider individual marker information content in large-scale association mapping.
Collapse
Affiliation(s)
- C Charles Gu
- Division of Biostatistics and Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | | | |
Collapse
|
38
|
|
39
|
Dong C, Qian Z, Jia P, Wang Y, Huang W, Li Y. Gene-centric characteristics of genome-wide association studies. PLoS One 2007; 2:e1262. [PMID: 18060058 PMCID: PMC2092383 DOI: 10.1371/journal.pone.0001262] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2007] [Accepted: 10/12/2007] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The high-throughput genotyping chips have contributed greatly to genome-wide association (GWA) studies to identify novel disease susceptibility single nucleotide polymorphisms (SNPs). The high-density chips are designed using two different SNP selection approaches, the direct gene-centric approach, and the indirect quasi-random SNPs or linkage disequilibrium (LD)-based tagSNPs approaches. Although all these approaches can provide high genome coverage and ascertain variants in genes, it is not clear to which extent these approaches could capture the common genic variants. It is also important to characterize and compare the differences between these approaches. METHODOLOGY/PRINCIPAL FINDINGS In our study, by using both the Phase II HapMap data and the disease variants extracted from OMIM, a gene-centric evaluation was first performed to evaluate the ability of the approaches in capturing the disease variants in Caucasian population. Then the distribution patterns of SNPs were also characterized in genic regions, evolutionarily conserved introns and nongenic regions, ontologies and pathways. The results show that, no mater which SNP selection approach is used, the current high-density SNP chips provide very high coverage in genic regions and can capture most of known common disease variants under HapMap frame. The results also show that the differences between the direct and the indirect approaches are relatively small. Both have similar SNP distribution patterns in these gene-centric characteristics. CONCLUSIONS/SIGNIFICANCE This study suggests that the indirect approaches not only have the advantage of high coverage but also are useful for studies focusing on various functional SNPs either in genes or in the conserved regions that the direct approach supports. The study and the annotation of characteristics will be helpful for designing and analyzing GWA studies that aim to identify genetic risk factors involved in common diseases, especially variants in genes and conserved regions.
Collapse
Affiliation(s)
- Changzheng Dong
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ziliang Qian
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Peilin Jia
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ying Wang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
| | - Wei Huang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Rui Jin Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China
| | - Yixue Li
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Shanghai Center for Bioinformation Technology, Shanghai, China
| |
Collapse
|
40
|
|
41
|
Nam MH, Won HH, Lee KA, Kim JW. Effectiveness of in silico tagSNP selection methods: virtual analysis of the genotypes of pharmacogenetic genes. Pharmacogenomics 2007; 8:1347-57. [PMID: 17979509 DOI: 10.2217/14622416.8.10.1347] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
INTRODUCTION SNP tagging has been recently introduced, and the use of this strategy reduces the dimension of disease association studies and eventually saves on genotyping costs. There is no single set of tagging SNPs (tagSNPs) that will satisfy every association study design; thus, many different methods have been introduced. We evaluated various tagSNP selection methods using known haplotype data of pharmacogenetic genes. We also compared the selected tagSNPs among different ethnic groups. METHODS We collected genotype data for the NAT2 and CYP2D6 genes from the previously published literature where the linkage phase was resolved directly through molecular haplotyping. Three computational tagSNP selection methods (ldSelect, Tagger and TagIT software) were evaluated with these data sets. RESULTS Tagging effectiveness and efficiency were variable in all three tagSNP selection methods. No tagSNP sets were identical among the different ethnic groups. The haplotype r(2)-based method was more effective in determining genotype-phenotype correlation than the other methods employed. CONCLUSION All of the three computational tagSNP selection methods showed acceptable efficiency and effectiveness. The selected tagSNPs were different from each other among the different ethnic groups.
Collapse
Affiliation(s)
- Myung-Hyun Nam
- College of Medicine, Korea University, Department of Laboratory Medicine, Seoul 136-705, South Korea.
| | | | | | | |
Collapse
|
42
|
Wang S, Lewis CM, Jakobsson M, Ramachandran S, Ray N, Bedoya G, Rojas W, Parra MV, Molina JA, Gallo C, Mazzotti G, Poletti G, Hill K, Hurtado AM, Labuda D, Klitz W, Barrantes R, Bortolini MC, Salzano FM, Petzl-Erler ML, Tsuneto LT, Llop E, Rothhammer F, Excoffier L, Feldman MW, Rosenberg NA, Ruiz-Linares A. Genetic variation and population structure in native Americans. PLoS Genet 2007; 3:e185. [PMID: 18039031 PMCID: PMC2082466 DOI: 10.1371/journal.pgen.0030185] [Citation(s) in RCA: 347] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2007] [Accepted: 09/10/2007] [Indexed: 01/04/2023] Open
Abstract
We examined genetic diversity and population structure in the American landmass using 678 autosomal microsatellite markers genotyped in 422 individuals representing 24 Native American populations sampled from North, Central, and South America. These data were analyzed jointly with similar data available in 54 other indigenous populations worldwide, including an additional five Native American groups. The Native American populations have lower genetic diversity and greater differentiation than populations from other continental regions. We observe gradients both of decreasing genetic diversity as a function of geographic distance from the Bering Strait and of decreasing genetic similarity to Siberians--signals of the southward dispersal of human populations from the northwestern tip of the Americas. We also observe evidence of: (1) a higher level of diversity and lower level of population structure in western South America compared to eastern South America, (2) a relative lack of differentiation between Mesoamerican and Andean populations, (3) a scenario in which coastal routes were easier for migrating peoples to traverse in comparison with inland routes, and (4) a partial agreement on a local scale between genetic similarity and the linguistic classification of populations. These findings offer new insights into the process of population dispersal and differentiation during the peopling of the Americas.
Collapse
Affiliation(s)
- Sijia Wang
- The Galton Laboratory, Department of Biology, University College London, London, United Kingdom
| | - Cecil M Lewis
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Mattias Jakobsson
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Sohini Ramachandran
- Department of Biological Sciences, Stanford University, Stanford, California, United States of America
| | - Nicolas Ray
- Computational and Molecular Population Genetics Lab, University of Bern, Bern, Switzerland
| | - Gabriel Bedoya
- Laboratorio de Genética Molecular, Universidad de Antioquia, Medellín, Colombia
| | - Winston Rojas
- Laboratorio de Genética Molecular, Universidad de Antioquia, Medellín, Colombia
| | - Maria V Parra
- Laboratorio de Genética Molecular, Universidad de Antioquia, Medellín, Colombia
| | - Julio A Molina
- Center for Neurobehavioral Genetics, University of California Los Angeles, Los Angeles, United States of America
| | - Carla Gallo
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Perú
| | - Guido Mazzotti
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Perú
| | - Giovanni Poletti
- Facultad de Medicina, Universidad Peruana Cayetano Heredia, Lima, Perú
| | - Kim Hill
- Department of Anthropology, University of New Mexico, Albuquerque, New Mexico, United States of America
| | - Ana M Hurtado
- Department of Anthropology, University of New Mexico, Albuquerque, New Mexico, United States of America
| | - Damian Labuda
- Département de Pédiatrie, CHU Sainte-Justine, Université de Montréal, Montréal, Quebec, Canada
| | - William Klitz
- School of Public Health, University of California Berkeley, Berkeley, California, United States of America
- Public Health Institute, Oakland, California, United States of America
| | - Ramiro Barrantes
- Escuela de Biología, Universidad de Costa Rica, San José, Costa Rica
| | - Maria Cátira Bortolini
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil
| | - Francisco M Salzano
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil
| | | | - Luiza T Tsuneto
- Departamento de Genética, Universidade Federal do Paraná, Curitiba, Paraná, Brazil
| | - Elena Llop
- Programa de Genética Humana, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
| | - Francisco Rothhammer
- Programa de Genética Humana, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
- Instituto de Alta Investigación, Universidad de Tarapacá, Arica, Chile
| | - Laurent Excoffier
- Computational and Molecular Population Genetics Lab, University of Bern, Bern, Switzerland
| | - Marcus W Feldman
- Department of Biological Sciences, Stanford University, Stanford, California, United States of America
| | - Noah A Rosenberg
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Andrés Ruiz-Linares
- The Galton Laboratory, Department of Biology, University College London, London, United Kingdom
| |
Collapse
|
43
|
Khrunin A, Verbenko D, Nikitina K, Limborska S. Regional differences in the genetic variability of Finno-Ugric speaking Komi populations. Am J Hum Biol 2007; 19:741-50. [PMID: 17691096 DOI: 10.1002/ajhb.20620] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The Komi (Komi-Zyryan) people are one of the most numerous ethnic groups belonging to the Finno-Ugric linguistic community. They occupy an extensive territory in north Russia to the west of the Ural Mountains, in the northeast of the East European Plain. This is an area of long-term interactions between Europeans and North Asians. Genetic variability was evaluated in two geographically distinct populations, the Izhemski and Priluzski Komi. We searched for polymorphisms of the TP53 gene (a 16-bp duplication in intron 3 and three RFLPs: for Bsh1236I at codon 72, for MspI in intron 6, and for BamHI in the 3' flanking region) and for variable number tandem repeat (VNTR) polymorphisms of locus D1S80 and of the 3' untranslated region of the gene for apolipoprotein B (ApoB). Some data from our previous studies of TP53, 3'ApoB, and D1S80 variability were involved in the comparison of Komi with other Eastern European populations. Multidimensional scaling analysis of genetic distances was used for the evaluation of genetic relationships between populations. The results revealed some affinity between Priluzski Komi and Eastern Slavonic populations, and significant segregation of Izhemski Komi from other ethnic groups studied. The unique genetic features of Izhemski Komi may have been determined by their ethnogenesis or the pressure of environmental factors, such as special nutrition and adaptation to extreme climatic conditions.
Collapse
Affiliation(s)
- Andrey Khrunin
- Human Genetics Molecular Bases Department, Institute of Molecular Genetics, Russian Academy of Sciences, 123182 Moscow, Russia.
| | | | | | | |
Collapse
|
44
|
Hersh CP, Raby BA, Soto-Quirós ME, Murphy AJ, Avila L, Lasky-Su J, Sylvia JS, Klanderman BJ, Lange C, Weiss ST, Celedón JC. Comprehensive testing of positionally cloned asthma genes in two populations. Am J Respir Crit Care Med 2007; 176:849-57. [PMID: 17702965 PMCID: PMC2048676 DOI: 10.1164/rccm.200704-592oc] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2007] [Accepted: 08/07/2007] [Indexed: 11/16/2022] Open
Abstract
RATIONALE Replication of gene-disease associations has become a requirement in complex trait genetics. OBJECTIVES In studies of childhood asthma from two different ethnic groups, we attempted to replicate associations with five potential asthma susceptibility genes previously identified by positional cloning. METHODS We analyzed two family-based samples ascertained through an asthmatic proband: 497 European-American children from the Childhood Asthma Management Program and 439 Hispanic children from the Central Valley of Costa Rica. We genotyped 98 linkage disequilibrium-tagging single-nucleotide polymorphisms (SNPs) in five genes: ADAM33, DPP10, GPR154 (HUGO name: NPSR1), HLA-G, and the PHF11 locus (includes genes SETDB2 and RCBTB1). SNPs were tested for association with asthma and two intermediate phenotypes: airway hyperresponsiveness and total serum immunoglobulin E levels. MEASUREMENTS AND MAIN RESULTS Despite differing ancestries, linkage disequilibrium patterns were similar in both cohorts. Of the five evaluated genes, SNP-level replication was found only for GPR154 (NPSR1). In this gene, three SNPs were associated with asthma in both cohorts, although the opposite alleles were associated in either study. Weak evidence for locus-level replication with asthma was found in the PHF11 locus, although there was no overlap in the associated SNP across the two cohorts. No consistent associations were observed for the three other genes. CONCLUSIONS These results provide some further support for the role of genetic variation in GPR154 (NPSR1) and PHF11 in asthma susceptibility and also highlight the challenges of replicating genetic associations in complex traits such as asthma, even for genes identified by linkage analysis.
Collapse
Affiliation(s)
- Craig P Hersh
- Channing Laboratory and Center for Genomic Medicine, Brigham and Women's Hospital, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
An assessment of the Irish population for large-scale genetic mapping studies involving epilepsy and other complex diseases. Eur J Hum Genet 2007; 16:176-83. [PMID: 17971835 DOI: 10.1038/sj.ejhg.5201938] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The recent completion of the International HapMap Project has rapidly advanced our understanding of linkage disequilibrium (LD) in the human genome. Today, tagging SNPs (tSNPs) can be quickly and easily selected and consequently HapMap data are regularly applied to both small- and large-scale genetic mapping studies. However, to correctly interpret the application of HapMap-derived tSNPs in a genetic mapping study, an understanding of how well HapMap data represents LD in the study population is critical. The Irish population had not previously been characterised in this way. Here, we do so using a set of 4424 SNPs selected from 279 candidate genes for epilepsy genotyped across 1118 healthy individuals from the Irish, British, Finnish and Australian populations. By considering the Irish population alongside surrounding European populations, our results confirm that the HapMap European-derived population accurately estimates patterning of LD in European descent populations. The Irish population appears notably well matched to the European HapMap population, and is markedly similar to the neighbouring British population. Although we were unable to detect significant substructure within the Irish population (a favourable result for genetic mapping), methods for controlling stratification should always be incorporated. This analysis therefore confirms that the genetic architecture of the Irish population is well suited to the study of complex traits and that tSNPs selected using the HapMap data can be confidently applied to the Irish population.
Collapse
|
46
|
Graffelman J, Camarena JM. Graphical tests for Hardy-Weinberg equilibrium based on the ternary plot. Hum Hered 2007; 65:77-84. [PMID: 17898538 DOI: 10.1159/000108939] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2007] [Accepted: 05/24/2007] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE We design a graphical test for Hardy-Weinberg equilibrium. This can circumvent the calculation of p values and the statistical (non)significance of a large number of bi-allelic markers can be inferred from their position in a graph. METHOD By rewriting expressions for the chi(2) statistic (with and without continuity correction) in terms of the heterozygote frequency an acceptance region for Hardy-Weinberg equilibrium is obtained that can be depicted in a ternary plot. RESULTS We obtain equations for curves in the ternary plot that separate markers that are out of Hardy-Weinberg equilibrium from those that are in equilibrium. The curves depend on the chosen significance level, the sample size and on a continuity correction parameter. Some examples of graphical tests using a set of 106 SNPs on the long arm of human chromosome 22 are described. Significant markers and poor markers with a lot of missing values are easily identified in the proposed plots. R software for making the diagrams is provided. CONCLUSION The proposed graphs can be used as control charts for spotting problematic markers in large scale genotyping studies, and constitute an excellent tool for the graphical exploration of bi-allelic marker data.
Collapse
Affiliation(s)
- Jan Graffelman
- Department of Statistics and Operations Research, Universitat Politècnica de Catalunya, Barcelona, Spain.
| | | |
Collapse
|
47
|
Wang P, Koehle MS, Rupert JL. Common Haplotypes in the β-2 Adrenergic Receptor Gene Are Not Associated with Acute Mountain Sickness Susceptibility in Nepalese. High Alt Med Biol 2007; 8:206-12. [PMID: 17824821 DOI: 10.1089/ham.2007.1018] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Acute Mountain Sickness (AMS), the most common and least serious of the altitude-related illnesses, is frequently experienced by sojourners traveling above 2500 m. Although altitude and rate of ascent are likely the most critical factors in determining whether the condition will develop in a person, interindividual variation and patterns of susceptibility suggest that there may be genetic risk factors as well. We hypothesized that variants in the gene that encodes the beta-2 adrenergic receptor (the principal catecholamine receptor in the lungs) are involved in the etiology of AMS and tested this hypothesis in cohorts of Nepalese individuals who developed or did not develop AMS when attending the Purnima Festival at Lake Gosain Kunda at 4380. Polymorphisms that could serve as markers for the common haplotypes encompassing the gene were chosen using the HapMap database. We found no association between any alleles at the seven highly informative polymorphic loci (tagSNPs) that we assayed and AMS status, suggesting that variants in, or near, the beta-2 adrenergic receptor gene do not contribute to AMS susceptibility in this population. This study is the first application of the HapMap database and associated haplotype mapping tools to the understanding of altitude-related pathologies.
Collapse
Affiliation(s)
- Pei Wang
- School of Human Kinetics, University of British Columbia, Vancouver, Canada
| | | | | |
Collapse
|
48
|
Marvelle AF, Lange LA, Qin L, Wang Y, Lange EM, Adair LS, Mohlke KL. Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples. J Hum Genet 2007; 52:729-737. [PMID: 17636361 DOI: 10.1007/s10038-007-0175-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2007] [Accepted: 06/22/2007] [Indexed: 10/23/2022]
Abstract
Patterns of linkage disequilibrium (LD) act as the framework for designing efficient association studies; these patterns are being studied and catalogued by The International HapMap Project. The current study assessed the transferability of tag SNPs chosen from HapMap panels to a cohort of 80 individuals from metro Cebu, Philippines, who participated in the Cebu Longitudinal Health and Nutrition Survey (CLHNS). The analyses focused on 627 single nucleotide polymorphisms (SNPs) in the central 40 kb within each of the 10 HapMap ENCODE regions. The similarity between the genetic variants in Cebu Filipino samples and HapMap panels was examined using allele frequency estimates, measures of pairwise linkage disequilibrium (LD), and haplotype frequency estimates. For these measures, strong correlations were observed between the Cebu Filipino samples and the Asian panels from HapMap, with the strongest correlations observed with the Han Chinese from Beijing (CHB) panel. Tag SNPs selected using the HapMap CHB panel were particularly effective at representing the genetic variation in Cebu Filipino samples. These results suggest that the HapMap data will be an effective resource for future studies in Cebu Filipino samples.
Collapse
Affiliation(s)
- Amanda F Marvelle
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Leslie A Lange
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Li Qin
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Yunfei Wang
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Ethan M Lange
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
- Department of Biostatistics, School of Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Linda S Adair
- Department of Nutrition, Schools of Medicine and Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Karen L Mohlke
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA.
| |
Collapse
|
49
|
Graffelman J, Balding DJ, Gonzalez-Neira A, Bertranpetit J. Variation in estimated recombination rates across human populations. Hum Genet 2007; 122:301-10. [PMID: 17609980 DOI: 10.1007/s00439-007-0391-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2006] [Accepted: 06/01/2007] [Indexed: 12/11/2022]
Abstract
Recently it has been reported that recombination hotspots appear to be highly variable between humans and chimpanzees, and there is evidence for between-person variability in hotspots, and evolutionary transience. To understand the nature of variation in human recombination rates, it is important to describe patterns of variability across populations. Direct measurement of recombination rates remains infeasible on a large scale, and population-genetic approaches can be imprecise, and are affected by demographic history. Reports to date have suggested broad similarity in recombination rates at large genomic scales and across human populations. Here, we examine recombination rate estimates at a finer population and genomic scale: 28 worldwide populations and 107 SNPs in a 1 Mb stretch of chromosome 22q. We employ analysis of variance of recombination rate estimates, corrected for differences in effective population size using genome-wide microsatellite mutation rate estimates. We find substantial variation in fine-scale rates between populations, but reduced variation within continental groups. All effects examined (SNP-pair, region, population and interactions) were highly significant. Adjustment for effective population size made little difference to the conclusions. Observed hotspots tended to be conserved across populations, albeit at varying intensities. This holds particularly for populations from the same region, and also to a considerable degree across geographical regions. However, some hotspots appear to be population-specific. Several results from studies on the population history of humans are in accordance with our analysis. Our results suggest that between-population variation in DNA sequences may underly recombination rate variation.
Collapse
Affiliation(s)
- Jan Graffelman
- Department of Statistics and Operations Research, Universitat Politècnica de Catalunya, Avinguda Diagonal 647, 08028 Barcelona, Spain.
| | | | | | | |
Collapse
|
50
|
A genetic association analysis of cognitive ability and cognitive ageing using 325 markers for 109 genes associated with oxidative stress or cognition. BMC Genet 2007; 8:43. [PMID: 17601350 PMCID: PMC1933580 DOI: 10.1186/1471-2156-8-43] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2007] [Accepted: 07/02/2007] [Indexed: 12/20/2022] Open
Abstract
Background Non-pathological cognitive ageing is a distressing condition affecting an increasing number of people in our 'ageing society'. Oxidative stress is hypothesised to have a major role in cellular ageing, including brain ageing. Results Associations between cognitive ageing and 325 single nucleotide polymorphisms (SNPs), located in 109 genes implicated in oxidative stress and/or cognition, were examined in a unique cohort of relatively healthy older people, on whom we have cognitive ability scores at ages 11 and 79 years (LBC1921). SNPs showing a significant positive association were then genotyped in a second cohort for whom we have cognitive ability scores at the ages of 11 and 64 years (ABC1936). An intronic SNP in the APP gene (rs2830102) was significantly associated with cognitive ageing in both LBC1921 and a combined LBC1921/ABC1936 analysis (p < 0.01), but not in ABC1936 alone. Conclusion This study suggests a possible role for APP in normal cognitive ageing, in addition to its role in Alzheimer's disease.
Collapse
|