1
|
Sang S, Ling J, Liu X, Mei L, Cai X, Li T, Li W, Li M, Wen J, Liu X, Liu J, Liu Y, Chen H, He C, Feng Y. Proband Whole-Exome Sequencing Identified Genes Responsible for Autosomal Recessive Non-Syndromic Hearing Loss in 33 Chinese Nuclear Families. Front Genet 2019; 10:639. [PMID: 31379920 PMCID: PMC6650584 DOI: 10.3389/fgene.2019.00639] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Accepted: 06/18/2019] [Indexed: 11/22/2022] Open
Abstract
Autosomal recessive non-syndromic hearing loss (ARNSHL) is a highly heterogeneous disease involving more than 70 pathogenic genes. However, most ARNSHL families have small-sized pedigrees with limited genetic information, rendering challenges for the molecular diagnosis of these patients. Therefore, we attempted to establish a strategy for identifying deleterious variants associated with ARNSHL by applying proband whole-exome sequencing (proband-WES). Aside from desiring to improve molecular diagnostic rates, we also aimed to search for novel deafness genes shared by patients with similar phenotype, making up for the deficiency of small ARNSHL families. In this study, 48.5% (16/33) families were detected the pathogenic variants in eight known deafness genes, including 10 novel variants identified in TMPRSS3 (MIM 605551), MYO15A (MIM 602666), TMC1 (MIM 606706), ADGRV1 (MIM 602851), and PTPRQ (MIM 603317). Apart from six novel variants with a truncating effect (nonsense, deletion, insertion, and splice-site), four novel missense variants were not found in 200 unrelated control population by using Sanger sequencing. It is important to note that none of novel genes were shared across different pedigrees, indicating that a larger sample size might be needed. Proband-WES is a cost-effective and precise way of identifying causative variants in nuclear families with ARNSHL. This economical strategy may be appropriated as a clinical application to provide molecular diagnostics, genetic counseling, and individualized health maintenance measures for patients with ARNSHL at hearing clinics.
Collapse
Affiliation(s)
- Shushan Sang
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Jie Ling
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Institute of Molecular Precision Medicine, Xiangya Hospital, Central South University, Changsha, China.,Hunan Key Laboratory of Molecular Precision Medicine, Changsha, China
| | - Xuezhong Liu
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Department of Otolaryngology, University of Miami Miller School of Medicine, Miami, FL, United States.,Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami Miller School of Medicine, Miami, FL, United States
| | - Lingyun Mei
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Xinzhang Cai
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Taoxi Li
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China.,Hunan Jiahui Genetics Hospital, Changsha, China
| | - Wu Li
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Meng Li
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Jie Wen
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Xianlin Liu
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Jing Liu
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Yalan Liu
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Hongsheng Chen
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Chufeng He
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China
| | - Yong Feng
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Otolaryngology Major Diseases Research of Hunan Province, Changsha, China.,Hunan Jiahui Genetics Hospital, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
2
|
Qi YY, Zhou XJ, Nath SK, Sun C, Wang YN, Hou P, Mu R, Li C, Guo JP, Li ZG, Wang G, Xu HJ, Hao YJ, Zhang ZL, Yue WH, Zhang H, Zhao MH, Zhang H. A Rare Variant (rs933717) at FBXO31-MAP1LC3B in Chinese Is Associated With Systemic Lupus Erythematosus. Arthritis Rheumatol 2018; 70:287-297. [PMID: 29044928 DOI: 10.1002/art.40353] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 10/10/2017] [Indexed: 12/20/2022]
Abstract
OBJECTIVE Recent evidence from genetic, cell biology, and animal model studies has suggested a pivotal role of autophagy in mediating systemic lupus erythematosus (SLE). However, the genetic basis has not yet been thoroughly examined. Therefore, the aim of the present study was to identify additional susceptibility variants in autophagy-related genes along with their functional significance. METHODS First, we performed a gene family-based genetic association analysis in SLE patients with the use of ImmunoChip arrays, and then we selected the most strongly associated polymorphisms for replication in additional cohorts. To identify regulatory clues, we analyzed publicly available blood expression quantitative trait locus data and Encyclopedia of DNA Elements data on transcription factor binding sites and cell type-specific differential expression. Functional effects were tested by luciferase reporter assays, electrophoretic mobility shift assays, and differential gene expression assays. RESULTS In 14,474 samples, we observed that the rare Chinese variant rs933717T was associated with susceptibility to SLE (0.11% in cases versus 0.87% in controls; P = 2.36 × 10-10 , odds ratio 0.13). The rs933717 risk allele C correlated with increased MAP1LC3B expression; increased MAP1LC3B messenger RNA was observed in SLE patients and in lupus-prone mice. In reporter gene constructs, the risk allele increased luciferase activity up to 2.7-3.8-fold in both HEK 293T and Jurkat cell lines, and the binding of HEK 293T and Jurkat cell nuclear extracts to the risk allele was also increased. CONCLUSION We observed a likely genetic association between light chain 3B, a widely used marker for autophagy, and susceptibility to SLE.
Collapse
Affiliation(s)
- Yuan-Yuan Qi
- Peking University First Hospital, Beijing, China
| | - Xu-Jie Zhou
- Peking University First Hospital, Beijing, China
| | | | - Celi Sun
- Oklahoma Medical Research Foundation, Oklahoma City
| | - Yan-Na Wang
- Peking University First Hospital, Beijing, China
| | - Ping Hou
- Peking University First Hospital, Beijing, China
| | - Rong Mu
- Peking University People's Hospital, Beijing, China
| | - Chun Li
- Peking University People's Hospital, Beijing, China
| | | | - Zhan-Guo Li
- Peking University People's Hospital, Beijing, China
| | - Geng Wang
- Changzheng Hospital, Naval Medical University, Shanghai, China
| | - Hu-Ji Xu
- Changzheng Hospital, Naval Medical University, Shanghai, China
| | - Yan-Jie Hao
- Peking University First Hospital, Beijing, China
| | | | - Wei-Hua Yue
- Peking University Sixth Hospital, Beijing, China
| | | | | | - Hong Zhang
- Peking University First Hospital, Beijing, China
| |
Collapse
|
3
|
Takahashi M, Singh RS, Stone J. A Theory for the Origin of Human Menopause. Front Genet 2017; 7:222. [PMID: 28111590 PMCID: PMC5216033 DOI: 10.3389/fgene.2016.00222] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 12/13/2016] [Indexed: 11/25/2022] Open
Abstract
A complete and compelling evolutionary explanation for the origin of human menopause is wanting. Menopause onset is defined clinically as the final menses, confirmed after 1 year without menstruation. The theory proposed herein explains at multiple levels - ultimately genetic but involving (1) behavioral, (2) life history, and (3) social changes - the origin and evolution of menopause in women. Individuals in Lower Paleolithic human populations were characterized by short lifespans with diminished late-age survival and fertility, similar to contemporary chimpanzees, and thence were subject to three changes. (1) A mating behavior change was established in which only young women reproduced, thereby rendering as effectively neutral female-specific late-onset fertility-diminishing mutations, which accumulated subsequently. (2) A lifespan increase was manifested adaptively, revealing the reproductive senescence phenotype encoded in late-onset fertility-diminishing mutation genotypes, which, heretofore, had been unexpressed in the shorter lifespan. (3) A social interaction change emerged exaptively, when older non-reproductive women exclusively started assisting in rearing grandchildren rather than giving birth to and caring for their own children, ultimately leading to menstrual cycle cessation. The changes associate in a one-to-one manner with existing, non-mutually exclusive hypotheses for the origin of human menopause. Evidence for each hypothesis and its associated change having occurred are reviewed, and the hypotheses are combined in a synthetic theory for the origin of human menopause. The new theory simultaneously addresses the main theoretical problem with each hypothesis and yields predictions for future testing.
Collapse
Affiliation(s)
| | - Rama S. Singh
- Department of Biology, Origins Institute, McMaster University, HamiltonON, Canada
| | - John Stone
- Department of Biology, Origins Institute, McMaster University, HamiltonON, Canada
| |
Collapse
|
4
|
Schmid M, Smith J, Burt DW, Aken BL, Antin PB, Archibald AL, Ashwell C, Blackshear PJ, Boschiero C, Brown CT, Burgess SC, Cheng HH, Chow W, Coble DJ, Cooksey A, Crooijmans RPMA, Damas J, Davis RVN, de Koning DJ, Delany ME, Derrien T, Desta TT, Dunn IC, Dunn M, Ellegren H, Eöry L, Erb I, Farré M, Fasold M, Fleming D, Flicek P, Fowler KE, Frésard L, Froman DP, Garceau V, Gardner PP, Gheyas AA, Griffin DK, Groenen MAM, Haaf T, Hanotte O, Hart A, Häsler J, Hedges SB, Hertel J, Howe K, Hubbard A, Hume DA, Kaiser P, Kedra D, Kemp SJ, Klopp C, Kniel KE, Kuo R, Lagarrigue S, Lamont SJ, Larkin DM, Lawal RA, Markland SM, McCarthy F, McCormack HA, McPherson MC, Motegi A, Muljo SA, Münsterberg A, Nag R, Nanda I, Neuberger M, Nitsche A, Notredame C, Noyes H, O'Connor R, O'Hare EA, Oler AJ, Ommeh SC, Pais H, Persia M, Pitel F, Preeyanon L, Prieto Barja P, Pritchett EM, Rhoads DD, Robinson CM, Romanov MN, Rothschild M, Roux PF, Schmidt CJ, Schneider AS, Schwartz MG, Searle SM, Skinner MA, Smith CA, Stadler PF, Steeves TE, Steinlein C, Sun L, Takata M, Ulitsky I, Wang Q, Wang Y, Warren WC, Wood JMD, Wragg D, Zhou H. Third Report on Chicken Genes and Chromosomes 2015. Cytogenet Genome Res 2015; 145:78-179. [PMID: 26282327 PMCID: PMC5120589 DOI: 10.1159/000430927] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Michael Schmid
- Department of Human Genetics, University of Würzburg, Würzburg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Eng L, Azad AK, Qiu X, Kong QQ, Cheng D, Ying N, Tse A, Kuang Q, Dodbiba L, Renouf DJ, Marsh S, Savas S, Mackay HJ, Knox JJ, Darling GE, Wong RK, Xu W, Liu G, Faluyi OO. Discovery and validation of vascular endothelial growth factor (VEGF) pathway polymorphisms in esophageal adenocarcinoma outcome. Carcinogenesis 2015; 36:956-62. [DOI: 10.1093/carcin/bgv073] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Accepted: 05/20/2015] [Indexed: 01/13/2023] Open
|
6
|
Şükür YE, Kıvançlı İB, Özmen B. Ovarian aging and premature ovarian failure. J Turk Ger Gynecol Assoc 2014; 15:190-6. [PMID: 25317048 PMCID: PMC4195330 DOI: 10.5152/jtgga.2014.0022] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Accepted: 03/26/2014] [Indexed: 11/22/2022] Open
Abstract
Physiological reproductive aging occurs as a result of a decrease in the number and quality of oocytes in ovarian cortex follicles. Although the reason for the decrease in the quality of the pool and follicular oocytes is not fully understood, endocrine, paracrine, genetic, and metabolic factors are thought to be effective. Nowadays, in order to understand the mechanisms of ovarian aging, genomic research has gained importance. The effect of co-factors, such as telomerase and ceramide, in the ovarian aging process is only getting ascertained with new research studies. The most important tests in the assessment of ovarian aging are antral follicle count and anti-Mullerian hormone.
Collapse
Affiliation(s)
- Yavuz Emre Şükür
- Department of Obstetrics and Gynecology, Kayseri Training and Research Hospital, Kayseri, Turkey
| | - İçten Balık Kıvançlı
- Department of Obstetrics and Gynecology, Gazimagusa State Hospital, Gazimagusa, Turkish Republic of Northern Cyprus
| | - Batuhan Özmen
- Department of Obstetrics and Gynecology, Ankara University Faculty of Medicine, Ankara, Turkey
| |
Collapse
|
7
|
Stathias V, Sotiris GR, Karagiannidis I, Bourikas G, Martinis G, Papazoglou D, Tavridou A, Papanas N, Maltezos E, Theodoridis M, Vargemezis V, Manolopoulos VG, Speed WC, Kidd JR, Kidd KK, Drineas P, Paschou P. Exploring genomic structure differences and similarities between the Greek and European HapMap populations: implications for association studies. Ann Hum Genet 2013; 76:472-83. [PMID: 23061745 DOI: 10.1111/j.1469-1809.2012.00730.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Studies of the genomic structure of the Greek population and Southeastern Europe are limited, despite the central position of the area as a gateway for human migrations into Europe. HapMap has provided a unique tool for the analysis of human genetic variation. Europe is represented by the CEU (Northwestern Europe) and the TSI populations (Tuscan Italians from Southern Europe), which serve as reference for the design of genetic association studies. Furthermore, genetic association findings are often transferred to unstudied populations. Although initial studies support the fact that the CEU can, in general, be used as reference for the selection of tagging SNPs in European populations, this has not been extensively studied across Europe. We set out to explore the genomic structure of the Greek population (56 individuals) and compare it to the HapMap TSI and CEU populations. We studied 1112 SNPs (27 regions, 13 chromosomes). Although the HapMap European populations are, in general, a good reference for the Greek population, regions of population differentiation do exist and results should not be light-heartedly generalized. We conclude that, perhaps due to the individual evolutionary history of each genomic region, geographic proximity is not always a perfect guide for selecting a reference population for an unstudied population.
Collapse
Affiliation(s)
- Vasileios Stathias
- Department of Molecular Biology and Genetics, Democritus University of Thrace, Alexandroupoli, Greece
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Li Q, Geng X, Zheng W, Tang J, Xu B, Shi Q. Current understanding of ovarian aging. SCIENCE CHINA-LIFE SCIENCES 2012; 55:659-69. [PMID: 22932881 DOI: 10.1007/s11427-012-4352-5] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2012] [Accepted: 06/22/2012] [Indexed: 02/07/2023]
Abstract
The reproductive system of human female exhibits a much faster rate of aging than other body systems. Ovarian aging is thought to be dominated by a gradual decreasing numbers of follicles, coinciding with diminished quality of oocytes. Menopause is the final step in the process of ovarian aging. This review focuses on the mechanisms underlying the ovarian aging involving a poor complement of follicles at birth and a high rate of attrition each month, as well as the alternated endocrine factors. We also discuss the possible causative factors that contribute to ovarian aging, e.g., genetic factors, accumulation of irreparable damage of microenvironment, pathological effect and other factors. The appropriate and reliable methods to assess ovarian aging, such as quantification of follicles, endocrine measurement and genetic testing have also been discussed. Increased knowledge of the ovarian aging mechanisms may improve the prevention of premature ovarian failure.
Collapse
Affiliation(s)
- Qian Li
- Hefei National Laboratory for Physical Sciences at Microscale and School of Life Sciences, University of Science and Technology of China, Hefei, 230027, China
| | | | | | | | | | | |
Collapse
|
9
|
Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Epidemiol 2010; 34:816-34. [PMID: 21058334 PMCID: PMC3175618 DOI: 10.1002/gepi.20533] [Citation(s) in RCA: 1540] [Impact Index Per Article: 110.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Genome-wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereof as proxies. We have previously implemented a computationally efficient Markov Chain framework for genotype imputation and haplotyping in the freely available MaCH software package. The approach describes sampled chromosomes as mosaics of each other and uses available genotype and shotgun sequence data to estimate unobserved genotypes and haplotypes, together with useful measures of the quality of these estimates. Our approach is already widely used to facilitate comparison of results across studies as well as meta-analyses of GWAS. Here, we use simulations and experimental genotypes to evaluate its accuracy and utility, considering choices of genotyping panels, reference panel configurations, and designs where genotyping is replaced with shotgun sequencing. Importantly, we show that genotype imputation not only facilitates cross study analyses but also increases power of genetic association studies. We show that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome-wide SNP data or smaller amounts of data typical in fine-mapping studies. Furthermore, we show the approach is applicable in a variety of populations. Finally, we illustrate how association analyses of unobserved variants will benefit from ongoing advances such as larger HapMap reference panels and whole genome shotgun sequencing technologies.
Collapse
Affiliation(s)
- Yun Li
- Department of Genetics, Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina
| | - Cristen J. Willer
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Jun Ding
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Paul Scheet
- Department of Epidemiology, University of Texas M.D. Anderson Cancer Center, Houston, Texas
| | - Gonçalo R. Abecasis
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| |
Collapse
|
10
|
LI CHAO, LIN ZHIMING, XIE YINGYING, GUO ZISHI, HUANG JINXIAN, WEI QIUJING, LI QIUXIA, WANG XINWEI, CAO SHUANGYAN, LIAO ZETAO, XU MANLONG, SHEN YAN, GU JIERUO. ERAP1 Is Associated with Ankylosing Spondylitis in Han Chinese. J Rheumatol 2010; 38:317-21. [DOI: 10.3899/jrheum.100013] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Objective.Genetic components play important roles in the incidence and development of ankylosing spondylitis (AS). Aminopeptidase regulator of tumor necrosis factor receptor shedding 1 (ERAP1) was recently found to be associated with AS in North American and British cohorts. We evaluated whether ERAP1 is associated with AS in a Chinese Han population.Methods.A sample of 50 patients and 50 healthy controls was recruited for preliminary screening for informative single-nucleotide polymorphisms (SNP). Then 6 SNP of suggestive significance in the initial screening were followed up in a large sample of 471 patients with AS and 456 ethnically matched controls. Diagnosis of AS followed the 1984 modified New York criteria. Linkage disequilibrium coefficient (D’ and r2) and haplotypes were estimated by Haploview.Result.Two SNP (rs27434, p = 0.00039, and rs27529, p = 0.0083) in ERAP1 other than that reported previously were found to be significantly associated with AS. Haplotype analysis using 5 SNP within 1 linkage disequilibrium block identified 2 risk haplotypes (GATGT and GACGT) and 1 protective haplotype (GGTGT) for AS.Conclusion.Our study demonstrated that 2 novel SNP in ERAP1 were associated with AS in the Han Chinese population, suggesting that ERAP1 might confer genetic risk for AS in Han Chinese through the common mechanism shared by different populations, although the AS-associated SNP in ERAP1 might be population-specific.
Collapse
|
11
|
Abstract
Understanding genetic variation between populations is important because it affects the portability of human genome-wide analytical methods. We compared genetic variation and substructure between Malawians and other African and non-African HapMap populations. Allele frequencies and adjacent linkage disequilibrium (LD) were measured for 617 715 single nucleotide polymorphisms (SNPs) across subject genomes. Allele frequencies in the Malawian population (N=226) were highly correlated with allele frequencies in HapMap populations of African ancestry (AFA, N=376), namely Yoruban in Ibadan, Nigeria (Spearman's r(2)=0.97), Luhya in Webuye, Kenya (r(2)=0.97), African Americans in the southwest United States (r(2)=0.94) and Maasai in Kinyawa, Kenya (r(2)=0.91). This correlation was much lower between Malawians and other ancestry populations (r(2)<0.52). LD correlations between Malawians and HapMap populations were strongest for the populations of AFA (AFA r(2)>0.82, other ancestries r(2)<0.57). Principal components analyses revealed little population substructure within our Malawi sample but provided clear distinction between Malawians, AFA populations and two European populations. Five SNPs within the lactase gene (LCT) had substantially different allele frequencies between the Malawi population and Maasai in Kenyawa, Kenya (rs3769013, rs730005, rs3769012, rs2304370; P-values <1 x 10(-33)).
Collapse
|
12
|
Van Hout CV, Levin AM, Rampersaud E, Shen H, O'Connell JR, Mitchell BD, Shuldiner AR, Douglas JA. Extent and distribution of linkage disequilibrium in the Old Order Amish. Genet Epidemiol 2010; 34:146-50. [PMID: 19697356 PMCID: PMC2811753 DOI: 10.1002/gepi.20444] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Knowledge of the extent and distribution of linkage disequilibrium (LD) is critical to the design and interpretation of gene mapping studies. Because the demographic history of each population varies and is often not accurately known, it is necessary to empirically evaluate LD on a population-specific basis. Here we present the first genome-wide survey of LD in the Old Order Amish (OOA) of Lancaster County Pennsylvania, a closed population derived from a modest number of founders. Specifically, we present a comparison of LD between OOA individuals and US Utah participants in the International HapMap project (abbreviated CEU) using a high-density single nucleotide polymorphism (SNP) map. Overall, the allele (and haplotype) frequency distributions and LD profiles were remarkably similar between these two populations. For example, the median absolute allele frequency difference for autosomal SNPs was 0.05, with an inter-quartile range of 0.02–0.09, and for autosomal SNPs 10–20 kb apart with common alleles (minor allele frequency≥0.05), the LD measure r2 was at least 0.8 for 15 and 14% of SNP pairs in the OOA and CEU, respectively. Moreover, tag SNPs selected from the HapMap CEU sample captured a substantial portion of the common variation in the OOA (∼88%) at r2≥0.8. These results suggest that the OOA and CEU may share similar LD profiles for other common but untyped SNPs. Thus, in the context of the common variant-common disease hypothesis, genetic variants discovered in gene mapping studies in the OOA may generalize to other populations.
Collapse
Affiliation(s)
- Cristopher V Van Hout
- Department of Human Genetics, University of Michigan School of Medicine, Ann Arbor, 48109-5618, USA
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Andiappan AK, Anantharaman R, Nilkanth PP, Wang DY, Chew FT. Evaluating the transferability of Hapmap SNPs to a Singapore Chinese population. BMC Genet 2010; 11:36. [PMID: 20459637 PMCID: PMC2877651 DOI: 10.1186/1471-2156-11-36] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Accepted: 05/07/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The International Hapmap project serves as a valuable resource for human genome variation data, however its applicability to other populations has yet to be exhaustively investigated. In this paper, we use high density genotyping chips and resequencing strategies to compare the Singapore Chinese population with the Hapmap populations. First we compared 1028 and 114 unrelated Singapore Chinese samples genotyped using the Illumina Human Hapmap 550 k chip and Affymetrix 500 k array respectively against the 270 samples from Hapmap. Secondly, data from 20 candidate genes on 5q31-33 resequenced for an asthma candidate gene based study was also used for the analysis. RESULTS A total of 237 SNPs were identified through resequencing of which only 95 SNPs (40%) were in Hapmap; however an additional 56 SNPs (24%) were not genotyped directly but had a proxy SNP in the Hapmap. At the genome-wide level, Singapore Chinese were highly correlated with Hapmap Han Chinese with correlation of 0.954 and 0.947 for the Illumina and Affymetrix platforms respectively with deviant SNPs randomly distributed within and across all chromosomes. CONCLUSIONS The high correlation between our population and Hapmap Han Chinese reaffirms the applicability of Hapmap based genome-wide chips for GWA studies. There is a clear population signature for the Singapore Chinese samples and they predominantly resemble the southern Han Chinese population; however when new migrants particularly those with northern Han Chinese background were included, population stratification issues may arise. Future studies needs to address population stratification within the sample collection while designing and interpreting GWAS in the Chinese population.
Collapse
Affiliation(s)
- Anand Kumar Andiappan
- Department of Biological Sciences, National University of Singapore, Science Drive 4, Singapore 117543
| | - Ramani Anantharaman
- Department of Biological Sciences, National University of Singapore, Science Drive 4, Singapore 117543
| | - Pallavi Parate Nilkanth
- Department of Biological Sciences, National University of Singapore, Science Drive 4, Singapore 117543
| | - De Yun Wang
- Department of Otolaryngology, National University of Singapore, 10 Lower Kent Ridge Road, Singapore 119260
| | - Fook Tim Chew
- Department of Biological Sciences, National University of Singapore, Science Drive 4, Singapore 117543
| |
Collapse
|
14
|
Li X, Shu YH, Xiang AH, Trigo E, Kuusisto J, Hartiala J, Swift AJ, Kawakubo M, Stringham HM, Bonnycastle LL, Lawrence JM, Laakso M, Allayee H, Buchanan TA, Watanabe RM. Additive effects of genetic variation in GCK and G6PC2 on insulin secretion and fasting glucose. Diabetes 2009; 58:2946-53. [PMID: 19741163 PMCID: PMC2780888 DOI: 10.2337/db09-0228] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
OBJECTIVE Glucokinase (GCK) and glucose-6-phosphatase catalytic subunit 2 (G6PC2) regulate the glucose-cycling step in pancreatic beta-cells and may regulate insulin secretion. GCK rs1799884 and G6PC2 rs560887 have been independently associated with fasting glucose, but their interaction on glucose-insulin relationships is not well characterized. RESEARCH DESIGN AND METHODS We tested whether these variants are associated with diabetes-related quantitative traits in Mexican Americans from the BetaGene Study and attempted to replicate our findings in Finnish men from the METabolic Syndrome in Men (METSIM) Study. RESULTS rs1799884 was not associated with any quantitative trait (corrected P > 0.1), whereas rs560887 was significantly associated with the oral glucose tolerance test 30-min incremental insulin response (30' Deltainsulin, corrected P = 0.021). We found no association between quantitative traits and the multiplicative interaction between rs1799884 and rs560887 (P > 0.26). However, the additive effect of these single nucleotide polymorphisms was associated with fasting glucose (corrected P = 0.03) and 30' Deltainsulin (corrected P = 0.027). This additive association was replicated in METSIM (fasting glucose, P = 3.5 x 10(-10) 30' Deltainsulin, P = 0.028). When we examined the relationship between fasting glucose and 30' Deltainsulin stratified by GCK and G6PC2, we noted divergent changes in these quantitative traits for GCK but parallel changes for G6PC2. We observed a similar pattern in METSIM. CONCLUSIONS Our data suggest that variation in GCK and G6PC2 have additive effects on both fasting glucose and insulin secretion.
Collapse
Affiliation(s)
- Xia Li
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Yu-Hsiang Shu
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Anny H. Xiang
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Enrique Trigo
- Department of Medicine, Division of Endocrinology, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Johanna Kuusisto
- Department of Medicine, University of Kuopio and Kuopio University Hospital, Kuopio, Finland
| | - Jaana Hartiala
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
- Institute for Genetic Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Amy J. Swift
- Genome Technology Branch, National Human Genome Research Institute, Bethesda, Maryland
| | - Miwa Kawakubo
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Heather M. Stringham
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, Michigan
| | - Lori L. Bonnycastle
- Genome Technology Branch, National Human Genome Research Institute, Bethesda, Maryland
| | - Jean M. Lawrence
- Research and Evaluation, Kaiser Permanente of Southern California, Pasadena, California
| | - Markku Laakso
- Department of Medicine, University of Kuopio and Kuopio University Hospital, Kuopio, Finland
| | - Hooman Allayee
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
- Institute for Genetic Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Thomas A. Buchanan
- Department of Medicine, Division of Endocrinology, Keck School of Medicine, University of Southern California, Los Angeles, California
- Department of Physiology and Biophysics, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Richard M. Watanabe
- Department of Preventive Medicine, Division of Biostatistics, Keck School of Medicine, University of Southern California, Los Angeles, California
- Department of Physiology and Biophysics, Keck School of Medicine, University of Southern California, Los Angeles, California
- Corresponding author: Richard M. Watanabe,
| |
Collapse
|
15
|
Lins TC, Abreu BS, Pereira RW. TagSNP transferability and relative loss of variability prediction from HapMap to an admixed population. J Biomed Sci 2009; 16:73. [PMID: 19682379 PMCID: PMC2737315 DOI: 10.1186/1423-0127-16-73] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2009] [Accepted: 08/14/2009] [Indexed: 01/30/2023] Open
Abstract
Background The application of a subset of single nucleotide polymorphisms, the tagSNPs, can be useful in capturing untyped SNPs information in a genomic region. TagSNP transferability from the HapMap dataset to admixed populations is of uncertain value due population structure, admixture, drift and recombination effects. In this work an empirical dataset from a Brazilian admixed sample was evaluated against the HapMap population to measure tagSNP transferability and the relative loss of variability prediction. Methods The transferability study was carried out using SNPs dispersed over four genomic regions: the PTPN22, HMGCR, VDR and CETP genes. Variability coverage and the prediction accuracy for tagSNPs in the selected genomic regions of HapMap phase II were computed using a prediction accuracy algorithm. Transferability of tagSNPs and relative loss of prediction were evaluated according to the difference between the Brazilian sample and the pooled and single HapMap population estimates. Results Each population presented different levels of prediction per gene. On average, the Brazilian (BRA) sample displayed a lower power of prediction when compared to HapMap and the pooled sample. There was a relative loss of prediction for BRA when using single HapMap populations, but a pooled HapMap dataset generated minor loss of variability prediction and lower standard deviations, except at the VDR locus at which loss was minor using CEU tagSNPs. Conclusion Studies that involve tagSNP selection for an admixed population should not be generally correlated with any specific HapMap population and can be better represented with a pooled dataset in most cases.
Collapse
Affiliation(s)
- Tulio C Lins
- Programa de Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, DF, Brazil.
| | | | | |
Collapse
|
16
|
Abstract
Menopause is the final step in the process referred to as ovarian ageing. The age related decrease in follicle numbers dictates the onset of cycle irregularity and the final cessation of menses. The parallel decay in oocyte quality contributes to the gradual decline in fertility and the final occurrence of natural sterility. Endocrine changes mainly relate to the decline in the negative feedback from ovarian factors at the hypothalamo-pituitary unit. The declining cohort of antral follicles with age first results in gradually elevated FSH levels, followed by subsequent stages of overt cycle irregularity. The gradual decline in the size of the antral follicle cohort is best represented by decreasing levels of anti-Mullerian hormone. The variability of ovarian ageing among women is evident from the large variation in age at menopause. The identification of women who have severely decreased ovarian reserve for their age is clinically relevant. Ovarian reserve tests have appeared to be fairly accurate in predicting response to ovarian stimulation in the assisted reproductive technology (ART) setting. The capacity to predict the chances for spontaneous pregnancy or pregnancy after ART appears very limited. As menopause and the preceding decline in oocyte quality seem to have a fixed time interval, tests that predict the age at menopause may be useful to assess individual reproductive lifespan. Especially genetic studies, both addressing candidate gene and genome wide association, have identified several interesting loci of small genetic variation that may determine fetal follicle pool development and subsequent wastage of his pool over time. Improved knowledge of the ovarian ageing mechanisms may ultimately provide tools for prediction of menopause and manipulation of the early steps of folliculogenesis for the purpose of contraception and fertility lifespan extension.
Collapse
Affiliation(s)
- F J Broekmans
- Department of Reproductive Medicine and Gynecology, University Medical Center, Utrecht, The Netherlands.
| | | | | |
Collapse
|
17
|
Li M, Wang K, Grant SFA, Hakonarson H, Li C. ATOM: a powerful gene-based association test by combining optimally weighted markers. ACTA ACUST UNITED AC 2008; 25:497-503. [PMID: 19074959 DOI: 10.1093/bioinformatics/btn641] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
BACKGROUND Large-scale candidate-gene and genome-wide association studies genotype multiple SNPs within or surrounding a gene, including both tag and functional SNPs. The immense amount of data generated in these studies poses new challenges to analysis. One particularly challenging yet important question is how to best use all genetic information to test whether a gene or a region is associated with the trait of interest. METHODS Here we propose a powerful gene-based Association Test by combining Optimally Weighted Markers (ATOM) within a genomic region. Due to variation in linkage disequilibrium, different markers often associate with the trait of interest at different levels. To appropriately apportion their contributions, we assign a weight to each marker that is proportional to the amount of information it captures about the trait locus. We analytically derive the optimal weights for both quantitative and binary traits, and describe a procedure for estimating the weights from a reference database such as the HapMap. Compared with existing approaches, our method has several distinct advantages, including (i) the ability to borrow information from an external database to increase power, (ii) the theoretical derivation of optimal marker weights and (iii) the scalability to simultaneous analysis of all SNPs in candidate genes and pathways. RESULTS Through extensive simulations and analysis of the FTO gene in our ongoing genome-wide association study on childhood obesity, we demonstrate that ATOM increases the power to detect genetic association as compared with several commonly used multi-marker association tests.
Collapse
Affiliation(s)
- Mingyao Li
- Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
| | | | | | | | | |
Collapse
|
18
|
Hao K, Schadt EE, Storey JD. Calibrating the performance of SNP arrays for whole-genome association studies. PLoS Genet 2008; 4:e1000109. [PMID: 18584036 PMCID: PMC2432039 DOI: 10.1371/journal.pgen.1000109] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2008] [Accepted: 05/28/2008] [Indexed: 11/18/2022] Open
Abstract
To facilitate whole-genome association studies (WGAS), several high-density SNP genotyping arrays have been developed. Genetic coverage and statistical power are the primary benchmark metrics in evaluating the performance of SNP arrays. Ideally, such evaluations would be done on a SNP set and a cohort of individuals that are both independently sampled from the original SNPs and individuals used in developing the arrays. Without utilization of an independent test set, previous estimates of genetic coverage and statistical power may be subject to an overfitting bias. Additionally, the SNP arrays' statistical power in WGAS has not been systematically assessed on real traits. One robust setting for doing so is to evaluate statistical power on thousands of traits measured from a single set of individuals. In this study, 359 newly sampled Americans of European descent were genotyped using both Affymetrix 500K (Affx500K) and Illumina 650Y (Ilmn650K) SNP arrays. From these data, we were able to obtain estimates of genetic coverage, which are robust to overfitting, by constructing an independent test set from among these genotypes and individuals. Furthermore, we collected liver tissue RNA from the participants and profiled these samples on a comprehensive gene expression microarray. The RNA levels were used as a large-scale set of quantitative traits to calibrate the relative statistical power of the commercial arrays. Our genetic coverage estimates are lower than previous reports, providing evidence that previous estimates may be inflated due to overfitting. The Ilmn650K platform showed reasonable power (50% or greater) to detect SNPs associated with quantitative traits when the signal-to-noise ratio (SNR) is greater than or equal to 0.5 and the causal SNP's minor allele frequency (MAF) is greater than or equal to 20% (N = 359). In testing each of the more than 40,000 gene expression traits for association to each of the SNPs on the Ilmn650K and Affx500K arrays, we found that the Ilmn650K yielded 15% times more discoveries than the Affx500K at the same false discovery rate (FDR) level. Advances in SNP genotyping array technologies have made whole-genome association studies (WGAS) a readily available approach. Genetic coverage and the statistical power are two key properties to evaluate on the arrays. In this study, 359 newly sampled individuals were genotyped using Affymetrix 500K and Illumina 650Y SNP arrays. From these data, we obtained new estimates of genetic coverage by constructing a test set from among these genotypes and individuals that is independent from the SNPs and individuals used to construct the arrays. These estimates are notably smaller than previous ones, which we argue is due to an overfitting bias in previous studies. We also collected liver tissue RNA from the participants and profiled these samples on a comprehensive gene expression microarray. The RNA levels were used as a large-scale set of quantitative traits to calibrate the relative statistical power of the commercial arrays. Through this dataset and simulations, we find that the SNP arrays provide adequate power to detect quantitative trait loci when the causal SNP's minor allele frequency is greater than 20%, but low power is less than 10%. Importantly, we provide evidence that sample size has a greater impact on the power of WGAS than SNP density or genetic coverage.
Collapse
Affiliation(s)
- Ke Hao
- Rosetta Inpharmatics, Seattle, Washington, United States of America
| | - Eric E. Schadt
- Rosetta Inpharmatics, Seattle, Washington, United States of America
| | - John D. Storey
- Rosetta Inpharmatics, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
19
|
Xing J, Witherspoon DJ, Watkins WS, Zhang Y, Tolpinrud W, Jorde LB. HapMap tagSNP transferability in multiple populations: general guidelines. Genomics 2008; 92:41-51. [PMID: 18482828 DOI: 10.1016/j.ygeno.2008.03.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2008] [Revised: 03/26/2008] [Accepted: 03/28/2008] [Indexed: 11/30/2022]
Abstract
Linkage disequilibrium (LD) has received much attention recently because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian sample and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r(2) >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | | | | | | | | | | |
Collapse
|
20
|
Abstract
The question of tagging single nucleotide polymorphism (tagSNP) transferability is an important one because many ongoing and upcoming Genome-Wide Association studies rely critically upon the validity, and practical feasibility of using a universal core set of tagSNPs. A series of recent studies analyzed performance of tagSNPs selected based on the HapMap. While these studies showed largely satisfactory transferability of the tagSNPs, they also reported that the level of transferability varies, substantively sometimes, especially when tagSNPs selected in one population were used in another distant population. We present a review of the literature about where and why tagSNP transferability may become a problem and suggest research directions that may help the resolution.
Collapse
Affiliation(s)
- C Charles Gu
- Division of Biostatistics, Washington University School of Medicine, St Louis, MO 63110, USA.
| | | | | | | | | |
Collapse
|
21
|
Lundmark PE, Liljedahl U, Boomsma DI, Mannila H, Martin NG, Palotie A, Peltonen L, Perola M, Spector TD, Syvänen AC. Evaluation of HapMap data in six populations of European descent. Eur J Hum Genet 2008; 16:1142-50. [PMID: 18398430 DOI: 10.1038/ejhg.2008.77] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
We studied how well the European CEU samples used in the Haplotype Mapping Project (HapMap) represent five European populations by analyzing nuclear family samples from the Swedish, Finnish, Dutch, British and Australian (European ancestry) populations. The number of samples from each population (about 30 parent-offspring trios) was similar to that in the HapMap sample sets. A panel of 186 single nucleotide polymorphisms (SNPs) distributed over the 1.5 Mb region of the GRID2 gene on chromosome 4 was genotyped. The genotype data were compared pair-wise between the HapMap sample and the other population samples. Principal component analysis (PCA) was used to cluster the data from different populations with respect to allele frequencies and to define the markers responsible for observed variance. The only sample with detectable differences in allele frequencies was that from Kuusamo, Finland. This sample also separated from the others, including the other Finnish sample, in the PCA analysis. A set of tagSNPs was defined based on the HapMap data and applied to the samples. The tagSNPs were found to capture the genetic variation in the analyzed region at r(2)>0.8 at levels ranging from 95% in the Kuusamo sample to 87% in the Australian sample. To capture the maximal genetic variation in the region, the Kuusamo, HapMap and Australian samples required 58, 63 and 73 native tagSNPs, respectively. The HapMap CEU sample represents the European samples well for tagSNP selection, with some caution regarding estimation of allele frequencies in the Finnish Kuusamo sample, and a slight reduction in tagging efficiency in the Australian sample.
Collapse
Affiliation(s)
- Per E Lundmark
- Molecular Medicine, Department of Medical Sciences, Uppsala University Hospital, Uppsala University, Uppsala, Sweden
| | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
An evaluation of the performance of HapMap SNP data in a Shanghai Chinese population: analyses of allele frequency, linkage disequilibrium pattern and tagging SNPs transferability on chromosome 1q21-q25. BMC Genet 2008; 9:19. [PMID: 18302794 PMCID: PMC2292209 DOI: 10.1186/1471-2156-9-19] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2007] [Accepted: 02/27/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The HapMap project aimed to catalog millions of common single nucleotide polymorphisms (SNPs) in the human genome in four major populations, in order to facilitate association studies of complex diseases. To examine the transferability of Han Chinese in Beijing HapMap data to the Southern Han Chinese in Shanghai, we performed comparative analyses between genotypes from over 4,500 SNPs in a 21 Mb region on chromosome 1q21-q25 in 80 unrelated Shanghai Chinese and 45 HapMap Chinese data. RESULTS Three thousand and forty-two SNPs were analyzed after removal of SNPs that failed quality control and those not in the HapMap panel. We compared the allele frequency distributions, linkage disequilibrium patterns, haplotype frequency distributions and tagging SNP sets transferability between the HapMap population and Shanghai Chinese population. Among the four HapMap populations, Beijing Chinese showed the best correlation with Shanghai population on allele frequencies, linkage disequilibrium and haplotype frequencies. Tagging SNP sets selected from four HapMap populations at different thresholds were evaluated in the Shanghai sample. Under the threshold of r2 equal to 0.8 or 0.5, both HapMap Chinese and Japanese data showed better coverage and tagging efficiency than Caucasian and African data. CONCLUSION Our study supported the applicability of HapMap Beijing Chinese SNP data to the study of complex diseases among southern Chinese population.
Collapse
|
23
|
Abstract
Thanks to the recent revolutionary genomic advances such as the International HapMap consortium, resolution of the genetic architecture of common complex traits is beginning to look hopeful. While demonstrating the feasibility of genome-wide association (GWA) studies, the pathbreaking Wellcome Trust Case Control Consortium (WTCCC) study also serves to underscore the critical importance of very large sample sizes and draws attention to potential problems, which need to be addressed as part of the study design. Even the large WTCCC study had vastly inadequate power for several of the associations reported (and confirmed) and, therefore, most of the regions harboring relevant associations may not be identified anytime soon. This chapter provides an overview of some of the key developments in the methodological approaches to genetic dissection of common complex traits. Constrained Bayesian networks are suggested as especially useful for analysis of pathway-based SNPs. Likewise, composite likelihood is suggested as a promising method for modeling complex systems. It discusses the key steps in a study design, with an emphasis on GWA studies. Potential limitations highlighted by the WTCCC GWA study are discussed, including problems associated with massive genotype imputation, analysis of pooled national samples, shared controls, and the critical role of interactions. GWA studies clearly need massive sample sizes that are only possible through genuine collaborations. After all, for common complex traits, the question is not whether we can find some pieces of the puzzle, but how large and what kind of a sample we need to (nearly) solve the genetic puzzle.
Collapse
Affiliation(s)
- D C Rao
- Division of Biostatistics and Departments of Genetics, Psychiatry, and Mathematics, School of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA
| |
Collapse
|
24
|
|
25
|
Dong C, Qian Z, Jia P, Wang Y, Huang W, Li Y. Gene-centric characteristics of genome-wide association studies. PLoS One 2007; 2:e1262. [PMID: 18060058 PMCID: PMC2092383 DOI: 10.1371/journal.pone.0001262] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2007] [Accepted: 10/12/2007] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The high-throughput genotyping chips have contributed greatly to genome-wide association (GWA) studies to identify novel disease susceptibility single nucleotide polymorphisms (SNPs). The high-density chips are designed using two different SNP selection approaches, the direct gene-centric approach, and the indirect quasi-random SNPs or linkage disequilibrium (LD)-based tagSNPs approaches. Although all these approaches can provide high genome coverage and ascertain variants in genes, it is not clear to which extent these approaches could capture the common genic variants. It is also important to characterize and compare the differences between these approaches. METHODOLOGY/PRINCIPAL FINDINGS In our study, by using both the Phase II HapMap data and the disease variants extracted from OMIM, a gene-centric evaluation was first performed to evaluate the ability of the approaches in capturing the disease variants in Caucasian population. Then the distribution patterns of SNPs were also characterized in genic regions, evolutionarily conserved introns and nongenic regions, ontologies and pathways. The results show that, no mater which SNP selection approach is used, the current high-density SNP chips provide very high coverage in genic regions and can capture most of known common disease variants under HapMap frame. The results also show that the differences between the direct and the indirect approaches are relatively small. Both have similar SNP distribution patterns in these gene-centric characteristics. CONCLUSIONS/SIGNIFICANCE This study suggests that the indirect approaches not only have the advantage of high coverage but also are useful for studies focusing on various functional SNPs either in genes or in the conserved regions that the direct approach supports. The study and the annotation of characteristics will be helpful for designing and analyzing GWA studies that aim to identify genetic risk factors involved in common diseases, especially variants in genes and conserved regions.
Collapse
Affiliation(s)
- Changzheng Dong
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ziliang Qian
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Peilin Jia
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Ying Wang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
| | - Wei Huang
- Chinese National Human Genome Center at Shanghai, Shanghai, China
- Rui Jin Hospital, School of Medicine, Shanghai Jiaotong University, Shanghai, China
| | - Yixue Li
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Shanghai Center for Bioinformation Technology, Shanghai, China
| |
Collapse
|
26
|
Abstract
SUMMARY GWAsimulator implements a rapid moving-window algorithm to simulate genotype data for case-control or population samples from genomic SNP chips. For case-control data, the program generates cases and controls according to a user-specified multi-locus disease model, and can simulate specific regions if desired. The program uses phased genotype data as input and has the flexibility of simulating genotypes for different populations and different genomic SNP chips. When the HapMap phased data are used, the simulated data have similar local LD patterns as the HapMap data. As genome-wide association (GWA) studies become increasingly popular and new GWA data analysis methods are being developed, we anticipate that GWAsimulator will be an important tool for evaluating performance of new GWA analysis methods. AVAILABILITY The C++ source code, executables for Linux, Windows and MacOS, manual, example data sets and analysis program are available at http://biostat.mc.vanderbilt.edu/GWAsimulator
Collapse
Affiliation(s)
- Chun Li
- Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA.
| | | |
Collapse
|
27
|
Loza MJ, McCall CE, Li L, Isaacs WB, Xu J, Chang BL. Assembly of inflammation-related genes for pathway-focused genetic analysis. PLoS One 2007; 2:e1035. [PMID: 17940599 PMCID: PMC2001184 DOI: 10.1371/journal.pone.0001035] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2007] [Accepted: 09/25/2007] [Indexed: 12/20/2022] Open
Abstract
Recent identifications of associations between novel variants in inflammation-related genes and several common diseases emphasize the need for systematic evaluations of these genes in disease susceptibility. Considering that many genes are involved in the complex inflammation responses and many genetic variants in these genes have the potential to alter the functions and expression of these genes, we assembled a list of key inflammation-related genes to facilitate the identification of genetic associations of diseases with an inflammation-related etiology. We first reviewed various phases of inflammation responses, including the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. Assisted by the Ingenuity Pathway Analysis, we then identified 17 functional sub-pathways that are involved in one or multiple phases. This organization would greatly increase the chance of detecting gene-gene interactions by hierarchical clustering of genes with their functional closeness in a pathway. Finally, as an example application, we have developed tagging single nucleotide polymorphism (tSNP) arrays for populations of European and African descent to capture all the common variants of these key inflammation-related genes. Assays of these tSNPs have been designed and assembled into two Affymetrix ParAllele customized chips, one each for European (12,011 SNPs) and African (21,542 SNPs) populations. These tSNPs have greater coverage for these inflammation-related genes compared to the existing genome-wide arrays, particularly in the African population. These tSNP arrays can facilitate systematic evaluation of inflammation pathways in disease susceptibility. For additional applications, other genotyping platforms could also be employed. For existing genome-wide association data, this list of key inflammation-related genes and associated subpathways can facilitate comprehensive inflammation pathway- focused association analyses.
Collapse
Affiliation(s)
- Matthew J. Loza
- Center for Human Genomics, Department of Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America
| | - Charles E. McCall
- Department of Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America
| | - Liwu Li
- Department of Biology, Virginia Polytechnic Institute and State University, Blacksburg, Virginia, United States of America
| | - William B. Isaacs
- Department of Urology, Johns Hopkins University Medical Institutions, Baltimore, Maryland, United States of America
- Department of Oncology, Johns Hopkins University Medical Institutions, Baltimore, Maryland, United States of America
| | - Jianfeng Xu
- Center for Human Genomics, Department of Epidemiology and Prevention, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America
- * To whom correspondence should be addressed. E-mail: (JX); (BC)
| | - Bao-Li Chang
- Center for Human Genomics, Department of Pediatric Medicine, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America
- * To whom correspondence should be addressed. E-mail: (JX); (BC)
| |
Collapse
|
28
|
Wright FA, Huang H, Guan X, Gamiel K, Jeffries C, Barry WT, de Villena FPM, Sullivan PF, Wilhelmsen KC, Zou F. Simulating association studies: a data-based resampling method for candidate regions or whole genome scans. Bioinformatics 2007; 23:2581-8. [PMID: 17785348 DOI: 10.1093/bioinformatics/btm386] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Reductions in genotyping costs have heightened interest in performing whole genome association scans and in the fine mapping of candidate regions. Improvements in study design and analytic techniques will require the simulation of datasets with realistic patterns of linkage disequilibrium and allele frequencies for typed SNPs. METHODS We describe a general approach to simulate genotyped datasets for standard case-control or affected child trio data, by resampling from existing phased datasets. The approach allows for considerable flexibility in disease models, potentially involving a large number of interacting loci. The method is most applicable for diseases caused by common variants that have not been under strong selection, a class specifically targeted by the International HapMap project. RESULTS Using the three population Phase I/II HapMap data as a testbed for our approach, we have implemented the approach in HAP-SAMPLE, a web-based simulation tool.
Collapse
Affiliation(s)
- Fred A Wright
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Abstract
Personalized medicine introduces the promise to use molecular markers that signal the risk of disease or its presence before clinical signs and symptoms appear. This information underlies a new healthcare strategy focused on prevention and early intervention, rather than reaction to advanced stages of disease. Such a strategy can delay disease onset or minimize symptom severity. The molecular foundations that enable personalized medicine include detection of variation in nucleotide sequence of genes and in characteristic patterns of gene expression, proteins and metabolites. Genetic and molecular patterns are correlated with disease manifestations, drug responses, treatment prognosis, or prediction of predisposition to future disease states. However, the uncertainties for personalized medicine are considerable, including economic, ethical, legal, and societal questions. Although much of its promise remains unproven to date, the foundations of personalized medicine appear solid and evidence is accumulating rapidly pointing to its growing importance in healthcare (Fig. 1).
Collapse
Affiliation(s)
- Erwin P Bottinger
- Charles Bronfman Institute for Personalized Medicine, Department of Medicine, Mount Sinai School of Medicine, New York, New York 10029, USA.
| |
Collapse
|
30
|
Van Eyken E, Van Camp G, Van Laer L. The complexity of age-related hearing impairment: contributing environmental and genetic factors. Audiol Neurootol 2007; 12:345-58. [PMID: 17664866 DOI: 10.1159/000106478] [Citation(s) in RCA: 206] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2006] [Accepted: 05/18/2007] [Indexed: 12/24/2022] Open
Abstract
Age-related hearing impairment (ARHI) is the most common sensory impairment seen in the elderly. It is a complex disorder, with both environmental as well as genetic factors contributing to the impairment. The involvement of several environmental factors has been partially elucidated. A first step towards the identification of the genetic factors has been made, which will result in the identification of susceptibility genes, and will provide possible targets for the future treatment and/or prevention of ARHI. This paper aims to give a broad overview of the scientific findings related to ARHI, focusing mainly on environmental and genetic data in humans and in animal models. In addition, methods for the identification of contributing genetic factors as well as possible future therapeutic strategies are discussed.
Collapse
Affiliation(s)
- E Van Eyken
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | | | | |
Collapse
|
31
|
Marvelle AF, Lange LA, Qin L, Wang Y, Lange EM, Adair LS, Mohlke KL. Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples. J Hum Genet 2007; 52:729-737. [PMID: 17636361 DOI: 10.1007/s10038-007-0175-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2007] [Accepted: 06/22/2007] [Indexed: 10/23/2022]
Abstract
Patterns of linkage disequilibrium (LD) act as the framework for designing efficient association studies; these patterns are being studied and catalogued by The International HapMap Project. The current study assessed the transferability of tag SNPs chosen from HapMap panels to a cohort of 80 individuals from metro Cebu, Philippines, who participated in the Cebu Longitudinal Health and Nutrition Survey (CLHNS). The analyses focused on 627 single nucleotide polymorphisms (SNPs) in the central 40 kb within each of the 10 HapMap ENCODE regions. The similarity between the genetic variants in Cebu Filipino samples and HapMap panels was examined using allele frequency estimates, measures of pairwise linkage disequilibrium (LD), and haplotype frequency estimates. For these measures, strong correlations were observed between the Cebu Filipino samples and the Asian panels from HapMap, with the strongest correlations observed with the Han Chinese from Beijing (CHB) panel. Tag SNPs selected using the HapMap CHB panel were particularly effective at representing the genetic variation in Cebu Filipino samples. These results suggest that the HapMap data will be an effective resource for future studies in Cebu Filipino samples.
Collapse
Affiliation(s)
- Amanda F Marvelle
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Leslie A Lange
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Li Qin
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Yunfei Wang
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
| | - Ethan M Lange
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA
- Department of Biostatistics, School of Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Linda S Adair
- Department of Nutrition, Schools of Medicine and Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Karen L Mohlke
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, 4109-F Neuroscience Research Building, 103 Mason Farm Road, Chapel Hill, NC, 27599-7264, USA.
| |
Collapse
|
32
|
Montpetit A, Chagnon F. [The Haplotype Map of the human genome: a revolution in the genetics of complex diseases]. Med Sci (Paris) 2007; 22:1061-7. [PMID: 17156727 DOI: 10.1051/medsci/200622121061] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
More than 99.9 % of the sequence is identical when comparing the DNA from two individuals. The remaining 0.1 % is responsible, along with other factors such as the environment, for the risk level of developing complex diseases (such as asthma, diabetes or cancer) or for the different pharmacological response to drugs. Despite the incredible advances in genomics in the past few years, identifying the variants involved remains difficult because of the prodigious amount of information to process. The recent completion of the Haplotype Map of the human genome has raised great hopes in the field as it is expected to help reduce the complexity of association studies and thus accelerate the discovery of genes associated with complex diseases. This review details the rationale behind the HapMap project, gives a summary of the results and also describes potential applications of the Haplotype Map.
Collapse
Affiliation(s)
- Alexandre Montpetit
- Centre d'Innovation de Génome Québec et de l'Université McGill, 740 avenue Dr Penfield, Montréal, Québec, H3A 1A4 Canada.
| | | |
Collapse
|
33
|
Evaluating the performance of commercial whole-genome marker sets for capturing common genetic variation. BMC Genomics 2007; 8:159. [PMID: 17562002 PMCID: PMC1914356 DOI: 10.1186/1471-2164-8-159] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2007] [Accepted: 06/11/2007] [Indexed: 11/10/2022] Open
Abstract
Background New technologies have enabled genome-wide association studies to be conducted with hundreds of thousands of genotyped SNPs. Several different first-generation genome-wide panels of SNPs have been commercialized. The total amount of common genetic variation is still unknown; however, the coverage of commercial panels can be evaluated against reference population samples genotyped by the International HapMap project. Less information is available about coverage in samples from other populations. Results In this study we compare four commercial panels: the HumanHap 300 and HumanHap 550 Array Sets from the Illumina Infinium series and the Mapping 100 K and Mapping 500 K Array Sets from the Affymetrix GeneChip series. Tagging performance is compared among HapMap CEPH (CEU), Asian (JPT, CHB) and Yoruba (YRI) population samples. It is also evaluated in an Estonian population sample with more than 1000 individuals genotyped in two 500-kbp ENCODE regions of chromosome 2: ENr112 on 2p16.3 and ENr131 on 2p37.1. Conclusion We found that in a non-reference Caucasian population, commercial SNP panels provide levels of coverage similar to those in the HapMap CEPH population sample. We present the proportions of universal and population-specific SNPs in all the commercial platforms studied.
Collapse
|
34
|
Iyengar SK, Adler SG. The application of the HapMap to diabetic nephropathy and other causes of chronic renal failure. Semin Nephrol 2007; 27:223-36. [PMID: 17418690 DOI: 10.1016/j.semnephrol.2007.01.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The human nuclear genome consists of approximately 3 billion nucleotides. Human beings are 99% similar in DNA sequence to each other, but natural genetic variation in approximately 1% of the DNA sequence is responsible for interindividual differences, including determining who will develop disease and who will remain healthy. The pace and timing of disease initiation also is regulated by exposure to individual-level environmental factors and other random causes. Therefore, an examination of the DNA sequences of individuals with and without diabetic nephropathy, or, more broadly, chronic renal failure, can predict which sequence differences vary with disease (or health). The technology is not yet economical enough to analyze large numbers of individuals down to each nucleotide, but standardized dense genotyping sets for interrogating 1 marker for every 5,000, 10,000, or 15,000 nucleotides now are affordable even in large samples. The swiftness with which disease-gene associations can be mined has improved radically as a result of the availability of discovery human genetic variation data from large-scale public and private initiatives, such as those provided by the International Haplotype Map Consortium and Perlegen Sciences, Inc. (Mountain View, CA). These projects have captured many of the common genetic variants (>1%) in the genome. This information has been buttressed with improvements in large-scale genotyping technologies and statistical methods for data analysis. In summary, the renal community is now poised for discovery of genes for chronic renal failure using these resources.
Collapse
Affiliation(s)
- Sudha K Iyengar
- Department of Epidemiology and Biostatistics, Case Western Reserve University, 2103 Cornell Road, Cleveland, OH 44106, USA.
| | | |
Collapse
|
35
|
Pemberton TJ, Jakobsson M, Conrad DF, Coop G, Wall JD, Pritchard JK, Patel PI, Rosenberg NA. Using population mixtures to optimize the utility of genomic databases: linkage disequilibrium and association study design in India. Ann Hum Genet 2007; 72:535-46. [PMID: 18513279 DOI: 10.1111/j.1469-1809.2008.00457.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
When performing association studies in populations that have not been the focus of large-scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis - such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation.
Collapse
Affiliation(s)
- T J Pemberton
- Institute for Genetic Medicine, University of Southern California, 2250 Alcazar St., Los Angeles, California 90033, USA
| | | | | | | | | | | | | | | |
Collapse
|
36
|
Service S, Sabatti C, Freimer N. Tag SNPs chosen from HapMap perform well in several population isolates. Genet Epidemiol 2007; 31:189-94. [PMID: 17323370 DOI: 10.1002/gepi.20201] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Population isolates may be particularly useful for association studies of complex traits. This utility, however, largely depends on the transferability of tag SNPs chosen from reference samples, such as HapMap, to samples from such populations. Factors that characterize population isolates, such as widespread genetic drift, could impede such transferability. In this report, we show that tag SNPs chosen from HapMap perform well in several population isolates; this is true even for populations that differ substantially from the HapMap sample either in levels of linkage disequilibrium or in SNP allele frequency distributions.
Collapse
Affiliation(s)
- Susan Service
- Center for Neurobehavioral Genetics, University of California, Los Angeles, California 90095-1761, USA
| | | | | |
Collapse
|
37
|
Yeager M, Orr N, Hayes RB, Jacobs KB, Kraft P, Wacholder S, Minichiello MJ, Fearnhead P, Yu K, Chatterjee N, Wang Z, Welch R, Staats BJ, Calle EE, Feigelson HS, Thun MJ, Rodriguez C, Albanes D, Virtamo J, Weinstein S, Schumacher FR, Giovannucci E, Willett WC, Cancel-Tassin G, Cussenot O, Valeri A, Andriole GL, Gelmann EP, Tucker M, Gerhard DS, Fraumeni JF, Hoover R, Hunter DJ, Chanock SJ, Thomas G. Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet 2007; 39:645-9. [PMID: 17401363 DOI: 10.1038/ng2022] [Citation(s) in RCA: 871] [Impact Index Per Article: 51.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2006] [Accepted: 03/07/2007] [Indexed: 02/06/2023]
Abstract
Recently, common variants on human chromosome 8q24 were found to be associated with prostate cancer risk. While conducting a genome-wide association study in the Cancer Genetic Markers of Susceptibility project with 550,000 SNPs in a nested case-control study (1,172 cases and 1,157 controls of European origin), we identified a new association at 8q24 with an independent effect on prostate cancer susceptibility. The most significant signal is 70 kb centromeric to the previously reported SNP, rs1447295, but shows little evidence of linkage disequilibrium with it. A combined analysis with four additional studies (total: 4,296 cases and 4,299 controls) confirms association with prostate cancer for rs6983267 in the centromeric locus (P = 9.42 x 10(-13); heterozygote odds ratio (OR): 1.26, 95% confidence interval (c.i.): 1.13-1.41; homozygote OR: 1.58, 95% c.i.: 1.40-1.78). Each SNP remained significant in a joint analysis after adjusting for the other (rs1447295 P = 1.41 x 10(-11); rs6983267 P = 6.62 x 10(-10)). These observations, combined with compelling evidence for a recombination hotspot between the two markers, indicate the presence of at least two independent loci within 8q24 that contribute to prostate cancer in men of European ancestry. We estimate that the population attributable risk of the new locus, marked by rs6983267, is higher than the locus marked by rs1447295 (21% versus 9%).
Collapse
Affiliation(s)
- Meredith Yeager
- SAIC-Frederick, National Cancer Institute (NCI)-Frederick Cancer Research and Development Center, Frederick, Maryland 21702, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Visscher PM. Variation of estimates of SNP and haplotype diversity and linkage disequilibrium in samples from the same population due to experimental and evolutionary sample size. Ann Hum Genet 2007; 71:119-26. [PMID: 17227482 DOI: 10.1111/j.1469-1809.2006.00305.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Studies of genetic polymorphisms and diversity between and within human populations are increasingly characterised by a very large number of genetic markers but using a relatively small number of individuals from which DNA samples were taken. In this report we examine the limitations of a small experimental sample size relative to a large genomic sample size, and quantify the sampling variance of a number of measures of diversity and linkage disequilibrium. The relationship between sample size and observed levels of polymorphism and haplotype diversity at the level of a gene is investigated under a neutral model of sequence evolution, using coalescent simulations. It is shown that the effect of evolutionary sampling, as manifested by differences between samples (genes) in measures of diversity estimated using very large sample sizes, is substantial, with a coefficient of variation of the number of detected polymorphic SNPs or haplotypes in the order of 15%. The effect of experimental design (sample size) is also very large, and a number of 'significant' results reported in the literature can be explained by sampling alone. The expected correlation coefficient of measures of linkage disequilibrium across samples from the same population has been quantified and found to be consistent with empirical estimates from the literature.
Collapse
Affiliation(s)
- P M Visscher
- Queensland Institute of Medical Research, Brisbane, Australia.
| |
Collapse
|
39
|
Broekmans FJ, Knauff EAH, te Velde ER, Macklon NS, Fauser BC. Female reproductive ageing: current knowledge and future trends. Trends Endocrinol Metab 2007; 18:58-65. [PMID: 17275321 DOI: 10.1016/j.tem.2007.01.004] [Citation(s) in RCA: 276] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2006] [Revised: 12/22/2006] [Accepted: 01/24/2007] [Indexed: 10/23/2022]
Abstract
Over the past few decades, postponement of childbearing has led to a decrease in family size and increased rates of age-related female subfertility. Age-related decrease in ovarian follicle numbers and a decay in oocyte quality dictate the occurrence of natural loss of fecundity and, ultimately, menopause. The rate of this ovarian ageing process is highly variable among women. Identification of women who have severely decreased ovarian reserve for their age is, therefore, clinically relevant. Endocrine and imaging tests for ovarian reserve relate mainly to the quantitative aspect of ovarian reserve, but their capacity to predict the chances for pregnancy is limited. Genetic factors regulating the size of the follicle pool and the rate of its depletion might be identified in the near future and, possibly, assist the accurate prediction of a woman's reproductive lifespan.
Collapse
Affiliation(s)
- Frank J Broekmans
- Department of Reproductive Medicine and Gynecology, University Medical Center Utrecht, Utrecht, The Netherlands.
| | | | | | | | | |
Collapse
|
40
|
Venken T, Del-Favero J. Chasing genes for mood disorders and schizophrenia in genetically isolated populations. Hum Mutat 2007; 28:1156-70. [PMID: 17659644 DOI: 10.1002/humu.20582] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Major affective disorders and schizophrenia are among the most common brain diseases worldwide and their predisposition is influenced by a complex interaction of genetic and environmental factors. So far, traditional linkage mapping studies for these complex disorders have not achieved the same success as the positional cloning of genes for Mendelian diseases. The struggle to identify susceptibility genes for complex disorders has stimulated the development of alternative approaches, including studies in genetically isolated populations. Since isolated populations are likely to have both a reduced number of genetic vulnerability factors and environmental background and are therefore considered to be more homogeneous compared to outbred populations, the use of isolated populations in genetic studies is expected to improve the chance of finding susceptibility loci and genes. Here we review the role of isolated populations, based on linkage and association studies, in the identification of susceptibility genes for bipolar disorder and schizophrenia.
Collapse
Affiliation(s)
- Tine Venken
- Applied Molecular Genomics Group, Department of Molecular Genetics, VIB, Antwerpen, Belgium
| | | |
Collapse
|
41
|
Paschou P, Mahoney MW, Javed A, Kidd JR, Pakstis AJ, Gu S, Kidd KK, Drineas P. Intra- and interpopulation genotype reconstruction from tagging SNPs. Genome Res 2006; 17:96-107. [PMID: 17151345 PMCID: PMC1716273 DOI: 10.1101/gr.5741407] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The optimal method to be used for tSNP selection, the applicability of a reference LD map to unassayed populations, and the scalability of these methods to genome-wide analysis, all remain subjects of debate. We propose novel, scalable matrix algorithms that address these issues and we evaluate them on genotypic data from 38 populations and four genomic regions (248 SNPs typed for approximately 2000 individuals). We also evaluate these algorithms on a second data set consisting of genotypes available from the HapMap database (1336 SNPs for four populations) over the same genomic regions. Furthermore, we test these methods in the setting of a real association study using a publicly available family data set. The algorithms we use for tSNP selection and unassayed SNP reconstruction do not require haplotype inference and they are, in principle, scalable even to genome-wide analysis. Moreover, they are greedy variants of recently developed matrix algorithms with provable performance guarantees. Using a small set of carefully selected tSNPs, we achieve very good reconstruction accuracy of "untyped" genotypes for most of the populations studied. Additionally, we demonstrate in a quantitative manner that the chosen tSNPs exhibit substantial transferability, both within and across different geographic regions. Finally, we show that reconstruction can be applied to retrieve significant SNP associations with disease, with important genotyping savings.
Collapse
Affiliation(s)
- Peristera Paschou
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06511, USA.
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Hao K, Di X, Cawley S. LdCompare: rapid computation of single- and multiple-marker r2 and genetic coverage. Bioinformatics 2006; 23:252-4. [PMID: 17148510 DOI: 10.1093/bioinformatics/btl574] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED The scale of genetic-variation datasets has increased enormously and the linkage equilibrium (LD) structure of these polymorphisms, particularly in whole-genome association studies, is of great interest. The significant computational complexity of calculating single- and multiple-marker correlations at a genome-wide scale remains challenging. We have developed a program that efficiently characterizes whole-genome LD structure on large number of SNPs in terms of single- and multiple-marker correlations. AVAILABILITY LdCompare is licensed under the GNU General Public License (GPL). Source code, documentation, testing datasets and precompiled executables are available for download at: http://www.affymetrix.com/support/developer/tools/devnettools.affx
Collapse
Affiliation(s)
- K Hao
- Algorithm and Data Analysis, Affymetrix Inc. 3420 Central Expressway, Santa Clara, California, USA.
| | | | | |
Collapse
|
43
|
Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet 2006; 38:1251-60. [PMID: 17057719 DOI: 10.1038/ng1911] [Citation(s) in RCA: 355] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2006] [Accepted: 09/22/2006] [Indexed: 12/30/2022]
Abstract
Recent genomic surveys have produced high-resolution haplotype information, but only in a small number of human populations. We report haplotype structure across 12 Mb of DNA sequence in 927 individuals representing 52 populations. The geographic distribution of haplotypes reflects human history, with a loss of haplotype diversity as distance increases from Africa. Although the extent of linkage disequilibrium (LD) varies markedly across populations, considerable sharing of haplotype structure exists, and inferred recombination hotspot locations generally match across groups. The four samples in the International HapMap Project contain the majority of common haplotypes found in most populations: averaging across populations, 83% of common 20-kb haplotypes in a population are also common in the most similar HapMap sample. Consequently, although the portability of tag SNPs based on the HapMap is reduced in low-LD Africans, the HapMap will be helpful for the design of genome-wide association mapping studies in nearly all human populations.
Collapse
Affiliation(s)
- Donald F Conrad
- Department of Human Genetics, University of Chicago, 920 East 58th Street, Chicago, Illinois 60637, USA
| | | | | | | | | | | | | |
Collapse
|
44
|
Tantoso E, Yang Y, Li KB. How well do HapMap SNPs capture the untyped SNPs? BMC Genomics 2006; 7:238. [PMID: 16982009 PMCID: PMC1586200 DOI: 10.1186/1471-2164-7-238] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2006] [Accepted: 09/19/2006] [Indexed: 01/17/2023] Open
Abstract
Background The recent advancement in human genome sequencing and genotyping has revealed millions of single nucleotide polymorphisms (SNP) which determine the variation among human beings. One of the particular important projects is The International HapMap Project which provides the catalogue of human genetic variation for disease association studies. In this paper, we analyzed the genotype data in HapMap project by using National Institute of Environmental Health Sciences Environmental Genome Project (NIEHS EGP) SNPs. We first determine whether the HapMap data are transferable to the NIEHS data. Then, we study how well the HapMap SNPs capture the untyped SNPs in the region. Finally, we provide general guidelines for determining whether the SNPs chosen from HapMap may be able to capture most of the untyped SNPs. Results Our analysis shows that HapMap data are not robust enough to capture the untyped variants for most of the human genes. The performance of SNPs for European and Asian samples are marginal in capturing the untyped variants, i.e. approximately 55%. Expectedly, the SNPs from HapMap YRI panel can only capture approximately 30% of the variants. Although the overall performance is low, however, the SNPs for some genes perform very well and are able to capture most of the variants along the gene. This is observed in the European and Asian panel, but not in African panel. Through observation, we concluded that in order to have a well covered SNPs reference panel, the SNPs density and the association among reference SNPs are important to estimate the robustness of the chosen SNPs. Conclusion We have analyzed the coverage of HapMap SNPs using NIEHS EGP data. The results show that HapMap SNPs are transferable to the NIEHS SNPs. However, HapMap SNPs cannot capture some of the untyped SNPs and therefore resequencing may be needed to uncover more SNPs in the missing region.
Collapse
Affiliation(s)
- Erwin Tantoso
- Bioinformatics Institute, 30 Biopolis Street, #07-01 Matrix, 138671, Singapore
| | - Yuchen Yang
- Bioinformatics Institute, 30 Biopolis Street, #07-01 Matrix, 138671, Singapore
| | - Kuo-Bin Li
- Bioinformatics Center, National Yang-Ming University, Taipei, 112, Taiwan
| |
Collapse
|
45
|
Smith E, Wang X, Littrell J, Eckert J, Cole R, Kissebah A, Olivier M. Comparison of linkage disequilibrium patterns between the HapMap CEPH samples and a family-based cohort of Northern European descent. Genomics 2006; 88:407-14. [PMID: 16713172 PMCID: PMC2803324 DOI: 10.1016/j.ygeno.2006.04.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2006] [Revised: 03/21/2006] [Accepted: 04/11/2006] [Indexed: 02/04/2023]
Abstract
The International HapMap Consortium has determined the linkage disequilibrium (LD) patterns of four major human populations. The aim of our investigation was to compare the LD patterns of the HapMap CEPH (Centre d'Etude du Polymorphisme Humain) samples with a family-based cohort of similar ancestry to determine its usefulness as a reference population for disease association studies. We examined four genomic regions on chromosomes 7q, 12p, and 14q totaling 14.3 Mb, initially identified in our linkage study of obesity and the metabolic syndrome. Near identical patterns of LD were detected in both populations. Furthermore, tagSNPs selected based on the HapMap CEPH cohort data capture over 98% of the variants at an r2 > 0.8 in the disease cohort. This confirms the usefulness of the CEPH cohort of the HapMap as a reference sample for further investigations into the genomic variation of populations of Northern European descent.
Collapse
Affiliation(s)
| | | | | | | | | | | | - M. Olivier
- Corresponding author. Fax: +1 414 456 6516. (M. Olivier)
| |
Collapse
|
46
|
Howie BN, Carlson CS, Rieder MJ, Nickerson DA. Efficient selection of tagging single-nucleotide polymorphisms in multiple populations. Hum Genet 2006; 120:58-68. [PMID: 16680432 DOI: 10.1007/s00439-006-0182-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2006] [Accepted: 03/30/2006] [Indexed: 10/24/2022]
Abstract
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Collapse
Affiliation(s)
- Bryan N Howie
- Department of Genome Sciences, University of Washington, Box 357730, Seattle, WA 98195, USA
| | | | | | | |
Collapse
|