Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang K, Calabrese P, Nordborg M, Sun F. Haplotype block structure and its applications to association studies: power and study designs. Am J Hum Genet 2002;71:1386-94. [PMID: 12439824 PMCID: PMC378580 DOI: 10.1086/344780] [Citation(s) in RCA: 207] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2002] [Accepted: 09/16/2002] [Indexed: 11/04/2022] Open

For:	Zhang K, Calabrese P, Nordborg M, Sun F. Haplotype block structure and its applications to association studies: power and study designs. Am J Hum Genet 2002;71:1386-94. [PMID: 12439824 PMCID: PMC378580 DOI: 10.1086/344780] [Citation(s) in RCA: 207] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2002] [Accepted: 09/16/2002] [Indexed: 11/04/2022] Open

Number

Cited by Other Article(s)

101

Hirota Y, Ohara T, Zenibayashi M, Kuno SI, Fukuyama K, Teranishi T, Kouyama K, Miyake K, Maeda E, Kasuga M. Lack of association of CPT1A polymorphisms or haplotypes on hepatic lipid content or insulin resistance in Japanese individuals with type 2 diabetes mellitus. Metabolism 2007;56:656-61. [PMID: 17445541 DOI: 10.1016/j.metabol.2006.12.014] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2006] [Accepted: 12/18/2006] [Indexed: 10/23/2022]

102

Rakovski CS, Xu X, Lazarus R, Blacker D, Laird NM. A new multimarker test for family-based association studies. Genet Epidemiol 2007;31:9-17. [PMID: 17086514 DOI: 10.1002/gepi.20186] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

103

Mao W, He J, Brinza D, Zelikovsky A. A combinatorial method for predicting genetic susceptibility to complex diseases. CONFERENCE PROCEEDINGS : ... ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL CONFERENCE 2007;2006:224-7. [PMID: 17282153 DOI: 10.1109/iembs.2005.1616384] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

104

Sun YV, Levin AM, Boerwinkle E, Robertson H, Kardia SLR. A scan statistic for identifying chromosomal patterns of SNP association. Genet Epidemiol 2007;30:627-35. [PMID: 16858698 DOI: 10.1002/gepi.20173] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

105

Zheng M, McPeek MS. Multipoint linkage-disequilibrium mapping with haplotype-block structure. Am J Hum Genet 2007;80:112-25. [PMID: 17160899 PMCID: PMC1785316 DOI: 10.1086/510685] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2006] [Accepted: 11/07/2006] [Indexed: 01/17/2023] Open

106

YOSHIKAWA Y, NAKAYAMA T, SAITO K, HUI P, MORITA A, SATO N, TAKAHASHI T, TAMURA M, SATO I, AOI N, DOBA N, HINOHARA S, SOMA M, USAMI R. Haplotype-Based Case-Control Study of the Association between the Guanylate Cyclase Activator 2B (GUCA2B, Uroguanylin) Gene and Essential Hypertension. Hypertens Res 2007;30:789-96. [DOI: 10.1291/hypres.30.789] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

107

Ding K, Kullo IJ. Methods for the selection of tagging SNPs: a comparison of tagging efficiency and performance. Eur J Hum Genet 2006;15:228-36. [PMID: 17164795 DOI: 10.1038/sj.ejhg.5201755] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

108

Ding J, Nicklas BJ, Fallin MD, de Rekeneire N, Kritchevsky SB, Pahor M, Rodondi N, Li R, Zmuda JM, Harris TB. Plasminogen activator inhibitor type 1 gene polymorphisms and haplotypes are associated with plasma plasminogen activator inhibitor type 1 levels but not with myocardial infarction or stroke. Am Heart J 2006;152:1109-15. [PMID: 17161063 DOI: 10.1016/j.ahj.2006.06.021] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2005] [Accepted: 06/07/2006] [Indexed: 11/29/2022]

109

Menon R, Fortunato SJ, Thorsen P, Williams S. Genetic associations in preterm birth: a primer of marker selection, study design, and data analysis. ACTA ACUST UNITED AC 2006;13:531-41. [PMID: 17088082 DOI: 10.1016/j.jsgi.2006.09.006] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2006] [Indexed: 01/16/2023]

110

Moskvina V, Schmidt KM. Individual SNP allele reconstruction from informative markers selected by a non-linear Gauss-type algorithm. Hum Hered 2006;62:97-106. [PMID: 17047339 DOI: 10.1159/000096097] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Accepted: 08/02/2006] [Indexed: 11/19/2022] Open

111

Li J, Zhang MQ, Zhang X. A new method for detecting human recombination hotspots and its applications to the HapMap ENCODE data. Am J Hum Genet 2006;79:628-39. [PMID: 16960799 PMCID: PMC1592557 DOI: 10.1086/508066] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2006] [Accepted: 07/25/2006] [Indexed: 11/03/2022] Open

112

Bardel C, Darlu P, Génin E. Clustering of haplotypes based on phylogeny: how good a strategy for association testing? Eur J Hum Genet 2006;14:202-6. [PMID: 16306882 DOI: 10.1038/sj.ejhg.5201501] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

113

Kimmel G, Shamir R. A fast method for computing high-significance disease association in large population-based studies. Am J Hum Genet 2006;79:481-92. [PMID: 16909386 PMCID: PMC1559554 DOI: 10.1086/507317] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2006] [Accepted: 06/27/2006] [Indexed: 11/03/2022] Open

114

Ward K. Microarray technology in obstetrics and gynecology: a guide for clinicians. Am J Obstet Gynecol 2006;195:364-72. [PMID: 16615920 PMCID: PMC7093878 DOI: 10.1016/j.ajog.2005.12.014] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2005] [Revised: 11/29/2005] [Accepted: 12/05/2005] [Indexed: 11/28/2022]

115

Ayers KL, Sabatti C, Lange K. Reconstructing ancestral haplotypes with a dictionary model. J Comput Biol 2006;13:767-85. [PMID: 16706724 DOI: 10.1089/cmb.2006.13.767] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

116

Hanson RL, Looker HC, Ma L, Muller YL, Baier LJ, Knowler WC. Design and analysis of genetic association studies to finely map a locus identified by linkage analysis: sample size and power calculations. Ann Hum Genet 2006;70:332-49. [PMID: 16674556 DOI: 10.1111/j.1529-8817.2005.00230.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Abstract

Association (e.g. case-control) studies are often used to finely map loci identified by linkage analysis. We investigated the influence of various parameters on power and sample size requirements for such a study. Calculations were performed for various values of a high-risk functional allele (fA), frequency of a marker allele associated with the high risk allele (f1), degree of linkage disquilibrium between functional and marker alleles (D') and trait heritability attributable to the functional locus (h2). The calculations show that if cases and controls are selected from equal but opposite extreme quantiles of a quantitative trait, the primary determinants of power are h2 and the specific quantiles selected. For a dichotomous trait, power also depends on population prevalence. Power is optimal if functional alleles are studied (fA= f1 and D'= 1.0) and can decrease substantially as D' diverges from 1.0 or as f(1) diverges from fA. These analyses suggest that association studies to finely map loci are most powerful if potential functional polymorphisms are identified a priori or if markers are typed to maximize haplotypic diversity. In the absence of such information, expected minimum power at a given location for a given sample size can be calculated by specifying a range of potential frequencies for fA (e.g. 0.1-0.9) and determining power for all markers within the region with specification of the expected D' between the markers and the functional locus. This method is illustrated for a fine-mapping project with 662 single nucleotide polymorphisms in 24 Mb. Regions differed by marker density and allele frequencies. Thus, in some, power was near its theoretical maximum and little additional information is expected from additional markers, while in others, additional markers appear to be necessary. These methods may be useful in the analysis and interpretation of fine-mapping studies.

Collapse

117

Liu PY, Lu Y, Deng HW. Accurate haplotype inference for multiple linked single-nucleotide polymorphisms using sibship data. Genetics 2006;174:499-509. [PMID: 16783022 PMCID: PMC1569787 DOI: 10.1534/genetics.105.054213] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

118

Nicolas P, Sun F, Li LM. A model-based approach to selection of tag SNPs. BMC Bioinformatics 2006;7:303. [PMID: 16776821 PMCID: PMC1525207 DOI: 10.1186/1471-2105-7-303] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2006] [Accepted: 06/15/2006] [Indexed: 11/23/2022] Open

Abstract

Background

Single Nucleotide Polymorphisms (SNPs) are the most common type of polymorphisms found in the human genome. Effective genetic association studies require the identification of sets of tag SNPs that capture as much haplotype information as possible. Tag SNP selection is analogous to the problem of data compression in information theory. According to Shannon's framework, the optimal tag set maximizes the entropy of the tag SNPs subject to constraints on the number of SNPs. This approach requires an appropriate probabilistic model. Compared to simple measures of Linkage Disequilibrium (LD), a good model of haplotype sequences can more accurately account for LD structure. It also provides a machinery for the prediction of tagged SNPs and thereby to assess the performances of tag sets through their ability to predict larger SNP sets.

Results

Here, we compute the description code-lengths of SNP data for an array of models and we develop tag SNP selection methods based on these models and the strategy of entropy maximization. Using data sets from the HapMap and ENCODE projects, we show that the hidden Markov model introduced by Li and Stephens outperforms the other models in several aspects: description code-length of SNP data, information content of tag sets, and prediction of tagged SNPs. This is the first use of this model in the context of tag SNP selection.

Conclusion

Our study provides strong evidence that the tag sets selected by our best method, based on Li and Stephens model, outperform those chosen by several existing methods. The results also suggest that information content evaluated with a good model is more sensitive for assessing the quality of a tagging set than the correct prediction rate of tagged SNPs. Besides, we show that haplotype phase uncertainty has an almost negligible impact on the ability of good tag sets to predict tagged SNPs. This justifies the selection of tag SNPs on the basis of haplotype informativeness, although genotyping studies do not directly assess haplotypes. A software that implements our approach is available.

Collapse

119

Maekawa K, Itoda M, Sai K, Saito Y, Kaniwa N, Shirao K, Hamaguchi T, Kunitoh H, Yamamoto N, Tamura T, Minami H, Kubota K, Ohtsu A, Yoshida T, Saijo N, Kamatani N, Ozawa S, Sawada JI. Genetic variation and haplotype structure of the ABC transporter gene ABCG2 in a Japanese population. Drug Metab Pharmacokinet 2006;21:109-21. [PMID: 16702730 DOI: 10.2133/dmpk.21.109] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

120

Kukita Y, Miyatake K, Stokowski R, Hinds D, Higasa K, Wake N, Hirakawa T, Kato H, Matsuda T, Pant K, Cox D, Tahira T, Hayashi K. Genome-wide definitive haplotypes determined using a collection of complete hydatidiform moles. Genome Res 2006;15:1511-8. [PMID: 16251461 PMCID: PMC1310639 DOI: 10.1101/gr.4371105] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

121

Maksymowych WP, Rahman P, Reeve JP, Gladman DD, Peddle L, Inman RD. Association of the IL1 gene cluster with susceptibility to ankylosing spondylitis: an analysis of three Canadian populations. ACTA ACUST UNITED AC 2006;54:974-85. [PMID: 16508980 DOI: 10.1002/art.21642] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

122

Liu PY, Zhang YY, Lu Y, Long JR, Shen H, Zhao LJ, Xu FH, Xiao P, Xiong DH, Liu YJ, Recker RR, Deng HW. A survey of haplotype variants at several disease candidate genes: the importance of rare variants for complex diseases. J Med Genet 2006;42:221-7. [PMID: 15744035 PMCID: PMC1736011 DOI: 10.1136/jmg.2004.024752] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

123

Kimmel G, Shamir R. A block-free hidden Markov model for genotypes and its application to disease association. J Comput Biol 2006;12:1243-60. [PMID: 16379532 DOI: 10.1089/cmb.2005.12.1243] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

124

Sabbagh A, Darlu P. SNP selection at the NAT2 locus for an accurate prediction of the acetylation phenotype. Genet Med 2006;8:76-85. [PMID: 16481889 DOI: 10.1097/01.gim.0000200951.54346.d6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

125

Yuan A, Chen G, Rotimi C, Bonney GE. A statistical framework for haplotype block inference. J Bioinform Comput Biol 2006;3:1021-38. [PMID: 16278945 DOI: 10.1142/s021972000500151x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2004] [Revised: 03/29/2005] [Accepted: 03/30/2005] [Indexed: 11/18/2022]

126

Greenspan G, Geiger D. Modeling haplotype block variation using Markov chains. Genetics 2005;172:2583-99. [PMID: 16361244 PMCID: PMC1456412 DOI: 10.1534/genetics.105.042978] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

127

Nothnagel M, Rohde K. The effect of single-nucleotide polymorphism marker selection on patterns of haplotype blocks and haplotype frequency estimates. Am J Hum Genet 2005;77:988-98. [PMID: 16380910 PMCID: PMC1285181 DOI: 10.1086/498175] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2004] [Accepted: 09/16/2004] [Indexed: 11/03/2022] Open

128

Iles MM. The Effect of SNP Marker Density on the Efficacy of Haplotype Tagging SNPs - a Warning. Ann Hum Genet 2005. [DOI: 10.1046/j.1469-1809.2004.00141.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

129

Sutherland AM, Russell JA. Issues with Polymorphism Analysis in Sepsis. Clin Infect Dis 2005;41 Suppl 7:S396-402. [PMID: 16237637 DOI: 10.1086/431989] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

130

Hüffmeier U, Lascorz J, Traupe H, Böhm B, Schürmeier-Horst F, Ständer M, Kelsch R, Baumann C, Küster W, Burkhardt H, Reis A. Systematic Linkage Disequilibrium Analysis of SLC12A8 at PSORS5 Confirms a Role in Susceptibility to Psoriasis Vulgaris. J Invest Dermatol 2005;125:906-12. [PMID: 16297188 DOI: 10.1111/j.0022-202x.2005.23847.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

131

Zhang K, Sun F. Assessing the power of tag SNPs in the mapping of quantitative trait loci (QTL) with extremal and random samples. BMC Genet 2005;6:51. [PMID: 16236175 PMCID: PMC1274312 DOI: 10.1186/1471-2156-6-51] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2005] [Accepted: 10/19/2005] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Recent studies have indicated that the human genome could be divided into regions with low haplotype diversity interspersed with regions of high haplotype diversity. In regions of low haplotype diversity, a small fraction of SNPs (tag SNPs) are sufficient to account for most of the haplotype diversity of the human genome. These tag SNPs can be extremely useful for testing the association of a marker locus with a qualitative or quantitative trait locus in that it may not be necessary to genotype all the SNPs. When tag SNPs are used to reduce the genotyping effort in association studies, it is important to know how much power is lost. It is also important to know how much power is gained when tag SNPs instead of the same number of randomly chosen SNPs are used.

RESULTS

We design a simulation study to tackle these problems for a variety of quantitative association tests using either case-parent samples or unrelated population samples. First, the samples are generated based on the quantitative trait model with the assumption of either an extremal sampling scheme or a random sampling scheme. Second, a small number of samples are selected to determine the haplotype blocks and the tag SNPs. Third, the statistical power of the tests is evaluated using four kinds of data: (1) all the SNPs and the corresponding haplotypes, (2) the tag SNPs and the corresponding haplotypes, (3) the same number of evenly spaced SNPs with minor allele frequency greater than a threshold and the corresponding haplotypes, (4) the same number of randomly chosen SNPs and their corresponding haplotypes.

CONCLUSION

Our results suggest that in most situations genotyping efforts can be significantly reduced by using tag SNPs for mapping the QTL in association studies without much loss of power, which is consistent with previous studies on association mapping of qualitative traits. For all situations considered, two-locus haplotype analysis using tag SNPs are more powerful than those using the same number of randomly selected SNPs, but the degree of such power differences depends upon the sampling scheme and the population history.

Collapse

132

Palmer LJ, Cardon LR. Shaking the tree: mapping complex disease genes with linkage disequilibrium. Lancet 2005;366:1223-34. [PMID: 16198771 DOI: 10.1016/s0140-6736(05)67485-5] [Citation(s) in RCA: 165] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

133

De La Vega FM, Gordon D, Su X, Scafe C, Isaac H, Gilbert DA, Spier EG. Power and Sample Size Calculations for Genetic Case/Control Studies Using Gene-Centric SNP Maps: Application to Human Chromosomes 6, 21, and 22 in Three Populations. Hum Hered 2005;60:43-60. [PMID: 16137993 DOI: 10.1159/000087918] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2004] [Accepted: 07/12/2005] [Indexed: 01/29/2023] Open

Abstract

Power and sample size calculations are critical parts of any research design for genetic association. We present a method that utilizes haplotype frequency information and average marker-marker linkage disequilibrium on SNPs typed in and around all genes on a chromosome. The test statistic used is the classic likelihood ratio test applied to haplotypes in case/control populations. Haplotype frequencies are computed through specification of genetic model parameters. Power is determined by computation of the test's non-centrality parameter. Power per gene is computed as a weighted average of the power assuming each haplotype is associated with the trait. We apply our method to genotype data from dense SNP maps across three entire chromosomes (6, 21, and 22) for three different human populations (African-American, Caucasian, Chinese), three different models of disease (additive, dominant, and multiplicative) and two trait allele frequencies (rare, common). We perform a regression analysis using these factors, average marker-marker disequilibrium, and the haplotype diversity across the gene region to determine which factors most significantly affect average power for a gene in our data. Also, as a 'proof of principle' calculation, we perform power and sample size calculations for all genes within 100 kb of the PSORS1 locus (chromosome 6) for a previously published association study of psoriasis. Results of our regression analysis indicate that four highly significant factors that determine average power to detect association are: disease model, average marker-marker disequilibrium, haplotype diversity, and the trait allele frequency. These findings may have important implications for the design of well-powered candidate gene association studies. Our power and sample size calculations for the PSORS1 gene appear consistent with published findings, namely that there is substantial power (>0.99) for most genes within 100 kb of the PSORS1 locus at the 0.01 significance level.

Collapse

134

Hamblin MT, Salas Fernandez MG, Casa AM, Mitchell SE, Paterson AH, Kresovich S. Equilibrium processes cannot explain high levels of short- and medium-range linkage disequilibrium in the domesticated grass Sorghum bicolor. Genetics 2005;171:1247-56. [PMID: 16157678 PMCID: PMC1456844 DOI: 10.1534/genetics.105.041566] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

135

Terry KL, De Vivo I, Titus-Ernstoff L, Shih MC, Cramer DW. Androgen receptor cytosine, adenine, guanine repeats, and haplotypes in relation to ovarian cancer risk. Cancer Res 2005;65:5974-81. [PMID: 15994977 PMCID: PMC1364476 DOI: 10.1158/0008-5472.can-04-3885] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

136

Jones R, Pembrey M, Golding J, Herrick D. The search for genenotype/phenotype associations and the phenome scan. Paediatr Perinat Epidemiol 2005;19:264-75. [PMID: 15958149 DOI: 10.1111/j.1365-3016.2005.00664.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

137

Zhao J, Boerwinkle E, Xiong M. An entropy-based statistic for genomewide association studies. Am J Hum Genet 2005;77:27-40. [PMID: 15931594 PMCID: PMC1226192 DOI: 10.1086/431243] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2004] [Accepted: 04/19/2005] [Indexed: 11/04/2022] Open

138

Rinaldo A, Bacanu SA, Devlin B, Sonpar V, Wasserman L, Roeder K. Characterization of multilocus linkage disequilibrium. Genet Epidemiol 2005;28:193-206. [PMID: 15637716 DOI: 10.1002/gepi.20056] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

139

Cheng R, Ma JZ, Elston RC, Li MD. Fine mapping functional sites or regions from case-control data using haplotypes of multiple linked SNPs. Ann Hum Genet 2005;69:102-12. [PMID: 15638831 DOI: 10.1046/j.1529-8817.2004.00140.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

140

Zeggini E, Barton A, Eyre S, Ward D, Ollier W, Worthington J, John S. Characterisation of the genomic architecture of human chromosome 17q and evaluation of different methods for haplotype block definition. BMC Genet 2005;6:21. [PMID: 15850495 PMCID: PMC1090572 DOI: 10.1186/1471-2156-6-21] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2004] [Accepted: 04/25/2005] [Indexed: 11/10/2022] Open

141

Ding K, Zhang J, Zhou K, Shen Y, Zhang X. htSNPer1.0: software for haplotype block partition and htSNPs selection. BMC Bioinformatics 2005;6:38. [PMID: 15740612 PMCID: PMC1274247 DOI: 10.1186/1471-2105-6-38] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2004] [Accepted: 03/01/2005] [Indexed: 01/17/2023] Open

142

van den Oord EJCG. Controlling false discoveries in candidate gene studies. Mol Psychiatry 2005;10:230-1. [PMID: 15738930 DOI: 10.1038/sj.mp.4001581] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

143

Terry KL, De Vivo I, Titus-Ernstoff L, Sluss PM, Cramer DW. Genetic variation in the progesterone receptor gene and ovarian cancer risk. Am J Epidemiol 2005;161:442-51. [PMID: 15718480 PMCID: PMC1380205 DOI: 10.1093/aje/kwi064] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

144

Hu X, Schrodi SJ, Ross DA, Cargill M. Selecting tagging SNPs for association studies using power calculations from genotype data. Hum Hered 2005;57:156-70. [PMID: 15297809 DOI: 10.1159/000079246] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2003] [Accepted: 04/13/2004] [Indexed: 11/19/2022] Open

145

Nakajima Y, Saito Y, Shiseki K, Fukushima-Uesaka H, Hasegawa R, Ozawa S, Sugai K, Katoh M, Saitoh O, Ohnuma T, Kawai M, Ohtsuki T, Suzuki C, Minami N, Kimura H, Goto YI, Kamatani N, Kaniwa N, Sawada JI. Haplotype structures of EPHX1 and their effects on the metabolism of carbamazepine-10,11-epoxide in Japanese epileptic patients. Eur J Clin Pharmacol 2005;61:25-34. [PMID: 15692831 DOI: 10.1007/s00228-004-0878-1] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2004] [Accepted: 11/26/2004] [Indexed: 10/25/2022]

146

Sun X, Stephens JC, Zhao H. The impact of sample size and marker selection on the study of haplotype structures. Hum Genomics 2005;1:179-93. [PMID: 15588478 PMCID: PMC3525083 DOI: 10.1186/1479-7364-1-3-179] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

147

Zhang X, Roeder K, Wallstrom G, Devlin B. Integration of association statistics over genomic regions using Bayesian adaptive regression splines. Hum Genomics 2005;1:20-9. [PMID: 15601530 PMCID: PMC3525002 DOI: 10.1186/1479-7364-1-1-20] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

148

Roeder K, Bacanu SA, Sonpar V, Zhang X, Devlin B. Analysis of single-locus tests to detect gene/disease associations. Genet Epidemiol 2005;28:207-19. [PMID: 15637715 DOI: 10.1002/gepi.20050] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Abstract

A goal of association analysis is to determine whether variation in a particular candidate region or gene is associated with liability to complex disease. To evaluate such candidates, ubiquitous Single Nucleotide Polymorphisms (SNPs) are useful. It is critical, however, to select a set of SNPs that are in substantial linkage disequilibrium (LD) with all other polymorphisms in the region. Whether there is an ideal statistical framework to test such a set of 'tag SNPs' for association is unknown. Compared to tests for association based on frequencies of haplotypes, recent evidence suggests tests for association based on linear combinations of the tag SNPs (Hotelling T(2) test) are more powerful. Following this logical progression, we wondered if single-locus tests would prove generally more powerful than the regression-based tests? We answer this question by investigating four inferential procedures: the maximum of a series of test statistics corrected for multiple testing by the Bonferroni procedure, T(B), or by permutation of case-control status, T(P); a procedure that tests the maximum of a smoothed curve fitted to the series of of test statistics, T(S); and the Hotelling T(2) procedure, which we call T(R). These procedures are evaluated by simulating data like that from human populations, including realistic levels of LD and realistic effects of alleles conferring liability to disease. We find that power depends on the correlation structure of SNPs within a gene, the density of tag SNPs, and the placement of the liability allele. The clearest pattern emerges between power and the number of SNPs selected. When a large fraction of the SNPs within a gene are tested, and multiple SNPs are highly correlated with the liability allele, T(S) has better power. Using a SNP selection scheme that optimizes power but also requires a substantial number of SNPs to be genotyped (roughly 10-20 SNPs per gene), power of T(P) is generally superior to that for the other procedures, including T(R). Finally, when a SNP selection procedure that targets a minimal number of SNPs per gene is applied, the average performances of T(P) and T(R) are indistinguishable.

Collapse

149

Flanders WD, Khoury MJ, Yang QH, Austin H. Tests of trait—haplotype association when linkage phase is ambiguous, appropriate for matched case-control and cohort studies with competing risks. Stat Med 2005;24:2299-316. [PMID: 16015677 DOI: 10.1002/sim.2156] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

150

Harrap SB. Blood Pressure Genetics. Hypertension 2005. [DOI: 10.1016/b978-0-7216-0258-5.50095-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]