Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schaid DJ. Genomic similarity and kernel methods I: advancements by building on mathematical and statistical foundations. Hum Hered 2010;70:109-31. [PMID: 20610906 DOI: 10.1159/000312641] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2009] [Accepted: 03/09/2010] [Indexed: 01/05/2023] Open

For:	Schaid DJ. Genomic similarity and kernel methods I: advancements by building on mathematical and statistical foundations. Hum Hered 2010;70:109-31. [PMID: 20610906 DOI: 10.1159/000312641] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2009] [Accepted: 03/09/2010] [Indexed: 01/05/2023] Open

Number

Cited by Other Article(s)

Rong Y, Zhao SD, Zheng X, Li Y. Kernel Cox partially linear regression: Building predictive models for cancer patients' survival. Stat Med 2024;43:1-15. [PMID: 37875428 DOI: 10.1002/sim.9938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 09/30/2023] [Accepted: 10/03/2023] [Indexed: 10/26/2023]

Wendel B, Heidenreich M, Budde M, Heilbronner M, Oraki Kohshour M, Papiol S, Falkai P, Schulze TG, Heilbronner U, Bickeböller H. Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study. Front Genet 2022;13:1015885. [PMID: 36561312 PMCID: PMC9767414 DOI: 10.3389/fgene.2022.1015885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 11/24/2022] [Indexed: 12/12/2022] Open

Abstract

A popular approach to reduce the high dimensionality resulting from genome-wide association studies is to analyze a whole pathway in a single test for association with a phenotype. Kernel machine regression (KMR) is a highly flexible pathway analysis approach. Initially, KMR was developed to analyze a simple phenotype with just one measurement per individual. Recently, however, the investigation into the influence of genomic factors in the development of disease-related phenotypes across time (trajectories) has gained in importance. Thus, novel statistical approaches for KMR analyzing longitudinal data, i.e. several measurements at specific time points per individual are required. For longitudinal pathway analysis, we extend KMR to long-KMR using the estimation equivalence of KMR and linear mixed models. We include additional random effects to correct for the dependence structure. Moreover, within long-KMR we created a topology-based pathway analysis by combining this approach with a kernel including network information of the pathway. Most importantly, long-KMR not only allows for the investigation of the main genetic effect adjusting for time dependencies within an individual, but it also allows to test for the association of the pathway with the longitudinal course of the phenotype in the form of testing the genetic time-interaction effect. The approach is implemented as an R package, kalpra. Our simulation study demonstrates that the power of long-KMR exceeded that of another KMR method previously developed to analyze longitudinal data, while maintaining (slightly conservatively) the type I error. The network kernel improved the performance of long-KMR compared to the linear kernel. Considering different pathway densities, the power of the network kernel decreased with increasing pathway density. We applied long-KMR to cognitive data on executive function (Trail Making Test, part B) from the PsyCourse Study and 17 candidate pathways selected from Reactome. We identified seven nominally significant pathways.

Collapse

Arthur VL, Li Z, Cao R, Oetting WS, Israni AK, Jacobson PA, Ritchie MD, Guan W, Chen J. A Multi-Marker Test for Analyzing Paired Genetic Data in Transplantation. Front Genet 2021;12:745773. [PMID: 34721531 PMCID: PMC8548646 DOI: 10.3389/fgene.2021.745773] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 09/23/2021] [Indexed: 12/02/2022] Open

Pluta D, Shen T, Xue G, Chen C, Ombao H, Yu Z. Ridge-penalized adaptive Mantel test and its application in imaging genetics. Stat Med 2021;40:5313-5332. [PMID: 34216035 DOI: 10.1002/sim.9127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 06/01/2021] [Accepted: 06/16/2021] [Indexed: 01/23/2023]

Gong M, Liu P, Sciurba FC, Stojanov P, Tao D, Tseng GC, Zhang K, Batmanghelich K. Unpaired data empowers association tests. Bioinformatics 2021;37:785-792. [PMID: 33070196 PMCID: PMC8098021 DOI: 10.1093/bioinformatics/btaa886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 09/07/2020] [Accepted: 10/05/2020] [Indexed: 11/25/2022] Open

Schaid DJ, Sinnwell JP, Larson NB, Chen J. Penalized variance components for association of multiple genes with traits. Genet Epidemiol 2021;44:665-675. [PMID: 33463755 DOI: 10.1002/gepi.22340] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 06/26/2020] [Accepted: 07/05/2020] [Indexed: 12/19/2022]

Deng Y, Wu S, Fan H. Genome-wide pathway-based quantitative multiple phenotypes analysis. PLoS One 2020;15:e0240910. [PMID: 33175855 PMCID: PMC7657528 DOI: 10.1371/journal.pone.0240910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 10/06/2020] [Indexed: 11/18/2022] Open

Yin T, König S. Genomic predictions of growth curves in Holstein dairy cattle based on parameter estimates from nonlinear models combined with different kernel functions. J Dairy Sci 2020;103:7222-7237. [PMID: 32534925 DOI: 10.3168/jds.2019-18010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 04/06/2020] [Indexed: 11/19/2022]

Abstract

Availability of longitudinal body weight (BW) records allows the application of nonlinear models (NLINM) to predict phenotypic and genomic growth curves in dairy cattle. In this regard, we considered a data set including 31,722 BW records from 4,952 female Holstein cattle, during the period from birth (mo 0) to approximately age at first calving (mo 24). Parameters of the growth curves were estimated using 3 NLINM: the logistic (LOG), the Gompertz (GOM), and the Richards (RICH) functions. Residuals for the growth curve parameters from the NLINM applications were used as pseudo-phenotypes in the ongoing genomic analyses with different similarity matrices, including 2 genomic relationship matrices (G1 and G2), a combined pedigree and genomic relationship matrix (H), and 3 kernel matrices. The kernels were a weighted "alike by state" kernel function (K1), an exponential dissimilarity kernel (K2), and a Gaussian kernel (K3). On the basis of G1 and G2 matrices, genomic heritabilities for the growth curve parameters birth weight (W₀), mature weight (W_m), and growth rate (k), and the shape parameter (m; only available from RICH) were moderate to large, in the range from 0.29 (m from RICH) to 0.46 (k from RICH). Fitting the similarity matrices based on kernel functions contributed to an increase of the ratio of the variance explained by the similarity matrix in relation to the total variance (compared with the heritability when modeling G1 or G2). Genetic correlations between W₀, W_m, and k were always positive (>0.30), especially for the same growth curve parameters estimated from different NLINM (>0.90). The shape parameter m from RICH was negatively correlated with other growth curve parameters, from -0.29 to -0.95. In a next step, estimated genomic breeding values for growth curve parameters were input data for the respective NLINM, aiming to construct genomic growth curves. Prediction accuracies were correlations between genomic growth curves and genomic breeding values from random regression models for sires and female cattle. Considering all genotyped female cattle with pseudo-phenotypes, prediction accuracies were larger from RICH than from LOG and GOM. However, differences in prediction accuracies from the NLINM × similarity matrix combinations were quite small. Accordingly, in 5-fold cross-validations using heifer groups with masked phenotypes, very similar prediction accuracies across modeling approaches were identified. Especially for specific age months, genomic growth curve predictions were more accurate for sires than for female cattle, indicating that the relationships between animals in training and validation sets are more important than the selection of specific NLINM × similarity matrix combinations.

Collapse

Agarwal D, Zhang NR. Semblance: An empirical similarity kernel on probability spaces. SCIENCE ADVANCES 2019;5:eaau9630. [PMID: 31840051 PMCID: PMC6892634 DOI: 10.1126/sciadv.aau9630] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 09/30/2019] [Indexed: 06/10/2023]

Shao F, Wang Y, Zhao Y, Yang S. Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype. BMC Genet 2019;20:36. [PMID: 30890140 PMCID: PMC6423879 DOI: 10.1186/s12863-019-0739-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Accepted: 03/12/2019] [Indexed: 11/29/2022] Open

Larson NB, Chen J, Schaid DJ. A review of kernel methods for genetic association studies. Genet Epidemiol 2019;43:122-136. [PMID: 30604442 DOI: 10.1002/gepi.22180] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 11/09/2018] [Accepted: 11/26/2018] [Indexed: 12/17/2022]

Budde M, Friedrichs S, Alliey-Rodriguez N, Ament S, Badner JA, Berrettini WH, Bloss CS, Byerley W, Cichon S, Comes AL, Coryell W, Craig DW, Degenhardt F, Edenberg HJ, Foroud T, Forstner AJ, Frank J, Gershon ES, Goes FS, Greenwood TA, Guo Y, Hipolito M, Hood L, Keating BJ, Koller DL, Lawson WB, Liu C, Mahon PB, McInnis MG, McMahon FJ, Meier SM, Mühleisen TW, Murray SS, Nievergelt CM, Nurnberger JI, Nwulia EA, Potash JB, Quarless D, Rice J, Roach JC, Scheftner WA, Schork NJ, Shekhtman T, Shilling PD, Smith EN, Streit F, Strohmaier J, Szelinger S, Treutlein J, Witt SH, Zandi PP, Zhang P, Zöllner S, Bickeböller H, Falkai PG, Kelsoe JR, Nöthen MM, Rietschel M, Schulze TG, Malzahn D. Efficient region-based test strategy uncovers genetic risk factors for functional outcome in bipolar disorder. Eur Neuropsychopharmacol 2019;29:156-170. [PMID: 30503783 DOI: 10.1016/j.euroneuro.2018.10.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Revised: 10/16/2018] [Accepted: 10/23/2018] [Indexed: 11/21/2022]

Affiliation(s)

Monika Budde Institute of Psychiatric Phenomics and Genomics, University Hospital, LMU Munich, Nussbaumstr. 7, Munich 80336, Germany
Stefanie Friedrichs Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University, Göttingen 37099, Germany
Ney Alliey-Rodriguez Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL 60637, United States
Seth Ament Institute for Systems Biology, Seattle, WA 98109, United States
Judith A Badner Department of Psychiatry, Rush University Medical Center, Chicago, IL 60612, United States
Wade H Berrettini Department of Psychiatry, University of Pennsylvania, Philadelphia, PA 19104, United States
Cinnamon S Bloss University of California San Diego, La Jolla, CA 92093, United States
William Byerley Department of Psychiatry, University of California at San Francisco, San Francisco, CA 94103, United States
Sven Cichon Human Genomics Research Group, Department of Biomedicine, University of Basel, Basel 4031, Switzerland; Institute of Medical Genetics and Pathology, University Hospital Basel, Basel 4031, Switzerland; Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich 52425, Germany
Ashley L Comes Institute of Psychiatric Phenomics and Genomics, University Hospital, LMU Munich, Nussbaumstr. 7, Munich 80336, Germany; International Max Planck Research School for Translational Psychiatry, Max Planck Institute of Psychiatry, Munich 80804, Germany
William Coryell University of Iowa Hospitals and Clinics, Iowa City, IA 52242, United States
David W Craig The Translational Genomics Research Institute, Phoenix, AZ 85004, United States
Franziska Degenhardt Institute of Human Genetics, School of Medicine & University Hospital Bonn, University of Bonn, Bonn 53127, Germany; Department of Genomics, Life & Brain Center, University of Bonn, Bonn 53127, Germany
Howard J Edenberg Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, United States; Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, United States
Tatiana Foroud Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, United States
Andreas J Forstner Institute of Human Genetics, School of Medicine & University Hospital Bonn, University of Bonn, Bonn 53127, Germany; Department of Genomics, Life & Brain Center, University of Bonn, Bonn 53127, Germany; Human Genomics Research Group, Department of Biomedicine, University of Basel, Basel 4031, Switzerland; Department of Psychiatry (UPK), University of Basel, Basel 4012, Switzerland
Josef Frank Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Elliot S Gershon Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, IL 60637, United States
Fernando S Goes Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD 21287, United States
Tiffany A Greenwood Department of Psychiatry, University of California San Diego, San Diego, CA 92093, United States
Yiran Guo Center for Applied Genomics, Children's Hospital of Philadelphia, Abramson Research Center, Philadelphia, PA 19104, United States; Beijing Genomics Institute at Shenzhen, Shenzhen 518083, China
Maria Hipolito Department of Psychiatry and Behavioral Sciences, Howard University Hospital, Washington, DC 20060, United States
Leroy Hood Institute for Systems Biology, Seattle, WA 98109, United States
Brendan J Keating Cardiovascular Institute, University of Pennsylvania School of Medicine, Philadelphia, PA 19104-5159, United States; Institute for Translational Medicine and Therapeutics, School of Medicine, University of Pennsylvania, Philadelphia, PA 19104-5158, United States
Daniel L Koller Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, United States
William B Lawson Dell Medical School, University of Texas at Austin, Austin, TX 78723, United States
Chunyu Liu SUNY Upstate Medical University, Syracuse, NY 13210, United States
Pamela B Mahon Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD 21287, United States
Melvin G McInnis Department of Psychiatry, University of Michigan, Ann Arbor, MI 48105, United States
Francis J McMahon U.S. Department of Health & Human Services, Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20894, United States
Sandra M Meier Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany; National Centre for Register-Based Research, Aarhus University, Aarhus V 8210, Denmark
Thomas W Mühleisen Institute of Neuroscience and Medicine (INM-1), Research Centre Jülich, Jülich 52425, Germany; Human Genomics Research Group, Department of Biomedicine, University of Basel, Basel 4031, Switzerland
Sarah S Murray Scripps Genomic Medicine & The Scripps Translational Sciences Institute (STSI), La Jolla, CA 92037, United States; Department of Pathology, University of California San Diego, La Jolla, CA 92093, United States
Caroline M Nievergelt Department of Psychiatry, University of California San Diego, San Diego, CA 92093, United States
John I Nurnberger Department of Psychiatry, Indiana University School of Medicine, Indianapolis, IN 46202, United States
Evaristus A Nwulia Department of Psychiatry and Behavioral Sciences, Howard University Hospital, Washington, DC 20060, United States
James B Potash Department of Psychiatry, Carver College of Medicine, University of Iowa School of Medicine, Iowa City, IA 52242, United States
Danjuma Quarless J. Craig Venter Institute, La Jolla, CA 92037, United States; University of California San Diego, La Jolla, CA 92093, United States
John Rice Department of Psychiatry, Washington University School of Medicine in St. Louis, St. Louis, MO 63110, United States
Jared C Roach Institute for Systems Biology, Seattle, WA 98109, United States
William A Scheftner Rush University Medical Center, Chicago, IL 60612, United States
Nicholas J Schork J. Craig Venter Institute, La Jolla, CA 92037, United States; The Translational Genomics Research Institute, Phoenix, AZ 85004, United States; University of California San Diego, La Jolla, CA 92093, United States
Tatyana Shekhtman Department of Psychiatry, University of California San Diego, San Diego, CA 92093, United States
Paul D Shilling Department of Psychiatry, University of California San Diego, San Diego, CA 92093, United States
Erin N Smith Scripps Genomic Medicine & The Scripps Translational Sciences Institute (STSI), La Jolla, CA 92037, United States; Department of Pediatrics and Rady's Children's Hospital, School of Medicine, University of California San Diego, La Jolla, CA 92037, United States
Fabian Streit Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Jana Strohmaier Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Szabolcs Szelinger The Translational Genomics Research Institute, Phoenix, AZ 85004, United States
Jens Treutlein Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Stephanie H Witt Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Peter P Zandi Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, United States
Peng Zhang Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, United States
Sebastian Zöllner Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, United States; Department of Psychiatry, University of Michigan, Ann Arbor, MI 48105, United States
Heike Bickeböller Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University, Göttingen 37099, Germany
Peter G Falkai Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich 80336, Germany
John R Kelsoe Department of Psychiatry, University of California San Diego, San Diego, CA 92093, United States
Markus M Nöthen Institute of Human Genetics, School of Medicine & University Hospital Bonn, University of Bonn, Bonn 53127, Germany; Department of Genomics, Life & Brain Center, University of Bonn, Bonn 53127, Germany
Marcella Rietschel Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany
Thomas G Schulze Institute of Psychiatric Phenomics and Genomics, University Hospital, LMU Munich, Nussbaumstr. 7, Munich 80336, Germany; Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, University of Heidelberg, Mannheim 68159, Germany; Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD 21287, United States; U.S. Department of Health & Human Services, Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20894, United States.
Dörthe Malzahn Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University, Göttingen 37099, Germany.

Collapse

Yang H, Cao H, He T, Wang T, Cui Y. Multilevel heterogeneous omics data integration with kernel fusion. Brief Bioinform 2018;21:156-170. [PMID: 30496340 DOI: 10.1093/bib/bby115] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2018] [Revised: 10/25/2018] [Accepted: 10/26/2018] [Indexed: 01/26/2023] Open

He T, Li S, Zhong PS, Cui Y. An optimal kernel-based U -statistic method for quantitative gene-set association analysis. Genet Epidemiol 2018;43:137-149. [DOI: 10.1002/gepi.22170] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 08/19/2018] [Accepted: 09/26/2018] [Indexed: 11/09/2022]

Yasmeen S, Burger P, Friedrichs S, Papiol S, Bickeböller H. Relating drug response to epigenetic and genetic markers using a region-based kernel score test. BMC Proc 2018;12:47. [PMID: 30275895 PMCID: PMC6157113 DOI: 10.1186/s12919-018-0154-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Statistical methods and challenges in connectome genetics. Stat Probab Lett 2018. [DOI: 10.1016/j.spl.2018.02.048] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Reexamining Dis/Similarity-Based Tests for Rare-Variant Association with Case-Control Samples. Genetics 2018;209:105-113. [PMID: 29545466 DOI: 10.1534/genetics.118.300769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 03/02/2018] [Indexed: 11/18/2022] Open

Abstract

A properly designed distance-based measure can capture informative genetic differences among individuals with different phenotypes and can be used to detect variants responsible for the phenotypes. To detect associated variants, various tests have been designed to contrast genetic dissimilarity or similarity scores of certain subject groups in different ways, among which the most widely used strategy is to quantify the difference between the within-group genetic dissimilarity/similarity (i.e., case-case and control-control similarities) and the between-group dissimilarity/similarity (i.e., case-control similarities). While it has been noted that for common variants, the within-group and the between-group measures should all be included; in this work, we show that for rare variants, comparison based on the two within-group measures can more effectively quantify the genetic difference between cases and controls. The between-group measure tends to overlap with one of the two within-group measures for rare variants, although such overlap is not present for common variants. Consequently, a dissimilarity or similarity test that includes the between-group information tends to attenuate the association signals and leads to power loss. Based on these findings, we propose a dissimilarity test that compares the degree of SNP dissimilarity within cases to that within controls to better characterize the difference between two disease phenotypes. We provide the statistical properties, asymptotic distribution, and computation details for a small sample size of the proposed test. We use simulated and real sequence data to assess the performance of the proposed test, comparing it with other rare-variant methods including those similarity-based tests that use both within-group and between-group information. As similarity-based approaches serve as one of the dominating approaches in rare-variant analysis, our results provide some insight for the effective detection of rare variants.

Collapse

Randolph TW, Zhao S, Copeland W, Hullar M, Shojaie A. KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA. Ann Appl Stat 2018;12:540-566. [PMID: 30224943 PMCID: PMC6138053 DOI: 10.1214/17-aoas1102] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Zhu B, Song N, Shen R, Arora A, Machiela MJ, Song L, Landi MT, Ghosh D, Chatterjee N, Baladandayuthapani V, Zhao H. Integrating Clinical and Multiple Omics Data for Prognostic Assessment across Human Cancers. Sci Rep 2017;7:16954. [PMID: 29209073 PMCID: PMC5717223 DOI: 10.1038/s41598-017-17031-8] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2017] [Accepted: 11/20/2017] [Indexed: 02/06/2023] Open

Xu Z, Wu C, Pan W. Imaging-wide association study: Integrating imaging endophenotypes in GWAS. Neuroimage 2017;159:159-169. [PMID: 28736311 PMCID: PMC5671364 DOI: 10.1016/j.neuroimage.2017.07.036] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Revised: 06/22/2017] [Accepted: 07/18/2017] [Indexed: 10/19/2022] Open

A Powerful Framework for Integrating eQTL and GWAS Summary Data. Genetics 2017;207:893-902. [PMID: 28893853 DOI: 10.1534/genetics.117.300270] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 09/05/2017] [Indexed: 01/26/2023] Open

Islam S, Anand S, Hamid J, Thabane L, Beyene J. Comparing the performance of linear and nonlinear principal components in the context of high-dimensional genomic data integration. Stat Appl Genet Mol Biol 2017;16:199-216. [PMID: 28727569 DOI: 10.1515/sagmb-2016-0066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Powerful Genetic Association Analysis for Common or Rare Variants with High-Dimensional Structured Traits. Genetics 2017. [PMID: 28642271 DOI: 10.1534/genetics.116.199646] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open

Malzahn D, Friedrichs S, Bickeböller H. Comparing strategies for combined testing of rare and common variants in whole sequence and genome-wide genotype data. BMC Proc 2016;10:269-273. [PMID: 27980648 PMCID: PMC5133495 DOI: 10.1186/s12919-016-0042-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Dandine-Roulland C, Perdry H. The Use of the Linear Mixed Model in Human Genetics. Hum Hered 2016;80:196-206. [PMID: 27576760 DOI: 10.1159/000447634] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Xu HM, Xu LF, Hou TT, Luo LF, Chen GB, Sun XW, Lou XY. GMDR: Versatile Software for Detecting Gene-Gene and Gene-Environ- ment Interactions Underlying Complex Traits. Curr Genomics 2016;17:396-402. [PMID: 28479868 PMCID: PMC5320543 DOI: 10.2174/1389202917666160513102612] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Revised: 04/10/2015] [Accepted: 04/15/2015] [Indexed: 11/22/2022] Open

Yang H, Li S, Cao H, Zhang C, Cui Y. Predicting disease trait with genomic data: a composite kernel approach. Brief Bioinform 2016;18:591-601. [DOI: 10.1093/bib/bbw043] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Indexed: 01/17/2023] Open

Friedrichs S, Malzahn D, Pugh EW, Almeida M, Liu XQ, Bailey JN. Filtering genetic variants and placing informative priors based on putative biological function. BMC Genet 2016;17 Suppl 2:8. [PMID: 26866982 PMCID: PMC4895695 DOI: 10.1186/s12863-015-0313-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Kanagawa M, Nishiyama Y, Gretton A, Fukumizu K. Filtering with State-Observation Examples via Kernel Monte Carlo Filter. Neural Comput 2015;28:382-444. [PMID: 26654205 DOI: 10.1162/neco_a_00806] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Zhu N, Heinrich V, Dickhaus T, Hecht J, Robinson PN, Mundlos S, Kamphans T, Krawitz PM. Strategies to improve the performance of rare variant association studies by optimizing the selection of controls. Bioinformatics 2015;31:3577-83. [PMID: 26249812 DOI: 10.1093/bioinformatics/btv457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2015] [Accepted: 07/30/2015] [Indexed: 11/12/2022] Open

Chen F, He J, Zhang J, Chen GK, Thomas V, Ambrosone CB, Bandera EV, Berndt SI, Bernstein L, Blot WJ, Cai Q, Carpten J, Casey G, Chanock SJ, Cheng I, Chu L, Deming SL, Driver WR, Goodman P, Hayes RB, Hennis AJM, Hsing AW, Hu JJ, Ingles SA, John EM, Kittles RA, Kolb S, Leske MC, Millikan RC, Monroe KR, Murphy A, Nemesure B, Neslund-Dudas C, Nyante S, Ostrander EA, Press MF, Rodriguez-Gil JL, Rybicki BA, Schumacher F, Stanford JL, Signorello LB, Strom SS, Stevens V, Van Den Berg D, Wang Z, Witte JS, Wu SY, Yamamura Y, Zheng W, Ziegler RG, Stram AH, Kolonel LN, Marchand LL, Henderson BE, Haiman CA, Stram DO. Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults. PLoS One 2015;10:e0131106. [PMID: 26125186 PMCID: PMC4488332 DOI: 10.1371/journal.pone.0131106] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 05/28/2015] [Indexed: 01/02/2023] Open

Abstract

Height has an extremely polygenic pattern of inheritance. Genome-wide association studies (GWAS) have revealed hundreds of common variants that are associated with human height at genome-wide levels of significance. However, only a small fraction of phenotypic variation can be explained by the aggregate of these common variants. In a large study of African-American men and women (n = 14,419), we genotyped and analyzed 966,578 autosomal SNPs across the entire genome using a linear mixed model variance components approach implemented in the program GCTA (Yang et al Nat Genet 2010), and estimated an additive heritability of 44.7% (se: 3.7%) for this phenotype in a sample of evidently unrelated individuals. While this estimated value is similar to that given by Yang et al in their analyses, we remain concerned about two related issues: (1) whether in the complete absence of hidden relatedness, variance components methods have adequate power to estimate heritability when a very large number of SNPs are used in the analysis; and (2) whether estimation of heritability may be biased, in real studies, by low levels of residual hidden relatedness. We addressed the first question in a semi-analytic fashion by directly simulating the distribution of the score statistic for a test of zero heritability with and without low levels of relatedness. The second question was addressed by a very careful comparison of the behavior of estimated heritability for both observed (self-reported) height and simulated phenotypes compared to imputation R² as a function of the number of SNPs used in the analysis. These simulations help to address the important question about whether today's GWAS SNPs will remain useful for imputing causal variants that are discovered using very large sample sizes in future studies of height, or whether the causal variants themselves will need to be genotyped de novo in order to build a prediction model that ultimately captures a large fraction of the variability of height, and by implication other complex phenotypes. Our overall conclusions are that when study sizes are quite large (5,000 or so) the additive heritability estimate for height is not apparently biased upwards using the linear mixed model; however there is evidence in our simulation that a very large number of causal variants (many thousands) each with very small effect on phenotypic variance will need to be discovered to fill the gap between the heritability explained by known versus unknown causal variants. We conclude that today's GWAS data will remain useful in the future for causal variant prediction, but that finding the causal variants that need to be predicted may be extremely laborious.

Collapse

Affiliation(s)

Fang Chen Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Jing He Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Jianqi Zhang Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Gary K. Chen Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Venetta Thomas Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, FL, United States of America
Christine B. Ambrosone Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY, United States of America
Elisa V. Bandera The Cancer Institute of New Jersey, New Brunswick, NJ, United States of America
Sonja I. Berndt Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
Leslie Bernstein Division of Cancer Etiology, Department of Population Science, Beckman Research Institute, City of Hope, CA, United States of America
William J. Blot International Epidemiology Institute, Rockville, MD, United States of America Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University and the Vanderbilt-Ingram Cancer Center, Nashville, TN, United States of America
Qiuyin Cai Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University and the Vanderbilt-Ingram Cancer Center, Nashville, TN, United States of America
John Carpten The Translational Genomics Research Institute, Phoenix, AZ, United States of America
Graham Casey Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Stephen J. Chanock Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
Iona Cheng Epidemiology Program, Cancer Research Center, University of Hawaii, Honolulu, HI, United States of America
Lisa Chu Cancer Prevention Institute of California, Fremont, CA, United States of America
Sandra L. Deming Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University and the Vanderbilt-Ingram Cancer Center, Nashville, TN, United States of America
W. Ryan Driver Epidemiology Research Program, American Cancer Society, Atlanta, GA, United States of America
Phyllis Goodman Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America
Richard B. Hayes Division of Epidemiology, Department of Environmental Medicine, New York University Langone Medical Center, New York, NY, United States of America
Anselm J. M. Hennis Chronic Disease Research Centre and Faculty of Medical Sciences, University of the West Indies, Bridgetown, Barbados Department of Preventive Medicine, Stony Brook University, Stony Brook, NY, United States of America
Ann W. Hsing Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
Jennifer J. Hu Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, FL, United States of America
Sue A. Ingles Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Esther M. John Cancer Prevention Institute of California, Fremont, CA, United States of America Division of Epidemiology, Department of Health Research & Policy, Stanford University School of Medicine and Stanford Cancer Institute, Stanford, CA, United States of America
Rick A. Kittles Department of Medicine, University of Illinois at Chicago, Chicago, IL, United States of America
Suzanne Kolb Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America
M. Cristina Leske Department of Preventive Medicine, Stony Brook University, Stony Brook, NY, United States of America
Robert C. Millikan Department of Epidemiology, Gillings School of Global Public Health, and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, United States of America
Kristine R. Monroe Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Adam Murphy Department of Urology, Northwestern University, Chicago, IL, United States of America
Barbara Nemesure Department of Preventive Medicine, Stony Brook University, Stony Brook, NY, United States of America
Christine Neslund-Dudas Department of Biostatistics and Research Epidemiology, Henry Ford Hospital, Detroit, MI, United States of America
Sarah Nyante Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
Elaine A Ostrander Cancer Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, United States of America
Michael F. Press Department of Pathology, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Jorge L. Rodriguez-Gil Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, FL, United States of America
Ben A. Rybicki Department of Biostatistics and Research Epidemiology, Henry Ford Hospital, Detroit, MI, United States of America
Fredrick Schumacher Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Janet L. Stanford Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America
Lisa B. Signorello Department of Epidemiology, Harvard School of Public Health, Boston, MA, United States of America
Sara S. Strom Department of Epidemiology, The University of Texas M.D. Anderson Cancer Center, Houston, TX, United States of America
Victoria Stevens Epidemiology Research Program, American Cancer Society, Atlanta, GA, United States of America
David Van Den Berg Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Zhaoming Wang Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
John S. Witte Institute for Human Genetics, Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, United States of America
Suh-Yuh Wu Department of Preventive Medicine, Stony Brook University, Stony Brook, NY, United States of America
Yuko Yamamura Institute for Human Genetics, Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, United States of America
Wei Zheng Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University and the Vanderbilt-Ingram Cancer Center, Nashville, TN, United States of America
Regina G. Ziegler Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, United States of America
Alexander H. Stram Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Laurence N. Kolonel Epidemiology Program, Cancer Research Center, University of Hawaii, Honolulu, HI, United States of America
Loïc Le Marchand Epidemiology Program, Cancer Research Center, University of Hawaii, Honolulu, HI, United States of America
Brian E. Henderson Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Christopher A. Haiman Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America
Daniel O. Stram Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, United States of America * E-mail:

Collapse

Pan W, Chen YM, Wei P. Testing for polygenic effects in genome-wide association studies. Genet Epidemiol 2015;39:306-16. [PMID: 25847094 DOI: 10.1002/gepi.21899] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Revised: 01/30/2015] [Accepted: 02/23/2015] [Indexed: 12/20/2022]

Abstract

To confirm associations with a large number of single nucleotide polymorphisms (SNPs), each with only a small effect size, as hypothesized in the polygenic theory for schizophrenia, the International Schizophrenia Consortium (2009, Nature 460:748-752) proposed a polygenic risk score (PRS) test and demonstrated its effectiveness when applied to psychiatric disorders. The basic idea of the PRS test is to use a half of the sample to select and up-weight those more likely to be associated SNPs, and then use the other half of the sample to test for aggregated effects of the selected SNPs. Intrigued by the novelty and increasing use of the PRS test, we aimed to evaluate and improve its performance for GWAS data. First, by an analysis of the PRS test, we point out its connection with the Sum test [Chapman and Whittaker, Genet Epidemiol 32:560-566; Pan, Genet Epidemiol 33:497-507]; given the known advantages and disadvantages of the Sum test, this connection motivated the development of several other polygenic tests, some of which may be more powerful than the PRS test under certain situations. Second, more importantly, to overcome the low statistical efficiency of the data-splitting strategy as adopted in the PRS test, we reformulate and thus modify the PRS test, obtaining several adaptive tests, which are closely related to the adaptive sum of powered score (SPU) test studied in the context of rare variant analysis [Pan et al., 2014, Genetics 197:1081-1095]. We use both simulated data and a real GWAS dataset of alcohol dependence to show dramatically improved power of the new tests over the PRS test; due to its superior performance and simplicity, we recommend the whole sample-based adaptive SPU test for polygenic testing. We hope to raise the awareness of the limitations of the PRS test and potential power gain of the adaptive SPU test.

Collapse

Wang Z, Maity A, Hsiao CK, Voora D, Kaddurah-Daouk R, Tzeng JY. Module-based association analysis for omics data with network structure. PLoS One 2015;10:e0122309. [PMID: 25822417 PMCID: PMC4378989 DOI: 10.1371/journal.pone.0122309] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 02/20/2015] [Indexed: 02/06/2023] Open

Wang X, Xing EP, Schaid DJ. Kernel methods for large-scale genomic data analysis. Brief Bioinform 2015;16:183-92. [PMID: 25053743 PMCID: PMC4375394 DOI: 10.1093/bib/bbu024] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 05/20/2014] [Indexed: 11/12/2022] Open

Massively expedited genome-wide heritability analysis (MEGHA). Proc Natl Acad Sci U S A 2015;112:2479-84. [PMID: 25675487 DOI: 10.1073/pnas.1415603112] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction.

Collapse

Pan W. Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing. Genet Epidemiol 2015;35:211-6. [PMID: 21308765 DOI: 10.1002/gepi.20567] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2010] [Revised: 11/21/2010] [Accepted: 01/04/2011] [Indexed: 11/10/2022]

Ge T, Nichols TE, Ghosh D, Mormino EC, Smoller JW, Sabuncu MR. A kernel machine method for detecting effects of interaction between multidimensional variable sets: an imaging genetics application. Neuroimage 2015;109:505-514. [PMID: 25600633 DOI: 10.1016/j.neuroimage.2015.01.029] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Revised: 01/06/2015] [Accepted: 01/09/2015] [Indexed: 11/19/2022] Open

Assessing gene-environment interactions for common and rare variants with binary traits using gene-trait similarity regression. Genetics 2015;199:695-710. [PMID: 25585620 DOI: 10.1534/genetics.114.171686] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Family L, Bensen JT, Troester MA, Wu MC, Anders CK, Olshan AF. Single-nucleotide polymorphisms in DNA bypass polymerase genes and association with breast cancer and breast cancer subtypes among African Americans and Whites. Breast Cancer Res Treat 2015;149:181-90. [PMID: 25417172 PMCID: PMC4498665 DOI: 10.1007/s10549-014-3203-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 11/09/2014] [Indexed: 01/18/2023]

Wang X, Epstein MP, Tzeng JY. Analysis of gene-gene interactions using gene-trait similarity regression. Hum Hered 2014;78:17-26. [PMID: 24969398 DOI: 10.1159/000360161] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 01/30/2014] [Indexed: 12/14/2022] Open

Malzahn D, Friedrichs S, Rosenberger A, Bickeböller H. Kernel score statistic for dependent data. BMC Proc 2014;8:S41. [PMID: 25519324 PMCID: PMC4143755 DOI: 10.1186/1753-6561-8-s1-s41] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Tzeng JY, Lu W, Hsu FC. GENE-LEVEL PHARMACOGENETIC ANALYSIS ON SURVIVAL OUTCOMES USING GENE-TRAIT SIMILARITY REGRESSION. Ann Appl Stat 2014;8:1232-1255. [PMID: 25018788 DOI: 10.1214/14-aoas735] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

King CR, Nicolae DL. GWAS to Sequencing: Divergence in Study Design and Analysis. Genes (Basel) 2014;5:460-76. [PMID: 24879455 PMCID: PMC4094943 DOI: 10.3390/genes5020460] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2013] [Revised: 05/13/2014] [Accepted: 05/15/2014] [Indexed: 12/03/2022] Open

Vrieze SI, Feng S, Miller MB, Hicks BM, Pankratz N, Abecasis GR, Iacono WG, McGue M. Rare nonsynonymous exonic variants in addiction and behavioral disinhibition. Biol Psychiatry 2014;75:783-9. [PMID: 24094508 PMCID: PMC3975816 DOI: 10.1016/j.biopsych.2013.08.027] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 08/02/2013] [Accepted: 08/26/2013] [Indexed: 10/26/2022]

Zeng P, Zhao Y, Zhang L, Huang S, Chen F. Rare variants detection with kernel machine learning based on likelihood ratio test. PLoS One 2014;9:e93355. [PMID: 24675868 PMCID: PMC3968153 DOI: 10.1371/journal.pone.0093355] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2013] [Accepted: 03/03/2014] [Indexed: 11/18/2022] Open

Røislien J, Samset E. A non-parametric permutation method for assessing agreement for distance matrix observations. Stat Med 2014;33:319-29. [PMID: 23946159 DOI: 10.1002/sim.5927] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2011] [Revised: 05/16/2013] [Accepted: 07/08/2013] [Indexed: 11/08/2022]

Larson NB, Jenkins GD, Larson MC, Vierkant RA, Sellers TA, Phelan CM, Schildkraut JM, Sutphen R, Pharoah PPD, Gayther SA, Wentzensen N, Goode EL, Fridley BL. Kernel canonical correlation analysis for assessing gene-gene interactions and application to ovarian cancer. Eur J Hum Genet 2014;22:126-31. [PMID: 23591404 PMCID: PMC3865403 DOI: 10.1038/ejhg.2013.69] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Revised: 01/11/2013] [Accepted: 01/16/2013] [Indexed: 01/24/2023] Open

Affiliation(s)

Nicholas B Larson Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Gregory D Jenkins Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Melissa C Larson Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Robert A Vierkant Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Thomas A Sellers Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
Catherine M Phelan Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
Joellen M Schildkraut Duke Comprehensive Cancer Center, Duke University, Durham, NC, USA
Rebecca Sutphen Department of Pediatrics, Universty of South Florida College of Medicine, Tampa, FL, USA
Paul P D Pharoah Department of Oncology, University of Cambridge, Cambridge, UK
Simon A Gayther Department of Preventative Medicine, University of Southern California, Los Angeles, CA, USA
Nicolas Wentzensen Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
Ovarian Cancer Association Consortium Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA Duke Comprehensive Cancer Center, Duke University, Durham, NC, USA Department of Pediatrics, Universty of South Florida College of Medicine, Tampa, FL, USA Department of Oncology, University of Cambridge, Cambridge, UK Department of Preventative Medicine, University of Southern California, Los Angeles, CA, USA Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS, USA
Ellen L Goode Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Brooke L Fridley Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS, USA

Collapse

Thomas DC, Yang Z, Yang F. Two-phase and family-based designs for next-generation sequencing studies. Front Genet 2013;4:276. [PMID: 24379824 PMCID: PMC3861783 DOI: 10.3389/fgene.2013.00276] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2013] [Accepted: 11/19/2013] [Indexed: 12/21/2022] Open

Abstract

The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing studies: two-phase designs for targeted follow-up of genomewide association studies using unrelated individuals; and family-based designs exploiting co-segregation for prioritizing variants and genes. Two-phase designs subsample subjects for sequencing from a larger case-control study jointly on the basis of their disease and carrier status; the discovered variants are then tested for association in the parent study. The analysis combines the full sequence data from the substudy with the more limited SNP data from the main study. We discuss various methods for selecting this subset of variants and describe the expected yield of true positive associations in the context of an on-going study of second breast cancers following radiotherapy. While the sharing of variants within families means that family-based designs are less efficient for discovery than sequencing unrelated individuals, the ability to exploit co-segregation of variants with disease within families helps distinguish causal from non-causal ones. Furthermore, by enriching for family history, the yield of causal variants can be improved and use of identity-by-descent information improves imputation of genotypes for other family members. We compare the relative efficiency of these designs with those using unrelated individuals for discovering and prioritizing variants or genes for testing association in larger studies. While associations can be tested with single variants, power is low for rare ones. Recent generalizations of burden or kernel tests for gene-level associations to family-based data are appealing. These approaches are illustrated in the context of a family-based study of colorectal cancer.

Collapse

Qu L, Guennel T, Marshall SL. Linear score tests for variance components in linear mixed models and applications to genetic association studies. Biometrics 2013;69:883-92. [PMID: 24328714 DOI: 10.1111/biom.12095] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2012] [Revised: 06/01/2013] [Accepted: 07/01/2013] [Indexed: 01/16/2023]

Hoffman GE. Correcting for population structure and kinship using the linear mixed model: theory and extensions. PLoS One 2013;8:e75707. [PMID: 24204578 PMCID: PMC3810480 DOI: 10.1371/journal.pone.0075707] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 08/20/2013] [Indexed: 01/20/2023] Open