Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cheung C, Thompson E, Wijsman E. GIGI: an approach to effective imputation of dense genotypes on large pedigrees. Am J Hum Genet 2013;92:504-16. [PMID: 23561844 DOI: 10.1016/j.ajhg.2013.02.011] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2012] [Revised: 01/15/2013] [Accepted: 02/27/2013] [Indexed: 12/11/2022] Open

For:	Cheung C, Thompson E, Wijsman E. GIGI: an approach to effective imputation of dense genotypes on large pedigrees. Am J Hum Genet 2013;92:504-16. [PMID: 23561844 DOI: 10.1016/j.ajhg.2013.02.011] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2012] [Revised: 01/15/2013] [Accepted: 02/27/2013] [Indexed: 12/11/2022] Open

Number

Cited by Other Article(s)

Magalhães Borges V, Horimoto ARVR, Wijsman EM, Kimura L, Nunes K, Nato AQ, Mingroni-Netto RC. Genomic Exploration of Essential Hypertension in African-Brazilian Quilombo Populations: A Comprehensive Approach with Pedigree Analysis and Family-Based Association Studies. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.06.26.24309531. [PMID: 38978678 PMCID: PMC11230341 DOI: 10.1101/2024.06.26.24309531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]

Abstract

Essential Hypertension (EH) is a major global health concern, causing about 9.4 million deaths annually. Its prevalence varies across different regions, affecting 17% of the population in the Americas, 19.2% in the Western Pacific, 23.2% in Europe, 25.1% in Southeast Asia, 26.3% in the Eastern Mediterranean, and 27.2% in Africa. EH is a multifactorial disease influenced by both genetic and environmental factors. While genetic factors contribute 30-60% to blood pressure variation, the genetic complexity of EH remains largely unexplained due to limited knowledge of candidate genes and population-specific differences. Various methods, including candidate gene studies, genome-wide linkage analysis (GWLA), and genome-wide association studies (GWAS), have been employed to identify genetic factors, yet much of the heritability of EH is still unknown. This study aimed to investigate the genetic basis of EH by mapping regions of interest (ROIs) and identifying candidate genes and variants influencing EH in African-derived individuals from partially isolated populations of quilombo remnants in Vale do Ribeira, São Paulo, Brazil. Samples from 431 individuals (167 affected, 261 unaffected, 3 with unknown phenotype) from eight quilombo remnant populations were genotyped using a 650k SNP array. The global ancestry proportions were estimated at 47% African, 36% European, and 16% Native American. Genealogical information from 673 individuals was used to construct six pedigrees comprising 1104 individuals. The mapping strategy consisted of a multi-level computational approach. We constructed pedigrees based on interviews and kinship coefficient, pruned the dataset to obtain three non-overlapping markers subpanels, phased the haplotype and performed local ancestry to account for admixture. We performed GWLA and dense linkage analyses using markers subpanels and performed fine-mapping using family-based association studies (FBAS) based on population and pedigree imputed data, investigating EH-related genes and variants. The linkage analysis identified 22 ROIs with LOD scores 1.45-3.03, containing markers co-segregating with the phenotype. These ROIs encompassed 2363 genes. Fine-mapping identified 60 EH-related candidate genes and 118 suggestive or significant variants (FBAS). Among these, 14 genes, including PHGDH, S100A10, MFN2, and RYR2, were highlighted with strong evidence of association with hypertension. These genes, harboring 29 SNPs, were implicated in regulating blood pressure, sodium and potassium levels, and the aldosterone pathway. This study revealed, through a complementary approach - combining admixture-adjusted genome-wide linkage analysis based on Markov chain Monte Carlo (MCMC) methods, association studies on imputed data, and in silico investigations - genetic regions, variants and candidate genes that shed light on the genetic basis of essential hypertension, with significant potential to explain the genetic etiology in quilombo remnant populations.

Collapse

Qiao Y, Jewett EM, McManus KF, Freyman WA, Curran JE, Williams-Blangero S, Blangero J, Williams AL. Reconstructing parent genomes using siblings and other relatives. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.10.593578. [PMID: 38798596 PMCID: PMC11118276 DOI: 10.1101/2024.05.10.593578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022;2467:113-138. [PMID: 35451774 DOI: 10.1007/978-1-0716-2205-6_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Mdyogolo S, MacNeil MD, Neser FWC, Scholtz MM, Makgahlela ML. Assessing accuracy of genotype imputation in the Afrikaner and Brahman cattle breeds of South Africa. Trop Anim Health Prod 2022;54:90. [PMID: 35133512 DOI: 10.1007/s11250-022-03102-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/01/2022] [Indexed: 11/26/2022]

Lee D, Kim Y, Chung Y, Lee D, Seo D, Choi TJ, Lim D, Yoon D, Lee SH. Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle. JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY 2021;63:1232-1246. [PMID: 34957440 PMCID: PMC8672260 DOI: 10.5187/jast.2021.e117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/20/2022]

Whole-genome sequencing identifies functional noncoding variation in SEMA3C that cosegregates with dyslexia in a multigenerational family. Hum Genet 2021;140:1183-1200. [PMID: 34076780 PMCID: PMC8263547 DOI: 10.1007/s00439-021-02289-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 04/27/2021] [Indexed: 12/11/2022]

Naj AC. Genotype Imputation in Genome-Wide Association Studies. ACTA ACUST UNITED AC 2020;102:e84. [PMID: 31216114 DOI: 10.1002/cphg.84] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Sul JH, Service SK, Huang AY, Ramensky V, Hwang SG, Teshiba TM, Park Y, Ori APS, Zhang Z, Mullins N, Olde Loohuis LM, Fears SC, Araya C, Araya X, Spesny M, Bejarano J, Ramirez M, Castrillón G, Gomez-Makhinson J, Lopez MC, Montoya G, Montoya CP, Aldana I, Escobar JI, Ospina-Duque J, Kremeyer B, Bedoya G, Ruiz-Linares A, Cantor RM, Molina J, Coppola G, Ophoff RA, Macaya G, Lopez-Jaramillo C, Reus V, Bearden CE, Sabatti C, Freimer NB. Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates. Transl Psychiatry 2020;10:74. [PMID: 32094344 PMCID: PMC7039961 DOI: 10.1038/s41398-020-0758-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 09/24/2019] [Accepted: 11/04/2019] [Indexed: 12/13/2022] Open

Affiliation(s)

Jae Hoon Sul grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA
Susan K. Service grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA
Alden Y. Huang grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,3grid.19006.3e0000 0000 9632 6718Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095 USA
Vasily Ramensky grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA ,4Federal State Institution “National Medical Research Center for Preventive Medicine” of the Ministry of Healthcare of the Russian Federation. Petroverigskiy lane 10, Moscow, 101990 Russia
Sun-Goo Hwang grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA
Terri M. Teshiba grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA
YoungJun Park grid.19006.3e0000 0000 9632 6718Department of Computer Science, University of California, Los Angeles, Los Angeles, CA 90095 USA
Anil P. S. Ori grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA
Zhongyang Zhang grid.59734.3c0000 0001 0670 2351Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
Niamh Mullins grid.13097.3c0000 0001 2322 6764King’s College London, Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, De Crespigny Park, Denmark Hill, London, SE5 8AF UK ,8grid.59734.3c0000 0001 0670 2351Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
Loes M. Olde Loohuis grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA
Scott C. Fears grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA
Carmen Araya grid.412889.e0000 0004 1937 0706Cell and Molecular Biology Research Center, Universidad de Costa Rica, San Pedro de Montes de Oca, San José, 11501 Costa Rica
Xinia Araya grid.412889.e0000 0004 1937 0706Cell and Molecular Biology Research Center, Universidad de Costa Rica, San Pedro de Montes de Oca, San José, 11501 Costa Rica
Mitzi Spesny Division of Pediatric Pulmonology, Hospital Nacional de Nin ~os, San Jose, Costa Rica
Julio Bejarano grid.412889.e0000 0004 1937 0706Cell and Molecular Biology Research Center, Universidad de Costa Rica, San Pedro de Montes de Oca, San José, 11501 Costa Rica
Margarita Ramirez grid.412889.e0000 0004 1937 0706Cell and Molecular Biology Research Center, Universidad de Costa Rica, San Pedro de Montes de Oca, San José, 11501 Costa Rica
Gabriel Castrillón Instituto de Alta Tecnologia Medica, Medellín, Antioquia, Colombia ,12grid.15474.330000 0004 0477 2438Department of Neuroradiology, Klinikum rechts der Isar, TUM, Munich, Germany
Juliana Gomez-Makhinson grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia
Maria C. Lopez grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia
Gabriel Montoya grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia
Claudia P. Montoya grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia
Ileana Aldana grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA
Javier I. Escobar grid.430387.b0000 0004 1936 8796Department of Psychiatry and Family Medicine, Rutgers-Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ 08901 USA
Jorge Ospina-Duque grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia
Barbara Kremeyer grid.83440.3b0000000121901201Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT UK
Gabriel Bedoya grid.412881.60000 0000 8882 5269Laboratory of Molecular Genetics, Institute of Biology, University of Antioquia, Medellín, 050010 Colombia
Andres Ruiz-Linares grid.8547.e0000 0001 0125 2443Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai, 200438 China ,18grid.5399.60000 0001 2176 4817Aix Marseille Univ, CNRS, EFS, ADES, Marseille, France
Rita M. Cantor grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,19grid.19006.3e0000 0000 9632 6718Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095 USA
Julio Molina BioCiencias Lab, 01010 Guatemala, Guatemala
Giovanni Coppola grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA
Roel A. Ophoff grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA ,19grid.19006.3e0000 0000 9632 6718Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095 USA ,21grid.7692.a0000000090126352Department of Psychiatry, Brain Center Rudolf Magnus, University Medical Center Utrecht, Utrecht, Netherlands
Gabriel Macaya grid.412889.e0000 0004 1937 0706Cell and Molecular Biology Research Center, Universidad de Costa Rica, San Pedro de Montes de Oca, San José, 11501 Costa Rica
Carlos Lopez-Jaramillo grid.412881.60000 0000 8882 5269Grupo de Investigación en Psiquiatría (Research Group in Psychiatry; GIPSI), Departamento de Psiquiatría Facultad de Medicina, Universidad de Antioquia, Medellín, 050011 Colombia ,22Mood Disorders Program, Hospital San Vicente Fundacion, Medellín, 050011 Colombia
Victor Reus grid.266102.10000 0001 2297 6811Department of Psychiatry and UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA 94143 USA
Carrie E. Bearden grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA ,24grid.19006.3e0000 0000 9632 6718Department of Psychology, University of California, Los Angeles, Los Angeles, CA 90095 USA
Chiara Sabatti grid.168010.e0000000419368956Department of Health Research and Policy, Division of Biostatistics, Stanford University, Stanford, CA 94305 USA
Nelson B. Freimer grid.19006.3e0000 0000 9632 6718Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, Los Angeles, CA 90095 USA ,2grid.19006.3e0000 0000 9632 6718Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University California Los Angeles, Los Angeles, CA USA ,19grid.19006.3e0000 0000 9632 6718Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095 USA

Collapse

Abney M, ElSherbiny A. Kinpute: using identity by descent to improve genotype imputation. Bioinformatics 2019;35:4321-4326. [PMID: 30918937 PMCID: PMC6821425 DOI: 10.1093/bioinformatics/btz221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 02/21/2019] [Accepted: 03/26/2019] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Genotype imputation, though generally accurate, often results in many genotypes being poorly imputed, particularly in studies where the individuals are not well represented by standard reference panels. When individuals in the study share regions of the genome identical by descent (IBD), it is possible to use this information in combination with a study-specific reference panel (SSRP) to improve the imputation results. Kinpute uses IBD information-due to recent, familial relatedness or distant, unknown ancestors-in conjunction with the output from linkage disequilibrium (LD) based imputation methods to compute more accurate genotype probabilities. Kinpute uses a novel method for IBD imputation, which works even in the absence of a pedigree, and results in substantially improved imputation quality.

RESULTS

Given initial estimates of average IBD between subjects in the study sample, Kinpute uses a novel algorithm to select an optimal set of individuals to sequence and use as an SSRP. Kinpute is designed to use as input both this SSRP and the genotype probabilities output from other LD-based imputation software, and uses a new method to combine the LD imputed genotype probabilities with IBD configurations to substantially improve imputation. We tested Kinpute on a human population isolate where 98 individuals have been sequenced. In half of this sample, whose sequence data was masked, we used Impute2 to perform LD-based imputation and Kinpute was used to obtain higher accuracy genotype probabilities. Measures of imputation accuracy improved significantly, particularly for those genotypes that Impute2 imputed with low certainty.

AVAILABILITY AND IMPLEMENTATION

Kinpute is an open-source and freely available C++ software package that can be downloaded from.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Naj AC, Lin H, Vardarajan BN, White S, Lancour D, Ma Y, Schmidt M, Sun F, Butkiewicz M, Bush WS, Kunkle BW, Malamon J, Amin N, Choi SH, Hamilton-Nelson KL, van der Lee SJ, Gupta N, Koboldt DC, Saad M, Wang B, Nato AQ, Sohi HK, Kuzma A, Wang LS, Cupples LA, van Duijn C, Seshadri S, Schellenberg GD, Boerwinkle E, Bis JC, Dupuis J, Salerno WJ, Wijsman EM, Martin ER, DeStefano AL. Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer's disease sequencing project. Genomics 2019;111:808-818. [PMID: 29857119 PMCID: PMC6397097 DOI: 10.1016/j.ygeno.2018.05.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 04/03/2018] [Accepted: 05/06/2018] [Indexed: 12/30/2022]

Affiliation(s)

Adam C Naj Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
Honghuang Lin Department of Medicine, Boston University School of Medicine, Boston, MA, USA
Badri N Vardarajan Department of Neurology, Columbia University Medical Center, New York, NY, USA
Simon White Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Daniel Lancour Department of Biomedical Genetics, Boston University School of Medicine, Boston, MA, USA
Yiyi Ma Department of Biomedical Genetics, Boston University School of Medicine, Boston, MA, USA
Michael Schmidt John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
Fangui Sun Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Mariusz Butkiewicz Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
William S Bush Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Brian W Kunkle John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
John Malamon Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Najaf Amin Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
Seung Hoan Choi Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Kara L Hamilton-Nelson John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
Sven J van der Lee Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
Namrata Gupta Medical and Population Genetics Program, Broad Institute, Cambridge, MA, USA
Daniel C Koboldt Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
Mohamad Saad Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, University of Washington, Seattle, WA, USA
Bowen Wang Department of Statistics, University of Washington, Seattle, WA, USA
Alejandro Q Nato Division of Medical Genetics, University of Washington, Seattle, WA, USA
Harkirat K Sohi Division of Medical Genetics, University of Washington, Seattle, WA, USA
Amanda Kuzma Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Li-San Wang Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
L Adrienne Cupples Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA
Cornelia van Duijn Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands
Sudha Seshadri The Framingham Heart Study, Framingham, MA, USA; Department of Neurology, Boston University School of Medicine, Boston, MA, USA
Gerard D Schellenberg Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Eric Boerwinkle Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA; Human Genetics Center, University of Texas Health Science Center, Houston, TX, USA
Joshua C Bis Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
Josée Dupuis Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA
William J Salerno Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Ellen M Wijsman Department of Biostatistics, University of Washington, Seattle, WA, USA; Division of Medical Genetics, University of Washington, Seattle, WA, USA
Eden R Martin John P. Hussman Institute for Human Genetics, University of Miami Miller School of Medicine, Miami, FL, USA
Anita L DeStefano Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA; The Framingham Heart Study, Framingham, MA, USA; Department of Neurology, Boston University School of Medicine, Boston, MA, USA

Collapse

Kunji K, Ullah E, Nato AQ, Wijsman EM, Saad M. GIGI-Quick: a fast approach to impute missing genotypes in genome-wide association family data. Bioinformatics 2019;34:1591-1593. [PMID: 29267877 DOI: 10.1093/bioinformatics/btx782] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Accepted: 12/15/2017] [Indexed: 11/12/2022] Open

Rediscovering the value of families for psychiatric genetics research. Mol Psychiatry 2019;24:523-535. [PMID: 29955165 PMCID: PMC7028329 DOI: 10.1038/s41380-018-0073-x] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 01/11/2018] [Accepted: 03/26/2018] [Indexed: 01/09/2023]

Revisit Population-based and Family-based Genotype Imputation. Sci Rep 2019;9:1800. [PMID: 30755687 PMCID: PMC6372660 DOI: 10.1038/s41598-018-38469-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 12/27/2018] [Indexed: 11/12/2022] Open

Whalen A, Ros-Freixedes R, Wilson DL, Gorjanc G, Hickey JM. Hybrid peeling for fast and accurate calling, phasing, and imputation with sequence data of any coverage in pedigrees. Genet Sel Evol 2018;50:67. [PMID: 30563452 PMCID: PMC6299538 DOI: 10.1186/s12711-018-0438-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 12/11/2018] [Indexed: 12/31/2022] Open

Nelson D, Moreau C, de Vriendt M, Zeng Y, Preuss C, Vézina H, Milot E, Andelfinger G, Labuda D, Gravel S. Inferring Transmission Histories of Rare Alleles in Population-Scale Genealogies. Am J Hum Genet 2018;103:893-906. [PMID: 30526866 DOI: 10.1016/j.ajhg.2018.10.017] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 10/22/2018] [Indexed: 01/06/2023] Open

Ullah E, Mall R, Abbas MM, Kunji K, Nato AQ, Bensmail H, Wijsman EM, Saad M. Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees. Genome Res 2018;29:125-134. [PMID: 30514702 PMCID: PMC6314157 DOI: 10.1101/gr.236315.118] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 11/30/2018] [Indexed: 01/19/2023]

Abstract

Genotype imputation is widely used in genome-wide association studies to boost variant density, allowing increased power in association testing. Many studies currently include pedigree data due to increasing interest in rare variants coupled with the availability of appropriate analysis tools. The performance of population-based (subjects are unrelated) imputation methods is well established. However, the performance of family- and population-based imputation methods on family data has been subject to much less scrutiny. Here, we extensively compare several family- and population-based imputation methods on family data of large pedigrees with both European and African ancestry. Our comparison includes many widely used family- and population-based tools and another method, Ped_Pop, which combines family- and population-based imputation results. We also compare four subject selection strategies for full sequencing to serve as the reference panel for imputation: GIGI-Pick, ExomePicks, PRIMUS, and random selection. Moreover, we compare two imputation accuracy metrics: the Imputation Quality Score and Pearson's correlation R ² for predicting power of association analysis using imputation results. Our results show that (1) GIGI outperforms Merlin; (2) family-based imputation outperforms population-based imputation for rare variants but not for common ones; (3) combining family- and population-based imputation outperforms all imputation approaches for all minor allele frequencies; (4) GIGI-Pick gives the best selection strategy based on the R ² criterion; and (5) R ² is the best measure of imputation accuracy. Our study is the first to extensively evaluate the imputation performance of many available family- and population-based tools on the same family data and provides guidelines for future studies.

Collapse

Nafikov RA, Nato AQ, Sohi H, Wang B, Brown L, Horimoto AR, Vardarajan BN, Barral SM, Tosto G, Mayeux RP, Thornton TA, Blue E, Wijsman EM. Analysis of pedigree data in populations with multiple ancestries: Strategies for dealing with admixture in Caribbean Hispanic families from the ADSP. Genet Epidemiol 2018;42:500-515. [PMID: 29862559 PMCID: PMC6160322 DOI: 10.1002/gepi.22133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Revised: 05/04/2018] [Accepted: 05/14/2018] [Indexed: 11/12/2022]

Zheng C, Boer MP, van Eeuwijk FA. Accurate Genotype Imputation in Multiparental Populations from Low-Coverage Sequence. Genetics 2018;210:71-82. [PMID: 30045858 PMCID: PMC6116951 DOI: 10.1534/genetics.118.300885] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/21/2018] [Indexed: 11/18/2022] Open

Miranda AM, Herman M, Cheng R, Nahmani E, Barrett G, Micevska E, Fontaine G, Potier MC, Head E, Schmitt FA, Lott IT, Jiménez-Velázquez IZ, Antonarakis SE, Di Paolo G, Lee JH, Hussaini SA, Marquer C. Excess Synaptojanin 1 Contributes to Place Cell Dysfunction and Memory Deficits in the Aging Hippocampus in Three Types of Alzheimer's Disease. Cell Rep 2018;23:2967-2975. [PMID: 29874583 PMCID: PMC6040810 DOI: 10.1016/j.celrep.2018.05.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 03/01/2018] [Accepted: 05/02/2018] [Indexed: 12/11/2022] Open

Affiliation(s)

Andre M Miranda Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA; Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, 4710-057 Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, 4806-909 Braga/Guimarães, Portugal
Mathieu Herman Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Rong Cheng Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; G. H. Sergievsky Center, Columbia University Medical Center, New York, NY 10032, USA
Eden Nahmani Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Geoffrey Barrett Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Elizabeta Micevska Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Gaelle Fontaine Sorbonne Universités, UPMC Univ Paris 06, Inserm U1127, CNRS UMR7225, ICM, 75013 Paris, France
Marie-Claude Potier Sorbonne Universités, UPMC Univ Paris 06, Inserm U1127, CNRS UMR7225, ICM, 75013 Paris, France
Elizabeth Head Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40536-0230, USA; Department of Pharmacology & Nutritional Sciences, University of Kentucky, Lexington, KY 40506, USA
Frederick A Schmitt Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40536-0230, USA; Department of Neurology, University of Kentucky, Lexington, KY 40506, USA
Ira T Lott Department of Physiology, University of Kentucky, Lexington, KY 40506, USA; Department of Pediatrics and Neurology, School of Medicine, University of California, Irvine (UCI), Orange, CA 92668, USA
Ivonne Z Jiménez-Velázquez Department of Internal Medicine, University of Puerto Rico School of Medicine, San Juan, PR, USA
Stylianos E Antonarakis Department of Genetic Medicine and Development, University of Geneva Medical School and University Hospitals of Geneva, 1211 Geneva, Switzerland
Gilbert Di Paolo Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Joseph H Lee Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; G. H. Sergievsky Center, Columbia University Medical Center, New York, NY 10032, USA; Departments of Epidemiology and Neurology, Columbia University Medical Center, New York, NY 10032, USA
S Abid Hussaini Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA
Catherine Marquer Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University Medical Center, New York, NY 10032, USA; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032, USA.

Collapse

Torkamaneh D, Boyle B, Belzile F. Efficient genome-wide genotyping strategies and data integration in crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2018;131:499-511. [PMID: 29352324 DOI: 10.1007/s00122-018-3056-z] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/12/2018] [Indexed: 05/21/2023]

Herzig AF, Nutile T, Babron MC, Ciullo M, Bellenguez C, Leutenegger AL. Strategies for phasing and imputation in a population isolate. Genet Epidemiol 2018;42:201-213. [PMID: 29319195 DOI: 10.1002/gepi.22109] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Revised: 11/16/2017] [Accepted: 11/16/2017] [Indexed: 11/05/2022]

Tissier R, Tsonaka R, Mooijaart SP, Slagboom E, Houwing-Duistermaat JJ. Secondary phenotype analysis in ascertained family designs: application to the Leiden longevity study. Stat Med 2017;36:2288-2301. [PMID: 28303589 PMCID: PMC5485037 DOI: 10.1002/sim.7281] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Revised: 02/17/2017] [Accepted: 02/20/2017] [Indexed: 01/14/2023]

Genotype Imputation Methods and Their Effects on Genomic Predictions in Cattle. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/s40362-017-0041-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Woodbury-Smith M, Bilder DA, Morgan J, Jerominski L, Darlington T, Dyer T, Paterson AD, Coon H. Combined genome-wide linkage and targeted association analysis of head circumference in autism spectrum disorder families. J Neurodev Disord 2017;9:5. [PMID: 28289475 PMCID: PMC5304400 DOI: 10.1186/s11689-017-9187-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/26/2016] [Accepted: 01/20/2017] [Indexed: 11/24/2022] Open

Abstract

Background

It has long been recognized that there is an association between enlarged head circumference (HC) and autism spectrum disorder (ASD), but the genetics of HC in ASD is not well understood. In order to investigate the genetic underpinning of HC in ASD, we undertook a genome-wide linkage study of HC followed by linkage signal targeted association among a sample of 67 extended pedigrees with ASD.

Methods

HC measurements on members of 67 multiplex ASD extended pedigrees were used as a quantitative trait in a genome-wide linkage analysis. The Illumina 6K SNP linkage panel was used, and analyses were carried out using the SOLAR implemented variance components model. Loci identified in this way formed the target for subsequent association analysis using the Illumina OmniExpress chip and imputed genotypes. A modification of the qTDT was used as implemented in SOLAR.

Results

We identified a linkage signal spanning 6p21.31 to 6p22.2 (maximum LOD = 3.4). Although targeted association did not find evidence of association with any SNP overall, in one family with the strongest evidence of linkage, there was evidence for association (rs17586672, p = 1.72E−07).

Conclusions

Although this region does not overlap with ASD linkage signals in these same samples, it has been associated with other psychiatric risk, including ADHD, developmental dyslexia, schizophrenia, specific language impairment, and juvenile bipolar disorder. The genome-wide significant linkage signal represents the first reported observation of a potential quantitative trait locus for HC in ASD and may be relevant in the context of complex multivariate risk likely leading to ASD.

Electronic supplementary material

The online version of this article (doi:10.1186/s11689-017-9187-8) contains supplementary material, which is available to authorized users.

Collapse

Saad M, Nato AQ, Grimson FL, Lewis SM, Brown LA, Blue EM, Thornton TA, Thompson EA, Wijsman EM. Identity-by-descent estimation with population- and pedigree-based imputation in admixed family data. BMC Proc 2016;10:295-301. [PMID: 27980652 PMCID: PMC5133511 DOI: 10.1186/s12919-016-0046-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023] Open

Increasing Generality and Power of Rare-Variant Tests by Utilizing Extended Pedigrees. Am J Hum Genet 2016;99:846-859. [PMID: 27666371 DOI: 10.1016/j.ajhg.2016.08.015] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2016] [Accepted: 08/17/2016] [Indexed: 11/24/2022] Open

Ristov S, Brajkovic V, Cubric-Curik V, Michieli I, Curik I. MaGelLAn 1.0: a software to facilitate quantitative and population genetic analysis of maternal inheritance by combination of molecular and pedigree information. Genet Sel Evol 2016;48:65. [PMID: 27613390 PMCID: PMC5018160 DOI: 10.1186/s12711-016-0242-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 08/29/2016] [Indexed: 11/23/2022] Open

Abstract

Background

Identification of genes or even nucleotides that are responsible for quantitative and adaptive trait variation is a difficult task due to the complex interdependence between a large number of genetic and environmental factors. The polymorphism of the mitogenome is one of the factors that can contribute to quantitative trait variation. However, the effects of the mitogenome have not been comprehensively studied, since large numbers of mitogenome sequences and recorded phenotypes are required to reach the adequate power of analysis. Current research in our group focuses on acquiring the necessary mitochondria sequence information and analysing its influence on the phenotype of a quantitative trait. To facilitate these tasks we have produced software for processing pedigrees that is optimised for maternal lineage analysis.

Results

We present MaGelLAn 1.0 (maternal genealogy lineage analyser), a suite of four Python scripts (modules) that is designed to facilitate the analysis of the impact of mitogenome polymorphism on quantitative trait variation by combining molecular and pedigree information. MaGelLAn 1.0 is primarily used to: (1) optimise the sampling strategy for molecular analyses; (2) identify and correct pedigree inconsistencies; and (3) identify maternal lineages and assign the corresponding mitogenome sequences to all individuals in the pedigree, this information being used as input to any of the standard software for quantitative genetic (association) analysis. In addition, MaGelLAn 1.0 allows computing the mitogenome (maternal) effective population sizes and probability of mitogenome (maternal) identity that are useful for conservation management of small populations.

Conclusions

MaGelLAn is the first tool for pedigree analysis that focuses on quantitative genetic analyses of mitogenome data. It is conceived with the purpose to significantly reduce the effort in handling and preparing large pedigrees for processing the information linked to maternal lines. The software source code, along with the manual and the example files can be downloaded at http://lissp.irb.hr/software/magellan-1-0/ and https://github.com/sristov/magellan.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-016-0242-9) contains supplementary material, which is available to authorized users.

Collapse

Bimber BN, Raboin MJ, Letaw J, Nevonen KA, Spindel JE, McCouch SR, Cervera-Juanes R, Spindel E, Carbone L, Ferguson B, Vinson A. Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation. BMC Genomics 2016;17:676. [PMID: 27558348 PMCID: PMC4997765 DOI: 10.1186/s12864-016-2966-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2016] [Accepted: 07/22/2016] [Indexed: 01/29/2023] Open

Abstract

BACKGROUND

Rhesus macaques are widely used in biomedical research, but the application of genomic information in this species to better understand human disease is still in its infancy. Whole-genome sequence (WGS) data in large pedigreed macaque colonies could provide substantial experimental power for genetic discovery, but the collection of WGS data in large cohorts remains a formidable expense. Here, we describe a cost-effective approach that selects the most informative macaques in a pedigree for 30X WGS, followed by low-cost genotyping-by-sequencing (GBS) at 30X on the remaining macaques in order to generate sparse genotype data at high accuracy. Dense variants from the selected macaques with WGS data are then imputed into macaques having only sparse GBS data, resulting in dense genome-wide genotypes throughout the pedigree.

RESULTS

We developed GBS for the macaque genome using a digestion with PstI, followed by sequencing of size-selected fragments at 30X coverage. From GBS sequence data collected on all individuals in a 16-member pedigree, we characterized high-confidence genotypes at 22,455 single nucleotide variant (SNV) sites that were suitable for guiding imputation of dense sequence data from WGS. To characterize dense markers for imputation, we performed WGS at 30X coverage on nine of the 16 individuals, yielding 10,193,425 high-confidence SNVs. To validate the use of GBS data for facilitating imputation, we initially focused on chromosome 19 as a test case, using an optimized panel of 833 sparse, evenly-spaced markers from GBS and 5,010 dense markers from WGS. Using the method of "Genotype Imputation Given Inheritance" (GIGI), we evaluated the effects on imputation accuracy of 3 different strategies for selecting individuals for WGS, including 1) using "GIGI-Pick" to select the most informative individuals, 2) using the most recent generation, or 3) using founders only. We also evaluated the effects on imputation accuracy of using a range of from 1 to 9 WGS individuals for imputation. We found that the GIGI-Pick algorithm for selection of WGS individuals outperformed common heuristic approaches, and that genotype numbers and accuracy improved very little when using >5 WGS individuals for imputation. Informed by our findings, we used 4 macaques with WGS data to impute variants at up to 7,655,491 sites spanning all 20 autosomes in the 12 remaining macaques, based on their GBS genotypes at only 17,158 loci. Using a strict confidence threshold, we imputed an average of 3,680,238 variants per individual at >99 % accuracy, or an average 4,458,883 variants per individual at a more relaxed threshold, yielding >97 % accuracy.

CONCLUSIONS

We conclude that an optimal tradeoff between genotype accuracy, number of imputed genotypes, and overall cost exists at the ratio of one individual selected for WGS using the GIGI-Pick algorithm, per 3-5 relatives selected for GBS. This approach makes feasible the collection of accurate, dense genome-wide sequence data in large pedigreed macaque cohorts without the need for more expensive WGS data on all individuals.

Collapse

Chung RH, Tsai WY, Kang CY, Yao PJ, Tsai HJ, Chen CH. FamPipe: An Automatic Analysis Pipeline for Analyzing Sequencing Data in Families for Disease Studies. PLoS Comput Biol 2016;12:e1004980. [PMID: 27272119 PMCID: PMC4894624 DOI: 10.1371/journal.pcbi.1004980] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 05/12/2016] [Indexed: 11/18/2022] Open

Wijsman EM. Family-based approaches: design, imputation, analysis, and beyond. BMC Genet 2016;17 Suppl 2:9. [PMID: 26866700 PMCID: PMC4895701 DOI: 10.1186/s12863-015-0318-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Nato AQ, Chapman NH, Sohi HK, Nguyen HD, Brkanac Z, Wijsman EM. PBAP: a pipeline for file processing and quality control of pedigree data with dense genetic markers. Bioinformatics 2015;31:3790-8. [PMID: 26231429 PMCID: PMC4668752 DOI: 10.1093/bioinformatics/btv444] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Revised: 07/07/2015] [Accepted: 07/25/2015] [Indexed: 11/13/2022] Open

Chapman NH, Nato AQ, Bernier R, Ankenman K, Sohi H, Munson J, Patowary A, Archer M, Blue EM, Webb SJ, Coon H, Raskind WH, Brkanac Z, Wijsman EM. Whole exome sequencing in extended families with autism spectrum disorder implicates four candidate genes. Hum Genet 2015;134:1055-68. [PMID: 26204995 PMCID: PMC4578871 DOI: 10.1007/s00439-015-1585-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 07/11/2015] [Indexed: 12/26/2022]

Affiliation(s)

Nicola H Chapman Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
Alejandro Q Nato Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
Raphael Bernier Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
Katy Ankenman Department of Psychiatry, University of California, San Francisco, CA, USA
Harkirat Sohi Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
Jeff Munson Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA Center on Child Development and Disability, University of Washington, Seattle, WA, USA
Ashok Patowary Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
Marilyn Archer Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
Elizabeth M Blue Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA
Sara Jane Webb Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA Center on Child Development and Disability, University of Washington, Seattle, WA, USA
Hilary Coon Department of Internal Medicine, University of Utah, Salt Lake City, UT, USA Department of Psychiatry, School of Medicine, University of Utah, Salt Lake City, UT, USA
Wendy H Raskind Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA Department of Genome Sciences, University of Washington, Seattle, WA, USA
Zoran Brkanac Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
Ellen M Wijsman Division of Medical Genetics, School of Medicine, University of Washington, Seattle, WA, USA. Department of Biostatistics, University of Washington, Seattle, WA, USA. Department of Genome Sciences, University of Washington, Seattle, WA, USA. University of Washington, University of Washington Tower, T15, 4333 Brooklyn Ave, NE, BOX 359460, Seattle, WA, 98195-9460, USA.

Collapse

Gribble MO, Voruganti VS, Cole SA, Haack K, Balakrishnan P, Laston SL, Tellez-Plaza M, Francesconi KA, Goessler W, Umans JG, Thomas DC, Gilliland F, North KE, Franceschini N, Navas-Acien A. Linkage Analysis of Urine Arsenic Species Patterns in the Strong Heart Family Study. Toxicol Sci 2015. [PMID: 26209557 DOI: 10.1093/toxsci/kfv164] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Affiliation(s)

Matthew O Gribble *Department of Preventive Medicine, University of Southern California, Los Angeles, California;
Venkata Saroja Voruganti Department of Nutrition, University of North Carolina, Chapel Hill, North Carolina; UNC Nutrition Research Institute, University of North Carolina at Chapel Hill, Kannapolis, North Carolina
Shelley A Cole Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas
Karin Haack Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas
Poojitha Balakrishnan Department of Environmental Health Sciences, Johns Hopkins University, Baltimore, Maryland; Department of Epidemiology, Johns Hopkins Medical Institutions, Baltimore, Maryland
Sandra L Laston South Texas Diabetes and Obesity Institute, University of Texas Health Science Center, San Antonio-Regional Academic Health Center, Brownsville, Texas
Maria Tellez-Plaza Department of Environmental Health Sciences, Johns Hopkins University, Baltimore, Maryland; Biomedical Research Institute, Hospital Clinic de Valencia-INCLIVA, Valencia, Spain
Kevin A Francesconi Institute of Chemistry-Analytical Chemistry, University of Graz, Graz, Austria
Walter Goessler Institute of Chemistry-Analytical Chemistry, University of Graz, Graz, Austria
Jason G Umans Georgetown-Howard Universities Center for Clinical and Translational Science, Washington, District of Columbia; MedStar Health Research Institute, Hyattsville, Maryland
Duncan C Thomas *Department of Preventive Medicine, University of Southern California, Los Angeles, California
Frank Gilliland *Department of Preventive Medicine, University of Southern California, Los Angeles, California
Kari E North Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina
Nora Franceschini Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina
Ana Navas-Acien Department of Environmental Health Sciences, Johns Hopkins University, Baltimore, Maryland; Department of Epidemiology, Johns Hopkins Medical Institutions, Baltimore, Maryland; Welch Center for Prevention, Epidemiology and Clinical Research, Johns Hopkins Medical Institutions, Baltimore, Maryland; Department of Oncology, Johns Hopkins Medical Institutions, Baltimore, Maryland

Collapse

Leveraging Identity-by-Descent for Accurate Genotype Inference in Family Sequencing Data. PLoS Genet 2015;11:e1005271. [PMID: 26043085 PMCID: PMC4456389 DOI: 10.1371/journal.pgen.1005271] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 05/12/2015] [Indexed: 12/23/2022] Open

Abstract

Sequencing family DNA samples provides an attractive alternative to population based designs to identify rare variants associated with human disease due to the enrichment of causal variants in pedigrees. Previous studies showed that genotype calling accuracy can be improved by modeling family relatedness compared to standard calling algorithms. Current family-based variant calling methods use sequencing data on single variants and ignore the identity-by-descent (IBD) sharing along the genome. In this study we describe a new computational framework to accurately estimate the IBD sharing from the sequencing data, and to utilize the inferred IBD among family members to jointly call genotypes in pedigrees. Through simulations and application to real data, we showed that IBD can be reliably estimated across the genome, even at very low coverage (e.g. 2X), and genotype accuracy can be dramatically improved. Moreover, the improvement is more pronounced for variants with low frequencies, especially at low to intermediate coverage (e.g. 10X to 20X), making our approach effective in studying rare variants in cost-effective whole genome sequencing in pedigrees. We hope that our tool is useful to the research community for identifying rare variants for human disease through family-based sequencing.

To identify disease variants that occur less frequently in population, sequencing families in which multiple individuals are affected is more powerful due to the enrichment of causal variants. An important step in such studies is to infer individual genotypes from sequencing data. Existing methods do not utilize full familial transmission information and therefore result in reduced accuracy of inferred genotypes. In this study we describe a new method that infers shared genetic materials among family members and then incorporate the shared genomic information in a novel algorithm that can accurately infer genotypes. Our method is particularly advantageous when inferring low frequency variants with fewer sequence data, making it effective in analyzing genome-wide sequence data. We implemented the algorithm in a computationally efficient tool to facilitate cost-effective sequencing in families for identifying disease genetic variants.

Collapse

Kember RL, Georgi B, Bailey-Wilson JE, Stambolian D, Paul SM, Bućan M. Copy number variants encompassing Mendelian disease genes in a large multigenerational family segregating bipolar disorder. BMC Genet 2015;16:27. [PMID: 25887117 PMCID: PMC4382929 DOI: 10.1186/s12863-015-0184-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Accepted: 02/19/2015] [Indexed: 12/20/2022] Open

Livne OE, Han L, Alkorta-Aranburu G, Wentworth-Sheilds W, Abney M, Ober C, Nicolae DL. PRIMAL: Fast and accurate pedigree-based imputation from sequence data in a founder population. PLoS Comput Biol 2015;11:e1004139. [PMID: 25735005 PMCID: PMC4348507 DOI: 10.1371/journal.pcbi.1004139] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 01/19/2015] [Indexed: 12/31/2022] Open

Saad M, Wijsman EM. Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees. Genet Epidemiol 2014;38:579-90. [PMID: 25132070 PMCID: PMC4190076 DOI: 10.1002/gepi.21844] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Revised: 05/24/2014] [Accepted: 06/27/2014] [Indexed: 12/27/2022]

Blue EM, Sun L, Tintle NL, Wijsman EM. Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond. Genet Epidemiol 2014;38 Suppl 1:S21-8. [PMID: 25112184 DOI: 10.1002/gepi.21821] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Chen W, Schaid DJ. PedBLIMP: extending linear predictors to impute genotypes in pedigrees. Genet Epidemiol 2014;38:531-41. [PMID: 25044249 DOI: 10.1002/gepi.21838] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2014] [Revised: 05/15/2014] [Accepted: 05/19/2014] [Indexed: 12/13/2022]

Identity-by-descent graphs offer a flexible framework for imputation and both linkage and association analyses. BMC Proc 2014;8:S19. [PMID: 25519371 PMCID: PMC4143703 DOI: 10.1186/1753-6561-8-s1-s19] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Rubenstein K, Raskind WH, Berninger VW, Matsushita MM, Wijsman EM. Genome scan for cognitive trait loci of dyslexia: Rapid naming and rapid switching of letters, numbers, and colors. Am J Med Genet B Neuropsychiatr Genet 2014;165B:345-56. [PMID: 24807833 PMCID: PMC4053475 DOI: 10.1002/ajmg.b.32237] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Accepted: 04/14/2014] [Indexed: 12/14/2022]

Cheung CYK, Thompson EA, Wijsman EM. Detection of Mendelian consistent genotyping errors in pedigrees. Genet Epidemiol 2014;38:291-9. [PMID: 24718985 DOI: 10.1002/gepi.21806] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2013] [Revised: 03/03/2014] [Accepted: 03/04/2014] [Indexed: 11/12/2022]

Genomic view of bipolar disorder revealed by whole genome sequencing in a genetic isolate. PLoS Genet 2014;10:e1004229. [PMID: 24625924 PMCID: PMC3953017 DOI: 10.1371/journal.pgen.1004229] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2013] [Accepted: 01/24/2014] [Indexed: 11/19/2022] Open

Abstract

Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders.

Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable efforts genetic studies have yet to reveal the precise genetic underpinnings of the disorder. In this study we have analyzed a large extended pedigree of Old Order Amish that segregates bipolar disorder. Our study design integrates both dense genotype and whole-genome sequence data. In a combined linkage and association analysis we identify five chromosomal regions with nominally significant or suggestive evidence for linkage, several of which constitute replication of earlier linkage findings for bipolar disorder in non-Amish families. Association analysis of genetic variants in each of the linkage regions yielded a number of plausible candidate genes for bipolar disorder. The striking genetic heterogeneity we observed in this genetic isolate has profound implications for the study of bipolar disorder in the general population.

Collapse

A statistical framework to guide sequencing choices in pedigrees. Am J Hum Genet 2014;94:257-67. [PMID: 24507777 DOI: 10.1016/j.ajhg.2014.01.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2013] [Accepted: 01/13/2014] [Indexed: 11/23/2022] Open

Thomas DC, Yang Z, Yang F. Two-phase and family-based designs for next-generation sequencing studies. Front Genet 2013;4:276. [PMID: 24379824 PMCID: PMC3861783 DOI: 10.3389/fgene.2013.00276] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2013] [Accepted: 11/19/2013] [Indexed: 12/21/2022] Open

Abstract

The cost of next-generation sequencing is now approaching that of early GWAS panels, but is still out of reach for large epidemiologic studies and the millions of rare variants expected poses challenges for distinguishing causal from non-causal variants. We review two types of designs for sequencing studies: two-phase designs for targeted follow-up of genomewide association studies using unrelated individuals; and family-based designs exploiting co-segregation for prioritizing variants and genes. Two-phase designs subsample subjects for sequencing from a larger case-control study jointly on the basis of their disease and carrier status; the discovered variants are then tested for association in the parent study. The analysis combines the full sequence data from the substudy with the more limited SNP data from the main study. We discuss various methods for selecting this subset of variants and describe the expected yield of true positive associations in the context of an on-going study of second breast cancers following radiotherapy. While the sharing of variants within families means that family-based designs are less efficient for discovery than sequencing unrelated individuals, the ability to exploit co-segregation of variants with disease within families helps distinguish causal from non-causal ones. Furthermore, by enriching for family history, the yield of causal variants can be improved and use of identity-by-descent information improves imputation of genotypes for other family members. We compare the relative efficiency of these designs with those using unrelated individuals for discovering and prioritizing variants or genes for testing association in larger studies. While associations can be tested with single variants, power is low for rare ones. Recent generalizations of burden or kernel tests for gene-level associations to family-based data are appealing. These approaches are illustrated in the context of a family-based study of colorectal cancer.

Collapse

Saad M, Wijsman EM. Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes. Genet Epidemiol 2013;38:1-9. [PMID: 24243664 DOI: 10.1002/gepi.21776] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Revised: 09/30/2013] [Accepted: 10/15/2013] [Indexed: 01/09/2023]

Marchani EE, Chapman NH, Cheung CYK, Ankenman K, Stanaway IB, Coon HH, Nickerson D, Bernier R, Brkanac Z, Wijsman EM. Identification of rare variants from exome sequence in a large pedigree with autism. Hum Hered 2013;74:153-64. [PMID: 23594493 DOI: 10.1159/000346560] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open