1
|
Fu Y, Aganezov S, Mahmoud M, Beaulaurier J, Juul S, Treangen TJ, Sedlazeck FJ. MethPhaser: methylation-based long-read haplotype phasing of human genomes. Nat Commun 2024; 15:5327. [PMID: 38909018 PMCID: PMC11193733 DOI: 10.1038/s41467-024-49588-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 06/11/2024] [Indexed: 06/24/2024] Open
Abstract
The assignment of variants across haplotypes, phasing, is crucial for predicting the consequences, interaction, and inheritance of mutations and is a key step in improving our understanding of phenotype and disease. However, phasing is limited by read length and stretches of homozygosity along the genome. To overcome this limitation, we designed MethPhaser, a method that utilizes methylation signals from Oxford Nanopore Technologies to extend Single Nucleotide Variation (SNV)-based phasing. We demonstrate that haplotype-specific methylations extensively exist in Human genomes and the advent of long-read technologies enabled direct report of methylation signals. For ONT R9 and R10 cell line data, we increase the phase length N50 by 78%-151% at a phasing accuracy of 83.4-98.7% To assess the impact of tissue purity and random methylation signals due to inactivation, we also applied MethPhaser on blood samples from 4 patients, still showing improvements over SNV-only phasing. MethPhaser further improves phasing across HLA and multiple other medically relevant genes, improving our understanding of how mutations interact across multiple phenotypes. The concept of MethPhaser can also be extended to non-human diploid genomes. MethPhaser is available at https://github.com/treangenlab/methphaser .
Collapse
Affiliation(s)
- Yilei Fu
- Department of Computer Science, Rice University, Houston, TX, USA
| | | | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | - Sissel Juul
- Oxford Nanopore Technologies Inc, New York, NY, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA.
- Department of Bioengineering, Rice University, Houston, TX, USA.
| | - Fritz J Sedlazeck
- Department of Computer Science, Rice University, Houston, TX, USA.
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.
| |
Collapse
|
2
|
Jin X, Shi G. Cauchy combination methods for the detection of gene-environment interactions for rare variants related to quantitative phenotypes. Heredity (Edinb) 2023; 131:241-252. [PMID: 37481617 PMCID: PMC10539363 DOI: 10.1038/s41437-023-00640-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 07/09/2023] [Accepted: 07/12/2023] [Indexed: 07/24/2023] Open
Abstract
The characterization of gene-environment interactions (GEIs) can provide detailed insights into the biological mechanisms underlying complex diseases. Despite recent interest in GEIs for rare variants, published GEI tests are underpowered for an extremely small proportion of causal rare variants in a gene or a region. By extending the aggregated Cauchy association test (ACAT), we propose three GEI tests to address this issue: a Cauchy combination GEI test with fixed main effects (CCGEI-F), a Cauchy combination GEI test with random main effects (CCGEI-R), and an omnibus Cauchy combination GEI test (CCGEI-O). ACAT was applied to combine p values of single-variant GEI analyses to obtain CCGEI-F and CCGEI-R and p values of multiple GEI tests were combined in CCGEI-O. Through numerical simulations, for small numbers of causal variants, CCGEI-F, CCGEI-R and CCGEI-O provided approximately 5% higher power than the existing GEI tests INT-FIX and INT-RAN; however, they had slightly higher power than the existing GEI test TOW-GE. For large numbers of causal variants, although CCGEI-F and CCGEI-R exhibited comparable or slightly lower power values than the competing tests, the results were still satisfactory. Among all simulation conditions evaluated, CCGEI-O provided significantly higher power than that of competing GEI tests. We further applied our GEI tests in genome-wide analyses of systolic blood pressure or diastolic blood pressure to detect gene-body mass index (BMI) interactions, using whole-exome sequencing data from UK Biobank. At a suggestive significance level of 1.0 × 10-4, KCNC4, GAR1, FAM120AOS and NT5C3B showed interactions with BMI by our GEI tests.
Collapse
Affiliation(s)
- Xiaoqin Jin
- State Key Laboratory of Integrated Services Networks, Xidian University, 2 South Taibai Road, Xi'an, Shaanxi, 710071, China.
| | - Gang Shi
- State Key Laboratory of Integrated Services Networks, Xidian University, 2 South Taibai Road, Xi'an, Shaanxi, 710071, China
| |
Collapse
|
3
|
Mukherjee S, Cassini TA, Hu N, Yang T, Li B, Shen W, Moth CW, Rinker DC, Sheehan JH, Cogan JD, Newman JH, Hamid R, Macdonald RL, Roden DM, Meiler J, Kuenze G, Phillips JA, Capra JA. Personalized structural biology reveals the molecular mechanisms underlying heterogeneous epileptic phenotypes caused by de novo KCNC2 variants. HGG ADVANCES 2022; 3:100131. [PMID: 36035247 PMCID: PMC9399384 DOI: 10.1016/j.xhgg.2022.100131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 07/11/2022] [Indexed: 11/28/2022] Open
Abstract
Whole-exome sequencing (WES) in the clinic has identified several rare monogenic developmental and epileptic encephalopathies (DEE) caused by ion channel variants. However, WES often fails to provide actionable insight for rare diseases, such as DEEs, due to the challenges of interpreting variants of unknown significance (VUS). Here, we describe a "personalized structural biology" (PSB) approach that leverages recent innovations in the analysis of protein 3D structures to address this challenge. We illustrate this approach in an Undiagnosed Diseases Network (UDN) individual with DEE symptoms and a de novo VUS in KCNC2 (p.V469L), the Kv3.2 voltage-gated potassium channel. A nearby KCNC2 variant (p.V471L) was recently suggested to cause DEE-like phenotypes. Computational structural modeling suggests that both affect protein function. However, despite their proximity, the p.V469L variant is likely to sterically block the channel pore, while the p.V471L variant is likely to stabilize the open state. Biochemical and electrophysiological analyses demonstrate heterogeneous loss-of-function and gain-of-function effects, as well as differential response to 4-aminopyridine treatment. Molecular dynamics simulations illustrate that the pore of the p.V469L variant is more constricted, increasing the energetic barrier for K+ permeation, whereas the p.V471L variant stabilizes the open conformation. Our results implicate variants in KCNC2 as causative for DEE and guide the interpretation of a UDN individual. They further delineate the molecular basis for the heterogeneous clinical phenotypes resulting from two proximal pathogenic variants. This demonstrates how the PSB approach can provide an analytical framework for individualized hypothesis-driven interpretation of protein-coding VUS.
Collapse
Affiliation(s)
- Souhrid Mukherjee
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
| | - Thomas A. Cassini
- Department of Internal Medicine, National Institutes of Health Clinical Center, Bethesda, MD 20814, USA
| | - Ningning Hu
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Tao Yang
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Bian Li
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
| | - Wangzhen Shen
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Christopher W. Moth
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
| | - David C. Rinker
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
| | - Jonathan H. Sheehan
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- John T. Milliken Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Joy D. Cogan
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Undiagnosed Diseases Network
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
- Pulmonary Hypertension Center, Division of Allergy, Pulmonary and Critical Care Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- John T. Milliken Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
- Department of Internal Medicine, National Institutes of Health Clinical Center, Bethesda, MD 20814, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
- Department of Chemistry, Leipzig University, Leipzig, SAC 04109, Germany
- Department of Computer Science, Leipzig University, Leipzig, SAC 04109, Germany
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA 94143, USA
| | - John H. Newman
- Pulmonary Hypertension Center, Division of Allergy, Pulmonary and Critical Care Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Rizwan Hamid
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Robert L. Macdonald
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Dan M. Roden
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
- Department of Chemistry, Leipzig University, Leipzig, SAC 04109, Germany
- Department of Computer Science, Leipzig University, Leipzig, SAC 04109, Germany
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
| | - John A. Phillips
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - John A. Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA 94143, USA
| |
Collapse
|
4
|
Susgun S, Kasan K, Yucesan E. Gene Hunting Approaches through the Combination of Linkage Analysis with Whole-Exome Sequencing in Mendelian Diseases: From Darwin to the Present Day. Public Health Genomics 2022; 24:207-217. [PMID: 34237751 DOI: 10.1159/000517102] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/27/2021] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND In the context of medical genetics, gene hunting is the process of identifying and functionally characterizing genes or genetic variations that contribute to disease phenotypes. In this review, we would like to summarize gene hunting process in terms of historical aspects from Darwin to now. For this purpose, different approaches and recent developments will be detailed. SUMMARY Linkage analysis and association studies are the most common methods in use for explaining the genetic background of hereditary diseases and disorders. Although linkage analysis is a relatively old approach, it is still a powerful method to detect disease-causing rare variants using family-based data, particularly for consanguineous marriages. As is known that, consanguineous marriages or endogamy poses a social problem in developing countries, however, this same condition also provides a unique opportunity for scientists to identify and characterize pathogenic variants. The rapid advancements in sequencing technologies and their parallel implementation together with linkage analyses now allow us to identify the candidate variants related to diseases in a relatively short time. Furthermore, we can now go one step further and functionally characterize the causative variant through in vitro and in vivo studies and unveil the variant-phenotype relationships on a molecular level more robustly. Key Messages: Herein, we suggest that the combined analysis of linkage and exome analysis is a powerful and precise tool to diagnose clinically rare and recessively inherited conditions.
Collapse
Affiliation(s)
- Seda Susgun
- Department of Medical Biology, Faculty of Medicine, Bezmialem Vakif University, Istanbul, Turkey.,Graduate School of Health Sciences, Istanbul University, Istanbul, Turkey
| | - Koray Kasan
- Faculty of Medicine, Bezmialem Vakif University, Istanbul, Turkey
| | - Emrah Yucesan
- Department of Medical Biology, Faculty of Medicine, Bezmialem Vakif University, Istanbul, Turkey
| |
Collapse
|
5
|
Mukherjee S, Cogan JD, Newman JH, Phillips JA, Hamid R, Meiler J, Capra JA. Identifying digenic disease genes via machine learning in the Undiagnosed Diseases Network. Am J Hum Genet 2021; 108:1946-1963. [PMID: 34529933 PMCID: PMC8546038 DOI: 10.1016/j.ajhg.2021.08.010] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 08/25/2021] [Indexed: 12/20/2022] Open
Abstract
Rare diseases affect millions of people worldwide, and discovering their genetic causes is challenging. More than half of the individuals analyzed by the Undiagnosed Diseases Network (UDN) remain undiagnosed. The central hypothesis of this work is that many of these rare genetic disorders are caused by multiple variants in more than one gene. However, given the large number of variants in each individual genome, experimentally evaluating combinations of variants for potential to cause disease is currently infeasible. To address this challenge, we developed the digenic predictor (DiGePred), a random forest classifier for identifying candidate digenic disease gene pairs by features derived from biological networks, genomics, evolutionary history, and functional annotations. We trained the DiGePred classifier by using DIDA, the largest available database of known digenic-disease-causing gene pairs, and several sets of non-digenic gene pairs, including variant pairs derived from unaffected relatives of UDN individuals. DiGePred achieved high precision and recall in cross-validation and on a held-out test set (PR area under the curve > 77%), and we further demonstrate its utility by using digenic pairs from the recent literature. In contrast to other approaches, DiGePred also appropriately controls the number of false positives when applied in realistic clinical settings. Finally, to enable the rapid screening of variant gene pairs for digenic disease potential, we freely provide the predictions of DiGePred on all human gene pairs. Our work enables the discovery of genetic causes for rare non-monogenic diseases by providing a means to rapidly evaluate variant gene pairs for the potential to cause digenic disease.
Collapse
Affiliation(s)
- Souhrid Mukherjee
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
| | - Joy D Cogan
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - John H Newman
- Pulmonary Hypertension Center, Division of Allergy, Pulmonary, and Critical Care Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - John A Phillips
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Rizwan Hamid
- Department of Pediatrics, Division of Medical Genetics and Genomic Medicine, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA; Department of Pharmacology, Vanderbilt University, Nashville, TN 37235, USA; Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Institute for Drug Discovery, Leipzig University Medical School, Leipzig 04103, Germany; Department of Chemistry, Leipzig University, Leipzig 04109, Germany; Department of Computer Science, Leipzig University, Leipzig 04109, Germany.
| | - John A Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA 94143, USA.
| |
Collapse
|
6
|
Lim E, Chen H, Dupuis J, Liu CT. A unified method for rare variant analysis of gene-environment interactions. Stat Med 2020; 39:801-813. [PMID: 31799744 PMCID: PMC7261513 DOI: 10.1002/sim.8446] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 11/19/2019] [Accepted: 11/21/2019] [Indexed: 01/17/2023]
Abstract
Advanced technology in whole-genome sequencing has offered the opportunity to comprehensively investigate the genetic contribution, particularly rare variants, to complex traits. Several region-based tests have been developed to jointly model the marginal effect of rare variants, but methods to detect gene-environment (GE) interactions are underdeveloped. Identifying the modification effects of environmental factors on genetic risk poses a considerable challenge. To tackle this challenge, we develop a method to detect GE interactions for rare variants using generalized linear mixed effect model. The proposed method can accommodate either binary or continuous traits in related or unrelated samples. Under this model, genetic main effects, GE interactions, and sample relatedness are modeled as random effects. We adopt a kernel-based method to leverage the joint information across rare variants and implement variance component score tests to reduce the computational burden. Our simulation studies of continuous and binary traits show that the proposed method maintains correct type I error rates and appropriate power under various scenarios, such as genotype main effects and GE interaction effects in opposite directions and varying the proportion of causal variants in the model. We apply our method in the Framingham Heart Study to test GE interaction of smoking on body mass index or overweight status and replicate the Cholinergic Receptor Nicotinic Beta 4 gene association reported in previous large consortium meta-analysis of single nucleotide polymorphism-smoking interaction. Our proposed set-based GE test is computationally efficient and is applicable to both binary and continuous phenotypes, while appropriately accounting for familial or cryptic relatedness.
Collapse
Affiliation(s)
- Elise Lim
- Department of Biostatistics, Boston University, Boston, Massachusetts
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas
- Center for Precision Health, School of Public Health and School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas
| | - Josée Dupuis
- Department of Biostatistics, Boston University, Boston, Massachusetts
| | - Ching-Ti Liu
- Department of Biostatistics, Boston University, Boston, Massachusetts
| |
Collapse
|
7
|
Wang L, Lee S, Qiao D, Cho MH, Silverman EK, Lange C, Won S. metaFARVAT: An Efficient Tool for Meta-Analysis of Family-Based, Case-Control, and Population-Based Rare Variant Association Studies. Front Genet 2019; 10:572. [PMID: 31275357 PMCID: PMC6593391 DOI: 10.3389/fgene.2019.00572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 05/31/2019] [Indexed: 11/13/2022] Open
Abstract
Family-based designs have been shown to be powerful in detecting the significant rare variants associated with human diseases. However, very few significant results have been found owing to relatively small sample sizes and the fact that statistical analyses often suffer from high false-negative error rates. These limitations can be avoided by combining results from multiple studies via meta-analysis. However, statistical methods for meta-analysis with rare variants are limited for family-based samples. In this report, we propose a tool for the meta-analysis of family-based rare variant associations, metaFARVAT. metaFARVAT is based on a quasi-likelihood score for each variant. These scores are combined to generate burden test, variable-threshold test, sequence kernel association test (SKAT), and optimal SKAT statistics. The proposed method tests homogeneous and heterogeneous effects of variants among different studies and can be applied to both quantitative and dichotomous phenotypes. Simulation results demonstrated the robustness and efficiency of the proposed method in different scenarios. By applying metaFARVAT to data from a family-based study and a case-control study, we identified a few promising candidate genes, including DLEC1, which is associated with chronic obstructive pulmonary disease.
Collapse
Affiliation(s)
- Longfei Wang
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
| | - Sungyoung Lee
- Center for Precision Medicine, Seoul National University Hospital, Seoul, South Korea
| | - Dandi Qiao
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States
| | - Michael H Cho
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States.,Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, United States
| | - Edwin K Silverman
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States.,Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, United States
| | - Christoph Lange
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, United States.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, United States
| | - Sungho Won
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.,Department of Public Health Sciences, Seoul National University, Seoul, South Korea.,Institute of Health and Environment, Seoul National University, Seoul, South Korea
| |
Collapse
|
8
|
Bureau A, Begum F, Taub MA, Hetmanski J, Parker MM, Albacha-Hejazi H, Scott AF, Murray JC, Marazita ML, Bailey-Wilson JE, Beaty TH, Ruczinski I. Inferring disease risk genes from sequencing data in multiplex pedigrees through sharing of rare variants. Genet Epidemiol 2019; 43:37-49. [PMID: 30246882 PMCID: PMC6330140 DOI: 10.1002/gepi.22155] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 07/11/2018] [Accepted: 07/15/2018] [Indexed: 12/23/2022]
Abstract
We previously demonstrated how sharing of rare variants (RVs) in distant affected relatives can be used to identify variants causing a complex and heterogeneous disease. This approach tested whether single RVs were shared by all sequenced affected family members. However, as with other study designs, joint analysis of several RVs (e.g., within genes) is sometimes required to obtain sufficient statistical power. Further, phenocopies can lead to false negatives for some causal RVs if complete sharing among affected is required. Here, we extend our methodology (Rare Variant Sharing, RVS) to address these issues. Specifically, we introduce gene-based analyses, a partial sharing test based on RV sharing probabilities for subsets of affected relatives and a haplotype-based RV definition. RVS also has the desirable feature of not requiring external estimates of variant frequency or control samples, provides functionality to assess and address violations of key assumptions, and is available as open source software for genome-wide analysis. Simulations including phenocopies, based on the families of an oral cleft study, revealed the partial and complete sharing versions of RVS achieved similar statistical power compared with alternative methods (RareIBD and the Gene-Based Segregation Test), and had superior power compared with the pedigree Variant Annotation, Analysis, and Search Tool (pVAAST) linkage statistic. In studies of multiplex cleft families, analysis of rare single nucleotide variants in the exome of 151 affected relatives from 54 families revealed no significant excess sharing in any one gene, but highlighted different patterns of sharing revealed by the complete and partial sharing tests.
Collapse
Affiliation(s)
- Alexandre Bureau
- Département de Médecine Sociale et Préventive, Université Laval, Québec, Canada
- Centre de recherche CERVO, Québec, Canada
| | - Ferdouse Begum
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Margaret A. Taub
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Jacqueline Hetmanski
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Margaret M. Parker
- Channing Division of Network Medicine, Harvard Medical School, Boston, MA, USA
| | | | - Alan F. Scott
- Institute of Genetic Medicine, Johns Hopkins Medical Institutions, Baltimore, MD, USA
| | - Jeffrey C. Murray
- Department of Pediatrics, School of Medicine, University of Iowa, Iowa City, IA, USA
| | - Mary L. Marazita
- Department of Oral Biology, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Joan E. Bailey-Wilson
- Inherited Disease Research Branch, National Human Genome Research Institute, Baltimore, MD, USA
| | - Terri H. Beaty
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| |
Collapse
|
9
|
Shi WL, Zhang YB, Wei W, Gao HY, Huang YH. WITHDRAWN: Whole genome sequencing identifies novel NOTCH3 mutations for leukoaraiosis. Biochem Biophys Res Commun 2018:S0006-291X(18)30297-3. [PMID: 29428736 DOI: 10.1016/j.bbrc.2018.02.074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 02/07/2018] [Indexed: 10/18/2022]
Abstract
This article has been withdrawn at the request of the author(s) and/or editor. The Publisher apologizes for any inconvenience this may cause. The full Elsevier Policy on Article Withdrawal can be found at https://www.elsevier.com/about/our-business/policies/article-withdrawal.
Collapse
Affiliation(s)
- Wen-Lei Shi
- Department of Neurology, Beijing Military General Hospital, Beijing, 10070, China; Department of Neurology, Bethune International Peace Hospital, Shijiazhuang, HeBei, 050082, PR China
| | - Yong-Biao Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100029, China
| | - Wei Wei
- Department of Neurology, Beijing Military General Hospital, Beijing, 10070, China
| | - Hong-Yan Gao
- Section of Science Research and Training, Beijing Military General Hospital, Beijing 10070, China
| | - Yong-Hua Huang
- Department of Neurology, Beijing Military General Hospital, Beijing, 10070, China
| |
Collapse
|
10
|
Qiao D, Lange C, Beaty TH, Crapo JD, Barnes KC, Bamshad M, Hersh CP, Morrow J, Pinto-Plata VM, Marchetti N, Bueno R, Celli BR, Criner GJ, Silverman EK, Cho MH. Exome Sequencing Analysis in Severe, Early-Onset Chronic Obstructive Pulmonary Disease. Am J Respir Crit Care Med 2017; 193:1353-63. [PMID: 26736064 DOI: 10.1164/rccm.201506-1223oc] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
RATIONALE Genomic regions identified by genome-wide association studies explain only a small fraction of heritability for chronic obstructive pulmonary disease (COPD). Alpha-1 antitrypsin deficiency shows that rare coding variants of large effect also influence COPD susceptibility. We hypothesized that exome sequencing in families identified through a proband with severe, early-onset COPD would identify additional rare genetic determinants of large effect. OBJECTIVES To identify rare genetic determinants of severe COPD. METHODS We applied filtering approaches to identify potential causal variants for COPD in whole exomes from 347 subjects in 49 extended pedigrees from the Boston Early-Onset COPD Study. We assessed the power of this approach under different levels of genetic heterogeneity using simulations. We tested genes identified in these families using gene-based association tests in exomes of 204 cases with severe COPD and 195 resistant smokers from the COPDGene study. In addition, we examined previously described loci associated with COPD using these datasets. MEASUREMENTS AND MAIN RESULTS We identified 69 genes with predicted deleterious nonsynonymous, stop, or splice variants that segregated with severe COPD in at least two pedigrees. Four genes (DNAH8, ALCAM, RARS, and GBF1) also demonstrated an increase in rare nonsynonymous, stop, and/or splice mutations in cases compared with resistant smokers from the COPDGene study; however, these results were not statistically significant. We demonstrate the limitations of the power of this approach under genetic heterogeneity through simulation. CONCLUSIONS Rare deleterious coding variants may increase risk for COPD, but multiple genes likely contribute to COPD susceptibility.
Collapse
Affiliation(s)
| | - Christoph Lange
- 2 Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts
| | - Terri H Beaty
- 3 Johns Hopkins Bloomberg School of Public Health, and
| | | | - Kathleen C Barnes
- 5 Division of Allergy and Clinical Immunology, Department of Medicine, Johns Hopkins University, Baltimore, Maryland
| | - Michael Bamshad
- 6 Division of Genetic Medicine, Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, Washington
| | - Craig P Hersh
- 1 Channing Division of Network Medicine.,7 Division of Pulmonary and Critical Care Medicine, and
| | | | - Victor M Pinto-Plata
- 8 Department of Critical Care Medicine and Pulmonary Disease, Baystate Medical Center, Springfield, Massachusetts
| | | | - Raphael Bueno
- 10 Division of Thoracic Surgery, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
| | | | - Gerald J Criner
- 11 Division of Pulmonary and Critical Care Medicine Temple University School of Medicine, Philadelphia, Pennsylvania
| | - Edwin K Silverman
- 1 Channing Division of Network Medicine.,7 Division of Pulmonary and Critical Care Medicine, and
| | - Michael H Cho
- 1 Channing Division of Network Medicine.,7 Division of Pulmonary and Critical Care Medicine, and
| | | | | | | |
Collapse
|
11
|
Qiao D, Lange C, Laird NM, Won S, Hersh CP, Morrow J, Hobbs BD, Lutz SM, Ruczinski I, Beaty TH, Silverman EK, Cho MH. Gene-based segregation method for identifying rare variants in family-based sequencing studies. Genet Epidemiol 2017; 41:309-319. [PMID: 28191685 DOI: 10.1002/gepi.22037] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Revised: 11/30/2016] [Accepted: 12/09/2016] [Indexed: 11/11/2022]
Abstract
Whole-exome sequencing using family data has identified rare coding variants in Mendelian diseases or complex diseases with Mendelian subtypes, using filters based on variant novelty, functionality, and segregation with the phenotype within families. However, formal statistical approaches are limited. We propose a gene-based segregation test (GESE) that quantifies the uncertainty of the filtering approach. It is constructed using the probability of segregation events under the null hypothesis of Mendelian transmission. This test takes into account different degrees of relatedness in families, the number of functional rare variants in the gene, and their minor allele frequencies in the corresponding population. In addition, a weighted version of this test allows incorporating additional subject phenotypes to improve statistical power. We show via simulations that the GESE and weighted GESE tests maintain appropriate type I error rate, and have greater power than several commonly used region-based methods. We apply our method to whole-exome sequencing data from 49 extended pedigrees with severe, early-onset chronic obstructive pulmonary disease (COPD) in the Boston Early-Onset COPD study (BEOCOPD) and identify several promising candidate genes. Our proposed methods show great potential for identifying rare coding variants of large effect and high penetrance for family-based sequencing data. The proposed tests are implemented in an R package that is available on CRAN (https://cran.r-project.org/web/packages/GESE/).
Collapse
Affiliation(s)
- Dandi Qiao
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Christoph Lange
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Nan M Laird
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Sungho Won
- Department of Public Health Science, Seoul National University, Seoul, Republic of Korea
| | - Craig P Hersh
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.,Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jarrett Morrow
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Brian D Hobbs
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.,Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Sharon M Lutz
- Department of Biostatistics, Anschutz Medical Campus, University of Colorado, Aurora, Colorado, United States of America
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Terri H Beaty
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
| | - Edwin K Silverman
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.,Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Michael H Cho
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America.,Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, United States of America.,University of Washington Center for Mendelian Genomics, Seattle, Washington, United States of America
| |
Collapse
|
12
|
Chung RH, Tsai WY, Kang CY, Yao PJ, Tsai HJ, Chen CH. FamPipe: An Automatic Analysis Pipeline for Analyzing Sequencing Data in Families for Disease Studies. PLoS Comput Biol 2016; 12:e1004980. [PMID: 27272119 PMCID: PMC4894624 DOI: 10.1371/journal.pcbi.1004980] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 05/12/2016] [Indexed: 11/18/2022] Open
Abstract
In disease studies, family-based designs have become an attractive approach to analyzing next-generation sequencing (NGS) data for the identification of rare mutations enriched in families. Substantial research effort has been devoted to developing pipelines for automating sequence alignment, variant calling, and annotation. However, fewer pipelines have been designed specifically for disease studies. Most of the current analysis pipelines for family-based disease studies using NGS data focus on a specific function, such as identifying variants with Mendelian inheritance or identifying shared chromosomal regions among affected family members. Consequently, some other useful family-based analysis tools, such as imputation, linkage, and association tools, have yet to be integrated and automated. We developed FamPipe, a comprehensive analysis pipeline, which includes several family-specific analysis modules, including the identification of shared chromosomal regions among affected family members, prioritizing variants assuming a disease model, imputation of untyped variants, and linkage and association tests. We used simulation studies to compare properties of some modules implemented in FamPipe, and based on the results, we provided suggestions for the selection of modules to achieve an optimal analysis strategy. The pipeline is under the GNU GPL License and can be downloaded for free at http://fampipe.sourceforge.net.
Collapse
Affiliation(s)
- Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
- * E-mail:
| | - Wei-Yun Tsai
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
| | - Chen-Yu Kang
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
| | - Po-Ju Yao
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
| | - Hui-Ju Tsai
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
- Department of Public Health, China Medical University, Taichung, Taiwan
- Department of Pediatrics, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, United States of America
| | - Chia-Hsiang Chen
- Department of Psychiatry, Chang Gung Memorial Hospital-Linkou, Gueishan, Taoyuan, Taiwan
- Department and Graduate Institute of Biomedical Sciences, Chang Gung University, Taoyuan, Taiwan
| |
Collapse
|
13
|
Lin WY. Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study. Sci Rep 2016; 6:21824. [PMID: 26903168 PMCID: PMC4763184 DOI: 10.1038/srep21824] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 02/01/2016] [Indexed: 12/31/2022] Open
Abstract
Rare-variant association testing usually requires some method of aggregation. The next important step is to pinpoint individual rare causal variants among a large number of variants within a genetic region. Recently Ionita-Laza et al. propose a backward elimination (BE) procedure that can identify individual causal variants among the many variants in a gene. The BE procedure removes a variant if excluding this variant can lead to a smaller P-value for the BURDEN test (referred to as "BE-BURDEN") or the SKAT test (referred to as "BE-SKAT"). We here use the adaptive combination of P-values (ADA) method to pinpoint causal variants. Unlike most gene-based association tests, the ADA statistic is built upon per-site P-values of individual variants. It is straightforward to select important variants given the optimal P-value truncation threshold found by ADA. We performed comprehensive simulations to compare ADA with BE-SKAT and BE-BURDEN. Ranking these three approaches according to positive predictive values (PPVs), the percentage of truly causal variants among the total selected variants, we found ADA > BE-SKAT > BE-BURDEN across all simulation scenarios. We therefore recommend using ADA to pinpoint plausible rare causal variants in a gene.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
14
|
Abstract
Genomic DNA sequencing technologies have been one of the great advances of the 21st century, having decreased in cost by seven orders of magnitude and opening up new fields of investigation throughout research and clinical medicine. Genomics coupled with biochemical investigation has allowed the molecular definition of a growing number of new genetic diseases that reveal new concepts of immune regulation. Also, defining the genetic pathogenesis of these diseases has led to improved diagnosis, prognosis, genetic counseling, and, most importantly, new therapies. We highlight the investigational journey from patient phenotype to treatment using the newly defined XMEN disease, caused by the genetic loss of the MAGT1 magnesium transporter, as an example. This disease illustrates how genomics yields new fundamental immunoregulatory insights as well as how research genomics is integrated into clinical immunology. At the end, we discuss two other recently described diseases, CHAI/LATAIE (CTLA-4 deficiency) and PASLI (PI3K dysregulation), as additional examples of the journey from unknown immunological diseases to new precision medicine treatments using genomics.
Collapse
Affiliation(s)
- Michael Lenardo
- Molecular Development of the Immune System Section, Laboratory of Immunology, and Clinical Genomics Program, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland;
| | - Bernice Lo
- Molecular Development of the Immune System Section, Laboratory of Immunology, and Clinical Genomics Program, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland;
| | - Carrie L Lucas
- Molecular Development of the Immune System Section, Laboratory of Immunology, and Clinical Genomics Program, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland;
| |
Collapse
|
15
|
Smedley D, Jacobsen JOB, Jäger M, Köhler S, Holtgrewe M, Schubach M, Siragusa E, Zemojtel T, Buske OJ, Washington NL, Bone WP, Haendel MA, Robinson PN. Next-generation diagnostics and disease-gene discovery with the Exomiser. Nat Protoc 2015; 10:2004-15. [PMID: 26562621 DOI: 10.1038/nprot.2015.124] [Citation(s) in RCA: 247] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Exomiser is an application that prioritizes genes and variants in next-generation sequencing (NGS) projects for novel disease-gene discovery or differential diagnostics of Mendelian disease. Exomiser comprises a suite of algorithms for prioritizing exome sequences using random-walk analysis of protein interaction networks, clinical relevance and cross-species phenotype comparisons, as well as a wide range of other computational filters for variant frequency, predicted pathogenicity and pedigree analysis. In this protocol, we provide a detailed explanation of how to install Exomiser and use it to prioritize exome sequences in a number of scenarios. Exomiser requires ∼3 GB of RAM and roughly 15-90 s of computing time on a standard desktop computer to analyze a variant call format (VCF) file. Exomiser is freely available for academic use from http://www.sanger.ac.uk/science/tools/exomiser.
Collapse
Affiliation(s)
- Damian Smedley
- Skarnes Faculty Group, Wellcome Trust Sanger Institute, Hinxton, UK
| | | | - Marten Jäger
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Sebastian Köhler
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Manuel Holtgrewe
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute for Health, Berlin, Germany
| | - Max Schubach
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Enrico Siragusa
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute for Health, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Tomasz Zemojtel
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.,Labor Berlin - Charité Vivantes, Humangenetik, Berlin, Germany
| | - Orion J Buske
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Nicole L Washington
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - William P Bone
- The National Institutes of Health (NIH) Undiagnosed Diseases Program, Common Fund, Office of the Director, NIH, Bethesda, Maryland, USA
| | - Melissa A Haendel
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health &Science University, Portland, Oregon, USA
| | - Peter N Robinson
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany.,Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany.,Department of Mathematics and Computer Science, Institute for Bioinformatics, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
16
|
Beyond Homozygosity Mapping: Family-Control analysis based on Hamming distance for prioritizing variants in exome sequencing. Sci Rep 2015; 5:12028. [PMID: 26143870 PMCID: PMC5155624 DOI: 10.1038/srep12028] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Accepted: 06/11/2015] [Indexed: 01/23/2023] Open
Abstract
A major challenge in current exome sequencing in autosomal recessive (AR) families is the lack of an effective method to prioritize single-nucleotide variants (SNVs). AR families are generally too small for linkage analysis, and length of homozygous regions is unreliable for identification of causative variants. Various common filtering steps usually result in a list of candidate variants that cannot be narrowed down further or ranked. To prioritize shortlisted SNVs we consider each homozygous candidate variant together with a set of SNVs flanking it. We compare the resulting array of genotypes between an affected family member and a number of control individuals and argue that, in a family, differences between family member and controls should be larger for a pathogenic variant and SNVs flanking it than for a random variant. We assess differences between arrays in two individuals by the Hamming distance and develop a suitable test statistic, which is expected to be large for a causative variant and flanking SNVs. We prioritize candidate variants based on this statistic and applied our approach to six patients with known pathogenic variants and found these to be in the top 2 to 10 percentiles of ranks.
Collapse
|
17
|
Federoff M, Schottlaender LV, Houlden H, Singleton A. Multiple system atrophy: the application of genetics in understanding etiology. Clin Auton Res 2015; 25:19-36. [PMID: 25687905 PMCID: PMC5217460 DOI: 10.1007/s10286-014-0267-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 12/29/2014] [Indexed: 12/14/2022]
Abstract
Classically defined phenotypically by a triad of cerebellar ataxia, parkinsonism, and autonomic dysfunction in conjunction with pyramidal signs, multiple system atrophy (MSA) is a rare and progressive neurodegenerative disease affecting an estimated 3-4 per every 100,000 individuals among adults 50-99 years of age. With a pathological hallmark of alpha-synuclein-immunoreactive glial cytoplasmic inclusions (GCIs; Papp-Lantos inclusions), MSA patients exhibit marked neurodegenerative changes in the striatonigral and/or olivopontocerebellar structures of the brain. As a member of the alpha-synucleinopathy family, which is defined by its well-demarcated alpha-synuclein-immunoreactive inclusions and aggregation, MSA's clinical presentation exhibits several overlapping features with other members including Parkinson's disease (PD) and dementia with Lewy bodies (DLB). Given the extensive fund of knowledge regarding the genetic etiology of PD revealed within the past several years, a genetic investigation of MSA is warranted. While a current genome-wide association study is underway for MSA to further clarify the role of associated genetic loci and single-nucleotide polymorphisms, several cases have presented solid preliminary evidence of a genetic etiology. Naturally, genes and variants manifesting known associations with PD (and other phenotypically similar neurodegenerative disorders), including SNCA and MAPT, have been comprehensively investigated in MSA patient cohorts. More recently variants in COQ2 have been linked to MSA in the Japanese population although this finding awaits replication. Nonetheless, significant positive associations with subsequent independent replication studies have been scarce. With very limited information regarding genetic mutations or alterations in gene dosage as a cause of MSA, the search for novel risk genes, which may be in the form of common variants or rare variants, is the logical nexus for MSA research. We believe that the application of next generation genetic methods to MSA will provide valuable insight into the underlying causes of this disease, and will be central to the identification of etiologic-based therapies.
Collapse
Affiliation(s)
- Monica Federoff
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, 20892, USA
| | | | | | | |
Collapse
|
18
|
Ionita-Laza I, Capanu M, De Rubeis S, McCallum K, Buxbaum JD. Identification of rare causal variants in sequence-based studies: methods and applications to VPS13B, a gene involved in Cohen syndrome and autism. PLoS Genet 2014; 10:e1004729. [PMID: 25502226 PMCID: PMC4263785 DOI: 10.1371/journal.pgen.1004729] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Accepted: 09/02/2014] [Indexed: 11/18/2022] Open
Abstract
Pinpointing the small number of causal variants among the abundant naturally occurring genetic variation is a difficult challenge, but a crucial one for understanding precise molecular mechanisms of disease and follow-up functional studies. We propose and investigate two complementary statistical approaches for identification of rare causal variants in sequencing studies: a backward elimination procedure based on groupwise association tests, and a hierarchical approach that can integrate sequencing data with diverse functional and evolutionary conservation annotations for individual variants. Using simulations, we show that incorporation of multiple bioinformatic predictors of deleteriousness, such as PolyPhen-2, SIFT and GERP++ scores, can improve the power to discover truly causal variants. As proof of principle, we apply the proposed methods to VPS13B, a gene mutated in the rare neurodevelopmental disorder called Cohen syndrome, and recently reported with recessive variants in autism. We identify a small set of promising candidates for causal variants, including two loss-of-function variants and a rare, homozygous probably-damaging variant that could contribute to autism risk.
Collapse
Affiliation(s)
- Iuliana Ionita-Laza
- Department of Biostatistics, Columbia University, New York, New York, United States of America
- * E-mail:
| | - Marinela Capanu
- Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Silvia De Rubeis
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Departments of Psychiatry, Mount Sinai School of Medicine, New York, New York, United States of America
| | - Kenneth McCallum
- Department of Biostatistics, Columbia University, New York, New York, United States of America
| | - Joseph D. Buxbaum
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Departments of Psychiatry, Mount Sinai School of Medicine, New York, New York, United States of America
- Departments of Genetics and Genomic Sciences, and Neuroscience, and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Mindich Child Health and Development Institute, Mount Sinai School of Medicine, New York, New York, United States of America
| |
Collapse
|
19
|
Bao R, Huang L, Andrade J, Tan W, Kibbe WA, Jiang H, Feng G. Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing. Cancer Inform 2014; 13:67-82. [PMID: 25288881 PMCID: PMC4179624 DOI: 10.4137/cin.s13779] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Revised: 07/06/2014] [Accepted: 07/07/2014] [Indexed: 12/21/2022] Open
Abstract
The advent of next-generation sequencing technologies has greatly promoted advances in the study of human diseases at the genomic, transcriptomic, and epigenetic levels. Exome sequencing, where the coding region of the genome is captured and sequenced at a deep level, has proven to be a cost-effective method to detect disease-causing variants and discover gene targets. In this review, we outline the general framework of whole exome sequence data analysis. We focus on established bioinformatics tools and applications that support five analytical steps: raw data quality assessment, pre-processing, alignment, post-processing, and variant analysis (detection, annotation, and prioritization). We evaluate the performance of open-source alignment programs and variant calling tools using simulated and benchmark datasets, and highlight the challenges posed by the lack of concordance among variant detection tools. Based on these results, we recommend adopting multiple tools and resources to reduce false positives and increase the sensitivity of variant calling. In addition, we briefly discuss the current status and solutions for big data management, analysis, and summarization in the field of bioinformatics.
Collapse
Affiliation(s)
- Riyue Bao
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Lei Huang
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Jorge Andrade
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Wei Tan
- IBM Thomas J. Watson Research Center, Yorktown Heights, New York, USA
| | - Warren A Kibbe
- Biomedical Informatics Center (NUBIC), Clinical and Translational Sciences Institute (NUCATS), Northwestern University, Chicago, IL, USA
| | - Hongmei Jiang
- Department of Statistics, Northwestern University, Evanston, IL, USA
| | - Gang Feng
- Biomedical Informatics Center (NUBIC), Clinical and Translational Sciences Institute (NUCATS), Northwestern University, Chicago, IL, USA
| |
Collapse
|
20
|
Abstract
The cost of next-generation sequencing is now approaching that of the first generation of genome-wide single-nucleotide genotyping panels, but this is still out of reach for large-scale epidemiologic studies with tens of thousands of subjects. Furthermore, the anticipated yield of millions of rare variants poses serious challenges for distinguishing causal from noncausal variants for disease. We explore the merits of using family-based designs for sequencing substudies to identify novel variants and prioritize them for their likelihood of causality. While the sharing of variants within families means that family-based designs may be less efficient for discovery than sequencing of a comparable number of unrelated individuals, the ability to exploit cosegregation of variants with disease within families helps distinguish causal from noncausal ones. We introduce a score test criterion for prioritizing discovered variants in terms of their likelihood of being functional. We compare the relative statistical efficiency of 2-stage versus1-stage family-based designs by application to the Genetic Analysis Workshop 18 simulated sequence data.
Collapse
Affiliation(s)
- Zhao Yang
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089-9234, USA
| | - Duncan C Thomas
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089-9234, USA
| |
Collapse
|
21
|
Shyr C, Kushniruk A, Wasserman WW. Usability study of clinical exome analysis software: top lessons learned and recommendations. J Biomed Inform 2014; 51:129-36. [PMID: 24860971 DOI: 10.1016/j.jbi.2014.05.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Revised: 04/30/2014] [Accepted: 05/06/2014] [Indexed: 10/25/2022]
Abstract
OBJECTIVES New DNA sequencing technologies have revolutionized the search for genetic disruptions. Targeted sequencing of all protein coding regions of the genome, called exome analysis, is actively used in research-oriented genetics clinics, with the transition to exomes as a standard procedure underway. This transition is challenging; identification of potentially causal mutation(s) amongst ∼10(6) variants requires specialized computation in combination with expert assessment. This study analyzes the usability of user interfaces for clinical exome analysis software. There are two study objectives: (1) To ascertain the key features of successful user interfaces for clinical exome analysis software based on the perspective of expert clinical geneticists, (2) To assess user-system interactions in order to reveal strengths and weaknesses of existing software, inform future design, and accelerate the clinical uptake of exome analysis. METHODS Surveys, interviews, and cognitive task analysis were performed for the assessment of two next-generation exome sequence analysis software packages. The subjects included ten clinical geneticists who interacted with the software packages using the "think aloud" method. Subjects' interactions with the software were recorded in their clinical office within an urban research and teaching hospital. All major user interface events (from the user interactions with the packages) were time-stamped and annotated with coding categories to identify usability issues in order to characterize desired features and deficiencies in the user experience. RESULTS We detected 193 usability issues, the majority of which concern interface layout and navigation, and the resolution of reports. Our study highlights gaps in specific software features typical within exome analysis. The clinicians perform best when the flow of the system is structured into well-defined yet customizable layers for incorporation within the clinical workflow. The results highlight opportunities to dramatically accelerate clinician analysis and interpretation of patient genomic data. CONCLUSION We present the first application of usability methods to evaluate software interfaces in the context of exome analysis. Our results highlight how the study of user responses can lead to identification of usability issues and challenges and reveal software reengineering opportunities for improving clinical next-generation sequencing analysis. While the evaluation focused on two distinctive software tools, the results are general and should inform active and future software development for genome analysis software. As large-scale genome analysis becomes increasingly common in healthcare, it is critical that efficient and effective software interfaces are provided to accelerate clinical adoption of the technology. Implications for improved design of such applications are discussed.
Collapse
Affiliation(s)
- Casper Shyr
- Centre for Molecular Medicine and Therapeutics, Child & Family Research Institute, 950 28th Ave W, Vancouver, BC V5Z 4H4, Canada; Bioinformatics Graduate Program, University of British Columbia, 2329 West Mall, Vancouver, BC V6T 1Z4, Canada
| | - Andre Kushniruk
- School of Health Information Science, University of Victoria, 3800 Finnerty Rd., Victoria, BC V8P 5C2, Canada
| | - Wyeth W Wasserman
- Centre for Molecular Medicine and Therapeutics, Child & Family Research Institute, 950 28th Ave W, Vancouver, BC V5Z 4H4, Canada; Department of Medical Genetics, University of British Columbia, 2329 West Mall, Vancouver, BC V6T 1Z4, Canada.
| |
Collapse
|
22
|
Koboldt D, Larson D, Sullivan L, Bowne S, Steinberg K, Churchill J, Buhr A, Nutter N, Pierce E, Blanton S, Weinstock G, Wilson R, Daiger S. Exome-based mapping and variant prioritization for inherited Mendelian disorders. Am J Hum Genet 2014; 94:373-84. [PMID: 24560519 DOI: 10.1016/j.ajhg.2014.01.016] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Accepted: 01/30/2014] [Indexed: 02/08/2023] Open
Abstract
Exome sequencing in families affected by rare genetic disorders has the potential to rapidly identify new disease genes (genes in which mutations cause disease), but the identification of a single causal mutation among thousands of variants remains a significant challenge. We developed a scoring algorithm to prioritize potential causal variants within a family according to segregation with the phenotype, population frequency, predicted effect, and gene expression in the tissue(s) of interest. To narrow the search space in families with multiple affected individuals, we also developed two complementary approaches to exome-based mapping of autosomal-dominant disorders. One approach identifies segments of maximum identity by descent among affected individuals; the other nominates regions on the basis of shared rare variants and the absence of homozygous differences between affected individuals. We showcase our methods by using exome sequence data from families affected by autosomal-dominant retinitis pigmentosa (adRP), a rare disorder characterized by night blindness and progressive vision loss. We performed exome capture and sequencing on 91 samples representing 24 families affected by probable adRP but lacking common disease-causing mutations. Eight of 24 families (33%) were revealed to harbor high-scoring, most likely pathogenic (by clinical assessment) mutations affecting known RP genes. Analysis of the remaining 17 families identified candidate variants in a number of interesting genes, some of which have withstood further segregation testing in extended pedigrees. To empower the search for Mendelian-disease genes in family-based sequencing studies, we implemented them in a cross-platform-compatible software package, MendelScan, which is freely available to the research community.
Collapse
|
23
|
González-Del Pozo M, Méndez-Vidal C, Santoyo-Lopez J, Vela-Boza A, Bravo-Gil N, Rueda A, García-Alonso L, Vázquez-Marouschek C, Dopazo J, Borrego S, Antiñolo G. Deciphering intrafamilial phenotypic variability by exome sequencing in a Bardet-Biedl family. Mol Genet Genomic Med 2013; 2:124-33. [PMID: 24689075 PMCID: PMC3960054 DOI: 10.1002/mgg3.50] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 10/09/2013] [Accepted: 10/16/2013] [Indexed: 12/15/2022] Open
Abstract
Bardet-Biedl syndrome (BBS) is a model ciliopathy characterized by a wide range of clinical variability. The heterogeneity of this condition is reflected in the number of underlying gene defects and the epistatic interactions between the proteins encoded. BBS is generally inherited in an autosomal recessive trait. However, in some families, mutations across different loci interact to modulate the expressivity of the phenotype. In order to investigate the magnitude of epistasis in one BBS family with remarkable intrafamilial phenotypic variability, we designed an exome sequencing-based approach using SOLID 5500xl platform. This strategy allowed the reliable detection of the primary causal mutations in our family consisting of two novel compound heterozygous mutations in McKusick-Kaufman syndrome (MKKS) gene (p.D90G and p.V396F). Additionally, exome sequencing enabled the detection of one novel heterozygous NPHP4 variant which is predicted to activate a cryptic acceptor splice site and is only present in the most severely affected patient. Here, we provide an exome sequencing analysis of a BBS family and show the potential utility of this tool, in combination with network analysis, to detect disease-causing mutations and second-site modifiers. Our data demonstrate how next-generation sequencing (NGS) can facilitate the dissection of epistatic phenomena, and shed light on the genetic basis of phenotypic variability.
Collapse
Affiliation(s)
- María González-Del Pozo
- Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville Seville, Spain ; Centre for Biomedical Network Research on Rare Diseases (CIBERER) Seville, Spain
| | - Cristina Méndez-Vidal
- Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville Seville, Spain ; Centre for Biomedical Network Research on Rare Diseases (CIBERER) Seville, Spain
| | - Javier Santoyo-Lopez
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA) Seville, Spain
| | - Alicia Vela-Boza
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA) Seville, Spain
| | - Nereida Bravo-Gil
- Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville Seville, Spain
| | - Antonio Rueda
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA) Seville, Spain
| | - Luz García-Alonso
- Department of Computational Genomics, Centro de Investigación Príncipe Felipe (CIPF) Valencia, Spain
| | | | - Joaquín Dopazo
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA) Seville, Spain ; Department of Computational Genomics, Centro de Investigación Príncipe Felipe (CIPF) Valencia, Spain ; Functional Genomics Node (INB), CIPF Valencia, Spain
| | - Salud Borrego
- Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville Seville, Spain ; Centre for Biomedical Network Research on Rare Diseases (CIBERER) Seville, Spain
| | - Guillermo Antiñolo
- Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville Seville, Spain ; Centre for Biomedical Network Research on Rare Diseases (CIBERER) Seville, Spain ; Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA) Seville, Spain
| |
Collapse
|
24
|
Robinson PN, Köhler S, Oellrich A, Wang K, Mungall CJ, Lewis SE, Washington N, Bauer S, Seelow D, Krawitz P, Gilissen C, Haendel M, Smedley D. Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res 2013; 24:340-8. [PMID: 24162188 PMCID: PMC3912424 DOI: 10.1101/gr.160325.113] [Citation(s) in RCA: 248] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Numerous new disease-gene associations have been identified by whole-exome sequencing studies in the last few years. However, many cases remain unsolved due to the sheer number of candidate variants remaining after common filtering strategies such as removing low quality and common variants and those deemed unlikely to be pathogenic. The observation that each of our genomes contains about 100 genuine loss-of-function variants makes identification of the causative mutation problematic when using these strategies alone. We propose using the wealth of genotype to phenotype data that already exists from model organism studies to assess the potential impact of these exome variants. Here, we introduce PHenotypic Interpretation of Variants in Exomes (PHIVE), an algorithm that integrates the calculation of phenotype similarity between human diseases and genetically modified mouse models with evaluation of the variants according to allele frequency, pathogenicity, and mode of inheritance approaches in our Exomiser tool. Large-scale validation of PHIVE analysis using 100,000 exomes containing known mutations demonstrated a substantial improvement (up to 54.1-fold) over purely variant-based (frequency and pathogenicity) methods with the correct gene recalled as the top hit in up to 83% of samples, corresponding to an area under the ROC curve of >95%. We conclude that incorporation of phenotype data can play a vital role in translational bioinformatics and propose that exome sequencing projects should systematically capture clinical phenotypes to take advantage of the strategy presented here.
Collapse
Affiliation(s)
- Peter N Robinson
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
eXtasy: variant prioritization by genomic data fusion. Nat Methods 2013; 10:1083-4. [PMID: 24076761 DOI: 10.1038/nmeth.2656] [Citation(s) in RCA: 137] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 08/26/2013] [Indexed: 01/01/2023]
Abstract
Massively parallel sequencing greatly facilitates the discovery of novel disease genes causing Mendelian and oligogenic disorders. However, many mutations are present in any individual genome, and identifying which ones are disease causing remains a largely open problem. We introduce eXtasy, an approach to prioritize nonsynonymous single-nucleotide variants (nSNVs) that substantially improves prediction of disease-causing variants in exome sequencing data by integrating variant impact prediction, haploinsufficiency prediction and phenotype-specific gene prioritization.
Collapse
|
26
|
The application of next-generation sequencing in the autozygosity mapping of human recessive diseases. Hum Genet 2013; 132:1197-211. [PMID: 23907654 DOI: 10.1007/s00439-013-1344-x] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 07/20/2013] [Indexed: 02/08/2023]
Abstract
Autozygosity, or the inheritance of two copies of an ancestral allele, has the potential to not only reveal phenotypes caused by biallelic mutations in autosomal recessive genes, but to also facilitate the mapping of such mutations by flagging the surrounding haplotypes as tractable runs of homozygosity (ROH), a process known as autozygosity mapping. Since SNPs replaced microsatellites as markers for the purpose of genomewide identification of ROH, autozygosity mapping of Mendelian genes has witnessed a significant acceleration. Historically, successful mapping traditionally required favorable family structure that permits the identification of an autozygous interval that is amenable to candidate gene selection and confirmation by Sanger sequencing. This requirement presented a major bottleneck that hindered the utilization of simplex cases and many multiplex families with autosomal recessive phenotypes. However, the advent of next-generation sequencing that enables massively parallel sequencing of DNA has largely bypassed this bottleneck and thus ushered in an era of unprecedented pace of Mendelian disease gene discovery. The ability to identify a single causal mutation among a massive number of variants that are uncovered by next-generation sequencing can be challenging, but applying autozygosity as a filter can greatly enhance the enrichment process and its throughput. This review will discuss the power of combining the best of both techniques in the mapping of recessive disease genes and offer some tips to troubleshoot potential limitations.
Collapse
|
27
|
Family-based association tests for sequence data, and comparisons with population-based association tests. Eur J Hum Genet 2013; 21:1158-62. [PMID: 23386037 DOI: 10.1038/ejhg.2012.308] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Revised: 10/22/2012] [Accepted: 11/21/2012] [Indexed: 11/08/2022] Open
Abstract
Recent advances in high-throughput sequencing technologies make it increasingly more efficient to sequence large cohorts for many complex traits. We discuss here a class of sequence-based association tests for family-based designs that corresponds naturally to previously proposed population-based tests, including the classical Burden and variance-component tests. This framework allows for a direct comparison between the powers of sequence-based association tests with family- vs population-based designs. We show that for dichotomous traits using family-based controls results in similar power levels as the population-based design (although at an increased sequencing cost for the family-based design), while for continuous traits (in random samples, no ascertainment) the population-based design can be substantially more powerful. A possible disadvantage of population-based designs is that they can lead to increased false-positive rates in the presence of population stratification, while the family-based designs are robust to population stratification. We show also an application to a small exome-sequencing family-based study on autism spectrum disorders. The tests are implemented in publicly available software.
Collapse
|
28
|
Kang J, Huang KC, Xu Z, Wang Y, Abecasis GR, Li Y. AbCD: arbitrary coverage design for sequencing-based genetic studies. Bioinformatics 2013; 29:799-801. [PMID: 23357921 DOI: 10.1093/bioinformatics/btt041] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Recent advances in sequencing technologies have revolutionized genetic studies. Although high-coverage sequencing can uncover most variants present in the sequenced sample, low-coverage sequencing is appealing for its cost effectiveness. Here, we present AbCD (arbitrary coverage design) to aid the design of sequencing-based studies. AbCD is a user-friendly interface providing pre-estimated effective sample sizes, specific to each minor allele frequency category, for designs with arbitrary coverage (0.5-30×) and sample size (20-10 000), and for four major ethnic groups (Europeans, Africans, Asians and African Americans). In addition, we also present two software tools: ShotGun and DesignPlanner, which were used to generate the estimates behind AbCD. ShotGun is a flexible short-read simulator for arbitrary user-specified read length and average depth, allowing cycle-specific sequencing error rates and realistic read depth distributions. DesignPlanner is a full pipeline that uses ShotGun to generate sequence data and performs initial SNP discovery, uses our previously presented linkage disequilibrium-aware method to call genotypes, and, finally, provides minor allele frequency-specific effective sample sizes. ShotGun plus DesignPlanner can accommodate effective sample size estimate for any combination of high-depth and low-depth data (for example, whole-genome low-depth plus exonic high-depth) or combination of sequence and genotype data [for example, whole-exome sequencing plus genotyping from existing Genomewide Association Study (GWAS)].
Collapse
Affiliation(s)
- Jian Kang
- Faculty of Kinesiology, University of Calgary, Calgary, AB T2N1N4, Canada
| | | | | | | | | | | |
Collapse
|
29
|
Torkamani A, Pham P, Libiger O, Bansal V, Zhang G, Scott-Van Zeeland AA, Tewhey R, Topol EJ, Schork NJ. Clinical implications of human population differences in genome-wide rates of functional genotypes. Front Genet 2012; 3:211. [PMID: 23125845 PMCID: PMC3485509 DOI: 10.3389/fgene.2012.00211] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2012] [Accepted: 09/26/2012] [Indexed: 12/21/2022] Open
Abstract
There have been a number of recent successes in the use of whole genome sequencing and sophisticated bioinformatics techniques to identify pathogenic DNA sequence variants responsible for individual idiopathic congenital conditions. However, the success of this identification process is heavily influenced by the ancestry or genetic background of a patient with an idiopathic condition. This is so because potential pathogenic variants in a patient’s genome must be contrasted with variants in a reference set of genomes made up of other individuals’ genomes of the same ancestry as the patient. We explored the effect of ignoring the ancestries of both an individual patient and the individuals used to construct reference genomes. We pursued this exploration in two major steps. We first considered variation in the per-genome number and rates of likely functional derived (i.e., non-ancestral, based on the chimp genome) single nucleotide variants and small indels in 52 individual whole human genomes sampled from 10 different global populations. We took advantage of a suite of computational and bioinformatics techniques to predict the functional effect of over 24 million genomic variants, both coding and non-coding, across these genomes. We found that the typical human genome harbors ∼5.5–6.1 million total derived variants, of which ∼12,000 are likely to have a functional effect (∼5000 coding and ∼7000 non-coding). We also found that the rates of functional genotypes per the total number of genotypes in individual whole genomes differ dramatically between human populations. We then created tables showing how the use of comparator or reference genome panels comprised of genomes from individuals that do not have the same ancestral background as a patient can negatively impact pathogenic variant identification. Our results have important implications for clinical sequencing initiatives.
Collapse
Affiliation(s)
- Ali Torkamani
- The Scripps Translational Science La Jolla, CA, USA ; Scripps Health La Jolla, CA, USA ; Department of Molecular and Experimental Medicine, The Scripps Research Institute La Jolla, CA, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Fier H, Won S, Prokopenko D, AlChawa T, Ludwig KU, Fimmers R, Silverman EK, Pagano M, Mangold E, Lange C. 'Location, Location, Location': a spatial approach for rare variant analysis and an application to a study on non-syndromic cleft lip with or without cleft palate. Bioinformatics 2012; 28:3027-33. [PMID: 23044548 PMCID: PMC3516147 DOI: 10.1093/bioinformatics/bts568] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Motivation: For the analysis of rare variants in sequence data, numerous approaches have been suggested. Fixed and flexible threshold approaches collapse the rare variant information of a genomic region into a test statistic with reduced dimensionality. Alternatively, the rare variant information can be combined in statistical frameworks that are based on suitable regression models, machine learning, etc. Although the existing approaches provide powerful tests that can incorporate information on allele frequencies and prior biological knowledge, differences in the spatial clustering of rare variants between cases and controls cannot be incorporated. Based on the assumption that deleterious variants and protective variants cluster or occur in different parts of the genomic region of interest, we propose a testing strategy for rare variants that builds on spatial cluster methodology and that guides the identification of the biological relevant segments of the region. Our approach does not require any assumption about the directions of the genetic effects. Results: In simulation studies, we assess the power of the clustering approach and compare it with existing methodology. Our simulation results suggest that the clustering approach for rare variants is well powered, even in situations that are ideal for standard methods. The efficiency of our spatial clustering approach is not affected by the presence of rare variants that have opposite effect size directions. An application to a sequencing study for non-syndromic cleft lip with or without cleft palate (NSCL/P) demonstrates its practical relevance. The proposed testing strategy is applied to a genomic region on chromosome 15q13.3 that was implicated in NSCL/P etiology in a previous genome-wide association study, and its results are compared with standard approaches. Availability: Source code and documentation for the implementation in R will be provided online. Currently, the R-implementation only supports genotype data. We currently are working on an extension for VCF files. Contact:heide.fier@googlemail.com
Collapse
Affiliation(s)
- Heide Fier
- Department of Genomic Mathematics, University of Bonn, 53127, Germany.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Sifrim A, Van Houdt JKJ, Tranchevent LC, Nowakowska B, Sakai R, Pavlopoulos GA, Devriendt K, Vermeesch JR, Moreau Y, Aerts J. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease. Genome Med 2012; 4:73. [PMID: 23013645 PMCID: PMC3580443 DOI: 10.1186/gm374] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Revised: 09/14/2012] [Accepted: 09/26/2012] [Indexed: 12/18/2022] Open
Abstract
The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.
Collapse
Affiliation(s)
- Alejandro Sifrim
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| | - Jeroen KJ Van Houdt
- KU Leuven, Centre for Human Genetics, University Hospital Gasthuisberg, Herestraat 49, 3000 Leuven, Belgium
| | - Leon-Charles Tranchevent
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| | - Beata Nowakowska
- KU Leuven, Centre for Human Genetics, University Hospital Gasthuisberg, Herestraat 49, 3000 Leuven, Belgium
| | - Ryo Sakai
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| | - Georgios A Pavlopoulos
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| | - Koen Devriendt
- KU Leuven, Centre for Human Genetics, University Hospital Gasthuisberg, Herestraat 49, 3000 Leuven, Belgium
| | - Joris R Vermeesch
- KU Leuven, Centre for Human Genetics, University Hospital Gasthuisberg, Herestraat 49, 3000 Leuven, Belgium
| | - Yves Moreau
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| | - Jan Aerts
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
- IBBT Future Health Department, Kasteelpark Arenberg 10, B-3001, Leuven, Belgium
| |
Collapse
|
32
|
Swaminathan GJ, Bragin E, Chatzimichali EA, Corpas M, Bevan AP, Wright CF, Carter NP, Hurles ME, Firth HV. DECIPHER: web-based, community resource for clinical interpretation of rare variants in developmental disorders. Hum Mol Genet 2012; 21:R37-44. [PMID: 22962312 DOI: 10.1093/hmg/dds362] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Patients with developmental disorders often harbour sub-microscopic deletions or duplications that lead to a disruption of normal gene expression or perturbation in the copy number of dosage-sensitive genes. Clinical interpretation for such patients in isolation is hindered by the rarity and novelty of such disorders. The DECIPHER project (https://decipher.sanger.ac.uk) was established in 2004 as an accessible online repository of genomic and associated phenotypic data with the primary goal of aiding the clinical interpretation of rare copy-number variants (CNVs). DECIPHER integrates information from a variety of bioinformatics resources and uses visualization tools to identify potential disease genes within a CNV. A two-tier access system permits clinicians and clinical scientists to maintain confidential linked anonymous records of phenotypes and CNVs for their patients that, with informed consent, can subsequently be shared with the wider clinical genetics and research communities. Advances in next-generation sequencing technologies are making it practical and affordable to sequence the whole exome/genome of patients who display features suggestive of a genetic disorder. This approach enables the identification of smaller intragenic mutations including single-nucleotide variants that are not accessible even with high-resolution genomic array analysis. This article briefly summarizes the current status and achievements of the DECIPHER project and looks ahead to the opportunities and challenges of jointly analysing structural and sequence variation in the human genome.
Collapse
Affiliation(s)
- Ganesh J Swaminathan
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Yang IV, Schwartz DA. The next generation of complex lung genetic studies. Am J Respir Crit Care Med 2012; 186:1087-94. [PMID: 22936355 DOI: 10.1164/rccm.201207-1178pp] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Common genetic risk variants identified by genome-wide association studies have explained a small portion of disease heritability in complex diseases. It is becoming apparent that each gene/locus is heterogeneous and that multiple rare independent risk alleles across the population contribute to disease risk. Next-generation sequencing technologies have reached the maturity and low cost necessary to perform whole genome, whole exome, and targeted region sequencing to identify all rare risk alleles across a population, a task that is not possible to achieve by genotyping. Design of whole genome, whole exome, and targeted sequencing projects to identify disease variants for complex lung diseases requires four main steps: library preparation, sequencing, sequence data analysis, and statistical analysis. Although data analysis approaches are still evolving, a number of published studies have successfully identified rare variants associated with complex disease. Despite many challenges that lie ahead in applying these technologies to lung disease, rare variants are likely to be a critical piece of the puzzle that needs to be solved to understand the genetic basis of complex lung disease and to use this information to develop better therapies.
Collapse
Affiliation(s)
- Ivana V Yang
- Department of Medicine, University of Colorado Denver, 12700 East 19th Avenue, 8611, Aurora, CO 80045, USA.
| | | |
Collapse
|
34
|
Abstract
PURPOSE OF REVIEW This review examines the application of next-generation sequencing (NGS) technologies in the identification of the causation of nonsyndromic genetic cardiomyopathies. RECENT FINDINGS NGS sequencing of the entire genetic coding sequence (the exome) has successfully identified five novel genes and causative variants for cardiomyopathies without previously known cause within the last 12 months. Continual rapidly decreasing costs of NGS will shortly allow cost-effective sequencing of the entire genomes of affected individuals and their relatives to include noncoding and regulatory variant discovery and epigenetic profiling. Despite this rapid technological progress with sequencing, analysis of these large data sets remains challenging, particularly for assigning causality to novel rare variants identified in DNA samples from patients with cardiomyopathy. SUMMARY NGS technologies are rapidly moving to identify novel rare variants in patients with cardiomyopathy, but assigning pathogenicity to these novel variants remains challenging.
Collapse
|
35
|
Epstein M, Duncan R, Jiang Y, Conneely K, Allen A, Satten G. A permutation procedure to correct for confounders in case-control studies, including tests of rare variation. Am J Hum Genet 2012; 91:215-23. [PMID: 22818855 DOI: 10.1016/j.ajhg.2012.06.004] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Revised: 05/03/2012] [Accepted: 06/05/2012] [Indexed: 01/30/2023] Open
Abstract
Many case-control tests of rare variation are implemented in statistical frameworks that make correction for confounders like population stratification difficult. Simple permutation of disease status is unacceptable for resolving this issue because the replicate data sets do not have the same confounding as the original data set. These limitations make it difficult to apply rare-variant tests to samples in which confounding most likely exists, e.g., samples collected from admixed populations. To enable the use of such rare-variant methods in structured samples, as well as to facilitate permutation tests for any situation in which case-control tests require adjustment for confounding covariates, we propose to establish the significance of a rare-variant test via a modified permutation procedure. Our procedure uses Fisher's noncentral hypergeometric distribution to generate permuted data sets with the same structure present in the actual data set such that inference is valid in the presence of confounding factors. We use simulated sequence data based on coalescent models to show that our permutation strategy corrects for confounding due to population stratification that, if ignored, would otherwise inflate the size of a rare-variant test. We further illustrate the approach by using sequence data from the Dallas Heart Study of energy metabolism traits. Researchers can implement our permutation approach by using the R package BiasedUrn.
Collapse
|
36
|
Lyon GJ, Wang K. Identifying disease mutations in genomic medicine settings: current challenges and how to accelerate progress. Genome Med 2012; 4:58. [PMID: 22830651 PMCID: PMC3580414 DOI: 10.1186/gm359] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The pace of exome and genome sequencing is accelerating, with the identification of many new disease-causing mutations in research settings, and it is likely that whole exome or genome sequencing could have a major impact in the clinical arena in the relatively near future. However, the human genomics community is currently facing several challenges, including phenotyping, sample collection, sequencing strategies, bioinformatics analysis, biological validation of variant function, clinical interpretation and validity of variant data, and delivery of genomic information to various constituents. Here we review these challenges and summarize the bottlenecks for the clinical application of exome and genome sequencing, and we discuss ways for moving the field forward. In particular, we urge the need for clinical-grade sample collection, high-quality sequencing data acquisition, digitalized phenotyping, rigorous generation of variant calls, and comprehensive functional annotation of variants. Additionally, we suggest that a 'networking of science' model that encourages much more collaboration and online sharing of medical history, genomic data and biological knowledge, including among research participants and consumers/patients, will help establish causation and penetrance for disease causal variants and genes. As we enter this new era of genomic medicine, we envision that consumer-driven and consumer-oriented efforts will take center stage, thus allowing insights from the human genome project to translate directly back into individualized medicine.
Collapse
Affiliation(s)
- Gholson J Lyon
- Cold Spring Harbor Laboratory, New York, NY 11797, USA
- Institute for Genomic Medicine, Utah Foundation for Biomedical Research (UFBR), Salt Lake City, UT 84106, USA
| | - Kai Wang
- Institute for Genomic Medicine, Utah Foundation for Biomedical Research (UFBR), Salt Lake City, UT 84106, USA
- Zilkha Neurogenetic Institute, Department of Psychiatry and Preventive Medicine, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
37
|
Curtis D. A rapid method for combined analysis of common and rare variants at the level of a region, gene, or pathway. Adv Appl Bioinform Chem 2012; 5:1-9. [PMID: 22888262 PMCID: PMC3413013 DOI: 10.2147/aabc.s33049] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Previously described methods for the combined analysis of common and rare variants have disadvantages such as requiring an arbitrary classification of variants or permutation testing to assess statistical significance. Here we propose a novel method which implements a weighting scheme based on allele frequencies observed in both cases and controls. Because the test is unbiased, scores can be analyzed with a standard t-test. To test its validity we applied it to data for common, rare, and very rare variants simulated under the null hypothesis. To test its power we applied it to simulated data in which association was present, including data using the observed allele frequencies of common and rare variants in NOD2 previously reported in cases of Crohn’s disease and controls. The method produced results that conformed well to those expected under the null hypothesis. It demonstrated more power to detect association when rare and common variants were analyzed jointly, the power further increasing when rare variants were assigned higher weights. 20,000 analyses of a gene containing 62 variants could be performed in 80 minutes on a laptop. This approach shows promise for the analysis of data currently emerging from genome wide sequencing studies.
Collapse
Affiliation(s)
- David Curtis
- Centre for Psychiatry, Barts and the London School of Medicine and Dentistry, London, UK
| |
Collapse
|
38
|
Coonrod EM, Durtschi JD, Margraf RL, Voelkerding KV. Developing genome and exome sequencing for candidate gene identification in inherited disorders: an integrated technical and bioinformatics approach. Arch Pathol Lab Med 2012; 137:415-33. [PMID: 22770468 DOI: 10.5858/arpa.2012-0107-ra] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
CONTEXT Advances in sequencing technology with the commercialization of next-generation sequencing (NGS) has substantially increased the feasibility of sequencing human genomes and exomes. Next-generation sequencing has been successfully applied to the discovery of disease-causing genes in rare, inherited disorders. By necessity, the advent of NGS has fostered the concurrent development of bioinformatics approaches to expeditiously analyze the large data sets generated. Next-generation sequencing has been used for important discoveries in the research setting and is now being implemented into the clinical diagnostic arena. OBJECTIVE To review the current literature on technical and bioinformatics approaches for exome and genome sequencing and highlight examples of successful disease gene discovery in inherited disorders. To discuss the challenges for implementing NGS in the clinical research and diagnostic arenas. DATA SOURCES Literature review and authors' experience. CONCLUSIONS Next-generation sequencing approaches are powerful and require an investment in infrastructure and personnel expertise for effective use; however, the potential for improvement of patient care through faster and more accurate molecular diagnoses is high.
Collapse
Affiliation(s)
- Emily M Coonrod
- Research and Development, ARUP Institute for Clinical and Experimental Pathology, Salt Lake City, UT 84108, USA
| | | | | | | |
Collapse
|
39
|
Statistical Challenges in Sequence-Based Association Studies with Population- and Family-Based Designs. STATISTICS IN BIOSCIENCES 2012. [DOI: 10.1007/s12561-012-9062-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|