101
|
Abstract
Herpes simplex viruses (HSV) cause chronic infection in humans that are characterized by periodic episodes of mucosal shedding and ulcerative disease. HSV causes millions of infections world-wide, with lifelong bouts of viral reactivation from latency in neuronal ganglia. Infected individuals experience different levels of disease severity and frequency of reactivation. There are two distantly related HSV species, with HSV-1 infections historically found most often in the oral niche and HSV-2 infections in the genital niche. Over the last two decades, HSV-1 has emerged as the leading cause of first-episode genital herpes in multiple countries. While HSV-1 has the highest level of genetic diversity among human alpha-herpesviruses, it is not yet known how quickly the HSV-1 viral population in a human host adapts over time, or if there are population bottlenecks associated with viral reactivation and/or transmission. It is also unknown how the ecological environments in which HSV infections occur influence their evolutionary trajectory, or that of co-occurring viruses and microbes. In this review, we explore how HSV accrues genetic diversity within each new infection, and yet maintains its ability to successfully infect most of the human population. A holistic examination of the ecological context of natural human infections can expand our awareness of how HSV adapts as it moves within and between human hosts, and reveal the complexity of these lifelong human-virus interactions. These insights may in turn suggest new areas of exploration for other chronic pathogens that successfully evolve and persist among their hosts.
Collapse
|
102
|
Choi L, An JY. Genetic architecture of autism spectrum disorder: Lessons from large-scale genomic studies. Neurosci Biobehav Rev 2021; 128:244-257. [PMID: 34166716 DOI: 10.1016/j.neubiorev.2021.06.028] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 06/17/2021] [Accepted: 06/17/2021] [Indexed: 12/20/2022]
Abstract
Autism spectrum disorder (ASD) is a neurodevelopmental disorder with a strong genetic component. Recently developed genomic technologies, including microarray and next-generation sequencing (NGS), have enabled researchers to genetic analyses aimed at identifying genetic variations associated with ASD and to elucidate the genetic architecture of the disorder. Large-scale microarray, exome sequencing analyses, and robust statistical methods have resulted in successful gene discovery and identification of high-confidence ASD genes from among de novo and inherited variants. Efforts have been made to understand the genetic architecture of ASD using whole-genome sequencing and genome-wide association studies aimed at identifying noncoding mutations and common variants associated with ASD. In addition, the development of systems biology approaches has resulted in the integration of genetic findings with functional genomic datasets, thereby providing a unique insight into the functional convergence of ASD risk genes and their neurobiology. In this review, we summarize the latest findings of ASD genetic studies involving large cohorts and discuss their implications in ASD neurobiology and in clinical practice.
Collapse
Affiliation(s)
- Leejee Choi
- School of Biosystem and Biomedical Science, College of Health Science, Korea University, Seoul, 02841, Republic of Korea; Department of Integrated Biomedical and Life Science, Korea University, Seoul, 02841, Republic of Korea
| | - Joon-Yong An
- School of Biosystem and Biomedical Science, College of Health Science, Korea University, Seoul, 02841, Republic of Korea; Department of Integrated Biomedical and Life Science, Korea University, Seoul, 02841, Republic of Korea; Transdisciplinary Major in Learning Health Systems, Department of Healthcare Sciences, Graduate School, Korea University, Seoul, 02841, Republic of Korea; BK21FOUR R&E Center for Learning Health Systems, Korea University, Seoul, 02841, Republic of Korea.
| |
Collapse
|
103
|
Chintalaphani SR, Pineda SS, Deveson IW, Kumar KR. An update on the neurological short tandem repeat expansion disorders and the emergence of long-read sequencing diagnostics. Acta Neuropathol Commun 2021; 9:98. [PMID: 34034831 PMCID: PMC8145836 DOI: 10.1186/s40478-021-01201-x] [Citation(s) in RCA: 80] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/17/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Short tandem repeat (STR) expansion disorders are an important cause of human neurological disease. They have an established role in more than 40 different phenotypes including the myotonic dystrophies, Fragile X syndrome, Huntington's disease, the hereditary cerebellar ataxias, amyotrophic lateral sclerosis and frontotemporal dementia. MAIN BODY STR expansions are difficult to detect and may explain unsolved diseases, as highlighted by recent findings including: the discovery of a biallelic intronic 'AAGGG' repeat in RFC1 as the cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS); and the finding of 'CGG' repeat expansions in NOTCH2NLC as the cause of neuronal intranuclear inclusion disease and a range of clinical phenotypes. However, established laboratory techniques for diagnosis of repeat expansions (repeat-primed PCR and Southern blot) are cumbersome, low-throughput and poorly suited to parallel analysis of multiple gene regions. While next generation sequencing (NGS) has been increasingly used, established short-read NGS platforms (e.g., Illumina) are unable to genotype large and/or complex repeat expansions. Long-read sequencing platforms recently developed by Oxford Nanopore Technology and Pacific Biosciences promise to overcome these limitations to deliver enhanced diagnosis of repeat expansion disorders in a rapid and cost-effective fashion. CONCLUSION We anticipate that long-read sequencing will rapidly transform the detection of short tandem repeat expansion disorders for both clinical diagnosis and gene discovery.
Collapse
Affiliation(s)
- Sanjog R. Chintalaphani
- School of Medicine, University of New South Wales, Sydney, 2052 Australia
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
| | - Sandy S. Pineda
- Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Brain and Mind Centre, University of Sydney, Camperdown, NSW 2050 Australia
| | - Ira W. Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Faculty of Medicine, St Vincent’s Clinical School, University of New South Wales, Sydney, NSW 2010 Australia
| | - Kishore R. Kumar
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Molecular Medicine Laboratory and Neurology Department, Central Clinical School, Concord Repatriation General Hospital, University of Sydney, Concord, NSW 2137 Australia
| |
Collapse
|
104
|
Eslami Rasekh M, Hernández Y, Drinan SD, Fuxman Bass J, Benson G. Genome-wide characterization of human minisatellite VNTRs: population-specific alleles and gene expression differences. Nucleic Acids Res 2021; 49:4308-4324. [PMID: 33849068 PMCID: PMC8096271 DOI: 10.1093/nar/gkab224] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 03/06/2021] [Accepted: 03/18/2021] [Indexed: 11/12/2022] Open
Abstract
Variable Number Tandem Repeats (VNTRs) are tandem repeat (TR) loci that vary in copy number across a population. Using our program, VNTRseek, we analyzed human whole genome sequencing datasets from 2770 individuals in order to detect minisatellite VNTRs, i.e., those with pattern sizes ≥7 bp. We detected 35 638 VNTR loci and classified 5676 as commonly polymorphic (i.e. with non-reference alleles occurring in >5% of the population). Commonly polymorphic VNTR loci were found to be enriched in genomic regions with regulatory function, i.e. transcription start sites and enhancers. Investigation of the commonly polymorphic VNTRs in the context of population ancestry revealed that 1096 loci contained population-specific alleles and that those could be used to classify individuals into super-populations with near-perfect accuracy. Search for quantitative trait loci (eQTLs), among the VNTRs proximal to genes, indicated that in 187 genes expression differences correlated with VNTR genotype. We validated our predictions in several ways, including experimentally, through the identification of predicted alleles in long reads, and by comparisons showing consistency between sequencing platforms. This study is the most comprehensive analysis of minisatellite VNTRs in the human population to date.
Collapse
Affiliation(s)
| | - Yözen Hernández
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
| | | | - Juan I Fuxman Bass
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
- Department of Biology, Boston University, Boston, MA 02215, USA
| | - Gary Benson
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
- Department of Biology, Boston University, Boston, MA 02215, USA
- Department of Computer Science, Boston University, Boston, MA 02215, USA
| |
Collapse
|
105
|
Depienne C, Mandel JL. 30 years of repeat expansion disorders: What have we learned and what are the remaining challenges? Am J Hum Genet 2021; 108:764-785. [PMID: 33811808 PMCID: PMC8205997 DOI: 10.1016/j.ajhg.2021.03.011] [Citation(s) in RCA: 178] [Impact Index Per Article: 59.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 03/05/2021] [Indexed: 12/13/2022] Open
Abstract
Tandem repeats represent one of the most abundant class of variations in human genomes, which are polymorphic by nature and become highly unstable in a length-dependent manner. The expansion of repeat length across generations is a well-established process that results in human disorders mainly affecting the central nervous system. At least 50 disorders associated with expansion loci have been described to date, with half recognized only in the last ten years, as prior methodological difficulties limited their identification. These limitations still apply to the current widely used molecular diagnostic methods (exome or gene panels) and thus result in missed diagnosis detrimental to affected individuals and their families, especially for disorders that are very rare and/or clinically not recognizable. Most of these disorders have been identified through family-driven approaches and many others likely remain to be identified. The recent development of long-read technologies provides a unique opportunity to systematically investigate the contribution of tandem repeats and repeat expansions to the genetic architecture of human disorders. In this review, we summarize the current and most recent knowledge about the genetics of repeat expansion disorders and the diversity of their pathophysiological mechanisms and outline the perspectives of developing personalized treatments in the future.
Collapse
Affiliation(s)
- Christel Depienne
- Institute of Human Genetics, University Hospital Essen, University of Duisburg-Essen, Essen, Germany; Institut du Cerveau et de la Moelle épinière (ICM), Sorbonne Université, UMR S 1127, Inserm U1127, CNRS UMR 7225, 75013 Paris, France.
| | - Jean-Louis Mandel
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch 67400, France; Centre National de la Recherche Scientifique, UMR 7104, Illkirch 67400, France; Institut National de la Santé et de la Recherche Médicale, U 1258, Illkirch 67400, France; Université de Strasbourg, Illkirch 67400, France; USIAS University of Strasbourg Institute of Advanced study, 67000 Strasbourg, France.
| |
Collapse
|
106
|
Garg P, Martin-Trujillo A, Rodriguez OL, Gies SJ, Hadelia E, Jadhav B, Jain M, Paten B, Sharp AJ. Pervasive cis effects of variation in copy number of large tandem repeats on local DNA methylation and gene expression. Am J Hum Genet 2021; 108:809-824. [PMID: 33794196 DOI: 10.1016/j.ajhg.2021.03.016] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 03/11/2021] [Indexed: 12/27/2022] Open
Abstract
Variable number tandem repeats (VNTRs) are composed of large tandemly repeated motifs, many of which are highly polymorphic in copy number. However, because of their large size and repetitive nature, they remain poorly studied. To investigate the regulatory potential of VNTRs, we used read-depth data from Illumina whole-genome sequencing to perform association analysis between copy number of ∼70,000 VNTRs (motif size ≥ 10 bp) with both gene expression (404 samples in 48 tissues) and DNA methylation (235 samples in peripheral blood), identifying thousands of VNTRs that are associated with local gene expression (eVNTRs) and DNA methylation levels (mVNTRs). Using an independent cohort, we validated 73%-80% of signals observed in the two discovery cohorts, while allelic analysis of VNTR length and CpG methylation in 30 Oxford Nanopore genomes gave additional support for mVNTR loci, thus providing robust evidence to support that these represent genuine associations. Further, conditional analysis indicated that many eVNTRs and mVNTRs act as QTLs independently of other local variation. We also observed strong enrichments of eVNTRs and mVNTRs for regulatory features such as enhancers and promoters. Using the Human Genome Diversity Panel, we define sets of VNTRs that show highly divergent copy numbers among human populations and show that these are enriched for regulatory effects and preferentially associate with genes that have been linked with human phenotypes through GWASs. Our study provides strong evidence supporting functional variation at thousands of VNTRs and defines candidate sets of VNTRs, copy number variation of which potentially plays a role in numerous human phenotypes.
Collapse
|
107
|
Mousavi N, Margoliash J, Pusarla N, Saini S, Yanicky R, Gymrek M. TRTools: a toolkit for genome-wide analysis of tandem repeats. Bioinformatics 2021; 37:731-733. [PMID: 32805020 PMCID: PMC8097685 DOI: 10.1093/bioinformatics/btaa736] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 08/07/2020] [Accepted: 08/12/2020] [Indexed: 12/11/2022] Open
Abstract
SUMMARY A rich set of tools have recently been developed for performing genome-wide genotyping of tandem repeats (TRs). However, standardized tools for downstream analysis of these results are lacking. To facilitate TR analysis applications, we present TRTools, a Python library and suite of command line tools for filtering, merging and quality control of TR genotype files. TRTools utilizes an internal harmonization module, making it compatible with outputs from a wide range of TR genotypers. AVAILABILITY AND IMPLEMENTATION TRTools is freely available at https://github.com/gymreklab/TRTools. Detailed documentation is available at https://trtools.readthedocs.io. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nima Mousavi
- Department of Electrical and Computer Engineering, University of California San Diego, La Jolla, 92093, USA
| | - Jonathan Margoliash
- Department of Medicine, University of California San Diego, La Jolla, 92093, USA
| | - Neha Pusarla
- Department of Bioengineering, University of California San Diego, La Jolla, 92093, USA
| | - Shubham Saini
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, 92093, USA
| | - Richard Yanicky
- Department of Medicine, University of California San Diego, La Jolla, 92093, USA
| | - Melissa Gymrek
- Department of Medicine, University of California San Diego, La Jolla, 92093, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, 92093, USA
| |
Collapse
|
108
|
Costales M, Casanueva R, Suárez V, Asensi JM, Cifuentes GA, Diñeiro M, Cadiñanos J, López F, Álvarez-Marcos C, Otero A, Gómez J, Llorente JL, Cabanillas R. CANVAS: A New Genetic Entity in the Otorhinolaryngologist's Differential Diagnosis. Otolaryngol Head Neck Surg 2021; 166:74-79. [PMID: 33940977 DOI: 10.1177/01945998211008398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
OBJECTIVE The biallelic inheritance of an expanded intronic pentamer (AAGGG)exp in the gene encoding replication factor C subunit 1 (RFC1) has been found to be a cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS). This study describes clinical and genetic features of our patients with clinical suspicion of the syndrome. STUDY DESIGN A retrospective descriptive study from an ataxia database comprising 500 patients. SETTING The study was performed at the Otorhinolaryngology Department of a hospital in the north of Spain. METHODS Specific genetic testing for CANVAS was performed in 13 patients with clinical suspicion of complete or incomplete syndrome. The clinical diagnosis was supported by quantitative vestibular hypofunction, cerebellar atrophy, and abnormal sensory nerve conduction testing. RESULTS Nine of 13 (69%) patients met clinical diagnostic criteria for definite CANVAS disease. The first manifestation of the syndrome was lower limb dysesthesia in 8 of 13 patients and gait imbalance in 5 of 13. Eleven of 13 (85%) patients were carriers of the biallelic (AAGGG)exp in RFC1. CONCLUSION A genetic cause of CANVAS has recently been discovered. We propose genetic screening for biallelic expansions of the AAGGG pentamer of RFC1 in all patients with clinical suspicion of CANVAS, since accurate early diagnosis could improve the quality of life of these patients.
Collapse
Affiliation(s)
- María Costales
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | - Rodrigo Casanueva
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | - Vanessa Suárez
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | | | - Guadalupe A Cifuentes
- Instituto de Medicina Oncológica y Molecular de Asturias (IMOMA), Gijón, Asturias, Spain
| | - Marta Diñeiro
- Instituto de Medicina Oncológica y Molecular de Asturias (IMOMA), Gijón, Asturias, Spain
| | - Juan Cadiñanos
- Instituto de Medicina Oncológica y Molecular de Asturias (IMOMA), Gijón, Asturias, Spain
| | - Fernando López
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | - César Álvarez-Marcos
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | - Andrea Otero
- Instituto de Medicina Oncológica y Molecular de Asturias (IMOMA), Gijón, Asturias, Spain
| | - Justo Gómez
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | - José Luis Llorente
- Otorhinolaryngoly Department, Hospital Central de Asturias, Oviedo, Asturias, Spain
| | | |
Collapse
|
109
|
van der Sanden BPGH, Corominas J, de Groot M, Pennings M, Meijer RPP, Verbeek N, van de Warrenburg B, Schouten M, Yntema HG, Vissers LELM, Kamsteeg EJ, Gilissen C. Systematic analysis of short tandem repeats in 38,095 exomes provides an additional diagnostic yield. Genet Med 2021; 23:1569-1573. [PMID: 33846582 DOI: 10.1038/s41436-021-01174-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 03/29/2021] [Accepted: 03/30/2021] [Indexed: 12/31/2022] Open
Abstract
PURPOSE Expansions of a subset of short tandem repeats (STRs) have been implicated in approximately 30 different human genetic disorders. Despite extensive application of exome sequencing (ES) in routine diagnostic genetic testing, STRs are not routinely identified from these data. METHODS We assessed diagnostic utility of STR analysis in exome sequencing by applying ExpansionHunter to 2,867 exomes from movement disorder patients and 35,228 other clinical exomes. RESULTS We identified 38 movement disorder patients with a possible aberrant STR length. Validation by polymerase chain reaction (PCR) and/or repeat-primed PCR technologies confirmed the presence of aberrant expansion alleles for 13 (34%). For seven of these patients the genotype was compatible with the phenotypic description, resulting in a molecular diagnosis. We subsequently tested the remainder of our diagnostic ES cohort, including over 30 clinically and genetically heterogeneous disorders. Optimized manual curation yielded 167 samples with a likely aberrant STR length. Validations confirmed 93/167 (56%) aberrant expansion alleles, of which 48 were in the pathogenic range and 45 in the premutation range. CONCLUSION Our work provides guidance for the implementation of STR analysis in clinical ES. Our results show that systematic STR evaluation may increase diagnostic ES yield by 0.2%, and recommend making STR evaluation a routine part of ES interpretation in genetic testing laboratories.
Collapse
Affiliation(s)
- Bart P G H van der Sanden
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Jordi Corominas
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Michelle de Groot
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Maartje Pennings
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Rowdy P P Meijer
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Nienke Verbeek
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Bart van de Warrenburg
- Department of Neurology, Donders Institute for Brain, Cognition and Behaviour, Radboud university medical center, Nijmegen, The Netherlands
| | - Meyke Schouten
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Helger G Yntema
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands
| | - Lisenka E L M Vissers
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Erik-Jan Kamsteeg
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands.
| | - Christian Gilissen
- Department of Human Genetics, Radboud university medical center, Nijmegen, The Netherlands. .,Radboud Institute for Molecular Life Sciences, Nijmegen, The Netherlands.
| |
Collapse
|
110
|
Bakhtiari M, Park J, Ding YC, Shleizer-Burko S, Neuhausen SL, Halldórsson BV, Stefánsson K, Gymrek M, Bafna V. Variable number tandem repeats mediate the expression of proximal genes. Nat Commun 2021; 12:2075. [PMID: 33824302 PMCID: PMC8024321 DOI: 10.1038/s41467-021-22206-z] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 02/17/2021] [Indexed: 12/12/2022] Open
Abstract
Variable number tandem repeats (VNTRs) account for significant genetic variation in many organisms. In humans, VNTRs have been implicated in both Mendelian and complex disorders, but are largely ignored by genomic pipelines due to the complexity of genotyping and the computational expense. We describe adVNTR-NN, a method that uses shallow neural networks to genotype a VNTR in 18 seconds on 55X whole genome data, while maintaining high accuracy. We use adVNTR-NN to genotype 10,264 VNTRs in 652 GTEx individuals. Associating VNTR length with gene expression in 46 tissues, we identify 163 "eVNTRs". Of the 22 eVNTRs in blood where independent data is available, 21 (95%) are replicated in terms of significance and direction of association. 49% of the eVNTR loci show a strong and likely causal impact on the expression of genes and 80% have maximum effect size at least 0.3. The impacted genes are involved in diseases including Alzheimer's, obesity and familial cancers, highlighting the importance of VNTRs for understanding the genetic basis of complex diseases.
Collapse
Affiliation(s)
- Mehrdad Bakhtiari
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Jonghun Park
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Yuan-Chun Ding
- Department of Population Sciences, Beckman Research Institute of City of Hope, Duarte, CA, USA
| | | | - Susan L Neuhausen
- Department of Population Sciences, Beckman Research Institute of City of Hope, Duarte, CA, USA
| | | | | | - Melissa Gymrek
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Vineet Bafna
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
111
|
Laboratory testing for fragile X, 2021 revision: a technical standard of the American College of Medical Genetics and Genomics (ACMG). Genet Med 2021; 23:799-812. [PMID: 33795824 DOI: 10.1038/s41436-021-01115-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 02/01/2021] [Accepted: 02/02/2021] [Indexed: 11/08/2022] Open
Abstract
Molecular genetic testing of the FMR1 gene is commonly performed in clinical laboratories. Pathogenic variants in the FMR1 gene are associated with fragile X syndrome, fragile X-associated tremor ataxia syndrome (FXTAS), and fragile X-associated primary ovarian insufficiency (FXPOI). This document provides updated information regarding FMR1 pathogenic variants, including prevalence, genotype-phenotype correlations, and variant nomenclature. Methodological considerations are provided for Southern blot analysis and polymerase chain reaction (PCR) amplification of FMR1, including triplet repeat-primed and methylation-specific PCR.The American College of Medical Genetics and Genomics (ACMG) Laboratory Quality Assurance Committee has the mission of maintaining high technical standards for the performance and interpretation of genetic tests. In part, this is accomplished by the publication of the document ACMG Technical Standards for Clinical Genetics Laboratories, which is now maintained online ( http://www.acmg.net ). This subcommittee also reviews the outcome of national proficiency testing in the genetics area and may choose to focus on specific diseases or methodologies in response to those results. Accordingly, the subcommittee selected fragile X syndrome to be the first topic in a series of supplemental sections, recognizing that it is one of the most frequently ordered genetic tests and that it has many alternative methods with different strengths and weaknesses. This document is the fourth update to the original standards and guidelines for fragile X testing that were published in 2001, with revisions in 2005 and 2013, respectively.This versionClarifies the clinical features associated with different FMRI variants (Section 2.3)Discusses important reporting considerations (Section 3.3.1.3)Provides updates on technology (Section 4.1).
Collapse
|
112
|
Kinney N, Kang L, Bains H, Lawson E, Husain M, Husain K, Sandhu I, Shin Y, Carter JK, Anandakrishnan R, Michalak P, Garner H. Ethnically biased microsatellites contribute to differential gene expression and glutathione metabolism in Africans and Europeans. PLoS One 2021; 16:e0249148. [PMID: 33765058 PMCID: PMC7993785 DOI: 10.1371/journal.pone.0249148] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 03/11/2021] [Indexed: 12/28/2022] Open
Abstract
Approximately three percent of the human genome is occupied by microsatellites: a type of short tandem repeat (STR). Microsatellites have well established effects on (a) the genetic structure of diverse human populations and (b) expression of nearby genes. These lines of inquiry have uncovered 3,984 ethnically biased microsatellite loci (EBML) and 28,375 expression STRs (eSTRs), respectively. We hypothesize that a combination of EBML, eSTRs, and gene expression data (RNA-seq) can be used to show that microsatellites contribute to differential gene expression and phenotype in human populations. In fact, our previous study demonstrated a degree of mutual overlap between EBML and eSTRs but fell short of quantifying effects on gene expression. The present work aims to narrow the gap. First, we identify 313 overlapping EBML/eSTRs and recapitulate their mutual overlap. The 313 EBML/eSTRs are then characterized across ethnicity and tissue type. We use RNA-seq data to pursue validation of 49 regions that affect whole blood gene expression; 32 out of 54 affected genes are differentially expressed in Africans and Europeans. We quantify the relative contribution of these 32 genes to differential expression; fold change tends to be less than other differentially expressed genes. Repeat length correlates with expression for 15 of the 32 genes; two are conspicuously involved in glutathione metabolism. Finally, we repurpose a mathematical model of glutathione metabolism to investigate how a single polymorphic microsatellite affects phenotype. We conclude with a testable prediction that microsatellite polymorphisms affect GPX7 expression and oxidative stress in Africans and Europeans.
Collapse
Affiliation(s)
- Nick Kinney
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
- Gibbs Cancer Center & Research Institute, Spartanburg, South Carolina, United States of America
- * E-mail:
| | - Lin Kang
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
- Gibbs Cancer Center & Research Institute, Spartanburg, South Carolina, United States of America
| | - Harpal Bains
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Elizabeth Lawson
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Mesam Husain
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Kumayl Husain
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Inderjit Sandhu
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Yongdeok Shin
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
| | - Javan K. Carter
- University of Colorado Boulder, Boulder, Colorado, United States of America
| | - Ramu Anandakrishnan
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
- Gibbs Cancer Center & Research Institute, Spartanburg, South Carolina, United States of America
| | - Pawel Michalak
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
- Gibbs Cancer Center & Research Institute, Spartanburg, South Carolina, United States of America
- Institute of Evolution, University of Haifa, Haifa, Israel
| | - Harold Garner
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, United States of America
- Gibbs Cancer Center & Research Institute, Spartanburg, South Carolina, United States of America
| |
Collapse
|
113
|
Annear DJ, Vandeweyer G, Elinck E, Sanchis-Juan A, French CE, Raymond L, Kooy RF. Abundancy of polymorphic CGG repeats in the human genome suggest a broad involvement in neurological disease. Sci Rep 2021; 11:2515. [PMID: 33510257 PMCID: PMC7844047 DOI: 10.1038/s41598-021-82050-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 12/29/2020] [Indexed: 11/09/2022] Open
Abstract
Expanded CGG-repeats have been linked to neurodevelopmental and neurodegenerative disorders, including the fragile X syndrome and fragile X-associated tremor/ataxia syndrome (FXTAS). We hypothesized that as of yet uncharacterised CGG-repeat expansions within the genome contribute to human disease. To catalogue the CGG-repeats, 544 human whole genomes were analyzed. In total, 6101 unique CGG-repeats were detected of which more than 93% were highly variable in repeat length. Repeats with a median size of 12 repeat units or more were always polymorphic but shorter repeats were often polymorphic, suggesting a potential intergenerational instability of the CGG region even for repeats units with a median length of four or less. 410 of the CGG repeats were associated with known neurodevelopmental disease genes or with strong candidate genes. Based on their frequency and genomic location, CGG repeats may thus be a currently overlooked cause of human disease.
Collapse
Affiliation(s)
- Dale J Annear
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - Geert Vandeweyer
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - Ellen Elinck
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - Alba Sanchis-Juan
- NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK.,Department of Haematology, NHS Blood and Transplant Centre, University of Cambridge, Cambridge, CB2 0PT, UK
| | - Courtney E French
- Department of Paediatrics, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Lucy Raymond
- NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK.,Department of Medical Genetics, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, CB2 0XY, UK
| | - R Frank Kooy
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium.
| |
Collapse
|
114
|
Oleksyk TK, Wolfsberger WW, Weber AM, Shchubelka K, Oleksyk OT, Levchuk O, Patrus A, Lazar N, Castro-Marquez SO, Hasynets Y, Boldyzhar P, Neymet M, Urbanovych A, Stakhovska V, Malyar K, Chervyakova S, Podoroha O, Kovalchuk N, Rodriguez-Flores JL, Zhou W, Medley S, Battistuzzi F, Liu R, Hou Y, Chen S, Yang H, Yeager M, Dean M, Mills RE, Smolanka V. Genome diversity in Ukraine. Gigascience 2021; 10:6079618. [PMID: 33438729 PMCID: PMC7804371 DOI: 10.1093/gigascience/giaa159] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 08/21/2020] [Accepted: 12/15/2020] [Indexed: 01/21/2023] Open
Abstract
Background The main goal of this collaborative effort is to provide genome-wide data for the previously underrepresented population in Eastern Europe, and to provide cross-validation of the data from genome sequences and genotypes of the same individuals acquired by different technologies. We collected 97 genome-grade DNA samples from consented individuals representing major regions of Ukraine that were consented for public data release. BGISEQ-500 sequence data and genotypes by an Illumina GWAS chip were cross-validated on multiple samples and additionally referenced to 1 sample that has been resequenced by Illumina NovaSeq6000 S4 at high coverage. Results The genome data have been searched for genomic variation represented in this population, and a number of variants have been reported: large structural variants, indels, copy number variations, single-nucletide polymorphisms, and microsatellites. To our knowledge, this study provides the largest to-date survey of genetic variation in Ukraine, creating a public reference resource aiming to provide data for medical research in a large understudied population. Conclusions Our results indicate that the genetic diversity of the Ukrainian population is uniquely shaped by evolutionary and demographic forces and cannot be ignored in future genetic and biomedical studies. These data will contribute a wealth of new information bringing forth a wealth of novel, endemic and medically related alleles.
Collapse
Affiliation(s)
- Taras K Oleksyk
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Walter W Wolfsberger
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Alexandra M Weber
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Khrystyna Shchubelka
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA.,Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Olga T Oleksyk
- A. Novak Transcarpathian Regional Clinical Hospital, Uzhhorod 88000, Ukraine
| | | | | | | | - Stephanie O Castro-Marquez
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Yaroslava Hasynets
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine
| | - Patricia Boldyzhar
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Mikhailo Neymet
- Velyka Kopanya Family Hospital, Transcarpatia 90330, Ukraine
| | | | | | - Kateryna Malyar
- I.I.Mechnikov Dnipro Regional Clinical Hospital, Dnipro 49000, Ukraine
| | | | | | - Natalia Kovalchuk
- Rivne Regional Specialized Hospital of Radiation Protection, Rivne 33028, Ukraine
| | | | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sarah Medley
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Fabia Battistuzzi
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Ryan Liu
- BGI Shenzhen, Shenzhen, 518083, China
| | - Yong Hou
- BGI Shenzhen, Shenzhen, 518083, China
| | - Siru Chen
- BGI Shenzhen, Shenzhen, 518083, China
| | | | - Meredith Yeager
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Michael Dean
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Volodymyr Smolanka
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| |
Collapse
|
115
|
Mitra I, Huang B, Mousavi N, Ma N, Lamkin M, Yanicky R, Shleizer-Burko S, Lohmueller KE, Gymrek M. Patterns of de novo tandem repeat mutations and their role in autism. Nature 2021; 589:246-250. [PMID: 33442040 PMCID: PMC7810352 DOI: 10.1038/s41586-020-03078-7] [Citation(s) in RCA: 97] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Accepted: 11/23/2020] [Indexed: 12/19/2022]
Abstract
Autism Spectrum Disorder (ASD) is an early onset developmental disorder characterized by deficits in communication and social interaction and restrictive or repetitive behaviors1,2. Family studies demonstrate that ASD has a significant genetic basis with contributions both from inherited and de novo variants3,4. It has been estimated that de novo mutations may contribute to 30% of all simplex cases, in which only a single child is affected per family5. Tandem repeats (TRs), defined here as 1-20bp sequences repeated consecutively, comprise one of the largest sources of de novo mutations in humans6. TR expansions are implicated in dozens of neurological and psychiatric disorders7. Yet, de novo TR mutations have not been characterized on a genome-wide scale, and their contribution to ASD remains unexplored. We develop novel bioinformatics methods for identifying and prioritizing de novo TR mutations from sequencing data and then perform a genome-wide characterization of de novo TR mutations in ASD-affected probands and unaffected siblings. Compared to recent work on TRs in ASD8, we explicitly infer mutation events and their precise changes in repeat number, and primarily focus on more prevalent stepwise copy number changes rather than large expansions. Our results demonstrate a significant genome-wide excess of TR mutations in ASD probands. Mutations in probands tend to be larger, enriched in fetal brain regulatory regions, and predicted to be more evolutionarily deleterious. Overall, our results highlight the importance of considering repeat variants in future studies of de novo mutations.
Collapse
Affiliation(s)
- Ileena Mitra
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - Bonnie Huang
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Nima Mousavi
- Department of Electrical and Computer Engineering, University of California San Diego, La Jolla, CA, USA
| | - Nichole Ma
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Michael Lamkin
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Richard Yanicky
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | | | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, CA, USA. .,Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA.
| | - Melissa Gymrek
- Department of Medicine, University of California San Diego, La Jolla, CA, USA. .,Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
116
|
Koriath CAM, Kenny J, Ryan NS, Rohrer JD, Schott JM, Houlden H, Fox NC, Tabrizi SJ, Mead S. Genetic testing in dementia - utility and clinical strategies. Nat Rev Neurol 2021; 17:23-36. [PMID: 33168964 DOI: 10.1038/s41582-020-00416-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/11/2020] [Indexed: 02/07/2023]
Abstract
Techniques for clinical genetic testing in dementia disorders have advanced rapidly but remain to be more widely implemented in practice. A positive genetic test offers a precise molecular diagnosis, can help members of an affected family to determine personal risk, provides a basis for reproductive choices and can offer options for clinical trials. The likelihood of identifying a specific genetic cause of dementia depends on the clinical condition, the age at onset and family history. Attempts to match phenotypes to single genes are mostly inadvisable owing to clinical overlap between the dementias, genetic heterogeneity, pleiotropy and concurrent mutations. Currently, the appropriate genetic test in most cases of dementia is a next-generation sequencing gene panel, though some conditions necessitate specific types of test such as repeat expansion testing. Whole-exome and whole-genome sequencing are becoming financially feasible but raise or exacerbate complex issues such as variants of uncertain significance, secondary findings and the potential for re-analysis in light of new information. However, the capacity for data analysis and counselling is already restricting the provision of genetic testing. Patients and their relatives need to be given reliable information to enable them to make informed choices about tests, treatments and data sharing; the ability of patients with dementia to make decisions must be considered when providing this information.
Collapse
Affiliation(s)
| | - Joanna Kenny
- South West Thames Regional Genetics Service, London, UK
| | - Natalie S Ryan
- Dementia Research Centre, Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
- UK Dementia Research Institute, UCL Queen Square Institute of Neurology, London, UK
| | - Jonathan D Rohrer
- Dementia Research Centre, Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
| | - Jonathan M Schott
- Dementia Research Centre, Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
| | - Henry Houlden
- Neurogenetics Laboratory, National Hospital for Neurology and Neurosurgery, London, UK
| | - Nick C Fox
- Dementia Research Centre, Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
- UK Dementia Research Institute, UCL Queen Square Institute of Neurology, London, UK
| | - Sarah J Tabrizi
- Huntington's Disease Centre, Department of Neurodegenerative Disease, UCL Queen Square Institute of Neurology, London, UK
| | - Simon Mead
- MRC Prion Unit at UCL, UCL Institute of Prion Diseases, London, UK.
| |
Collapse
|
117
|
Russo R, Marra R, Rosato BE, Iolascon A, Andolfo I. Genetics and Genomics Approaches for Diagnosis and Research Into Hereditary Anemias. Front Physiol 2020; 11:613559. [PMID: 33414725 PMCID: PMC7783452 DOI: 10.3389/fphys.2020.613559] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 12/03/2020] [Indexed: 01/19/2023] Open
Abstract
The hereditary anemias are a relatively heterogeneous set of disorders that can show wide clinical and genetic heterogeneity, which often hampers correct clinical diagnosis. The classical diagnostic workflow for these conditions generally used to start with analysis of the family and personal histories, followed by biochemical and morphological evaluations, and ending with genetic testing. However, the diagnostic framework has changed more recently, and genetic testing is now a suitable approach for differential diagnosis of these patients. There are several approaches to this genetic testing, the choice of which depends on phenotyping, genetic heterogeneity, and gene size. For patients who show complete phenotyping, single-gene testing remains recommended. However, genetic analysis now includes next-generation sequencing, which is generally based on custom-designed targeting panels and whole-exome sequencing. The use of next-generation sequencing also allows the identification of new causative genes, and of polygenic conditions and genetic factors that modify disease severity of hereditary anemias. In the research field, whole-genome sequencing is useful for the identification of non-coding causative mutations, which might account for the disruption of transcriptional factor occupancy sites and cis-regulatory elements. Moreover, advances in high-throughput sequencing techniques have now resulted in the identification of genome-wide profiling of the chromatin structures known as the topologically associating domains. These represent a recurrent disease mechanism that exposes genes to inappropriate regulatory elements, causing errors in gene expression. This review focuses on the challenges of diagnosis and research into hereditary anemias, with indications of both the advantages and disadvantages. Finally, we consider the future perspectives for the use of next-generation sequencing technologies in this era of precision medicine.
Collapse
Affiliation(s)
- Roberta Russo
- Department of Molecular Medicine and Medical Biotechnologies, University of Naples Federico II, Naples, Italy.,CEINGE Biotecnologie Avanzate, Naples, Italy
| | - Roberta Marra
- Department of Molecular Medicine and Medical Biotechnologies, University of Naples Federico II, Naples, Italy.,CEINGE Biotecnologie Avanzate, Naples, Italy
| | - Barbara Eleni Rosato
- Department of Molecular Medicine and Medical Biotechnologies, University of Naples Federico II, Naples, Italy.,CEINGE Biotecnologie Avanzate, Naples, Italy
| | - Achille Iolascon
- Department of Molecular Medicine and Medical Biotechnologies, University of Naples Federico II, Naples, Italy.,CEINGE Biotecnologie Avanzate, Naples, Italy
| | - Immacolata Andolfo
- Department of Molecular Medicine and Medical Biotechnologies, University of Naples Federico II, Naples, Italy.,CEINGE Biotecnologie Avanzate, Naples, Italy
| |
Collapse
|
118
|
Affiliation(s)
- Jan M Friedman
- Department of Medical Genetics, University of British Columbia, Vancouver, Canada
| | | | - John C Carey
- Department of Pediatrics, University of Utah Health, Salt Lake City
| |
Collapse
|
119
|
Ní Ghrálaigh F, Gallagher L, Lopez LM. Autism spectrum disorder genomics: The progress and potential of genomic technologies. Genomics 2020; 112:5136-5142. [DOI: 10.1016/j.ygeno.2020.09.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 09/01/2020] [Accepted: 09/08/2020] [Indexed: 12/27/2022]
|
120
|
Wan L, Chen Z, Wan N, Liu M, Xue J, Chen H, Zhang Y, Peng Y, Tang Z, Gong Y, Yuan H, Wang S, Deng Q, Hou X, Wang C, Peng H, Shi Y, Peng L, Lei L, Duan R, Xia K, Qiu R, Shen L, Tang B, Ashizawa T, Jiang H. Biallelic Intronic AAGGG Expansion of RFC1 is Related to Multiple System Atrophy. Ann Neurol 2020; 88:1132-1143. [PMID: 32939785 DOI: 10.1002/ana.25902] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Revised: 09/14/2020] [Accepted: 09/14/2020] [Indexed: 12/13/2022]
Abstract
OBJECTIVE A recessive biallelic repeat expansion, (AAGGG)exp , in the RFC1 gene has been reported to be a frequent cause of late-onset ataxia. For cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS), the recessive biallelic (AAGGG)exp genotype was present in ~92% of cases. This study aimed to examine whether the pentanucleotide repeat (PNR) was related to multiple system atrophy (MSA), which shares a spectrum of symptoms with CANVAS. METHODS In this study, we screened the pathogenic (AAGGG)exp repeat and 5 other PNRs in 104 Chinese sporadic adult-onset ataxia of unknown aetiology (SAOA) patients, 282 MSA patients, and 203 unaffected individuals. Multiple molecular genetic tests were used, including long-range polymerase chain reaction (PCR), repeat-primed PCR (RP-PCR), Sanger sequencing, and Southern blot. Comprehensive clinical assessments were conducted, including neurological examination, neuroimaging, nerve electrophysiology, and examination of vestibular function. RESULTS We identified biallelic (AAGGG)exp in 1 SAOA patient and 3 MSA patients. Additionally, 1 MSA patient had the (AAGGG)exp /(AAAGG)exp genotype with uncertain pathogenicity. We also described the carrier frequency for different PNRs in our cohorts. Furthermore, we summarized the distinct phenotypes of affected patients, suggesting that biallelic (AAGGG)exp in RFC1 could be associated with MSA and should be screened routinely in the MSA diagnostic workflow. INTERPRETATION Our results expanded the clinical phenotypic spectrum of RFC1-related disorders and raised the possibility that MSA might share the same genetic background as CANVAS, which is crucial for re-evaluating the current CANVAS and MSA diagnostic criteria. ANN NEUROL 2020;88:1132-1143.
Collapse
Affiliation(s)
- Linlin Wan
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Zhao Chen
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Hunan Province in Neurodegenerative Disorders, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Na Wan
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Mingjie Liu
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Jin Xue
- Center for Medical Genetics, School of Life Sciences, Central South University, Changsha, China
| | - Hongsheng Chen
- Department of Otolaryngology, Xiangya Hospital, Central South University, Changsha, China
| | - Youming Zhang
- Department of Radiology, Xiangya Hospital, Central South University, Changsha, China
| | - Yun Peng
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Zhichao Tang
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Yiqing Gong
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Hongyu Yuan
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Shang Wang
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Qi Deng
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Xuan Hou
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Chunrong Wang
- Department of Pathology, Xiangya Hospital, Central South University, Changsha, China
| | - Huirong Peng
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Yuting Shi
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Linliu Peng
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Lijing Lei
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China
| | - Ranhui Duan
- Center for Medical Genetics, School of Life Sciences, Central South University, Changsha, China.,Hunan Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Kun Xia
- Center for Medical Genetics, School of Life Sciences, Central South University, Changsha, China.,Hunan Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Rong Qiu
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Lu Shen
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Hunan Province in Neurodegenerative Disorders, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Beisha Tang
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Hunan Province in Neurodegenerative Disorders, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| | - Tetsuo Ashizawa
- Neuroscience Research Program, Methodist Hospital Research Institute, Houston, TX, USA
| | - Hong Jiang
- Department of Neurology, Xiangya Hospital, Central South University, Changsha, China.,Key Laboratory of Hunan Province in Neurodegenerative Disorders, Central South University, Changsha, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
121
|
Kristmundsdottir S, Eggertsson HP, Arnadottir GA, Halldorsson BV. popSTR2 enables clinical and population-scale genotyping of microsatellites. Bioinformatics 2020; 36:2269-2271. [PMID: 31804671 PMCID: PMC7141861 DOI: 10.1093/bioinformatics/btz913] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 11/27/2019] [Accepted: 12/03/2019] [Indexed: 01/01/2023] Open
Abstract
Summary popSTR2 is an update and augmentation of our previous work ‘popSTR: a population-based microsatellite genotyper’. To make genotyping sensitive to inter-sample differences, we supply a kernel to estimate sample-specific slippage rates. For clinical sequencing purposes, a panel of known pathogenic repeat expansions is provided along with a script that scans and flags for manual inspection markers indicative of a pathogenic expansion. Like its predecessor, popSTR2 allows for joint genotyping of samples at a population scale. We now provide a binning method that makes the microsatellite genotypes more amenable to analysis within standard association pipelines and can increase association power. Availability and implementation https://github.com/DecodeGenetics/popSTR. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Snædis Kristmundsdottir
- deCODE genetics/Amgen, Reykjavík 102, Iceland.,School of Science and Engineering, Reykjavík University, Reykjavík 102, Iceland
| | | | - Gudny A Arnadottir
- School of Science and Engineering, Reykjavík University, Reykjavík 102, Iceland
| | - Bjarni V Halldorsson
- deCODE genetics/Amgen, Reykjavík 102, Iceland.,School of Science and Engineering, Reykjavík University, Reykjavík 102, Iceland
| |
Collapse
|
122
|
Fazal S, Danzi MC, Cintra VP, Bis-Brewer DM, Dolzhenko E, Eberle MA, Zuchner S. Large scale in silico characterization of repeat expansion variation in human genomes. Sci Data 2020; 7:294. [PMID: 32901039 PMCID: PMC7479135 DOI: 10.1038/s41597-020-00633-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Accepted: 08/13/2020] [Indexed: 11/21/2022] Open
Abstract
Significant progress has been made in elucidating single nucleotide polymorphism diversity in the human population. However, the majority of the variation space in the genome is structural and remains partially elusive. One form of structural variation is tandem repeats (TRs). Expansion of TRs are responsible for over 40 diseases, but we hypothesize these represent only a fraction of the pathogenic repeat expansions that exist. Here we characterize long or expanded TR variation in 1,115 human genomes as well as a replication cohort of 2,504 genomes, identified using ExpansionHunter Denovo. We found that individual genomes typically harbor several rare, large TRs, generally in non-coding regions of the genome. We noticed that these large TRs are enriched in their proximity to Alu elements. The vast majority of these large TRs seem to be expansions of smaller TRs that are already present in the reference genome. We are providing this TR profile as a resource for comparison to undiagnosed rare disease genomes in order to detect novel disease-causing repeat expansions.
Collapse
Affiliation(s)
- Sarah Fazal
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Matt C Danzi
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Vivian P Cintra
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | - Dana M Bis-Brewer
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
| | | | | | - Stephan Zuchner
- Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA.
| |
Collapse
|
123
|
Expanding genes, repeating themes and therapeutic schemes: The neurobiology of tandem repeat disorders. Neurobiol Dis 2020; 144:105053. [PMID: 32810583 DOI: 10.1016/j.nbd.2020.105053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
124
|
Yagi Y, Yamada T, Nakatani K. Chemical Probing of Thymine in the TGG/CGG Triad to Explore the Deamination of 5-Methylcytosine in the CGG Repeat. Biochemistry 2020; 59:2679-2683. [PMID: 32628834 DOI: 10.1021/acs.biochem.0c00333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The methylation of cytosine in the full mutation of the expanded CGG repeat and subsequent deamination to thymine could be a measure of repeat instability. We report the synthesis of NCD-Bpy, which binds to the TGG/CGG site in the repeat hairpin. NCD-Bpy forces the thymine in the TGG/CGG site to flip out from the π-stack, recruits osmium tetroxide in the vicinity of the flipped-out T, and oxidizes the T. The piperidine-induced cleavage band successfully determined the position of the T in the expanded CGG repeat.
Collapse
Affiliation(s)
- Yuki Yagi
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, Mihoga-oka, Ibaraki 567-0047, Japan
| | - Takeshi Yamada
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, Mihoga-oka, Ibaraki 567-0047, Japan
| | - Kazuhiko Nakatani
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, Mihoga-oka, Ibaraki 567-0047, Japan
| |
Collapse
|
125
|
Dolzhenko E, Deshpande V, Schlesinger F, Krusche P, Petrovski R, Chen S, Emig-Agius D, Gross A, Narzisi G, Bowman B, Scheffler K, van Vugt JJFA, French C, Sanchis-Juan A, Ibáñez K, Tucci A, Lajoie BR, Veldink JH, Raymond FL, Taft RJ, Bentley DR, Eberle MA. ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions. Bioinformatics 2020; 35:4754-4756. [PMID: 31134279 PMCID: PMC6853681 DOI: 10.1093/bioinformatics/btz431] [Citation(s) in RCA: 175] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 04/26/2019] [Accepted: 05/23/2019] [Indexed: 12/16/2022] Open
Abstract
SUMMARY We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci. AVAILABILITY AND IMPLEMENTATION ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | | | - Peter Krusche
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | - Roman Petrovski
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | - Sai Chen
- Illumina Inc., San Diego, CA 92122, USA
| | | | | | - Giuseppe Narzisi
- Computational Biology, New York Genome Center, New York, NY 10013, USA
| | | | | | - Joke J F A van Vugt
- UMC Utrecht Brain Center, Utrecht University, 3508 AB Utrecht, The Netherlands
| | - Courtney French
- Department of Medical Genetics, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK
| | - Alba Sanchis-Juan
- Department of Haematology, University of Cambridge, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK.,NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | - Kristina Ibáñez
- Genomics England, Queen Mary University London, London EC1M 6BQ, UK
| | - Arianna Tucci
- Genomics England, Queen Mary University London, London EC1M 6BQ, UK
| | | | - Jan H Veldink
- UMC Utrecht Brain Center, Utrecht University, 3508 AB Utrecht, The Netherlands
| | - F Lucy Raymond
- Department of Medical Genetics, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK
| | | | - David R Bentley
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | | |
Collapse
|
126
|
Dolzhenko E, Bennett MF, Richmond PA, Trost B, Chen S, van Vugt JJFA, Nguyen C, Narzisi G, Gainullin VG, Gross AM, Lajoie BR, Taft RJ, Wasserman WW, Scherer SW, Veldink JH, Bentley DR, Yuen RKC, Bahlo M, Eberle MA. ExpansionHunter Denovo: a computational method for locating known and novel repeat expansions in short-read sequencing data. Genome Biol 2020; 21:102. [PMID: 32345345 PMCID: PMC7187524 DOI: 10.1186/s13059-020-02017-z] [Citation(s) in RCA: 103] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 04/14/2020] [Indexed: 12/13/2022] Open
Abstract
Repeat expansions are responsible for over 40 monogenic disorders, and undoubtedly more pathogenic repeat expansions remain to be discovered. Existing methods for detecting repeat expansions in short-read sequencing data require predefined repeat catalogs. Recent discoveries emphasize the need for methods that do not require pre-specified candidate repeats. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide repeat expansion detection. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 44 pathogenic repeats, including nine recently reported non-reference repeat expansions not discoverable via existing methods.
Collapse
Affiliation(s)
- Egor Dolzhenko
- Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
| | - Mark F Bennett
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia.,Department of Medical Biology, The University of Melbourne, 1G Royal Parade, Parkville, VIC, 3052, Australia.,Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, 245 Burgundy Street, Heidelberg, VIC, 3084, Australia
| | - Phillip A Richmond
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital, University of British Columbia, Vancouver, BC, V5Z 4H4, Canada
| | - Brett Trost
- Genetics and Genome Biology, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,The Centre for Applied Genomics, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada
| | - Sai Chen
- Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
| | - Joke J F A van Vugt
- Department of Neurology, UMC Utrecht Brain Center, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Charlotte Nguyen
- Genetics and Genome Biology, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,The Centre for Applied Genomics, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5S 2E5, Canada
| | - Giuseppe Narzisi
- New York Genome Center, 101 Avenue of the Americas, New York, 10013, USA
| | | | - Andrew M Gross
- Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
| | - Bryan R Lajoie
- Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
| | - Ryan J Taft
- Illumina Inc., 5200 Illumina Way, San Diego, CA, 92122, USA
| | - Wyeth W Wasserman
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital, University of British Columbia, Vancouver, BC, V5Z 4H4, Canada
| | - Stephen W Scherer
- Genetics and Genome Biology, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,The Centre for Applied Genomics, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5S 2E5, Canada.,The McLaughlin Centre, University of Toronto, 686 Bay Street, Toronto, ON, M5G 0A4, Canada
| | - Jan H Veldink
- Department of Neurology, UMC Utrecht Brain Center, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - David R Bentley
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge, CB21 6DF, UK
| | - Ryan K C Yuen
- Genetics and Genome Biology, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,The Centre for Applied Genomics, The Hospital for Sick Children, 686 Bay Street, Toronto, ON, M5G 0A4, Canada.,Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON, M5S 2E5, Canada
| | - Melanie Bahlo
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia.,Department of Medical Biology, The University of Melbourne, 1G Royal Parade, Parkville, VIC, 3052, Australia
| | | |
Collapse
|
127
|
Abstract
Epilepsy encompasses a group of heterogeneous brain diseases that affect more than 50 million people worldwide. Epilepsy may have discernible structural, infectious, metabolic, and immune etiologies; however, in most people with epilepsy, no obvious cause is identifiable. Based initially on family studies and later on advances in gene sequencing technologies and computational approaches, as well as the establishment of large collaborative initiatives, we now know that genetics plays a much greater role in epilepsy than was previously appreciated. Here, we review the progress in the field of epilepsy genetics and highlight molecular discoveries in the most important epilepsy groups, including those that have been long considered to have a nongenetic cause. We discuss where the field of epilepsy genetics is moving as it enters a new era in which the genetic architecture of common epilepsies is starting to be unraveled.
Collapse
Affiliation(s)
- Piero Perucca
- Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Victoria 3000, Australia.,Departments of Medicine and Neurology, The Royal Melbourne Hospital, The University of Melbourne, Melbourne, Victoria 3050, Australia.,Department of Neurology, Alfred Health, Melbourne, Victoria 3000, Australia
| | - Melanie Bahlo
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Victoria 3052, Australia.,Department of Medical Biology, The University of Melbourne, Melbourne, Victoria 3010, Australia
| | - Samuel F Berkovic
- Epilepsy Research Centre, Department of Medicine, Austin Health, The University of Melbourne, Melbourne, Victoria 3084, Australia;
| |
Collapse
|
128
|
Abstract
Background: Short tandem repeats are an important source of genetic variation. They are highly mutable and repeat expansions are associated dozens of human disorders, such as Huntington's disease and spinocerebellar ataxias. Technical advantages in sequencing technology have made it possible to analyse these repeats at large scale; however, accurate genotyping is still a challenging task. We compared four different short tandem repeats genotyping tools on whole exome sequencing data to determine their genotyping performance and limits, which will aid other researchers in choosing a suitable tool and parameters for analysis. Methods: The analysis was performed on the Simons Simplex Collection dataset, where we used a novel method of evaluation with accuracy determined by the rate of homozygous calls on the X chromosome of male samples. In total we analysed 433 samples and around a million genotypes for evaluating tools on whole exome sequencing data. Results: We determined a relatively good performance of all tools when genotyping repeats of 3-6 bp in length, which could be improved with coverage and quality score filtering. However, genotyping homopolymers was challenging for all tools and a high error rate was present across different thresholds of coverage and quality scores. Interestingly, dinucleotide repeats displayed a high error rate as well, which was found to be mainly caused by the AC/TG repeats. Overall, LobSTR was able to make the most calls and was also the fastest tool, while RepeatSeq and HipSTR exhibited the lowest heterozygous error rate at low coverage. Conclusions: All tools have different strengths and weaknesses and the choice may depend on the application. In this analysis we demonstrated the effect of using different filtering parameters and offered recommendations based on the trade-off between the best accuracy of genotyping and the highest number of calls.
Collapse
Affiliation(s)
- Andreas Halman
- Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, VIC, 3052, Australia
- Peter MacCallum Cancer Centre, 305 Grattan St, Melbourne, VIC, 3000, Australia
- Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, 3052, Australia
- School of Natural Sciences and Health, Tallinn University, Tallinn, 10120, Estonia
| | - Alicia Oshlack
- Murdoch Children’s Research Institute, Royal Children’s Hospital, Parkville, VIC, 3052, Australia
- Peter MacCallum Cancer Centre, 305 Grattan St, Melbourne, VIC, 3000, Australia
- School of BioSciences, University of Melbourne, Parkville, VIC, 3052, Australia
| |
Collapse
|
129
|
Bennett MF, Oliver KL, Regan BM, Bellows ST, Schneider AL, Rafehi H, Sikta N, Crompton DE, Coleman M, Hildebrand MS, Corbett MA, Kroes T, Gecz J, Scheffer IE, Berkovic SF, Bahlo M. Familial adult myoclonic epilepsy type 1 SAMD12 TTTCA repeat expansion arose 17,000 years ago and is present in Sri Lankan and Indian families. Eur J Hum Genet 2020; 28:973-978. [PMID: 32203200 DOI: 10.1038/s41431-020-0606-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 02/05/2020] [Accepted: 03/03/2020] [Indexed: 11/09/2022] Open
Abstract
Familial adult myoclonic epilepsy 1 (FAME1), first recognised in Japanese families, was recently shown to be caused by a TTTCA repeat insertion in intron 4 of SAMD12 on chromosome 8. We performed whole genome sequencing on two families with FAME, one of Sri Lankan origin and the other of Indian origin, and identified a TTTCA repeat insertion in SAMD12 in both families. Haplotype analysis revealed that both families shared the same core ancestral haplotype reported in Japanese and Chinese families with FAME1. Mutation dating, based on the length of shared haplotypes, estimated the age of the ancestral haplotype to be ~670 generations, or 17,000 years old. Our data extend the geographic range of this repeat expansion to Southern Asia and potentially implicate an even broader regional distribution given the age of the variant. This finding suggests patients of Asian ancestry with suspected FAME should be screened for the SAMD12 TTTCA expansion.
Collapse
Affiliation(s)
- Mark F Bennett
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,Department of Medical Biology, The University of Melbourne, Parkville, VIC, 3052, Australia.,Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Karen L Oliver
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Brigid M Regan
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Susannah T Bellows
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Amy L Schneider
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Haloom Rafehi
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia.,Department of Medical Biology, The University of Melbourne, Parkville, VIC, 3052, Australia.,Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Neblina Sikta
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia
| | - Douglas E Crompton
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia.,Neurology Department, Northern Health, Melbourne, VIC, 3076, Australia
| | - Matthew Coleman
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Michael S Hildebrand
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia.,Murdoch Children's Research Institute, Royal Children's Hospital, Parkville, VIC, 3052, Australia
| | - Mark A Corbett
- Robinson Research Institute & Adelaide Medical School, The University of Adelaide, Adelaide, SA, 5005, Australia
| | - Thessa Kroes
- Robinson Research Institute & Adelaide Medical School, The University of Adelaide, Adelaide, SA, 5005, Australia
| | - Jozef Gecz
- Robinson Research Institute & Adelaide Medical School, The University of Adelaide, Adelaide, SA, 5005, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia
| | - Ingrid E Scheffer
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia.,Murdoch Children's Research Institute, Royal Children's Hospital, Parkville, VIC, 3052, Australia.,Department of Paediatrics, The University of Melbourne, Royal Children's Hospital, Parkville, VIC, 3052, Australia.,The Florey Institute, Parkville, VIC, 3052, Australia
| | - Samuel F Berkovic
- Epilepsy Research Centre, Department of Medicine, The University of Melbourne, Austin Health, Heidelberg, VIC, 3084, Australia
| | - Melanie Bahlo
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia. .,Department of Medical Biology, The University of Melbourne, Parkville, VIC, 3052, Australia.
| |
Collapse
|
130
|
Dolzhenko E, Deshpande V, Schlesinger F, Krusche P, Petrovski R, Chen S, Emig-Agius D, Gross A, Narzisi G, Bowman B, Scheffler K, van Vugt JJFA, French C, Sanchis-Juan A, Ibáñez K, Tucci A, Lajoie BR, Veldink JH, Raymond FL, Taft RJ, Bentley DR, Eberle MA. ExpansionHunter: a sequence-graph-based tool to analyze variation in short tandem repeat regions. BIOINFORMATICS (OXFORD, ENGLAND) 2019; 35:4754-4756. [PMID: 31134279 DOI: 10.1101/361162] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 04/26/2019] [Accepted: 05/23/2019] [Indexed: 05/25/2023]
Abstract
SUMMARY We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci. AVAILABILITY AND IMPLEMENTATION ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | | | - Peter Krusche
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | - Roman Petrovski
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | - Sai Chen
- Illumina Inc., San Diego, CA 92122, USA
| | | | | | - Giuseppe Narzisi
- Computational Biology, New York Genome Center, New York, NY 10013, USA
| | | | | | - Joke J F A van Vugt
- UMC Utrecht Brain Center, Utrecht University, 3508 AB Utrecht, The Netherlands
| | - Courtney French
- Department of Medical Genetics, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK
| | - Alba Sanchis-Juan
- Department of Haematology, University of Cambridge, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK
- NIHR BioResource, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
| | - Kristina Ibáñez
- Genomics England, Queen Mary University London, London EC1M 6BQ, UK
| | - Arianna Tucci
- Genomics England, Queen Mary University London, London EC1M 6BQ, UK
| | | | - Jan H Veldink
- UMC Utrecht Brain Center, Utrecht University, 3508 AB Utrecht, The Netherlands
| | - F Lucy Raymond
- Department of Medical Genetics, NHS Blood and Transplant Centre, Cambridge, CB2 0PT, UK
| | | | - David R Bentley
- Illumina Cambridge Ltd, Illumina Centre, 19 Granta Park, Great Abington, Cambridge CB21 6DF, UK
| | | |
Collapse
|
131
|
Abstract
A wave of technologies transformed sequencing over a decade ago into the high-throughput era, demanding research in new computational methods to analyze these data. The applications of these sequencing technologies have continuously expanded since then. The RECOMB Satellite Workshop on Massively Parallel Sequencing (RECOMB-Seq) meeting, established in 2011, brings together leading researchers in computational genomics and genomic biology to discuss emerging frontiers in algorithm development for massively parallel sequencing data. The ninth edition of this workshop was held in Washington, DC, in George Washington University on May 3 and 4, 2019. There was an exploration of several traditional topics in sequence analysis, including genome assembly, sequence alignment, and data compression, and development of methods for new sequencing technologies, including linked reads and single-molecule long-read sequencing. Here we revisit these topics and discuss the current status and perspectives of sequencing technologies and analyses.
Collapse
Affiliation(s)
- Vikas Bansal
- Department of Pediatrics, School of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Christina Boucher
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL, USA
| |
Collapse
|