1
|
Cui Y, Arnold FJ, Li JS, Wu J, Wang D, Philippe J, Colwin MR, Michels S, Chen C, Sallam T, Thompson LM, La Spada AR, Li W. Multi-omic quantitative trait loci link tandem repeat size variation to gene regulation in human brain. Nat Genet 2025:10.1038/s41588-024-02057-2. [PMID: 39809899 DOI: 10.1038/s41588-024-02057-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 12/10/2024] [Indexed: 01/16/2025]
Abstract
Tandem repeat (TR) size variation is implicated in ~50 neurological disorders, yet its impact on gene regulation in the human brain remains largely unknown. In the present study, we quantified the impact of TR size variation on brain gene regulation across distinct molecular phenotypes, based on 4,412 multi-omics samples from 1,597 donors, including 1,586 newly sequenced ones. We identified ~2.2 million TR molecular quantitative trait loci (TR-xQTLs), linking ~139,000 unique TRs to nearby molecular phenotypes, including many known disease-risk TRs, such as the G2C4 expansion in C9orf72 associated with amyotrophic lateral sclerosis. Fine-mapping revealed ~18,700 TRs as potential causal variants. Our in vitro experiments further confirmed the causal and independent regulatory effects of three TRs. Additional colocalization analysis indicated the potential causal role of TR variation in brain-related phenotypes, highlighted by a 3'-UTR TR in NUDT14 linked to cortical surface area and a TG repeat in PLEKHA1, associated with Alzheimer's disease.
Collapse
Affiliation(s)
- Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, University of California, Irvine, Irvine, CA, USA.
| | - Frederick J Arnold
- Departments of Pathology & Laboratory Medicine, Neurology, Biological Chemistry, and Neurobiology & Behavior, University of California, Irvine, Irvine, CA, USA
| | - Jason Sheng Li
- Division of Computational Biomedicine, Department of Biological Chemistry, University of California, Irvine, Irvine, CA, USA
| | - Jie Wu
- Departments of Psychiatry and Human Behavior, Neurobiology and Behavior, and Biological Chemistry, University of California, Irvine, Irvine, CA, USA
| | - Dan Wang
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Julien Philippe
- Departments of Pathology & Laboratory Medicine, Neurology, Biological Chemistry, and Neurobiology & Behavior, University of California, Irvine, Irvine, CA, USA
| | - Michael R Colwin
- Departments of Pathology & Laboratory Medicine, Neurology, Biological Chemistry, and Neurobiology & Behavior, University of California, Irvine, Irvine, CA, USA
| | - Sebastian Michels
- Departments of Pathology & Laboratory Medicine, Neurology, Biological Chemistry, and Neurobiology & Behavior, University of California, Irvine, Irvine, CA, USA
- Department of Neurology, University of Ulm, Oberer Eselsberg, Ulm, Germany
| | - Chaorong Chen
- Division of Computational Biomedicine, Department of Biological Chemistry, University of California, Irvine, Irvine, CA, USA
| | - Tamer Sallam
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Leslie M Thompson
- Departments of Psychiatry and Human Behavior, Neurobiology and Behavior, and Biological Chemistry, University of California, Irvine, Irvine, CA, USA.
| | - Albert R La Spada
- Departments of Pathology & Laboratory Medicine, Neurology, Biological Chemistry, and Neurobiology & Behavior, University of California, Irvine, Irvine, CA, USA.
- UCI Center for Neurotherapeutics, University of California, Irvine, Irvine, CA, USA.
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
2
|
Kim JH, Koh IG, Lee H, Lee GH, Song DY, Kim SW, Kim Y, Han JH, Bong G, Lee J, Byun H, Son JH, Kim YR, Lee Y, Kim JJ, Park JW, Kim IB, Choi JK, Jang JH, Trost B, Lee J, Kim E, Yoo HJ, An JY. Short tandem repeat expansions in cortical layer-specific genes implicate in phenotypic severity and adaptability of autism spectrum disorder. Psychiatry Clin Neurosci 2024; 78:405-415. [PMID: 38751214 PMCID: PMC11488627 DOI: 10.1111/pcn.13676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 02/14/2024] [Accepted: 04/15/2024] [Indexed: 07/06/2024]
Abstract
AIM Short tandem repeats (STRs) are repetitive DNA sequences and highly mutable in various human disorders. While the involvement of STRs in various genetic disorders has been extensively studied, their role in autism spectrum disorder (ASD) remains largely unexplored. In this study, we aimed to investigate genetic association of STR expansions with ASD using whole genome sequencing (WGS) and identify risk loci associated with ASD phenotypes. METHODS We analyzed WGS data of 634 ASD families and performed genome-wide evaluation for 12,929 STR loci. We found rare STR expansions that exceeded normal repeat lengths in autism cases compared to unaffected controls. By integrating single cell RNA and ATAC sequencing datasets of human postmortem brains, we prioritized STR loci in genes specifically expressed in cortical development stages. A deep learning method was used to predict functionality of ASD-associated STR loci. RESULTS In ASD cases, rare STR expansions predominantly occurred in early cortical layer-specific genes involved in neurodevelopment, highlighting the cellular specificity of STR-associated genes in ASD risk. Leveraging deep learning prediction models, we demonstrated that these STR expansions disrupted the regulatory activity of enhancers and promoters, suggesting a potential mechanism through which they contribute to ASD pathogenesis. We found that individuals with ASD-associated STR expansions exhibited more severe ASD phenotypes and diminished adaptability compared to non-carriers. CONCLUSION Short tandem repeat expansions in cortical layer-specific genes are associated with ASD and could potentially be a risk genetic factor for ASD. Our study is the first to show evidence of STR expansion associated with ASD in an under-investigated population.
Collapse
Affiliation(s)
- Jae Hyun Kim
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - In Gyeong Koh
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - Hyeji Lee
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - Gang-Hee Lee
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - Da-Yea Song
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
- Department of Psychiatry, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Soo-Whee Kim
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - Yujin Kim
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
| | - Jae Hyun Han
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
- Department of Psychiatry, College of Medicine, Soonchunhyang University Cheonan Hospital, Cheonan, Republic of Korea
| | - Guiyoung Bong
- Department of Psychiatry, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Jeewon Lee
- Department of Psychiatry, Soonchunhyang University College of Medicine, Asan, Republic of Korea
| | - Heejung Byun
- Department of Neuropsychiatry, Seoul Metropolitan Children's Hospital, Seoul, Republic of Korea
| | - Ji Hyun Son
- Department of Neuropsychiatry, Seoul Metropolitan Children's Hospital, Seoul, Republic of Korea
| | - Ye Rim Kim
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
- Department of Psychiatry, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Yoojeong Lee
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | - Justine Jaewon Kim
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | - Jung Woo Park
- Center for Biomedical Computing, Division of National Supercomputing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea
| | - Il Bin Kim
- Department of Psychiatry, Hanyang University Guri Hospital, Guri, Republic of Korea
| | - Jung Kyoon Choi
- Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Ja-Hyun Jang
- Department of Laboratory Medicine and Genetics, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| | - Brett Trost
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, Ontario, Canada
- Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Junehawk Lee
- Center for Biomedical Computing, Division of National Supercomputing, Korea Institute of Science and Technology Information, Daejeon, Republic of Korea
| | - Eunjoon Kim
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science, Daejeon, Republic of Korea
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Hee Jeong Yoo
- Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
- Department of Psychiatry, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Joon-Yong An
- Department of Integrated Biomedical and Life Science, Korea University, Seoul, Republic of Korea
- L-HOPE Program for Community-Based Total Learning Health Systems, Korea University, Seoul, Republic of Korea
- School of Biosystem and Biomedical Science, College of Health Science, Korea University, Seoul, Republic of Korea
| |
Collapse
|
3
|
Tanudisastro HA, Deveson IW, Dashnow H, MacArthur DG. Sequencing and characterizing short tandem repeats in the human genome. Nat Rev Genet 2024; 25:460-475. [PMID: 38366034 DOI: 10.1038/s41576-024-00692-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2023] [Indexed: 02/18/2024]
Abstract
Short tandem repeats (STRs) are highly polymorphic sequences throughout the human genome that are composed of repeated copies of a 1-6-bp motif. Over 1 million variable STR loci are known, some of which regulate gene expression and influence complex traits, such as height. Moreover, variants in at least 60 STR loci cause genetic disorders, including Huntington disease and fragile X syndrome. Accurately identifying and genotyping STR variants is challenging, in particular mapping short reads to repetitive regions and inferring expanded repeat lengths. Recent advances in sequencing technology and computational tools for STR genotyping from sequencing data promise to help overcome this challenge and solve genetically unresolved cases and the 'missing heritability' of polygenic traits. Here, we compare STR genotyping methods, analytical tools and their applications to understand the effect of STR variation on health and disease. We identify emergent opportunities to refine genotyping and quality-control approaches as well as to integrate STRs into variant-calling workflows and large cohort analyses.
Collapse
Affiliation(s)
- Hope A Tanudisastro
- Centre for Population Genomics, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Victoria, Australia
- Faculty of Medicine and Health, University of New South Wales, Sydney, New South Wales, Australia
- Faculty of Medicine and Health, University of Sydney, Sydney, New South Wales, Australia
| | - Ira W Deveson
- Faculty of Medicine and Health, University of New South Wales, Sydney, New South Wales, Australia
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | - Harriet Dashnow
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA.
| | - Daniel G MacArthur
- Centre for Population Genomics, Garvan Institute of Medical Research, Sydney, New South Wales, Australia.
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.
- Faculty of Medicine and Health, University of New South Wales, Sydney, New South Wales, Australia.
| |
Collapse
|
4
|
Rajan-Babu IS, Dolzhenko E, Eberle MA, Friedman JM. Sequence composition changes in short tandem repeats: heterogeneity, detection, mechanisms and clinical implications. Nat Rev Genet 2024; 25:476-499. [PMID: 38467784 DOI: 10.1038/s41576-024-00696-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2024] [Indexed: 03/13/2024]
Abstract
Short tandem repeats (STRs) are a class of repetitive elements, composed of tandem arrays of 1-6 base pair sequence motifs, that comprise a substantial fraction of the human genome. STR expansions can cause a wide range of neurological and neuromuscular conditions, known as repeat expansion disorders, whose age of onset, severity, penetrance and/or clinical phenotype are influenced by the length of the repeats and their sequence composition. The presence of non-canonical motifs, depending on the type, frequency and position within the repeat tract, can alter clinical outcomes by modifying somatic and intergenerational repeat stability, gene expression and mutant transcript-mediated and/or protein-mediated toxicities. Here, we review the diverse structural conformations of repeat expansions, technological advances for the characterization of changes in sequence composition, their clinical correlations and the impact on disease mechanisms.
Collapse
Affiliation(s)
- Indhu-Shree Rajan-Babu
- Department of Medical Genetics, The University of British Columbia, and Children's & Women's Hospital, Vancouver, British Columbia, Canada.
| | | | | | - Jan M Friedman
- Department of Medical Genetics, The University of British Columbia, and Children's & Women's Hospital, Vancouver, British Columbia, Canada
- BC Children's Hospital Research Institute, Vancouver, British Columbia, Canada
| |
Collapse
|
5
|
Fehlings DL, Zarrei M, Engchuan W, Sondheimer N, Thiruvahindrapuram B, MacDonald JR, Higginbotham EJ, Thapa R, Behlim T, Aimola S, Switzer L, Ng P, Wei J, Danthi PS, Pellecchia G, Lamoureux S, Ho K, Pereira SL, de Rijke J, Sung WWL, Mowjoodi A, Howe JL, Nalpathamkalam T, Manshaei R, Ghaffari S, Whitney J, Patel RV, Hamdan O, Shaath R, Trost B, Knights S, Samdup D, McCormick A, Hunt C, Kirton A, Kawamura A, Mesterman R, Gorter JW, Dlamini N, Merico D, Hilali M, Hirschfeld K, Grover K, Bautista NX, Han K, Marshall CR, Yuen RKC, Subbarao P, Azad MB, Turvey SE, Mandhane P, Moraes TJ, Simons E, Maxwell G, Shevell M, Costain G, Michaud JL, Hamdan FF, Gauthier J, Uguen K, Stavropoulos DJ, Wintle RF, Oskoui M, Scherer SW. Comprehensive whole-genome sequence analyses provide insights into the genomic architecture of cerebral palsy. Nat Genet 2024; 56:585-594. [PMID: 38553553 DOI: 10.1038/s41588-024-01686-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 02/13/2024] [Indexed: 04/17/2024]
Abstract
We performed whole-genome sequencing (WGS) in 327 children with cerebral palsy (CP) and their biological parents. We classified 37 of 327 (11.3%) children as having pathogenic/likely pathogenic (P/LP) variants and 58 of 327 (17.7%) as having variants of uncertain significance. Multiple classes of P/LP variants included single-nucleotide variants (SNVs)/indels (6.7%), copy number variations (3.4%) and mitochondrial mutations (1.5%). The COL4A1 gene had the most P/LP SNVs. We also analyzed two pediatric control cohorts (n = 203 trios and n = 89 sib-pair families) to provide a baseline for de novo mutation rates and genetic burden analyses, the latter of which demonstrated associations between de novo deleterious variants and genes related to the nervous system. An enrichment analysis revealed previously undescribed plausible candidate CP genes (SMOC1, KDM5B, BCL11A and CYP51A1). A multifactorial CP risk profile and substantial presence of P/LP variants combine to support WGS in the diagnostic work-up across all CP and related phenotypes.
Collapse
Affiliation(s)
- Darcy L Fehlings
- Division of Developmental Paediatrics, Holland Bloorview Kids Rehabilitation Hospital, Toronto, Ontario, Canada
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Mehdi Zarrei
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Worrawat Engchuan
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Neal Sondheimer
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | | | - Jeffrey R MacDonald
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Edward J Higginbotham
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Genome Diagnostics, Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ritesh Thapa
- Division of Developmental Paediatrics, Holland Bloorview Kids Rehabilitation Hospital, Toronto, Ontario, Canada
| | - Tarannum Behlim
- Centre for Outcomes Research and Evaluation, Research Institute of the McGill University Health Centre, Montréal, Québec, Canada
| | - Sabrina Aimola
- Division of Developmental Paediatrics, Holland Bloorview Kids Rehabilitation Hospital, Toronto, Ontario, Canada
| | - Lauren Switzer
- Division of Developmental Paediatrics, Holland Bloorview Kids Rehabilitation Hospital, Toronto, Ontario, Canada
| | - Pamela Ng
- Centre for Outcomes Research and Evaluation, Research Institute of the McGill University Health Centre, Montréal, Québec, Canada
| | - John Wei
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Prakroothi S Danthi
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Giovanna Pellecchia
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Sylvia Lamoureux
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Karen Ho
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Sergio L Pereira
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Jill de Rijke
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Wilson W L Sung
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Alireza Mowjoodi
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Jennifer L Howe
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Thomas Nalpathamkalam
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Roozbeh Manshaei
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Ted Rogers Centre for Heart Research, Cardiac Genome Clinic, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Siavash Ghaffari
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Joseph Whitney
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Rohan V Patel
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Omar Hamdan
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Rulan Shaath
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Brett Trost
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Shannon Knights
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Grandview Children's Centre, Oshawa, Ontario, Canada
| | - Dawa Samdup
- Department of Pediatrics, Queen's University, Kingston, Ontario, Canada
| | - Anna McCormick
- Children's Hospital of Eastern Ontario and University of Ottawa, Ottawa, Ontario, Canada
| | - Carolyn Hunt
- Grandview Children's Centre, Oshawa, Ontario, Canada
| | - Adam Kirton
- Department of Pediatrics, Department of Clinical Neuroscience, University of Calgary, Calgary, Alberta, Canada
| | - Anne Kawamura
- Division of Developmental Paediatrics, Holland Bloorview Kids Rehabilitation Hospital, Toronto, Ontario, Canada
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Ronit Mesterman
- Department of Pediatrics, McMaster University, Hamilton, Ontario, Canada
| | - Jan Willem Gorter
- Department of Pediatrics, McMaster University, Hamilton, Ontario, Canada
| | - Nomazulu Dlamini
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Division of Neurology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Neurosciences and Mental Health Program, Research Institute, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Daniele Merico
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Deep Genomics Inc., Toronto, Ontario, Canada
- Vevo Therapeutics Inc., San Francisco, CA, USA
| | - Murto Hilali
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Kyle Hirschfeld
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Kritika Grover
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Nelson X Bautista
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Kara Han
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Christian R Marshall
- Genome Diagnostics, Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ryan K C Yuen
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Padmaja Subbarao
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Department of Pediatrics, McMaster University, Hamilton, Ontario, Canada
- Department of Physiology, University of Toronto, Toronto, Ontario, Canada
| | - Meghan B Azad
- Department of Pediatrics and Child Health, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Stuart E Turvey
- Department of Pediatrics, BC Children's Hospital, University of British Columbia, Vancouver, British Columbia, Canada
| | - Piush Mandhane
- Faculty of Medicine & Dentistry, Pediatrics Department, University of Alberta, Edmonton, Alberta, Canada
| | - Theo J Moraes
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Program in Translation Medicine & Division of Respiratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Elinor Simons
- Department of Pediatrics and Child Health, Section of Allergy and Clinical Immunology, University of Manitoba, Children's Hospital Research Institute of Manitoba, Winnipeg, Manitoba, Canada
| | - George Maxwell
- Women's Health Integrated Research Center, Inova Women's Service Line, Inova Health System, Falls Church, VA, USA
| | - Michael Shevell
- Centre for Outcomes Research and Evaluation, Research Institute of the McGill University Health Centre, Montréal, Québec, Canada
- Departments of Pediatrics and Department of Neurology and Neurosurgery, McGill University, Montréal, Québec, Canada
| | - Gregory Costain
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Genome Diagnostics, Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Jacques L Michaud
- Departments of Pediatrics and Neurosciences, Université de Montréal, Montréal, Québec, Canada
- CHU Sainte-Justine Azrieli Research Center, Montréal, Québec, Canada
| | - Fadi F Hamdan
- CHU Sainte-Justine Azrieli Research Center, Montréal, Québec, Canada
- Department of Pediatrics, Université de Montréal, Montréal, Québec, Canada
| | - Julie Gauthier
- CHU Sainte-Justine Azrieli Research Center, Montréal, Québec, Canada
- Department of Pediatrics, Université de Montréal, Montréal, Québec, Canada
| | - Kevin Uguen
- CHU Sainte-Justine Azrieli Research Center, Montréal, Québec, Canada
| | - Dimitri J Stavropoulos
- Genome Diagnostics, Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada
| | - Richard F Wintle
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Maryam Oskoui
- Centre for Outcomes Research and Evaluation, Research Institute of the McGill University Health Centre, Montréal, Québec, Canada
- Departments of Pediatrics and Department of Neurology and Neurosurgery, McGill University, Montréal, Québec, Canada
| | - Stephen W Scherer
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada.
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada.
- Department of Molecular Genetics and McLaughlin Centre, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
6
|
Sanchez-Flores M, Corral-Juan M, Gasch-Navalón E, Cirillo D, Sanchez I, Matilla-Dueñas A. Novel genotype-phenotype correlations, differential cerebellar allele-specific methylation, and a common origin of the (ATTTC) n insertion in spinocerebellar ataxia type 37. Hum Genet 2024; 143:211-232. [PMID: 38396267 PMCID: PMC11043136 DOI: 10.1007/s00439-024-02644-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 01/17/2024] [Indexed: 02/25/2024]
Abstract
Spinocerebellar ataxia subtype 37 (SCA37) is a rare disease originally identified in ataxia patients from the Iberian Peninsula with a pure cerebellar syndrome. SCA37 patients carry a pathogenic intronic (ATTTC)n repeat insertion flanked by two polymorphic (ATTTT)n repeats in the Disabled-1 (DAB1) gene leading to cerebellar dysregulation. Herein, we determine the precise configuration of the pathogenic 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n SCA37 alleles by CRISPR-Cas9 and long-read nanopore sequencing, reveal their epigenomic signatures in SCA37 lymphocytes, fibroblasts, and cerebellar samples, and establish new molecular and clinical correlations. The 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n pathogenic allele configurations revealed repeat instability and differential methylation signatures. Disease age of onset negatively correlated with the (ATTTC)n, and positively correlated with the 3'(ATTTT)n. Geographic origin and gender significantly correlated with age of onset. Furthermore, significant predictive regression models were obtained by machine learning for age of onset and disease evolution by considering gender, the (ATTTC)n, the 3'(ATTTT)n, and seven CpG positions differentially methylated in SCA37 cerebellum. A common 964-kb genomic region spanning the (ATTTC)n insertion was identified in all SCA37 patients analysed from Portugal and Spain, evidencing a common origin of the SCA37 mutation in the Iberian Peninsula originating 859 years ago (95% CI 647-1378). In conclusion, we demonstrate an accurate determination of the size and configuration of the regulatory 5'(ATTTT)n-(ATTTC)n-3'(ATTTT)n repeat tract, avoiding PCR bias amplification using CRISPR/Cas9-enrichment and nanopore long-read sequencing, resulting relevant for accurate genetic diagnosis of SCA37. Moreover, we determine novel significant genotype-phenotype correlations in SCA37 and identify differential cerebellar allele-specific methylation signatures that may underlie DAB1 pathogenic dysregulation.
Collapse
Affiliation(s)
- Marina Sanchez-Flores
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Marc Corral-Juan
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Esther Gasch-Navalón
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | | | - Ivelisse Sanchez
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain
| | - Antoni Matilla-Dueñas
- Neurogenetics Unit, Department of Neuroscience, Germans Trias i Pujol Research Institute (IGTP), Universitat Autònoma de Barcelona-Can Ruti Campus, Carretera de Can Ruti, Camí de les Escoles s/n, 08916, Badalona, Spain.
| |
Collapse
|
7
|
Mitina A, Khan M, Lesurf R, Yin Y, Engchuan W, Hamdan O, Pellecchia G, Trost B, Backstrom I, Guo K, Pallotto LM, Lam Doong PH, Wang Z, Nalpathamkalam T, Thiruvahindrapuram B, Papaz T, Pearson CE, Ragoussis J, Subbarao P, Azad MB, Turvey SE, Mandhane P, Moraes TJ, Simons E, Scherer SW, Lougheed J, Mondal T, Smythe J, Altamirano-Diaz L, Oechslin E, Mital S, Yuen RKC. Genome-wide enhancer-associated tandem repeats are expanded in cardiomyopathy. EBioMedicine 2024; 101:105027. [PMID: 38418263 PMCID: PMC10944212 DOI: 10.1016/j.ebiom.2024.105027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 02/05/2024] [Accepted: 02/06/2024] [Indexed: 03/01/2024] Open
Abstract
BACKGROUND Cardiomyopathy is a clinically and genetically heterogeneous heart condition that can lead to heart failure and sudden cardiac death in childhood. While it has a strong genetic basis, the genetic aetiology for over 50% of cardiomyopathy cases remains unknown. METHODS In this study, we analyse the characteristics of tandem repeats from genome sequence data of unrelated individuals diagnosed with cardiomyopathy from Canada and the United Kingdom (n = 1216) and compare them to those found in the general population. We perform burden analysis to identify genomic and epigenomic features that are impacted by rare tandem repeat expansions (TREs), and enrichment analysis to identify functional pathways that are involved in the TRE-associated genes in cardiomyopathy. We use Oxford Nanopore targeted long-read sequencing to validate repeat size and methylation status of one of the most recurrent TREs. We also compare the TRE-associated genes to those that are dysregulated in the heart tissues of individuals with cardiomyopathy. FINDINGS We demonstrate that tandem repeats that are rarely expanded in the general population are predominantly expanded in cardiomyopathy. We find that rare TREs are disproportionately present in constrained genes near transcriptional start sites, have high GC content, and frequently overlap active enhancer H3K27ac marks, where expansion-related DNA methylation may reduce gene expression. We demonstrate the gene silencing effect of expanded CGG tandem repeats in DIP2B through promoter hypermethylation. We show that the enhancer-associated loci are found in genes that are highly expressed in human cardiomyocytes and are differentially expressed in the left ventricle of the heart in individuals with cardiomyopathy. INTERPRETATION Our findings highlight the underrecognized contribution of rare tandem repeat expansions to the risk of cardiomyopathy and suggest that rare TREs contribute to ∼4% of cardiomyopathy risk. FUNDING Government of Ontario (RKCY), The Canadian Institutes of Health Research PJT 175329 (RKCY), The Azrieli Foundation (RKCY), SickKids Catalyst Scholar in Genetics (RKCY), The University of Toronto McLaughlin Centre (RKCY, SM), Ted Rogers Centre for Heart Research (SM), Data Sciences Institute at the University of Toronto (SM), The Canadian Institutes of Health Research PJT 175034 (SM), The Canadian Institutes of Health Research ENP 161429 under the frame of ERA PerMed (SM, RL), Heart and Stroke Foundation of Ontario & Robert M Freedom Chair in Cardiovascular Science (SM), Bitove Family Professorship of Adult Congenital Heart Disease (EO), Canada Foundation for Innovation (SWS, JR), Canada Research Chair (PS), Genome Canada (PS, JR), The Canadian Institutes of Health Research (PS).
Collapse
Affiliation(s)
- Aleksandra Mitina
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Mahreen Khan
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; Department of Molecular Genetics, University of Toronto; Toronto, Ontario, Canada
| | - Robert Lesurf
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Yue Yin
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Worrawat Engchuan
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Omar Hamdan
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Giovanna Pellecchia
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Brett Trost
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Ian Backstrom
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Keyi Guo
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Linda M Pallotto
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Phoenix Hoi Lam Doong
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Zhuozhi Wang
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Thomas Nalpathamkalam
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Bhooma Thiruvahindrapuram
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada
| | - Tanya Papaz
- Ted Rogers Centre for Heart Research; Toronto, Ontario, Canada; Division of Cardiology, Department of Pediatrics, The Hospital for Sick Children, University of Toronto; Toronto, Ontario, Canada
| | - Christopher E Pearson
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; Department of Molecular Genetics, University of Toronto; Toronto, Ontario, Canada
| | - Jiannis Ragoussis
- McGill Genome Centre, Victor Phillip Dahdaleh Institute of Genomic Medicine, McGill University, Montreal, Quebec, Canada
| | - Padmaja Subbarao
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada; Department of Physiology, University of Toronto, Toronto, Ontario, Canada; Program in Translation Medicine & Division of Respiratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Meghan B Azad
- Department of Pediatrics and Child Health, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Stuart E Turvey
- Department of Pediatrics, BC Children's Hospital, University of British Columbia, Vancouver, British Columbia, Canada
| | - Piushkumar Mandhane
- Department of Pediatrics, Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Alberta, Canada
| | - Theo J Moraes
- Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada; Program in Translation Medicine & Division of Respiratory Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Elinor Simons
- Department of Pediatrics and Child Health, Section of Allergy and Clinical Immunology, University of Manitoba, Winnipeg, Manitoba, Canada; Children's Hospital Research Institute of Manitoba, Winnipeg, Manitoba, Canada
| | - Stephen W Scherer
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada; Department of Molecular Genetics and McLaughlin Centre, University of Toronto, Toronto, Ontario, Canada
| | - Jane Lougheed
- Division of Cardiology, Children's Hospital of Eastern Ontario, Ottawa, Ontario, Canada
| | - Tapas Mondal
- Division of Cardiology, Department of Pediatrics, McMaster Children's Hospital, Hamilton, Ontario, Canada
| | - John Smythe
- Division of Cardiology, Department of Pediatrics, Kingston General Hospital, Kingston, Ontario, Canada
| | - Luis Altamirano-Diaz
- Division of Cardiology, Department of Pediatrics, London Health Sciences Centre, London, Ontario, Canada
| | - Erwin Oechslin
- Division of Cardiology, Toronto Adult Congenital Heart Disease Program at Peter Munk Cardiac Centre, Department of Medicine, University Health Network, and University of Toronto, Toronto, Ontario, Canada
| | - Seema Mital
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; Ted Rogers Centre for Heart Research; Toronto, Ontario, Canada; Division of Cardiology, Department of Pediatrics, The Hospital for Sick Children, University of Toronto; Toronto, Ontario, Canada.
| | - Ryan K C Yuen
- Genetics and Genome Biology, The Hospital for Sick Children; Toronto, Ontario, Canada; Department of Molecular Genetics, University of Toronto; Toronto, Ontario, Canada; The Centre for Applied Genomics, The Hospital for Sick Children; Toronto, Ontario, Canada.
| |
Collapse
|
8
|
Hannan AJ. Repeating themes of plastic genes and therapeutic schemes targeting the 'tandem repeatome'. Brain Commun 2024; 6:fcae047. [PMID: 38449715 PMCID: PMC10917440 DOI: 10.1093/braincomms/fcae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 01/24/2024] [Accepted: 02/17/2024] [Indexed: 03/08/2024] Open
Abstract
This scientific commentary refers to 'Modification of Huntington's disease by short tandem repeats' by Hong et al. (https://doi.org/10.1093/braincomms/fcae016) in Brain Communications.
Collapse
Affiliation(s)
- Anthony J Hannan
- Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, Australia
- Department of Anatomy and Physiology, University of Melbourne, Parkville, Australia
| |
Collapse
|
9
|
Birnbaum R. Rediscovering tandem repeat variation in schizophrenia: challenges and opportunities. Transl Psychiatry 2023; 13:402. [PMID: 38123544 PMCID: PMC10733427 DOI: 10.1038/s41398-023-02689-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 11/23/2023] [Accepted: 11/27/2023] [Indexed: 12/23/2023] Open
Abstract
Tandem repeats (TRs) are prevalent throughout the genome, constituting at least 3% of the genome, and often highly polymorphic. The high mutation rate of TRs, which can be orders of magnitude higher than single-nucleotide polymorphisms and indels, indicates that they are likely to make significant contributions to phenotypic variation, yet their contribution to schizophrenia has been largely ignored by recent genome-wide association studies (GWAS). Tandem repeat expansions are already known causative factors for over 50 disorders, while common tandem repeat variation is increasingly being identified as significantly associated with complex disease and gene regulation. The current review summarizes key background concepts of tandem repeat variation as pertains to disease risk, elucidating their potential for schizophrenia association. An overview of next-generation sequencing-based methods that may be applied for TR genome-wide identification is provided, and some key methodological challenges in TR analyses are delineated.
Collapse
Affiliation(s)
- Rebecca Birnbaum
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomics Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
10
|
Annear DJ, Kooy RF. Unravelling the link between neurodevelopmental disorders and short tandem CGG-repeat expansions. Emerg Top Life Sci 2023; 7:265-275. [PMID: 37768318 PMCID: PMC10754333 DOI: 10.1042/etls20230021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 08/23/2023] [Accepted: 09/11/2023] [Indexed: 09/29/2023]
Abstract
Neurodevelopmental disorders (NDDs) encompass a diverse group of disorders characterised by impaired cognitive abilities and developmental challenges. Short tandem repeats (STRs), repetitive DNA sequences found throughout the human genome, have emerged as potential contributors to NDDs. Specifically, the CGG trinucleotide repeat has been implicated in a wide range of NDDs, including Fragile X Syndrome (FXS), the most common inherited form of intellectual disability and autism. This review focuses on CGG STR expansions associated with NDDs and their impact on gene expression through repeat expansion-mediated epigenetic silencing. We explore the molecular mechanisms underlying CGG-repeat expansion and the resulting epigenetic modifications, such as DNA hypermethylation and gene silencing. Additionally, we discuss the involvement of other CGG STRs in neurodevelopmental diseases. Several examples, including FMR1, AFF2, AFF3, XYLT1, FRA10AC1, CBL, and DIP2B, highlight the complex relationship between CGG STR expansions and NDDs. Furthermore, recent advancements in this field are highlighted, shedding light on potential future research directions. Understanding the role of STRs, particularly CGG-repeats, in NDDs has the potential to uncover novel diagnostic and therapeutic strategies for these challenging disorders.
Collapse
Affiliation(s)
- Dale J Annear
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - R Frank Kooy
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
11
|
Panoyan MA, Wendt FR. The role of tandem repeat expansions in brain disorders. Emerg Top Life Sci 2023; 7:249-263. [PMID: 37401564 DOI: 10.1042/etls20230022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/05/2023] [Accepted: 06/19/2023] [Indexed: 07/05/2023]
Abstract
The human genome contains numerous genetic polymorphisms contributing to different health and disease outcomes. Tandem repeat (TR) loci are highly polymorphic yet under-investigated in large genomic studies, which has prompted research efforts to identify novel variations and gain a deeper understanding of their role in human biology and disease outcomes. We summarize the current understanding of TRs and their implications for human health and disease, including an overview of the challenges encountered when conducting TR analyses and potential solutions to overcome these challenges. By shedding light on these issues, this article aims to contribute to a better understanding of the impact of TRs on the development of new disease treatments.
Collapse
Affiliation(s)
- Mary Anne Panoyan
- Department of Anthropology, University of Toronto, Mississauga, ON, Canada
| | - Frank R Wendt
- Department of Anthropology, University of Toronto, Mississauga, ON, Canada
- Biostatistics Division, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- Forensic Science Program, University of Toronto, Mississauga, ON, Canada
| |
Collapse
|
12
|
Hannan AJ. Expanding horizons of tandem repeats in biology and medicine: Why 'genomic dark matter' matters. Emerg Top Life Sci 2023; 7:ETLS20230075. [PMID: 38088823 PMCID: PMC10754335 DOI: 10.1042/etls20230075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 11/27/2023] [Accepted: 11/27/2023] [Indexed: 12/30/2023]
Abstract
Approximately half of the human genome includes repetitive sequences, and these DNA sequences (as well as their transcribed repetitive RNA and translated amino-acid repeat sequences) are known as the repeatome. Within this repeatome there are a couple of million tandem repeats, dispersed throughout the genome. These tandem repeats have been estimated to constitute ∼8% of the entire human genome. These tandem repeats can be located throughout exons, introns and intergenic regions, thus potentially affecting the structure and function of tandemly repetitive DNA, RNA and protein sequences. Over more than three decades, more than 60 monogenic human disorders have been found to be caused by tandem-repeat mutations. These monogenic tandem-repeat disorders include Huntington's disease, a variety of ataxias, amyotrophic lateral sclerosis and frontotemporal dementia, as well as many other neurodegenerative diseases. Furthermore, tandem-repeat disorders can include fragile X syndrome, related fragile X disorders, as well as other neurological and psychiatric disorders. However, these monogenic tandem-repeat disorders, which were discovered via their dominant or recessive modes of inheritance, may represent the 'tip of the iceberg' with respect to tandem-repeat contributions to human disorders. A previous proposal that tandem repeats may contribute to the 'missing heritability' of various common polygenic human disorders has recently been supported by a variety of new evidence. This includes genome-wide studies that associate tandem-repeat mutations with autism, schizophrenia, Parkinson's disease and various types of cancers. In this article, I will discuss how tandem-repeat mutations and polymorphisms could contribute to a wide range of common disorders, along with some of the many major challenges of tandem-repeat biology and medicine. Finally, I will discuss the potential of tandem repeats to be therapeutically targeted, so as to prevent and treat an expanding range of human disorders.
Collapse
Affiliation(s)
- Anthony J Hannan
- Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, Victoria 3010, Australia
- Department of Anatomy and Physiology, University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
13
|
Guo MH, Lee WP, Vardarajan B, Schellenberg GD, Phillips-Cremins J. Polygenic burden of short tandem repeat expansions promote risk for Alzheimer's disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.11.16.23298623. [PMID: 38014121 PMCID: PMC10680900 DOI: 10.1101/2023.11.16.23298623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Studies of the genetics of Alzheimer's disease (AD) have largely focused on single nucleotide variants and short insertions/deletions. However, most of the disease heritability has yet to be uncovered, suggesting that there is substantial genetic risk conferred by other forms of genetic variation. There are over one million short tandem repeats (STRs) in the genome, and their link to AD risk has not been assessed. As pathogenic expansions of STR cause over 30 neurologic diseases, it is important to ascertain whether STRs may also be implicated in AD risk. Here, we genotyped 321,742 polymorphic STR tracts genome-wide using PCR-free whole genome sequencing data from 2,981 individuals (1,489 AD case and 1,492 control individuals). We implemented an approach to identify STR expansions as STRs with tract lengths that are outliers from the population. We then tested for differences in aggregate burden of expansions in case versus control individuals. AD patients had a 1.19-fold increase of STR expansions compared to healthy elderly controls (p=8.27×10-3, two-sided Mann Whitney test). Individuals carrying > 30 STR expansions had 3.62-fold higher odds of having AD and had more severe AD neuropathology. AD STR expansions were highly enriched within active promoters in post-mortem hippocampal brain tissues and particularly within SINE-VNTR-Alu (SVA) retrotransposons. Together, these results demonstrate that expanded STRs within active promoter regions of the genome promote risk of AD.
Collapse
Affiliation(s)
- Michael H Guo
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Wan-Ping Lee
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Badri Vardarajan
- Department of Neurology, College of Physicians and Surgeons, Columbia University, New York, NY
| | - Gerard D Schellenberg
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Jennifer Phillips-Cremins
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA
- Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
14
|
Lundström OS, Adriaan Verbiest M, Xia F, Jam HZ, Zlobec I, Anisimova M, Gymrek M. WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans. J Mol Biol 2023; 435:168260. [PMID: 37678708 DOI: 10.1016/j.jmb.2023.168260] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 08/29/2023] [Accepted: 08/29/2023] [Indexed: 09/09/2023]
Abstract
Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteristics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.
Collapse
Affiliation(s)
- Oxana Sachenkova Lundström
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden; Vildly AB, Kalmar, Sweden; Institute of Computational Life Sciences, School of Life Sciences and Facility Management, Zürich University of Applied Sciences (ZHAW), Waedenswil, Switzerland. https://twitter.com/merenlin
| | - Max Adriaan Verbiest
- Institute of Computational Life Sciences, School of Life Sciences and Facility Management, Zürich University of Applied Sciences (ZHAW), Waedenswil, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland; Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Feifei Xia
- Institute of Computational Life Sciences, School of Life Sciences and Facility Management, Zürich University of Applied Sciences (ZHAW), Waedenswil, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland; Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland. https://twitter.com/Feifeix97
| | - Helyaneh Ziaei Jam
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
| | - Inti Zlobec
- Institute of Tissue Medicine and Pathology, University of Bern, Switzerland
| | - Maria Anisimova
- Institute of Computational Life Sciences, School of Life Sciences and Facility Management, Zürich University of Applied Sciences (ZHAW), Waedenswil, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
| | - Melissa Gymrek
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA; Department of Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|