1
|
Uguen K, Michaud JL, Génin E. Short Tandem Repeats in the era of next-generation sequencing: from historical loci to population databases. Eur J Hum Genet 2024; 32:1037-1044. [PMID: 38982300 PMCID: PMC11369099 DOI: 10.1038/s41431-024-01666-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 06/20/2024] [Accepted: 06/27/2024] [Indexed: 07/11/2024] Open
Abstract
In this study, we explore the landscape of short tandem repeats (STRs) within the human genome through the lens of evolving technologies to detect genomic variations. STRs, which encompass approximately 3% of our genomic DNA, are crucial for understanding human genetic diversity, disease mechanisms, and evolutionary biology. The advent of high-throughput sequencing methods has revolutionized our ability to accurately map and analyze STRs, highlighting their significance in genetic disorders, forensic science, and population genetics. We review the current available methodologies for STR analysis, the challenges in interpreting STR variations across different populations, and the implications of STRs in medical genetics. Our findings underscore the urgent need for comprehensive STR databases that reflect the genetic diversity of global populations, facilitating the interpretation of STR data in clinical diagnostics, genetic research, and forensic applications. This work sets the stage for future studies aimed at harnessing STR variations to elucidate complex genetic traits and diseases, reinforcing the importance of integrating STRs into genetic research and clinical practice.
Collapse
Affiliation(s)
- Kevin Uguen
- Univ Brest, Inserm, EFS, UMR 1078, GGB, Brest, France.
- Service de Génétique Médicale et Biologie de la Reproduction, CHU de Brest, Brest, France.
- CHU Sainte-Justine Azrieli Research Centre, Montréal, QC, Canada.
| | - Jacques L Michaud
- CHU Sainte-Justine Azrieli Research Centre, Montréal, QC, Canada
- Department of Pediatrics, Université de Montréal, Montréal, QC, Canada
- Department of Neurosciences, Université de Montréal, Montréal, QC, Canada
| | | |
Collapse
|
2
|
Rajan-Babu IS, Dolzhenko E, Eberle MA, Friedman JM. Sequence composition changes in short tandem repeats: heterogeneity, detection, mechanisms and clinical implications. Nat Rev Genet 2024; 25:476-499. [PMID: 38467784 DOI: 10.1038/s41576-024-00696-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2024] [Indexed: 03/13/2024]
Abstract
Short tandem repeats (STRs) are a class of repetitive elements, composed of tandem arrays of 1-6 base pair sequence motifs, that comprise a substantial fraction of the human genome. STR expansions can cause a wide range of neurological and neuromuscular conditions, known as repeat expansion disorders, whose age of onset, severity, penetrance and/or clinical phenotype are influenced by the length of the repeats and their sequence composition. The presence of non-canonical motifs, depending on the type, frequency and position within the repeat tract, can alter clinical outcomes by modifying somatic and intergenerational repeat stability, gene expression and mutant transcript-mediated and/or protein-mediated toxicities. Here, we review the diverse structural conformations of repeat expansions, technological advances for the characterization of changes in sequence composition, their clinical correlations and the impact on disease mechanisms.
Collapse
Affiliation(s)
- Indhu-Shree Rajan-Babu
- Department of Medical Genetics, The University of British Columbia, and Children's & Women's Hospital, Vancouver, British Columbia, Canada.
| | | | | | - Jan M Friedman
- Department of Medical Genetics, The University of British Columbia, and Children's & Women's Hospital, Vancouver, British Columbia, Canada
- BC Children's Hospital Research Institute, Vancouver, British Columbia, Canada
| |
Collapse
|
3
|
Scarano C, Veneruso I, De Simone RR, Di Bonito G, Secondino A, D’Argenio V. The Third-Generation Sequencing Challenge: Novel Insights for the Omic Sciences. Biomolecules 2024; 14:568. [PMID: 38785975 PMCID: PMC11117673 DOI: 10.3390/biom14050568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 05/05/2024] [Accepted: 05/08/2024] [Indexed: 05/25/2024] Open
Abstract
The understanding of the human genome has been greatly improved by the advent of next-generation sequencing technologies (NGS). Despite the undeniable advantages responsible for their widespread diffusion, these methods have some constraints, mainly related to short read length and the need for PCR amplification. As a consequence, long-read sequencers, called third-generation sequencing (TGS), have been developed, promising to overcome NGS. Starting from the first prototype, TGS has progressively ameliorated its chemistries by improving both read length and base-calling accuracy, as well as simultaneously reducing the costs/base. Based on these premises, TGS is showing its potential in many fields, including the analysis of difficult-to-sequence genomic regions, structural variations detection, RNA expression profiling, DNA methylation study, and metagenomic analyses. Protocol standardization and the development of easy-to-use pipelines for data analysis will enhance TGS use, also opening the way for their routine applications in diagnostic contexts.
Collapse
Affiliation(s)
- Carmela Scarano
- Department of Molecular Medicine and Medical Biotechnologies, Federico II University, Via Sergio Pansini 5, 80131 Napoli, Italy
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
| | - Iolanda Veneruso
- Department of Molecular Medicine and Medical Biotechnologies, Federico II University, Via Sergio Pansini 5, 80131 Napoli, Italy
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
| | - Rosa Redenta De Simone
- Department of Molecular Medicine and Medical Biotechnologies, Federico II University, Via Sergio Pansini 5, 80131 Napoli, Italy
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
| | - Gennaro Di Bonito
- Department of Molecular Medicine and Medical Biotechnologies, Federico II University, Via Sergio Pansini 5, 80131 Napoli, Italy
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
| | - Angela Secondino
- Department of Molecular Medicine and Medical Biotechnologies, Federico II University, Via Sergio Pansini 5, 80131 Napoli, Italy
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
| | - Valeria D’Argenio
- CEINGE-Biotecnologie Avanzate Franco Salvatore, Via G. Salvatore 486, 80145 Napoli, Italy
- Department of Human Sciences and Quality of Life Promotion, San Raffaele Open University, Via di Val Cannuta 247, 00166 Roma, Italy
| |
Collapse
|
4
|
Uppuluri L, Shi CH, Varapula D, Young E, Ehrlich RL, Wang Y, Piazza D, Mell JC, Yip KY, Xiao M. A long-read sequencing strategy with overlapping linkers on adjacent fragments (OLAF-Seq) for targeted resequencing and enrichment. Sci Rep 2024; 14:5583. [PMID: 38448490 PMCID: PMC10917763 DOI: 10.1038/s41598-024-56402-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 03/06/2024] [Indexed: 03/08/2024] Open
Abstract
In this report, we present OLAF-Seq, a novel strategy to construct a long-read sequencing library such that adjacent fragments are linked with end-terminal duplications. We use the CRISPR-Cas9 nickase enzyme and a pool of multiple sgRNAs to perform non-random fragmentation of targeted long DNA molecules (> 300kb) into smaller library-sized fragments (about 20 kbp) in a manner so as to retain physical linkage information (up to 1000 bp) between adjacent fragments. DNA molecules targeted for fragmentation are preferentially ligated with adaptors for sequencing, so this method can enrich targeted regions while taking advantage of the long-read sequencing platforms. This enables the sequencing of target regions with significantly lower total coverage, and the genome sequence within linker regions provides information for assembly and phasing. We demonstrated the validity and efficacy of the method first using phage and then by sequencing a panel of 100 full-length cancer-related genes (including both exons and introns) in the human genome. When the designed linkers contained heterozygous genetic variants, long haplotypes could be established. This sequencing strategy can be readily applied in both PacBio and Oxford Nanopore platforms for both long and short genes with an easy protocol. This economically viable approach is useful for targeted enrichment of hundreds of target genomic regions and where long no-gap contigs need deep sequencing.
Collapse
Affiliation(s)
- Lahari Uppuluri
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, 19104, USA
| | - Christina Huan Shi
- Cancer Genome and Epigenetics Program, NCI-Designated Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Dharma Varapula
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, 19104, USA
| | - Eleanor Young
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, 19104, USA
| | - Rachel L Ehrlich
- Department of Microbiology and Immunology, College of Medicine, Drexel University, Philadelphia, PA, 19104, USA
- Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, 19104, USA
| | - Yilin Wang
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, 19104, USA
| | - Danielle Piazza
- Department of Microbiology and Immunology, College of Medicine, Drexel University, Philadelphia, PA, 19104, USA
- Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, 19104, USA
| | - Joshua Chang Mell
- Department of Microbiology and Immunology, College of Medicine, Drexel University, Philadelphia, PA, 19104, USA
- Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, 19104, USA
| | - Kevin Y Yip
- Cancer Genome and Epigenetics Program, NCI-Designated Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Ming Xiao
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, 19104, USA.
- Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, 19104, USA.
| |
Collapse
|
5
|
Yu SY, Xi YL, Xu FQ, Zhang J, Liu YS. Application of long read sequencing in rare diseases: The longer, the better? Eur J Med Genet 2023; 66:104871. [PMID: 38832911 DOI: 10.1016/j.ejmg.2023.104871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 10/11/2023] [Accepted: 10/22/2023] [Indexed: 06/06/2024]
Abstract
Rare diseases encompass a diverse group of genetic disorders that affect a small proportion of the population. Identifying the underlying genetic causes of these conditions presents significant challenges due to their genetic heterogeneity and complexity. Conventional short-read sequencing (SRS) techniques have been widely used in diagnosing and investigating of rare diseases, with limitations due to the nature of short-read lengths. In recent years, long read sequencing (LRS) technologies have emerged as a valuable tool in overcoming these limitations. This minireview provides a concise overview of the applications of LRS in rare disease research and diagnosis, including the identification of disease-causing tandem repeat expansions, structural variations, and comprehensive analysis of pathogenic variants with LRS.
Collapse
Affiliation(s)
- Si-Yan Yu
- Department of Pediatric Laboratory, Affiliated Children's Hospital of Jiangnan University (Wuxi Children's Hospital), Wuxi, Jiangsu, China; The First School of Clinical Medicine, Nanjing Medical University, Nanjing, Jiangsu, China
| | - Yu-Lin Xi
- Wuxi School of Medicine, Jiangnan University, Wuxi, Jiangsu, China
| | - Fu-Qiang Xu
- Department of Gynecology, Beijing Youan Hospital, Capital Medical University, Beijing, China
| | - Jian Zhang
- Department of Medical Laboratory, Affiliated Children's Hospital of Jiangnan University (Wuxi Children's Hospital), Wuxi, Jiangsu, China.
| | - Yan-Shan Liu
- Department of Pediatric Laboratory, Affiliated Children's Hospital of Jiangnan University (Wuxi Children's Hospital), Wuxi, Jiangsu, China; Wuxi School of Medicine, Jiangnan University, Wuxi, Jiangsu, China.
| |
Collapse
|
6
|
Zou J, Wang F, Gong Z, Wang R, Chen S, Zhang H, Sun R, Gao C, Li W, Shang J, Zhang J. A Chinese SCA36 pedigree analysis of NOP56 expansion region based on long-read sequencing. Front Genet 2023; 14:1110307. [PMID: 37051597 PMCID: PMC10083286 DOI: 10.3389/fgene.2023.1110307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 03/15/2023] [Indexed: 03/28/2023] Open
Abstract
Introduction: Spinocerebellar ataxias 36 (SCA36) is the neurodegenerative disease caused by the GGCCTG Hexanucleotide repeat expansions in NOP56, which is too long to sequence using short-read sequencing. Single molecule real time (SMRT) sequencing can sequence across disease-causing repeat expansion. We report the first long-read sequencing data across the expansion region in SCA36.Methods: We collected and described the clinical manifestations and imaging features of Han Chinese pedigree with three generations of SCA36. Also, we focused on structural variation analysis for intron 1 of the NOP56 gene by SMRT sequencing in the assembled genome.Results: The main clinical features of this pedigree are late-onset ataxia symptoms, with a presymptomatic presence of affective and sleep disorders. In addition, the results of SMRT sequencing showed the specific repeat expansion region and demonstrated that the region was not composed of single GGCCTG hexanucleotides and there were random interruptions.Discussion: We extended the phenotypic spectrum of SCA36. We applied SMRT sequencing to reveal the correlation between genotype and phenotype of SCA36. Our findings indicated that long-read sequencing is well suited to characterize known repeat expansion.
Collapse
Affiliation(s)
- Jinlong Zou
- Department of Neurology, Henan University People’s Hospital, Henan Provincial People’s Hospital, Zhengzhou, China
| | - Fengyu Wang
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Zhenping Gong
- Department of Neurology, Xinxiang Medical University, Henan Provincial People’s Hospital, Zhengzhou, China
| | - Runrun Wang
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Shuai Chen
- Department of Neurology, Henan University People’s Hospital, Henan Provincial People’s Hospital, Zhengzhou, China
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Haohan Zhang
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
- Academy of Medical Sciences, Zhengzhou University, Zhengzhou, Henan, China
| | - Ruihua Sun
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
- Academy of Medical Sciences, Zhengzhou University, Zhengzhou, Henan, China
| | - Chenhao Gao
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Wei Li
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Junkui Shang
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
| | - Jiewen Zhang
- Department of Neurology, Henan University People’s Hospital, Henan Provincial People’s Hospital, Zhengzhou, China
- Department of Neurology, Henan Provincial People’s Hospital, Zhengzhou University People’s Hospital, Zhengzhou, China
- Department of Neurology, Xinxiang Medical University, Henan Provincial People’s Hospital, Zhengzhou, China
- Academy of Medical Sciences, Zhengzhou University, Zhengzhou, Henan, China
- *Correspondence: Jiewen Zhang,
| |
Collapse
|
7
|
Thymidine Kinase 2 and Mitochondrial Protein COX I in the Cerebellum of Patients with Spinocerebellar Ataxia Type 31 Caused by Penta-nucleotide Repeats (TTCCA) n. CEREBELLUM (LONDON, ENGLAND) 2023; 22:70-84. [PMID: 35084690 PMCID: PMC9883315 DOI: 10.1007/s12311-021-01364-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 12/23/2021] [Indexed: 02/01/2023]
Abstract
Spinocerebellar ataxia type 31 (SCA31), an autosomal-dominant neurodegenerative disorder characterized by progressive cerebellar ataxia with Purkinje cell degeneration, is caused by a heterozygous 2.5-3.8 kilobase penta-nucleotide repeat of (TTCCA)n in intron 11 of the thymidine kinase 2 (TK2) gene. TK2 is an essential mitochondrial pyrimidine-deoxyribonucleoside kinase. Bi-allelic loss-of-function mutations of TK2 lead to mitochondrial DNA depletion syndrome (MDS) in humans through severe (~ 70%) reduction of mitochondrial electron-transport-chain activity, and tk2 knockout mice show Purkinje cell degeneration and ataxia through severe mitochondrial cytochrome-c oxidase subunit I (COX I) protein reduction. To clarify whether TK2 function is altered in SCA31, we investigated TK2 and COX I expression in human postmortem SCA31 cerebellum. We confirmed that canonical TK2 mRNA is transcribed from exons far upstream of the repeat site, and demonstrated that an extended version of TK2 mRNA ("TK2-EXT"), transcribed from exons spanning the repeat site, is expressed in human cerebellum. While canonical TK2 was conserved among vertebrates, TK2-EXT was specific to primates. Reverse transcription-PCR demonstrated that both TK2 mRNAs were preserved in SCA31 cerebella compared with control cerebella. The TK2 proteins, assessed with three different antibodies including our original polyclonal antibody against TK2-EXT, were detected as ~ 26 kilodalton proteins on western blot; their levels were similar in SCA31 and control cerebella. COX I protein level was preserved in SCA31 compared to nuclear DNA-encoded protein. We conclude that the expression and function of TK2 are preserved in SCA31, suggesting a mechanism distinct from that of MDS.
Collapse
|
8
|
Taylor A, Barros D, Gobet N, Schuepbach T, McAllister B, Aeschbach L, Randall E, Trofimenko E, Heuchan E, Barszcz P, Ciosi M, Morgan J, Hafford-Tear N, Davidson A, Massey T, Monckton D, Jones L, network REGISTRYH, Xenarios I, Dion V. Repeat Detector: versatile sizing of expanded tandem repeats and identification of interrupted alleles from targeted DNA sequencing. NAR Genom Bioinform 2022; 4:lqac089. [PMID: 36478959 PMCID: PMC9719798 DOI: 10.1093/nargab/lqac089] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 10/25/2022] [Accepted: 11/08/2022] [Indexed: 12/07/2022] Open
Abstract
Targeted DNA sequencing approaches will improve how the size of short tandem repeats is measured for diagnostic tests and preclinical studies. The expansion of these sequences causes dozens of disorders, with longer tracts generally leading to a more severe disease. Interrupted alleles are sometimes present within repeats and can alter disease manifestation. Determining repeat size mosaicism and identifying interruptions in targeted sequencing datasets remains a major challenge. This is in part because standard alignment tools are ill-suited for repetitive and unstable sequences. To address this, we have developed Repeat Detector (RD), a deterministic profile weighting algorithm for counting repeats in targeted sequencing data. We tested RD using blood-derived DNA samples from Huntington's disease and Fuchs endothelial corneal dystrophy patients sequenced using either Illumina MiSeq or Pacific Biosciences single-molecule, real-time sequencing platforms. RD was highly accurate in determining repeat sizes of 609 blood-derived samples from Huntington's disease individuals and did not require prior knowledge of the flanking sequences. Furthermore, RD can be used to identify alleles with interruptions and provide a measure of repeat instability within an individual. RD is therefore highly versatile and may find applications in the diagnosis of expanded repeat disorders and in the development of novel therapies.
Collapse
Affiliation(s)
- Alysha S Taylor
- UK Dementia Research Institute, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff, CF24 4HQ, UK
| | - Dinis Barros
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
| | - Nastassia Gobet
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
| | - Thierry Schuepbach
- Vital-IT Group, Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
- Newbiologix, Ch. De la corniche 6-8, 1066 Epalinges, Switzerland
| | - Branduff McAllister
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff CF24 4HQ, UK
- Molecular Neurogenetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Lorene Aeschbach
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
| | - Emma L Randall
- UK Dementia Research Institute, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff, CF24 4HQ, UK
| | - Evgeniya Trofimenko
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
- Sorbonne Université, École normale supérieure, PSL University, CNRS, Laboratoire des biomolécules, LBM, 75005 Paris, France
| | - Eleanor R Heuchan
- UK Dementia Research Institute, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff, CF24 4HQ, UK
| | - Paula Barszcz
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
| | - Marc Ciosi
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, Davidson Building, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Joanne Morgan
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff CF24 4HQ, UK
| | | | - Alice E Davidson
- UCL Institute of Ophthalmology, 11-43 Bath Street, London, EC1V 9EL UK
| | - Thomas H Massey
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff CF24 4HQ, UK
| | - Darren G Monckton
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, Davidson Building, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Lesley Jones
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff CF24 4HQ, UK
| | | | - Ioannis Xenarios
- Centre for Integrative Genomics, University of Lausanne, Bâtiment Génopode, 1015 Lausanne, Switzerland
- Health2030 Genome Center, Ch des Mines 14, 1202 Genève, Switzerland
| | - Vincent Dion
- UK Dementia Research Institute, Cardiff University, Hadyn Ellis Building, Maindy Road, Cardiff, CF24 4HQ, UK
| |
Collapse
|
9
|
Conlin LK, Aref-Eshghi E, McEldrew DA, Luo M, Rajagopalan R. Long-read sequencing for molecular diagnostics in constitutional genetic disorders. Hum Mutat 2022; 43:1531-1544. [PMID: 36086952 PMCID: PMC9561063 DOI: 10.1002/humu.24465] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/03/2022] [Accepted: 09/06/2022] [Indexed: 11/11/2022]
Abstract
Long-read sequencing (LRS) has been around for more than a decade, but widespread adoption of the technology has been slow due to the perceived high error rates and high sequencing cost. This is changing due to the recent advancements to produce highly accurate sequences and the reducing costs. LRS promises significant improvement over short read sequencing in four major areas: (1) better detection of structural variation (2) better resolution of highly repetitive or nonunique regions (3) accurate long-range haplotype phasing and (4) the detection of base modifications natively from the sequencing data. Several successful applications of LRS have demonstrated its ability to resolve molecular diagnoses where short-read sequencing fails to identify a cause. However, the argument for increased diagnostic yield from LRS remains to be validated. Larger cohort studies may be required to establish the realistic boundaries of LRS's clinical utility and analytical validity, as well as the development of standards for clinical applications. We discuss the limitations of the current standard of care, and contrast with the applications and advantages of two major LRS platforms, PacBio and Oxford Nanopore, for molecular diagnostics of constitutional disorders, and present a critical argument about the potential of LRS in diagnostic settings.
Collapse
Affiliation(s)
- Laura K. Conlin
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - Erfan Aref-Eshghi
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Deborah A. McEldrew
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Minjie Luo
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| | - Ramakrishnan Rajagopalan
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104
| |
Collapse
|
10
|
ATTCT and ATTCC repeat expansions in the ATXN10 gene affect disease penetrance of spinocerebellar ataxia type 10. HGG ADVANCES 2022; 3:100137. [PMID: 36092952 PMCID: PMC9460507 DOI: 10.1016/j.xhgg.2022.100137] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 08/11/2022] [Indexed: 11/21/2022] Open
Abstract
Spinocerebellar ataxia type 10 (SCA10) is an autosomal-dominant disorder caused by an expanded pentanucleotide repeat in the ATXN10 gene. This repeat expansion, when fully penetrant, has a size of 850–4,500 repeats. It has been shown that the repeat composition can be a modifier of disease, e.g., seizures. Here, we describe a Mexican kindred in which we identified both pure (ATTCT)n and mixed (ATTCT)n-(ATTCC)n expansions in the same family. We used amplification-free targeted sequencing and optical genome mapping to decipher the composition of these repeat expansions. We found a considerable degree of mosaicism of the repeat expansion. This mosaicism was confirmed in skin fibroblasts from individuals with ATXN10 expansions with RNAScope in situ hybridization. All affected family members with the mixed ATXN10 repeat expansion showed typical clinical signs of spinocerebellar ataxia and epilepsy. In contrast, individuals with the pure ATXN10 expansion present with Parkinson's disease or are unaffected, even in individuals more than 20 years older than the average age at onset for SCA10. Our findings suggest that the pure (ATTCT)n expansion is non-pathogenic, while repeat interruptions, e.g., (ATTCC)n, are necessary to cause SCA10. This mechanism has been recently described for several other repeat expansions including SCA31 (BEAN1), SCA37 (DAB1), and three loci for benign adult familial myoclonic epilepsy BAFME (SAMD12, TNRC6A, RAPGEF2). Therefore, long-read sequencing and optical genome mapping of the entire genomic structure of repeat expansions are critical for clinical practice and genetic counseling, as variations in the repeat can affect disease penetrance, symptoms, and disease trajectory.
Collapse
|
11
|
Kurosaki T, Ashizawa T. The genetic and molecular features of the intronic pentanucleotide repeat expansion in spinocerebellar ataxia type 10. Front Genet 2022; 13:936869. [PMID: 36199580 PMCID: PMC9528567 DOI: 10.3389/fgene.2022.936869] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 08/25/2022] [Indexed: 11/13/2022] Open
Abstract
Spinocerebellar ataxia type 10 (SCA10) is characterized by progressive cerebellar neurodegeneration and, in many patients, epilepsy. This disease mainly occurs in individuals with Indigenous American or East Asian ancestry, with strong evidence supporting a founder effect. The mutation causing SCA10 is a large expansion in an ATTCT pentanucleotide repeat in intron 9 of the ATXN10 gene. The ATTCT repeat is highly unstable, expanding to 280–4,500 repeats in affected patients compared with the 9–32 repeats in normal individuals, one of the largest repeat expansions causing neurological disorders identified to date. However, the underlying molecular basis of how this huge repeat expansion evolves and contributes to the SCA10 phenotype remains largely unknown. Recent progress in next-generation DNA sequencing technologies has established that the SCA10 repeat sequence has a highly heterogeneous structure. Here we summarize what is known about the structure and origin of SCA10 repeats, discuss the potential contribution of variant repeats to the SCA10 disease phenotype, and explore how this information can be exploited for therapeutic benefit.
Collapse
Affiliation(s)
- Tatsuaki Kurosaki
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY, United States
- Center for RNA Biology, University of Rochester, Rochester, NY, United States
- *Correspondence: Tatsuaki Kurosaki, ; Tetsuo Ashizawa,
| | - Tetsuo Ashizawa
- Stanley H. Appel Department of Neurology, Houston Methodist Research Institute and Weil Cornell Medical College at Houston Methodist Houston, TX, United States
- *Correspondence: Tatsuaki Kurosaki, ; Tetsuo Ashizawa,
| |
Collapse
|
12
|
Jiang F, Mao AP, Liu YY, Liu FZ, Li YL, Li J, Zhou JY, Tang XW, Ju AP, Li FT, Wan JH, Zuo LD, Li DZ. Detection of rare thalassemia mutations using long-read single-molecule real-time sequencing. Gene 2022; 825:146438. [PMID: 35306112 DOI: 10.1016/j.gene.2022.146438] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 03/07/2022] [Accepted: 03/14/2022] [Indexed: 11/30/2022]
Abstract
Gap- polymerase chain reaction (PCR), reverse dot-blot assay (RDB), real-time PCR based multicolor melting curve analysis (MMCA assay), multiplex ligation-dependent probe amplification (MLPA) and Sanger sequencing are conventional methods to diagnose thalassemia but all of them have limitations. In this study, we applied single-molecule real-time (SMRT) sequencing following multiplex long-range PCR to uncover rare mutations in nine patients and their family members. The patients with different results between Gap-PCR and MMCA assay or with phenotype not matching genotype were included. Using SMRT sequencing, we first identified the carriers with αααanti3.7/HKαα, -α762bpα/αα (chr16:172,648-173,409), ααfusion/αQSα (in a trans configuration), two cases with novel gene rearrangements and another case with a novel 341 bp insertion in α-globin gene cluster, respectively. One carrier with --SEA/αααanti4.2, and two carriers with the coexistence of globin variant and an α-globin gene duplication were also found. Most importantly, we could determine two defects in α-globin gene cluster being a cis or trans configuration in a single test. Our results showed that SMRT has great advantages in detection of α-globin gene triplications, rare deletions and determination of a cis or trans configuration. SMRT is a comprehensive and one-step method for thalassemia screening and diagnosis, especially for detection of rare thalassemia mutations.
Collapse
Affiliation(s)
- Fan Jiang
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Ai-Ping Mao
- Berry Genomics Corporation, Beijing, 102200, China
| | - Yin-Yin Liu
- Berry Genomics Corporation, Beijing, 102200, China
| | - Feng-Zhi Liu
- Medical Genetics Laboratory, Foshan Maternal and Child Health Hospital, Foshan, Guangdong, China
| | - Yan-Lin Li
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Jian Li
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Jian-Ying Zhou
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Xue-Wei Tang
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Ai-Ping Ju
- Clinical Laboratory, Huadu District Maternal and Neonatal Healthcare Hospital of Guangzhou, Hu Zhong Hospital, Guangzhou, Guangdong, China
| | - Fa-Tao Li
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Jun-Hui Wan
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Lian-Dong Zuo
- Scientific Research Department, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China
| | - Dong-Zhi Li
- Prenatal Diagnostic Center, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, China.
| |
Collapse
|
13
|
Mechanistic and Therapeutic Insights into Ataxic Disorders with Pentanucleotide Expansions. Cells 2022; 11:cells11091567. [PMID: 35563872 PMCID: PMC9099484 DOI: 10.3390/cells11091567] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 05/02/2022] [Accepted: 05/05/2022] [Indexed: 02/01/2023] Open
Abstract
Pentanucleotide expansion diseases constitute a special class of neurodegeneration. The repeat expansions occur in non-coding regions, have likely arisen from Alu elements, and often result in autosomal dominant or recessive phenotypes with underlying cerebellar neuropathology. When transcribed (potentially bidirectionally), the expanded RNA forms complex secondary and tertiary structures that can give rise to RNA-mediated toxicity, including protein sequestration, pentapeptide synthesis, and mRNA dysregulation. Since several of these diseases have recently been discovered, our understanding of their pathological mechanisms is limited, and their therapeutic interventions underexplored. This review aims to highlight new in vitro and in vivo insights into these incurable diseases.
Collapse
|
14
|
Third-Generation Cytogenetic Analysis: Diagnostic Application of Long-Read Sequencing. J Mol Diagn 2022; 24:711-718. [PMID: 35526834 DOI: 10.1016/j.jmoldx.2022.03.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 02/23/2022] [Accepted: 03/25/2022] [Indexed: 11/21/2022] Open
Abstract
Copy number variants (CNVs) play important roles in the pathogenesis of several genetic syndromes. Traditional and molecular karyotyping are considered the first-tier diagnostic tests to detect macroscopic and cryptic deletions/duplications. However, their time-consuming and laborious experimental protocols protract diagnostic times from 3 to 15 days. Nanopore sequencing has the ability to reduce time to results for the detection of CNVs with the same resolution of current state-of-the-art diagnostic tests. Nanopore sequencing was compared to molecular karyotyping for the detection of pathogenic CNVs of seven patients with previously diagnosed causative CNVs of different sizes and cellular fractions. Larger chromosomal anomalies included trisomy 21 and mosaic tetrasomy 12p. Among smaller CNVs, two genomic imbalances of 1.3 Mb, a small deletion of 170 kb, and two mosaic deletions (1.2 Mb and 408 kb) were tested. DNA was sequenced and data generated during runs were analyzed in online mode. All pathogenic CNVs were identified with detection time inversely proportional to size and cellular fraction. Aneuploidies were called after only 30 minutes of sequencing, whereas 30 hours were needed to call small CNVs. These results demonstrate the clinical utility of our approach that allows the molecular diagnosis of genomic disorders within a 30-minute to 30-hour time frame and its easy implementation as a routinary diagnostic tool.
Collapse
|
15
|
Marsili L, Duque KR, Bode RL, Kauffman MA, Espay AJ. Uncovering Essential Tremor Genetics: The Promise of Long-Read Sequencing. Front Neurol 2022; 13:821189. [PMID: 35401394 PMCID: PMC8983820 DOI: 10.3389/fneur.2022.821189] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 02/25/2022] [Indexed: 12/23/2022] Open
Abstract
Long-read sequencing (LRS) technologies have been recently introduced to overcome intrinsic limitations of widely-used next-generation sequencing (NGS) technologies, namely the sequencing limited to short-read fragments (150–300 base pairs). Since its introduction, LRS has permitted many successes in unraveling hidden mutational mechanisms. One area in clinical neurology in need of rethinking as it applies to genetic mechanisms is essential tremor (ET). This disorder, among the most common in neurology, is a syndrome often exhibiting an autosomal dominant pattern of inheritance whose large phenotypic spectrum suggest a multitude of genetic etiologies. Exome sequencing has revealed the genetic etiology only in rare ET families (FUS, SORT1, SCN4A, NOS3, KCNS2, HAPLN4/BRAL2, and USP46). We hypothesize that a reason for this shortcoming may be non-classical genetic mechanism(s) underpinning ET, among them trinucleotide, tetranucleotide, or pentanucleotide repeat disorders. In support of this hypothesis, trinucleotide (e.g., GGC repeats in NOTCH2NLC) and pentanucleotide repeat disorders (e.g., ATTTC repeats in STARD7) have been revealed as pathogenic in patients with a past history of what has come to be referred to as “ET plus,” bilateral hand tremor associated with epilepsy and/or leukoencephalopathy. A systematic review of LRS in neurodegenerative disorders showed that 10 of the 22 (45%) genetic etiologies ascertained by LRS include tremor in their phenotypic spectrum, suggesting that future clinical applications of LRS for tremor disorders may uncover genetic subtypes of familial ET that have eluded NGS, particularly those with associated leukoencephalopathy or family history of epilepsy. LRS provides a pathway for potentially uncovering novel genes and genetic mechanisms, helping narrow the large proportion of “idiopathic” ET.
Collapse
Affiliation(s)
- Luca Marsili
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Kevin R. Duque
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Rachel L. Bode
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Marcelo A. Kauffman
- Consultorio y Laboratorio de Neurogenética, Centro Universitario de Neurología José María Ramos Mejía, Buenos Aires, Argentina
| | - Alberto J. Espay
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
- *Correspondence: Alberto J. Espay
| |
Collapse
|
16
|
Loureiro JR, Castro AF, Figueiredo AS, Silveira I. Molecular Mechanisms in Pentanucleotide Repeat Diseases. Cells 2022; 11:cells11020205. [PMID: 35053321 PMCID: PMC8773600 DOI: 10.3390/cells11020205] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 01/04/2022] [Accepted: 01/05/2022] [Indexed: 02/01/2023] Open
Abstract
The number of neurodegenerative diseases resulting from repeat expansion has increased extraordinarily in recent years. In several of these pathologies, the repeat can be transcribed in RNA from both DNA strands producing, at least, one toxic RNA repeat that causes neurodegeneration by a complex mechanism. Recently, seven diseases have been found caused by a novel intronic pentanucleotide repeat in distinct genes encoding proteins highly expressed in the cerebellum. These disorders are clinically heterogeneous being characterized by impaired motor function, resulting from ataxia or epilepsy. The role that apparently normal proteins from these mutant genes play in these pathologies is not known. However, recent advances in previously known spinocerebellar ataxias originated by abnormal non-coding pentanucleotide repeats point to a gain of a toxic function by the pathogenic repeat-containing RNA that abnormally forms nuclear foci with RNA-binding proteins. In cells, RNA foci have been shown to be formed by phase separation. Moreover, the field of repeat expansions has lately achieved an extraordinary progress with the discovery that RNA repeats, polyglutamine, and polyalanine proteins are crucial for the formation of nuclear membraneless organelles by phase separation, which is perturbed when they are expanded. This review will cover the amazing advances on repeat diseases.
Collapse
Affiliation(s)
- Joana R. Loureiro
- Genetics of Cognitive Dysfunction Laboratory, i3S- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; (J.R.L.); (A.F.C.); (A.S.F.)
- Institute for Molecular and Cell Biology, Universidade do Porto, 4200-135 Porto, Portugal
| | - Ana F. Castro
- Genetics of Cognitive Dysfunction Laboratory, i3S- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; (J.R.L.); (A.F.C.); (A.S.F.)
- Institute for Molecular and Cell Biology, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Ciências Biomédicas Abel Salazar, Universidade do Porto, 4050-313 Porto, Portugal
| | - Ana S. Figueiredo
- Genetics of Cognitive Dysfunction Laboratory, i3S- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; (J.R.L.); (A.F.C.); (A.S.F.)
- Institute for Molecular and Cell Biology, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Ciências Biomédicas Abel Salazar, Universidade do Porto, 4050-313 Porto, Portugal
| | - Isabel Silveira
- Genetics of Cognitive Dysfunction Laboratory, i3S- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; (J.R.L.); (A.F.C.); (A.S.F.)
- Institute for Molecular and Cell Biology, Universidade do Porto, 4200-135 Porto, Portugal
- Correspondence: ; Tel.: +351-2240-8800
| |
Collapse
|
17
|
Grosso V, Marcolungo L, Maestri S, Alfano M, Lavezzari D, Iadarola B, Salviati A, Mariotti B, Botta A, D’Apice MR, Novelli G, Delledonne M, Rossato M. Characterization of FMR1 Repeat Expansion and Intragenic Variants by Indirect Sequence Capture. Front Genet 2021; 12:743230. [PMID: 34646309 PMCID: PMC8504923 DOI: 10.3389/fgene.2021.743230] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Accepted: 08/26/2021] [Indexed: 11/30/2022] Open
Abstract
Traditional methods for the analysis of repeat expansions, which underlie genetic disorders, such as fragile X syndrome (FXS), lack single-nucleotide resolution in repeat analysis and the ability to characterize causative variants outside the repeat array. These drawbacks can be overcome by long-read and short-read sequencing, respectively. However, the routine application of next-generation sequencing in the clinic requires target enrichment, and none of the available methods allows parallel analysis of long-DNA fragments using both sequencing technologies. In this study, we investigated the use of indirect sequence capture (Xdrop technology) coupled to Nanopore and Illumina sequencing to characterize FMR1, the gene responsible of FXS. We achieved the efficient enrichment (> 200×) of large target DNA fragments (~60-80 kbp) encompassing the entire FMR1 gene. The analysis of Xdrop-enriched samples by Nanopore long-read sequencing allowed the complete characterization of repeat lengths in samples with normal, pre-mutation, and full mutation status (> 1 kbp), and correctly identified repeat interruptions relevant for disease prognosis and transmission. Single-nucleotide variants (SNVs) and small insertions/deletions (indels) could be detected in the same samples by Illumina short-read sequencing, completing the mutational testing through the identification of pathogenic variants within the FMR1 gene, when no typical CGG repeat expansion is detected. The study successfully demonstrated the parallel analysis of repeat expansions and SNVs/indels in the FMR1 gene at single-nucleotide resolution by combining Xdrop enrichment with two next-generation sequencing approaches. With the appropriate optimization necessary for the clinical settings, the system could facilitate both the study of genotype-phenotype correlation in FXS and enable a more efficient diagnosis and genetic counseling for patients and their relatives.
Collapse
Affiliation(s)
- Valentina Grosso
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Luca Marcolungo
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Simone Maestri
- Department of Biotechnology, University of Verona, Verona, Italy
| | | | - Denise Lavezzari
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Barbara Iadarola
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Alessandro Salviati
- Department of Biotechnology, University of Verona, Verona, Italy
- GENARTIS srl, Verona, Italy
| | - Barbara Mariotti
- Department of Medicine, Section of General Pathology, University of Verona, Verona, Italy
| | - Annalisa Botta
- Department of Biomedicine and Prevention, Medical Genetics Section, University of Rome "Tor Vergata", Rome, Italy
| | | | - Giuseppe Novelli
- Department of Biomedicine and Prevention, Medical Genetics Section, University of Rome "Tor Vergata", Rome, Italy
- IRCCS Neuromed Mediterranean Neurological Institute, Pozzilli, Italy
- Department of Pharmacology, School of Medicine, University of Nevada, Reno, NV, United States
| | - Massimo Delledonne
- Department of Biotechnology, University of Verona, Verona, Italy
- GENARTIS srl, Verona, Italy
| | - Marzia Rossato
- Department of Biotechnology, University of Verona, Verona, Italy
- GENARTIS srl, Verona, Italy
| |
Collapse
|
18
|
Su Y, Fan L, Shi C, Wang T, Zheng H, Luo H, Zhang S, Hu Z, Fan Y, Dong Y, Yang J, Mao C, Xu Y. Deciphering Neurodegenerative Diseases Using Long-Read Sequencing. Neurology 2021; 97:423-433. [PMID: 34389649 PMCID: PMC8408508 DOI: 10.1212/wnl.0000000000012466] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 06/23/2021] [Indexed: 11/15/2022] Open
Abstract
Neurodegenerative diseases exhibit chronic progressive lesions in the central and peripheral nervous systems with unclear causes. The search for pathogenic mutations in human neurodegenerative diseases has benefited from massively parallel short-read sequencers. However, genomic regions, including repetitive elements, especially with high/low GC content, are far beyond the capability of conventional approaches. Recently, long-read single-molecule DNA sequencing technologies have emerged and enabled researchers to study genomes, transcriptomes, and metagenomes at unprecedented resolutions. The identification of novel mutations in unresolved neurodegenerative disorders, the characterization of causative repeat expansions, and the direct detection of epigenetic modifications on naive DNA by virtue of long-read sequencers will further expand our understanding of neurodegenerative diseases. In this article, we review and compare 2 prevailing long-read sequencing technologies, Pacific Biosciences and Oxford Nanopore Technologies, and discuss their applications in neurodegenerative diseases.
Collapse
Affiliation(s)
- Yun Su
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Liyuan Fan
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Changhe Shi
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Sino-British Research Centre for Molecular Oncology, National Centre for International Research in Cell and Gene Therapy, School of Basic Medical Sciences, Academy of Medical Sciences, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Tai Wang
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Huimin Zheng
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Haiyang Luo
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Shuo Zhang
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Zhengwei Hu
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Yu Fan
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Yali Dong
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Jing Yang
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Institute of Neuroscience, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Chengyuan Mao
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,The Academy of Medical Sciences of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Sino-British Research Centre for Molecular Oncology, National Centre for International Research in Cell and Gene Therapy, School of Basic Medical Sciences, Academy of Medical Sciences, Zhengzhou University, Zhengzhou, Henan, P. R. China
| | - Yuming Xu
- Department of Neurology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China .,Henan Key Laboratory of Cerebrovascular Diseases, The First Affiliated Hospital of Zhengzhou University, Zhengzhou University, Zhengzhou, Henan, P. R. China.,Institute of Neuroscience, Zhengzhou University, Zhengzhou, Henan, P. R. China
| |
Collapse
|
19
|
Chintalaphani SR, Pineda SS, Deveson IW, Kumar KR. An update on the neurological short tandem repeat expansion disorders and the emergence of long-read sequencing diagnostics. Acta Neuropathol Commun 2021; 9:98. [PMID: 34034831 PMCID: PMC8145836 DOI: 10.1186/s40478-021-01201-x] [Citation(s) in RCA: 80] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/17/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Short tandem repeat (STR) expansion disorders are an important cause of human neurological disease. They have an established role in more than 40 different phenotypes including the myotonic dystrophies, Fragile X syndrome, Huntington's disease, the hereditary cerebellar ataxias, amyotrophic lateral sclerosis and frontotemporal dementia. MAIN BODY STR expansions are difficult to detect and may explain unsolved diseases, as highlighted by recent findings including: the discovery of a biallelic intronic 'AAGGG' repeat in RFC1 as the cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS); and the finding of 'CGG' repeat expansions in NOTCH2NLC as the cause of neuronal intranuclear inclusion disease and a range of clinical phenotypes. However, established laboratory techniques for diagnosis of repeat expansions (repeat-primed PCR and Southern blot) are cumbersome, low-throughput and poorly suited to parallel analysis of multiple gene regions. While next generation sequencing (NGS) has been increasingly used, established short-read NGS platforms (e.g., Illumina) are unable to genotype large and/or complex repeat expansions. Long-read sequencing platforms recently developed by Oxford Nanopore Technology and Pacific Biosciences promise to overcome these limitations to deliver enhanced diagnosis of repeat expansion disorders in a rapid and cost-effective fashion. CONCLUSION We anticipate that long-read sequencing will rapidly transform the detection of short tandem repeat expansion disorders for both clinical diagnosis and gene discovery.
Collapse
Affiliation(s)
- Sanjog R. Chintalaphani
- School of Medicine, University of New South Wales, Sydney, 2052 Australia
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
| | - Sandy S. Pineda
- Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Brain and Mind Centre, University of Sydney, Camperdown, NSW 2050 Australia
| | - Ira W. Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Faculty of Medicine, St Vincent’s Clinical School, University of New South Wales, Sydney, NSW 2010 Australia
| | - Kishore R. Kumar
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, NSW 2010 Australia
- Molecular Medicine Laboratory and Neurology Department, Central Clinical School, Concord Repatriation General Hospital, University of Sydney, Concord, NSW 2137 Australia
| |
Collapse
|
20
|
Rodríguez-Labrada R, Martins AC, Magaña JJ, Vazquez-Mojena Y, Medrano-Montero J, Fernandez-Ruíz J, Cisneros B, Teive H, McFarland KN, Saraiva-Pereira ML, Cerecedo-Zapata CM, Gomez CM, Ashizawa T, Velázquez-Pérez L, Jardim LB. Founder Effects of Spinocerebellar Ataxias in the American Continents and the Caribbean. THE CEREBELLUM 2021; 19:446-458. [PMID: 32086717 DOI: 10.1007/s12311-020-01109-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Spinocerebellar ataxias (SCAs) comprise a heterogeneous group of autosomal dominant disorders. The relative frequency of the different SCA subtypes varies broadly among different geographical and ethnic groups as result of genetic drifts. This review aims to provide an update regarding SCA founders in the American continents and the Caribbean as well as to discuss characteristics of these populations. Clusters of SCAs were detected in Eastern regions of Cuba for SCA2, in South Brazil for SCA3/MJD, and in Southeast regions of Mexico for SCA7. Prevalence rates were obtained and reached 154 (municipality of Báguano, Cuba), 166 (General Câmara, Brazil), and 423 (Tlaltetela, Mexico) patients/100,000 for SCA2, SCA3/MJD, and SCA7, respectively. In contrast, the scattered families with spinocerebellar ataxia type 10 (SCA10) reported all over North and South Americas have been associated to a common Native American ancestry that may have risen in East Asia and migrated to Americas 10,000 to 20,000 years ago. The comprehensive review showed that for each of these SCAs corresponded at least the development of one study group with a large production of scientific evidence often generalizable to all carriers of these conditions. Clusters of SCA populations in the American continents and the Caribbean provide unusual opportunity to gain insights into clinical and genetic characteristics of these disorders. Furthermore, the presence of large populations of patients living close to study centers can favor the development of meaningful clinical trials, which will impact on therapies and on quality of life of SCA carriers worldwide.
Collapse
Affiliation(s)
| | - Ana Carolina Martins
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, 91540-070, Brazil
| | - Jonathan J Magaña
- Department of Genetics, Laboratory of Genomic Medicine, National Rehabilitation Institute (INR-LGII), 14389, Mexico City, Mexico
| | - Yaimeé Vazquez-Mojena
- Centre for the Research and Rehabilitation of Hereditary Ataxias, 80100, Holguín, Cuba
| | | | - Juan Fernandez-Ruíz
- Departamento de Fisiología, Facultad de Medicina, Universidad Nacional Autonoma de Mexico, 04510, Mexico City, Mexico
| | - Bulmaro Cisneros
- Department of Genetics and Molecular Biology, Center of Research and Advanced Studies (CINVESTAV-IPN), 07360, Mexico City, Mexico
| | - Helio Teive
- Movement Disorders Unit, Neurology Service, Internal Medicine Department, Hospital de Clínicas Federal University of Paraná, Curitiba, PR, 80240-440, Brazil
| | | | - Maria Luiza Saraiva-Pereira
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, 91540-070, Brazil
- Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, RS, 90035-903, Brazil
- Departamento de Bioquímica, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, 90035-903, Brazil
| | - César M Cerecedo-Zapata
- Department of Genetics, Laboratory of Genomic Medicine, National Rehabilitation Institute (INR-LGII), 14389, Mexico City, Mexico
- Rehabilitation and Social Inclusion Center of Veracruz (CRIS-DIF), Xalapa, 91070, Veracruz, Mexico
| | | | - Tetsuo Ashizawa
- Program of Neuroscience, Houston Methodist Research Institute, Houston, TX, 77030, USA
| | - Luis Velázquez-Pérez
- Centre for the Research and Rehabilitation of Hereditary Ataxias, 80100, Holguín, Cuba.
- Cuban Academy of Sciences, 10100, La Havana, Cuba.
| | - Laura Bannach Jardim
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, 91540-070, Brazil
- Hospital de Clínicas de Porto Alegre (HCPA), Porto Alegre, RS, 90035-903, Brazil
- Departamento de Medicina Interna, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, 90035-903, Brazil
| |
Collapse
|
21
|
The Role of Noncoding Variants in Heritable Disease. Trends Genet 2020; 36:880-891. [PMID: 32741549 DOI: 10.1016/j.tig.2020.07.004] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 06/30/2020] [Accepted: 07/02/2020] [Indexed: 12/26/2022]
Abstract
The genetic basis of disease has largely focused on coding regions. However, it has become clear that a large proportion of the noncoding genome is functional and harbors genetic variants that contribute to disease etiology. Here, we review recent examples of inherited noncoding alterations that are responsible for Mendelian disorders or act to influence complex traits. We explore both rare and common genetic variants and discuss the wide range of mechanisms by which they affect gene regulation to promote disease. We also debate the challenges and progress associated with identifying and interpreting the functional and clinical significance of genetic variation in the context of the noncoding regulatory landscape.
Collapse
|
22
|
Mikheenko A, Bzikadze AV, Gurevich A, Miga KH, Pevzner PA. TandemTools: mapping long reads and assessing/improving assembly quality in extra-long tandem repeats. Bioinformatics 2020; 36:i75-i83. [PMID: 32657355 PMCID: PMC7355294 DOI: 10.1093/bioinformatics/btaa440] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION Extra-long tandem repeats (ETRs) are widespread in eukaryotic genomes and play an important role in fundamental cellular processes, such as chromosome segregation. Although emerging long-read technologies have enabled ETR assemblies, the accuracy of such assemblies is difficult to evaluate since there are no tools for their quality assessment. Moreover, since the mapping of error-prone reads to ETRs remains an open problem, it is not clear how to polish draft ETR assemblies. RESULTS To address these problems, we developed the TandemTools software that includes the TandemMapper tool for mapping reads to ETRs and the TandemQUAST tool for polishing ETR assemblies and their quality assessment. We demonstrate that TandemTools not only reveals errors in ETR assemblies but also improves the recently generated assemblies of human centromeres. AVAILABILITY AND IMPLEMENTATION https://github.com/ablab/TandemTools. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, CA 92093, USA
| | - Alexey Gurevich
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093, USA
| |
Collapse
|
23
|
Sturchio A, Marsili L, Mahajan A, Grimberg MB, Kauffman MA, Espay AJ. How have advances in genetic technology modified movement disorder nosology? Eur J Neurol 2020; 27:1461-1470. [PMID: 32356310 DOI: 10.1111/ene.14294] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2020] [Accepted: 04/27/2020] [Indexed: 01/03/2023]
Abstract
The role of genetics and its technological development have been fundamental in advancing the field of movement disorders, opening the door to precision medicine. Starting from the revolutionary discovery of the locus of the Huntington's disease gene, we review the milestones of genetic discoveries in movement disorders and their impact on clinical practice and research efforts. Before the 1980s, early techniques did not allow the identification of genetic alteration in complex diseases. Further advances increasingly defined a large number of pathogenic genetic alterations. Moreover, these techniques allowed epigenomic, transcriptomic and microbiome analyses. In the 2020s, these new technologies are poised to displace phenotype-based classifications towards a nosology based on genetic/biological data. Advances in genetic technologies are engineering a reversal of the phenotype-to-genotype order of nosology development, replacing convergent clinicopathological disease models with the genotypic divergence required for future precision medicine applications.
Collapse
Affiliation(s)
- A Sturchio
- Gardner Family Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, USA
| | - L Marsili
- Gardner Family Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, USA
| | - A Mahajan
- Gardner Family Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, USA
| | - M B Grimberg
- Gardner Family Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, USA
| | - M A Kauffman
- Consultorio y Laboratorio de Neurogenética, Centro Universitario de Neurología 'José María Ramos Mejía' y División Neurología, Hospital JM Ramos Mejía, Facultad de Medicina, UBA and Programa de Medicina de Precision y Genomica Clinica, Instituto de Investigaciones en Medicina Traslacional, Facultad de Ciencias Biomédicas, Universidad Austral-CONICET, Pilar, Argentina
| | - A J Espay
- Gardner Family Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, USA
| |
Collapse
|
24
|
Ross JP, Dion PA, Rouleau GA. Exome sequencing in genetic disease: recent advances and considerations. F1000Res 2020; 9:F1000 Faculty Rev-336. [PMID: 32431803 PMCID: PMC7205110 DOI: 10.12688/f1000research.19444.1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/30/2020] [Indexed: 12/14/2022] Open
Abstract
Over the past decade, exome sequencing (ES) has allowed significant advancements to the field of disease research. By targeting the protein-coding regions of the genome, ES combines the depth of knowledge on protein-altering variants with high-throughput data generation and ease of analysis. New discoveries continue to be made using ES, and medical science has benefitted both theoretically and clinically from its continued use. In this review, we describe recent advances and successes of ES in disease research. Through selected examples of recent publications, we explore how ES continues to be a valuable tool to find variants that might explain disease etiology or provide insight into the biology underlying the disease. We then discuss shortcomings of ES in terms of variant discoveries made by other sequencing technologies that would be missed because of the scope and techniques of ES. We conclude with a brief outlook on the future of ES, suggesting that although newer and more thorough sequencing methods will soon supplant ES, its results will continue to be useful for disease research.
Collapse
Affiliation(s)
- Jay P. Ross
- Department of Human Genetics, McGill University, 3640 University, Montréal, QC, H3A 0C7, Canada
- Montreal Neurological Institute and Hospital, McGill University, 3801 University, Montréal, QC, H3A 2B4, Canada
| | - Patrick A. Dion
- Montreal Neurological Institute and Hospital, McGill University, 3801 University, Montréal, QC, H3A 2B4, Canada
- Department of Neurology and Neurosurgery, McGill University, 3801 University, Montréal, QC, H3A 2B4, Canada
| | - Guy A. Rouleau
- Department of Human Genetics, McGill University, 3640 University, Montréal, QC, H3A 0C7, Canada
- Montreal Neurological Institute and Hospital, McGill University, 3801 University, Montréal, QC, H3A 2B4, Canada
- Department of Neurology and Neurosurgery, McGill University, 3801 University, Montréal, QC, H3A 2B4, Canada
| |
Collapse
|
25
|
Hashem V, Tiwari A, Bewick B, Teive HAG, Moscovich M, Schüele B, Bushara K, Bower M, Rasmussen A, Tsai YC, Clark T, McFarland K, Ashizawa T. Pulse-Field capillary electrophoresis of repeat-primed PCR amplicons for analysis of large repeats in Spinocerebellar Ataxia Type 10. PLoS One 2020; 15:e0228789. [PMID: 32160188 PMCID: PMC7065784 DOI: 10.1371/journal.pone.0228789] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Accepted: 01/23/2020] [Indexed: 12/13/2022] Open
Abstract
Large expansions of microsatellite DNA cause several neurological diseases. In Spinocerebellar ataxia type 10 (SCA10), the repeat interruptions change disease phenotype; an (ATTCC)n or a (ATCCT)n/(ATCCC)n interruption within the (ATTCT)n repeat is associated with the robust phenotype of ataxia and epilepsy while mostly pure (ATTCT)n may have reduced penetrance. Large repeat expansions of SCA10, and many other microsatellite expansions, can exceed 10,000 base pairs (bp) in size. Conventional next generation sequencing (NGS) technologies are ineffective in determining internal sequence contents or size of these expanded repeats. Using repeat primed PCR (RP-PCR) in conjunction with a high-sensitivity pulsed-field capillary electrophoresis fragment analyzer (FEMTO-Pulse, Agilent, Santa Clara, CA) (RP-FEMTO hereafter), we successfully determined sequence content of large expansion repeats in genomic DNA of SCA10 patients and transformed yeast artificial chromosomes containing SCA10 repeats. This RP-FEMTO is a simple and economical methodology which could complement emerging NGS for very long sequence reads such as Single Molecule, Real-Time (SMRT) and nanopore sequencing technologies.
Collapse
Affiliation(s)
- Vera Hashem
- Department of Neurology, Houston Methodist Research Institute, Houston, Texas, United States of America
| | - Anjana Tiwari
- Department of Neurology, Houston Methodist Research Institute, Houston, Texas, United States of America
| | - Brittani Bewick
- Department of Neurology, Houston Methodist Research Institute, Houston, Texas, United States of America
| | - Helio A. G. Teive
- Movement Disorders Unit, Neurology Service, Department of Internal Medicine, Hospital de Clinicas, Federal University of Paraná, Curitiba, PR, Brazil
| | - Mariana Moscovich
- Department of Internal Medicine, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Birgitt Schüele
- Department of Pathology, Stanford University School of Medicine, Stanford, California, United States of America
| | - Khalaf Bushara
- Department of Neurology, University of Minnesota Medical Center, Minneapolis, Minnesota, United States of America
| | - Matt Bower
- Institute of Human Genetics, University of Minnesota Medical Center, Minneapolis, Minnesota, United States of America
| | - Astrid Rasmussen
- Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
- Instituto Nacional de Ciencias Medicas y Nutricion Salvador Zubiran, Mexico City, CDMX, Mexico
| | - Yu-Chih Tsai
- Pacific Biosciences of California, Inc, Menlo Park, California, United States of America
| | - Tyson Clark
- Pacific Biosciences of California, Inc, Menlo Park, California, United States of America
| | - Karen McFarland
- Department of Neurology and The McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
| | - Tetsuo Ashizawa
- Department of Neurology, Houston Methodist Research Institute, Houston, Texas, United States of America
| |
Collapse
|
26
|
Chiara M, Zambelli F, Picardi E, Horner DS, Pesole G. Critical assessment of bioinformatics methods for the characterization of pathological repeat expansions with single-molecule sequencing data. Brief Bioinform 2019; 21:1971-1986. [DOI: 10.1093/bib/bbz099] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 06/22/2019] [Accepted: 07/09/2019] [Indexed: 01/19/2023] Open
Abstract
Abstract
A number of studies have reported the successful application of single-molecule sequencing technologies to the determination of the size and sequence of pathological expanded microsatellite repeats over the last 5 years. However, different custom bioinformatics pipelines were employed in each study, preventing meaningful comparisons and somewhat limiting the reproducibility of the results. In this review, we provide a brief summary of state-of-the-art methods for the characterization of expanded repeats alleles, along with a detailed comparison of bioinformatics tools for the determination of repeat length and sequence, using both real and simulated data. Our reanalysis of publicly available human genome sequencing data suggests a modest, but statistically significant, increase of the error rate of single-molecule sequencing technologies at genomic regions containing short tandem repeats. However, we observe that all the methods herein tested, irrespective of the strategy used for the analysis of the data (either based on the alignment or assembly of the reads), show high levels of sensitivity in both the detection of expanded tandem repeats and the estimation of the expansion size, suggesting that approaches based on single-molecule sequencing technologies are highly effective for the detection and quantification of tandem repeat expansions and contractions.
Collapse
Affiliation(s)
- Matteo Chiara
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Federico Zambelli
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Ernesto Picardi
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Via Orabona 4, 70126 Bari, Italy
| | - David S Horner
- Department of Biosciences, University of Milan, via Celoria 26, 20133 Milan, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
| | - Graziano Pesole
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council, Via Amendola e, 70126 Bari, Italy
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Via Orabona 4, 70126 Bari, Italy
| |
Collapse
|
27
|
Long-read sequencing for rare human genetic diseases. J Hum Genet 2019; 65:11-19. [PMID: 31558760 DOI: 10.1038/s10038-019-0671-8] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 08/28/2019] [Accepted: 09/03/2019] [Indexed: 12/19/2022]
Abstract
During the past decade, the search for pathogenic mutations in rare human genetic diseases has involved huge efforts to sequence coding regions, or the entire genome, using massively parallel short-read sequencers. However, the approximate current diagnostic rate is <50% using these approaches, and there remain many rare genetic diseases with unknown cause. There may be many reasons for this, but one plausible explanation is that the responsible mutations are in regions of the genome that are difficult to sequence using conventional technologies (e.g., tandem-repeat expansion or complex chromosomal structural aberrations). Despite the drawbacks of high cost and a shortage of standard analytical methods, several studies have analyzed pathogenic changes in the genome using long-read sequencers. The results of these studies provide hope that further application of long-read sequencers to identify the causative mutations in unsolved genetic diseases may expand our understanding of the human genome and diseases. Such approaches may also be applied to molecular diagnosis and therapeutic strategies for patients with genetic diseases in the future.
Collapse
|
28
|
De Coster W, Van Broeckhoven C. Newest Methods for Detecting Structural Variations. Trends Biotechnol 2019; 37:973-982. [DOI: 10.1016/j.tibtech.2019.02.003] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2018] [Revised: 02/08/2019] [Accepted: 02/11/2019] [Indexed: 01/28/2023]
|
29
|
Abstract
The spinocerebellar ataxias (SCAs) comprise more than 40 autosomal dominant neurodegenerative disorders that present principally with progressive ataxia. Within the past few years, studies of pathogenic mechanisms in the SCAs have led to the development of promising therapeutic strategies, especially for SCAs caused by polyglutamine-coding CAG repeats. Nucleotide-based gene-silencing approaches that target the first steps in the pathogenic cascade are one promising approach not only for polyglutamine SCAs but also for the many other SCAs caused by toxic mutant proteins or RNA. For these and other emerging therapeutic strategies, well-coordinated preparation is needed for fruitful clinical trials. To accomplish this goal, investigators from the United States and Europe are now collaborating to share data from their respective SCA cohorts. Increased knowledge of the natural history of SCAs, including of the premanifest and early symptomatic stages of disease, will improve the prospects for success in clinical trials of disease-modifying drugs. In addition, investigators are seeking validated clinical outcome measures that demonstrate responsiveness to changes in SCA populations. Findings suggest that MRI and magnetic resonance spectroscopy biomarkers will provide objective biological readouts of disease activity and progression, but more work is needed to establish disease-specific biomarkers that track target engagement in therapeutic trials. Together, these efforts suggest that the development of successful therapies for one or more SCAs is not far away.
Collapse
|
30
|
Ardui S, Ameur A, Vermeesch JR, Hestand MS. Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics. Nucleic Acids Res 2019; 46:2159-2168. [PMID: 29401301 PMCID: PMC5861413 DOI: 10.1093/nar/gky066] [Citation(s) in RCA: 396] [Impact Index Per Article: 79.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 01/23/2018] [Indexed: 12/30/2022] Open
Abstract
Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio's single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing.
Collapse
Affiliation(s)
- Simon Ardui
- Department of Human Genetics, KU Leuven, Leuven 3000, Belgium
| | - Adam Ameur
- Department of Immunology, Genetics and Pathology, Uppsala University, Science for Life Laboratory, Uppsala 75108, Sweden.,School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | | | - Matthew S Hestand
- Department of Human Genetics, KU Leuven, Leuven 3000, Belgium.,Department of Clinical Genetics, VU University Medical Center, Amsterdam 1081 BT, The Netherlands
| |
Collapse
|
31
|
Mantere T, Kersten S, Hoischen A. Long-Read Sequencing Emerging in Medical Genetics. Front Genet 2019; 10:426. [PMID: 31134132 PMCID: PMC6514244 DOI: 10.3389/fgene.2019.00426] [Citation(s) in RCA: 226] [Impact Index Per Article: 45.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/18/2019] [Indexed: 12/12/2022] Open
Abstract
The wide implementation of next-generation sequencing (NGS) technologies has revolutionized the field of medical genetics. However, the short read lengths of currently used sequencing approaches pose a limitation for the identification of structural variants, sequencing repetitive regions, phasing of alleles and distinguishing highly homologous genomic regions. These limitations may significantly contribute to the diagnostic gap in patients with genetic disorders who have undergone standard NGS, like whole exome or even genome sequencing. Now, the emerging long-read sequencing (LRS) technologies may offer improvements in the characterization of genetic variation and regions that are difficult to assess with the prevailing NGS approaches. LRS has so far mainly been used to investigate genetic disorders with previously known or strongly suspected disease loci. While these targeted approaches already show the potential of LRS, it remains to be seen whether LRS technologies can soon enable true whole genome sequencing routinely. Ultimately, this could allow the de novo assembly of individual whole genomes used as a generic test for genetic disorders. In this article, we summarize the current LRS-based research on human genetic disorders and discuss the potential of these technologies to facilitate the next major advancements in medical genetics.
Collapse
Affiliation(s)
- Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Simone Kersten
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| |
Collapse
|
32
|
Mitsuhashi S, Frith MC, Mizuguchi T, Miyatake S, Toyota T, Adachi H, Oma Y, Kino Y, Mitsuhashi H, Matsumoto N. Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads. Genome Biol 2019; 20:58. [PMID: 30890163 PMCID: PMC6425644 DOI: 10.1186/s13059-019-1667-6] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 03/01/2019] [Indexed: 01/03/2023] Open
Abstract
Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome. Our method is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we prioritize pathogenic expansions within the top 10 out of 700,000 tandem repeats in whole genome sequencing data. This may help to elucidate the many genetic diseases whose causes remain unknown.
Collapse
Affiliation(s)
- Satomi Mitsuhashi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan.
| | - Martin C Frith
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan.
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan.
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Shinjuku-ku, Tokyo, Japan.
| | - Takeshi Mizuguchi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Satoko Miyatake
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Tomoko Toyota
- Department of Neurology, University of Occupational and Environmental Health School of Medicine, Kitakyushu, Fukuoka, Japan
| | - Hiroaki Adachi
- Department of Neurology, University of Occupational and Environmental Health School of Medicine, Kitakyushu, Fukuoka, Japan
| | - Yoko Oma
- Department of Liberal Arts, Faculty of Medicine, Saitama Medical University, Iruma, Saitama, Japan
| | - Yoshihiro Kino
- Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University, Kiyose, Tokyo, Japan
| | - Hiroaki Mitsuhashi
- Department of Applied Biochemistry, School of Engineering, Tokai University, Hiratsuka, Kanagawa, Japan
| | - Naomichi Matsumoto
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| |
Collapse
|
33
|
Single-Molecule Sequencing: Towards Clinical Applications. Trends Biotechnol 2019; 37:72-85. [DOI: 10.1016/j.tibtech.2018.07.013] [Citation(s) in RCA: 112] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 07/16/2018] [Accepted: 07/18/2018] [Indexed: 12/31/2022]
|
34
|
Zablotskaya A, Van Esch H, Verstrepen KJ, Froyen G, Vermeesch JR. Mapping the landscape of tandem repeat variability by targeted long read single molecule sequencing in familial X-linked intellectual disability. BMC Med Genomics 2018; 11:123. [PMID: 30567555 PMCID: PMC6299999 DOI: 10.1186/s12920-018-0446-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 12/06/2018] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND The etiology of more than half of all patients with X-linked intellectual disability remains elusive, despite array-based comparative genomic hybridization, whole exome or genome sequencing. Since short read massive parallel sequencing approaches do not allow the detection of larger tandem repeat expansions, we hypothesized that such expansions could be a hidden cause of X-linked intellectual disability. METHODS We selectively captured over 1800 tandem repeats on the X chromosome and characterized them by long read single molecule sequencing in 3 families with idiopathic X-linked intellectual disability. RESULTS In male DNA samples, full tandem repeat length sequences were obtained for 88-93% of the targets and up to 99.6% of the repeats with a moderate guanine-cytosine content. Read length and analysis pipeline allow to detect cases of > 900 bp tandem repeat expansion. In one family, one repeat expansion co-occurs with down-regulation of the neighboring MIR222 gene. This gene has previously been implicated in intellectual disability and is apparently linked to FMR1 and NEFH overexpression associated with neurological disorders. CONCLUSIONS This study demonstrates the power of single molecule sequencing to measure tandem repeat lengths and detect expansions, and suggests that tandem repeat mutations may be a hidden cause of X-linked intellectual disability.
Collapse
Affiliation(s)
- Alena Zablotskaya
- Department of Human Genetics and Center for Human Genetics, Laboratory for Cytogenetics and Genome Research, University Hospitals Leuven, KU Leuven, O&N I Herestraat 49 - box 606, 3000, Leuven, Belgium
| | - Hilde Van Esch
- Department of Human Genetics and Center for Human Genetics, Laboratory for Genetics of Cognition, University Hospitals Leuven, KU Leuven, O&N I Herestraat 49 - box 606, 3000, Leuven, Belgium
| | - Kevin J Verstrepen
- VIB Center for Microbiology and CMPG Lab for Genetics and Genomics, KU Leuven, Gaston Geenslaan 1 - box 2471, 3001, Leuven, Belgium
| | - Guy Froyen
- Clinical Biology, Laboratory for Molecular Diagnostics, Jessa Hospital, Stadsomvaart 11, 3500, Hasselt, Belgium
| | - Joris R Vermeesch
- Department of Human Genetics and Center for Human Genetics, Laboratory for Cytogenetics and Genome Research, University Hospitals Leuven, KU Leuven, O&N I Herestraat 49 - box 606, 3000, Leuven, Belgium.
| |
Collapse
|
35
|
Detecting a long insertion variant in SAMD12 by SMRT sequencing: implications of long-read whole-genome sequencing for repeat expansion diseases. J Hum Genet 2018; 64:191-197. [DOI: 10.1038/s10038-018-0551-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 11/12/2018] [Accepted: 11/27/2018] [Indexed: 01/21/2023]
|
36
|
Dastidar S, Ardui S, Singh K, Majumdar D, Nair N, Fu Y, Reyon D, Samara E, Gerli MF, Klein AF, De Schrijver W, Tipanee J, Seneca S, Tulalamba W, Wang H, Chai Y, In’t Veld P, Furling D, Tedesco F, Vermeesch JR, Joung JK, Chuah MK, VandenDriessche T. Efficient CRISPR/Cas9-mediated editing of trinucleotide repeat expansion in myotonic dystrophy patient-derived iPS and myogenic cells. Nucleic Acids Res 2018; 46:8275-8298. [PMID: 29947794 PMCID: PMC6144820 DOI: 10.1093/nar/gky548] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Revised: 06/01/2018] [Accepted: 06/05/2018] [Indexed: 12/17/2022] Open
Abstract
CRISPR/Cas9 is an attractive platform to potentially correct dominant genetic diseases by gene editing with unprecedented precision. In the current proof-of-principle study, we explored the use of CRISPR/Cas9 for gene-editing in myotonic dystrophy type-1 (DM1), an autosomal-dominant muscle disorder, by excising the CTG-repeat expansion in the 3'-untranslated-region (UTR) of the human myotonic dystrophy protein kinase (DMPK) gene in DM1 patient-specific induced pluripotent stem cells (DM1-iPSC), DM1-iPSC-derived myogenic cells and DM1 patient-specific myoblasts. To eliminate the pathogenic gain-of-function mutant DMPK transcript, we designed a dual guide RNA based strategy that excises the CTG-repeat expansion with high efficiency, as confirmed by Southern blot and single molecule real-time (SMRT) sequencing. Correction efficiencies up to 90% could be attained in DM1-iPSC as confirmed at the clonal level, following ribonucleoprotein (RNP) transfection of CRISPR/Cas9 components without the need for selective enrichment. Expanded CTG repeat excision resulted in the disappearance of ribonuclear foci, a quintessential cellular phenotype of DM1, in the corrected DM1-iPSC, DM1-iPSC-derived myogenic cells and DM1 myoblasts. Consequently, the normal intracellular localization of the muscleblind-like splicing regulator 1 (MBNL1) was restored, resulting in the normalization of splicing pattern of SERCA1. This study validates the use of CRISPR/Cas9 for gene editing of repeat expansions.
Collapse
Affiliation(s)
- Sumitava Dastidar
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Simon Ardui
- Department of Human Genetics, University of Leuven, Leuven 3000, Belgium
| | - Kshitiz Singh
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Debanjana Majumdar
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Nisha Nair
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Yanfang Fu
- Molecular Pathology Unit, Center for Cancer Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Charlestown, MA02129, USA
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Deepak Reyon
- Molecular Pathology Unit, Center for Cancer Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Charlestown, MA02129, USA
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Ermira Samara
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Mattia F M Gerli
- Department of Cell and Developmental Biology, University College London, London WC1E6DE, UK
| | - Arnaud F Klein
- Sorbonne Universités, INSERM, Association Institute de Myologie, Center de Recherche en Myologie, F-75013 , France
| | - Wito De Schrijver
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Jaitip Tipanee
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Sara Seneca
- Research Group Reproduction and Genetics (REGE), Center for Medical Genetics, UZ Brussels, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Warut Tulalamba
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Hui Wang
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Yoke Chin Chai
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Peter In’t Veld
- Department of Pathology, Vrije Universiteit Brussel, Brussels 1090, Belgium
| | - Denis Furling
- Sorbonne Universités, INSERM, Association Institute de Myologie, Center de Recherche en Myologie, F-75013 , France
| | | | - Joris R Vermeesch
- Department of Human Genetics, University of Leuven, Leuven 3000, Belgium
| | - J Keith Joung
- Molecular Pathology Unit, Center for Cancer Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Charlestown, MA02129, USA
- Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Marinee K Chuah
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
- Center for Molecular & Vascular Biology, Department of Cardiovascular Sciences, University of Leuven, Leuven 3000, Belgium
| | - Thierry VandenDriessche
- Department of Gene Therapy & Regenerative Medicine, Vrije Universiteit Brussel, Brussels 1090, Belgium
- Center for Molecular & Vascular Biology, Department of Cardiovascular Sciences, University of Leuven, Leuven 3000, Belgium
| |
Collapse
|
37
|
Genovese LM, Geraci F, Corrado L, Mangano E, D'Aurizio R, Bordoni R, Severgnini M, Manzini G, De Bellis G, D'Alfonso S, Pellegrini M. A Census of Tandemly Repeated Polymorphic Loci in Genic Regions Through the Comparative Integration of Human Genome Assemblies. Front Genet 2018; 9:155. [PMID: 29770143 PMCID: PMC5941971 DOI: 10.3389/fgene.2018.00155] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 04/13/2018] [Indexed: 11/29/2022] Open
Abstract
Polymorphic Tandem Repeat (PTR) is a common form of polymorphism in the human genome. A PTR consists in a variation found in an individual (or in a population) of the number of repeating units of a Tandem Repeat (TR) locus of the genome with respect to the reference genome. Several phenotypic traits and diseases have been discovered to be strongly associated with or caused by specific PTR loci. PTR are further distinguished in two main classes: Short Tandem Repeats (STR) when the repeating unit has size up to 6 base pairs, and Variable Number Tandem Repeats (VNTR) for repeating units of size above 6 base pairs. As larger and larger populations are screened via high throughput sequencing projects, it becomes technically feasible and desirable to explore the association between PTR and a panoply of such traits and conditions. In order to facilitate these studies, we have devised a method for compiling catalogs of PTR from assembled genomes, and we have produced a catalog of PTR for genic regions (exons, introns, UTR and adjacent regions) of the human genome (GRCh38). We applied four different TR discovery software tools to uncover in the first phase 55,223,485 TR (after duplicate removal) in GRCh38, of which 373,173 were determined to be PTR in the second phase by comparison with five assembled human genomes. Of these, 263,266 are not included by state-of-the-art PTR catalogs. The new methodology is mainly based on a hierarchical and systematic application of alignment-based sequence comparisons to identify and measure the polymorphism of TR. While previous catalogs focus on the class of STR of small total size, we remove any size restrictions, aiming at the more general class of PTR, and we also target fuzzy TR by using specific detection tools. Similarly to other previous catalogs of human polymorphic loci, we focus our catalog toward applications in the discovery of disease-associated loci. Validation by cross-referencing with existing catalogs on common clinically-relevant loci shows good concordance. Overall, this proposed census of human PTR in genic regions is a shared resource (web accessible), complementary to existing catalogs, facilitating future genome-wide studies involving PTR.
Collapse
Affiliation(s)
| | - Filippo Geraci
- Institute for Informatics and Telematics of CNR, Pisa, Italy
| | - Lucia Corrado
- Department of Health Sciences, University of Eastern Piedmont Amedeo Avogadro, Novara, Italy
| | | | | | - Roberta Bordoni
- Institute for Biomedical Technologies of CNR, Segrate, Italy
| | | | - Giovanni Manzini
- Institute for Informatics and Telematics of CNR, Pisa, Italy.,Department of Science and Technological Innovation, University of Eastern Piedmont Amedeo Avogadro, Novara, Italy
| | | | - Sandra D'Alfonso
- Department of Health Sciences, University of Eastern Piedmont Amedeo Avogadro, Novara, Italy
| | | |
Collapse
|
38
|
Expression of Pyridoxal 5'-Phosphate-Independent Racemases Can Reduce 2-Aminoacrylate Stress in Salmonella enterica. J Bacteriol 2018; 200:JB.00751-17. [PMID: 29440254 DOI: 10.1128/jb.00751-17] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Accepted: 02/09/2018] [Indexed: 11/20/2022] Open
Abstract
The RidA protein (PF01042) from Salmonella enterica is a deaminase that quenches 2-aminoacrylate (2AA) and other reactive metabolites. In the absence of RidA, 2AA accumulates, damages cellular enzymes, and compromises the metabolic network. In vitro, RidA homologs from all domains of life deaminate 2AA, and RidA proteins from plants, bacteria, yeast, and humans complement the mutant phenotype of a ridA mutant strain of S. enterica In the present study, a methanogenic archaeon, Methanococcus maripaludis S2, was used to probe alternative mechanisms to restore metabolic balance. M. maripaludis MMP0739, which is annotated as an aspartate/glutamate racemase, complemented a ridA mutant strain and reduced the intracellular 2AA burden. The aspartate/glutamate racemase YgeA from Escherichia coli or S. enterica, when provided in trans, similarly restored wild-type growth to a ridA mutant. These results uncovered a new mechanism to ameliorate metabolic stress, and they suggest that direct quenching by RidA is not the only strategy to quench 2AA.IMPORTANCE 2-Aminoacrylate is an endogenously generated reactive metabolite that can damage cellular enzymes if not directly quenched by the conserved deaminase RidA. This study used an archaeon to identify a RidA-independent mechanism to prevent metabolic stress caused by 2AA. The data suggest that a gene product annotated as an aspartate/glutamate racemase (MMP0739) produces a metabolite that can quench 2AA, expanding our understanding of strategies available to quench reactive metabolites.
Collapse
|
39
|
Schüle B, McFarland KN, Lee K, Tsai YC, Nguyen KD, Sun C, Liu M, Byrne C, Gopi R, Huang N, Langston JW, Clark T, Gil FJJ, Ashizawa T. Parkinson's disease associated with pure ATXN10 repeat expansion. NPJ PARKINSONS DISEASE 2017; 3:27. [PMID: 28890930 PMCID: PMC5585403 DOI: 10.1038/s41531-017-0029-x] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 07/17/2017] [Accepted: 07/24/2017] [Indexed: 11/09/2022]
Abstract
Large, non-coding pentanucleotide repeat expansions of ATTCT in intron 9 of the ATXN10 gene typically cause progressive spinocerebellar ataxia with or without seizures and present neuropathologically with Purkinje cell loss resulting in symmetrical cerebellar atrophy. These ATXN10 repeat expansions can be interrupted by sequence motifs which have been attributed to seizures and are likely to act as genetic modifiers. We identified a Mexican kindred with multiple affected family members with ATXN10 expansions. Four affected family members showed clinical features of spinocerebellar ataxia type 10 (SCA10). However, one affected individual presented with early-onset levodopa-responsive parkinsonism, and one family member carried a large repeat ATXN10 expansion, but was clinically unaffected. To characterize the ATXN10 repeat, we used a novel technology of single-molecule real-time (SMRT) sequencing and CRISPR/Cas9-based capture. We sequenced the entire span of ~5.3-7.0 kb repeat expansions. The Parkinson's patient carried an ATXN10 expansion with no repeat interruption motifs as well as an unaffected sister. In the siblings with typical SCA10, we found a repeat pattern of ATTCC repeat motifs that have not been associated with seizures previously. Our data suggest that the absence of repeat interruptions is likely a genetic modifier for the clinical presentation of l-Dopa responsive parkinsonism, whereas repeat interruption motifs contribute clinically to epilepsy. Repeat interruptions are important genetic modifiers of the clinical phenotype in SCA10. Advanced sequencing techniques now allow to better characterize the underlying genetic architecture for determining accurate phenotype-genotype correlations.
Collapse
Affiliation(s)
- Birgitt Schüle
- Parkinson's Institute and Clinical Center, Sunnyvale, CA 94028 USA
| | - Karen N McFarland
- Center for Translational Research in Neurodegenerative Disease and The McKnight Brain Institute, University of Florida, College of Medicine, Department of Neurology, Gainesville, FL 32610 USA
| | - Kelsey Lee
- Parkinson's Institute and Clinical Center, Sunnyvale, CA 94028 USA
| | | | | | - Chao Sun
- Biogen Idec, Cambridge, MA 02142 USA
| | - Mei Liu
- Biogen Idec, Cambridge, MA 02142 USA
| | - Christie Byrne
- Parkinson's Institute and Clinical Center, Sunnyvale, CA 94028 USA
| | - Ramesh Gopi
- Silicon Valley Diagnostic Imaging, El Camino Hospital, Mountain View, CA 94040 USA
| | - Neng Huang
- Valley Parkinson Clinic, Los Gatos, CA 95032 USA
| | | | - Tyson Clark
- Pacific Biosciences, Menlo Park, CA 94025 USA
| | | | | |
Collapse
|
40
|
Liu Q, Zhang P, Wang D, Gu W, Wang K. Interrogating the "unsequenceable" genomic trinucleotide repeat disorders by long-read sequencing. Genome Med 2017; 9:65. [PMID: 28720120 PMCID: PMC5514472 DOI: 10.1186/s13073-017-0456-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 06/30/2017] [Indexed: 12/26/2022] Open
Abstract
Microsatellite expansion, such as trinucleotide repeat expansion (TRE), is known to cause a number of genetic diseases. Sanger sequencing and next-generation short-read sequencing are unable to interrogate TRE reliably. We developed a novel algorithm called RepeatHMM to estimate repeat counts from long-read sequencing data. Evaluation on simulation data, real amplicon sequencing data on two repeat expansion disorders, and whole-genome sequencing data generated by PacBio and Oxford Nanopore technologies showed superior performance over competing approaches. We concluded that long-read sequencing coupled with RepeatHMM can estimate repeat counts on microsatellites and can interrogate the “unsequenceable” genomic trinucleotide repeat disorders.
Collapse
Affiliation(s)
- Qian Liu
- Institute for Genomic Medicine, Columbia University, New York, NY, 10032, USA
| | - Peng Zhang
- Nextomics Biosciences, Wuhan, Hubei, 430000, China
| | - Depeng Wang
- Nextomics Biosciences, Wuhan, Hubei, 430000, China
| | - Weihong Gu
- China-Japan Friendship Hospital, Beijing, 100029, China
| | - Kai Wang
- Institute for Genomic Medicine, Columbia University, New York, NY, 10032, USA. .,Department of Biomedical Informatics, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
41
|
Landrian I, McFarland KN, Liu J, Mulligan CJ, Rasmussen A, Ashizawa T. Inheritance patterns of ATCCT repeat interruptions in spinocerebellar ataxia type 10 (SCA10) expansions. PLoS One 2017; 12:e0175958. [PMID: 28423040 PMCID: PMC5397023 DOI: 10.1371/journal.pone.0175958] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 04/03/2017] [Indexed: 11/18/2022] Open
Abstract
Spinocerebellar ataxia type 10 (SCA10), an autosomal dominant cerebellar ataxia disorder, is caused by a non-coding ATTCT microsatellite repeat expansion in the ataxin 10 gene. In a subset of SCA10 families, the 5’-end of the repeat expansion contains a complex sequence of penta- and heptanucleotide interruption motifs which is followed by a pure tract of tandem ATCCT repeats of unknown length at its 3’-end. Intriguingly, expansions that carry these interruption motifs correlate with an epileptic seizure phenotype and are unstable despite the theory that interruptions are expected to stabilize expanded repeats. To examine the apparent contradiction of unstable, interruption-positive SCA10 expansion alleles and to determine whether the instability originates outside of the interrupted region, we sequenced approximately 1 kb of the 5’-end of SCA10 expansions using the ATCCT-PCR product in individuals across multiple generations from four SCA10 families. We found that the greatest instability within this region occurred in paternal transmissions of the allele in stretches of pure ATTCT motifs while the intervening interrupted sequences were stable. Overall, the ATCCT interruption changes by only one to three repeat units and therefore cannot account for the instability across the length of the disease allele. We conclude that the AT-rich interruptions locally stabilize the SCA10 expansion at the 5’-end but do not completely abolish instability across the entire span of the expansion. In addition, analysis of the interruption alleles across these families support a parsimonious single origin of the mutation with a shared distant ancestor.
Collapse
Affiliation(s)
- Ivette Landrian
- Department of Neurology, College of Medicine, and the McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
| | - Karen N. McFarland
- Department of Neurology, College of Medicine, and the McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
- Center for Translational Research in Neurodegenerative Disease, The University of Florida, Gainesville, Florida, United States of America
- * E-mail:
| | - Jilin Liu
- Department of Neurology, College of Medicine, and the McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
| | - Connie J. Mulligan
- Department of Anthropology, College of Liberal Arts and Sciences, University of Florida, Gainesville, Florida, United States of America
- Genetics Institute, University of Florida, Gainesville, Florida, United States of America
| | - Astrid Rasmussen
- Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, United States of America
| | - Tetsuo Ashizawa
- Department of Neurology, College of Medicine, and the McKnight Brain Institute, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
42
|
Zhang N, Ashizawa T. RNA toxicity and foci formation in microsatellite expansion diseases. Curr Opin Genet Dev 2017; 44:17-29. [PMID: 28208060 DOI: 10.1016/j.gde.2017.01.005] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Revised: 01/04/2017] [Accepted: 01/18/2017] [Indexed: 12/11/2022]
Abstract
More than 30 incurable neurological and neuromuscular diseases are caused by simple microsatellite expansions consisted of 3-6 nucleotides. These repeats can occur in non-coding regions and often result in a dominantly inherited disease phenotype that is characteristic of a toxic RNA gain-of-function. The expanded RNA adopts unusual secondary structures, sequesters various RNA binding proteins to form insoluble nuclear foci, and causes cellular defects at a multisystem level. Nuclear foci are dynamic in size, shape and colocalization of RNA binding proteins in different expansion diseases and tissue types. This review sets to provide new insights into the disease mechanisms of RNA toxicity and foci modulation, in light of recent advancement on bi-directional transcription, antisense RNA, repeat-associated non-ATG translation and beyond.
Collapse
Affiliation(s)
- Nan Zhang
- Neurosciences Research Program, Houston Methodist Research Institute, Houston, TX 77030, United States; Division of Cell and Molecular Biology, South Kensington Campus, Imperial College London, London SW7 2AZ, UK
| | - Tetsuo Ashizawa
- Neurosciences Research Program, Houston Methodist Research Institute, Houston, TX 77030, United States.
| |
Collapse
|
43
|
Jones L, Houlden H, Tabrizi SJ. DNA repair in the trinucleotide repeat disorders. Lancet Neurol 2017; 16:88-96. [PMID: 27979358 DOI: 10.1016/s1474-4422(16)30350-7] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Revised: 09/22/2016] [Accepted: 10/27/2016] [Indexed: 02/07/2023]
Abstract
BACKGROUND Inherited diseases caused by unstable repeated DNA sequences are rare, but together represent a substantial cause of morbidity. Trinucleotide repeat disorders are severe, usually life-shortening, neurological disorders caused by nucleotide expansions, and most have no disease-modifying treatments. Longer repeat expansions are associated with genetic anticipation (ie, earlier disease onset in successive generations), although the differences in age at onset are not entirely accounted for by repeat length. Such phenotypic variation within disorders implies the existence of additional modifying factors in pathways that can potentially be modulated to treat disease. RECENT DEVELOPMENTS A genome-wide association study detected genetic modifiers of age at onset in Huntington's disease. Similar findings were seen in the spinocerebellar ataxias, indicating an association between DNA damage-response and repair pathways and the age at onset of disease. These studies also suggest that a common genetic mechanism modulates age at onset across polyglutamine diseases and could extend to other repeat expansion disorders. Genetic defects in DNA repair underlie other neurodegenerative disorders (eg, ataxia-telangiectasia), and DNA double-strand breaks are crucial to the modulation of early gene expression, which provides a mechanistic link between DNA repair and neurodegeneration. Mismatch and base-excision repair are important in the somatic expansion of repeated sequences in mouse models of trinucleotide repeat disorders, and somatic expansion of the expanded CAG tract in HTT correlates with age at onset of Huntington's disease and other trinucleotide repeat disorders. WHERE NEXT?: To understand the common genetic architecture of trinucleotide repeat disorders and any further genetic susceptibilities in individual disorders, genetic analysis with increased numbers of variants and sample sizes is needed, followed by sequencing approaches to define the phenotype-modifying variants. The findings must then be translated into cell biology analyses to elucidate the mechanisms through which the genetic variants operate. Genes that have roles in the DNA damage response could underpin a common DNA repeat-based mechanism and provide new therapeutic targets (and hence therapeutics) in multiple trinucleotide repeat disorders.
Collapse
Affiliation(s)
- Lesley Jones
- MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK.
| | - Henry Houlden
- Department of Molecular Neuroscience and MRC Centre for Neuromuscular Diseases, Institute of Neurology, Queen Square, London, UK
| | - Sarah J Tabrizi
- UCL Huntington's Disease Centre, Department of Neurodegenerative Disease, Institute of Neurology, University College London, London, UK
| |
Collapse
|
44
|
Genomic analyses of multidrug resistant Pseudomonas aeruginosa PA1 resequenced by single-molecule real-time sequencing. Biosci Rep 2016; 36:BSR20160282. [PMID: 27765811 PMCID: PMC5293553 DOI: 10.1042/bsr20160282] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 10/17/2016] [Accepted: 10/20/2016] [Indexed: 11/17/2022] Open
Abstract
As a third-generation sequencing (TGS) method, single-molecule real-time (SMRT) technology provides long read length, and it is well suited for resequencing projects and de novo assembly. In the present study, Pseudomonas aeruginosa PA1 was characterized and resequenced using SMRT technology. PA1 was also subjected to genomic, comparative and pan-genomic analyses. The multidrug resistant strain PA1 possesses a 6,498,072 bp genome and a sequence type of ST-782. The genome of PA1 was also visualized, and the results revealed the details of general genome annotations, virulence factors, regulatory proteins (RPs), secretion system proteins, type II toxin–antitoxin (T–A) pairs and genomic islands. Whole genome comparison analysis suggested that PA1 exhibits similarity to other P. aeruginosa strains but differs in terms of horizontal gene transfer (HGT) regions, such as prophages and genomic islands. Phylogenetic analyses based on 16S rRNA sequences demonstrated that PA1 is closely related to PAO1, and P. aeruginosa strains can be divided into two main groups. The pan-genome of P. aeruginosa consists of a core genome of approximately 4,000 genes and an accessory genome of at least 6,600 genes. The present study presented a detailed, visualized and comparative analysis of the PA1 genome, to enhance our understanding of this notorious pathogen.
Collapse
|