201
|
Zhang S, Liang F, Lei C, Wu J, Fu J, Yang Q, Luo X, Yu G, Wang D, Zhang Y, Lu D, Sun X, Liang Y, Xu C. Long-read sequencing and haplotype linkage analysis enabled preimplantation genetic testing for patients carrying pathogenic inversions. J Med Genet 2019; 56:741-749. [PMID: 31439719 PMCID: PMC6860410 DOI: 10.1136/jmedgenet-2018-105976] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Revised: 05/31/2019] [Accepted: 06/13/2019] [Indexed: 01/04/2023]
Abstract
Background Preimplantation genetic testing (PGT) has already been applied in patients known to carry chromosomal structural variants to improve the clinical outcome of assisted reproduction. However, conventional molecular techniques are not capable of reliably distinguishing embryos that carry balanced inversion from those with a normal karyotype. We aim to evaluate the use of long-read sequencing in combination with haplotype linkage analysis to address this challenge. Methods Long-read sequencing on Oxford Nanopore platform was employed to identify the precise positions of inversion break points in four patients. Comprehensive chromosomal screening and genome-wide haplotype linkage analysis were performed based on SNP microarray. The haplotypes, including the break point regions, the whole chromosomes involved in the inversion and the corresponding homologous chromosomes, were established using informative SNPs. Results All the inversion break points were successfully identified by long-read sequencing and validated by Sanger sequencing, and on average only 13 bp differences were observed between break points inferred by long-read sequencing and Sanger sequencing. Eighteen blastocysts were biopsied and tested, in which 10 were aneuploid or unbalanced and eight were diploid with normal or balanced inversion karyotypes. Diploid embryos were transferred back to patients, the predictive results of the current methodology were consistent with fetal karyotypes of amniotic fluid or cord blood. Conclusions Nanopore long-read sequencing is a powerful method to assay chromosomal inversions and identify exact break points. Identification of inversion break points combined with haplotype linkage analysis is an efficient strategy to distinguish embryos with normal or balanced inversion karyotypes, facilitating PGT applications.
Collapse
Affiliation(s)
- Shuo Zhang
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China.,Collaborative Innovation Center for Genetics and Development, State Key Laboratory of Genetic Engineering, School of Life Science, Fudan University, Shanghai, China
| | - Fan Liang
- GrandOmics Biosciences, Beijing, China
| | - Caixia Lei
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| | - Junping Wu
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| | - Jing Fu
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| | - Qi Yang
- GrandOmics Biosciences, Beijing, China
| | - Xiao Luo
- GrandOmics Biosciences, Beijing, China
| | | | | | - Yueping Zhang
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| | - Daru Lu
- Collaborative Innovation Center for Genetics and Development, State Key Laboratory of Genetic Engineering, School of Life Science, Fudan University, Shanghai, China
| | - Xiaoxi Sun
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China .,Shanghai Key Laboratory of Female Reproductive Endocrine Related Diseases, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| | - Yu Liang
- GrandOmics Biosciences, Beijing, China
| | - Congjian Xu
- Shanghai Ji Ai Genetics & IVF Institute, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China .,Shanghai Key Laboratory of Female Reproductive Endocrine Related Diseases, Obstetrics and Gynecology Hospital, Fudan University, Shanghai, China
| |
Collapse
|
202
|
Cao Y, Li J, Chu X, Liu H, Liu W, Liu D. Nanopore sequencing: a rapid solution for infectious disease epidemics. SCIENCE CHINA. LIFE SCIENCES 2019; 62:1101-1103. [PMID: 31372817 PMCID: PMC7089317 DOI: 10.1007/s11427-019-9596-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 07/13/2019] [Indexed: 11/29/2022]
Affiliation(s)
- Ying Cao
- Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
- Research Group of Computer Virology, Center for Bacteria and Virus Resources and Application, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
| | - Jing Li
- Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xin Chu
- Bureau of Major R&D Programs, Chinese Academy of Sciences, Beijing, 100864, China
| | - Haizhou Liu
- Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
- Research Group of Computer Virology, Center for Bacteria and Virus Resources and Application, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
| | - Wenjun Liu
- Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Di Liu
- Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China.
- Research Group of Computer Virology, Center for Bacteria and Virus Resources and Application, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
203
|
Brinkmann A, Andrusch A, Belka A, Wylezich C, Höper D, Pohlmann A, Nordahl Petersen T, Lucas P, Blanchard Y, Papa A, Melidou A, Oude Munnink BB, Matthijnssens J, Deboutte W, Ellis RJ, Hansmann F, Baumgärtner W, van der Vries E, Osterhaus A, Camma C, Mangone I, Lorusso A, Marcacci M, Nunes A, Pinto M, Borges V, Kroneman A, Schmitz D, Corman VM, Drosten C, Jones TC, Hendriksen RS, Aarestrup FM, Koopmans M, Beer M, Nitsche A. Proficiency Testing of Virus Diagnostics Based on Bioinformatics Analysis of Simulated In Silico High-Throughput Sequencing Data Sets. J Clin Microbiol 2019; 57:e00466-19. [PMID: 31167846 PMCID: PMC6663916 DOI: 10.1128/jcm.00466-19] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 05/28/2019] [Indexed: 12/22/2022] Open
Abstract
Quality management and independent assessment of high-throughput sequencing-based virus diagnostics have not yet been established as a mandatory approach for ensuring comparable results. The sensitivity and specificity of viral high-throughput sequence data analysis are highly affected by bioinformatics processing using publicly available and custom tools and databases and thus differ widely between individuals and institutions. Here we present the results of the COMPARE [Collaborative Management Platform for Detection and Analyses of (Re-)emerging and Foodborne Outbreaks in Europe] in silico virus proficiency test. An artificial, simulated in silico data set of Illumina HiSeq sequences was provided to 13 different European institutes for bioinformatics analysis to identify viral pathogens in high-throughput sequence data. Comparison of the participants' analyses shows that the use of different tools, programs, and databases for bioinformatics analyses can impact the correct identification of viral sequences from a simple data set. The identification of slightly mutated and highly divergent virus genomes has been shown to be most challenging. Furthermore, the interpretation of the results, together with a fictitious case report, by the participants showed that in addition to the bioinformatics analysis, the virological evaluation of the results can be important in clinical settings. External quality assessment and proficiency testing should become an important part of validating high-throughput sequencing-based virus diagnostics and could improve the harmonization, comparability, and reproducibility of results. There is a need for the establishment of international proficiency testing, like that established for conventional laboratory tests such as PCR, for bioinformatics pipelines and the interpretation of such results.
Collapse
Affiliation(s)
- Annika Brinkmann
- Robert Koch Institute, Centre for Biological Threats and Special Pathogens 1, Berlin, Germany
| | - Andreas Andrusch
- Robert Koch Institute, Centre for Biological Threats and Special Pathogens 1, Berlin, Germany
| | - Ariane Belka
- Friedrich-Loeffler-Institut, Institute of Diagnostic Virology, Greifswald-Insel Riems, Germany
| | - Claudia Wylezich
- Friedrich-Loeffler-Institut, Institute of Diagnostic Virology, Greifswald-Insel Riems, Germany
| | - Dirk Höper
- Friedrich-Loeffler-Institut, Institute of Diagnostic Virology, Greifswald-Insel Riems, Germany
| | - Anne Pohlmann
- Friedrich-Loeffler-Institut, Institute of Diagnostic Virology, Greifswald-Insel Riems, Germany
| | - Thomas Nordahl Petersen
- Technical University of Denmark, National Food Institute, WHO Collaborating Center for Antimicrobial Resistance in Foodborne Pathogens and Genomics and European Union Reference Laboratory for Antimicrobial Resistance, Kongens Lyngby, Denmark
| | - Pierrick Lucas
- French Agency for Food, Environmental and Occupational Health and Safety, Laboratory of Ploufragan, Unit of Viral Genetics and Biosafety, Ploufragan, France
| | - Yannick Blanchard
- French Agency for Food, Environmental and Occupational Health and Safety, Laboratory of Ploufragan, Unit of Viral Genetics and Biosafety, Ploufragan, France
| | - Anna Papa
- Microbiology Department, Aristotle University of Thessaloniki, School of Medicine, Thessaloniki, Greece
| | - Angeliki Melidou
- Microbiology Department, Aristotle University of Thessaloniki, School of Medicine, Thessaloniki, Greece
| | - Bas B Oude Munnink
- Department of Viroscience, Erasmus Medical Centre, Rotterdam, The Netherlands
| | | | | | | | - Florian Hansmann
- Department of Pathology, University of Veterinary Medicine Hannover, Hannover, Germany
| | - Wolfgang Baumgärtner
- Department of Pathology, University of Veterinary Medicine Hannover, Hannover, Germany
| | - Erhard van der Vries
- Department of Infectious Diseases and Immunology, University of Utrecht, Utrecht, The Netherlands
| | | | - Cesare Camma
- Istituto Zooprofilattico Sperimentale dell'Abruzzo e Molise G. Caporale, National Reference Center for Whole Genome Sequencing of Microbial Pathogens: Database and Bioinformatic Analysis, Teramo, Italy
| | - Iolanda Mangone
- Istituto Zooprofilattico Sperimentale dell'Abruzzo e Molise G. Caporale, National Reference Center for Whole Genome Sequencing of Microbial Pathogens: Database and Bioinformatic Analysis, Teramo, Italy
| | - Alessio Lorusso
- Istituto Zooprofilattico Sperimentale dell'Abruzzo e Molise G. Caporale, National Reference Center for Whole Genome Sequencing of Microbial Pathogens: Database and Bioinformatic Analysis, Teramo, Italy
| | - Maurilia Marcacci
- Istituto Zooprofilattico Sperimentale dell'Abruzzo e Molise G. Caporale, National Reference Center for Whole Genome Sequencing of Microbial Pathogens: Database and Bioinformatic Analysis, Teramo, Italy
| | - Alexandra Nunes
- Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health (INSA), Lisbon, Portugal
| | - Miguel Pinto
- Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health (INSA), Lisbon, Portugal
| | - Vítor Borges
- Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health (INSA), Lisbon, Portugal
| | - Annelies Kroneman
- National Institute for Public Health and the Environment, Bilthoven, The Netherlands
| | - Dennis Schmitz
- Department of Viroscience, Erasmus Medical Centre, Rotterdam, The Netherlands
- National Institute for Public Health and the Environment, Bilthoven, The Netherlands
| | - Victor Max Corman
- Institute of Virology, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Christian Drosten
- Institute of Virology, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Terry C Jones
- Institute of Virology, Charité-Universitätsmedizin Berlin, Berlin, Germany
- Center for Pathogen Evolution, Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Rene S Hendriksen
- Technical University of Denmark, National Food Institute, WHO Collaborating Center for Antimicrobial Resistance in Foodborne Pathogens and Genomics and European Union Reference Laboratory for Antimicrobial Resistance, Kongens Lyngby, Denmark
| | - Frank M Aarestrup
- Technical University of Denmark, National Food Institute, WHO Collaborating Center for Antimicrobial Resistance in Foodborne Pathogens and Genomics and European Union Reference Laboratory for Antimicrobial Resistance, Kongens Lyngby, Denmark
| | - Marion Koopmans
- Department of Viroscience, Erasmus Medical Centre, Rotterdam, The Netherlands
| | - Martin Beer
- Friedrich-Loeffler-Institut, Institute of Diagnostic Virology, Greifswald-Insel Riems, Germany
| | - Andreas Nitsche
- Robert Koch Institute, Centre for Biological Threats and Special Pathogens 1, Berlin, Germany
| |
Collapse
|
204
|
|
205
|
Gabrieli T, Sharim H, Fridman D, Arbib N, Michaeli Y, Ebenstein Y. Selective nanopore sequencing of human BRCA1 by Cas9-assisted targeting of chromosome segments (CATCH). Nucleic Acids Res 2019; 46:e87. [PMID: 29788371 PMCID: PMC6101500 DOI: 10.1093/nar/gky411] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Accepted: 05/04/2018] [Indexed: 12/15/2022] Open
Abstract
Next generation sequencing (NGS) is challenged by structural and copy number variations larger than the typical read length of several hundred bases. Third-generation sequencing platforms such as single-molecule real-time (SMRT) and nanopore sequencing provide longer reads and are able to characterize variations that are undetected in NGS data. Nevertheless, these technologies suffer from inherent low throughput which prohibits deep sequencing at reasonable cost without target enrichment. Here, we optimized Cas9-Assisted Targeting of CHromosome segments (CATCH) for nanopore sequencing of the breast cancer gene BRCA1. A 200 kb target containing the 80 kb BRCA1 gene body and its flanking regions was isolated intact from primary human peripheral blood cells, allowing long-range amplification and long-read nanopore sequencing. The target was enriched 237-fold and sequenced at up to 70× coverage on a single flow-cell. Overall performance and single-nucleotide polymorphism (SNP) calling were directly compared to Illumina sequencing of the same enriched sample, highlighting the benefits of CATCH for targeted sequencing. The CATCH enrichment scheme only requires knowledge of the target flanking sequence for Cas9 cleavage while providing contiguous data across both coding and non-coding sequence and holds promise for characterization of complex disease-related or highly variable genomic regions.
Collapse
Affiliation(s)
- Tslil Gabrieli
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Hila Sharim
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Dena Fridman
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Nissim Arbib
- Department of Obstetrics and Gynecology, Meir Hospital, Kfar Saba, Israel & Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Yael Michaeli
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Yuval Ebenstein
- School of Chemistry, Center for Nanoscience and Nanotechnology, Center for Light-Matter Interaction, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
206
|
|
207
|
Abstract
Structural and quantitative chromosomal rearrangements, collectively referred to as structural variation (SV), contribute to a large extent to the genetic diversity of the human genome and thus are of high relevance for cancer genetics, rare diseases and evolutionary genetics. Recent studies have shown that SVs can not only affect gene dosage but also modulate basic mechanisms of gene regulation. SVs can alter the copy number of regulatory elements or modify the 3D genome by disrupting higher-order chromatin organization such as topologically associating domains. As a result of these position effects, SVs can influence the expression of genes distant from the SV breakpoints, thereby causing disease. The impact of SVs on the 3D genome and on gene expression regulation has to be considered when interpreting the pathogenic potential of these variant types.
Collapse
Affiliation(s)
- Malte Spielmann
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Darío G Lupiáñez
- Epigenetics and Sex Development Group, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin-Buch, Germany
| | - Stefan Mundlos
- Max Planck Institute for Molecular Genetics, RG Development & Disease, Berlin, Germany. .,Institute for Medical and Human Genetics, Charité Universitätsmedizin Berlin, Berlin, Germany.
| |
Collapse
|
208
|
Frith MC, Khan S. A survey of localized sequence rearrangements in human DNA. Nucleic Acids Res 2019; 46:1661-1673. [PMID: 29272440 PMCID: PMC5829575 DOI: 10.1093/nar/gkx1266] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 12/07/2017] [Indexed: 01/29/2023] Open
Abstract
Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex 'local' mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.
Collapse
Affiliation(s)
- Martin C Frith
- Artificial Intelligence Research Center, AIST, Tokyo 135-0064, Japan.,Graduate School of Frontier Sciences, University of Tokyo, Chiba 277-8562, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Tokyo 169-8555, Japan
| | - Sofia Khan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Tokyo 169-8555, Japan
| |
Collapse
|
209
|
De Coster W, De Rijk P, De Roeck A, De Pooter T, D'Hert S, Strazisar M, Sleegers K, Van Broeckhoven C. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. Genome Res 2019; 29:1178-1187. [PMID: 31186302 PMCID: PMC6633254 DOI: 10.1101/gr.244939.118] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 06/06/2019] [Indexed: 01/17/2023]
Abstract
We sequenced the genome of the Yoruban reference individual NA19240 on the long-read sequencing platform Oxford Nanopore PromethION for evaluation and benchmarking of recently published aligners and germline structural variant calling tools, as well as a comparison with the performance of structural variant calling from short-read sequencing data. The structural variant caller Sniffles after NGMLR or minimap2 alignment provides the most accurate results, but additional confidence or sensitivity can be obtained by a combination of multiple variant callers. Sensitive and fast results can be obtained by minimap2 for alignment and a combination of Sniffles and SVIM for variant identification. We describe a scalable workflow for identification, annotation, and characterization of tens of thousands of structural variants from long-read genome sequencing of an individual or population. By discussing the results of this well-characterized reference individual, we provide an approximation of what can be expected in future long-read sequencing studies aiming for structural variant identification.
Collapse
Affiliation(s)
- Wouter De Coster
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Peter De Rijk
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Arne De Roeck
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Tim De Pooter
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Svenn D'Hert
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Mojca Strazisar
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
- Neuromics Support Facility, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
| | - Kristel Sleegers
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| | - Christine Van Broeckhoven
- Neurodegenerative Brain Diseases Group, Center for Molecular Neurology, VIB, 2610 Antwerp, Belgium
- Biomedical Sciences, University of Antwerp, 2610 Antwerp, Belgium
| |
Collapse
|
210
|
Malmberg MM, Spangenberg GC, Daetwyler HD, Cogan NOI. Assessment of low-coverage nanopore long read sequencing for SNP genotyping in doubled haploid canola (Brassica napus L.). Sci Rep 2019; 9:8688. [PMID: 31213642 PMCID: PMC6582154 DOI: 10.1038/s41598-019-45131-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 05/28/2019] [Indexed: 11/16/2022] Open
Abstract
Despite the high accuracy of short read sequencing (SRS), there are still issues with attaining accurate single nucleotide polymorphism (SNP) genotypes at low sequencing coverage and in highly duplicated genomes due to misalignment. Long read sequencing (LRS) systems, including the Oxford Nanopore Technologies (ONT) minION, have become popular options for de novo genome assembly and structural variant characterisation. The current high error rate often requires substantial post-sequencing correction and would appear to prevent the adoption of this system for SNP genotyping, but nanopore sequencing errors are largely random. Using low coverage ONT minION sequencing for genotyping of pre-validated SNP loci was examined in 9 canola doubled haploids. The minION genotypes were compared to the Illumina sequences to determine the extent and nature of genotype discrepancies between the two systems. The significant increase in read length improved alignment to the genome and the absence of classical SRS biases results in a more even representation of the genome. Sequencing errors are present, primarily in the form of heterozygous genotypes, which can be removed in completely homozygous backgrounds but requires more advanced bioinformatics in heterozygous genomes. Developments in this technology are promising for routine genotyping in the future.
Collapse
Affiliation(s)
- M M Malmberg
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - G C Spangenberg
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - H D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - N O I Cogan
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia.
| |
Collapse
|
211
|
Mantere T, Kersten S, Hoischen A. Long-Read Sequencing Emerging in Medical Genetics. Front Genet 2019; 10:426. [PMID: 31134132 PMCID: PMC6514244 DOI: 10.3389/fgene.2019.00426] [Citation(s) in RCA: 226] [Impact Index Per Article: 45.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/18/2019] [Indexed: 12/12/2022] Open
Abstract
The wide implementation of next-generation sequencing (NGS) technologies has revolutionized the field of medical genetics. However, the short read lengths of currently used sequencing approaches pose a limitation for the identification of structural variants, sequencing repetitive regions, phasing of alleles and distinguishing highly homologous genomic regions. These limitations may significantly contribute to the diagnostic gap in patients with genetic disorders who have undergone standard NGS, like whole exome or even genome sequencing. Now, the emerging long-read sequencing (LRS) technologies may offer improvements in the characterization of genetic variation and regions that are difficult to assess with the prevailing NGS approaches. LRS has so far mainly been used to investigate genetic disorders with previously known or strongly suspected disease loci. While these targeted approaches already show the potential of LRS, it remains to be seen whether LRS technologies can soon enable true whole genome sequencing routinely. Ultimately, this could allow the de novo assembly of individual whole genomes used as a generic test for genetic disorders. In this article, we summarize the current LRS-based research on human genetic disorders and discuss the potential of these technologies to facilitate the next major advancements in medical genetics.
Collapse
Affiliation(s)
- Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Simone Kersten
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| |
Collapse
|
212
|
Kono N, Arakawa K. Nanopore sequencing: Review of potential applications in functional genomics. Dev Growth Differ 2019; 61:316-326. [DOI: 10.1111/dgd.12608] [Citation(s) in RCA: 164] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 03/26/2019] [Accepted: 03/26/2019] [Indexed: 12/17/2022]
Affiliation(s)
- Nobuaki Kono
- Institute for Advanced Biosciences Keio University Tsuruoka Yamagata Japan
| | - Kazuharu Arakawa
- Institute for Advanced Biosciences Keio University Tsuruoka Yamagata Japan
| |
Collapse
|
213
|
Bowden R, Davies RW, Heger A, Pagnamenta AT, de Cesare M, Oikkonen LE, Parkes D, Freeman C, Dhalla F, Patel SY, Popitsch N, Ip CLC, Roberts HE, Salatino S, Lockstone H, Lunter G, Taylor JC, Buck D, Simpson MA, Donnelly P. Sequencing of human genomes with nanopore technology. Nat Commun 2019; 10:1869. [PMID: 31015479 PMCID: PMC6478738 DOI: 10.1038/s41467-019-09637-5] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 03/19/2019] [Indexed: 12/17/2022] Open
Abstract
Whole-genome sequencing (WGS) is becoming widely used in clinical medicine in diagnostic contexts and to inform treatment choice. Here we evaluate the potential of the Oxford Nanopore Technologies (ONT) MinION long-read sequencer for routine WGS by sequencing the reference sample NA12878 and the genome of an individual with ataxia-pancytopenia syndrome and severe immune dysregulation. We develop and apply a novel reference panel-free analytical method to infer and then exploit phase information which improves single-nucleotide variant (SNV) calling performance from otherwise modest levels. In the clinical sample, we identify and directly phase two non-synonymous de novo variants in SAMD9L, (OMIM #159550) inferring that they lie on the same paternal haplotype. Whilst consensus SNV-calling error rates from ONT data remain substantially higher than those from short-read methods, we demonstrate the substantial benefits of analytical innovation. Ongoing improvements to base-calling and SNV-calling methodology must continue for nanopore sequencing to establish itself as a primary method for clinical WGS.
Collapse
Affiliation(s)
- Rory Bowden
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Robert W Davies
- Genomics plc, Oxford, OX1 1JD, UK
- Program in Genetics and Genomic Biology and The Centre for Applied Genomics, Hospital for Sick Children, Toronto, M5G 0A4, Canada
| | | | - Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | | | - Laura E Oikkonen
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Duncan Parkes
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Colin Freeman
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Fatima Dhalla
- Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK
- Developmental Immunology Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, OX3 9DS, UK
| | - Smita Y Patel
- Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK
- Clinical Immunology Group, National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | - Niko Popitsch
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
- Children's Cancer Research Institute, St. Anna Kinderkrebsforschung, 1090, Vienna, Austria
| | - Camilla L C Ip
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Hannah E Roberts
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Silvia Salatino
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Helen Lockstone
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Gerton Lunter
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- Genomics plc, Oxford, OX1 1JD, UK
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | - David Buck
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | | | - Peter Donnelly
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK.
- Genomics plc, Oxford, OX1 1JD, UK.
- Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.
| |
Collapse
|
214
|
Zepeda-Mendoza CJ, Morton CC. The Iceberg under Water: Unexplored Complexity of Chromoanagenesis in Congenital Disorders. Am J Hum Genet 2019; 104:565-577. [PMID: 30951674 DOI: 10.1016/j.ajhg.2019.02.024] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 02/25/2019] [Indexed: 01/16/2023] Open
Abstract
Structural variation, composed of balanced and unbalanced genomic rearrangements, is an important contributor to human genetic diversity with prominent roles in somatic and congenital disease. At the nucleotide level, structural variants (SVs) have been shown to frequently harbor additional breakpoints and copy-number imbalances, a complexity predicted to emerge wholly as a single-cell division event. Chromothripsis, chromoplexy, and chromoanasynthesis, collectively referred to as chromoanagenesis, are three major mechanisms that explain the occurrence of complex germline and somatic SVs. While chromothripsis and chromoplexy have been shown to be key signatures of cancer, chromoanagenesis has been detected in numerous cases of developmental disease and phenotypically normal individuals. Such observations advocate for a deeper study of the polymorphic and pathogenic properties of complex germline SVs, many of which go undetected by traditional clinical molecular and cytogenetic methods. This review focuses on congenital chromoanagenesis, mechanisms leading to occurrence of these complex rearrangements, and their impact on chromosome organization and genome function. We highlight future applications of routine screening of complex and balanced SVs in the clinic, as these represent a potential and often neglected genetic disease source, a true "iceberg under water."
Collapse
Affiliation(s)
- Cinthya J Zepeda-Mendoza
- Division of Laboratory Genetics, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55902, USA
| | - Cynthia C Morton
- Departments of Obstetrics and Gynecology and of Pathology, Brigham and Women's Hospital, Boston, MA 02115, USA; Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Manchester Center for Audiology and Deafness, School of Health Sciences, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9NT, UK.
| |
Collapse
|
215
|
Maroilley T, Tarailo-Graovac M. Uncovering Missing Heritability in Rare Diseases. Genes (Basel) 2019; 10:E275. [PMID: 30987386 PMCID: PMC6523881 DOI: 10.3390/genes10040275] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 03/29/2019] [Accepted: 04/01/2019] [Indexed: 12/14/2022] Open
Abstract
The problem of 'missing heritability' affects both common and rare diseases hindering: discovery, diagnosis, and patient care. The 'missing heritability' concept has been mainly associated with common and complex diseases where promising modern technological advances, like genome-wide association studies (GWAS), were unable to uncover the complete genetic mechanism of the disease/trait. Although rare diseases (RDs) have low prevalence individually, collectively they are common. Furthermore, multi-level genetic and phenotypic complexity when combined with the individual rarity of these conditions poses an important challenge in the quest to identify causative genetic changes in RD patients. In recent years, high throughput sequencing has accelerated discovery and diagnosis in RDs. However, despite the several-fold increase (from ~10% using traditional to ~40% using genome-wide genetic testing) in finding genetic causes of these diseases in RD patients, as is the case in common diseases-the majority of RDs are also facing the 'missing heritability' problem. This review outlines the key role of high throughput sequencing in uncovering genetics behind RDs, with a particular focus on genome sequencing. We review current advances and challenges of sequencing technologies, bioinformatics approaches, and resources.
Collapse
Affiliation(s)
- Tatiana Maroilley
- Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada.
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada.
| | - Maja Tarailo-Graovac
- Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada.
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada.
| |
Collapse
|
216
|
Córdova-Fletes C, Arámbula-Meraz E, Zarazúa-Niño AI, Madueña-Molina J, Sáinz-Barraza ME, González-García JR, León-León C, López-López G, Lugo-Guzmán EG, Picos-Cárdenas VJ. Familial 3-Way Balanced Translocation Causes 1q43→qter Loss and 10q25.2→qter Gain in a Severely Affected Male Toddler. Cytogenet Genome Res 2019; 157:231-238. [PMID: 30933949 DOI: 10.1159/000498998] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/29/2018] [Indexed: 11/19/2022] Open
Abstract
Constitutional complex chromosomal rearrangements (CCRs) are rare events that typically involve 2 or more chromosomes with at least 3 breakpoints and can result in normal or abnormal phenotypes depending on whether they disturb the euchromatic neighborhood. Here, we report an unusual balanced CCR involving chromosomes 1, 9, and 10 that causes an unbalanced karyotype in a severely affected toddler. The CCR was initially reported as a maternal 2-way translocation but was reclassified as a 3-way translocation after a microarray analysis of the propositus revealed the involvement of another chromosome not identified by G-banding in his phenotypically normal mother. FISH assays on maternal metaphase cells confirmed that the 1qter region of der(1) was translocated to der(10), whereas the 10qter segment was translocated to der(9), which in turn donated a segment to der(1). Subsequently, this CCR was also identified in her phenotypically normal father (the patient's grandfather). Thus, the patient inherited the previously unreported pathogenic combination of der(1) with a loss of 1q43→qter (including AKT3, ZBTB18, HNRNPU, and SMYD3) and der(9) with a gain of 10q25.2→qter (including FGFR2), leading to a compound phenotype with key features of the 1q43→qter deletion and distal 10q trisomy syndromes. Our observations suggest that the loss of SMYD3 accounts for cardiac defects in a subset of patients. Moreover, due to recurrent miscarriages in this family, our findings allowed improved genetic counseling.
Collapse
MESH Headings
- Abnormalities, Multiple/diagnostic imaging
- Abnormalities, Multiple/genetics
- Child, Preschool
- Chromosomes, Human, Pair 1/genetics
- Chromosomes, Human, Pair 10/genetics
- Chromosomes, Human, Pair 9/genetics
- Comparative Genomic Hybridization
- Genetic Counseling
- Histone-Lysine N-Methyltransferase/genetics
- Humans
- In Situ Hybridization, Fluorescence/methods
- Male
- Tomography, X-Ray Computed
- Translocation, Genetic
Collapse
|
217
|
Ly P, Brunner SF, Shoshani O, Kim DH, Lan W, Pyntikova T, Flanagan AM, Behjati S, Page DC, Campbell PJ, Cleveland DW. Chromosome segregation errors generate a diverse spectrum of simple and complex genomic rearrangements. Nat Genet 2019; 51:705-715. [PMID: 30833795 PMCID: PMC6441390 DOI: 10.1038/s41588-019-0360-8] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 01/23/2019] [Indexed: 01/05/2023]
Abstract
Cancer genomes are frequently characterized by numerical and structural chromosomal abnormalities. Here we integrated a centromere-specific inactivation approach with selection for a conditionally essential gene, a strategy termed CEN-SELECT, to systematically interrogate the structural landscape of mis-segregated chromosomes. We show that single-chromosome mis-segregation into a micronucleus can directly trigger a broad spectrum of genomic rearrangement types. Cytogenetic profiling revealed that mis-segregated chromosomes exhibit 120-fold-higher susceptibility to developing seven major categories of structural aberrations, including translocations, insertions, deletions, and complex reassembly through chromothripsis coupled to classical non-homologous end joining. Whole-genome sequencing of clonally propagated rearrangements identified random patterns of clustered breakpoints with copy-number alterations resulting in interspersed gene deletions and extrachromosomal DNA amplification events. We conclude that individual chromosome segregation errors during mitotic cell division are sufficient to drive extensive structural variations that recapitulate genomic features commonly associated with human disease.
Collapse
Affiliation(s)
- Peter Ly
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA.
- Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| | | | - Ofer Shoshani
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Dong Hyun Kim
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Weijie Lan
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA
| | | | - Adrienne M Flanagan
- University College London Cancer Institute, London, UK
- Department of Histopathology, Royal National Orthopaedic Hospital NHS Trust, Stanmore, UK
| | - Sam Behjati
- Wellcome Sanger Institute, Hinxton, UK
- Department of Paediatrics, University of Cambridge, Cambridge, UK
| | - David C Page
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Howard Hughes Medical Institute, Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - Don W Cleveland
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA.
| |
Collapse
|
218
|
Parivesh A, Barseghyan H, Délot E, Vilain E. Translating genomics to the clinical diagnosis of disorders/differences of sex development. Curr Top Dev Biol 2019; 134:317-375. [PMID: 30999980 PMCID: PMC7382024 DOI: 10.1016/bs.ctdb.2019.01.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The medical and psychosocial challenges faced by patients living with Disorders/Differences of Sex Development (DSD) and their families can be alleviated by a rapid and accurate diagnostic process. Clinical diagnosis of DSD is limited by a lack of standardization of anatomical and endocrine phenotyping and genetic testing, as well as poor genotype/phenotype correlation. Historically, DSD genes have been identified through positional cloning of disease-associated variants segregating in families and validation of candidates in animal and in vitro modeling of variant pathogenicity. Owing to the complexity of conditions grouped under DSD, genome-wide scanning methods are better suited for identifying disease causing gene variant(s) and providing a clinical diagnosis. Here, we review a number of established genomic tools (karyotyping, chromosomal microarrays and exome sequencing) used in clinic for DSD diagnosis, as well as emerging genomic technologies such as whole-genome (short-read) sequencing, long-read sequencing, and optical mapping used for novel DSD gene discovery. These, together with gene expression and epigenetic studies can potentiate the clinical diagnosis of DSD diagnostic rates and enhance the outcomes for patients and families.
Collapse
Affiliation(s)
- Abhinav Parivesh
- Center for Genetic Medicine Research, Children's National Medical Center, Washington, DC, United States
| | - Hayk Barseghyan
- Center for Genetic Medicine Research, Children's National Medical Center, Washington, DC, United States; Department of Genomics and Precision Medicine, The George Washington University, Washington, DC, United States
| | - Emmanuèle Délot
- Center for Genetic Medicine Research, Children's National Medical Center, Washington, DC, United States; Department of Genomics and Precision Medicine, The George Washington University, Washington, DC, United States.
| | - Eric Vilain
- Center for Genetic Medicine Research, Children's National Medical Center, Washington, DC, United States; Department of Genomics and Precision Medicine, The George Washington University, Washington, DC, United States.
| |
Collapse
|
219
|
Mitsuhashi S, Frith MC, Mizuguchi T, Miyatake S, Toyota T, Adachi H, Oma Y, Kino Y, Mitsuhashi H, Matsumoto N. Tandem-genotypes: robust detection of tandem repeat expansions from long DNA reads. Genome Biol 2019; 20:58. [PMID: 30890163 PMCID: PMC6425644 DOI: 10.1186/s13059-019-1667-6] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 03/01/2019] [Indexed: 01/03/2023] Open
Abstract
Tandemly repeated DNA is highly mutable and causes at least 31 diseases, but it is hard to detect pathogenic repeat expansions genome-wide. Here, we report robust detection of human repeat expansions from careful alignments of long but error-prone (PacBio and nanopore) reads to a reference genome. Our method is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we prioritize pathogenic expansions within the top 10 out of 700,000 tandem repeats in whole genome sequencing data. This may help to elucidate the many genetic diseases whose causes remain unknown.
Collapse
Affiliation(s)
- Satomi Mitsuhashi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan.
| | - Martin C Frith
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo, 135-0064, Japan.
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan.
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Shinjuku-ku, Tokyo, Japan.
| | - Takeshi Mizuguchi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Satoko Miyatake
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Tomoko Toyota
- Department of Neurology, University of Occupational and Environmental Health School of Medicine, Kitakyushu, Fukuoka, Japan
| | - Hiroaki Adachi
- Department of Neurology, University of Occupational and Environmental Health School of Medicine, Kitakyushu, Fukuoka, Japan
| | - Yoko Oma
- Department of Liberal Arts, Faculty of Medicine, Saitama Medical University, Iruma, Saitama, Japan
| | - Yoshihiro Kino
- Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University, Kiyose, Tokyo, Japan
| | - Hiroaki Mitsuhashi
- Department of Applied Biochemistry, School of Engineering, Tokai University, Hiratsuka, Kanagawa, Japan
| | - Naomichi Matsumoto
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, Fukuura 3-9, Kanazawa-ku, Yokohama, 236-0004, Japan
| |
Collapse
|
220
|
Characterization and evolutionary dynamics of complex regions in eukaryotic genomes. SCIENCE CHINA-LIFE SCIENCES 2019; 62:467-488. [PMID: 30810961 DOI: 10.1007/s11427-018-9458-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 11/05/2018] [Indexed: 01/07/2023]
Abstract
Complex regions in eukaryotic genomes are typically characterized by duplications of chromosomal stretches that often include one or more genes repeated in a tandem array or in relatively close proximity. Nevertheless, the repetitive nature of these regions, together with the often high sequence identity among repeats, have made complex regions particularly recalcitrant to proper molecular characterization, often being misassembled or completely absent in genome assemblies. This limitation has prevented accurate functional and evolutionary analyses of these regions. This is becoming increasingly relevant as evidence continues to support a central role for complex genomic regions in explaining human disease, developmental innovations, and ecological adaptations across phyla. With the advent of long-read sequencing technologies and suitable assemblers, the development of algorithms that can accommodate sample heterozygosity, and the adoption of a pangenomic-like view of these regions, accurate reconstructions of complex regions are now within reach. These reconstructions will finally allow for accurate functional and evolutionary studies of complex genomic regions, underlying the generation of genotype-phenotype maps of unprecedented resolution.
Collapse
|
221
|
Comprehensive structural variation genome map of individuals carrying complex chromosomal rearrangements. PLoS Genet 2019; 15:e1007858. [PMID: 30735495 PMCID: PMC6368290 DOI: 10.1371/journal.pgen.1007858] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 11/28/2018] [Indexed: 11/19/2022] Open
Abstract
Complex chromosomal rearrangements (CCRs) are rearrangements involving more than two chromosomes or more than two breakpoints. Whole genome sequencing (WGS) allows for outstanding high resolution characterization on the nucleotide level in unique sequences of such rearrangements, but problems remain for mapping breakpoints in repetitive regions of the genome, which are known to be prone to rearrangements. Hence, multiple complementary WGS experiments are sometimes needed to solve the structures of CCRs. We have studied three individuals with CCRs: Case 1 and Case 2 presented with de novo karyotypically balanced, complex interchromosomal rearrangements (46,XX,t(2;8;15)(q35;q24.1;q22) and 46,XY,t(1;10;5)(q32;p12;q31)), and Case 3 presented with a de novo, extremely complex intrachromosomal rearrangement on chromosome 1. Molecular cytogenetic investigation revealed cryptic deletions in the breakpoints of chromosome 2 and 8 in Case 1, and on chromosome 10 in Case 2, explaining their clinical symptoms. In Case 3, 26 breakpoints were identified using WGS, disrupting five known disease genes. All rearrangements were subsequently analyzed using optical maps, linked-read WGS, and short-read WGS. In conclusion, we present a case series of three unique de novo CCRs where we by combining the results from the different technologies fully solved the structure of each rearrangement. The power in combining short-read WGS with long-molecule sequencing or optical mapping in these unique de novo CCRs in a clinical setting is demonstrated.
Collapse
|
222
|
Hyland CA, Roulis EV, Schoeman EM. Developments beyond blood group serology in the genomics era. Br J Haematol 2019; 184:897-911. [PMID: 30706459 DOI: 10.1111/bjh.15747] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Blood group serology and single nucleotide polymorphism-based genotyping platforms are accurate but do not provide a comprehensive cover for all 36 blood group systems and do not cover the antigen diversity observed among population groups. This review examines the extent to which genomics is shaping blood group serology. Resources for genomics include the Human Reference Genome Sequence assembly; curated blood group tables listing variants; public databases providing information on genetic variants from world-wide studies; and massively parallel sequencing technologies. Blood group genomic studies span the spectrum, from bioinformatic data mining of huge data sets containing whole genome and whole exome information to laboratory investigations utilising targeted sequencing approaches. Blood group predictions based on genome sequencing and genomic studies are proving accurate, and have shown utility in both research and reference settings. Overall, studies confirm the potential for blood group genomics to reshape donor and patient transfusion management strategies to provide more compatible blood transfusions.
Collapse
Affiliation(s)
- Catherine A Hyland
- Clinical Services and Research, Australian Red Cross Blood Service, Kelvin Grove, Queensland, Australia
| | - Eileen V Roulis
- Clinical Services and Research, Australian Red Cross Blood Service, Kelvin Grove, Queensland, Australia
| | - Elizna M Schoeman
- Clinical Services and Research, Australian Red Cross Blood Service, Kelvin Grove, Queensland, Australia
| |
Collapse
|
223
|
Piazza A, Heyer WD. Homologous Recombination and the Formation of Complex Genomic Rearrangements. Trends Cell Biol 2019; 29:135-149. [PMID: 30497856 PMCID: PMC6402879 DOI: 10.1016/j.tcb.2018.10.006] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 10/28/2018] [Accepted: 10/29/2018] [Indexed: 12/13/2022]
Abstract
The maintenance of genome integrity involves multiple independent DNA damage avoidance and repair mechanisms. However, the origin and pathways of the focal chromosomal reshuffling phenomena collectively referred to as chromothripsis remain mechanistically obscure. We discuss here the role, mechanisms, and regulation of homologous recombination (HR) in the formation of simple and complex chromosomal rearrangements. We emphasize features of the recently characterized multi-invasion (MI)-induced rearrangement (MIR) pathway which uniquely amplifies the initial DNA damage. HR intermediates and cellular contexts that endanger genomic stability are discussed as well as the emerging roles of various classes of nucleases in the formation of genome rearrangements. Long-read sequencing and improved mapping of repeats should enable better appreciation of the significance of recombination in generating genomic rearrangements.
Collapse
Affiliation(s)
- Aurèle Piazza
- Department of Microbiology and Molecular Genetics, University of California, Davis, CA 95616, USA; Spatial Regulation of Genomes, Department of Genomes and Genetics, Centre National de la Recherche Scientifique (CNRS) Unité Mixte de Recherche 3525, Institut Pasteur, 75015 Paris, France
| | - Wolf-Dietrich Heyer
- Department of Microbiology and Molecular Genetics, University of California, Davis, CA 95616, USA; Department of Molecular and Cellular Biology, University of California, Davis, CA 95616, USA.
| |
Collapse
|
224
|
Abstract
A number of outstanding problems in genomics, such as identifying structural variations and sequencing through centromeres and telomeres, stand poised to benefit tremendously from emerging long-read genomics technologies such as nanopore sequencing and genome mapping in nanochannels. However, optimal application of these new genomics technologies requires facile methods for extracting long DNA from cells. These sample preparation tools should be amenable to automation and minimize fragmentation of the long DNA molecules by shear. We present one such approach in a poly(dimethylsiloxane) device, where gel-based high molecular weight DNA extraction and continuous flow purification in a 3D cell culture-inspired geometry is followed by electrophoretic extraction of the long DNA from the miniaturized gel. Molecular combing reveals that the device produces molecules that are typically in excess of 100 kilobase pairs in size, with the longest molecule extending up to 4 megabase pairs. The microfluidic format reduces the standard day-long and labor-intensive DNA extraction process to 4 hours, making it a promising prototype platform for routine long DNA sample preparation.
Collapse
Affiliation(s)
- Paridhi Agrawal
- Department of Chemical Engineering and Materials Science, University of Minnesota - Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, USA.
| | | |
Collapse
|
225
|
Single-Molecule Sequencing: Towards Clinical Applications. Trends Biotechnol 2019; 37:72-85. [DOI: 10.1016/j.tibtech.2018.07.013] [Citation(s) in RCA: 112] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 07/16/2018] [Accepted: 07/18/2018] [Indexed: 12/31/2022]
|
226
|
Kurtas NE, Xumerle L, Giussani U, Pansa A, Cardarelli L, Bertini V, Valetto A, Liehr T, Clara Bonaglia M, Errichiello E, Delledonne M, Zuffardi O. Insertional translocation involving an additional nonchromothriptic chromosome in constitutional chromothripsis: Rule or exception? Mol Genet Genomic Med 2018; 7:e00496. [PMID: 30565424 PMCID: PMC6393660 DOI: 10.1002/mgg3.496] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Revised: 09/06/2018] [Accepted: 10/02/2018] [Indexed: 01/17/2023] Open
Abstract
Background Chromothripsis, which is the local massive shattering of one or more chromosomes and their reassembly in a disordered array with frequent loss of some fragments, has been mainly reported in association with abnormal phenotypes. We report three unrelated healthy persons, two of which parenting a child with some degree of intellectual disability, carrying a chromothripsis involving respectively one, two, and three chromosomes, which was detected only after whole‐genome sequencing. Unexpectedly, in all three cases a fragment from one of the chromothripsed chromosomes resulted to be inserted within a nonchromothripsed one. Methods Conventional cytogenetic techniques, paired‐end whole‐genome sequencing, polymerase chain reaction, and Sanger sequencing were used to characterize complex rearrangements, copy‐number variations, and breakpoint sequences in all three families. Results In two families, one parent was carrier of a balanced chromothripsis causing in the index case a deletion and a noncontiguous duplication at 3q in case 1, and a t(6;14) translocation associated with interstitial 14q deletion in case 2. In the third family, an unbalanced chromothripsis involving chromosomes 6, 7, and 15 was inherited to the proband by the mosaic parent. In all three parents, the chromothripsis was concurrent with an insertional translocation of a portion of one of the chromothriptic chromosomes within a further chromosome that was not involved in the chromothripsis event. Conclusion Our findings show that (a) both simple and complex unbalanced rearrangements may result by the recombination of a cryptic parental balanced chromothripsis and that (b) insertional translocations are the spy of more complex rearrangements and not simply a three‐breakpoint event.
Collapse
Affiliation(s)
| | - Luciano Xumerle
- Personal Genomics srl, Department of Biotechnologies, University of Verona, Verona, Italy
| | | | | | | | | | | | - Thomas Liehr
- Institute of Human Genetics, Jena University Hospital, Jena, Germany
| | - Maria Clara Bonaglia
- Cytogenetics Laboratory, Scientific Institute, IRCCS Eugenio Medea, Lecco, Italy
| | | | - Massimo Delledonne
- Personal Genomics srl, Department of Biotechnologies, University of Verona, Verona, Italy
| | - Orsetta Zuffardi
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
| |
Collapse
|
227
|
Jagannathan V, Gerber V, Rieder S, Tetens J, Thaller G, Drögemüller C, Leeb T. Comprehensive characterization of horse genome variation by whole-genome sequencing of 88 horses. Anim Genet 2018; 50:74-77. [PMID: 30525216 DOI: 10.1111/age.12753] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/24/2018] [Indexed: 11/29/2022]
Abstract
Whole-genome sequencing studies are vital to gain a thorough understanding of genomic variation. Here, we summarize the results of a whole-genome sequencing study comprising 88 horses and ponies from diverse breeds at 19.1× average coverage. The paired-end reads were mapped to the current EquCab3.0 horse reference genome assembly, and we identified approximately 23.5 million single nucleotide variants and 2.3 million short indel variants. Our dataset included at least 7 million variants that were not previously reported. On average, each individual horse genome carried ∼5.7 million single nucleotides and 0.8 million small indel variants with respect to the reference genome assembly. The variants were functionally annotated. We provide two examples for potentially deleterious recessive alleles that were identified in a heterozygous state in individual genome sequences. Appropriate management of such deleterious recessive alleles in horse breeding programs should help to improve fertility and reduce the prevalence of heritable diseases. This comprehensive dataset has been made publicly available, will represent a valuable resource for future horse genetic studies and supports the goal of accelerating the rates of genetic gain in domestic horse.
Collapse
Affiliation(s)
- V Jagannathan
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| | - V Gerber
- Department of Clinical Veterinary Medicine, Swiss Institute of Equine Medicine, Vetsuisse Faculty, University of Bern, and Agroscope, Länggassstrasse 124, 3012, Bern, Switzerland
| | - S Rieder
- Swiss National Stud Farm, Agroscope, 1580, Avenches, Switzerland
| | - J Tetens
- Department of Animal Sciences, Functional Breeding Group, Georg-August University Göttingen, Burckhardtweg 2, 37077, Göttingen, Germany
| | - G Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts University Kiel, Hermann-Rodewald-Strasse 6, 24098, Kiel, Germany
| | - C Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| | - T Leeb
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3001, Bern, Switzerland
| |
Collapse
|
228
|
Complex structural variants in Mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing. Genome Med 2018; 10:95. [PMID: 30526634 PMCID: PMC6286558 DOI: 10.1186/s13073-018-0606-6] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 11/23/2018] [Indexed: 12/19/2022] Open
Abstract
Background Studies have shown that complex structural variants (cxSVs) contribute to human genomic variation and can cause Mendelian disease. We aimed to identify cxSVs relevant to Mendelian disease using short-read whole-genome sequencing (WGS), resolve the precise variant configuration and investigate possible mechanisms of cxSV formation. Methods We performed short-read WGS and analysis of breakpoint junctions to identify cxSVs in a cohort of 1324 undiagnosed rare disease patients. Long-read WGS and gene expression analysis were used to resolve one case. Results We identified three pathogenic cxSVs: a de novo duplication-inversion-inversion-deletion affecting ARID1B, a de novo deletion-inversion-duplication affecting HNRNPU and a homozygous deletion-inversion-deletion affecting CEP78. Additionally, a de novo duplication-inversion-duplication overlapping CDKL5 was resolved by long-read WGS demonstrating the presence of both a disrupted and an intact copy of CDKL5 on the same allele, and gene expression analysis showed both parental alleles of CDKL5 were expressed. Breakpoint analysis in all the cxSVs revealed both microhomology and longer repetitive elements. Conclusions Our results corroborate that cxSVs cause Mendelian disease, and we recommend their consideration during clinical investigations. We show that resolution of breakpoints can be critical to interpret pathogenicity and present evidence of replication-based mechanisms in cxSV formation. Electronic supplementary material The online version of this article (10.1186/s13073-018-0606-6) contains supplementary material, which is available to authorized users.
Collapse
|
229
|
Bamba M, Kawaguchi YW, Tsuchimatsu T. Plant adaptation and speciation studied by population genomic approaches. Dev Growth Differ 2018; 61:12-24. [DOI: 10.1111/dgd.12578] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Revised: 10/22/2018] [Accepted: 10/22/2018] [Indexed: 12/20/2022]
Affiliation(s)
- Masaru Bamba
- Department of Biology (Frontier Science Program); Graduate School of Science and Engineering; Chiba University; Chiba Japan
| | - Yawako W. Kawaguchi
- Department of Biology (Frontier Science Program); Graduate School of Science and Engineering; Chiba University; Chiba Japan
| | - Takashi Tsuchimatsu
- Department of Biology; Graduate School of Science; Chiba University; Chiba Japan
| |
Collapse
|
230
|
Hehir-Kwa JY, Tops BBJ, Kemmeren P. The clinical implementation of copy number detection in the age of next-generation sequencing. Expert Rev Mol Diagn 2018; 18:907-915. [PMID: 30221560 DOI: 10.1080/14737159.2018.1523723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION The role of copy number variants (CNVs) in disease is now well established. In parallel NGS technologies, such as long-read technologies, there is continual development and data analysis methods continue to be refined. Clinical exome sequencing data is now a reality for many diagnostic laboratories in both congenital genetics and oncology. This provides the ability to detect and report both SNVs and structural variants, including CNVs, using a single assay for a wide range of patient cohorts. Areas covered: Currently, whole-genome sequencing is mainly restricted to research applications and clinical utility studies. Furthermore, detecting the full-size spectrum of CNVs as well as somatic events remains difficult for both exome and whole-genome sequencing. As a result, the full extent of genomic variants in an individual's genome is still largely unknown. Recently, new sequencing technologies have been introduced which maintain the long-range genomic context, aiding the detection of CNVs and structural variants. Expert commentary: The development of long-read sequencing promises to resolve many CNV and SV detection issues but is yet to become established. The current challenge for clinical CNV detection is how to fully exploit all the data which is generated by high throughput sequencing technologies.
Collapse
Affiliation(s)
- Jayne Y Hehir-Kwa
- a Princess Máxima Center for Pediatric Oncology , Utrecht , Netherlands
| | - Bastiaan B J Tops
- a Princess Máxima Center for Pediatric Oncology , Utrecht , Netherlands
| | - Patrick Kemmeren
- a Princess Máxima Center for Pediatric Oncology , Utrecht , Netherlands
| |
Collapse
|
231
|
Rang FJ, Kloosterman WP, de Ridder J. From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol 2018; 19:90. [PMID: 30005597 PMCID: PMC6045860 DOI: 10.1186/s13059-018-1462-9] [Citation(s) in RCA: 338] [Impact Index Per Article: 56.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Nanopore sequencing is a rapidly maturing technology delivering long reads in real time on a portable instrument at low cost. Not surprisingly, the community has rapidly taken up this new way of sequencing and has used it successfully for a variety of research applications. A major limitation of nanopore sequencing is its high error rate, which despite recent improvements to the nanopore chemistry and computational tools still ranges between 5% and 15%. Here, we review computational approaches determining the nanopore sequencing error rate. Furthermore, we outline strategies for translation of raw sequencing data into base calls for detection of base modifications and for obtaining consensus sequences.
Collapse
Affiliation(s)
- Franka J Rang
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, 3584, CG, Utrecht, The Netherlands
| | - Wigard P Kloosterman
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, 3584, CG, Utrecht, The Netherlands.
| | - Jeroen de Ridder
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, 3584, CG, Utrecht, The Netherlands.
| |
Collapse
|
232
|
Beretta S, Patterson MD, Zaccaria S, Della Vedova G, Bonizzoni P. HapCHAT: adaptive haplotype assembly for efficiently leveraging high coverage in long reads. BMC Bioinformatics 2018; 19:252. [PMID: 29970002 PMCID: PMC6029272 DOI: 10.1186/s12859-018-2253-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 06/18/2018] [Indexed: 01/08/2023] Open
Abstract
Background Haplotype assembly is the process of assigning the different alleles of the variants covered by mapped sequencing reads to the two haplotypes of the genome of a human individual. Long reads, which are nowadays cheaper to produce and more widely available than ever before, have been used to reduce the fragmentation of the assembled haplotypes since their ability to span several variants along the genome. These long reads are also characterized by a high error rate, an issue which may be mitigated, however, with larger sets of reads, when this error rate is uniform across genome positions. Unfortunately, current state-of-the-art dynamic programming approaches designed for long reads deal only with limited coverages. Results Here, we propose a new method for assembling haplotypes which combines and extends the features of previous approaches to deal with long reads and higher coverages. In particular, our algorithm is able to dynamically adapt the estimated number of errors at each variant site, while minimizing the total number of error corrections necessary for finding a feasible solution. This allows our method to significantly reduce the required computational resources, allowing to consider datasets composed of higher coverages. The algorithm has been implemented in a freely available tool, HapCHAT: Haplotype Assembly Coverage Handling by Adapting Thresholds. An experimental analysis on sequencing reads with up to 60 × coverage reveals improvements in accuracy and recall achieved by considering a higher coverage with lower runtimes. Conclusions Our method leverages the long-range information of sequencing reads that allows to obtain assembled haplotypes fragmented in a lower number of unphased haplotype blocks. At the same time, our method is also able to deal with higher coverages to better correct the errors in the original reads and to obtain more accurate haplotypes as a result. Availability HapCHAT is available at http://hapchat.algolab.euunder the GNU Public License (GPL).
Collapse
Affiliation(s)
- Stefano Beretta
- Department of Informatics, Systems, and Communication, University of Milano-Bicocca, Milan, Italy
| | - Murray D Patterson
- Department of Informatics, Systems, and Communication, University of Milano-Bicocca, Milan, Italy.
| | - Simone Zaccaria
- Department of Computer Science, Princeton University, Princeton, New Jersey, USA
| | - Gianluca Della Vedova
- Department of Informatics, Systems, and Communication, University of Milano-Bicocca, Milan, Italy
| | - Paola Bonizzoni
- Department of Informatics, Systems, and Communication, University of Milano-Bicocca, Milan, Italy
| |
Collapse
|
233
|
Gong L, Wong CH, Cheng WC, Tjong H, Menghi F, Ngan CY, Liu ET, Wei CL. Picky comprehensively detects high-resolution structural variants in nanopore long reads. Nat Methods 2018; 15:455-460. [PMID: 29713081 PMCID: PMC5990454 DOI: 10.1038/s41592-018-0002-6] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 03/14/2018] [Indexed: 12/11/2022]
Abstract
Acquired genomic structural variants (SVs) are major hallmarks of cancer genomes, but they are challenging to reconstruct from short-read sequencing data. Here we exploited the long reads of the nanopore platform using our customized pipeline, Picky ( https://github.com/TheJacksonLaboratory/Picky ), to reveal SVs of diverse architecture in a breast cancer model. We identified the full spectrum of SVs with superior specificity and sensitivity relative to short-read analyses, and uncovered repetitive DNA as the major source of variation. Examination of genome-wide breakpoints at nucleotide resolution uncovered micro-insertions as the common structural features associated with SVs. Breakpoint density across the genome is associated with the propensity for interchromosomal connectivity and was found to be enriched in promoters and transcribed regions of the genome. Furthermore, we observed an over-representation of reciprocal translocations from chromosomal double-crossovers through phased SVs. We demonstrate that Picky analysis is an effective tool for comprehensive detection of SVs in cancer genomes from long-read data.
Collapse
Affiliation(s)
- Liang Gong
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chee-Hong Wong
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Harianto Tjong
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Francesca Menghi
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chew Yee Ngan
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Edison T Liu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Chia-Lin Wei
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- China Medical University, Taichung, Taiwan.
| |
Collapse
|
234
|
Audebert C, Hot D, Caboche S. [Prospects for applications in human health of nanopore-based sequencing]. Med Sci (Paris) 2018; 34:319-325. [PMID: 29658474 DOI: 10.1051/medsci/20183404012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
High throughput sequencing has opened up new clinical opportunities moving towards a medicine of precision. Oncology, infectious diseases or human genomics, many applications have been developed in recent years. The introduction of a third generation of nanopore-based sequencing technology, addressing some of the weaknesses of the previous generation, heralds a new revolution. Portability, real time, long reads and marginal investment costs, these promising new technologies point to a new shift of paradigm. What are the perspectives opened up by nanopores for clinical applications?
Collapse
Affiliation(s)
- Christophe Audebert
- Gènes Diffusion, 3595, route de Tournai, 59501 Douai, France - Pegase-Biosciences, Institut Pasteur de Lille, 1, rue du professeur Calmette, 59019 Lille, France
| | - David Hot
- Univ Lille, CNRS, Inserm, CHU de Lille, Institut Pasteur de Lille, U1019-UMR 8204-CIIL-Centre d'Infection et d'Immunité de Lille, 1, rue du professeur Calmette, F-59000 Lille, France - Pegase-Biosciences, Institut Pasteur de Lille, 1, rue du professeur Calmette, 59019 Lille, France
| | - Ségolène Caboche
- Univ Lille, CNRS, Inserm, CHU de Lille, Institut Pasteur de Lille, U1019-UMR 8204-CIIL-Centre d'Infection et d'Immunité de Lille, 1, rue du professeur Calmette, F-59000 Lille, France - Pegase-Biosciences, Institut Pasteur de Lille, 1, rue du professeur Calmette, 59019 Lille, France
| |
Collapse
|
235
|
Chan EKF, Cameron DL, Petersen DC, Lyons RJ, Baldi BF, Papenfuss AT, Thomas DM, Hayes VM. Optical mapping reveals a higher level of genomic architecture of chained fusions in cancer. Genome Res 2018; 28:726-738. [PMID: 29618486 PMCID: PMC5932612 DOI: 10.1101/gr.227975.117] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 03/21/2018] [Indexed: 01/21/2023]
Abstract
Genomic rearrangements are common in cancer, with demonstrated links to disease progression and treatment response. These rearrangements can be complex, resulting in fusions of multiple chromosomal fragments and generation of derivative chromosomes. Although methods exist for detecting individual fusions, they are generally unable to reconstruct complex chained events. To overcome these limitations, we adopted a new optical mapping approach, allowing megabase-length genome maps to be reconstructed and rearranged genomes to be visualized without loss of integrity. Whole-genome mapping (Bionano Genomics) of a well-studied highly rearranged liposarcoma cell line resulted in 3338 assembled consensus genome maps, including 72 fusion maps. These fusion maps represent 112.3 Mb of highly rearranged genomic regions, illuminating the complex architecture of chained fusions, including content, order, orientation, and size. Spanning the junction of 147 chromosomal translocations, we found a total of 28 Mb of interspersed sequences that could not be aligned to the reference genome. Traversing these interspersed sequences using short-read sequencing breakpoint calls, we were able to identify and place 399 sequencing fragments within the optical mapping gaps, thus illustrating the complementary nature of optical mapping and short-read sequencing. We demonstrate that optical mapping provides a powerful new approach for capturing a higher level of complex genomic architecture, creating a scaffold for renewed interpretation of sequencing data of particular relevance to human cancer.
Collapse
Affiliation(s)
- Eva K F Chan
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia
| | - Daniel L Cameron
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Victoria 3052, Australia.,Department of Medical Biology, University of Melbourne, Victoria 3010, Australia
| | - Desiree C Petersen
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia
| | - Ruth J Lyons
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Benedetta F Baldi
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Anthony T Papenfuss
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Victoria 3052, Australia.,Department of Medical Biology, University of Melbourne, Victoria 3010, Australia.,Department of Mathematics and Statistics, University of Melbourne, Victoria 3010, Australia.,Sir Peter MacCallum Department of Oncology, University of Melbourne, Victoria 3010, Australia.,Bioinformatics and Cancer Genomics, Peter MacCallum Cancer Centre, Victoria 3002, Australia
| | - David M Thomas
- The Kinghorn Cancer Centre, Garvan Institute of Medical Research, New South Wales 2010, Australia.,Cancer Division, Garvan Institute of Medical Research, New South Wales 2010, Australia
| | - Vanessa M Hayes
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, New South Wales 2010, Australia.,St Vincent's Clinical School, University of New South Wales, New South Wales 2052, Australia.,School of Health Systems and Public Health, University of Pretoria, Hatfield 0002, South Africa.,Central Clinical School, University of Sydney, New South Wales 2006, Australia
| |
Collapse
|
236
|
Bayega A, Fahiminiya S, Oikonomopoulos S, Ragoussis J. Current and Future Methods for mRNA Analysis: A Drive Toward Single Molecule Sequencing. Methods Mol Biol 2018; 1783:209-241. [PMID: 29767365 DOI: 10.1007/978-1-4939-7834-2_11] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The transcriptome encompasses a range of species including messenger RNA, and other noncoding RNA such as rRNA, tRNA, and short and long noncoding RNAs. Due to the huge role played by mRNA in development and disease, several methods have been developed to sequence and characterize mRNA, with RNA sequencing (RNA-Seq) emerging as the current method of choice particularly for large high-throughput studies. Short-read RNA-Seq which involves sequencing of short cDNA fragments and computationally assembling them to reconstruct the transcriptome, or aligning them to a reference is the most widely used approach. However, due to inherent limitations of this approach in de novo transcriptome assembly and isoform quantification, long-read RNA-Seq approaches, which also happen to be single molecule sequencing approaches, are increasingly becoming the standard for de novo transcriptome assembly and isoform quantification. In this chapter, we review the technical aspects of the current methods of RNA-Seq, both short and long-read approaches, and data analysis methods available. We discuss recent advances in single-cell RNA-Seq and direct RNA-Seq approaches, which perhaps will dominate the future of RNA-Seq.
Collapse
Affiliation(s)
- Anthony Bayega
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | | | - Spyros Oikonomopoulos
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Jiannis Ragoussis
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada.
- Department of Bioengineering, McGill University, Montréal, QC, Canada.
- Cancer and Mutagen Unit, Department of Biochemistry, Center of Innovation in Personalized Medicine, King Fahd Center for Medical Research, King Abdulaziz University, Jeddah, Saudi Arabia.
| |
Collapse
|
237
|
Euskirchen P, Bielle F, Labreche K, Kloosterman WP, Rosenberg S, Daniau M, Schmitt C, Masliah-Planchon J, Bourdeaut F, Dehais C, Marie Y, Delattre JY, Idbaih A. Same-day genomic and epigenomic diagnosis of brain tumors using real-time nanopore sequencing. Acta Neuropathol 2017; 134:691-703. [PMID: 28638988 PMCID: PMC5645447 DOI: 10.1007/s00401-017-1743-5] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Revised: 06/07/2017] [Accepted: 06/10/2017] [Indexed: 12/13/2022]
Abstract
Molecular classification of cancer has entered clinical routine to inform diagnosis, prognosis, and treatment decisions. At the same time, new tumor entities have been identified that cannot be defined histologically. For central nervous system tumors, the current World Health Organization classification explicitly demands molecular testing, e.g., for 1p/19q-codeletion or IDH mutations, to make an integrated histomolecular diagnosis. However, a plethora of sophisticated technologies is currently needed to assess different genomic and epigenomic alterations and turnaround times are in the range of weeks, which makes standardized and widespread implementation difficult and hinders timely decision making. Here, we explored the potential of a pocket-size nanopore sequencing device for multimodal and rapid molecular diagnostics of cancer. Low-pass whole genome sequencing was used to simultaneously generate copy number (CN) and methylation profiles from native tumor DNA in the same sequencing run. Single nucleotide variants in IDH1, IDH2, TP53, H3F3A, and the TERT promoter region were identified using deep amplicon sequencing. Nanopore sequencing yielded ~0.1X genome coverage within 6 h and resulting CN and epigenetic profiles correlated well with matched microarray data. Diagnostically relevant alterations, such as 1p/19q codeletion, and focal amplifications could be recapitulated. Using ad hoc random forests, we could perform supervised pan-cancer classification to distinguish gliomas, medulloblastomas, and brain metastases of different primary sites. Single nucleotide variants in IDH1, IDH2, and H3F3A were identified using deep amplicon sequencing within minutes of sequencing. Detection of TP53 and TERT promoter mutations shows that sequencing of entire genes and GC-rich regions is feasible. Nanopore sequencing allows same-day detection of structural variants, point mutations, and methylation profiling using a single device with negligible capital cost. It outperforms hybridization-based and current sequencing technologies with respect to time to diagnosis and required laboratory equipment and expertise, aiming to make precision medicine possible for every cancer patient, even in resource-restricted settings.
Collapse
Affiliation(s)
- Philipp Euskirchen
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France.
- Department of Neurology, Charité-Universitätsmedizin Berlin, Berlin, Germany.
- Berlin Institute of Health (BIH), Berlin, Germany.
| | - Franck Bielle
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
- Service de Neuropathologie, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière-Charles Foix, Paris, France
- OncoNeuroTek, Paris, France
| | - Karim Labreche
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
- Division of Genetics and Epidemiology, The Institute of Cancer Research, Sutton, Surrey, SM2 5NG, UK
| | - Wigard P Kloosterman
- Division of Biomedical Genetics, Center for Molecular Medicine, Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Shai Rosenberg
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
| | - Mailys Daniau
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
| | - Charlotte Schmitt
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
| | | | - Franck Bourdeaut
- Laboratory of Translational Research in Pediatric Oncology, Institut Curie, PSL Research University, Paris, France
| | - Caroline Dehais
- Service de Neurologie, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière-Charles Foix, 2-Mazarin, Paris, France
| | - Yannick Marie
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
| | - Jean-Yves Delattre
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France
- Service de Neurologie, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière-Charles Foix, 2-Mazarin, Paris, France
| | - Ahmed Idbaih
- Inserm U 1127, CNRS UMR 7225, Sorbonne Universités, UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Paris, France.
- Service de Neurologie, AP-HP, Hôpitaux Universitaires La Pitié Salpêtrière-Charles Foix, 2-Mazarin, Paris, France.
| |
Collapse
|