Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Xia LC, Bell JM, Wood-Bouwens C, Chen JJ, Zhang NR, Ji HP. Identification of large rearrangements in cancer genomes with barcode linked reads. Nucleic Acids Res 2019;46:e19. [PMID: 29186506 PMCID: PMC5829571 DOI: 10.1093/nar/gkx1193] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 11/17/2017] [Indexed: 01/08/2023] Open

For:	Xia LC, Bell JM, Wood-Bouwens C, Chen JJ, Zhang NR, Ji HP. Identification of large rearrangements in cancer genomes with barcode linked reads. Nucleic Acids Res 2019;46:e19. [PMID: 29186506 PMCID: PMC5829571 DOI: 10.1093/nar/gkx1193] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 11/17/2017] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Höps W, Rausch T, Jendrusch M, Korbel JO, Sedlazeck FJ. Impact and characterization of serial structural variations across humans and great apes. Nat Commun 2024;15:8007. [PMID: 39266513 PMCID: PMC11393467 DOI: 10.1038/s41467-024-52027-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 08/23/2024] [Indexed: 09/14/2024] Open

Foltz SM, Li Y, Yao L, Terekhanova NV, Weerasinghe A, Gao Q, Dong G, Schindler M, Cao S, Sun H, Jayasinghe RG, Fulton RS, Fronick CC, King J, Kohnen DR, Fiala MA, Chen K, DiPersio JF, Vij R, Ding L. Somatic mutation phasing and haplotype extension using linked-reads in multiple myeloma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.09.607342. [PMID: 39149342 PMCID: PMC11326269 DOI: 10.1101/2024.08.09.607342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]

Affiliation(s)

Steven M. Foltz Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Yize Li Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Lijun Yao Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Nadezhda V. Terekhanova Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Amila Weerasinghe Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Qingsong Gao Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Guanlan Dong Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Moses Schindler Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Song Cao Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Hua Sun Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Reyka G. Jayasinghe Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Robert S. Fulton McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Catrina C. Fronick McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA
Justin King Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
Daniel R. Kohnen Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
Mark A. Fiala Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA
Ken Chen Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
John F. DiPersio Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA
Ravi Vij Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA
Li Ding Department of Medicine, Washington University in St. Louis, St. Louis, MO, 63110, USA McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO, 63108, USA Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, 63110, USA Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA

Collapse

Tan KT, Slevin MK, Leibowitz ML, Garrity-Janger M, Shan J, Li H, Meyerson M. Neotelomeres and telomere-spanning chromosomal arm fusions in cancer genomes revealed by long-read sequencing. CELL GENOMICS 2024;4:100588. [PMID: 38917803 PMCID: PMC11293586 DOI: 10.1016/j.xgen.2024.100588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 11/09/2023] [Accepted: 05/30/2024] [Indexed: 06/27/2024]

Bai X, Duren Z, Wan L, Xia LC. Joint inference of clonal structure using single-cell genome and transcriptome sequencing data. NAR Genom Bioinform 2024;6:lqae017. [PMID: 38486887 PMCID: PMC10939367 DOI: 10.1093/nargab/lqae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/19/2023] [Accepted: 01/29/2024] [Indexed: 03/17/2024] Open

Yang C, Zhang Z, Huang Y, Xie X, Liao H, Xiao J, Veldsman WP, Yin K, Fang X, Zhang L. LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome. Gigascience 2024;13:giae028. [PMID: 38869148 PMCID: PMC11170215 DOI: 10.1093/gigascience/giae028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 03/15/2024] [Accepted: 05/09/2024] [Indexed: 06/14/2024] Open

Abstract

BACKGROUND

Linked-read sequencing technologies generate high-base quality short reads that contain extrapolative information on long-range DNA connectedness. These advantages of linked-read technologies are well known and have been demonstrated in many human genomic and metagenomic studies. However, existing linked-read analysis pipelines (e.g., Long Ranger) were primarily developed to process sequencing data from the human genome and are not suited for analyzing metagenomic sequencing data. Moreover, linked-read analysis pipelines are typically limited to 1 specific sequencing platform.

FINDINGS

To address these limitations, we present the Linked-Read ToolKit (LRTK), a unified and versatile toolkit for platform agnostic processing of linked-read sequencing data from both human genome and metagenome. LRTK provides functions to perform linked-read simulation, barcode sequencing error correction, barcode-aware read alignment and metagenome assembly, reconstruction of long DNA fragments, taxonomic classification and quantification, and barcode-assisted genomic variant calling and phasing. LRTK has the ability to process multiple samples automatically and provides users with the option to generate reproducible reports during processing of raw sequencing data and at multiple checkpoints throughout downstream analysis. We applied LRTK on linked reads from simulation, mock community, and real datasets for both human genome and metagenome. We showcased LRTK's ability to generate comparative performance results from preceding benchmark studies and to report these results in publication-ready HTML document plots.

CONCLUSIONS

LRTK provides comprehensive and flexible modules along with an easy-to-use Python-based workflow for processing linked-read sequencing datasets, thereby filling the current gap in the field caused by platform-centric genome-specific linked-read data analysis tools.

Collapse

Tan KT, Slevin MK, Leibowitz ML, Garrity-Janger M, Li H, Meyerson M. Neotelomeres and Telomere-Spanning Chromosomal Arm Fusions in Cancer Genomes Revealed by Long-Read Sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.30.569101. [PMID: 38077026 PMCID: PMC10705422 DOI: 10.1101/2023.11.30.569101] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]

Qu J, Li S, Yu D. Detection of complex chromosome rearrangements using optical genome mapping. Gene 2023;884:147688. [PMID: 37543218 DOI: 10.1016/j.gene.2023.147688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Revised: 07/15/2023] [Accepted: 08/02/2023] [Indexed: 08/07/2023]

Abstract

Chromosomal structural variations (SVs) are a main cause of human genetic disease. Currently, karyotype, chromosomal microarray analysis (CMA), and fluorescent in situ hybridization (FISH) form the backbone of current routine diagnostics (CRD). These methods have their own limitations. CRD cannot identify cryptic balanced SVs and complex SVs even if these techniques were performed either simultaneously or in a sequential manner. Optical genome mapping (OGM) is a novel technology that can identify several classes of SVs with higher resolution, but studies on the applicability of OGM and its comparison with CRD are inadequate for difficult and complicated chromosomal SVs are lacking. Herein, seven patients with definite complicated SVs involving at least two breakpoints (BPs) were recruited for this study. The results of BPs and SVs from OGM were compared with those from CRD. The results showed that all BPs of five samples and partial BPs of two samples were detected by OGM. The undetected BPs were all close to the repeat-rich gap region. Besides, OGM also detected additional SVs including a cryptic balanced translocation, two additional complex chromosomal rearrangement (CCR). OGM yielded the additional information, such as the orientation of acentric fragments, BP positions, and genes mapped in the BP region for all the cases. The accuracy of additional SVs and BPs detected by OGM was verified by FISH panel and next-generation sequencing and Sanger sequencing. Taken together, OGM exhibit a better performance in detecting chromosomal SVs compared to the CRD. We suggested that OGM method should be utilized in the clinical examination to improve the efficiency and accuracy of genetic disease diagnosis, supplemented by FISH or karyotyping to compensate for the SVs in the repeat-rich gap region if necessary.

Collapse

Laufer VA, Glover TW, Wilson TE. Applications of advanced technologies for detecting genomic structural variation. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023;792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]

Weisweiler M, Stich B. Benchmarking of structural variant detection in the tetraploid potato genome using linked-read sequencing. Genomics 2023;115:110568. [PMID: 36702293 DOI: 10.1016/j.ygeno.2023.110568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/12/2023] [Accepted: 01/18/2023] [Indexed: 01/25/2023]

Muñoz-Barrera A, Rubio-Rodríguez LA, Díaz-de Usera A, Jáspez D, Lorenzo-Salazar JM, González-Montelongo R, García-Olivares V, Flores C. From Samples to Germline and Somatic Sequence Variation: A Focus on Next-Generation Sequencing in Melanoma Research. Life (Basel) 2022;12:1939. [PMID: 36431075 PMCID: PMC9695713 DOI: 10.3390/life12111939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 11/12/2022] [Accepted: 11/16/2022] [Indexed: 11/24/2022] Open

Linked-read whole-genome sequencing resolves common and private structural variants in multiple myeloma. Blood Adv 2022;6:5009-5023. [PMID: 35675515 PMCID: PMC9631623 DOI: 10.1182/bloodadvances.2021006720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 05/31/2022] [Indexed: 01/18/2023] Open

Abstract

Linked-read WGS can be performed without DNA purification and allows for resolution of the diverse structural variants found in MM.

Linked-read WGS can, as a standalone assay, provide comprehensive genetics in myeloma and other diseases with complex genomes.

Multiple myeloma (MM) is an incurable and aggressive plasma cell malignancy characterized by a complex karyotype with multiple structural variants (SVs) and copy-number variations (CNVs). Linked-read whole-genome sequencing (lrWGS) allows for refined detection and reconstruction of SVs by providing long-range genetic information from standard short-read sequencing. This makes lrWGS an attractive solution for capturing the full genomic complexity of MM. Here we show that high-quality lrWGS data can be generated from low numbers of cells subjected to fluorescence-activated cell sorting (FACS) without DNA purification. Using this protocol, we analyzed MM cells after FACS from 37 patients with MM using lrWGS. We found high concordance between lrWGS and fluorescence in situ hybridization (FISH) for the detection of recurrent translocations and CNVs. Outside of the regions investigated by FISH, we identified >150 additional SVs and CNVs across the cohort. Analysis of the lrWGS data allowed for resolution of the structure of diverse SVs affecting the MYC and t(11;14) loci, causing the duplication of genes and gene regulatory elements. In addition, we identified private SVs causing the dysregulation of genes recurrently involved in translocations with the IGH locus and show that these can alter the molecular classification of MM. Overall, we conclude that lrWGS allows for the detection of aberrations critical for MM prognostics and provides a feasible route for providing comprehensive genetics. Implementing lrWGS could provide more accurate clinical prognostics, facilitate genomic medicine initiatives, and greatly improve the stratification of patients included in clinical trials.

Collapse

Guo J, Shi C, Chen X, Wang O, Liu P, Yang H, Xu X, Zhang W, Zhu H. stLFRsv: A Germline Structural Variant Analysis Pipeline Using Co-barcoded Reads. Front Genet 2021;12:636239. [PMID: 33815469 PMCID: PMC8012683 DOI: 10.3389/fgene.2021.636239] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 02/04/2021] [Indexed: 11/13/2022] Open

Noninvasive prenatal test of single-gene disorders by linked-read direct haplotyping: application in various diseases. Eur J Hum Genet 2020;29:463-470. [PMID: 33235377 DOI: 10.1038/s41431-020-00759-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 08/26/2020] [Accepted: 10/20/2020] [Indexed: 11/08/2022] Open

Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions. PLoS Comput Biol 2020;16:e1008397. [PMID: 33226985 PMCID: PMC7721175 DOI: 10.1371/journal.pcbi.1008397] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 12/07/2020] [Accepted: 09/24/2020] [Indexed: 11/19/2022] Open

Abstract

Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, we performed a comprehensive analysis of different types and sizes of SVs predicted by both the technologies and validated with an independent PCR based approach. The SVs commonly identified by both the technologies were highly specific, while validation rate dropped for uncommon events. A particularly high FDR was observed for SVs only found by 10XWGS. To improve FDR and sensitivity, statistical models for both the technologies were trained. Using our approach, we characterized SVs from the MCF7 cell line and a primary breast cancer tumor with high precision. This approach improves SV prediction and can therefore help in understanding the underlying genetics in various diseases.

Cancer and many other diseases are often driven by structural rearrangements in the patients. Their precise identification is necessary to understand evolution and cure for the disease. In this study, we have compared two sequencing technologies for the identification of structural variations i.e. Illumina’s short-reads and 10X Genomics linked-reads sequencing. Short-reads sequencing is already known to have high false discovery rate for structural variations, while, an unbiased performance evaluation of linked-reads sequencing is missing. Hence, we evaluate the performance of these two technologies using computational and PCR based methodologies. Moreover, we also present a statistical approach to increase their performance, supporting better detection of structural variations and thus further research into disease biology.

Collapse

Gallant J, Mouton J, Ummels R, Ten Hagen-Jongman C, Kriel N, Pain A, Warren RM, Bitter W, Heunis T, Sampson SL. Identification of gene fusion events in Mycobacterium tuberculosis that encode chimeric proteins. NAR Genom Bioinform 2020;2:lqaa033. [PMID: 33575588 PMCID: PMC7671302 DOI: 10.1093/nargab/lqaa033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 04/16/2020] [Accepted: 05/05/2020] [Indexed: 02/07/2023] Open

Affiliation(s)

James Gallant DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa.,Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands
Jomien Mouton DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
Roy Ummels Medical Microbiology and Infection Control, Vrije Universiteit Amsterdam, Amsterdam UMC, 1081 HZ Amsterdam, The Netherlands
Corinne Ten Hagen-Jongman Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands
Nastassja Kriel DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
Arnab Pain Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia.,Global Station for Zoonosis Control, GI-CoRE, Hokkaido University, 001-0020, N20 W10 Kita-ku, Sapporo, Japan
Robin M Warren DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa
Wilbert Bitter Section of Molecular Microbiology, Amsterdam Institute for Molecules, Medicines and Systems, Vrije Universiteit Amsterdam, 1081 HZ Amsterdam, The Netherlands.,Medical Microbiology and Infection Control, Vrije Universiteit Amsterdam, Amsterdam UMC, 1081 HZ Amsterdam, The Netherlands
Tiaan Heunis DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa.,Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
Samantha L Sampson DST/NRF Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Department of Biomedical Science, Faculty of Medicine and Health Science, Stellenbosch University, Tygerberg, Cape Town 7505, South Africa

Collapse

Karaoğlanoğlu F, Ricketts C, Ebren E, Rasekh ME, Hajirasouliha I, Alkan C. VALOR2: characterization of large-scale structural variants using linked-reads. Genome Biol 2020;21:72. [PMID: 32192518 PMCID: PMC7083023 DOI: 10.1186/s13059-020-01975-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 02/24/2020] [Indexed: 12/31/2022] Open

Ho SS, Urban AE, Mills RE. Structural variation in the sequencing era. Nat Rev Genet 2020;21:171-189. [PMID: 31729472 PMCID: PMC7402362 DOI: 10.1038/s41576-019-0180-9] [Citation(s) in RCA: 280] [Impact Index Per Article: 70.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2019] [Indexed: 12/13/2022]

Zhang Y, Kang Z, Lv D, Zhang X, Liao Y, Li Y, Liu R, Li P, Tong M, Tian J, Shao Y, Huang C, Ge D, Zhang J, Bai W, Wang Y, Liu Q, Li Z, Yan J. Longitudinal whole-genome sequencing reveals the evolution of MPAL. Cancer Genet 2020;240:59-65. [DOI: 10.1016/j.cancergen.2019.11.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Revised: 10/21/2019] [Accepted: 11/21/2019] [Indexed: 12/30/2022]

Shin G, Greer SU, Xia LC, Lee H, Zhou J, Boles TC, Ji HP. Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes. Nucleic Acids Res 2019;47:e115. [PMID: 31350896 PMCID: PMC6821272 DOI: 10.1093/nar/gkz661] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 07/02/2019] [Accepted: 07/18/2019] [Indexed: 11/12/2022] Open

Wellenreuther M, Mérot C, Berdan E, Bernatchez L. Going beyond SNPs: The role of structural genomic variants in adaptive evolution and species diversification. Mol Ecol 2019;28:1203-1209. [PMID: 30834648 DOI: 10.1111/mec.15066] [Citation(s) in RCA: 120] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 02/28/2019] [Indexed: 12/17/2022]

Nicolussi A, Belardinilli F, Silvestri V, Mahdavian Y, Valentini V, D'Inzeo S, Petroni M, Zani M, Ferraro S, Di Giulio S, Fabretti F, Fratini B, Gradilone A, Ottini L, Giannini G, Coppa A, Capalbo C. Identification of novel BRCA1 large genomic rearrangements by a computational algorithm of amplicon-based Next-Generation Sequencing data. PeerJ 2019;7:e7972. [PMID: 31741787 PMCID: PMC6859874 DOI: 10.7717/peerj.7972] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Accepted: 10/01/2019] [Indexed: 12/30/2022] Open

Abstract

Background

Genetic testing for BRCA1/2 germline mutations in hereditary breast/ovarian cancer patients requires screening for single nucleotide variants, small insertions/deletions and large genomic rearrangements (LGRs). These studies have long been run by Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA). The recent introduction of next-generation sequencing (NGS) platforms dramatically improved the speed and the efficiency of DNA testing for nucleotide variants, while the possibility to correctly detect LGRs by this mean is still debated. The purpose of this study was to establish whether and to which extent the development of an analytical algorithm could help us translating NGS sequencing via an Ion Torrent PGM platform into a tool suitable to identify LGRs in hereditary breast-ovarian cancer patients.

Methods

We first used NGS data of a group of three patients (training set), previously screened in our laboratory by conventional methods, to develop an algorithm for the calculation of the dosage quotient (DQ) to be compared with the Ion Reporter (IR) analysis. Then, we tested the optimized pipeline with a consecutive cohort of 85 uncharacterized probands (validation set) also subjected to MLPA analysis. Characterization of the breakpoints of three novel BRCA1 LGRs was obtained via long-range PCR and direct sequencing of the DNA products.

Results

In our cohort, the newly defined DQ-based algorithm detected 3/3 BRCA1 LGRs, demonstrating 100% sensitivity and 100% negative predictive value (NPV) (95% CI [87.6–99.9]) compared to 2/3 cases detected by IR (66.7% sensitivity and 98.2% NPV (95% CI [85.6–99.9])). Interestingly, DQ and IR shared 12 positive results, but exons deletion calls matched only in five cases, two of which confirmed by MLPA. The breakpoints of the 3 novel BRCA1 deletions, involving exons 16–17, 21–22 and 20, have been characterized.

Conclusions

Our study defined a DQ-based algorithm to identify BRCA1 LGRs using NGS data. Whether confirmed on larger data sets, this tool could guide the selection of samples to be subjected to MLPA analysis, leading to significant savings in time and money.

Collapse

Iwata S, Nakadai H, Fukushi D, Jose M, Nagahara M, Iwamoto T. Simple and large-scale chromosomal engineering of mouse zygotes via in vitro and in vivo electroporation. Sci Rep 2019;9:14713. [PMID: 31604975 PMCID: PMC6789149 DOI: 10.1038/s41598-019-50900-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Accepted: 09/19/2019] [Indexed: 01/25/2023] Open

Darby CA, Fitch JR, Brennan PJ, Kelly BJ, Bir N, Magrini V, Leonard J, Cottrell CE, Gastier-Foster JM, Wilson RK, Mardis ER, White P, Langmead B, Schatz MC. Samovar: Single-Sample Mosaic Single-Nucleotide Variant Calling with Linked Reads. iScience 2019;18:1-10. [PMID: 31271967 PMCID: PMC6609817 DOI: 10.1016/j.isci.2019.05.037] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 05/06/2019] [Accepted: 05/24/2019] [Indexed: 12/25/2022] Open

Affiliation(s)

Charlotte A Darby Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
James R Fitch The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
Patrick J Brennan The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
Benjamin J Kelly The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
Natalie Bir The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
Vincent Magrini The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Jeffrey Leonard Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA; Department of Neurosurgery, Nationwide Children's Hospital, Columbus, OH, USA
Catherine E Cottrell The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Julie M Gastier-Foster The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Richard K Wilson The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Elaine R Mardis The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Peter White The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA; Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
Ben Langmead Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
Michael C Schatz Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA; Department of Biology, Johns Hopkins University, Baltimore, MD, USA; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.

Collapse

Levy SE, Boone BE. Next-Generation Sequencing Strategies. Cold Spring Harb Perspect Med 2019;9:cshperspect.a025791. [PMID: 30323017 DOI: 10.1101/cshperspect.a025791] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Marks P, Garcia S, Barrio AM, Belhocine K, Bernate J, Bharadwaj R, Bjornson K, Catalanotti C, Delaney J, Fehr A, Fiddes IT, Galvin B, Heaton H, Herschleb J, Hindson C, Holt E, Jabara CB, Jett S, Keivanfar N, Kyriazopoulou-Panagiotopoulou S, Lek M, Lin B, Lowe A, Mahamdallie S, Maheshwari S, Makarewicz T, Marshall J, Meschi F, O'Keefe CJ, Ordonez H, Patel P, Price A, Royall A, Ruark E, Seal S, Schnall-Levin M, Shah P, Stafford D, Williams S, Wu I, Xu AW, Rahman N, MacArthur D, Church DM. Resolving the full spectrum of human genome variation using Linked-Reads. Genome Res 2019;29:635-645. [PMID: 30894395 PMCID: PMC6442396 DOI: 10.1101/gr.234443.118] [Citation(s) in RCA: 134] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Accepted: 02/21/2019] [Indexed: 02/07/2023]

Affiliation(s)

Patrick Marks 10x Genomics, Pleasanton, California 94566, USA
Sarah Garcia 10x Genomics, Pleasanton, California 94566, USA
Alvaro Martinez Barrio 10x Genomics, Pleasanton, California 94566, USA
Kamila Belhocine 10x Genomics, Pleasanton, California 94566, USA
Jorge Bernate 10x Genomics, Pleasanton, California 94566, USA
Rajiv Bharadwaj 10x Genomics, Pleasanton, California 94566, USA
Keith Bjornson 10x Genomics, Pleasanton, California 94566, USA
Claudia Catalanotti 10x Genomics, Pleasanton, California 94566, USA
Josh Delaney 10x Genomics, Pleasanton, California 94566, USA
Adrian Fehr 10x Genomics, Pleasanton, California 94566, USA
Ian T Fiddes 10x Genomics, Pleasanton, California 94566, USA
Brendan Galvin 10x Genomics, Pleasanton, California 94566, USA
Haynes Heaton 10x Genomics, Pleasanton, California 94566, USA
Jill Herschleb 10x Genomics, Pleasanton, California 94566, USA
Christopher Hindson 10x Genomics, Pleasanton, California 94566, USA
Esty Holt The Institute of Cancer Research, Division of Genetics and Epidemiology, London SM2 5NG, United Kingdom
Cassandra B Jabara 10x Genomics, Pleasanton, California 94566, USA
Susanna Jett 10x Genomics, Pleasanton, California 94566, USA
Nikka Keivanfar 10x Genomics, Pleasanton, California 94566, USA
Sofia Kyriazopoulou-Panagiotopoulou 10x Genomics, Pleasanton, California 94566, USA
Monkol Lek Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
Bill Lin 10x Genomics, Pleasanton, California 94566, USA
Adam Lowe 10x Genomics, Pleasanton, California 94566, USA
Shazia Mahamdallie The Institute of Cancer Research, Division of Genetics and Epidemiology, London SM2 5NG, United Kingdom
Shamoni Maheshwari 10x Genomics, Pleasanton, California 94566, USA
Tony Makarewicz 10x Genomics, Pleasanton, California 94566, USA
Jamie Marshall Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
Francesca Meschi 10x Genomics, Pleasanton, California 94566, USA
Christopher J O'Keefe 10x Genomics, Pleasanton, California 94566, USA
Heather Ordonez 10x Genomics, Pleasanton, California 94566, USA
Pranav Patel 10x Genomics, Pleasanton, California 94566, USA
Andrew Price 10x Genomics, Pleasanton, California 94566, USA
Ariel Royall 10x Genomics, Pleasanton, California 94566, USA
Elise Ruark The Institute of Cancer Research, Division of Genetics and Epidemiology, London SM2 5NG, United Kingdom
Sheila Seal The Institute of Cancer Research, Division of Genetics and Epidemiology, London SM2 5NG, United Kingdom
Michael Schnall-Levin 10x Genomics, Pleasanton, California 94566, USA
Preyas Shah 10x Genomics, Pleasanton, California 94566, USA
David Stafford 10x Genomics, Pleasanton, California 94566, USA
Stephen Williams 10x Genomics, Pleasanton, California 94566, USA
Indira Wu 10x Genomics, Pleasanton, California 94566, USA
Andrew Wei Xu 10x Genomics, Pleasanton, California 94566, USA
Nazneen Rahman The Institute of Cancer Research, Division of Genetics and Epidemiology, London SM2 5NG, United Kingdom
Daniel MacArthur Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
Deanna M Church 10x Genomics, Pleasanton, California 94566, USA

Collapse

Ma ZS, Li L, Ye C, Peng M, Zhang YP. Hybrid assembly of ultra-long Nanopore reads augmented with 10x-Genomics contigs: Demonstrated with a human genome. Genomics 2018;111:1896-1901. [PMID: 30594583 DOI: 10.1016/j.ygeno.2018.12.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Revised: 11/17/2018] [Accepted: 12/24/2018] [Indexed: 10/27/2022]

Affiliation(s)

Zhanshan Sam Ma Computational Biology and Medical Ecology Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China; Kunming College of Life Science, Chinese Academy of Sciences, Kunming, 650223, China.
Lianwei Li Computational Biology and Medical Ecology Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Kunming College of Life Science, Chinese Academy of Sciences, Kunming, 650223, China
Chengxi Ye Computational Biology and Medical Ecology Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Department of Computer Science, University of Maryland, College Park, MD, USA
Minsheng Peng Molecular Evolution and Genome Diversity Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Kunming College of Life Science, Chinese Academy of Sciences, Kunming, 650223, China; KIZ/CUHK Joint Laboratory of Bio-resources and Molecular Research in Common Diseases, Kunming 650223, China
Ya-Ping Zhang Molecular Evolution and Genome Diversity Lab, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China; Kunming College of Life Science, Chinese Academy of Sciences, Kunming, 650223, China; KIZ/CUHK Joint Laboratory of Bio-resources and Molecular Research in Common Diseases, Kunming 650223, China.

Collapse

Xia LC, Ai D, Lee H, Andor N, Li C, Zhang NR, Ji HP. SVEngine: an efficient and versatile simulator of genome structural variations with features of cancer clonal evolution. Gigascience 2018;7:5049476. [PMID: 29982625 PMCID: PMC6057526 DOI: 10.1093/gigascience/giy081] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 05/22/2018] [Accepted: 06/26/2018] [Indexed: 11/29/2022] Open

Abstract

Background

Simulating genome sequence data with variant features facilitates the development and benchmarking of structural variant analysis programs. However, there are only a few data simulators that provide structural variants in silico and even fewer that provide variants with different allelic fraction and haplotypes.

Findings

We developed SVEngine, an open-source tool to address this need. SVEngine simulates next-generation sequencing data with embedded structural variations. As input, SVEngine takes template haploid sequences (FASTA) and an external variant file, a variant distribution file, and/or a clonal phylogeny tree file (NEWICK) as input. Subsequently, it simulates and outputs sequence contigs (FASTAs), sequence reads (FASTQs), and/or post-alignment files (BAMs). All of the files contain the desired variants, along with BED files containing the ground truth. SVEngine's flexible design process enables one to specify size, position, and allelic fraction for deletions, insertions, duplications, inversions, and translocations. Finally, SVEngine simulates sequence data that replicate the characteristics of a sequencing library with mixed sizes of DNA insert molecules. To improve the compute speed, SVEngine is highly parallelized to reduce the simulation time.

Conclusions

We demonstrated the versatile features of SVEngine and its improved runtime comparisons with other available simulators. SVEngine's features include the simulation of locus-specific variant frequency designed to mimic the phylogeny of cancer clonal evolution. We validated SVEngine's accuracy by simulating genome-wide structural variants of NA12878 and a heterogeneous cancer genome. Our evaluation included checking various sequencing mapping features such as coverage change, read clipping, insert size shift, and neighboring hanging read pairs for representative variant types. Structural variant callers Lumpy and Manta and tumor heterogeneity estimator THetA2 were able to perform realistically on the simulated data. SVEngine is implemented as a standard Python package and is freely available for academic use .

Collapse