Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sedlazeck FJ, Lee H, Darby CA, Schatz MC. Piercing the dark matter: bioinformatics of long-range sequencing and mapping. Nat Rev Genet 2019;19:329-346. [PMID: 29599501 DOI: 10.1038/s41576-018-0003-4] [Citation(s) in RCA: 291] [Impact Index Per Article: 58.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

For:	Sedlazeck FJ, Lee H, Darby CA, Schatz MC. Piercing the dark matter: bioinformatics of long-range sequencing and mapping. Nat Rev Genet 2019;19:329-346. [PMID: 29599501 DOI: 10.1038/s41576-018-0003-4] [Citation(s) in RCA: 291] [Impact Index Per Article: 58.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Number

Cited by Other Article(s)

101

Nicholas TJ, Al‐Sweel N, Farrell A, Mao R, Bayrak‐Toydemir P, Miller CE, Bentley D, Palmquist R, Moore B, Hernandez EJ, Cormier MJ, Fredrickson E, Noble K, Rynearson S, Holt C, Karren M, Bonkowsky JL, Tristani‐Firouzi M, Yandell M, Marth G, Quinlan AR, Brunelli L, Toydemir R, Shayota BJ, Carey JC, Boyden SE, Malone Jenkins S. Comprehensive variant calling from whole-genome sequencing identifies a complex inversion that disrupts ZFPM2 in familial congenital diaphragmatic hernia. Mol Genet Genomic Med 2022;10:e1888. [PMID: 35119225 PMCID: PMC9000945 DOI: 10.1002/mgg3.1888] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 01/14/2022] [Accepted: 01/18/2022] [Indexed: 01/03/2023] Open

Affiliation(s)

Thomas J. Nicholas Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Najla Al‐Sweel ARUP LaboratoriesSalt Lake CityUSA Department of PathologyUniversity of UtahSalt Lake CityUSA
Andrew Farrell Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Rong Mao ARUP LaboratoriesSalt Lake CityUSA Department of PathologyUniversity of UtahSalt Lake CityUSA
Pinar Bayrak‐Toydemir ARUP LaboratoriesSalt Lake CityUSA Department of PathologyUniversity of UtahSalt Lake CityUSA
Christine E. Miller ARUP LaboratoriesSalt Lake CityUSA
Dawn Bentley Division of Neonatology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA
Rachel Palmquist Division of Pediatric Neurology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA Primary Children's Center for Personalized MedicineSalt Lake CityUSA
Barry Moore Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Edgar J. Hernandez Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Michael J. Cormier Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Eric Fredrickson ARUP LaboratoriesSalt Lake CityUSA
Katherine Noble ARUP LaboratoriesSalt Lake CityUSA
Shawn Rynearson Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Carson Holt Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Mary Anne Karren Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Joshua L. Bonkowsky Division of Pediatric Neurology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA Primary Children's Center for Personalized MedicineSalt Lake CityUSA
Martin Tristani‐Firouzi Division of Pediatric Cardiology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA
Mark Yandell Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Gabor Marth Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Aaron R. Quinlan Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA Department of Biomedical InformaticsUniversity of UtahSalt Lake CityUSA
Luca Brunelli Division of Neonatology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA
Reha M. Toydemir ARUP LaboratoriesSalt Lake CityUSA Department of PathologyUniversity of UtahSalt Lake CityUSA
Brian J. Shayota Division of Medical Genetics, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA
John C. Carey Division of Medical Genetics, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA
Steven E. Boyden Department of Human Genetics, Utah Center for Genetic DiscoveryUniversity of UtahSalt Lake CityUSA
Sabrina Malone Jenkins Division of Neonatology, Department of PediatricsUniversity of Utah School of MedicineSalt Lake CityUSA

Collapse

102

Jobson E, Roberts R. Genomic structural variation in tomato and its role in plant immunity. MOLECULAR HORTICULTURE 2022;2:7. [PMID: 37789472 PMCID: PMC10515242 DOI: 10.1186/s43897-022-00029-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 02/22/2022] [Indexed: 10/05/2023]

103

Assessment of linkage disequilibrium patterns between structural variants and single nucleotide polymorphisms in three commercial chicken populations. BMC Genomics 2022;23:193. [PMID: 35264116 PMCID: PMC8908679 DOI: 10.1186/s12864-022-08418-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/24/2022] [Indexed: 12/29/2022] Open

Abstract

BACKGROUND

Structural variants (SV) are causative for some prominent phenotypic traits of livestock as different comb types in chickens or color patterns in pigs. Their effects on production traits are also increasingly studied. Nevertheless, accurately calling SV remains challenging. It is therefore of interest, whether close-by single nucleotide polymorphisms (SNPs) are in strong linkage disequilibrium (LD) with SVs and can serve as markers. Literature comes to different conclusions on whether SVs are in LD to SNPs on the same level as SNPs to other SNPs. The present study aimed to generate a precise SV callset from whole-genome short-read sequencing (WGS) data for three commercial chicken populations and to evaluate LD patterns between the called SVs and surrounding SNPs. It is thereby the first study that assessed LD between SVs and SNPs in chickens.

RESULTS

The final callset consisted of 12,294,329 bivariate SNPs, 4,301 deletions (DEL), 224 duplications (DUP), 218 inversions (INV) and 117 translocation breakpoints (BND). While average LD between DELs and SNPs was at the same level as between SNPs and SNPs, LD between other SVs and SNPs was strongly reduced (DUP: 40%, INV: 27%, BND: 19% of between-SNP LD). A main factor for the reduced LD was the presence of local minor allele frequency differences, which accounted for 50% of the difference between SNP - SNP and DUP - SNP LD. This was potentially accompanied by lower genotyping accuracies for DUP, INV and BND compared with SNPs and DELs. An evaluation of the presence of tag SNPs (SNP in highest LD to the variant of interest) further revealed DELs to be slightly less tagged by WGS SNPs than WGS SNPs by other SNPs. This difference, however, was no longer present when reducing the pool of potential tag SNPs to SNPs located on four different chicken genotyping arrays.

CONCLUSIONS

The results implied that genomic variance due to DELs in the chicken populations studied can be captured by different SNP marker sets as good as variance from WGS SNPs, whereas separate SV calling might be advisable for DUP, INV, and BND effects.

Collapse

104

Liu Z, Roberts R, Mercer TR, Xu J, Sedlazeck FJ, Tong W. Towards accurate and reliable resolution of structural variants for clinical diagnosis. Genome Biol 2022;23:68. [PMID: 35241127 PMCID: PMC8892125 DOI: 10.1186/s13059-022-02636-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 02/15/2022] [Indexed: 12/17/2022] Open

105

Long-read sequencing on the SMRT platform enables efficient haplotype linkage analysis in preimplantation genetic testing for β-thalassemia. J Assist Reprod Genet 2022;39:739-746. [PMID: 35141813 PMCID: PMC8995213 DOI: 10.1007/s10815-022-02415-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 01/26/2022] [Indexed: 10/19/2022] Open

106

Marwaha S, Knowles JW, Ashley EA. A guide for the diagnosis of rare and undiagnosed disease: beyond the exome. Genome Med 2022;14:23. [PMID: 35220969 PMCID: PMC8883622 DOI: 10.1186/s13073-022-01026-w] [Citation(s) in RCA: 105] [Impact Index Per Article: 52.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 02/10/2022] [Indexed: 02/07/2023] Open

107

Lemay MA, Sibbesen JA, Torkamaneh D, Hamel J, Levesque RC, Belzile F. Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology. BMC Biol 2022;20:53. [PMID: 35197050 PMCID: PMC8867729 DOI: 10.1186/s12915-022-01255-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/16/2022] [Indexed: 12/31/2022] Open

Abstract

BACKGROUND

Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars.

RESULTS

We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE.

CONCLUSIONS

We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species.

Collapse

108

Menon VK, Okhuysen PC, Chappell CL, Mahmoud M, Mahmoud M, Meng Q, Doddapaneni H, Vee V, Han Y, Salvi S, Bhamidipati S, Kottapalli K, Weissenberger G, Shen H, Ross MC, Hoffman KL, Cregeen SJ, Muzny DM, Metcalf GA, Gibbs RA, Petrosino JF, Sedlazeck FJ. Fully resolved assembly of Cryptosporidium parvum. Gigascience 2022;11:giac010. [PMID: 35166336 PMCID: PMC8848321 DOI: 10.1093/gigascience/giac010] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 12/07/2021] [Accepted: 01/20/2022] [Indexed: 11/17/2022] Open

Affiliation(s)

Vipin K Menon Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Pablo C Okhuysen Department of Infectious Diseases, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Cynthia L Chappell Center for Infectious Diseases, The University of Texas School of Public Health, Houston, TX 77030, USA
Medhat Mahmoud Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Medhat Mahmoud Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Qingchang Meng Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Harsha Doddapaneni Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Vanesa Vee Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Yi Han Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Sejal Salvi Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Sravya Bhamidipati Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Kavya Kottapalli Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
George Weissenberger Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Hua Shen Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Matthew C Ross Alkek Center for Metagenomics and Microbiome Research, Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA
Kristi L Hoffman Alkek Center for Metagenomics and Microbiome Research, Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA
Sara Javornik Cregeen Alkek Center for Metagenomics and Microbiome Research, Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA
Donna M Muzny Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Ginger A Metcalf Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Richard A Gibbs Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Joseph F Petrosino Alkek Center for Metagenomics and Microbiome Research, Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA
Fritz J Sedlazeck Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA

Collapse

109

Methods to Improve Molecular Diagnosis in Genomic Cold Cases in Pediatric Neurology. Genes (Basel) 2022;13:genes13020333. [PMID: 35205378 PMCID: PMC8871714 DOI: 10.3390/genes13020333] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 02/06/2022] [Accepted: 02/07/2022] [Indexed: 02/04/2023] Open

110

Murdock DR, Rosenfeld JA, Lee B. What Has the Undiagnosed Diseases Network Taught Us About the Clinical Applications of Genomic Testing? Annu Rev Med 2022;73:575-585. [PMID: 35084988 PMCID: PMC10874501 DOI: 10.1146/annurev-med-042120-014904] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

111

Kaur P, Zhang B. New vision on the new era of genome study. Funct Integr Genomics 2022;22:1-2. [PMID: 35038070 PMCID: PMC8761871 DOI: 10.1007/s10142-022-00826-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

112

Wierzbicki F, Schwarz F, Cannalonga O, Kofler R. Novel quality metrics allow identifying and generating high-quality assemblies of piRNA clusters. Mol Ecol Resour 2022;22:102-121. [PMID: 34181811 DOI: 10.1111/1755-0998.13455] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 04/30/2021] [Accepted: 06/14/2021] [Indexed: 12/30/2022]

113

Jiang T, Liu S, Cao S, Wang Y. Structural Variant Detection from Long-Read Sequencing Data with cuteSV. Methods Mol Biol 2022;2493:137-151. [PMID: 35751813 DOI: 10.1007/978-1-0716-2293-3_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

114

Lemay MA, Malle S. A Practical Guide to Using Structural Variants for Genome-Wide Association Studies. Methods Mol Biol 2022;2481:161-172. [PMID: 35641764 DOI: 10.1007/978-1-0716-2237-7_10] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

115

Khayat MM, Sahraeian SME, Zarate S, Carroll A, Hong H, Pan B, Shi L, Gibbs RA, Mohiyuddin M, Zheng Y, Sedlazeck FJ. Hidden biases in germline structural variant detection. Genome Biol 2021;22:347. [PMID: 34930391 PMCID: PMC8686633 DOI: 10.1186/s13059-021-02558-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 11/24/2021] [Indexed: 01/23/2023] Open

116

Baslan T, Kovaka S, Sedlazeck FJ, Zhang Y, Wappel R, Tian S, Lowe SW, Goodwin S, Schatz MC. High resolution copy number inference in cancer using short-molecule nanopore sequencing. Nucleic Acids Res 2021;49:e124. [PMID: 34551429 PMCID: PMC8643650 DOI: 10.1093/nar/gkab812] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 07/19/2021] [Accepted: 09/09/2021] [Indexed: 01/23/2023] Open

117

Chen Z, He X. Application of third-generation sequencing in cancer research. MEDICAL REVIEW (BERLIN, GERMANY) 2021;1:150-171. [PMID: 37724303 PMCID: PMC10388785 DOI: 10.1515/mr-2021-0013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 08/09/2021] [Indexed: 09/20/2023]

118

Hall CL, Kesharwani RK, Phillips NR, Planz JV, Sedlazeck FJ, Zascavage RR. Accurate profiling of forensic autosomal STRs using the Oxford Nanopore Technologies MinION device. Forensic Sci Int Genet 2021;56:102629. [PMID: 34837788 DOI: 10.1016/j.fsigen.2021.102629] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 09/28/2021] [Accepted: 11/01/2021] [Indexed: 01/23/2023]

Abstract

The high variability characteristic of short tandem repeat (STR) markers is harnessed for human identification in forensic genetic analyses. Despite the power and reliability of current typing techniques, sequence-level information both within and around STRs are masked in the length-based profiles generated. Forensic STR typing using next generation sequencing (NGS) has therefore gained attention as an alternative to traditional capillary electrophoresis (CE) approaches. In this proof-of-principle study, we evaluate the forensic applicability of the newest and smallest NGS platform available - the Oxford Nanopore Technologies (ONT) MinION device. Although nanopore sequencing on the handheld MinION offers numerous advantages, including low startup cost and on-site sample processing, the relatively high error rate and lack of forensic-specific analysis software has prevented accurate profiling across STR panels in previous studies. Here we present STRspy, a streamlined method capable of producing length- and sequence-based STR allele designations from noisy, error-prone third generation sequencing reads. To assess the capabilities of STRspy, seven reference samples (female: n = 2; male: n = 5) were amplified at 15 and 30 PCR cycles using the Promega PowerSeq 46GY System and sequenced on the ONT MinION device in triplicate. Basecalled reads were then processed with STRspy using a custom database containing alleles reported in the STRSeq BioProject NIST 1036 dataset. Resultant STR allele designations and flanking region single nucleotide polymorphism (SNP) calls were compared to the manufacturer-validated genotypes for each sample. STRspy generated robust and reliable genotypes across all autosomal STR loci amplified with 30 PCR cycles, achieving 100% concordance based on both length and sequence. Furthermore, we were able to identify flanking region SNPs in the 15-cycle dataset with > 90% accuracy. These results demonstrate that when analyzed with STRspy ONT reads can reveal additional variation in and around STR loci depending on read coverage. As the first and only third generation sequencing platform-specific method to successfully profile the entire panel of autosomal STRs amplified by a commercially available multiplex, STRspy significantly increases the feasibility of nanopore sequencing in forensic applications.

Collapse

119

Liu H, Yan XM, Wang XR, Zhang DX, Zhou Q, Shi TL, Jia KH, Tian XC, Zhou SS, Zhang RG, Yun QZ, Wang Q, Xiang Q, Mannapperuma C, Van Zalen E, Street NR, Porth I, El-Kassaby YA, Zhao W, Wang XR, Guan W, Mao JF. Centromere-Specific Retrotransposons and Very-Long-Chain Fatty Acid Biosynthesis in the Genome of Yellowhorn (Xanthoceras sorbifolium, Sapindaceae), an Oil-Producing Tree With Significant Drought Resistance. FRONTIERS IN PLANT SCIENCE 2021;12:766389. [PMID: 34880890 PMCID: PMC8647845 DOI: 10.3389/fpls.2021.766389] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 10/18/2021] [Indexed: 05/17/2023]

Affiliation(s)

Hui Liu National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Xue-Mei Yan National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Xin-rui Wang National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Dong-Xu Zhang Protected Agricultural Technology, R&D Center, Shanxi Datong University, Datong, China
Qingyuan Zhou Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, China
Tian-Le Shi National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Kai-Hua Jia National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Xue-Chan Tian National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Shan-Shan Zhou National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Ren-Gang Zhang Department of Bioinformatics, Ori (Shandong) Gene Science and Technology Co., Ltd., Weifang, China
Quan-Zheng Yun Department of Bioinformatics, Ori (Shandong) Gene Science and Technology Co., Ltd., Weifang, China
Qing Wang Key Laboratory of Forest Ecology and Environment of the National Forestry and Grassland Administration, Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing, China
Qiuhong Xiang National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Chanaka Mannapperuma Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
Elena Van Zalen Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
Nathaniel R. Street Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
Ilga Porth Départment des Sciences du Bois et de la Forêt, Faculté de Foresterie, de Géographie et de Géomatique, Université Laval Québec, Quebec City, QC, Canada
Yousry A. El-Kassaby Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC, Canada
Wei Zhao National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China Department of Ecology and Environmental Science, Umeå Plant Science Centre, Umeå University, Umeå, Sweden
Xiao-Ru Wang National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China Department of Ecology and Environmental Science, Umeå Plant Science Centre, Umeå University, Umeå, Sweden
Wenbin Guan National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
Jian-Feng Mao National Engineering Laboratory for Tree Breeding, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, School of Ecology and Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China

Collapse

120

Comprehensive characterization of copy number variation (CNV) called from array, long- and short-read data. BMC Genomics 2021;22:826. [PMID: 34789167 PMCID: PMC8596897 DOI: 10.1186/s12864-021-08082-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 10/13/2021] [Indexed: 11/10/2022] Open

121

Schielzeth H, Wolf JBW. Community genomics: a community-wide perspective on within-species genetic diversity. AMERICAN JOURNAL OF BOTANY 2021;108:2108-2111. [PMID: 34767249 DOI: 10.1002/ajb2.1796] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 09/07/2021] [Indexed: 06/13/2023]

122

Coombe L, Li JX, Lo T, Wong J, Nikolic V, Warren RL, Birol I. LongStitch: high-quality genome assembly correction and scaffolding using long reads. BMC Bioinformatics 2021;22:534. [PMID: 34717540 PMCID: PMC8557608 DOI: 10.1186/s12859-021-04451-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 10/19/2021] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

Generating high-quality de novo genome assemblies is foundational to the genomics study of model and non-model organisms. In recent years, long-read sequencing has greatly benefited genome assembly and scaffolding, a process by which assembled sequences are ordered and oriented through the use of long-range information. Long reads are better able to span repetitive genomic regions compared to short reads, and thus have tremendous utility for resolving problematic regions and helping generate more complete draft assemblies. Here, we present LongStitch, a scalable pipeline that corrects and scaffolds draft genome assemblies exclusively using long reads.

RESULTS

LongStitch incorporates multiple tools developed by our group and runs in up to three stages, which includes initial assembly correction (Tigmint-long), followed by two incremental scaffolding stages (ntLink and ARKS-long). Tigmint-long and ARKS-long are misassembly correction and scaffolding utilities, respectively, previously developed for linked reads, that we adapted for long reads. Here, we describe the LongStitch pipeline and introduce our new long-read scaffolder, ntLink, which utilizes lightweight minimizer mappings to join contigs. LongStitch was tested on short and long-read assemblies of Caenorhabditis elegans, Oryza sativa, and three different human individuals using corresponding nanopore long-read data, and improves the contiguity of each assembly from 1.2-fold up to 304.6-fold (as measured by NGA50 length). Furthermore, LongStitch generates more contiguous and correct assemblies compared to state-of-the-art long-read scaffolder LRScaf in most tests, and consistently improves upon human assemblies in under five hours using less than 23 GB of RAM.

CONCLUSIONS

Due to its effectiveness and efficiency in improving draft assemblies using long reads, we expect LongStitch to benefit a wide variety of de novo genome assembly projects. The LongStitch pipeline is freely available at https://github.com/bcgsc/longstitch .

Collapse

123

Democratizing long-read genome assembly. Cell Syst 2021;12:945-947. [PMID: 34672955 DOI: 10.1016/j.cels.2021.09.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

124

Luo X, Cui K, Wang Z, Li Z, Wu Z, Huang W, Zhu XQ, Ruan J, Zhang W, Liu Q. High-quality reference genome of Fasciola gigantica: Insights into the genomic signatures of transposon-mediated evolution and specific parasitic adaption in tropical regions. PLoS Negl Trop Dis 2021;15:e0009750. [PMID: 34610021 PMCID: PMC8519440 DOI: 10.1371/journal.pntd.0009750] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 10/15/2021] [Accepted: 08/23/2021] [Indexed: 12/31/2022] Open

Abstract

Fasciola gigantica and Fasciola hepatica are causative pathogens of fascioliasis, with the widest latitudinal, longitudinal, and altitudinal distribution; however, among parasites, they have the largest sequenced genomes, hindering genomic research. In the present study, we used various sequencing and assembly technologies to generate a new high-quality Fasciola gigantica reference genome. We improved the integration of gene structure prediction, and identified two independent transposable element expansion events contributing to (1) the speciation between Fasciola and Fasciolopsis during the Cretaceous-Paleogene boundary mass extinction, and (2) the habitat switch to the liver during the Paleocene-Eocene Thermal Maximum, accompanied by gene length increment. Long interspersed element (LINE) duplication contributed to the second transposon-mediated alteration, showing an obvious trend of insertion into gene regions, regardless of strong purifying effect. Gene ontology analysis of genes with long LINE insertions identified membrane-associated and vesicle secretion process proteins, further implicating the functional alteration of the gene network. We identified 852 predicted excretory/secretory proteins and 3300 protein-protein interactions between Fasciola gigantica and its host. Among them, copper/zinc superoxide dismutase genes, with specific gene copy number variations, might play a central role in the phase I detoxification process. Analysis of 559 single-copy orthologs suggested that Fasciola gigantica and Fasciola hepatica diverged at 11.8 Ma near the Middle and Late Miocene Epoch boundary. We identified 98 rapidly evolving gene families, including actin and aquaporin, which might explain the large body size and the parasitic adaptive character resulting in these liver flukes becoming epidemic in tropical and subtropical regions.

Collapse

Affiliation(s)

Xier Luo Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Kuiqing Cui State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Zhiqiang Wang State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Zhipeng Li State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Zhengjiao Wu State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Weiyi Huang State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Xing-Quan Zhu College of Veterinary Medicine, Shanxi Agricultural University, Taigu, China
Jue Ruan Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Weiyu Zhang State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China
Qingyou Liu Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning, China

Collapse

125

Revollo JR, Miranda JA, Dobrovolsky VN. PacBio sequencing detects genome-wide ultra-low-frequency substitution mutations resulting from exposure to chemical mutagens. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS 2021;62:438-445. [PMID: 34424574 DOI: 10.1002/em.22462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/18/2021] [Accepted: 08/20/2021] [Indexed: 06/13/2023]

126

Fu Y, Mahmoud M, Muraliraman VV, Sedlazeck FJ, Treangen TJ. Vulcan: Improved long-read mapping and structural variant calling via dual-mode alignment. Gigascience 2021;10:6375129. [PMID: 34561697 PMCID: PMC8463296 DOI: 10.1093/gigascience/giab063] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/22/2021] [Accepted: 08/29/2021] [Indexed: 01/23/2023] Open

127

Lima L, Marchet C, Caboche S, Da Silva C, Istace B, Aury JM, Touzet H, Chikhi R. Comparative assessment of long-read error correction software applied to Nanopore RNA-sequencing data. Brief Bioinform 2021;21:1164-1181. [PMID: 31232449 DOI: 10.1093/bib/bbz058] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Revised: 04/05/2019] [Accepted: 04/22/2019] [Indexed: 12/13/2022] Open

128

Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations in mismatch repair genes in methotrexate-resistant HT-29 cells. Exp Mol Med 2021;53:1344-1355. [PMID: 34521988 PMCID: PMC8492700 DOI: 10.1038/s12276-021-00668-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 06/04/2021] [Accepted: 06/21/2021] [Indexed: 12/16/2022] Open

Abstract

DHFR gene amplification is commonly present in methotrexate (MTX)-resistant colon cancer cells and acute lymphoblastic leukemia. In this study, we proposed an integrative framework to characterize the amplified region by using a combination of single-molecule real-time sequencing, next-generation optical mapping, and chromosome conformation capture (Hi-C). We identified an amplification unit spanning 11 genes, from the DHFR gene to the ATP6AP1L gene position, with high adjusted interaction frequencies on chromosome 5 (~2.2 Mbp) and a twenty-fold tandemly amplified region, and novel inversions at the start and end positions of the amplified region as well as frameshift insertions in most of the MSH and MLH genes were detected. These mutations might stimulate chromosomal breakage and cause the dysregulation of mismatch repair. Characterizing the tandem gene-amplified unit may be critical for identifying the mechanisms that trigger genomic rearrangements. These findings may provide new insight into the mechanisms underlying the amplification process and the evolution of drug resistance.

Sequencing a large region of DNA containing many surplus copies of genes linked to drug resistance in colon cancer cells may illuminate how these genomic rearrangements arise. Such regions of gene amplification are highly repetitive, making them impossible to sequence using ordinary methods, and little is known about how they are generated. Using advanced methods, Jeong-Sun Seo at Seoul National University Bundang Hospital in South Korea and co-workers sequenced a region of gene amplification in colon cancer cells. The amplified region was approximately 20 times the length of that in healthy cells and contained many copies of an eleven-gene segment, including a gene implicated in drug resistance. The region also contained mutations in chromosomal repair genes which would disrupt repair pathways. These results illuminate the genetic changes that lead to gene amplification and drug resistance in cancer cells.

Collapse

129

Yan SM, Sherman RM, Taylor DJ, Nair DR, Bortvin AN, Schatz MC, McCoy RC. Local adaptation and archaic introgression shape global diversity at human structural variant loci. eLife 2021;10:e67615. [PMID: 34528508 PMCID: PMC8492059 DOI: 10.7554/elife.67615] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 09/14/2021] [Indexed: 12/13/2022] Open

130

Mahmoud M, Doddapaneni H, Timp W, Sedlazeck FJ. PRINCESS: comprehensive detection of haplotype resolved SNVs, SVs, and methylation. Genome Biol 2021;22:268. [PMID: 34521442 PMCID: PMC8442460 DOI: 10.1186/s13059-021-02486-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 09/02/2021] [Indexed: 12/11/2022] Open

131

Délot EC, Vilain E. Towards improved genetic diagnosis of human differences of sex development. Nat Rev Genet 2021;22:588-602. [PMID: 34083777 PMCID: PMC10598994 DOI: 10.1038/s41576-021-00365-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/14/2021] [Indexed: 02/05/2023]

132

De Coster W, Weissensteiner MH, Sedlazeck FJ. Towards population-scale long-read sequencing. Nat Rev Genet 2021;22:572-587. [PMID: 34050336 PMCID: PMC8161719 DOI: 10.1038/s41576-021-00367-3] [Citation(s) in RCA: 123] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/20/2021] [Indexed: 02/05/2023]

133

Hu D, Jing J, Snowdon RJ, Mason AS, Shen J, Meng J, Zou J. Exploring the gene pool of Brassica napus by genomics-based approaches. PLANT BIOTECHNOLOGY JOURNAL 2021;19:1693-1712. [PMID: 34031989 PMCID: PMC8428838 DOI: 10.1111/pbi.13636] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 05/13/2021] [Accepted: 05/14/2021] [Indexed: 05/08/2023]

134

Miga KH, Wang T. The Need for a Human Pangenome Reference Sequence. Annu Rev Genomics Hum Genet 2021;22:81-102. [PMID: 33929893 PMCID: PMC8410644 DOI: 10.1146/annurev-genom-120120-081921] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

135

Alser M, Rotman J, Deshpande D, Taraszka K, Shi H, Baykal PI, Yang HT, Xue V, Knyazev S, Singer BD, Balliu B, Koslicki D, Skums P, Zelikovsky A, Alkan C, Mutlu O, Mangul S. Technology dictates algorithms: recent developments in read alignment. Genome Biol 2021;22:249. [PMID: 34446078 PMCID: PMC8390189 DOI: 10.1186/s13059-021-02443-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 07/28/2021] [Indexed: 01/08/2023] Open

Affiliation(s)

Mohammed Alser Computer Science Department, ETH Zürich, 8092, Zürich, Switzerland Computer Engineering Department, Bilkent University, 06800 Bilkent, Ankara, Turkey Information Technology and Electrical Engineering Department, ETH Zürich, Zürich, 8092, Switzerland
Jeremy Rotman Department of Computer Science, University of California Los Angeles, Los Angeles, CA, 90095, USA
Dhrithi Deshpande Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA, 90089, USA
Kodi Taraszka Department of Computer Science, University of California Los Angeles, Los Angeles, CA, 90095, USA
Huwenbo Shi Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Pelin Icer Baykal Department of Computer Science, Georgia State University, Atlanta, GA, 30302, USA
Harry Taegyun Yang Department of Computer Science, University of California Los Angeles, Los Angeles, CA, 90095, USA Bioinformatics Interdepartmental Ph.D. Program, University of California Los Angeles, Los Angeles, CA, 90095, USA
Victor Xue Department of Computer Science, University of California Los Angeles, Los Angeles, CA, 90095, USA
Sergey Knyazev Department of Computer Science, Georgia State University, Atlanta, GA, 30302, USA
Benjamin D Singer Division of Pulmonary and Critical Care Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA Department of Biochemistry & Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, USA Simpson Querrey Institute for Epigenetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
Brunilda Balliu Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA, 90095, USA
David Koslicki Computer Science and Engineering, Pennsylvania State University, University Park, PA, 16801, USA Biology Department, Pennsylvania State University, University Park, PA, 16801, USA The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16801, USA
Pavel Skums Department of Computer Science, Georgia State University, Atlanta, GA, 30302, USA
Alex Zelikovsky Department of Computer Science, Georgia State University, Atlanta, GA, 30302, USA The Laboratory of Bioinformatics, I.M. Sechenov First Moscow State Medical University, Moscow, 119991, Russia
Can Alkan Computer Engineering Department, Bilkent University, 06800 Bilkent, Ankara, Turkey Bilkent-Hacettepe Health Sciences and Technologies Program, Ankara, Turkey
Onur Mutlu Computer Science Department, ETH Zürich, 8092, Zürich, Switzerland Computer Engineering Department, Bilkent University, 06800 Bilkent, Ankara, Turkey Information Technology and Electrical Engineering Department, ETH Zürich, Zürich, 8092, Switzerland
Serghei Mangul Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA, 90089, USA.

Collapse

136

Locke RK, Greig DR, Jenkins C, Dallman TJ, Cowley LA. Acquisition and loss of CTX-M plasmids in Shigella species associated with MSM transmission in the UK. Microb Genom 2021;7. [PMID: 34427554 PMCID: PMC8549364 DOI: 10.1099/mgen.0.000644] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

137

Reddy S, Hung LH, Sala-Torra O, Radich JP, Yeung CC, Yeung KY. A graphical, interactive and GPU-enabled workflow to process long-read sequencing data. BMC Genomics 2021;22:626. [PMID: 34425749 PMCID: PMC8381503 DOI: 10.1186/s12864-021-07927-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/10/2021] [Indexed: 12/18/2022] Open

Abstract

Background

Long-read sequencing has great promise in enabling portable, rapid molecular-assisted cancer diagnoses. A key challenge in democratizing long-read sequencing technology in the biomedical and clinical community is the lack of graphical bioinformatics software tools which can efficiently process the raw nanopore reads, support graphical output and interactive visualizations for interpretations of results. Another obstacle is that high performance software tools for long-read sequencing data analyses often leverage graphics processing units (GPU), which is challenging and time-consuming to configure, especially on the cloud.

Results

We present a graphical cloud-enabled workflow for fast, interactive analysis of nanopore sequencing data using GPUs. Users customize parameters, monitor execution and visualize results through an accessible graphical interface. The workflow and its components are completely containerized to ensure reproducibility and facilitate installation of the GPU-enabled software. We also provide an Amazon Machine Image (AMI) with all software and drivers pre-installed for GPU computing on the cloud. Most importantly, we demonstrate the potential of applying our software tools to reduce the turnaround time of cancer diagnostics by generating blood cancer (NB4, K562, ME1, 238 MV4;11) cell line Nanopore data using the Flongle adapter. We observe a 29x speedup and a 93x reduction in costs for the rate-limiting basecalling step in the analysis of blood cancer cell line data.

Conclusions

Our interactive and efficient software tools will make analyses of Nanopore data using GPU and cloud computing accessible to biomedical and clinical scientists, thus facilitating the adoption of cost effective, fast, portable and real-time long-read sequencing.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-021-07927-1.

Collapse

138

Wold J, Koepfli KP, Galla SJ, Eccles D, Hogg CJ, Le Lec MF, Guhlin J, Santure AW, Steeves TE. Expanding the conservation genomics toolbox: Incorporating structural variants to enhance genomic studies for species of conservation concern. Mol Ecol 2021;30:5949-5965. [PMID: 34424587 PMCID: PMC9290615 DOI: 10.1111/mec.16141] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 07/28/2021] [Accepted: 08/18/2021] [Indexed: 12/28/2022]

139

Zhang S, Liu W, Liu X, Du X, Zhang K, Zhang Y, Song Y, Zi Y, Qiu Q, Lenstra JA, Liu J. Structural Variants Selected during Yak Domestication Inferred from Long-Read Whole-Genome Sequencing. Mol Biol Evol 2021;38:3676-3680. [PMID: 33944937 PMCID: PMC8382902 DOI: 10.1093/molbev/msab134] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

140

Hirakawa H, Toyoda A, Itoh T, Suzuki Y, Nagano AJ, Sugiyama S, Onodera Y. A spinach genome assembly with remarkable completeness, and its use for rapid identification of candidate genes for agronomic traits. DNA Res 2021;28:6303609. [PMID: 34142133 PMCID: PMC8231376 DOI: 10.1093/dnares/dsab004] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Indexed: 01/23/2023] Open

141

Karousis ED, Gypas F, Zavolan M, Mühlemann O. Nanopore sequencing reveals endogenous NMD-targeted isoforms in human cells. Genome Biol 2021;22:223. [PMID: 34389041 PMCID: PMC8361881 DOI: 10.1186/s13059-021-02439-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 07/26/2021] [Indexed: 12/13/2022] Open

142

Sakamoto Y, Zaha S, Suzuki Y, Seki M, Suzuki A. Application of long-read sequencing to the detection of structural variants in human cancer genomes. Comput Struct Biotechnol J 2021;19:4207-4216. [PMID: 34527193 PMCID: PMC8350331 DOI: 10.1016/j.csbj.2021.07.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 07/20/2021] [Accepted: 07/25/2021] [Indexed: 01/02/2023] Open

143

Tan KT, Kim H, Carrot-Zhang J, Zhang Y, Kim WJ, Kugener G, Wala JA, Howard TP, Chi YY, Beroukhim R, Li H, Ha G, Alper SL, Perlman EJ, Mullen EA, Hahn WC, Meyerson M, Hong AL. Haplotype-resolved germline and somatic alterations in renal medullary carcinomas. Genome Med 2021;13:114. [PMID: 34261517 PMCID: PMC8281718 DOI: 10.1186/s13073-021-00929-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 06/25/2021] [Indexed: 11/10/2022] Open

Affiliation(s)

Kar-Tong Tan Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Hyunji Kim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Jian Carrot-Zhang Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Genetics, Harvard Medical School, Boston, MA, USA
Yuxiang Zhang Department of Genetics, Harvard Medical School, Boston, MA, USA
Won Jun Kim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Guillaume Kugener Broad Institute of MIT and Harvard, Cambridge, MA, USA
Jeremiah A Wala Department of Medicine, University of California San Francisco, San Francisco, CA, USA
Thomas P Howard Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Yueh-Yun Chi Department of Pediatrics, University of Southern California, Los Angeles, CA, USA
Rameen Beroukhim Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Heng Li Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, USA
Gavin Ha Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Seth L Alper Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
Elizabeth J Perlman Department of Pathology, Northwestern University, Chicago, IL, USA
Elizabeth A Mullen Department of Hematology and Oncology, Boston Children's Hospital, Boston, MA, USA Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
William C Hahn Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA.
Matthew Meyerson Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA. Department of Genetics, Harvard Medical School, Boston, MA, USA.
Andrew L Hong Department of Pediatrics, Emory University, Atlanta, GA, USA. Aflac Center for Cancer and Blood Disorders, Children's Healthcare of Atlanta, Atlanta, GA, USA.

Collapse

144

Kamil G, Yoon JY, Yoo S, Cheon CK. Clinical relevance of targeted exome sequencing in patients with rare syndromic short stature. Orphanet J Rare Dis 2021;16:297. [PMID: 34217350 PMCID: PMC8254301 DOI: 10.1186/s13023-021-01937-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 06/27/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Large-scale genomic analyses have provided insight into the genetic complexity of short stature (SS); however, only a portion of genetic causes have been identified. In this study, we identified disease-causing mutations in a cohort of Korean patients with suspected syndromic SS by targeted exome sequencing (TES).

METHODS

Thirty-four patients in South Korea with suspected syndromic disorders based on abnormal growth and dysmorphic facial features, developmental delay, or accompanying anomalies were enrolled in 2018-2020 and evaluated by TES.

RESULTS

For 17 of 34 patients with suspected syndromic SS, a genetic diagnosis was obtained by TES. The mean SDS values for height, IGF-1, and IGFBP-3 for these 17 patients were - 3.27 ± 1.25, - 0.42 ± 1.15, and 0.36 ± 1.31, respectively. Most patients displayed distinct facial features (16/17) and developmental delay or intellectual disability (12/17). In 17 patients, 19 genetic variants were identified, including 13 novel heterozygous variants, associated with 15 different genetic diseases, including many inherited rare skeletal disorders and connective tissue diseases (e.g., cleidocranial dysplasia, Hajdu-Cheney syndrome, Sheldon-Hall, acromesomelic dysplasia Maroteaux type, and microcephalic osteodysplastic primordial dwarfism type II). After re-classification by clinical reassessment, including family member testing and segregation studies, 42.1% of variants were pathogenic, 42.1% were likely pathogenic variant, and 15.7% were variants of uncertain significance. Ultra-rare diseases accounted for 12 out of 15 genetic diseases (80%).

CONCLUSIONS

A high positive result from genetic testing suggests that TES may be an effective diagnostic approach for patients with syndromic SS, with implications for genetic counseling. These results expand the mutation spectrum for rare genetic diseases related to SS in Korea.

Collapse

145

Methods to Study Translated Pseudogenes: Recombinant Expression and Complementation, Targeted Proteomics, and RNA Profiling. Methods Mol Biol 2021. [PMID: 34165719 DOI: 10.1007/978-1-0716-1503-4_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/19/2024]

146

Tunjić-Cvitanić M, Pasantes JJ, García-Souto D, Cvitanić T, Plohl M, Šatović-Vukšić E. Satellitome Analysis of the Pacific Oyster Crassostrea gigas Reveals New Pattern of Satellite DNA Organization, Highly Scattered across the Genome. Int J Mol Sci 2021;22:ijms22136798. [PMID: 34202698 PMCID: PMC8268682 DOI: 10.3390/ijms22136798] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 06/18/2021] [Accepted: 06/19/2021] [Indexed: 12/22/2022] Open

147

Tvedte ES, Gasser M, Sparklin BC, Michalski J, Hjelmen CE, Johnston JS, Zhao X, Bromley R, Tallon LJ, Sadzewicz L, Rasko DA, Dunning Hotopp JC. Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes. G3 (BETHESDA, MD.) 2021;11:jkab083. [PMID: 33768248 PMCID: PMC8495745 DOI: 10.1093/g3journal/jkab083] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 03/07/2021] [Indexed: 12/14/2022]

148

Suh A, Dion-Côté AM. New Perspectives on the Evolution of Within-Individual Genome Variation and Germline/Soma Distinction. Genome Biol Evol 2021;13:evab095. [PMID: 33963843 PMCID: PMC8245192 DOI: 10.1093/gbe/evab095] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/07/2021] [Indexed: 12/19/2022] Open

149

Guiglielmoni N, Houtain A, Derzelle A, Van Doninck K, Flot JF. Overcoming uncollapsed haplotypes in long-read assemblies of non-model organisms. BMC Bioinformatics 2021;22:303. [PMID: 34090340 PMCID: PMC8178825 DOI: 10.1186/s12859-021-04118-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 04/02/2021] [Indexed: 12/21/2022] Open

Abstract

Background

Long-read sequencing is revolutionizing genome assembly: as PacBio and Nanopore technologies become more accessible in technicity and in cost, long-read assemblers flourish and are starting to deliver chromosome-level assemblies. However, these long reads are usually error-prone, making the generation of a haploid reference out of a diploid genome a difficult enterprise. Failure to properly collapse haplotypes results in fragmented and structurally incorrect assemblies and wreaks havoc on orthology inference pipelines, yet this serious issue is rarely acknowledged and dealt with in genomic projects, and an independent, comparative benchmark of the capacity of assemblers and post-processing tools to properly collapse or purge haplotypes is still lacking.

Results

We tested different assembly strategies on the genome of the rotifer Adineta vaga, a non-model organism for which high coverages of both PacBio and Nanopore reads were available. The assemblers we tested (Canu, Flye, NextDenovo, Ra, Raven, Shasta and wtdbg2) exhibited strikingly different behaviors when dealing with highly heterozygous regions, resulting in variable amounts of uncollapsed haplotypes. Filtering reads generally improved haploid assemblies, and we also benchmarked three post-processing tools aimed at detecting and purging uncollapsed haplotypes in long-read assemblies: HaploMerger2, purge_haplotigs and purge_dups.

Conclusions

We provide a thorough evaluation of popular assemblers on a non-model eukaryote genome with variable levels of heterozygosity. Our study highlights several strategies using pre and post-processing approaches to generate haploid assemblies with high continuity and completeness. This benchmark will help users to improve haploid assemblies of non-model organisms, and evaluate the quality of their own assemblies.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-021-04118-3.

Collapse

150

Ono Y, Asai K, Hamada M. PBSIM2: a simulator for long-read sequencers with a novel generative model of quality scores. Bioinformatics 2021;37:589-595. [PMID: 32976553 PMCID: PMC8097687 DOI: 10.1093/bioinformatics/btaa835] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 08/20/2020] [Accepted: 09/11/2020] [Indexed: 12/21/2022] Open