1
|
Kolesnikov A, Cook D, Nattestad M, Brambrink L, McNulty B, Gorzynski J, Goenka S, Ashley EA, Jain M, Miga KH, Paten B, Chang PC, Carroll A, Shafin K. Local read haplotagging enables accurate long-read small variant calling. Nat Commun 2024; 15:5907. [PMID: 39003259 PMCID: PMC11246426 DOI: 10.1038/s41467-024-50079-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 06/28/2024] [Indexed: 07/15/2024] Open
Abstract
Long-read sequencing technology has enabled variant detection in difficult-to-map regions of the genome and enabled rapid genetic diagnosis in clinical settings. Rapidly evolving third-generation sequencing platforms like Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) are introducing newer platforms and data types. It has been demonstrated that variant calling methods based on deep neural networks can use local haplotyping information with long-reads to improve the genotyping accuracy. However, using local haplotype information creates an overhead as variant calling needs to be performed multiple times which ultimately makes it difficult to extend to new data types and platforms as they get introduced. In this work, we have developed a local haplotype approximate method that enables state-of-the-art variant calling performance with multiple sequencing platforms including PacBio Revio system, ONT R10.4 simplex and duplex data. This addition of local haplotype approximation simplifies long-read variant calling with DeepVariant.
Collapse
Affiliation(s)
| | - Daniel Cook
- Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA, USA
| | | | | | - Brandy McNulty
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | | | | | | | - Miten Jain
- Northeastern university, Boston, MA, USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Pi-Chuan Chang
- Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA, USA
| | - Andrew Carroll
- Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA, USA.
| | - Kishwar Shafin
- Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA, USA.
| |
Collapse
|
2
|
Overgaard CK, Jamy M, Radutoiu S, Burki F, Dueholm MKD. Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes. Mol Ecol Resour 2024:e13991. [PMID: 38979877 DOI: 10.1111/1755-0998.13991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 05/31/2024] [Accepted: 06/26/2024] [Indexed: 07/10/2024]
Abstract
The use of short-read metabarcoding for classifying microeukaryotes is challenged by the lack of comprehensive 18S rRNA reference databases. While recent advances in high-throughput long-read sequencing provide the potential to greatly increase the phylogenetic coverage of these databases, the performance of different sequencing technologies and subsequent bioinformatics processing remain to be evaluated, primarily because of the absence of well-defined eukaryotic mock communities. To address this challenge, we created a eukaryotic rRNA operon clone-library and turned it into a precisely defined synthetic eukaryotic mock community. This mock community was then used to evaluate the performance of three long-read sequencing strategies (PacBio circular consensus sequencing and two Nanopore approaches using unique molecular identifiers) and three tools for resolving amplicons sequence variants (ASVs) (USEARCH, VSEARCH, and DADA2). We investigated the sensitivity of the sequencing techniques based on the number of detected mock taxa, and the accuracy of the different ASV-calling tools with a specific focus on the presence of chimera among the final rRNA operon ASVs. Based on our findings, we provide recommendations and best practice protocols for how to cost-effectively obtain essentially error-free rRNA operons in high-throughput. An agricultural soil sample was used to demonstrate that the sequencing and bioinformatic results from the mock community also translates to highly diverse natural samples, which enables us to identify previously undescribed microeukaryotic lineages.
Collapse
Affiliation(s)
| | - Mahwash Jamy
- Department of Organismal Biology (Systematic Biology), Uppsala University, Uppsala, Sweden
- Department of Aquatic Sciences and Assessment; Division of Microbial Ecology, Uppsala University, Uppsala, Sweden
| | - Simona Radutoiu
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Fabien Burki
- Department of Organismal Biology (Systematic Biology), Uppsala University, Uppsala, Sweden
| | - Morten Kam Dahl Dueholm
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| |
Collapse
|
3
|
Luan T, Commichaux S, Hoffmann M, Jayeola V, Jang JH, Pop M, Rand H, Luo Y. Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates. BMC Genomics 2024; 25:679. [PMID: 38978005 PMCID: PMC11232133 DOI: 10.1186/s12864-024-10582-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 07/01/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks. RESULTS We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct. CONCLUSIONS Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.
Collapse
Affiliation(s)
- Tu Luan
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Seth Commichaux
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, Laurel, MD, 20708, USA.
| | - Maria Hoffmann
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Victor Jayeola
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Jae Hee Jang
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Hugh Rand
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| | - Yan Luo
- Center for Food Safety and Applied Nutrition, Food and Drug Administration, College Park, MD, 20740, USA
| |
Collapse
|
4
|
Hong YP, Chen BH, Wang YW, Teng RH, Wei HL, Chiou CS. The usefulness of nanopore sequencing in whole-genome sequencing-based genotyping of Listeria monocytogenes and Salmonella enterica serovar Enteritidis. Microbiol Spectr 2024; 12:e0050924. [PMID: 38809017 PMCID: PMC11218467 DOI: 10.1128/spectrum.00509-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 05/01/2024] [Indexed: 05/30/2024] Open
Abstract
Bacterial genotyping through whole-genome sequencing plays a crucial role in disease surveillance and outbreak investigations in public health laboratories. This study assessed the effectiveness of Oxford Nanopore Technologies (ONT) sequencing in the genotyping of Listeria monocytogenes and Salmonella enterica serovar Enteritidis. Our results indicated that ONT sequences, generated with the R10.4.1 flow cell and basecalled using the Dorado 0.5.0 Super Accurate 4.3 model, exhibited comparable accuracy to Illumina sequences, effectively discriminating among bacterial strains from outbreaks. These findings suggest that ONT sequencing has the potential to be a promising tool for rapid whole-genome sequencing of bacterial pathogens in public health laboratories for epidemiological investigations. IMPORTANCE This study unveils that Oxford Nanopore Technologies sequencing, by itself, holds the potential to serve as a whole-genome sequencing-based genotyping tool in public health laboratories, enabling routine subtyping of bacterial isolates for disease surveillance and outbreak investigations.
Collapse
Affiliation(s)
- Yu-Ping Hong
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| | - Bo-Han Chen
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| | - You-Wun Wang
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| | - Ru-Hsiou Teng
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| | - Hsiao-Lun Wei
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| | - Chien-Shun Chiou
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Taichung, Taiwan
| |
Collapse
|
5
|
Hejazi S, Ahsan A, Kashani M, Reuel NF. Amplified DNA Heterogeneity Assessment with Oxford Nanopore Sequencing Applied to Cell Free Expression Templates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.02.597048. [PMID: 38895213 PMCID: PMC11185537 DOI: 10.1101/2024.06.02.597048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
In this work, Oxford Nanopore sequencing is tested as an accessible method for quantifying heterogeneity of amplified DNA. This method enables rapid quantification of deletions, insertions, and substitutions, the probability of each mutation error, and their locations in the replicated sequences. Amplification techniques tested were conventional polymerase chain reaction (PCR) with varying levels of polymerase fidelity (OneTaq, Phusion, and Q5) as well as rolling circle amplification (RCA) with Phi29 polymerase. Plasmid amplification using bacteria was also assessed. By analyzing the distribution of errors in a large set of sequences for each sample, we examined the heterogeneity and mode of errors in each sample. This analysis revealed that Q5 and Phusion polymerases exhibited the lowest error rates observed in the amplified DNA. As a secondary validation, we analyzed the emission spectra of sfGFP fluorescent proteins synthesized with amplified DNA using cell free expression. Error-prone polymerase chain reactions confirmed the dependency of reporter protein emission spectra peak broadness to DNA error rates. The presented nanopore sequencing methods serve as a roadmap to quantify the accuracy of other gene amplification techniques, as they are discovered, enabling more homogenous cell-free expression of desired proteins.
Collapse
Affiliation(s)
- Sepehr Hejazi
- Chemical and Biological Engineering - Iowa State University, 618 Bissell Rd, Ames, IA 50011
| | - Afrin Ahsan
- Chemical and Biological Engineering - Iowa State University, 618 Bissell Rd, Ames, IA 50011
| | - Mohammad Kashani
- Electrical and Computer Engineering - Iowa State University, 618 Bissell Rd, Ames, IA 50011
| | - Nigel F Reuel
- Chemical and Biological Engineering - Iowa State University, 618 Bissell Rd, Ames, IA 50011
| |
Collapse
|
6
|
Chen S, Wang H, Zhang D, Chen R, Luo J. Readon: a novel algorithm to identify read-through transcripts with long-read sequencing data. Bioinformatics 2024; 40:btae336. [PMID: 38808568 PMCID: PMC11162696 DOI: 10.1093/bioinformatics/btae336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 04/30/2024] [Accepted: 05/26/2024] [Indexed: 05/30/2024] Open
Abstract
MOTIVATION There are many clustered transcriptionally active regions in the human genome, in which the transcription complex cannot immediately terminate transcription at the upstream gene termination site, but instead continues to transcribe intergenic regions and downstream genes, resulting in read-through transcripts. Several studies have demonstrated the regulatory roles of read-through transcripts in tumorigenesis and development. However, limited by the read length of next-generation sequencing, discovery of read-through transcripts has been slow. For long but also erroneous third-generation sequencing data, this study developed a novel minimizer sketch algorithm to accurately and quickly identify read-through transcripts. RESULTS Readon initially splits the reference sequence into distinct active regions. It employs a sliding window approach within each region, calculates minimizers, and constructs the specialized structured arrays for query indexing. Following initial alignment anchor screening of candidate read-through transcripts, further confirmation steps are executed. Comparative assessments against existing software reveal Readon's superior performance on both simulated and validated real data. Additionally, two downstream tools are provided: one for predicting whether a read-through transcript is likely to undergo nonsense-mediated decay or encodes a protein, and another for visualizing splicing patterns. AVAILABILITY AND IMPLEMENTATION Readon is freely available on GitHub (https://github.com/Bulabula45/Readon).
Collapse
Affiliation(s)
- Siang Chen
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hao Wang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Dongdong Zhang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Runsheng Chen
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jianjun Luo
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
7
|
Hansen MH, Maagaard M, Cédile O, Nyvold CG. SWIGH-SCORE: A translational light-weight approach in computational detection of rearranged immunoglobulin heavy chain to be used in monoclonal lymphoproliferative disorders. MethodsX 2024; 12:102741. [PMID: 38846434 PMCID: PMC11154698 DOI: 10.1016/j.mex.2024.102741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 04/23/2024] [Accepted: 05/02/2024] [Indexed: 06/09/2024] Open
Abstract
We present a lightweight tool for clonotyping and measurable residual disease (MRD) assessment in monoclonal lymphoproliferative disorders. It is a translational method that enables computational detection of rearranged immunoglobulin heavy chain gene sequences.•The swigh-score clonotyping tool emphasizes parallelization and applicability across sequencing platforms.•The algorithm is based on an adaptation of the Smith-Waterman algorithm for local alignment of reads generated by 2nd and 3rd generation of sequencers.For method validation, we demonstrate the targeted sequences of immunoglobulin heavy chain genes from diagnostic bone marrow using serial dilutions of CD138+ plasma cells from a patient with multiple myeloma. Sequencing libraries from diagnostic samples were prepared for the three sequencing platforms, Ion S5 (Thermo Fisher Scientific), MiSeq (Illumina), and MinION (Oxford Nanopore), using the LymphoTrack assay. Basic quality filtering was performed, and a Smith-Waterman-based swigh-score algorithm was developed in shell and C for clonotyping and MRD assessment using FASTQ data files. Performance is demonstrated across the three different sequencing platforms.
Collapse
Affiliation(s)
- Marcus Høy Hansen
- Haematology-Pathology Research Laboratory, Research Unit of Haematology, Department of Hematology, and Research Unit of Pathology, Department of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
| | - Markus Maagaard
- Haematology-Pathology Research Laboratory, Research Unit of Haematology, Department of Hematology, and Research Unit of Pathology, Department of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
| | - Oriane Cédile
- Haematology-Pathology Research Laboratory, Research Unit of Haematology, Department of Hematology, and Research Unit of Pathology, Department of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
- OPEN, Odense Patient data Explorative Network, Haematology-Pathology Research Laboratory, Odense University Hospital, Odense, Denmark
| | - Charlotte Guldborg Nyvold
- Haematology-Pathology Research Laboratory, Research Unit of Haematology, Department of Hematology, and Research Unit of Pathology, Department of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
- OPEN, Odense Patient data Explorative Network, Haematology-Pathology Research Laboratory, Odense University Hospital, Odense, Denmark
| |
Collapse
|
8
|
Gribling-Burrer AS, Bohn P, Smyth RP. Isoform-specific RNA structure determination using Nano-DMS-MaP. Nat Protoc 2024; 19:1835-1865. [PMID: 38347203 DOI: 10.1038/s41596-024-00959-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 12/12/2023] [Indexed: 06/12/2024]
Abstract
RNA structure determination is essential to understand how RNA carries out its diverse biological functions. In cells, RNA isoforms are readily expressed with partial variations within their sequences due, for example, to alternative splicing, heterogeneity in the transcription start site, RNA processing or differential termination/polyadenylation. Nanopore dimethyl sulfate mutational profiling (Nano-DMS-MaP) is a method for in situ isoform-specific RNA structure determination. Unlike similar methods that rely on short sequencing reads, Nano-DMS-MaP employs nanopore sequencing to resolve the structures of long and highly similar RNA molecules to reveal their previously hidden structural differences. This Protocol describes the development and applications of Nano-DMS-MaP and outlines the main considerations for designing and implementing a successful experiment: from bench to data analysis. In cell probing experiments can be carried out by an experienced molecular biologist in 3-4 d. Data analysis requires good knowledge of command line tools and Python scripts and requires a further 3-5 d.
Collapse
Affiliation(s)
- Anne-Sophie Gribling-Burrer
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany.
| | - Patrick Bohn
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany.
| | - Redmond P Smyth
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany.
- Faculty of Medicine, University of Würzburg, Würzburg, Germany.
| |
Collapse
|
9
|
Agustinho DP, Fu Y, Menon VK, Metcalf GA, Treangen TJ, Sedlazeck FJ. Unveiling microbial diversity: harnessing long-read sequencing technology. Nat Methods 2024; 21:954-966. [PMID: 38689099 DOI: 10.1038/s41592-024-02262-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/29/2024] [Indexed: 05/02/2024]
Abstract
Long-read sequencing has recently transformed metagenomics, enhancing strain-level pathogen characterization, enabling accurate and complete metagenome-assembled genomes, and improving microbiome taxonomic classification and profiling. These advancements are not only due to improvements in sequencing accuracy, but also happening across rapidly changing analysis methods. In this Review, we explore long-read sequencing's profound impact on metagenomics, focusing on computational pipelines for genome assembly, taxonomic characterization and variant detection, to summarize recent advancements in the field and provide an overview of available analytical methods to fully leverage long reads. We provide insights into the advantages and disadvantages of long reads over short reads and their evolution from the early days of long-read sequencing to their recent impact on metagenomics and clinical diagnostics. We further point out remaining challenges for the field such as the integration of methylation signals in sub-strain analysis and the lack of benchmarks.
Collapse
Affiliation(s)
- Daniel P Agustinho
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Yilei Fu
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Vipin K Menon
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
- Senior research project manager, Human Genetics, Genentech, South San Francisco, CA, USA
| | - Ginger A Metcalf
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA
| | - Todd J Treangen
- Department of Computer Science, Rice University, Houston, TX, USA
- Department of Bioengineering, Rice University, Houston, TX, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing center, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
10
|
Ye L, Liu X, Ni Y, Xu Y, Zheng Z, Chen K, Hu Q, Tan L, Guo Z, Wai CK, Chan EWC, Li R, Chen S. Comprehensive genomic and plasmid characterization of multidrug-resistant bacterial strains by R10.4.1 nanopore sequencing. Microbiol Res 2024; 283:127666. [PMID: 38460283 DOI: 10.1016/j.micres.2024.127666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 01/17/2024] [Accepted: 02/25/2024] [Indexed: 03/11/2024]
Abstract
The escalating prevalence of multidrug-resistant (MDR) bacteria pose a significant public health threat. Understanding the genomic features and deciphering the antibiotic resistance profiles of these pathogens is crucial for development of effective surveillance and treatment strategies. In this study, we employed the R10.4.1 nanopore sequencing technology, specifically through the use of the MinION platform, to analyze eight MDR bacterial strains originating from clinical, ecological and food sources. A single 72-hour sequencing run could yield approximately 12 million reads which covered a total of 34 gigabases (Gbp). The nanopore R10.4.1 data was processed using the Flye assembler, successfully assembling the genomes of eight bacterial strains and their 18 plasmids. Notably, the assemblies generated solely from R10.4.1 nanopore data closely matched those from next-generation sequencing data. Diverse antibiotic resistance patterns and specific resistance genes in the test strains were identified. Hospital strains that exhibited multidrug resistance were found to harbor various resistance genes that encode efflux pumps and extended-spectrum β-lactamases. Environmental and food sources were found to display resistance profiles in a species-specific manner. The composition of structurally complex plasmids in the test strains could also be revealed by analysis of nanopore long reads, which also suggested evidence of horizontal transfer of plasmids between different bacterial species. These findings provide valuable insights into the genetic characteristics of MDR bacteria and demonstrating the practicality of nanopore sequencing technology for detecting of resistance elements in bacterial pathogens.
Collapse
Affiliation(s)
- Lianwei Ye
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China; State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Xudong Liu
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China
| | - Ying Ni
- Department of Biomedical Sciences and Tung Biomedical Sciences Centre, City University of Hong Kong, Hong Kong, Hong Kong, SAR China; Department of Precision Diagnostic and Therapeutic Technology, City University of Hong Kong Shenzhen Futian Research Institute, Shenzhen, Guangdong, China
| | - Yating Xu
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China; State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Zhiwei Zheng
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China; Shenzhen Key Lab for Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, China
| | - Kaichao Chen
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Qiao Hu
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China; State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Lu Tan
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China
| | - Zhihao Guo
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China
| | - Chan Kwan Wai
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Edward Wai Chi Chan
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China
| | - Runsheng Li
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Kowloon, Hong Kong, SAR China; Department of Precision Diagnostic and Therapeutic Technology, City University of Hong Kong Shenzhen Futian Research Institute, Shenzhen, Guangdong, China.
| | - Sheng Chen
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, SAR China; Shenzhen Key Lab for Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, China; Animal Health Research Center, City University of Hong Kong Chengdu Research Institute, Chengdu, China.
| |
Collapse
|
11
|
Guo Z, Ni Y, Tan L, Shao Y, Ye L, Chen S, Li R. Nanopore Current Events Magnifier (nanoCEM): a novel tool for visualizing current events at modification sites of nanopore sequencing. NAR Genom Bioinform 2024; 6:lqae052. [PMID: 38774513 PMCID: PMC11106030 DOI: 10.1093/nargab/lqae052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 04/23/2024] [Accepted: 05/05/2024] [Indexed: 05/24/2024] Open
Abstract
Nanopore sequencing technologies have enabled the direct detection of base modifications in DNA or RNA molecules. Despite these advancements, the tools for visualizing electrical current, essential for analyzing base modifications, are often lacking in clarity and compatibility with diverse nanopore pipelines. Here, we present Nanopore Current Events Magnifier (nanoCEM, https://github.com/lrslab/nanoCEM), a Python command-line tool designed to facilitate the identification of DNA/RNA modification sites through enhanced visualization and statistical analysis. Compatible with the four preprocessing methods including 'f5c resquiggle', 'f5c eventalign', 'Tombo' and 'move table', nanoCEM is applicable to RNA and DNA analysis across multiple flow cell types. By utilizing rescaling techniques and calculating various statistical features, nanoCEM provides more accurate and comparable visualization of current events, allowing researchers to effectively observe differences between samples and showcase the modified sites.
Collapse
Affiliation(s)
- Zhihao Guo
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Ying Ni
- Tung Biomedical Sciences Centre, City University of Hong Kong, Hong Kong, China
| | - Lu Tan
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Yanwen Shao
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Lianwei Ye
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Sheng Chen
- State Key Lab of Chemical Biology and Drug Discovery and the Department of Food Science and Nutrition, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR
| | - Runsheng Li
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
- Tung Biomedical Sciences Centre, City University of Hong Kong, Hong Kong, China
- Department of Precision Diagnostic and Therapeutic Technology, City University of Hong Kong Shenzhen Futian Research Institute, Shenzhen, Guangdong, China
| |
Collapse
|
12
|
Wattanasombat S, Tongjai S. Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline. F1000Res 2024; 13:556. [PMID: 38984017 PMCID: PMC11231628 DOI: 10.12688/f1000research.149577.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/14/2024] [Indexed: 07/11/2024] Open
Abstract
Background Determining the appropriate computational requirements and software performance is essential for efficient genomic surveillance. The lack of standardized benchmarking complicates software selection, especially with limited resources. Methods We developed a containerized benchmarking pipeline to evaluate seven long-read assemblers-Canu, GoldRush, MetaFlye, Strainline, HaploDMF, iGDA, and RVHaplo-for viral haplotype reconstruction, using both simulated and experimental Oxford Nanopore sequencing data of HIV-1 and other viruses. Benchmarking was conducted on three computational systems to assess each assembler's performance, utilizing QUAST and BLASTN for quality assessment. Results Our findings show that assembler choice significantly impacts assembly time, with CPU and memory usage having minimal effect. Assembler selection also influences the size of the contigs, with a minimum read length of 2,000 nucleotides required for quality assembly. A 4,000-nucleotide read length improves quality further. Canu was efficient among de novo assemblers but not suitable for multi-strain mixtures, while GoldRush produced only consensus assemblies. Strainline and MetaFlye were suitable for metagenomic sequencing data, with Strainline requiring high memory and MetaFlye operable on low-specification machines. Among reference-based assemblers, iGDA had high error rates, RVHaplo showed the best runtime and accuracy but became ineffective with similar sequences, and HaploDMF, utilizing machine learning, had fewer errors with a slightly longer runtime. Conclusions The HIV-64148 pipeline, containerized using Docker, facilitates easy deployment and offers flexibility to select from a range of assemblers to match computational systems or study requirements. This tool aids in genome assembly and provides valuable information on HIV-1 sequences, enhancing viral evolution monitoring and understanding.
Collapse
Affiliation(s)
- Sara Wattanasombat
- Department of Microbiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
| | - Siripong Tongjai
- Department of Microbiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
| |
Collapse
|
13
|
Sapan V, Simsek SZ, Filoğlu G, Bulbul O. Forensic DNA phenotyping using Oxford Nanopore Sequencing system. Electrophoresis 2024. [PMID: 38794987 DOI: 10.1002/elps.202300252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 03/27/2024] [Accepted: 05/08/2024] [Indexed: 05/27/2024]
Abstract
In forensic science, the demand for precision, consistency, and cost-effectiveness has driven the exploration of next-generation sequencing technologies. This study investigates the potential of Oxford Nanopore Sequencing (ONT) Technology for analyzing the HIrisPlex-S panel, a set of 41 single nucleotide polymorphism (SNP) markers used to predict eye, hair, and skin color. Using ONT sequencing, we assessed the accuracy and reliability of ONT-generated data by comparing it with conventional capillary electrophoresis (CE) in 18 samples. The Guppy v6.1 was used as a basecaller, and sample profiles were obtained using Burrows-Wheeler Aligner, Samtools, BCFtools, and Python. Comparing accuracy with CE, we found that 62% of SNPs in ONT-unligated samples were correctly genotyped, with 36% showing allele dropout, and 2% being incorrectly genotyped. In the ONT-ligated samples, 85% of SNPs were correctly genotyped, with 10% showing allele dropout, and 5% being incorrectly genotyped. Our findings indicate that ONT, particularly when combined with ligation, enhances genotyping accuracy and coverage, thereby reducing allele dropouts. However, challenges associated with the technology's error rates and the impact on genotyping accuracy are recognized. Phenotype predictions based on ONT data demonstrate varying degrees of success, with the technology showing high accuracy in several cases. Although ONT technology holds promise in forensic genetics, further optimization and quality control measures are essential to harness its full potential. This study contributes to the ongoing efforts to refine sequence read tuning and improve correction tools in the context of ONT technology's application in forensic genetics.
Collapse
Affiliation(s)
- Veysel Sapan
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Sumeyye Zulal Simsek
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Gonul Filoğlu
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| | - Ozlem Bulbul
- Institute of Forensic Sciences and Legal Medicine, Istanbul University-Cerrahpasa, Istanbul, Turkey
| |
Collapse
|
14
|
Szakállas N, Barták BK, Valcz G, Nagy ZB, Takács I, Molnár B. Can long-read sequencing tackle the barriers, which the next-generation could not? A review. Pathol Oncol Res 2024; 30:1611676. [PMID: 38818014 PMCID: PMC11137202 DOI: 10.3389/pore.2024.1611676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 04/30/2024] [Indexed: 06/01/2024]
Abstract
The large-scale heterogeneity of genetic diseases necessitated the deeper examination of nucleotide sequence alterations enhancing the discovery of new targeted drug attack points. The appearance of new sequencing techniques was essential to get more interpretable genomic data. In contrast to the previous short-reads, longer lengths can provide a better insight into the potential health threatening genetic abnormalities. Long-reads offer more accurate variant identification and genome assembly methods, indicating advances in nucleotide deflect-related studies. In this review, we introduce the historical background of sequencing technologies and show their benefits and limits, as well. Furthermore, we highlight the differences between short- and long-read approaches, including their unique advances and difficulties in methodologies and evaluation. Additionally, we provide a detailed description of the corresponding bioinformatics and the current applications.
Collapse
Affiliation(s)
- Nikolett Szakállas
- Department of Biological Physics, Faculty of Science, Eötvös Loránd University, Budapest, Hungary
| | - Barbara K. Barták
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - Gábor Valcz
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
- HUN-REN-SU Translational Extracellular Vesicle Research Group, Budapest, Hungary
| | - Zsófia B. Nagy
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - István Takács
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| | - Béla Molnár
- Department of Internal Medicine and Oncology, Faculty of Medicine, Semmelweis University, Budapest, Hungary
| |
Collapse
|
15
|
Su C, Chandradoss KR, Malachowski T, Boya R, Ryu HS, Brennand KJ, Phillips-Cremins JE. MASTR-seq: Multiplexed Analysis of Short Tandem Repeats with sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.29.591790. [PMID: 38746155 PMCID: PMC11092654 DOI: 10.1101/2024.04.29.591790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
More than 60 human disorders have been linked to unstable expansion of short tandem repeat (STR) tracts. STR length and the extent of DNA methylation is linked to disease pathology and can be mosaic in a cell type-specific manner in several repeat expansion disorders. Mosaic phenomenon have been difficult to study to date due to technical bias intrinsic to repeat sequences and the need for multi-modal measurements at single-allele resolution. Nanopore long-read sequencing accurately measures STR length and DNA methylation in the same single molecule but is cost prohibitive for studies assessing a target locus across multiple experimental conditions or patient samples. Here, we describe MASTR-seq, M ultiplexed A nalysis of S hort T andem R epeats, for cost-effective, high-throughput, accurate, multi-modal measurements of DNA methylation and STR genotype at single-allele resolution. MASTR-seq couples long-read sequencing, Cas9-mediated target enrichment, and PCR-free multiplexed barcoding to achieve a >ten-fold increase in on-target read mapping for 8-12 pooled samples in a single MinION flow cell. We provide a detailed experimental protocol and computational tools and present evidence that MASTR-seq quantifies tract length and DNA methylation status for CGG and CAG STR loci in normal-length and mutation-length human cell lines. The MASTR-seq protocol takes approximately eight days for experiments and one additional day for data processing and analyses. Key points We provide a protocol for MASTR-seq: M ultiplexed A nalysis of S hort T andem R epeats using Cas9-mediated target enrichment and PCR-free, multiplexed nanopore sequencing. MASTR-seq achieves a >10-fold increase in on-target read proportion for highly repetitive, technically inaccessible regions of the genome relevant for human health and disease.MASTR-seq allows for high-throughput, efficient, accurate, and cost-effective measurement of STR length and DNA methylation in the same single allele for up to 8-12 samples in parallel in one Nanopore MinION flow cell.
Collapse
|
16
|
Xu Z, Qu HQ, Chan J, Kao C, Hakonarson H, Wang K. Single-Cell Omics for Transcriptome CHaracterization (SCOTCH): isoform-level characterization of gene expression through long-read single-cell RNA sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.29.590597. [PMID: 38746128 PMCID: PMC11092450 DOI: 10.1101/2024.04.29.590597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The advent of long-read single-cell transcriptome sequencing (lr-scRNA-Seq) represents a significant leap forward in single-cell genomics. With the recent introduction of R10 flowcells by Oxford Nanopore, we propose that previous computational methods designed to handle high sequencing error rates are no longer relevant, and that the prevailing approach using short reads to compile "barcode space" (candidate barcode list) to de-multiplex long reads are no longer necessary. Instead, computational methods should now shift focus on harnessing the unique benefits of long reads to analyze transcriptome complexity. In this context, we introduce a comprehensive suite of computational methods named Single-Cell Omics for Transcriptome CHaracterization (SCOTCH). Our method is compatible with the single-cell library preparation platform from both 10X Genomics and Parse Biosciences, facilitating the analysis of special cell populations, such as neurons, hepatocytes and developing cardiomyocytes. We specifically re-formulated the transcript mapping problem with a compatibility matrix and addressed the multiple-mapping issue using probabilistic inference, which allows the discovery of novel isoforms as well as the detection of differential isoform usage between cell populations. We evaluated SCOTCH through analysis of real data across different combinations of single-cell libraries and sequencing technologies (10X + Illumina, Parse + Illumina, 10X + Nanopore_R9, 10X + Nanopore_R10, Parse + Nanopore_R10), and showed its ability to infer novel biological insights on cell type-specific isoform expression. These datasets enhance the availability of publicly available data for continued development of computational approaches. In summary, SCOTCH allows extraction of more biological insights from the new advancements in single-cell library construction and sequencing technologies, facilitating the examination of transcriptome complexity at the single-cell level.
Collapse
Affiliation(s)
- Zhuoran Xu
- Graduate Group in Genomics and Computational Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, 19104, USA
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Hui-Qi Qu
- The Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, 19104, USA
| | - Joe Chan
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Charlly Kao
- The Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, 19104, USA
| | - Hakon Hakonarson
- The Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, 19104, USA
- Department of Pediatrics, The Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, 19104, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA, 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| |
Collapse
|
17
|
Yuan CU, Quah FX, Hemberg M. Single-cell and spatial transcriptomics: Bridging current technologies with long-read sequencing. Mol Aspects Med 2024; 96:101255. [PMID: 38368637 DOI: 10.1016/j.mam.2024.101255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/30/2024] [Accepted: 02/07/2024] [Indexed: 02/20/2024]
Abstract
Single-cell technologies have transformed biomedical research over the last decade, opening up new possibilities for understanding cellular heterogeneity, both at the genomic and transcriptomic level. In addition, more recent developments of spatial transcriptomics technologies have made it possible to profile cells in their tissue context. In parallel, there have been substantial advances in sequencing technologies, and the third generation of methods are able to produce reads that are tens of kilobases long, with error rates matching the second generation short reads. Long reads technologies make it possible to better map large genome rearrangements and quantify isoform specific abundances. This further improves our ability to characterize functionally relevant heterogeneity. Here, we show how researchers have begun to combine single-cell, spatial transcriptomics, and long-read technologies, and how this is resulting in powerful new approaches to profiling both the genome and the transcriptome. We discuss the achievements so far, and we highlight remaining challenges and opportunities.
Collapse
Affiliation(s)
- Chengwei Ulrika Yuan
- Department of Biochemistry, University of Cambridge, Cambridge, UK; Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Fu Xiang Quah
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Martin Hemberg
- Gene Lay Institute, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
18
|
Olivucci G, Iovino E, Innella G, Turchetti D, Pippucci T, Magini P. Long read sequencing on its way to the routine diagnostics of genetic diseases. Front Genet 2024; 15:1374860. [PMID: 38510277 PMCID: PMC10951082 DOI: 10.3389/fgene.2024.1374860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 02/26/2024] [Indexed: 03/22/2024] Open
Abstract
The clinical application of technological progress in the identification of DNA alterations has always led to improvements of diagnostic yields in genetic medicine. At chromosome side, from cytogenetic techniques evaluating number and gross structural defects to genomic microarrays detecting cryptic copy number variants, and at molecular level, from Sanger method studying the nucleotide sequence of single genes to the high-throughput next-generation sequencing (NGS) technologies, resolution and sensitivity progressively increased expanding considerably the range of detectable DNA anomalies and alongside of Mendelian disorders with known genetic causes. However, particular genomic regions (i.e., repetitive and GC-rich sequences) are inefficiently analyzed by standard genetic tests, still relying on laborious, time-consuming and low-sensitive approaches (i.e., southern-blot for repeat expansion or long-PCR for genes with highly homologous pseudogenes), accounting for at least part of the patients with undiagnosed genetic disorders. Third generation sequencing, generating long reads with improved mappability, is more suitable for the detection of structural alterations and defects in hardly accessible genomic regions. Although recently implemented and not yet clinically available, long read sequencing (LRS) technologies have already shown their potential in genetic medicine research that might greatly impact on diagnostic yield and reporting times, through their translation to clinical settings. The main investigated LRS application concerns the identification of structural variants and repeat expansions, probably because techniques for their detection have not evolved as rapidly as those dedicated to single nucleotide variants (SNV) identification: gold standard analyses are karyotyping and microarrays for balanced and unbalanced chromosome rearrangements, respectively, and southern blot and repeat-primed PCR for the amplification and sizing of expanded alleles, impaired by limited resolution and sensitivity that have not been significantly improved by the advent of NGS. Nevertheless, more recently, with the increased accuracy provided by the latest product releases, LRS has been tested also for SNV detection, especially in genes with highly homologous pseudogenes and for haplotype reconstruction to assess the parental origin of alleles with de novo pathogenic variants. We provide a review of relevant recent scientific papers exploring LRS potential in the diagnosis of genetic diseases and its potential future applications in routine genetic testing.
Collapse
Affiliation(s)
- Giulia Olivucci
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
- Department of Surgical and Oncological Sciences, University of Palermo, Palermo, Italy
| | - Emanuela Iovino
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Giovanni Innella
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Daniela Turchetti
- Department of Medical and Surgical Sciences (DIMEC), University of Bologna, Bologna, Italy
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Tommaso Pippucci
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Pamela Magini
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| |
Collapse
|
19
|
Street TL, Sanderson ND, Barker L, Kavanagh J, Cole K, Llewelyn M, Eyre DW. Target enrichment improves culture-independent detection of Neisseria gonorrhoeae and antimicrobial resistance determinants direct from clinical samples with Nanopore sequencing. Microb Genom 2024; 10:001208. [PMID: 38529900 PMCID: PMC10995632 DOI: 10.1099/mgen.0.001208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 02/10/2024] [Indexed: 03/27/2024] Open
Abstract
Multi-drug-resistant Neisseria gonorrhoeae infection is a significant public health risk. Rapidly detecting N. gonorrhoeae and antimicrobial-resistant (AMR) determinants by metagenomic sequencing of urine is possible, although high levels of host DNA and overgrowth of contaminating species hamper sequencing and limit N. gonorrhoeae genome coverage. We performed Nanopore sequencing of nucleic acid amplification test-positive urine samples and culture-positive urethral swabs with and without probe-based target enrichment, using a custom SureSelect panel, to investigate whether selective enrichment of N. gonorrhoeae DNA improves detection of both species and AMR determinants. Probes were designed to cover the entire N. gonorrhoeae genome, with tenfold enrichment of probes covering selected AMR determinants. Multiplexing was tested in a subset of samples. The proportion of sequence bases classified as N. gonorrhoeae increased in all samples after enrichment, from a median (IQR) of 0.05 % (0.01-0.1 %) to 76 % (42-82 %), giving a corresponding median improvement in fold genome coverage of 365 times (112-720). Over 20-fold coverage, required for robust AMR determinant detection, was achieved in 13/15(87 %) samples, compared to 2/15(13 %) without enrichment. The four samples multiplexed together also achieved >20-fold genome coverage. Coverage of AMR determinants was sufficient to predict resistance conferred by changes in chromosomal genes, where present, and genome coverage also enabled phylogenetic relationships to be reconstructed. Probe-based target enrichment can improve N. gonorrhoeae genome coverage when sequencing DNA extracts directly from urine or urethral swabs, allowing for detection of AMR determinants. Additionally, multiplexing prior to enrichment provided enough genome coverage for AMR detection and reduces the costs associated with this method.
Collapse
Affiliation(s)
- Teresa L. Street
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
- National Institute for Health Research Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford, UK
| | - Nicholas D. Sanderson
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
- National Institute for Health Research Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford, UK
| | - Leanne Barker
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - James Kavanagh
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Kevin Cole
- Department of Microbiology and Infection, University Hospitals Sussex NHS Trust, Brighton, UK
| | - The GonFast Investigators Group
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
- National Institute for Health Research Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford, UK
- Department of Microbiology and Infection, University Hospitals Sussex NHS Trust, Brighton, UK
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Martin Llewelyn
- Department of Microbiology and Infection, University Hospitals Sussex NHS Trust, Brighton, UK
| | - David W. Eyre
- National Institute for Health Research Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford, UK
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| |
Collapse
|
20
|
Johnston CM, Olesen AS, Lohse L, le Maire Madsen A, Bøtner A, Belsham GJ, Rasmussen TB. A Deep Sequencing Strategy for Investigation of Virus Variants within African Swine Fever Virus-Infected Pigs. Pathogens 2024; 13:154. [PMID: 38392892 PMCID: PMC10893071 DOI: 10.3390/pathogens13020154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 01/26/2024] [Accepted: 02/03/2024] [Indexed: 02/25/2024] Open
Abstract
African swine fever virus (ASFV) is the causative agent of African swine fever, an economically important disease of pigs, often with a high case fatality rate. ASFV has demonstrated low genetic diversity among isolates collected within Eurasia. To explore the influence of viral variants on clinical outcomes and infection dynamics in pigs experimentally infected with ASFV, we have designed a deep sequencing strategy. The variant analysis revealed unique SNPs at <10% frequency in several infected pigs as well as some SNPs that were found in more than one pig. In addition, a deletion of 10,487 bp (resulting in the complete loss of 21 genes) was present at a nearly 100% frequency in the ASFV DNA from one pig at position 6362-16849. This deletion was also found to be present at low levels in the virus inoculum and in two other infected pigs. The current methodology can be used for the currently circulating Eurasian ASFVs and also adapted to other ASFV strains and genotypes. Comprehensive deep sequencing is critical for following ASFV molecular evolution, especially for the identification of modifications that affect virus virulence.
Collapse
Affiliation(s)
- Camille Melissa Johnston
- Section for Veterinary Virology, Department of Virus & Microbiological Special Diagnostics, Statens Serum Institute, Artillerivej 5, DK-2300 Copenhagen, Denmark; (C.M.J.); (A.S.O.); (L.L.); (A.l.M.M.)
| | - Ann Sofie Olesen
- Section for Veterinary Virology, Department of Virus & Microbiological Special Diagnostics, Statens Serum Institute, Artillerivej 5, DK-2300 Copenhagen, Denmark; (C.M.J.); (A.S.O.); (L.L.); (A.l.M.M.)
| | - Louise Lohse
- Section for Veterinary Virology, Department of Virus & Microbiological Special Diagnostics, Statens Serum Institute, Artillerivej 5, DK-2300 Copenhagen, Denmark; (C.M.J.); (A.S.O.); (L.L.); (A.l.M.M.)
| | - Agnete le Maire Madsen
- Section for Veterinary Virology, Department of Virus & Microbiological Special Diagnostics, Statens Serum Institute, Artillerivej 5, DK-2300 Copenhagen, Denmark; (C.M.J.); (A.S.O.); (L.L.); (A.l.M.M.)
- Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Øster Farimagsgade 5, DK-1353 København, Denmark
| | - Anette Bøtner
- Section for Veterinary Clinical Microbiology, Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, DK-1870 Frederiksberg, Denmark; (A.B.); (G.J.B.)
| | - Graham J. Belsham
- Section for Veterinary Clinical Microbiology, Department of Veterinary and Animal Sciences, University of Copenhagen, Stigbøjlen 4, DK-1870 Frederiksberg, Denmark; (A.B.); (G.J.B.)
| | - Thomas Bruun Rasmussen
- Section for Veterinary Virology, Department of Virus & Microbiological Special Diagnostics, Statens Serum Institute, Artillerivej 5, DK-2300 Copenhagen, Denmark; (C.M.J.); (A.S.O.); (L.L.); (A.l.M.M.)
| |
Collapse
|
21
|
Phillips AL, Ferguson S, Burton RA, Watson-Haigh NS. CLAW: An automated Snakemake workflow for the assembly of chloroplast genomes from long-read data. PLoS Comput Biol 2024; 20:e1011870. [PMID: 38335225 PMCID: PMC10883564 DOI: 10.1371/journal.pcbi.1011870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 02/22/2024] [Accepted: 01/29/2024] [Indexed: 02/12/2024] Open
Abstract
Chloroplasts are photosynthetic organelles in algal and plant cells that contain their own genome. Chloroplast genomes are commonly used in evolutionary studies and taxonomic identification and are increasingly becoming a target for crop improvement studies. As DNA sequencing becomes more affordable, researchers are collecting vast swathes of high-quality whole-genome sequence data from laboratory and field settings alike. Whole tissue read libraries sequenced with the primary goal of understanding the nuclear genome will inadvertently contain many reads derived from the chloroplast genome. These whole-genome, whole-tissue read libraries can additionally be used to assemble chloroplast genomes with little to no extra cost. While several tools exist that make use of short-read second generation and third-generation long-read sequencing data for chloroplast genome assembly, these tools may have complex installation steps, inadequate error reporting, poor expandability, and/or lack scalability. Here, we present CLAW (Chloroplast Long-read Assembly Workflow), an easy to install, customise, and use Snakemake tool to assemble chloroplast genomes from chloroplast long-reads found in whole-genome read libraries (https://github.com/aaronphillips7493/CLAW). Using 19 publicly available reference chloroplast genome assemblies and long-read libraries from algal, monocot and eudicot species, we show that CLAW can rapidly produce chloroplast genome assemblies with high similarity to the reference assemblies. CLAW was designed such that users have complete control over parameterisation, allowing individuals to optimise CLAW to their specific use cases. We expect that CLAW will provide researchers (with varying levels of bioinformatics expertise) with an additional resource useful for contributing to the growing number of publicly available chloroplast genome assemblies.
Collapse
Affiliation(s)
- Aaron L Phillips
- Department of Food Science, University of Adelaide, Adelaide, South Australia, Australia
| | - Scott Ferguson
- Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Rachel A Burton
- Department of Food Science, University of Adelaide, Adelaide, South Australia, Australia
| | - Nathan S Watson-Haigh
- South Australian Genomics Centre (SAGC), SAHMRI, Adelaide, South Australia, Australia
- Australian Genome Research Facility, Victorian Comprehensive Cancer Centre, Melbourne, Victoria, Australia
- Alkahest Inc., San Carlos, California, United States of America
| |
Collapse
|
22
|
Safar HA, Alatar F, Mustafa AS. Three Rounds of Read Correction Significantly Improve Eukaryotic Protein Detection in ONT Reads. Microorganisms 2024; 12:247. [PMID: 38399651 PMCID: PMC10893331 DOI: 10.3390/microorganisms12020247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 01/19/2024] [Accepted: 01/23/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND Eukaryotes' whole-genome sequencing is crucial for species identification, gene detection, and protein annotation. Oxford Nanopore Technology (ONT) is an affordable and rapid platform for sequencing eukaryotes; however, the relatively higher error rates require computational and bioinformatic efforts to produce more accurate genome assemblies. Here, we evaluated the effect of read correction tools on eukaryote genome completeness, gene detection and protein annotation. METHODS Reads generated by ONT of four eukaryotes, C. albicans, C. gattii, S. cerevisiae, and P. falciparum, were assembled using minimap2 and underwent three rounds of read correction using flye, medaka and racon. The generates consensus FASTA files were compared for total length (bp), genome completeness, gene detection, and protein-annotation by QUAST, BUSCO, BRAKER1 and InterProScan, respectively. RESULTS Genome completeness was dependent on the assembly method rather than on the read correction tool; however, medaka performed better than flye and racon. Racon significantly performed better than flye and medaka in gene detection, while both racon and medaka significantly performed better than flye in protein-annotation. CONCLUSION We show that three rounds of read correction significantly affect gene detection and protein annotation, which are dependent on assembly quality in preference to assembly completeness.
Collapse
Affiliation(s)
- Hussain A. Safar
- OMICS Research Unit, Health Science Centre, Kuwait University, Kuwait City 13110, Kuwait;
| | - Fatemah Alatar
- Serology and Molecular Microbiology Reference Laboratory, Mubarak Al-Kabeer Hospital, Ministry of Health, Kuwait City 13110, Kuwait;
| | - Abu Salim Mustafa
- Department of Microbiology, Faculty of Medicine, Kuwait University, Kuwait City 13110, Kuwait
| |
Collapse
|
23
|
Lao HY, Wong LLY, Hui Y, Ng TTL, Chan CTM, Lo HWH, Yau MCY, Leung ECM, Wong RCW, Ho AYM, Yip KT, Lam JYW, Chow VCY, Luk KS, Que TL, Chow FWN, Siu GKH. The clinical utility of Nanopore 16S rRNA gene sequencing for direct bacterial identification in normally sterile body fluids. Front Microbiol 2024; 14:1324494. [PMID: 38264489 PMCID: PMC10803466 DOI: 10.3389/fmicb.2023.1324494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/19/2023] [Indexed: 01/25/2024] Open
Abstract
The prolonged incubation period of traditional culture methods leads to a delay in diagnosing invasive infections. Nanopore 16S rRNA gene sequencing (Nanopore 16S) offers a potential rapid diagnostic approach for directly identifying bacteria in infected body fluids. To evaluate the clinical utility of Nanopore 16S, we conducted a study involving the collection and sequencing of 128 monomicrobial samples, 65 polymicrobial samples, and 20 culture-negative body fluids. To minimize classification bias, taxonomic classification was performed using 3 analysis pipelines: Epi2me, Emu, and NanoCLUST. The result was compared to the culture references. The limit of detection of Nanopore 16S was also determined using simulated bacteremic blood samples. Among the three classifiers, Emu demonstrated the highest concordance with the culture results. It correctly identified the taxon of 125 (97.7%) of the 128 monomicrobial samples, compared to 109 (85.2%) for Epi2me and 102 (79.7%) for NanoCLUST. For the 230 cultured species in the 65 polymicrobial samples, Emu correctly identified 188 (81.7%) cultured species, compared to 174 (75.7%) for Epi2me and 125 (54.3%) for NanoCLUST. Through ROC analysis on the monomicrobial samples, we determined a threshold of relative abundance at 0.058 for distinguishing potential pathogens from background in Nanopore 16S. Applying this threshold resulted in the identification of 107 (83.6%), 117 (91.4%), and 114 (91.2%) correctly detected samples for Epi2me, Emu, and NanoCLUST, respectively, in the monomicrobial samples. Nanopore 16S coupled with Epi2me could provide preliminary results within 6 h. However, the ROC analysis of polymicrobial samples exhibited a random-like performance, making it difficult to establish a threshold. The overall limit of detection for Nanopore 16S was found to be about 90 CFU/ml.
Collapse
Affiliation(s)
- Hiu-Yin Lao
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Lily Lok-Yee Wong
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Yan Hui
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Timothy Ting-Leung Ng
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Chloe Toi-Mei Chan
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Hazel Wing-Hei Lo
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Miranda Chong-Yee Yau
- Department of Clinical Pathology, Pamela Youde Nethersole Eastern Hospital, Chai Wan, Hong Kong SAR, China
| | - Eddie Chi-Man Leung
- Department of Clinical Pathology, Pamela Youde Nethersole Eastern Hospital, Chai Wan, Hong Kong SAR, China
| | - River Chun-Wai Wong
- Department of Microbiology, Prince of Wales Hospital, Shatin, Hong Kong SAR, China
| | - Alex Yat-Man Ho
- Department of Pathology, Princess Margaret Hospital, Kowloon, Hong Kong SAR, China
| | - Kam-Tong Yip
- Department of Clinical Pathology, Tuen Mun Hospital, Tuen Mun, Hong Kong SAR, China
| | - Jimmy Yiu-Wing Lam
- Department of Clinical Pathology, Pamela Youde Nethersole Eastern Hospital, Chai Wan, Hong Kong SAR, China
| | - Viola Chi-Ying Chow
- Department of Microbiology, Prince of Wales Hospital, Shatin, Hong Kong SAR, China
| | - Kristine Shik Luk
- Department of Pathology, Princess Margaret Hospital, Kowloon, Hong Kong SAR, China
| | - Tak-Lun Que
- Department of Clinical Pathology, Tuen Mun Hospital, Tuen Mun, Hong Kong SAR, China
| | - Franklin Wang Ngai Chow
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Gilman Kit-Hang Siu
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| |
Collapse
|
24
|
Penter L, Borji M, Nagler A, Lyu H, Lu WS, Cieri N, Maurer K, Oliveira G, Al'Khafaji AM, Garimella KV, Li S, Neuberg DS, Ritz J, Soiffer RJ, Garcia JS, Livak KJ, Wu CJ. Integrative genotyping of cancer and immune phenotypes by long-read sequencing. Nat Commun 2024; 15:32. [PMID: 38167262 PMCID: PMC10762175 DOI: 10.1038/s41467-023-44137-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 12/01/2023] [Indexed: 01/05/2024] Open
Abstract
Single-cell transcriptomics has become the definitive method for classifying cell types and states, and can be augmented with genotype information to improve cell lineage identification. Due to constraints of short-read sequencing, current methods to detect natural genetic barcodes often require cumbersome primer panels and early commitment to targets. Here we devise a flexible long-read sequencing workflow and analysis pipeline, termed nanoranger, that starts from intermediate single-cell cDNA libraries to detect cell lineage-defining features, including single-nucleotide variants, fusion genes, isoforms, sequences of chimeric antigen and TCRs. Through systematic analysis of these classes of natural 'barcodes', we define the optimal targets for nanoranger, namely those loci close to the 5' end of highly expressed genes with transcript lengths shorter than 4 kB. As proof-of-concept, we apply nanoranger to longitudinal tracking of subclones of acute myeloid leukemia (AML) and describe the heterogeneous isoform landscape of thousands of marrow-infiltrating immune cells. We propose that enhanced cellular genotyping using nanoranger can improve the tracking of single-cell tumor and immune cell co-evolution.
Collapse
Grants
- P01 CA229092 NCI NIH HHS
- P50 CA101942 NCI NIH HHS
- UM1 CA186709 NCI NIH HHS
- U24 CA224316 NCI NIH HHS
- U24 CA224331 NCI NIH HHS
- U24 CA224285 NCI NIH HHS
- R50 CA251956 NCI NIH HHS
- U24 CA224309 NCI NIH HHS
- U24 CA224319 NCI NIH HHS
- K08 CA245209 NCI NIH HHS
- This work was supported by National Institutes of Health, National Cancer Institute grant P01CA229092 (CJW), UM1CA186709 (Principal Investigator: Geoffrey Shapiro), National Cancer Institute Cancer Therapy Evaluation Program, Bristol-Myers Squibb, and LLS Therapy Accelerator Program. L.P. was supported by a research fellowship from the German Research Foundation (DFG, PE 3127/1-1) and is a Scholar of the American Society of Hematology, participant in the BIH Charité Digital Clinician Scientist Program funded by the DFG, the Charité – Universitätsmedizin Berlin, and the Berlin Institute of Health at Charité (BIH) and is supported by the Max-Eder program of the German Cancer Aid. A.A. is supported by the Broad Institute IGNITE award. K.M. is suppored by the ASCO YIA award. G.O. was supported by the Claudia Adams Barr Program for Innovative Cancer Research and by DF/HCC Kidney Cancer SPORE P50 CA101942. S.L. is supported by the National Institutes of Health, National Cancer Institute Research Specialist Award (R50CA251956). JSG is supported by the Conquer Cancer Foundation Career Development Award, Leukemia and Lymphoma Society Translational Research Program Award, and NIH K08CA245209. NCI CTEP provided study drug (Ipilimumab) support. This work was further supported by the CIMAC-CIDC Network. Scientific and financial support for the CIMAC-CIDC Network is provided through National Institutes of Health, National Cancer Institute Cooperative Agreements U24CA224319 (to the Icahn School of Medicine at Mount Sinai CIMAC), U24CA224331 (to the Dana-Farber Cancer Institute CIMAC), U24CA224285 (to the MD Anderson Cancer Center CIMAC), U24CA224309 (to the Stanford University CIMAC), and U24CA224316 (to the CIDC at Dana-Farber Cancer Institute). The CIMAC-CIDC website is found at https://cimac-network.org/.
Collapse
Affiliation(s)
- Livius Penter
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Harvard Medical School, Boston, MA, USA
- Department of Hematology, Oncology, and Tumorimmunology, Campus Virchow Klinikum, Berlin, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, BIH Biomedical Innovation Academy, BIH Charité Digital Clinician Scientist Program, Charitéplatz 1, 10117, Berlin, Germany
| | - Mehdi Borji
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Adi Nagler
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | - Haoxiang Lyu
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Wesley S Lu
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Nicoletta Cieri
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Katie Maurer
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Giacomo Oliveira
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Aziz M Al'Khafaji
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | - Kiran V Garimella
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | - Shuqiang Li
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Donna S Neuberg
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Jerome Ritz
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Robert J Soiffer
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Jacqueline S Garcia
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Kenneth J Livak
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Translational Immunogenomics Lab, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Catherine J Wu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA.
- Harvard Medical School, Boston, MA, USA.
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.
| |
Collapse
|
25
|
Srivastava SK, Parker C, O'Brien CN, Tucker MS, Thompson PC, Rosenthal BM, Dubey JP, Khan A, Jenkins MC. Chromosomal scale assembly reveals localized structural variants in avian caecal coccidian parasite Eimeria tenella. Sci Rep 2023; 13:22802. [PMID: 38129566 PMCID: PMC10739835 DOI: 10.1038/s41598-023-50117-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 12/15/2023] [Indexed: 12/23/2023] Open
Abstract
Eimeria tenella is a major cause of caecal coccidiosis in commercial poultry chickens worldwide. Here, we report chromosomal scale assembly of Eimeria tenella strain APU2, a strain isolated from commercial broiler chickens in the U.S. We obtained 100× sequencing Oxford Nanopore Technology (ONT) and more than 800× Coverage of Illumina Next-Seq. We created the assembly using the hybrid approach implemented in MaSuRCA, achieving a contiguous 51.34 Mb chromosomal-scale scaffolding enabling identification of structural variations. The AUGUSTUS pipeline predicted 8060 genes, and BUSCO deemed the genomes 99% complete; 6278 (78%) genes were annotated with Pfam domains, and 1395 genes were assigned GO-terms. Comparing E. tenella strains (APU2, US isolate and Houghton, UK isolate) derived Houghton strain of E. tenella revealed 62,905 high stringency differences, of which 45,322 are single nucleotide polymorphisms (SNPs) (0.088%). The rate of transitions/transversions among the SNPs are 1.63 ts/tv. The strains possess conserved gene order but have profound sequence heterogeneity in a several chromosomal segments (chr 2, 11 and 15). Genic and intergenic variation in defined gene families was evaluated between the two strains to possibly identify sequences under selection. The average genic nucleotide diversity of 2.8 with average 2 kb gene length (0.145%) at genic level. We examined population structure using available E. tenella sequences in NCBI, revealing that the two E. tenella isolates from the U.S. (E. tenella APU2 and Wisconsin, "ERR296879") share a common maternal inheritance with the E. tenella Houghton. Our chromosomal level assembly promotes insight into Eimeria biology and evolution, hastening drug discovery and vaccine development.
Collapse
Affiliation(s)
- Subodh K Srivastava
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA.
| | - Carolyn Parker
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Celia N O'Brien
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Matthew S Tucker
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Peter C Thompson
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Benjamin M Rosenthal
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Jitender P Dubey
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Asis Khan
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA
| | - Mark C Jenkins
- USDA-ARS Animal Parasitic Diseases Laboratory, Beltsville Agricultural Research Center, BARC-East Building 1040, 10300 Baltimore Ave., Beltsville, MD, 20705, USA.
| |
Collapse
|
26
|
Chiou CS, Chen BH, Wang YW, Kuo NT, Chang CH, Huang YT. Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction. Commun Biol 2023; 6:1215. [PMID: 38030695 PMCID: PMC10687267 DOI: 10.1038/s42003-023-05605-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 11/17/2023] [Indexed: 12/01/2023] Open
Abstract
The accuracy of Oxford Nanopore Technology (ONT) sequencing has significantly improved thanks to new flowcells, sequencing kits, and basecalling algorithms. However, novel modification types untrained in the basecalling models can seriously reduce the quality. Here we reports a set of ONT-sequenced genomes with unexpected low quality due to novel modification types. Demodification by whole-genome amplification significantly improved the quality but lost the epigenome. We also developed a reference-based method, Modpolish, for correcting modification-mediated errors while retaining the epigenome when a sufficient number of closely-related genomes is publicly available (default: top 20 genomes with at least 95% identity). Modpolish not only significantly improved the quality of in-house sequenced genomes but also public datasets sequenced by R9.4 and R10.4 (simplex). Our results suggested that novel modifications are prone to ONT systematic errors. Nevertheless, these errors are correctable by nucleotide demodification or Modpolish without prior knowledge of modifications.
Collapse
Affiliation(s)
| | - Bo-Han Chen
- Centers for Disease Control, Taichung, Taiwan
| | | | - Nang-Ting Kuo
- Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan
| | - Chih-Hsiang Chang
- Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan
| | - Yao-Ting Huang
- Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan.
| |
Collapse
|
27
|
Theologidis I, Karamitros T, Vichou AE, Kizis D. Nanopore-Sequencing Metabarcoding for Identification of Phytopathogenic and Endophytic Fungi in Olive ( Olea europaea) Twigs. J Fungi (Basel) 2023; 9:1119. [PMID: 37998924 PMCID: PMC10672464 DOI: 10.3390/jof9111119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 11/13/2023] [Accepted: 11/16/2023] [Indexed: 11/25/2023] Open
Abstract
Metabarcoding approaches for the identification of plant disease pathogens and characterization of plant microbial populations constitute a rapidly evolving research field. Fungal plant diseases are of major phytopathological concern; thus, the development of metabarcoding approaches for the detection of phytopathogenic fungi is becoming increasingly imperative in the context of plant disease prognosis. We developed a multiplex metabarcoding method for the identification of fungal phytopathogens and endophytes in olive young shoots, using the MinION sequencing platform (Oxford Nanopore Technologies). Selected fungal-specific primers were used to amplify three different genomic DNA loci (ITS, beta-tubulin, and 28S LSU) originating from olive twigs. A multiplex metabarcoding approach was initially evaluated using healthy olive twigs, and further assessed with naturally infected olive twig samples. Bioinformatic analysis of basecalled reads was carried out using MinKNOW, BLAST+ and R programming, and results were also evaluated using the BugSeq cloud platform. Data analysis highlighted the approaches based on ITS and their combination with beta-tubulin as the most informative ones according to diversity estimations. Subsequent implementation of the method on symptomatic samples identified major olive pathogens and endophytes including genera such as Cladosporium, Didymosphaeria, Paraconiothyrium, Penicillium, Phoma, Verticillium, and others.
Collapse
Affiliation(s)
- Ioannis Theologidis
- Laboratory of Toxicological Control of Pesticides, Scientific Directorate of Pesticides' Control & Phytopharmacy, Benaki Phytopathological Institute, 8 St. Delta Street, 14561 Athens, Attica, Greece
| | - Timokratis Karamitros
- Bioinformatics and Applied Genomics Unit, Department of Microbiology, Hellenic Pasteur Institute, 127 Vasilissis Sofias Avenue, 11521 Athens, Attica, Greece
| | - Aikaterini-Eleni Vichou
- Laboratory of Mycology, Scientific Directorate of Phytopathology, Benaki Phytopathological Institute, 8 St. Delta Street, 14561 Athens, Attica, Greece
| | - Dimosthenis Kizis
- Laboratory of Mycology, Scientific Directorate of Phytopathology, Benaki Phytopathological Institute, 8 St. Delta Street, 14561 Athens, Attica, Greece
| |
Collapse
|
28
|
Casanova-Adán L, Mosquera-Miguel A, González-Bao J, Ambroa-Conde A, Ruiz-Ramírez J, Cabrejas-Olalla A, González-Martín E, Freire-Aradas A, Rodríguez-López A, Phillips C, Lareu MV, de la Puente M. Adapting an established Ampliseq microhaplotype panel to nanopore sequencing through direct PCR. Forensic Sci Int Genet 2023; 67:102937. [PMID: 37812882 DOI: 10.1016/j.fsigen.2023.102937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/21/2023] [Accepted: 09/27/2023] [Indexed: 10/11/2023]
Abstract
We have adapted an established Ampliseq microhaplotype panel for nanopore sequencing with the Oxford Nanopore Technologies (ONT) system, as a cost-effective and highly scalable solution for forensic genetics applications. For this purpose, we designed a protocol combining direct PCR amplification from unextracted DNA with ONT library construction and sequencing using the MinION device and workflow. The analysis of reference samples at input amounts of 5-10 ng of DNA demonstrates stable coverage patterns, allele balance, and strand bias, reaching profile completeness and concordance rates of ∼95%. Similar levels were achieved when using direct-PCR from blood, buccal and semen swabs. Dilution series results indicate sensitivity is maintained down to 250 pg of input DNA, and informative profiles are produced down to 62.5 pg. Finally, we demonstrated the forensic utility of the nanopore workflow by analyzing two third degree pedigrees that showed low likelihood ratio values after the analysis of an extended panel of 38 STRs, achieving likelihood ratios 2-3 orders of magnitude higher when testing with the MinION-based haplotype data.
Collapse
Affiliation(s)
- L Casanova-Adán
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - A Mosquera-Miguel
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - J González-Bao
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - A Ambroa-Conde
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - J Ruiz-Ramírez
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - A Cabrejas-Olalla
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - E González-Martín
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - A Freire-Aradas
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - A Rodríguez-López
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - C Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - M V Lareu
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain
| | - M de la Puente
- Forensic Genetics Unit, Institute of Forensic Sciences, Universidade de Santiago de Compostela, Spain.
| |
Collapse
|
29
|
Kolesnikov A, Cook D, Nattestad M, McNulty B, Gorzynski J, Goenka S, Ashley EA, Jain M, Miga KH, Paten B, Chang PC, Carroll A, Shafin K. Local read haplotagging enables accurate long-read small variant calling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.07.556731. [PMID: 37745389 PMCID: PMC10515762 DOI: 10.1101/2023.09.07.556731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Long-read sequencing technology has enabled variant detection in difficult-to-map regions of the genome and enabled rapid genetic diagnosis in clinical settings. Rapidly evolving third-generation sequencing platforms like Pacific Biosciences (PacBio) and Oxford nanopore technologies (ONT) are introducing newer platforms and data types. It has been demonstrated that variant calling methods based on deep neural networks can use local haplotyping information with long-reads to improve the genotyping accuracy. However, using local haplotype information creates an overhead as variant calling needs to be performed multiple times which ultimately makes it difficult to extend to new data types and platforms as they get introduced. In this work, we have developed a local haplotype approximate method that enables state-of-the-art variant calling performance with multiple sequencing platforms including PacBio Revio system, ONT R10.4 simplex and duplex data. This addition of local haplotype approximation makes DeepVariant a universal variant calling solution for long-read sequencing platforms.
Collapse
Affiliation(s)
| | - Daniel Cook
- Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA
| | | | - Brandy McNulty
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | | | | | | | - Miten Jain
- Northeastern university, Boston, MA, USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | | | | | | |
Collapse
|
30
|
Liu X, Ni Y, Wang D, Ye S, Yang M, Sun X, Leung AYH, Li R. Unraveling the whole genome DNA methylation profile of zebrafish kidney marrow by Oxford Nanopore sequencing. Sci Data 2023; 10:532. [PMID: 37563176 PMCID: PMC10415270 DOI: 10.1038/s41597-023-02431-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 07/31/2023] [Indexed: 08/12/2023] Open
Abstract
Zebrafish is a widely used model organism for investigating human diseases, including hematopoietic disorders. However, a comprehensive methylation baseline for zebrafish primary hematopoietic organ, the kidney marrow (KM), is still lacking. We employed Oxford Nanopore Technologies (ONT) sequencing to profile DNA methylation in zebrafish KM by generating four KM datasets, with two groups based on the presence or absence of red blood cells. Our findings revealed that blood contamination in the KM samples reduced read quality and altered methylation patterns. Compared with whole-genome bisulfite sequencing (WGBS), the ONT-based methylation profiling can cover more CpG sites (92.4% vs 70%-80%), and exhibit less GC bias with more even genomic coverage. And the ONT methylation calling results showed a high correlation with WGBS results when using shared sites. This study establishes a comprehensive methylation profile for zebrafish KM, paving the way for further investigations into epigenetic regulation and the development of targeted therapies for hematopoietic disorders.
Collapse
Affiliation(s)
- Xudong Liu
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Ying Ni
- Department of Precision Diagnostic and Therapeutic Technology, City University of Hong Kong Shenzhen Futian Research Institute, Shenzhen, China
- Department of Biomedical Sciences and Tung Biomedical Sciences Centre, City University of Hong Kong, Hong Kong, China
| | - Dandan Wang
- Division of Haematology, Department of Medicine, LKS Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
| | - Silin Ye
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China
| | - Mengsu Yang
- Department of Precision Diagnostic and Therapeutic Technology, City University of Hong Kong Shenzhen Futian Research Institute, Shenzhen, China
- Department of Biomedical Sciences and Tung Biomedical Sciences Centre, City University of Hong Kong, Hong Kong, China
- Key Laboratory of Biochip Technology, Biotech and Health Centre, Shenzhen Research Institute of City University of Hong Kong, Shenzhen, China
| | - Xuan Sun
- Division of Haematology, Department of Medicine, LKS Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
- ZeBlast Technology Limited, Hong Kong Science Park, Hong Kong, China
| | - Anskar Yu Hung Leung
- Division of Haematology, Department of Medicine, LKS Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong SAR, China.
- ZeBlast Technology Limited, Hong Kong Science Park, Hong Kong, China.
| | - Runsheng Li
- Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou), Guangzhou, China.
| |
Collapse
|
31
|
Oehler JB, Wright H, Stark Z, Mallett AJ, Schmitz U. The application of long-read sequencing in clinical settings. Hum Genomics 2023; 17:73. [PMID: 37553611 PMCID: PMC10410870 DOI: 10.1186/s40246-023-00522-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/01/2023] [Indexed: 08/10/2023] Open
Abstract
Long-read DNA sequencing technologies have been rapidly evolving in recent years, and their ability to assess large and complex regions of the genome makes them ideal for clinical applications in molecular diagnosis and therapy selection, thereby providing a valuable tool for precision medicine. In the third-generation sequencing duopoly, Oxford Nanopore Technologies and Pacific Biosciences work towards increasing the accuracy, throughput, and portability of long-read sequencing methods while trying to keep costs low. These trades have made long-read sequencing an attractive tool for use in research and clinical settings. This article provides an overview of current clinical applications and limitations of long-read sequencing and explores its potential for point-of-care testing and health care in remote settings.
Collapse
Affiliation(s)
- Josephine B Oehler
- Biomedical Sciences and Molecular Biology, College of Public Health, Medical & Vet Sciences, James Cook University, Townsville, Australia
- College of Medicine and Dentistry, James Cook University, Townsville, Australia
| | - Helen Wright
- Nursing and Midwifery, College of Healthcare Sciences, James Cook University, Townsville, Australia
| | - Zornitza Stark
- Victorian Clinical Genetics Services, Murdoch Children's Research Institute, Melbourne, Australia
- University of Melbourne, Melbourne, Australia
- Australian Genomics, Melbourne, Australia
| | - Andrew J Mallett
- College of Medicine and Dentistry, James Cook University, Townsville, Australia
- Department of Renal Medicine, Townsville University Hospital, Townsville, Australia
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
- Faculty of Medicine, The University of Queensland, Brisbane, Australia
| | - Ulf Schmitz
- Biomedical Sciences and Molecular Biology, College of Public Health, Medical & Vet Sciences, James Cook University, Townsville, Australia.
- Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, Australia.
- Computational BioMedicine Lab Centenary Institute, The University of Sydney, Camperdown, Australia.
- Faculty of Medicine & Health, The University of Sydney, Camperdown, Australia.
| |
Collapse
|