1
|
Wennmann JT, Lim FS, Senger S, Gani M, Jehle JA, Keilwagen J. Haplotype determination of the Bombyx mori nucleopolyhedrovirus by Nanopore sequencing and linkage of single nucleotide variants. J Gen Virol 2024; 105. [PMID: 38767624 DOI: 10.1099/jgv.0.001983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024] Open
Abstract
Naturally occurring isolates of baculoviruses, such as the Bombyx mori nucleopolyhedrovirus (BmNPV), usually consist of numerous genetically different haplotypes. Deciphering the different haplotypes of such isolates is hampered by the large size of the dsDNA genome, as well as the short read length of next generation sequencing (NGS) techniques that are widely applied for baculovirus isolate characterization. In this study, we addressed this challenge by combining the accuracy of NGS to determine single nucleotide variants (SNVs) as genetic markers with the long read length of Nanopore sequencing technique. This hybrid approach allowed the comprehensive analysis of genetically homogeneous and heterogeneous isolates of BmNPV. Specifically, this allowed the identification of two putative major haplotypes in the heterogeneous isolate BmNPV-Ja by SNV position linkage. SNV positions, which were determined based on NGS data, were linked by the long Nanopore reads in a Position Weight Matrix. Using a modified Expectation-Maximization algorithm, the Nanopore reads were assigned according to the occurrence of variable SNV positions by machine learning. The cohorts of reads were de novo assembled, which led to the identification of BmNPV haplotypes. The method demonstrated the strength of the combined approach of short- and long-read sequencing techniques to decipher the genetic diversity of baculovirus isolates.
Collapse
Affiliation(s)
- Jörg T Wennmann
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Schwabenheimer Str. 101, 69221 Dossenheim, Germany
| | - Fang-Shiang Lim
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Schwabenheimer Str. 101, 69221 Dossenheim, Germany
| | - Sergei Senger
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Schwabenheimer Str. 101, 69221 Dossenheim, Germany
| | - Mudasir Gani
- Division of Entomology, Faculty of Agriculture, Sher-e-Kashmir University of Agricultural Sciences & Technology, Kashmir 193 201, J&K, India
| | - Johannes A Jehle
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biological Control, Schwabenheimer Str. 101, 69221 Dossenheim, Germany
| | - Jens Keilwagen
- Julius Kühn Institute (JKI) - Federal Research Centre for Cultivated Plants, Institute for Biosafety in Plant Biotechnology, Ernst-Baur-Str. 27, 06484 Quedlinburg, Germany
| |
Collapse
|
2
|
Cornman RS. Data mining reveals tissue-specific expression and host lineage-associated forms of Apis mellifera filamentous virus. PeerJ 2023; 11:e16455. [PMID: 38025724 PMCID: PMC10655722 DOI: 10.7717/peerj.16455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 10/23/2023] [Indexed: 12/01/2023] Open
Abstract
Background Apis mellifera filamentous virus (AmFV) is a large double-stranded DNA virus of uncertain phylogenetic position that infects honey bees (Apis mellifera). Little is known about AmFV evolution or molecular aspects of infection. Accurate annotation of open-reading frames (ORFs) is challenged by weak homology to other known viruses. This study was undertaken to evaluate ORFs (including coding-frame conservation, codon bias, and purifying selection), quantify genetic variation within AmFV, identify host characteristics that covary with infection rate, and examine viral expression patterns in different tissues. Methods Short-read data were accessed from the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Sequence reads were downloaded from accessions meeting search criteria and scanned for kmers representative of AmFV genomic sequence. Samples with kmer counts above specified thresholds were downloaded in full for mapping to reference sequences and de novo assembly. Results At least three distinct evolutionary lineages of AmFV exist. Clade 1 predominates in Europe but in the Americas and Africa it is replaced by the other clades as infection level increases in hosts. Only clade 3 was found at high relative abundance in hosts with African ancestry, whereas all clades achieved high relative abundance in bees of non-African ancestry. In Europe and Africa, clade 2 was generally detected only in low-level infections but was locally dominant in some North American samples. The geographic distribution of clade 3 was consistent with an introduction to the Americas with 'Africanized' honey bees in the 1950s. Localized genomic regions of very high nucleotide divergence in individual isolates suggest recombination with additional, as-yet unidentified AmFV lineages. A set of 155 high-confidence ORFs was annotated based on evolutionary conservation in six AmFV genome sequences representative of the three clades. Pairwise protein-level identity averaged 94.6% across ORFs (range 77.1-100%), which generally exhibited low evolutionary rates and moderate to strong codon bias. However, no robust example of positive diversifying selection on coding sequence was found in these alignments. Most of the genome was detected in RNA short-read alignments. Transcriptome assembly often yielded contigs in excess of 50 kb and containing ORFs in both orientations, and the termini of long transcripts were associated with tandem repeats. Lower levels of AmFV RNA were detected in brain tissue compared to abdominal tissue, and a distinct set of ORFs had minimal to no detectable expression in brain tissue. A scan of DNA accessions from the parasitic mite Varroa destructor was inconclusive with respect to replication in that species. Discussion Collectively, these results expand our understanding of this enigmatic virus, revealing transcriptional complexity and co-evolutionary associations with host lineage.
Collapse
|
3
|
Tombácz D, Torma G, Gulyás G, Fülöp Á, Dörmő Á, Prazsák I, Csabai Z, Mizik M, Hornyák Á, Zádori Z, Kakuk B, Boldogkői Z. Hybrid sequencing discloses unique aspects of the transcriptomic architecture in equid alphaherpesvirus 1. Heliyon 2023; 9:e17716. [PMID: 37449092 PMCID: PMC10336594 DOI: 10.1016/j.heliyon.2023.e17716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 06/05/2023] [Accepted: 06/26/2023] [Indexed: 07/18/2023] Open
Abstract
This study employed both short-read sequencing (SRS, Illumina) and long-read sequencing (LRS Oxford Nanopore Technologies) platforms to conduct a comprehensive analysis of the equid alphaherpesvirus 1 (EHV-1) transcriptome. The study involved the annotation of canonical mRNAs and their transcript variants, encompassing transcription start site (TSS) and transcription end site (TES) isoforms, in addition to alternative splicing forms. Furthermore, the study revealed the presence of numerous non-coding RNA (ncRNA) molecules, including intergenic and antisense transcripts, produced by EHV-1. An intriguing finding was the abundant production of chimeric transcripts, some of which potentially encode fusion polypeptides. Moreover, EHV-1 exhibited a greater incidence of transcriptional overlaps and splicing compared to related viruses. It is noteworthy that many genes have their unique TESs along with the co-terminal transcription ends, a characteristic scarcely seen in other alphaherpesviruses. The study also identified transcripts that overlap the replication origins of the virus. Moreover, a novel ncRNA, referred to as NOIR, was found to intersect with the 5'-ends of longer transcript isoform specified by the major transactivator genes ORF64 and ORF65, surrounding the OriL. These findings together imply the existence of a key regulatory mechanism that governs both transcription and replication through, among others, a process that involves interference between the DNA and RNA synthesis machineries.
Collapse
Affiliation(s)
- Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Gulyás
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Ádám Fülöp
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Ákos Dörmő
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - István Prazsák
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Máté Mizik
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Ákos Hornyák
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Budapest, Hungary
| | - Zoltán Zádori
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Budapest, Hungary
| | - Balázs Kakuk
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| |
Collapse
|
4
|
Hamim I, Sekine KT, Komatsu K. How do emerging long-read sequencing technologies function in transforming the plant pathology research landscape? PLANT MOLECULAR BIOLOGY 2022; 110:469-484. [PMID: 35962900 DOI: 10.1007/s11103-022-01305-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 07/26/2022] [Indexed: 06/15/2023]
Abstract
Long-read sequencing technologies are revolutionizing the sequencing and analysis of plant and pathogen genomes and transcriptomes, as well as contributing to emerging areas of interest in plant-pathogen interactions, disease management techniques, and the introduction of new plant varieties or cultivars. Long-read sequencing (LRS) technologies are progressively being implemented to study plants and pathogens of agricultural importance, which have substantial economic effects. The variability and complexity of the genome and transcriptome affect plant growth, development and pathogen responses. Overcoming the limitations of second-generation sequencing, LRS technology has significantly increased the length of a single contiguous read from a few hundred to millions of base pairs. Because of the longer read lengths, new analysis methods and tools have been developed for plant and pathogen genomics and transcriptomics. LRS technologies enable faster, more efficient, and high-throughput ultralong reads, allowing direct sequencing of genomes that would be impossible or difficult to investigate using short-read sequencing approaches. These benefits include genome assembly in repetitive areas, creating more comprehensive and exact genome determinations, assembling full-length transcripts, and detecting DNA and RNA alterations. Furthermore, these technologies allow for the identification of transcriptome diversity, significant structural variation analysis, and direct epigenetic mark detection in plant and pathogen genomic regions. LRS in plant pathology is found efficient for identifying and characterization of effectors in plants as well as known and unknown plant pathogens. In this review, we investigate how these technologies are transforming the landscape of determination and characterization of plant and pathogen genomes and transcriptomes efficiently and accurately. Moreover, we highlight potential areas of interest offered by LRS technologies for future study into plant-pathogen interactions, disease control strategies, and the development of new plant varieties or cultivars.
Collapse
Affiliation(s)
- Islam Hamim
- Laboratory of Plant Pathology, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan
- International Research Fellow of Japan Society for the Promotion of Science, Tokyo, Japan
- Department of Plant Pathology, Bangladesh Agricultural University, Mymensingh, Bangladesh
| | - Ken-Taro Sekine
- Faculty of Agriculture, University of the Ryukyus, Okinawa, Japan
| | - Ken Komatsu
- Laboratory of Plant Pathology, Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan.
| |
Collapse
|
5
|
Bruder MR, Aucoin MG. Utility of Alternative Promoters for Foreign Gene Expression Using the Baculovirus Expression Vector System. Viruses 2022; 14:v14122670. [PMID: 36560674 PMCID: PMC9786725 DOI: 10.3390/v14122670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 10/27/2022] [Accepted: 10/31/2022] [Indexed: 12/03/2022] Open
Abstract
The baculovirus expression vector system (BEVS) is a widely used platform for recombinant protein production for use in a wide variety of applications. Of particular interest is production of virus-like particles (VLPs), which consist of multiple viral proteins that self-assemble in strict stoichiometric ratios to mimic the structure of a virus but lacks its genetic material, while a significant amount of effort has been spent on optimizing expression ratios by co-infecting cells with multiple recombinant BEVs and modulating different process parameters, co-expressing multiple foreign genes from a single rBEV may offer more promise. However, there is currently a lack of promoters available with which to optimize co-expression of each foreign gene. To address this, previously published transcriptome data was used to identify promoters that have incrementally lower expression profiles and compared by expressing model cytoplasmic and secreted proteins. Bioinformatics was also used to identify sequence determinants that may be important for late gene transcription regulation, and translation initiation. The identified promoters and bioinformatics analyses may be useful for optimizing expression of foreign genes in the BEVS.
Collapse
|
6
|
Tombácz D, Kakuk B, Torma G, Csabai Z, Gulyás G, Tamás V, Zádori Z, Jefferson VA, Meyer F, Boldogkői Z. In-Depth Temporal Transcriptome Profiling of an Alphaherpesvirus Using Nanopore Sequencing. Viruses 2022; 14:v14061289. [PMID: 35746760 PMCID: PMC9229804 DOI: 10.3390/v14061289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 06/05/2022] [Accepted: 06/08/2022] [Indexed: 12/10/2022] Open
Abstract
In this work, a long-read sequencing (LRS) technique based on the Oxford Nanopore Technology MinION platform was used for quantifying and kinetic characterization of the poly(A) fraction of bovine alphaherpesvirus type 1 (BoHV-1) lytic transcriptome across a 12-h infection period. Amplification-based LRS techniques frequently generate artefactual transcription reads and are biased towards the production of shorter amplicons. To avoid these undesired effects, we applied direct cDNA sequencing, an amplification-free technique. Here, we show that a single promoter can produce multiple transcription start sites whose distribution patterns differ among the viral genes but are similar in the same gene at different timepoints. Our investigations revealed that the circ gene is expressed with immediate–early (IE) kinetics by utilizing a special mechanism based on the use of the promoter of another IE gene (bicp4) for the transcriptional control. Furthermore, we detected an overlap between the initiation of DNA replication and the transcription from the bicp22 gene, which suggests an interaction between the two molecular machineries. This study developed a generally applicable LRS-based method for the time-course characterization of transcriptomes of any organism.
Collapse
Affiliation(s)
- Dóra Tombácz
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
| | - Balázs Kakuk
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
| | - Gábor Torma
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
| | - Zsolt Csabai
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
| | - Gábor Gulyás
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
| | - Vivien Tamás
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungária krt. 21, 1143 Budapest, Hungary; (V.T.); (Z.Z.)
| | - Zoltán Zádori
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungária krt. 21, 1143 Budapest, Hungary; (V.T.); (Z.Z.)
| | - Victoria A. Jefferson
- Department of Biochemistry & Molecular Biology, Entomology & Plant Pathology, Mississippi State University, 408 Dorman P.O. Box 9655, 32 Creelman St., Starkville, MS 39762, USA; (V.A.J.); (F.M.)
| | - Florencia Meyer
- Department of Biochemistry & Molecular Biology, Entomology & Plant Pathology, Mississippi State University, 408 Dorman P.O. Box 9655, 32 Creelman St., Starkville, MS 39762, USA; (V.A.J.); (F.M.)
| | - Zsolt Boldogkői
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4, 6720 Szeged, Hungary; (D.T.); (B.K.); (G.T.); (Z.C.); (G.G.)
- Correspondence:
| |
Collapse
|
7
|
Conserved Viral Transcription Plays a Key Role in Virus-Like Particle Production of the Parasitoid Wasp Venturia canescens. J Virol 2022; 96:e0052422. [PMID: 35678601 DOI: 10.1128/jvi.00524-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Nudiviruses are large double-stranded DNA viruses related to baculoviruses known to be endogenized in the genomes of certain parasitic wasp species. These wasp-virus associations allow the production of viral particles or virus-like particles that ensure wasp parasitism success within lepidopteran hosts. Venturia canescens is an ichneumonid wasp belonging to the Campopleginae subfamily that has endogenized nudivirus genes belonging to the Alphanudivirus genus to produce "virus-like particles" (Venturia canescens virus-like particles [VcVLPs]), which package proteic virulence factors. The main aim of this study was to determine whether alphanudivirus gene functions have been conserved following endogenization. The expression dynamics of alphanudivirus genes was monitored by a high throughput transcriptional approach, and the functional role of lef-4 and lef-8 genes predicted to encode viral RNA polymerase components was investigated by RNA interference. As described for baculovirus infections and for endogenized nudivirus genes in braconid wasp species producing bracoviruses, a transcriptional cascade involving early and late expressed alphanudivirus genes could be observed. The expression of lef-4 and lef-8 was also shown to be required for the expression of alphanudivirus late genes allowing correct particle formation. Together with previous literature, the results show that endogenization of nudiviruses in parasitoid wasps has repeatedly led to the conservation of the viral RNA polymerase function, allowing the production of viruses or viral-like particles that differ in composition but enable wasp parasitic success. IMPORTANCE This study shows that endogenization of a nudivirus genome in a Campopleginae parasitoid wasp has led to the conservation, as for endogenized nudiviruses in braconid parasitoid wasps, of the viral RNA polymerase function, required for the transcription of genes encoding viral particles involved in wasp parasitism success. We also showed for the first time that RNA interference (RNAi) can be successfully used to downregulate gene expression in this species, a model in behavioral ecology. This opens the opportunity to investigate the function of genes involved in other traits important for parasitism success, such as reproductive strategies and host choice. Fundamental data acquired on gene function in Venturia canescens are likely to be transferable to other parasitoid wasp species used in biological control programs. This study also renders possible the investigation of other nudivirus gene functions, for which little data are available.
Collapse
|
8
|
Dual isoform sequencing reveals complex transcriptomic and epitranscriptomic landscapes of a prototype baculovirus. Sci Rep 2022; 12:1291. [PMID: 35079129 PMCID: PMC8789824 DOI: 10.1038/s41598-022-05457-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 01/05/2022] [Indexed: 12/23/2022] Open
Abstract
In this study, two long-read sequencing (LRS) techniques, MinION from Oxford Nanopore Technologies and Sequel from the Pacific Biosciences, were used for the transcriptional characterization of a prototype baculovirus, Autographa californica multiple nucleopolyhedrovirus. LRS is able to read full-length RNA molecules, and thereby distinguish between transcript isoforms, mono- and polycistronic RNAs, and overlapping transcripts. Altogether, we detected 875 transcript species, of which 759 were novel and 116 were annotated previously. These RNA molecules include 41 novel putative protein coding transcripts [each containing 5'-truncated in-frame open reading frames (ORFs), 14 monocistronic transcripts, 99 polygenic RNAs, 101 non-coding RNAs, and 504 untranslated region isoforms. This work also identified novel replication origin-associated transcripts, upstream ORFs, cis-regulatory sequences and poly(A) sites. We also detected RNA methylation in 99 viral genes and RNA hyper-editing in the longer 5'-UTR transcript isoform of the canonical ORF 19 transcript.
Collapse
|
9
|
Fülöp Á, Torma G, Moldován N, Szenthe K, Bánáti F, Almsarrhad IAA, Csabai Z, Tombácz D, Minárovits J, Boldogkői Z. Integrative profiling of Epstein-Barr virus transcriptome using a multiplatform approach. Virol J 2022; 19:7. [PMID: 34991630 PMCID: PMC8740505 DOI: 10.1186/s12985-021-01734-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 12/20/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Epstein-Barr virus (EBV) is an important human pathogenic gammaherpesvirus with carcinogenic potential. The EBV transcriptome has previously been analyzed using both Illumina-based short read-sequencing and Pacific Biosciences RS II-based long-read sequencing technologies. Since the various sequencing methods have distinct strengths and limitations, the use of multiplatform approaches have proven to be valuable. The aim of this study is to provide a more complete picture on the transcriptomic architecture of EBV. METHODS In this work, we apply the Oxford Nanopore Technologies MinION (long-read sequencing) platform for the generation of novel transcriptomic data, and integrate these with other's data generated by another LRS approach, Pacific BioSciences RSII sequencing and Illumina CAGE-Seq and Poly(A)-Seq approaches. Both amplified and non-amplified cDNA sequencings were applied for the generation of sequencing reads, including both oligo-d(T) and random oligonucleotide-primed reverse transcription. EBV transcripts are identified and annotated using the LoRTIA software suite developed in our laboratory. RESULTS This study detected novel genes embedded into longer host genes containing 5'-truncated in-frame open reading frames, which potentially encode N-terminally truncated proteins. We also detected a number of novel non-coding RNAs and transcript length isoforms encoded by the same genes but differing in their start and/or end sites. This study also reports the discovery of novel splice isoforms, many of which may represent altered coding potential, and of novel replication-origin-associated transcripts. Additionally, novel mono- and multigenic transcripts were identified. An intricate meshwork of transcriptional overlaps was revealed. CONCLUSIONS An integrative approach applying multi-technique sequencing technologies is suitable for reliable identification of complex transcriptomes because each techniques has different advantages and limitations, and the they can be used for the validation of the results obtained by a particular approach.
Collapse
Affiliation(s)
- Ádám Fülöp
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - Gábor Torma
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - Norbert Moldován
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - Kálmán Szenthe
- Carlsbad Research Organization Ltd., Szabadság u. 2., Újrónafő, 9244 Hungary
| | - Ferenc Bánáti
- RT-Europe Research Center, Vár tér 2., Mosonmagyaróvár, 9200 Hungary
| | - Islam A. A. Almsarrhad
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| | - János Minárovits
- Department of Oral Biology and Experimental Dental Research, University of Szeged, Tisza Lajos krt. 64, Szeged, 6720 Hungary
| | - Zsolt Boldogkői
- Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi B. u. 4., Szeged, 6720 Hungary
| |
Collapse
|
10
|
Vacca D, Fiannaca A, Tramuto F, Cancila V, La Paglia L, Mazzucco W, Gulino A, La Rosa M, Maida CM, Morello G, Belmonte B, Casuccio A, Maugeri R, Iacopino G, Balistreri CR, Vitale F, Tripodo C, Urso A. Direct RNA Nanopore Sequencing of SARS-CoV-2 Extracted from Critical Material from Swabs. Life (Basel) 2022; 12:69. [PMID: 35054462 PMCID: PMC8778588 DOI: 10.3390/life12010069] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 12/24/2021] [Accepted: 12/28/2021] [Indexed: 12/30/2022] Open
Abstract
In consideration of the increasing prevalence of COVID-19 cases in several countries and the resulting demand for unbiased sequencing approaches, we performed a direct RNA sequencing (direct RNA seq.) experiment using critical oropharyngeal swab samples collected from Italian patients infected with SARS-CoV-2 from the Palermo region in Sicily. Here, we identified the sequences SARS-CoV-2 directly in RNA extracted from critical samples using the Oxford Nanopore MinION technology without prior cDNA retrotranscription. Using an appropriate bioinformatics pipeline, we could identify mutations in the nucleocapsid (N) gene, which have been reported previously in studies conducted in other countries. In conclusion, to the best of our knowledge, the technique used in this study has not been used for SARS-CoV-2 detection previously owing to the difficulties in the extraction of RNA of sufficient quantity and quality from routine oropharyngeal swabs. Despite these limitations, this approach provides the advantages of true native RNA sequencing and does not include amplification steps that could introduce systematic errors. This study can provide novel information relevant to the current strategies adopted in SARS-CoV-2 next-generation sequencing.
Collapse
Affiliation(s)
- Davide Vacca
- Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
| | - Antonino Fiannaca
- CNR-ICAR, National Research Council of Italy, Via Ugo La Malfa, a5c, 90146 Palermo, Italy; (A.F.); (L.L.P.); (M.L.R.); (A.U.)
| | - Fabio Tramuto
- Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, Hygiene Section, University of Palermo, 90127 Palermo, Italy; (F.T.); (W.M.); (C.M.M.); (A.C.); (F.V.)
| | - Valeria Cancila
- Tumor Immunology Unit, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, University of Palermo, 90127 Palermo, Italy; (V.C.); (G.M.); (B.B.); (C.T.)
| | - Laura La Paglia
- CNR-ICAR, National Research Council of Italy, Via Ugo La Malfa, a5c, 90146 Palermo, Italy; (A.F.); (L.L.P.); (M.L.R.); (A.U.)
| | - Walter Mazzucco
- Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, Hygiene Section, University of Palermo, 90127 Palermo, Italy; (F.T.); (W.M.); (C.M.M.); (A.C.); (F.V.)
| | - Alessandro Gulino
- Cogentech srl Società Benefit, FIRC Institute of Molecular Oncology (IFOM), Via Adamello 16, 20139 Milan, Italy;
| | - Massimo La Rosa
- CNR-ICAR, National Research Council of Italy, Via Ugo La Malfa, a5c, 90146 Palermo, Italy; (A.F.); (L.L.P.); (M.L.R.); (A.U.)
| | - Carmelo Massimo Maida
- Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, Hygiene Section, University of Palermo, 90127 Palermo, Italy; (F.T.); (W.M.); (C.M.M.); (A.C.); (F.V.)
| | - Gaia Morello
- Tumor Immunology Unit, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, University of Palermo, 90127 Palermo, Italy; (V.C.); (G.M.); (B.B.); (C.T.)
| | - Beatrice Belmonte
- Tumor Immunology Unit, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, University of Palermo, 90127 Palermo, Italy; (V.C.); (G.M.); (B.B.); (C.T.)
| | - Alessandra Casuccio
- Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, Hygiene Section, University of Palermo, 90127 Palermo, Italy; (F.T.); (W.M.); (C.M.M.); (A.C.); (F.V.)
| | - Rosario Maugeri
- Department of Experimental Biomedicine and Clinical Neurosciences, School of Medicine, Neurosurgical Clinic, University of Palermo, 90127 Palermo, Italy; (R.M.); (G.I.)
| | - Gerardo Iacopino
- Department of Experimental Biomedicine and Clinical Neurosciences, School of Medicine, Neurosurgical Clinic, University of Palermo, 90127 Palermo, Italy; (R.M.); (G.I.)
| | - Carmela Rita Balistreri
- Department of Biomedicine, Neuroscience and Advanced Diagnostics (Bi.N.D.), University of Palermo, 90134 Palermo, Italy;
| | - Francesco Vitale
- Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, Hygiene Section, University of Palermo, 90127 Palermo, Italy; (F.T.); (W.M.); (C.M.M.); (A.C.); (F.V.)
| | - Claudio Tripodo
- Tumor Immunology Unit, Department of Health Promotion, Mother and Child Care, Internal Medicine and Medical Specialties “G. D’Alessandro”, University of Palermo, 90127 Palermo, Italy; (V.C.); (G.M.); (B.B.); (C.T.)
| | - Alfonso Urso
- CNR-ICAR, National Research Council of Italy, Via Ugo La Malfa, a5c, 90146 Palermo, Italy; (A.F.); (L.L.P.); (M.L.R.); (A.U.)
| |
Collapse
|
11
|
Javaran VJ, Moffett P, Lemoyne P, Xu D, Adkar-Purushothama CR, Fall ML. Grapevine Virology in the Third-Generation Sequencing Era: From Virus Detection to Viral Epitranscriptomics. PLANTS (BASEL, SWITZERLAND) 2021; 10:plants10112355. [PMID: 34834718 PMCID: PMC8623739 DOI: 10.3390/plants10112355] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 10/16/2021] [Accepted: 10/29/2021] [Indexed: 05/30/2023]
Abstract
Among all economically important plant species in the world, grapevine (Vitis vinifera L.) is the most cultivated fruit plant. It has a significant impact on the economies of many countries through wine and fresh and dried fruit production. In recent years, the grape and wine industry has been facing outbreaks of known and emerging viral diseases across the world. Although high-throughput sequencing (HTS) has been used extensively in grapevine virology, the application and potential of third-generation sequencing have not been explored in understanding grapevine viruses and their impact on the grapevine. Nanopore sequencing, a third-generation technology, can be used for the direct sequencing of both RNA and DNA with minimal infrastructure. Compared to other HTS methods, the MinION nanopore platform is faster and more cost-effective and allows for long-read sequencing. Due to the size of the MinION device, it can be easily carried for field viral disease surveillance. This review article discusses grapevine viruses, the principle of third-generation sequencing platforms, and the application of nanopore sequencing technology in grapevine virus detection, virus-plant interactions, as well as the characterization of viral RNA modifications.
Collapse
Affiliation(s)
- Vahid Jalali Javaran
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada; (V.J.J.); (P.L.); (D.X.)
- Département de Biologie, Centre SÈVE, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada;
| | - Peter Moffett
- Département de Biologie, Centre SÈVE, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada;
| | - Pierre Lemoyne
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada; (V.J.J.); (P.L.); (D.X.)
| | - Dong Xu
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada; (V.J.J.); (P.L.); (D.X.)
| | - Charith Raj Adkar-Purushothama
- Département de Biochimie, Faculté de Médecine des Sciences de la Santé, 3201 rue Jean-Mignault, Sherbrooke, QC J1E 4K8, Canada;
| | - Mamadou Lamine Fall
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada; (V.J.J.); (P.L.); (D.X.)
| |
Collapse
|
12
|
Kakuk B, Kiss AA, Torma G, Csabai Z, Prazsák I, Mizik M, Megyeri K, Tombácz D, Boldogkői Z. Nanopore Assay Reveals Cell-Type-Dependent Gene Expression of Vesicular Stomatitis Indiana Virus and Differential Host Cell Response. Pathogens 2021; 10:pathogens10091196. [PMID: 34578228 PMCID: PMC8468008 DOI: 10.3390/pathogens10091196] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 09/09/2021] [Accepted: 09/10/2021] [Indexed: 11/16/2022] Open
Abstract
Vesicular stomatitis Indiana virus (VSIV) of genus Vesiculovirus, species IndianaVesiculovirus (formerly as Vesicular stomatitis virus, VSV) causes a disease in livestock that is very similar to the foot and mouth disease, thereby an outbreak may lead to significant economic loss. Long-read sequencing (LRS) -based approaches already reveal a hidden complexity of the transcriptomes in several viruses. This technique has been utilized for the sequencing of the VSIV genome, but our study is the first for the application of this technique for the profiling of the VSIV transcriptome. Since LRS is able to sequence full-length RNA molecules, it thereby provides more accurate annotation of the transcriptomes than the traditional short-read sequencing methods. The objectives of this study were to assemble the complete transcriptome of using nanopore sequencing, to ascertain cell-type specificity and dynamics of viral gene expression, and to evaluate host gene expression changes induced by the viral infection. We carried out a time-course analysis of VSIV gene expression in human glioblastoma and primate fibroblast cell lines using a nanopore-based LRS approach and applied both amplified and direct cDNA sequencing (as well as cap-selection) for a fraction of samples. Our investigations revealed that, although the VSIV genome is simple, it generates a relatively complex transcriptomic architecture. In this study, we also demonstrated that VSIV transcripts vary in structure and exhibit differential gene expression patterns in the two examined cell types.
Collapse
Affiliation(s)
- Balázs Kakuk
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - András Attila Kiss
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - István Prazsák
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - Máté Mizik
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - Klára Megyeri
- Department of Medical Microbiology and Immunobiology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary;
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary; (B.K.); (A.A.K.); (G.T.); (Z.C.); (I.P.); (M.M.); (D.T.)
- Correspondence:
| |
Collapse
|
13
|
Brancaccio RN, Robitaille A, Dutta S, Rollison DE, Tommasino M, Gheit T. MinION nanopore sequencing and assembly of a complete human papillomavirus genome. J Virol Methods 2021; 294:114180. [PMID: 33965458 PMCID: PMC8223502 DOI: 10.1016/j.jviromet.2021.114180] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 03/26/2021] [Accepted: 05/03/2021] [Indexed: 01/01/2023]
Abstract
BACKGROUND The MinION sequencer belongs to the third generation of sequencing technology that allows for the generation of ultra-long reads, representing a potentially more effective approach to characterize entire viral genome sequences than other time-consuming and low-throughput methodologies. METHODS We report the use of the MinION nanopore sequencer to sequence the full-length genome of human papillomavirus (HPV)-ICB2 (7441 bp), which was previously characterized in our laboratory. Three independent MinION libraries were prepared and sequenced using either three consecutive 12 -h runs (Protocol A) or a single run of 48 h starting from a pool of three barcoded DNA libraries (Protocol B). A fully automated bioinformatics pipeline was developed for the reconstruction of the viral genome. RESULTS Protocols A and B generated 9,354,933 and 3,255,879 reads, respectively. Read length N50 values ranged between 6976 and 7360 nucleotides over the four sequencing runs. Bioinformatics analysis showed that both protocols allowed for the reconstruction of the whole viral genome, with pairwise percentages of identity to HPV-ICB2 of 100 % for protocol A and 99.98 % for protocol B. CONCLUSION Our results show that the use of the MinION nanopore sequencer represents an effective strategy for whole-genome sequencing of HPVs with a minimal error rate.
Collapse
Affiliation(s)
- Rosario N Brancaccio
- Early Detection, Prevention and Infections Branch, International Agency for Research on Cancer, Lyon, France
| | - Alexis Robitaille
- Early Detection, Prevention and Infections Branch, International Agency for Research on Cancer, Lyon, France
| | - Sankhadeep Dutta
- Chittaranjan National Cancer Institute, Department of Viral Associated Human Cancer, Kolkata, India
| | - Dana E Rollison
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Massimo Tommasino
- Early Detection, Prevention and Infections Branch, International Agency for Research on Cancer, Lyon, France
| | - Tarik Gheit
- Early Detection, Prevention and Infections Branch, International Agency for Research on Cancer, Lyon, France.
| |
Collapse
|
14
|
Time-Course Transcriptome Profiling of a Poxvirus Using Long-Read Full-Length Assay. Pathogens 2021; 10:pathogens10080919. [PMID: 34451383 PMCID: PMC8398953 DOI: 10.3390/pathogens10080919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/29/2021] [Accepted: 07/12/2021] [Indexed: 11/16/2022] Open
Abstract
Viral transcriptomes that are determined using first- and second-generation sequencing techniques are incomplete. Due to the short read length, these methods are inefficient or fail to distinguish between transcript isoforms, polycistronic RNAs, and transcriptional overlaps and readthroughs. Additionally, these approaches are insensitive for the identification of splice and transcriptional start sites (TSSs) and, in most cases, transcriptional end sites (TESs), especially in transcript isoforms with varying transcript ends, and in multi-spliced transcripts. Long-read sequencing is able to read full-length nucleic acids and can therefore be used to assemble complete transcriptome atlases. Although vaccinia virus (VACV) does not produce spliced RNAs, its transcriptome has a high diversity of TSSs and TESs, and a high degree of polycistronism that leads to enormous complexity. We applied single-molecule, real-time, and nanopore-based sequencing methods to investigate the time-lapse transcriptome patterns of VACV gene expression.
Collapse
|
15
|
Kakuk B, Tombácz D, Balázs Z, Moldován N, Csabai Z, Torma G, Megyeri K, Snyder M, Boldogkői Z. Combined nanopore and single-molecule real-time sequencing survey of human betaherpesvirus 5 transcriptome. Sci Rep 2021; 11:14487. [PMID: 34262076 PMCID: PMC8280142 DOI: 10.1038/s41598-021-93593-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/28/2021] [Indexed: 02/08/2023] Open
Abstract
Long-read sequencing (LRS), a powerful novel approach, is able to read full-length transcripts and confers a major advantage over the earlier gold standard short-read sequencing in the efficiency of identifying for example polycistronic transcripts and transcript isoforms, including transcript length- and splice variants. In this work, we profile the human cytomegalovirus transcriptome using two third-generation LRS platforms: the Sequel from Pacific BioSciences, and MinION from Oxford Nanopore Technologies. We carried out both cDNA and direct RNA sequencing, and applied the LoRTIA software, developed in our laboratory, for the transcript annotations. This study identified a large number of novel transcript variants, including splice isoforms and transcript start and end site isoforms, as well as putative mRNAs with truncated in-frame ORFs (located within the larger ORFs of the canonical mRNAs), which potentially encode N-terminally truncated polypeptides. Our work also disclosed a highly complex meshwork of transcriptional read-throughs and overlaps.
Collapse
Affiliation(s)
- Balázs Kakuk
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
- MTA-SZTE Momentum GeMiNI Research Group, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
- Department of Genetics, School of Medicine, Stanford University, 300 Pasteur Dr, Stanford, CA, USA
| | - Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
| | - Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
| | - Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary
| | - Klára Megyeri
- Department of Medical Microbiology and Immunobiology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, 300 Pasteur Dr, Stanford, CA, USA
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4, 6720, Szeged, Hungary.
| |
Collapse
|
16
|
Maróti Z, Tombácz D, Prazsák I, Moldován N, Csabai Z, Torma G, Balázs Z, Kalmár T, Dénes B, Snyder M, Boldogkői Z. Time-course transcriptome analysis of host cell response to poxvirus infection using a dual long-read sequencing approach. BMC Res Notes 2021; 14:239. [PMID: 34167576 PMCID: PMC8223271 DOI: 10.1186/s13104-021-05657-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Accepted: 06/15/2021] [Indexed: 11/30/2022] Open
Abstract
OBJECTIVE In this study, we applied two long-read sequencing (LRS) approaches, including single-molecule real-time and nanopore-based sequencing methods to investigate the time-lapse transcriptome patterns of host gene expression as a response to Vaccinia virus infection. Transcriptomes determined using short-read sequencing approaches are incomplete because these platforms are inefficient or fail to distinguish between polycistronic RNAs, transcript isoforms, transcriptional start sites, as well as transcriptional readthroughs and overlaps. Long-read sequencing is able to read full-length nucleic acids and can therefore be used to assemble complete transcriptome atlases. RESULTS In this work, we identified a number of novel transcripts and transcript isoforms of Chlorocebus sabaeus. Additionally, analysis of the most abundant 768 host transcripts revealed a significant overrepresentation of the class of genes in the "regulation of signaling receptor activity" Gene Ontology annotation as a result of viral infection.
Collapse
Affiliation(s)
- Zoltán Maróti
- Department of Pediatrics, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA
| | - István Prazsák
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Tibor Kalmár
- Department of Pediatrics, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Béla Dénes
- Veterinary Diagnostic Directorate of the National Food Chain Safety Office, Budapest, Hungary
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, USA
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary.
| |
Collapse
|
17
|
Tombácz D, Moldován N, Torma G, Nagy T, Hornyák Á, Csabai Z, Gulyás G, Boldogkői M, Jefferson VA, Zádori Z, Meyer F, Boldogkői Z. Dynamic Transcriptome Sequencing of Bovine Alphaherpesvirus Type 1 and Host Cells Carried Out by a Multi-Technique Approach. Front Genet 2021; 12:619056. [PMID: 33897757 PMCID: PMC8059770 DOI: 10.3389/fgene.2021.619056] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 03/16/2021] [Indexed: 12/12/2022] Open
Affiliation(s)
- Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Tibor Nagy
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, University of Debrecen, Debrecen, Hungary
| | - Ákos Hornyák
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Budapest, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Gulyás
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Miklós Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Victoria A Jefferson
- Department of Biochemistry & Molecular Biology, Entomology & Plant Pathology, Mississippi State University, Starkville, MS, United States
| | - Zoltán Zádori
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Budapest, Hungary
| | - Florencia Meyer
- Department of Biochemistry & Molecular Biology, Entomology & Plant Pathology, Mississippi State University, Starkville, MS, United States
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| |
Collapse
|
18
|
Torma G, Tombácz D, Csabai Z, Moldován N, Mészáros I, Zádori Z, Boldogkői Z. Combined Short and Long-Read Sequencing Reveals a Complex Transcriptomic Architecture of African Swine Fever Virus. Viruses 2021; 13:v13040579. [PMID: 33808073 PMCID: PMC8103240 DOI: 10.3390/v13040579] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/17/2021] [Accepted: 03/28/2021] [Indexed: 11/16/2022] Open
Abstract
African swine fever virus (ASFV) is a large DNA virus belonging to the Asfarviridae family. Despite its agricultural importance, little is known about the fundamental molecular mechanisms of this pathogen. Short-read sequencing (SRS) can produce a huge amount of high-precision sequencing reads for transcriptomic profiling, but it is inefficient for comprehensively annotating transcriptomes. Long-read sequencing (LRS) can overcome some of SRS's limitations, but it also has drawbacks, such as low-coverage and high error rate. The limitations of the two approaches can be surmounted by the combined use of these techniques. In this study, we used Illumina SRS and Oxford Nanopore Technologies LRS platforms with multiple library preparation methods (amplified and direct cDNA sequencings and native RNA sequencing) for constructing the ASFV transcriptomic atlas. This work identified many novel transcripts and transcript isoforms and annotated the precise termini of previously described RNAs. This study identified a novel species of ASFV transcripts, the replication origin-associated RNAs. Additionally, we discovered several nested genes embedded into larger canonical genes. In contrast to the current view that the ASFV transcripts are monocistronic, we detected a significant extent of polycistronism. A multifaceted meshwork of transcriptional overlaps was also discovered.
Collapse
Affiliation(s)
- Gábor Torma
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary; (G.T.); (D.T.); (Z.C.); (N.M.)
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary; (G.T.); (D.T.); (Z.C.); (N.M.)
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary; (G.T.); (D.T.); (Z.C.); (N.M.)
| | - Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary; (G.T.); (D.T.); (Z.C.); (N.M.)
| | - István Mészáros
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungária krt. 21, H-1143 Budapest, Hungary; (I.M.); (Z.Z.)
| | - Zoltán Zádori
- Institute for Veterinary Medical Research, Centre for Agricultural Research, Hungária krt. 21, H-1143 Budapest, Hungary; (I.M.); (Z.Z.)
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary; (G.T.); (D.T.); (Z.C.); (N.M.)
- Correspondence:
| |
Collapse
|
19
|
Direct Nanopore Sequencing of mRNA Reveals Landscape of Transcript Isoforms in Apicomplexan Parasites. mSystems 2021; 6:6/2/e01081-20. [PMID: 33688018 PMCID: PMC8561664 DOI: 10.1128/msystems.01081-20] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Alternative splicing is a widespread phenomenon in metazoans by which single genes are able to produce multiple isoforms of the gene product. However, this has been poorly characterized in apicomplexans, a major phylum of some of the most important global parasites. Efforts have been hampered by atypical transcriptomic features, such as the high AU content of Plasmodium RNA, but also the limitations of short-read sequencing in deciphering complex splicing events. In this study, we utilized the long read direct RNA sequencing platform developed by Oxford Nanopore Technologies to survey the alternative splicing landscape of Toxoplasma gondii and Plasmodium falciparum. We find that while native RNA sequencing has a reduced throughput, it allows us to obtain full-length or nearly full-length transcripts with comparable quantification to Illumina sequencing. By comparing these data with available gene models, we find widespread alternative splicing, particularly intron retention, in these parasites. Most of these transcripts contain premature stop codons, suggesting that in these parasites, alternative splicing represents a pathway to transcriptomic diversity, rather than expanding proteomic diversity. Moreover, alternative splicing rates are comparable between parasites, suggesting a shared splicing machinery, despite notable transcriptomic differences between the parasites. This study highlights a strategy in using long-read sequencing to understand splicing events at the whole-transcript level and has implications in the future interpretation of transcriptome sequencing studies. IMPORTANCE We have used a novel nanopore sequencing technology to directly analyze parasite transcriptomes. The very long reads of this technology reveal the full-length genes of the parasites that cause malaria and toxoplasmosis. Gene transcripts must be processed in a process called splicing before they can be translated to protein. Our analysis reveals that these parasites very frequently only partially process their gene products, in a manner that departs dramatically from their human hosts.
Collapse
|
20
|
Advances in the Bioinformatics Knowledge of mRNA Polyadenylation in Baculovirus Genes. Viruses 2020; 12:v12121395. [PMID: 33291215 PMCID: PMC7762203 DOI: 10.3390/v12121395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 11/19/2020] [Accepted: 11/30/2020] [Indexed: 11/17/2022] Open
Abstract
Baculoviruses are a group of insect viruses with large circular dsDNA genomes exploited in numerous biotechnological applications, such as the biological control of agricultural pests, the expression of recombinant proteins or the gene delivery of therapeutic sequences in mammals, among others. Their genomes encode between 80 and 200 proteins, of which 38 are shared by all reported species. Thanks to multi-omic studies, there is remarkable information about the baculoviral proteome and the temporality in the virus gene expression. This allows some functional elements of the genome to be very well described, such as promoters and open reading frames. However, less information is available about the transcription termination signals and, consequently, there are still imprecisions about what are the limits of the transcriptional units present in the baculovirus genomes and how is the processing of the 3′ end of viral mRNA. Regarding to this, in this review we provide an update about the characteristics of DNA signals involved in this process and we contribute to their correct prediction through an exhaustive analysis that involves bibliography information, data mining, RNA structure and a comprehensive study of the core gene 3′ ends from 180 baculovirus genomes.
Collapse
|
21
|
Time-course profiling of bovine alphaherpesvirus 1.1 transcriptome using multiplatform sequencing. Sci Rep 2020; 10:20496. [PMID: 33235226 PMCID: PMC7686369 DOI: 10.1038/s41598-020-77520-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Accepted: 11/04/2020] [Indexed: 12/11/2022] Open
Abstract
Long-read sequencing (LRS) has become a standard approach for transcriptome analysis in recent years. Bovine alphaherpesvirus 1 (BoHV-1) is an important pathogen of cattle worldwide. This study reports the profiling of the dynamic lytic transcriptome of BoHV-1 using two long-read sequencing (LRS) techniques, the Oxford Nanopore Technologies MinION, and the LoopSeq synthetic LRS methods, using multiple library preparation protocols. In this work, we annotated viral mRNAs and non-coding transcripts, and a large number of transcript isoforms, including transcription start and end sites, as well as splice variants of BoHV-1. Our analysis demonstrated an extremely complex pattern of transcriptional overlaps.
Collapse
|
22
|
Long-read assays shed new light on the transcriptome complexity of a viral pathogen. Sci Rep 2020; 10:13822. [PMID: 32796917 PMCID: PMC7427789 DOI: 10.1038/s41598-020-70794-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Accepted: 08/03/2020] [Indexed: 12/21/2022] Open
Abstract
Characterization of global transcriptomes using conventional short-read sequencing is challenging due to the insensitivity of these platforms to transcripts isoforms, multigenic RNA molecules, and transcriptional overlaps. Long-read sequencing (LRS) can overcome these limitations by reading full-length transcripts. Employment of these technologies has led to the redefinition of transcriptional complexities in reported organisms. In this study, we applied LRS platforms from Pacific Biosciences and Oxford Nanopore Technologies to profile the vaccinia virus (VACV) transcriptome. We performed cDNA and direct RNA sequencing analyses and revealed an extremely complex transcriptional landscape of this virus. In particular, VACV genes produce large numbers of transcript isoforms that vary in their start and termination sites. A significant fraction of VACV transcripts start or end within coding regions of neighbouring genes. This study provides new insights into the transcriptomic profile of this viral pathogen.
Collapse
|
23
|
Meta-analytic approach for transcriptome profiling of herpes simplex virus type 1. Sci Data 2020; 7:223. [PMID: 32647284 PMCID: PMC7347551 DOI: 10.1038/s41597-020-0558-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 06/09/2020] [Indexed: 11/09/2022] Open
Abstract
In this meta-analysis, we re-analysed and compared herpes simplex virus type 1 transcriptomic data generated by eight studies using various short- and long-read sequencing techniques and different library preparation methods. We identified a large number of novel mRNAs, non-coding RNAs and transcript isoforms, and validated many previously published transcripts. Here, we present the most complete HSV-1 transcriptome to date. Furthermore, we also demonstrate that various sequencing techniques, including both cDNA and direct RNA sequencing approaches, are error-prone, which can be circumvented by using integrated approaches. This work draws attention to the need for using multiple sequencing approaches and meta-analyses in transcriptome profiling studies to obtain reliable results.
Collapse
|
24
|
Oikonomopoulos S, Bayega A, Fahiminiya S, Djambazian H, Berube P, Ragoussis J. Methodologies for Transcript Profiling Using Long-Read Technologies. Front Genet 2020; 11:606. [PMID: 32733532 PMCID: PMC7358353 DOI: 10.3389/fgene.2020.00606] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 05/19/2020] [Indexed: 12/28/2022] Open
Abstract
RNA sequencing using next-generation sequencing technologies (NGS) is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise high throughput, cost efficient short-read RNA-Seq, while emerging single molecule, long-read RNA-Seq technologies have enabled new approaches to study the transcriptome and its function. The emerging single molecule, long-read technologies are currently commercially available by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), while new methodologies based on short-read sequencing approaches are also being developed in order to provide long range single molecule level information-for example, the ones represented by the 10x Genomics linked read methodology. The shift toward long-read sequencing technologies for transcriptome characterization is based on current increases in throughput and decreases in cost, making these attractive for de novo transcriptome assembly, isoform expression quantification, and in-depth RNA species analysis. These types of analyses were challenging with standard short sequencing approaches, due to the complex nature of the transcriptome, which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes, as well as the high sequence similarity of highly abundant species of RNA, such as rRNAs. Here we aim to focus on single molecule level sequencing technologies and single-cell technologies that, combined with perturbation tools, allow the analysis of complete RNA species, whether short or long, at high resolution. In parallel, these tools have opened new ways in understanding gene functions at the tissue, network, and pathway levels, as well as their detailed functional characterization. Analysis of the epi-transcriptome, including RNA methylation and modification and the effects of such modifications on biological systems is now enabled through direct RNA sequencing instead of classical indirect approaches. However, many difficulties and challenges remain, such as methodologies to generate full-length RNA or cDNA libraries from all different species of RNAs, not only poly-A containing transcripts, and the identification of allele-specific transcripts due to current error rates of single molecule technologies, while the bioinformatics analysis on long-read data for accurate identification of 5' and 3' UTRs is still in development.
Collapse
Affiliation(s)
- Spyros Oikonomopoulos
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Anthony Bayega
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Somayyeh Fahiminiya
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Haig Djambazian
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Pierre Berube
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Jiannis Ragoussis
- McGill Genome Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
- Department of Bioengineering, McGill University, Montréal, QC, Canada
| |
Collapse
|
25
|
Sosa-Gómez DR, Morgado FS, Corrêa RFT, Silva LA, Ardisson-Araújo DMP, Rodrigues BMP, Oliveira EE, Aguiar RWS, Ribeiro BM. Entomopathogenic Viruses in the Neotropics: Current Status and Recently Discovered Species. NEOTROPICAL ENTOMOLOGY 2020; 49:315-331. [PMID: 32358711 DOI: 10.1007/s13744-020-00770-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 03/03/2020] [Indexed: 06/11/2023]
Abstract
The market for biological control of insect pests in the world and in Brazil has grown in recent years due to the unwanted ecological and human health impacts of chemical insecticides. Therefore, research on biological control agents for pest management has also increased. For instance, insect viruses have been used to protect crops and forests around the world for decades. Among insect viruses, the baculoviruses are the most studied and used viral biocontrol agent. More than 700 species of insects have been found to be naturally infected by baculoviruses, with 90% isolated from lepidopteran insects. In this review, some basic aspects of baculovirus infection in vivo and in vitro infection, gene content, viral replication will be discussed. Furthermore, we provide examples of the use of insect viruses for biological pest control and recently characterized baculoviruses in Brazil.
Collapse
Affiliation(s)
- D R Sosa-Gómez
- Empresa Brasileira de Pesquisa Agropecuária, Embrapa Soja, Londrina, PR, Brasil
| | - F S Morgado
- Depto de Biologia Celular, Univ of Brasília, Brasília, DF, Brasil
| | - R F T Corrêa
- Depto de Biotecnologia, Univ Federal de Tocantins, Gurupi, TO, Brasil
| | - L A Silva
- Depto de Biologia Celular, Univ of Brasília, Brasília, DF, Brasil
| | - D M P Ardisson-Araújo
- Depto de Bioquímica e Biologia Molecular, Univ Federal de Santa Maria, Santa Maria, RS, Brasil
| | - B M P Rodrigues
- Depto de Biologia Celular, Univ of Brasília, Brasília, DF, Brasil
| | - E E Oliveira
- Depto de Entomologia, Univ Federal de Viçosa, Viçosa, MG, Brasil
| | - R W S Aguiar
- Depto de Biotecnologia, Univ Federal de Tocantins, Gurupi, TO, Brasil
| | - B M Ribeiro
- Depto de Biologia Celular, Univ of Brasília, Brasília, DF, Brasil.
| |
Collapse
|
26
|
Pitt ME, Nguyen SH, Duarte TPS, Teng H, Blaskovich MAT, Cooper MA, Coin LJM. Evaluating the genome and resistome of extensively drug-resistant Klebsiella pneumoniae using native DNA and RNA Nanopore sequencing. Gigascience 2020; 9:giaa002. [PMID: 32016399 PMCID: PMC6998412 DOI: 10.1093/gigascience/giaa002] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 10/17/2019] [Accepted: 01/10/2020] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Klebsiella pneumoniae frequently harbours multidrug resistance, and current diagnostics struggle to rapidly identify appropriate antibiotics to treat these bacterial infections. The MinION device can sequence native DNA and RNA in real time, providing an opportunity to compare the utility of DNA and RNA for prediction of antibiotic susceptibility. However, the effectiveness of bacterial direct RNA sequencing and base-calling has not previously been investigated. This study interrogated the genome and transcriptome of 4 extensively drug-resistant (XDR) K. pneumoniae clinical isolates; however, further antimicrobial susceptibility testing identified 3 isolates as pandrug-resistant (PDR). RESULTS The majority of acquired resistance (≥75%) resided on plasmids including several megaplasmids (≥100 kb). DNA sequencing detected most resistance genes (≥70%) within 2 hours of sequencing. Neural network-based base-calling of direct RNA achieved up to 86% identity rate, although ≤23% of reads could be aligned. Direct RNA sequencing (with ∼6 times slower pore translocation) was able to identify (within 10 hours) ≥35% of resistance genes, including those associated with resistance to aminoglycosides, β-lactams, trimethoprim, and sulphonamide and also quinolones, rifampicin, fosfomycin, and phenicol in some isolates. Direct RNA sequencing also identified the presence of operons containing up to 3 resistance genes. Polymyxin-resistant isolates showed a heightened transcription of phoPQ (≥2-fold) and the pmrHFIJKLM operon (≥8-fold). Expression levels estimated from direct RNA sequencing displayed strong correlation (Pearson: 0.86) compared to quantitative real-time PCR across 11 resistance genes. CONCLUSION Overall, MinION sequencing rapidly detected the XDR/PDR K. pneumoniae resistome, and direct RNA sequencing provided accurate estimation of expression levels of these genes.
Collapse
Affiliation(s)
- Miranda E Pitt
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria, 3000, Australia
| | - Son H Nguyen
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Tânia P S Duarte
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Haotian Teng
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Mark A T Blaskovich
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Matthew A Cooper
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria, 3000, Australia
| |
Collapse
|
27
|
Whole-genome sequencing and comparative transcriptome analysis of Bombyx mori nucleopolyhedrovirus La strain. Virus Genes 2020; 56:249-259. [PMID: 31912283 DOI: 10.1007/s11262-019-01727-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 12/27/2019] [Indexed: 10/25/2022]
Abstract
The Bombyx mori nucleopolyhedrovirus (BmNPV) La is a variant BmNPV strain isolated in Laos. La has different features from BmNPV type strain T3 in virulence, production of the polyhedrin protein, and the formation of multicapsid occlusion-derived viruses. Here, the whole-genome sequence of La was compared to the sequences of nine BmNPV and two Bombyx mandarina nucleopolyhedrovirus strains. The complete La genome consisted of 127,618 base pairs with a G + C content of 40.3% and contained putative 136 open reading frames encoding more than 60 amino acids. The La genome lacked the bro-b gene and had the highest identity with that of the T3 strain. A comparison of the transcriptomes of La- and T3-infected cells showed that the expression levels of the polyhedrin and cathepsin genes were greater in cells infected with La as compared to those infected with T3. Interestingly, the virus genes with different RNA levels between the two BmNPV strains were assembled into five clusters in the genome of La. Also, the RNA levels of host ribosomal protein genes were significantly decreased in cells infected with La as compared to those infected with T3.
Collapse
|
28
|
Tombácz D, Moldován N, Balázs Z, Gulyás G, Csabai Z, Boldogkői M, Snyder M, Boldogkői Z. Multiple Long-Read Sequencing Survey of Herpes Simplex Virus Dynamic Transcriptome. Front Genet 2019; 10:834. [PMID: 31608102 PMCID: PMC6769088 DOI: 10.3389/fgene.2019.00834] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 08/13/2019] [Indexed: 12/12/2022] Open
Abstract
Long-read sequencing (LRS) has become increasingly important in RNA research due to its strength in resolving complex transcriptomic architectures. In this regard, currently two LRS platforms have demonstrated adequate performance: the Single Molecule Real-Time Sequencing by Pacific Biosciences (PacBio) and the nanopore sequencing by Oxford Nanopore Technologies (ONT). Even though these techniques produce lower coverage and are more error prone than short-read sequencing, they continue to be more successful in identifying polycistronic RNAs, transcript isoforms including splice and transcript end variants, as well as transcript overlaps. Recent reports have successfully applied LRS for the investigation of the transcriptome of viruses belonging to various families. These studies have substantially increased the number of previously known viral RNA molecules. In this work, we used the Sequel and MinION technique from PacBio and ONT, respectively, to characterize the lytic transcriptome of the herpes simplex virus type 1 (HSV-1). In most samples, we analyzed the poly(A) fraction of the transcriptome, but we also performed random oligonucleotide-based sequencing. Besides cDNA sequencing, we also carried out native RNA sequencing. Our investigations identified more than 2,300 previously undetected transcripts, including coding, and non-coding RNAs, multi-splice transcripts, as well as polycistronic and complex transcripts. Furthermore, we found previously unsubstantiated transcriptional start sites, polyadenylation sites, and splice sites. A large number of novel transcriptional overlaps were also detected. Random-primed sequencing revealed that each convergent gene pair produces non-polyadenylated read-through RNAs overlapping the partner genes. Furthermore, we identified novel replication-associated transcripts overlapping the HSV-1 replication origins, and novel LAT variants with very long 5' regions, which are co-terminal with the LAT-0.7kb transcript. Overall, our results demonstrated that the HSV-1 transcripts form an extremely complex pattern of overlaps, and that entire viral genome is transcriptionally active. In most viral genes, if not in all, both DNA strands are expressed.
Collapse
Affiliation(s)
- Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Gábor Gulyás
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Miklós Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, Stanford, CA, United States
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| |
Collapse
|
29
|
Viehweger A, Krautwurst S, Lamkiewicz K, Madhugiri R, Ziebuhr J, Hölzer M, Marz M. Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis. Genome Res 2019; 29:1545-1554. [PMID: 31439691 DOI: 10.1101/483693] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 08/05/2019] [Indexed: 05/24/2023]
Abstract
Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called "quasispecies." Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (∼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.
Collapse
Affiliation(s)
- Adrian Viehweger
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Sebastian Krautwurst
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Kevin Lamkiewicz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Ramakanth Madhugiri
- Institute of Medical Virology, Justus Liebig University Gießen, 35390 Gießen, Germany
| | - John Ziebuhr
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
- Institute of Medical Virology, Justus Liebig University Gießen, 35390 Gießen, Germany
| | - Martin Hölzer
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Manja Marz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
- Leibniz Institute on Aging-Fritz Lipmann Institute, 07743 Jena, Germany
| |
Collapse
|
30
|
Viehweger A, Krautwurst S, Lamkiewicz K, Madhugiri R, Ziebuhr J, Hölzer M, Marz M. Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis. Genome Res 2019; 29:1545-1554. [PMID: 31439691 PMCID: PMC6724671 DOI: 10.1101/gr.247064.118] [Citation(s) in RCA: 133] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 08/05/2019] [Indexed: 01/09/2023]
Abstract
Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called “quasispecies.” Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (∼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.
Collapse
Affiliation(s)
- Adrian Viehweger
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany.,European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Sebastian Krautwurst
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany.,European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Kevin Lamkiewicz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany.,European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Ramakanth Madhugiri
- Institute of Medical Virology, Justus Liebig University Gießen, 35390 Gießen, Germany
| | - John Ziebuhr
- European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany.,Institute of Medical Virology, Justus Liebig University Gießen, 35390 Gießen, Germany
| | - Martin Hölzer
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany.,European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Manja Marz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany.,European Virus Bioinformatics Center, Friedrich Schiller University Jena, 07743 Jena, Germany.,Leibniz Institute on Aging-Fritz Lipmann Institute, 07743 Jena, Germany
| |
Collapse
|
31
|
Boldogkői Z, Moldován N, Balázs Z, Snyder M, Tombácz D. Long-Read Sequencing – A Powerful Tool in Viral Transcriptome Research. Trends Microbiol 2019; 27:578-592. [DOI: 10.1016/j.tim.2019.01.010] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 01/21/2019] [Accepted: 01/30/2019] [Indexed: 12/16/2022]
|
32
|
Zhao L, Zhang H, Kohnen MV, Prasad KVSK, Gu L, Reddy ASN. Analysis of Transcriptome and Epitranscriptome in Plants Using PacBio Iso-Seq and Nanopore-Based Direct RNA Sequencing. Front Genet 2019; 10:253. [PMID: 30949200 PMCID: PMC6438080 DOI: 10.3389/fgene.2019.00253] [Citation(s) in RCA: 80] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 03/06/2019] [Indexed: 12/18/2022] Open
Abstract
Nanopore sequencing from Oxford Nanopore Technologies (ONT) and Pacific BioSciences (PacBio) single-molecule real-time (SMRT) long-read isoform sequencing (Iso-Seq) are revolutionizing the way transcriptomes are analyzed. These methods offer many advantages over most widely used high-throughput short-read RNA sequencing (RNA-Seq) approaches and allow a comprehensive analysis of transcriptomes in identifying full-length splice isoforms and several other post-transcriptional events. In addition, direct RNA-Seq provides valuable information about RNA modifications, which are lost during the PCR amplification step in other methods. Here, we present a comprehensive summary of important applications of these technologies in plants, including identification of complex alternative splicing (AS), full-length splice variants, fusion transcripts, and alternative polyadenylation (APA) events. Furthermore, we discuss the impact of the newly developed nanopore direct RNA-Seq in advancing epitranscriptome research in plants. Additionally, we summarize computational tools for identifying and quantifying full-length isoforms and other co/post-transcriptional events and discussed some of the limitations with these methods. Sequencing of transcriptomes using these new single-molecule long-read methods will unravel many aspects of transcriptome complexity in unprecedented ways as compared to previous short-read sequencing approaches. Analysis of plant transcriptomes with these new powerful methods that require minimum sample processing is likely to become the norm and is expected to uncover novel co/post-transcriptional gene regulatory mechanisms that control biological outcomes during plant development and in response to various stresses.
Collapse
Affiliation(s)
- Liangzhen Zhao
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Hangxiao Zhang
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Markus V. Kohnen
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Kasavajhala V. S. K. Prasad
- Program in Cell and Molecular Biology, Department of Biology, Colorado State University, Fort Collins, CO, United States
| | - Lianfeng Gu
- Basic Forestry and Proteomics Research Center, College of Forestry, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Anireddy S. N. Reddy
- Program in Cell and Molecular Biology, Department of Biology, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|
33
|
Masson T, Fabre ML, Ferrelli ML, Pidre ML, Romanowski V. Protein composition of the occlusion bodies of Epinotia aporema granulovirus. PLoS One 2019; 14:e0207735. [PMID: 30753194 PMCID: PMC6372164 DOI: 10.1371/journal.pone.0207735] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Accepted: 01/21/2019] [Indexed: 12/28/2022] Open
Abstract
Within family Baculoviridae, members of the Betabaculovirus genus are employed as biocontrol agents against lepidopteran pests, either alone or in combination with selected members of the Alphabaculovirus genus. Epinotia aporema granulovirus (EpapGV) is a fast killing betabaculovirus that infects the bean shoot borer (E. aporema) and is a promising biopesticide. Because occlusion bodies (OBs) play a key role in baculovirus horizontal transmission, we investigated the composition of EpapGV OBs. Using mass spectrometry-based proteomics we could identify 56 proteins that are included in the OBs during the final stages of larval infection. Our data provides experimental validation of several annotated hypothetical coding sequences. Proteogenomic mapping against genomic sequence detected a previously unannotated ac110-like core gene and a putative translation fusion product of ORFs epap48 and epap49. Comparative studies of the proteomes available for the family Baculoviridae highlight the conservation of core gene products as parts of the occluded virion. Two proteins specific for betabaculoviruses (Epap48 and Epap95) are incorporated into OBs. Moreover, quantification based on emPAI values showed that Epap95 is one of the most abundant components of EpapGV OBs.
Collapse
Affiliation(s)
- Tomás Masson
- Instituto de Biotecnología y Biología Molecular (IBBM, UNLP-CONICET), Facultad de Ciencias Exactas, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
| | - María Laura Fabre
- Instituto de Biotecnología y Biología Molecular (IBBM, UNLP-CONICET), Facultad de Ciencias Exactas, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
| | - María Leticia Ferrelli
- Instituto de Biotecnología y Biología Molecular (IBBM, UNLP-CONICET), Facultad de Ciencias Exactas, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
| | - Matías Luis Pidre
- Instituto de Biotecnología y Biología Molecular (IBBM, UNLP-CONICET), Facultad de Ciencias Exactas, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
| | - Víctor Romanowski
- Instituto de Biotecnología y Biología Molecular (IBBM, UNLP-CONICET), Facultad de Ciencias Exactas, Universidad Nacional de La Plata, La Plata, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
34
|
Boldogkői Z, Balázs Z, Moldován N, Prazsák I, Tombácz D. Novel classes of replication-associated transcripts discovered in viruses. RNA Biol 2019; 16:166-175. [PMID: 30608222 PMCID: PMC6380287 DOI: 10.1080/15476286.2018.1564468] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The role of RNA molecules in the priming of DNA replication and in providing a template for telomerase extension has been known for decades. Since then, several transcripts have been discovered, which play diverse roles in governing replication, including regulation of RNA primer formation, the recruitment of replication origin (Ori) recognition complex, and the assembly of replication fork. Recent studies on viral transcriptomes have revealed novel classes of replication-associated (ra)RNAs, which are expressed from the genomic locations in close vicinity to the Ori. Many of them overlap the Ori, whereas others are terminated close to the replication origin. These novel transcripts can be both protein-coding and non-coding RNAs. The Ori-overlapping part of the mRNAs is generally either the 5ʹ-untranslated regions (UTRs), or the 3ʹ-UTRs of the longer isoforms. Several raRNAs have been identified in various viral families using primarily third-generation long-read sequencing. Hyper-editing of these transcripts has also been described.
Collapse
Affiliation(s)
- Zsolt Boldogkői
- a Department of Medical Biology, Faculty of Medicine , University of Szeged , Szeged , Hungary
| | - Zsolt Balázs
- a Department of Medical Biology, Faculty of Medicine , University of Szeged , Szeged , Hungary
| | - Norbert Moldován
- a Department of Medical Biology, Faculty of Medicine , University of Szeged , Szeged , Hungary
| | - István Prazsák
- a Department of Medical Biology, Faculty of Medicine , University of Szeged , Szeged , Hungary
| | - Dóra Tombácz
- a Department of Medical Biology, Faculty of Medicine , University of Szeged , Szeged , Hungary
| |
Collapse
|
35
|
Transcriptome-wide analysis of a baculovirus using nanopore sequencing. Sci Data 2018; 5:180276. [PMID: 30512018 PMCID: PMC6278695 DOI: 10.1038/sdata.2018.276] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 10/23/2018] [Indexed: 02/06/2023] Open
Abstract
Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is a prototypic baculovirus infecting specific insects. AcMNPV contains a large double-stranded DNA genome encoding a complex transcriptome. This virus has a widespread application as a vector for the expression of heterologous proteins. Here, we present a dataset, derived from Oxford Nanopore Technologies (ONT) long-read sequencing platform. We used both cDNA and direct RNA sequencing techniques. The dataset contains 520,310 AcMNPV and 1,309,481 host cell reads using the regular cDNA-sequencing method of ONT technique, whereas altogether 6,456 reads were produced by using direct RNA-sequencing. We also used a Cap-selection protocol for certain ONT samples, and obtained 2,568,669 reads by using this method. The raw reads were aligned to the AcMNPV reference genome (KM667940.1). Here, we openly released the ‘static’ and the dynamic transcript catalogue of AcMNPV. This dataset can be used for deep analyses of the transcriptomic and epitranscriptomic patterns of the AcMNPV and the host cell. The data can be also useful for the validation of different bioinformatics software packages and analysis tools.
Collapse
|
36
|
Long-read sequencing uncovers a complex transcriptome topology in varicella zoster virus. BMC Genomics 2018; 19:873. [PMID: 30514211 DOI: 10.1186/s12864-018-5267-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 11/19/2018] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Varicella zoster virus (VZV) is a human pathogenic alphaherpesvirus harboring a relatively large DNA molecule. The VZV transcriptome has already been analyzed by microarray and short-read sequencing analyses. However, both approaches have substantial limitations when used for structural characterization of transcript isoforms, even if supplemented with primer extension or other techniques. Among others, they are inefficient in distinguishing between embedded RNA molecules, transcript isoforms, including splice and length variants, as well as between alternative polycistronic transcripts. It has been demonstrated in several studies that long-read sequencing is able to circumvent these problems. RESULTS In this work, we report the analysis of the VZV lytic transcriptome using the Oxford Nanopore Technologies sequencing platform. These investigations have led to the identification of 114 novel transcripts, including mRNAs, non-coding RNAs, polycistronic RNAs and complex transcripts, as well as 10 novel spliced transcripts and 25 novel transcription start site isoforms and transcription end site isoforms. A novel class of transcripts, the nroRNAs are described in this study. These transcripts are encoded by the genomic region located in close vicinity to the viral replication origin. We also show that the ORF63 exhibits a complex structural variation encompassing the splice sites of VZV latency transcripts. Additionally, we have detected RNA editing in a novel non-coding RNA molecule. CONCLUSIONS Our investigations disclosed a composite transcriptomic architecture of VZV, including the discovery of novel RNA molecules and transcript isoforms, as well as a complex meshwork of transcriptional read-throughs and overlaps. The results represent a substantial advance in the annotation of the VZV transcriptome and in understanding the molecular biology of the herpesviruses in general.
Collapse
|
37
|
Tombácz D, Prazsák I, Szucs A, Dénes B, Snyder M, Boldogkoi Z. Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. Gigascience 2018; 7:5202462. [PMID: 30476066 PMCID: PMC6290886 DOI: 10.1093/gigascience/giy139] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Accepted: 11/12/2018] [Indexed: 12/21/2022] Open
Abstract
Background Poxviruses are large DNA viruses that infect humans and animals. Vaccinia virus (VACV) has been applied as a live vaccine for immunization against smallpox, which was eradicated by 1980 as a result of worldwide vaccination. VACV is the prototype of poxviruses in the investigation of the molecular pathogenesis of the virus. Short-read sequencing methods have revolutionized transcriptomics; however, they are not efficient in distinguishing between the RNA isoforms and transcript overlaps. Long-read sequencing (LRS) is much better suited to solve these problems and also allow direct RNA sequencing. Despite the scientific relevance of VACV, no LRS data have been generated for the viral transcriptome to date. Findings For the deep characterization of the VACV RNA profile, various LRS platforms and library preparation approaches were applied. The raw reads were mapped to the VACV reference genome and also to the host (Chlorocebus sabaeus) genome. In this study, we applied the Pacific Biosciences RSII and Sequel platforms, which altogether resulted in 937,531 mapped reads of inserts (1.42 Gb), while we obtained 2,160,348 aligned reads (1.75 Gb) from the different library preparation methods using the MinION device from Oxford Nanopore Technologies. Conclusions By applying cutting-edge technologies, we were able to generate a large dataset that can serve as a valuable resource for the investigation of the dynamic VACV transcriptome, the virus-host interactions, and RNA base modifications. These data can provide useful information for novel gene annotations in the VACV genome. Our dataset can also be used to analyze the currently available LRS platforms, library preparation methods, and bioinformatics pipelines.
Collapse
Affiliation(s)
- Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary
| | - István Prazsák
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary
| | - Attila Szucs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary
| | - Béla Dénes
- Veterinary Diagnostic Directorate of the National Food Chain Safety Office, Tábornok u. 2., 1143 Budapest, Hungary
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, 300 Pasteur Dr, Stanford, California, USA
| | - Zsolt Boldogkoi
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., 6720 Szeged, Hungary
| |
Collapse
|
38
|
Transcriptomic study of Herpes simplex virus type-1 using full-length sequencing techniques. Sci Data 2018; 5:180266. [PMID: 30480662 PMCID: PMC6257044 DOI: 10.1038/sdata.2018.266] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 10/05/2018] [Indexed: 02/07/2023] Open
Abstract
Herpes simplex virus type-1 (HSV-1) is a human pathogenic member of the Alphaherpesvirinae subfamily of herpesviruses. The HSV-1 genome is a large double-stranded DNA specifying about 85 protein coding genes. The latest surveys have demonstrated that the HSV-1 transcriptome is much more complex than it had been thought before. Here, we provide a long-read sequencing dataset, which was generated by using the RSII and Sequel systems from Pacific Biosciences (PacBio), as well as MinION sequencing system from Oxford Nanopore Technologies (ONT). This dataset contains 39,096 reads of inserts (ROIs) mapped to the HSV-1 genome (X14112) in RSII sequencing, while Sequel sequencing yielded 77,851 ROIs. The MinION cDNA sequencing altogether resulted in 158,653 reads, while the direct RNA-seq produced 16,516 reads. This dataset can be utilized for the identification of novel HSV RNAs and transcripts isoforms, as well as for the comparison of the quality and length of the sequencing reads derived from the currently available long-read sequencing platforms. The various library preparation approaches can also be compared with each other.
Collapse
|