2
|
Shigematsu M, Honda S, Loher P, Telonis AG, Rigoutsos I, Kirino Y. YAMAT-seq: an efficient method for high-throughput sequencing of mature transfer RNAs. Nucleic Acids Res 2017; 45:e70. [PMID: 28108659 PMCID: PMC5605243 DOI: 10.1093/nar/gkx005] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 01/03/2017] [Indexed: 11/18/2022] Open
Abstract
Besides translation, transfer RNAs (tRNAs) play many non-canonical roles in various biological pathways and exhibit highly variable expression profiles. To unravel the emerging complexities of tRNA biology and molecular mechanisms underlying them, an efficient tRNA sequencing method is required. However, the rigid structure of tRNA has been presenting a challenge to the development of such methods. We report the development of Y-shaped Adapter-ligated MAture TRNA sequencing (YAMAT-seq), an efficient and convenient method for high-throughput sequencing of mature tRNAs. YAMAT-seq circumvents the issue of inefficient adapter ligation, a characteristic of conventional RNA sequencing methods for mature tRNAs, by employing the efficient and specific ligation of Y-shaped adapter to mature tRNAs using T4 RNA Ligase 2. Subsequent cDNA amplification and next-generation sequencing successfully yield numerous mature tRNA sequences. YAMAT-seq has high specificity for mature tRNAs and high sensitivity to detect most isoacceptors from minute amount of total RNA. Moreover, YAMAT-seq shows quantitative capability to estimate expression levels of mature tRNAs, and has high reproducibility and broad applicability for various cell lines. YAMAT-seq thus provides high-throughput technique for identifying tRNA profiles and their regulations in various transcriptomes, which could play important regulatory roles in translation and other biological processes.
Collapse
Affiliation(s)
- Megumi Shigematsu
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Shozo Honda
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Phillipe Loher
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Aristeidis G Telonis
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Isidore Rigoutsos
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Yohei Kirino
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, USA
| |
Collapse
|
4
|
Honda S, Kirino Y. Dumbbell-PCR: a method to quantify specific small RNA variants with a single nucleotide resolution at terminal sequences. Nucleic Acids Res 2015; 43:e77. [PMID: 25779041 PMCID: PMC4499115 DOI: 10.1093/nar/gkv218] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 03/04/2015] [Indexed: 12/15/2022] Open
Abstract
Recent advances in next-generation sequencing technologies have revealed that cellular functional RNAs are not always expressed as single entities with fixed terminal sequences but as multiple isoforms bearing complex heterogeneity in both length and terminal sequences, such as isomiRs, the isoforms of microRNAs. Unraveling the biogenesis and biological significance of heterogenetic RNA expression requires distinctive analysis of each RNA variant. Here, we report the development of dumbbell PCR (Db-PCR), an efficient and convenient method to distinctively quantify a specific individual small RNA variant. In Db-PCR, 5′- and 3′-stem–loop adapters are specifically hybridized and ligated to the 5′- and 3′-ends of target RNAs, respectively, by T4 RNA ligase 2 (Rnl2). The resultant ligation products with ‘dumbbell-like’ structures are subsequently quantified by TaqMan RT-PCR. We confirmed that high specificity of Rnl2 ligation and TaqMan RT-PCR toward target RNAs assured both 5′- and 3′-terminal sequences of target RNAs with single nucleotide resolution so that Db-PCR specifically detected target RNAs but not their corresponding terminal variants. Db-PCR had broad applicability for the quantification of various small RNAs in different cell types, and the results were consistent with those from other quantification method. Therefore, Db-PCR provides a much-needed simple method for analyzing RNA terminal heterogeneity.
Collapse
Affiliation(s)
- Shozo Honda
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, 1020 Locust Street, Philadelphia, PA 19107, USA
| | - Yohei Kirino
- Computational Medicine Center, Sidney Kimmel Medical College, Thomas Jefferson University, 1020 Locust Street, Philadelphia, PA 19107, USA
| |
Collapse
|
5
|
Ogino T. Capping of vesicular stomatitis virus pre-mRNA is required for accurate selection of transcription stop-start sites and virus propagation. Nucleic Acids Res 2014; 42:12112-25. [PMID: 25274740 PMCID: PMC4231761 DOI: 10.1093/nar/gku901] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The multifunctional RNA-dependent RNA polymerase L protein of vesicular stomatitis virus catalyzes unconventional pre-mRNA capping via the covalent enzyme-pRNA intermediate formation, which requires the histidine–arginine (HR) motif in the polyribonucleotidyltransferase domain. Here, the effects of cap-defective mutations in the HR motif on transcription were analyzed using an in vitro reconstituted transcription system. The wild-type L protein synthesized the leader RNA from the 3′-end of the genome followed by 5′-capped and 3′-polyadenylated mRNAs from internal genes by a stop–start transcription mechanism. Cap-defective mutants efficiently produced the leader RNA, but displayed aberrant stop–start transcription using cryptic termination and initiation signals within the first gene, resulting in sequential generation of ∼40-nucleotide transcripts with 5′-ATP from a correct mRNA-start site followed by a 28-nucleotide transcript and long 3′-polyadenylated transcript initiated with non-canonical GTP from atypical start sites. Frequent transcription termination and re-initiation within the first gene significantly attenuated the production of downstream mRNAs. Consistent with the inability of these mutants in in vitro mRNA synthesis and capping, these mutations were lethal to virus replication in cultured cells. These findings indicate that viral mRNA capping is required for accurate stop–start transcription as well as mRNA stability and translation and, therefore, for virus replication in host cells.
Collapse
Affiliation(s)
- Tomoaki Ogino
- Department of Molecular Biology and Microbiology, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| |
Collapse
|
6
|
High-throughput sequencing and de novo assembly of Brassica oleracea var. Capitata L. for transcriptome analysis. PLoS One 2014; 9:e92087. [PMID: 24682075 PMCID: PMC3969326 DOI: 10.1371/journal.pone.0092087] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Accepted: 02/18/2014] [Indexed: 12/28/2022] Open
Abstract
Background The cabbage, Brassica oleracea var. capitata L., has a distinguishable phenotype within the genus Brassica. Despite the economic and genetic importance of cabbage, there is little genomic data for cabbage, and most studies of Brassica are focused on other species or other B. oleracea subspecies. The lack of genomic data for cabbage, a non-model organism, hinders research on its molecular biology. Hence, the construction of reliable transcriptomic data based on high-throughput sequencing technologies is needed to enhance our understanding of cabbage and provide genomic information for future work. Methodology/Principal Findings We constructed cDNAs from total RNA isolated from the roots, leaves, flowers, seedlings, and calcium-limited seedling tissues of two cabbage genotypes: 102043 and 107140. We sequenced a total of six different samples using the Illumina HiSeq platform, producing 40.5 Gbp of sequence data comprising 401,454,986 short reads. We assembled 205,046 transcripts (≥ 200 bp) using the Velvet and Oases assembler and predicted 53,562 loci from the transcripts. We annotated 35,274 of the loci with 55,916 plant peptides in the Phytozome database. The average length of the annotated loci was 1,419 bp. We confirmed the reliability of the sequencing assembly using reverse-transcriptase PCR to identify tissue-specific gene candidates among the annotated loci. Conclusion Our study provides valuable transcriptome sequence data for B. oleracea var. capitata L., offering a new resource for studying B. oleracea and closely related species. Our transcriptomic sequences will enhance the quality of gene annotation and functional analysis of the cabbage genome and serve as a material basis for future genomic research on cabbage. The sequencing data from this study can be used to develop molecular markers and to identify the extreme differences among the phenotypes of different species in the genus Brassica.
Collapse
|
7
|
Structure-function analysis of Methanobacterium thermoautotrophicum RNA ligase - engineering a thermostable ATP independent enzyme. BMC Mol Biol 2012; 13:24. [PMID: 22809063 PMCID: PMC3514331 DOI: 10.1186/1471-2199-13-24] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2012] [Accepted: 07/03/2012] [Indexed: 01/09/2023] Open
Abstract
Background RNA ligases are essential reagents for many methods in molecular biology including NextGen RNA sequencing. To prevent ligation of RNA to itself, ATP independent mutant ligases, defective in self-adenylation, are often used in combination with activated pre-adenylated linkers. It is important that these ligases not have de-adenylation activity, which can result in activation of RNA and formation of background ligation products. An additional useful feature is for the ligase to be active at elevated temperatures. This has the advantage or reducing preferences caused by structures of single-stranded substrates and linkers. Results To create an RNA ligase with these desirable properties we performed mutational analysis of the archaeal thermophilic RNA ligase from Methanobacterium thermoautotrophicum. We identified amino acids essential for ATP binding and reactivity but dispensable for phosphodiester bond formation with 5’ pre-adenylated donor substrate. The motif V lysine mutant (K246A) showed reduced activity in the first two steps of ligation reaction. The mutant has full ligation activity with pre-adenylated substrates but retained the undesirable activity of deadenylation, which is the reverse of step 2 adenylation. A second mutant, an alanine substitution for the catalytic lysine in motif I (K97A) abolished activity in the first two steps of the ligation reaction, but preserved wild type ligation activity in step 3. The activity of the K97A mutant is similar with either pre-adenylated RNA or single-stranded DNA (ssDNA) as donor substrates but we observed two-fold preference for RNA as an acceptor substrate compared to ssDNA with an identical sequence. In contrast, truncated T4 RNA ligase 2, the commercial enzyme used in these applications, is significantly more active using pre-adenylated RNA as a donor compared to pre-adenylated ssDNA. However, the T4 RNA ligases are ineffective in ligating ssDNA acceptors. Conclusions Mutational analysis of the heat stable RNA ligase from Methanobacterium thermoautotrophicum resulted in the creation of an ATP independent ligase. The K97A mutant is defective in the first two steps of ligation but retains full activity in ligation of either RNA or ssDNA to a pre-adenylated linker. The ability of the ligase to function at 65°C should reduce the constraints of RNA secondary structure in RNA ligation experiments.
Collapse
|
8
|
Clepet C, Joobeur T, Zheng Y, Jublot D, Huang M, Truniger V, Boualem A, Hernandez-Gonzalez ME, Dolcet-Sanjuan R, Portnoy V, Mascarell-Creus A, Caño-Delgado AI, Katzir N, Bendahmane A, Giovannoni JJ, Aranda MA, Garcia-Mas J, Fei Z. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon. BMC Genomics 2011; 12:252. [PMID: 21599934 PMCID: PMC3118787 DOI: 10.1186/1471-2164-12-252] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Accepted: 05/20/2011] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. RESULT We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. CONCLUSION The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns.
Collapse
Affiliation(s)
- Christian Clepet
- URGV Plant Genomics, Unité de Recherche en Génomique Végétale, UMR1165 ERL8196 INRA-UEVE-CNRS. 2, Rue Gaston Crémieux, 91057 Evry, France
| | - Tarek Joobeur
- Molecular and Cellular Imaging Center, The Ohio State University, OARDC, 1680 Madison Ave, Wooster, OH 44691, USA
- Seminis Vegetable Seeds, 37437 State Highway 16 Woodland, CA 95695, USA
| | - Yi Zheng
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
| | - Delphine Jublot
- URGV Plant Genomics, Unité de Recherche en Génomique Végétale, UMR1165 ERL8196 INRA-UEVE-CNRS. 2, Rue Gaston Crémieux, 91057 Evry, France
| | - Mingyun Huang
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
| | - Veronica Truniger
- Centro de Edafología y Biología Aplicada del Segura (CEBAS), Consejo Superior de Investigaciones Científicas (CSIC), Apdo. Correos 164, 30100 Espinardo, Murcia, Spain
| | - Adnane Boualem
- URGV Plant Genomics, Unité de Recherche en Génomique Végétale, UMR1165 ERL8196 INRA-UEVE-CNRS. 2, Rue Gaston Crémieux, 91057 Evry, France
| | | | - Ramon Dolcet-Sanjuan
- IRTA, Center for Research in Agricultural Genomics CSIC-IRTA-UAB, Campus UAB, Edifici CRAG, 08193 Bellaterra (Barcelona), Spain
| | - Vitaly Portnoy
- Department of Vegetable Research, Agricultural Research Organization, Newe Ya'ar Research Center, P.O. Box 1021, Ramat Yishay, 30095, Israel
| | - Albert Mascarell-Creus
- Department de Genètica Molecular, Center for Research in Agricultural Genomics CSIC-IRTA-UAB, Campus UAB, Edifici CRAG, 08193 Bellaterra (Barcelona), Spain
| | - Ana I Caño-Delgado
- Department de Genètica Molecular, Center for Research in Agricultural Genomics CSIC-IRTA-UAB, Campus UAB, Edifici CRAG, 08193 Bellaterra (Barcelona), Spain
| | - Nurit Katzir
- Department of Vegetable Research, Agricultural Research Organization, Newe Ya'ar Research Center, P.O. Box 1021, Ramat Yishay, 30095, Israel
| | - Abdelhafid Bendahmane
- URGV Plant Genomics, Unité de Recherche en Génomique Végétale, UMR1165 ERL8196 INRA-UEVE-CNRS. 2, Rue Gaston Crémieux, 91057 Evry, France
- Department of Plant Production, College of Food and Agricultural Sciences, King Saud University, Riyadh Saudi Arabia
| | - James J Giovannoni
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
- USDA Robert W. Holley Center for Agriculture and Health, Tower Road, Ithaca, NY 14853, USA
| | - Miguel A Aranda
- Centro de Edafología y Biología Aplicada del Segura (CEBAS), Consejo Superior de Investigaciones Científicas (CSIC), Apdo. Correos 164, 30100 Espinardo, Murcia, Spain
| | - Jordi Garcia-Mas
- IRTA, Center for Research in Agricultural Genomics CSIC-IRTA-UAB, Campus UAB, Edifici CRAG, 08193 Bellaterra (Barcelona), Spain
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
- USDA Robert W. Holley Center for Agriculture and Health, Tower Road, Ithaca, NY 14853, USA
| |
Collapse
|