1
|
Shaw PJ, Piriyapongsa J, Kaewprommal P, Wongsombat C, Chaosrikul C, Teeravajanadet K, Boonbangyang M, Uthaipibull C, Kamchonwongpaisan S, Tongsima S. Identifying transcript 5' capped ends in Plasmodium falciparum. PeerJ 2021; 9:e11983. [PMID: 34527439 PMCID: PMC8401752 DOI: 10.7717/peerj.11983] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 07/26/2021] [Indexed: 12/15/2022] Open
Abstract
Background The genome of the human malaria parasite Plasmodium falciparum is poorly annotated, in particular, the 5' capped ends of its mRNA transcripts. New approaches are needed to fully catalog P. falciparum transcripts for understanding gene function and regulation in this organism. Methods We developed a transcriptomic method based on next-generation sequencing of complementary DNA (cDNA) enriched for full-length fragments using eIF4E, a 5' cap-binding protein, and an unenriched control. DNA sequencing adapter was added after enrichment of full-length cDNA using two different ligation protocols. From the mapped sequence reads, enrichment scores were calculated for all transcribed nucleotides and used to calculate P-values of 5' capped nucleotide enrichment. Sensitivity and accuracy were increased by combining P-values from replicate experiments. Data were obtained for P. falciparum ring, trophozoite and schizont stages of intra-erythrocytic development. Results 5' capped nucleotide signals were mapped to 17,961 non-overlapping P. falciparum genomic intervals. Analysis of the dominant 5' capped nucleotide in these genomic intervals revealed the presence of two groups with distinctive epigenetic features and sequence patterns. A total of 4,512 transcripts were annotated as 5' capped based on the correspondence of 5' end with 5' capped nucleotide annotated from full-length cDNA data. Discussion The presence of two groups of 5' capped nucleotides suggests that alternative mechanisms may exist for producing 5' capped transcript ends in P. falciparum. The 5' capped transcripts that are antisense, outside of, or partially overlapping coding regions may be important regulators of gene function in P. falciparum.
Collapse
Affiliation(s)
- Philip J Shaw
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Jittima Piriyapongsa
- National Biobank of Thailand (NBT), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Pavita Kaewprommal
- National Biobank of Thailand (NBT), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Chayaphat Wongsombat
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Chadapohn Chaosrikul
- National Biobank of Thailand (NBT), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Krirkwit Teeravajanadet
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Manon Boonbangyang
- National Biobank of Thailand (NBT), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Chairat Uthaipibull
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Sumalee Kamchonwongpaisan
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Sissades Tongsima
- National Biobank of Thailand (NBT), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| |
Collapse
|
2
|
Guerrini MM, Oguchi A, Suzuki A, Murakawa Y. Cap analysis of gene expression (CAGE) and noncoding regulatory elements. Semin Immunopathol 2021; 44:127-136. [PMID: 34468849 DOI: 10.1007/s00281-021-00886-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 08/13/2021] [Indexed: 01/06/2023]
Abstract
Cap analysis of gene expression (CAGE) was developed to detect the 5' end of RNA. Trapping of the RNA 5'-cap structure enables the enrichment and selective sequencing of complete transcripts. Upscaled high-throughput versions of CAGE have enabled the genome-wide identification of transcription start sites, including transcriptionally active promoters and enhancers. CAGE sequencing can be exploited to draw comprehensive maps of active genomic regulatory elements in a cell type- and activation-specific manner. The cells of the immune system are among the best candidates to be analyzed in humans, since they are easily accessible. In this review, we discuss how CAGE data are instrumental for integrative analyses with quantitative trait loci and omics data, and their usefulness in the mechanistic interpretation of the effects of genetic variations over the entire human genome. Integrating CAGE data with the currently available omics information will contribute to better understanding of the genome-wide association study variants that lie outside of annotated genes, deepening our knowledge on human diseases, and enabling the targeted design of more specific therapeutic interventions.
Collapse
Affiliation(s)
- Matteo Maurizio Guerrini
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan.
| | - Akiko Oguchi
- RIKEN-IFOM Joint Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Akari Suzuki
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Yasuhiro Murakawa
- RIKEN-IFOM Joint Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- IFOM-the FIRC Institute of Molecular Oncology, Milan, Italy
| |
Collapse
|
3
|
Lin L, Cai W, Du Z, Zhang W, Xu Q, Sun W, Chen M. Establishing a System for Functional Characterization of Full-Length cDNAs of Camellia sinensis. Int J Mol Sci 2019; 20:ijms20235929. [PMID: 31775391 PMCID: PMC6929147 DOI: 10.3390/ijms20235929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 11/15/2019] [Accepted: 11/21/2019] [Indexed: 11/17/2022] Open
Abstract
Tea (Camellia sinensis) is enriched with bioactive secondary metabolites, and is one of the most popular nonalcoholic beverages globally. Two tea reference genomes have been reported; however, the functional analysis of tea genes has lagged, mainly due to tea’s recalcitrance to genetic transformation and the absence of alternative high throughput heterologous expression systems. A full-length cDNA collection with a streamlined cloning system is needed in this economically important woody crop species. RNAs were isolated from nine different vegetative tea tissues, pooled, then used to construct a normalized full-length cDNA library. The titer of unamplified and amplified cDNA library was 6.89 × 106 and 1.8 × 1010 cfu/mL, respectively; the library recombinant rate was 87.2%. Preliminary characterization demonstrated that this collection can complement existing tea reference genomes and facilitate rare gene discovery. In addition, to streamline tea cDNA cloning and functional analysis, a binary vector (pBIG2113SF) was reengineered, seven tea cDNAs isolated from this library were successfully cloned into this vector, then transformed into Arabidopsis. One FL-cDNA, which encodes a putative P1B-type ATPase 5 (CsHMA5), was characterized further as a proof of concept. We demonstrated that overexpression of CsHMA5 in Arabidopsis resulted in copper hyposensitivity. Thus, our data demonstrated that this represents an efficient system for rare gene discovery and functional characterization of tea genes. The integration of a tea FL-cDNA collection with efficient cloning and a heterologous expression system would facilitate functional annotation and characterization of tea genes.
Collapse
Affiliation(s)
- Lin Lin
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Horticultural Plant Biology and Metabolomics Center, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (L.L.); (W.C.); (Z.D.); (Q.X.)
| | - Weiwei Cai
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Horticultural Plant Biology and Metabolomics Center, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (L.L.); (W.C.); (Z.D.); (Q.X.)
| | - Zhenghua Du
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Horticultural Plant Biology and Metabolomics Center, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (L.L.); (W.C.); (Z.D.); (Q.X.)
| | - Wenjing Zhang
- Anxi College of Tea Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Quanming Xu
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Horticultural Plant Biology and Metabolomics Center, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (L.L.); (W.C.); (Z.D.); (Q.X.)
| | - Weijiang Sun
- Anxi College of Tea Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Correspondence: (W.S.); (M.C.); Tel.: +86-13705067139 (W.S.); +86-18860109236 (M.C.)
| | - Mingjie Chen
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Horticultural Plant Biology and Metabolomics Center, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (L.L.); (W.C.); (Z.D.); (Q.X.)
- Correspondence: (W.S.); (M.C.); Tel.: +86-13705067139 (W.S.); +86-18860109236 (M.C.)
| |
Collapse
|
4
|
Boldogkői Z, Moldován N, Balázs Z, Snyder M, Tombácz D. Long-Read Sequencing – A Powerful Tool in Viral Transcriptome Research. Trends Microbiol 2019; 27:578-592. [DOI: 10.1016/j.tim.2019.01.010] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 01/21/2019] [Accepted: 01/30/2019] [Indexed: 12/16/2022]
|
5
|
Archer SK, Shirokikh NE, Preiss T. Probe-Directed Degradation (PDD) for Flexible Removal of Unwanted cDNA Sequences from RNA-Seq Libraries. ACTA ACUST UNITED AC 2015; 85:11.15.1-11.15.36. [PMID: 25827346 DOI: 10.1002/0471142905.hg1115s85] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Most applications for RNA-seq require the depletion of abundant transcripts to gain greater coverage of the underlying transcriptome. The sequences to be targeted for depletion depend on application and species and in many cases may not be supported by commercial depletion kits. This unit describes a method for generating RNA-seq libraries that incorporates probe-directed degradation (PDD), which can deplete any unwanted sequence set, with the low-bias split-adapter method of library generation (although many other library generation methods are in principle compatible). The overall strategy is suitable for applications requiring customized sequence depletion or where faithful representation of fragment ends and lack of sequence bias is paramount. We provide guidelines to rapidly design specific probes against the target sequence, and a detailed protocol for library generation using the split-adapter method including several strategies for streamlining the technique and reducing adapter dimer content.
Collapse
Affiliation(s)
- Stuart K Archer
- Genome Biology Department, The John Curtin School of Medical Research (JCSMR), The Australian National University, Acton, Canberra, Australian Capital Territory, Australia.,Present address: Monash Bioinformatics Platform, Monash University, Clayton, Victoria, Australia.,These authors contributed equally to this work
| | - Nikolay E Shirokikh
- Genome Biology Department, The John Curtin School of Medical Research (JCSMR), The Australian National University, Acton, Canberra, Australian Capital Territory, Australia.,Present address: Moscow Regional State Institute of Humanities and Social Studies, Ministry of Education of Moscow Region, Kolomna, Moscow Region, Russia.,These authors contributed equally to this work
| | - Thomas Preiss
- Genome Biology Department, The John Curtin School of Medical Research (JCSMR), The Australian National University, Acton, Canberra, Australian Capital Territory, Australia.,Victor Chang Cardiac Research Institute, Darlinghurst (Sydney), New South Wales, Australia
| |
Collapse
|
6
|
Yadav RK, Barbi F, Ziller A, Luis P, Marmeisse R, Reddy MS, Fraissinet-Tachet L. Construction of sized eukaryotic cDNA libraries using low input of total environmental metatranscriptomic RNA. BMC Biotechnol 2014; 14:80. [PMID: 25183040 PMCID: PMC4170940 DOI: 10.1186/1472-6750-14-80] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Accepted: 08/21/2014] [Indexed: 11/25/2022] Open
Abstract
Background Construction of high quality cDNA libraries from the usually low amounts of eukaryotic mRNA extracted from environmental samples is essential in functional metatranscriptomics for the selection of functional, full-length genes encoding proteins of interest. Many of the inserts in libraries constructed by standard methods are represented by truncated cDNAs due to premature stoppage of reverse transcriptase activity and preferential cloning of short cDNAs. Results We report here a simple and cost effective technique for preparation of sized eukaryotic cDNA libraries from as low as three microgram of total soil RNA dominated by ribosomal and bacterial RNA. cDNAs synthesized by a template switching approach were size-fractionated by two dimensional agarose gel electrophoresis prior to PCR amplification and cloning. Effective size selection was demonstrated by PCR amplification of conserved gene families specific of each size class. Libraries of more than one million independent inserts whose sizes ranged between one and four kb were thus produced. Up to 80% of the insert sequences were homologous to eukaryotic gene sequences present in public databases. Conclusions A simple and cost effective technique has been developed to construct sized eukaryotic cDNA libraries from environmental samples. This technique will facilitate expression cloning of environmental eukaryotic genes and contribute to a better understanding of basic biological and/or ecological processes carried out by eukaryotic microbial communities.
Collapse
Affiliation(s)
- Rajiv Kumar Yadav
- Ecologie Microbienne, UMR CNRS 5557, USC INRA 1364, Université Lyon 1, Université de Lyon, Villeurbanne, France.
| | | | | | | | | | | | | |
Collapse
|
7
|
High-throughput sequencing and de novo assembly of Brassica oleracea var. Capitata L. for transcriptome analysis. PLoS One 2014; 9:e92087. [PMID: 24682075 PMCID: PMC3969326 DOI: 10.1371/journal.pone.0092087] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Accepted: 02/18/2014] [Indexed: 12/28/2022] Open
Abstract
Background The cabbage, Brassica oleracea var. capitata L., has a distinguishable phenotype within the genus Brassica. Despite the economic and genetic importance of cabbage, there is little genomic data for cabbage, and most studies of Brassica are focused on other species or other B. oleracea subspecies. The lack of genomic data for cabbage, a non-model organism, hinders research on its molecular biology. Hence, the construction of reliable transcriptomic data based on high-throughput sequencing technologies is needed to enhance our understanding of cabbage and provide genomic information for future work. Methodology/Principal Findings We constructed cDNAs from total RNA isolated from the roots, leaves, flowers, seedlings, and calcium-limited seedling tissues of two cabbage genotypes: 102043 and 107140. We sequenced a total of six different samples using the Illumina HiSeq platform, producing 40.5 Gbp of sequence data comprising 401,454,986 short reads. We assembled 205,046 transcripts (≥ 200 bp) using the Velvet and Oases assembler and predicted 53,562 loci from the transcripts. We annotated 35,274 of the loci with 55,916 plant peptides in the Phytozome database. The average length of the annotated loci was 1,419 bp. We confirmed the reliability of the sequencing assembly using reverse-transcriptase PCR to identify tissue-specific gene candidates among the annotated loci. Conclusion Our study provides valuable transcriptome sequence data for B. oleracea var. capitata L., offering a new resource for studying B. oleracea and closely related species. Our transcriptomic sequences will enhance the quality of gene annotation and functional analysis of the cabbage genome and serve as a material basis for future genomic research on cabbage. The sequencing data from this study can be used to develop molecular markers and to identify the extreme differences among the phenotypes of different species in the genus Brassica.
Collapse
|
8
|
Al-Balool HH, Weber D, Liu Y, Wade M, Guleria K, Nam PLP, Clayton J, Rowe W, Coxhead J, Irving J, Elliott DJ, Hall AG, Santibanez-Koref M, Jackson MS. Post-transcriptional exon shuffling events in humans can be evolutionarily conserved and abundant. Genome Res 2011; 21:1788-99. [PMID: 21948523 PMCID: PMC3205564 DOI: 10.1101/gr.116442.110] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2010] [Accepted: 07/28/2011] [Indexed: 12/31/2022]
Abstract
In silico analyses have established that transcripts from some genes can be processed into RNAs with rearranged exon order relative to genomic structure (post-transcriptional exon shuffling, or PTES). Although known to contribute to transcriptome diversity in some species, to date the structure, distribution, abundance, and functional significance of human PTES transcripts remains largely unknown. Here, using high-throughput transcriptome sequencing, we identify 205 putative human PTES products from 176 genes. We validate 72 out of 112 products analyzed using RT-PCR, and identify additional PTES products structurally related to 61% of validated targets. Sequencing of these additional products reveals GT-AG dinucleotides at >95% of the splice junctions, confirming that they are processed by the spliceosome. We show that most PTES transcripts are expressed in a wide variety of human tissues, that they can be polyadenylated, and that some are conserved in mouse. We also show that they can extend into 5' and 3' UTRs, consistent with formation via trans-splicing of independent pre-mRNA molecules. Finally, we use real-time PCR to compare the abundance of PTES exon junctions relative to canonical exon junctions within the transcripts from seven genes. PTES exon junctions are present at <0.01% to >90% of the levels of canonical junctions, with transcripts from MAN1A2, PHC3, TLE4, and CDK13 exhibiting the highest levels. This is the first systematic experimental analysis of PTES in human, and it suggests both that the phenomenon is much more widespread than previously thought and that some PTES transcripts could be functional.
Collapse
Affiliation(s)
- Haya H. Al-Balool
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - David Weber
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Yilei Liu
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Mark Wade
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Kamlesh Guleria
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Pitsien Lang Ping Nam
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Jake Clayton
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - William Rowe
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Jonathan Coxhead
- NewGene Limited, Bioscience Building, International Centre for Life, Newcastle upon Tyne NE1 4EP, United Kingdom
| | - Julie Irving
- NewGene Limited, Bioscience Building, International Centre for Life, Newcastle upon Tyne NE1 4EP, United Kingdom
| | - David J. Elliott
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| | - Andrew G. Hall
- Northern Institute for Cancer Research, Paul O'Gorman Building, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
| | | | - Michael S. Jackson
- Institute of Genetic Medicine, Newcastle University, Newcastle NE1 3BZ, United Kingdom
| |
Collapse
|
9
|
Abstract
Background In the genome era, characterizing the structure and the function of RNA molecules remains a major challenge. Alternative transcripts and non-protein-coding genes are poorly recognized by the current genome-annotation algorithms and efficient tools are needed to isolate the less-abundant or stable RNAs. Results A universal RNA-tagging method using the T4 RNA ligase 2 and special adapters is reported. Based on this system, protocols for RACE PCR and full-length cDNA library construction have been developed. The RNA tagging conditions were thoroughly optimized and compared to previous methods by using a biochemical oligonucleotide tagging assay and RACE PCRs on a range of transcripts. In addition, two large-scale full-length cDNA inventories relying on this method are presented. Conclusion The RNA Captor is a straightforward and accessible protocol. The sensitivity of this approach was shown to be higher compared to previous methods, and applicable on messenger RNAs, non-protein-coding RNAs, transcription-start sites and microRNA-directed cleavage sites of transcripts. This strategy could also be used to study other classes of RNA and in deep sequencing experiments.
Collapse
Affiliation(s)
- Christian Clepet
- URGV Plant Genomics, INRA UMR1165 UEVE/CNRS ERL 8196, Evry, France.
| |
Collapse
|
10
|
Construction and characterization of a cDNA library from wheat infected with Fusarium graminearum Fg 2. Int J Mol Sci 2011; 12:613-26. [PMID: 21340003 PMCID: PMC3039969 DOI: 10.3390/ijms12010613] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2010] [Revised: 01/14/2011] [Accepted: 01/17/2011] [Indexed: 11/16/2022] Open
Abstract
Total RNA from wheat spikes infected with F. graminearum Fg2 was extracted and the mRNA was purified. Switching Mechanism at 5′ end of the RNA Transcript (SMART) technique and CDS Ill/3′ primer were used for first-strand cDNA synthesis using reverse transcriptase by RT-PCR. Primer extension polymerase chain reaction was used to construct the double-strand cDNA that was digested by proteinase K, then by Sfi I and fractionated. cDNAs longer than 0.5 kb were collected and ligated to λTriplEx2 vector followed λ phage packaging reaction and library amplification. The qualities of both unamplified and amplified cDNA libraries were strictly checked by conventional titer determination. One hundred and sixty five plaques were randomly picked and tested using PCR with universal primers derived from the sequence flanking the vector. A high quality cDNA library from wheat spikes that have been infected by F. graminearum was successfully constructed.
Collapse
|
11
|
Kapteyn J, He R, McDowell ET, Gang DR. Incorporation of non-natural nucleotides into template-switching oligonucleotides reduces background and improves cDNA synthesis from very small RNA samples. BMC Genomics 2010; 11:413. [PMID: 20598146 PMCID: PMC2996941 DOI: 10.1186/1471-2164-11-413] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 07/02/2010] [Indexed: 01/15/2023] Open
Abstract
Background The template switching PCR (TS-PCR) method of cDNA synthesis represents one of the most straightforward approaches to generating full length cDNA for sequencing efforts. However, when applied to very small RNA samples, such as those obtained from tens or hundreds of cells, this approach leads to high background and low cDNA yield due to concatamerization of the TS oligo. Results In this study, we describe the application of nucleotide isomers that form non-standard base pairs in the template switching oligo to prevent background cDNA synthesis. When such bases are added to the 5' end of the template switching (TS) oligo, they inhibit MMLV-RT from extending the cDNA beyond the TS oligo, thus increasing cDNA yield by reducing formation of concatamers of the TS oligo that are the source of significant background. Conclusions Our results demonstrate that this novel approach for cDNA synthesis has valuable utility for application of ultra-high throughput technologies, such as whole transcriptome sequencing using 454 technology, to very small biological samples comprised of tens of cells as might be obtained via approaches like laser microdissection.
Collapse
Affiliation(s)
- Jeremy Kapteyn
- School of Plant Sciences and BIO5 Institute, The University of Arizona, 1657 E Helen Street, Tucson, AZ 85721, USA
| | | | | | | |
Collapse
|
12
|
Liu F, Wang D, Fu J, Sun G, Shen Y, Dong L, Zhang B, Hu S, Li J. Identification of immune-relevant genes by expressed sequence tag analysis of head kidney from grass carp (Ctenopharyngodon idella). COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2010; 5:116-23. [DOI: 10.1016/j.cbd.2010.02.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2009] [Revised: 02/11/2010] [Accepted: 02/15/2010] [Indexed: 01/17/2023]
|
13
|
Marques MC, Alonso-Cantabrana H, Forment J, Arribas R, Alamar S, Conejero V, Perez-Amador MA. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus. BMC Genomics 2009; 10:428. [PMID: 19747386 PMCID: PMC2754500 DOI: 10.1186/1471-2164-10-428] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2009] [Accepted: 09/11/2009] [Indexed: 01/02/2023] Open
Abstract
Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an important step towards the identification of all genes in the citrus genome. Furthermore, public availability of the cDNA clones generated in this study, and not only their sequence, enables testing of the biological function of the genes represented in the collection. Expression of the citrus SEP3 homologue, CitrSEP, in Arabidopsis results in early flowering, along with other phenotypes resembling the over-expression of the Arabidopsis SEPALLATA genes. Our findings suggest that the members of the SEP gene family play similar roles in these quite distant plant species.
Collapse
Affiliation(s)
- M Carmen Marques
- Instituto de Biología Molecular y Celular de Plantas, Universidad Politécnica de Valencia and Consejo Superior de Investigaciones Científicas, Avenida de los Naranjos s/n, Valencia 46022, Spain.
| | | | | | | | | | | | | |
Collapse
|
14
|
Tsuchihara K, Suzuki Y, Wakaguri H, Irie T, Tanimoto K, Hashimoto SI, Matsushima K, Mizushima-Sugano J, Yamashita R, Nakai K, Bentley D, Esumi H, Sugano S. Massive transcriptional start site analysis of human genes in hypoxia cells. Nucleic Acids Res 2009; 37:2249-63. [PMID: 19237398 PMCID: PMC2673422 DOI: 10.1093/nar/gkp066] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Combining our full-length cDNA method and the massively parallel sequencing technology, we developed a simple method to collect precise positional information of transcriptional start sites (TSSs) together with digital information of the gene-expression levels in a high throughput manner. We applied this method to observe gene-expression changes in a colon cancer cell line cultured in normoxic and hypoxic conditions. We generated more than 100 million 36-base TSS-tag sequences and revealed comprehensive features of hypoxia responsive alterations in the transcriptional landscape of the human genome. The features include presence of inducible 'hot regions' in 54 genomic regions, 220 novel hypoxia inducible promoters that may drive non-protein-coding transcripts, 191 hypoxia responsive alternative promoters and detailed views of 120 novel as well as known hypoxia responsive genes. We further analyzed hypoxic response of different cells using additional 60 million TSS-tags and found that the degree of the gene-expression changes were different among cell lines, possibly reflecting cellular robustness against hypoxia. The novel dynamic figure of the human gene transcriptome will deepen our understanding of the transcriptional program of the human genome as well as bringing new insights into the biology of cancer cells in hypoxia.
Collapse
Affiliation(s)
- Katsuya Tsuchihara
- Cancer Physiology Project, Research Center for Innovative Oncology, National Cancer Center Hospital East, Kashiwa, Chiba, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Natalizio BJ, Robson-Dixon ND, Garcia-Blanco MA. The Carboxyl-terminal Domain of RNA Polymerase II Is Not Sufficient to Enhance the Efficiency of Pre-mRNA Capping or Splicing in the Context of a Different Polymerase. J Biol Chem 2009; 284:8692-702. [PMID: 19176527 DOI: 10.1074/jbc.m806919200] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Eukaryotic messenger RNA precursors (pre-mRNAs) synthesized by RNA polymerase II (RNAP II) are processed co-transcriptionally. The carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II is thought to mediate the coupling of transcription with pre-mRNA processing by coordinating the recruitment of processing factors during synthesis of nascent transcripts. Previous studies have demonstrated that the phosphorylated CTD is required for efficient co-transcriptional processing. In the study presented here we investigated whether the CTD is sufficient to coordinate transcription with pre-mRNA capping and splicing in the context of two other DNA-dependent RNA polymerases, mammalian RNAP III and bacteriophage T7 RNAP. Our results indicate that the CTD fused to the largest subunit of RNAP III (POLR3A) is not sufficient to enhance co-transcriptional pre-mRNA splicing or capping in vivo. Additionally, we analyzed a T7 RNAP-CTD fusion protein and examined its ability to enhance pre-mRNA splicing and capping of both constitutively and alternatively spliced substrates. We observed that the CTD in the context of T7 RNAP was not sufficient to enhance pre-mRNA splicing or capping either in vitro or in vivo. We propose that the efficient coupling of transcription to pre-mRNA processing requires not only the phosphorylated CTD but also other RNAP II specific subunits or associated factors.
Collapse
Affiliation(s)
- Barbara J Natalizio
- Department of Molecular Genetics and Microbiology, Center for RNA Biology, and Department of Medicine, Duke University Medical Center, Durham, North Carolina 27710, USA
| | | | | |
Collapse
|
16
|
Ghosh P, Cheng J, Chou TF, Jia Y, Avdulov S, Bitterman PB, Polunovsky VA, Wagner CR. Expression, purification and characterization of recombinant mouse translation initiation factor eIF4E as a dihydrofolate reductase (DHFR) fusion protein. Protein Expr Purif 2008; 60:132-9. [PMID: 18479935 PMCID: PMC2617730 DOI: 10.1016/j.pep.2008.03.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2007] [Revised: 03/14/2008] [Accepted: 03/19/2008] [Indexed: 11/23/2022]
Abstract
One of the earliest steps in translation initiation is recognition of the mRNA cap structure (m7GpppX) by the initiation factor eIF4E. Studies of interactions between purified eIF4E and its binding partners provide important information for understanding mechanisms underlying translational control in normal and cancer cells. Numerous impediments of the available methods used for eIF4E purification led us to develop a novel methodology for obtaining fractions of eIF4E free from undesired by-products. Herein we report methods for bacterial expression of eIF4E tagged with mutant dihydrofolate reductase (DHFR) followed by isolation and purification of the DHFR-eIF4E protein by using affinity and anion exchange chromatography. Fluorescence quenching experiments indicated the cap-analog, 7MeGTP, bound to DHFR-eIF4E and eIF4E with a dissociation constant (K(d)) of 6+/-5 and 10+/-3 nM, respectively. Recombinant eIF4E and DHFR-eIF4E were both shown to significantly enhance in vitro translation in dose dependent manner by 75% at 0.5 microM. Nevertheless increased concentrations of eIF4E and DHFR-eIF4E significantly inhibited translation in a dose dependent manner by a maximum at 2 microM of 60% and 90%, respectively. Thus, we have demonstrated that we have developed an expression system for fully functional recombinant eIF4E. We have also shown that the fusion protein DHFR-eIF4E is functional and thus may be useful for cell based affinity tag studies with fluorescently labeled trimethoprim analogs.
Collapse
Affiliation(s)
- Phalguni Ghosh
- Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Jilin Cheng
- Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Tsui-Fen Chou
- Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Yan Jia
- Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Svetlana Avdulov
- Department of Pulmonary Medicine, University of Minnesota, Minneapolis, MN 55455, USA
| | - Peter B. Bitterman
- Department of Pulmonary Medicine, University of Minnesota, Minneapolis, MN 55455, USA
| | - Vitaly A. Polunovsky
- Department of Pulmonary Medicine, University of Minnesota, Minneapolis, MN 55455, USA
| | - Carston R. Wagner
- Department of Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
17
|
3G vector-primer plasmid for constructing full-length-enriched cDNA libraries. Anal Biochem 2008; 380:149-51. [PMID: 18544335 DOI: 10.1016/j.ab.2008.05.022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2008] [Revised: 05/18/2008] [Accepted: 05/19/2008] [Indexed: 11/21/2022]
Abstract
We designed a 3G vector-primer plasmid for the generation of full-length-enriched complementary DNA (cDNA) libraries. By employing the terminal transferase activity of reverse transcriptase and the modified strand replacement method, this plasmid (assembled with a polydT end and a deoxyguanosine [dG] end) combines priming full-length cDNA strand synthesis and directional cDNA cloning. As a result, the number of steps involved in cDNA library preparation is decreased while simplifying downstream gene manipulation, sequencing, and subcloning. The 3G vector-primer plasmid method yields fully represented plasmid primed libraries that are equivalent to those made by the SMART (switching mechanism at 5' end of RNA transcript) approach.
Collapse
|
18
|
Abstract
Recent progress in the analyses of the mouse transcriptome leads to unexpected discoveries. The mouse genomic sequences read by RNA polymerase II may be six times more than previously expected for human chromosomes. The transcript-abundant regions (named "transcription forests") occupy more than half of the genomic sequence and are divided by transcript-scarce regions (transcription deserts). Many of the coding mRNAs may have partially overlapping antisense RNAs. There are transcripts bridging several adjacent genes that were previously regarded as distinct ones. The transcription start sites appearing as cap analysis of gene expression (CAGE) tags are mapped on the mouse genomic sequences. Distributions of CAGE tags show that the shapes of mammalian gene promoters can be classified into four major categories. These shapes were conserved between mouse and human. Most of the gene has exonic transcription start sites, especially in the 3' untranslated region (3' UTR) sequences. The term "RNA continent" has been invented to express this unexpectedly complex and prodigious mouse transcriptome. More than a half of the RNA polymerase II transcripts are regarded as noncoding RNAs (ncRNAs). The great variety of ncRNAs in mammalian transcriptome implies that there are many functional ncRNAs in the cells. Especially, the evolutionarily conserved microRNAs play critical roles in mammalian development and other biological functions. Moreover, many other ncRNAs have also been shown to have biological significant functions, mainly in the regulation of gene expression. The functional survey of the RNA continent has just started. We will describe the state of the art of the RNA continent and its impact on the modern molecular biology, especially on the cancer research.
Collapse
Affiliation(s)
- Jun Yasuda
- Functional RNA Research Program, Frontier Research System, RIKEN Yokohama Institute, 1-7-22, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | | |
Collapse
|
19
|
Harbers M. The current status of cDNA cloning. Genomics 2008; 91:232-42. [PMID: 18222633 DOI: 10.1016/j.ygeno.2007.11.004] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2007] [Revised: 11/10/2007] [Accepted: 11/17/2007] [Indexed: 11/19/2022]
Abstract
The cloning of cDNAs, copies of cellular RNA, is one of the classical technologies in molecular biology. Over the past 30 years cDNA cloning technologies have been improved to enable the cloning of large cDNA collections, which are fundamental to today's understanding of the utilization of genetic information. With the discovery of noncoding RNAs, additional new approaches to the cloning of short RNAs have been developed. However, with the realization that much larger portions of genomes are transcribed than anticipated from genome annotations, cDNA cloning faces new challenges to uncover rare transcripts and to make the corresponding cDNAs available for functional studies. This review provides an overview on the current status of cDNA cloning and possibilities for the discovery and characterization of new RNA families.
Collapse
Affiliation(s)
- Matthias Harbers
- DNAFORM, Inc., Leading Venture Plaza 2, 75-1 Ono-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0046, Japan.
| |
Collapse
|
20
|
Abstract
The principal route to understanding the biological significance of the genome sequence comes from discovery and characterization of that portion of the genome that is transcribed into RNA products. We now know that this ;transcriptome' is unexpectedly complex and its precise definition in any one species requires multiple technical approaches and an ability to work on a very large scale. A key step is the development of technologies able to capture snapshots of the complexity of the various kinds of RNA generated by the genome. As the human, mouse and other model genome sequencing projects approach completion, considerable effort has been focused on identifying and annotating the protein-coding genes as the principal output of the genome. In pursuing this aim, several key technologies have been developed to generate large numbers and highly diverse sets of full-length cDNAs and their variants. However, the search has identified another hidden transcriptional universe comprising a wide variety of non-protein coding RNA transcripts. Despite initial scepticism, various experiments and complementary technologies have demonstrated that these RNAs are dynamically transcribed and a subset of them can act as sense-antisense RNAs, which influence the transcriptional output of the genome. Recent experimental evidence suggests that the list of non-protein coding RNAs is still largely incomplete and that transcription is substantially more complex even than currently thought.
Collapse
Affiliation(s)
- Piero Carninci
- Genome Science Laboratory, Discovery and Research Institute, RIKEN Wako Institute, Wako, Saitama, Japan.
| |
Collapse
|
21
|
Bajak EZ, Hagedorn CH. Efficient 5' cap-dependent RNA purification : use in identifying and studying subsets of RNA. Methods Mol Biol 2008; 419:147-60. [PMID: 18369981 DOI: 10.1007/978-1-59745-033-1_10] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Microarray-based screening technologies have revealed a larger than expected diversity of gene expression profiles for many cells, tissues, and organisms. The complexity of RNA species, defined by their molecular structure, represents a major new development in biology. RNA not only carries genetic information in the form of templates and components of the translational machinery for protein synthesis but also directly regulates gene expression as exemplified by micro-RNAs (miRNAs). Recent evidence has demonstrated that 5' capped and 3' polyadenylated ends are not restricted to mRNAs, but that they are also present in precursors of both miRNAs and some antisense RNA transcripts. In addition, as many as 40% of transcribed RNAs may lack 3' poly(A) ends. In concert with the presence of a 5' cap (m7 GpppN), the length of the 3' poly(A) end plays a critical role in determining the translational efficiency, stability, and the cellular distribution of a specific mRNA. RNAs with short or lacking 3' poly(A) ends, that escape isolation and amplification with oligo(dT)-based methods, provide a challenge in RNA biology and gene expression studies. To circumvent the limitations of 3' poly(A)-dependent RNA isolation methods, we developed an efficient RNA purification system that binds the 5' cap of RNA with a high-affinity variant of the cap-binding protein eIF4E. This system can be used in differential selection approaches to isolate subsets of RNAs, including those with short 3' poly(A) ends that are likely targets of post-transcriptional regulation of gene expression. The length of the 3' poly(A) ends can be defined using a rapid polymerase chain reaction (PCR)- based approach.
Collapse
Affiliation(s)
- Edyta Z Bajak
- Department of Medicine and Pharmacology, University of Kansas Medical Center, Kansas City, KS, USA
| | | |
Collapse
|
22
|
Shaw PJ, Ponmee N, Karoonuthaisiri N, Kamchonwongpaisan S, Yuthavong Y. Characterization of human malaria parasite Plasmodium falciparum eIF4E homologue and mRNA 5' cap status. Mol Biochem Parasitol 2007; 155:146-55. [PMID: 17692399 DOI: 10.1016/j.molbiopara.2007.07.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2006] [Revised: 06/01/2007] [Accepted: 07/05/2007] [Indexed: 11/30/2022]
Abstract
The mRNA 5' cap is an essential structural feature for translation of eukaryotic mRNA. Translation is initiated by recognition of the cap by the translation initiation factor eIF4E. To further our understanding of mRNA translation in the human malaria parasite Plasmodium falciparum, we have investigated the parasite eIF4E and its interaction with capped mRNA. We have purified P. falciparum eIF4E as a recombinant protein and demonstrated that it has canonical mRNA cap binding activity. We used this protein to purify P. falciparum capped mRNAs from total parasite RNA. Microarray analysis comparing total and eIF4E-purified capped mRNAs shows that 34 features were more than twofold under-represented in the purified RNA sample, including 19 features representative of nuclear transcripts. The putatively uncapped nuclear transcripts may represent a class of mRNAs targeted for storage and cap removal.
Collapse
Affiliation(s)
- Philip J Shaw
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency, 113 Pahonyothin Road, Klong 1, Klong Luang, Pathumthani 12120, Thailand
| | | | | | | | | |
Collapse
|
23
|
Construction and characterization of a full-length cDNA library for the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici). BMC Genomics 2007; 8:145. [PMID: 17547766 PMCID: PMC1903366 DOI: 10.1186/1471-2164-8-145] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 06/04/2007] [Indexed: 11/17/2022] Open
Abstract
Background Puccinia striiformis is a plant pathogenic fungus causing stripe rust, one of the most important diseases on cereal crops and grasses worldwide. However, little is know about its genome and genes involved in the biology and pathogenicity of the pathogen. We initiated the functional genomic research of the fungus by constructing a full-length cDNA and determined functions of the first group of genes by sequence comparison of cDNA clones to genes reported in other fungi. Results A full-length cDNA library, consisting of 42,240 clones with an average cDNA insert of 1.9 kb, was constructed using urediniospores of race PST-78 of P. striiformis f. sp. tritici. From 196 sequenced cDNA clones, we determined functions of 73 clones (37.2%). In addition, 36 clones (18.4%) had significant homology to hypothetical proteins, 37 clones (18.9%) had some homology to genes in other fungi, and the remaining 50 clones (25.5%) did not produce any hits. From the 73 clones with functions, we identified 51 different genes encoding protein products that are involved in amino acid metabolism, cell defense, cell cycle, cell signaling, cell structure and growth, energy cycle, lipid and nucleotide metabolism, protein modification, ribosomal protein complex, sugar metabolism, transcription factor, transport metabolism, and virulence/infection. Conclusion The full-length cDNA library is useful in identifying functional genes of P. striiformis.
Collapse
|
24
|
Kim TH, Kim NS, Lim D, Lee KT, Oh JH, Park HS, Jang GW, Kim HY, Jeon M, Choi BH, Lee HY, Chung HY, Kim H. Generation and analysis of large-scale expressed sequence tags (ESTs) from a full-length enriched cDNA library of porcine backfat tissue. BMC Genomics 2006; 7:36. [PMID: 16504160 PMCID: PMC1444929 DOI: 10.1186/1471-2164-7-36] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2005] [Accepted: 02/27/2006] [Indexed: 11/13/2022] Open
Abstract
Background Genome research in farm animals will expand our basic knowledge of the genetic control of complex traits, and the results will be applied in the livestock industry to improve meat quality and productivity, as well as to reduce the incidence of disease. A combination of quantitative trait locus mapping and microarray analysis is a useful approach to reduce the overall effort needed to identify genes associated with quantitative traits of interest. Results We constructed a full-length enriched cDNA library from porcine backfat tissue. The estimated average size of the cDNA inserts was 1.7 kb, and the cDNA fullness ratio was 70%. In total, we deposited 16,110 high-quality sequences in the dbEST division of GenBank (accession numbers: DT319652-DT335761). For all the expressed sequence tags (ESTs), approximately 10.9 Mb of porcine sequence were generated with an average length of 674 bp per EST (range: 200–952 bp). Clustering and assembly of these ESTs resulted in a total of 5,008 unique sequences with 1,776 contigs (35.46%) and 3,232 singleton (65.54%) ESTs. From a total of 5,008 unique sequences, 3,154 (62.98%) were similar to other sequences, and 1,854 (37.02%) were identified as having no hit or low identity (<95%) and 60% coverage in The Institute for Genomic Research (TIGR) gene index of Sus scrofa. Gene ontology (GO) annotation of unique sequences showed that approximately 31.7, 32.3, and 30.8% were assigned molecular function, biological process, and cellular component GO terms, respectively. A total of 1,854 putative novel transcripts resulted after comparison and filtering with the TIGR SsGI; these included a large percentage of singletons (80.64%) and a small proportion of contigs (13.36%). Conclusion The sequence data generated in this study will provide valuable information for studying expression profiles using EST-based microarrays and assist in the condensation of current pig TCs into clusters representing longer stretches of cDNA sequences. The isolation of genes expressed in backfat tissue is the first step toward a better understanding of backfat tissue on a genomic basis.
Collapse
Affiliation(s)
- Tae-Hun Kim
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Nam-Soon Kim
- Laboratory of Human Genomics, Genome Research Center, Korea Research Institute of Bioscience & Biotechnology, Daejeon, Korea
| | - Dajeong Lim
- School of Agricultural Biotechnology, Seoul National University San 56-1, Sillim-dong, Gwanak-gu, Seoul 151-742, Korea
| | - Kyung-Tai Lee
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Jung-Hwa Oh
- Laboratory of Human Genomics, Genome Research Center, Korea Research Institute of Bioscience & Biotechnology, Daejeon, Korea
| | - Hye-Sook Park
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Gil-Won Jang
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Hyung-Yong Kim
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Mina Jeon
- School of Agricultural Biotechnology, Seoul National University San 56-1, Sillim-dong, Gwanak-gu, Seoul 151-742, Korea
| | - Bong-Hwan Choi
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Hae-Young Lee
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - HY Chung
- Division of Animal Genomics & Bioinformatics, National LivestockResearch Institute, Rural Development Administration, Omokchun-dong 564, Kwonsun-gu, Suwon, Korea
| | - Heebal Kim
- School of Agricultural Biotechnology, Seoul National University San 56-1, Sillim-dong, Gwanak-gu, Seoul 151-742, Korea
| |
Collapse
|
25
|
Harbers M, Carninci P. Tag-based approaches for transcriptome research and genome annotation. Nat Methods 2005; 2:495-502. [PMID: 15973418 DOI: 10.1038/nmeth768] [Citation(s) in RCA: 141] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
With the increasing number of whole genome sequences available, genomic research has shifted toward the annotation of functional elements and transcribed regions. Thus, the related field of transcriptome research requires accurate methods for the profiling of genes that are not biased by known sequence information, and that also allow for the identification of promoter regions. Starting with serial analysis of gene expression (SAGE), methods making use of short sequencing tags have greatly contributed to transcriptome studies. Here we review recent developments in the use of short sequencing tags in expression profiling, gene discovery and genome annotation. These tags are obtained from the 5' end of mRNAs, both terminal ends of mRNAs, or genomic regions. The 5' end-specific tags, with their ability to identify transcripts along with their transcriptional start sites, will be of particular interest for gene network studies and may become one of the most important approaches in systems biology.
Collapse
Affiliation(s)
- Matthias Harbers
- K.K. Dnaform, Tsukuba Branch, 3-1 Chuo 8-chome, Ami Machi, Inashiki Gun, Ibaraki, 300-0332, Japan.
| | | |
Collapse
|
26
|
Cowan D, Meyer Q, Stafford W, Muyanga S, Cameron R, Wittwer P. Metagenomic gene discovery: past, present and future. Trends Biotechnol 2005; 23:321-9. [PMID: 15922085 DOI: 10.1016/j.tibtech.2005.04.001] [Citation(s) in RCA: 148] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2004] [Revised: 02/22/2005] [Accepted: 04/04/2005] [Indexed: 11/29/2022]
Abstract
It is now widely accepted that the application of standard microbiological methods for the recovery of microorganisms from the environment has had limited success in providing access to the true extent of microbial biodiversity. It follows that much of the extant microbial genetic diversity (collectively termed the metagenome) remains unexploited, an issue of considerable relevance to a wider understanding of microbial communities and of considerable importance to the biotechnology industry. The recent development of technologies designed to access this wealth of genetic information through environmental nucleic acid extraction has provided a means of avoiding the limitations of culture-dependent genetic exploitation.
Collapse
Affiliation(s)
- Don Cowan
- Advanced Research Centre for Applied Microbiology, Department of Biotechnology, University of the Western Cape, Bellville 7535, Cape Town, South Africa.
| | | | | | | | | | | |
Collapse
|
27
|
Yan Y, Svitkin Y, Lee JM, Bisaillon M, Pelletier J. Ribavirin is not a functional mimic of the 7-methyl guanosine mRNA cap. RNA (NEW YORK, N.Y.) 2005; 11:1238-44. [PMID: 16043507 PMCID: PMC1370807 DOI: 10.1261/rna.2930805] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Ribavirin is a guanosine ribonucleoside analog that displays broad-spectrum anti-viral activity and is currently used for the treatment of some viral infections. Ribavirin has recently been proposed to also be a mimic of the 7-methyl guanosine cap found at the 5' end of mRNAs. To obtain supporting functional data for this hypothesis, we assessed the ability of ribavirin triphosphate to interfere with the interaction between eIF4E and 7-methyl guanosine capped mRNA. In chemical cross-linking assays, cap-affinity chromatography, and cap-dependent translation assays, ribavirin was unable to function as a cap analog.
Collapse
Affiliation(s)
- Yifei Yan
- Department of Biochemistry, McIntyre Medical Sciences Building, McGill University, Montreal, Québec H3G 1Y6, Canada
| | | | | | | | | |
Collapse
|
28
|
Oh JH, Sohn HY, Kim JM, Kim YS, Kim NS. Construction of multi-purpose vectors, pCNS and pCNS-D2, are suitable for collection and functional study of large-scale cDNAs. Plasmid 2004; 51:217-26. [PMID: 15109828 DOI: 10.1016/j.plasmid.2004.01.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2003] [Indexed: 10/26/2022]
Abstract
To efficiently perform collection and functional studies of large-scale cDNAs, we constructed multi-functional cDNA vectors for efficient full-length cDNA cloning, direct sequencing, easy screening, and the expression of cDNA in vitro and in vivo without subcloning the cDNA into other vectors. The constructed vectors, pCNS and pCNS-D2, contain a multi-cloning site for uni-directional full-length cDNA cloning, T7 and Sp6 RNA polymerase promoters for in vitro transcription and translation, and hCMV immediate early promoter and BGH poly(A) to allow expression in mammalian cells. Using these vectors, we constructed full-length enriched cDNA libraries containing 60-75% of the full-length cDNAs using two different oligo-capping methods. The subtracted cDNA libraries could also be constructed by removing of EF1-alpha cDNA, a highly expressed cDNA. In addition, we confirmed the translation of EF1-alpha cDNA in vitro and the expression of luciferase cDNA in mammalian cells. The expression efficiency of luciferase cDNA in different cell lines, such as HeLa, Hep3B, SNU638, and SNU668, showed that pCNS vectors can highly express target genes in different cell types. These results indicated that our multi-purpose vectors, pCNS and pCNS-D2, are useful tools for the construction of full-length cDNA libraries and high-throughput based functional study of cDNAs.
Collapse
Affiliation(s)
- Jung-Hwa Oh
- Laboratory of Human Genomics, Division of Genomics and Proteomics, Korea Research Institute of Bioscience and Biotechnology, Daejeon 305-333, Republic of Korea
| | | | | | | | | |
Collapse
|
29
|
Wellenreuther R, Schupp I, Poustka A, Wiemann S. SMART amplification combined with cDNA size fractionation in order to obtain large full-length clones. BMC Genomics 2004; 5:36. [PMID: 15198809 PMCID: PMC436056 DOI: 10.1186/1471-2164-5-36] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2004] [Accepted: 06/15/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND cDNA libraries are widely used to identify genes and splice variants, and as a physical resource for full-length clones. Conventionally-generated cDNA libraries contain a high percentage of 5'-truncated clones. Current library construction methods that enrich for full-length mRNA are laborious, and involve several enzymatic steps performed on mRNA, which renders them sensitive to RNA degradation. The SMART technique for full-length enrichment is robust but results in limited cDNA insert size of the library. RESULTS We describe a method to construct SMART full-length enriched cDNA libraries with large insert sizes. Sub-libraries were generated from size-fractionated cDNA with an average insert size of up to seven kb. The percentage of full-length clones was calculated for different size ranges from BLAST results of over 12,000 5'ESTs. CONCLUSIONS The presented technique is suitable to generate full-length enriched cDNA libraries with large average insert sizes in a straightforward and robust way. The representation of full-coding clones is high also for large cDNAs (70%, 4-10 kb), when high-quality starting mRNA is used.
Collapse
MESH Headings
- Cell Line, Tumor
- Cloning, Molecular/methods
- DNA, Complementary/chemistry
- DNA, Complementary/genetics
- DNA, Complementary/metabolism
- DNA, Neoplasm/chemistry
- DNA, Neoplasm/genetics
- DNA, Neoplasm/metabolism
- Electrophoresis, Agar Gel/methods
- Endometrial Neoplasms/genetics
- Endometrial Neoplasms/pathology
- Female
- Gene Library
- Humans
- Nucleic Acid Amplification Techniques/methods
- RNA, Messenger/genetics
- RNA, Neoplasm/genetics
- Reverse Transcriptase Polymerase Chain Reaction/methods
Collapse
Affiliation(s)
- Ruth Wellenreuther
- Department of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Ingo Schupp
- Department of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | | | - Annemarie Poustka
- Department of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Stefan Wiemann
- Department of Molecular Genome Analysis, German Cancer Research Center (DKFZ), Heidelberg, Germany
| |
Collapse
|
30
|
Zhulidov PA, Bogdanova EA, Shcheglov AS, Vagner LL, Khaspekov GL, Kozhemyako VB, Matz MV, Meleshkevitch E, Moroz LL, Lukyanov SA, Shagin DA. Simple cDNA normalization using kamchatka crab duplex-specific nuclease. Nucleic Acids Res 2004; 32:e37. [PMID: 14973331 PMCID: PMC373426 DOI: 10.1093/nar/gnh031] [Citation(s) in RCA: 319] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2003] [Revised: 11/25/2003] [Accepted: 01/20/2004] [Indexed: 11/14/2022] Open
Abstract
We developed a novel simple cDNA normalization method [termed duplex-specific nuclease (DSN) normalization] that may be effectively used for samples enriched with full-length cDNA sequences. DSN normalization involves the denaturation-reassociation of cDNA, degradation of the double-stranded (ds) fraction formed by abundant transcripts and PCR amplification of the equalized single-stranded (ss) DNA fraction. The key element of this method is the degradation of the ds fraction formed during reassociation of cDNA using the kamchatka crab DSN, as described recently. This thermostable enzyme displays a strong preference for cleaving ds DNA and DNA in DNA-RNA hybrid duplexes compared with ss DNA and RNA, irrespective of sequence length. We developed normalization protocols for both first-strand cDNA [when poly(A)+ RNA is available] and amplified cDNA (when only total RNA can be obtained). Both protocols were evaluated in model experiments using human skeletal muscle cDNA. We also employed DSN normalization to normalize cDNA from nervous tissues of the marine mollusc Aplysia californica (a popular model organism in neuroscience) to illustrate further the efficiency of the normalization technique.
Collapse
Affiliation(s)
- Pavel A Zhulidov
- Shemiakin and Ovchinnikov Institute of Bioorganic Chemistry RAS, Miklukho-Maklaya 16/10, 117871 Moscow, Russia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Clepet C, Le Clainche I, Caboche M. Improved full-length cDNA production based on RNA tagging by T4 DNA ligase. Nucleic Acids Res 2004; 32:e6. [PMID: 14704363 PMCID: PMC373303 DOI: 10.1093/nar/gng158] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Second-strand cDNA priming is a central problem for full-length characterization of transcripts. A new strategy using bacteriophage T4 DNA ligase and partially degenerate adapters is proposed for grafting a sequence tag to the end of polyribonucleotides. Based on this RNA tagging system and previously described protocols, a new method for full-length cDNA production has been implemented. Validation of the method is shown in Arabidopsis thaliana by the construction of a full-length cDNA library and the analysis of 154 clones and by 5'-RACE-PCR run on a documented experimental system.
Collapse
Affiliation(s)
- Christian Clepet
- Unité de Recherches en Génomique Végétale, INRA/CNRS, 2 Rue Gaston-Crémieux, F-91057 Evry Cedex, France.
| | | | | |
Collapse
|
32
|
Jackson A, Jiao PE, Ni I, Fu GK. Agarose gel size fractionation of RNA for the cloning of full-length cDNAs. Anal Biochem 2003; 323:252-5. [PMID: 14656534 DOI: 10.1016/j.ab.2003.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Affiliation(s)
- Alan Jackson
- Incyte Corp, 3160 Porter Dr, Palo Alto, CA 94304, USA
| | | | | | | |
Collapse
|
33
|
Yamashita R, Suzuki Y, Nakai K, Sugano S. Small open reading frames in 5′ untranslated regions of mRNAs. C R Biol 2003; 326:987-91. [PMID: 14744105 DOI: 10.1016/j.crvi.2003.09.028] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Using the 5'-end sequence data from 'oligo-capped' cDNAs, we generated a representative full-length cDNA dataset for 4870 RefSeq entries, and analyzed the 5' untranslated region (UTR) of these genes. To our surprise, about half of the 4870 genes had an upstream ATG before the ATG that starts the longest open reading frame (ORF), suggesting that about half of them have small ORFs in their 5' UTR of average length of 31 amino acids. They require attention for further analysis to identify their biological role.
Collapse
Affiliation(s)
- Riu Yamashita
- Laboratory of Genome Database, Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan
| | | | | | | |
Collapse
|
34
|
Choi YH, Hagedorn CH. Purifying mRNAs with a high-affinity eIF4E mutant identifies the short 3' poly(A) end phenotype. Proc Natl Acad Sci U S A 2003; 100:7033-8. [PMID: 12777618 PMCID: PMC165825 DOI: 10.1073/pnas.1232347100] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2003] [Indexed: 11/18/2022] Open
Abstract
The use of DNA microarrays has revolutionized the manner in which mRNA populations are analyzed. One limitation of the current technology is that mRNAs are often purified on the basis of their 3' poly(A) ends, which can be extremely short or absent in some mRNAs. To circumvent this limitation, we have developed a procedure for the purification of eukaryotic mRNAs using a mutant version of the mRNA 5' cap-binding protein (eIF4E) with increased affinity for the m7GTP moiety of the cap. By using this procedure, we have compared the populations of mammalian mRNAs purified by oligo(dT) and 5' cap selection with oligonucleotide microarrays. This analysis has identified a subpopulation of mRNAs that are present with short 3' poly(A) ends at steady state and are missed or underrepresented after purification by oligo(dT). These mRNAs may respond to specific posttranscriptional control mechanisms such as cytoplasmic polyadenylation.
Collapse
Affiliation(s)
- Youkyung Hwang Choi
- Department of Medicine, Genetics Program, Winship Cancer Institute, Emory University School of Medicine, Atlanta, GA 30322, USA
| | | |
Collapse
|
35
|
Liu Y, Kuersten S, Huang T, Larsen A, MacMorris M, Blumenthal T. An uncapped RNA suggests a model for Caenorhabditis elegans polycistronic pre-mRNA processing. RNA (NEW YORK, N.Y.) 2003; 9:677-87. [PMID: 12756326 PMCID: PMC1370435 DOI: 10.1261/rna.2128903] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2002] [Accepted: 02/27/2003] [Indexed: 05/24/2023]
Abstract
Polycistronic pre-mRNAs from Caenohabditis elegans operons are processed by internal cleavage and polyadenylation to create 3' ends of mature mRNAs. This is accompanied by trans-splicing with SL2 approximately 100 nucleotides downstream of the 3' end formation sites to create the 5' ends of downstream mRNAs. SL2 trans-splicing depends on a U-rich element (Ur), located approximately 70 nucleotides upstream of the trans-splice site in the intercistronic region (ICR), as well as a functional 3' end formation signal. Here we report the existence of a novel gene-length RNA, the Ur-RNA, starting just upstream of the Ur element. The expression of Ur-RNA is dependent on 3' end formation as well as on the presence of the Ur element, but does not require a trans-splice site. The Ur-RNA is not capped, and alteration of the location of the Ur element in either the 5' or 3' direction alters the location of the 5' end of the Ur-RNA. We propose that a 5' to 3' exonuclease degrades the precursor RNA following cleavage at the poly(A) site, stopping when it reaches the Ur element, presumably attributable to a bound protein. Part of the function of this protein can be performed by the MS2 coat protein. Recruitment of coat protein to the ICR in the absence of the Ur element results in accumulation of an RNA equivalent to Ur-RNA, and restores trans-splicing. Only SL1, however, is used. Therefore, coat protein is sufficient for blocking the exonuclease and thereby allowing formation of a substrate for trans-splicing, but it lacks the ability to recruit the SL2 snRNP. Our results also demonstrate that MS2 coat protein can be used as an in vivo block to an exonuclease, which should have utility in mRNA stability studies.
Collapse
Affiliation(s)
- Yingmiao Liu
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Denver, Colorado 80262, USA
| | | | | | | | | | | |
Collapse
|
36
|
Abstract
Here we investigated how capping and methylation of HIV pre-mRNAs are coupled to Pol II elongation. Stable binding of the capping enzyme (Mce1) and cap methyltransferase (Hcm1) to template-engaged Pol II depends on CTD phosphorylation, but not on nascent RNA. Both Mce1 and Hcm1 travel with Pol II during elongation. The capping and methylation reactions cannot occur until the nascent pre-mRNA has attained a chain length of 19-22 nucleotides. HIV pre-mRNAs are capped quantitatively when elongation complexes are halted at promoter-proximal positions, but capping is much less efficient during unimpeded Pol II elongation. Cotranscriptional capping of HIV mRNA is strongly stimulated by Tat, and this stimulation requires the C-terminal segment of Tat that mediates its direct binding to Mce1. Our findings implicate capping in an elongation checkpoint critical to HIV gene expression.
Collapse
Affiliation(s)
- Ya-Lin Chiu
- Chemical Biology Program, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | | | | | | | | | | |
Collapse
|
37
|
Affiliation(s)
- Troels Z Kristiansen
- Center for Experimental Bioinformatics, University of Southern Denmark, Odense, Denmark
| | | |
Collapse
|
38
|
Spivak-Kroizman T, Friedland DE, De Staercke C, Gernert KM, Goss DJ, Hagedorn CH. Mutations in the S4-H2 loop of eIF4E which increase the affinity for m7GTP. FEBS Lett 2002; 516:9-14. [PMID: 11959093 DOI: 10.1016/s0014-5793(02)02445-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Eukaryotic initiation factor 4E (eIF4E) binds the 5'-cap of eukaryotic mRNAs and overexpression of eIF4E in epithelial cell cancers correlates with the metastases/tissue invasion phenotype. Photolabeling of eIF4E with [gamma-32P]8-azidoguanosine 5'-triphosphate (8-N3GTP) demonstrated cross-linking at Lys-119 in the S4-H2 loop which is distant from the m7GTP binding site [Marcotrigiano et al. (1997) Cell 89, 951-961; Friedland et al. (1997) Protein Sci. 6, 125-131]. Modeling studies indicate that 8-N3GTP cross-linked with Lys-119 because it binds a site that is occupied by the second nucleotide of a bound mRNA. Mutagenesis of the S4-H2 loop produced proteins with a 5-10-fold higher affinity for m7GTP than wild-type eIF4E. These mutants of eIF4E may have uses in selectively purifying mRNAs with intact 5'-ends or in determining how the promyelocytic leukemia protein decreases the affinity of eIF4E for mRNA caps.
Collapse
Affiliation(s)
- Taly Spivak-Kroizman
- Department of Medicine, Genetics Program of the Winship Cancer Center, Emory University School of Medicine, 165 Michael Street, Room 201, Atlanta, GA 30322, USA
| | | | | | | | | | | |
Collapse
|
39
|
Draper MP, August PR, Connolly T, Packard B, Call KM. Efficient cloning of full-length cDNAs based on cDNA size fractionation. Genomics 2002; 79:603-7. [PMID: 11944994 DOI: 10.1006/geno.2002.6738] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The ability to generate and obtain full-length (FL) cDNAs is of critical importance to the field of genomics. Most cDNAs in a traditional cDNA library lack the initiating 5' ATG, making it difficult to obtain a FL clone. We report here on an improved protocol for the preparation of FL enriched cDNA libraries. We demonstrate that if good quality RNA is used in the cDNA synthesis, high-quality, FL cDNA can be generated for messages upward of 7 kb. In addition, we demonstrate the utility of size fractionation as a means to produce libraries containing a high percentage of initiating 5' ATG containing clones with insert sizes greater than 4 kb. The method is simple, cost efficient, and can be performed in most laboratories equipped to perform molecular biology. Lastly, the novel methodologies used in the analysis of the cDNA and library should prove useful to others working to create high-quality cDNA libraries.
Collapse
Affiliation(s)
- Michael P Draper
- Paratek Pharmaceuticals, 75 Kneeland Street, Boston, Massachusetts 02111, USA.
| | | | | | | | | |
Collapse
|
40
|
Efimov VA, Chakhmakhcheva OG, Archdeacon J, Fernandez JM, Fedorkin ON, Dorokhov YL, Atabekov JG. Detection of the 5'-cap structure of messenger RNAs with the use of the cap-jumping approach. Nucleic Acids Res 2001; 29:4751-9. [PMID: 11713326 PMCID: PMC92527 DOI: 10.1093/nar/29.22.4751] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2001] [Revised: 08/28/2001] [Accepted: 09/20/2001] [Indexed: 11/13/2022] Open
Abstract
An effective procedure for specific determination of the cap structure at the 5'-terminus of mRNA and for isolation of the corresponding full-length cDNA has been developed. The procedure involves covalent attachment of an oligonucleotide template extender to the 5'-cap structure of mRNA followed by RT-PCR using M-MLV SuperScript II reverse transcriptase. In the course of reverse transcription, the enzyme 'jumps over' the cap structure and includes the sequence complementary to the oligonucleotide template extender into the 3'-end of the first cDNA strand. The cap-jumping method was successfully tested using some mammalian cellular mRNAs, genomic RNAs of tobacco mosaic virus (TMV) U1 and the recently isolated crucifer-infecting tobamovirus. Moreover, cDNA products corresponding to the genomic tobamovirus RNA were obtained from total RNA extracted from tobacco plants infected by crucifer-infecting tobamovirus or tobacco mosaic virus. Using the cap-jumping method, we have shown for the first time that genomic crucifer-infecting tobamovirus (crTMV) RNA contains a 5'-cap structure. This improved method can be recommended for the construction of full-length and 5'-end enriched cDNA libraries, identification of capped RNAs and determination of their 5'-terminal sequences.
Collapse
Affiliation(s)
- V A Efimov
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, ul. Miklukho-Maklaya 16/10, Moscow 117997, Russia.
| | | | | | | | | | | | | |
Collapse
|
41
|
Piao Y, Ko NT, Lim MK, Ko MS. Construction of long-transcript enriched cDNA libraries from submicrogram amounts of total RNAs by a universal PCR amplification method. Genome Res 2001; 11:1553-8. [PMID: 11544199 PMCID: PMC311119 DOI: 10.1101/gr.185501] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Here we report a novel design of linker primer that allows one to differentially amplify long tracts (average 3.0 kb with size ranges of 1-7 kb) or short DNAs (average 1.5 kb with size ranges of 0.5-3 kb) from a complex mixture. The method allows one to generate cDNA libraries enriched for long transcripts without size selection of insert DNAs. One representative library from newborn kidney includes 70% of clones bearing ATG start codons. A comparable library has been generated from 20 mouse blastocysts, containing only approximately 40 ng of total RNA. This universal PCR amplification scheme can provide a route to isolate very large cDNAs, even if they are expressed at very low levels.
Collapse
Affiliation(s)
- Y Piao
- Developmental Genomics and Aging Section, Laboratory of Genetics, National Institute on Aging, National Institutes of Health, Baltimore, Maryland 21224, USA
| | | | | | | |
Collapse
|
42
|
Abstract
The development of functional genomic resources is essential to understand and utilize information generated from genome sequencing projects. Central to the development of this technology is the creation of high-quality cDNA resources and improved technologies for analyzing coding and noncoding mRNA sequences. The isolation and mapping of cDNAs is an entrée to characterizing the information that is of significant biological relevance in the genome of an organism. However, a bottleneck is often encountered when attempting to bring to full-length (or at least full-coding) a number of incomplete cDNAs in parallel, since this involves the nonsystematic, time consuming, and labor-intensive iterative screening of a number of cDNA libraries of variable quality and/or directed strategies to process individual clones (e.g., 5' rapid amplification of cDNA ends). Here, we review the current state of the art in cDNA library generation, as well as present an analysis of the different steps involved in cDNA library generation.
Collapse
Affiliation(s)
- M Das
- Department of Biochemistry, McGill Cancer Center, McGill University, Montreal, Quebec, Canada H3G 1Y6
| | | | | | | | | |
Collapse
|
43
|
Xiong W, Hsieh CC, Kurtz AJ, Rabek JP, Papaconstantinou J. Regulation of CCAAT/enhancer-binding protein-beta isoform synthesis by alternative translational initiation at multiple AUG start sites. Nucleic Acids Res 2001; 29:3087-98. [PMID: 11452034 PMCID: PMC55812 DOI: 10.1093/nar/29.14.3087] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2001] [Revised: 06/04/2001] [Accepted: 06/04/2001] [Indexed: 11/14/2022] Open
Abstract
The mRNA of the intronless, single-copy CCAAT/enhancer-binding protein-beta (C/EBPbeta) gene encodes several isoforms that have truncated transcription activation domains. This occurs by the alternative translational initiation (ATI) at multiple AUG start sites. The C/EBPbeta mRNA has four in-frame AUGs and an internal out-of-frame AUG associated with a small open reading frame (sORF). Initiation of translation at the in-frame AUGs forms 40-kDa (AUG-1), 35-kDa (AUG-2), 20-kDa (AUG-3) and 8.5-kDa (AUG-4) isoforms. We show that in COS-1 cells the 20-kDa isoform is not a product of proteolysis of the higher molecular weight isoforms. The sORF contains an AUG and termination signal that may produce the oligopeptide MPPAAARRL. Our studies suggest that ATI involves three mRNA structural features: (i) the cap structure, (ii) the context of the Kozak sequences that flank the AUG and (iii) the integrity of the sORF. We propose that formation of C/EBPbeta isoforms is accomplished by a leaky ribosomal scanning mechanism that facilitates ATI of multiple internal AUGs.
Collapse
Affiliation(s)
- W Xiong
- Department of Human Biological Chemistry and Genetics, University of Texas Medical Branch, Galveston, TX 77555-0643, USA
| | | | | | | | | |
Collapse
|
44
|
Abstract
In the continuing search for a full-length cDNA cloning method, there is no clear winner. Perfecting these techniques may require the re-engineering of reverse transcriptase. There now exist two reasonably linear methods for deriving expression signatures from small amounts of biological material, but advances in serial analysis of gene expression provide a quantitative, if expensive, alternative to these methods.
Collapse
Affiliation(s)
- S Bashiardes
- Department of Genetics, Washington University School of Medicine, Campus Box 8232, 4566 Scott Avenue, St Louis, Missouri 63110-1093, USA
| | | |
Collapse
|
45
|
Sugahara Y, Carninci P, Itoh M, Shibata K, Konno H, Endo T, Muramatsu M, Hayashizaki Y. Comparative evaluation of 5'-end-sequence quality of clones in CAP trapper and other full-length-cDNA libraries. Gene 2001; 263:93-102. [PMID: 11223247 DOI: 10.1016/s0378-1119(00)00557-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
To enhance the usefulness of the laboratory mouse and to facilitate the rapid assay of gene functions we have been collecting the entire set of mouse full-length cDNA by one-pass sequencing. To collect full-length cDNA clones efficiently, it is critical to construct high-quality cDNA libraries. In recent years, we have been developing a way to construct full-length cDNA libraries by using biotinylation of the cap structure (the 'CAP-trapper' method) coupled with treatment to increase reverse transcriptase efficiency at high temperature by the addition of trehalose. In this paper we report our evaluation of the quality of CAP trapper and a number of other full-length cDNA libraries, including the results of 5' end analysis of clones in CAP trapper and the other libraries. We used a procedure that compared the 5'-ends of cDNA clones with those of genes in the public databases. Our analysis showed that 63% of cDNA clones in CAP trapper libraries had sequences that were either the same length as those of equivalent genes in the public database or 5'-extended, and that 90% of these clones maintained their coding sequences. These results indicate that the CAP trapper library is a promising tool for collecting full-length cDNA in large-scale projects. Comparison of the quality of CAP trapper with that of other full-length-cDNA libraries confirmed the value of these libraries.
Collapse
Affiliation(s)
- Y Sugahara
- Laboratory for Genome Exploration Research Project, Genomic Sciences Center and Genome Science Laboratory, RIKEN Tsukuba Institute, 3-1-1 Koyadai, Tsukuba, Ibaraki 305-0074, Japan
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Abstract
Current strategies for cDNA cloning are based on construction of cDNA libraries and colony screening. The process of obtaining a full-length cDNA clone can be highly time and labor intensive. Using the human actin gene as a model target cDNA, we have developed an RNA-capture method for rapid cloning of full-length cDNAs. The approach involves the capture of mRNA with expressed sequence tag (EST)-derived, biotin labeled antisense "capture" primers and streptavidin-coated magnetic beads. Full-length cDNA is then synthesized from purified EST-specific mRNA and cloned directly into plasmid vectors. The results of using beta-actin-based capture primers on cytoplasmic RNA were the isolation of both beta- and gamma-actin cDNA clones. Of the 16 actin-specific cDNA clones analyzed, 15 (93%) were full-length. This approach for cloning full-length cDNAs from available ESTs or partial cDNA sequences will facilitate a more rapid and efficient characterization of gene structure and function.
Collapse
Affiliation(s)
- Z Xu
- Department of Surgery and Mount Zion Cancer Center, University of California, 2340 Sutter Street, P.O. Box 1674, San Francisco, CA 94115, USA.
| | | | | |
Collapse
|
47
|
Fuchs B, Zhang K, Bolander ME, Sarkar G. Differential mRNA fingerprinting by preferential amplification of coding sequences. Gene 2000; 258:155-63. [PMID: 11111053 DOI: 10.1016/s0378-1119(00)00393-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
The need for rapid identification of differentially expressed genes will persist even after the complete human genomic sequence becomes available. The most popular method for identifying differentially expressed genes acquires expressed sequence tags (ESTs) from the extreme 3' non-coding end of mRNAs. Such ESTs have limitations for downstream applications. We have developed a method, termed preferential amplification of coding sequences (PACS), that was applied to identify differentially expressed coding sequence tags (dCSTs) between osteoblasts and osteosarcoma cells. PACS was achieved by PCR with a set of primers to anchor at sequences complementary to AUG sequences in mRNAs and another set of primers to anchor at a PCR-amplifiable distance from AUG sequences. An initial screen identified 103 candidate dCSTs after screening approximately 15% of the expressed genes between the two cell types. Of these sequences, 27 represent CSTs of known genes and two are from 3'-ESTs of known mRNAs. Thus, PACS identified CSTs approximately 13.5 times more often than it identified 3' ESTs, attesting to the objective of the method. Since many of the dCSTs represent known genes, their identity and potential relevance to osteosarcoma could be immediately hypothesized. Differential expression of many of the dCSTs was further demonstrated by northern blotting or RT-PCR. Since PACS is not dependent on the existence of a poly A tail on an mRNA, it should have application to identify dCSTs for both prokaryotic and eukaryotic organisms. Additionally, PACS should aid in the identification of cell-specific or tissue-specific genes and bidirectional acquisition of cDNA sequence enabling rapid retrieval of full-length cDNA sequence of novel genes.
Collapse
Affiliation(s)
- B Fuchs
- Mayo Clinic, Medical Sciences 3-69, 200 First Street SW, Rochester, MN 55905, USA
| | | | | | | |
Collapse
|
48
|
Carninci P, Shibata Y, Hayatsu N, Sugahara Y, Shibata K, Itoh M, Konno H, Okazaki Y, Muramatsu M, Hayashizaki Y. Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes. Genome Res 2000; 10:1617-30. [PMID: 11042159 PMCID: PMC310980 DOI: 10.1101/gr.145100] [Citation(s) in RCA: 210] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
In the effort to prepare the mouse full-length cDNA encyclopedia, we previously developed several techniques to prepare and select full-length cDNAs. To increase the number of different cDNAs, we introduce here a strategy to prepare normalized and subtracted cDNA libraries in a single step. The method is based on hybridization of the first-strand, full-length cDNA with several RNA drivers, including starting mRNA as the normalizing driver and run-off transcripts from minilibraries containing highly expressed genes, rearrayed clones, and previously sequenced cDNAs as subtracting drivers. Our method keeps the proportion of full-length cDNAs in the subtracted/normalized library high. Moreover, our method dramatically enhances the discovery of new genes as compared to results obtained by using standard, full-length cDNA libraries. This procedure can be extended to the preparation of full-length cDNA encyclopedias from other organisms.
Collapse
Affiliation(s)
- P Carninci
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Tsukuba, Japan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Schmidt WM, Mueller MW. CapSelect: A highly sensitive method for 5' CAP-dependent enrichment of full-length cDNA in PCR-mediated analysis of mRNAs. Nucleic Acids Res 1999. [DOI: 10.1093/nar/27.21.e31-i] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
50
|
Abstract
The Mammalian Gene Collection (MGC) project is a new effort by the NIH to generate full-length complementary DNA (cDNA) resources. This project will provide publicly accessible resources to the full research community. The MGC project entails the production of libraries, sequencing, and database and repository development, as well as the support of library construction, sequencing, and analytic technologies dedicated to the goal of obtaining a full set of human and other mammalian full-length (open reading frame) sequences and clones of expressed genes.
Collapse
Affiliation(s)
- R L Strausberg
- National Cancer Institute, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | |
Collapse
|