1
|
Gao S, Ruan Q, Ma Y, Li M, Wang L, Zheng B, Qi Y, Ji Y, Sun Z, Huang Y. Validation of three splice donor and three splice acceptor sites for regulating four novel low-abundance spliced transcripts of human cytomegalovirus UL21.5 gene locus. Int J Mol Med 2014; 35:253-62. [PMID: 25370414 DOI: 10.3892/ijmm.2014.1987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 10/23/2014] [Indexed: 11/06/2022] Open
Abstract
In a previous study, one spliced transcript of human cytomegalovirus (HCMV), named UL21.5 was identified. UL21.5 has been found to be one of the viral transcripts packaged within HCMV particles. The UL21.5 mRNA is translated into a secreted glycoprotein, which is a viral chemokine decoy receptor specifically interacting with regulated upon activation normal T cell expressed and secreted (RANTES). In the present study, four novel low-abundance 3'-coterminal spliced transcripts were identified to be transcribed from the UL21.5 gene region of a low-passage HCMV strain during the late infection phase by cDNA library screening, northern blot hybridization, reverse transcription polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE)-PCR. Three splicing donor and three splicing acceptor sites found in the UL21.5 gene region were validated to be functional in an in vitro expression system. In addition, the determinant regulatory region that is necessary for the splice donor site at nucleotide (nt) 25533 was located in a 9-bp sequence around the site; the regulatory regions for the splice acceptor sites at nt 26597 and nt 26633 were located in a 20-bp sequence upstream of the site at nt 26597 and in a 10-bp sequence from nt 26641 to nt 26650 downstream of the site at nt 26633, respectively.
Collapse
Affiliation(s)
- Shuang Gao
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Qiang Ruan
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Yanping Ma
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Mali Li
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Lin Wang
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Bo Zheng
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Ying Qi
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Yaohua Ji
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Zhengrong Sun
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Yujing Huang
- Virus Laboratory, Affiliated Shengjing Hospital, China Medical University, Shenyang, Liaoning 110004, P.R. China
| |
Collapse
|
2
|
Islam MA, Sturrock RN, Ekramoddoullah AKM. Gene expression profiling of a compatible interaction between Douglas-Fir and the root rot fungal pathogen Phellinus sulphurascens. PHYTOPATHOLOGY 2013; 103:583-593. [PMID: 23676089 DOI: 10.1094/phyto-04-12-0086-r] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Douglas-fir (DF) (Pseudotsuga menziesii) is one of the largest and most economically important coniferous species in western North America. Its productivity is greatly affected by the root rot fungus Phellinus sulphurascens Pilát. Evidence of resistance by DF to fungal root pathogens such as P. sulphurascens has been reported but mechanisms of resistance in this compatible pathosystem are not yet known. To better understand the DF-P. sulphurascens interaction, especially at the molecular level, we selected 12 diverse plant genes already identified as defense-related from a cDNA library constructed using root tissues from P. sulphurascens-infected DF seedlings. Using quantitative reverse-transcriptase polymerase chain reaction on infected DF root samples collected at five different time points after inoculation, we found that P. sulphurascens infection significantly elevated expression of the 12 selected genes. In most cases the highest expression level was recorded within 2 to 3 days after inoculation. The constructed cDNA library, which is enriched with defense-related host genes and a number of fungal genes, will continue to serve as a useful resource for future larger-scale gene discovery and functional research on the P. sulphurascens and DF pathosystem.
Collapse
Affiliation(s)
- M A Islam
- Natural Resources Canada, Canadian Forest Service, Pacific Forestry Centre, 506 West Burnside Road, Victoria, BC, V8Z 1M5, Canada
| | | | | |
Collapse
|
3
|
Adomas AB, Lopez-Giraldez F, Clark TA, Wang Z, Townsend JP. Multi-targeted priming for genome-wide gene expression assays. BMC Genomics 2010; 11:477. [PMID: 20716356 PMCID: PMC3091673 DOI: 10.1186/1471-2164-11-477] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2010] [Accepted: 08/17/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. RESULTS We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. CONCLUSIONS Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and precise assay of the transcribed sequences within the genome.
Collapse
Affiliation(s)
- Aleksandra B Adomas
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | | | | | | | | |
Collapse
|
4
|
Hu G, Linning R, McCallum B, Banks T, Cloutier S, Butterfield Y, Liu J, Kirkpatrick R, Stott J, Yang G, Smailus D, Jones S, Marra M, Schein J, Bakkeren G. Generation of a wheat leaf rust, Puccinia triticina, EST database from stage-specific cDNA libraries. MOLECULAR PLANT PATHOLOGY 2007; 8:451-67. [PMID: 20507513 DOI: 10.1111/j.1364-3703.2007.00406.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Thirteen cDNA libraries constructed from small amounts of leaf rust mRNA using optimized methods served as the source for the generation of 25 558 high-quality DNA sequence reads. Five life-cycle stages were sampled: resting urediniospores, urediniospores germinated over water or plant extract, compatible, interactive stages during appressorium or haustorium formation just before sporulation, and an incompatible interaction. mRNA populations were subjected to treatments such as full-length cDNA production, subtractive and normalizing hybridizations, and size selection methods combined with PCR amplification. Pathogen and host sequences from interactive libraries were differentiated in silico using cereal and fungal sequences, codon usage analyses, and by means of a partial prototype cDNA microarray hybridized with genomic DNAs. This yielded a non-redundant unigene set of 9760 putative fungal sequences consisting of 6616 singlets and 3144 contigs, representing 4.7 Mbp. At an E-value 10(-5), 3670 unigenes (38%) matched sequences in various databases and collections but only 694 unigenes (7%) were similar to genes with known functions. In total, 296 unigenes were identified as most probably wheat and ten as rRNA sequences. Annotation rates were low for germinated urediniospores (4%) and appressoria (2%). Gene sets obtained from the various life-cycle stages appear to be remarkably different, suggesting drastic reprogramming of the transcriptome during these major differentiation processes. Redundancy within contigs yielded information about possible expression levels of certain genes among stages. Many sequences were similar to genes from other rusts such as Uromyces and Melampsora species; some of these genes have been implicated in pathogenicity and virulence.
Collapse
Affiliation(s)
- Guanggan Hu
- Pacific Agri-Food Research Centre, Agriculture and Agri-Food Canada, Highway 97, Summerland, BC V0H 1Z0, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Katzenellenbogen M, Mizrahi L, Pappo O, Klopstock N, Olam D, Barash H, Domany E, Galun E, Goldenberg D. Molecular mechanisms of the chemopreventive effect on hepatocellular carcinoma development in Mdr2 knockout mice. Mol Cancer Ther 2007; 6:1283-91. [PMID: 17431106 DOI: 10.1158/1535-7163.mct-06-0420] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Dietary antioxidants and selenium compounds were shown to have a therapeutic effect against hepatocellular carcinoma in several mouse models. We tested the effects of tannic acid and selenomethionine on hepatocellular carcinoma development in Mdr2 knockout (Mdr2-KO) mice. Mdr2-KO and age-matched Mdr2 heterozygous control mice were fed with tannic acid or selenomethionine during the first 3 months of life. Then, several mice from each group were sacrificed, and liver tissue samples were removed for analysis. The remaining mice were fed a regular diet until the age of 16 months, at which time the number and size of liver tumors were determined. Liver tissue samples of 3-month-old mice were subjected to gene expression profiling analysis using cDNA macroarrays containing probes for 240 genes that regulate responses to oxidative stress and inflammation or lipid metabolism. Both tannic acid and selenomethionine had partial chemopreventive effect on development of hepatocellular carcinoma in Mdr2-KO mice: they reduced the incidence of large tumor nodules (diameter >1 cm) at age 16 months. Both agents inhibited gene expression and reversed up-regulation of many genes that control inflammation or response to oxidative stress in Mdr2-KO livers at age 3 months. This inhibitory effect on gene expression correlated with the ability of agents to reduce incidence of large tumors: selenomethionine was more active than tannic acid in both aspects. Understanding the molecular mechanism of chemoprevention effect could improve our therapeutic modalities while using these agents.
Collapse
Affiliation(s)
- Mark Katzenellenbogen
- Goldyne Savad Institute of Gene Therapy, Hadassah-Hebrew University Medical Center, Jerusalem, Israel.
| | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Kim YC, Jung YC, Xuan Z, Dong H, Zhang MQ, Wang SM. Pan-genome isolation of low abundance transcripts using SAGE tag. FEBS Lett 2006; 580:6721-9. [PMID: 17113583 PMCID: PMC1791009 DOI: 10.1016/j.febslet.2006.11.013] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2006] [Revised: 10/31/2006] [Accepted: 11/03/2006] [Indexed: 11/24/2022]
Abstract
The SAGE (serial analysis of gene expression) method is sensitive at detecting the lower abundance transcripts. More than a third of human SAGE tags identified are novel representing the low abundance unknown transcripts. Using the GLGI method (generation of longer 3' EST from SAGE tag for gene identification), we converted 1009 low-copy, human X chromosome-specific SAGE tags into 10210 3' ESTs. We identified 3418 unique 3' ESTs, 46% of which are novel and originated from the lower abundance transcripts. However, nearly all 3' ESTs were mapped to various regions across the genome but not X chromosome. Detailed analysis indicates that those 3' ESTs were isolated by SAGE tag mis-priming to the non-parent transcripts. Replacing SAGE tags with non-transcribed genomic DNA tags resulted in poor amplification, indicating that the sequence similarity between different transcripts contributed to the amplification. Our study shows the prevalence of novel low abundance transcripts that can be isolated efficiently through SAGE tags mis-priming.
Collapse
Affiliation(s)
- Yeong Cheol Kim
- Center for Functional Genomics, Division of Medical Genetics, Department of Medicine, ENH Research Institute, Northwestern University, Evanston, IL 60201, USA
| | | | | | | | | | | |
Collapse
|
7
|
Wang SM. Applying the SAGE technique to study the effects of electromagnetic field on biological systems. Proteomics 2006; 6:4765-8. [PMID: 16897688 DOI: 10.1002/pmic.200500881] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Identification of genes alternatively expressed in electromagnetic field (EMF)-exposed cells could provide direct evidence for biological effects of EMF. As there are a few indications so far for certain genes to be influenced by EMF, genome-wide scans of the transcriptome appear to be necessary. Among the several technologies used for genome-wide gene expression analysis, serial analysis of gene expression (SAGE) is one promising method, which seems particularly applicable for EMF research. This review provides a brief description of the features of gene expression, illustrates the basic principle of SAGE, and discusses the advantages and limitations of SAGE as well as examples of application. This information should help investigators determine if the SAGE technique is an optimal method for evaluating the biological effects of EMF.
Collapse
Affiliation(s)
- San Ming Wang
- Center for Functional Genomics, ENH Research Institute, Department of Medicine, Northwestern University, Evanston, IL 60201, USA.
| |
Collapse
|
8
|
Lee S, Bao J, Zhou G, Shapiro J, Xu J, Shi RZ, Lu X, Clark T, Johnson D, Kim YC, Wing C, Tseng C, Sun M, Lin W, Wang J, Yang H, Wang J, Du W, Wu CI, Zhang X, Wang SM. Detecting novel low-abundant transcripts in Drosophila. RNA (NEW YORK, N.Y.) 2005; 11:939-46. [PMID: 15923377 PMCID: PMC1370778 DOI: 10.1261/rna.7239605] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Increasing evidence suggests that low-abundant transcripts may play fundamental roles in biological processes. In an attempt to estimate the prevalence of low-abundant transcripts in eukaryotic genomes, we performed a transcriptome analysis in Drosophila using the SAGE technique. We collected 244,313 SAGE tags from transcripts expressed in Drosophila embryonic, larval, pupae, adult, and testicular tissue. From these SAGE tags, we identified 40,823 unique SAGE tags. Our analysis showed that 55% of the 40,823 unique SAGE tags are novel without matches in currently known Drosophila transcripts, and most of the novel SAGE tags have low copy numbers. Further analysis indicated that these novel SAGE tags represent novel low-abundant transcripts expressed from loci outside of currently annotated exons including the intergenic and intronic regions, and antisense of the currently annotated exons in the Drosophila genome. Our study reveals the presence of a significant number of novel low-abundant transcripts in Drosophila, and highlights the need to isolate these novel low-abundant transcripts for further biological studies.
Collapse
Affiliation(s)
- Sanggyu Lee
- Department of Medicine, University of Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Xiao NZ, Ba L, Holm PB, Wang XZ, Bowra S. Quantitative transcript analysis in plants: improved first-strand cDNA synthesis. Acta Biochim Biophys Sin (Shanghai) 2005; 37:429-34. [PMID: 15944759 DOI: 10.1111/j.1745-7270.2005.00052.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
The quantity and quality of first-strand cDNA directly influence the accuracy of transcriptional analysis and quantification. Using a plant-derived alpha-tubulin as a model system, the effect of oligo sequence and DTT on the quality and quantity of first-strand cDNA synthesis was assessed via a combination of semi-quantitative PCR and real-time PCR. The results indicated that anchored oligo dT significantly improved the quantity and quality of alpha-tubulin cDNA compared to the conventional oligo dT. Similarly, omitting DTT from the first-strand cDNA synthesis also enhanced the levels of transcript. This is the first time that a comparative analysis has been undertaken for a plant system and it shows conclusively that small changes to current protocols can have very significant impact on transcript analysis.
Collapse
Affiliation(s)
- Nai-Zhong Xiao
- School of Life Sciences, Northeast Normal University, Changchun 130024, China
| | | | | | | | | |
Collapse
|
10
|
Halum SL, Popper P, Cioffi JA, Wackym PA. Serial Analysis of Gene Expression in Neurofibromatosis Type 2–Associated Vestibular Schwannoma. Otol Neurotol 2004; 25:587-93. [PMID: 15241239 DOI: 10.1097/00129492-200407000-00028] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
HYPOTHESIS The genesis, morphology, and growth characteristics of vestibular schwannomas are determined by genetic alterations which vary gene transcript expression and this transcript expression can be qualitatively and quantitatively evaluated using the SAGE technique. By use of such technique, gene products with tumorigenic potential may be identified, providing insight and targets for future study. BACKGROUND Serial analysis of gene expression (SAGE) is a powerful new technique that allows detailed qualitative and quantitative evaluation of cellular gene transcript expression. Tissue in limited quantity (5 x 10 to 2 x 10 cells) may be analyzed by a modified version of SAGE called microSAGE. Application of SAGE or microSAGE to study vestibular schwannoma gene expression has not been previously reported. METHODS Fresh, vestibular schwannoma specimen from an individual with the diagnosis of neurofibromatosis type 2 was attained intraoperatively and maintained in a sealed container at -80degreesC until the time of analysis. The tissue was processed according to the microSAGE protocol, using 180 mg of vestibular schwannoma as starting material. RESULTS The protocol resulted in the generation and sequencing of a tag library involving 458 tags representing 277 different gene products, including many transcripts known to be expressed in vestibular schwannomas. Several gene products with tumorigenic potential were identified. CONCLUSIONS These data demonstrate that microSAGE is a useful technique to study vestibular schwannoma gene expression. Future studies will include building more comprehensive libraries and comparing libraries from various vestibular schwannoma phenotypes to identify useful diagnostic or prognostic markers, and targets for therapeutic intervention.
Collapse
Affiliation(s)
- Stacey L Halum
- Department of Otolaryngology and Communication Sciences, Medical College of Wisconsin, Milwaukee 53226, USA
| | | | | | | |
Collapse
|
11
|
Sun M, Zhou G, Lee S, Chen J, Shi RZ, Wang SM. SAGE is far more sensitive than EST for detecting low-abundance transcripts. BMC Genomics 2004; 5:1. [PMID: 14704093 PMCID: PMC317289 DOI: 10.1186/1471-2164-5-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2003] [Accepted: 01/05/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Isolation of low-abundance transcripts expressed in a genome remains a serious challenge in transcriptome studies. The sensitivity of the methods used for analysis has a direct impact on the efficiency of the detection. We compared the EST method and the SAGE method to determine which one is more sensitive and to what extent the sensitivity is great for the detection of low-abundance transcripts. RESULTS Using the same low-abundance transcripts detected by both methods as the targeted sequences, we observed that the SAGE method is 26 times more sensitive than the EST method for the detection of low-abundance transcripts. CONCLUSIONS The SAGE method is more efficient than the EST method in detecting the low-abundance transcripts.
Collapse
Affiliation(s)
- Miao Sun
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
| | - Guolin Zhou
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
| | - Sanggyu Lee
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
| | - Jianjun Chen
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
| | - Run Zhang Shi
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
| | - San Ming Wang
- Department of Medicine, University of Chicago, 5841 S. Maryland Avenue, MC2115, Chicago, Illinois 60637, USA
- ENH Research Institute, Northwestern University, 1001 University Place, Evanston, IL 60201
| |
Collapse
|
12
|
Carninci P, Waki K, Shiraki T, Konno H, Shibata K, Itoh M, Aizawa K, Arakawa T, Ishii Y, Sasaki D, Bono H, Kondo S, Sugahara Y, Saito R, Osato N, Fukuda S, Sato K, Watahiki A, Hirozane-Kishikawa T, Nakamura M, Shibata Y, Yasunishi A, Kikuchi N, Yoshiki A, Kusakabe M, Gustincich S, Beisel K, Pavan W, Aidinis V, Nakagawara A, Held WA, Iwata H, Kono T, Nakauchi H, Lyons P, Wells C, Hume DA, Fagiolini M, Hensch TK, Brinkmeier M, Camper S, Hirota J, Mombaerts P, Muramatsu M, Okazaki Y, Kawai J, Hayashizaki Y. Targeting a complex transcriptome: the construction of the mouse full-length cDNA encyclopedia. Genome Res 2003; 13:1273-89. [PMID: 12819125 PMCID: PMC403712 DOI: 10.1101/gr.1119703] [Citation(s) in RCA: 142] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We report the construction of the mouse full-length cDNA encyclopedia,the most extensive view of a complex transcriptome,on the basis of preparing and sequencing 246 libraries. Before cloning,cDNAs were enriched in full-length by Cap-Trapper,and in most cases,aggressively subtracted/normalized. We have produced 1,442,236 successful 3'-end sequences clustered into 171,144 groups, from which 60,770 clones were fully sequenced cDNAs annotated in the FANTOM-2 annotation. We have also produced 547,149 5' end reads,which clustered into 124,258 groups. Altogether, these cDNAs were further grouped in 70,000 transcriptional units (TU),which represent the best coverage of a transcriptome so far. By monitoring the extent of normalization/subtraction, we define the tentative equivalent coverage (TEC),which was estimated to be equivalent to >12,000,000 ESTs derived from standard libraries. High coverage explains discrepancies between the very large numbers of clusters (and TUs) of this project,which also include non-protein-coding RNAs,and the lower gene number estimation of genome annotations. Altogether,5'-end clusters identify regions that are potential promoters for 8637 known genes and 5'-end clusters suggest the presence of almost 63,000 transcriptional starting points. An estimate of the frequency of polyadenylation signals suggests that at least half of the singletons in the EST set represent real mRNAs. Clones accounting for about half of the predicted TUs await further sequencing. The continued high-discovery rate suggests that the task of transcriptome discovery is not yet complete.
Collapse
Affiliation(s)
- Piero Carninci
- Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Clark T, Lee S, Ridgway Scott L, Wang SM. Computational Analysis of Gene Identification with SAGE. J Comput Biol 2003; 9:513-26. [PMID: 12162890 DOI: 10.1089/106652702760138600] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
SAGE is one of the few techniques capable of uniformly probing gene expression at a genome level irrespective of mRNA abundance and without a priori knowledge of the transcripts present. However, individual SAGE tags can match many sequences in the reference database, complicating gene identification. We perform a baseline evaluation of gene identification with SAGE using UniGene Human as the reference database by analyzing 1) the distributions of tags for various length tag sets formed for UniGene Human and 2) the tag-to-sequence mapping using a SAGE tag set consisting of 37,522 tags derived from human myeloid cells. The extensive multiplicity of the dbEST component of UniGene significantly detracts from gains that might be expected by extending tags within the scope of the SAGE protocol. In order to achieve reasonable sequence specificity for gene identification with the content of the commonly used UniGene sequence collection, tags on the order of hundreds of bases in length are required. One way to produce tags of such lengths is with GLGI, which extends SAGE tags to the 3' end of cDNA. We show that the longer sequences produced by GLGI relieve significantly the multiple match condition. In the myeloid sample, we also found a correlation between multiple match severity and high copy number. We extrapolate these findings, providing insights into the use of UniGene Human as a reference for gene identification.
Collapse
Affiliation(s)
- Terry Clark
- Department of Computer Science, The University of Chicago, Chicago, IL 60637, USA.
| | | | | | | |
Collapse
|
14
|
Chen J, Sun M, Lee S, Zhou G, Rowley JD, Wang SM. Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags. Proc Natl Acad Sci U S A 2002; 99:12257-62. [PMID: 12213963 PMCID: PMC129432 DOI: 10.1073/pnas.192436499] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/23/2002] [Indexed: 11/18/2022] Open
Abstract
The number of genes in the human genome is still a controversial issue. Whereas most of the genes in the human genome are said to have been physically or computationally identified, many short cDNA sequences identified as tags by use of serial analysis of gene expression (SAGE) do not match these genes. By performing experimental verification of more than 1,000 SAGE tags and analyzing 4,285,923 SAGE tags of human origin in the current SAGE database, we examined the nature of the unmatched SAGE tags. Our study shows that most of the unmatched SAGE tags are truly novel SAGE tags that originated from novel transcripts not yet identified in the human genome, including alternatively spliced transcripts from known genes and potential novel genes. Our study indicates that by using novel SAGE tags as probes, we should be able to identify efficiently many novel transcripts/novel genes in the human genome that are difficult to identify by conventional methods.
Collapse
Affiliation(s)
- Jianjun Chen
- Department of Medicine, University of Chicago, 5841 South Maryland, MC2115, Chicago, IL 60637, USA
| | | | | | | | | | | |
Collapse
|
15
|
Laveder P, De Pittà C, Toppo S, Valle G, Lanfranchi G. A two-step strategy for constructing specifically self-subtracted cDNA libraries. Nucleic Acids Res 2002; 30:e38. [PMID: 11972353 PMCID: PMC113861 DOI: 10.1093/nar/30.9.e38] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We have developed a new strategy for producing subtracted cDNA libraries that is optimized for connective and epithelial tissues, where a few exceptionally abundant (super-prevalent) RNA species account for a large fraction of the total mRNA mass. Our method consists of a two-step subtraction of the most abundant mRNAs: the first step involves a novel use of oligo-directed RNase H digestion to lower the concentration of tissue-specific, super-prevalent RNAs. In the second step, a highly specific subtraction is achieved through hybridization with probes from a 3'-end ESTs collection. By applying this technique in skeletal muscle, we have constructed subtracted cDNA libraries that are effectively enriched for genes expressed at low levels. We further report on frequent premature termination of transcription in human muscle mitochondria and discuss the importance of this phenomenon in designing subtractive approaches. The tissue-specific collections of cDNA clones generated by our method are particularly well suited for expression profiling.
Collapse
Affiliation(s)
- Paolo Laveder
- CRIBI Biotechnology Center, Università degli Studi di Padova, via Ugo Bassi 58b, Padua I-35121, Italy
| | | | | | | | | |
Collapse
|
16
|
Nam DK, Lee S, Zhou G, Cao X, Wang C, Clark T, Chen J, Rowley JD, Wang SM. Oligo(dT) primer generates a high frequency of truncated cDNAs through internal poly(A) priming during reverse transcription. Proc Natl Acad Sci U S A 2002; 99:6152-6. [PMID: 11972056 PMCID: PMC122918 DOI: 10.1073/pnas.092140899] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2002] [Indexed: 11/18/2022] Open
Abstract
We have analyzed a systematic flaw in the current system of gene identification: the oligo(dT) primer widely used for cDNA synthesis generates a high frequency of truncated cDNAs through internal poly(A) priming. Such truncated cDNAs may contribute to 12% of the expressed sequence tags in the current dbEST database. By using a synthetic transcript and real mRNA templates as models, we characterized the patterns of internal poly(A) priming by oligo(dT) primer. We further demonstrated that the internal poly(A) priming can be effectively diminished by replacing the oligo(dT) primer with a set of anchored oligo(dT) primers for reverse transcription. Our study indicates that cDNAs designed for genomewide gene identification should be synthesized by use of the anchored oligo(dT) primers, rather than the oligo(dT) primers, to diminish the generation of truncated cDNAs caused by internal poly(A) priming.
Collapse
Affiliation(s)
- Douglas Kyung Nam
- Department of Medicine, Center for Functional Genomics, University of Chicago, 5841 South Maryland Avenue, MC2115, Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Chen J, Lee S, Zhou G, Wang SM. High-throughput GLGI procedure for converting a large number of serial analysis of gene expression tag sequences into 3' complementary DNAs. Genes Chromosomes Cancer 2002; 33:252-61. [PMID: 11807982 DOI: 10.1002/gcc.10017] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Serial analysis of gene expression (SAGE) is a powerful technique for genome-wide analysis of gene expression. However, two-thirds of SAGE tags cannot be used directly for gene identification for two reasons. First, many SAGE tags match several known expressed sequences, owing to the short length of SAGE tag sequences. Second, many SAGE tags do not match any known expressed sequences, presumably because the sequences corresponding to these SAGE tags have not been identified. These two problems can be solved by extension of the SAGE tags into 3' complementary DNAs (cDNAs) by use of the GLGI technique (generation of longer cDNA fragments from SAGE tags for gene identification). We have improved the original GLGI technique into a high-throughput procedure for simultaneous conversion of a large number of SAGE tags into corresponding 3' cDNAs. The whole process is simple, rapid, low-cost, and highly efficient, as shown by our use of this procedure for analyzing hundreds of SAGE tags. In addition to identifying the correct gene for SAGE tags with multiple matches, GLGI can be used for large-scale identification of novel genes by converting novel SAGE tags into 3' cDNAs. Applying this high-throughput procedure should accelerate the rate of gene identification significantly in the human and other eukaryotic genomes.
Collapse
Affiliation(s)
- Jianjun Chen
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
18
|
Chen J, Rowley DA, Clark T, Lee S, Zhou G, Beck C, Rowley JD, Wang SM. The pattern of gene expression in mouse Gr-1(+) myeloid progenitor cells. Genomics 2001; 77:149-62. [PMID: 11597140 DOI: 10.1006/geno.2001.6633] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
To understand the pattern of gene expression in mouse myeloid progenitor cells, we carried out a genome-wide analysis of gene expression in mouse bone marrow Gr-1(+) cells using SAGE and GLGI techniques. We identified 22,033 unique SAGE tags with quantitative information from 73,869 collected SAGE tags. Among these unique tags, 64% match known sequences, including many genes important for myeloid differentiation, and 36% have no matches to known sequences and are likely to represent novel genes. We compared the expression of mouse Gr-1(+) and human CD15(+) myeloid progenitor cells and showed that the pattern of gene expression of these two cell populations had some similarities. We also compared the expression of mouse Gr-1(+) myeloid progenitor cells with that of mouse brain tissue and found a highly tissue-specific manner of gene expression in these two samples. Our data provide a basis for studying altered gene expression in myeloid disorders using mouse models.
Collapse
Affiliation(s)
- J Chen
- Department of Medicine, University of Chicago, 5841 S. Maryland, MC2115, Chicago, Illinois 60637, USA
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Araki T, Nagarajan R, Milbrandt J. Identification of genes induced in peripheral nerve after injury. Expression profiling and novel gene discovery. J Biol Chem 2001; 276:34131-41. [PMID: 11427537 DOI: 10.1074/jbc.m104271200] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Peripheral nerve injury results in axonal degeneration and in phenotypic changes of the surrounding Schwann cells, whose presence is critical for nerve regeneration. To identify genes induced after nerve injury in Schwann cells, we developed a strategy that included differential screening of a subtractive library enriched for cDNAs expressed in injured nerve, sequence analysis, and expression profiling. By using real time quantitative reverse transcriptase-polymerase chain reaction, we found that injury-induced genes could be categorized into four temporal expression patterns. Among the clones we identified were a number that were homologous only to expressed sequence tags in the data base. These were stratified based on their expression profile, presence of identifiable sequence motifs, homologies to other proteins, and evolutionary conservation. We chose one representative gene, nin283, to analyze in detail. The nin283 gene encodes a 227-residue protein containing both a zinc finger and a RING finger motif. nin283 is highly expressed in the central nervous system, particularly in the developing cortical plate in embryos. It is also expressed in peripheral ganglia and is induced by nerve growth factor in PC12 cells. Subcellular localization analysis demonstrated that Nin283 is located in the endosome/lysosome compartment, suggesting that it may participate in ubiquitin-mediated protein modification.
Collapse
Affiliation(s)
- T Araki
- Department of Pathology and Immunology and Internal Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | | | | |
Collapse
|
20
|
Das M, Burge CB, Park E, Colinas J, Pelletier J. Assessment of the total number of human transcription units. Genomics 2001; 77:71-8. [PMID: 11543635 DOI: 10.1006/geno.2001.6620] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Variation in the estimates of the number of genes encoded by the human genome (28,000-120,000) attests to the difficulty of systematically identifying human genes. Sequencing of human chromosome 22 (Chr22) provided the first comprehensive, unbiased view of an entire human chromosome, and intensive analysis of this sequence identified 545 genes and 134 pseudogenes that had similarity or identity to known proteins and/or ESTs and which were listed in the gene annotation (http://www.sanger.ac.uk/HGP/Chr22). This analysis yielded an estimate of approximately 36,000 functional expressed genes in the human genome (and 9000 pseudogenes). However, a key uncertainty in this estimate was that hundreds of additional genes beyond those annotated in the Chr22 sequence are predicted by the gene prediction program Genscan, an unknown number of which might represent additional expressed genes. To determine what fraction of these "predicted novel genes" (PNGs) represents expressed human genes, we used a sensitive RT-PCR assay to detect predicted transcripts in 17 tissues and one cell line. Our results indicate that at least 5000-9000 additional human genes which lack similarity to known genes or proteins exist in the human genome, increasing baseline gene estimates to approximately 41,000-45,000.
Collapse
Affiliation(s)
- M Das
- Department of Biochemistry, McGill University, Rm 810, 3655 Drummond St., Montreal, Quebec, H3G 1Y6, Canada
| | | | | | | | | |
Collapse
|
21
|
Abstract
The development of functional genomic resources is essential to understand and utilize information generated from genome sequencing projects. Central to the development of this technology is the creation of high-quality cDNA resources and improved technologies for analyzing coding and noncoding mRNA sequences. The isolation and mapping of cDNAs is an entrée to characterizing the information that is of significant biological relevance in the genome of an organism. However, a bottleneck is often encountered when attempting to bring to full-length (or at least full-coding) a number of incomplete cDNAs in parallel, since this involves the nonsystematic, time consuming, and labor-intensive iterative screening of a number of cDNA libraries of variable quality and/or directed strategies to process individual clones (e.g., 5' rapid amplification of cDNA ends). Here, we review the current state of the art in cDNA library generation, as well as present an analysis of the different steps involved in cDNA library generation.
Collapse
Affiliation(s)
- M Das
- Department of Biochemistry, McGill Cancer Center, McGill University, Montreal, Quebec, Canada H3G 1Y6
| | | | | | | | | |
Collapse
|
22
|
Lee S, Zhou G, Clark T, Chen J, Rowley JD, Wang SM. The pattern of gene expression in human CD15+ myeloid progenitor cells. Proc Natl Acad Sci U S A 2001; 98:3340-5. [PMID: 11248080 PMCID: PMC30655 DOI: 10.1073/pnas.051013798] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We performed a genome-wide analysis of gene expression in primary human CD15(+) myeloid progenitor cells. By using the serial analysis of gene expression (SAGE) technique, we obtained quantitative information for the expression of 37,519 unique SAGE-tag sequences. Of these unique tags, (i) 25% were detected at high and intermediate levels, whereas 75% were present as single copies, (ii) 53% of the tags matched known expressed sequences, 34% of which were matched to more than one known expressed sequence, and (iii) 47% of the tags had no matches and represent potentially novel genes. The correct genes were confirmed by application of the generation of longer cDNA fragments from SAGE tags for gene identification (GLGI) technique for high-copy tags with multiple matches. A set of genes known to be important in myeloid differentiation were expressed at various levels and used different spliced forms. This study provides a normal baseline for comparison of gene expression in myeloid diseases. The strategy of using SAGE and GLGI techniques in this study has broad applications to the genome-wide identification of expressed genes.
Collapse
Affiliation(s)
- S Lee
- Department of Medicine, University of Chicago Medical Center, 5841 South Maryland, MC2115, Chicago, IL 60637, USA
| | | | | | | | | | | |
Collapse
|
23
|
Posey KL, Jones LB, Cerda R, Bajaj M, Huynh T, Hardin PE, Hardin SH. Survey of transcripts in the adult Drosophila brain. Genome Biol 2001; 2:RESEARCH0008. [PMID: 11276425 PMCID: PMC30707 DOI: 10.1186/gb-2001-2-3-research0008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2000] [Revised: 01/22/2001] [Accepted: 01/24/2001] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Classic methods of identifying genes involved in neural function include the laborious process of behavioral screening of mutagenized flies and then rescreening candidate lines for pleiotropic effects due to developmental defects. To accelerate the molecular analysis of brain function in Drosophila we constructed a cDNA library exclusively from adult brains. Our goal was to begin to develop a catalog of transcripts expressed in the brain. These transcripts are expected to contain a higher proportion of clones that are involved in neuronal function. RESULTS The library contains approximately 6.75 million independent clones. From our initial characterization of 271 randomly chosen clones, we expect that approximately 11% of the clones in this library will identify transcribed sequences not found in expressed sequence tag databases. Furthermore, 15% of these 271 clones are not among the 13,601 predicted Drosophila genes. CONCLUSIONS Our analysis of this unique Drosophila brain library suggests that the number of genes may be underestimated in this organism. This work complements the Drosophila genome project by providing information that facilitates more complete annotation of the genomic sequence. This library should be a useful resource that will help in determining how basic brain functions operate at the molecular level.
Collapse
Affiliation(s)
- Karen L Posey
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Leslie B Jones
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Rosalinda Cerda
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Monica Bajaj
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Thao Huynh
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Paul E Hardin
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| | - Susan H Hardin
- Department of Biology and Biochemistry, Institute of Molecular Biology, University of Houston, Houston, TX 77204-5513, USA
| |
Collapse
|
24
|
Andrews J, Bouffard GG, Cheadle C, Lü J, Becker KG, Oliver B. Gene Discovery Using Computational and Microarray Analysis of Transcription in the Drosophila melanogaster Testis. Genome Res 2000. [DOI: 10.1101/gr.159800] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Identification and annotation of all the genes in the sequencedDrosophila genome is a work in progress. Wild-type testis function requires many genes and is thus of potentially high value for the identification of transcription units. We therefore undertook a survey of the repertoire of genes expressed in the Drosophilatestis by computational and microarray analysis. We generated 3141 high-quality testis expressed sequence tags (ESTs). Testis ESTs computationally collapsed into 1560 cDNA set used for further analysis. Of those, 11% correspond to named genes, and 33% provide biological evidence for a predicted gene. A surprising 47% fail to align with existing ESTs and 16% with predicted genes in the current genome release. EST frequency and microarray expression profiles indicate that the testis mRNA population is highly complex and shows an extended range of transcript abundance. Furthermore, >80% of the genes expressed in the testis showed onefold overexpression relative to ovaries, or gonadectomized flies. Additionally, >3% showed more than threefold overexpression at p <0.05. Surprisingly, 22% of the genes most highly overexpressed in testis matchDrosophila genomic sequence, but not predicted genes. These data strongly support the idea that sequencing additional cDNA libraries from defined tissues, such as testis, will be important tools for refined annotation of the Drosophila genome. Additionally, these data suggest that the number of genes in Drosophila will significantly exceed the conservative estimate of 13,601.[The sequence data described in this paper have been submitted to the dbEST data library under accession nos.AI944400–AI947263 and BE661985–BE662262.][The microarray data described in this paper have been submitted to the GEO data library under accession nos. GPLS, GSM3–GSM10.]
Collapse
|
25
|
Andrews J, Bouffard GG, Cheadle C, Lü J, Becker KG, Oliver B. Gene discovery using computational and microarray analysis of transcription in the Drosophila melanogaster testis. Genome Res 2000; 10:2030-43. [PMID: 11116097 PMCID: PMC313064 DOI: 10.1101/gr.10.12.2030] [Citation(s) in RCA: 164] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Identification and annotation of all the genes in the sequenced Drosophila genome is a work in progress. Wild-type testis function requires many genes and is thus of potentially high value for the identification of transcription units. We therefore undertook a survey of the repertoire of genes expressed in the Drosophila testis by computational and microarray analysis. We generated 3141 high-quality testis expressed sequence tags (ESTs). Testis ESTs computationally collapsed into 1560 cDNA set used for further analysis. Of those, 11% correspond to named genes, and 33% provide biological evidence for a predicted gene. A surprising 47% fail to align with existing ESTs and 16% with predicted genes in the current genome release. EST frequency and microarray expression profiles indicate that the testis mRNA population is highly complex and shows an extended range of transcript abundance. Furthermore, >80% of the genes expressed in the testis showed onefold overexpression relative to ovaries, or gonadectomized flies. Additionally, >3% showed more than threefold overexpression at p <0.05. Surprisingly, 22% of the genes most highly overexpressed in testis match Drosophila genomic sequence, but not predicted genes. These data strongly support the idea that sequencing additional cDNA libraries from defined tissues, such as testis, will be important tools for refined annotation of the Drosophila genome. Additionally, these data suggest that the number of genes in Drosophila will significantly exceed the conservative estimate of 13,601.
Collapse
Affiliation(s)
- J Andrews
- Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | | | |
Collapse
|
26
|
Wenzl P, Mancilla LI, Rao IM, Mayer JE. Isolation of rare cDNAs by asymmetric self-hybridization. Anal Biochem 2000; 286:303-5. [PMID: 11067756 DOI: 10.1006/abio.2000.4752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- P Wenzl
- Centro Internacional de Agricultura Tropical (CIAT), Cali, A.A. 6713, Colombia
| | | | | | | |
Collapse
|
27
|
Fuchs B, Zhang K, Bolander ME, Sarkar G. Identification of differentially expressed genes by mutually subtracted RNA fingerprinting. Anal Biochem 2000; 286:91-8. [PMID: 11038279 DOI: 10.1006/abio.2000.4792] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A mutually subtracted RNA fingerprinting (SuRF) method has been developed that allows efficient identification of differentially expressed sequence tags between two samples. Mutual subtractions of two RNA samples are achieved by first synthesizing cDNAs using oligo(dT) coupled with magnetic beads which are then reciprocally hybridized to starting RNA samples to remove common mRNAs between them. The second step involves differential fingerprinting of the subtracted RNA samples by polymerase chain reaction with specially designed degenerate primers. SuRF was applied to identify alteration in gene expression pertinent to osteogenic sarcoma which was achieved by employing the method between FOB (an immortalized fetal osteoblast) and MG63 (an osteosarcoma) cell lines. An estimated 10% of the total expressed genes in these two cell types were screened by the method. This analysis identified 96 differentially expressed sequences, none of which was identified repeatedly. A subset of these sequences was subsequently confirmed to have differential expression between the two cell types. Removal of common mRNAs prior to differential display should diminish redundant identification of abundant genes and increase the chance of identifying rare differentially expressed genes.
Collapse
Affiliation(s)
- B Fuchs
- Department of Orthopedics, Mayo Clinic and Foundation, Rochester, Minnesota 55905, USA
| | | | | | | |
Collapse
|
28
|
Dimopoulos G, Casavant TL, Chang S, Scheetz T, Roberts C, Donohue M, Schultz J, Benes V, Bork P, Ansorge W, Soares MB, Kafatos FC. Anopheles gambiae pilot gene discovery project: identification of mosquito innate immunity genes from expressed sequence tags generated from immune-competent cell lines. Proc Natl Acad Sci U S A 2000; 97:6619-24. [PMID: 10841561 PMCID: PMC18678 DOI: 10.1073/pnas.97.12.6619] [Citation(s) in RCA: 127] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Together with AIDS and tuberculosis, malaria is at the top of the list of devastating infectious diseases. However, molecular genetic studies of its major vector, Anopheles gambiae, are still quite limited. We have conducted a pilot gene discovery project to accelerate progress in the molecular analysis of vector biology, with emphasis on the mosquito's antimalarial immune defense. A total of 5,925 expressed sequence tags were determined from normalized cDNA libraries derived from immune-responsive hemocyte-like cell lines. The 3,242 expressed sequence tag-containing cDNA clones were grouped into 2,380 clone clusters, potentially representing unique genes. Of these, 1,118 showed similarities to known genes from other organisms, but only 27 were identical to previously known mosquito genes. We identified 38 candidate genes, based on sequence similarity, that may be implicated in immune reactions including antimalarial defense; 19 of these were shown experimentally to be inducible by bacterial challenge, lending support to their proposed involvement in mosquito immunity.
Collapse
Affiliation(s)
- G Dimopoulos
- European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Affiliation(s)
- K J Martin
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | | |
Collapse
|