1
|
De la Rosa MVG, Patel D, McCann MR, Stringer KA, Rosania GR. Database screening as a strategy to identify endogenous candidate metabolites to probe and assess mitochondrial drug toxicity. Sci Rep 2023; 13:22013. [PMID: 38086883 PMCID: PMC10716408 DOI: 10.1038/s41598-023-49443-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 12/08/2023] [Indexed: 12/18/2023] Open
Abstract
Adverse drug reactions (ADRs) are considered an inherent risk of medication use, and some ADRs have been associated with off-target drug interactions with mitochondria. Metabolites that reflect mitochondrial function may help identify patients at risk of mitochondrial toxicity. We employed a database strategy to identify candidate mitochondrial metabolites that could be clinically useful to identify individuals at increased risk of mitochondrial-related ADRs. This led to L-carnitine being identified as the candidate mitochondrial metabolite. L-carnitine, its acetylated metabolite, acetylcarnitine and other acylcarnitines are mitochondrial biomarkers used to detect inborn errors of metabolism. We hypothesized that changes in L-carnitine disposition, induced by a "challenge test" of intravenous L-carnitine, could identify mitochondrial-related ADRs by provoking variation in L-carnitine and/or acetylcarnitine blood levels. To test this hypothesis, we induced mitochondrial drug toxicity with clofazimine (CFZ) in a mouse model. Following CFZ treatment, mice received an L-carnitine "challenge test". CFZ-induced changes in weight were consistent with previous work and reflect CFZ-induced catabolism. L-carnitine induced differences in whole blood acetylcarnitine concentrations in a manner that was dependent on CFZ treatment. This supports the usefulness of a database strategy for the discovery of candidate metabolite biomarkers of drug toxicity and substantiates the potential of the L-carnitine "challenge test" as a "probe" to identify drug-related toxicological manifestations.
Collapse
Affiliation(s)
- Mery Vet George De la Rosa
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Michigan, Ann Arbor, MI, 48104, USA
| | - Dipali Patel
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Michigan, Ann Arbor, MI, 48104, USA
| | - Marc R McCann
- The NMR Metabolomics Laboratory, Department of Clinical Pharmacy, College of Pharmacy, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Kathleen A Stringer
- The NMR Metabolomics Laboratory, Department of Clinical Pharmacy, College of Pharmacy, University of Michigan, Ann Arbor, MI, 48109, USA
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, School of Medicine, University of Michigan, Ann Arbor, MI, 48109, USA
- Weil Institute for Critical Care Research and Innovation, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Gus R Rosania
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Michigan, Ann Arbor, MI, 48104, USA.
| |
Collapse
|
2
|
Flentje A, Kober KM, Carrico AW, Neilands TB, Flowers E, Heck NC, Aouizerat BE. Minority stress and leukocyte gene expression in sexual minority men living with treated HIV infection. Brain Behav Immun 2018; 70:335-345. [PMID: 29548994 PMCID: PMC5953835 DOI: 10.1016/j.bbi.2018.03.016] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 02/20/2018] [Accepted: 03/12/2018] [Indexed: 12/17/2022] Open
Abstract
Sexual minority (i.e., non-heterosexual) individuals experience poorer mental and physical health, accounted for in part by the additional burden of sexual minority stress occurring from being situated in a culture favoring heteronormativity. Informed by previous research, the purpose of this study was to identify the relationship between sexual minority stress and leukocyte gene expression related to inflammation, cancer, immune function, and cardiovascular function. Sexual minority men living with HIV who were on anti-retroviral medication, had viral load < 200 copies/mL, and had biologically confirmed, recent methamphetamine use completed minority stress measures and submitted blood samples for RNA sequencing on leukocytes. Differential gene expression and pathway analyses were conducted comparing those with clinically elevated minority stress (n = 18) and those who did not meet the clinical cutoff (n = 20), covarying reactive urine toxicology results for very recent stimulant use. In total, 90 differentially expressed genes and 138 gene set pathways evidencing 2-directional perturbation were observed at false discovery rate (FDR) < 0.10. Of these, 41 of the differentially expressed genes and 35 of the 2-directionally perturbed pathways were identified as functionally related to hypothesized mechanisms of inflammation, cancer, immune function, and cardiovascular function. The neuroactive-ligand receptor pathway (implicated in cancer development) was identified using signaling pathway impact analysis. Our results suggest several potential biological pathways for future work investigating the relationship between sexual minority stress and health.
Collapse
Affiliation(s)
- Annesa Flentje
- Community Health Systems, School of Nursing, University of California, San Francisco, United States.
| | - Kord M Kober
- Department of Physiological Nursing, School of Nursing, University of California, San Francisco, United States; Institute for Computational Health Sciences, University of California, San Francisco, United States
| | | | - Torsten B Neilands
- Center for AIDS Prevention Studies, Department of Medicine, University of California, San Francisco, United States
| | - Elena Flowers
- Department of Physiological Nursing, School of Nursing, University of California, San Francisco, United States; Institute for Human Genetics, University of California, San Francisco, United States
| | - Nicholas C Heck
- Department of Psychology, Marquette University, United States
| | - Bradley E Aouizerat
- Bluestone Center for Clinical Research, College of Dentistry, New York University, United States
| |
Collapse
|
3
|
Melouane A, Ghanemi A, Aubé S, Yoshioka M, St-Amand J. Differential gene expression analysis in ageing muscle and drug discovery perspectives. Ageing Res Rev 2018; 41:53-63. [PMID: 29102726 DOI: 10.1016/j.arr.2017.10.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Revised: 10/31/2017] [Accepted: 10/31/2017] [Indexed: 12/12/2022]
Abstract
Identifying therapeutic target genes represents the key step in functional genomics-based therapies. Within this context, the disease heterogeneity, the exogenous factors and the complexity of genomic structure and function represent important challenges. The functional genomics aims to overcome such obstacles via identifying the gene functions and therefore highlight disease-causing genes as therapeutic targets. Genomic technologies promise to reshape the research on ageing muscle, exercise response and drug discovery. Herein, we describe the functional genomics strategies, mainly differential gene expression methods microarray, serial analysis of gene expression (SAGE), massively parallel signature sequence (MPSS), RNA sequencing (RNA seq), representational difference analysis (RDA), and suppression subtractive hybridization (SSH). Furthermore, we review these illustrative approaches that have been used to discover new therapeutic targets for some complex diseases along with the application of these tools to study the modulation of the skeletal muscle transcriptome.
Collapse
|
4
|
Guo Y. Towards systems biological understanding of leaf senescence. PLANT MOLECULAR BIOLOGY 2013; 82:519-28. [PMID: 23065109 DOI: 10.1007/s11103-012-9974-2] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 09/20/2012] [Indexed: 05/22/2023]
Abstract
The application of systems biology approaches has greatly facilitated the process of deciphering the molecular mechanisms underlying leaf senescence. Analyses of the leaf senescence transcriptome have identified some of the major biochemical events during senescence including protein degradation and nutrient remobilization. Proteomic studies have confirmed these findings and have suggested up-regulated energy metabolism during leaf senescence which might be important for cell viability maintenance. As a critical part of systems biology, studies involving transcription regulation networking and senescence-inducing signaling have deepened our understanding on the molecular regulation of leaf senescence. The important next steps towards a systems biological understanding of leaf senescence will be discussed.
Collapse
Affiliation(s)
- Yongfeng Guo
- Tobacco Research Institute, Chinese Academy of Agricultural Sciences, Qingdao, 266101, China.
| |
Collapse
|
5
|
Yamamoto N, Daito M, Hiyama K, Ding J, Nakabayashi K, Otomo Y, Tsujimoto M, Matsuura N, Kato Y. An optimal mRNA marker for OSNA (One-step nucleic acid amplification) based lymph node metastasis detection in colorectal cancer patients. Jpn J Clin Oncol 2013; 43:264-70. [PMID: 23293371 DOI: 10.1093/jjco/hys227] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND We previously reported that the one-step nucleic acid amplification assay is effective for lymph node metastasis detection in breast cancer patients. This paper describes the identification of CK19 mRNA as an optimal marker and its cut-off value for use in the detection of one-step nucleic acid amplification-based lymph node metastasis in colorectal cancer patients. METHODS Candidate mRNA markers selected from the genome-wide expressed sequence tag database were evaluated by quantitative RT-PCR using a mixture of metastasis-positive and another mixture of metastasis-negative lymph nodes (n = 5 each), followed by quantitative RT-PCR using metastasis-positive and -negative lymph nodes (n = 10 each) from 20 patients. The one-step nucleic acid amplification assay for mRNA markers selected above was examined using 28 positive lymph nodes from 19 patients and 38 negative lymph nodes from the 11 pN0 patients. RESULTS Quantitative RT-PCR analyses of the 98 mRNAs selected from the genome-wide expressed sequence tag database and the subsequent quantitative RT-PCR analyses of the nine mRNAs selected above indicated that CK19 and CEA mRNAs have the highest capability for distinguishing between positive and negative lymph nodes. CK19, CEA and CK20 mRNAs were evaluated by the one-step nucleic acid amplification assay. An area under a receiver-operating-characteristic curve for CK19 mRNA (0.999) was slightly larger than that for CEA mRNA (0.946; P = 0.062) and significantly larger that than for CK20 mRNA (0.875; P = 0.006). CONCLUSION We found that CK19 mRNA has the best diagnostic performance and its cut-off value for discriminating positive from negative lymph nodes can be set in the range of 75-500 copies/µl with 96.4% sensitivity and 100% specificity.
Collapse
Affiliation(s)
- Noriko Yamamoto
- Central Research Laboratories, Sysmex Corporation, 4-4-4 Takatsukadai, Kobe, Hyougo, Japan.
| | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Wu TH, Chu LJ, Wang JC, Chen TW, Tien YJ, Lin WC, Ng WV. Meta-analytical biomarker search of EST expression data reveals three differentially expressed candidates. BMC Genomics 2012; 13 Suppl 7:S12. [PMID: 23282184 PMCID: PMC3521215 DOI: 10.1186/1471-2164-13-s7-s12] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Background Researches have been conducted for the identification of differentially expressed genes (DEGs) by generating and mining of cDNA expressed sequence tags (ESTs) for more than a decade. Although the availability of public databases make possible the comprehensive mining of DEGs among the ESTs from multiple tissue types, existing studies usually employed statistics suitable only for two categories. Multi-class test has been developed to enable the finding of tissue specific genes, but subsequent search for cancer genes involves separate two-category test only on the ESTs of the tissue of interest. This constricts the amount of data used. On the other hand, simple pooling of cancer and normal genes from multiple tissue types runs the risk of Simpson's paradox. Here we presented a different approach which searched for multi-cancer DEG candidates by analyzing all pertinent ESTs in all categories and narrowing down the cancer biomarker candidates via integrative analysis with microarray data and selection of secretory and membrane protein genes as well as incorporation of network analysis. Finally, the differential expression patterns of three selected cancer biomarker candidates were confirmed by real-time qPCR analysis. Results Seven hundred and twenty three primary DEG candidates (p-value < 0.05 and lower bound of confidence interval of odds ratio ≧ 1.65) were selected from a curated EST database with the application of Cochran-Mantel-Haenszel statistic (CMH). GeneGO analysis results indicated this set as neoplasm enriched. Cross-examination with microarray data further narrowed the list down to 235 genes, among which 96 had membrane or secretory annotations. After examined the candidates in protein interaction network, public tissue expression databases, and literatures, we selected three genes for further evaluation by real-time qPCR with eight major normal and cancer tissues. The higher-than-normal tissue expression of COL3A1, DLG3, and RNF43 in some of the cancer tissues is in agreement with our in silico predictions. Conclusions Searching digitized transcriptome using CMH enabled us to identify multi-cancer differentially expressed gene candidates. Our methodology demonstrated simultaneously analysis for cancer biomarkers of multiple tissue types with the EST data. With the revived interest in digitizing the transcriptomes by NGS, cancer biomarkers could be more precisely detected from the ESTs. The three candidates identified in this study, COL3A1, DLG3, and RNF43, are valuable targets for further evaluation with a larger sample size of normal and cancer tissue or serum samples.
Collapse
Affiliation(s)
- Timothy H Wu
- Institute of Biomedical Informatics, National Yang Ming University, Taipei, Taiwan, ROC
| | | | | | | | | | | | | |
Collapse
|
7
|
Lee YJ, Hsieh WY, Chen LY, Li C. Protein arginine methylation of SERBP1 by protein arginine methyltransferase 1 affects cytoplasmic/nuclear distribution. J Cell Biochem 2012; 113:2721-8. [PMID: 22442049 DOI: 10.1002/jcb.24151] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Protein arginine methylation regulates a broad array of cellular processes. SERBP1 implicated in tumor progression through its putative involvement in the plaminogen activator protease cascade, is an RNA-binding protein containing an RG-rich domain and an RGG box domain that might be methylated by protein arginine N-methyltransferases (PRMTs). Asymmetric dimethylarginine (aDMA) was detected in SERBP1 and an indirect methyltransferase inhibitor adenosine dialdehyde (AdOx) significantly reduced the methylation signals. Arginines in the middle RG and C-terminal RGG region of SERBP1 are methylated based on the analyses of different deletion constructs. The predominant type I protein arginine methyltransferase PRMT1 co-immunoprecipitated with SERBP1 and the level of bound PRMT1 decreased upon the addition of AdOx. Recombinant PRMT1 methylated SERBP1 and knockdown of PRMT1 significantly reduced the aDMA level of SERBP1, indicating that SERBP1 is specifically methylated by PRMT1. Immunofluorescent analyses of endogenous SERBP1 showed predominant cytoplasmic localization of SERBP1. Treatment of AdOx or PRMT1 siRNA increased the nuclear localization of SERBP1. Analyses of different deletions indicated that the middle RG region is important for the nuclear localization while both N- and C- terminus are required for nuclear export. Low methylation of the C-terminal RGG region also favors nuclear localization. In conclusion, the RG-rich and RGG box of SERBP1 is asymmetrically dimethylated by PRMT1 and the modification affects protein interaction and intracellular localization of the protein. These findings provide the basis for dissecting the roles of SERBP1.
Collapse
Affiliation(s)
- Yu-Jen Lee
- Institute of Biochemistry and Biotechnology, Chung Shan Medical University, Taichung, Taiwan
| | | | | | | |
Collapse
|
8
|
Phylogenetic and functional analyses of the cytochrome P450 family 4. Mol Phylogenet Evol 2012; 62:458-71. [DOI: 10.1016/j.ympev.2011.10.016] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Revised: 10/11/2011] [Accepted: 10/22/2011] [Indexed: 01/08/2023]
|
9
|
He T, Wang Q, Feng G, Hu Y, Wang L, Wang Y. Computational detection and functional analysis of human tissue-specific A-to-I RNA editing. PLoS One 2011; 6:e18129. [PMID: 21448465 PMCID: PMC3063316 DOI: 10.1371/journal.pone.0018129] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2010] [Accepted: 02/25/2011] [Indexed: 12/13/2022] Open
Abstract
A-to-I RNA editing is a widespread post-transcriptional modification event in vertebrates. It could increase transcriptome and proteome diversity through recoding the genomic information and cross-linking other regulatory events, such as those mediated by alternative splicing, RNAi and microRNA (miRNA). Previous studies indicated that RNA editing can occur in a tissue-specific manner in response to the requirements of the local environment. We set out to systematically detect tissue-specific A-to-I RNA editing sites in 43 human tissues using bioinformatics approaches based on the Fisher's exact test and the Benjamini & Hochberg false discovery rate (FDR) multiple testing correction. Twenty-three sites in total were identified to be tissue-specific. One of them resulted in an altered amino acid residue which may prevent the phosphorylation of PARP-10 and affect its activity. Eight and two tissue-specific A-to-I RNA editing sites were predicted to destroy putative exonic splicing enhancers (ESEs) and exonic splicing silencers (ESSs), respectively. Brain-specific and ovary-specific A-to-I RNA editing sites were further verified by comparing the cDNA sequences with their corresponding genomic templates in multiple cell lines from brain, colon, breast, bone marrow, lymph, liver, ovary and kidney tissue. Our findings help to elucidate the role of A-to-I RNA editing in the regulation of tissue-specific development and function, and the approach utilized here can be broadened to study other types of tissue-specific substitution editing.
Collapse
Affiliation(s)
- Tao He
- Beijing Institute of Biotechnology, Beijing, China
| | - Qiong Wang
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Guihai Feng
- Beijing Institute of Biotechnology, Beijing, China
| | - Yaou Hu
- Beijing Institute of Biotechnology, Beijing, China
| | - Li Wang
- Beijing Institute of Biotechnology, Beijing, China
- * E-mail: (LW); (YW)
| | - Yumin Wang
- Beijing Institute of Biotechnology, Beijing, China
- * E-mail: (LW); (YW)
| |
Collapse
|
10
|
Yaguchi Y, Sugasawa H, Tsujimoto H, Takata H, Nakabayashi K, Ichikura T, Ono S, Hiraki S, Sakamoto N, Horio T, Kumano I, Otomo Y, Mochizuki H, Yamamoto J, Hase K. One-step nucleic acid amplification (OSNA) for the application of sentinel node concept in gastric cancer. Ann Surg Oncol 2011; 18:2289-96. [PMID: 21301968 DOI: 10.1245/s10434-011-1591-9] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2010] [Indexed: 12/19/2022]
Abstract
PURPOSE This study was designed to apply safely the sentinel node navigation surgery (SNNS) to the malignancies, an accurate and prompt intraoperative diagnosis of SN is essential, and micrometastasis has been frequently missed by conventional frozen sections. Recently, a novel molecular-based rapid diagnosis for the lymph node (LN) metastases has been developed using (OSNA) in breast cancer, which takes approximately 30 min to obtain a final result. We evaluated the efficacy of OSNA in terms of the intraoperative diagnosis of LN metastasis in patients with gastric cancer. METHODS A total of 162 LNs dissected from 32 patients with gastric cancer was included in this study; 45 LNs were pathologically diagnosed as metastatic LNs and 117 LNs were negative. The LNs were bisected; halves were examined with H&E stain, and the opposite halves were subjected to OSNA analyses of CK19 mRNA. The CK19 mRNA expression was examined in the positive or negative metastatic LNs, and the correlation between the tumor volume and CK19 mRNA expression in the metastatic LNs was examined. RESULTS The CK19 mRNA expressions in the positive metastatic LNs were significantly higher than those of negative LNs. When 250 copies/μl was set as a cutoff value, the concordance rate was 94.4%, the sensitivity was 88.9%, and the specificity was 96.6%. The OSNA expression was significantly correlated with the estimated tumor volumes in the metastatic LNs. CONCLUSIONS The OSNA method is feasible and acceptable for detecting LN metastases in patients with gastric cancer. This should be applied for the intraoperative diagnosis in the SN-navigation surgery in gastric cancer.
Collapse
Affiliation(s)
- Yoshihisa Yaguchi
- Department of Surgery, National Defense Medical College, Tokorozawa, Saitama, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Generation and Analysis of Expressed Sequence Tags from Olea europaea L. Comp Funct Genomics 2010; 2010:757512. [PMID: 21197085 PMCID: PMC3004401 DOI: 10.1155/2010/757512] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2010] [Accepted: 10/13/2010] [Indexed: 11/18/2022] Open
Abstract
Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive.
Collapse
|
12
|
Valletti A, Anselmo A, Mangiulli M, Boria I, Mignone F, Merla G, D'Angelo V, Tullo A, Sbisà E, D'Erchia AM, Pesole G. Identification of tumor-associated cassette exons in human cancer through EST-based computational prediction and experimental validation. Mol Cancer 2010; 9:230. [PMID: 20813049 PMCID: PMC2941758 DOI: 10.1186/1476-4598-9-230] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2010] [Accepted: 09/02/2010] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Many evidences report that alternative splicing, the mechanism which produces mRNAs and proteins with different structures and functions from the same gene, is altered in cancer cells. Thus, the identification and characterization of cancer-specific splice variants may give large impulse to the discovery of novel diagnostic and prognostic tumour biomarkers, as well as of new targets for more selective and effective therapies. RESULTS We present here a genome-wide analysis of the alternative splicing pattern of human genes through a computational analysis of normal and cancer-specific ESTs from seventeen anatomical groups, using data available in AspicDB, a database resource for the analysis of alternative splicing in human. By using a statistical methodology, normal and cancer-specific genes, splice sites and cassette exons were predicted in silico. The condition association of some of the novel normal/tumoral cassette exons was experimentally verified by RT-qPCR assays in the same anatomical system where they were predicted. Remarkably, the presence in vivo of the predicted alternative transcripts, specific for the nervous system, was confirmed in patients affected by glioblastoma. CONCLUSION This study presents a novel computational methodology for the identification of tumor-associated transcript variants to be used as cancer molecular biomarkers, provides its experimental validation, and reports specific biomarkers for glioblastoma.
Collapse
Affiliation(s)
- Alessio Valletti
- Dipartimento di Biochimica e Biologia Molecolare, University of Bari, Quagliariello, via Orabona 4, Bari 70126, Italy
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Yu F, Chen MH, Kuo L, Huang P, Yang W. Bayesian hierarchical modeling and selection of differentially expressed genes for the EST data. Biometrics 2010; 67:142-50. [PMID: 20560937 DOI: 10.1111/j.1541-0420.2010.01447.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Expressed sequence tag (EST) sequencing is a one-pass sequencing reading of cloned cDNAs derived from a certain tissue. The frequency of unique tags among different unbiased cDNA libraries is used to infer the relative expression level of each tag. In this article, we propose a hierarchical multinomial model with a nonlinear Dirichlet prior for the EST data with multiple libraries and multiple types of tissues. A novel hierarchical prior is developed and the properties of the proposed prior are examined. An efficient Markov chain Monte Carlo algorithm is developed for carrying out the posterior computation. We also propose a new selection criterion for detecting which genes are differentially expressed between two tissue types. Our new method with the new gene selection criterion is demonstrated via several simulations to have low false negative and false positive rates. A real EST data set is used to motivate and illustrate the proposed method.
Collapse
Affiliation(s)
- Fang Yu
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, Nebraska 68198-4350,
| | | | | | | | | |
Collapse
|
14
|
Lim D, Lee SH, Cho YM, Yoon D, Shin Y, Kim KW, Park HS, Kim H. Transcript profiling of expressed sequence tags from intramuscular fat, longissimus dorsi muscle and liver in Korean cattle (Hanwoo). BMB Rep 2010; 43:115-21. [PMID: 20193130 DOI: 10.5483/bmbrep.2010.43.2.115] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A large data set of Hanwoo (Korean cattle) ESTs was analyzed to obtain differential gene expression results for the following three libraries: intramuscular fat, longissimus dorsi muscle and liver. To better understand the gene expression profiles, we identified differentially expressed genes (DEGs) via digital gene expression analysis. Hierarchical clustering of genes was performed according to their relative abundance within the six separate groups (Hanwoo fat versus non-Hanwoo fat, Hanwoo muscle versus non-Hanwoo muscle and Hanwoo liver versus non-Hanwoo liver), producing detailed patterns of gene expression. We determined the quantitative traits associated with the highly expressed genes. We also provide the first list of putative regulatory elements associated with differential tissue expression in Hanwoo cattle. In addition, we conducted evolutionary analysis that suggests a subset of genes accelerated in the bovine lineage are strongly correlated with their expression in Hanwoo muscle.
Collapse
Affiliation(s)
- Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Suwon, Korea
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Liu Y, Guo F, Dai M, Wang D, Tong Y, Huang J, Hu J, Li G. Gammaaminobutyric acid A receptor alpha 3 subunit is overexpressed in lung cancer. Pathol Oncol Res 2010; 15:351-8. [PMID: 19048400 DOI: 10.1007/s12253-008-9128-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/07/2008] [Accepted: 11/10/2008] [Indexed: 10/21/2022]
Abstract
The identification of tumor-associated antigens, which are specifically expressed in cancer tissues, is very important for immunotherapy of lung cancer. We have combined the in silico screening and experimental verifying to identify genes that are differently expressed in cancers compared with their corresponding normal tissues. Using these methods, we have identified that GABRA3 gene was overexpressed in lung cancer and rarely expressed in other cancers. Furthermore, GABRA3 protein expression was significantly higher in the lower grade of lung cancer. It may compose functional GABA-gated channel with other subunits. This study demonstrated GABRA3 could be a potential biomarker for diagnosis of lung cancer, and GABAA receptors may play an important role in cancer differentiation.
Collapse
Affiliation(s)
- Yan Liu
- Tumor Immunobiology Laboratory of Cancer Research Institute, Central South University, Changsha, Hunan Province 410078, China
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Liu Y, Li Y, Guo F, Dai M, Wang J, Hu J, Li G. Identification of HTA as a novel-specific marker for human hepatocellular carcinoma. J Cancer Res Clin Oncol 2010; 136:1187-92. [PMID: 20130911 DOI: 10.1007/s00432-010-0767-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2008] [Accepted: 01/13/2010] [Indexed: 01/17/2023]
Abstract
BACKGROUND Hepatocellular carcinoma (HCC) is the most common malignancy in the world, especially in China. Early diagnosis of new and recurrent hepatocellular carcinoma, followed by timely treatment, will help decrease mortality. Currently biomarkers are not satisfactory. Better diagnostic methods are highly demanded. METHODS In this study, we have used in silico identification and RT-PCR test and discovered a hepatoma associated gene (HTA). Knockdown of endogenous HTA expression was performed by small interfering RNA in malignant hepatocyte HepG2. Then we tested the cell proliferative ability of these cells in vitro and in vivo. RESULTS HTA was expressed specifically in some kinds of tumors, but not detected in any normal tissues. It was expressed especially high in hepatocellular carcinoma. Knockdown of endogenous HTA expression in HepG2 by small interfering RNA attenuated HCC cell growth. CONCLUSIONS HCA is a very good marker for tumors, especially for HCC. It could play important roles in HCC development and progression and can be a promising molecular target for the development of new diagnostic and therapeutic strategies for HCC.
Collapse
Affiliation(s)
- Yan Liu
- Tumor Immunobiology Laboratory of Cancer Research Institute, Central South University, 410078, Changsha, Hunan, China
| | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
Originally established in the beginning of the 1990s as a direct route to gene finding, expressed sequence tags (ESTs) still lend themselves as a means to analyze gene expression in almost all human tissues. The type of questions that can be addressed using public EST libraries ranges from tissue-specific gene profiling to the comparison between tissues in diseased and healthy states. Thanks to a multitude of web-based online bioinformatics resources, mining in EST libraries is not restricted to experts in the field of data analysis, but can readily be performed by the medical or life scientist. In this chapter, a couple of cases studies are presented that guide the scientist to the most useful online resources so that they can conduct their own research.
Collapse
Affiliation(s)
- Armin O Schmitt
- Institute for Animal Sciences, Humboldt-Universität zu Berlin, Berlin, Germany
| |
Collapse
|
18
|
[Novel prognostic marker in invasive breast cancer. ITIH5 expression is abrogated by aberrant promoter methylation]. DER PATHOLOGE 2009; 29 Suppl 2:338-46. [PMID: 18810445 DOI: 10.1007/s00292-008-1044-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
We have recently characterized ITIH5 as a new extracellular matrix protein that exhibits clear expression loss in a variety of human tumour entities, including breast cancer. The aim of the present study was to decipher the molecular cause of ITIH5 expression loss in breast cancer and to learn more about the possible role of this molecule in cancer diseases. ITIH5 protein expression was found to be strongly reduced in 42% of invasive breast carcinomas-interestingly, with significant association with poor patient outcome. ITIH5 promoter methylation was frequently detected in breast cell lines and in primary carcinomas (40%), and it was functionally correlated with loss of ITIH5 mRNA expression. Moreover, ITIH5 promoter methylation was also significantly associated with poor clinical patient outcome and also with the occurrence of lymph node and distant metastases. In conclusion, we propose that ITIH5 may represent a novel metastasis repressor in human breast cancer. Both ITIH5 protein expression and ITIH5 promoter methylation may serve as prognostic biomarkers, thereby helping improve clinical patient outcome.
Collapse
|
19
|
A novel approach to detect differentially expressed genes from count-based digital databases by normalizing with housekeeping genes. Genomics 2009; 94:211-6. [PMID: 19446020 DOI: 10.1016/j.ygeno.2009.05.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2009] [Revised: 05/04/2009] [Accepted: 05/06/2009] [Indexed: 01/26/2023]
Abstract
Sequence tag count-based gene expression analysis is potent for the identification of candidate genes relevant to the cancerous phenotype. With the public availability of count-based data, the computational approaches for differentially expressed genes, which are mainly based on Binomial or beta-Binomial distribution, become practical and important in cancer biology. It remains a permanent need to select a proper statistical model for these methods. In this study, we developed a novel Bayesian algorithm-based method, Electronic Differential Gene Expression Screener (EDGES), in which a statistical model was determined by geometric averaging of 12 common housekeeping genes. EDGES identified a set of differentially expressed genes in lung, breast and colorectal cancers by using publically available Serial Analysis of Gene Expression (SAGE) and Expressed Sequence Tag (EST data). Gene expression microarray analysis and quantitative reverse transcription real-time PCR demonstrated the effectiveness of this procedure. We conclude that current normalization of calibrators provides a new insight into count-based digital subtraction in cancer research.
Collapse
|
20
|
de Souza SJ. Exploiting ESTs in human health. Methods Mol Biol 2009; 533:311-324. [PMID: 19277565 DOI: 10.1007/978-1-60327-136-3_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Expressed Sequence Tags (ESTs) are fragments of cDNA clones. They correspond to the most abundant type of cDNA information available in the public databases. ESTs have been used for expression profiling, gene identification, characterization of differentially expressed genes, and identification of transcript variants among other utilities. In this review I will discuss the major features of the collection of ESTs available in the public domain giving a special emphasis on how this dataset has been used in studies about human diseases.
Collapse
|
21
|
Zafrakas M, Losen I, Knüchel R, Dahl E. Enhancer of the rudimentary gene homologue (ERH) expression pattern in sporadic human breast cancer and normal breast tissue. BMC Cancer 2008; 8:145. [PMID: 18500978 PMCID: PMC2426700 DOI: 10.1186/1471-2407-8-145] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2008] [Accepted: 05/23/2008] [Indexed: 11/23/2022] Open
Abstract
Background The human gene ERH (Enhancer of the Rudimentary gene Homologue) has previously been identified by in silico analysis of four million ESTs as a gene differentially expressed in breast cancer. The biological function of ERH protein has not been fully elucidated, however functions in cell cycle progression, pyrimidine metabolism a possible interaction with p21(Cip1/Waf1) via the Ciz1 zinc finger protein have been suggested. The aim of the present study was a systematic characterization of ERH expression in human breast cancer in order to evaluate possible clinical applications of this molecule. Methods The expression pattern of ERH was analyzed using multiple tissue northern blots (MTN) on a panel of 16 normal human tissues and two sets of malignant/normal breast and ovarian tissue samples. ERH expression was further analyzed in breast cancer and normal breast tissues and in tumorigenic as well as non-tumorigenic breast cancer cell lines, using quantitative RT-PCR and non-radioisotopic in situ hybridization (ISH). Results Among normal human tissues, ERH expression was most abundant in testis, heart, ovary, prostate, and liver. In the two MTN sets of malignant/normal breast and ovarian tissue,ERH was clearly more abundantly expressed in all tumours than in normal tissue samples. Quantitative RT-PCR analyses showed that ERH expression was significantly more abundant in tumorigenic than in non-tumorigenic breast cancer cell lines (4.5-fold; p = 0.05, two-tailed Mann-Whitney U-test); the same trend was noted in a set of 25 primary invasive breast cancers and 16 normal breast tissue samples (2.5-fold; p = 0.1). These findings were further confirmed by non-radioisotopic ISH in human breast cancer and normal breast tissue. Conclusion ERH expression is clearly up-regulated in malignant as compared with benign breast cells both in primary human breast cancer and in cell models of breast cancer. Since similar results were obtained for ovarian cancer, ERH overexpression may be implicated in the initiation and/or progression of certain human malignancies. Further studies on large breast cancer tissue cohorts should determine whether ERH could function as a prognostic factor or even a drug target in the treatment of human breast cancer.
Collapse
Affiliation(s)
- Menelaos Zafrakas
- Molecular Oncology Group, Institute of Pathology, University Hospital of RWTH Aachen, Aachen, Germany.
| | | | | | | |
Collapse
|
22
|
Helftenbein G, Koslowski M, Dhaene K, Seitz G, Sahin U, Türeci O. In silico strategy for detection of target candidates for antibody therapy of solid tumors. Gene 2008; 414:76-84. [PMID: 18358640 DOI: 10.1016/j.gene.2008.02.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Revised: 02/05/2008] [Accepted: 02/13/2008] [Indexed: 10/22/2022]
Abstract
In contrast to earlier attempts for the identification of target candidates suitable for monoclonal antibody (mAb) based cancer therapies we concentrated on highly selective lineage-specific genes additionally preserved or even overexpressed in orthotopic cancers. In a script aided workflow we reduced all human entries of the RefSeq mRNA database to those encoding transmembrane domain bearing gene products and subjected them to BLAST analysis against the human EST database. All BLAST results were validated in a gene centric way allowing two types of data curation prior to expression profiling of matching ESTs in selected healthy tissues: (i) exclusion of questionable ESTs arising e.g. from genomic contamination and (ii) elimination of erroneously predicted mRNAs as well as transcripts with only weak EST coverage. The impact of such stringent input control on accuracy of prediction is underlined by RT-PCR confirmation of predicted tissue distribution patterns for a number of selected candidates.
Collapse
|
23
|
Lindlöf A, Bräutigam M, Chawade A, Olsson O, Olsson B. Evaluation of combining several statistical methods with a flexible cutoff for identifying differentially expressed genes in pairwise comparison of EST sets. Bioinform Biol Insights 2008; 2:215-37. [PMID: 19812778 PMCID: PMC2735943 DOI: 10.4137/bbi.s431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The detection of differentially expressed genes from EST data is of importance for the discovery of potential biological or pharmaceutical targets, especially when studying biological processes in less characterized organisms and where large-scale microarrays are not an option. We present a comparison of five different statistical methods for identifying up-regulated genes through pairwise comparison of EST sets, where one of the sets is generated from a treatment and the other one serves as a control. In addition, we specifically address situations where the sets are relatively small (~2,000–10,000 ESTs) and may differ in size. The methods were tested on both simulated and experimentally derived data, and compared to a collection of cold stress induced genes identified by microarrays. We found that combining the method proposed by Audic and Claverie with Fisher’s exact test and a method based on calculating the difference in relative frequency was the best combination for maximizing the detection of up-regulated genes. We also introduced the use of a flexible cutoff, which takes the size of the EST sets into consideration. This could be considered as an alternative to a static cutoff. Finally, the detected genes showed a low overlap with those identified by microarrays, which indicates, as in previous studies, low overall concordance between the two platforms.
Collapse
Affiliation(s)
- Angelica Lindlöf
- School of Humanities and Informatics, University of Skövde, Box 408, 541 28 Skövde, Sweden.
| | | | | | | | | |
Collapse
|
24
|
Salzburger W, Renn SCP, Steinke D, Braasch I, Hofmann HA, Meyer A. Annotation of expressed sequence tags for the East African cichlid fish Astatotilapia burtoni and evolutionary analyses of cichlid ORFs. BMC Genomics 2008; 9:96. [PMID: 18298844 PMCID: PMC2279125 DOI: 10.1186/1471-2164-9-96] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2007] [Accepted: 02/25/2008] [Indexed: 11/13/2022] Open
Abstract
Background The cichlid fishes in general, and the exceptionally diverse East African haplochromine cichlids in particular, are famous examples of adaptive radiation and explosive speciation. Here we report the collection and annotation of more than 12,000 expressed sequence tags (ESTs) generated from three different cDNA libraries obtained from the East African haplochromine cichlid species Astatotilapia burtoni and Metriaclima zebra. Results We first annotated more than 12,000 newly generated cichlid ESTs using the Gene Ontology classification system. For evolutionary analyses, we combined these ESTs with all available sequence data for haplochromine cichlids, which resulted in a total of more than 45,000 ESTs. The ESTs represent a broad range of molecular functions and biological processes. We compared the haplochromine ESTs to sequence data from those available for other fish model systems such as pufferfish (Takifugu rubripes and Tetraodon nigroviridis), trout, and zebrafish. We characterized genes that show a faster or slower rate of base substitutions in haplochromine cichlids compared to other fish species, as this is indicative of a relaxed or reinforced selection regime. Four of these genes showed the signature of positive selection as revealed by calculating Ka/Ks ratios. Conclusion About 22% of the surveyed ESTs were found to have cichlid specific rate differences suggesting that these genes might play a role in lineage specific characteristics of cichlids. We also conclude that the four genes with a Ka/Ks ratio greater than one appear as good candidate genes for further work on the genetic basis of evolutionary success of haplochromine cichlid fishes.
Collapse
Affiliation(s)
- Walter Salzburger
- Lehrstuhl für Zoologie und Evolutionsbiologie, Department of Biology, University of Konstanz, 78467 Konstanz, Germany.
| | | | | | | | | | | |
Collapse
|
25
|
Dahl E, Kristiansen G, Gottlob K, Klaman I, Ebner E, Hinzmann B, Hermann K, Pilarsky C, Dürst M, Klinkhammer-Schalke M, Blaszyk H, Knuechel R, Hartmann A, Rosenthal A, Wild PJ. Molecular profiling of laser-microdissected matched tumor and normal breast tissue identifies karyopherin alpha2 as a potential novel prognostic marker in breast cancer. Clin Cancer Res 2007; 12:3950-60. [PMID: 16818692 DOI: 10.1158/1078-0432.ccr-05-2090] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
PURPOSE The aim of the present study was to identify human genes that might prove useful in the diagnosis and therapy of primary breast cancer. EXPERIMENTAL DESIGN Twenty-four matched pairs of invasive ductal breast cancer and corresponding benign breast tissue were investigated by a combination of laser microdissection and gene expression profiling. Differential expression of candidate genes was validated by dot blot analysis of cDNA in 50 pairs of matching benign and malignant breast tissue. Cellular expression of candidate genes was further validated by RNA in situ hybridization, quantitative reverse transcription-PCR, and immunohistochemistry using tissue microarray analysis of 272 nonselected breast cancers. Multivariate analysis of factors on overall survival and recurrence-free survival was done. RESULTS Fifty-four genes were found to be up-regulated and 78 genes were found to be down-regulated. Dot blot analysis reduced the number of up-regulated genes to 15 candidate genes that showed at least a 2-fold overexpression in >15 of 50 (30%) tumor/normal pairs. We selected phosphatidic acid phosphatase type 2 domain containing 1A (PPAPDC1A) and karyopherin alpha2 (KPNA2) for further validation. PPAPDC1A and KPNA2 RNA was up-regulated (fold change >2) in 84% and 32% of analyzed tumor/normal pairs, respectively. Nuclear protein expression of KPNA2 was significantly associated with shorter overall survival and recurrence-free survival. Testing various multivariate Cox regression models, KPNA2 expression remained a highly significant, independent and adverse risk factor for overall survival. CONCLUSIONS Gene expression profiling of laser-microdissected breast cancer tissue revealed novel genes that may represent potential molecular targets for breast cancer therapy and prediction of outcome.
Collapse
Affiliation(s)
- Edgar Dahl
- Institute of Pathology, University Hospital of the RWTH Aachen, Aachen, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Kegel J, Allen MJ, Metfies K, Wilson WH, Wolf-Gladrow D, Valentin K. Pilot study of an EST approach of the coccolithophorid Emiliania huxleyi during a virus infection. Gene 2007; 406:209-16. [PMID: 18035506 DOI: 10.1016/j.gene.2007.10.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2007] [Revised: 09/20/2007] [Accepted: 10/04/2007] [Indexed: 10/22/2022]
Abstract
Blooms of the coccolithophorid Emiliania huxleyi can be infected by viruses, which can lead to bloom-termination. This pilot study used an expressed sequence tag (EST) approach to get a first view of gene-expression changes that occur during viral infection of E. huxleyi. cDNA libraries were constructed from uninfected cultures and 6, 12, and 24 h after infection with E. huxleyi-specific virus 86 (EhV-86). From each library 60-90 ESTs were randomly selected and annotated manually with PhyloGena. Viral genes were identified using BLAST-Search of the known viral genome. The data of this study show, that 6 h after viral infection the algal transcriptome changed significantly although few viral transcripts were present. At this point, changes mainly concerned transcripts related to photosynthesis and protein metabolism. However, after 24 h viral transcripts were most abundant. Viral transcripts found at this stage of viral infection encode proteins involved in protein degradation, nucleic acid degradation, transcription and replication.
Collapse
Affiliation(s)
- Jessica Kegel
- Alfred Wegener Institute for Polar and Marine Research, Am Handelshafen 12, D-27570 Bremerhaven, Germany
| | | | | | | | | | | |
Collapse
|
27
|
Tsujimoto M, Nakabayashi K, Yoshidome K, Kaneko T, Iwase T, Akiyama F, Kato Y, Tsuda H, Ueda S, Sato K, Tamaki Y, Noguchi S, Kataoka TR, Nakajima H, Komoike Y, Inaji H, Tsugawa K, Suzuki K, Nakamura S, Daitoh M, Otomo Y, Matsuura N. One-step nucleic acid amplification for intraoperative detection of lymph node metastasis in breast cancer patients. Clin Cancer Res 2007; 13:4807-16. [PMID: 17699859 DOI: 10.1158/1078-0432.ccr-06-2512] [Citation(s) in RCA: 327] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
PURPOSE Detection of sentinel lymph node (SLN) metastasis in breast cancer patients has conventionally been determined by intraoperative histopathologic examination of frozen sections followed by definitive postoperative examination of permanent sections. The purpose of this study is to develop a more efficient method for intraoperative detection of lymph node metastasis. EXPERIMENTAL DESIGN Cutoff values to distinguish macrometastasis, micrometastasis, and nonmetastasis were determined by measuring cytokeratin 19 (CK19) mRNA in histopathologically positive and negative lymph nodes using one-step nucleic acid amplification (OSNA). In an intraoperative clinical study involving six facilities, 325 lymph nodes (101 patients), including 81 SLNs, were divided into four blocks. Alternate blocks were used for the OSNA assay with CK19 mRNA, and the remaining blocks were used for H&E and CK19 immunohistochemistry-based three-level histopathologic examination. The results from the two methods were then compared. RESULTS We established CK19 mRNA cutoff values of 2.5 x 10(2) and 5 x 10(3) copies/muL. In the clinical study, an overall concordance rate between the OSNA assay and the three-level histopathology was 98.2%. Similar results were obtained with 81 SLNs. The OSNA assay discriminated macrometastasis from micrometastasis. No false positive was observed in the OSNA assay of 144 histopathologically negative lymph nodes from pN0 patients, indicating an extremely low false positive for the OSNA assay. CONCLUSION The OSNA assay of half of a lymph node provided results similar to those of three-level histopathology. Clinical results indicate that the OSNA assay provides a useful intraoperative detection method of lymph node metastasis in breast cancer patients.
Collapse
|
28
|
Murray D, Doran P, MacMathuna P, Moss AC. In silico gene expression analysis--an overview. Mol Cancer 2007; 6:50. [PMID: 17683638 PMCID: PMC1964762 DOI: 10.1186/1476-4598-6-50] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Accepted: 08/07/2007] [Indexed: 12/18/2022] Open
Abstract
Efforts aimed at deciphering the molecular basis of complex disease are underpinned by the availability of high throughput strategies for the identification of biomolecules that drive the disease process. The completion of the human genome-sequencing project, coupled to major technological developments, has afforded investigators myriad opportunities for multidimensional analysis of biological systems. Nowhere has this research explosion been more evident than in the field of transcriptomics. Affordable access and availability to the technology that supports such investigations has led to a significant increase in the amount of data generated. As most biological distinctions are now observed at a genomic level, a large amount of expression information is now openly available via public databases. Furthermore, numerous computational based methods have been developed to harness the power of these data. In this review we provide a brief overview of in silico methodologies for the analysis of differential gene expression such as Serial Analysis of Gene Expression and Digital Differential Display. The performance of these strategies, at both an operational and result/output level is assessed and compared. The key considerations that must be made when completing an in silico expression analysis are also presented as a roadmap to facilitate biologists. Furthermore, to highlight the importance of these in silico methodologies in contemporary biomedical research, examples of current studies using these approaches are discussed. The overriding goal of this review is to present the scientific community with a critical overview of these strategies, so that they can be effectively added to the tool box of biomedical researchers focused on identifying the molecular mechanisms of disease.
Collapse
Affiliation(s)
- David Murray
- General Clinical Research Unit, UCD School of Medicine and Medical Sciences, Mater Misericordiae University Hospital, Dublin 7, Ireland
| | - Peter Doran
- General Clinical Research Unit, UCD School of Medicine and Medical Sciences, Mater Misericordiae University Hospital, Dublin 7, Ireland
| | - Padraic MacMathuna
- Gastrointestinal Unit, Mater Misericordiae University Hospital, Dublin 7, Ireland
| | - Alan C Moss
- Division of Gastroenterology, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA 02215, USA
| |
Collapse
|
29
|
Zafrakas M, Chorovicer M, Klaman I, Kristiansen G, Wild PJ, Heindrichs U, Knüchel R, Dahl E. Systematic characterisation of GABRP expression in sporadic breast cancer and normal breast tissue. Int J Cancer 2007; 118:1453-9. [PMID: 16187283 DOI: 10.1002/ijc.21517] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The GABRP gene has been previously identified by in silico analysis of four million ESTs as a candidate gene differentially expressed in breast cancer. GABRP is located on chromosome 5q34 and it encodes the pi-subunit of the gamma-aminobutyric acid (GABA) receptor, a transmembrane protein expressed in the brain and several nonneuronal tissues. Using cDNA dot blot hybridisation (cancer profiling array), quantitative RT-PCR and non-radioisotopic in situ hybridisation (ISH), we have analysed GABRP expression in breast cancer and normal breast tissues as well as in nontumorigenic and tumorigenic breast cell lines. Analysis of the cancer profiling array revealed a more than 2-fold downregulation of GABRP (p < 0.001) in 76% of primary breast carcinomas (n = 50) compared to corresponding normal tissues. Quantitative RT-PCR in a panel of 23 normal human tissues showed that the GABRP expression level was most abundant in the normal breast tissues compared to other human tissues. GABRP downregulation in breast cancer was confirmed by quantitative RT-PCR in cryopreserved breast tumour and normal breast tissue specimens (n = 22), in archival formalin-fixed, paraffin-embedded tissue specimens (n = 32), as well as in breast cancer cell lines (n = 8). Furthermore, a significant downregulation of GABRP was noted in large (pT3-pT4) (p = 0.044) primary breast tumours. Non-radioisotopic ISH showed strong GABRP expression in normal epithelial and benign papilloma breast cells, but no signal could be detected in invasive ductal carcinoma. Altogether, these data suggest that GABRP is progressively down-regulated with tumour-progression, and that it may be useful as a prognostic marker in breast cancer.
Collapse
Affiliation(s)
- Menelaos Zafrakas
- Institute of Pathology, University Hospital of the RWTH, Aachen, Germany
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Comparative analysis of differentially expressed genes in normal and white spot syndrome virus infected Penaeus monodon. BMC Genomics 2007; 8:120. [PMID: 17506900 PMCID: PMC1888707 DOI: 10.1186/1471-2164-8-120] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 05/16/2007] [Indexed: 11/21/2022] Open
Abstract
Background White spot syndrome (WSS) is a viral disease that affects most of the commercially important shrimps and causes serious economic losses to the shrimp farming industry worldwide. However, little information is available in terms of the molecular mechanisms of the host-virus interaction. In this study, we used an expressed sequence tag (EST) approach to observe global gene expression changes in white spot syndrome virus (WSSV)-infected postlarvae of Penaeus monodon. Results Sequencing of the complementary DNA clones of two libraries constructed from normal and WSSV-infected postlarvae produced a total of 15,981 high-quality ESTs. Of these ESTs, 46% were successfully matched against annotated genes in National Center of Biotechnology Information (NCBI) non-redundant (nr) database and 44% were functionally classified using the Gene Ontology (GO) scheme. Comparative EST analyses suggested that, in postlarval shrimp, WSSV infection strongly modulates the gene expression patterns in several organs or tissues, including the hepatopancreas, muscle, eyestalk and cuticle. Our data suggest that several basic cellular metabolic processes are likely to be affected, including oxidative phosphorylation, protein synthesis, the glycolytic pathway, and calcium ion balance. A group of immune-related chitin-binding protein genes is also likely to be strongly up regulated after WSSV infection. A database containing all the sequence data and analysis results is accessible at . Conclusion This study suggests that WSSV infection modulates expression of various kinds of genes. The predicted gene expression pattern changes not only reflect the possible responses of shrimp to the virus infection but also suggest how WSSV subverts cellular functions for virus multiplication. In addition, the ESTs reported in this study provide a rich source for identification of novel genes in shrimp.
Collapse
|
31
|
Transcriptome analysis of the venom gland of the Mexican scorpion Hadrurus gertschi (Arachnida: Scorpiones). BMC Genomics 2007; 8:119. [PMID: 17506894 PMCID: PMC1904202 DOI: 10.1186/1471-2164-8-119] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2007] [Accepted: 05/16/2007] [Indexed: 11/19/2022] Open
Abstract
Background Scorpions like other venomous animals posses a highly specialized organ that produces, secretes and disposes the venom components. In these animals, the last postabdominal segment, named telson, contains a pair of venomous glands connected to the stinger. The isolation of numerous scorpion toxins, along with cDNA-based gene cloning and, more recently, proteomic analyses have provided us with a large collection of venom components sequences. However, all of them are secreted, or at least are predicted to be secretable gene products. Therefore very little is known about the cellular processes that normally take place inside the glands for production of the venom mixture. To gain insights into the scorpion venom gland biology, we have decided to perform a transcriptomic analysis by constructing a cDNA library and conducting a random sequencing screening of the transcripts. Results From the cDNA library prepared from a single venom gland of the scorpion Hadrurus gertschi, 160 expressed sequence tags (ESTs) were analyzed. These transcripts were further clustered into 68 unique sequences (20 contigs and 48 singlets), with an average length of 919 bp. Half of the ESTs can be confidentially assigned as homologues of annotated gene products. Annotation of these ESTs, with the aid of Gene Ontology terms and homology to eukaryotic orthologous groups, reveals some cellular processes important for venom gland function; including high protein synthesis, tuned posttranslational processing and trafficking. Nonetheless, the main group of the identified gene products includes ESTs similar to known scorpion toxins or other previously characterized scorpion venom components, which account for nearly 60% of the identified proteins. Conclusion To the best of our knowledge this report contains the first transcriptome analysis of genes transcribed by the venomous gland of a scorpion. The data were obtained for the species Hadrurus gertschi, belonging to the family Caraboctonidae. One hundred and sixty ESTs were analyzed, showing enrichment in genes that encode for products similar to known venom components, but also provides the first sketch of cellular components, molecular functions, biological processes and some unique sequences of the scorpion venom gland.
Collapse
|
32
|
Lee S, Jo M, Lee J, Koh SS, Kim S. Identification of novel universal housekeeping genes by statistical analysis of microarray data. BMB Rep 2007; 40:226-31. [PMID: 17394773 DOI: 10.5483/bmbrep.2007.40.2.226] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Housekeeping genes are widely used as internal controls in a variety of study types, including real time RT-PCR, microarrays, Northern analysis and RNase protection assays. However, even commonly used housekeeping genes may vary in stability depending on the cell type or disease being studied. Thus, it is necessary to identify additional housekeeping-type genes that show sample-independent stability. Here, we used statistical analysis to examine a large human microarray database, seeking genes that were stably expressed in various tissues, disease states and cell lines. We further selected genes that were expressed at different levels, because reference and target genes should be present in similar copy numbers to achieve reliable quantitative results. Real time RT-PCR amplification of three newly identified reference genes, CGI-119, CTBP1 and GOLGAl, alongside three well-known housekeeping genes, B2M, GAPD, and TUBB, confirmed that the newly identified genes were more stably expressed in individual samples with similar ranges. These results collectively suggest that statistical analysis of microarray data can be used to identify new candidate housekeeping genes showing consistent expression across tissues and diseases. Our analysis identified three novel candidate housekeeping genes (CGI-119, GOLGA1, and CTBP1) that could prove useful for normalization across a variety of RNA-based techniques.
Collapse
Affiliation(s)
- Seram Lee
- Department of Chemistry, Dongguk University, Seoul 100-715, Korea.
| | | | | | | | | |
Collapse
|
33
|
Georg RC, Gomes SL. Transcriptome analysis in response to heat shock and cadmium in the aquatic fungus Blastocladiella emersonii. EUKARYOTIC CELL 2007; 6:1053-62. [PMID: 17449658 PMCID: PMC1951522 DOI: 10.1128/ec.00053-07] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The global transcriptional response of the chytridiomycete Blastocladiella emersonii to environmental stress conditions was explored by sequencing a large number of expressed sequence tags (ESTs) from three distinct cDNA libraries, constructed with mRNA extracted from cells exposed to heat shock and different concentrations of cadmium chloride. A total of 6,350 high-quality EST sequences were obtained and assembled into 2,326 putative unigenes, 51% of them not previously described in B. emersonii. To approximately 59% of the unigenes it was possible to assign an orthologue in another organism, whereas 41% of them remained without a putative identification, with transcripts related to protein folding and antioxidant activity being highly enriched in the stress libraries. A microarray chip was constructed encompassing 3,773 distinct ESTs from the B. emersonii transcriptome presently available, which correspond to a wide range of biological processes. Global gene expression analysis of B. emersonii cells exposed to stress conditions revealed a large number of differentially expressed genes: 122 up- and 60 downregulated genes during heat shock and 189 up- and 110 downregulated genes during exposure to cadmium. The main functional categories represented among the upregulated genes were protein folding and proteolysis, proteins with antioxidant properties, and cellular transport. Interestingly, in response to cadmium stress, B. emersonii cells induced genes encoding six different glutathione S-transferases and six distinct metacaspases, as well as genes coding for several proteins of sulfur amino acid metabolism, indicating that cadmium causes oxidative stress and apoptosis in this fungus. All sequences described in this study have been submitted to the GenBank EST section with the accession numbers EE 730389 to EE 736848.
Collapse
Affiliation(s)
- Raphaela C Georg
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes 748, 05508-000 São Paulo, Brazil
| | | |
Collapse
|
34
|
In silico whole-genome screening for cancer-related single-nucleotide polymorphisms located in human mRNA untranslated regions. BMC Genomics 2007; 8:2. [PMID: 17201911 PMCID: PMC1774567 DOI: 10.1186/1471-2164-8-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2006] [Accepted: 01/03/2007] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND A promising application of the huge amounts of genetic data currently available lies in developing a better understanding of complex diseases, such as cancer. Analysis of publicly available databases can help identify potential candidates for genes or mutations specifically related to the cancer phenotype. In spite of their huge potential to affect gene function, no systematic attention has been paid so far to the changes that occur in untranslated regions of mRNA. RESULTS In this study, we used Expressed Sequence Tag (EST) databases as a source for cancer-related sequence polymorphism discovery at the whole-genome level. Using a novel computational procedure, we focused on the identification of untranslated region (UTR)-localized non-coding Single Nucleotide Polymorphisms (UTR-SNPs) significantly associated with the tumoral state. To explore possible relationships between genetic mutation and phenotypic variation, bioinformatic tools were used to predict the potential impact of cancer-associated UTR-SNPs on mRNA secondary structure and UTR regulatory elements. We provide a comprehensive and unbiased description of cancer-associated UTR-SNPs that may be useful to define genotypic markers or to propose polymorphisms that can act to alter gene expression levels. Our results suggest that a fraction of cancer-associated UTR-SNPs may have functional consequences on mRNA stability and/or expression. CONCLUSION We have undertaken a comprehensive effort to identify cancer-associated polymorphisms in untranslated regions of mRNA and to characterize putative functional UTR-SNPs. Alteration of translational control can change the expression of genes in tumor cells, causing an increase or decrease in the concentration of specific proteins. Through the description of testable candidates and the experimental validation of a number of UTR-SNPs discovered on the secreted protein acidic and rich in cysteine (SPARC) gene, this report illustrates the utility of a cross-talk between in silico transcriptomics and cancer genetics.
Collapse
|
35
|
Arhondakis S, Clay O, Bernardi G. Compositional properties of human cDNA libraries: practical implications. FEBS Lett 2006; 580:5772-8. [PMID: 17022979 DOI: 10.1016/j.febslet.2006.09.034] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Revised: 09/12/2006] [Accepted: 09/19/2006] [Indexed: 01/28/2023]
Abstract
The strikingly wide and bimodal gene distribution exhibited by the human genome has prompted us to study the correlations between EST-counts (expression levels) and base composition of genes, especially since existing data are contradictory. Here we investigate how cDNA library preparation affects the GC distributions of ESTs and/or genes found in the library, and address consequences for expression studies. We observe that strongly anomalous GC distributions often indicate experimental biases or deficits during their preparation. We propose the use of compositional distributions of raw ESTs from a cDNA library, and/or of the genes they represent, as a simple and effective tool for quality control.
Collapse
Affiliation(s)
- Stilianos Arhondakis
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, 80121 Naples, Italy
| | | | | |
Collapse
|
36
|
Hecht J, Kuhl H, Haas SA, Bauer S, Poustka AJ, Lienau J, Schell H, Stiege AC, Seitz V, Reinhardt R, Duda GN, Mundlos S, Robinson PN. Gene identification and analysis of transcripts differentially regulated in fracture healing by EST sequencing in the domestic sheep. BMC Genomics 2006; 7:172. [PMID: 16822315 PMCID: PMC1578570 DOI: 10.1186/1471-2164-7-172] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Accepted: 07/05/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The sheep is an important model animal for testing novel fracture treatments and other medical applications. Despite these medical uses and the well known economic and cultural importance of the sheep, relatively little research has been performed into sheep genetics, and DNA sequences are available for only a small number of sheep genes. RESULTS In this work we have sequenced over 47 thousand expressed sequence tags (ESTs) from libraries developed from healing bone in a sheep model of fracture healing. These ESTs were clustered with the previously available 10 thousand sheep ESTs to a total of 19087 contigs with an average length of 603 nucleotides. We used the newly identified sequences to develop RT-PCR assays for 78 sheep genes and measured differential expression during the course of fracture healing between days 7 and 42 postfracture. All genes showed significant shifts at one or more time points. 23 of the genes were differentially expressed between postfracture days 7 and 10, which could reflect an important role for these genes for the initiation of osteogenesis. CONCLUSION The sequences we have identified in this work are a valuable resource for future studies on musculoskeletal healing and regeneration using sheep and represent an important head-start for genomic sequencing projects for Ovis aries, with partial or complete sequences being made available for over 5,800 previously unsequenced sheep genes.
Collapse
Affiliation(s)
- Jochen Hecht
- Max Planck Institute for Molecular Genetics, Berlin, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Aouacheria A, Navratil V, Barthelaix A, Mouchiroud D, Gautier C. Bioinformatic screening of human ESTs for differentially expressed genes in normal and tumor tissues. BMC Genomics 2006; 7:94. [PMID: 16640784 PMCID: PMC1459866 DOI: 10.1186/1471-2164-7-94] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2005] [Accepted: 04/26/2006] [Indexed: 11/24/2022] Open
Abstract
Background Owing to the explosion of information generated by human genomics, analysis of publicly available databases can help identify potential candidate genes relevant to the cancerous phenotype. The aim of this study was to scan for such genes by whole-genome in silico subtraction using Expressed Sequence Tag (EST) data. Methods Genes differentially expressed in normal versus tumor tissues were identified using a computer-based differential display strategy. Bcl-xL, an anti-apoptotic member of the Bcl-2 family, was selected for confirmation by western blot analysis. Results Our genome-wide expression analysis identified a set of genes whose differential expression may be attributed to the genetic alterations associated with tumor formation and malignant growth. We propose complete lists of genes that may serve as targets for projects seeking novel candidates for cancer diagnosis and therapy. Our validation result showed increased protein levels of Bcl-xL in two different liver cancer specimens compared to normal liver. Notably, our EST-based data mining procedure indicated that most of the changes in gene expression observed in cancer cells corresponded to gene inactivation patterns. Chromosomes and chromosomal regions most frequently associated with aberrant expression changes in cancer libraries were also determined. Conclusion Through the description of several candidates (including genes encoding extracellular matrix and ribosomal components, cytoskeletal proteins, apoptotic regulators, and novel tissue-specific biomarkers), our study illustrates the utility of in silico transcriptomics to identify tumor cell signatures, tumor-related genes and chromosomal regions frequently associated with aberrant expression in cancer.
Collapse
Affiliation(s)
- Abdel Aouacheria
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR 5558, Université Claude Bernard Lyon 1, 69622 Villeurbanne Cedex, France
- Current address: Apoptosis and Oncogenesis Laboratory, IBCP, UMR 5086 CNRS-UCBL, IFR 128, Lyon, France
| | - Vincent Navratil
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR 5558, Université Claude Bernard Lyon 1, 69622 Villeurbanne Cedex, France
| | | | - Dominique Mouchiroud
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR 5558, Université Claude Bernard Lyon 1, 69622 Villeurbanne Cedex, France
| | - Christian Gautier
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR 5558, Université Claude Bernard Lyon 1, 69622 Villeurbanne Cedex, France
| |
Collapse
|
38
|
Nie D, Xiang Y. Molecular cloning and characterization of a novel human testis-specific gene by use of digital differential display. J Genet 2006; 85:57-62. [PMID: 16809841 DOI: 10.1007/bf02728971] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Dongsong Nie
- Department of Chemistry and Chemical Engineering, Hunan Institute of Science and Technology, Hunan, YueYang 414000, China.
| | | |
Collapse
|
39
|
Lü B, Xu J, Lai M, Zhang H, Chen J. A transcriptome anatomy of human colorectal cancers. BMC Cancer 2006; 6:40. [PMID: 16504081 PMCID: PMC1402307 DOI: 10.1186/1471-2407-6-40] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2005] [Accepted: 02/23/2006] [Indexed: 11/24/2022] Open
Abstract
Background Accumulating databases in human genome research have enabled integrated genome-wide study on complicated diseases such as cancers. A practical approach is to mine a global transcriptome profile of disease from public database. New concepts of these diseases might emerge by landscaping this profile. Methods In this study, we clustered human colorectal normal mucosa (N), inflammatory bowel disease (IBD), adenoma (A) and cancer (T) related expression sequence tags (EST) into UniGenes via an in-house GetUni software package and analyzed the transcriptome overview of these libraries by GOTree Machine (GOTM). Additionally, we downloaded UniGene based cDNA libraries of colon and analyzed them by Xprofiler to cross validate the efficiency of GetUni. Semi-quantitative RT-PCR was used to validate the expression of β-catenin and. 7 novel genes in colorectal cancers. Results The efficiency of GetUni was successfully validated by Xprofiler and RT-PCR. Genes in library N, IBD and A were all found in library T. A total of 14,879 genes were identified with 2,355 of them having at least 2 transcripts. Differences in gene enrichment among these libraries were statistically significant in 50 signal transduction pathways and Pfam protein domains by GOTM analysis P < 0.01 Hypergeometric Test). Genes in two metabolic pathways, ribosome and glycolysis, were more enriched in the expression profiles of A and IBD than in N and T. Seven transmembrane receptor superfamily genes were typically abundant in cancers. Conclusion Colorectal cancers are genetically heterogeneous. Transcription variants are common in them. Aberrations of ribosome and glycolysis pathway might be early indicators of precursor lesions in colon cancers. The electronic gene expression profile could be used to highlight the integral molecular events in colorectal cancers.
Collapse
Affiliation(s)
- Bingjian Lü
- Department of Pathology & Pathophysiology, School of Medicine, Zhejiang University, PR , 310031, China
| | - Jing Xu
- Department of Pathology & Pathophysiology, School of Medicine, Zhejiang University, PR , 310031, China
| | - Maode Lai
- Department of Pathology & Pathophysiology, School of Medicine, Zhejiang University, PR , 310031, China
| | - Hao Zhang
- Department of Computer Science, School of Computer Science & Technology, Zhejiang University, PR, 310023, China
| | - Jian Chen
- Department of Pathology & Pathophysiology, School of Medicine, Zhejiang University, PR , 310031, China
| |
Collapse
|
40
|
Liu D, Graber JH. Quantitative comparison of EST libraries requires compensation for systematic biases in cDNA generation. BMC Bioinformatics 2006; 7:77. [PMID: 16503995 PMCID: PMC1431573 DOI: 10.1186/1471-2105-7-77] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2005] [Accepted: 02/17/2006] [Indexed: 12/28/2022] Open
Abstract
Background Publicly accessible EST libraries contain valuable information that can be utilized for studies of tissue-specific gene expression and processing of individual genes. This information is, however, confounded by multiple systematic effects arising from the procedures used to generate these libraries. Results We used alignment of ESTs against a reference set of transcripts to estimate the size distributions of the cDNA inserts and sampled mRNA transcripts in individual EST libraries and show how these measurements can be used to inform quantitative comparisons of libraries. While significant attention has been paid to the effects of normalization and substraction, we also find significant biases in transcript sampling introduced by the combined procedures of reverse transcription and selection of cDNA clones for sequencing. Using examples drawn from studies of mRNA 3'-processing (cleavage and polyadenylation), we demonstrate effects of the transcript sampling bias, and provide a method for identifying libraries that can be safely compared without bias. All data sets, supplemental data, and software are available at our supplemental web site [1]. Conclusion The biases we characterize in the transcript sampling of EST libraries represent a significant and heretofore under-appreciated source of false positive candidates for tissue-, cell type-, or developmental stage-specific activity or processing of genes. Uncorrected, quantitative comparison of dissimilar EST libraries will likely result in the identification of statistically significant, but biologically meaningless changes.
Collapse
Affiliation(s)
- Donglin Liu
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| | - Joel H Graber
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| |
Collapse
|
41
|
Burkhardt M, Mayordomo E, Winzer KJ, Fritzsche F, Gansukh T, Pahl S, Weichert W, Denkert C, Guski H, Dietel M, Kristiansen G. Cytoplasmic overexpression of ALCAM is prognostic of disease progression in breast cancer. J Clin Pathol 2006; 59:403-9. [PMID: 16484444 PMCID: PMC1860378 DOI: 10.1136/jcp.2005.028209] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
BACKGROUND Activated leucocyte cell adhesion molecule (ALCAM, CD166) is a cell surface member of the immunoglobulin superfamily. ALCAM expression has prognostic relevance in prostate and colon cancer. OBJECTIVE To evaluate ALCAM protein expression in breast cancer by immunohistochemistry and to correlate expression levels with clinicopathological data. METHODS 162 primary breast carcinomas with a mean clinical follow up time of 53 months were immunostained using a monoclonal ALCAM antibody. The staining was evaluated as an immunoreactive score (IRS) and grouped into low v high for both membranous and cytoplasmic staining. RESULTS Intraductal and invasive carcinomas showed a higher ALCAM expression (median IRS 4 and 6 respectively) than normal breast tissue (IRS 2). In univariate survival analyses a significant association of high cytoplasmic ALCAM expression with shortened patient disease-free survival (mean (SD) five year non-progression rate, 69.4 (4.6)% v 49.4 (11.1)%, p = 0.0142) was found. In multivariate analyses of disease-free survival times, high cytoplasmic ALCAM expression (relative risk (RR) = 2.086, p = 0.026) and nodal status (RR = 2.246, p = 0.035) were significantly associated with earlier disease progression, whereas tumour grading (RR = 1.6, p = 0.052) was of borderline significance. CONCLUSIONS The data suggest that strong cytoplasmic ALCAM expression in primary breast cancer, as detected by immunohistochemistry, might be a new marker for a more aggressive breast cancer biology.
Collapse
Affiliation(s)
- M Burkhardt
- Institute of Pathology, Charité-Universitätsmedizin, Berlin, Germany
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Wild PJ, Krieg RC, Seidl J, Stoehr R, Reher K, Hofmann C, Louhelainen J, Rosenthal A, Hartmann A, Pilarsky C, Bosserhoff AK, Knuechel R. RNA expression profiling of normal and tumor cells following photodynamic therapy with 5-aminolevulinic acid-induced protoporphyrin IX in vitro. Mol Cancer Ther 2005; 4:516-28. [PMID: 15827324 DOI: 10.1158/1535-7163.mct-04-0141] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Photodynamic therapy using 5-aminolevulinic acid-induced protoporphyrin IX synthesis as a photosensitizing reagent is an encouraging modality for cancer treatment. Understanding the mechanism of tumor phototoxicity is important to provide a basis for combinatory therapy regimens. A normal cell line (UROtsa, urothelial) and two tumor cell lines (RT4, urothelial; HT29, colonic) were treated with cell line-specific LD50 doses of light after exposure to 5-aminolevulinic acid (100 microg/mL), and harvested for RNA extraction 0, 10, and 30 minutes after irradiation. The RNA was hybridized to the metg001A Affymetrix GeneChip containing 2,800 genes, focusing on cancer-related and growth regulatory targets. Comparing the gene expression profiles between the different samples, 40 genes (e.g., SOD2, LUC7A, CASP8, and DUSP1) were identified as significantly altered in comparison with the control samples, and grouped according to their gene ontology. We selected caspase-8 (CASP8) and dual specificity phosphatase 1 (DUSP1) for further validation of the array findings, and compared their expression with the expression of the immediate early gene FOS by quantitative reverse transcription-PCR. RNA expression of CASP8 stayed unchanged whereas DUSP1 RNA was up-regulated in normal and tumor cells starting 30 minutes after irradiation. In contrast, FOS RNA was found continuously up-regulated over time in all three cell lines. Induction of DUSP1 protein expression was clearly shown after 1 hour using Western blot analysis. Interestingly, no changes of caspase-8 protein expression but activation of catalytic activity was detected only in UROtsa cells starting 1 hour after photodynamic therapy, whereas no changes were seen in both tumor cell lines. According to caspase-8, the active caspase 3 fragment was found only in the normal urothelial cell line (UROtsa) 1 hour after photodynamic therapy. Combined data analysis suggests that photodynamic therapy in vitro (LD50) leads to apoptosis in UROtsa and to necrosis in the tumor cell lines, respectively. RNA expression profiling of normal and tumor cell lines following photodynamic therapy with 5-aminolevulinic acid gave insight into the major molecular mechanisms induced by photodynamic therapy.
Collapse
Affiliation(s)
- Peter J Wild
- Institute of Pathology, University of Regensburg, Regensburg, Germany
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Jiang Z, Wu XL, Garcia MD, Griffin KB, Michal JJ, Ott TL, Gaskins CT, Wright RW. Comparative gene-based in silico analysis of transcriptomes in different bovine tissues and (or) organs. Genome 2005; 47:1164-72. [PMID: 15644975 DOI: 10.1139/g04-084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A gene-based approach was used to annotate 322,168 cattle expressed sequence tags (ESTs) based on human genes in order to census the transcriptomes, analyze their expression similarities, and identify genes preferentially expressed in different bovine tissues and (or) organs. Of the 34,157 human coding genes used in a standalone BLAST search, 14,928 could be matched with provisional orthologous sequences in a total of 230,135 bovine ESTs. The remaining 92,033 bovine ESTs were estimated to represent an additional 5970 genes in cattle. On average, approximately 8600 genes were estimated to be expressed in a single tissue and (or) organ and 13,000 in a pooled tissue library. On the basis of the estimated numbers of genes, no more than 3% of genes would be missed when approximately 34,000 ESTs were sequenced from a single tissue and (or) organ library and approximately 40,000 ESTs from a pooled source, respectively. Cluster analyses of the gene expression patterns among 12 single tissues and (or) organs in cattle revealed that their expression similarities would depend on physiological functions. In addition, a total of 1502 genes were identified as preferentially expressed genes in these 12 single tissues and (or) organs with LOD (logarithm of the odds, base 10) > or = 3.0. Therefore, our study provides some insights for further investigating the developmental and functional relations of various tissues and organs in mammals.
Collapse
Affiliation(s)
- Zhihua Jiang
- Department of Animal Sciences, Washington State University, Pullman, WA 99164, USA.
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Alldinger I, Dittert D, Peiper M, Fusco A, Chiappetta G, Staub E, Lohr M, Jesnowski R, Baretton G, Ockert D, Saeger HD, Grützmann R, Pilarsky C. Gene expression analysis of pancreatic cell lines reveals genes overexpressed in pancreatic cancer. Pancreatology 2005; 5:370-9. [PMID: 15983444 DOI: 10.1159/000086537] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/05/2004] [Accepted: 09/16/2004] [Indexed: 12/11/2022]
Abstract
BACKGROUND Pancreatic cancer is one of the leading causes of cancer-related death. Using DNA gene expression analysis based on a custom made Affymetrix cancer array, we investigated the expression pattern of both primary and established pancreatic carcinoma cell lines. METHODS We analyzed the gene expression of 5 established pancreatic cancer cell lines (AsPC-1, BxPC-3, Capan-1, Capan-2 and HPAF II) and 5 primary isolates, 1 of them derived from benign pancreatic duct cells. RESULTS Out of 1,540 genes which were expressed in at least 3 experiments, we found 122 genes upregulated and 18 downregulated in tumor cell lines compared to benign cells with a fold change >3. Several of the upregulated genes (like Prefoldin 5, ADAM9 and E-cadherin) have been associated with pancreatic cancer before. The other differentially regulated genes, however, play a so far unknown role in the course of human pancreatic carcinoma. By means of immunohistochemistry we could show that thymosin beta-10 (TMSB10), upregulated in tumor cell lines, is expressed in human pancreatic carcinoma, but not in non-neoplastic pancreatic tissue, suggesting a role for TMSB10 in the carcinogenesis of pancreatic carcinoma. CONCLUSION Using gene expression profiling of pancreatic cell lines we were able to identify genes differentially expressed in pancreatic adenocarcinoma, which might contribute to pancreatic cancer development.
Collapse
Affiliation(s)
- Ingo Alldinger
- Department of Visceral-, Thoracic- and Vascular Surgery, University Hospital, Dresden, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Wild PJ, Herr A, Wissmann C, Stoehr R, Rosenthal A, Zaak D, Simon R, Knuechel R, Pilarsky C, Hartmann A. Gene Expression Profiling of Progressive Papillary Noninvasive Carcinomas of the Urinary Bladder. Clin Cancer Res 2005; 11:4415-29. [PMID: 15958626 DOI: 10.1158/1078-0432.ccr-05-0259] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
PURPOSE The aim of the present study was to define gene expression profiles of noninvasive and invasive bladder cancer, to identify potential therapeutic or screening targets in bladder cancer, and to define genetic changes relevant for tumor progression of recurrent papillary bladder cancer (pTa). EXPERIMENTAL DESIGN Overall, 67 bladder neoplasms (46 pTa, 3 pTis, 10 pT1, and 8 pT2) and eight normal bladder specimens were investigated by a combination of laser microdissection and gene expression profiling. Eight of 16 patients with recurrent noninvasive papillary bladder tumors developed carcinoma in situ (pTis) or invasive bladder cancer (> or = pT1G2) in the course of time. RNA expression results of the putative progression marker cathepsin E (CTSE) were confirmed by immunohistochemistry using high-throughput tissue microarray analysis (n = 776). Univariate analysis of factors regarding overall survival, progression-free survival, and recurrence-free survival in patients with urothelial bladder cancer was done. RESULTS Hierarchical cluster analyses revealed no differences between pTaG1 and pTaG2 tumors. However, distinct groups of invasive cancers with different gene expression profiles in papillary and solid tumors were found. Progression-associated gene profiles could be defined (e.g., FABP4 and CTSE) and were already present in the preceding noninvasive papillary tumors. CTSE expression (P = 0.003) and a high Ki-67 labeling index of at least 5% (P = 0.01) were the only factors that correlated significantly with progression-free survival of pTa tumors in our gene expression approach. CONCLUSIONS Gene expression profiling revealed novel genes with potential clinical utility to select patients that are more likely to develop aggressive disease.
Collapse
Affiliation(s)
- Peter J Wild
- Institute of Pathology, University of Regensburg, Regensburg, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Ferguson DA, Chiang JTA, Richardson JA, Graff J. eXPRESSION: an in silico tool to predict patterns of gene expression. Gene Expr Patterns 2005; 5:619-28. [PMID: 15939374 DOI: 10.1016/j.modgep.2005.03.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2004] [Revised: 01/17/2005] [Accepted: 03/09/2005] [Indexed: 11/22/2022]
Abstract
In embryological studies, expression pattern analyses are of special importance since genes that have temporally and spatially restricted expression are not only essential as lineage markers but are often causative in formation of specific fates. Further, where a molecule is expressed can be quite revealing in regard to its endogenous function. We present a gene discovery tool, termed eXPRESSION, that utilizes the public EST databases to identify genes matching desired transcriptional profiles. We first tested and validated the ability of eXPRESSION to discover tissue-specific genes in the adult mouse; empirically as well as with DNA microarrays and RT-PCRs. These studies showed that eXPRESSION predictions could identify genes that are specifically expressed in adult mouse tissues. Next, we developed a novel search strategy to find genes that are expressed in specific regions or tissues of the developing mouse embryo. With these tools, we identified several novel genes that exhibited a neural-specific or neural-enriched expression pattern during murine development. The data show that eXPRESSION is widely applicable and may be used to identify both adult and embryonic tissue- or organ-specific genes with minimal cost and effort.
Collapse
Affiliation(s)
- Deborah A Ferguson
- Center for Developmental Biology, UT Southwestern Medical Center, Dallas, TX 75390-9133, USA
| | | | | | | |
Collapse
|
47
|
Kristiansen G, Pilarsky C, Wissmann C, Kaiser S, Bruemmendorf T, Roepcke S, Dahl E, Hinzmann B, Specht T, Pervan J, Stephan C, Loening S, Dietel M, Rosenthal A. Expression profiling of microdissected matched prostate cancer samples reveals CD166/MEMD and CD24 as new prognostic markers for patient survival. J Pathol 2005; 205:359-76. [PMID: 15532095 DOI: 10.1002/path.1676] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
In order to screen for differentially expressed genes that might be useful in diagnosis or therapy of prostate cancer we have used a custom made Affymetrix GeneChip containing 3950 cDNA fragments. Expression profiles were obtained from 42 matched pairs of mRNAs isolated from microdissected malignant and benign prostate tissues. Applying three different bioinformatic approaches to define differential gene expression, we found 277 differentially expressed genes, of which 98 were identified by all three methods. Fourteen per cent of these genes were not found in other expression studies, which were based on bulk tissue. Resultant candidate genes were further validated by quantitative RT-PCR, mRNA in situ hybridization and immunohistochemistry. AGR2 was over-expressed in 89% of prostate carcinomas, but did not have prognostic significance. Immunohistologically detected over-expression of MEMD and CD24 was identified in 86% and 38.5% of prostate carcinomas respectively, and both were predictive of PSA relapse. Combined marker analysis using MEMD and CD24 expression proved to be an independent prognostic factor (RR = 4.7, p = 0.006) in a Cox regression model, and was also superior to conventional markers. This combination of molecular markers thus appears to allow improved prediction of patient prognosis, but should be validated in larger studies.
Collapse
Affiliation(s)
- Glen Kristiansen
- Institute of Pathology, Charité University Hospital, Berlin, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Dahl E, Sadr-Nabavi A, Klopocki E, Betz B, Grube S, Kreutzfeld R, Himmelfarb M, An HX, Gelling S, Klaman I, Hinzmann B, Kristiansen G, Grützmann R, Kuner R, Petschke B, Rhiem K, Wiechen K, Sers C, Wiestler O, Schneider A, Höfler H, Nährig J, Dietel M, Schäfer R, Rosenthal A, Schmutzler R, Dürst M, Meindl A, Niederacher D. Systematic identification and molecular characterization of genes differentially expressed in breast and ovarian cancer. J Pathol 2004; 205:21-8. [PMID: 15586368 DOI: 10.1002/path.1687] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The identification of novel disease-associated genes in gynaecological tumours has important implications for understanding the process of tumourigenesis and the development of novel treatment regimens. cDNA libraries from disease tissues may represent a valuable source to identify such genes. Recently, a bio-informatic procedure based on an 'electronic Northern' approach was established to screen expressed sequence tag (EST) libraries for genes differentially expressed in tumour and normal tissues, and identified 450 candidate genes differentially expressed in breast and ovarian cancer. In this report, the validation of an initial set of 40 candidate genes, which were selected due to their localization in chromosomal regions frequently altered in gynaecological tumours, is described. Differential expression of 29 of these genes, including three uncharacterized novel genes, was confirmed by applying cancer profiling arrays with 106 matched pairs of tumour/normal cDNAs and quantitative reverse transcription-polymerase chain reaction (RT-PCR) on 60 clinical specimens. The majority of these differentially expressed genes have not been described previously in the context of breast and ovarian cancer, and may constitute novel diagnostic markers for these tumour entities.
Collapse
Affiliation(s)
- Edgar Dahl
- Institute of Pathology, RWTH Aachen, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Fei Z, Tang X, Alba RM, White JA, Ronning CM, Martin GB, Tanksley SD, Giovannoni JJ. Comprehensive EST analysis of tomato and comparative genomics of fruit ripening. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2004; 40:47-59. [PMID: 15361140 DOI: 10.1111/j.1365-313x.2004.02188.x] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
A large tomato expressed sequence tag (EST) dataset (152 635 total) was analyzed to gain insights into differential gene expression among diverse plant tissues representing a range of developmental programs and biological responses. These ESTs were clustered and assembled to a total of 31 012 unique gene sequences. To better understand tomato gene expression at a plant system level and to identify differentially expressed and tissue-specific genes, we developed and implemented a digital expression analysis protocol. By clustering genes according to their relative abundance in the various EST libraries, expression patterns of genes across various tissues were generated and genes with similar patterns were grouped. In addition, tissues themselves were clustered for relatedness based on relative gene expression as a means of validating the integrity of the EST data as representative of relative gene expression. Arabidopsis and grape EST collections were also characterized to facilitate cross-species comparisons where possible. Tomato fruit digital expression data was specifically compared with publicly available grape EST data to gain insight into molecular manifestation of ripening processes across diverse taxa and resulted in identification of common transcription factors not previously associated with ripening.
Collapse
Affiliation(s)
- Zhangjun Fei
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Gupta S, Zink D, Korn B, Vingron M, Haas SA. Strengths and weaknesses of EST-based prediction of tissue-specific alternative splicing. BMC Genomics 2004; 5:72. [PMID: 15453915 PMCID: PMC521684 DOI: 10.1186/1471-2164-5-72] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2004] [Accepted: 09/28/2004] [Indexed: 12/15/2022] Open
Abstract
Background Alternative splicing contributes significantly to the complexity of the human transcriptome and proteome. Computational prediction of alternative splice isoforms are usually based on EST sequences that also allow to approximate the expression pattern of the related transcripts. However, the limited number of tissues represented in the EST data as well as the different cDNA construction protocols may influence the predictive capacity of ESTs to unravel tissue-specifically expressed transcripts. Methods We predict tissue and tumor specific splice isoforms based on the genomic mapping (SpliceNest) of the EST consensus sequences and library annotation provided in the GeneNest database. We further ascertain the potentially rare tissue specific transcripts as the ones represented only by ESTs derived from normalized libraries. A subset of the predicted tissue and tumor specific isoforms are then validated via RT-PCR experiments over a spectrum of 40 tissue types. Results Our strategy revealed 427 genes with at least one tissue specific transcript as well as 1120 genes showing tumor specific isoforms. While our experimental evaluation of computationally predicted tissue-specific isoforms revealed a high success rate in confirming the expression of these isoforms in the respective tissue, the strategy frequently failed to detect the expected restricted expression pattern. The analysis of putative lowly expressed transcripts using normalized cDNA libraries suggests that our ability to detect tissue-specific isoforms strongly depends on the expression level of the respective transcript as well as on the sensitivity of the experimental methods. Especially splice isoforms predicted to be disease-specific tend to represent transcripts that are expressed in a set of healthy tissues rather than novel isoforms. Conclusions We propose to combine the computational prediction of alternative splice isoforms with experimental validation for efficient delineation of an accurate set of tissue-specific transcripts.
Collapse
Affiliation(s)
- Shobhit Gupta
- Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestr. 73, D-14195 Berlin – Germany
| | - Dorothea Zink
- German Resource Center for Genome Research, INF 580, 69120 Heidelberg – Germany
| | - Bernhard Korn
- German Resource Center for Genome Research, INF 580, 69120 Heidelberg – Germany
| | - Martin Vingron
- Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestr. 73, D-14195 Berlin – Germany
| | - Stefan A Haas
- Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestr. 73, D-14195 Berlin – Germany
| |
Collapse
|