151
|
Ergun S, Oztuzcu S. Oncocers: ceRNA-mediated cross-talk by sponging miRNAs in oncogenic pathways. Tumour Biol 2015; 36:3129-36. [PMID: 25809705 DOI: 10.1007/s13277-015-3346-x] [Citation(s) in RCA: 149] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Accepted: 03/15/2015] [Indexed: 12/12/2022] Open
Abstract
Competing endogenous RNAs (ceRNAs) are RNA transcripts which can communicate with each other by decreasing targeting concentration of micro-RNA (miRNA) with the derepression of other messenger RNAs (mRNAs) having the common miRNA response elements (MREs). Oncocers are ceRNAs taking crucial roles in oncogenic pathways processed in many types of cancer, and this study analyzes oncocer-mediated cross-talk by sponging microRNAs (miRNAs) in these pathways. While doing this, breast, liver, colon, prostate, gastric, lung, endometrium, thyroid and epithelial cancers and melanoma, rhabdomyosarcoma, glioblastoma, acute promyelocytic leukemia, retinoblastoma, and neuroblastoma were analyzed with respect to ceRNA-based carcinogenesis. This study defines, firstly, oncocers in the literature and contains all oncocer-related findings found up to now. Therefore, it will help to increase our comprehension about oncocer-mediated mechanisms. Via this study, a novel perspective would be produced to make clear cancer mechanisms and suggest novel approaches to regulate ceRNA networks via miRNA competition for cancer therapeutics. Graphical Abstract Multiple RNA transcripts have common MREs for the similar miRNA in their 3'-untranslated regions (3'-UTRs). Upregulation of ceRNAs rises the abundance of specific MREs and shifts the miRNA pool distribution, as a result, leading to the increased expression of target mRNA. The depot of genomic mutations and epigenetic alterations changing gene function and expression causes cancers. Herewith, genome-based somatic base-pair mutations, DNA copy number alterations, chromosomal translocation, also transcript fusions, alternative splicing are usually seen in cancer situations. Consequently, such cases causing changed UTR expression in transcripts influence the levels of MRE or present new MREs into the cells. Alterations in MREs of ceRNAs affect the capability of a specific mRNA transcript to attach or titrate miRNAs. As a result, the disturbed ceRNA network can lead to diseases and cancers. As a new term in RNA world, oncocers-the name for ceRNAs taking crucial roles in oncogenic pathways-are processed in many types of cancer, and oncocer-mediated cross-talk are analyzed by sponging miRNAs in these pathways.
Collapse
Affiliation(s)
- Sercan Ergun
- Ulubey Vocational Higher School, Ordu University, Ordu, Turkey,
| | | |
Collapse
|
152
|
Yu L, Tang W, He W, Ma X, Vasseur L, Baxter SW, Yang G, Huang S, Song F, You M. Characterization and expression of the cytochrome P450 gene family in diamondback moth, Plutella xylostella (L.). Sci Rep 2015; 5:8952. [PMID: 25752830 PMCID: PMC5155450 DOI: 10.1038/srep08952] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Accepted: 02/04/2015] [Indexed: 12/30/2022] Open
Abstract
Cytochrome P450 monooxygenases are present in almost all organisms and can play vital roles in hormone regulation, metabolism of xenobiotics and in biosynthesis or inactivation of endogenous compounds. In the present study, a genome-wide approach was used to identify and analyze the P450 gene family of diamondback moth, Plutella xylostella, a destructive worldwide pest of cruciferous crops. We identified 85 putative cytochrome P450 genes from the P. xylostella genome, including 84 functional genes and 1 pseudogene. These genes were classified into 26 families and 52 subfamilies. A phylogenetic tree constructed with three additional insect species shows extensive gene expansions of P. xylostella P450 genes from clans 3 and 4. Gene expression of cytochrome P450s was quantified across multiple developmental stages (egg, larva, pupa and adult) and tissues (head and midgut) using P. xylostella strains susceptible or resistant to insecticides chlorpyrifos and fiprinol. Expression of the lepidopteran specific CYP367s predominantly occurred in head tissue suggesting a role in either olfaction or detoxification. CYP340s with abundant transposable elements and relatively high expression in the midgut probably contribute to the detoxification of insecticides or plant toxins in P. xylostella. This study will facilitate future functional studies of the P. xylostella P450s in detoxification.
Collapse
Affiliation(s)
- Liying Yu
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Faculty of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Weiqi Tang
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Faculty of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Weiyi He
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Xiaoli Ma
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Liette Vasseur
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Department of Biological Sciences, Brock University, St. Catharines, Ontario, Canada
| | - Simon W. Baxter
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia, Australia
| | - Guang Yang
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Shiguo Huang
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Fengqin Song
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Faculty of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| | - Minsheng You
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Key Laboratory of Integrated Pest Management for Fujian-Taiwan Crops, Ministry of Agriculture, Fuzhou 350002, China
| |
Collapse
|
153
|
Welch JD, Baran-Gale J, Perou CM, Sethupathy P, Prins JF. Pseudogenes transcribed in breast invasive carcinoma show subtype-specific expression and ceRNA potential. BMC Genomics 2015; 16:113. [PMID: 25765044 PMCID: PMC4344757 DOI: 10.1186/s12864-015-1227-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 01/08/2015] [Indexed: 12/14/2022] Open
Abstract
Background Recent studies have shown that some pseudogenes are transcribed and contribute to cancer when dysregulated. In particular, pseudogene transcripts can function as competing endogenous RNAs (ceRNAs). The high similarity of gene and pseudogene nucleotide sequence has hindered experimental investigation of these mechanisms using RNA-seq. Furthermore, previous studies of pseudogenes in breast cancer have not integrated miRNA expression data in order to perform large-scale analysis of ceRNA potential. Thus, knowledge of both pseudogene ceRNA function and the role of pseudogene expression in cancer are restricted to isolated examples. Results To investigate whether transcribed pseudogenes play a pervasive regulatory role in cancer, we developed a novel bioinformatic method for measuring pseudogene transcription from RNA-seq data. We applied this method to 819 breast cancer samples from The Cancer Genome Atlas (TCGA) project. We then clustered the samples using pseudogene expression levels and integrated sample-paired pseudogene, gene and miRNA expression data with miRNA target prediction to determine whether more pseudogenes have ceRNA potential than expected by chance. Conclusions Our analysis identifies with high confidence a set of 440 pseudogenes that are transcribed in breast cancer tissue. Of this set, 309 pseudogenes exhibit significant differential expression among breast cancer subtypes. Hierarchical clustering using only pseudogene expression levels accurately separates tumor samples from normal samples and discriminates the Basal subtype from the Luminal and Her2 subtypes. Correlation analysis shows more positively correlated pseudogene-parent gene pairs and negatively correlated pseudogene-miRNA pairs than expected by chance. Furthermore, 177 transcribed pseudogenes possess binding sites for co-expressed miRNAs that are also predicted to target their parent genes. Taken together, these results increase the catalog of putative pseudogene ceRNAs and suggest that pseudogene transcription in breast cancer may play a larger role than previously appreciated. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1227-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Joshua D Welch
- Curriculum in Bioinformatics and Computational Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| | - Jeanette Baran-Gale
- Curriculum in Bioinformatics and Computational Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| | - Charles M Perou
- Curriculum in Bioinformatics and Computational Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| | - Praveen Sethupathy
- Curriculum in Bioinformatics and Computational Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Lineberger Comprehensive Cancer Center, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| | - Jan F Prins
- Curriculum in Bioinformatics and Computational Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. .,Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
154
|
|
155
|
Tew SR, McDermott BT, Fentem RB, Peffers MJ, Clegg PD. Transcriptome-wide analysis of messenger RNA decay in normal and osteoarthritic human articular chondrocytes. Arthritis Rheumatol 2015; 66:3052-61. [PMID: 25155964 PMCID: PMC4233952 DOI: 10.1002/art.38849] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Accepted: 08/12/2014] [Indexed: 01/03/2023]
Abstract
Objective Messenger RNA (mRNA) decay rates control not only gene expression levels, but also responsiveness to altered transcriptional input. We undertook this study to examine transcriptome-wide posttranscriptional regulation in both normal and osteoarthritic (OA) human articular chondrocytes. Methods Human articular chondrocytes were isolated from normal or OA tissue. Equine articular chondrocytes were isolated from young or old horses at a commercial abattoir. RNA decay was measured across the transcriptome in human cells by microarray analysis following an actinomycin D chase. Messenger RNA levels in samples were confirmed using quantitative reverse transcription–polymerase chain reaction. Results Examination of total mRNA expression levels demonstrated significant differences in the expression of transcripts between normal and OA chondrocytes. Interestingly, almost no difference was observed in total mRNA expression between chondrocytes from intact OA cartilage and those from fibrillated OA cartilage. Decay analysis revealed a set of rapidly turned over transcripts associated with transcriptional control and programmed cell death that were common to all chondrocytes and contained binding sites for abundant cartilage microRNAs. Many transcripts exhibited altered mRNA half-lives in human OA chondrocytes compared to normal cells. Specific transcripts whose decay rates were altered were generally less stable in these pathologic cells. Examination of selected genes in chondrocytes from young and old healthy horses did not identify any change in mRNA turnover. Conclusion This is the first investigation into the “posttranscriptome” of the chondrocyte. It identifies a set of short-lived chondrocyte mRNAs likely to be highly responsive to altered transcriptional input as well as mRNAs whose decay rates are affected in OA chondrocytes.
Collapse
Affiliation(s)
- Simon R Tew
- University of Liverpool, Leahurst Campus, Neston, Cheshire, UK
| | | | | | | | | |
Collapse
|
156
|
Milligan MJ, Lipovich L. Pseudogene-derived lncRNAs: emerging regulators of gene expression. Front Genet 2015; 5:476. [PMID: 25699073 PMCID: PMC4316772 DOI: 10.3389/fgene.2014.00476] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 12/25/2014] [Indexed: 01/11/2023] Open
Abstract
In the more than one decade since the completion of the Human Genome Project, the prevalence of non-protein-coding functional elements in the human genome has emerged as a key revelation in post-genomic biology. Highlighted by the ENCODE (Encyclopedia of DNA Elements) and FANTOM (Functional Annotation of Mammals) consortia, these elements include tens of thousands of pseudogenes, as well as comparably numerous long non-coding RNA (lncRNA) genes. Pseudogene transcription and function remain insufficiently understood. However, the field is of great importance for human disease due to the high sequence similarity between pseudogenes and their parental protein-coding genes, which generates the potential for sequence-specific regulation. Recent case studies have established essential and coordinated roles of both pseudogenes and lncRNAs in development and disease in metazoan systems, including functional impacts of lncRNA transcription at pseudogene loci on the regulation of the pseudogenes’ parental genes. This review synthesizes the nascent evidence for regulatory modalities jointly exerted by lncRNAs and pseudogenes in human disease, and for recent evolutionary origins of these systems.
Collapse
Affiliation(s)
- Michael J Milligan
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine , Detroit, MI, USA
| | - Leonard Lipovich
- Center for Molecular Medicine and Genetics, Wayne State University School of Medicine , Detroit, MI, USA
| |
Collapse
|
157
|
Zheng W, Sai W, Yao M, Gu H, Yao Y, Qian Q, Yao D. Silencing clusterin gene transcription on effects of multidrug resistance reversing of human hepatoma HepG2/ADM cells. Tumour Biol 2015. [PMID: 25600802 DOI: 10.1007/s13277-015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Abnormal clusterin (CLU) expression is associated with multidrug resistance (MDR) of hepatocellular carcinoma (HCC). In the present study, the CLU expression was analyzed in human hepatoma cells and chemoresistant counterpart HepG2/ADM cells. Compared with L02 cells, the overexpression of cellular CLU was identified in HepG2, HepG2/ADM, SMMC7721, Hep3B ,and PLC cells and relatively lower expression in Bel-7404, SNU-739, and MHCC97H cells. Specific short hairpin RNAs (shRNAs) to silence CLU gene transcription were designed, and the most effective sequences were screened. After the HepG2/ADM cells transfected with shRNA-1, the inhibition of CLU expression was 73.68 % at messenger RNA (mRNA) level by real-time quantitative RT-PCR with obvious enhancement in cell chemosensitivity, increasing apoptosis induced by doxorubicin using fluorescence kit, and Rh-123 retention qualified with flow cytometry. Knockdown CLU also significantly decreased the drug efflux pump activity through the depression of MDR1/P-glycoprotein (q = 11.739, P < 0.001). Moreover, silencing CLU led to downregulation of β-catenin (q = 13.544, P = 0.001), suggesting that downregulation of CLU might be a key point to reverse multidrug resistance of HepG2/ADM cells.
Collapse
Affiliation(s)
- Wenjie Zheng
- Research Center of Clinical Medicine, Affiliated Hospital of Nantong University, 20 West Temple Road, Nantong, 226001, Jiangsu Province, China
| | | | | | | | | | | | | |
Collapse
|
158
|
Guo Y, Zhao S, Bjoring M, Han L. Advanced Datamining Using RNAseq Data. BIG DATA ANALYTICS IN BIOINFORMATICS AND HEALTHCARE 2015. [DOI: 10.4018/978-1-4666-6611-5.ch001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
In recent years, RNA sequencing (RNAseq) technology has experienced a rapid rise in popularity. Often seen as a competitor of and the ultimate successor to microarray technology given its more accurate and quantitative gene expression measurement, RNAseq also offers a wealth of additional information that is often overlooked, and given the massive accumulation of RNAseq data available in public data repositories over the past few years, these data are ripe for discovery. Abundant opportunities exist for researchers to conduct in-depth, non-traditional analyses that take advantage of these secondary uses and for bioinformaticians to develop tools to make these data more accessible. This is discussed in this chapter.
Collapse
|
159
|
Grandér D, Johnsson P. Pseudogene-Expressed RNAs: Emerging Roles in Gene Regulation and Disease. Curr Top Microbiol Immunol 2015; 394:111-26. [DOI: 10.1007/82_2015_442] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
160
|
Zhu L, Hu Z, Liu J, Gao J, Lin B. Gene expression profile analysis identifies metastasis and chemoresistance-associated genes in epithelial ovarian carcinoma cells. Med Oncol 2014; 32:426. [PMID: 25502083 PMCID: PMC4262766 DOI: 10.1007/s12032-014-0426-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Accepted: 12/02/2014] [Indexed: 12/12/2022]
Abstract
The purpose of this study was to identify genes that associated with higher ability of metastasis and chemotherapic resistance in epithelial ovarian carcinoma (EOC) cells. An oligonucleotide microarray with probe sets complementary to 41,000+ unique human genes and transcripts was used to determine whether gene expression profile may differentiate three epithelial ovarian cell lines (RMG-I-C, COC1 and HO8910) from their sub-lines (RMG-I-H, COCI/DDP and HO8910/PM) with higher ability of metastasis and chemotherapic resistance. Quantitative real-time PCR and immunohistochemical staining validated the microarray results. Hierarchic cluster analysis of gene expression identified 49 genes that exhibited ≥2.0-fold change and P value ≤0.05. Highly differential expression of GCET2, NLRP4, FOXP1 and SNX29 genes was validated by quantitative PCR in all cell line samples. Finally, FOXP1 was validated at the protein level by immunohistochemistry in paraffin embedded ovarian tissues (i.e., for metastasis, 15 primary EOC and 10 omental metastasis [OM]; for chemoresistance, 13 sensitive and 13 resistant EOC). The identification of higher ability of metastasis and chemotherapic resistance-associated genes may provide a foundation for the development of new type-specific diagnostic strategies and treatment for metastasis and chemotherapic resistance in epithelial ovarian cancer.
Collapse
Affiliation(s)
- Liancheng Zhu
- Department of Obstetrics and Gynecology, Shengjing Hospital Affiliated to China Medical University, Shenyang, 110004, Liaoning Province, China,
| | | | | | | | | |
Collapse
|
161
|
Watanabe T, Cheng EC, Zhong M, Lin H. Retrotransposons and pseudogenes regulate mRNAs and lncRNAs via the piRNA pathway in the germline. Genome Res 2014; 25:368-80. [PMID: 25480952 PMCID: PMC4352877 DOI: 10.1101/gr.180802.114] [Citation(s) in RCA: 175] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The eukaryotic genome has vast intergenic regions containing transposons, pseudogenes, and other repetitive sequences. They produce numerous long noncoding RNAs (lncRNAs) and Piwi-interacting RNAs (piRNAs), yet the functions of the vast intergenic regions remain largely unknown. Mammalian piRNAs are abundantly expressed from the spermatocyte to round spermatid stage, coinciding with the widespread expression of lncRNAs in these cells. Here, we show that piRNAs derived from transposons and pseudogenes mediate the degradation of a large number of mRNAs and lncRNAs in mouse late spermatocytes. In particular, they have a large impact on the lncRNA transcriptome, as a quarter of lncRNAs expressed in late spermatocytes are up-regulated in mice deficient in the piRNA pathway. Furthermore, our genomic and in vivo functional analyses reveal that retrotransposon sequences in the 3′ UTR of mRNAs are targeted by piRNAs for degradation. Similarly, the degradation of spermatogenic cell-specific lncRNAs by piRNAs is mediated by retrotransposon sequences. Moreover, we show that pseudogenes regulate mRNA stability via the piRNA pathway. The degradation of mRNAs and lncRNAs by piRNAs requires PIWIL1 (also known as MIWI) and, at least in part, depends on its slicer activity. Together, these findings reveal the presence of a highly complex and global RNA regulatory network mediated by piRNAs with retrotransposons and pseudogenes as regulatory sequences.
Collapse
Affiliation(s)
- Toshiaki Watanabe
- Yale Stem Cell Center and Department of Cell Biology, Yale University School of Medicine, New Haven, Connecticut 06519, USA
| | - Ee-chun Cheng
- Yale Stem Cell Center and Department of Cell Biology, Yale University School of Medicine, New Haven, Connecticut 06519, USA
| | - Mei Zhong
- Yale Stem Cell Center and Department of Cell Biology, Yale University School of Medicine, New Haven, Connecticut 06519, USA
| | - Haifan Lin
- Yale Stem Cell Center and Department of Cell Biology, Yale University School of Medicine, New Haven, Connecticut 06519, USA
| |
Collapse
|
162
|
Radenbaugh AJ, Ma S, Ewing A, Stuart JM, Collisson EA, Zhu J, Haussler D. RADIA: RNA and DNA integrated analysis for somatic mutation detection. PLoS One 2014; 9:e111516. [PMID: 25405470 PMCID: PMC4236012 DOI: 10.1371/journal.pone.0111516] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 09/30/2014] [Indexed: 01/30/2023] Open
Abstract
The detection of somatic single nucleotide variants is a crucial component to the characterization of the cancer genome. Mutation calling algorithms thus far have focused on comparing the normal and tumor genomes from the same individual. In recent years, it has become routine for projects like The Cancer Genome Atlas (TCGA) to also sequence the tumor RNA. Here we present RADIA (RNA and DNA Integrated Analysis), a novel computational method combining the patient-matched normal and tumor DNA with the tumor RNA to detect somatic mutations. The inclusion of the RNA increases the power to detect somatic mutations, especially at low DNA allelic frequencies. By integrating an individual's DNA and RNA, we are able to detect mutations that would otherwise be missed by traditional algorithms that examine only the DNA. We demonstrate high sensitivity (84%) and very high precision (98% and 99%) for RADIA in patient data from endometrial carcinoma and lung adenocarcinoma from TCGA. Mutations with both high DNA and RNA read support have the highest validation rate of over 99%. We also introduce a simulation package that spikes in artificial mutations to patient data, rather than simulating sequencing data from a reference genome. We evaluate sensitivity on the simulation data and demonstrate our ability to rescue back mutations at low DNA allelic frequencies by including the RNA. Finally, we highlight mutations in important cancer genes that were rescued due to the incorporation of the RNA.
Collapse
Affiliation(s)
- Amie J. Radenbaugh
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Singer Ma
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Adam Ewing
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Joshua M. Stuart
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Eric A. Collisson
- Division of Hematology/Oncology, University of California San Francisco, San Francisco, California, United States of America
| | - Jingchun Zhu
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - David Haussler
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| |
Collapse
|
163
|
Prensner JR, Zhao S, Erho N, Schipper M, Iyer MK, Dhanasekaran SM, Magi-Galluzzi C, Mehra R, Sahu A, Siddiqui J, Davicioni E, Den RB, Dicker AP, Karnes RJ, Wei JT, Klein EA, Jenkins RB, Chinnaiyan AM, Feng FY. RNA biomarkers associated with metastatic progression in prostate cancer: a multi-institutional high-throughput analysis of SChLAP1. Lancet Oncol 2014; 15:1469-1480. [PMID: 25456366 DOI: 10.1016/s1470-2045(14)71113-1] [Citation(s) in RCA: 169] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
BACKGROUND Improved clinical predictors for disease progression are needed for localised prostate cancer, since only a subset of patients develop recurrent or refractory disease after first-line treatment. Therefore, we undertook an unbiased analysis to identify RNA biomarkers associated with metastatic progression after prostatectomy. METHODS Prostate cancer samples from patients treated with radical prostatectomy at three academic institutions were analysed for gene expression by a high-density Affymetrix GeneChip platform, encompassing more than 1 million genomic loci. In a discovery cohort, all protein-coding genes and known long non-coding RNAs were ranked by fold change in expression between tumours that subsequently metastasised versus those that did not. The top ranked gene was then validated for its prognostic value for metastatic progression in three additional independent cohorts. 95% of the gene expression assays were done in a Clinical Laboratory Improvements Amendments certified laboratory facility. All genes were assessed for their ability to predict metastatic progression by receiver-operating-curve area-under-the-curve analyses. Multivariate analyses were done for the primary endpoint of metastatic progression, with variables including Gleason score, preoperative prostate-specific antigen concentration, seminal vesicle invasion, surgical margin status, extracapsular extension, lymph node invasion, and expression of the highest ranked gene. FINDINGS 1008 patients were included in the study: 545 in the discovery cohort and 463 in the validation cohorts. The long non-coding RNA SChLAP1 was identified as the highest-ranked overexpressed gene in cancers with metastatic progression. Validation in three independent cohorts confirmed the prognostic value of SChLAP1 for metastatic progression. On multivariate modelling, SChLAP1 expression (high vs low) independently predicted metastasis within 10 years (odds ratio [OR] 2·45, 95% CI 1·70-3·53; p<0·0001). The only other variable that independently predicted metastasis within 10 years was Gleason score (8-10 vs 5-7; OR 2·14, 95% CI 1·77-2·58; p<0·0001). INTERPRETATION We identified and validated high SChLAP1 expression as significantly prognostic for metastatic disease progression of prostate cancer. Our findings suggest that further development of SChLAP1 as a potential biomarker, for treatment intensification in aggressive prostate cancer, warrants future study. FUNDING Prostate Cancer Foundation, National Institutes of Health, Department of Defense, Early Detection Research Network, Doris Duke Charitable Foundation, and Howard Hughes Medical Institute.
Collapse
Affiliation(s)
- John R Prensner
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Shuang Zhao
- Department of Radiation Oncology, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Matthew Schipper
- Department of Radiation Oncology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Matthew K Iyer
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Saravana M Dhanasekaran
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Pathology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Cristina Magi-Galluzzi
- Anatomic Pathology, Robert J Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Rohit Mehra
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Anirban Sahu
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Javed Siddiqui
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Pathology, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Robert B Den
- Department of Radiation Oncology, Thomas Jefferson University, Philadelphia, PA, USA
| | - Adam P Dicker
- Department of Radiation Oncology, Thomas Jefferson University, Philadelphia, PA, USA
| | | | - John T Wei
- Department of Urology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Eric A Klein
- Glickman Urological and Kidney Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Robert B Jenkins
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Arul M Chinnaiyan
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Urology, University of Michigan Medical School, Ann Arbor, MI, USA; Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI, USA; Howard Hughes Medical Institute, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Felix Y Feng
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, MI, USA; Department of Radiation Oncology, University of Michigan Medical School, Ann Arbor, MI, USA; Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI, USA.
| |
Collapse
|
164
|
Xiao-Jie L, Ai-Mei G, Li-Juan J, Jiang X. Pseudogene in cancer: real functions and promising signature. J Med Genet 2014; 52:17-24. [DOI: 10.1136/jmedgenet-2014-102785] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
165
|
Siggens L, Ekwall K. Epigenetics, chromatin and genome organization: recent advances from the ENCODE project. J Intern Med 2014; 276:201-14. [PMID: 24605849 DOI: 10.1111/joim.12231] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
The organization of the genome into functional units, such as enhancers and active or repressed promoters, is associated with distinct patterns of DNA and histone modifications. The Encyclopedia of DNA Elements (ENCODE) project has advanced our understanding of the principles of genome, epigenome and chromatin organization, identifying hundreds of thousands of potential regulatory regions and transcription factor binding sites. Part of the ENCODE consortium, GENCODE, has annotated the human genome with novel transcripts including new noncoding RNAs and pseudogenes, highlighting transcriptional complexity. Many disease variants identified in genome-wide association studies are located within putative enhancer regions defined by the ENCODE project. Understanding the principles of chromatin and epigenome organization will help to identify new disease mechanisms, biomarkers and drug targets, particularly as ongoing epigenome mapping projects generate data for primary human cell types that play important roles in disease.
Collapse
Affiliation(s)
- L Siggens
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
| | | |
Collapse
|
166
|
Vitiello M, Tuccoli A, Poliseno L. Long non-coding RNAs in cancer: implications for personalized therapy. Cell Oncol (Dordr) 2014; 38:17-28. [PMID: 25113790 DOI: 10.1007/s13402-014-0180-x] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2014] [Indexed: 02/06/2023] Open
Abstract
Long non-coding RNAs (lncRNAs, pseudogenes and circRNAs) have recently come into light as powerful players in cancer pathogenesis and it is becoming increasingly clear that they have the potential of greatly contributing to the spread and success of personalized cancer medicine. In this concise review, we briefly introduce these three classes of long non-coding RNAs. We then discuss their applications as diagnostic and prognostic biomarkers. Finally, we describe their appeal as targets and as drugs, while pointing out the limitations that still lie ahead of their definitive entry into clinical practice.
Collapse
Affiliation(s)
- Marianna Vitiello
- Oncogenomics Unit, Core Research Laboratory, Istituto Toscano Tumori c/o IFC-CNR, via Moruzzi 1, 56124, Pisa, Italy
| | | | | |
Collapse
|
167
|
Rijlaarsdam MA, Looijenga LHJ. An oncofetal and developmental perspective on testicular germ cell cancer. Semin Cancer Biol 2014; 29:59-74. [PMID: 25066859 DOI: 10.1016/j.semcancer.2014.07.003] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 07/17/2014] [Indexed: 12/19/2022]
Abstract
Germ cell tumors (GCTs) represent a diverse group of tumors presumably originating from (early fetal) developing germ cells. Most frequent are the testicular germ cell cancers (TGCC). Overall, TGCC is the most frequent malignancy in Caucasian males (20-40 years) and remains an important cause of (treatment related) mortality in these young men. The strong association between the phenotype of TGCC stem cell components and their totipotent ancestor (fetal primordial germ cell or gonocyte) makes these tumors highly relevant from an onco-fetal point of view. This review subsequently discusses the evidence for the early embryonic origin of TGCCs, followed by an overview of the crucial association between TGCC pathogenesis, genetics, environmental exposure and the (fetal) testicular micro-environment (genvironment). This culminates in an evaluation of three genvironmentally modulated hallmarks of TGCC directly related to the oncofetal pathogenesis of TGCC: (1) maintenance of pluripotency, (2) cell cycle control/cisplatin sensitivity and (3) regulation of proliferation/migration/apoptosis by KIT-KITL mediated receptor tyrosine kinase signaling. Briefly, TGCC exhibit identifiable stem cell components (seminoma and embryonal carcinoma) and progenitors that show large and consistent similarities to primordial/embryonic germ cells, their presumed totipotent cells of origin. TGCC pathogenesis depends crucially on a complex interaction of genetic and (micro-)environmental, i.e. genvironmental risk factors that have only been partly elucidated despite significant effort. TGCC stem cell components also show a high degree of similarity with embryonic stem/germ cells (ES) in the regulation of pluripotency and cell cycle control, directly related to their exquisite sensitivity to DNA damaging agents (e.g. cisplatin). Of note, (ES specific) micro-RNAs play a pivotal role in the crossover between cell cycle control, pluripotency and chemosensitivity. Moreover, multiple consistent observations reported TGCC to be associated with KIT-KITL mediated receptor tyrosine kinase signaling, a pathway crucially implicated in proliferation, migration and survival during embryogenesis including germ cell development. In conclusion, TGCCs are a fascinating model for onco-fetal developmental processes especially with regard to studying cell cycle control, pluripotency maintenance and KIT-KITL signaling. The knowledge presented here contributes to better understanding of the molecular characteristics of TGCC pathogenesis, translating to identification of at risk individuals and enhanced quality of care for TGCC patients (diagnosis, treatment and follow-up).
Collapse
Affiliation(s)
- Martin A Rijlaarsdam
- Department of Pathology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Leendert H J Looijenga
- Department of Pathology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, The Netherlands.
| |
Collapse
|
168
|
Han L, Yuan Y, Zheng S, Yang Y, Li J, Edgerton ME, Diao L, Xu Y, Verhaak RGW, Liang H. The Pan-Cancer analysis of pseudogene expression reveals biologically and clinically relevant tumour subtypes. Nat Commun 2014; 5:3963. [PMID: 24999802 PMCID: PMC4339277 DOI: 10.1038/ncomms4963] [Citation(s) in RCA: 109] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Accepted: 06/13/2014] [Indexed: 12/30/2022] Open
Abstract
Although individual pseudogenes have been implicated in tumor biology, the biomedical significance and clinical relevance of pseudogene expression have not been assessed in a systematic way. Here we generate pseudogene expression profiles in 2,808 patient samples of seven cancer types from The Cancer Genome Atlas RNA-seq data using a newly developed computational pipeline. Supervised analysis reveals a significant number of pseudogenes differentially expressed among established tumor subtypes; and pseudogene expression alone can accurately classify the major histological subtypes of endometrial cancer. Across cancer types, the tumor subtypes revealed by pseudogene expression show extensive and strong concordance with the subtypes defined by other molecular data. Strikingly, in kidney cancer, the pseudogene-expression subtypes not only significantly correlate with patient survival, but also help stratify patients in combination with clinical variables. Our study highlights the potential of pseudogene expression analysis as a new paradigm for investigating cancer mechanisms and discovering prognostic biomarkers.
Collapse
Affiliation(s)
- Leng Han
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Yuan Yuan
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA.,Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Siyuan Zheng
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Yang Yang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA.,Division of Biostatistics, The University of Texas Health Science Center at Houston, School of Public Health, Houston, TX 77030, USA
| | - Jun Li
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Mary E Edgerton
- Department of Pathology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030, USA
| | - Lixia Diao
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Yanxun Xu
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Roeland G W Verhaak
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA
| | - Han Liang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, USA.,Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
169
|
Roberts TC, Morris KV. Not so pseudo anymore: pseudogenes as therapeutic targets. Pharmacogenomics 2014; 14:2023-34. [PMID: 24279857 DOI: 10.2217/pgs.13.172] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Pseudogenes are junk DNA gene remnants generated by inactivating mutations or the loss of regulatory sequences, often following gene duplication or retrotransposition events. These pseudogenes have previously been considered to be molecular fossils derived from once-coding genes. In many cases, pseudogenes confer no observable selective advantage to the host organism and may be on a path towards removal from the genome. However, pseudogenes can also serve as raw material for the exaptation of novel functions, particularly in relation to the regulation of gene expression. Many pseudogenes are resurrected as noncoding RNA genes, which function in RNA-based gene regulatory circuits. As such, functional pseudogenes might simply be considered as 'genes'. Here, we discuss the role of these pseudogene-derived RNAs as regulators of gene expression in the context of human disease. In particular, we consider the manipulation of pseudogene transcripts through the use of antisense oligonucleotides, siRNAs, aptamers or classical gene therapy approaches as novel pharmacological strategies.
Collapse
Affiliation(s)
- Thomas C Roberts
- Department of Molecular & Experimental Medicine, The Scripps Research Institute, 10550 N Torrey Pines Road, La Jolla, CA 92037, USA
| | | |
Collapse
|
170
|
Abstract
The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here, we present a draft map of the human proteome using high resolution Fourier transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples including 17 adult tissues, 7 fetal tissues and 6 purified primary hematopoietic cells resulted in identification of proteins encoded by 17,294 genes accounting for ~84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream ORFs. This large human proteome catalog (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.
Collapse
|
171
|
Ebert G, Steininger A, Weißmann R, Boldt V, Lind-Thomsen A, Grune J, Badelt S, Heßler M, Peiser M, Hitzler M, Jensen LR, Müller I, Hu H, Arndt PF, Kuss AW, Tebel K, Ullmann R. Distribution of segmental duplications in the context of higher order chromatin organisation of human chromosome 7. BMC Genomics 2014; 15:537. [PMID: 24973960 PMCID: PMC4092221 DOI: 10.1186/1471-2164-15-537] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 06/17/2014] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Segmental duplications (SDs) are not evenly distributed along chromosomes. The reasons for this biased susceptibility to SD insertion are poorly understood. Accumulation of SDs is associated with increased genomic instability, which can lead to structural variants and genomic disorders such as the Williams-Beuren syndrome. Despite these adverse effects, SDs have become fixed in the human genome. Focusing on chromosome 7, which is particularly rich in interstitial SDs, we have investigated the distribution of SDs in the context of evolution and the three dimensional organisation of the chromosome in order to gain insights into the mutual relationship of SDs and chromatin topology. RESULTS Intrachromosomal SDs preferentially accumulate in those segments of chromosome 7 that are homologous to marmoset chromosome 2. Although this formerly compact segment has been re-distributed to three different sites during primate evolution, we can show by means of public data on long distance chromatin interactions that these three intervals, and consequently the paralogous SDs mapping to them, have retained their spatial proximity in the nucleus. Focusing on SD clusters implicated in the aetiology of the Williams-Beuren syndrome locus we demonstrate by cross-species comparison that these SDs have inserted at the borders of a topological domain and that they flank regions with distinct DNA conformation. CONCLUSIONS Our study suggests a link of nuclear architecture and the propagation of SDs across chromosome 7, either by promoting regional SD insertion or by contributing to the establishment of higher order chromatin organisation themselves. The latter could compensate for the high risk of structural rearrangements and thus may have contributed to their evolutionary fixation in the human genome.
Collapse
Affiliation(s)
- Grit Ebert
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Anne Steininger
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Robert Weißmann
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Vivien Boldt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Department of Biology, Chemistry and Pharmacy, Free University Berlin, 14195 Berlin, Germany
| | - Allan Lind-Thomsen
- />Wilhelm Johannsen Centre for Functional Genome Research, Department of Cellular and Molecular Medicine, University of Copenhagen, Blegdamsvej 3, DK-2200 Copenhagen, Denmark
| | - Jana Grune
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Stefan Badelt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- />Institute for Theoretical Chemistry, University of Vienna, Waehringer Straße 17, A-1090 Vienna, Austria
| | - Melanie Heßler
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Matthias Peiser
- />Unit Experimental Research, Department of Product Safety, Federal Institute for Bundeswehr Institute of Radiobiology affiliated, the University of Ulm, Neuherbergstraße 11, 80937 Munich, Germany
| | - Manuel Hitzler
- />Unit Experimental Research, Department of Product Safety, Federal Institute for Bundeswehr Institute of Radiobiology affiliated, the University of Ulm, Neuherbergstraße 11, 80937 Munich, Germany
| | - Lars R Jensen
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Ines Müller
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Hao Hu
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Peter F Arndt
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Andreas W Kuss
- />Department of Human Genetics, University Medicine Greifswald, and Interfaculty Institute of Genetics and Functional Genomics, University of Greifswald, Fleischmannstraße 42-44, 17475 Greifswald, Germany
| | - Katrin Tebel
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Reinhard Ullmann
- />Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| |
Collapse
|
172
|
Manda SS, Nirujogi RS, Pinto SM, Kim MS, Datta KK, Sirdeshmukh R, Prasad TSK, Thongboonkerd V, Pandey A, Gowda H. Identification and Characterization of Proteins Encoded by Chromosome 12 as Part of Chromosome-centric Human Proteome Project. J Proteome Res 2014; 13:3166-77. [DOI: 10.1021/pr401123v] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Srikanth Srinivas Manda
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Centre
of Excellence in Bioinformatics, Bioinformatics Centre, School of
Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Raja Sekhar Nirujogi
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Centre
of Excellence in Bioinformatics, Bioinformatics Centre, School of
Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Sneha Maria Pinto
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Manipal University, Madhav Nagar, Manipal 576104, India
| | | | - Keshava K. Datta
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- School of
Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Ravi Sirdeshmukh
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - T. S. Keshava Prasad
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Visith Thongboonkerd
- Medical
Proteomics Unit, Office for Research and Development, Faculty of Medicine
Siriraj Hospital, and Center for Research in Complex Systems Science, Mahidol University, Bangkok 10700, Thailand
| | - Akhilesh Pandey
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Harsha Gowda
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| |
Collapse
|
173
|
Sokol M, Wabl M, Ruiz IR, Pedersen FS. Novel principles of gamma-retroviral insertional transcription activation in murine leukemia virus-induced end-stage tumors. Retrovirology 2014; 11:36. [PMID: 24886479 PMCID: PMC4098794 DOI: 10.1186/1742-4690-11-36] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2013] [Accepted: 04/28/2014] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Insertional mutagenesis screens of retrovirus-induced mouse tumors have proven valuable in human cancer research and for understanding adverse effects of retroviral-based gene therapies. In previous studies, the assignment of mouse genes to individual retroviral integration sites has been based on close proximity and expression patterns of annotated genes at target positions in the genome. We here employed next-generation RNA sequencing to map retroviral-mouse chimeric junctions genome-wide, and to identify local patterns of transcription activation in T-lymphomas induced by the murine leukemia gamma-retrovirus SL3-3. Moreover, to determine epigenetic integration preferences underlying long-range gene activation by retroviruses, the colocalization propensity with common epigenetic enhancer markers (H3K4Me1 and H3K27Ac) of 6,117 integrations derived from end-stage tumors of more than 2,000 mice was examined. RESULTS We detected several novel mechanisms of retroviral insertional mutagenesis: bidirectional activation of mouse transcripts on opposite sides of a provirus including transcription of unannotated mouse sequence; sense/antisense-type activation of genes located on opposite DNA strands; tandem-type activation of distal genes that are positioned adjacently on the same DNA strand; activation of genes that are not the direct integration targets; combination-type insertional mutagenesis, in which enhancer activation, alternative chimeric splicing and retroviral promoter insertion are induced by a single retrovirus. We also show that irrespective of the distance to transcription start sites, the far majority of retroviruses in end-stage tumors colocalize with H3K4Me1 and H3K27Ac-enriched regions in murine lymphoid tissues. CONCLUSIONS We expose novel retrovirus-induced host transcription activation patterns that reach beyond a single and nearest annotated gene target. Awareness of this previously undescribed layer of complexity may prove important for elucidation of adverse effects in retroviral-based gene therapies. We also show that wild-type gamma-retroviruses are frequently positioned at enhancers, suggesting that integration into regulatory regions is specific and also subject to positive selection for sustaining long-range gene activation in end-stage tumors. Altogether, this study should prove useful for extrapolating adverse outcomes of retroviral vector therapies, and for understanding fundamental cellular regulatory principles and retroviral biology.
Collapse
Affiliation(s)
- Martin Sokol
- Department of Molecular Biology and Genetics, Aarhus University, DK-8000 Aarhus, Denmark
| | - Matthias Wabl
- Department of Microbiology and Immunology, University of California, San Francisco, CA 94143, USA
| | - Irene Rius Ruiz
- Department of Molecular Biology and Genetics, Aarhus University, DK-8000 Aarhus, Denmark
| | - Finn Skou Pedersen
- Department of Molecular Biology and Genetics, Aarhus University, DK-8000 Aarhus, Denmark
| |
Collapse
|
174
|
Ding W, Dai J. Methods for detecting transcribed pseudogenes: PCR on regions of high sequence similarity followed by cloning and sequencing. Methods Mol Biol 2014; 1167:103-15. [PMID: 24823774 DOI: 10.1007/978-1-4939-0835-6_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
Identifying pseudogene transcription is problematic in many cases due to the high sequence similarity between pseudogenes and their parental genes. In this chapter, we detail the procedure for the detection of pseudogene transcription using the reverse transcription polymerase chain reaction (RT-PCR) method. The protocol comprises (1) extraction of total RNA, (2) first-strand cDNA synthesis from total RNA, (3) amplification of the cDNA by PCR, and (4) cloning and sequencing of the PCR products. Technical and practical guidance is provided, and the critical points during each of the steps are discussed. In particular, the importance of designing high specific PCR primers and thoroughly eliminating genomic DNA contamination from RNA preparation is emphasized.
Collapse
Affiliation(s)
- Wenyong Ding
- Department of Biochemistry, Dalian Medical University, Dalian, China
| | | |
Collapse
|
175
|
Shi L, Lin YH, Sierant MC, Zhu F, Cui S, Guan Y, Sartor MA, Tanabe O, Lim KC, Engel JD. Developmental transcriptome analysis of human erythropoiesis. Hum Mol Genet 2014; 23:4528-42. [PMID: 24781209 DOI: 10.1093/hmg/ddu167] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To globally survey the changes in transcriptional landscape during terminal erythroid differentiation, we performed RNA sequencing (RNA-seq) on primary human CD34(+) cells after ex vivo differentiation from the earliest into the most mature erythroid cell stages. This analysis identified thousands of novel intergenic and intronic transcripts as well as novel alternative transcript isoforms. After rigorous data filtering, 51 (presumptive) novel protein-coding transcripts, 5326 long and 679 small non-coding RNA candidates remained. The analysis also revealed two clear transcriptional trends during terminal erythroid differentiation: first, the complexity of transcript diversity was predominantly achieved by alternative splicing, and second, splicing junctional diversity diminished during erythroid differentiation. Finally, 404 genes that were not known previously to be differentially expressed in erythroid cells were annotated. Analysis of the most extremely differentially expressed transcripts revealed that these gene products were all closely associated with hematopoietic lineage differentiation. Taken together, this study will serve as a comprehensive platform for future in-depth investigation of human erythroid development that, in turn, may reveal new insights into multiple layers of the transcriptional regulatory hierarchy that controls erythropoiesis.
Collapse
Affiliation(s)
- Lihong Shi
- Department of Cell and Developmental Biology and
| | - Yu-Hsuan Lin
- Department of Cell and Developmental Biology and Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - M C Sierant
- Department of Cell and Developmental Biology and
| | - Fan Zhu
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | | | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Maureen A Sartor
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Osamu Tanabe
- Department of Cell and Developmental Biology and Department of Integrative Genomics, Tohoku Medical Megabank, Tohoku University, 2-1 Seiryo-machi, Sendai 980-8573, Japan
| | - Kim-Chew Lim
- Department of Cell and Developmental Biology and
| | | |
Collapse
|
176
|
Pan Y, Sun C, Huang M, Liu Y, Qi F, Liu L, Wen J, Liu J, Xie K, Ma H, Hu Z, Shen H. A genetic variant in pseudogene E2F3P1 contributes to prognosis of hepatocellular carcinoma. J Biomed Res 2014; 28:194-200. [PMID: 25013402 PMCID: PMC4085556 DOI: 10.7555/jbr.28.20140052] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2014] [Accepted: 03/31/2014] [Indexed: 01/14/2023] Open
Abstract
Certain pseudogenes may regulate their protein-coding cousins by competing for miRNAs and play an active biological role in cancer. However, few studies have focused on the association of genetic variations in pseudogenes with cancer prognosis. We selected six potentially functional single nucleotide polymorphisms (SNPs) in cancer-related pseudogenes, and performed a case-only study to assess the association between those SNPs and the prognosis of hepatocellular carcinoma (HCC) in 331 HBV-positive HCC patients without surgical treatment. Log-rank test and Cox proportional hazard models were used for survival analysis. We found that the A allele of rs9909601 in E2F3P1 was significantly associated with a better prognosis compared with the G allele [adjusted hazard ratio (HR) = 0.69, 95% confidence interval (CI) = 0.56–0.86, P = 0.001]. Additionally, this protective effect was more predominant for patients without chemotherapy and transcatheter hepatic arterial chemoembolization (TACE) treatment. Interestingly, we also detected a statistically significant multiplicative interaction between genotypes of rs9909601 and chemotherapy or TACE status on HCC survival (P for multiplicative interaction < 0.001). These findings indicate that rs9909601 in the pseudogene E2F3P1 may be a genetic marker for HCC prognosis in Chinese.
Collapse
Affiliation(s)
- Yun Pan
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Chongqi Sun
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Mingde Huang
- Department of Oncology, Huai'an First People's Hospital, Nanjing Medical University, Huai'an, Jiangsu 223300, China
| | - Yao Liu
- Pathology Center and Department of Pathology, Soochow University, Suzhou, Jiangsu 215123, China
| | - Fuzhen Qi
- Department of Hepatopancreatobiliary Surgery, Huai'an First People's Hospital, Nanjing Medical University, Huai'an, Jiangsu 223300, China
| | - Li Liu
- Digestive Endoscopy Center, the First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu 210029, China
| | - Juan Wen
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Jibin Liu
- Department of Hepatobiliary Surgery, Nantong Tumor Hospital, Nantong, Jiangsu 226361, China
| | - Kaipeng Xie
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Hongxia Ma
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Zhibin Hu
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Hongbing Shen
- Department of Epidemiology and Biostatistics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Cancer Center, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| |
Collapse
|
177
|
Abstract
A pseudogene arises when a gene loses the ability to produce a protein, which can be due to mutation or inaccurate duplication. Previous dogma has dictated that because the pseudogene no longer produces a protein it becomes functionless and evolutionarily inert, being neither conserved nor removed. However, recent evidence has forced a re-evaluation of this view. Some pseudogenes, although not translated into protein, are at least transcribed into RNA. In some cases, these pseudogene transcripts are capable of influencing the activity of other genes that code for proteins, thereby altering expression and in turn affecting the phenotype of the organism. In the present chapter, we will define pseudogenes, describe the evidence that they are transcribed into non-coding RNAs and outline the mechanisms by which they are able to influence the machinery of the eukaryotic cell.
Collapse
|
178
|
Characterization of human pseudogene-derived non-coding RNAs for functional potential. PLoS One 2014; 9:e93972. [PMID: 24699680 PMCID: PMC3974860 DOI: 10.1371/journal.pone.0093972] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 03/10/2014] [Indexed: 11/19/2022] Open
Abstract
Thousands of pseudogenes exist in the human genome and many are transcribed, but their functional potential remains elusive and understudied. To explore these issues systematically, we first developed a computational pipeline to identify transcribed pseudogenes from RNA-Seq data. Applying the pipeline to datasets from 16 distinct normal human tissues identified ∼ 3,000 pseudogenes that could produce non-coding RNAs in a manner of low abundance but high tissue specificity under normal physiological conditions. Cross-tissue comparison revealed that the transcriptional profiles of pseudogenes and their parent genes showed mostly positive correlations, suggesting that pseudogene transcription could have a positive effect on the expression of their parent genes, perhaps by functioning as competing endogenous RNAs (ceRNAs), as previously suggested and demonstrated with the PTEN pseudogene, PTENP1. Our analysis of the ENCODE project data also found many transcriptionally active pseudogenes in the GM12878 and K562 cell lines; moreover, it showed that many human pseudogenes produced small RNAs (sRNAs) and some pseudogene-derived sRNAs, especially those from antisense strands, exhibited evidence of interfering with gene expression. Further integrated analysis of transcriptomics and epigenomics data, however, demonstrated that trimethylation of histone 3 at lysine 9 (H3K9me3), a posttranslational modification typically associated with gene repression and heterochromatin, was enriched at many transcribed pseudogenes in a transcription-level dependent manner in the two cell lines. The H3K9me3 enrichment was more prominent in pseudogenes that produced sRNAs at pseudogene loci and their adjacent regions, an observation further supported by the co-enrichment of SETDB1 (a H3K9 methyltransferase), suggesting that pseudogene sRNAs may have a role in regional chromatin repression. Taken together, our comprehensive and systematic characterization of pseudogene transcription uncovers a complex picture of how pseudogene ncRNAs could influence gene and pseudogene expression, at both epigenetic and post-transcriptional levels.
Collapse
|
179
|
Shore AN, Rosen JM. Regulation of mammary epithelial cell homeostasis by lncRNAs. Int J Biochem Cell Biol 2014; 54:318-30. [PMID: 24680897 DOI: 10.1016/j.biocel.2014.03.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Revised: 03/14/2014] [Accepted: 03/18/2014] [Indexed: 01/02/2023]
Abstract
The epithelial cells of the mammary gland develop primarily after birth and undergo surges of hormonally regulated proliferation, differentiation, and apoptosis during both puberty and pregnancy. Thus, the mammary gland is a useful model to study fundamental processes of development and adult tissue homeostasis, such as stem and progenitor cell regulation, cell fate commitment, and differentiation. Long noncoding RNAs (lncRNAs) are emerging as prominent regulators of these essential processes, as their extraordinary versatility allows them to modulate gene expression via diverse mechanisms at both transcriptional and post-transcriptional levels. Not surprisingly, lncRNAs are also aberrantly expressed in cancer and promote tumorigenesis by disrupting vital cellular functions, such as cell cycle, survival, and migration. In this review, we first broadly summarize the functions of lncRNAs in mammalian development and cancer. Then we focus on what is currently known about the role of lncRNAs in mammary gland development and breast cancer. This article is part of a Directed Issue entitled: The Non-coding RNA Revolution.
Collapse
Affiliation(s)
- Amy N Shore
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States.
| | - Jeffrey M Rosen
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States
| |
Collapse
|
180
|
Human transporter database: comprehensive knowledge and discovery tools in the human transporter genes. PLoS One 2014; 9:e88883. [PMID: 24558441 PMCID: PMC3928311 DOI: 10.1371/journal.pone.0088883] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 01/12/2014] [Indexed: 11/25/2022] Open
Abstract
Transporters are essential in homeostatic exchange of endogenous and exogenous substances at the systematic, organic, cellular, and subcellular levels. Gene mutations of transporters are often related to pharmacogenetics traits. Recent developments in high throughput technologies on genomics, transcriptomics and proteomics allow in depth studies of transporter genes in normal cellular processes and diverse disease conditions. The flood of high throughput data have resulted in urgent need for an updated knowledgebase with curated, organized, and annotated human transporters in an easily accessible way. Using a pipeline with the combination of automated keywords query, sequence similarity search and manual curation on transporters, we collected 1,555 human non-redundant transporter genes to develop the Human Transporter Database (HTD) (http://htd.cbi.pku.edu.cn). Based on the extensive annotations, global properties of the transporter genes were illustrated, such as expression patterns and polymorphisms in relationships with their ligands. We noted that the human transporters were enriched in many fundamental biological processes such as oxidative phosphorylation and cardiac muscle contraction, and significantly associated with Mendelian and complex diseases such as epilepsy and sudden infant death syndrome. Overall, HTD provides a well-organized interface to facilitate research communities to search detailed molecular and genetic information of transporters for development of personalized medicine.
Collapse
|
181
|
Sui Y, Li B, Shi J, Chen M. Genomic, regulatory and epigenetic mechanisms underlying duplicated gene evolution in the natural allotetraploid Oryza minuta. BMC Genomics 2014; 15:11. [PMID: 24393121 PMCID: PMC3890553 DOI: 10.1186/1471-2164-15-11] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2013] [Accepted: 12/30/2013] [Indexed: 11/27/2022] Open
Abstract
Background Polyploid species contribute to Oryza diversity. However, the mechanisms underlying gene and genome evolution in Oryza polyploids remain largely unknown. The allotetraploid Oryza minuta, which is estimated to have formed less than one million years ago, along with its putative diploid progenitors (O. punctata and O. officinalis), are quite suitable for the study of polyploid genome evolution using a comparative genomics approach. Results Here, we performed a comparative study of a large genomic region surrounding the Shattering4 locus in O. minuta, as well as in O. punctata and O. officinalis. Duplicated genomes in O. minuta have maintained the diploid genome organization, except for several structural variations mediated by transposon movement. Tandem duplicated gene clusters are prevalent in the Sh4 region, and segmental duplication followed by random deletion is illustrated to explain the gene gain-and-loss process. Both copies of most duplicated genes still persist in O. minuta. Molecular evolution analysis suggested that these duplicated genes are equally evolved and mostly manipulated by purifying selection. However, cDNA-SSCP analysis revealed that the expression patterns were dramatically altered between duplicated genes: nine of 29 duplicated genes exhibited expression divergence in O. minuta. We further detected one gene silencing event that was attributed to gene structural variation, but most gene silencing could not be related to sequence changes. We identified one case in which DNA methylation differences within promoter regions that were associated with the insertion of one hAT element were probably responsible for gene silencing, suggesting a potential epigenetic gene silencing pathway triggered by TE movement. Conclusions Our study revealed both genetic and epigenetic mechanisms involved in duplicated gene silencing in the allotetraploid O. minuta.
Collapse
Affiliation(s)
| | | | | | - Mingsheng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
182
|
Hesson LB, Ward RL. Discrimination of pseudogene and parental gene DNA methylation using allelic bisulfite sequencing. Methods Mol Biol 2014; 1167:265-274. [PMID: 24823784 DOI: 10.1007/978-1-4939-0835-6_18] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Determining the methylation status of genes with pseudogenes can be technically challenging due to sequence homology. High sequence homology can result in the amplification of both pseudogene and parental gene alleles, potentially leading to data misinterpretation. Allelic bisulfite sequencing allows for detection of the methylation status of individual alleles at nucleotide resolution and represents the most reliable method for discriminating pseudogene and parental gene sequences. Here, we discuss important points that should be considered when investigating pseudogene and parental gene methylation status and we describe the method of allelic bisulfite sequencing, including assay design.
Collapse
Affiliation(s)
- Luke B Hesson
- Adult Cancer Program, Lowy Cancer Research Centre and Prince of Wales Clinical School, University of New South Wales, Kensington, Sydney, NSW, 2227, Australia,
| | | |
Collapse
|
183
|
The miRNA-mediated cross-talk between transcripts provides a novel layer of posttranscriptional regulation. ADVANCES IN GENETICS 2014; 85:149-99. [PMID: 24880735 DOI: 10.1016/b978-0-12-800271-1.00003-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Endogenously expressed transcripts that are posttranscriptionally regulated by the same microRNAs (miRNAs) will, in principle, compete for the binding of their shared small noncoding RNA regulators and modulate each other's abundance. Recently, the levels of some coding as well as noncoding transcripts have indeed been found to be regulated in this way. Transcripts that engage in such regulatory interactions are referred to as competitive endogenous RNAs (ceRNAs). This novel layer of posttranscriptional regulation has been shown to contribute to diverse aspects of organismal and cellular biology, despite the number of functionally characterized ceRNAs being as yet relatively low. Importantly, increasing evidence suggests that the dysregulation of some ceRNA interactions is associated with disease etiology, most preeminently with cancer. Here we review how posttranscriptional regulation by miRNAs contributes to the cross-talk between transcripts and review examples of known ceRNAs by highlighting the features underlying their interactions and what might be their biological relevance.
Collapse
|
184
|
Abstract
The study of pseudogenes, originally dismissed as genomic relics of evolutionary selection, has seen a resurgence in scientific literature, in addition to being a peculiar topic of discussion in theological debates. For a long time, pseudogenes have been touted as a beacon of natural selection and a definitive proof of evolution due to the slow mutation rate that differentiated them from their parental genes and ultimately caused their genetic demise as functional genes. It now seems that "creationists" have co-opted some recent reports identifying unheralded biological functions to pseudogens and other noncoding RNAs as evidence to undermine the existence of evolution and supporting intelligent design. This issue of Methods in Molecular Biology focused on pseudogenes will certainly not end, nor enter this debate; however, scientists who are also genomics and pseudogene enthusiasts will certainly appreciate that many scientists are thinking about these particular genetic elements in new and interesting ways. With this new interest in a biological significance and "non-junk" role for pseudogenes and other noncoding RNAs, new methods and approaches are being developed to unlock the mystery of these ancient artifacts we know as pseudogenes. In this brief introductory chapter we highlight the renewed interest in pseudogenes and review a rationale for intensification of pseudogene-related research.
Collapse
|
185
|
Valdes C, Capobianco E. Methods to detect transcribed pseudogenes: RNA-Seq discovery allows learning through features. Methods Mol Biol 2014; 1167:157-83. [PMID: 24823777 DOI: 10.1007/978-1-4939-0835-6_11] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
The detection of transcripts and the measurement of their associated activity at the pseudogene scale have recently become important topics of research. Being integral part of many recent studies aimed at establishing a role for a variety of noncoding RNA structures, pseudogenes' popularity has substantially increased due to the discovery of regulatory properties and complex mechanisms of action that, while requiring further investigation, analysis, and validation, promise as well to have a broad impact on human disease. Currently, there are relatively few methodologies specifically designed to accomplish the detection of pseudogene transcripts and tools that either replace or integrate manual annotation procedures are very much needed. In particular, it seems to us justified that we engage in advancing the computational treatment of pseudogenes at the whole transcriptome level. Catalogs of human pseudogenes have started to be delivered, through RNA-Seq technologies. However, just a certain number of transcriptomes has been covered. Furthermore, while most proposals have led to the production of a targeted algorithm, especially used for detection, few computational pipelines were designed following a comprehensive approach addressing identification and quantification of transcriptional activity within a unifying methodological frame. Given the currently incomplete evidence, the limitations of the impacts due to the lack of extensive testing, and the presence of unsolved uncertainties affecting the reproducibility of results, our motivation for the proposal of a new computational approach is high and timely. We have considered a hybrid approach, based on the assembly of a variety of computational tools, including RNA-Seq methods and machine learning applications, all applied to transcriptome data of various complexities. Our initial strategy is to provide lists of pseudogenes to be validated against the currently known examples, in order to extend our knowledge further. An ultimate goal that is naturally linked to this work is to provide an automatic approach that analyzes transcriptomes with the goal of detecting candidate pseudogenes through characteristic features and that allows efficient and reproducible pseudogene classification models.
Collapse
Affiliation(s)
- Camilo Valdes
- Center for Computational Science, University of Miami, Miami, FL, 33146, USA
| | | |
Collapse
|
186
|
The OCT4 pseudogene POU5F1B is amplified and promotes an aggressive phenotype in gastric cancer. Oncogene 2013; 34:199-208. [PMID: 24362523 DOI: 10.1038/onc.2013.547] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2013] [Revised: 11/12/2013] [Accepted: 11/15/2013] [Indexed: 12/15/2022]
Abstract
POU5F1B (POU domain class 5 transcription factor 1B), a processed pseudogene that is highly homologous to OCT4, was recently shown to be transcribed in cancer cells, but its clinical relevance and biological function have remained unclear. We now show that POU5F1B, which is located adjacent to MYC on human chromosome 8q24, is frequently amplified in gastric cancer (GC) cell lines. POU5F1B, but not OCT4, was also found to be expressed at a high level in GC cell lines and clinical specimens. In addition, the DNA copy number and mRNA abundance for POU5F1B showed a positive correlation in both cancer cell lines and GC specimens. Overexpression of POU5F1B in GC cells promoted colony formation in vitro as well as both tumorigenicity and tumor growth in vivo, and these effects were enhanced in the additional presence of MYC overexpression. Furthermore, knockdown of POU5F1B expression with a short hairpin RNA confirmed a role for the endogenous pseudogene in the promotion of cancer cell growth in vitro and tumor growth in vivo. POU5F1B overexpression induced upregulation of various growth factors in GC cells as well as exhibited mitogenic, angiogenic and antiapoptotic effects in GC xenografts. Finally, amplification of POU5F1B was detected in 17 (12%) of 145 cases of GC and was a significant predictor of poor prognosis in patients with stage IV disease. In conclusion, we found that the POU5F1B pseudogene is amplified and expressed at a high level in, as well as confers an aggressive phenotype on, GC, and that POU5F1B amplification is associated with a poor prognosis in GC patients.
Collapse
|
187
|
Branca RMM, Orre LM, Johansson HJ, Granholm V, Huss M, Pérez-Bercoff Å, Forshed J, Käll L, Lehtiö J. HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics. Nat Methods 2013; 11:59-62. [PMID: 24240322 DOI: 10.1038/nmeth.2732] [Citation(s) in RCA: 182] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2013] [Accepted: 10/08/2013] [Indexed: 11/09/2022]
Abstract
We present a liquid chromatography-mass spectrometry (LC-MS)-based method permitting unbiased (gene prediction-independent) genome-wide discovery of protein-coding loci in higher eukaryotes. Using high-resolution isoelectric focusing (HiRIEF) at the peptide level in the 3.7-5.0 pH range and accurate peptide isoelectric point (pI) prediction, we probed the six-reading-frame translation of the human and mouse genomes and identified 98 and 52 previously undiscovered protein-coding loci, respectively. The method also enabled deep proteome coverage, identifying 13,078 human and 10,637 mouse proteins.
Collapse
Affiliation(s)
- Rui M M Branca
- 1] Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden. [2]
| | - Lukas M Orre
- 1] Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden. [2]
| | - Henrik J Johansson
- 1] Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden. [2]
| | - Viktor Granholm
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Mikael Huss
- Department of Biochemistry and Biophysics, The Arrhenius Laboratories for Natural Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Åsa Pérez-Bercoff
- Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden
| | - Jenny Forshed
- Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden
| | - Lukas Käll
- 1] School of Biotechnology, Science for Life Laboratory, KTH Royal Institute of Technology, Stockholm, Sweden. [2] Swedish e-Science Resource Center, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Janne Lehtiö
- Department of Oncology-Pathology, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
188
|
Jerabek S, Merino F, Schöler HR, Cojocaru V. OCT4: dynamic DNA binding pioneers stem cell pluripotency. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2013; 1839:138-54. [PMID: 24145198 DOI: 10.1016/j.bbagrm.2013.10.001] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Revised: 10/02/2013] [Accepted: 10/06/2013] [Indexed: 12/12/2022]
Abstract
OCT4 was discovered more than two decades ago as a transcription factor specific to early embryonic development. Early studies with OCT4 were descriptive and looked at determining the functional roles of OCT4 in the embryo as well as in pluripotent cell lines derived from embryos. Later studies showed that OCT4 was one of the transcription factors in the four-factor cocktail required for reprogramming somatic cells into induced pluripotent stem cells (iPSCs) and that it is the only factor that cannot be substituted in this process by other members of the same protein family. In recent years, OCT4 has emerged as a master regulator of the induction and maintenance of cellular pluripotency, with crucial roles in the early stages of differentiation. Currently, mechanistic studies look at elucidating the molecular details of how OCT4 contributes to establishing selective gene expression programs that define different developmental stages of pluripotent cells. OCT4 belongs to the POU family of proteins, which have two conserved DNA-binding domains connected by a variable linker region. The functions of OCT4 depend on its ability to recognize and bind to DNA regulatory regions alone or in cooperation with other transcription factors and on its capacity to recruit other factors required to regulate the expression of specific sets of genes. Undoubtedly, future iPSC-based applications in regenerative medicine will benefit from understanding how OCT4 functions. Here we provide an integrated view of OCT4 research conducted to date by reviewing the different functional roles for OCT4 and discussing the current progress in understanding their underlying molecular mechanisms. This article is part of a Special Issue entitled: Chromatin and epigenetic regulation of animal development.
Collapse
Affiliation(s)
- Stepan Jerabek
- Department of Cell and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149 Münster, Germany
| | - Felipe Merino
- Department of Cell and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149 Münster, Germany
| | - Hans Robert Schöler
- Department of Cell and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149 Münster, Germany.
| | - Vlad Cojocaru
- Department of Cell and Developmental Biology, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149 Münster, Germany.
| |
Collapse
|
189
|
Abstract
UNLABELLED The cancer transcriptome is characterized by aberrant expression of both protein-coding and noncoding transcripts. Similar to mRNAs, a significant portion of the noncoding transcriptome, including long noncoding RNAs and pseudogenes, harbors microRNA (miRNA)-response elements (MRE). The recent discovery of competitive endogenous RNAs (ceRNA), natural decoys that compete for a common pool of miRNAs, provides a framework to systematically functionalize MRE-harboring noncoding RNAs and integrate them with the protein-coding RNA dimension in complex ceRNA networks. Functional interactions in ceRNA networks aid in coordinating a number of biologic processes and, when perturbed, contribute to disease pathogenesis. In this review, we discuss recent discoveries that implicate natural miRNA decoys in the development of cancer. SIGNIFICANCE Cross-talk between ceRNAs through shared miRNAs represents a novel layer of gene regulation that plays important roles in the physiology and development of diseases such as cancer. As cross-talk can be predicted on the basis of the overlap of miRNA-binding sites, we are now one step closer to a complete functionalization of the human transcriptome, especially the noncoding space.
Collapse
Affiliation(s)
- Florian A Karreth
- Cancer Genetics Program, Beth Israel Deaconess Cancer Center, Department of Medicine and Pathology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts
| | | |
Collapse
|
190
|
Therapeutic implications of activation of the host gene (Dleu2) promoter for miR-15a/16-1 in chronic lymphocytic leukemia. Oncogene 2013; 33:3307-15. [PMID: 23995789 PMCID: PMC4508006 DOI: 10.1038/onc.2013.291] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2012] [Revised: 04/25/2013] [Accepted: 05/09/2013] [Indexed: 12/30/2022]
Abstract
Genetic lesions and other regulatory events lead to silencing of the 13q14 locus in a majority of chronic lymphocytic leukemia (CLL) patients. This locus encodes a pair of critical pro-apoptotic microRNAs, miR-15a/16-1. Decreased levels of miR-15a/16-1 are critical for the increased survival exhibited by CLL cells. Similarly, in a de novo murine model of CLL, the NZB strain, germline-encoded regulation of the syntenic region resulted in decreased miR-15a/16-1. In this paper we have identified additional molecular mechanisms regulating miR-15a/16-1 levels and shown that the transcription factor BSAP (B cell Specific Activator Protein) directly interacts with Dleu2, the host gene containing the mir-15a/16-1 loci and via negative regulation of the Dleu2 promoter results in repression of mir-15a/16 expression. CLL patient B cell expression levels of BSAP were increased compared to control sources of B cells. With the use of siRNA mediated repression, the levels of BSAP were decreased in vitro in the NZB derived malignant B1 cell line, LNC, and in ex vivo CLL patient PBMC. BSAP knockdown led to an increase in the expression of miR-15a/16-1 and an increase in apoptosis and a cell cycle arrest in both the cell line and patient PBMC. Moreover, using Dleu2 promoter analysis by chromatin immunoprecipitation (ChIP) assay we have shown that BSAP directly interacts with the Dleu2 promoter. Derepression of the Dleu2 promoter via inhibition of histone deacetylation combined with BSAP knockdown increased miR-15a/16 expression and increased malignant B cell death. In summary, therapy targeting enhanced host gene Dleu2 transcription may augment CLL therapy.
Collapse
|
191
|
Li CH, Chen Y. Targeting long non-coding RNAs in cancers: Progress and prospects. Int J Biochem Cell Biol 2013; 45:1895-910. [DOI: 10.1016/j.biocel.2013.05.030] [Citation(s) in RCA: 344] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Revised: 05/21/2013] [Accepted: 05/23/2013] [Indexed: 02/07/2023]
|
192
|
Discovery of structural alterations in solid tumor oligodendroglioma by single molecule analysis. BMC Genomics 2013; 14:505. [PMID: 23885787 PMCID: PMC3727977 DOI: 10.1186/1471-2164-14-505] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Accepted: 07/23/2013] [Indexed: 12/31/2022] Open
Abstract
Background Solid tumors present a panoply of genomic alterations, from single base changes to the gain or loss of entire chromosomes. Although aberrations at the two extremes of this spectrum are readily defined, comprehensive discernment of the complex and disperse mutational spectrum of cancer genomes remains a significant challenge for current genome analysis platforms. In this context, high throughput, single molecule platforms like Optical Mapping offer a unique perspective. Results Using measurements from large ensembles of individual DNA molecules, we have discovered genomic structural alterations in the solid tumor oligodendroglioma. Over a thousand structural variants were identified in each tumor sample, without any prior hypotheses, and often in genomic regions deemed intractable by other technologies. These findings were then validated by comprehensive comparisons to variants reported in external and internal databases, and by selected experimental corroborations. Alterations range in size from under 5 kb to hundreds of kilobases, and comprise insertions, deletions, inversions and compound events. Candidate mutations were scored at sub-genic resolution and unambiguously reveal structural details at aberrant loci. Conclusions The Optical Mapping system provides a rich description of the complex genomes of solid tumors, including sequence level aberrations, structural alterations and copy number variants that power generation of functional hypotheses for oligodendroglioma genetics.
Collapse
|
193
|
Rapicavoli NA, Qu K, Zhang J, Mikhail M, Laberge RM, Chang HY. A mammalian pseudogene lncRNA at the interface of inflammation and anti-inflammatory therapeutics. eLife 2013; 2:e00762. [PMID: 23898399 PMCID: PMC3721235 DOI: 10.7554/elife.00762] [Citation(s) in RCA: 364] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Accepted: 06/18/2013] [Indexed: 12/30/2022] Open
Abstract
Pseudogenes are thought to be inactive gene sequences, but recent evidence of extensive pseudogene transcription raised the question of potential function. Here we discover and characterize the sets of mouse lncRNAs induced by inflammatory signaling via TNFα. TNFα regulates hundreds of lncRNAs, including 54 pseudogene lncRNAs, several of which show exquisitely selective expression in response to specific cytokines and microbial components in a NF-κB-dependent manner. Lethe, a pseudogene lncRNA, is selectively induced by proinflammatory cytokines via NF-κB or glucocorticoid receptor agonist, and functions in negative feedback signaling to NF-κB. Lethe interacts with NF-κB subunit RelA to inhibit RelA DNA binding and target gene activation. Lethe level decreases with organismal age, a physiological state associated with increased NF-κB activity. These findings suggest that expression of pseudogenes lncRNAs are actively regulated and constitute functional regulators of inflammatory signaling. DOI:http://dx.doi.org/10.7554/eLife.00762.001 The simplest account of gene expression is that DNA is transcribed into messenger RNA, which is then translated into a protein. However, not all genes encode proteins; for some it is the RNA molecule itself that is the end product. Many of these ‘non-coding RNAs’ are thought to be involved in regulating the expression of other genes, but their exact functions are unknown. Pseudogenes are genes that have lost their protein-coding abilities as a result of mutations they have accumulated mutations over the course of evolution. They were previously referred to as ‘junk DNA’ or ‘dead genes’ because they were thought to be completely non-functional, lacking even the ability to encode RNA. However, recent work has shown that pseudogenes are in fact transcribed into long non-coding RNAs, and these are now the focus of much research. Here, Rapicavoli et al. report that certain pseudogenes and long non-coding RNAs are involved in regulating the immune response. Specific and distinct pseudogene-derived long RNAs are made when cells are exposed to different kinds of infections. Immune cells such as macrophages and lymphocytes produce a protein called tumor necrosis factor alpha (TNFα), which is involved in triggering fever and inflammation. TNFα exerts these effects by binding to and activating a transcription factor called NF-κB, which then moves to the nucleus and binds to DNA, regulating the expression of genes that encode immune proteins. Rapicavoli et al. found that the production of a long non-coding RNA called Lethe (after the ‘river of forgetfulness’ in Greek mythology) increases when TNFα activates NF-κB. Surprisingly, however, Lethe then binds to NF-κB and prevents it from interacting with DNA, thereby reducing the production of various inflammatory proteins. This is the first time that a pseudogene has been shown to have an active role in regulating signaling pathways involved in inflammation, and raises the possibility that other pseudogenes may also influence distinct feedback loops and signaling networks. It suggests that many novel functions for pseudogenes and long non-coding RNAs remain to be discovered. DOI:http://dx.doi.org/10.7554/eLife.00762.002
Collapse
Affiliation(s)
- Nicole A Rapicavoli
- Program in Epithelial Biology , Howard Hughes Medical Institute, Stanford University School of Medicine , Stanford , United States
| | | | | | | | | | | |
Collapse
|
194
|
Geahlen JH, Lapid C, Thorell K, Nikolskiy I, Huh WJ, Oates EL, Lennerz JKM, Tian X, Weis VG, Khurana SS, Lundin SB, Templeton AR, Mills JC. Evolution of the human gastrokine locus and confounding factors regarding the pseudogenicity of GKN3. Physiol Genomics 2013; 45:667-83. [PMID: 23715263 DOI: 10.1152/physiolgenomics.00169.2012] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
In a screen for genes expressed specifically in gastric mucous neck cells, we identified GKN3, the recently discovered third member of the gastrokine family. We present confirmatory mouse data and novel porcine data showing that mouse GKN3 expression is confined to mucous cells of the corpus neck and antrum base and is prominently expressed in metaplastic lesions. GKN3 was proposed originally to be expressed in some human populations and a pseudogene in others. To investigate that hypothesis, we studied human GKN3 evolution in the context of its paralogous genomic neighbors, GKN1 and GKN2. Haplotype analysis revealed that GKN3 mimics GKN2 in patterns of exonic SNP allocation, whereas GKN1 appeared to be more stringently selected. GKN3 showed signatures of both directional selection and population based selective sweeps in humans. One such selective sweep includes SNP rs10187256, originally identified as an ancestral tryptophan to premature STOP codon mutation. The derived (nonancestral) allele went to fixation in Asia. We show that another SNP, rs75578132, identified 5 bp downstream of rs10187256, exhibits a second selective sweep in almost all Europeans, some Latinos, and some Africans, possibly resulting from a reintroduction of European genes during African colonization. Finally, we identify a mutation that would destroy the splice donor site in the putative exon3-intron3 boundary, which occurs in all human genomes examined to date. Our results highlight a stomach-specific human genetic locus, which has undergone various selective sweeps across European, Asian, and African populations and thus reflects geographic and ethnic patterns in genome evolution.
Collapse
Affiliation(s)
- Jessica H Geahlen
- Division of Gastroenterology, Department of Medicine, School of Medicine, Washington University, St. Louis, Missouri 63110, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
195
|
Mandal PK, Ewing AD, Hancks DC, Kazazian HH. Enrichment of processed pseudogene transcripts in L1-ribonucleoprotein particles. Hum Mol Genet 2013; 22:3730-48. [PMID: 23696454 DOI: 10.1093/hmg/ddt225] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Long INterspersed Elements (LINE-1s, L1s) are responsible for over one million retrotransposon insertions and 8000 processed pseudogenes (PPs) in the human genome. An active L1 encodes two proteins (ORF1p and ORF2p) that bind with L1 RNA and form L1-ribonucleoprotein particles (RNPs). Although it is believed that the RNA-binding property of ORF1p is critical to recruit other mobile RNAs to the RNP, the identity of recruited RNAs is largely unknown. Here, we used crosslinking and immunoprecipitation followed by deep sequencing to identify RNA components of L1-RNPs. Our results show that in addition to retrotransposed RNAs [L1, Alu and SINE-VNTR-Alu (SVA)], L1-RNPs are enriched with cellular mRNAs, which have PPs in the human genome. Using purified L1-RNPs, we show that PP-source RNAs preferentially serve as ORF2p templates in a reverse transcriptase assay. In addition, we find that exogenous ORF2p binds endogenous ORF1p, allowing reverse transcription of the same PP-source RNAs. These data demonstrate that interaction of a cellular RNA with the L1-RNP is an inside track to PP formation.
Collapse
Affiliation(s)
- Prabhat K Mandal
- McKusick-Nathans Institute of Genetic Medicine and Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | | | | |
Collapse
|
196
|
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 2013. [PMID: 23618408 DOI: 10.1101/000851] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023] Open
Abstract
TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.
Collapse
|
197
|
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 2013. [PMID: 23618408 DOI: 10.1186/gb‐2013‐14‐4‐r36] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.
Collapse
|
198
|
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 2013. [PMID: 23618408 DOI: 10.1186/gb-2013-14-4-r361186/gb-2013-14-4-r36] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.
Collapse
|
199
|
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 2013; 14:R36. [PMID: 23618408 PMCID: PMC4053844 DOI: 10.1186/gb-2013-14-4-r36] [Citation(s) in RCA: 9256] [Impact Index Per Article: 841.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2012] [Accepted: 04/25/2013] [Indexed: 11/10/2022] Open
Abstract
TopHat is a popular spliced aligner for RNA-sequence (RNA-seq) experiments. In this paper, we describe TopHat2, which incorporates many significant enhancements to TopHat. TopHat2 can align reads of various lengths produced by the latest sequencing technologies, while allowing for variable-length indels with respect to the reference genome. In addition to de novo spliced alignment, TopHat2 can align reads across fusion breaks, which can occur after genomic translocations. TopHat2 combines the ability to identify novel splice sites with direct mapping to known transcripts, producing sensitive and accurate alignments, even for highly repetitive genomes or in the presence of pseudogenes. TopHat2 is available at http://ccb.jhu.edu/software/tophat.
Collapse
|
200
|
Wang L, Guo ZY, Zhang R, Xin B, Chen R, Zhao J, Wang T, Wen WH, Jia LT, Yao LB, Yang AG. Pseudogene OCT4-pg4 functions as a natural micro RNA sponge to regulate OCT4 expression by competing for miR-145 in hepatocellular carcinoma. Carcinogenesis 2013; 34:1773-81. [PMID: 23615404 DOI: 10.1093/carcin/bgt139] [Citation(s) in RCA: 124] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The POU transcription factor OCT4 is a pleiotropic regulator of gene expression in embryonic stem cells. Recent studies demonstrated that OCT4 is aberrantly expressed in multiple types of human cancer; however, the underlying molecular mechanism remains largely unknown. In this study, we report that OCT4-pg4, a pseudogene of OCT4, is abnormally activated in hepatocellular carcinoma (HCC). The expression level of OCT4-pg4 is positively correlated with that of OCT4, and both gene transcripts can be directly targeted by a tumor-suppressive micro RNA miR-145. We find that the non-coding RNA OCT4-pg4 is biologically active, as it can upregulate OCT4 protein level in HCC. Mechanistic analysis revealed that OCT4-pg4 functions as a natural micro RNA sponge to protect OCT4 transcript from being inhibited by miR-145. In addition, our study also showed that OCT4-pg4 can promote growth and tumorigenicity of HCC cells, thus exerting an oncogenic role in hepatocarcinogenesis. Furthermore, survival analysis suggests that high OCT4-pg4 level is significantly correlated with poor prognosis of HCC patients. Taken together, our finding adds a new layer of post-transcriptional regulation of OCT4 and sheds new light on the treatment of human HCC.
Collapse
Affiliation(s)
- Lei Wang
- State Key Laboratory of Cancer Biology, Department of Biochemistry and Molecular Biology, Fourth Military Medical University, Xi’an, China.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|