1
|
Nakamura-García AK, Espinal-Enríquez J. Pseudogenes in Cancer: State of the Art. Cancers (Basel) 2023; 15:4024. [PMID: 37627052 PMCID: PMC10452131 DOI: 10.3390/cancers15164024] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/27/2023] Open
Abstract
Pseudogenes are duplicates of protein-coding genes that have accumulated multiple detrimental alterations, rendering them unable to produce the protein they encode. Initially disregarded as "junk DNA" due to their perceived lack of functionality, research on their biological roles has been hindered by this assumption. Nevertheless, recent focus has shifted towards these molecules due to their abnormal expression in cancer phenotypes. In this review, our objective is to provide a thorough overview of the current understanding of pseudogene formation, the mechanisms governing their expression, and the roles they may play in promoting tumorigenesis.
Collapse
|
2
|
Huang M, Ye X, Imakura A, Sakurai T. Sequential reinforcement active feature learning for gene signature identification in renal cell carcinoma. J Biomed Inform 2022; 128:104049. [DOI: 10.1016/j.jbi.2022.104049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 02/03/2022] [Accepted: 03/06/2022] [Indexed: 10/18/2022]
|
3
|
Troskie RL, Faulkner GJ, Cheetham SW. Processed pseudogenes: A substrate for evolutionary innovation: Retrotransposition contributes to genome evolution by propagating pseudogene sequences with rich regulatory potential throughout the genome. Bioessays 2021; 43:e2100186. [PMID: 34569081 DOI: 10.1002/bies.202100186] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 11/08/2022]
Abstract
Processed pseudogenes may serve as a genetic reservoir for evolutionary innovation. Here, we argue that through the activity of long interspersed element-1 retrotransposons, processed pseudogenes disperse coding and noncoding sequences rich with regulatory potential throughout the human genome. While these sequences may appear to be non-functional, a lack of contemporary function does not prohibit future development of biological activity. Here, we discuss the dynamic evolution of certain processed pseudogenes into coding and noncoding genes and regulatory elements, and their implication in wide-ranging biological and pathological processes. Also see the video abstract here: https://youtu.be/iUY_mteVoPI.
Collapse
Affiliation(s)
- Robin-Lee Troskie
- Mater Research Institute, University of Queensland, Woolloongabba, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, Woolloongabba, Australia.,Queensland Brain Institute, University of Queensland, Brisbane, Australia
| | - Seth W Cheetham
- Mater Research Institute, University of Queensland, Woolloongabba, Australia
| |
Collapse
|
4
|
Pseudogenes: Four Decades of Discovery. Methods Mol Biol 2021. [PMID: 34165705 DOI: 10.1007/978-1-0716-1503-4_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/10/2023]
Abstract
A pseudogene is defined as a genomic DNA sequence that looks like a mutated or truncated version of a known functional gene. Nearly four decades since their first discovery it has been estimated that between ~12,000 and ~20,000 pseudogenes exist in the human genome. Early efforts to characterize functions for pseudogenes were unsuccessful, thus they were considered functionless relics of evolutionary selection, junk DNA or genetic fossils. Remarkably, an increasing number of pseudogenes have been reported to be expressed as RNA transcripts above and beyond levels considered accidental or spurious transcription. There is emerging evidence that some expressed pseudogene transcripts have biological functions and should be defined as a subclass of functional long noncoding RNAs (lncRNA). In this introductory chapter, I briefly summarize the history and the current knowledge of pseudogenes, and highlight the emerging functions of some pseudogenes in human biology and disease. This second iteration of Pseudogenes in Methods in Molecular Biology highlights new methodological approaches to investigate this intriguing family of lncRNAs and the extent of their biological function.
Collapse
|
5
|
Tan L, Cheng W, Liu F, Wang DO, Wu L, Cao N, Wang J. Positive natural selection of N6-methyladenosine on the RNAs of processed pseudogenes. Genome Biol 2021; 22:180. [PMID: 34120636 PMCID: PMC8201931 DOI: 10.1186/s13059-021-02402-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 06/04/2021] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Canonical nonsense-mediated decay (NMD) is an important splicing-dependent process for mRNA surveillance in mammals. However, processed pseudogenes are not able to trigger NMD due to their lack of introns. It is largely unknown whether they have evolved other surveillance mechanisms. RESULTS Here, we find that the RNAs of pseudogenes, especially processed pseudogenes, have dramatically higher m6A levels than their cognate protein-coding genes, associated with de novo m6A peaks and motifs in human cells. Furthermore, pseudogenes have rapidly accumulated m6A motifs during evolution. The m6A sites of pseudogenes are evolutionarily younger than neutral sites and their m6A levels are increasing, supporting the idea that m6A on the RNAs of pseudogenes is under positive selection. We then find that the m6A RNA modification of processed, rather than unprocessed, pseudogenes promotes cytosolic RNA degradation and attenuates interference with the RNAs of their cognate protein-coding genes. We experimentally validate the m6A RNA modification of two processed pseudogenes, DSTNP2 and NAP1L4P1, which promotes the RNA degradation of both pseudogenes and their cognate protein-coding genes DSTN and NAP1L4. In addition, the m6A of DSTNP2 regulation of DSTN is partially dependent on the miRNA miR-362-5p. CONCLUSIONS Our discovery reveals a novel evolutionary role of m6A RNA modification in cleaning up the unnecessary processed pseudogene transcripts to attenuate their interference with the regulatory network of protein-coding genes.
Collapse
Affiliation(s)
- Liqiang Tan
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Weisheng Cheng
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Fang Liu
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Dan Ohtan Wang
- Center for Biosystems Dynamics Research, RIKEN, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
- Wuya College of Innovation, Shenyang Pharmaceutical University, Shenyang, 110016, China
| | - Linwei Wu
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Nan Cao
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jinkai Wang
- Department of Medical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China.
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China.
- RNA Biomedical Institute, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, 510120, China.
| |
Collapse
|
6
|
Ni H, Zhi R, Zuo J, Liu W, Xie P, Zhi Z. Pseudogene ANXA2P2 knockdown shows tumor-suppressive function by inhibition of the PI3K/PKB pathway in glioblastoma cells. J Biochem Mol Toxicol 2021; 35:e22824. [PMID: 34047431 DOI: 10.1002/jbt.22824] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 02/04/2021] [Accepted: 05/18/2021] [Indexed: 01/15/2023]
Abstract
The pseudogene annexin A2 pseudogene 2 (ANXA2P2) is highly expressed in glioblastoma (GBM). However, its role and mechanism involved in the progression of GBM remain poorly understood. ANXA2P2 messenger RNA expression was measured by quantitative reverse transcription-polymerase chain reaction. The protein levels were detected by Western blot. Cell viability was evaluated by the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide and lactate dehydrogenase (LDH) release assays. Cell invasive ability was investigated by the transwell assay and by epithelial-mesenchymal transition (EMT). Cell apoptosis was examined by flow cytometry. The results showed that ANXA2P2 expression was increased in GBM tissues and cells. Silencing of ANXA2P2 inhibited the activation of the phosphoinositide 3-kinase (PI3K)/protein kinase B (PKB) pathway in GBM cells. Knockdown of ANXA2P2 decreased cell viability, promoted LDH release, suppressed cell invasive ability, and EMT, and induced cell apoptosis in GBM cells. The addition of the PI3K/PKB activator 740Y-P abrogated the effects of ANXA2P2 knockdown on cell viability, LDH release, invasive ability, and apoptosis. In conclusion, knockdown of ANXA2P2 inhibited cell viability and invasion but promoted the apoptotic rate by suppressing the PI3K/PKB pathway in GBM cells. ANXA2P2 may represent a new target for the treatment of GBM.
Collapse
Affiliation(s)
- Hongzao Ni
- Department of Neurosurgery, The Second People's Hospital of Huai'an, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| | - Rongrong Zhi
- Department of Gastroenterology, Lianshui County People's Hospital Affiliated to Kangda College of Nanjing Medical University, Huai'an, China
| | - Jiandong Zuo
- Department of Neurosurgery, The Second People's Hospital of Huai'an, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| | - Wenguang Liu
- Department of Neurosurgery, The Second People's Hospital of Huai'an, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| | - Peng Xie
- Department of Neurosurgery, The Second People's Hospital of Huai'an, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| | - Zhongwen Zhi
- Department of Neurosurgery, The Second People's Hospital of Huai'an, The Affiliated Huai'an Hospital of Xuzhou Medical University, Huai'an, China
| |
Collapse
|
7
|
Abstract
The number of complete genome sequences explodes more and more with each passing year. Thus, methods for genome annotation need to be honed constantly to handle the deluge of information. Annotation of pseudogenes (i.e., gene copies that appear not to make a functional protein) in genomes is a persistent problem; here, we overview pseudogene annotation methods that are based on the detection of sequence homology in genomic DNA.
Collapse
Affiliation(s)
- Paul M Harrison
- Department of Biology, McGill University, Montreal, QC, Canada.
| |
Collapse
|
8
|
Lister NC, Johnsson P, Waters PD, Morris KV. Pseudogenes: A Novel Source of Trans-Acting Antisense RNAs. Methods Mol Biol 2021; 2324:219-236. [PMID: 34165718 DOI: 10.1007/978-1-0716-1503-4_14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Several recent studies support a functional role for pseudogenes, a copy of a parent gene that has lost protein-coding potential, which was for a long time thought to represent only "junk" DNA. Several hundreds of pseudogenes have now been reported as transcribed RNAs in a large variety of tissues and tumor types. Most studies have focused on pseudogenes expressed in sense direction, relative to their protein-coding gene counterpart, but some reports suggest that pseudogenes can be also transcribed as antisense RNAs (asRNAs). Key regulatory genes, such as PTEN and OCT4, have in fact been reported to be under the regulation of pseudogene-expressed asRNAs. Here, we review what is known about pseudogene-expressed asRNAs, we discuss the functional role that these transcripts may have in gene regulation and we summarize the techniques that are available to study them.
Collapse
Affiliation(s)
- Nicholas C Lister
- School of Biotechnology and Biomedical Sciences, The University of New South Wales, Sydney, NSW, Australia.
| | - Per Johnsson
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
| | - Paul D Waters
- School of Biotechnology and Biomedical Sciences, The University of New South Wales, Sydney, NSW, Australia
| | - Kevin V Morris
- Menzies Health Institute and School of Pharmacology and Medical Sciences, Griffith University, Southport, QLD, Australia
| |
Collapse
|
9
|
Molecular fossils “pseudogenes” as functional signature in biological system. Genes Genomics 2020; 42:619-630. [DOI: 10.1007/s13258-020-00935-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 04/03/2020] [Indexed: 12/11/2022]
|
10
|
Machado JP, Antunes A. The genomic context of retrocopies increases their chance of functional relevancy in mammals. Genomics 2020; 112:2410-2417. [PMID: 31981699 DOI: 10.1016/j.ygeno.2020.01.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 01/03/2020] [Accepted: 01/21/2020] [Indexed: 11/30/2022]
Abstract
Described as "junk" DNA, pseudogenes are dead structures of previously active genes present in genomes. Pseudogenes are categorized into two main classes: processed pseudogenes, formed through retrotransposition, and non-processed pseudogenes, typically originated from gene decay following duplication events. The term "processed pseudogene" has changed to "retrocopy" since they are likely to evolve new functional roles and became a retrogene. Here, we surveyed 38,080 retrocopies from chimpanzee, dog, human, mouse, and rat genomes to assess their potential adaptive value. The retrocopies inserted in the same chromosome of the parental gene have higher chances of remain potentially "active" (absence of premature stop codons and frameshifts) (~26.1%), while those placed into a different chromosome have a twofold decrease chance of continuing potentially "active" (~7.52%). The genomic context of their placement seems associated with their expression. Retrocopies placed in intragenic regions and the same sense of the "host" gene have higher chances of being expressed relative to other genomic contexts. The proximity of retrocopies to their parental gene is associated with a lower decay rate, and their location likely influence their expression. Thus, despite their unclear role, retrocopies are probably involved in adaptive processes. Our results evidence natural selection acting in retrocopies.
Collapse
Affiliation(s)
- João Paulo Machado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Abel Salazar Biomedical Sciences Institute (ICBAS), University of Porto, Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Department of Biology, Faculty of Sciences, University of Porto, 4169 007 Porto, Portugal.
| |
Collapse
|
11
|
Chen X, Wan L, Wang W, Xi WJ, Yang AG, Wang T. Re-recognition of pseudogenes: From molecular to clinical applications. Theranostics 2020; 10:1479-1499. [PMID: 32042317 PMCID: PMC6993246 DOI: 10.7150/thno.40659] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 11/12/2019] [Indexed: 12/14/2022] Open
Abstract
Pseudogenes were initially regarded as "nonfunctional" genomic elements that did not have protein-coding abilities due to several endogenous inactivating mutations. Although pseudogenes are widely expressed in prokaryotes and eukaryotes, for decades, they have been largely ignored and classified as gene "junk" or "relics". With the widespread availability of high-throughput sequencing analysis, especially omics technologies, knowledge concerning pseudogenes has substantially increased. Pseudogenes are evolutionarily conserved and derive primarily from a mutation or retrotransposon, conferring the pseudogene with a "gene repository" role to store and expand genetic information. In contrast to previous notions, pseudogenes have a variety of functions at the DNA, RNA and protein levels for broadly participating in gene regulation to influence the development and progression of certain diseases, especially cancer. Indeed, some pseudogenes have been proven to encode proteins, strongly contradicting their "trash" identification, and have been confirmed to have tissue-specific and disease subtype-specific expression, indicating their own value in disease diagnosis. Moreover, pseudogenes have been correlated with the life expectancy of patients and exhibit great potential for future use in disease treatment, suggesting that they are promising biomarkers and therapeutic targets for clinical applications. In this review, we summarize the natural properties, functions, disease involvement and clinical value of pseudogenes. Although our knowledge of pseudogenes remains nascent, this field deserves more attention and deeper exploration.
Collapse
|
12
|
Stewart GL, Enfield KSS, Sage AP, Martinez VD, Minatel BC, Pewarchuk ME, Marshall EA, Lam WL. Aberrant Expression of Pseudogene-Derived lncRNAs as an Alternative Mechanism of Cancer Gene Regulation in Lung Adenocarcinoma. Front Genet 2019; 10:138. [PMID: 30894871 PMCID: PMC6414417 DOI: 10.3389/fgene.2019.00138] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 02/11/2019] [Indexed: 12/27/2022] Open
Abstract
Transcriptome sequencing has led to the widespread identification of long non-coding RNAs (lncRNAs). Subsequently, these genes have been shown to hold functional importance in human cellular biology, which can be exploited by tumors to drive the hallmarks of cancer. Due to the complex tertiary structure and unknown binding motifs of lncRNAs, there is a growing disparity between the number of lncRNAs identified and those that have been functionally characterized. As such, lncRNAs deregulated in cancer may represent critical components of cancer pathways that could serve as novel therapeutic intervention points. Pseudogenes are non-coding DNA sequences that are defunct relatives of their protein-coding parent genes but retain high sequence similarity. Interestingly, certain lncRNAs expressed from pseudogene loci have been shown to regulate the protein-coding parent genes of these pseudogenes in trans particularly because of this sequence complementarity. We hypothesize that this phenomenon occurs more broadly than previously realized, and that aberrant expression of lncRNAs overlapping pseudogene loci provides an alternative mechanism of cancer gene deregulation. Using RNA-sequencing data from two cohorts of lung adenocarcinoma, each paired with patient-matched non-malignant lung samples, we discovered 104 deregulated pseudogene-derived lncRNAs. Remarkably, many of these deregulated lncRNAs (i) were expressed from the loci of pseudogenes related to known cancer genes, (ii) had expression that significantly correlated with protein-coding parent gene expression, and (iii) had lncRNA protein-coding parent gene expression that was significantly associated with survival. Here, we uncover evidence to suggest the lncRNA-pseudogene-protein-coding gene axis as a prominent mechanism of cancer gene regulation in lung adenocarcinoma, and highlights the clinical utility of exploring the non-coding regions of the cancer transcriptome.
Collapse
Affiliation(s)
| | - Katey S S Enfield
- BC Cancer Research Centre, Vancouver, BC, Canada.,The Francis Crick Institute, London, United Kingdom
| | - Adam P Sage
- BC Cancer Research Centre, Vancouver, BC, Canada
| | | | | | | | | | - Wan L Lam
- BC Cancer Research Centre, Vancouver, BC, Canada
| |
Collapse
|
13
|
Emadi-Baygi M, Sedighi R, Nourbakhsh N, Nikpour P. Pseudogenes in gastric cancer pathogenesis: a review article. Brief Funct Genomics 2018; 16:348-360. [PMID: 28459995 DOI: 10.1093/bfgp/elx004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Cancer burden rises globally at an alarming pace. According to GLOBOCAN 2012, gastric cancer (GC) is regarded as the fifth most common malignancy in the world. Being twice as high in men as in women, GC is the third leading cause of cancer mortality in both sexes globally. Being labeled as 'junk DNA', pseudogenes were considered as nonfunctional 'trash', which contribute nothing to survival of the organism; therefore, a number of strategies have been developed to circumvent their accidental detection. Recent progresses have confirmed that pseudogenes can have broad and multifaceted spectrum of activities in human cancers in general and GC in particular. Furthermore, the mentioned functions are parental gene-dependent and/or -independent. Therefore, pseudogenes can be regarded as the emerging class of elaborate modulators of gene expression involved in pathogenesis of human cancers including gastric adenocarcinoma.
Collapse
|
14
|
Haddadi N, Lin Y, Travis G, Simpson AM, McGowan EM, Nassif NT. PTEN/PTENP1: 'Regulating the regulator of RTK-dependent PI3K/Akt signalling', new targets for cancer therapy. Mol Cancer 2018; 17:37. [PMID: 29455665 PMCID: PMC5817727 DOI: 10.1186/s12943-018-0803-3] [Citation(s) in RCA: 179] [Impact Index Per Article: 29.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2017] [Accepted: 02/01/2018] [Indexed: 12/14/2022] Open
Abstract
Regulation of the PI-3 kinase (PI3K)/Akt signalling pathway is essential for maintaining the integrity of fundamental cellular processes, cell growth, survival, death and metabolism, and dysregulation of this pathway is implicated in the development and progression of cancers. Receptor tyrosine kinases (RTKs) are major upstream regulators of PI3K/Akt signalling. The phosphatase and tensin homologue (PTEN), a well characterised tumour suppressor, is a prime antagonist of PI3K and therefore a negative regulator of this pathway. Loss or inactivation of PTEN, which occurs in many tumour types, leads to overactivation of RTK/PI3K/Akt signalling driving tumourigenesis. Cellular PTEN levels are tightly regulated by a number of transcriptional, post-transcriptional and post-translational regulatory mechanisms. Of particular interest, transcription of the PTEN pseudogene, PTENP1, produces sense and antisense transcripts that exhibit post-transcriptional and transcriptional modulation of PTEN expression respectively. These additional levels of regulatory complexity governing PTEN expression add to the overall intricacies of the regulation of RTK/PI-3 K/Akt signalling. This review will discuss the regulation of oncogenic PI3K signalling by PTEN (the regulator) with a focus on the modulatory effects of the sense and antisense transcripts of PTENP1 on PTEN expression, and will further explore the potential for new therapeutic opportunities in cancer treatment.
Collapse
Affiliation(s)
- Nahal Haddadi
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
| | - Yiguang Lin
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
| | - Glena Travis
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
| | - Ann M. Simpson
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
| | - Eileen M. McGowan
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
- Central Laboratory, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, 510080 China
| | - Najah T. Nassif
- School of Life Sciences, Faculty of Science, University of Technology Sydney, 15 Broadway, Ultimo, Sydney, NSW 2007 Australia
| |
Collapse
|
15
|
Yang JR, Maclean CJ, Park C, Zhao H, Zhang J. Intra and Interspecific Variations of Gene Expression Levels in Yeast Are Largely Neutral: (Nei Lecture, SMBE 2016, Gold Coast). Mol Biol Evol 2017; 34:2125-2139. [PMID: 28575451 PMCID: PMC5850415 DOI: 10.1093/molbev/msx171] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
It is commonly, although not universally, accepted that most intra and interspecific genome sequence variations are more or less neutral, whereas a large fraction of organism-level phenotypic variations are adaptive. Gene expression levels are molecular phenotypes that bridge the gap between genotypes and corresponding organism-level phenotypes. Yet, it is unknown whether natural variations in gene expression levels are mostly neutral or adaptive. Here we address this fundamental question by genome-wide profiling and comparison of gene expression levels in nine yeast strains belonging to three closely related Saccharomyces species and originating from five different ecological environments. We find that the transcriptome-based clustering of the nine strains approximates the genome sequence-based phylogeny irrespective of their ecological environments. Remarkably, only ∼0.5% of genes exhibit similar expression levels among strains from a common ecological environment, no greater than that among strains with comparable phylogenetic relationships but different environments. These and other observations strongly suggest that most intra and interspecific variations in yeast gene expression levels result from the accumulation of random mutations rather than environmental adaptations. This finding has profound implications for understanding the driving force of gene expression evolution, genetic basis of phenotypic adaptation, and general role of stochasticity in evolution.
Collapse
Affiliation(s)
- Jian-Rong Yang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Calum J. Maclean
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Chungoo Park
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Huabin Zhao
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
16
|
Bailey J. Monkey-based research on human disease: the implications of genetic differences. Altern Lab Anim 2016; 42:287-317. [PMID: 25413291 DOI: 10.1177/026119291404200504] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Assertions that the use of monkeys to investigate human diseases is valid scientifically are frequently based on a reported 90-93% genetic similarity between the species. Critical analyses of the relevance of monkey studies to human biology, however, indicate that this genetic similarity does not result in sufficient physiological similarity for monkeys to constitute good models for research, and that monkey data do not translate well to progress in clinical practice for humans. Salient examples include the failure of new drugs in clinical trials, the highly different infectivity and pathology of SIV/HIV, and poor extrapolation of research on Alzheimer's disease, Parkinson's disease and stroke. The major molecular differences underlying these inter-species phenotypic disparities have been revealed by comparative genomics and molecular biology - there are key differences in all aspects of gene expression and protein function, from chromosome and chromatin structure to post-translational modification. The collective effects of these differences are striking, extensive and widespread, and they show that the superficial similarity between human and monkey genetic sequences is of little benefit for biomedical research. The extrapolation of biomedical data from monkeys to humans is therefore highly unreliable, and the use of monkeys must be considered of questionable value, particularly given the breadth and potential of alternative methods of enquiry that are currently available to scientists.
Collapse
Affiliation(s)
- Jarrod Bailey
- New England Anti-Vivisection Society (NEAVS), Boston, MA, USA
| |
Collapse
|
17
|
Abstract
By definition, pseudogenes are relics of former genes that no longer possess biological functions. Operationally, they are identified based on disruptions of open reading frames (ORFs) or presumed losses of promoter activities. Intriguingly, a recent human proteomic study reported peptides encoded by 107 pseudogenes. These peptides may play currently unrecognized physiological roles. Alternatively, they may have resulted from accidental translations of pseudogene transcripts and possess no function. Comparing between human and macaque orthologs, we show that the nonsynonymous to synonymous substitution rate ratio (ω) is significantly smaller for translated pseudogenes than other pseudogenes. In particular, five of 34 translated pseudogenes amenable to evolutionary analysis have ω values significantly lower than 1, indicative of the action of purifying selection. This and other findings demonstrate that some but not all translated pseudogenes have selected functions at the protein level. Hence, neither ORF disruption nor presence of protein product disproves or proves gene functionality at the protein level.
Collapse
Affiliation(s)
- Jinrui Xu
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor
| |
Collapse
|
18
|
Koufariotis LT, Chen YPP, Chamberlain A, Vander Jagt C, Hayes BJ. A catalogue of novel bovine long noncoding RNA across 18 tissues. PLoS One 2015; 10:e0141225. [PMID: 26496443 PMCID: PMC4619662 DOI: 10.1371/journal.pone.0141225] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/05/2015] [Indexed: 11/19/2022] Open
Abstract
Long non-coding RNA (lncRNA) have been implicated in diverse biological roles including gene regulation and genomic imprinting. Identifying lncRNA in bovine across many differing tissue would contribute to the current repertoire of bovine lncRNA, and help further improve our understanding of the evolutionary importance and constraints of these transcripts. Additionally, it could aid in identifying sites in the genome outside of protein coding genes where mutations could contribute to variation in complex traits. This is particularly important in bovine as genomic predictions are increasingly used in genetic improvement for milk and meat production. Our aim was to identify and annotate novel long non coding RNA transcripts in the bovine genome captured from RNA Sequencing (RNA-Seq) data across 18 tissues, sampled in triplicate from a single cow. To address the main challenge in identifying lncRNA, namely distinguishing lncRNA transcripts from unannotated genes and protein coding genes, a lncRNA identification pipeline with a number of filtering steps was developed. A total of 9,778 transcripts passed the filtering pipeline. The bovine lncRNA catalogue includes MALAT1 and HOTAIR, both of which have been well described in human and mouse genomes. We attempted to validate the lncRNA in libraries from three additional cows. 726 (87.47%) liver and 1,668 (55.27%) blood class 3 lncRNA were validated with stranded liver and blood libraries respectively. Additionally, this study identified a large number of novel unknown transcripts in the bovine genome with high protein coding potential, illustrating a clear need for better annotations of protein coding genes.
Collapse
Affiliation(s)
- Lambros T. Koufariotis
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
- * E-mail:
| | - Yi-Ping Phoebe Chen
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
| | - Amanda Chamberlain
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Christy Vander Jagt
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Ben J. Hayes
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| |
Collapse
|
19
|
Mahmudi O, Sennblad B, Arvestad L, Nowick K, Lagergren J. Gene-pseudogene evolution: a probabilistic approach. BMC Genomics 2015; 16 Suppl 10:S12. [PMID: 26449131 PMCID: PMC4602177 DOI: 10.1186/1471-2164-16-s10-s12] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Over the last decade, methods have been developed for the reconstruction of gene trees that take into account the species tree. Many of these methods have been based on the probabilistic duplication-loss model, which describes how a gene-tree evolves over a species-tree with respect to duplication and losses, as well as extension of this model, e.g., the DLRS (Duplication, Loss, Rate and Sequence evolution) model that also includes sequence evolution under relaxed molecular clock. A disjoint, almost as recent, and very important line of research has been focused on non protein-coding, but yet, functional DNA. For instance, DNA sequences being pseudogenes in the sense that they are not translated, may still be transcribed and the thereby produced RNA may be functional. We extend the DLRS model by including pseudogenization events and devise an MCMC framework for analyzing extended gene families consisting of genes and pseudogenes with respect to this model, i.e., reconstructing gene-trees and identifying pseudogenization events in the reconstructed gene-trees. By applying the MCMC framework to biologically realistic synthetic data, we show that gene-trees as well as pseudogenization points can be inferred well. We also apply our MCMC framework to extended gene families belonging to the Olfactory Receptor and Zinc Finger superfamilies. The analysis indicate that both these super families contains very old pseudogenes, perhaps so old that it is reasonable to suspect that some are functional. In our analysis, the sub families of the Olfactory Receptors contains only lineage specific pseudogenes, while the sub families of the Zinc Fingers contains pseudogene lineages common to several species.
Collapse
|
20
|
Roberts TC, Morris KV, Wood MJA. The role of long non-coding RNAs in neurodevelopment, brain function and neurological disease. Philos Trans R Soc Lond B Biol Sci 2015; 369:rstb.2013.0507. [PMID: 25135968 DOI: 10.1098/rstb.2013.0507] [Citation(s) in RCA: 140] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are transcripts with low protein-coding potential that represent a large proportion of the transcriptional output of the cell. Many lncRNAs exhibit features indicative of functionality including tissue-restricted expression, localization to distinct subcellular structures, regulated expression and evolutionary conservation. Some lncRNAs have been shown to associate with chromatin-modifying activities and transcription factors, suggesting that a common mode of action may be to guide protein complexes to target genomic loci. However, the functions (if any) of the vast majority of lncRNA transcripts are currently unknown, and the subject of investigation. Here, we consider the putative role(s) of lncRNAs in neurodevelopment and brain function with an emphasis on the epigenetic regulation of gene expression. Associations of lncRNAs with neurodevelopmental/neuropsychiatric disorders, neurodegeneration and brain cancers are also discussed.
Collapse
Affiliation(s)
- Thomas C Roberts
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK Department of Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Kevin V Morris
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA School of Biotechnology and Biomedical Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Matthew J A Wood
- Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK
| |
Collapse
|
21
|
Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP, Ulitsky I. Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep 2015; 11:1110-22. [PMID: 25959816 DOI: 10.1016/j.celrep.2015.04.023] [Citation(s) in RCA: 448] [Impact Index Per Article: 49.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Revised: 03/02/2015] [Accepted: 04/09/2015] [Indexed: 12/15/2022] Open
Abstract
The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNAs). Although in each species, >70% of lincRNAs cannot be traced to homologs in species that diverged >50 million years ago, thousands of human lincRNAs have homologs with similar expression patterns in other species. These homologs share short, 5'-biased patches of sequence conservation nested in exonic architectures that have been extensively rewired, in part by transposable element exonization. Thus, over a thousand human lincRNAs are likely to have conserved functions in mammals, and hundreds beyond mammals, but those functions require only short patches of specific sequences and can tolerate major changes in gene architecture.
Collapse
Affiliation(s)
- Hadas Hezroni
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - David Koppstein
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Howard Hughes Medical Institute and Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | - Alexandra Avrutin
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - David P Bartel
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Howard Hughes Medical Institute and Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Igor Ulitsky
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
22
|
Abstract
Historically pseudogenes were believed to represent nonfunctional genomic fossils; however, there is emerging evidence that many of them could be biologically active. This possibility has ignited interest in pseudogene loci and made the need for their high-quality annotation more pressing as an accurate knowledge of all pseudogenes in the human reference genome sequence facilitates confident functional analysis. GENCODE have undertaken the first genome-wide pseudogene assignment for protein-coding genes combining both large-scale manual annotation and computational pseudogene prediction pipelines. Multiple computational predictions provide an unbiased set of hints for manual annotators to investigate, both during first-pass annotation and as part of QC to identify any potential missing pseudogene loci. Where a pseudogene is identified, the extent of its homology to the parent locus is fully investigated by a manual annotator; a pseudogene model is built and assigned to one of eight pseudogene biotypes depending on the mechanism of creation and on the presence of locus-specific transcriptional or proteomic data. The high-quality, information-rich set of pseudogenes created has been integrated with ENCODE functional genomics data, specifically expression level, transcription factor and RNA polymerase II binding, and chromatin marks. In this way we have been able to identify some pseudogenes that possess conventional characteristics of functionality as well as others with interesting patterns of partial activity, which might suggest that putatively inactive loci could be gaining a novel function, for example as long noncoding RNAs. The activity data associated with every pseudogene is stored in the psiDR resource.
Collapse
|
23
|
Xiao-Jie L, Ai-Mei G, Li-Juan J, Jiang X. Pseudogene in cancer: real functions and promising signature. J Med Genet 2014; 52:17-24. [DOI: 10.1136/jmedgenet-2014-102785] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
24
|
Dharia AP, Obla A, Gajdosik MD, Simon A, Nelson CE. Tempo and mode of gene duplication in mammalian ribosomal protein evolution. PLoS One 2014; 9:e111721. [PMID: 25369106 PMCID: PMC4219774 DOI: 10.1371/journal.pone.0111721] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 10/06/2014] [Indexed: 12/17/2022] Open
Abstract
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism.
Collapse
Affiliation(s)
- Asav P. Dharia
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Ajay Obla
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Matthew D. Gajdosik
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Amanda Simon
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
| | - Craig E. Nelson
- University of Connecticut Department of Molecular and Cell Biology, Storrs, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
25
|
Roberts TC, Morris KV. Not so pseudo anymore: pseudogenes as therapeutic targets. Pharmacogenomics 2014; 14:2023-34. [PMID: 24279857 DOI: 10.2217/pgs.13.172] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Pseudogenes are junk DNA gene remnants generated by inactivating mutations or the loss of regulatory sequences, often following gene duplication or retrotransposition events. These pseudogenes have previously been considered to be molecular fossils derived from once-coding genes. In many cases, pseudogenes confer no observable selective advantage to the host organism and may be on a path towards removal from the genome. However, pseudogenes can also serve as raw material for the exaptation of novel functions, particularly in relation to the regulation of gene expression. Many pseudogenes are resurrected as noncoding RNA genes, which function in RNA-based gene regulatory circuits. As such, functional pseudogenes might simply be considered as 'genes'. Here, we discuss the role of these pseudogene-derived RNAs as regulators of gene expression in the context of human disease. In particular, we consider the manipulation of pseudogene transcripts through the use of antisense oligonucleotides, siRNAs, aptamers or classical gene therapy approaches as novel pharmacological strategies.
Collapse
Affiliation(s)
- Thomas C Roberts
- Department of Molecular & Experimental Medicine, The Scripps Research Institute, 10550 N Torrey Pines Road, La Jolla, CA 92037, USA
| | | |
Collapse
|
26
|
Abstract
A pseudogene arises when a gene loses the ability to produce a protein, which can be due to mutation or inaccurate duplication. Previous dogma has dictated that because the pseudogene no longer produces a protein it becomes functionless and evolutionarily inert, being neither conserved nor removed. However, recent evidence has forced a re-evaluation of this view. Some pseudogenes, although not translated into protein, are at least transcribed into RNA. In some cases, these pseudogene transcripts are capable of influencing the activity of other genes that code for proteins, thereby altering expression and in turn affecting the phenotype of the organism. In the present chapter, we will define pseudogenes, describe the evidence that they are transcribed into non-coding RNAs and outline the mechanisms by which they are able to influence the machinery of the eukaryotic cell.
Collapse
|
27
|
Characterization of human pseudogene-derived non-coding RNAs for functional potential. PLoS One 2014; 9:e93972. [PMID: 24699680 PMCID: PMC3974860 DOI: 10.1371/journal.pone.0093972] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 03/10/2014] [Indexed: 11/19/2022] Open
Abstract
Thousands of pseudogenes exist in the human genome and many are transcribed, but their functional potential remains elusive and understudied. To explore these issues systematically, we first developed a computational pipeline to identify transcribed pseudogenes from RNA-Seq data. Applying the pipeline to datasets from 16 distinct normal human tissues identified ∼ 3,000 pseudogenes that could produce non-coding RNAs in a manner of low abundance but high tissue specificity under normal physiological conditions. Cross-tissue comparison revealed that the transcriptional profiles of pseudogenes and their parent genes showed mostly positive correlations, suggesting that pseudogene transcription could have a positive effect on the expression of their parent genes, perhaps by functioning as competing endogenous RNAs (ceRNAs), as previously suggested and demonstrated with the PTEN pseudogene, PTENP1. Our analysis of the ENCODE project data also found many transcriptionally active pseudogenes in the GM12878 and K562 cell lines; moreover, it showed that many human pseudogenes produced small RNAs (sRNAs) and some pseudogene-derived sRNAs, especially those from antisense strands, exhibited evidence of interfering with gene expression. Further integrated analysis of transcriptomics and epigenomics data, however, demonstrated that trimethylation of histone 3 at lysine 9 (H3K9me3), a posttranslational modification typically associated with gene repression and heterochromatin, was enriched at many transcribed pseudogenes in a transcription-level dependent manner in the two cell lines. The H3K9me3 enrichment was more prominent in pseudogenes that produced sRNAs at pseudogene loci and their adjacent regions, an observation further supported by the co-enrichment of SETDB1 (a H3K9 methyltransferase), suggesting that pseudogene sRNAs may have a role in regional chromatin repression. Taken together, our comprehensive and systematic characterization of pseudogene transcription uncovers a complex picture of how pseudogene ncRNAs could influence gene and pseudogene expression, at both epigenetic and post-transcriptional levels.
Collapse
|
28
|
Abstract
The number of complete genome sequences explodes more and more with each passing year. Thus, methods for genome annotation need to be honed constantly to handle the deluge of information. Annotation of pseudogenes (i.e., gene copies that appear not to make a functional protein) in genomes is a persistent problem; here, we overview pseudogene annotation methods that are based on the detection of sequence homology in genomic DNA.
Collapse
Affiliation(s)
- Paul M Harrison
- Department of Biology, McGill University, Stewart Biology Building, 1205 Doctor Penfield Avenue, Montreal, QC, Canada, H3A 1B1,
| |
Collapse
|
29
|
Korrodi-Gregório L, Abrantes J, Muller T, Melo-Ferreira J, Marcus K, da Cruz e Silva OAB, Fardilha M, Esteves PJ. Not so pseudo: the evolutionary history of protein phosphatase 1 regulatory subunit 2 and related pseudogenes. BMC Evol Biol 2013; 13:242. [PMID: 24195737 PMCID: PMC3840573 DOI: 10.1186/1471-2148-13-242] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Accepted: 10/29/2013] [Indexed: 01/23/2023] Open
Abstract
Background Pseudogenes are traditionally considered “dead” genes, therefore lacking biological functions. This view has however been challenged during the last decade. This is the case of the Protein phosphatase 1 regulatory subunit 2 (PPP1R2) or inhibitor-2 gene family, for which several incomplete copies exist scattered throughout the genome. Results In this study, the pseudogenization process of PPP1R2 was analyzed. Ten PPP1R2-related pseudogenes (PPP1R2P1-P10), highly similar to PPP1R2, were retrieved from the human genome assembly present in the databases. The phylogenetic analysis of mammalian PPP1R2 and related pseudogenes suggested that PPP1R2P7 and PPP1R2P9 retroposons appeared before the great mammalian radiation, while the remaining pseudogenes are primate-specific and retroposed at different times during Primate evolution. Although considered inactive, four of these pseudogenes seem to be transcribed and possibly possess biological functions. Given the role of PPP1R2 in sperm motility, the presence of these proteins was assessed in human sperm, and two PPP1R2-related proteins were detected, PPP1R2P3 and PPP1R2P9. Signatures of negative and positive selection were also detected in PPP1R2P9, further suggesting a role as a functional protein. Conclusions The results show that contrary to initial observations PPP1R2-related pseudogenes are not simple bystanders of the evolutionary process but may rather be at the origin of genes with novel functions.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Pedro J Esteves
- CIBIO-UP, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, InBIO, Laboratório Associado, Campus Agrário de Vairão, Vairão, Portugal.
| |
Collapse
|
30
|
Abstract
Long intervening noncoding RNAs (lincRNAs) are transcribed from thousands of loci in mammalian genomes and might play widespread roles in gene regulation and other cellular processes. This Review outlines the emerging understanding of lincRNAs in vertebrate animals, with emphases on how they are being identified and current conclusions and questions regarding their genomics, evolution and mechanisms of action.
Collapse
Affiliation(s)
- Igor Ulitsky
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | | |
Collapse
|
31
|
Sen K, Ghosh TC. Pseudogenes and their composers: delving in the 'debris' of human genome. Brief Funct Genomics 2013; 12:536-47. [PMID: 23900003 DOI: 10.1093/bfgp/elt026] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Pseudogenes, the nonfunctional homologs of functional genes and thus exemplified as 'genomic fossils' provide intriguing snapshots of the evolutionary history of human genome. These defunct copies generally arise by retrotransposition or duplication followed by various genetic disablements. In this study, focusing on human pseudogenes and their functional homologues we describe their characteristic features and relevance to protein sequence evolution. We recapitulate that pseudogenes harbor disease-causing degenerative sequence variations in conjunction with the immense disease gene association of their progenitors. Furthermore, we also discuss the issue of functional resurrection and the potentiality observed in some pseudogenes to regulate their functional counterparts.
Collapse
Affiliation(s)
- Kamalika Sen
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India. Tel.: +91 33 2355 6626; Fax: +91 33 2355 3886;
| | | |
Collapse
|
32
|
Antonov I, Coakley A, Atkins JF, Baranov PV, Borodovsky M. Identification of the nature of reading frame transitions observed in prokaryotic genomes. Nucleic Acids Res 2013; 41:6514-30. [PMID: 23649834 PMCID: PMC3711429 DOI: 10.1093/nar/gkt274] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Our goal was to identify evolutionary conserved frame transitions in protein coding regions and to uncover an underlying functional role of these structural aberrations. We used the ab initio frameshift prediction program, GeneTack, to detect reading frame transitions in 206 991 genes (fs-genes) from 1106 complete prokaryotic genomes. We grouped 102 731 fs-genes into 19 430 clusters based on sequence similarity between protein products (fs-proteins) as well as conservation of predicted position of the frameshift and its direction. We identified 4010 pseudogene clusters and 146 clusters of fs-genes apparently using recoding (local deviation from using standard genetic code) due to possessing specific sequence motifs near frameshift positions. Particularly interesting was finding of a novel type of organization of the dnaX gene, where recoding is required for synthesis of the longer subunit, τ. We selected 20 clusters of predicted recoding candidates and designed a series of genetic constructs with a reporter gene or affinity tag whose expression would require a frameshift event. Expression of the constructs in Escherichia coli demonstrated enrichment of the set of candidates with sequences that trigger genuine programmed ribosomal frameshifting; we have experimentally confirmed four new families of programmed frameshifts.
Collapse
Affiliation(s)
- Ivan Antonov
- School of Computational Science and Engineering at Georgia Tech, Atlanta, GA 30332, USA
| | | | | | | | | |
Collapse
|
33
|
Katju V. To the beat of a different drum: determinants implicated in the asymmetric sequence divergence of Caenorhabditis elegans paralogs. BMC Evol Biol 2013; 13:73. [PMID: 23530733 PMCID: PMC3637608 DOI: 10.1186/1471-2148-13-73] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 03/20/2013] [Indexed: 12/18/2022] Open
Abstract
Background Gene duplicates often exhibit asymmetric rates of molecular evolution in their early evolutionary existence. This asymmetry in rates is thought to signify the maintenance of the ancestral function by one copy and the removal of functional constraint on the other copy, enabling it to embark on a novel evolutionary trajectory. Here I focused on a large population of evolutionarily young gene duplicates (KS ≤ 0.14) in the Caenorhabditis elegans genome in order to conduct the first combined analysis of four predictors (evolutionary age, chromosomal location, structural resemblance between duplicates, and duplication span) which may be implicated in the asymmetric sequence divergence of paralogs at the nucleotide and amino acid level. In addition, I investigate if either paralog is equally likely to embark on a trajectory of accelerated sequence evolution or whether the derived paralog is more likely to exhibit faster sequence evolution. Results Three predictors (evolutionary age of duplicates, chromosomal location and duplication span) serve as major determinants of sequence asymmetry between C. elegans paralogs. Paralogs diverge asymmetrically in sequence with increasing evolutionary age, the relocation of one copy to a different chromosome and attenuated duplication spans that likely fail to capture the entire ancestral repertoire of coding sequence and regulatory elements. Furthermore, for paralogs residing on the same chromosome, opposite transcriptional orientation and increased genomic distance do not increase sequence asymmetry between paralogs. For a subset of duplicate pairs wherein the ancestral versus derived paralog could be distinguished, the derived paralogs are more likely to evolve at accelerated rates. Conclusions This genome-wide study of evolutionarily young duplicates stemming primarily from DNA-mediated small-scale duplication events demonstrates that genomic relocation to a new chromosome has important consequences for asymmetric divergence of paralogs, akin to paralogs arising from RNA-mediated duplication events. Additionally, the duplication span is negatively correlated with sequence rate asymmetry among paralogs, suggesting that attenuated duplication spans stemming from incomplete duplication of the ORF and/or ancestral regulatory elements further accelerate sequence divergence between paralogs. Cumulatively, derived copies exhibit accelerated rates of sequence evolution suggesting that they are primed for a divergent evolutionary trajectory by changes in structure and genomic context at inception.
Collapse
Affiliation(s)
- Vaishali Katju
- Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA.
| |
Collapse
|
34
|
Li W, Yang W, Wang XJ. Pseudogenes: pseudo or real functional elements? J Genet Genomics 2013; 40:171-7. [PMID: 23618400 DOI: 10.1016/j.jgg.2013.03.003] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 03/04/2013] [Accepted: 03/04/2013] [Indexed: 11/24/2022]
Abstract
Pseudogenes are genomic remnants of ancient protein-coding genes which have lost their coding potentials through evolution. Although broadly existed, pseudogenes used to be considered as junk or relics of genomes which have not drawn enough attentions of biologists until recent years. With the broad applications of high-throughput experimental techniques, growing lines of evidence have strongly suggested that some pseudogenes possess special functions, including regulating parental gene expression and participating in the regulation of many biological processes. In this review, we summarize some basic features of pseudogenes and their functions in regulating development and diseases. All of these observations indicate that pseudogenes are not purely dead fossils of genomes, but warrant further exploration in their distribution, expression regulation and functions. A new nomenclature is desirable for the currently called 'pseudogenes' to better describe their functions.
Collapse
Affiliation(s)
- Wen Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | | | | |
Collapse
|
35
|
Johnsson P, Ackley A, Vidarsdottir L, Lui WO, Corcoran M, Grandér D, Morris KV. A pseudogene long-noncoding-RNA network regulates PTEN transcription and translation in human cells. Nat Struct Mol Biol 2013; 20:440-6. [PMID: 23435381 PMCID: PMC3618526 DOI: 10.1038/nsmb.2516] [Citation(s) in RCA: 348] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2012] [Accepted: 01/04/2013] [Indexed: 11/09/2022]
Abstract
PTEN is a tumor suppressor gene that has been shown to be under the regulatory control of a PTEN pseudogene expressed noncoding RNA, PTENpg1. Here, we characterize a previously unidentified PTENpg1 encoded antisense RNA (asRNA), which regulates PTEN transcription and PTEN mRNA stability. We find two PTENpg1 asRNA isoforms, alpha and beta. The alpha isoform functions in trans, localizes to the PTEN promoter, and epigenetically modulates PTEN transcription by the recruitment of DNMT3a and EZH2. In contrast, the beta isoform interacts with PTENpg1 through an RNA:RNA pairing interaction, which affects PTEN protein output via changes of PTENpg1 stability and microRNA sponge activity. Disruption of this asRNA-regulated network induces cell cycle arrest and sensitizes cells to doxorubicin, suggesting a biological function for the respective PTENpg1 expressed asRNAs.
Collapse
Affiliation(s)
- Per Johnsson
- Department of Oncology and Pathology, Karolinska Institutet, Stockholm, Sweden
| | | | | | | | | | | | | |
Collapse
|
36
|
Chan WL, Yang WK, Huang HD, Chang JG. pseudoMap: an innovative and comprehensive resource for identification of siRNA-mediated mechanisms in human transcribed pseudogenes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013; 2013:bat001. [PMID: 23396300 PMCID: PMC3567485 DOI: 10.1093/database/bat001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
RNA interference (RNAi) is a gene silencing process within living cells, which is controlled by the RNA-induced silencing complex with a sequence-specific manner. In flies and mice, the pseudogene transcripts can be processed into short interfering RNAs (siRNAs) that regulate protein-coding genes through the RNAi pathway. Following these findings, we construct an innovative and comprehensive database to elucidate siRNA-mediated mechanism in human transcribed pseudogenes (TPGs). To investigate TPG producing siRNAs that regulate protein-coding genes, we mapped the TPGs to small RNAs (sRNAs) that were supported by publicly deep sequencing data from various sRNA libraries and constructed the TPG-derived siRNA-target interactions. In addition, we also presented that TPGs can act as a target for miRNAs that actually regulate the parental gene. To enable the systematic compilation and updating of these results and additional information, we have developed a database, pseudoMap, capturing various types of information, including sequence data, TPG and cognate annotation, deep sequencing data, RNA-folding structure, gene expression profiles, miRNA annotation and target prediction. As our knowledge, pseudoMap is the first database to demonstrate two mechanisms of human TPGs: encoding siRNAs and decoying miRNAs that target the parental gene. pseudoMap is freely accessible at http://pseudomap.mbc.nctu.edu.tw/. Database URL:http://pseudomap.mbc.nctu.edu.tw/
Collapse
Affiliation(s)
- Wen-Ling Chan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsin-Chu, Taiwan
| | | | | | | |
Collapse
|
37
|
Chan WL, Yuo CY, Yang WK, Hung SY, Chang YS, Chiu CC, Yeh KT, Huang HD, Chang JG. Transcribed pseudogene ψPPM1K generates endogenous siRNA to suppress oncogenic cell growth in hepatocellular carcinoma. Nucleic Acids Res 2013; 41:3734-47. [PMID: 23376929 PMCID: PMC3616710 DOI: 10.1093/nar/gkt047] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Pseudogenes, especially those that are transcribed, may not be mere genomic fossils, but their biological significance remains unclear. Postulating that in the human genome, as in animal models, pseudogenes may function as gene regulators through generation of endo-siRNAs (esiRNAs), antisense RNAs or RNA decoys, we performed bioinformatic and subsequent experimental tests to explore esiRNA-mediated mechanisms of pseudogene involvement in oncogenesis. A genome-wide survey revealed a partial retrotranscript pseudogene ψPPM1K containing inverted repeats capable of folding into hairpin structures that can be processed into two esiRNAs; these esiRNAs potentially target many cellular genes, including NEK8. In 41 paired surgical specimens, we found significantly reduced expression of two predicted ψPPM1K-specific esiRNAs, and the cognate gene PPM1K, in hepatocellular carcinoma compared with matched non-tumour tissues, whereas the expression of target gene NEK8 was increased in tumours. Additionally, NEK8 and PPM1K were downregulated in stably transfected ψPPM1K-overexpressing cells, but not in cells transfected with an esiRNA1-deletion mutant of ψPPM1K. Furthermore, expression of NEK8 in ψPPM1K-transfected cells demonstrated that NEK8 can counteract the growth inhibitory effects of ψPPM1K. These findings indicate that a transcribed pseudogene can exert tumour-suppressor activity independent of its parental gene by generation of esiRNAs that regulate human cell growth.
Collapse
Affiliation(s)
- Wen-Ling Chan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsin-Chu 300, Taiwan
| | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Losses of functional opsin genes, short-wavelength cone photopigments, and color vision--a significant trend in the evolution of mammalian vision. Vis Neurosci 2013; 30:39-53. [PMID: 23286388 DOI: 10.1017/s0952523812000429] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
All mammalian cone photopigments are derived from the operation of representatives from two opsin gene families (SWS1 and LWS in marsupial and eutherian mammals; SWS2 and LWS in monotremes), a process that produces cone pigments with respective peak sensitivities in the short and middle-to-long wavelengths. With the exception of a number of primate taxa, the modal pattern for mammals is to have two types of cone photopigment, one drawn from each of the gene families. In recent years, it has been discovered that the SWS1 opsin genes of a widely divergent collection of eutherian mammals have accumulated mutational changes that render them nonfunctional. This alteration reduces the retinal complements of these species to a single cone type, thus rendering ordinary color vision impossible. At present, several dozen species from five mammalian orders have been identified as falling into this category, but the total number of mammalian species that have lost short-wavelength cones in this way is certain to be much larger, perhaps reaching as high as 10% of all species. A number of circumstances that might be used to explain this widespread cone loss can be identified. Among these, the single consistent fact is that the species so affected are nocturnal or, if they are not technically nocturnal, they at least feature retinal organizations that are typically associated with that lifestyle. At the same time, however, there are many nocturnal mammals that retain functional short-wavelength cones. Nocturnality thus appears to set the stage for loss of functional SWS1 opsin genes in mammals, but it cannot be the sole circumstance.
Collapse
|
39
|
Pante E, Rohfritsch A, Becquet V, Belkhir K, Bierne N, Garcia P. SNP detection from de novo transcriptome sequencing in the bivalve Macoma balthica: marker development for evolutionary studies. PLoS One 2012; 7:e52302. [PMID: 23300636 PMCID: PMC3530552 DOI: 10.1371/journal.pone.0052302] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Accepted: 11/16/2012] [Indexed: 11/28/2022] Open
Abstract
Hybrid zones are noteworthy systems for the study of environmental adaptation to fast-changing environments, as they constitute reservoirs of polymorphism and are key to the maintenance of biodiversity. They can move in relation to climate fluctuations, as temperature can affect both selection and migration, or remain trapped by environmental and physical barriers. There is therefore a very strong incentive to study the dynamics of hybrid zones subjected to climate variations. The infaunal bivalve Macoma balthica emerges as a noteworthy model species, as divergent lineages hybridize, and its native NE Atlantic range is currently contracting to the North. To investigate the dynamics and functioning of hybrid zones in M. balthica, we developed new molecular markers by sequencing the collective transcriptome of 30 individuals. Ten individuals were pooled for each of the three populations sampled at the margins of two hybrid zones. A single 454 run generated 277 Mb from which 17K SNPs were detected. SNP density averaged 1 polymorphic site every 14 to 19 bases, for mitochondrial and nuclear loci, respectively. An scan detected high genetic divergence among several hundred SNPs, some of them involved in energetic metabolism, cellular respiration and physiological stress. The high population differentiation, recorded for nuclear-encoded ATP synthase and NADH dehydrogenase as well as most mitochondrial loci, suggests cytonuclear genetic incompatibilities. Results from this study will help pave the way to a high-resolution study of hybrid zone dynamics in M. balthica, and the relative importance of endogenous and exogenous barriers to gene flow in this system.
Collapse
Affiliation(s)
- Eric Pante
- Littoral, Environnement et Sociétés Joint Research Unit 7266 Centre national de la recherche scientifique, Université de La Rochelle, La Rochelle, France.
| | | | | | | | | | | |
Collapse
|
40
|
Antonov I, Baranov P, Borodovsky M. GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences. Nucleic Acids Res 2012; 41:D152-6. [PMID: 23161689 PMCID: PMC3531167 DOI: 10.1093/nar/gks1062] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Database annotations of prokaryotic genomes and eukaryotic mRNA sequences pay relatively low attention to frame transitions that disrupt protein-coding genes. Frame transitions (frameshifts) could be caused by sequencing errors or indel mutations inside protein-coding regions. Other observed frameshifts are related to recoding events (that evolved to control expression of some genes). Earlier, we have developed an algorithm and software program GeneTack for ab initio frameshift finding in intronless genes. Here, we describe a database (freely available at http://topaz.gatech.edu/GeneTack/db.html) containing genes with frameshifts (fs-genes) predicted by GeneTack. The database includes 206 991 fs-genes from 1106 complete prokaryotic genomes and 45 295 frameshifts predicted in mRNA sequences from 100 eukaryotic genomes. The whole set of fs-genes was grouped into clusters based on sequence similarity between fs-proteins (conceptually translated fs-genes), conservation of the frameshift position and frameshift direction (−1, +1). The fs-genes can be retrieved by similarity search to a given query sequence via a web interface, by fs-gene cluster browsing, etc. Clusters of fs-genes are characterized with respect to their likely origin, such as pseudogenization, phase variation, etc. The largest clusters contain fs-genes with programed frameshifts (related to recoding events).
Collapse
Affiliation(s)
- Ivan Antonov
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | | | | |
Collapse
|
41
|
McManus MT, Joshi S, Searle B, Pither-Joyce M, Shaw M, Leung S, Albert N, Shigyo M, Jakse J, Havey MJ, McCallum J. Genotypic variation in sulfur assimilation and metabolism of onion (Allium cepa L.) III. Characterization of sulfite reductase. PHYTOCHEMISTRY 2012; 83:34-42. [PMID: 22944351 DOI: 10.1016/j.phytochem.2012.07.028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2012] [Revised: 07/26/2012] [Accepted: 07/30/2012] [Indexed: 06/01/2023]
Abstract
Genomic and cDNA sequences corresponding to a ferredoxin-sulfite reductase (SiR) have been cloned from bulb onion (Allium cepa L.) and the expression of the gene and activity of the enzyme characterized with respect to sulfur (S) supply. Cloning, mapping and expression studies revealed that onion has a single functional SiR gene and also expresses an unprocessed pseudogene (φ-SiR). Northern and qPCR analysis revealed differences in expression pattern between the SiR gene and the pseudogene. Western analysis using antibodies raised to a recombinant SiR revealed that the enzyme is present in chloroplasts and phylogenetic analysis has shown that the onion protein groups with lower eudicots. In hydroponically-grown plants, levels of SiR transcripts were significantly higher in the roots of S-sufficient when compared with S-deficient plants of the pungent cultivar 'W202A' but not the less pungent cultivar 'Texas Grano'. In these same treatments, a higher level of enzyme activity was observed in the S-sufficient treatment in leaves of both cultivars before and after bulbing. In a factorial field trial with and without sulfur fertilization, a statistically significant increase in SiR activity was observed in the leaves of the pungent cultivar 'Kojak' in response to added S but not in the less pungent cultivar 'Encore'.
Collapse
Affiliation(s)
- Michael T McManus
- Institute of Molecular BioSciences, Massey University, Private Bag 11222, Palmerston North, New Zealand.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Abstract
Because they are generally noncoding and thus considered nonfunctional and unimportant, pseudogenes have long been neglected. Recent advances have established that the DNA of a pseudogene, the RNA transcribed from a pseudogene, or the protein translated from a pseudogene can have multiple, diverse functions and that these functions can affect not only their parental genes but also unrelated genes. Therefore, pseudogenes have emerged as a previously unappreciated class of sophisticated modulators of gene expression, with a multifaceted involvement in the pathogenesis of human cancer.
Collapse
Affiliation(s)
- Laura Poliseno
- Oncogenomics Unit, Core Research Laboratory, Istituto Toscano Tumori (CRL-ITT), c/o IFC-CNR Via Moruzzi 1, 56124 Pisa, Italy.
| |
Collapse
|
43
|
Naidoo N, Pawitan Y, Soong R, Cooper DN, Ku CS. Human genetics and genomics a decade after the release of the draft sequence of the human genome. Hum Genomics 2012; 5:577-622. [PMID: 22155605 PMCID: PMC3525251 DOI: 10.1186/1479-7364-5-6-577] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.
Collapse
Affiliation(s)
- Nasheen Naidoo
- Centre for Molecular Epidemiology, Department of Epidemiology and Public Health, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | | | | | | | | |
Collapse
|
44
|
Yan B, Wang ZH, Guo JT. The research strategies for probing the function of long noncoding RNAs. Genomics 2011; 99:76-80. [PMID: 22210346 DOI: 10.1016/j.ygeno.2011.12.002] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2011] [Revised: 12/08/2011] [Accepted: 12/09/2011] [Indexed: 10/14/2022]
Abstract
Long noncoding RNAs (lncRNAs) represent a new frontier in molecular genetics and molecular biology. They have a tremendous potential for advancing our comprehensive understanding of biological processes in huma n health and disease. The transcripts of lncRNAs are easy to find, but sorting out what they do remains the biggest challenge in lncRNAs' research field. In the paper, we highlight recent progress regarding the methods to explore the roles of lncRNAs.
Collapse
Affiliation(s)
- Biao Yan
- Key laboratory of Freshwater Aquatic Genetic Resources, Shanghai Ocean University, Ministry of Agriculture, Shanghai Ocean University, Shanghai 201306, China.
| | | | | |
Collapse
|
45
|
Singh D, Kashyap A, Pandey RV, Saini KS. Novel advances in cytochrome P450 research. Drug Discov Today 2011; 16:793-9. [DOI: 10.1016/j.drudis.2011.08.003] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2010] [Revised: 05/19/2011] [Accepted: 08/09/2011] [Indexed: 11/28/2022]
|
46
|
Muro EM, Mah N, Andrade-Navarro MA. Functional evidence of post-transcriptional regulation by pseudogenes. Biochimie 2011; 93:1916-21. [PMID: 21816204 DOI: 10.1016/j.biochi.2011.07.024] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2011] [Accepted: 07/19/2011] [Indexed: 11/26/2022]
Abstract
Pseudogenes have been mainly considered as functionless evolutionary relics since their discovery in 1977. However, multiple mechanisms of pseudogene functionality have been proposed both at the transcriptional and post-transcriptional level. This review focuses on the role of pseudogenes as post-transcriptional regulators. Two lines of research have recently presented strong evidence of their potential function as post-transcriptional regulators of the corresponding parental genes from which they originate. First, pseudogene genomic sequences can encode siRNAs. Second, pseudogene transcripts can act as indirect post-transcriptional regulators decoying ncRNA, in particular miRNAs that target the parental gene. This has been demonstrated for PTEN and KRAS, two genes involved in tumorigenesis. The role of pseudogenes in disease has not been proven and seems to be the next research landmark. In this review, we chronicle the events following the initial discovery of the 'useless' pseudogene to its breakthrough as a functional molecule with hitherto unbeknownst potential to influence human disease.
Collapse
Affiliation(s)
- Enrique M Muro
- Max-Delbrück Center for Molecular Medicine, Robert Rössle Str. 10, 13125 Berlin, Germany.
| | | | | |
Collapse
|
47
|
Pink RC, Wicks K, Caley DP, Punch EK, Jacobs L, Carter DRF. Pseudogenes: pseudo-functional or key regulators in health and disease? RNA (NEW YORK, N.Y.) 2011; 17:792-8. [PMID: 21398401 PMCID: PMC3078729 DOI: 10.1261/rna.2658311] [Citation(s) in RCA: 293] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Pseudogenes have long been labeled as "junk" DNA, failed copies of genes that arise during the evolution of genomes. However, recent results are challenging this moniker; indeed, some pseudogenes appear to harbor the potential to regulate their protein-coding cousins. Far from being silent relics, many pseudogenes are transcribed into RNA, some exhibiting a tissue-specific pattern of activation. Pseudogene transcripts can be processed into short interfering RNAs that regulate coding genes through the RNAi pathway. In another remarkable discovery, it has been shown that pseudogenes are capable of regulating tumor suppressors and oncogenes by acting as microRNA decoys. The finding that pseudogenes are often deregulated during cancer progression warrants further investigation into the true extent of pseudogene function. In this review, we describe the ways in which pseudogenes exert their effect on coding genes and explore the role of pseudogenes in the increasingly complex web of noncoding RNA that contributes to normal cellular regulation.
Collapse
Affiliation(s)
- Ryan Charles Pink
- School of Life Sciences, Oxford Brookes University, Headington, Oxford, United Kingdom
| | | | | | | | | | | |
Collapse
|
48
|
Brosch M, Saunders GI, Frankish A, Collins MO, Yu L, Wright J, Verstraten R, Adams DJ, Harrow J, Choudhary JS, Hubbard T. Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome. Genome Res 2011; 21:756-67. [PMID: 21460061 DOI: 10.1101/gr.114272.110] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Recent advances in proteomic mass spectrometry (MS) offer the chance to marry high-throughput peptide sequencing to transcript models, allowing the validation, refinement, and identification of new protein-coding loci. We present a novel pipeline that integrates highly sensitive and statistically robust peptide spectrum matching with genome-wide protein-coding predictions to perform large-scale gene validation and discovery in the mouse genome for the first time. In searching an excess of 10 million spectra, we have been able to validate 32%, 17%, and 7% of all protein-coding genes, exons, and splice boundaries, respectively. Moreover, we present strong evidence for the identification of multiple alternatively spliced translations from 53 genes and have uncovered 10 entirely novel protein-coding genes, which are not covered in any mouse annotation data sources. One such novel protein-coding gene is a fusion protein that spans the Ins2 and Igf2 loci to produce a transcript encoding the insulin II and the insulin-like growth factor 2-derived peptides. We also report nine processed pseudogenes that have unique peptide hits, demonstrating, for the first time, that they are not just transcribed but are translated and are therefore resurrected into new coding loci. This work not only highlights an important utility for MS data in genome annotation but also provides unique insights into the gene structure and propagation in the mouse genome. All these data have been subsequently used to improve the publicly available mouse annotation available in both the Vega and Ensembl genome browsers (http://vega.sanger.ac.uk).
Collapse
Affiliation(s)
- Markus Brosch
- The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Shrestha B, Reed JM, Starks PT, Kaufman GE, Goldstone JV, Roelke ME, O'Brien SJ, Koepfli KP, Frank LG, Court MH. Evolution of a major drug metabolizing enzyme defect in the domestic cat and other felidae: phylogenetic timing and the role of hypercarnivory. PLoS One 2011; 6:e18046. [PMID: 21464924 PMCID: PMC3065456 DOI: 10.1371/journal.pone.0018046] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2010] [Accepted: 02/18/2011] [Indexed: 11/18/2022] Open
Abstract
The domestic cat (Felis catus) shows remarkable sensitivity to
the adverse effects of phenolic drugs, including acetaminophen and aspirin, as
well as structurally-related toxicants found in the diet and environment. This
idiosyncrasy results from pseudogenization of the gene encoding
UDP-glucuronosyltransferase (UGT) 1A6, the major species-conserved phenol
detoxification enzyme. Here, we established the phylogenetic timing of
disruptive UGT1A6 mutations and explored the hypothesis that
gene inactivation in cats was enabled by minimal exposure to plant-derived
toxicants. Fixation of the UGT1A6 pseudogene was estimated to
have occurred between 35 and 11 million years ago with all extant Felidae having
dysfunctional UGT1A6. Out of 22 additional taxa sampled,
representative of most Carnivora families, only brown hyena (Parahyaena
brunnea) and northern elephant seal (Mirounga
angustirostris) showed inactivating UGT1A6
mutations. A comprehensive literature review of the natural diet of the sampled
taxa indicated that all species with defective UGT1A6 were
hypercarnivores (>70% dietary animal matter). Furthermore those
species with UGT1A6 defects showed evidence for reduced amino
acid constraint (increased dN/dS ratios approaching the neutral
selection value of 1.0) as compared with species with intact
UGT1A6. In contrast, there was no evidence for reduced
amino acid constraint for these same species within UGT1A1, the
gene encoding the enzyme responsible for detoxification of endogenously
generated bilirubin. Our results provide the first evidence suggesting that diet
may have played a permissive role in the devolution of a mammalian drug
metabolizing enzyme. Further work is needed to establish whether these
preliminary findings can be generalized to all Carnivora.
Collapse
Affiliation(s)
- Binu Shrestha
- Comparative and Molecular Pharmacogenomics Laboratory, Department of
Molecular Physiology and Pharmacology, Tufts University School of Medicine,
Boston, Massachusetts, United States of America
- Department of Biology, Tufts University, Medford, Massachusetts, United
States of America
| | - J. Michael Reed
- Department of Biology, Tufts University, Medford, Massachusetts, United
States of America
| | - Philip T. Starks
- Department of Biology, Tufts University, Medford, Massachusetts, United
States of America
| | - Gretchen E. Kaufman
- Department of Environmental and Population Health, Tufts Cummings School
of Veterinary Medicine, North Grafton, Massachusetts, United States of
America
| | - Jared V. Goldstone
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole,
Massachusetts, United States of America
| | - Melody E. Roelke
- Laboratory of Genomic Diversity, SAIC-Frederick Incorporated, National
Cancer Institute at Frederick, Frederick, Maryland, United States of
America
| | - Stephen J. O'Brien
- Laboratory of Genomic Diversity, National Cancer Institute at Frederick,
Frederick, Maryland, United States of America
| | - Klaus-Peter Koepfli
- Laboratory of Genomic Diversity, National Cancer Institute at Frederick,
Frederick, Maryland, United States of America
| | - Laurence G. Frank
- Living with Lions Project (Kenya), Museum of Vertebrate Zoology,
University of California, Berkeley, California, United States of
America
| | - Michael H. Court
- Comparative and Molecular Pharmacogenomics Laboratory, Department of
Molecular Physiology and Pharmacology, Tufts University School of Medicine,
Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
50
|
Advances in Research on Pseudogenes. PROG BIOCHEM BIOPHYS 2011. [DOI: 10.3724/sp.j.1206.2010.00215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|