1
|
Yuanyuan J, Xinqiang Y. Micropeptides Identified from Human Genomes. J Proteome Res 2022; 21:865-873. [DOI: 10.1021/acs.jproteome.1c00889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jing Yuanyuan
- School of Public Health, North Sichuan Medical College, Nanchong 637000, China
| | - Yin Xinqiang
- School of Basic Medicine and Forensics, North Sichuan Medical College, Nanchong 637000, China
| |
Collapse
|
2
|
Kiniry SJ, Judge CE, Michel AM, Baranov PV. Trips-Viz: an environment for the analysis of public and user-generated ribosome profiling data. Nucleic Acids Res 2021; 49:W662-W670. [PMID: 33950201 PMCID: PMC8262740 DOI: 10.1093/nar/gkab323] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 04/11/2021] [Accepted: 04/20/2021] [Indexed: 02/07/2023] Open
Abstract
Trips-Viz (https://trips.ucc.ie/) is an interactive platform for the analysis and visualization of ribosome profiling (Ribo-Seq) and shotgun RNA sequencing (RNA-seq) data. This includes publicly available and user generated data, hence Trips-Viz can be classified as a database and as a server. As a database it provides access to many processed Ribo-Seq and RNA-seq data aligned to reference transcriptomes which has been expanded considerably since its inception. Here, we focus on the server functionality of Trips-viz which also has been greatly improved. Trips-viz now enables visualisation of proteomics data from a large number of processed mass spectrometry datasets. It can be used to support translation inferred from Ribo-Seq data. Users are now able to upload a custom reference transcriptome as well as data types other than Ribo-Seq/RNA-Seq. Incorporating custom data has been streamlined with RiboGalaxy (https://ribogalaxy.ucc.ie/) integration. The other new functionality is the rapid detection of translated open reading frames (ORFs) through a simple easy to use interface. The analysis of differential expression has been also improved via integration of DESeq2 and Anota2seq in addition to a number of other improvements of existing Trips-viz features.
Collapse
Affiliation(s)
- Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Ciara E Judge
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Audrey M Michel
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
- Ribomaps Ltd, Western Gateway Bld, Western Rd, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| |
Collapse
|
3
|
Manfrini N, Ricciardi S, Alfieri R, Ventura G, Calamita P, Favalli A, Biffo S. Ribosome profiling unveils translational regulation of metabolic enzymes in primary CD4 + Th1 cells. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2020; 109:103697. [PMID: 32330465 DOI: 10.1016/j.dci.2020.103697] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Revised: 04/06/2020] [Accepted: 04/06/2020] [Indexed: 05/22/2023]
Abstract
The transition from a naïve to an effector T cell is an essential event that requires metabolic reprogramming. We have recently demonstrated that the rapid metabolic changes that occur following stimulation of naïve T cells require the translation of preexisting mRNAs. Here, we provide evidence that translation regulates the metabolic asset of effector T cells. By performing ribosome profiling in human CD4+ Th1 cells, we show that the metabolism of glucose, fatty acids and pentose phosphates is regulated at the translational level. In Th1 cells, each pathway has at least one enzyme regulated at the translational level and selected enzymes have high translational efficiencies. mRNA expression does not predict protein expression. For instance, PKM2 mRNA is equally present in naïve T and Th1 cells, but the protein is abundant only in Th1. 5'-untranslated regions (UTRs) may partly account for this regulation. Overall we suggest that immunometabolism is controlled by translation.
Collapse
Affiliation(s)
- Nicola Manfrini
- INGM, National Institute of Molecular Genetics, "Fondazione Romeo ed Enrica Invernizzi", Milano, Italy; Department of Biological Sciences, University of Milan, Milan, Italy
| | - Sara Ricciardi
- INGM, National Institute of Molecular Genetics, "Fondazione Romeo ed Enrica Invernizzi", Milano, Italy; Department of Biological Sciences, University of Milan, Milan, Italy
| | - Roberta Alfieri
- INGM, National Institute of Molecular Genetics, "Fondazione Romeo ed Enrica Invernizzi", Milano, Italy
| | - Gabriele Ventura
- Department of Biological Sciences, University of Milan, Milan, Italy
| | - Piera Calamita
- INGM, National Institute of Molecular Genetics, "Fondazione Romeo ed Enrica Invernizzi", Milano, Italy; Department of Biological Sciences, University of Milan, Milan, Italy
| | - Andrea Favalli
- Department of Biological Sciences, University of Milan, Milan, Italy
| | - Stefano Biffo
- INGM, National Institute of Molecular Genetics, "Fondazione Romeo ed Enrica Invernizzi", Milano, Italy; Department of Biological Sciences, University of Milan, Milan, Italy.
| |
Collapse
|
4
|
Kiniry SJ, Michel AM, Baranov PV. Computational methods for ribosome profiling data analysis. WILEY INTERDISCIPLINARY REVIEWS. RNA 2020; 11:e1577. [PMID: 31760685 DOI: 10.1002/wrna.1577] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 10/12/2019] [Accepted: 10/16/2019] [Indexed: 12/15/2022]
Abstract
Since the introduction of the ribosome profiling technique in 2009 its popularity has greatly increased. It is widely used for the comprehensive assessment of gene expression and for studying the mechanisms of regulation at the translational level. As the number of ribosome profiling datasets being produced continues to grow, so too does the need for reliable software that can provide answers to the biological questions it can address. This review describes the computational methods and tools that have been developed to analyze ribosome profiling data at the different stages of the process. It starts with initial routine processing of raw data and follows with more specific tasks such as the identification of translated open reading frames, differential gene expression analysis, or evaluation of local or global codon decoding rates. The review pinpoints challenges associated with each step and explains the ways in which they are currently addressed. In addition it provides a comprehensive, albeit incomplete, list of publicly available software applicable to each step, which may be a beneficial starting point to those unexposed to ribosome profiling analysis. The outline of current challenges in ribosome profiling data analysis may inspire computational biologists to search for novel, potentially superior, solutions that will improve and expand the bioinformatician's toolbox for ribosome profiling data analysis. This article is characterized under: Translation > Ribosome Structure/Function RNA Evolution and Genomics > Computational Analyses of RNA Translation > Translation Mechanisms Translation > Translation Regulation.
Collapse
Affiliation(s)
- Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Audrey M Michel
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| |
Collapse
|
5
|
Alexaki A, Kames J, Hettiarachchi GK, Athey JC, Katneni UK, Hunt RC, Hamasaki-Katagiri N, Holcomb DD, DiCuccio M, Bar H, Komar AA, Kimchi-Sarfaty C. Ribosome profiling of HEK293T cells overexpressing codon optimized coagulation factor IX. F1000Res 2020; 9:174. [PMID: 33014344 PMCID: PMC7509596 DOI: 10.12688/f1000research.22400.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/09/2020] [Indexed: 12/30/2022] Open
Abstract
Ribosome profiling provides the opportunity to evaluate translation kinetics at codon level resolution. Here, we describe ribosome profiling data, generated from two HEK293T cell lines. The ribosome profiling data are composed of Ribo-seq (mRNA sequencing data from ribosome protected fragments) and RNA-seq data (total RNA sequencing). The two HEK293T cell lines each express a version of the
F9 gene, both of which are translated into identical proteins in terms of their amino acid sequences. However, these
F9 genes vary drastically in their codon usage and predicted mRNA structure. We also provide the pipeline that we used to analyze the data. Further analyzing this dataset holds great potential as it can be used i) to unveil insights into the composition and regulation of the transcriptome, ii) for comparison with other ribosome profiling datasets, iii) to measure the rate of protein synthesis across the proteome and identify differences in elongation rates, iv) to discover previously unidentified translation of peptides, v) to explore the effects of codon usage or codon context in translational kinetics and vi) to investigate cotranslational folding. Importantly, a unique feature of this dataset, compared to other available ribosome profiling data, is the presence of the
F9 gene in two very distinct coding sequences.
Collapse
Affiliation(s)
- Aikaterini Alexaki
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Jacob Kames
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Gaya K Hettiarachchi
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - John C Athey
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Upendra K Katneni
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Ryan C Hunt
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Nobuko Hamasaki-Katagiri
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - David D Holcomb
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| | - Michael DiCuccio
- National Center of Biotechnology Information, National Institutes of Health, USA, Bethesda, MD, 20892, USA
| | - Haim Bar
- Department of Statistics, University of Connecticut, Storrs, CT, 06269, USA
| | - Anton A Komar
- Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, 44115, USA
| | - Chava Kimchi-Sarfaty
- Center for Biologics Evaluation and Research, Food and Drug Administration, USA, Silver Spring, MD, 20993, USA
| |
Collapse
|
6
|
Khan YA, Jungreis I, Wright JC, Mudge JM, Choudhary JS, Firth AE, Kellis M. Evidence for a novel overlapping coding sequence in POLG initiated at a CUG start codon. BMC Genet 2020; 21:25. [PMID: 32138667 PMCID: PMC7059407 DOI: 10.1186/s12863-020-0828-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 02/19/2020] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND POLG, located on nuclear chromosome 15, encodes the DNA polymerase γ(Pol γ). Pol γ is responsible for the replication and repair of mitochondrial DNA (mtDNA). Pol γ is the only DNA polymerase found in mitochondria for most animal cells. Mutations in POLG are the most common single-gene cause of diseases of mitochondria and have been mapped over the coding region of the POLG ORF. RESULTS Using PhyloCSF to survey alternative reading frames, we found a conserved coding signature in an alternative frame in exons 2 and 3 of POLG, herein referred to as ORF-Y that arose de novo in placental mammals. Using the synplot2 program, synonymous site conservation was found among mammals in the region of the POLG ORF that is overlapped by ORF-Y. Ribosome profiling data revealed that ORF-Y is translated and that initiation likely occurs at a CUG codon. Inspection of an alignment of mammalian sequences containing ORF-Y revealed that the CUG codon has a strong initiation context and that a well-conserved predicted RNA stem-loop begins 14 nucleotides downstream. Such features are associated with enhanced initiation at near-cognate non-AUG codons. Reanalysis of the Kim et al. (2014) draft human proteome dataset yielded two unique peptides that map unambiguously to ORF-Y. An additional conserved uORF, herein referred to as ORF-Z, was also found in exon 2 of POLG. Lastly, we surveyed Clinvar variants that are synonymous with respect to the POLG ORF and found that most of these variants cause amino acid changes in ORF-Y or ORF-Z. CONCLUSIONS We provide evidence for a novel coding sequence, ORF-Y, that overlaps the POLG ORF. Ribosome profiling and mass spectrometry data show that ORF-Y is expressed. PhyloCSF and synplot2 analysis show that ORF-Y is subject to strong purifying selection. An abundance of disease-correlated mutations that map to exons 2 and 3 of POLG but also affect ORF-Y provides potential clinical significance to this finding.
Collapse
Affiliation(s)
- Yousuf A Khan
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA, 94305, USA.
- Division of Virology, Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK.
| | - Irwin Jungreis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
| | - James C Wright
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, 123 Old Brompton Road, London, SW7 3RP, UK
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | - Andrew E Firth
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| |
Collapse
|
7
|
Michel AM, Kiniry SJ, O'Connor PBF, Mullan JP, Baranov PV. GWIPS-viz: 2018 update. Nucleic Acids Res 2019; 46:D823-D830. [PMID: 28977460 PMCID: PMC5753223 DOI: 10.1093/nar/gkx790] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 08/29/2017] [Indexed: 12/15/2022] Open
Abstract
The GWIPS-viz browser (http://gwips.ucc.ie/) is an on-line genome browser which is tailored for exploring ribosome profiling (Ribo-seq) data. Since its publication in 2014, GWIPS-viz provides Ribo-seq data for an additional 14 genomes bringing the current total to 23. The integration of new Ribo-seq data has been automated thereby increasing the number of available tracks to 1792, a 10-fold increase in the last three years. The increase is particularly substantial for data derived from human sources. Following user requests, we added the functionality to download these tracks in bigWig format. We also incorporated new types of data (e.g. TCP-seq) as well as auxiliary tracks from other sources that help with the interpretation of Ribo-seq data. Improvements in the visualization of the data have been carried out particularly for bacterial genomes where the Ribo-seq data are now shown in a strand specific manner. For higher eukaryotic datasets, we provide characteristics of individual datasets using the RUST program which includes the triplet periodicity, sequencing biases and relative inferred A-site dwell times. This information can be used for assessing the quality of Ribo-seq datasets. To improve the power of the signal, we aggregate Ribo-seq data from several studies into Global aggregate tracks for each genome.
Collapse
Affiliation(s)
- Audrey M Michel
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | | | - James P Mullan
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| |
Collapse
|
8
|
Shao Y, Chen C, Shen H, He BZ, Yu D, Jiang S, Zhao S, Gao Z, Zhu Z, Chen X, Fu Y, Chen H, Gao G, Long M, Zhang YE. GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes. Genome Res 2019; 29:682-696. [PMID: 30862647 PMCID: PMC6442393 DOI: 10.1101/gr.238733.118] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 01/29/2019] [Indexed: 12/13/2022]
Abstract
The origination of new genes contributes to phenotypic evolution in humans. Two major challenges in the study of new genes are the inference of gene ages and annotation of their protein-coding potential. To tackle these challenges, we created GenTree, an integrated online database that compiles age inferences from three major methods together with functional genomic data for new genes. Genome-wide comparison of the age inference methods revealed that the synteny-based pipeline (SBP) is most suited for recently duplicated genes, whereas the protein-family–based methods are useful for ancient genes. For SBP-dated primate-specific protein-coding genes (PSGs), we performed manual evaluation based on published PSG lists and showed that SBP generated a conservative data set of PSGs by masking less reliable syntenic regions. After assessing the coding potential based on evolutionary constraint and peptide evidence from proteomic data, we curated a list of 254 PSGs with different levels of protein evidence. This list also includes 41 candidate misannotated pseudogenes that encode primate-specific short proteins. Coexpression analysis showed that PSGs are preferentially recruited into organs with rapidly evolving pathways such as spermatogenesis, immune response, mother–fetus interaction, and brain development. For brain development, primate-specific KRAB zinc-finger proteins (KZNFs) are specifically up-regulated in the mid-fetal stage, which may have contributed to the evolution of this critical stage. Altogether, hundreds of PSGs are either recruited to processes under strong selection pressure or to processes supporting an evolving novel organ.
Collapse
Affiliation(s)
- Yi Shao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chunyan Chen
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hao Shen
- College of Computers, Hunan University of Technology, Zhuzhou Hunan 412007, China
| | - Bin Z He
- FAS Center for Systems Biology and Howard Hughes Medical Institute, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Daqi Yu
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shuai Jiang
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Center for Bioinformatics, Peking University, Beijing 100871, China.,Beijing Advanced Innovation Center for Genomics (ICG), Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Shilei Zhao
- University of Chinese Academy of Sciences, Beijing 100049, China.,CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhiqiang Gao
- University of Chinese Academy of Sciences, Beijing 100049, China.,National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| | - Zhenglin Zhu
- School of Life Sciences, Chongqing University, Chongqing 400044, China
| | - Xi Chen
- Wuhan Institute of Biotechnology, Wuhan 430072, China.,Medical Research Institute, Wuhan University, Wuhan 430072, China
| | - Yan Fu
- University of Chinese Academy of Sciences, Beijing 100049, China.,National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| | - Hua Chen
- University of Chinese Academy of Sciences, Beijing 100049, China.,CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| | - Ge Gao
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Center for Bioinformatics, Peking University, Beijing 100871, China.,Beijing Advanced Innovation Center for Genomics (ICG), Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637, USA
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
9
|
Al-Raawi D, Jones R, Wijesinghe S, Halsall J, Petric M, Roberts S, Hotchin NA, Kanhere A. A novel form of JARID2 is required for differentiation in lineage-committed cells. EMBO J 2018; 38:embj.201798449. [PMID: 30573669 PMCID: PMC6356158 DOI: 10.15252/embj.201798449] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Revised: 10/15/2018] [Accepted: 10/25/2018] [Indexed: 12/11/2022] Open
Abstract
Polycomb repressive complex‐2 (PRC2) is a group of proteins that play an important role during development and in cell differentiation. PRC2 is a histone‐modifying complex that catalyses methylation of lysine 27 of histone H3 (H3K27me3) at differentiation genes leading to their transcriptional repression. JARID2 is a co‐factor of PRC2 and is important for targeting PRC2 to chromatin. Here, we show that, unlike in embryonic stem cells, in lineage‐committed human cells, including human epidermal keratinocytes, JARID2 predominantly exists as a novel low molecular weight form, which lacks the N‐terminal PRC2‐interacting domain (ΔN‐JARID2). We show that ΔN‐JARID2 is a cleaved product of full‐length JARID2 spanning the C‐terminal conserved jumonji domains. JARID2 knockout in keratinocytes results in up‐regulation of cell cycle genes and repression of many epidermal differentiation genes. Surprisingly, repression of epidermal differentiation genes in JARID2‐null keratinocytes can be rescued by expression of ΔN‐JARID2 suggesting that, in contrast to PRC2, ΔN‐JARID2 promotes activation of differentiation genes. We propose that a switch from expression of full‐length JARID2 to ΔN‐JARID2 is important for the up‐regulation differentiation genes.
Collapse
Affiliation(s)
- Diaa Al-Raawi
- School of Biosciences, University of Birmingham, Birmingham, UK
| | - Rhian Jones
- School of Biosciences, University of Birmingham, Birmingham, UK
| | | | - John Halsall
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK
| | - Marija Petric
- School of Biosciences, University of Birmingham, Birmingham, UK
| | - Sally Roberts
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK
| | - Neil A Hotchin
- School of Biosciences, University of Birmingham, Birmingham, UK
| | - Aditi Kanhere
- School of Biosciences, University of Birmingham, Birmingham, UK
| |
Collapse
|
10
|
Kumari R, Michel AM, Baranov PV. PausePred and Rfeet: webtools for inferring ribosome pauses and visualizing footprint density from ribosome profiling data. RNA (NEW YORK, N.Y.) 2018; 24:1297-1304. [PMID: 30049792 PMCID: PMC6140459 DOI: 10.1261/rna.065235.117] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 07/23/2018] [Indexed: 05/25/2023]
Abstract
The process of translation is characterized by irregularities in the local decoding rates of specific mRNA codons. This includes the occurrences of long pauses that can take place when ribosomes decode certain peptide sequences, encounter strong RNA secondary structures, or decode "hungry" codons. Examples are known where such pausing or stalling is used for regulating protein synthesis. This can be achieved at the level of translation via direct alteration of ribosome progression through mRNA or by altering mRNA stability via NoGo decay. Ribosome pausing has also been implicated in the cotranslational folding of proteins. Ribosome profiling data often are used for inferring the locations of ribosome pauses. However, no dedicated online software is available for this purpose. Here we present PausePred (https://pausepred.ucc.ie/), which can be used to infer ribosome pauses from ribosome profiling (Ribo-seq) data. Peaks of ribosome footprint density are scored based on their magnitude relative to the background density within the surrounding area. The scoring allows the comparison of peaks across the transcriptome or genome. In addition to the score, PausePred reports the coordinates of the pause, the footprint density at the pause site, and the surrounding nucleotide sequence. The pauses can be visualized in the context of Ribo-seq and RNA-seq density plots generated for specific transcripts or genomic regions with the Rfeet tool. PausePred does not require input on the location of protein coding ORFs (although gene annotations can be optionally supplied). As a result, it can be used universally and its output does not depend on ever evolving annotations.
Collapse
Affiliation(s)
- Romika Kumari
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| | - Audrey M Michel
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| |
Collapse
|
11
|
Rogozin IB, Gertz EM, Baranov PV, Poliakov E, Schaffer AA. Genome-Wide Changes in Protein Translation Efficiency Are Associated with Autism. Genome Biol Evol 2018; 10:1902-1919. [PMID: 29986017 PMCID: PMC6086092 DOI: 10.1093/gbe/evy146] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2018] [Indexed: 01/05/2023] Open
Abstract
We previously proposed that changes in the efficiency of protein translation are associated with autism spectrum disorders (ASDs). This hypothesis connects environmental factors and genetic factors because each can alter translation efficiency. For genetic factors, we previously tested our hypothesis using a small set of ASD-associated genes, a small set of ASD-associated variants, and a statistic to quantify by how much a single nucleotide variant (SNV) in a protein coding region changes translation speed. In this study, we confirm and extend our hypothesis using a published set of 1,800 autism quartets (parents, one affected child and one unaffected child) and genome-wide variants. Then, we extend the test statistic to combine translation efficiency with other possibly relevant variables: ribosome profiling data, presence/absence of CpG dinucleotides, and phylogenetic conservation. The inclusion of ribosome profiling abundances strengthens our results for male-male sibling pairs. The inclusion of CpG information strengthens our results for female-female pairs, giving an insight into the significant gender differences in autism incidence. By combining the single-variant test statistic for all variants in a gene, we obtain a single gene score to evaluate how well a gene distinguishes between affected and unaffected siblings. Using statistical methods, we compute gene sets that have some power to distinguish between affected and unaffected siblings by translation efficiency of gene variants. Pathway and enrichment analysis of those gene sets suggest the importance of Wnt signaling pathways, some other pathways related to cancer, ATP binding, and ATP-ase pathways in the etiology of ASDs.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, Maryland
| | - E Michael Gertz
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, Maryland
| | - Pasha V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Eugenia Poliakov
- National Eye Institute, NIH, Laboratory of Retinal Cell and Molecular Biology, Bethesda, Maryland
| | - Alejandro A Schaffer
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, Maryland
| |
Collapse
|
12
|
Abstract
Protein folding begins co-translationally within the restricted space of the peptide exit tunnel of the ribosome. We have already shown that the N-terminal α-helical domain of the universally conserved N5-glutamine methyltransferase HemK is compacted within the exit tunnel and rearranges into the native fold upon emerging from the ribosome. However, the exact folding pathway of the domain remained unclear. Here we analyzed the rapid kinetics of translation and folding monitored by fluorescence resonance energy transfer and photoinduced electron transfer using global fitting to a model for synthesis of the 112-amino acid HemK fragment. Our results suggest that the co-translational folding trajectory of HemK starts within the tunnel and passes through four kinetically distinct folding intermediates that may represent sequential docking of helices to a growing compact core. The kinetics of the process is defined entirely by translation. The results show how analysis of ensemble kinetic data can be used to dissect complex trajectories of rapid conformational rearrangements in multicomponent systems.
Collapse
Affiliation(s)
- Evan Mercier
- Department of Physical Biochemistry , Max Planck Institute for Biophysical Chemistry , Am Fassberg 11 , D-37077 Goettingen , Germany
| | - Marina V Rodnina
- Department of Physical Biochemistry , Max Planck Institute for Biophysical Chemistry , Am Fassberg 11 , D-37077 Goettingen , Germany
| |
Collapse
|
13
|
Abstract
Peptides encoded by short open reading frames (sORFs) are usually defined as peptides ≤100 aa long. Usually sORFs were ignored by automatic genome annotation programs due to the high probability of false discovery. However, improved computational tools along with a high-throughput RIBO-seq approach identified a myriad of translated sORFs. Their importance becomes evident as we are gaining experimental validation of their diverse cellular functions. This Review examines various computational and experimental approaches of sORFs identification as well as provides the summary of our current knowledge of their functional roles in cells.
Collapse
Affiliation(s)
- Anastasia Chugunova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Tsimafei Navalayeu
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia
| | - Olga Dontsova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Petr Sergiev
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| |
Collapse
|
14
|
RNA editing by ADAR1 leads to context-dependent transcriptome-wide changes in RNA secondary structure. Nat Commun 2017; 8:1440. [PMID: 29129909 PMCID: PMC5682290 DOI: 10.1038/s41467-017-01458-8] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Accepted: 09/19/2017] [Indexed: 11/09/2022] Open
Abstract
Adenosine deaminase acting on RNA 1 (ADAR1) is the master RNA editor, catalyzing the deamination of adenosine to inosine. RNA editing is vital for preventing abnormal activation of cytosolic nucleic acid sensing pathways by self-double-stranded RNAs. Here we determine, by parallel analysis of RNA secondary structure sequencing (PARS-seq), the global RNA secondary structure changes in ADAR1 deficient cells. Surprisingly, ADAR1 silencing resulted in a lower global double-stranded to single-stranded RNA ratio, suggesting that A-to-I editing can stabilize a large subset of imperfect RNA duplexes. The duplexes destabilized by editing are composed of vastly complementary inverted Alus found in untranslated regions of genes performing vital biological processes, including housekeeping functions and type-I interferon responses. They are predominantly cytoplasmic and generally demonstrate higher ribosomal occupancy. Our findings imply that the editing effect on RNA secondary structure is context dependent and underline the intricate regulatory role of ADAR1 on global RNA secondary structure.
Collapse
|
15
|
Malaney P, Uversky VN, Davé V. PTEN proteoforms in biology and disease. Cell Mol Life Sci 2017; 74:2783-2794. [PMID: 28289760 PMCID: PMC11107534 DOI: 10.1007/s00018-017-2500-6] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Revised: 02/23/2017] [Accepted: 03/02/2017] [Indexed: 01/30/2023]
Abstract
Proteoforms are specific molecular forms of protein products arising from a single gene that possess different structures and different functions. Therefore, a single gene can produce a large repertoire of proteoforms by means of allelic variations (mutations, indels, SNPs), alternative splicing and other pre-translational mechanisms, post-translational modifications (PTMs), conformational dynamics, and functioning. Resulting proteoforms that have different sizes, alternative splicing patterns, sets of post-translational modifications, protein-protein interactions, and protein-ligand interactions, might dramatically increase the functionality of the encoded protein. Herein, we have interrogated the tumor suppressor PTEN for its proteoforms and find that this protein exists in multiple forms with distinct functions and sub-cellular localizations. Furthermore, the levels of each PTEN proteoform in a given cell may affect its biological function. Indeed, the paradigm of the continuum model of tumor suppression by PTEN can be better explained by the presence of a continuum of PTEN proteoforms, diversity, and levels of which are associated with pathological outcomes than simply by the different roles of mutations in the PTEN gene. Consequently, understanding the mechanisms underlying the dysregulation of PTEN proteoforms by several genomic and non-genomic mechanisms in cancer and other diseases is imperative. We have identified different PTEN proteoforms, which control various aspects of cellular function and grouped them into three categories of intrinsic, function-induced, and inducible proteoforms. A special emphasis is given to the inducible PTEN proteoforms that are produced due to alternative translational initiation. The novel finding that PTEN forms dimers with biological implications supports the notion that PTEN proteoform-proteoform interactions may play hitherto unknown roles in cellular homeostasis and in pathogenic settings, including cancer. These PTEN proteoforms with unique properties and functionalities offer potential novel therapeutic opportunities in the treatment of various cancers and other diseases.
Collapse
Affiliation(s)
- Prerna Malaney
- Department of Pathology and Cell Biology, Morsani College of Medicine, MDC 64, University of South Florida, 12901 Bruce B Downs Blvd, Tampa, FL, 33612, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, Tampa, FL, 33612, USA
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave., Saint Petersburg, Russia, 194064
| | - Vrushank Davé
- Department of Pathology and Cell Biology, Morsani College of Medicine, MDC 64, University of South Florida, 12901 Bruce B Downs Blvd, Tampa, FL, 33612, USA.
- Department of Oncological Sciences, Morsani College of Medicine, University of South Florida, Bruce B. Downs Blvd, Tampa, FL, 33612, USA.
| |
Collapse
|
16
|
Translation complex profile sequencing to study the in vivo dynamics of mRNA–ribosome interactions during translation initiation, elongation and termination. Nat Protoc 2017; 12:697-731. [DOI: 10.1038/nprot.2016.189] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
17
|
Calvo SE, Julien O, Clauser KR, Shen H, Kamer KJ, Wells JA, Mootha VK. Comparative Analysis of Mitochondrial N-Termini from Mouse, Human, and Yeast. Mol Cell Proteomics 2017; 16:512-523. [PMID: 28122942 DOI: 10.1074/mcp.m116.063818] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Revised: 01/06/2017] [Indexed: 01/08/2023] Open
Abstract
The majority of mitochondrial proteins are encoded in the nuclear genome, translated in the cytoplasm, and directed to the mitochondria by an N-terminal presequence that is cleaved upon import. Recently, N-proteome catalogs have been generated for mitochondria from yeast and from human U937 cells. Here, we applied the subtiligase method to determine N-termini for 327 proteins in mitochondria isolated from mouse liver and kidney. Comparative analysis between mitochondrial N-termini from mouse, human, and yeast proteins shows that whereas presequences are poorly conserved at the sequence level, other presequence properties are extremely conserved, including a length of ∼20-60 amino acids, a net charge between +3 to +6, and the presence of stabilizing amino acids at the N-terminus of mature proteins that follow the N-end rule from bacteria. As in yeast, ∼80% of mouse presequence cleavage sites match canonical motifs for three mitochondrial peptidases (MPP, Icp55, and Oct1), whereas the remainder do not match any known peptidase motifs. We show that mature mitochondrial proteins often exist with a spectrum of N-termini, consistent with a model of multiple cleavage events by MPP and Icp55. In addition to analysis of canonical targeting presequences, our N-terminal dataset allows the exploration of other cleavage events and provides support for polypeptide cleavage into two distinct enzymes (Hsd17b4), protein cleavages key for signaling (Oma1, Opa1, Htra2, Mavs, and Bcs2l13), and in several cases suggests novel protein isoforms (Scp2, Acadm, Adck3, Hsdl2, Dlst, and Ogdh). We present an integrated catalog of mammalian mitochondrial N-termini that can be used as a community resource to investigate individual proteins, to elucidate mechanisms of mammalian mitochondrial processing, and to allow researchers to engineer tags distally to the presequence cleavage.
Collapse
Affiliation(s)
- Sarah E Calvo
- From the ‡Howard Hughes Medical Institute, Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114; .,§Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115.,¶Broad Institute, Cambridge, Massachusetts 02141
| | | | | | - Hongying Shen
- From the ‡Howard Hughes Medical Institute, Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114.,§Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115
| | - Kimberli J Kamer
- From the ‡Howard Hughes Medical Institute, Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114.,§Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115
| | - James A Wells
- **Departments of Pharmaceutical Chemistry and.,§§Cellular and Molecular Pharmacology, University of California, San Francisco, California 94143
| | - Vamsi K Mootha
- From the ‡Howard Hughes Medical Institute, Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114.,§Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115
| |
Collapse
|
18
|
Andreev DE, O'Connor PBF, Loughran G, Dmitriev SE, Baranov PV, Shatsky IN. Insights into the mechanisms of eukaryotic translation gained with ribosome profiling. Nucleic Acids Res 2016; 45:513-526. [PMID: 27923997 PMCID: PMC5314775 DOI: 10.1093/nar/gkw1190] [Citation(s) in RCA: 105] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 10/31/2016] [Accepted: 11/18/2016] [Indexed: 12/29/2022] Open
Abstract
The development of Ribosome Profiling (RiboSeq) has revolutionized functional genomics. RiboSeq is based on capturing and sequencing of the mRNA fragments enclosed within the translating ribosome and it thereby provides a ‘snapshot’ of ribosome positions at the transcriptome wide level. Although the method is predominantly used for analysis of differential gene expression and discovery of novel translated ORFs, the RiboSeq data can also be a rich source of information about molecular mechanisms of polypeptide synthesis and translational control. This review will focus on how recent findings made with RiboSeq have revealed important details of the molecular mechanisms of translation in eukaryotes. These include mRNA translation sensitivity to drugs affecting translation initiation and elongation, the roles of upstream ORFs in response to stress, the dynamics of elongation and termination as well as details of intrinsic ribosome behavior on the mRNA after translation termination. As the RiboSeq method is still at a relatively early stage we will also discuss the implications of RiboSeq artifacts on data interpretation.
Collapse
Affiliation(s)
- Dmitry E Andreev
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119234, Russia
| | | | - Gary Loughran
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Sergey E Dmitriev
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119234, Russia
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Ivan N Shatsky
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119234, Russia
| |
Collapse
|