1
|
Yang H, Li Q, Stroup EK, Wang S, Ji Z. Widespread stable noncanonical peptides identified by integrated analyses of ribosome profiling and ORF features. Nat Commun 2024; 15:1932. [PMID: 38431639 PMCID: PMC10908861 DOI: 10.1038/s41467-024-46240-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/18/2024] [Indexed: 03/05/2024] Open
Abstract
Studies have revealed dozens of functional peptides in putative 'noncoding' regions and raised the question of how many proteins are encoded by noncanonical open reading frames (ORFs). Here, we comprehensively annotate genome-wide translated ORFs across five eukaryotes (human, mouse, zebrafish, worm, and yeast) by analyzing ribosome profiling data. We develop a logistic regression model named PepScore based on ORF features (expected length, encoded domain, and conservation) to calculate the probability that the encoded peptide is stable in humans. Systematic ectopic expression validates PepScore and shows that stable complex-associating microproteins can be encoded in 5'/3' untranslated regions and overlapping coding regions of mRNAs besides annotated noncoding RNAs. Stable noncanonical proteins follow conventional rules and localize to different subcellular compartments. Inhibition of proteasomal/lysosomal degradation pathways can stabilize some peptides especially those with moderate PepScores, but cannot rescue the expression of short ones with low PepScores suggesting they are directly degraded by cellular proteases. The majority of human noncanonical peptides with high PepScores show longer lengths but low conservation across species/mammals, and hundreds contain trait-associated genetic variants. Our study presents a statistical framework to identify stable noncanonical peptides in the genome and provides a valuable resource for functional characterization of noncanonical translation during development and disease.
Collapse
Affiliation(s)
- Haiwang Yang
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Qianru Li
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Emily K Stroup
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Sheng Wang
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA
| | - Zhe Ji
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA.
| |
Collapse
|
2
|
Lu Y, Ran Y, Li H, Wen J, Cui X, Zhang X, Guan X, Cheng M. Micropeptides: origins, identification, and potential role in metabolism-related diseases. J Zhejiang Univ Sci B 2023; 24:1106-1122. [PMID: 38057268 PMCID: PMC10710913 DOI: 10.1631/jzus.b2300128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 06/06/2023] [Indexed: 12/08/2023]
Abstract
With the development of modern sequencing techniques and bioinformatics, genomes that were once thought to be noncoding have been found to encode abundant functional micropeptides (miPs), a kind of small polypeptides. Although miPs are difficult to analyze and identify, a number of studies have begun to focus on them. More and more miPs have been revealed as essential for energy metabolism homeostasis, immune regulation, and tumor growth and development. Many reports have shown that miPs are especially essential for regulating glucose and lipid metabolism and regulating mitochondrial function. MiPs are also involved in the progression of related diseases. This paper reviews the sources and identification of miPs, as well as the functional significance of miPs for metabolism-related diseases, with the aim of revealing their potential clinical applications.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Min Cheng
- School of Basic Medicine Sciences, Weifang Medical University, Weifang 261053, China.
| |
Collapse
|
3
|
Jang C, Blume SW, Choi HS. Novel protein products encoded by upstream open reading frames of the MYCN gene in pediatric embryonal tumors. J Cell Biochem 2023; 124:1615-1627. [PMID: 37682868 DOI: 10.1002/jcb.30470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 08/12/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023]
Abstract
The MYCC and MYCN loci are each associated with two upstream open reading frames (uORFs) potentially encoding small proteins (9-21 kDa). We previously demonstrated that uORFs mrtl and MYCHEX1 of MYCC are translated, and their protein products may function to regulate the expression of the "parent" oncogene. We hypothesized that a similar relationship might exist between MYCN and its two uORFs: MYCNOT and MNOP, and investigated the uORF-encoded proteins associated with MYCN to confirm their expression and intracellular location in neuroblastoma and medulloblastoma cells and tissues. MNOP, MYCNOT, mrtl, and MYCHEX1 were readily detected via reverse transcription polymerase chain reaction and Western blot analysis in tumor cell lines. In tumor tissue, MNOP protein expression was confirmed; however, MCYNOT generated from alternative splicing MYCNΔ1b mRNA was not detected. Immunofluorescence staining of MYCNOT displayed multiple bright foci in the nucleus and diffuse staining in the cytoplasm, suggesting that this small protein may function in both the nucleus and cytoplasm. Upon JQ1 treatment, MYCN, MYCNOT, and mrtl decreased substantially or disappeared completely in three different tumor cell lines. Significant levels of apoptosis were observed in each pediatric embryonal tumor cell line but not T47D breast carcinoma cells, suggesting that response to JQ1 transcriptional inhibition is greatest in tumor cells, which depend on MYC to maintain an undifferentiated phenotype. In conclusion, both MYCN uORF-encoded proteins MNOP and MYCNOT, together with the two MYCC uORF-encoded proteins mrtl and MYCHEX1 were detected simultaneously in tumor cell lines and tumor tissues. These four distinct proteins are translated from the "5'-untranslated region" of MYCN or MYCC mRNA and display consistent distribution patterns within the cell. Additional studies to further elucidate the physiological and pathological roles of these uORF-encoded proteins are warranted, as insights gained could inform new strategies for modulating MYC-family oncogenes by targeting their uORFs.
Collapse
Affiliation(s)
- Chorong Jang
- Department of Pediatrics, Seoul National University College of Medicine, Seoul National University Bundang Hospital, Seongnam, South Korea
| | - Scott W Blume
- Department of Medicine and Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Hyoung Soo Choi
- Department of Pediatrics, Seoul National University College of Medicine, Seoul National University Bundang Hospital, Seongnam, South Korea
| |
Collapse
|
4
|
Jürgens L, Wethmar K. The Emerging Role of uORF-Encoded uPeptides and HLA uLigands in Cellular and Tumor Biology. Cancers (Basel) 2022; 14:cancers14246031. [PMID: 36551517 PMCID: PMC9776223 DOI: 10.3390/cancers14246031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/29/2022] [Accepted: 11/30/2022] [Indexed: 12/13/2022] Open
Abstract
Recent technological advances have facilitated the detection of numerous non-canonical human peptides derived from regulatory regions of mRNAs, long non-coding RNAs, and other cryptic transcripts. In this review, we first give an overview of the classification of these novel peptides and summarize recent improvements in their annotation and detection by ribosome profiling, mass spectrometry, and individual experimental analysis. A large fraction of the novel peptides originates from translation at upstream open reading frames (uORFs) that are located within the transcript leader sequence of regular mRNA. In humans, uORF-encoded peptides (uPeptides) have been detected in both healthy and malignantly transformed cells and emerge as important regulators in cellular and immunological pathways. In the second part of the review, we focus on various functional implications of uPeptides. As uPeptides frequently act at the transition of translational regulation and individual peptide function, we describe the mechanistic modes of translational regulation through ribosome stalling, the involvement in cellular programs through protein interaction and complex formation, and their role within the human leukocyte antigen (HLA)-associated immunopeptidome as HLA uLigands. We delineate how malignant transformation may lead to the formation of novel uORFs, uPeptides, or HLA uLigands and explain their potential implication in tumor biology. Ultimately, we speculate on a potential use of uPeptides as peptide drugs and discuss how uPeptides and HLA uLigands may facilitate translational inhibition of oncogenic protein messages and immunotherapeutic approaches in cancer therapy.
Collapse
|
5
|
Mohaupt P, Roucou X, Delaby C, Vialaret J, Lehmann S, Hirtz C. The alternative proteome in neurobiology. Front Cell Neurosci 2022; 16:1019680. [PMID: 36467612 PMCID: PMC9712206 DOI: 10.3389/fncel.2022.1019680] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 11/02/2022] [Indexed: 10/13/2023] Open
Abstract
Translation involves the biosynthesis of a protein sequence following the decoding of the genetic information embedded in a messenger RNA (mRNA). Typically, the eukaryotic mRNA was considered to be inherently monocistronic, but this paradigm is not in agreement with the translational landscape of cells, tissues, and organs. Recent ribosome sequencing (Ribo-seq) and proteomics studies show that, in addition to currently annotated reference proteins (RefProt), other proteins termed alternative proteins (AltProts), and microproteins are encoded in regions of mRNAs thought to be untranslated or in transcripts annotated as non-coding. This experimental evidence expands the repertoire of functional proteins within a cell and potentially provides important information on biological processes. This review explores the hitherto overlooked alternative proteome in neurobiology and considers the role of AltProts in pathological and healthy neuromolecular processes.
Collapse
Affiliation(s)
- Pablo Mohaupt
- LBPC-PPC, Université de Montpellier, IRMB CHU de Montpellier, INM INSERM, Montpellier, France
| | - Xavier Roucou
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Constance Delaby
- LBPC-PPC, Université de Montpellier, IRMB CHU de Montpellier, INM INSERM, Montpellier, France
| | - Jérôme Vialaret
- LBPC-PPC, Université de Montpellier, IRMB CHU de Montpellier, INM INSERM, Montpellier, France
| | - Sylvain Lehmann
- LBPC-PPC, Université de Montpellier, IRMB CHU de Montpellier, INM INSERM, Montpellier, France
| | - Christophe Hirtz
- LBPC-PPC, Université de Montpellier, IRMB CHU de Montpellier, INM INSERM, Montpellier, France
| |
Collapse
|
6
|
Zheng X, Xiang M. Mitochondrion-located peptides and their pleiotropic physiological functions. FEBS J 2022; 289:6919-6935. [PMID: 35599630 DOI: 10.1111/febs.16532] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 05/12/2022] [Accepted: 05/20/2022] [Indexed: 01/13/2023]
Abstract
With the development of advanced technologies, many small open reading frames (sORFs) have been found to be translated into micropeptides. Interestingly, a considerable proportion of micropeptides are located in mitochondria, which are designated here as mitochondrion-located peptides (MLPs). These MLPs often contain a transmembrane domain and show a high degree of conservation across species. They usually act as co-factors of large proteins and play regulatory roles in mitochondria such as electron transport in the respiratory chain, reactive oxygen species (ROS) production, metabolic homeostasis, and so on. Deficiency of MLPs disturbs diverse physiological processes including immunity, differentiation, and metabolism both in vivo and in vitro. These findings reveal crucial functions for MLPs and provide fresh insights into diverse mitochondrion-associated biological processes and diseases.
Collapse
Affiliation(s)
- Xintong Zheng
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Sun Yat-sen University, Guangzhou, China
| | - Mengqing Xiang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Sun Yat-sen University, Guangzhou, China.,Guangdong Provincial Key Laboratory of Brain Function and Disease, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
7
|
Andreev DE, Loughran G, Fedorova AD, Mikhaylova MS, Shatsky IN, Baranov PV. Non-AUG translation initiation in mammals. Genome Biol 2022; 23:111. [PMID: 35534899 PMCID: PMC9082881 DOI: 10.1186/s13059-022-02674-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 04/14/2022] [Indexed: 12/12/2022] Open
Abstract
Recent proteogenomic studies revealed extensive translation outside of annotated protein coding regions, such as non-coding RNAs and untranslated regions of mRNAs. This non-canonical translation is largely due to start codon plurality within the same RNA. This plurality is often due to the failure of some scanning ribosomes to recognize potential start codons leading to initiation downstream—a process termed leaky scanning. Codons other than AUG (non-AUG) are particularly leaky due to their inefficiency. Here we discuss our current understanding of non-AUG initiation. We argue for a near-ubiquitous role of non-AUG initiation in shaping the dynamic composition of mammalian proteomes.
Collapse
|
8
|
Della Bella E, Koch J, Baerenfaller K. Translation and emerging functions of non-coding RNAs in inflammation and immunity. Allergy 2022; 77:2025-2037. [PMID: 35094406 PMCID: PMC9302665 DOI: 10.1111/all.15234] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 12/17/2022]
Abstract
Regulatory non‐coding RNAs (ncRNAs) including small non‐coding RNAs (sRNAs), long non‐coding RNAs (lncRNAs), and circular RNAs (circRNAs) have gained considerable attention in the last few years. This is mainly due to their condition‐ and tissue‐specific expression and their various modes of action, which suggests them as promising biomarkers and therapeutic targets. One important mechanism of ncRNAs to regulate gene expression is through translation of short open reading frames (sORFs). These sORFs can be located in lncRNAs, in non‐translated regions of mRNAs where upstream ORFs (uORFs) represent the majority, or in circRNAs. Regulation of their translation can function as a quick way to adapt protein production to changing cellular or environmental cues, and can either depend solely on the initiation and elongation of translation, or on the roles of the produced functional peptides. Due to the experimental challenges to pinpoint translation events and to detect the produced peptides, translational regulation through regulatory RNAs is not well studied yet. In the case of circRNAs, they have only recently started to be recognized as regulatory molecules instead of mere artifacts of RNA biosynthesis. Of the many roles described for regulatory ncRNAs, we will focus here on their regulation during inflammation and in immunity.
Collapse
Affiliation(s)
| | - Jana Koch
- Swiss Institute of Allergy and Asthma Research (SIAF) University of Zurich Swiss Institute of Bioinformatics (SIB) Davos Switzerland
| | - Katja Baerenfaller
- Swiss Institute of Allergy and Asthma Research (SIAF) University of Zurich Swiss Institute of Bioinformatics (SIB) Davos Switzerland
| |
Collapse
|
9
|
Nomura Y, Dohmae N. Discovery of a small protein-encoding cis-regulatory overlapping gene of the tumor suppressor gene Scribble in humans. Commun Biol 2021; 4:1098. [PMID: 34535749 PMCID: PMC8448870 DOI: 10.1038/s42003-021-02619-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 08/30/2021] [Indexed: 12/26/2022] Open
Abstract
Intensive gene annotation has revealed many functional and regulatory elements in the human genome. Although eukaryotic protein-coding genes are generally transcribed into monocistronic mRNAs, recent studies have discovered additional short open reading frames (sORFs) in mRNAs. Here, we performed proteogenomic data mining for hidden proteins categorized into sORF-encoded polypeptides (SEPs) in human cancers. We identified a new SEP-encoding overlapping sORF (oORF) on the cell polarity determinant Scribble (SCRIB) that is considered a proto-oncogene with tumor suppressor function in Hippo-YAP/TAZ, MAPK/ERK, and PI3K/Akt/mTOR signaling. Reanalysis of clinical human proteomic data revealed translational dysregulation of both SCRIB and its oORF, oSCRIB, during carcinogenesis. Biochemical analyses suggested that the translatable oSCRIB constitutively limits the capacity of eukaryotic ribosomes to translate the downstream SCRIB. These findings provide a new example of cis-regulatory oORFs that function as a ribosomal roadblock and potentially serve as a fail-safe mechanism to normal cells for non-excessive downstream gene expression, which is hijacked in cancer. Yuhta Nomura and Naoshi Dohmae report the discovery of a small protein-coding gene that overlaps the tumor suppressor gene Scribble. Their data suggest that the overlapping gene, oSCRIB, limits the translation of downstream Scribble and may have important implications in cancer.
Collapse
Affiliation(s)
- Yuhta Nomura
- Biomolecular Characterization Unit, RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
| | - Naoshi Dohmae
- Biomolecular Characterization Unit, RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
| |
Collapse
|
10
|
Guerra-Almeida D, Tschoeke DA, da-Fonseca RN. Understanding small ORF diversity through a comprehensive transcription feature classification. DNA Res 2021; 28:6317669. [PMID: 34240112 PMCID: PMC8435553 DOI: 10.1093/dnares/dsab007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Indexed: 11/13/2022] Open
Abstract
Small open reading frames (small ORFs/sORFs/smORFs) are potentially coding sequences smaller than 100 codons that have historically been considered junk DNA by gene prediction software and in annotation screening; however, the advent of next-generation sequencing has contributed to the deeper investigation of junk DNA regions and their transcription products, resulting in the emergence of smORFs as a new focus of interest in systems biology. Several smORF peptides were recently reported in noncanonical mRNAs as new players in numerous biological contexts; however, their relevance is still overlooked in coding potential analysis. Hence, this review proposes a smORF classification based on transcriptional features, discussing the most promising approaches to investigate smORFs based on their different characteristics. First, smORFs were divided into nonexpressed (intergenic) and expressed (genic) smORFs. Second, genic smORFs were classified as smORFs located in noncoding RNAs (ncRNAs) or canonical mRNAs. Finally, smORFs in ncRNAs were further subdivided into sequences located in small or long RNAs, whereas smORFs located in canonical mRNAs were subdivided into several specific classes depending on their localization along the gene. We hope that this review provides new insights into large-scale annotations and reinforces the role of smORFs as essential components of a hidden coding DNA world.
Collapse
Affiliation(s)
- Diego Guerra-Almeida
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Diogo Antonio Tschoeke
- Alberto Luiz Coimbra Institute of Graduate Studies and Engineering Research (COPPE), Biomedical Engineering Program, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo Nunes- da-Fonseca
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.,National Institute of Science and Technology in Molecular Entomology, Rio de Janeiro, Brazil
| |
Collapse
|
11
|
Gagnon M, Savard M, Jacques JF, Bkaily G, Geha S, Roucou X, Gobeil F. Potentiation of B2 receptor signaling by AltB2R, a newly identified alternative protein encoded in the human bradykinin B2 receptor gene. J Biol Chem 2021; 296:100329. [PMID: 33497625 PMCID: PMC7949122 DOI: 10.1016/j.jbc.2021.100329] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 01/12/2021] [Accepted: 01/21/2021] [Indexed: 12/27/2022] Open
Abstract
Recent functional and proteomic studies in eukaryotes (www.openprot.org) predict the translation of alternative open reading frames (AltORFs) in mature G-protein-coupled receptor (GPCR) mRNAs, including that of bradykinin B2 receptor (B2R). Our main objective was to determine the implication of a newly discovered AltORF resulting protein, termed AltB2R, in the known signaling properties of B2R using complementary methodological approaches. When ectopically expressed in HeLa cells, AltB2R presented predominant punctate cytoplasmic/perinuclear distribution and apparent cointeraction with B2R at plasma and endosomal/vesicular membranes. The presence of AltB2R increases intracellular [Ca2+] and ERK1/2-MAPK activation (via phosphorylation) following B2R stimulation. Moreover, HEK293A cells expressing mutant B2R lacking concomitant expression of AltB2R displayed significantly decreased maximal responses in agonist-stimulated Gαq-Gαi2/3-protein coupling, IP3 generation, and ERK1/2-MAPK activation as compared with wild-type controls. Conversely, there was no difference in cell-surface density as well as ligand-binding properties of B2R and in efficiencies of cognate agonists at promoting B2R internalization and β-arrestin 2 recruitment. Importantly, both AltB2R and B2R proteins were overexpressed in prostate and breast cancers, compared with their normal counterparts suggesting new associative roles of AltB2R in these diseases. Our study shows that BDKRB2 is a dual-coding gene and identifies AltB2R as a novel positive modulator of some B2R signaling pathways. More broadly, it also supports a new, unexpected alternative proteome for GPCRs, which opens new frontiers in fields of GPCR biology, diseases, and drug discovery.
Collapse
Affiliation(s)
- Maxime Gagnon
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada; Institute of Pharmacology, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Martin Savard
- Department of Pharmacology & Physiology, Université de Sherbrooke, Sherbrooke, Québec, Canada; Institute of Pharmacology, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Jean-François Jacques
- Department of Pharmacology & Physiology, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Ghassan Bkaily
- Department of Immunology & Cellular Biology, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Sameh Geha
- Department of Pathology, Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, Québec, Canada
| | - Xavier Roucou
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada; Institute of Pharmacology, Université de Sherbrooke, Sherbrooke, Québec, Canada.
| | - Fernand Gobeil
- Department of Pharmacology & Physiology, Université de Sherbrooke, Sherbrooke, Québec, Canada; Institute of Pharmacology, Université de Sherbrooke, Sherbrooke, Québec, Canada.
| |
Collapse
|
12
|
Takahashi H, Miyaki S, Onouchi H, Motomura T, Idesako N, Takahashi A, Murase M, Fukuyoshi S, Endo T, Satou K, Naito S, Itoh M. Exhaustive identification of conserved upstream open reading frames with potential translational regulatory functions from animal genomes. Sci Rep 2020; 10:16289. [PMID: 33004976 PMCID: PMC7530721 DOI: 10.1038/s41598-020-73307-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 09/15/2020] [Indexed: 11/17/2022] Open
Abstract
Upstream open reading frames (uORFs) are present in the 5′-untranslated regions of many eukaryotic mRNAs, and some peptides encoded by these regions play important regulatory roles in controlling main ORF (mORF) translation. We previously developed a novel pipeline, ESUCA, to comprehensively identify plant uORFs encoding functional peptides, based on genome-wide identification of uORFs with conserved peptide sequences (CPuORFs). Here, we applied ESUCA to diverse animal genomes, because animal CPuORFs have been identified only by comparing uORF sequences between a limited number of species, and how many previously identified CPuORFs encode regulatory peptides is unclear. By using ESUCA, 1517 (1373 novel and 144 known) CPuORFs were extracted from four evolutionarily divergent animal genomes. We examined the effects of 17 human CPuORFs on mORF translation using transient expression assays. Through these analyses, we identified seven novel regulatory CPuORFs that repressed mORF translation in a sequence-dependent manner, including one conserved only among Eutheria. We discovered a much higher number of animal CPuORFs than previously identified. Since most human CPuORFs identified in this study are conserved across a wide range of Eutheria or a wider taxonomic range, many CPuORFs encoding regulatory peptides are expected to be found in the identified CPuORFs.
Collapse
Affiliation(s)
- Hiro Takahashi
- Graduate School of Medical Sciences, Kanazawa University, Kanazawa, 920-1192, Japan. .,Graduate School of Horticulture, Chiba University, Matsudo, 271-8510, Japan. .,Fundamental Innovative Oncology Core Center, National Cancer Center, Tokyo, 104-0045, Japan.
| | - Shido Miyaki
- Graduate School of Horticulture, Chiba University, Matsudo, 271-8510, Japan
| | - Hitoshi Onouchi
- Graduate School of Agriculture, Hokkaido University, Sapporo, 060-8589, Japan
| | - Taichiro Motomura
- Graduate School of Medical Sciences, Kanazawa University, Kanazawa, 920-1192, Japan
| | - Nobuo Idesako
- Graduate School of Horticulture, Chiba University, Matsudo, 271-8510, Japan
| | - Anna Takahashi
- Faculty of Information Technologies and Control, Belarusian State University of Informatics and Radio Electronics, 220013, Minsk, Belarus.,College of Bioscience and Biotechnology, Chubu University, Kasugai, 487-8501, Japan
| | - Masataka Murase
- Graduate School of Medical Sciences, Kanazawa University, Kanazawa, 920-1192, Japan
| | - Shuichi Fukuyoshi
- Institute of Medical, Pharmaceutical and Health Sciences, Kanazawa University, Kanazawa, 920-1192, Japan
| | - Toshinori Endo
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, 060-0814, Japan
| | - Kenji Satou
- Faculty of Biological Science and Technology, Institute of Science and Engineering, Kanazawa University, Kanazawa, 920-1192, Japan
| | - Satoshi Naito
- Graduate School of Agriculture, Hokkaido University, Sapporo, 060-8589, Japan.,Graduate School of Life Science, Hokkaido University, Sapporo, 060-0810, Japan
| | - Motoyuki Itoh
- Graduate School of Pharmaceutical Science, Chiba University, Chiba, 260-8675, Japan.
| |
Collapse
|
13
|
Dever TE, Ivanov IP, Sachs MS. Conserved Upstream Open Reading Frame Nascent Peptides That Control Translation. Annu Rev Genet 2020; 54:237-264. [PMID: 32870728 DOI: 10.1146/annurev-genet-112618-043822] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Cells utilize transcriptional and posttranscriptional mechanisms to alter gene expression in response to environmental cues. Gene-specific controls, including changing the translation of specific messenger RNAs (mRNAs), provide a rapid means to respond precisely to different conditions. Upstream open reading frames (uORFs) are known to control the translation of mRNAs. Recent studies in bacteria and eukaryotes have revealed the functions of evolutionarily conserved uORF-encoded peptides. Some of these uORF-encoded nascent peptides enable responses to specific metabolites to modulate the translation of their mRNAs by stalling ribosomes and through ribosome stalling may also modulate the level of their mRNAs. In this review, we highlight several examples of conserved uORF nascent peptides that stall ribosomes to regulate gene expression in response to specific metabolites in bacteria, fungi, mammals, and plants.
Collapse
Affiliation(s)
- Thomas E Dever
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA; ,
| | - Ivaylo P Ivanov
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA; ,
| | - Matthew S Sachs
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA;
| |
Collapse
|
14
|
Renz PF, Valdivia-Francia F, Sendoel A. Some like it translated: small ORFs in the 5'UTR. Exp Cell Res 2020; 396:112229. [PMID: 32818479 DOI: 10.1016/j.yexcr.2020.112229] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 07/28/2020] [Accepted: 08/07/2020] [Indexed: 01/06/2023]
Abstract
The 5' untranslated region (5'UTR) is critical in determining post-transcriptional control, which is partly mediated by short upstream open reading frames (uORFs) present in half of mammalian transcripts. uORFs are generally considered to provide functionally important repression of the main-ORF by engaging initiating ribosomes, but under specific environmental conditions such as cellular stress, uORFs can become essential to activate the translation of the main coding sequence. In addition, a growing number of uORF-encoded bioactive microproteins have been described, which have the potential to significantly increase cellular protein diversity. Here we review the diverse cellular contexts in which uORFs play a critical role and discuss the molecular mechanisms underlying their function and regulation. The progress over the last decades in dissecting uORF function suggests that the 5'UTR remains an exciting frontier towards understanding how the cellular proteome is shaped in health and disease.
Collapse
Affiliation(s)
- Peter F Renz
- Institute for Regenerative Medicine (IREM), University of Zurich, Wagistrasse 12, CH-8952 Schlieren, Switzerland
| | - Fabiola Valdivia-Francia
- Institute for Regenerative Medicine (IREM), University of Zurich, Wagistrasse 12, CH-8952 Schlieren, Switzerland; Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Switzerland
| | - Ataman Sendoel
- Institute for Regenerative Medicine (IREM), University of Zurich, Wagistrasse 12, CH-8952 Schlieren, Switzerland.
| |
Collapse
|
15
|
Jadiya P, Tomar D. Mitochondrial Protein Quality Control Mechanisms. Genes (Basel) 2020; 11:genes11050563. [PMID: 32443488 PMCID: PMC7290828 DOI: 10.3390/genes11050563] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 05/14/2020] [Accepted: 05/15/2020] [Indexed: 02/08/2023] Open
Abstract
Mitochondria serve as a hub for many cellular processes, including bioenergetics, metabolism, cellular signaling, redox balance, calcium homeostasis, and cell death. The mitochondrial proteome includes over a thousand proteins, encoded by both the mitochondrial and nuclear genomes. The majority (~99%) of proteins are nuclear encoded that are synthesized in the cytosol and subsequently imported into the mitochondria. Within the mitochondria, polypeptides fold and assemble into their native functional form. Mitochondria health and integrity depend on correct protein import, folding, and regulated turnover termed as mitochondrial protein quality control (MPQC). Failure to maintain these processes can cause mitochondrial dysfunction that leads to various pathophysiological outcomes and the commencement of diseases. Here, we summarize the current knowledge about the role of different MPQC regulatory systems such as mitochondrial chaperones, proteases, the ubiquitin-proteasome system, mitochondrial unfolded protein response, mitophagy, and mitochondria-derived vesicles in the maintenance of mitochondrial proteome and health. The proper understanding of mitochondrial protein quality control mechanisms will provide relevant insights to treat multiple human diseases.
Collapse
Affiliation(s)
- Pooja Jadiya
- Correspondence: (P.J.); (D.T.); Tel.: +1-215-707-9144 (D.T.)
| | - Dhanendra Tomar
- Correspondence: (P.J.); (D.T.); Tel.: +1-215-707-9144 (D.T.)
| |
Collapse
|
16
|
Mitochondrial peptide BRAWNIN is essential for vertebrate respiratory complex III assembly. Nat Commun 2020; 11:1312. [PMID: 32161263 PMCID: PMC7066179 DOI: 10.1038/s41467-020-14999-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 02/14/2020] [Indexed: 11/08/2022] Open
Abstract
The emergence of small open reading frame (sORF)-encoded peptides (SEPs) is rapidly expanding the known proteome at the lower end of the size distribution. Here, we show that the mitochondrial proteome, particularly the respiratory chain, is enriched for small proteins. Using a prediction and validation pipeline for SEPs, we report the discovery of 16 endogenous nuclear encoded, mitochondrial-localized SEPs (mito-SEPs). Through functional prediction, proteomics, metabolomics and metabolic flux modeling, we demonstrate that BRAWNIN, a 71 a.a. peptide encoded by C12orf73, is essential for respiratory chain complex III (CIII) assembly. In human cells, BRAWNIN is induced by the energy-sensing AMPK pathway, and its depletion impairs mitochondrial ATP production. In zebrafish, Brawnin deletion causes complete CIII loss, resulting in severe growth retardation, lactic acidosis and early death. Our findings demonstrate that BRAWNIN is essential for vertebrate oxidative phosphorylation. We propose that mito-SEPs are an untapped resource for essential regulators of oxidative metabolism.
Collapse
|
17
|
Mudge JM, Jungreis I, Hunt T, Gonzalez JM, Wright JC, Kay M, Davidson C, Fitzgerald S, Seal R, Tweedie S, He L, Waterhouse RM, Li Y, Bruford E, Choudhary JS, Frankish A, Kellis M. Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci. Genome Res 2019; 29:2073-2087. [PMID: 31537640 PMCID: PMC6886504 DOI: 10.1101/gr.246462.118] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 09/09/2019] [Indexed: 12/15/2022]
Abstract
The most widely appreciated role of DNA is to encode protein, yet the exact portion of the human genome that is translated remains to be ascertained. We previously developed PhyloCSF, a widely used tool to identify evolutionary signatures of protein-coding regions using multispecies genome alignments. Here, we present the first whole-genome PhyloCSF prediction tracks for human, mouse, chicken, fly, worm, and mosquito. We develop a workflow that uses machine learning to predict novel conserved protein-coding regions and efficiently guide their manual curation. We analyze more than 1000 high-scoring human PhyloCSF regions and confidently add 144 conserved protein-coding genes to the GENCODE gene set, as well as additional coding regions within 236 previously annotated protein-coding genes, and 169 pseudogenes, most of them disabled after primates diverged. The majority of these represent new discoveries, including 70 previously undetected protein-coding genes. The novel coding genes are additionally supported by single-nucleotide variant evidence indicative of continued purifying selection in the human lineage, coding-exon splicing evidence from new GENCODE transcripts using next-generation transcriptomic data sets, and mass spectrometry evidence of translation for several new genes. Our discoveries required simultaneous comparative annotation of other vertebrate genomes, which we show is essential to remove spurious ORFs and to distinguish coding from pseudogene regions. Our new coding regions help elucidate disease-associated regions by revealing that 118 GWAS variants previously thought to be noncoding are in fact protein altering. Altogether, our PhyloCSF data sets and algorithms will help researchers seeking to interpret these genomes, while our new annotations present exciting loci for further experimental characterization.
Collapse
Affiliation(s)
- Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Jose Manuel Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - James C Wright
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Stephen Fitzgerald
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Ruth Seal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Susan Tweedie
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Liang He
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Robert M Waterhouse
- Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Yue Li
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elspeth Bruford
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Jyoti S Choudhary
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
18
|
Silva J, Fernandes R, Romão L. Translational Regulation by Upstream Open Reading Frames and Human Diseases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2019; 1157:99-116. [DOI: 10.1007/978-3-030-19966-1_5] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
19
|
|
20
|
Shatsky IN, Terenin IM, Smirnova VV, Andreev DE. Cap-Independent Translation: What's in a Name? Trends Biochem Sci 2018; 43:882-895. [PMID: 29789219 DOI: 10.1016/j.tibs.2018.04.011] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Revised: 04/15/2018] [Accepted: 04/22/2018] [Indexed: 02/05/2023]
Abstract
Eukaryotic translation initiation relies on the m7G cap present at the 5' end of all mRNAs. Some viral mRNAs employ alternative mechanisms of initiation based on internal ribosome entry. The 'IRES ideology' was adopted by researchers to explain the differential translation of cellular mRNAs when the cap recognition is suppressed. However, some cellular IRESs have already been challenged and others are awaiting their validation. As an alternative cap-independent mechanism, we propose adopting the concept of cap-independent translation enhancers (CITEs) for mammalian mRNAs. Unlike IRESs, CITEs can be located both within 5' and 3' UTRs and bind mRNA-recruiting translational components. The respective 5' UTRs are then inspected by the scanning machinery essentially in the same way as under cap-dependent translation.
Collapse
Affiliation(s)
- Ivan N Shatsky
- Lomonosov Moscow State University, Belozersky Institute of Physico-Chemical Biology, Leninskie Gory 1, Bldg. 40, Moscow 119992, Russia.
| | - Ilya M Terenin
- Lomonosov Moscow State University, Belozersky Institute of Physico-Chemical Biology, Leninskie Gory 1, Bldg. 40, Moscow 119992, Russia; Sechenov First Moscow State Medical University, Institute of Molecular Medicine, Trubetskaya Str. 8-2, 119991, Moscow, Russia
| | - Victoria V Smirnova
- Lomonosov Moscow State University, Belozersky Institute of Physico-Chemical Biology, Leninskie Gory 1, Bldg. 40, Moscow 119992, Russia
| | - Dmitri E Andreev
- Lomonosov Moscow State University, Belozersky Institute of Physico-Chemical Biology, Leninskie Gory 1, Bldg. 40, Moscow 119992, Russia
| |
Collapse
|
21
|
Li H, Xiao L, Zhang L, Wu J, Wei B, Sun N, Zhao Y. FSPP: A Tool for Genome-Wide Prediction of smORF-Encoded Peptides and Their Functions. Front Genet 2018; 9:96. [PMID: 29675032 PMCID: PMC5896265 DOI: 10.3389/fgene.2018.00096] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2018] [Accepted: 03/08/2018] [Indexed: 02/01/2023] Open
Abstract
smORFs are small open reading frames of less than 100 codons. Recent low throughput experiments showed a lot of smORF-encoded peptides (SEPs) played crucial rule in processes such as regulation of transcription or translation, transportation through membranes and the antimicrobial activity. In order to gather more functional SEPs, it is necessary to have access to genome-wide prediction tools to give profound directions for low throughput experiments. In this study, we put forward a functional smORF-encoded peptides predictor (FSPP) which tended to predict authentic SEPs and their functions in a high throughput method. FSPP used the overlap of detected SEPs from Ribo-seq and mass spectrometry as target objects. With the expression data on transcription and translation levels, FSPP built two co-expression networks. Combing co-location relations, FSPP constructed a compound network and then annotated SEPs with functions of adjacent nodes. Tested on 38 sequenced samples of 5 human cell lines, FSPP successfully predicted 856 out of 960 annotated proteins. Interestingly, FSPP also highlighted 568 functional SEPs from these samples. After comparison, the roles predicted by FSPP were consistent with known functions. These results suggest that FSPP is a reliable tool for the identification of functional small peptides. FSPP source code can be acquired at https://www.bioinfo.org/FSPP.
Collapse
Affiliation(s)
- Hui Li
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.,School of Computer and Control Engineering, University of Chinese Academy of Sciences (UCAS), Beijing, China
| | - Li Xiao
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Lili Zhang
- School of Computer and Control Engineering, University of Chinese Academy of Sciences (UCAS), Beijing, China.,CAS Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Jiarui Wu
- Department of Clinical Pharmacology of Traditional Chinese Medicine, School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing, China
| | - Bin Wei
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Ninghui Sun
- State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| | - Yi Zhao
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
22
|
Abstract
Peptides encoded by short open reading frames (sORFs) are usually defined as peptides ≤100 aa long. Usually sORFs were ignored by automatic genome annotation programs due to the high probability of false discovery. However, improved computational tools along with a high-throughput RIBO-seq approach identified a myriad of translated sORFs. Their importance becomes evident as we are gaining experimental validation of their diverse cellular functions. This Review examines various computational and experimental approaches of sORFs identification as well as provides the summary of our current knowledge of their functional roles in cells.
Collapse
Affiliation(s)
- Anastasia Chugunova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Tsimafei Navalayeu
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia
| | - Olga Dontsova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Petr Sergiev
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| |
Collapse
|
23
|
Jones L, Goode L, Davila E, Brown A, McCarthy DM, Sharma N, Bhide PG, Armata IA. Translational effects and coding potential of an upstream open reading frame associated with DOPA Responsive Dystonia. Biochim Biophys Acta Mol Basis Dis 2017; 1863:1171-1182. [PMID: 28366877 DOI: 10.1016/j.bbadis.2017.03.024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 03/17/2017] [Accepted: 03/29/2017] [Indexed: 01/08/2023]
Abstract
Upstream open reading frames (uORFs) have emerged as major post-transcriptional regulatory elements in eukaryotic species. In general, uORFs are initiated by a translation start codon within the 5' untranslated region of a gene (upstream ATG; uATG), and they are negatively correlated with translational efficiency. In addition to their translational regulatory role, some uORFs can code for biologically active short peptides. The importance of uATGs/uORFs is further underscored by human diseases associated with single nucleotide polymorphisms (SNPs), which disrupt existing uORFs or introduce novel uORFs. Although several functional proteins translated from naturally occurring uORFs have been described, the coding potential of uORFs created by SNPs has been ignored because of the a priori assumption that these proteins are short-lived with no likely impact on protein homeostasis. Thus, studies on SNP-created uORFs are limited to their translational effects, leaving unexplored the potential cellular consequences of a SNP/uORF-encoded protein. Here, we investigate functionality of a uATG/uORF introduced by a +142C>T SNP within the GCH1 gene and associated with a familial form of DOPA Responsive Dystonia. We report that the +142C>T SNP represses GCH1 translation, and introduces a short, frame shifted uORF that encodes a 73-amino acid peptide. This peptide is localized within the nucleus and compromises cell viability upon proteasome inhibition. Our work extends the list of uATG/uORF associated diseases and advances research on peptides translated from SNP-introduced uORFs, a neglected component of the proteome.
Collapse
Affiliation(s)
- Lataisia Jones
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Lacy Goode
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Eduardo Davila
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Amber Brown
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Deirdre M McCarthy
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA
| | - Nutan Sharma
- Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Pradeep G Bhide
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA.
| | - Ioanna A Armata
- Center for Brain Repair and Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, Florida 32306, USA.
| |
Collapse
|
24
|
New Peptides Under the s(ORF)ace of the Genome. Trends Biochem Sci 2016; 41:665-678. [DOI: 10.1016/j.tibs.2016.05.003] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 04/28/2016] [Accepted: 05/03/2016] [Indexed: 01/30/2023]
|
25
|
Mouilleron H, Delcourt V, Roucou X. Death of a dogma: eukaryotic mRNAs can code for more than one protein. Nucleic Acids Res 2016; 44:14-23. [PMID: 26578573 PMCID: PMC4705651 DOI: 10.1093/nar/gkv1218] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 10/26/2015] [Accepted: 10/28/2015] [Indexed: 12/13/2022] Open
Abstract
mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma.
Collapse
Affiliation(s)
- Hélène Mouilleron
- Department of biochemistry, Université de Sherbrooke, Sherbrooke, Quebec J1E 4K8, Canada PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Quebec, Canada
| | - Vivian Delcourt
- Department of biochemistry, Université de Sherbrooke, Sherbrooke, Quebec J1E 4K8, Canada PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Quebec, Canada Inserm U-1192, Laboratoire de Protéomique, Réponse Inflammatoire, Spectrométrie de Masse (PRISM), Université de Lille 1, Cité Scientifique, 59655 Villeneuve D'Ascq, France
| | - Xavier Roucou
- Department of biochemistry, Université de Sherbrooke, Sherbrooke, Quebec J1E 4K8, Canada PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Quebec, Canada
| |
Collapse
|
26
|
Olexiouk V, Menschaert G. Identification of Small Novel Coding Sequences, a Proteogenomics Endeavor. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 926:49-64. [PMID: 27686805 DOI: 10.1007/978-3-319-42316-6_4] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The identification of small proteins and peptides has consistently proven to be challenging. However, technological advances as well as multi-omics endeavors facilitate the identification of novel small coding sequences, leading to new insights. Specifically, the application of next generation sequencing technologies (NGS), providing accurate and sample specific transcriptome / translatome information, into the proteomics field led to more comprehensive results and new discoveries. This book chapter focuses on the inclusion of RNA-Seq and RIBO-Seq also known as ribosome profiling, an RNA-Seq based technique sequencing the +/- 30 bp long fragments captured by translating ribosomes. We emphasize the identification of micropeptides and neo-antigens, two distinct classes of small translation products, triggering our current understanding of biology. RNA-Seq is capable of capturing sample specific genomic variations, enabling focused neo-antigen identification. RIBO-Seq can identify translation events in small open reading frames which are considered to be non-coding, leading to the discovery of micropeptides. The identification of small translation products requires the integration of multi-omics data, stressing the importance of proteogenomics in this novel research area.
Collapse
Affiliation(s)
- Volodimir Olexiouk
- Lab of Bioinformatics and Computational Genomics (BioBix), Faculty of Bioscience Engineering, Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Coupure Links 653, Building A, Ghent, 9000, Belgium.
| | - Gerben Menschaert
- Lab of Bioinformatics and Computational Genomics (BioBix), Faculty of Bioscience Engineering, Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Coupure Links 653, Building A, Ghent, 9000, Belgium
| |
Collapse
|
27
|
Ji MH, Kim SK, Kim CY, Phi JH, Jun HJ, Blume SW, Choi HS. Physiological Expression and Accumulation of the Products of Two Upstream Open Reading Frames mrtl and MycHex1 Along With p64 and p67 Myc From the Human c-myc Locus. J Cell Biochem 2015; 117:1407-18. [PMID: 26552949 DOI: 10.1002/jcb.25431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 11/09/2015] [Indexed: 11/05/2022]
Abstract
In addition to the canonical c-Myc p64 and p67 proteins, the human c-myc locus encodes two distinct proteins, mrtl (myc-related translation/localization regulatory factor) and MycHex1 (Myc Human Exon 1), from the upstream open reading frames within the 5'-untranslated region of the c-myc P0 mRNA. The aim of this study is to examine simultaneously, for the first time, mrtl, MycHex1, c-Myc p64, and p67 in human tumor cell lines and pediatric brain tumor tissues. Western blot analysis demonstrated endogenous mrtl, MycHex1, c-Myc p64, and p67 simultaneously. The relative abundance of mrtl and MycHex1 were consistent among a variety of human tumor cell lines, and the relative intensities of mrtl and MycHex1 correlated positively. Confocal imaging revealed mrtl predominantly localized to the nuclear envelope, along with prominent reticular pattern in the cytoplasm. MycHex1 was observed as a series of bright foci located within the nucleus, a subset of which colocalized with fibrillarin. mrtl and MycHex1 co-immunoprecipitated with RACK1, c-Myc, fibrillarin, coilin, and with each other. These findings suggest that mrtl and MycHex1 have multiple interaction partners in both the nucleus and cytoplasm. Sequence analyses confirmed a known polymorphism of mrtl at base 1965 (G>T) and new mutations at bases 1900 (C>G) and 1798 (C>G). Evidence is presented for expression and stable accumulation of all four proteins encoded by three distinct non-overlapping open reading frames within the human c-myc locus. Additional work is warranted to further elucidate the functional or regulatory roles of these molecules in regulation of c-Myc and in oncogenesis.
Collapse
Affiliation(s)
- Mi Hong Ji
- Department of Pediatrics, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | - Seung-Ki Kim
- Department of Neurosurgery, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Chae-Yong Kim
- Department of Neurosurgery, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Ji Hoon Phi
- Department of Neurosurgery, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Hyun Jin Jun
- Department of Pediatrics, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | - Scott W Blume
- Department of Medicine and Comprehensive Cancer Center, University of Alabama at Birmingham, Birmingham, Alabama 35294
| | - Hyoung Soo Choi
- Department of Pediatrics, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| |
Collapse
|
28
|
Mackowiak SD, Zauber H, Bielow C, Thiel D, Kutz K, Calviello L, Mastrobuoni G, Rajewsky N, Kempa S, Selbach M, Obermayer B. Extensive identification and analysis of conserved small ORFs in animals. Genome Biol 2015; 16:179. [PMID: 26364619 PMCID: PMC4568590 DOI: 10.1186/s13059-015-0742-x] [Citation(s) in RCA: 140] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 08/05/2015] [Indexed: 02/06/2023] Open
Abstract
Background There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptides), but genome-wide experimental or computational identification of functional sORFs remains challenging. Results Here, we expand our previously developed method and present results of an integrated computational pipeline for the identification of conserved sORFs in human, mouse, zebrafish, fruit fly, and the nematode C. elegans. Isolating specific conservation signatures indicative of purifying selection on amino acid (rather than nucleotide) sequence, we identify about 2,000 novel small ORFs located in the untranslated regions of canonical mRNAs or on transcripts annotated as non-coding. Predicted sORFs show stronger conservation signatures than those identified in previous studies and are sometimes conserved over large evolutionary distances. The encoded peptides have little homology to known proteins and are enriched in disordered regions and short linear interaction motifs. Published ribosome profiling data indicate translation of more than 100 novel sORFs, and mass spectrometry data provide evidence for more than 70 novel candidates. Conclusions Taken together, we identify hundreds of previously unknown conserved sORFs in major model organisms. Our computational analyses and integration with experimental data show that these sORFs are expressed, often translated, and sometimes widely conserved, in some cases even between vertebrates and invertebrates. We thus provide an integrated resource of putatively functional micropeptides for functional validation in vivo. Electronic supplementary material The online version of this article (doi:10.1186/s13059-015-0742-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sebastian D Mackowiak
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Henrik Zauber
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Chris Bielow
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany. .,Berlin Institute of Health, Kapelle-Ufer 2, 10117, Berlin, Germany.
| | - Denise Thiel
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Kamila Kutz
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Lorenzo Calviello
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Guido Mastrobuoni
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Nikolaus Rajewsky
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Stefan Kempa
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Matthias Selbach
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| | - Benedikt Obermayer
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125, Berlin, Germany.
| |
Collapse
|
29
|
Emerging evidence for functional peptides encoded by short open reading frames. Nat Rev Genet 2014; 15:193-204. [PMID: 24514441 DOI: 10.1038/nrg3520] [Citation(s) in RCA: 382] [Impact Index Per Article: 38.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Short open reading frames (sORFs) are a common feature of all genomes, but their coding potential has mostly been disregarded, partly because of the difficulty in determining whether these sequences are translated. Recent innovations in computing, proteomics and high-throughput analyses of translation start sites have begun to address this challenge and have identified hundreds of putative coding sORFs. The translation of some of these has been confirmed, although the contribution of their peptide products to cellular functions remains largely unknown. This Review examines this hitherto overlooked component of the proteome and considers potential roles for sORF-encoded peptides.
Collapse
|
30
|
Vanderperre B, Lucier JF, Bissonnette C, Motard J, Tremblay G, Vanderperre S, Wisztorski M, Salzet M, Boisvert FM, Roucou X. Direct detection of alternative open reading frames translation products in human significantly expands the proteome. PLoS One 2013; 8:e70698. [PMID: 23950983 PMCID: PMC3741303 DOI: 10.1371/journal.pone.0070698] [Citation(s) in RCA: 147] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2013] [Accepted: 06/20/2013] [Indexed: 01/04/2023] Open
Abstract
A fully mature mRNA is usually associated to a reference open reading frame encoding a single protein. Yet, mature mRNAs contain unconventional alternative open reading frames (AltORFs) located in untranslated regions (UTRs) or overlapping the reference ORFs (RefORFs) in non-canonical +2 and +3 reading frames. Although recent ribosome profiling and footprinting approaches have suggested the significant use of unconventional translation initiation sites in mammals, direct evidence of large-scale alternative protein expression at the proteome level is still lacking. To determine the contribution of alternative proteins to the human proteome, we generated a database of predicted human AltORFs revealing a new proteome mainly composed of small proteins with a median length of 57 amino acids, compared to 344 amino acids for the reference proteome. We experimentally detected a total of 1,259 alternative proteins by mass spectrometry analyses of human cell lines, tissues and fluids. In plasma and serum, alternative proteins represent up to 55% of the proteome and may be a potential unsuspected new source for biomarkers. We observed constitutive co-expression of RefORFs and AltORFs from endogenous genes and from transfected cDNAs, including tumor suppressor p53, and provide evidence that out-of-frame clones representing AltORFs are mistakenly rejected as false positive in cDNAs screening assays. Functional importance of alternative proteins is strongly supported by significant evolutionary conservation in vertebrates, invertebrates, and yeast. Our results imply that coding of multiple proteins in a single gene by the use of AltORFs may be a common feature in eukaryotes, and confirm that translation of unconventional ORFs generates an as yet unexplored proteome.
Collapse
Affiliation(s)
- Benoît Vanderperre
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Jean-François Lucier
- Département de microbiologie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Cyntia Bissonnette
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Julie Motard
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Guillaume Tremblay
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Solène Vanderperre
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Maxence Wisztorski
- PRISM, Laboratoire de Protéomique, Réponse Inflammatoire, Spectrométrie de Masse, EA 4550, SN3, Université Lille 1, Villeneuve d'Ascq, France
| | - Michel Salzet
- PRISM, Laboratoire de Protéomique, Réponse Inflammatoire, Spectrométrie de Masse, EA 4550, SN3, Université Lille 1, Villeneuve d'Ascq, France
| | - François-Michel Boisvert
- Département d'anatomie et de biologie cellulaire, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
| | - Xavier Roucou
- Département de biochimie, Faculté de Médecine et des Sciences de la Santé, Université de Sherbrooke, Québec, Canada
- * E-mail:
| |
Collapse
|
31
|
A perspective on mammalian upstream open reading frame function. Int J Biochem Cell Biol 2013; 45:1690-700. [PMID: 23624144 PMCID: PMC7172355 DOI: 10.1016/j.biocel.2013.04.020] [Citation(s) in RCA: 146] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Revised: 04/16/2013] [Accepted: 04/17/2013] [Indexed: 12/11/2022]
Abstract
Post-transcriptional control makes a major contribution to the overall regulation of gene expression pathway. Within the cytoplasm this is mediated by a combination of regulatory RNA motifs within the 5′ and 3′ untranslated regions of mRNAs and their interacting protein/RNA partners. One of the most common regulatory RNA elements in mammalian transcripts (present in approximately 40% of all mRNAs) are upstream open reading frames (uORFs). However, despite the prevalence of these RNA elements how they function is not well understood. In general, they act to repress translation of the physiological ORF under control conditions, and under certain pathophysiological stresses this repression can be alleviated. It is known that re-initiation following the translation of an uORF is utilised in some situations however there are numerous alternative mechanisms that control the synthesis of a protein whose mRNA contains uORFs. Moreover, the trans-acting factors that are also involved in this process are not well defined. In this review we summarise our current understanding of this area and highlight some common features of these RNA motifs that have been discovered to date.
Collapse
|