1
|
Rich A, Acar O, Carvunis AR. Massively integrated coexpression analysis reveals transcriptional regulation, evolution and cellular implications of the yeast noncanonical translatome. Genome Biol 2024; 25:183. [PMID: 38978079 PMCID: PMC11232214 DOI: 10.1186/s13059-024-03287-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 05/20/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Recent studies uncovered pervasive transcription and translation of thousands of noncanonical open reading frames (nORFs) outside of annotated genes. The contribution of nORFs to cellular phenotypes is difficult to infer using conventional approaches because nORFs tend to be short, of recent de novo origins, and lowly expressed. Here we develop a dedicated coexpression analysis framework that accounts for low expression to investigate the transcriptional regulation, evolution, and potential cellular roles of nORFs in Saccharomyces cerevisiae. RESULTS Our results reveal that nORFs tend to be preferentially coexpressed with genes involved in cellular transport or homeostasis but rarely with genes involved in RNA processing. Mechanistically, we discover that young de novo nORFs located downstream of conserved genes tend to leverage their neighbors' promoters through transcription readthrough, resulting in high coexpression and high expression levels. Transcriptional piggybacking also influences the coexpression profiles of young de novo nORFs located upstream of genes, but to a lesser extent and without detectable impact on expression levels. Transcriptional piggybacking influences, but does not determine, the transcription profiles of de novo nORFs emerging nearby genes. About 40% of nORFs are not strongly coexpressed with any gene but are transcriptionally regulated nonetheless and tend to form entirely new transcription modules. We offer a web browser interface ( https://carvunislab.csb.pitt.edu/shiny/coexpression/ ) to efficiently query, visualize, and download our coexpression inferences. CONCLUSIONS Our results suggest that nORF transcription is highly regulated. Our coexpression dataset serves as an unprecedented resource for unraveling how nORFs integrate into cellular networks, contribute to cellular phenotypes, and evolve.
Collapse
Affiliation(s)
- April Rich
- Joint Carnegie Mellon University-University of Pittsburgh, University of Pittsburgh Computational Biology PhD Program, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA
| | - Omer Acar
- Joint Carnegie Mellon University-University of Pittsburgh, University of Pittsburgh Computational Biology PhD Program, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA.
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
2
|
Bonnet C, Dian AL, Espie-Caullet T, Fabbri L, Lagadec L, Pivron T, Dutertre M, Luco R, Navickas A, Vagner S, Verga D, Uguen P. Post-transcriptional gene regulation: From mechanisms to RNA chemistry and therapeutics. Bull Cancer 2024; 111:782-790. [PMID: 38824069 DOI: 10.1016/j.bulcan.2024.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/22/2024] [Accepted: 04/03/2024] [Indexed: 06/03/2024]
Abstract
A better understanding of the RNA biology and chemistry is necessary to then develop new RNA therapeutic strategies. This review is the synthesis of a series of conferences that took place during the 6th international course on post-transcriptional gene regulation at Institut Curie. This year, the course made a special focus on RNA chemistry.
Collapse
Affiliation(s)
- Clara Bonnet
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Ana Luisa Dian
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Tristan Espie-Caullet
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Lucilla Fabbri
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Lucie Lagadec
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Thibaud Pivron
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Martin Dutertre
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Reini Luco
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Albertas Navickas
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Stephan Vagner
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France
| | - Daniela Verga
- CNRS UMR9187, Inserm U1196, Chemistry and Modelling for the Biology of Cancer, Institut Curie, université Paris-Saclay, 91405 Orsay, France
| | - Patricia Uguen
- CNRS UMR3348 Genome integrity, RNA and Cancer, Institut Curie, University Paris-Saclay, 91401 Orsay, France.
| |
Collapse
|
3
|
Malekos E, Montano C, Carpenter S. CRISPRware: an efficient method for contextual gRNA library design. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.18.599405. [PMID: 38948878 PMCID: PMC11213142 DOI: 10.1101/2024.06.18.599405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
We present CRISPRware, an efficient method for generating guide RNA (gRNA) libraries against transcribed, translated, and noncoding regions. CRISPRware leverages next-generation sequencing data to design context-specific gRNAs and accounts for genetic variation, which allows allele-specific guide design on a genome-wide scale. The latter ability holds promise for the development of gene therapy in the context of gene dosing and dominant negative mutations.
Collapse
Affiliation(s)
- Eric Malekos
- Department of Biomolecular Engineering, University of California Santa Cruz, California, USA
| | - Christy Montano
- Department of Molecular, Cell, and Developmental Biology, University of California Santa Cruz, California, USA
| | - Susan Carpenter
- Department of Molecular, Cell, and Developmental Biology, University of California Santa Cruz, California, USA
| |
Collapse
|
4
|
Wen K, Chen X, Gu J, Chen Z, Wang Z. Beyond traditional translation: ncRNA derived peptides as modulators of tumor behaviors. J Biomed Sci 2024; 31:63. [PMID: 38877495 PMCID: PMC11177406 DOI: 10.1186/s12929-024-01047-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 05/24/2024] [Indexed: 06/16/2024] Open
Abstract
Within the intricate tapestry of molecular research, noncoding RNAs (ncRNAs) were historically overshadowed by a pervasive presumption of their inability to encode proteins or peptides. However, groundbreaking revelations have challenged this notion, unveiling select ncRNAs that surprisingly encode peptides specifically those nearing a succinct 100 amino acids. At the forefront of this epiphany stand lncRNAs and circRNAs, distinctively characterized by their embedded small open reading frames (sORFs). Increasing evidence has revealed different functions and mechanisms of peptides/proteins encoded by ncRNAs in cancer, including promotion or inhibition of cancer cell proliferation, cellular metabolism (glucose metabolism and lipid metabolism), and promotion or concerted metastasis of cancer cells. The discoveries not only accentuate the depth of ncRNA functionality but also open novel avenues for oncological research and therapeutic innovations. The main difficulties in the study of these ncRNA-derived peptides hinge crucially on precise peptide detection and sORFs identification. Here, we illuminate cutting-edge methodologies, essential instrumentation, and dedicated databases tailored for unearthing sORFs and peptides. In addition, we also conclude the potential of clinical applications in cancer therapy.
Collapse
Affiliation(s)
- Kang Wen
- Cancer Medical Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu, 210011, P.R. China
| | - Xin Chen
- Cancer Medical Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu, 210011, P.R. China
| | - Jingyao Gu
- Cancer Medical Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu, 210011, P.R. China
| | - Zhenyao Chen
- Department of Respiratory Endoscopy, Shanghai Chest Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, P.R. China.
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| | - Zhaoxia Wang
- Cancer Medical Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu, 210011, P.R. China.
| |
Collapse
|
5
|
Liu T, Qiao H, Wang Z, Yang X, Pan X, Yang Y, Ye X, Sakurai T, Lin H, Zhang Y. CodLncScape Provides a Self-Enriching Framework for the Systematic Collection and Exploration of Coding LncRNAs. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2400009. [PMID: 38602457 PMCID: PMC11165466 DOI: 10.1002/advs.202400009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 03/19/2024] [Indexed: 04/12/2024]
Abstract
Recent studies have revealed that numerous lncRNAs can translate proteins under specific conditions, performing diverse biological functions, thus termed coding lncRNAs. Their comprehensive landscape, however, remains elusive due to this field's preliminary and dispersed nature. This study introduces codLncScape, a framework for coding lncRNA exploration consisting of codLncDB, codLncFlow, codLncWeb, and codLncNLP. Specifically, it contains a manually compiled knowledge base, codLncDB, encompassing 353 coding lncRNA entries validated by experiments. Building upon codLncDB, codLncFlow investigates the expression characteristics of these lncRNAs and their diagnostic potential in the pan-cancer context, alongside their association with spermatogenesis. Furthermore, codLncWeb emerges as a platform for storing, browsing, and accessing knowledge concerning coding lncRNAs within various programming environments. Finally, codLncNLP serves as a knowledge-mining tool to enhance the timely content inclusion and updates within codLncDB. In summary, this study offers a well-functioning, content-rich ecosystem for coding lncRNA research, aiming to accelerate systematic studies in this field.
Collapse
Affiliation(s)
- Tianyuan Liu
- Tsukuba Life Science Innovation ProgramUniversity of TsukubaTsukuba3058577Japan
| | - Huiyuan Qiao
- Innovative Institute of Chinese Medicine and PharmacyAcademy for InterdisciplineChengdu University of Traditional Chinese MedicineChengdu611137China
| | - Zixu Wang
- Department of Computer ScienceUniversity of TsukubaTsukuba3058577Japan
| | - Xinyan Yang
- Department of Developmental BiologySchool of Basic Medical SciencesSouthern Medical UniversityGuangzhou510515China
| | - Xianrun Pan
- Innovative Institute of Chinese Medicine and PharmacyAcademy for InterdisciplineChengdu University of Traditional Chinese MedicineChengdu611137China
| | - Yu Yang
- School of Healthcare TechnologyChengdu Neusoft UniversityChengdu611844China
| | - Xiucai Ye
- Tsukuba Life Science Innovation ProgramUniversity of TsukubaTsukuba3058577Japan
- Department of Computer ScienceUniversity of TsukubaTsukuba3058577Japan
| | - Tetsuya Sakurai
- Tsukuba Life Science Innovation ProgramUniversity of TsukubaTsukuba3058577Japan
- Department of Computer ScienceUniversity of TsukubaTsukuba3058577Japan
| | - Hao Lin
- School of Life Science and TechnologyUniversity of Electronic Science and Technology of ChinaChengdu611731China
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and PharmacyAcademy for InterdisciplineChengdu University of Traditional Chinese MedicineChengdu611137China
| |
Collapse
|
6
|
Dasgupta A, Prensner JR. Upstream open reading frames: new players in the landscape of cancer gene regulation. NAR Cancer 2024; 6:zcae023. [PMID: 38774471 PMCID: PMC11106035 DOI: 10.1093/narcan/zcae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 04/29/2024] [Accepted: 05/07/2024] [Indexed: 05/24/2024] Open
Abstract
The translation of RNA by ribosomes represents a central biological process and one of the most dysregulated processes in cancer. While translation is traditionally thought to occur exclusively in the protein-coding regions of messenger RNAs (mRNAs), recent transcriptome-wide approaches have shown abundant ribosome activity across diverse stretches of RNA transcripts. The most common type of this kind of ribosome activity occurs in gene leader sequences, also known as 5' untranslated regions (UTRs) of the mRNA, that precede the main coding sequence. Translation of these upstream open reading frames (uORFs) is now known to occur in upwards of 25% of all protein-coding genes. With diverse functions from RNA regulation to microprotein generation, uORFs are rapidly igniting a new arena of cancer biology, where they are linked to cancer genetics, cancer signaling, and tumor-immune interactions. This review focuses on the contributions of uORFs and their associated 5'UTR sequences to cancer biology.
Collapse
Affiliation(s)
- Anwesha Dasgupta
- Chad Carr Pediatric Brain Tumor Center, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - John R Prensner
- Chad Carr Pediatric Brain Tumor Center, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
7
|
Kato A, Iwasaki R, Takeshima K, Maruzuru Y, Koyanagi N, Natsume T, Kusano H, Adachi S, Kawano S, Kawaguchi Y. Identification of a novel neurovirulence factor encoded by the cryptic orphan gene UL31.6 of herpes simplex virus 1. J Virol 2024:e0074724. [PMID: 38819171 DOI: 10.1128/jvi.00747-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 05/03/2024] [Indexed: 06/01/2024] Open
Abstract
Although the herpes simplex virus type 1 (HSV-1) genome was thought to contain approximately 80 different protein coding sequences (CDSs), recent multi-omics analyses reported HSV-1 encodes more than 200 potential CDSs. However, few of the newly identified CDSs were confirmed to be expressed at the peptide or protein level in HSV-1-infected cells. Furthermore, the impact of the proteins they encode on HSV-1 infection is largely unknown. This study focused on a newly identified CDS, UL31.6. Re-analyzation of our previous chemical proteomics data verified that UL31.6 was expressed at the peptide level in HSV-1-infected cells. Antisera raised against a viral protein encoded by UL31.6 (pUL31.6) reacted with a protein with an approximate molecular mass of 37 kDa in lysates of Vero cells infected with each of three HSV-1 strains. pUL31.6 was efficiently dissociated from virions in high-salt solution. A UL31.6-null mutation had a minimal effect on HSV-1 gene expression, replication, cell-to-cell spread, and morphogenesis in Vero cells; in contrast, it significantly reduced HSV-1 cell-to-cell spread in three neural cells but not in four non-neural cells including Vero cells. The UL31.6-null mutation also significantly reduced the mortality and viral replication in the brains of mice after intracranial infection, but had minimal effects on pathogenic manifestations in and around the eyes, and viral replication detected in the tear films of mice after ocular infection. These results indicated that pUL31.6 was a tegument protein and specifically acted as a neurovirulence factor by potentially promoting viral transmission between neuronal cells in the central nervous system.IMPORTANCERecent multi-omics analyses reported the herpes simplex virus type 1 (HSV-1) genome encodes an additional number of potential coding sequences (CDSs). However, the expressions of these CDSs at the peptide or protein levels and the biological effects of these CDSs on HSV-1 infection remain largely unknown. This study annotated a cryptic orphan CDS, termed UL31.6, an HSV-1 gene that encodes a tegument protein with an approximate molecular mass of 37 kDa, which specifically acts as a neurovirulence factor. Our study indicates that HSV-1 proteins important for viral pathogenesis remain to be identified and a comprehensive understanding of the pathogenesis of HSV-1 will require not only the identification of cryptic orphan CDSs using emerging technologies but also step-by-step and in-depth analyses of each of the cryptic orphan CDSs.
Collapse
Affiliation(s)
- Akihisa Kato
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Research Center for Asian Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- PRESTO, Japan Science and Technology Agency (JST), Kawaguchi, Japan
| | - Ryoji Iwasaki
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Kousuke Takeshima
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Yuhei Maruzuru
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Naoto Koyanagi
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Tohru Natsume
- Molecular Profiling Research Center for Drug Discovery (molprof), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Hideo Kusano
- Molecular Profiling Research Center for Drug Discovery (molprof), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
- Department of Proteomics, National Cancer Center Research institute, Tokyo, Japan
| | - Shungo Adachi
- Molecular Profiling Research Center for Drug Discovery (molprof), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
- Department of Proteomics, National Cancer Center Research institute, Tokyo, Japan
| | - Shuichi Kawano
- Faculty of Mathematics, Kyushu University, Fukuoka, Japan
| | - Yasushi Kawaguchi
- Division of Molecular Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Department of Infectious Disease Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- Research Center for Asian Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
- The University of Tokyo, Pandemic Preparedness, Infection and Advanced Research Center, Tokyo, Japan
| |
Collapse
|
8
|
Inamo J, Suzuki A, Ueda MT, Yamaguchi K, Nishida H, Suzuki K, Kaneko Y, Takeuchi T, Hatano H, Ishigaki K, Ishihama Y, Yamamoto K, Kochi Y. Long-read sequencing for 29 immune cell subsets reveals disease-linked isoforms. Nat Commun 2024; 15:4285. [PMID: 38806455 PMCID: PMC11133395 DOI: 10.1038/s41467-024-48615-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 05/02/2024] [Indexed: 05/30/2024] Open
Abstract
Alternative splicing events are a major causal mechanism for complex traits, but they have been understudied due to the limitation of short-read sequencing. Here, we generate a full-length isoform annotation of human immune cells from an individual by long-read sequencing for 29 cell subsets. This contains a number of unannotated transcripts and isoforms such as a read-through transcript of TOMM40-APOE in the Alzheimer's disease locus. We profile characteristics of isoforms and show that repetitive elements significantly explain the diversity of unannotated isoforms, providing insight into the human genome evolution. In addition, some of the isoforms are expressed in a cell-type specific manner, whose alternative 3'-UTRs usage contributes to their specificity. Further, we identify disease-associated isoforms by isoform switch analysis and by integration of several quantitative trait loci analyses with genome-wide association study data. Our findings will promote the elucidation of the mechanism of complex diseases via alternative splicing.
Collapse
Affiliation(s)
- Jun Inamo
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
- Division of Rheumatology, Department of Internal Medicine, Keio University School of Medicine, Tokyo, 160-8582, Japan
| | - Akari Suzuki
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
| | - Mahoko Takahashi Ueda
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
| | - Kensuke Yamaguchi
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
- Biomedical Engineering Research Innovation Center, Institute of Biomaterials and Bioengineering, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan
| | - Hiroshi Nishida
- Department of Molecular Systems Bioanalysis, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, 606-8501, Japan
| | - Katsuya Suzuki
- Division of Rheumatology, Department of Internal Medicine, Keio University School of Medicine, Tokyo, 160-8582, Japan
| | - Yuko Kaneko
- Division of Rheumatology, Department of Internal Medicine, Keio University School of Medicine, Tokyo, 160-8582, Japan
| | - Tsutomu Takeuchi
- Division of Rheumatology, Department of Internal Medicine, Keio University School of Medicine, Tokyo, 160-8582, Japan
- Saitama Medical University, 38 Morohongo, Moroyama, Iruma, Saitama, 350-0495, Japan
| | - Hiroaki Hatano
- Laboratory for Human Immunogenetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
| | - Kazuyoshi Ishigaki
- Laboratory for Human Immunogenetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
| | - Yasushi Ishihama
- Department of Molecular Systems Bioanalysis, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, 606-8501, Japan
- Laboratory of Proteomics for Drug Discovery, National Institute of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka, 567-0085, Japan
| | - Kazuhiko Yamamoto
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
| | - Yuta Kochi
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, 113-8510, Japan.
- Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
9
|
Rocha AL, Pai V, Perkins G, Chang T, Ma J, De Souza EV, Chu Q, Vaughan JM, Diedrich JK, Ellisman MH, Saghatelian A. An Inner Mitochondrial Membrane Microprotein from the SLC35A4 Upstream ORF Regulates Cellular Metabolism. J Mol Biol 2024; 436:168559. [PMID: 38580077 DOI: 10.1016/j.jmb.2024.168559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 03/29/2024] [Accepted: 03/31/2024] [Indexed: 04/07/2024]
Abstract
Upstream open reading frames (uORFs) are cis-acting elements that can dynamically regulate the translation of downstream ORFs by suppressing downstream translation under basal conditions and, in some cases, increasing downstream translation under stress conditions. Computational and empirical methods have identified uORFs in the 5'-UTRs of approximately half of all mouse and human transcripts, making uORFs one of the largest regulatory elements known. Because the prevailing dogma was that eukaryotic mRNAs produce a single functional protein, the peptides and small proteins, or microproteins, encoded by uORFs were rarely studied. We hypothesized that a uORF in the SLC35A4 mRNA is producing a functional microprotein (SLC35A4-MP) because of its conserved amino acid sequence. Through a series of biochemical and cellular experiments, we find that the 103-amino acid SLC35A4-MP is a single-pass transmembrane inner mitochondrial membrane (IMM) microprotein. The IMM contains the protein machinery crucial for cellular respiration and ATP generation, and loss of function studies with SLC35A4-MP significantly diminish maximal cellular respiration, indicating a vital role for this microprotein in cellular metabolism. The findings add SLC35A4-MP to the growing list of functional microproteins and, more generally, indicate that uORFs that encode conserved microproteins are an untapped reservoir of functional microproteins.
Collapse
Affiliation(s)
- Andréa L Rocha
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Victor Pai
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Guy Perkins
- National Center for Microscopy and Imaging Research, Center for Research in Biological Systems, Department of Neurosciences, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Tina Chang
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Jiao Ma
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Eduardo V De Souza
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Qian Chu
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joan M Vaughan
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Jolene K Diedrich
- Mass Spectrometry Core for Proteomics and Metabolomics, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, CA, USA
| | - Mark H Ellisman
- National Center for Microscopy and Imaging Research, Center for Research in Biological Systems, Department of Neurosciences, School of Medicine, University of California San Diego, La Jolla, CA, USA.
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA.
| |
Collapse
|
10
|
Nichols C, Do-Thi VA, Peltier DC. Noncanonical microprotein regulation of immunity. Mol Ther 2024:S1525-0016(24)00324-1. [PMID: 38734902 DOI: 10.1016/j.ymthe.2024.05.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 04/19/2024] [Accepted: 05/09/2024] [Indexed: 05/13/2024] Open
Abstract
The immune system is highly regulated but, when dysregulated, suboptimal protective or overly robust immune responses can lead to immune-mediated disorders. The genetic and molecular mechanisms of immune regulation are incompletely understood, impeding the development of more precise diagnostics and therapeutics for immune-mediated disorders. Recently, thousands of previously unrecognized noncanonical microprotein genes encoded by small open reading frames have been identified. Many of these microproteins perform critical functions, often in a cell- and context-specific manner. Several microproteins are now known to regulate immunity; however, the vast majority are uncharacterized. Therefore, illuminating what is often referred to as the "dark proteome," may present opportunities to tune immune responses more precisely. Here, we review noncanonical microprotein biology, highlight recently discovered examples regulating immunity, and discuss the potential and challenges of modulating dysregulated immune responses by targeting microproteins.
Collapse
Affiliation(s)
- Cydney Nichols
- Morris Green Scholars Program, Department of Pediatrics, Riley Hospital for Children, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Van Anh Do-Thi
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Daniel C Peltier
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA; Simon Cancer Center, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
| |
Collapse
|
11
|
Yang Q, Wang J, Chen Z. Conditional splicing system for tight control of viral overlapping genes. J Virol 2024; 98:e0024224. [PMID: 38446633 PMCID: PMC11019872 DOI: 10.1128/jvi.00242-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 02/11/2024] [Indexed: 03/08/2024] Open
Abstract
Viral genomes frequently harbor overlapping genes, complicating the development of virus-vectored vaccines and gene therapies. This study introduces a novel conditional splicing system to precisely control the expression of such overlapping genes through recombinase-mediated conditional splicing. We refined site-specific recombinase (SSR) conditional splicing systems and explored their mechanisms. The systems demonstrated exceptional inducibility (116,700-fold increase) with negligible background expression, facilitating the conditional expression of overlapping genes in adenovirus-associated virus (AAV) and human immunodeficiency virus type 1. Notably, this approach enabled the establishment of stable AAV producer cell lines, encapsulating all necessary packaging genes. Our findings underscore the potential of the SSR-conditional splicing system to significantly advance vector engineering, enhancing the efficacy and scalability of viral-vector-based therapies and vaccines. IMPORTANCE Regulating overlapping genes is vital for gene therapy and vaccine development using viral vectors. The regulation of overlapping genes presents challenges, including cytotoxicity and impacts on vector capacity and genome stability, which restrict stable packaging cell line development and broad application. To address these challenges, we present a "loxp-splice-loxp"-based conditional splicing system, offering a novel solution for conditional expression of overlapping genes and stable cell line establishment. This system may also regulate other cytotoxic genes, representing a significant advancement in cell engineering and gene therapy as well as biomass production.
Collapse
Affiliation(s)
- Qing Yang
- AIDS Institute and Department of Microbiology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, People's Republic of China
| | - Jinlin Wang
- AIDS Institute and Department of Microbiology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, People's Republic of China
| | - Zhiwei Chen
- AIDS Institute and Department of Microbiology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, People's Republic of China
- State Key Laboratory of Emerging Infectious Diseases, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, People's Republic of China
| |
Collapse
|
12
|
Qin Z, Yang J, Zhang K, Gao X, Ran Q, Xu Y, Wang Z, Lou D, Huang C, Zellmer L, Meng G, Chen N, Ma H, Wang Z, Liao DJ. Updating mRNA variants of the human RSK4 gene and their expression in different stressed situations. Heliyon 2024; 10:e27475. [PMID: 38560189 PMCID: PMC10980951 DOI: 10.1016/j.heliyon.2024.e27475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 02/11/2024] [Accepted: 02/29/2024] [Indexed: 04/04/2024] Open
Abstract
We determined RNA spectrum of the human RSK4 (hRSK4) gene (also called RPS6KA6) and identified 29 novel mRNA variants derived from alternative splicing, which, plus the NCBI-documented ones and the five we reported previously, totaled 50 hRSK4 RNAs that, by our bioinformatics analyses, encode 35 hRSK4 protein isoforms of 35-762 amino acids. Many of the mRNAs are bicistronic or tricistronic for hRSK4. The NCBI-normalized NM_014496.5 and the protein it encodes are designated herein as the Wt-1 mRNA and protein, respectively, whereas the NM_001330512.1 and the long protein it encodes are designated as the Wt-2 mRNA and protein, respectively. Many of the mRNA variants responded differently to different situations of stress, including serum starvation, a febrile temperature, treatment with ethanol or ethanol-extracted clove buds (an herbal medicine), whereas the same stressed situation often caused quite different alterations among different mRNA variants in different cell lines. Mosifloxacin, an antibiotics and also a functional inhibitor of hRSK4, could inhibit the expression of certain hRSK4 mRNA variants. The hRSK4 gene likely uses alternative splicing as a handy tool to adapt to different stressed situations, and the mRNA and protein multiplicities may partly explain the incongruous literature on its expression and comports.
Collapse
Affiliation(s)
- Zhenwei Qin
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Jianglin Yang
- Center for Clinical Laboratories, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Rd, Guiyang, 550004, Guizhou Province, China
- Key Lab of Endemic and Ethnic Diseases of the Ministry of Education of China in Guizhou Medical University, Guiyang, 550004, Guizhou Province, China
| | - Keyin Zhang
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Xia Gao
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Qianchuan Ran
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Yuanhong Xu
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Zhi Wang
- Department of Pathology, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Didong Lou
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Chunhua Huang
- Section of Forensic Science and Pathology, School of Basic Medical Sciences, Guizhou University of Traditional Chinese Medicine, Dong-Qing-Nan Road, Guiyang, 550025, Guizhou Province, China
| | - Lucas Zellmer
- Department of Medicine, Hennepin County Medical Center, 730 South 8th St., Minneapolis, MN, 55415, USA
| | - Guangxue Meng
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Na Chen
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Hong Ma
- Department of Oral and Maxillofacial Surgery, School of Stomatology, Guizhou Medical University, 9 Beijing Road, Guiyang, 550004, Guizhou Province, China
| | - Zhe Wang
- State Key Laboratory of Cancer Biology, Department of Pathology, Xijing Hospital, Air Force Medical University, 169 Changle West Road, Xi'an, 710032, China
| | - Dezhong Joshua Liao
- Center for Clinical Laboratories, The Affiliated Hospital of Guizhou Medical University, 4 Beijing Rd, Guiyang, 550004, Guizhou Province, China
- Key Lab of Endemic and Ethnic Diseases of the Ministry of Education of China in Guizhou Medical University, Guiyang, 550004, Guizhou Province, China
| |
Collapse
|
13
|
Ren G, Gu X, Zhang L, Gong S, Song S, Chen S, Chen Z, Wang X, Li Z, Zhou Y, Li L, Yang J, Lai F, Dang Y. Ribosomal frameshifting at normal codon repeats recodes functional chimeric proteins in human. Nucleic Acids Res 2024; 52:2463-2479. [PMID: 38281188 PMCID: PMC10954444 DOI: 10.1093/nar/gkae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 01/04/2024] [Accepted: 01/10/2024] [Indexed: 01/30/2024] Open
Abstract
Ribosomal frameshifting refers to the process that ribosomes slip into +1 or -1 reading frame, thus produce chimeric trans-frame proteins. In viruses and bacteria, programmed ribosomal frameshifting can produce essential trans-frame proteins for viral replication or regulation of other biological processes. In humans, however, functional trans-frame protein derived from ribosomal frameshifting is scarcely documented. Combining multiple assays, we show that short codon repeats could act as cis-acting elements that stimulate ribosomal frameshifting in humans, abbreviated as CRFS hereafter. Using proteomic analyses, we identified many putative CRFS events from 32 normal human tissues supported by trans-frame peptides positioned at codon repeats. Finally, we show a CRFS-derived trans-frame protein (HDAC1-FS) functions by antagonizing the activities of HDAC1, thus affecting cell migration and apoptosis. These data suggest a novel type of translational recoding associated with codon repeats, which may expand the coding capacity of mRNA and diversify the regulation in human.
Collapse
Affiliation(s)
- Guiping Ren
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Xiaoqian Gu
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Lu Zhang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shimin Gong
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shuang Song
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shunkai Chen
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Zhenjing Chen
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Xiaoyan Wang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Zhanbiao Li
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Yingshui Zhou
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Longxi Li
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Jiao Yang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Fan Lai
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Yunkun Dang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| |
Collapse
|
14
|
Fleck K, Luria V, Garag N, Karger A, Hunter T, Marten D, Phu W, Nam KM, Sestan N, O’Donnell-Luria AH, Erceg J. Functional associations of evolutionarily recent human genes exhibit sensitivity to the 3D genome landscape and disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.17.585403. [PMID: 38559085 PMCID: PMC10980080 DOI: 10.1101/2024.03.17.585403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Genome organization is intricately tied to regulating genes and associated cell fate decisions. In this study, we examine the positioning and functional significance of human genes, grouped by their evolutionary age, within the 3D organization of the genome. We reveal that genes of different evolutionary origin have distinct positioning relationships with both domains and loop anchors, and remarkably consistent relationships with boundaries across cell types. While the functional associations of each group of genes are primarily cell type-specific, such associations of conserved genes maintain greater stability across 3D genomic features and disease than recently evolved genes. Furthermore, the expression of these genes across various tissues follows an evolutionary progression, such that RNA levels increase from young genes to ancient genes. Thus, the distinct relationships of gene evolutionary age, function, and positioning within 3D genomic features contribute to tissue-specific gene regulation in development and disease.
Collapse
Affiliation(s)
- Katherine Fleck
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269
| | - Victor Luria
- Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115
| | - Nitanta Garag
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
| | - Amir Karger
- IT-Research Computing, Harvard Medical School, Boston, MA 02115
| | - Trevor Hunter
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
| | - Daniel Marten
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
| | - William Phu
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
| | - Kee-Myoung Nam
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06510
| | - Nenad Sestan
- Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510
| | - Anne H. O’Donnell-Luria
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115
| | - Jelena Erceg
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06030
| |
Collapse
|
15
|
Yang H, Li Q, Stroup EK, Wang S, Ji Z. Widespread stable noncanonical peptides identified by integrated analyses of ribosome profiling and ORF features. Nat Commun 2024; 15:1932. [PMID: 38431639 PMCID: PMC10908861 DOI: 10.1038/s41467-024-46240-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/18/2024] [Indexed: 03/05/2024] Open
Abstract
Studies have revealed dozens of functional peptides in putative 'noncoding' regions and raised the question of how many proteins are encoded by noncanonical open reading frames (ORFs). Here, we comprehensively annotate genome-wide translated ORFs across five eukaryotes (human, mouse, zebrafish, worm, and yeast) by analyzing ribosome profiling data. We develop a logistic regression model named PepScore based on ORF features (expected length, encoded domain, and conservation) to calculate the probability that the encoded peptide is stable in humans. Systematic ectopic expression validates PepScore and shows that stable complex-associating microproteins can be encoded in 5'/3' untranslated regions and overlapping coding regions of mRNAs besides annotated noncoding RNAs. Stable noncanonical proteins follow conventional rules and localize to different subcellular compartments. Inhibition of proteasomal/lysosomal degradation pathways can stabilize some peptides especially those with moderate PepScores, but cannot rescue the expression of short ones with low PepScores suggesting they are directly degraded by cellular proteases. The majority of human noncanonical peptides with high PepScores show longer lengths but low conservation across species/mammals, and hundreds contain trait-associated genetic variants. Our study presents a statistical framework to identify stable noncanonical peptides in the genome and provides a valuable resource for functional characterization of noncanonical translation during development and disease.
Collapse
Affiliation(s)
- Haiwang Yang
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Qianru Li
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Emily K Stroup
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Sheng Wang
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA
| | - Zhe Ji
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA.
| |
Collapse
|
16
|
Valdivia-Francia F, Sendoel A. No country for old methods: New tools for studying microproteins. iScience 2024; 27:108972. [PMID: 38333695 PMCID: PMC10850755 DOI: 10.1016/j.isci.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024] Open
Abstract
Microproteins encoded by small open reading frames (sORFs) have emerged as a fascinating frontier in genomics. Traditionally overlooked due to their small size, recent technological advancements such as ribosome profiling, mass spectrometry-based strategies and advanced computational approaches have led to the annotation of more than 7000 sORFs in the human genome. Despite the vast progress, only a tiny portion of these microproteins have been characterized and an important challenge in the field lies in identifying functionally relevant microproteins and understanding their role in different cellular contexts. In this review, we explore the recent advancements in sORF research, focusing on the new methodologies and computational approaches that have facilitated their identification and functional characterization. Leveraging these new tools hold great promise for dissecting the diverse cellular roles of microproteins and will ultimately pave the way for understanding their role in the pathogenesis of diseases and identifying new therapeutic targets.
Collapse
Affiliation(s)
- Fabiola Valdivia-Francia
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
- Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Schlieren-Zurich, Switzerland
| | - Ataman Sendoel
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
| |
Collapse
|
17
|
Mao Y, Qian SB. Making sense of mRNA translational "noise". Semin Cell Dev Biol 2024; 154:114-122. [PMID: 36925447 PMCID: PMC10500040 DOI: 10.1016/j.semcdb.2023.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 03/09/2023] [Accepted: 03/09/2023] [Indexed: 03/15/2023]
Abstract
The importance of translation fidelity has been apparent since the discovery of genetic code. It is commonly believed that translation deviating from the main coding region is to be avoided at all times inside cells. However, ribosome profiling and mass spectrometry have revealed pervasive noncanonical translation. Both the scope and origin of translational "noise" are just beginning to be appreciated. Although largely overlooked, those translational "noises" are associated with a wide range of cellular functions, such as producing unannotated protein products. Furthermore, the dynamic nature of translational "noise" is responsive to stress conditions, highlighting the beneficial effect of translational "noise" in stress adaptation. Mechanistic investigation of translational "noise" will provide better insight into the mechanisms of translational regulation. Ultimately, they are not "noise" at all but represent a signature of cellular activities under pathophysiological conditions. Deciphering translational "noise" holds the therapeutic and diagnostic potential in a wide spectrum of human diseases.
Collapse
Affiliation(s)
- Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
18
|
Zhang D, Gao Y, Zhu L, Wang Y, Li P. Advances and opportunities in methods to study protein translation - A review. Int J Biol Macromol 2024; 259:129150. [PMID: 38171441 DOI: 10.1016/j.ijbiomac.2023.129150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 12/27/2023] [Accepted: 12/28/2023] [Indexed: 01/05/2024]
Abstract
It is generally believed that the regulation of gene expression involves protein translation occurring before RNA transcription. Therefore, it is crucial to investigate protein translation and its regulation. Recent advancements in biological sciences, particularly in the field of omics, have revolutionized protein translation research. These studies not only help characterize changes in protein translation during specific biological or pathological processes but also have significant implications in disease prevention and treatment. In this review, we summarize the latest methods in ribosome-based translation omics. We specifically focus on the application of fluorescence imaging technology and omics technology in studying overall protein translation. Additionally, we analyze the advantages, disadvantages, and application of these experimental methods, aiming to provide valuable insights and references to researchers studying translation.
Collapse
Affiliation(s)
- Dejiu Zhang
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao, China
| | - Yanyan Gao
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao, China
| | - Lei Zhu
- College of Basic Medical, Qingdao Binhai University, Qingdao, China
| | - Yin Wang
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao, China.
| | - Peifeng Li
- Institute for Translational Medicine, The Affiliated Hospital of Qingdao University, College of Medicine, Qingdao University, Qingdao, China.
| |
Collapse
|
19
|
Hofman DA, Ruiz-Orera J, Yannuzzi I, Murugesan R, Brown A, Clauser KR, Condurat AL, van Dinter JT, Engels SAG, Goodale A, van der Lugt J, Abid T, Wang L, Zhou KN, Vogelzang J, Ligon KL, Phoenix TN, Roth JA, Root DE, Hubner N, Golub TR, Bandopadhayay P, van Heesch S, Prensner JR. Translation of non-canonical open reading frames as a cancer cell survival mechanism in childhood medulloblastoma. Mol Cell 2024; 84:261-276.e18. [PMID: 38176414 PMCID: PMC10872554 DOI: 10.1016/j.molcel.2023.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/30/2023] [Accepted: 12/01/2023] [Indexed: 01/06/2024]
Abstract
A hallmark of high-risk childhood medulloblastoma is the dysregulation of RNA translation. Currently, it is unknown whether medulloblastoma dysregulates the translation of putatively oncogenic non-canonical open reading frames (ORFs). To address this question, we performed ribosome profiling of 32 medulloblastoma tissues and cell lines and observed widespread non-canonical ORF translation. We then developed a stepwise approach using multiple CRISPR-Cas9 screens to elucidate non-canonical ORFs and putative microproteins implicated in medulloblastoma cell survival. We determined that multiple lncRNA-ORFs and upstream ORFs (uORFs) exhibited selective functionality independent of main coding sequences. A microprotein encoded by one of these ORFs, ASNSD1-uORF or ASDURF, was upregulated, associated with MYC-family oncogenes, and promoted medulloblastoma cell survival through engagement with the prefoldin-like chaperone complex. Our findings underscore the fundamental importance of non-canonical ORF translation in medulloblastoma and provide a rationale to include these ORFs in future studies seeking to define new cancer targets.
Collapse
Affiliation(s)
- Damon A Hofman
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Ian Yannuzzi
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Adam Brown
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Karl R Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Alexandra L Condurat
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Jip T van Dinter
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Sem A G Engels
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Amy Goodale
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jasper van der Lugt
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands
| | - Tanaz Abid
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Li Wang
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kevin N Zhou
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Jayne Vogelzang
- Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA; Department of Pathology, Brigham and Women's Hospital, Boston, MA 02215, USA
| | - Keith L Ligon
- Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA; Department of Pathology, Brigham and Women's Hospital, Boston, MA 02215, USA; Department of Pathology, Boston Children's Hospital, Boston MA 02115, USA
| | - Timothy N Phoenix
- Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH 45229, USA
| | - Jennifer A Roth
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - David E Root
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Norbert Hubner
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany; German Centre for Cardiovascular Research, Partner Site Berlin, 13347 Berlin, Germany
| | - Todd R Golub
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Division of Pediatric Hematology/Oncology, Boston Children's Hospital, Boston, MA 02115, USA
| | - Pratiti Bandopadhayay
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Division of Pediatric Hematology/Oncology, Boston Children's Hospital, Boston, MA 02115, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS Utrecht, the Netherlands.
| | - John R Prensner
- Department of Pediatrics, Division of Pediatric Hematology/Oncology and Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| |
Collapse
|
20
|
Huang Y, Huo Y, Huang L, Zhang L, Zheng Y, Zhang N, Yang M. Super-enhancers: Implications in gastric cancer. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2024; 793:108489. [PMID: 38355091 DOI: 10.1016/j.mrrev.2024.108489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 02/01/2024] [Accepted: 02/05/2024] [Indexed: 02/16/2024]
Abstract
Gastric cancer (GC) is the fifth most prevalent malignancy and the third leading cause of cancer-related mortality globally. Despite intensive efforts to enhance the efficiencies of various therapeutics (chemotherapy, surgical interventions, molecular-targeted therapies, immunotherapies), the prognosis for patients with GC remains poor. This might be predominantly due to the limited understanding of the complicated etiology of GC. Importantly, epigenetic modifications and alterations are crucial during GC development. Super-enhancers (SEs) are a large cluster of adjacent enhancers that greatly activate transcription. SEs sustain cell-specific identity by enhancing the transcription of specific oncogenes. In this review, we systematically summarize how SEs are involved in GC development, including the SE landscape in GC, the SE target genes in GC, and the interventions related to SE functions for treating GC.
Collapse
Affiliation(s)
- Yizhou Huang
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China
| | - Yanfei Huo
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China
| | - Linying Huang
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China
| | - Long Zhang
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China
| | - Yanxiu Zheng
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China
| | - Nasha Zhang
- Department of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, Shandong Province, China; Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu Province, China.
| | - Ming Yang
- Shandong Provincial Key Laboratory of Radiation Oncology, Cancer Research Center, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan 250117, Shandong Province, China; Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu Province, China.
| |
Collapse
|
21
|
Kore H, Datta KK, Nagaraj SH, Gowda H. Protein-coding potential of non-canonical open reading frames in human transcriptome. Biochem Biophys Res Commun 2023; 684:149040. [PMID: 37897910 DOI: 10.1016/j.bbrc.2023.09.068] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/09/2023] [Accepted: 09/23/2023] [Indexed: 10/30/2023]
Abstract
In recent years, proteogenomics and ribosome profiling studies have identified a large number of proteins encoded by noncoding regions in the human genome. They are encoded by small open reading frames (sORFs) in the untranslated regions (UTRs) of mRNAs and long non-coding RNAs (lncRNAs). These sORF encoded proteins (SEPs) are often <150AA and show poor evolutionary conservation. A subset of them have been functionally characterized and shown to play an important role in fundamental biological processes including cardiac and muscle function, DNA repair, embryonic development and various human diseases. How many novel protein-coding regions exist in the human genome and what fraction of them are functionally important remains a mystery. In this review, we discuss current progress in unraveling SEPs, approaches used for their identification, their limitations and reliability of these identifications. We also discuss functionally characterized SEPs and their involvement in various biological processes and diseases. Lastly, we provide insights into their distinctive features compared to canonical proteins and challenges associated with annotating these in protein reference databases.
Collapse
Affiliation(s)
- Hitesh Kore
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia.
| | - Keshava K Datta
- Proteomics and Metabolomics Platform, La Trobe University, Melbourne, VIC, 3083, Australia
| | - Shivashankar H Nagaraj
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia
| | - Harsha Gowda
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Medicine, The University of Queensland, Queensland, 4072, Australia.
| |
Collapse
|
22
|
Zheng C, Wei Y, Zhang P, Lin K, He D, Teng H, Manyam G, Zhang Z, Liu W, Lee HRL, Tang X, He W, Islam N, Jain A, Chiu Y, Cao S, Diao Y, Meyer-Gauen S, Höök M, Malovannaya A, Li W, Hu M, Wang W, Xu H, Kopetz S, Chen Y. CRISPR-Cas9-based functional interrogation of unconventional translatome reveals human cancer dependency on cryptic non-canonical open reading frames. Nat Struct Mol Biol 2023; 30:1878-1892. [PMID: 37932451 PMCID: PMC10716047 DOI: 10.1038/s41594-023-01117-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 09/06/2023] [Indexed: 11/08/2023]
Abstract
Emerging evidence suggests that cryptic translation beyond the annotated translatome produces proteins with developmental or physiological functions. However, functions of cryptic non-canonical open reading frames (ORFs) in cancer remain largely unknown. To fill this gap and systematically identify colorectal cancer (CRC) dependency on non-canonical ORFs, we apply an integrative multiomic strategy, combining ribosome profiling and a CRISPR-Cas9 knockout screen with large-scale analysis of molecular and clinical data. Many such ORFs are upregulated in CRC compared to normal tissues and are associated with clinically relevant molecular subtypes. We confirm the in vivo tumor-promoting function of the microprotein SMIMP, encoded by a primate-specific, long noncoding RNA, the expression of which is associated with poor prognosis in CRC, is low in normal tissues and is specifically elevated in CRC and several other cancer types. Mechanistically, SMIMP interacts with the ATPase-forming domains of SMC1A, the core subunit of the cohesin complex, and facilitates SMC1A binding to cis-regulatory elements to promote epigenetic repression of the tumor-suppressive cell cycle regulators encoded by CDKN1A and CDKN2B. Thus, our study reveals a cryptic microprotein as an important component of cohesin-mediated gene regulation and suggests that the 'dark' proteome, encoded by cryptic non-canonical ORFs, may contain potential therapeutic or diagnostic targets.
Collapse
Affiliation(s)
- Caishang Zheng
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Yanjun Wei
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Peng Zhang
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Kangyu Lin
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Department of Gastrointestinal Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Dandan He
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Sema4, Inc., Stamford, CT, USA
| | - Hongqi Teng
- Department of Experimental Radiation Oncology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Ganiraju Manyam
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Zhao Zhang
- Department of Biochemistry and Molecular Biology, McGovern Medical School, the University of Texas Health Science Center at Houston, Houston, TX, USA
- MOE Key Laboratory of Metabolism and Molecular Medicine, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, China
| | - Wen Liu
- Center for Infectious and Inflammatory Diseases, Texas A&M Health Science Center, Institute of Biosciences of Technology, Houston, TX, USA
| | - Hye Rin Lindsay Lee
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Ximing Tang
- Department of Translational Molecular Pathology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Wei He
- Department of Epigenetics and Molecular Carcinogenesis, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Nelufa Islam
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX, USA
| | - Antrix Jain
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX, USA
| | - Yulun Chiu
- Department of Melanoma Medical Oncology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Shaolong Cao
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Yarui Diao
- Department of Cell Biology, Duke University Medical Center, Durham, NC, USA
- Duke Regeneration Center, Duke University Medical Center, Durham, NC, USA
- Department of Orthopedic Surgery, Duke University Medical Center, Durham, NC, USA
| | - Sherita Meyer-Gauen
- Department of Translational Molecular Pathology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Magnus Höök
- Center for Infectious and Inflammatory Diseases, Texas A&M Health Science Center, Institute of Biosciences of Technology, Houston, TX, USA
| | - Anna Malovannaya
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX, USA
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, USA
- Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Wenbo Li
- Department of Biochemistry and Molecular Biology, McGovern Medical School, the University of Texas Health Science Center at Houston, Houston, TX, USA
- Graduate School of Biomedical Sciences, University of Texas MD Anderson Cancer Center and UTHealth, Houston, TX, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Wenyi Wang
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Quantitative Sciences Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, USA
| | - Han Xu
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Department of Epigenetics and Molecular Carcinogenesis, the University of Texas MD Anderson Cancer Center, Houston, TX, USA
- Quantitative Sciences Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, USA
- Genetics and Epigenetics Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, USA
| | - Scott Kopetz
- Department of Gastrointestinal Medical Oncology, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Yiwen Chen
- Department of Bioinformatics and Computational Biology, the University of Texas MD Anderson Cancer Center, Houston, TX, USA.
- Quantitative Sciences Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, USA.
| |
Collapse
|
23
|
Mohsen JJ, Martel AA, Slavoff SA. Microproteins-Discovery, structure, and function. Proteomics 2023; 23:e2100211. [PMID: 37603371 PMCID: PMC10841188 DOI: 10.1002/pmic.202100211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/22/2023]
Abstract
Advances in proteogenomic technologies have revealed hundreds to thousands of translated small open reading frames (sORFs) that encode microproteins in genomes across evolutionary space. While many microproteins have now been shown to play critical roles in biology and human disease, a majority of recently identified microproteins have little or no experimental evidence regarding their functionality. Computational tools have some limitations for analysis of short, poorly conserved microprotein sequences, so additional approaches are needed to determine the role of each member of this recently discovered polypeptide class. A currently underexplored avenue in the study of microproteins is structure prediction and determination, which delivers a depth of functional information. In this review, we provide a brief overview of microprotein discovery methods, then examine examples of microprotein structures (and, conversely, intrinsic disorder) that have been experimentally determined using crystallography, cryo-electron microscopy, and NMR, which provide insight into their molecular functions and mechanisms. Additionally, we discuss examples of predicted microprotein structures that have provided insight or context regarding their function. Analysis of microprotein structure at the angstrom level, and confirmation of predicted structures, therefore, has potential to identify translated microproteins that are of biological importance and to provide molecular mechanism for their in vivo roles.
Collapse
Affiliation(s)
- Jessica J. Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Alina A. Martel
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
24
|
Wacholder A, Carvunis AR. Biological factors and statistical limitations prevent detection of most noncanonical proteins by mass spectrometry. PLoS Biol 2023; 21:e3002409. [PMID: 38048358 PMCID: PMC10721188 DOI: 10.1371/journal.pbio.3002409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 12/14/2023] [Accepted: 10/30/2023] [Indexed: 12/06/2023] Open
Abstract
Ribosome profiling experiments indicate pervasive translation of short open reading frames (ORFs) outside of annotated protein-coding genes. However, shotgun mass spectrometry (MS) experiments typically detect only a small fraction of the predicted protein products of this noncanonical translation. The rarity of detection could indicate that most predicted noncanonical proteins are rapidly degraded and not present in the cell; alternatively, it could reflect technical limitations. Here, we leveraged recent advances in ribosome profiling and MS to investigate the factors limiting detection of noncanonical proteins in yeast. We show that the low detection rate of noncanonical ORF products can largely be explained by small size and low translation levels and does not indicate that they are unstable or biologically insignificant. In particular, proteins encoded by evolutionarily young genes, including those with well-characterized biological roles, are too short and too lowly expressed to be detected by shotgun MS at current detection sensitivities. Additionally, we find that decoy biases can give misleading estimates of noncanonical protein false discovery rates, potentially leading to false detections. After accounting for these issues, we found strong evidence for 4 noncanonical proteins in MS data, which were also supported by evolution and translation data. These results illustrate the power of MS to validate unannotated genes predicted by ribosome profiling, but also its substantial limitations in finding many biologically relevant lowly expressed proteins.
Collapse
Affiliation(s)
- Aaron Wacholder
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
25
|
Wacholder A, Carvunis AR. Biological Factors and Statistical Limitations Prevent Detection of Most Noncanonical Proteins by Mass Spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.09.531963. [PMID: 36945638 PMCID: PMC10028962 DOI: 10.1101/2023.03.09.531963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]
Abstract
Ribosome profiling experiments indicate pervasive translation of short open reading frames (ORFs) outside of annotated protein-coding genes. However, shotgun mass spectrometry experiments typically detect only a small fraction of the predicted protein products of this noncanonical translation. The rarity of detection could indicate that most predicted noncanonical proteins are rapidly degraded and not present in the cell; alternatively, it could reflect technical limitations. Here we leveraged recent advances in ribosome profiling and mass spectrometry to investigate the factors limiting detection of noncanonical proteins in yeast. We show that the low detection rate of noncanonical ORF products can largely be explained by small size and low translation levels and does not indicate that they are unstable or biologically insignificant. In particular, proteins encoded by evolutionarily young genes, including those with well-characterized biological roles, are too short and too lowly-expressed to be detected by shotgun mass spectrometry at current detection sensitivities. Additionally, we find that decoy biases can give misleading estimates of noncanonical protein false discovery rates, potentially leading to false detections. After accounting for these issues, we found strong evidence for four noncanonical proteins in mass spectrometry data, which were also supported by evolution and translation data. These results illustrate the power of mass spectrometry to validate unannotated genes predicted by ribosome profiling, but also its substantial limitations in finding many biologically relevant lowly-expressed proteins.
Collapse
|
26
|
Fang JC, Liu MJ. Translation initiation at AUG and non-AUG triplets in plants. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2023; 335:111822. [PMID: 37574140 DOI: 10.1016/j.plantsci.2023.111822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 07/22/2023] [Accepted: 08/07/2023] [Indexed: 08/15/2023]
Abstract
In plants and other eukaryotes, precise selection of translation initiation site (TIS) on mRNAs shapes the proteome in response to cellular events or environmental cues. The canonical translation of mRNAs initiates at a 5' proximal AUG codon in a favorable context. However, the coding and non-coding regions of plant genomes contain numerous unannotated alternative AUG and non-AUG TISs. Determining how and why these unexpected and prevalent TISs are activated in plants has emerged as an exciting research area. In this review, we focus on the selection of plant TISs and highlight studies that revealed previously unannotated TISs used in vivo via comparative genomics and genome-wide profiling of ribosome positioning and protein N-terminal ends. The biological signatures of non-AUG TIS-initiated open reading frames (ORFs) in plants are also discussed. We describe what is understood about cis-regulatory RNA elements and trans-acting eukaryotic initiation factors (eIFs) in the site selection for translation initiation by featuring the findings in plants along with supporting findings in non-plant species. The prevalent, unannotated TISs provide a hidden reservoir of ORFs that likely help reshape plant proteomes in response to developmental or environmental cues. These findings underscore the importance of understanding the mechanistic basis of TIS selection to functionally annotate plant genomes, especially for crops with large genomes.
Collapse
Affiliation(s)
- Jhen-Cheng Fang
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Ming-Jung Liu
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan; Agricultural Biotechnology Research Center, Academia Sinica, Taipei 115, Taiwan.
| |
Collapse
|
27
|
Chen Y, Su H, Zhao J, Na Z, Jiang K, Bacchiocchi A, Loh KH, Halaban R, Wang Z, Cao X, Slavoff SA. Unannotated microprotein EMBOW regulates the interactome and chromatin and mitotic functions of WDR5. Cell Rep 2023; 42:113145. [PMID: 37725512 PMCID: PMC10629662 DOI: 10.1016/j.celrep.2023.113145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 07/20/2023] [Accepted: 08/31/2023] [Indexed: 09/21/2023] Open
Abstract
The conserved WD40-repeat protein WDR5 interacts with multiple proteins both inside and outside the nucleus. However, it is currently unclear whether and how the distribution of WDR5 between complexes is regulated. Here, we show that an unannotated microprotein EMBOW (endogenous microprotein binder of WDR5) dually encoded in the human SCRIB gene interacts with WDR5 and regulates its binding to multiple interaction partners, including KMT2A and KIF2A. EMBOW is cell cycle regulated, with two expression maxima at late G1 phase and G2/M phase. Loss of EMBOW decreases WDR5 interaction with KIF2A, aberrantly shortens mitotic spindle length, prolongs G2/M phase, and delays cell proliferation. In contrast, loss of EMBOW increases WDR5 interaction with KMT2A, leading to WDR5 binding to off-target genes, erroneously increasing H3K4me3 levels, and activating transcription of these genes. Together, these results implicate EMBOW as a regulator of WDR5 that regulates its interactions and prevents its off-target binding in multiple contexts.
Collapse
Affiliation(s)
- Yanran Chen
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA; Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences, School of Life Sciences, East China Normal University, Shanghai 200241, China; Key Laboratory of Brain Functional Genomics, Ministry of Education and Shanghai, School of Life Sciences, East China Normal University, Shanghai 200062, China
| | - Haomiao Su
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA
| | - Jianing Zhao
- Frontier Innovation Center, Department of Systems Biology for Medicine, School of Basic Medical Sciences, Fudan University, Shanghai 200433, China; Shanghai Fifth People's Hospital, Fudan University, Shanghai 200433, China
| | - Zhenkun Na
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA
| | - Kevin Jiang
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA
| | - Antonella Bacchiocchi
- Department of Dermatology, Yale University School of Medicine, New Haven, CT 06520, USA
| | - Ken H Loh
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA; Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT 06520, USA
| | - Ruth Halaban
- Department of Dermatology, Yale University School of Medicine, New Haven, CT 06520, USA
| | - Zhentian Wang
- Frontier Innovation Center, Department of Systems Biology for Medicine, School of Basic Medical Sciences, Fudan University, Shanghai 200433, China; Shanghai Fifth People's Hospital, Fudan University, Shanghai 200433, China
| | - Xiongwen Cao
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA; Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT 06520, USA; Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences, School of Life Sciences, East China Normal University, Shanghai 200241, China; Key Laboratory of Brain Functional Genomics, Ministry of Education and Shanghai, School of Life Sciences, East China Normal University, Shanghai 200062, China.
| | - Sarah A Slavoff
- Department of Chemistry, Yale University, New Haven, CT 06520, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT 06516, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06529, USA.
| |
Collapse
|
28
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Moritz RL, Deutsch EW, van Heesch S. What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome? Mol Cell Proteomics 2023; 22:100631. [PMID: 37572790 PMCID: PMC10506109 DOI: 10.1016/j.mcpro.2023.100631] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 07/21/2023] [Accepted: 08/08/2023] [Indexed: 08/14/2023] Open
Abstract
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from ∼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
Collapse
Affiliation(s)
- John R Prensner
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, USA.
| | | | - Leron W Kok
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Karl R Clauser
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, Agora Center Bugnon 25A, University of Lausanne, Lausanne, Switzerland; Department of Oncology, Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland; Agora Cancer Research Centre, Lausanne, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | - Eric W Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | | |
Collapse
|
29
|
Abstract
Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of transcripts of multiple types. Among these, a small set of protein-coding mRNAs play a disproportionately large role in defining phenotypes. Due to their sequence conservation, orthology can be established, making it possible to define the universal catalog of eukaryotic protein-coding genes. This catalog should substantially contribute to uncovering the genomic events underlying the emergence of eukaryotic phenotypes. This piece briefly reviews the basics of protein-coding gene prediction, discusses challenges in finalizing annotation of the human genome, and proposes strategies for producing annotations across the eukaryotic Tree of Life. This lays the groundwork for obtaining the catalog of all genes-the Earth's code of life.
Collapse
Affiliation(s)
- Roderic Guigó
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Dr. Aiguader 88, 08003 Barcelona, Catalonia
- Universitat Pompeu Fabra (UPF), Barcelona, Catalonia
| |
Collapse
|
30
|
Jaiswal M, Kumar S. smAMPsTK: a toolkit to unravel the smORFome encoding AMPs of plant species. J Biomol Struct Dyn 2023:1-13. [PMID: 37464885 DOI: 10.1080/07391102.2023.2235605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/06/2023] [Indexed: 07/20/2023]
Abstract
The pervasive repertoire of plant molecules with the potential to serve as a substitute for conventional antibiotics has led to obtaining better insights into plant-derived antimicrobial peptides (AMPs). The massive distribution of Small Open Reading Frames (smORFs) throughout eukaryotic genomes with proven extensive biological functions reflects their practicality as antimicrobials. Here, we have developed a pipeline named smAMPsTK to unveil the underlying hidden smORFs encoding AMPs for plant species. By applying this pipeline, we have elicited AMPs of various functional activity of lengths ranging from 5 to 100 aa by employing publicly available transcriptome data of five different angiosperms. Later, we studied the coding potential of AMPs-smORFs, the inclusion of diverse translation initiation start codons, and amino acid frequency. Codon usage study signifies no such codon usage biases for smORFs encoding AMPs. Majorly three start codons are prominent in generating AMPs. The evolutionary and conservational study proclaimed the widespread distribution of AMPs encoding genes throughout the plant kingdom. Domain analysis revealed that nearly all AMPs have chitin-binding ability, establishing their role as antifungal agents. The current study includes a developed methodology to characterize smORFs encoding AMPs, and their implications as antimicrobial, antibacterial, antifungal, or antiviral provided by SVM score and prediction status calculated by machine learning-based prediction models. The pipeline, complete package, and the results derived for five angiosperms are freely available at https://github.com/skbinfo/smAMPsTK.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Mohini Jaiswal
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| | - Shailesh Kumar
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| |
Collapse
|
31
|
Chen Y, Cao X, Loh KH, Slavoff SA. Chemical labeling and proteomics for characterization of unannotated small and alternative open reading frame-encoded polypeptides. Biochem Soc Trans 2023; 51:1071-1082. [PMID: 37171061 PMCID: PMC10317152 DOI: 10.1042/bst20221074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 03/27/2023] [Accepted: 04/13/2023] [Indexed: 05/13/2023]
Abstract
Thousands of unannotated small and alternative open reading frames (smORFs and alt-ORFs, respectively) have recently been revealed in mammalian genomes. While hundreds of mammalian smORF- and alt-ORF-encoded proteins (SEPs and alt-proteins, respectively) affect cell proliferation, the overwhelming majority of smORFs and alt-ORFs remain uncharacterized at the molecular level. Complicating the task of identifying the biological roles of smORFs and alt-ORFs, the SEPs and alt-proteins that they encode exhibit limited sequence homology to protein domains of known function. Experimental techniques for the functionalization of these gene classes are therefore required. Approaches combining chemical labeling and quantitative proteomics have greatly advanced our ability to identify and characterize functional SEPs and alt-proteins in high throughput. In this review, we briefly describe the principles of proteomic discovery of SEPs and alt-proteins, then summarize how these technologies interface with chemical labeling for identification of SEPs and alt-proteins with specific properties, as well as in defining the interactome of SEPs and alt-proteins.
Collapse
Affiliation(s)
- Yanran Chen
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
| | - Xiongwen Cao
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT, U.S.A
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Ken H. Loh
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT, U.S.A
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, U.S.A
| |
Collapse
|
32
|
Mohsen JJ, Slavoff SA. Noncoding translation: Quality control in the BAG. Mol Cell 2023; 83:1967-1969. [PMID: 37327774 DOI: 10.1016/j.molcel.2023.05.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 05/25/2023] [Accepted: 05/25/2023] [Indexed: 06/18/2023]
Abstract
Translation of noncoding regions is ubiquitous and upregulated in disease. Kesner et al.1 elucidate the mechanism by which the BAG6 complex exerts quality control over noncoding translation while targeting stable, noncanonical polypeptides to cellular membranes.
Collapse
Affiliation(s)
- Jessica J Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA; Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA.
| |
Collapse
|
33
|
Salz R, Saraiva-Agostinho N, Vorsteveld E, van der Made CI, Kersten S, Stemerdink M, Allen J, Volders PJ, Hunt SE, Hoischen A, 't Hoen PAC. SUsPECT: a pipeline for variant effect prediction based on custom long-read transcriptomes for improved clinical variant annotation. BMC Genomics 2023; 24:305. [PMID: 37280537 PMCID: PMC10245480 DOI: 10.1186/s12864-023-09391-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 05/19/2023] [Indexed: 06/08/2023] Open
Abstract
Our incomplete knowledge of the human transcriptome impairs the detection of disease-causing variants, in particular if they affect transcripts only expressed under certain conditions. These transcripts are often lacking from reference transcript sets, such as Ensembl/GENCODE and RefSeq, and could be relevant for establishing genetic diagnoses. We present SUsPECT (Solving Unsolved Patient Exomes/gEnomes using Custom Transcriptomes), a pipeline based on the Ensembl Variant Effect Predictor (VEP) to predict variant impact on custom transcript sets, such as those generated by long-read RNA-sequencing, for downstream prioritization. Our pipeline predicts the functional consequence and likely deleteriousness scores for missense variants in the context of novel open reading frames predicted from any transcriptome. We demonstrate the utility of SUsPECT by uncovering potential mutational mechanisms of pathogenic variants in ClinVar that are not predicted to be pathogenic using the reference transcript annotation. In further support of SUsPECT's utility, we identified an enrichment of immune-related variants predicted to have a more severe molecular consequence when annotating with a newly generated transcriptome from stimulated immune cells instead of the reference transcriptome. Our pipeline outputs crucial information for further prioritization of potentially disease-causing variants for any disease and will become increasingly useful as more long-read RNA sequencing datasets become available.
Collapse
Affiliation(s)
- Renee Salz
- Department of Medical BioSciences, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands
| | - Nuno Saraiva-Agostinho
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Emil Vorsteveld
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands
| | - Caspar I van der Made
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands
- Department of Internal Medicine, Radboud Institute for Molecular Life Sciences, and Radboud Expertise Center for Immunodeficiency and Autoinflammation, Radboud University Medical Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, the Netherlands
| | - Simone Kersten
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands
| | - Merel Stemerdink
- Department of Otorhinolaryngology, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, 6525 GA, The Netherlands
| | - Jamie Allen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Pieter-Jan Volders
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- Laboratory of Molecular Diagnostics, Department of Clinical Biology, Jessa Hospital, Hasselt, 3500, Belgium
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands
- Department of Internal Medicine, Radboud Institute for Molecular Life Sciences, and Radboud Expertise Center for Immunodeficiency and Autoinflammation, Radboud University Medical Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, the Netherlands
| | - Peter A C 't Hoen
- Department of Medical BioSciences, Radboud University Medical Center, Nijmegen, 6525 GA, the Netherlands.
| |
Collapse
|
34
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Deutsch EW, van Heesch S. What can Ribo-seq and proteomics tell us about the non-canonical proteome? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.16.541049. [PMID: 37292611 PMCID: PMC10245706 DOI: 10.1101/2023.05.16.541049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of non-canonical sites of ribosome translation outside of the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7,000 non-canonical open reading frames (ORFs) are translated, which, at first glance, has the potential to expand the number of human protein-coding sequences by 30%, from ∼19,500 annotated CDSs to over 26,000. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of non-canonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome, but searching for guidance on how to proceed. Here, we discuss the current state of non-canonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein-coding". In brief The human genome encodes thousands of non-canonical open reading frames (ORFs) in addition to protein-coding genes. As a nascent field, many questions remain regarding non-canonical ORFs. How many exist? Do they encode proteins? What level of evidence is needed for their verification? Central to these debates has been the advent of ribosome profiling (Ribo-seq) as a method to discern genome-wide ribosome occupancy, and immunopeptidomics as a method to detect peptides that are processed and presented by MHC molecules and not observed in traditional proteomics experiments. This article provides a synthesis of the current state of non-canonical ORF research and proposes standards for their future investigation and reporting. Highlights Combined use of Ribo-seq and proteomics-based methods enables optimal confidence in detecting non-canonical ORFs and their protein products.Ribo-seq can provide more sensitive detection of non-canonical ORFs, but data quality and analytical pipelines will impact results.Non-canonical ORF catalogs are diverse and span both high-stringency and low-stringency ORF nominations.A framework for standardized non-canonical ORF evidence will advance the research field.
Collapse
Affiliation(s)
- John R. Prensner
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | | | - Leron W. Kok
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Karl R. Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jonathan M. Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland
- Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland
- Agora Cancer Research Centre, 1011 Lausanne, Switzerland
| | - Eric W. Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington 98109, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| |
Collapse
|
35
|
Wacholder A, Parikh SB, Coelho NC, Acar O, Houghton C, Chou L, Carvunis AR. A vast evolutionarily transient translatome contributes to phenotype and fitness. Cell Syst 2023; 14:363-381.e8. [PMID: 37164009 PMCID: PMC10348077 DOI: 10.1016/j.cels.2023.04.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 01/30/2023] [Accepted: 04/06/2023] [Indexed: 05/12/2023]
Abstract
Translation is the process by which ribosomes synthesize proteins. Ribosome profiling recently revealed that many short sequences previously thought to be noncoding are pervasively translated. To identify protein-coding genes in this noncanonical translatome, we combine an integrative framework for extremely sensitive ribosome profiling analysis, iRibo, with high-powered selection inferences tailored for short sequences. We construct a reference translatome for Saccharomyces cerevisiae comprising 5,400 canonical and almost 19,000 noncanonical translated elements. Only 14 noncanonical elements were evolving under detectable purifying selection. A representative subset of translated elements lacking signatures of selection demonstrated involvement in processes including DNA repair, stress response, and post-transcriptional regulation. Our results suggest that most translated elements are not conserved protein-coding genes and contribute to genotype-phenotype relationships through fast-evolving molecular mechanisms.
Collapse
Affiliation(s)
- Aaron Wacholder
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Saurin Bipin Parikh
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Integrative Systems Biology Program, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Nelson Castilho Coelho
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Omer Acar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Joint CMU-Pitt PhD Program in Computational Biology, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Carly Houghton
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Joint CMU-Pitt PhD Program in Computational Biology, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Lin Chou
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Integrative Systems Biology Program, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA.
| |
Collapse
|
36
|
Hofman DA, Ruiz-Orera J, Yannuzzi I, Murugesan R, Brown A, Clauser KR, Condurat AL, van Dinter JT, Engels SA, Goodale A, van der Lugt J, Abid T, Wang L, Zhou KN, Vogelzang J, Ligon KL, Phoenix TN, Roth JA, Root DE, Hubner N, Golub TR, Bandopadhayay P, van Heesch S, Prensner JR. Translation of non-canonical open reading frames as a cancer cell survival mechanism in childhood medulloblastoma. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.04.539399. [PMID: 37205492 PMCID: PMC10187264 DOI: 10.1101/2023.05.04.539399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
A hallmark of high-risk childhood medulloblastoma is the dysregulation of RNA translation. Currently, it is unknown whether medulloblastoma dysregulates the translation of putatively oncogenic non-canonical open reading frames. To address this question, we performed ribosome profiling of 32 medulloblastoma tissues and cell lines and observed widespread non-canonical ORF translation. We then developed a step-wise approach to employ multiple CRISPR-Cas9 screens to elucidate functional non-canonical ORFs implicated in medulloblastoma cell survival. We determined that multiple lncRNA-ORFs and upstream open reading frames (uORFs) exhibited selective functionality independent of the main coding sequence. One of these, ASNSD1-uORF or ASDURF, was upregulated, associated with the MYC family oncogenes, and was required for medulloblastoma cell survival through engagement with the prefoldin-like chaperone complex. Our findings underscore the fundamental importance of non-canonical ORF translation in medulloblastoma and provide a rationale to include these ORFs in future cancer genomics studies seeking to define new cancer targets.
Collapse
Affiliation(s)
- Damon A. Hofman
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
- These authors contributed equally
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
- These authors contributed equally
| | - Ian Yannuzzi
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | | | - Adam Brown
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Current address: Arbor Biotechnologies, Cambridge, MA, 02140, USA
| | - Karl R. Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Alexandra L. Condurat
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Jip T. van Dinter
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Sem A.G. Engels
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Amy Goodale
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jasper van der Lugt
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Tanaz Abid
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Li Wang
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Kevin N. Zhou
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Current address: Kaiser Permanente Bernard J. Tyson School of Medicine, Pasadena, CA, 91101, USA
| | - Jayne Vogelzang
- Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA, 02215, USA
| | - Keith L. Ligon
- Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA, 02215, USA
- Department of Pathology, Boston Children’s Hospital, Boston MA 02115
| | - Timothy N. Phoenix
- Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH, 45229, USA
| | | | - David E. Root
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Norbert Hubner
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
- Charité-Universitätsmedizin, 10117 Berlin, Germany
- German Centre for Cardiovascular Research, Partner Site Berlin, 13347 Berlin, Germany
| | - Todd R. Golub
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA
| | - Pratiti Bandopadhayay
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - John R. Prensner
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Division of Pediatric Hematology/Oncology, Boston Children’s Hospital, Boston, MA, 02115, USA
- Current address: Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
37
|
Li L, Shu XS, Geng H, Ying J, Guo L, Luo J, Xiang T, Wu L, Ma BBY, Chan ATC, Zhu X, Ambinder RF, Tao Q. A novel tumor suppressor encoded by a 1p36.3 lncRNA functions as a phosphoinositide-binding protein repressing AKT phosphorylation/activation and promoting autophagy. Cell Death Differ 2023; 30:1166-1183. [PMID: 36813924 PMCID: PMC10154315 DOI: 10.1038/s41418-023-01129-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 01/30/2023] [Accepted: 02/02/2023] [Indexed: 02/24/2023] Open
Abstract
Peptides/small proteins, encoded by noncanonical open reading frames (ORF) of previously claimed non-coding RNAs, have recently been recognized possessing important biological functions, but largely uncharacterized. 1p36 is an important tumor suppressor gene (TSG) locus frequently deleted in multiple cancers, with critical TSGs like TP73, PRDM16, and CHD5 already validated. Our CpG methylome analysis identified a silenced 1p36.3 gene KIAA0495, previously thought coding long non-coding RNA. We found that the open reading frame 2 of KIAA0495 is actually protein-coding and translating, encoding a small protein SP0495. KIAA0495 transcript is broadly expressed in multiple normal tissues, but frequently silenced by promoter CpG methylation in multiple tumor cell lines and primary tumors including colorectal, esophageal and breast cancers. Its downregulation/methylation is associated with poor survival of cancer patients. SP0495 induces tumor cell apoptosis, cell cycle arrest, senescence and autophagy, and inhibits tumor cell growth in vitro and in vivo. Mechanistically, SP0495 binds to phosphoinositides (PtdIns(3)P, PtdIns(3,5)P2) as a lipid-binding protein, inhibits AKT phosphorylation and its downstream signaling, and further represses oncogenic AKT/mTOR, NF-κB, and Wnt/β-catenin signaling. SP0495 also regulates the stability of autophagy regulators BECN1 and SQSTM1/p62 through modulating phosphoinositides turnover and autophagic/proteasomal degradation. Thus, we discovered and validated a 1p36.3 small protein SP0495, functioning as a novel tumor suppressor regulating AKT signaling activation and autophagy as a phosphoinositide-binding protein, being frequently inactivated by promoter methylation in multiple tumors as a potential biomarker.
Collapse
Affiliation(s)
- Lili Li
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Xing-Sheng Shu
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- School of Medicine, Shenzhen University Health Science Center, Shenzhen, China
| | - Hua Geng
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Jianming Ying
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- Department of Pathology, Cancer Hospital, Peking Union Medical College & Chinese Academy of Medical Sciences, Beijing, China
| | - Lei Guo
- Department of Pathology, Cancer Hospital, Peking Union Medical College & Chinese Academy of Medical Sciences, Beijing, China
| | - Jie Luo
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Tingxiu Xiang
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
- Chongqing University Cancer Hospital, Chongqing, China
| | - Longtao Wu
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Brigette B Y Ma
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Anthony T C Chan
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Xiaofeng Zhu
- State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Richard F Ambinder
- Johns Hopkins Singapore and Sydney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Qian Tao
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
- Johns Hopkins Singapore and Sydney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
38
|
Oreper D, Klaeger S, Jhunjhunwala S, Delamarre L. The peptide woods are lovely, dark and deep: Hunting for novel cancer antigens. Semin Immunol 2023; 67:101758. [PMID: 37027981 DOI: 10.1016/j.smim.2023.101758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 03/22/2023] [Accepted: 03/22/2023] [Indexed: 04/08/2023]
Abstract
Harnessing the patient's immune system to control a tumor is a proven avenue for cancer therapy. T cell therapies as well as therapeutic vaccines, which target specific antigens of interest, are being explored as treatments in conjunction with immune checkpoint blockade. For these therapies, selecting the best suited antigens is crucial. Most of the focus has thus far been on neoantigens that arise from tumor-specific somatic mutations. Although there is clear evidence that T-cell responses against mutated neoantigens are protective, the large majority of these mutations are not immunogenic. In addition, most somatic mutations are unique to each individual patient and their targeting requires the development of individualized approaches. Therefore, novel antigen types are needed to broaden the scope of such treatments. We review high throughput approaches for discovering novel tumor antigens and some of the key challenges associated with their detection, and discuss considerations when selecting tumor antigens to target in the clinic.
Collapse
Affiliation(s)
- Daniel Oreper
- Genentech, 1 DNA way, South San Francisco, 94080 CA, USA.
| | - Susan Klaeger
- Genentech, 1 DNA way, South San Francisco, 94080 CA, USA.
| | | | | |
Collapse
|
39
|
Seal RL, Tweedie S, Bruford EA. A standardised nomenclature for long non-coding RNAs. IUBMB Life 2023; 75:380-389. [PMID: 35880706 PMCID: PMC9877250 DOI: 10.1002/iub.2663] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 07/08/2022] [Indexed: 01/29/2023]
Abstract
The HUGO Gene Nomenclature Committee (HGNC) is the sole group with the authority to approve symbols for human genes, including long non-coding RNA (lncRNA) genes. Use of approved symbols ensures that publications and biomedical databases are easily searchable and reduces the risks of confusion that can be caused by using the same symbol to refer to different genes or using many different symbols for the same gene. Here, we describe how the HGNC names lncRNA genes and review the nomenclature of the seven lncRNA genes most mentioned in the scientific literature.
Collapse
Affiliation(s)
- Ruth L. Seal
- HUGO Gene Nomenclature Committee, European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Wellcome Genome CampusHinxtonUK
- Department of HaematologyUniversity of Cambridge School of Clinical MedicineCambridgeUK
| | - Susan Tweedie
- HUGO Gene Nomenclature Committee, European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Wellcome Genome CampusHinxtonUK
| | - Elspeth A. Bruford
- HUGO Gene Nomenclature Committee, European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Wellcome Genome CampusHinxtonUK
- Department of HaematologyUniversity of Cambridge School of Clinical MedicineCambridgeUK
| |
Collapse
|
40
|
Jain N, Richter F, Adzhubei I, Sharp AJ, Gelb BD. Small open reading frames: a comparative genetics approach to validation. BMC Genomics 2023; 24:226. [PMID: 37127568 PMCID: PMC10152738 DOI: 10.1186/s12864-023-09311-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 04/13/2023] [Indexed: 05/03/2023] Open
Abstract
Open reading frames (ORFs) with fewer than 100 codons are generally not annotated in genomes, although bona fide genes of that size are known. Newer biochemical studies have suggested that thousands of small protein-coding ORFs (smORFs) may exist in the human genome, but the true number and the biological significance of the micropeptides they encode remain uncertain. Here, we used a comparative genomics approach to identify high-confidence smORFs that are likely protein-coding. We identified 3,326 high-confidence smORFs using constraint within human populations and evolutionary conservation as additional lines of evidence. Next, we validated that, as a group, our high-confidence smORFs are conserved at the amino-acid level rather than merely residing in highly conserved non-coding regions. Finally, we found that high-confidence smORFs are enriched among disease-associated variants from GWAS. Overall, our results highlight that smORF-encoded peptides likely have important functional roles in human disease.
Collapse
Affiliation(s)
- Niyati Jain
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount, Hess Center for Science and Medicine, 1470 Madison Avenue, New York, NY, 10029, USA
- Present Address: Committee On Genetics, Genomics, and Systems Biology, The University of Chicago, Chicago, IL, USA
| | - Felix Richter
- Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ivan Adzhubei
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
| | - Andrew J Sharp
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount, Hess Center for Science and Medicine, 1470 Madison Avenue, New York, NY, 10029, USA
| | - Bruce D Gelb
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount, Hess Center for Science and Medicine, 1470 Madison Avenue, New York, NY, 10029, USA.
- Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
41
|
Sandmann CL, Schulz JF, Ruiz-Orera J, Kirchner M, Ziehm M, Adami E, Marczenke M, Christ A, Liebe N, Greiner J, Schoenenberger A, Muecke MB, Liang N, Moritz RL, Sun Z, Deutsch EW, Gotthardt M, Mudge JM, Prensner JR, Willnow TE, Mertins P, van Heesch S, Hubner N. Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames. Mol Cell 2023; 83:994-1011.e18. [PMID: 36806354 PMCID: PMC10032668 DOI: 10.1016/j.molcel.2023.01.023] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/12/2022] [Accepted: 01/25/2023] [Indexed: 02/19/2023]
Abstract
All species continuously evolve short open reading frames (sORFs) that can be templated for protein synthesis and may provide raw materials for evolutionary adaptation. We analyzed the evolutionary origins of 7,264 recently cataloged human sORFs and found that most were evolutionarily young and had emerged de novo. We additionally identified 221 previously missed sORFs potentially translated into peptides of up to 15 amino acids-all of which are smaller than the smallest human microprotein annotated to date. To investigate the bioactivity of sORF-encoded small peptides and young microproteins, we subjected 266 candidates to a mass-spectrometry-based interactome screen with motif resolution. Based on these interactomes and additional cellular assays, we can associate several candidates with mRNA splicing, translational regulation, and endocytosis. Our work provides insights into the evolutionary origins and interaction potential of young and small proteins, thereby helping to elucidate this underexplored territory of the human proteome.
Collapse
Affiliation(s)
- Clara-L Sandmann
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jana F Schulz
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jorge Ruiz-Orera
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Marieluise Kirchner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Matthias Ziehm
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Eleonora Adami
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Maike Marczenke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Annabel Christ
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Nina Liebe
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Johannes Greiner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Aaron Schoenenberger
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michael B Muecke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Ning Liang
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | | | - Zhi Sun
- Institute for Systems Biology, Seattle, WA 98109, USA
| | | | - Michael Gotthardt
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John R Prensner
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Division of Pediatric Hematology/Oncology, Boston Children's Hospital, Boston, MA 02115, USA
| | - Thomas E Willnow
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Department of Biomedicine, Aarhus University, 8000 Aarhus, Denmark
| | - Philipp Mertins
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | | | - Norbert Hubner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany.
| |
Collapse
|
42
|
Evolution and implications of de novo genes in humans. Nat Ecol Evol 2023:10.1038/s41559-023-02014-y. [PMID: 36928843 DOI: 10.1038/s41559-023-02014-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 02/06/2023] [Indexed: 03/18/2023]
Abstract
Genes and translated open reading frames (ORFs) that emerged de novo from previously non-coding sequences provide species with opportunities for adaptation. When aberrantly activated, some human-specific de novo genes and ORFs have disease-promoting properties-for instance, driving tumour growth. Thousands of putative de novo coding sequences have been described in humans, but we still do not know what fraction of those ORFs has readily acquired a function. Here, we discuss the challenges and controversies surrounding the detection, mechanisms of origin, annotation, validation and characterization of de novo genes and ORFs. Through manual curation of literature and databases, we provide a thorough table with most de novo genes reported for humans to date. We re-evaluate each locus by tracing the enabling mutations and list proposed disease associations, protein characteristics and supporting evidence for translation and protein detection. This work will support future explorations of de novo genes and ORFs in humans.
Collapse
|
43
|
Luria V, Ma S, Shibata M, Pattabiraman K, Sestan N. Molecular and cellular mechanisms of human cortical connectivity. Curr Opin Neurobiol 2023; 80:102699. [PMID: 36921362 DOI: 10.1016/j.conb.2023.102699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 02/05/2023] [Indexed: 03/18/2023]
Abstract
Comparative studies of the cerebral cortex have identified various human and primate-specific changes in both local and long-range connectivity, which are thought to underlie our advanced cognitive capabilities. These changes are likely mediated by the divergence of spatiotemporal regulation of gene expression, which is particularly prominent in the prenatal and early postnatal human and non-human primate cerebral cortex. In this review, we describe recent advances in characterizing human and primate genetic and cellular innovations including identification of novel species-specific, especially human-specific, genes, gene expression patterns, and cell types. Finally, we highlight three recent studies linking these molecular changes to reorganization of cortical connectivity.
Collapse
Affiliation(s)
- Victor Luria
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA
| | - Shaojie Ma
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA
| | - Mikihito Shibata
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA
| | - Kartik Pattabiraman
- Yale Child Study Center, Yale School of Medicine, New Haven, CT, 06510, USA.
| | - Nenad Sestan
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA; Yale Child Study Center, Yale School of Medicine, New Haven, CT, 06510, USA; Departments of Psychiatry, Genetics and Comparative Medicine, Program in Cellular Neuroscience, Neurodegeneration and Repair, and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA.
| |
Collapse
|
44
|
Pueyo JI, Salazar J, Grincho C, Berni J, Towler BP, Newbury SF. Purriato is a conserved small open reading frame gene that interacts with the CASA pathway to regulate muscle homeostasis and epithelial tissue growth in Drosophila. Front Cell Dev Biol 2023; 11:1117454. [PMID: 36968202 PMCID: PMC10036370 DOI: 10.3389/fcell.2023.1117454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 02/24/2023] [Indexed: 03/12/2023] Open
Abstract
Recent advances in proteogenomic techniques and bioinformatic pipelines have permitted the detection of thousands of translated small Open Reading Frames (smORFs), which contain less than 100 codons, in eukaryotic genomes. Hundreds of these actively translated smORFs display conserved sequence, structure and evolutionary signatures indicating that the translated peptides could fulfil important biological roles. Despite their abundance, only tens of smORF genes have been fully characterised; these act mainly as regulators of canonical proteins involved in essential cellular processes. Importantly, some of these smORFs display conserved functions with their mutations being associated with pathogenesis. Thus, investigating smORF roles in Drosophila will not only expand our understanding of their functions but it may have an impact in human health. Here we describe the function of a novel and essential Drosophila smORF gene named purriato (prto). prto belongs to an ancient gene family whose members have expanded throughout the Protostomia clade. prto encodes a transmembrane peptide which is localized in endo-lysosomes and perinuclear and plasma membranes. prto is dynamically expressed in mesodermal tissues and imaginal discs. Targeted prto knockdown (KD) in these organs results in changes in nuclear morphology and endo-lysosomal distributions correlating with the loss of sarcomeric homeostasis in muscles and reduction of mitosis in wing discs. Consequently, prto KD mutants display severe reduction of motility, and shorter wings. Finally, our genetic interaction experiments show that prto function is closely associated to the CASA pathway, a conserved mechanism involved in turnover of mis-folded proteins and linked to muscle dystrophies and neurodegenerative diseases. Thus, this study shows the relevance of smORFs in regulating important cellular functions and supports the systematic characterisation of this class of genes to understand their functions and evolution.
Collapse
Affiliation(s)
- Jose I. Pueyo
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jorge Salazar
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Carolina Grincho
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jimena Berni
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Benjamin P. Towler
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
- Department of Biochemistry and Biomedicine, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sarah F. Newbury
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
45
|
Zheng C, Wei Y, Zhang P, Xu L, Zhang Z, Lin K, Hou J, Lv X, Ding Y, Chiu Y, Jain A, Islam N, Malovannaya A, Wu Y, Ding F, Xu H, Sun M, Chen X, Chen Y. CRISPR/Cas9 screen uncovers functional translation of cryptic lncRNA-encoded open reading frames in human cancer. J Clin Invest 2023; 133:e159940. [PMID: 36856111 PMCID: PMC9974104 DOI: 10.1172/jci159940] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 01/19/2023] [Indexed: 03/02/2023] Open
Abstract
Emerging evidence suggests that cryptic translation within long noncoding RNAs (lncRNAs) may produce novel proteins with important developmental/physiological functions. However, the role of this cryptic translation in complex diseases (e.g., cancer) remains elusive. Here, we applied an integrative strategy combining ribosome profiling and CRISPR/Cas9 screening with large-scale analysis of molecular/clinical data for breast cancer (BC) and identified estrogen receptor α-positive (ER+) BC dependency on the cryptic ORFs encoded by lncRNA genes that were upregulated in luminal tumors. We confirmed the in vivo tumor-promoting function of an unannotated protein, GATA3-interacting cryptic protein (GT3-INCP) encoded by LINC00992, the expression of which was associated with poor prognosis in luminal tumors. GTE-INCP was upregulated by estrogen/ER and regulated estrogen-dependent cell growth. Mechanistically, GT3-INCP interacted with GATA3, a master transcription factor key to mammary gland development/BC cell proliferation, and coregulated a gene expression program that involved many BC susceptibility/risk genes and impacted estrogen response/cell proliferation. GT3-INCP/GATA3 bound to common cis regulatory elements and upregulated the expression of the tumor-promoting and estrogen-regulated BC susceptibility/risk genes MYB and PDZK1. Our study indicates that cryptic lncRNA-encoded proteins can be an important integrated component of the master transcriptional regulatory network driving aberrant transcription in cancer, and suggests that the "hidden" lncRNA-encoded proteome might be a new space for therapeutic target discovery.
Collapse
Affiliation(s)
- Caishang Zheng
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Yanjun Wei
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Peng Zhang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Longyong Xu
- Department of Molecular and Cellular Biology
- Lester and Sue Smith Breast Center, and
- Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas, USA
| | - Zhenzhen Zhang
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina, USA
| | - Kangyu Lin
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Jiakai Hou
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Xiangdong Lv
- Department of Molecular and Cellular Biology
- Lester and Sue Smith Breast Center, and
- Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas, USA
| | - Yao Ding
- Department of Molecular and Cellular Biology
- Lester and Sue Smith Breast Center, and
- Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas, USA
| | - Yulun Chiu
- Department of Melanoma Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | | | | | - Anna Malovannaya
- Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas, USA
- Mass Spectrometry Proteomics Core and
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, USA
| | - Yun Wu
- Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina, USA
| | - Han Xu
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center
- Genetics and Epigenetics Program, and
- Quantitative Sciences Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, Texas, USA
| | - Ming Sun
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Xi Chen
- Department of Molecular and Cellular Biology
- Lester and Sue Smith Breast Center, and
- Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas, USA
| | - Yiwen Chen
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA
- Quantitative Sciences Program, MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, Texas, USA
| |
Collapse
|
46
|
Kidwell A, Yadav SPS, Maier B, Zollman A, Ni K, Halim A, Janosevic D, Myslinski J, Syed F, Zeng L, Waffo AB, Banno K, Xuei X, Doud EH, Dagher PC, Hato T. Translation Rescue by Targeting Ppp1r15a through Its Upstream Open Reading Frame in Sepsis-Induced Acute Kidney Injury in a Murine Model. J Am Soc Nephrol 2023; 34:220-240. [PMID: 36283811 PMCID: PMC10103092 DOI: 10.1681/asn.2022060644] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 09/23/2022] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Translation shutdown is a hallmark of late-phase, sepsis-induced kidney injury. Methods for controlling protein synthesis in the kidney are limited. Reversing translation shutdown requires dephosphorylation of the eukaryotic initiation factor 2 (eIF2) subunit eIF2 α ; this is mediated by a key regulatory molecule, protein phosphatase 1 regulatory subunit 15A (Ppp1r15a), also known as GADD34. METHODS To study protein synthesis in the kidney in a murine endotoxemia model and investigate the feasibility of translation control in vivo by boosting the protein expression of Ppp1r15a, we combined multiple tools, including ribosome profiling (Ribo-seq), proteomics, polyribosome profiling, and antisense oligonucleotides, and a newly generated Ppp1r15a knock-in mouse model and multiple mutant cell lines. RESULTS We report that translation shutdown in established sepsis-induced kidney injury is brought about by excessive eIF2 α phosphorylation and sustained by blunted expression of the counter-regulatory phosphatase Ppp1r15a. We determined the blunted Ppp1r15a expression persists because of the presence of an upstream open reading frame (uORF). Overcoming this barrier with genetic and antisense oligonucleotide approaches enabled the overexpression of Ppp1r15a, which salvaged translation and improved kidney function in an endotoxemia model. Loss of this uORF also had broad effects on the composition and phosphorylation status of the immunopeptidome-peptides associated with the MHC-that extended beyond the eIF2 α axis. CONCLUSIONS We found Ppp1r15a is translationally repressed during late-phase sepsis because of the existence of an uORF, which is a prime therapeutic candidate for this strategic rescue of translation in late-phase sepsis. The ability to accurately control translation dynamics during sepsis may offer new paths for the development of therapies at codon-level precision. PODCAST This article contains a podcast at.
Collapse
Affiliation(s)
- Ashley Kidwell
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | | | - Bernhard Maier
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Amy Zollman
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Kevin Ni
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Arvin Halim
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Danielle Janosevic
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Jered Myslinski
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Farooq Syed
- Department of Pediatrics and the Herman B. Wells Center, Indiana University School of Medicine, Indianapolis, Indiana
| | - Lifan Zeng
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana
| | - Alain Bopda Waffo
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana
| | - Kimihiko Banno
- Department of Physiology, Nara Medical University, Kashihara, Japan
| | - Xiaoling Xuei
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, Indiana
| | - Emma H. Doud
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana
| | - Pierre C. Dagher
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
| | - Takashi Hato
- Department of Medicine, Indiana University School of Medicine, Indianapolis, Indiana
- Richard L. Roudebush Veterans Affairs Medical Center, Indianapolis, Indiana
| |
Collapse
|
47
|
Liu T, Zou B, He M, Hu Y, Dou Y, Cui T, Tan P, Li S, Rao S, Huang Y, Liu S, Cai K, Wang D. LncReader: identification of dual functional long noncoding RNAs using a multi-head self-attention mechanism. Brief Bioinform 2023; 24:6961607. [PMID: 36575567 DOI: 10.1093/bib/bbac579] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/11/2022] [Accepted: 11/28/2022] [Indexed: 12/29/2022] Open
Abstract
Long noncoding ribonucleic acids (RNAs; LncRNAs) endowed with both protein-coding and noncoding functions are referred to as 'dual functional lncRNAs'. Recently, dual functional lncRNAs have been intensively studied and identified as involved in various fundamental cellular processes. However, apart from time-consuming and cell-type-specific experiments, there is virtually no in silico method for predicting the identity of dual functional lncRNAs. Here, we developed a deep-learning model with a multi-head self-attention mechanism, LncReader, to identify dual functional lncRNAs. Our data demonstrated that LncReader showed multiple advantages compared to various classical machine learning methods using benchmark datasets from our previously reported cncRNAdb project. Moreover, to obtain independent in-house datasets for robust testing, mass spectrometry proteomics combined with RNA-seq and Ribo-seq were applied in four leukaemia cell lines, which further confirmed that LncReader achieved the best performance compared to other tools. Therefore, LncReader provides an accurate and practical tool that enables fast dual functional lncRNA identification.
Collapse
Affiliation(s)
- Tianyuan Liu
- Department of Hematology and Oncology, Shenzhen Children's Hospital, Shenzhen 518038, China.,Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Bohao Zou
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.,Department of Statistics, University of California Davis, Davis, California, USA
| | - Manman He
- State Key Laboratory of Medical Molecular Biology, Key Laboratorytar of RNA Regulation and Hematopoiesis, Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, School of Basic Medicine, CAMS and Peking Union Medical College, Beijing 100005, China
| | - Yongfei Hu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.,Dermatology Hospital, Southern Medical University, Guangzhou, 510091, China
| | - Yiying Dou
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Tianyu Cui
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Puwen Tan
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Shaobin Li
- Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China
| | - Shuan Rao
- Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China
| | - Yan Huang
- Cancer Research Institute, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China
| | - Sixi Liu
- Department of Hematology and Oncology, Shenzhen Children's Hospital, Shenzhen 518038, China
| | - Kaican Cai
- Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China
| | - Dong Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.,Dermatology Hospital, Southern Medical University, Guangzhou, 510091, China.,Department of Bioinformatics, Fujian Key Laboratory of Medical Bioinformatics, School of Medical Technology and Engineering, Fujian Medical University, Fuzhou, 350122, China
| |
Collapse
|
48
|
Martinez TF, Lyons-Abbott S, Bookout AL, De Souza EV, Donaldson C, Vaughan JM, Lau C, Abramov A, Baquero AF, Baquero K, Friedrich D, Huard J, Davis R, Kim B, Koch T, Mercer AJ, Misquith A, Murray SA, Perry S, Pino LK, Sanford C, Simon A, Zhang Y, Zipp G, Bizarro CV, Shokhirev MN, Whittle AJ, Searle BC, MacCoss MJ, Saghatelian A, Barnes CA. Profiling mouse brown and white adipocytes to identify metabolically relevant small ORFs and functional microproteins. Cell Metab 2023; 35:166-183.e11. [PMID: 36599300 PMCID: PMC9889109 DOI: 10.1016/j.cmet.2022.12.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 09/19/2022] [Accepted: 12/06/2022] [Indexed: 01/05/2023]
Abstract
Microproteins (MPs) are a potentially rich source of uncharacterized metabolic regulators. Here, we use ribosome profiling (Ribo-seq) to curate 3,877 unannotated MP-encoding small ORFs (smORFs) in primary brown, white, and beige mouse adipocytes. Of these, we validated 85 MPs by proteomics, including 33 circulating MPs in mouse plasma. Analyses of MP-encoding mRNAs under different physiological conditions (high-fat diet) revealed that numerous MPs are regulated in adipose tissue in vivo and are co-expressed with established metabolic genes. Furthermore, Ribo-seq provided evidence for the translation of Gm8773, which encodes a secreted MP that is homologous to human and chicken FAM237B. Gm8773 is highly expressed in the arcuate nucleus of the hypothalamus, and intracerebroventricular administration of recombinant mFAM237B showed orexigenic activity in obese mice. Together, these data highlight the value of this adipocyte MP database in identifying MPs with roles in fundamental metabolic and physiological processes such as feeding.
Collapse
Affiliation(s)
- Thomas F Martinez
- Department of Pharmaceutical Sciences, Department of Biological Chemistry, Chao Family Comprehensive Cancer Center, University of California, Irvine, Irvine, CA, USA
| | | | - Angie L Bookout
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Eduardo V De Souza
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil; Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul 90616-900, Brazil; Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Cynthia Donaldson
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joan M Vaughan
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Calvin Lau
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Ariel Abramov
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Arian F Baquero
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Karalee Baquero
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Dave Friedrich
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Justin Huard
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ray Davis
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Bong Kim
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ty Koch
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Aaron J Mercer
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ayesha Misquith
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Sara A Murray
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Sakara Perry
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Lindsay K Pino
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | - Alex Simon
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Yu Zhang
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Garrett Zipp
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Cristiano V Bizarro
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil; Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul 90616-900, Brazil
| | - Maxim N Shokhirev
- Razavi Newman Integrative Genomics and Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, CA, USA
| | | | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA.
| | - Christopher A Barnes
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA; Velia Therapeutics, Inc., San Diego, CA, USA.
| |
Collapse
|
49
|
Álvarez-Urdiola R, Borràs E, Valverde F, Matus JT, Sabidó E, Riechmann JL. Peptidomics Methods Applied to the Study of Flower Development. Methods Mol Biol 2023; 2686:509-536. [PMID: 37540375 DOI: 10.1007/978-1-0716-3299-4_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
Abstract
Understanding the global and dynamic nature of plant developmental processes requires not only the study of the transcriptome, but also of the proteome, including its largely uncharacterized peptidome fraction. Recent advances in proteomics and high-throughput analyses of translating RNAs (ribosome profiling) have begun to address this issue, evidencing the existence of novel, uncharacterized, and possibly functional peptides. To validate the accumulation in tissues of sORF-encoded polypeptides (SEPs), the basic setup of proteomic analyses (i.e., LC-MS/MS) can be followed. However, the detection of peptides that are small (up to ~100 aa, 6-7 kDa) and novel (i.e., not annotated in reference databases) presents specific challenges that need to be addressed both experimentally and with computational biology resources. Several methods have been developed in recent years to isolate and identify peptides from plant tissues. In this chapter, we outline two different peptide extraction protocols and the subsequent peptide identification by mass spectrometry using the database search or the de novo identification methods.
Collapse
Affiliation(s)
- Raquel Álvarez-Urdiola
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Eva Borràs
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Federico Valverde
- Institute for Plant Biochemistry and Photosynthesis CSIC - University of Seville, Seville, Spain
| | - José Tomás Matus
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
- Institute for Integrative Systems Biology (I2SysBio), Universitat de València-CSIC, Paterna, Valencia, Spain
| | - Eduard Sabidó
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - José Luis Riechmann
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
| |
Collapse
|
50
|
Chothani S, Ho L, Schafer S, Rackham O. Discovering microproteins: making the most of ribosome profiling data. RNA Biol 2023; 20:943-954. [PMID: 38013207 PMCID: PMC10730196 DOI: 10.1080/15476286.2023.2279845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized biological process discovery and understanding. Traditionally, gene models have been confirmed using cDNA sequencing and encoded translated regions inferred using sequence-based detection of start and stop combinations longer than 100 amino-acids to prevent false positives. This has led to small ORFs (smORFs) and their encoded proteins left un-annotated. Ribo-seq allows deciphering translated regions from untranslated irrespective of the length. In this review, we describe the power of Ribo-seq data in detection of smORFs while discussing the major challenge posed by data-quality, -depth and -sparseness in identifying the start and end of smORF translation. In particular, we outline smORF cataloguing efforts in humans and the large differences that have arisen due to variation in data, methods and assumptions. Although current versions of smORF reference sets can already be used as a powerful tool for hypothesis generation, we recommend that future editions should consider these data limitations and adopt unified processing for the community to establish a canonical catalogue of translated smORFs.
Collapse
Affiliation(s)
- Sonia Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Owen Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
- The Alan Turing Institute, The British Library, London, UK
| |
Collapse
|