1
|
Tufail MA, Jordan B, Hadjeras L, Gelhausen R, Cassidy L, Habenicht T, Gutt M, Hellwig L, Backofen R, Tholey A, Sharma CM, Schmitz RA. Uncovering the small proteome of Methanosarcina mazei using Ribo-seq and peptidomics under different nitrogen conditions. Nat Commun 2024; 15:8659. [PMID: 39370430 DOI: 10.1038/s41467-024-53008-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 09/25/2024] [Indexed: 10/08/2024] Open
Abstract
The mesophilic methanogenic archaeal model organism Methanosarcina mazei strain Gö1 is crucial for climate and environmental research due to its ability to produce methane. Here, we establish a Ribo-seq protocol for M. mazei strain Gö1 under two growth conditions (nitrogen sufficiency and limitation). The translation of 93 previously annotated and 314 unannotated small ORFs, coding for proteins ≤ 70 amino acids, is predicted with high confidence based on Ribo-seq data. LC-MS analysis validates the translation for 62 annotated small ORFs and 26 unannotated small ORFs. Epitope tagging followed by immunoblotting analysis confirms the translation of 13 out of 16 selected unannotated small ORFs. A comprehensive differential transcription and translation analysis reveals that 29 of 314 unannotated small ORFs are differentially regulated in response to nitrogen availability at the transcriptional and 49 at the translational level. A high number of reported small RNAs are emerging as dual-function RNAs, including sRNA154, the central regulatory small RNA of nitrogen metabolism. Several unannotated small ORFs are conserved in Methanosarcina species and overproducing several (small ORF encoded) small proteins suggests key physiological functions. Overall, the comprehensive analysis opens an avenue to elucidate the function(s) of multitudinous small proteins and dual-function RNAs in M. mazei.
Collapse
Affiliation(s)
| | - Britta Jordan
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Lydia Hadjeras
- Institute of Molecular Infection Biology, University of Würzburg, 97080, Würzburg, Germany
| | - Rick Gelhausen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110, Freiburg, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Kiel University, 24105, Kiel, Germany
| | - Tim Habenicht
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Miriam Gutt
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Lisa Hellwig
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110, Freiburg, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Kiel University, 24105, Kiel, Germany
| | - Cynthia M Sharma
- Institute of Molecular Infection Biology, University of Würzburg, 97080, Würzburg, Germany
| | - Ruth A Schmitz
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany.
| |
Collapse
|
2
|
Wang Z, Jia X, Sun W, Wang M, Yuan Q, Xu T, Liu Y, Chen Z, Huang M, Ji N, Zhang M. A micropeptide TREMP encoded by lincR-PPP2R5C promotes Th2 cell differentiation by interacting with PYCR1 in allergic airway inflammation. Allergol Int 2024; 73:587-602. [PMID: 39025723 DOI: 10.1016/j.alit.2024.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/22/2024] [Accepted: 04/03/2024] [Indexed: 07/20/2024] Open
Abstract
BACKGROUND Allergic asthma is largely dominated by Th2 lymphocytes. Micropeptides in Th2 cells and asthma remain unmasked. Here, we aimed to demonstrate a micropeptide, T-cell regulatory micropeptide (TREMP), in Th2 cell differentiation in asthma. METHODS TREMP translated from lincR-PPP2R5C was validated using Western blotting and mass spectrometry. TREMP knockout mice were generated using CRISPR/Cas9. Coimmunoprecipitation revealed that TREMP targeted pyrroline-5-carboxylate reductase 1 (PYCR1), which was further explored in vitro and in vivo. The levels of TREMP and PYCR1 in Th2 cells from clinical samples were determined by flow cytometry. RESULTS TREMP, encoded by lincR-PPP2R5C, was in the mitochondrion. The lentivirus encoding TREMP promoted Th2 cell differentiation. In contrast, Th2 differentiation was suppressed in TREMP-/- CD4+ T cells. In the HDM-induced model of allergic airway inflammation, TREMP was increased in pulmonary tissues. Allergic airway inflammation was relieved in TREMP-/- mice treated with HDM. Mechanistically, TREMP interacted with PYCR1, which regulated Th2 differentiation via glycolysis. Glycolysis was decreased in Th2 cells from TREMP-/- mice and PYCR1-/- mice. Similar to TREMP-/- mice, allergic airway inflammation was mitigated in HDM-challenged PYCR1-/- mice. Moreover, we measured TREMP and PYCR1 in asthma patients. And we found that, compared with those in healthy controls, the levels of TREMP and PYCR1 in Th2 cells were significantly increased in asthmatic patients. CONCLUSIONS The micropeptide TREMP encoded by lincR-PPP2R5C promoted Th2 differentiation in allergic airway inflammation by interacting with PYCR1 and enhancing glycolysis. Our findings highlight the importance of neglected micropeptides from noncoding RNAs in allergic diseases.
Collapse
Affiliation(s)
- Zhengxia Wang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Xinyu Jia
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Wei Sun
- Department of Respiratory and Critical Care Medicine, Xishan People's Hospital of Wuxi City, Wuxi Branch of Zhongda Hospital Affiliate to Southeast University, Wuxi, China
| | - Min Wang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Qi Yuan
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Tingting Xu
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Yanan Liu
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China; Department of Respiratory and Critical Care Medicine, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, China
| | - Zhongqi Chen
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Mao Huang
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| | - Ningfei Ji
- Department of Respiratory and Critical Care Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China.
| | - Mingshun Zhang
- NHC Key Laboratory of Antibody Technique, Department of Immunology, Nanjing Medical University, Nanjing, China.
| |
Collapse
|
3
|
Das D, Podder S. Microscale marvels: unveiling the macroscopic significance of micropeptides in human health. Brief Funct Genomics 2024; 23:624-638. [PMID: 38706311 DOI: 10.1093/bfgp/elae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 04/07/2024] [Accepted: 04/15/2024] [Indexed: 05/07/2024] Open
Abstract
Non-coding RNA encodes micropeptides from small open reading frames located within the RNA. Interestingly, these micropeptides are involved in a variety of functions within the body. They are emerging as the resolving piece of the puzzle for complex biomolecular signaling pathways within the body. Recent studies highlight the pivotal role of small peptides in regulating important biological processes like DNA repair, gene expression, muscle regeneration, immune responses, etc. On the contrary, altered expression of micropeptides also plays a pivotal role in the progression of various diseases like cardiovascular diseases, neurological disorders and several types of cancer, including colorectal cancer, hepatocellular cancer, lung cancer, etc. This review delves into the dual impact of micropeptides on health and pathology, exploring their pivotal role in preserving normal physiological homeostasis and probing their involvement in the triggering and progression of diseases.
Collapse
Affiliation(s)
- Deepyaman Das
- Computational and Systems Biology Laboratory, Department of Microbiology, Raiganj University, Raiganj, Uttar Dinajpur, West Bengal-733134, India
| | - Soumita Podder
- Computational and Systems Biology Laboratory, Department of Microbiology, Raiganj University, Raiganj, Uttar Dinajpur, West Bengal-733134, India
| |
Collapse
|
4
|
Whited AM, Jungreis I, Allen J, Cleveland CL, Mudge JM, Kellis M, Rinn JL, Hough LE. Biophysical characterization of high-confidence, small human proteins. BIOPHYSICAL REPORTS 2024; 4:100167. [PMID: 38909903 PMCID: PMC11305224 DOI: 10.1016/j.bpr.2024.100167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/09/2024] [Accepted: 06/20/2024] [Indexed: 06/25/2024]
Abstract
Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. In addition, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from noncoding ones in otherwise ambiguous cases.
Collapse
Affiliation(s)
- A M Whited
- BioFrontiers Institute, University of Colorado, Boulder, Colorado
| | - Irwin Jungreis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - Jeffre Allen
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | | | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Manolis Kellis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - John L Rinn
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | - Loren E Hough
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Physics, University of Colorado Boulder, Boulder, Colorado.
| |
Collapse
|
5
|
Duan Y, Santos-Júnior CD, Schmidt TS, Fullam A, de Almeida BLS, Zhu C, Kuhn M, Zhao XM, Bork P, Coelho LP. A catalog of small proteins from the global microbiome. Nat Commun 2024; 15:7563. [PMID: 39214983 PMCID: PMC11364881 DOI: 10.1038/s41467-024-51894-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024] Open
Abstract
Small open reading frames (smORFs) shorter than 100 codons are widespread and perform essential roles in microorganisms, where they encode proteins active in several cell functions, including signal pathways, stress response, and antibacterial activities. However, the ecology, distribution and role of small proteins in the global microbiome remain unknown. Here, we construct a global microbial smORFs catalog (GMSC) derived from 63,410 publicly available metagenomes across 75 distinct habitats and 87,920 high-quality isolate genomes. GMSC contains 965 million non-redundant smORFs with comprehensive annotations. We find that archaea harbor more smORFs proportionally than bacteria. We moreover provide a tool called GMSC-mapper to identify and annotate small proteins from microbial (meta)genomes. Overall, this publicly-available resource demonstrates the immense and underexplored diversity of small proteins.
Collapse
Affiliation(s)
- Yiqian Duan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Célio Dias Santos-Júnior
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Laboratory of Microbial Processes & Biodiversity - LMPB; Department of Hydrobiology, Universidade Federal de São Carlos - UFSCar, São Carlos, São Paulo, Brazil
| | - Thomas Sebastian Schmidt
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- APC Microbiome and School of Medicine, University College Cork, Cork, Ireland
| | - Anthony Fullam
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Breno L S de Almeida
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Chengkai Zhu
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Michael Kuhn
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
- Department of Neurology, Zhongshan Hospital, Fudan University, Shanghai, China.
- Lingang Laboratory, Shanghai, 200031, China.
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China.
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China.
| | - Peer Bork
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Max Delbrück Centre for Molecular Medicine, Berlin, Germany
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | - Luis Pedro Coelho
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology, Translational Research Institute, Woolloongabba, QLD, Australia.
- Centre for Data Science, Queensland University of Technology, Brisbane, QLD, Australia.
| |
Collapse
|
6
|
Kafida M, Karela M, Giakountis A. RNA-Independent Regulatory Functions of lncRNA in Complex Disease. Cancers (Basel) 2024; 16:2728. [PMID: 39123456 PMCID: PMC11311644 DOI: 10.3390/cancers16152728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Revised: 07/28/2024] [Accepted: 07/30/2024] [Indexed: 08/12/2024] Open
Abstract
During the metagenomics era, high-throughput sequencing efforts both in mice and humans indicate that non-coding RNAs (ncRNAs) constitute a significant fraction of the transcribed genome. During the past decades, the regulatory role of these non-coding transcripts along with their interactions with other molecules have been extensively characterized. However, the study of long non-coding RNAs (lncRNAs), an ncRNA regulatory class with transcript lengths that exceed 200 nucleotides, revealed that certain non-coding transcripts are transcriptional "by-products", while their loci exert their downstream regulatory functions through RNA-independent mechanisms. Such mechanisms include, but are not limited to, chromatin interactions and complex promoter-enhancer competition schemes that involve the underlying ncRNA locus with or without its nascent transcription, mediating significant or even exclusive roles in the regulation of downstream target genes in mammals. Interestingly, such RNA-independent mechanisms often drive pathological manifestations, including oncogenesis. In this review, we summarize selective examples of lncRNAs that regulate target genes independently of their produced transcripts.
Collapse
Affiliation(s)
| | | | - Antonis Giakountis
- Department of Biochemistry and Biotechnology, University of Thessaly, Biopolis, Mezourlo, 41500 Larissa, Greece
| |
Collapse
|
7
|
Stachowiak L, Kraczkowska W, Świercz A, Jagodziński PP. Circulating non-coding RNA in type 1 diabetes mellitus as a source of potential biomarkers - An emerging role of sex difference. Biochem Biophys Res Commun 2024; 736:150482. [PMID: 39121670 DOI: 10.1016/j.bbrc.2024.150482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 07/30/2024] [Accepted: 07/30/2024] [Indexed: 08/12/2024]
Abstract
Non-coding RNAs (ncRNAs), such as microRNA, long non-coding RNA, and circular RNA, are considered essential regulatory molecules mediating many cellular processes. Moreover, an increasing number of studies have investigated the role of ncRNAs in cancers and various metabolic disorders, including diabetes mellitus. Interestingly, some circulating ncRNA detected in body fluids may serve as novel biomarkers. There is still a lack of conventional biomarkers that detect the early stage of type 1 diabetes mellitus. Many circulating microRNA, long non-coding RNA, and circular RNA show aberrant expression in type 1 diabetes patients compared to healthy individuals. However, most studies have focused on circulating microRNA rather than long non-coding RNA or circular RNA. In addition, a few studies have evaluated sex differences in ncRNA biomarkers. Therefore, this article summarises current knowledge about circulating ncRNAs as potential biomarkers for type 1 diabetes and explores the effects of sex on such biomarkers.
Collapse
Affiliation(s)
- Lucyna Stachowiak
- Department of Biochemistry and Molecular Biology, Poznań University of Medical Sciences, Święcickiego 6 street, 60-781, Poznań, Poland.
| | - Weronika Kraczkowska
- Department of Biochemistry and Molecular Biology, Poznań University of Medical Sciences, Święcickiego 6 street, 60-781, Poznań, Poland.
| | - Aleksandra Świercz
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2 street, 60-965, Poznań, Poland; Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14 street, 61-704, Poznań, Poland.
| | - Paweł Piotr Jagodziński
- Department of Biochemistry and Molecular Biology, Poznań University of Medical Sciences, Święcickiego 6 street, 60-781, Poznań, Poland.
| |
Collapse
|
8
|
Pereira IT, Gomes-Júnior R, Hansel-Frose A, França RSV, Liu M, Soliman HAN, Chan SSK, Dudley SC, Kyba M, Dallagiovanna B. Cardiac Development Long Non-Coding RNA ( CARDEL) Is Activated during Human Heart Development and Contributes to Cardiac Specification and Homeostasis. Cells 2024; 13:1050. [PMID: 38920678 PMCID: PMC11201801 DOI: 10.3390/cells13121050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 06/04/2024] [Accepted: 06/06/2024] [Indexed: 06/27/2024] Open
Abstract
Successful heart development depends on the careful orchestration of a network of transcription factors and signaling pathways. In recent years, in vitro cardiac differentiation using human pluripotent stem cells (hPSCs) has been used to uncover the intricate gene-network regulation involved in the proper formation and function of the human heart. Here, we searched for uncharacterized cardiac-development genes by combining a temporal evaluation of human cardiac specification in vitro with an analysis of gene expression in fetal and adult heart tissue. We discovered that CARDEL (CARdiac DEvelopment Long non-coding RNA; LINC00890; SERTM2) expression coincides with the commitment to the cardiac lineage. CARDEL knockout hPSCs differentiated poorly into cardiac cells, and hPSC-derived cardiomyocytes showed faster beating rates after controlled overexpression of CARDEL during differentiation. Altogether, we provide physiological and molecular evidence that CARDEL expression contributes to sculpting the cardiac program during cell-fate commitment.
Collapse
Affiliation(s)
- Isabela T. Pereira
- Basic Stem Cell Biology Laboratory, Instituto Carlos Chagas-FIOCRUZ-PR, Curitiba 81350-010, PR, Brazil; (R.G.-J.); (A.H.-F.); (R.S.V.F.); (B.D.)
| | - Rubens Gomes-Júnior
- Basic Stem Cell Biology Laboratory, Instituto Carlos Chagas-FIOCRUZ-PR, Curitiba 81350-010, PR, Brazil; (R.G.-J.); (A.H.-F.); (R.S.V.F.); (B.D.)
| | - Aruana Hansel-Frose
- Basic Stem Cell Biology Laboratory, Instituto Carlos Chagas-FIOCRUZ-PR, Curitiba 81350-010, PR, Brazil; (R.G.-J.); (A.H.-F.); (R.S.V.F.); (B.D.)
| | - Rhaíza S. V. França
- Basic Stem Cell Biology Laboratory, Instituto Carlos Chagas-FIOCRUZ-PR, Curitiba 81350-010, PR, Brazil; (R.G.-J.); (A.H.-F.); (R.S.V.F.); (B.D.)
| | - Man Liu
- Department of Medicine, Division of Cardiology, University of Minnesota, Minneapolis, MN 55455, USA; (M.L.); (S.C.D.J.)
- Lillehei Heart Institute, University of Minnesota, Minneapolis, MN 55455, USA; (H.A.N.S.); (S.S.K.C.); (M.K.)
| | - Hossam A. N. Soliman
- Lillehei Heart Institute, University of Minnesota, Minneapolis, MN 55455, USA; (H.A.N.S.); (S.S.K.C.); (M.K.)
- Department of Pediatrics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Sunny S. K. Chan
- Lillehei Heart Institute, University of Minnesota, Minneapolis, MN 55455, USA; (H.A.N.S.); (S.S.K.C.); (M.K.)
- Department of Pediatrics, University of Minnesota, Minneapolis, MN 55455, USA
- Stem Cell Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Samuel C. Dudley
- Department of Medicine, Division of Cardiology, University of Minnesota, Minneapolis, MN 55455, USA; (M.L.); (S.C.D.J.)
- Lillehei Heart Institute, University of Minnesota, Minneapolis, MN 55455, USA; (H.A.N.S.); (S.S.K.C.); (M.K.)
| | - Michael Kyba
- Lillehei Heart Institute, University of Minnesota, Minneapolis, MN 55455, USA; (H.A.N.S.); (S.S.K.C.); (M.K.)
- Department of Pediatrics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Bruno Dallagiovanna
- Basic Stem Cell Biology Laboratory, Instituto Carlos Chagas-FIOCRUZ-PR, Curitiba 81350-010, PR, Brazil; (R.G.-J.); (A.H.-F.); (R.S.V.F.); (B.D.)
| |
Collapse
|
9
|
Duffy EE, Assad EG, Kalish BT, Greenberg ME. Small but mighty: the rise of microprotein biology in neuroscience. Front Mol Neurosci 2024; 17:1386219. [PMID: 38807924 PMCID: PMC11130481 DOI: 10.3389/fnmol.2024.1386219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 04/30/2024] [Indexed: 05/30/2024] Open
Abstract
The mammalian central nervous system coordinates a network of signaling pathways and cellular interactions, which enable a myriad of complex cognitive and physiological functions. While traditional efforts to understand the molecular basis of brain function have focused on well-characterized proteins, recent advances in high-throughput translatome profiling have revealed a staggering number of proteins translated from non-canonical open reading frames (ncORFs) such as 5' and 3' untranslated regions of annotated proteins, out-of-frame internal ORFs, and previously annotated non-coding RNAs. Of note, microproteins < 100 amino acids (AA) that are translated from such ncORFs have often been neglected due to computational and biochemical challenges. Thousands of putative microproteins have been identified in cell lines and tissues including the brain, with some serving critical biological functions. In this perspective, we highlight the recent discovery of microproteins in the brain and describe several hypotheses that have emerged concerning microprotein function in the developing and mature nervous system.
Collapse
Affiliation(s)
- Erin E. Duffy
- Department of Neurobiology, Harvard Medical School, Boston, MA, United States
| | - Elena G. Assad
- Department of Neurobiology, Harvard Medical School, Boston, MA, United States
| | - Brian T. Kalish
- Program in Neuroscience and Mental Health, SickKids Research Institute, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Division of Neonatology, Department of Paediatrics, Hospital for Sick Children, Toronto, ON, Canada
| | | |
Collapse
|
10
|
Mably JD, Wang DZ. Long non-coding RNAs in cardiac hypertrophy and heart failure: functions, mechanisms and clinical prospects. Nat Rev Cardiol 2024; 21:326-345. [PMID: 37985696 PMCID: PMC11031336 DOI: 10.1038/s41569-023-00952-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/16/2023] [Indexed: 11/22/2023]
Abstract
The surge in reports describing non-coding RNAs (ncRNAs) has focused attention on their possible biological roles and effects on development and disease. ncRNAs have been touted as previously uncharacterized regulators of gene expression and cellular processes, possibly working to fine-tune these functions. The sheer number of ncRNAs identified has outpaced the capacity to characterize each molecule thoroughly and to reliably establish its clinical relevance; it has, nonetheless, created excitement about their potential as molecular targets for novel therapeutic approaches to treat human disease. In this Review, we focus on one category of ncRNAs - long non-coding RNAs - and their expression, functions and molecular mechanisms in cardiac hypertrophy and heart failure. We further discuss the prospects for this specific class of ncRNAs as novel targets for the diagnosis and treatment of these conditions.
Collapse
Affiliation(s)
- John D Mably
- Center for Regenerative Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- USF Health Heart Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Department of Internal Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Da-Zhi Wang
- Center for Regenerative Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- USF Health Heart Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Department of Internal Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Department of Molecular Pharmacology and Physiology, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| |
Collapse
|
11
|
Whited AM, Jungreis I, Allen J, Cleveland CL, Mudge JM, Kellis M, Rinn JL, Hough LE. Biophysical characterization of high-confidence, small human proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.12.589296. [PMID: 38659920 PMCID: PMC11042228 DOI: 10.1101/2024.04.12.589296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. Additionally, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from non-coding ones in otherwise ambiguous cases.
Collapse
Affiliation(s)
- A M Whited
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
| | - Irwin Jungreis
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA
| | - Jeffre Allen
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Biochemistry, University of Colorado Boulder, CO, USA
| | | | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Manolis Kellis
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA
| | - John L Rinn
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Biochemistry, University of Colorado Boulder, CO, USA
| | - Loren E Hough
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Physics, University of Colorado Boulder, CO, USA
| |
Collapse
|
12
|
Valdivia-Francia F, Sendoel A. No country for old methods: New tools for studying microproteins. iScience 2024; 27:108972. [PMID: 38333695 PMCID: PMC10850755 DOI: 10.1016/j.isci.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024] Open
Abstract
Microproteins encoded by small open reading frames (sORFs) have emerged as a fascinating frontier in genomics. Traditionally overlooked due to their small size, recent technological advancements such as ribosome profiling, mass spectrometry-based strategies and advanced computational approaches have led to the annotation of more than 7000 sORFs in the human genome. Despite the vast progress, only a tiny portion of these microproteins have been characterized and an important challenge in the field lies in identifying functionally relevant microproteins and understanding their role in different cellular contexts. In this review, we explore the recent advancements in sORF research, focusing on the new methodologies and computational approaches that have facilitated their identification and functional characterization. Leveraging these new tools hold great promise for dissecting the diverse cellular roles of microproteins and will ultimately pave the way for understanding their role in the pathogenesis of diseases and identifying new therapeutic targets.
Collapse
Affiliation(s)
- Fabiola Valdivia-Francia
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
- Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Schlieren-Zurich, Switzerland
| | - Ataman Sendoel
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
| |
Collapse
|
13
|
Beiki H, Murdoch BM, Park CA, Kern C, Kontechy D, Becker G, Rincon G, Jiang H, Zhou H, Thorne J, Koltes JE, Michal JJ, Davenport K, Rijnkels M, Ross PJ, Hu R, Corum S, McKay S, Smith TPL, Liu W, Ma W, Zhang X, Xu X, Han X, Jiang Z, Hu ZL, Reecy JM. Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology. Gigascience 2024; 13:giae019. [PMID: 38626724 PMCID: PMC11020238 DOI: 10.1093/gigascience/giae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 07/29/2023] [Accepted: 03/27/2024] [Indexed: 04/18/2024] Open
Abstract
BACKGROUND The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues. RESULTS A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5' untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue-tissue interconnection involved in different traits and construct the first bovine trait similarity network. CONCLUSIONS These validated results show significant improvement over current bovine genome annotations.
Collapse
Affiliation(s)
- Hamid Beiki
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Brenda M Murdoch
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Carissa A Park
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Chandlar Kern
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Denise Kontechy
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Gabrielle Becker
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | | | - Honglin Jiang
- Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Jacob Thorne
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - James E Koltes
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - Jennifer J Michal
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Kimberly Davenport
- Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA
| | - Monique Rijnkels
- Department of Veterinary Integrative Biosciences, Texas A&M University, TX 77843, USA
| | - Pablo J Ross
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Rui Hu
- Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA
| | - Sarah Corum
- Zoetis, Parsippany-Troy Hills, NJ 07054, USA
| | | | | | - Wansheng Liu
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Wenzhi Ma
- Department of Animal Science, Pennsylvania State University, PA 16802, USA
| | - Xiaohui Zhang
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Xiaoqing Xu
- Department of Animal Science, University of California, Davis, CA 95616, USA
| | - Xuelei Han
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Zhihua Jiang
- Department of Animal Science, Washington State University, WA 99164, USA
| | - Zhi-Liang Hu
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| | - James M Reecy
- Department of Animal Science, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
14
|
Lu Y, Ran Y, Li H, Wen J, Cui X, Zhang X, Guan X, Cheng M. Micropeptides: origins, identification, and potential role in metabolism-related diseases. J Zhejiang Univ Sci B 2023; 24:1106-1122. [PMID: 38057268 PMCID: PMC10710913 DOI: 10.1631/jzus.b2300128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 06/06/2023] [Indexed: 12/08/2023]
Abstract
With the development of modern sequencing techniques and bioinformatics, genomes that were once thought to be noncoding have been found to encode abundant functional micropeptides (miPs), a kind of small polypeptides. Although miPs are difficult to analyze and identify, a number of studies have begun to focus on them. More and more miPs have been revealed as essential for energy metabolism homeostasis, immune regulation, and tumor growth and development. Many reports have shown that miPs are especially essential for regulating glucose and lipid metabolism and regulating mitochondrial function. MiPs are also involved in the progression of related diseases. This paper reviews the sources and identification of miPs, as well as the functional significance of miPs for metabolism-related diseases, with the aim of revealing their potential clinical applications.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Min Cheng
- School of Basic Medicine Sciences, Weifang Medical University, Weifang 261053, China.
| |
Collapse
|
15
|
Kore H, Datta KK, Nagaraj SH, Gowda H. Protein-coding potential of non-canonical open reading frames in human transcriptome. Biochem Biophys Res Commun 2023; 684:149040. [PMID: 37897910 DOI: 10.1016/j.bbrc.2023.09.068] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/09/2023] [Accepted: 09/23/2023] [Indexed: 10/30/2023]
Abstract
In recent years, proteogenomics and ribosome profiling studies have identified a large number of proteins encoded by noncoding regions in the human genome. They are encoded by small open reading frames (sORFs) in the untranslated regions (UTRs) of mRNAs and long non-coding RNAs (lncRNAs). These sORF encoded proteins (SEPs) are often <150AA and show poor evolutionary conservation. A subset of them have been functionally characterized and shown to play an important role in fundamental biological processes including cardiac and muscle function, DNA repair, embryonic development and various human diseases. How many novel protein-coding regions exist in the human genome and what fraction of them are functionally important remains a mystery. In this review, we discuss current progress in unraveling SEPs, approaches used for their identification, their limitations and reliability of these identifications. We also discuss functionally characterized SEPs and their involvement in various biological processes and diseases. Lastly, we provide insights into their distinctive features compared to canonical proteins and challenges associated with annotating these in protein reference databases.
Collapse
Affiliation(s)
- Hitesh Kore
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia.
| | - Keshava K Datta
- Proteomics and Metabolomics Platform, La Trobe University, Melbourne, VIC, 3083, Australia
| | - Shivashankar H Nagaraj
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia
| | - Harsha Gowda
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Medicine, The University of Queensland, Queensland, 4072, Australia.
| |
Collapse
|
16
|
Wang J, Wang W, Ma F, Qian H. A hidden translatome in tumors-the coding lncRNAs. SCIENCE CHINA. LIFE SCIENCES 2023; 66:2755-2772. [PMID: 37154857 DOI: 10.1007/s11427-022-2289-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 12/29/2022] [Indexed: 05/10/2023]
Abstract
Long noncoding RNAs (lncRNAs) have been extensively identified in eukaryotic genomes and have been shown to play critical roles in the development of multiple cancers. Through the application and development of ribosome analysis and sequencing technologies, advanced studies have discovered the translation of lncRNAs. Although lncRNAs were originally defined as noncoding RNAs, many lncRNAs actually contain small open reading frames that are translated into peptides. This opens a broad area for the functional investigation of lncRNAs. Here, we introduce prospective methods and databases for screening lncRNAs with functional polypeptides. We also summarize the specific lncRNA-encoded proteins and their molecular mechanisms that promote or inhibit cancerous. Importantly, the role of lncRNA-encoded peptides/proteins holds promise in cancer research, but some potential challenges remain unresolved. This review includes reports on lncRNA-encoded peptides or proteins in cancer, aiming to provide theoretical basis and related references to facilitate the discovery of more functional peptides encoded by lncRNA, and to further develop new anti-cancer therapeutic targets as well as clinical biomarkers of diagnosis and prognosis.
Collapse
Affiliation(s)
- Jinsong Wang
- State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
| | - Wenna Wang
- State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
- Department of Medical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
| | - Fei Ma
- State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China.
- Department of Medical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China.
| | - Haili Qian
- State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China.
| |
Collapse
|
17
|
Mohsen JJ, Martel AA, Slavoff SA. Microproteins-Discovery, structure, and function. Proteomics 2023; 23:e2100211. [PMID: 37603371 PMCID: PMC10841188 DOI: 10.1002/pmic.202100211] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/22/2023]
Abstract
Advances in proteogenomic technologies have revealed hundreds to thousands of translated small open reading frames (sORFs) that encode microproteins in genomes across evolutionary space. While many microproteins have now been shown to play critical roles in biology and human disease, a majority of recently identified microproteins have little or no experimental evidence regarding their functionality. Computational tools have some limitations for analysis of short, poorly conserved microprotein sequences, so additional approaches are needed to determine the role of each member of this recently discovered polypeptide class. A currently underexplored avenue in the study of microproteins is structure prediction and determination, which delivers a depth of functional information. In this review, we provide a brief overview of microprotein discovery methods, then examine examples of microprotein structures (and, conversely, intrinsic disorder) that have been experimentally determined using crystallography, cryo-electron microscopy, and NMR, which provide insight into their molecular functions and mechanisms. Additionally, we discuss examples of predicted microprotein structures that have provided insight or context regarding their function. Analysis of microprotein structure at the angstrom level, and confirmation of predicted structures, therefore, has potential to identify translated microproteins that are of biological importance and to provide molecular mechanism for their in vivo roles.
Collapse
Affiliation(s)
- Jessica J. Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Alina A. Martel
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
18
|
Bosch JA, Keith N, Escobedo F, Fisher WW, LaGraff JT, Rabasco J, Wan KH, Weiszmann R, Hu Y, Kondo S, Brown JB, Perrimon N, Celniker SE. Molecular and functional characterization of the Drosophila melanogaster conserved smORFome. Cell Rep 2023; 42:113311. [PMID: 37889754 PMCID: PMC10843857 DOI: 10.1016/j.celrep.2023.113311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 08/24/2023] [Accepted: 10/04/2023] [Indexed: 10/29/2023] Open
Abstract
Short polypeptides encoded by small open reading frames (smORFs) are ubiquitously found in eukaryotic genomes and are important regulators of physiology, development, and mitochondrial processes. Here, we focus on a subset of 298 smORFs that are evolutionarily conserved between Drosophila melanogaster and humans. Many of these smORFs are conserved broadly in the bilaterian lineage, and ∼182 are conserved in plants. We observe remarkably heterogeneous spatial and temporal expression patterns of smORF transcripts-indicating wide-spread tissue-specific and stage-specific mitochondrial architectures. In addition, an analysis of annotated functional domains reveals a predicted enrichment of smORF polypeptides localizing to mitochondria. We conduct an embryonic ribosome profiling experiment and find support for translation of 137 of these smORFs during embryogenesis. We further embark on functional characterization using CRISPR knockout/activation, RNAi knockdown, and cDNA overexpression, revealing diverse phenotypes. This study underscores the importance of identifying smORF function in disease and phenotypic diversity.
Collapse
Affiliation(s)
- Justin A Bosch
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Nathan Keith
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Felipe Escobedo
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - William W Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - James Thai LaGraff
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Jorden Rabasco
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Kenneth H Wan
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Richard Weiszmann
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Yanhui Hu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Shu Kondo
- Laboratory of Invertebrate Genetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - James B Brown
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | - Norbert Perrimon
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Harvard Medical School, Boston, MA 02115, USA.
| | - Susan E Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| |
Collapse
|
19
|
Zhang L, Tang M, Diao H, Xiong L, Yang X, Xing S. LncRNA-encoded peptides: unveiling their significance in cardiovascular physiology and pathology-current research insights. Cardiovasc Res 2023; 119:2165-2178. [PMID: 37517040 DOI: 10.1093/cvr/cvad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 06/17/2023] [Accepted: 06/30/2023] [Indexed: 08/01/2023] Open
Abstract
Long non-coding RNAs (lncRNAs), which are RNA transcripts exceeding 200 nucleotides were believed to lack any protein-coding capacity. But advancements in -omics technology have revealed that some lncRNAs have small open reading frames (sORFs) that can be translated by ribosomes to encode peptides, some of which have important biological functions. These encoded peptides subserve important biological functions by interacting with their targets to modulate transcriptional or signalling axes, thereby enhancing or suppressing cardiovascular disease (CVD) occurrence and progression. In this review, we summarize what is known about the research strategy of lncRNA-encoded peptides, mainly comprising predictive websites/tools and experimental methods that have been widely used for prediction, identification, and validation. More importantly, we have compiled a list of lncRNA- encoded peptides, with a focus on those that play significant roles in cardiovascular physiology and pathology, including ENSRNOT (RNO)-sORF6/RNO-sORF7/RNO-sORF8, dwarf open reading frame (DOWRF), myoregulin (NLN), etc. Additionally, we have outlined the functions and mechanisms of these peptides in cardiovascular physiology and pathology, such as cardiomyocyte hypertrophy, myocardial contraction, myocardial infarction, and vascular remodelling. Finally, an overview of the existing challenges and potential future developments in the realm of lncRNA-encoded peptides was provided, with consideration given to prospective avenues for further research. Given that many lncRNA-encoded peptides have not been functionally annotated yet, their application in CVD diagnosis and treatment still requires further research.
Collapse
Affiliation(s)
- Li Zhang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
- Hongqiao International Institute of Medicine, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200336, China
| | - Mi Tang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Haoyang Diao
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Liling Xiong
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Xiao Yang
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| | - Shasha Xing
- Chengdu Women's and Children's Central Hospital, School of Medicine, University of Electronic Science and Technology of China, 1617 Riyue Street, Qingyang District, Chengdu 611731, China
| |
Collapse
|
20
|
Markus D, Pelletier A, Boube M, Port F, Boutros M, Payre F, Obermayer B, Zanet J. The pleiotropic functions of Pri smORF peptides synchronize leg development regulators. PLoS Genet 2023; 19:e1011004. [PMID: 37903161 PMCID: PMC10635573 DOI: 10.1371/journal.pgen.1011004] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 11/09/2023] [Accepted: 10/03/2023] [Indexed: 11/01/2023] Open
Abstract
The last decade witnesses the emergence of the abundant family of smORF peptides, encoded by small ORF (<100 codons), whose biological functions remain largely unexplored. Bioinformatic analyses here identify hundreds of putative smORF peptides expressed in Drosophila imaginal leg discs. Thanks to a functional screen in leg, we found smORF peptides involved in morphogenesis, including the pioneer smORF peptides Pri. Since we identified its target Ubr3 in the epidermis and pri was known to control leg development through poorly understood mechanisms, we investigated the role of Ubr3 in mediating pri function in leg. We found that pri plays several roles during leg development both in patterning and in cell survival. During larval stage, pri activates independently of Ubr3 tarsal transcriptional programs and Notch and EGFR signaling pathways, whereas at larval pupal transition, Pri peptides cooperate with Ubr3 to insure cell survival and leg morphogenesis. Our results highlight Ubr3 dependent and independent functions of Pri peptides and their pleiotropy. Moreover, we reveal that the smORF peptide family is a reservoir of overlooked developmental regulators, displaying distinct molecular functions and orchestrating leg development.
Collapse
Affiliation(s)
- Damien Markus
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Aurore Pelletier
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Muriel Boube
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Fillip Port
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - Michael Boutros
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - François Payre
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Benedikt Obermayer
- Core Unit Bioinformatics (CUBI), Berlin Institute of Health at Charité Universitätsmedizin-Berlin, Berlin, Germany
| | - Jennifer Zanet
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| |
Collapse
|
21
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Moritz RL, Deutsch EW, van Heesch S. What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome? Mol Cell Proteomics 2023; 22:100631. [PMID: 37572790 PMCID: PMC10506109 DOI: 10.1016/j.mcpro.2023.100631] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 07/21/2023] [Accepted: 08/08/2023] [Indexed: 08/14/2023] Open
Abstract
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from ∼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
Collapse
Affiliation(s)
- John R Prensner
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, USA.
| | | | - Leron W Kok
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Karl R Clauser
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, Agora Center Bugnon 25A, University of Lausanne, Lausanne, Switzerland; Department of Oncology, Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland; Agora Cancer Research Centre, Lausanne, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | - Eric W Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | | |
Collapse
|
22
|
Hassel KR, Brito-Estrada O, Makarewich CA. Microproteins: Overlooked regulators of physiology and disease. iScience 2023; 26:106781. [PMID: 37213226 PMCID: PMC10199267 DOI: 10.1016/j.isci.2023.106781] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/23/2023] Open
Abstract
Ongoing efforts to generate a complete and accurate annotation of the genome have revealed a significant blind spot for small proteins (<100 amino acids) originating from short open reading frames (sORFs). The recent discovery of numerous sORF-encoded proteins, termed microproteins, that play diverse roles in critical cellular processes has ignited the field of microprotein biology. Large-scale efforts are currently underway to identify sORF-encoded microproteins in diverse cell-types and tissues and specialized methods and tools have been developed to aid in their discovery, validation, and functional characterization. Microproteins that have been identified thus far play important roles in fundamental processes including ion transport, oxidative phosphorylation, and stress signaling. In this review, we discuss the optimized tools available for microprotein discovery and validation, summarize the biological functions of numerous microproteins, outline the promise for developing microproteins as therapeutic targets, and look forward to the future of the field of microprotein biology.
Collapse
Affiliation(s)
- Keira R. Hassel
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Omar Brito-Estrada
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Catherine A. Makarewich
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| |
Collapse
|
23
|
An intelligent based prediction of microbial behaviour in beef. Food Control 2023. [DOI: 10.1016/j.foodcont.2023.109665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|
24
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Deutsch EW, van Heesch S. What can Ribo-seq and proteomics tell us about the non-canonical proteome? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.16.541049. [PMID: 37292611 PMCID: PMC10245706 DOI: 10.1101/2023.05.16.541049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of non-canonical sites of ribosome translation outside of the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7,000 non-canonical open reading frames (ORFs) are translated, which, at first glance, has the potential to expand the number of human protein-coding sequences by 30%, from ∼19,500 annotated CDSs to over 26,000. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of non-canonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome, but searching for guidance on how to proceed. Here, we discuss the current state of non-canonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein-coding". In brief The human genome encodes thousands of non-canonical open reading frames (ORFs) in addition to protein-coding genes. As a nascent field, many questions remain regarding non-canonical ORFs. How many exist? Do they encode proteins? What level of evidence is needed for their verification? Central to these debates has been the advent of ribosome profiling (Ribo-seq) as a method to discern genome-wide ribosome occupancy, and immunopeptidomics as a method to detect peptides that are processed and presented by MHC molecules and not observed in traditional proteomics experiments. This article provides a synthesis of the current state of non-canonical ORF research and proposes standards for their future investigation and reporting. Highlights Combined use of Ribo-seq and proteomics-based methods enables optimal confidence in detecting non-canonical ORFs and their protein products.Ribo-seq can provide more sensitive detection of non-canonical ORFs, but data quality and analytical pipelines will impact results.Non-canonical ORF catalogs are diverse and span both high-stringency and low-stringency ORF nominations.A framework for standardized non-canonical ORF evidence will advance the research field.
Collapse
Affiliation(s)
- John R. Prensner
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | | | - Leron W. Kok
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Karl R. Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jonathan M. Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland
- Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland
- Agora Cancer Research Centre, 1011 Lausanne, Switzerland
| | - Eric W. Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington 98109, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| |
Collapse
|
25
|
Pei H, Dai Y, Yu Y, Tang J, Cao Z, Zhang Y, Li B, Nie J, Hei TK, Zhou G. The Tumorigenic Effect of lncRNA AFAP1-AS1 is Mediated by Translated Peptide ATMLP Under the Control of m 6 A Methylation. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2300314. [PMID: 36871154 PMCID: PMC10161021 DOI: 10.1002/advs.202300314] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Indexed: 05/06/2023]
Abstract
Long noncoding RNAs (lncRNAs) in eukaryotic transcripts have long been believed to regulate various aspects of cellular processes, including carcinogenesis. Herein, it is found that lncRNA AFAP1-AS1 encodes a conserved 90-amino acid peptide located on mitochondria, named lncRNA AFAP1-AS1 translated mitochondrial-localized peptide (ATMLP), and it is not the lncRNA but the peptide that promotes the malignancy of nonsmall cell lung cancer (NSCLC). As the tumor progresses, the serum level of ATMLP increases. NSCLC patients with high levels of ATMLP display poorer prognosis. Translation of ATMLP is controlled by m6 A methylation at the 1313 adenine locus of AFAP1-AS1. Mechanistically, ATMLP binds to the 4-nitrophenylphosphatase domain and non-neuronal SNAP25-like protein homolog 1 (NIPSNAP1) and inhibits its transport from the inner to the outer mitochondrial membrane, which antagonizes the NIPSNAP1-mediated regulation of cell autolysosome formation. The findings uncover a complex regulatory mechanism of NSCLC malignancy orchestrated by a peptide encoded by a lncRNA. A comprehensive judgment of the application prospects of ATMLP as an early diagnostic biomarker for NSCLC is also made.
Collapse
Affiliation(s)
- Hailong Pei
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Yingchu Dai
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Yongduo Yu
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Jiaxin Tang
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Zhifei Cao
- Department of Pathology, The Second Affiliated Hospital of Soochow University, Jiangsu, Suzhou, 215004, P. R. China
| | - Yongsheng Zhang
- Department of Pathology, The Second Affiliated Hospital of Soochow University, Jiangsu, Suzhou, 215004, P. R. China
| | - Bingyan Li
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Jing Nie
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| | - Tom K Hei
- Center for Radiological Research, College of Physician and Surgeons, Columbia University, New York, NY, 10032, USA
| | - Guangming Zhou
- State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Suzhou Medical College of Soochow University, Jiangsu, Suzhou, 215123, P. R. China
| |
Collapse
|
26
|
Sandmann CL, Schulz JF, Ruiz-Orera J, Kirchner M, Ziehm M, Adami E, Marczenke M, Christ A, Liebe N, Greiner J, Schoenenberger A, Muecke MB, Liang N, Moritz RL, Sun Z, Deutsch EW, Gotthardt M, Mudge JM, Prensner JR, Willnow TE, Mertins P, van Heesch S, Hubner N. Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames. Mol Cell 2023; 83:994-1011.e18. [PMID: 36806354 PMCID: PMC10032668 DOI: 10.1016/j.molcel.2023.01.023] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 12/12/2022] [Accepted: 01/25/2023] [Indexed: 02/19/2023]
Abstract
All species continuously evolve short open reading frames (sORFs) that can be templated for protein synthesis and may provide raw materials for evolutionary adaptation. We analyzed the evolutionary origins of 7,264 recently cataloged human sORFs and found that most were evolutionarily young and had emerged de novo. We additionally identified 221 previously missed sORFs potentially translated into peptides of up to 15 amino acids-all of which are smaller than the smallest human microprotein annotated to date. To investigate the bioactivity of sORF-encoded small peptides and young microproteins, we subjected 266 candidates to a mass-spectrometry-based interactome screen with motif resolution. Based on these interactomes and additional cellular assays, we can associate several candidates with mRNA splicing, translational regulation, and endocytosis. Our work provides insights into the evolutionary origins and interaction potential of young and small proteins, thereby helping to elucidate this underexplored territory of the human proteome.
Collapse
Affiliation(s)
- Clara-L Sandmann
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jana F Schulz
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany
| | - Jorge Ruiz-Orera
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Marieluise Kirchner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Matthias Ziehm
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | - Eleonora Adami
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Maike Marczenke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Annabel Christ
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Nina Liebe
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Johannes Greiner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Aaron Schoenenberger
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michael B Muecke
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Ning Liang
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | | | - Zhi Sun
- Institute for Systems Biology, Seattle, WA 98109, USA
| | | | - Michael Gotthardt
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John R Prensner
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Division of Pediatric Hematology/Oncology, Boston Children's Hospital, Boston, MA 02115, USA
| | - Thomas E Willnow
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Department of Biomedicine, Aarhus University, 8000 Aarhus, Denmark
| | - Philipp Mertins
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Proteomics, 10117 Berlin, Germany
| | | | - Norbert Hubner
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany; DZHK (German Centre for Cardiovascular Research), Partner Site Berlin, 13347 Berlin, Germany; Charité-Universitätsmedizin, 10117 Berlin, Germany.
| |
Collapse
|
27
|
Yan C, Meng Y, Yang J, Chen J, Jiang W. Translational landscape in human early neural fate determination. Development 2023; 150:297188. [PMID: 36846898 DOI: 10.1242/dev.201177] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 02/19/2023] [Indexed: 03/01/2023]
Abstract
Gene expression regulation in eukaryotes is a multi-level process, including transcription, mRNA translation and protein turnover. Many studies have reported sophisticated transcriptional regulation during neural development, but the global translational dynamics are still ambiguous. Here, we differentiate human embryonic stem cells (ESCs) into neural progenitor cells (NPCs) with high efficiency and perform ribosome sequencing and RNA sequencing on both ESCs and NPCs. Data analysis reveals that translational controls engage in many crucial pathways and contribute significantly to regulation of neural fate determination. Furthermore, we show that the sequence characteristics of the untranslated region (UTR) might regulate translation efficiency. Specifically, genes with short 5'UTR and intense Kozak sequence are associated with high translation efficiency in human ESCs, whereas genes with long 3'UTR are related to high translation efficiency in NPCs. In addition, we have identified four biasedly used codons (GAC, GAT, AGA and AGG) and dozens of short open reading frames during neural progenitor differentiation. Thus, our study reveals the translational landscape during early human neural differentiation and provides insights into the regulation of cell fate determination at the translational level.
Collapse
Affiliation(s)
- Chenchao Yan
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Yajing Meng
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Jie Yang
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
| | - Jian Chen
- Chinese Institute for Brain Research (Beijing), Research Unit of Medical Neurobiology, Chinese Academy of Medical Sciences, Beijing 102206, China
| | - Wei Jiang
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Zhongnan Hospital of Wuhan University, Wuhan 430071, China
- Human Genetics Resource Preservation Center of Wuhan University, Wuhan 430071, China
| |
Collapse
|
28
|
Pueyo JI, Salazar J, Grincho C, Berni J, Towler BP, Newbury SF. Purriato is a conserved small open reading frame gene that interacts with the CASA pathway to regulate muscle homeostasis and epithelial tissue growth in Drosophila. Front Cell Dev Biol 2023; 11:1117454. [PMID: 36968202 PMCID: PMC10036370 DOI: 10.3389/fcell.2023.1117454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 02/24/2023] [Indexed: 03/12/2023] Open
Abstract
Recent advances in proteogenomic techniques and bioinformatic pipelines have permitted the detection of thousands of translated small Open Reading Frames (smORFs), which contain less than 100 codons, in eukaryotic genomes. Hundreds of these actively translated smORFs display conserved sequence, structure and evolutionary signatures indicating that the translated peptides could fulfil important biological roles. Despite their abundance, only tens of smORF genes have been fully characterised; these act mainly as regulators of canonical proteins involved in essential cellular processes. Importantly, some of these smORFs display conserved functions with their mutations being associated with pathogenesis. Thus, investigating smORF roles in Drosophila will not only expand our understanding of their functions but it may have an impact in human health. Here we describe the function of a novel and essential Drosophila smORF gene named purriato (prto). prto belongs to an ancient gene family whose members have expanded throughout the Protostomia clade. prto encodes a transmembrane peptide which is localized in endo-lysosomes and perinuclear and plasma membranes. prto is dynamically expressed in mesodermal tissues and imaginal discs. Targeted prto knockdown (KD) in these organs results in changes in nuclear morphology and endo-lysosomal distributions correlating with the loss of sarcomeric homeostasis in muscles and reduction of mitosis in wing discs. Consequently, prto KD mutants display severe reduction of motility, and shorter wings. Finally, our genetic interaction experiments show that prto function is closely associated to the CASA pathway, a conserved mechanism involved in turnover of mis-folded proteins and linked to muscle dystrophies and neurodegenerative diseases. Thus, this study shows the relevance of smORFs in regulating important cellular functions and supports the systematic characterisation of this class of genes to understand their functions and evolution.
Collapse
Affiliation(s)
- Jose I. Pueyo
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jorge Salazar
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Carolina Grincho
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jimena Berni
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Benjamin P. Towler
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
- Department of Biochemistry and Biomedicine, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sarah F. Newbury
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
29
|
Imami K, Selbach M, Ishihama Y. Monitoring mitochondrial translation by pulse SILAC. J Biol Chem 2023; 299:102865. [PMID: 36603763 PMCID: PMC9922817 DOI: 10.1016/j.jbc.2022.102865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 12/27/2022] [Accepted: 12/27/2022] [Indexed: 01/04/2023] Open
Abstract
Mitochondrial ribosomes are specialized to translate the 13 membrane proteins encoded in the mitochondrial genome, which shapes the oxidative phosphorylation complexes essential for cellular energy metabolism. Despite the importance of mitochondrial translation (MT) control, it is challenging to identify and quantify the mitochondrial-encoded proteins because of their hydrophobic nature and low abundance. Here, we introduce a mass spectrometry-based proteomic method that combines biochemical isolation of mitochondria with pulse stable isotope labeling by amino acids in cell culture. Our method provides the highest protein identification rate with the shortest measurement time among currently available methods, enabling us to quantify 12 of the 13 mitochondrial-encoded proteins. We applied this method to uncover the global picture of (post-)translational regulation of both mitochondrial- and nuclear-encoded subunits of oxidative phosphorylation complexes. We found that inhibition of MT led to degradation of orphan nuclear-encoded subunits that are considered to form subcomplexes with the mitochondrial-encoded subunits. This method should be readily applicable to study MT programs in many contexts, including oxidative stress and mitochondrial disease.
Collapse
Affiliation(s)
- Koshi Imami
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan; RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.
| | - Matthias Selbach
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany; Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan; Laboratory of Clinical and Analytical Chemistry, National Institute of Biomedical Innovation, Health and Nutrition, Osaka, Japan.
| |
Collapse
|
30
|
Chothani S, Ho L, Schafer S, Rackham O. Discovering microproteins: making the most of ribosome profiling data. RNA Biol 2023; 20:943-954. [PMID: 38013207 PMCID: PMC10730196 DOI: 10.1080/15476286.2023.2279845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized biological process discovery and understanding. Traditionally, gene models have been confirmed using cDNA sequencing and encoded translated regions inferred using sequence-based detection of start and stop combinations longer than 100 amino-acids to prevent false positives. This has led to small ORFs (smORFs) and their encoded proteins left un-annotated. Ribo-seq allows deciphering translated regions from untranslated irrespective of the length. In this review, we describe the power of Ribo-seq data in detection of smORFs while discussing the major challenge posed by data-quality, -depth and -sparseness in identifying the start and end of smORF translation. In particular, we outline smORF cataloguing efforts in humans and the large differences that have arisen due to variation in data, methods and assumptions. Although current versions of smORF reference sets can already be used as a powerful tool for hypothesis generation, we recommend that future editions should consider these data limitations and adopt unified processing for the community to establish a canonical catalogue of translated smORFs.
Collapse
Affiliation(s)
- Sonia Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Owen Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
- The Alan Turing Institute, The British Library, London, UK
| |
Collapse
|
31
|
Yang J, Liu M, Fang X, Zhang H, Ren Q, Zheng Y, Wang Y, Zhou Y. Advances in peptides encoded by non-coding RNAs: A cargo in exosome. Front Oncol 2022; 12:1081997. [PMID: 36620552 PMCID: PMC9822543 DOI: 10.3389/fonc.2022.1081997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 11/28/2022] [Indexed: 12/24/2022] Open
Abstract
The metastasis of malignant tumors determines patient prognosis. This is the main reason for the poor prognosis of patients with cancer and the most challenging aspect of treating malignant tumors. Therefore, it is important to identify early tumor markers and molecules that can predict patient prognosis. However, there are currently no molecular markers with good clinical accuracy and specificity. Many non-coding RNA (ncRNAs)have been identified, which can regulate the process of tumor development at multiple levels. Interestingly, some ncRNAs are translated to produce functional peptides. Exosomes act as signal carriers, are encapsulated in nucleic acids and proteins, and play a messenger role in cell-to-cell communication. Recent studies have identified exosome peptides with potential diagnostic roles. This review aims to provide a theoretical basis for ncRNA-encoded peptides or proteins transported by exosomes and ultimately to provide ideas for further development of new diagnostic and prognostic cancer markers.
Collapse
Affiliation(s)
- Jing Yang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Mengxiao Liu
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Xidong Fang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Huiyun Zhang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Qian Ren
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Ya Zheng
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Yuping Wang
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China,*Correspondence: Yongning Zhou, ; Yuping Wang,
| | - Yongning Zhou
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China,*Correspondence: Yongning Zhou, ; Yuping Wang,
| |
Collapse
|
32
|
Vakirlis N, Vance Z, Duggan KM, McLysaght A. De novo birth of functional microproteins in the human lineage. Cell Rep 2022; 41:111808. [PMID: 36543139 PMCID: PMC10073203 DOI: 10.1016/j.celrep.2022.111808] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 06/21/2022] [Accepted: 11/18/2022] [Indexed: 12/24/2022] Open
Abstract
Small open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences. Here we sought to identify such cases in the human lineage by reconstructing the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the formation of each ORF and its transcriptional activation, we show that novel microproteins with significant phenotypic effects have emerged de novo throughout animal evolution, including two after the human-chimpanzee split. Notably, traditional methods for assessing coding potential would miss most of these cases. This evidence demonstrates that the functional potential intrinsic to sORFs can be relatively rapidly and frequently realized through de novo gene emergence.
Collapse
Affiliation(s)
- Nikolaos Vakirlis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece.
| | - Zoe Vance
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Kate M Duggan
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Aoife McLysaght
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland.
| |
Collapse
|
33
|
Zhang M, Zhao J, Li C, Ge F, Wu J, Jiang B, Song J, Song X. csORF-finder: an effective ensemble learning framework for accurate identification of multi-species coding short open reading frames. Brief Bioinform 2022; 23:bbac392. [PMID: 36094083 PMCID: PMC9677467 DOI: 10.1093/bib/bbac392] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 08/03/2022] [Accepted: 08/11/2022] [Indexed: 12/14/2022] Open
Abstract
Short open reading frames (sORFs) refer to the small nucleic fragments no longer than 303 nt in length that probably encode small peptides. To date, translatable sORFs have been found in both untranslated regions of messenger ribonucleic acids (RNAs; mRNAs) and long non-coding RNAs (lncRNAs), playing vital roles in a myriad of biological processes. As not all sORFs are translated or essentially translatable, it is important to develop a highly accurate computational tool for characterizing the coding potential of sORFs, thereby facilitating discovery of novel functional peptides. In light of this, we designed a series of ensemble models by integrating Efficient-CapsNet and LightGBM, collectively termed csORF-finder, to differentiate the coding sORFs (csORFs) from non-coding sORFs in Homo sapiens, Mus musculus and Drosophila melanogaster, respectively. To improve the performance of csORF-finder, we introduced a novel feature encoding scheme named trinucleotide deviation from expected mean (TDE) and computed all types of in-frame sequence-based features, such as i-framed-3mer, i-framed-CKSNAP and i-framed-TDE. Benchmarking results showed that these features could significantly boost the performance compared to the original 3-mer, CKSNAP and TDE features. Our performance comparisons showed that csORF-finder achieved a superior performance than the state-of-the-art methods for csORF prediction on multi-species and non-ATG initiation independent test datasets. Furthermore, we applied csORF-finder to screen the lncRNA datasets for identifying potential csORFs. The resulting data serve as an important computational repository for further experimental validation. We hope that csORF-finder can be exploited as a powerful platform for high-throughput identification of csORFs and functional characterization of these csORFs encoded peptides.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Jian Zhao
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Fang Ge
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Jing Wu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Bin Jiang
- College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Xiaofeng Song
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| |
Collapse
|
34
|
Translation and natural selection of micropeptides from long non-canonical RNAs. Nat Commun 2022; 13:6515. [PMID: 36316320 PMCID: PMC9622821 DOI: 10.1038/s41467-022-34094-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 10/13/2022] [Indexed: 12/25/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nucleotides but lacking canonical coding sequences. Apparently unable to produce peptides, lncRNA function seems to rely only on RNA expression, sequence and structure. Here, we exhaustively detect in-vivo translation of small open reading frames (small ORFs) within lncRNAs using Ribosomal profiling during Drosophila melanogaster embryogenesis. We show that around 30% of lncRNAs contain small ORFs engaged by ribosomes, leading to regulated translation of 100 to 300 micropeptides. We identify lncRNA features that favour translation, such as cistronicity, Kozak sequences, and conservation. For the latter, we develop a bioinformatics pipeline to detect small ORF homologues, and reveal evidence of natural selection favouring the conservation of micropeptide sequence and function across evolution. Our results expand the repertoire of lncRNA biochemical functions, and suggest that lncRNAs give rise to novel coding genes throughout evolution. Since most lncRNAs contain small ORFs with as yet unknown translation potential, we propose to rename them "long non-canonical RNAs".
Collapse
|
35
|
Identification and analysis of smORFs in Chlamydomonas reinhardtii. Genomics 2022; 114:110444. [PMID: 35933072 DOI: 10.1016/j.ygeno.2022.110444] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 07/06/2022] [Accepted: 07/31/2022] [Indexed: 11/24/2022]
Abstract
Small open reading frames (smORFs) have been acknowledged as an important partner in organism functions ranging from bacteria to higher eukaryotes. However, lack of investigation of smORFs in green algae, despite their importance in ecology and evolution. We applied bioinformatic analysis, ribosome profiling, and small peptide proteomics to provide a genome-wide and high-confident smORF database in the model green alga Chlamydomonas reinhardtii. The whole genome was screened first to mine potential coding smORFs. Then conservative analysis, ribosome profiling, and proteomics data were processed to identify conserved smORFs and generate translation evidence. The combination of procedures resulted in 2014 smORFs that might exist in the C. reinhardtii genome. The expression of smORFs in Cd treatment suggested that two smORFs might participate in redox reaction, three in inorganic phosphate transport, and one in DNA repair under stress. Our study built a genome-widely database in C. reinhardtii, providing target smORFs for further research.
Collapse
|
36
|
Brito-Estrada O, Hassel KR, Makarewich CA. An Integrated Approach for Microprotein Identification and Sequence Analysis. J Vis Exp 2022:10.3791/63841. [PMID: 35913170 PMCID: PMC9521633 DOI: 10.3791/63841] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024] Open
Abstract
Next-generation sequencing (NGS) has propelled the field of genomics forward and produced whole genome sequences for numerous animal species and model organisms. However, despite this wealth of sequence information, comprehensive gene annotation efforts have proven challenging, especially for small proteins. Notably, conventional protein annotation methods were designed to intentionally exclude putative proteins encoded by short open reading frames (sORFs) less than 300 nucleotides in length to filter out the exponentially higher number of spurious noncoding sORFs throughout the genome. As a result, hundreds of functional small proteins called microproteins (<100 amino acids in length) have been incorrectly classified as noncoding RNAs or overlooked entirely. Here we provide a detailed protocol to leverage free, publicly available bioinformatic tools to query genomic regions for microprotein-coding potential based on evolutionary conservation. Specifically, we provide step-by-step instructions on how to examine sequence conservation and coding potential using Phylogenetic Codon Substitution Frequencies (PhyloCSF) on the user-friendly University of California Santa Cruz (UCSC) Genome Browser. Additionally, we detail steps to efficiently generate multiple species alignments of identified microprotein sequences to visualize amino acid sequence conservation and recommend resources to analyze microprotein characteristics, including predicted domain structures. These powerful tools can be used to help identify putative microprotein-coding sequences in noncanonical genomic regions or to rule out the presence of a conserved coding sequence with translational potential in a noncoding transcript of interest.
Collapse
Affiliation(s)
- Omar Brito-Estrada
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center
| | - Keira R Hassel
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center
| | - Catherine A Makarewich
- The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center; Department of Pediatrics, University of Cincinnati College of Medicine;
| |
Collapse
|
37
|
Kragness S, Clark Z, Mullin A, Guidry J, Earls LR. An Rtn4/Nogo-A-interacting micropeptide modulates synaptic plasticity with age. PLoS One 2022; 17:e0269404. [PMID: 35771867 PMCID: PMC9246188 DOI: 10.1371/journal.pone.0269404] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 05/18/2022] [Indexed: 11/18/2022] Open
Abstract
Micropeptides, encoded from small open reading frames of 300 nucleotides or less, are hidden throughout mammalian genomes, though few functional studies of micropeptides in the brain are published. Here, we describe a micropeptide known as the Plasticity–Associated Neural Transcript Short (Pants), located in the 22q11.2 region of the human genome, the microdeletion of which conveys a high risk for schizophrenia. Our data show that Pants is upregulated in early adulthood in the mossy fiber circuit of the hippocampus, where it exerts a powerful negative effect on long-term potentiation (LTP). Further, we find that Pants is secreted from neurons, where it associates with synapses but is rapidly degraded with stimulation. Pants dynamically interacts with Rtn4/Nogo-A, a well-studied regulator of adult plasticity. Pants interaction with Nogo-A augments its influence over postsynaptic AMPA receptor clustering, thus gating plasticity at adult synapses. This work shows that neural micropeptides can act as architectural modules that increase the functional diversity of the known proteome.
Collapse
Affiliation(s)
- S. Kragness
- Department of Cell and Molecular Biology, Tulane University, New Orleans, LA, United States of America
| | - Z. Clark
- Department of Cell and Molecular Biology, Tulane University, New Orleans, LA, United States of America
| | - A. Mullin
- Department of Cell and Molecular Biology, Tulane University, New Orleans, LA, United States of America
- Tulane University Transgenic Core Facility, New Orleans, LA, United States of America
| | - J. Guidry
- Department of Biochemistry and Molecular Biology, LSU School of Medicine and Health Sciences Center, New Orleans, LA, United States of America
- The Proteomics Core Facility, LSUHSC, New Orleans, LA, United States of America
| | - L. R. Earls
- Department of Cell and Molecular Biology, Tulane University, New Orleans, LA, United States of America
- * E-mail:
| |
Collapse
|
38
|
Liu Y, Zeng S, Wu M. Novel insights into noncanonical open reading frames in cancer. Biochim Biophys Acta Rev Cancer 2022; 1877:188755. [PMID: 35777601 DOI: 10.1016/j.bbcan.2022.188755] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 06/11/2022] [Accepted: 06/23/2022] [Indexed: 12/12/2022]
Abstract
With technological advances, previously neglected noncanonical open reading frames (nORFs) are drawing ever-increasing attention. However, the translation potential of numerous putative nORFs remains elusive, and the functions of noncanonical peptides have not been systemically summarized. Moreover, the relationship between noncanonical peptides and their counterpart protein or RNA products remains elusive and the clinical implementation of noncanonical peptides has not been explored. In this review, we highlight how recent technological advances such as ribosome profiling, bioinformatics approaches and CRISPR/Cas9 facilitate the research of noncanonical peptides. We delineate the features of each nORF category and the evolutionary process underneath the nORFs. Most importantly, we summarize the diversified functions of noncanonical peptides in cancer based on their subcellular location, which reflect their extensive participation in key pathways and essential cellular activities in cancer cells. Meanwhile, the equilibrium between noncanonical peptides and their corresponding transcripts or counterpart products may be dysregulated under pathological states, which is essential for their roles in cancer. Lastly, we explore their underestimated potential in clinical application as diagnostic biomarkers and treatment targets against cancer.
Collapse
Affiliation(s)
- Yihan Liu
- Hunan Cancer Hospital and the Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha 410013, Hunan, China; The Key Laboratory of Carcinogenesis of the Chinese Ministry of Health, The Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute, Central South University, Changsha, Hunan 410008, China; Department of Oncology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; Key Laboratory for Molecular Radiation Oncology of Hunan Province, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
| | - Shan Zeng
- Department of Oncology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; Key Laboratory for Molecular Radiation Oncology of Hunan Province, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China.
| | - Minghua Wu
- Hunan Cancer Hospital and the Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha 410013, Hunan, China; The Key Laboratory of Carcinogenesis of the Chinese Ministry of Health, The Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute, Central South University, Changsha, Hunan 410008, China.
| |
Collapse
|
39
|
Pei MS, Liu HN, Wei TL, Yu YH, Guo DL. Large-scale discovery of non-conventional peptides in grape ( Vitis vinifera L.) through peptidogenomics. HORTICULTURE RESEARCH 2022; 9:uhac023. [PMID: 35531313 PMCID: PMC9070638 DOI: 10.1093/hr/uhac023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 01/24/2022] [Indexed: 06/14/2023]
Abstract
Non-conventional peptides (NCPs), which are peptides derived from previously unannotated coding sequences, play important biological roles in plants. In this study, we used peptidogenomic methods that integrated mass spectrometry (MS) peptidomics and a six-frame translation database to extensively identify NCPs in grape. In total, 188 and 2021 non-redundant peptides from the Arabidopsis thaliana and Vitis vinifera L. protein database at Ensembl/URGI and an individualized peptidogenomic database were identified. Unlike conventional peptides, these NCPs derived mainly from intergenic, intronic, upstream ORF, 5'UTR, 3'UTR, and downstream ORF regions. These results show that unannotated regions are translated more broadly than we thought. We also found that most NCPs were derived from regions related to phenotypic variations, LTR retrotransposons, and domestication selection, indicating that the NCPs have an important function in complex biological processes. We also found that the NCPs were developmentally specific and had transient and specific functions in grape berry development. In summary, our study is the first to extensively identify NCPs in grape. It demonstrated that there was a large amount of translation in the genome. These results lay a foundation for studying the functions of NCPs and also provide a reference for the discovery of new functional genes in grape.
Collapse
Affiliation(s)
- Mao-Song Pei
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Hai-Nan Liu
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Tong-Lu Wei
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Yi-He Yu
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | | |
Collapse
|
40
|
Cancer-related micropeptides encoded by ncRNAs: Promising drug targets and prognostic biomarkers. Cancer Lett 2022; 547:215723. [DOI: 10.1016/j.canlet.2022.215723] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/14/2022] [Accepted: 05/01/2022] [Indexed: 02/07/2023]
|
41
|
Lee HC, Hsieh CC, Tsai HJ. KEPI plays a negative role in the repression that accompanies translational inhibition guided by the uORF element of human CHOP transcript during stress response. Gene X 2022; 817:146160. [PMID: 35031423 DOI: 10.1016/j.gene.2021.146160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 10/28/2021] [Accepted: 12/10/2021] [Indexed: 11/04/2022] Open
Abstract
Translation of the downstream coding sequence of some mRNAs may be repressed by the upstream open reading frame (uORF) at their 5'-end. The mechanism underlying this uORF-mediated translational inhibition (uORF-MTI) is not fully understood in vivo. Recently, it was found that zebrafish Endouc or its human orthologue ENDOU (Endouc/ENDOU) plays a positive role in repressing the uORF-MTI of human CHOP (uORFchop-MTI) during stress by blocking its activity However, the repression of uORFchop-MTI assisted by an as-yet unidentified negative effector remains to be elucidated. Compared to the upregulated CHOP transcript, we herein report that the kepi (kinase-enhanced PP1 inhibitor) transcript was downregulated in the zebrafish embryos treated with both heat shock and hypoxia. Quantitative RT-PCR also revealed that the level of kepi mRNA was noticeably decreased in both heat-shock-treated and hypoxia-exposed embryos. When kepi mRNA was microinjected into the one-celled embryos from transgenic line huORFZ, the translation of downstream GFP reporter controlled by the uORFchop-MTI was reduced in the hypoxia-exposed embryos. In contrast, when kepi was knocked down by injection of antisense Morpholino oligonucleotide, the translation of downstream GFP reporter was induced and expressed in the brain and spinal cord of injected embryos in the absence of stress. During normal condition, overexpression of KEPI increased eIF2α phosphorylation, resulting in inducing the translation of uORF-tag mRNA, such as ATF4 and CHOP mRNAs. However, during stress condition, overexpression of KEPI decreased eIF2α phosphorylation, resulting in reducing the GFP reporter and CHOP proteins. This is the first report to demonstrate that KEPI plays a negative role in uORFchop - mediated translation during ER stress.
Collapse
Affiliation(s)
- Hung-Chieh Lee
- Institute of Biomedical Sciences, Mackay Medical College, New Taipei City, Taiwan
| | - Chi-Cheng Hsieh
- The Liver Disease Prevention and Treatment Research Foundation, Taipei, Taiwan
| | - Huai-Jen Tsai
- Department of Life Science, Fu-Jen Catholic University, New Taipei City, Taiwan; School of Medicine, Fu-Jen Catholic University, New Taipei City, Taiwan.
| |
Collapse
|
42
|
Leong AZX, Lee PY, Mohtar MA, Syafruddin SE, Pung YF, Low TY. Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures. J Biomed Sci 2022; 29:19. [PMID: 35300685 PMCID: PMC8928697 DOI: 10.1186/s12929-022-00802-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 03/09/2022] [Indexed: 12/17/2022] Open
Abstract
A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein–protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
Collapse
Affiliation(s)
- Alyssa Zi-Xin Leong
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Pey Yee Lee
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - M Aiman Mohtar
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Saiful Effendi Syafruddin
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia
| | - Yuh-Fen Pung
- Division of Biomedical Science, School of Pharmacy, University of Nottingham Malaysia, Semenyih, 43500, Selangor, Malaysia
| | - Teck Yew Low
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, 56000, Kuala Lumpur, Malaysia.
| |
Collapse
|
43
|
Small open reading frames in plant research: from prediction to functional characterization. 3 Biotech 2022; 12:76. [PMID: 35251879 PMCID: PMC8873315 DOI: 10.1007/s13205-022-03147-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 02/11/2022] [Indexed: 11/01/2022] Open
Abstract
Gene prediction is a laborious and time-consuming task. The advancement of sequencing technologies and bioinformatics tools, coupled with accelerated rate of ribosome profiling and mass spectrometry development, have made identification of small open reading frames (sORFs) (< 100 codons) in various plant genomes possible. The past 50 years have seen sORFs being isolated from many organisms. However, to date, a comprehensive sORF annotation pipeline is as yet unavailable, hence, addressed in our review. Here, we also provide current information on classification and functions of plant sORFs and their potential applications in crop improvement programs.
Collapse
|
44
|
Bonilauri B, Dallagiovanna B. Microproteins in skeletal muscle: hidden keys in muscle physiology. J Cachexia Sarcopenia Muscle 2022; 13:100-113. [PMID: 34850602 PMCID: PMC8818594 DOI: 10.1002/jcsm.12866] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 10/01/2021] [Accepted: 10/12/2021] [Indexed: 11/10/2022] Open
Abstract
Recent advances in the transcriptomics, translatomics, and proteomics have led us to the exciting new world of functional endogenous microproteins. These microproteins have a small size and are derived from small open reading frames (smORFs) of RNAs previously annotated as non-coding (e.g. lncRNAs and circRNAs) as well as from untranslated regions and canonical mRNAs. The presence of these microproteins reveals a much larger translatable portion of the genome, shifting previously defined dogmas and paradigms. These findings affect our view of organisms as a whole, including skeletal muscle tissue. Emerging evidence demonstrates that several smORF-derived microproteins play crucial roles during muscle development (myogenesis), maintenance, and regeneration, as well as lipid and glucose metabolism and skeletal muscle bioenergetics. These microproteins are also involved in processes including physical activity capacity, cellular stress, and muscular-related diseases (i.e. myopathy, cachexia, atrophy, and muscle wasting). Given the role of these small proteins as important key regulators of several skeletal muscle processes, there are rich prospects for the discovery of new microproteins and possible therapies using synthetic microproteins.
Collapse
Affiliation(s)
- Bernardo Bonilauri
- Laboratory of Basic Biology of Stem Cells (LABCET)Carlos Chagas Institute ‐ Fiocruz‐PRCuritibaParanáBrazil
| | - Bruno Dallagiovanna
- Laboratory of Basic Biology of Stem Cells (LABCET)Carlos Chagas Institute ‐ Fiocruz‐PRCuritibaParanáBrazil
| |
Collapse
|
45
|
DeVito LM, Barzilai N, Cuervo AM, Niedernhofer LJ, Milman S, Levine M, Promislow D, Ferrucci L, Kuchel GA, Mannick J, Justice J, Gonzales MM, Kirkland JL, Cohen P, Campisi J. Extending human healthspan and longevity: a symposium report. Ann N Y Acad Sci 2022; 1507:70-83. [PMID: 34498278 PMCID: PMC10231756 DOI: 10.1111/nyas.14681] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 08/09/2021] [Indexed: 12/13/2022]
Abstract
For many years, it was believed that the aging process was inevitable and that age-related diseases could not be prevented or reversed. The geroscience hypothesis, however, posits that aging is, in fact, malleable and, by targeting the hallmarks of biological aging, it is indeed possible to alleviate age-related diseases and dysfunction and extend longevity. This field of geroscience thus aims to prevent the development of multiple disorders with age, thereby extending healthspan, with the reduction of morbidity toward the end of life. Experts in the field have made remarkable advancements in understanding the mechanisms underlying biological aging and identified ways to target aging pathways using both novel agents and repurposed therapies. While geroscience researchers currently face significant barriers in bringing therapies through clinical development, proof-of-concept studies, as well as early-stage clinical trials, are underway to assess the feasibility of drug evaluation and lay a regulatory foundation for future FDA approvals in the future.
Collapse
Affiliation(s)
| | - Nir Barzilai
- Albert Einstein College of Medicine, Bronx, New York
| | | | | | - Sofiya Milman
- Albert Einstein College of Medicine, Bronx, New York
| | | | | | - Luigi Ferrucci
- National Institute on Aging, National Institutes of Health, Bethesda, Maryland
| | - George A Kuchel
- University of Connecticut School of Medicine, Farmington, Connecticut
| | | | - Jamie Justice
- Wake Forest School of Medicine, Winston-Salem, North Carolina
| | - Mitzi M Gonzales
- University of Texas Health Sciences Center San Antonio, San Antonio, Texas
| | | | - Pinchas Cohen
- USC Leonard Davis School of Gerontology, Los Angeles, California
| | - Judith Campisi
- The Buck Institute for Research on Aging, Novato, California
- Lawrence Berkeley National Laboratory, Berkley, California
| |
Collapse
|
46
|
Chen Y, Long W, Yang L, Zhao Y, Wu X, Li M, Du F, Chen Y, Yang Z, Wen Q, Yi T, Xiao Z, Shen J. Functional Peptides Encoded by Long Non-Coding RNAs in Gastrointestinal Cancer. Front Oncol 2021; 11:777374. [PMID: 34888249 PMCID: PMC8649637 DOI: 10.3389/fonc.2021.777374] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 10/28/2021] [Indexed: 12/11/2022] Open
Abstract
Gastrointestinal cancer is by far the most common malignancy and the most common cause of cancer-related deaths worldwide. Recent studies have shown that long non-coding RNAs (lncRNAs) play an important role in the epigenetic regulation of cancer cells and regulate tumor progression by affecting chromatin modifications, gene transcription, translation, and sponge to miRNAs. In particular, lncRNA has recently been found to possess open reading frame (ORF), which can encode functional small peptides or proteins. These peptides interact with its targets to regulate transcription or the signal axis, thus promoting or inhibiting the occurrence and development of tumors. In this review, we summarize the involvement of lncRNAs and the function of lncRNAs encoded small peptides in gastrointestinal cancer.
Collapse
Affiliation(s)
- Yao Chen
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Weili Long
- School of Basic Medicine, Southwest Medical University, Luzhou, China
| | - Liqiong Yang
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Yueshui Zhao
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Xu Wu
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Mingxing Li
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Fukuan Du
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Yu Chen
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Zhihui Yang
- Department of Pathology, The Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Qinglian Wen
- Department of Oncology, The Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Tao Yi
- School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, Hong Kong SAR, China
| | - Zhangang Xiao
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| | - Jing Shen
- Laboratory of Molecular Pharmacology, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, China.,South Sichuan Institute of Translational Medicine, Luzhou, China.,Laboratory of Personalised Cell Therapy & Cell Medicines, School of Pharmacy, Southwest Medical University, Luzhou, China
| |
Collapse
|
47
|
Chen L, Yang Y, Zhang Y, Li K, Cai H, Wang H, Zhao Q. The Small Open Reading Frame-Encoded Peptides: Advances in Methodologies and Functional Studies. Chembiochem 2021; 23:e202100534. [PMID: 34862721 DOI: 10.1002/cbic.202100534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 11/15/2021] [Indexed: 11/07/2022]
Abstract
Small open reading frames (sORFs) are an important class of genes with less than 100 codons. They were historically annotated as noncoding or even junk sequences. In recent years, accumulating evidence suggests that sORFs could encode a considerable number of polypeptides, many of which play important roles in both physiology and disease pathology. However, it has been technically challenging to directly detect sORF-encoded peptides (SEPs). Here, we discuss the latest advances in methodologies for identifying SEPs with mass spectrometry, as well as the progress on functional studies of SEPs.
Collapse
Affiliation(s)
- Lei Chen
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China.,Laboratory for Synthetic Chemistry and Chemical Biology Limited, Hong Kong Science and Technology Park, New Territories, Hong Kong SAR, 999077, P. R. China
| | - Ying Yang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Yuanliang Zhang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Kecheng Li
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Hongmin Cai
- School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510623, P. R. China
| | - Hongwei Wang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, 510623, P. R. China
| | - Qian Zhao
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| |
Collapse
|
48
|
Chatterjee O, Gopalakrishnan L, Mol P, Advani J, Nair B, Shankar SK, Mahadevan A, Prasad TSK. The Normal Human Adult Hypothalamus Proteomic Landscape: Rise of Neuroproteomics in Biological Psychiatry and Systems Biology. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2021; 25:693-710. [PMID: 34714154 DOI: 10.1089/omi.2021.0158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The human hypothalamus is central to the regulation of neuroendocrine and neurovegetative systems, as well as modulation of chronobiology and behavioral aspects in human health and disease. Surprisingly, a deep proteomic analysis of the normal human hypothalamic proteome has been missing for such an important organ so far. In this study, we delineated the human hypothalamus proteome using a high-resolution mass spectrometry approach which resulted in the identification of 5349 proteins, while a multiple post-translational modification (PTM) search identified 191 additional proteins, which were missed in the first search. A proteogenomic analysis resulted in the discovery of multiple novel protein-coding regions as we identified proteins from noncoding regions (pseudogenes) and proteins translated from short open reading frames that can be missed using the traditional pipeline of prediction of protein-coding genes as a part of genome annotation. We also identified several PTMs of hypothalamic proteins that may be required for normal hypothalamic functions. Moreover, we observed an enrichment of proteins pertaining to autophagy and adult neurogenesis in the proteome data. We believe that the hypothalamic proteome reported herein would help to decipher the molecular basis for the diverse range of physiological functions attributed to it, as well as its role in neurological and psychiatric diseases. Extensive proteomic profiling of the hypothalamic nuclei would further elaborate on the role and functional characterization of several hypothalamus-specific proteins and pathways to inform future research and clinical discoveries in biological psychiatry, neurology, and system biology.
Collapse
Affiliation(s)
- Oishi Chatterjee
- Institute of Bioinformatics, Bangalore India.,Amrita School of Biotechnology, Amrita University, Kollam, India.,Center for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | - Lathika Gopalakrishnan
- Institute of Bioinformatics, Bangalore India.,Center for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India.,Manipal Academy of Higher Education, Manipal, India
| | - Praseeda Mol
- Institute of Bioinformatics, Bangalore India.,Amrita School of Biotechnology, Amrita University, Kollam, India
| | | | - Bipin Nair
- Amrita School of Biotechnology, Amrita University, Kollam, India
| | - Susarla Krishna Shankar
- Department of Neuropathology, National Institute of Mental Health and Neurosciences, Bangalore, India.,Human Brain Tissue Repository, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Anita Mahadevan
- Department of Neuropathology, National Institute of Mental Health and Neurosciences, Bangalore, India.,Human Brain Tissue Repository, National Institute of Mental Health and Neurosciences, Bangalore, India
| | | |
Collapse
|
49
|
Parmar BS, Peeters MKR, Boonen K, Clark EC, Baggerman G, Menschaert G, Temmerman L. Identification of Non-Canonical Translation Products in C. elegans Using Tandem Mass Spectrometry. Front Genet 2021; 12:728900. [PMID: 34759956 PMCID: PMC8575065 DOI: 10.3389/fgene.2021.728900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 09/16/2021] [Indexed: 11/22/2022] Open
Abstract
Transcriptome and ribosome sequencing have revealed the existence of many non-canonical transcripts, mainly containing splice variants, ncRNA, sORFs and altORFs. However, identification and characterization of products that may be translated out of these remains a challenge. Addressing this, we here report on 552 non-canonical proteins and splice variants in the model organism C. elegans using tandem mass spectrometry. Aided by sequencing-based prediction, we generated a custom proteome database tailored to search for non-canonical translation products of C. elegans. Using this database, we mined available mass spectrometric resources of C. elegans, from which 51 novel, non-canonical proteins could be identified. Furthermore, we utilized diverse proteomic and peptidomic strategies to detect 40 novel non-canonical proteins in C. elegans by LC-TIMS-MS/MS, of which 6 were common with our meta-analysis of existing resources. Together, this permits us to provide a resource with detailed annotation of 467 splice variants and 85 novel proteins mapped onto UTRs, non-coding regions and alternative open reading frames of the C. elegans genome.
Collapse
Affiliation(s)
- Bhavesh S. Parmar
- Animal Physiology and Neurobiology, University of Leuven (KU Leuven), Leuven, Belgium
| | - Marlies K. R. Peeters
- Laboratory of Bioinformatics and Computational Genomics (BioBix), Department of Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Kurt Boonen
- Centre for Proteomics (CFP), University of Antwerp, Antwerp, Belgium
| | - Ellie C. Clark
- Animal Physiology and Neurobiology, University of Leuven (KU Leuven), Leuven, Belgium
| | - Geert Baggerman
- Centre for Proteomics (CFP), University of Antwerp, Antwerp, Belgium
| | - Gerben Menschaert
- Laboratory of Bioinformatics and Computational Genomics (BioBix), Department of Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Liesbet Temmerman
- Animal Physiology and Neurobiology, University of Leuven (KU Leuven), Leuven, Belgium
| |
Collapse
|
50
|
Fesenko I, Shabalina SA, Mamaeva A, Knyazev A, Glushkevich A, Lyapina I, Ziganshin R, Kovalchuk S, Kharlampieva D, Lazarev V, Taliansky M, Koonin EV. A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants. Nucleic Acids Res 2021; 49:10328-10346. [PMID: 34570232 DOI: 10.1093/nar/gkab816] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/17/2021] [Accepted: 09/17/2021] [Indexed: 12/17/2022] Open
Abstract
Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs, <100 codons) located on lncRNAs potentially might be translated into peptides or microproteins. We report a comprehensive analysis of the conservation and evolutionary trajectories of lncRNAs-smORFs from the moss Physcomitrium patens across transcriptomes of 479 plant species. Although thousands of smORFs are subject to substantial purifying selection, the majority of the smORFs appear to be evolutionary young and could represent a major pool for functional innovation. Using nanopore RNA sequencing, we show that, on average, the transcriptional level of conserved smORFs is higher than that of non-conserved smORFs. Proteomic analysis confirmed translation of 82 novel species-specific smORFs. Numerous conserved smORFs containing low complexity regions (LCRs) or transmembrane domains were identified, the biological functions of a selected LCR-smORF were demonstrated experimentally. Thus, microproteins encoded by smORFs are a major, functionally diverse component of the plant proteome.
Collapse
Affiliation(s)
- Igor Fesenko
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Anna Mamaeva
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Andrey Knyazev
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Anna Glushkevich
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Irina Lyapina
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Rustam Ziganshin
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Sergey Kovalchuk
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Daria Kharlampieva
- Department of Cell Biology, Federal Research and Clinical Center of Physical -Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russian Federation
| | - Vassili Lazarev
- Department of Cell Biology, Federal Research and Clinical Center of Physical -Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russian Federation.,Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow region, 141701, Russian Federation
| | - Michael Taliansky
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation.,The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|