1
|
Silva CAO, Alves SDS, Rodrigues BDC, Fraga Egidio JA, Ribeiro L, Logullo C, Mury FB, Santos DDG, Portal T, Monteiro-de-Barros C, Roberto da Silva J, Nepomuceno-Silva JL, Nunes-da-Fonseca R. The mlpt smORF gene is essential for digestive physiology and molting during nymphal stages in the kissing bug Rhodnius prolixus. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2024; 172:104154. [PMID: 38972513 DOI: 10.1016/j.ibmb.2024.104154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/04/2024] [Accepted: 07/04/2024] [Indexed: 07/09/2024]
Abstract
Chagas disease affects around 8 million people globally, with Latin America bearing approximately 10,000 deaths each year. Combatting the disease relies heavily on vector control methods, necessitating the identification of new targets. Within insect genomes, genes harboring small open reading frames (smORFs - < 100 amino acids) present numerous potential candidates. In our investigation, we elucidate the pivotal role of the archetypal smORF-containing gene, mille-pattes/polished-rice/tarsalless (mlpt/pri/tal), in the post-embryonic development of the kissing bug Rhodnius prolixus. Injection of double-stranded RNA targeting mlpt (dsmlpt) during nymphal stages yields a spectrum of phenotypes hindering post-embryonic growth. Notably, fourth or fifth stage nymphs subjected to dsmlpt do not undergo molting. These dsmlpt nymphs display heightened mRNA levels of JHAMT-like and EPOX-like, enzymes putatively involved in the juvenile hormone (JH) pathway, alongside increased expression of the transcription factor Kr-h1, indicating changes in the hormonal control. Histological examination reveals structural alterations in the hindgut and external cuticle of dsmlpt nymphs compared to control (dsGFP) counterparts. Furthermore, significant changes in the vector's digestive physiology were observed, with elevated hemozoin and glucose levels in the posterior midgut of dsmlpt nymphs. Importantly, dsmlpt nymphs exhibit impaired metacyclogenesis of Trypanosoma cruzi, the causative agent of Chagas disease, underscoring the crucial role of proper gut organization in parasite differentiation. Thus, our findings constitute the first evidence of a smORF-containing gene's regulatory influence on vector physiology, parasitic cycle, and disease transmission.
Collapse
Affiliation(s)
- Carina Azevedo Oliveira Silva
- Laboratório Integrado de Bioquímica Hatisaburo Masuda (LIBHM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil; Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Sandy da Silveira Alves
- Laboratório Integrado de Biociências Translacionais (LIBT), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Bruno da Costa Rodrigues
- Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Jonatha Anderson Fraga Egidio
- Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Lupis Ribeiro
- Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Carlos Logullo
- Laboratório Integrado de Bioquímica Hatisaburo Masuda (LIBHM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil; Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Brazil
| | - Flavia Borges Mury
- Laboratório Integrado de Biociências Translacionais (LIBT), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil; Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Brazil
| | - Daniele das Graças Santos
- Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Taynan Portal
- Laboratório Integrado de Biociências Translacionais (LIBT), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Cintia Monteiro-de-Barros
- Laboratório Integrado de Biociências Translacionais (LIBT), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - José Roberto da Silva
- Laboratório Integrado de Bioquímica Hatisaburo Masuda (LIBHM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil; Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Brazil
| | - José Luciano Nepomuceno-Silva
- Laboratório Integrado de Bioquímica Hatisaburo Masuda (LIBHM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil
| | - Rodrigo Nunes-da-Fonseca
- Laboratório Integrado de Ciências Morfofuncionais (LICM), Instituto de Biodiversidade e Sustentabilidade (NUPEM/UFRJ), Universidade Federal do Rio de Janeiro, Macaé, RJ, Brazil; Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Brazil.
| |
Collapse
|
2
|
Jaiswal M, Kumar S. smAMPsTK: a toolkit to unravel the smORFome encoding AMPs of plant species. J Biomol Struct Dyn 2024; 42:6600-6612. [PMID: 37464885 DOI: 10.1080/07391102.2023.2235605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/06/2023] [Indexed: 07/20/2023]
Abstract
The pervasive repertoire of plant molecules with the potential to serve as a substitute for conventional antibiotics has led to obtaining better insights into plant-derived antimicrobial peptides (AMPs). The massive distribution of Small Open Reading Frames (smORFs) throughout eukaryotic genomes with proven extensive biological functions reflects their practicality as antimicrobials. Here, we have developed a pipeline named smAMPsTK to unveil the underlying hidden smORFs encoding AMPs for plant species. By applying this pipeline, we have elicited AMPs of various functional activity of lengths ranging from 5 to 100 aa by employing publicly available transcriptome data of five different angiosperms. Later, we studied the coding potential of AMPs-smORFs, the inclusion of diverse translation initiation start codons, and amino acid frequency. Codon usage study signifies no such codon usage biases for smORFs encoding AMPs. Majorly three start codons are prominent in generating AMPs. The evolutionary and conservational study proclaimed the widespread distribution of AMPs encoding genes throughout the plant kingdom. Domain analysis revealed that nearly all AMPs have chitin-binding ability, establishing their role as antifungal agents. The current study includes a developed methodology to characterize smORFs encoding AMPs, and their implications as antimicrobial, antibacterial, antifungal, or antiviral provided by SVM score and prediction status calculated by machine learning-based prediction models. The pipeline, complete package, and the results derived for five angiosperms are freely available at https://github.com/skbinfo/smAMPsTK.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Mohini Jaiswal
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| | - Shailesh Kumar
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| |
Collapse
|
3
|
Rich A, Acar O, Carvunis AR. Massively integrated coexpression analysis reveals transcriptional regulation, evolution and cellular implications of the yeast noncanonical translatome. Genome Biol 2024; 25:183. [PMID: 38978079 PMCID: PMC11232214 DOI: 10.1186/s13059-024-03287-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 05/20/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Recent studies uncovered pervasive transcription and translation of thousands of noncanonical open reading frames (nORFs) outside of annotated genes. The contribution of nORFs to cellular phenotypes is difficult to infer using conventional approaches because nORFs tend to be short, of recent de novo origins, and lowly expressed. Here we develop a dedicated coexpression analysis framework that accounts for low expression to investigate the transcriptional regulation, evolution, and potential cellular roles of nORFs in Saccharomyces cerevisiae. RESULTS Our results reveal that nORFs tend to be preferentially coexpressed with genes involved in cellular transport or homeostasis but rarely with genes involved in RNA processing. Mechanistically, we discover that young de novo nORFs located downstream of conserved genes tend to leverage their neighbors' promoters through transcription readthrough, resulting in high coexpression and high expression levels. Transcriptional piggybacking also influences the coexpression profiles of young de novo nORFs located upstream of genes, but to a lesser extent and without detectable impact on expression levels. Transcriptional piggybacking influences, but does not determine, the transcription profiles of de novo nORFs emerging nearby genes. About 40% of nORFs are not strongly coexpressed with any gene but are transcriptionally regulated nonetheless and tend to form entirely new transcription modules. We offer a web browser interface ( https://carvunislab.csb.pitt.edu/shiny/coexpression/ ) to efficiently query, visualize, and download our coexpression inferences. CONCLUSIONS Our results suggest that nORF transcription is highly regulated. Our coexpression dataset serves as an unprecedented resource for unraveling how nORFs integrate into cellular networks, contribute to cellular phenotypes, and evolve.
Collapse
Affiliation(s)
- April Rich
- Joint Carnegie Mellon University-University of Pittsburgh, University of Pittsburgh Computational Biology PhD Program, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA
| | - Omer Acar
- Joint Carnegie Mellon University-University of Pittsburgh, University of Pittsburgh Computational Biology PhD Program, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA.
- Pittsburgh Center for Evolutionary Biology and Medicine (CEBaM), University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
4
|
Whited AM, Jungreis I, Allen J, Cleveland CL, Mudge JM, Kellis M, Rinn JL, Hough LE. Biophysical characterization of high-confidence, small human proteins. BIOPHYSICAL REPORTS 2024; 4:100167. [PMID: 38909903 PMCID: PMC11305224 DOI: 10.1016/j.bpr.2024.100167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/09/2024] [Accepted: 06/20/2024] [Indexed: 06/25/2024]
Abstract
Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. In addition, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from noncoding ones in otherwise ambiguous cases.
Collapse
Affiliation(s)
- A M Whited
- BioFrontiers Institute, University of Colorado, Boulder, Colorado
| | - Irwin Jungreis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - Jeffre Allen
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | | | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Manolis Kellis
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts
| | - John L Rinn
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Biochemistry, University of Colorado Boulder, Boulder, Colorado
| | - Loren E Hough
- BioFrontiers Institute, University of Colorado, Boulder, Colorado; Department of Physics, University of Colorado Boulder, Boulder, Colorado.
| |
Collapse
|
5
|
Chen J, Li Q, Xia S, Arsala D, Sosa D, Wang D, Long M. The Rapid Evolution of De Novo Proteins in Structure and Complex. Genome Biol Evol 2024; 16:evae107. [PMID: 38753069 PMCID: PMC11149777 DOI: 10.1093/gbe/evae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2024] [Indexed: 06/06/2024] Open
Abstract
Recent studies in the rice genome-wide have established that de novo genes, evolving from noncoding sequences, enhance protein diversity through a stepwise process. However, the pattern and rate of their evolution in protein structure over time remain unclear. Here, we addressed these issues within a surprisingly short evolutionary timescale (<1 million years for 97% of Oryza de novo genes) with comparative approaches to gene duplicates. We found that de novo genes evolve faster than gene duplicates in the intrinsically disordered regions (such as random coils), secondary structure elements (such as α helix and β strand), hydrophobicity, and molecular recognition features. In de novo proteins, specifically, we observed an 8% to 14% decay in random coils and intrinsically disordered region lengths and a 2.3% to 6.5% increase in structured elements, hydrophobicity, and molecular recognition features, per million years on average. These patterns of structural evolution align with changes in amino acid composition over time as well. We also revealed higher positive charges but smaller molecular weights for de novo proteins than duplicates. Tertiary structure predictions showed that most de novo proteins, though not typically well folded on their own, readily form low-energy and compact complexes with other proteins facilitated by extensive residue contacts and conformational flexibility, suggesting a faster-binding scenario in de novo proteins to promote interaction. These analyses illuminate a rapid evolution of protein structure in de novo genes in rice genomes, originating from noncoding sequences, highlighting their quick transformation into active, protein complex-forming components within a remarkably short evolutionary timeframe.
Collapse
Affiliation(s)
- Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Qingrong Li
- Division of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular & Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Shengqian Xia
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Deanna Arsala
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Dylan Sosa
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| | - Dong Wang
- Division of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular & Molecular Medicine, School of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
6
|
Salgado JCS, Alnoch RC, Polizeli MDLTDM, Ward RJ. Microenzymes: Is There Anybody Out There? Protein J 2024; 43:393-404. [PMID: 38507106 DOI: 10.1007/s10930-024-10193-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/08/2024] [Indexed: 03/22/2024]
Abstract
Biological macromolecules are found in different shapes and sizes. Among these, enzymes catalyze biochemical reactions and are essential in all organisms, but is there a limit size for them to function properly? Large enzymes such as catalases have hundreds of kDa and are formed by multiple subunits, whereas most enzymes are smaller, with molecular weights of 20-60 kDa. Enzymes smaller than 10 kDa could be called microenzymes and the present literature review brings together evidence of their occurrence in nature. Additionally, bioactive peptides could be a natural source for novel microenzymes hidden in larger peptides and molecular downsizing could be useful to engineer artificial enzymes with low molecular weight improving their stability and heterologous expression. An integrative approach is crucial to discover and determine the amino acid sequences of novel microenzymes, together with their genomic identification and their biochemical biological and evolutionary functions.
Collapse
Affiliation(s)
- Jose Carlos Santos Salgado
- Department of Chemistry, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, 14040-900, São Paulo, Brazil.
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, 14040-901, São Paulo, Brazil.
| | - Robson Carlos Alnoch
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, 14040-901, São Paulo, Brazil
- Department of Biochemistry and Immunology, Faculdade de Medicina de Ribeirão Preto (FMRP), University of São Paulo, Ribeirão Preto, 14049-900, São Paulo, Brazil
| | - Maria de Lourdes Teixeira de Moraes Polizeli
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, 14040-901, São Paulo, Brazil
- Department of Biochemistry and Immunology, Faculdade de Medicina de Ribeirão Preto (FMRP), University of São Paulo, Ribeirão Preto, 14049-900, São Paulo, Brazil
| | - Richard John Ward
- Department of Chemistry, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto (FFCLRP), University of São Paulo, Ribeirão Preto, 14040-900, São Paulo, Brazil
- Department of Biochemistry and Immunology, Faculdade de Medicina de Ribeirão Preto (FMRP), University of São Paulo, Ribeirão Preto, 14049-900, São Paulo, Brazil
| |
Collapse
|
7
|
Nichols C, Do-Thi VA, Peltier DC. Noncanonical microprotein regulation of immunity. Mol Ther 2024:S1525-0016(24)00324-1. [PMID: 38734902 DOI: 10.1016/j.ymthe.2024.05.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 04/19/2024] [Accepted: 05/09/2024] [Indexed: 05/13/2024] Open
Abstract
The immune system is highly regulated but, when dysregulated, suboptimal protective or overly robust immune responses can lead to immune-mediated disorders. The genetic and molecular mechanisms of immune regulation are incompletely understood, impeding the development of more precise diagnostics and therapeutics for immune-mediated disorders. Recently, thousands of previously unrecognized noncanonical microprotein genes encoded by small open reading frames have been identified. Many of these microproteins perform critical functions, often in a cell- and context-specific manner. Several microproteins are now known to regulate immunity; however, the vast majority are uncharacterized. Therefore, illuminating what is often referred to as the "dark proteome," may present opportunities to tune immune responses more precisely. Here, we review noncanonical microprotein biology, highlight recently discovered examples regulating immunity, and discuss the potential and challenges of modulating dysregulated immune responses by targeting microproteins.
Collapse
Affiliation(s)
- Cydney Nichols
- Morris Green Scholars Program, Department of Pediatrics, Riley Hospital for Children, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Van Anh Do-Thi
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Daniel C Peltier
- Division of Pediatric Hematology and Oncology, Department of Pediatrics, Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202, USA; Simon Cancer Center, Indiana University School of Medicine, Indianapolis, IN 46202, USA.
| |
Collapse
|
8
|
Peng Z, Li J, Jiang X, Wan C. sOCP: a framework predicting smORF coding potential based on TIS and in-frame features and effectively applied in the human genome. Brief Bioinform 2024; 25:bbae147. [PMID: 38600664 PMCID: PMC11006793 DOI: 10.1093/bib/bbae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/25/2024] [Accepted: 03/19/2024] [Indexed: 04/12/2024] Open
Abstract
Small open reading frames (smORFs) have been acknowledged to play various roles on essential biological pathways and affect human beings from diabetes to tumorigenesis. Predicting smORFs in silico is quite a prerequisite for processing the omics data. Here, we proposed the smORF-coding-potential-predicting framework, sOCP, which provides functions to construct a model for predicting novel smORFs in some species. The sOCP model constructed in human was based on in-frame features and the nucleotide bias around the start codon, and the small feature subset was proved to be competent enough and avoid overfitting problems for complicated models. It showed more advanced prediction metrics than previous methods and could correlate closely with experimental evidence in a heterogeneous dataset. The model was applied to Rattus norvegicus and exhibited satisfactory performance. We then scanned smORFs with ATG and non-ATG start codons from the human genome and generated a database containing about a million novel smORFs with coding potential. Around 72 000 smORFs are located on the lncRNA regions of the genome. The smORF-encoded peptides may be involved in biological pathways rare for canonical proteins, including glucocorticoid catabolic process and the prokaryotic defense system. Our work provides a model and database for human smORF investigation and a convenient tool for further smORF prediction in other species.
Collapse
Affiliation(s)
- Zhao Peng
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Jiaqiang Li
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Xingpeng Jiang
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Cuihong Wan
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| |
Collapse
|
9
|
Ren G, Gu X, Zhang L, Gong S, Song S, Chen S, Chen Z, Wang X, Li Z, Zhou Y, Li L, Yang J, Lai F, Dang Y. Ribosomal frameshifting at normal codon repeats recodes functional chimeric proteins in human. Nucleic Acids Res 2024; 52:2463-2479. [PMID: 38281188 PMCID: PMC10954444 DOI: 10.1093/nar/gkae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 01/04/2024] [Accepted: 01/10/2024] [Indexed: 01/30/2024] Open
Abstract
Ribosomal frameshifting refers to the process that ribosomes slip into +1 or -1 reading frame, thus produce chimeric trans-frame proteins. In viruses and bacteria, programmed ribosomal frameshifting can produce essential trans-frame proteins for viral replication or regulation of other biological processes. In humans, however, functional trans-frame protein derived from ribosomal frameshifting is scarcely documented. Combining multiple assays, we show that short codon repeats could act as cis-acting elements that stimulate ribosomal frameshifting in humans, abbreviated as CRFS hereafter. Using proteomic analyses, we identified many putative CRFS events from 32 normal human tissues supported by trans-frame peptides positioned at codon repeats. Finally, we show a CRFS-derived trans-frame protein (HDAC1-FS) functions by antagonizing the activities of HDAC1, thus affecting cell migration and apoptosis. These data suggest a novel type of translational recoding associated with codon repeats, which may expand the coding capacity of mRNA and diversify the regulation in human.
Collapse
Affiliation(s)
- Guiping Ren
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Xiaoqian Gu
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Lu Zhang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shimin Gong
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shuang Song
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Shunkai Chen
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Zhenjing Chen
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Xiaoyan Wang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Zhanbiao Li
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Yingshui Zhou
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Longxi Li
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Jiao Yang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Fan Lai
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| | - Yunkun Dang
- State Key Laboratory for Conservation and Utilization of Bio-Resource in Yunnan, Key Laboratory for Southwest Microbial Diversity of the Ministry of Education, Yunnan Key Laboratory of Cell Metabolism and Diseases, Center for Life Science, School of Life Sciences, Yunnan University, Kunming 650021, China
- Southwest United Graduate School, Kunming650092, China
| |
Collapse
|
10
|
Zandanell J, Wießner M, Bauer JW, Wagner RN. Stop codon readthrough as a treatment option for epidermolysis bullosa-Where we are and where we are going. Exp Dermatol 2024; 33:e15042. [PMID: 38459626 DOI: 10.1111/exd.15042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 01/24/2024] [Accepted: 02/17/2024] [Indexed: 03/10/2024]
Abstract
In the context of rare genetic diseases caused by nonsense mutations, the concept of induced stop codon readthrough (SCR) represents an attractive avenue in the ongoing search for improved treatment options. Epidermolysis bullosa (EB)-exemplary for this group of diseases-describes a diverse group of rare, blistering genodermatoses. Characterized by extreme skin fragility upon minor mechanical trauma, the most severe forms often result from nonsense mutations that lead to premature translation termination and loss of function of essential proteins at the dermo-epidermal junction. Since no curative interventions are currently available, medical care is mainly limited to alleviating symptoms and preventing complications. Complementary to attempts of gene, cell and protein therapy in EB, SCR represents a promising medical alternative. While gentamicin has already been examined in several clinical trials involving EB, other potent SCR inducers, such as ataluren, may also show promise in treating the hitherto non-curative disease. In addition to the extensively studied aminoglycosides and their derivatives, several other substance classes-non-aminoglycoside antibiotics and non-aminoglycoside compounds-are currently under investigation. The extensive data gathered in numerous in vitro experiments and the perspectives they reveal in the clinical setting will be discussed in this review.
Collapse
Affiliation(s)
- Johanna Zandanell
- Department of Dermatology and Allergology, University Hospital of the Paracelsus Medical University, Salzburg, Austria
| | - Michael Wießner
- Department of Dermatology and Allergology, University Hospital of the Paracelsus Medical University, Salzburg, Austria
| | - Johann W Bauer
- Department of Dermatology and Allergology, University Hospital of the Paracelsus Medical University, Salzburg, Austria
| | - Roland N Wagner
- Department of Dermatology and Allergology, University Hospital of the Paracelsus Medical University, Salzburg, Austria
| |
Collapse
|
11
|
Valdivia-Francia F, Sendoel A. No country for old methods: New tools for studying microproteins. iScience 2024; 27:108972. [PMID: 38333695 PMCID: PMC10850755 DOI: 10.1016/j.isci.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024] Open
Abstract
Microproteins encoded by small open reading frames (sORFs) have emerged as a fascinating frontier in genomics. Traditionally overlooked due to their small size, recent technological advancements such as ribosome profiling, mass spectrometry-based strategies and advanced computational approaches have led to the annotation of more than 7000 sORFs in the human genome. Despite the vast progress, only a tiny portion of these microproteins have been characterized and an important challenge in the field lies in identifying functionally relevant microproteins and understanding their role in different cellular contexts. In this review, we explore the recent advancements in sORF research, focusing on the new methodologies and computational approaches that have facilitated their identification and functional characterization. Leveraging these new tools hold great promise for dissecting the diverse cellular roles of microproteins and will ultimately pave the way for understanding their role in the pathogenesis of diseases and identifying new therapeutic targets.
Collapse
Affiliation(s)
- Fabiola Valdivia-Francia
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
- Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Schlieren-Zurich, Switzerland
| | - Ataman Sendoel
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
| |
Collapse
|
12
|
Mao Y, Qian SB. Making sense of mRNA translational "noise". Semin Cell Dev Biol 2024; 154:114-122. [PMID: 36925447 PMCID: PMC10500040 DOI: 10.1016/j.semcdb.2023.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 03/09/2023] [Accepted: 03/09/2023] [Indexed: 03/15/2023]
Abstract
The importance of translation fidelity has been apparent since the discovery of genetic code. It is commonly believed that translation deviating from the main coding region is to be avoided at all times inside cells. However, ribosome profiling and mass spectrometry have revealed pervasive noncanonical translation. Both the scope and origin of translational "noise" are just beginning to be appreciated. Although largely overlooked, those translational "noises" are associated with a wide range of cellular functions, such as producing unannotated protein products. Furthermore, the dynamic nature of translational "noise" is responsive to stress conditions, highlighting the beneficial effect of translational "noise" in stress adaptation. Mechanistic investigation of translational "noise" will provide better insight into the mechanisms of translational regulation. Ultimately, they are not "noise" at all but represent a signature of cellular activities under pathophysiological conditions. Deciphering translational "noise" holds the therapeutic and diagnostic potential in a wide spectrum of human diseases.
Collapse
Affiliation(s)
- Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
13
|
Atakan MM, Türkel İ, Özerkliğ B, Koşar ŞN, Taylor DF, Yan X, Bishop DJ. Small peptides: could they have a big role in metabolism and the response to exercise? J Physiol 2024; 602:545-568. [PMID: 38196325 DOI: 10.1113/jp283214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/14/2023] [Indexed: 01/11/2024] Open
Abstract
Exercise is a powerful non-pharmacological intervention for the treatment and prevention of numerous chronic diseases. Contracting skeletal muscles provoke widespread perturbations in numerous cells, tissues and organs, which stimulate multiple integrated adaptations that ultimately contribute to the many health benefits associated with regular exercise. Despite much research, the molecular mechanisms driving such changes are not completely resolved. Technological advancements beginning in the early 1960s have opened new avenues to explore the mechanisms responsible for the many beneficial adaptations to exercise. This has led to increased research into the role of small peptides (<100 amino acids) and mitochondrially derived peptides in metabolism and disease, including those coded within small open reading frames (sORFs; coding sequences that encode small peptides). Recently, it has been hypothesized that sORF-encoded mitochondrially derived peptides and other small peptides play significant roles as exercise-sensitive peptides in exercise-induced physiological adaptation. In this review, we highlight the discovery of mitochondrially derived peptides and newly discovered small peptides involved in metabolism, with a specific emphasis on their functions in exercise-induced adaptations and the prevention of metabolic diseases. In light of the few studies available, we also present data on how both single exercise sessions and exercise training affect expression of sORF-encoded mitochondrially derived peptides. Finally, we outline numerous research questions that await investigation regarding the roles of mitochondrially derived peptides in metabolism and prevention of various diseases, in addition to their roles in exercise-induced physiological adaptations, for future studies.
Collapse
Affiliation(s)
- Muhammed M Atakan
- Division of Exercise Nutrition and Metabolism, Faculty of Sport Sciences, Hacettepe University, Ankara, Turkey
- Institute for Health and Sport (iHeS), Victoria University, Melbourne, Victoria, Australia
| | - İbrahim Türkel
- Department of Exercise and Sport Sciences, Faculty of Sport Sciences, Hacettepe University, Ankara, Turkey
| | - Berkay Özerkliğ
- Department of Exercise and Sport Sciences, Faculty of Sport Sciences, Hacettepe University, Ankara, Turkey
| | - Şükran N Koşar
- Division of Exercise Nutrition and Metabolism, Faculty of Sport Sciences, Hacettepe University, Ankara, Turkey
| | - Dale F Taylor
- Institute for Health and Sport (iHeS), Victoria University, Melbourne, Victoria, Australia
| | - Xu Yan
- Institute for Health and Sport (iHeS), Victoria University, Melbourne, Victoria, Australia
- Sarcopenia Research Program, Australia Institute for Musculoskeletal Sciences (AIMSS), Melbourne, Victoria, Australia
| | - David J Bishop
- Institute for Health and Sport (iHeS), Victoria University, Melbourne, Victoria, Australia
| |
Collapse
|
14
|
Cai X, Yao Y, Ren F, Zhang S. circTldc1 increases Tldc1 expression by targeting miR-485-5p to promote fibroblast-like synoviocytes proliferation in collagen-induced arthritis. Exp Cell Res 2024; 435:113928. [PMID: 38190869 DOI: 10.1016/j.yexcr.2024.113928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/04/2024] [Accepted: 01/05/2024] [Indexed: 01/10/2024]
Abstract
Abnormalities in the function of fibroblast-like synoviocytes (FLSs) are crucial factors leading to joint damage of rheumatoid arthritis. In recent years, the role of circular RNA (circRNA) in RA has gradually been revealed. However, the functional regulation of FLSs mediated by circRNA and its potential mechanisms remain unclear. In this study, we elucidated the expression profile of circRNA in FLSs, as well as the role and molecular mechanisms of circTldc1. Through sequencing and validation experiments on primary FLSs derived from collagen-induced arthritis (CIA) rats, we found that circTldc1 can promote FLSs proliferation and exacerbate CIA-induced joint damage. The data revealed that circTldc1's parent gene, Tldc1, is homologous to human Tldc1, and circTldc1 is located in the cytoplasm of FLSs, belonging to the exonic circRNA category. The results from bioinformatics analysis, molecular experiments on FLSs (manipulating circTldc1 expression in vitro), and animal experiments (local regulation of circTldc1 expression in vivo) collectively confirmed that circTldc1 promotes Tldc1 expression by targeting miR-485-5p. High expression of Tldc1 further enhances FLSs proliferation and inflammatory responses, thereby worsening joint damage in CIA rats. High expression of circTldc1 and its parent gene Tldc1 may serve as biomarkers for RA. Local regulation of circTldc1 and Tldc1 gene levels in the joint cavity may represent a potential strategy to improve joint damage and inflammation in RA.
Collapse
Affiliation(s)
- Xiaoyu Cai
- Department of Pharmacy, Hangzhou First People's Hospital, Hangzhou, 310006, China.
| | - Yao Yao
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Fujia Ren
- Department of Pharmacy, Hangzhou Women's Hospital, Hangzhou, China
| | - Shiwei Zhang
- Department of Anesthesiology, The First Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China
| |
Collapse
|
15
|
Ding S, Liao H, Huang F, Chen L, Guo W, Feng K, Huang T, Cai YD. Analyzing domain features of small proteins using a machine-learning method. Proteomics 2024:e2300302. [PMID: 38258387 DOI: 10.1002/pmic.202300302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 01/14/2024] [Accepted: 01/15/2024] [Indexed: 01/24/2024]
Abstract
Small proteins (SPs) are a unique group of proteins that play crucial roles in many important biological processes. Exploring the biological function of SPs is necessary. In this study, the InterPro tool and the maximum correlation method were utilized to analyze functional domains of SPs. The purpose was to identify important functional domains that can indicate the essential differences between small and large protein sequences. First, the small and large proteins were represented by their functional domains via a one-hot scheme. Then, the MaxRel method was adopted to evaluate the relationships between each domain and the target variable, indicating small or large protein. The top 36 domain features were selected for further investigation. Among them, 14 were deemed to be highly related to SPs because they were annotated to SPs more frequently than large proteins. We found the involvement of functional domains, such as ubiquitin-conjugating enzyme/RWD-like, nuclear transport factor 2 domain, and alpha subunit of guanine nucleotide-binding protein (G-protein) in regulating the biological function of SPs. The involvement of these domains has been confirmed by other recent studies. Our findings indicate that protein functional domains may regulate small protein-related functions and predict their biological activity.
Collapse
Affiliation(s)
- ShiJian Ding
- School of Life Sciences, Shanghai University, Shanghai, China
| | | | - FeiMing Huang
- School of Life Sciences, Shanghai University, Shanghai, China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai, China
| | - Wei Guo
- Key Laboratory of Stem Cell Biology, Shanghai Jiao Tong University School of Medicine (SJTUSM) & Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China
| | - KaiYan Feng
- Department of Computer Science, Guangdong AIB Polytechnic College, Guangzhou, China
| | - Tao Huang
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai, China
| |
Collapse
|
16
|
Grandchamp A, Czuppon P, Bornberg-Bauer E. Quantification and modeling of turnover dynamics of de novo transcripts in Drosophila melanogaster. Nucleic Acids Res 2024; 52:274-287. [PMID: 38000384 PMCID: PMC10783523 DOI: 10.1093/nar/gkad1079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/13/2023] [Accepted: 10/28/2023] [Indexed: 11/26/2023] Open
Abstract
Most of the transcribed eukaryotic genomes are composed of non-coding transcripts. Among these transcripts, some are newly transcribed when compared to outgroups and are referred to as de novo transcripts. De novo transcripts have been shown to play a major role in genomic innovations. However, little is known about the rates at which de novo transcripts are gained and lost in individuals of the same species. Here, we address this gap and estimate the de novo transcript turnover rate with an evolutionary model. We use DNA long reads and RNA short reads from seven geographically remote samples of inbred individuals of Drosophila melanogaster to detect de novo transcripts that are gained on a short evolutionary time scale. Overall, each sampled individual contains around 2500 unspliced de novo transcripts, with most of them being sample specific. We estimate that around 0.15 transcripts are gained per year, and that each gained transcript is lost at a rate around 5× 10-5 per year. This high turnover of transcripts suggests frequent exploration of new genomic sequences within species. These rate estimates are essential to comprehend the process and timescale of de novo gene birth.
Collapse
Affiliation(s)
- Anna Grandchamp
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Peter Czuppon
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Department of Protein Evolution, Max Planck Institute for Biology, Tübingen, Germany
| |
Collapse
|
17
|
Lyapina I, Fesenko I. Intracellular and Extracellular Peptidomes of the Model Plant, Physcomitrium patens. Methods Mol Biol 2024; 2758:375-385. [PMID: 38549025 DOI: 10.1007/978-1-0716-3646-6_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2024]
Abstract
Here, we report our approach to peptidomic analysis of the plant model Physcomitrium patens. Intracellular and extracellular peptides were extracted under conditions preventing proteolytic digestion by endogenous proteases. The extracts were fractionated on size exclusion columns to isolate intracellular peptides and on reversed-phase cartridges to isolate extracellular peptides, with the isolated peptides subjected to LC-MS/MS analysis. Mass spectrometry data were analyzed for the presence of peptides derived from the known proteins or microproteins encoded by small open reading frames (<100 aa, smORFs) predicted in the moss genome. Experimental details are provided for each step.
Collapse
Affiliation(s)
- Irina Lyapina
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia
| | - Igor Fesenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia
| |
Collapse
|
18
|
Wright Z, Seymour M, Paszczak K, Truttmann T, Senn K, Stilp S, Jansen N, Gosz M, Goeden L, Anantharaman V, Aravind L, Waters LS. The small protein MntS evolved from a signal peptide and acquired a novel function regulating manganese homeostasis in Escherichia coli. Mol Microbiol 2024; 121:152-166. [PMID: 38104967 PMCID: PMC10842292 DOI: 10.1111/mmi.15206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/17/2023] [Accepted: 11/24/2023] [Indexed: 12/19/2023]
Abstract
Small proteins (<50 amino acids) are emerging as ubiquitous and important regulators in organisms ranging from bacteria to humans, where they commonly bind to and regulate larger proteins during stress responses. However, fundamental aspects of small proteins, such as their molecular mechanism of action, downregulation after they are no longer needed, and their evolutionary provenance, are poorly understood. Here, we show that the MntS small protein involved in manganese (Mn) homeostasis binds and inhibits the MntP Mn transporter. Mn is crucial for bacterial survival in stressful environments but is toxic in excess. Thus, Mn transport is tightly controlled at multiple levels to maintain optimal Mn levels. The small protein MntS adds a new level of regulation for Mn transporters, beyond the known transcriptional and post-transcriptional control. We also found that MntS binds to itself in the presence of Mn, providing a possible mechanism of downregulating MntS activity to terminate its inhibition of MntP Mn export. MntS is homologous to the signal peptide of SitA, the periplasmic metal-binding subunit of a Mn importer. Remarkably, the homologous signal peptide regions can substitute for MntS, demonstrating a functional relationship between MntS and these signal peptides. Conserved gene neighborhoods support that MntS evolved from the signal peptide of an ancestral SitA protein, acquiring a life of its own with a distinct function in Mn homeostasis.
Collapse
Affiliation(s)
- Zachary Wright
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Mackenzie Seymour
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Kalista Paszczak
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Taylor Truttmann
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Katherine Senn
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Samuel Stilp
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Nickolas Jansen
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Magdalyn Gosz
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Lindsay Goeden
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - L. Aravind
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Lauren S. Waters
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| |
Collapse
|
19
|
Yao Y, Cai X, Zhang M, Zheng Y, Fei W, Zhao M, Zheng C. Circ-Plod2 destabilizes Mpo mRNA by binding to IGF2BP2 to promote osteogenic differentiation of bone marrow mesenchymal stem cells. Eur J Pharmacol 2023; 961:176192. [PMID: 37981258 DOI: 10.1016/j.ejphar.2023.176192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 10/11/2023] [Accepted: 11/07/2023] [Indexed: 11/21/2023]
Abstract
Osteogenic differentiation, proliferation, and/or apoptosis of bone marrow mesenchymal stem cells (BMSCs) are involved in the progression of postmenopausal osteoporosis (PMO). However, circular RNA (circRNA)-mediated changes in the cellular function of BMSCs in PMO are still unclear. This study revealed the excellent ability of circ-Plod2 to promote osteogenic differentiation of BMSCs and its molecular mechanisms. In this study, ovariectomized (OVX) rats and control (Sham) rats were used to simulate PMO. Initially, we found that the expression of circ-Plod2 in OVX BMSCs is down-regulated and the expression of the Mpo gene is up-regulated by sequencing and verification. Further, we confirmed that circ-Plod2 is located in the cytoplasm and belongs to exon-type circRNA. Interestingly, circ-Plod2 promotes Mpo-dependent osteogenic differentiation of BMSCs without affecting proliferation, apoptosis, adipogenic differentiation, or chondrogenic differentiation of BMSCs. Mechanistically, we demonstrated that circ-Plod2 specifically binds IGF2BP2 to form an RNA-protein complex that destabilizes Mpo mRNA. Overexpression of circ-Plod2 in the bone marrow cavity effectively alleviated osteoporosis in OVX rats and inhibited the expression of MPO in BMSCs. Together, this study reveals that circ-Plod2 destabilizes Mpo mRNA by binding to IGF2BP2 to promote osteogenic differentiation of BMSCs to alleviate osteoporosis. The findings of this study may provide biomarkers for the diagnosis of PMO, and may also provide potential strategies for the clinical treatment of PMO.
Collapse
Affiliation(s)
- Yao Yao
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Xiaoyu Cai
- Department of Clinical Pharmacology, Key Laboratory of Clinical Cancer Pharmacology and Toxicology Research of Zhejiang Province, Affiliated Hangzhou First People's Hospital, Cancer Center, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Meng Zhang
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Yongquan Zheng
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Weidong Fei
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China
| | - Mengdan Zhao
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
| | - Caihong Zheng
- Department of Pharmacy, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
| |
Collapse
|
20
|
Frumkin I, Laub MT. Selection of a de novo gene that can promote survival of Escherichia coli by modulating protein homeostasis pathways. Nat Ecol Evol 2023; 7:2067-2079. [PMID: 37945946 PMCID: PMC10697842 DOI: 10.1038/s41559-023-02224-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 09/12/2023] [Indexed: 11/12/2023]
Abstract
Cellular novelty can emerge when non-functional loci become functional genes in a process termed de novo gene birth. But how proteins with random amino acid sequences beneficially integrate into existing cellular pathways remains poorly understood. We screened ~108 genes, generated from random nucleotide sequences and devoid of homology to natural genes, for their ability to rescue growth arrest of Escherichia coli cells producing the ribonuclease toxin MazF. We identified ~2,000 genes that could promote growth, probably by reducing transcription from the promoter driving toxin expression. Additionally, one random protein, named Random antitoxin of MazF (RamF), modulated protein homeostasis by interacting with chaperones, leading to MazF proteolysis and a consequent loss of its toxicity. Finally, we demonstrate that random proteins can improve during evolution by identifying beneficial mutations that turned RamF into a more efficient inhibitor. Our work provides a mechanistic basis for how de novo gene birth can produce functional proteins that effectively benefit cells evolving under stress.
Collapse
Affiliation(s)
- Idan Frumkin
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Michael T Laub
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Cambridge, MA, USA.
| |
Collapse
|
21
|
Bosch JA, Keith N, Escobedo F, Fisher WW, LaGraff JT, Rabasco J, Wan KH, Weiszmann R, Hu Y, Kondo S, Brown JB, Perrimon N, Celniker SE. Molecular and functional characterization of the Drosophila melanogaster conserved smORFome. Cell Rep 2023; 42:113311. [PMID: 37889754 PMCID: PMC10843857 DOI: 10.1016/j.celrep.2023.113311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 08/24/2023] [Accepted: 10/04/2023] [Indexed: 10/29/2023] Open
Abstract
Short polypeptides encoded by small open reading frames (smORFs) are ubiquitously found in eukaryotic genomes and are important regulators of physiology, development, and mitochondrial processes. Here, we focus on a subset of 298 smORFs that are evolutionarily conserved between Drosophila melanogaster and humans. Many of these smORFs are conserved broadly in the bilaterian lineage, and ∼182 are conserved in plants. We observe remarkably heterogeneous spatial and temporal expression patterns of smORF transcripts-indicating wide-spread tissue-specific and stage-specific mitochondrial architectures. In addition, an analysis of annotated functional domains reveals a predicted enrichment of smORF polypeptides localizing to mitochondria. We conduct an embryonic ribosome profiling experiment and find support for translation of 137 of these smORFs during embryogenesis. We further embark on functional characterization using CRISPR knockout/activation, RNAi knockdown, and cDNA overexpression, revealing diverse phenotypes. This study underscores the importance of identifying smORF function in disease and phenotypic diversity.
Collapse
Affiliation(s)
- Justin A Bosch
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Nathan Keith
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Felipe Escobedo
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - William W Fisher
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - James Thai LaGraff
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Jorden Rabasco
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Kenneth H Wan
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Richard Weiszmann
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Yanhui Hu
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Shu Kondo
- Laboratory of Invertebrate Genetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - James B Brown
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | - Norbert Perrimon
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Harvard Medical School, Boston, MA 02115, USA.
| | - Susan E Celniker
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| |
Collapse
|
22
|
Zhao B, Zhao J, Wang M, Guo Y, Mehmood A, Wang W, Xiong Y, Luo S, Wei DQ, Zhao XQ, Wang Y. Exploring microproteins from various model organisms using the mip-mining database. BMC Genomics 2023; 24:661. [PMID: 37919660 PMCID: PMC10623795 DOI: 10.1186/s12864-023-09735-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 10/12/2023] [Indexed: 11/04/2023] Open
Abstract
Microproteins, prevalent across all kingdoms of life, play a crucial role in cell physiology and human health. Although global gene transcription is widely explored and abundantly available, our understanding of microprotein functions using transcriptome data is still limited. To mitigate this problem, we present a database, Mip-mining ( https://weilab.sjtu.edu.cn/mipmining/ ), underpinned by high-quality RNA-sequencing data exclusively aimed at analyzing microprotein functions. The Mip-mining hosts 336 sets of high-quality transcriptome data from 8626 samples and nine representative living organisms, including microorganisms, plants, animals, and humans, in our Mip-mining database. Our database specifically provides a focus on a range of diseases and environmental stress conditions, taking into account chemical, physical, biological, and diseases-related stresses. Comparatively, our platform enables customized analysis by inputting desired data sets with self-determined cutoff values. The practicality of Mip-mining is demonstrated by identifying essential microproteins in different species and revealing the importance of ATP15 in the acetic acid stress tolerance of budding yeast. We believe that Mip-mining will facilitate a greater understanding and application of microproteins in biotechnology. Moreover, it will be beneficial for designing therapeutic strategies under various biological conditions.
Collapse
Affiliation(s)
- Bowen Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jing Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Muyao Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yangfan Guo
- Central Laboratory of Yan'an Hospital Affiliated to Kunming Medical University, Kunming, 650051, China
| | - Aamir Mehmood
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Weibin Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China
| | - Shenggan Luo
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Zhongjing Research and Industrialization Institute of Chinese Medicine, Zhongguancun Scientific Park, Meixi, Nayang, Henan, 473006, China.
- Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, 518055, Guangdong, China.
| | - Xin-Qing Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Yanjing Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Engineering Research Center of Cell & Therapeutic Antibody, School of Pharmacy, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
23
|
Fan KT, Hsu CW, Chen YR. Mass spectrometry in the discovery of peptides involved in intercellular communication: From targeted to untargeted peptidomics approaches. MASS SPECTROMETRY REVIEWS 2023; 42:2404-2425. [PMID: 35765846 DOI: 10.1002/mas.21789] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 03/17/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]
Abstract
Endogenous peptide hormones represent an essential class of biomolecules, which regulate cell-cell communications in diverse physiological processes of organisms. Mass spectrometry (MS) has been developed to be a powerful technology for identifying and quantifying peptides in a highly efficient manner. However, it is difficult to directly identify these peptide hormones due to their diverse characteristics, dynamic regulations, low abundance, and existence in a complicated biological matrix. Here, we summarize and discuss the roles of targeted and untargeted MS in discovering peptide hormones using bioassay-guided purification, bioinformatics screening, or the peptidomics-based approach. Although the peptidomics approach is expected to discover novel peptide hormones unbiasedly, only a limited number of successful cases have been reported. The critical challenges and corresponding measures for peptidomics from the steps of sample preparation, peptide extraction, and separation to the MS data acquisition and analysis are also discussed. We also identify emerging technologies and methods that can be integrated into the discovery platform toward the comprehensive study of endogenous peptide hormones.
Collapse
Affiliation(s)
- Kai-Ting Fan
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| | - Chia-Wei Hsu
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| | - Yet-Ran Chen
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
24
|
Fan SM, Li ZQ, Zhang SZ, Chen LY, Wei XY, Liang J, Zhao XQ, Su C. Multi-integrated approach for unraveling small open reading frames potentially associated with secondary metabolism in Streptomyces. mSystems 2023; 8:e0024523. [PMID: 37712700 PMCID: PMC10654065 DOI: 10.1128/msystems.00245-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 07/20/2023] [Indexed: 09/16/2023] Open
Abstract
IMPORTANCE Due to their small size and special chemical features, small open reading frame (smORF)-encoding peptides (SEPs) are often neglected. However, they may play critical roles in regulating gene expression, enzyme activity, and metabolite production. Studies on bacterial microproteins have mainly focused on pathogenic bacteria, which are importance to systematically investigate SEPs in streptomycetes and are rich sources of bioactive secondary metabolites. Our study is the first to perform a global identification of smORFs in streptomycetes. We established a peptidogenomic workflow for non-model microbial strains and identified multiple novel smORFs that are potentially linked to secondary metabolism in streptomycetes. Our multi-integrated approach in this study is meaningful to improve the quality and quantity of the detected smORFs. Ultimately, the workflow we established could be extended to other organisms and would benefit the genome mining of microproteins with critical functions for regulation and engineering useful microorganisms.
Collapse
Affiliation(s)
- Si-Min Fan
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
| | - Ze-Qi Li
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
| | - Shi-Zhe Zhang
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
| | - Liang-Yu Chen
- ProteinT (Tianjin) biotechnology Co. Ltd., Tianjin, China
| | - Xi-Ying Wei
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
| | - Jian Liang
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
- College of Biology and Geography, Yili Normal University, Yining, China
| | - Xin-Qing Zhao
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai Jiao, China
| | - Chun Su
- National Engineering Laboratory for Resource Developing of Endangered Chinese Crude Drugs in Northwest China, College of Life Sciences, Shaanxi Normal University, Shaanxi, China
| |
Collapse
|
25
|
Srikant S, Gaudet R, Murray AW. Extending the reach of homology by using successive computational filters to find yeast pheromone genes. Curr Biol 2023; 33:4098-4110.e3. [PMID: 37699395 PMCID: PMC10592104 DOI: 10.1016/j.cub.2023.08.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 07/04/2023] [Accepted: 08/14/2023] [Indexed: 09/14/2023]
Abstract
The mating of fungi depends on pheromones that mediate communication between two mating types. Most species use short peptides as pheromones, which are either unmodified (e.g., α-factor in Saccharomyces cerevisiae) or C-terminally farnesylated (e.g., a-factor in S. cerevisiae). Peptide pheromones have been found by genetics or biochemistry in a small number of fungi, but their short sequences and modest conservation make it impossible to detect homologous sequences in most species. To overcome this problem, we used a four-step computational pipeline to identify candidate a-factor genes in sequenced genomes of the Saccharomycotina, the fungal clade that contains most of the yeasts: we require that candidate genes have a C-terminal prenylation motif, are shorter than 100 amino acids long, and contain a proteolytic-processing motif upstream of the potential mature pheromone sequence and that closely related species contain highly conserved homologs of the potential mature pheromone sequence. Additional manual curation exploits the observation that many species carry more than one a-factor gene, encoding identical or nearly identical pheromones. From 332 Saccharomycotina genomes, we identified strong candidate pheromone genes in 241 genomes, covering 13 clades that are each separated from each other by at least 100 million years, the time required for evolution to remove detectable sequence homology among small pheromone genes. For one small clade, the Yarrowia, we demonstrated that our algorithm found the a-factor genes: deleting all four related genes in the a-mating type of Yarrowia lipolytica prevents mating.
Collapse
Affiliation(s)
- Sriram Srikant
- Department of Molecular and Cellular Biology, Harvard University, Oxford Street, Cambridge, MA 02138, USA
| | - Rachelle Gaudet
- Department of Molecular and Cellular Biology, Harvard University, Oxford Street, Cambridge, MA 02138, USA
| | - Andrew W Murray
- Department of Molecular and Cellular Biology, Harvard University, Oxford Street, Cambridge, MA 02138, USA.
| |
Collapse
|
26
|
Markus D, Pelletier A, Boube M, Port F, Boutros M, Payre F, Obermayer B, Zanet J. The pleiotropic functions of Pri smORF peptides synchronize leg development regulators. PLoS Genet 2023; 19:e1011004. [PMID: 37903161 PMCID: PMC10635573 DOI: 10.1371/journal.pgen.1011004] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 11/09/2023] [Accepted: 10/03/2023] [Indexed: 11/01/2023] Open
Abstract
The last decade witnesses the emergence of the abundant family of smORF peptides, encoded by small ORF (<100 codons), whose biological functions remain largely unexplored. Bioinformatic analyses here identify hundreds of putative smORF peptides expressed in Drosophila imaginal leg discs. Thanks to a functional screen in leg, we found smORF peptides involved in morphogenesis, including the pioneer smORF peptides Pri. Since we identified its target Ubr3 in the epidermis and pri was known to control leg development through poorly understood mechanisms, we investigated the role of Ubr3 in mediating pri function in leg. We found that pri plays several roles during leg development both in patterning and in cell survival. During larval stage, pri activates independently of Ubr3 tarsal transcriptional programs and Notch and EGFR signaling pathways, whereas at larval pupal transition, Pri peptides cooperate with Ubr3 to insure cell survival and leg morphogenesis. Our results highlight Ubr3 dependent and independent functions of Pri peptides and their pleiotropy. Moreover, we reveal that the smORF peptide family is a reservoir of overlooked developmental regulators, displaying distinct molecular functions and orchestrating leg development.
Collapse
Affiliation(s)
- Damien Markus
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Aurore Pelletier
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Muriel Boube
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Fillip Port
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - Michael Boutros
- Division Signaling and Functional Genomics, German Cancer Research Center (DKFZ) and Heidelberg University, Heidelberg, Germany
| | - François Payre
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Benedikt Obermayer
- Core Unit Bioinformatics (CUBI), Berlin Institute of Health at Charité Universitätsmedizin-Berlin, Berlin, Germany
| | - Jennifer Zanet
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| |
Collapse
|
27
|
Zhang M, Zhao J, Wu J, Wang Y, Zhuang M, Zou L, Mao R, Jiang B, Liu J, Song X. In-depth characterization and identification of translatable lncRNAs. Comput Biol Med 2023; 164:107243. [PMID: 37453378 DOI: 10.1016/j.compbiomed.2023.107243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 06/16/2023] [Accepted: 07/07/2023] [Indexed: 07/18/2023]
Abstract
Long non-coding RNAs (LncRNAs) are non-protein coding transcripts more than 200 nucleotides in length. Deep sequencing technologies have unveiled lncRNAs can harbor translatable short open reading frames (sORFs). Yet the regulatory mechanisms governing lncRNA translation events remain poorly understood. Here, we exhaustively detected the sequence, functional element, and structure features relevant to lncRNA translation in human. Extensive identification and analysis reveal that translatable lncRNAs contain richer protein-coding related sequence features, cap-dependent and cap-independent translation initiation mechanisms, and more stable secondary structures, as compared to untranslatable lncRNAs. These findings strongly support lncRNAs serve as a repository for the production of new small peptides. Based on the feature fusion affecting translation and the extreme gradient boosting (XGBoost) algorithm, we developed the first computational tool that dedicated for predicting translatable lncRNAs, named TransLncPred. Benchmark experimental results show that our method outperforms several state-of-the-art RNA coding potential prediction tools on the same training and testing datasets. The 100-time 10-fold cross-validation tests also demonstrate that regulatory element-derived features, especially N7-methylguanosine (m7G) and internal ribosome entry site (IRES), contribute to the improvement in predictive performance.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Jian Zhao
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China.
| | - Jing Wu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, China
| | - Yulan Wang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Minhui Zhuang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Lingxiao Zou
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Renlong Mao
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Bin Jiang
- College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Jingjing Liu
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China
| | - Xiaofeng Song
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 211106, China.
| |
Collapse
|
28
|
Anders J, Stadler PF. RNAcode_Web - Convenient identification of evolutionary conserved protein coding regions. J Integr Bioinform 2023; 20:jib-2022-0046. [PMID: 37615674 PMCID: PMC10757073 DOI: 10.1515/jib-2022-0046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 02/15/2023] [Indexed: 08/25/2023] Open
Abstract
The differentiation of regions with coding potential from non-coding regions remains a key task in computational biology. Methods such as RNAcode that exploit patterns of sequence conservation for this task have a substantial advantage in classification accuracy in particular for short coding sequences, compared to methods that rely on a single input sequence. However, they require sequence alignments as input. Frequently, suitable multiple sequence alignments are not readily available and are tedious, and sometimes difficult to construct. We therefore introduce here a new web service that provides access to the well-known coding sequence detector RNAcode with minimal user overhead. It requires as input only a single target nucleotide sequence. The service automates the collection, selection, and preparation of homologous sequences from the NCBI database, as well as the construction of the multiple sequence alignment that are needed as input for RNAcode. The service automatizes the entire pre- and postprocessing and thus makes the investigation of specific genomic regions for previously unannotated coding regions, such as small peptides or additional introns, a simple task that is easily accessible to non-expert users. RNAcode_Web is accessible online at rnacode.bioinf.uni-leipzig.de.
Collapse
Affiliation(s)
- John Anders
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, D-04107Leipzig, Germany
| | - Peter F. Stadler
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, D-04107Leipzig, Germany
- Max-Planck-Institute for Mathematics in the Sciences, Inselstraße 22, D-04103Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090Wien, Austria
- Facultad de Ciencias, Universidad National de Colombia, Sede Bogotá, Colombia
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501, USA
| |
Collapse
|
29
|
Sánchez-Salvador A, González-de la Fuente S, Aguado B, Yates PA, Requena JM. Refinement of Leishmania donovani Genome Annotations in the Light of Ribosome-Protected mRNAs Fragments (Ribo-Seq Data). Genes (Basel) 2023; 14:1637. [PMID: 37628688 PMCID: PMC10454037 DOI: 10.3390/genes14081637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/11/2023] [Accepted: 08/15/2023] [Indexed: 08/27/2023] Open
Abstract
Advances in next-generation sequencing methodologies have facilitated the assembly of an ever-increasing number of genomes. Gene annotations are typically conducted via specialized software, but the most accurate results require additional manual curation that incorporates insights derived from functional and bioinformatic analyses (e.g., transcriptomics, proteomics, and phylogenetics). In this study, we improved the annotation of the Leishmania donovani (strain HU3) genome using publicly available data from the deep sequencing of ribosome-protected mRNA fragments (Ribo-Seq). As a result of this analysis, we uncovered 70 previously non-annotated protein-coding genes and improved the annotation of around 600 genes. Additionally, we present evidence for small upstream open reading frames (uORFs) in a significant number of transcripts, indicating their potential role in the translational regulation of gene expression. The bioinformatics pipelines developed for these analyses can be used to improve the genome annotations of other organisms for which Ribo-Seq data are available. The improvements provided by these studies will bring us closer to the ultimate goal of a complete and accurately annotated L. donovani genome and will enhance future transcriptomics, proteomics, and genetics studies.
Collapse
Affiliation(s)
- Alejandro Sánchez-Salvador
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Departamento de Biología Molecular, Instituto Universitario de Biología Molecular (IUBM), Universidad Autónoma de Madrid, 28049 Madrid, Spain;
| | - Sandra González-de la Fuente
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Genomic and NGS Facility (GENGS), 28049 Madrid, Spain; (S.G.-d.l.F.); (B.A.)
| | - Begoña Aguado
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Genomic and NGS Facility (GENGS), 28049 Madrid, Spain; (S.G.-d.l.F.); (B.A.)
| | - Phillip A. Yates
- Department of Chemical Physiology & Biochemistry, Oregon Health & Science University, Portland, OR 97239, USA
| | - Jose M. Requena
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Departamento de Biología Molecular, Instituto Universitario de Biología Molecular (IUBM), Universidad Autónoma de Madrid, 28049 Madrid, Spain;
- Centro de Investigación Biomédica en Red (CIBERINFEC), Instituto de Salud Carlos III, 28029 Madrid, Spain
| |
Collapse
|
30
|
Ravillah D, Kieber-Emmons AL, Singh S, Keerthy K, Blum AE, Guda K. Discovery and Initial Characterization of Long Intergenic Noncoding RNAs Associated With Esophageal Adenocarcinoma. Gastroenterology 2023; 165:505-508.e7. [PMID: 37182784 PMCID: PMC10524377 DOI: 10.1053/j.gastro.2023.04.037] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 04/13/2023] [Accepted: 04/26/2023] [Indexed: 05/16/2023]
Affiliation(s)
- Durgadevi Ravillah
- Division of General Medical Sciences-Oncology, Case Western Reserve University School of Medicine, Cleveland, Ohio
| | | | - Salendra Singh
- Precision Immuno-Oncology, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, Ohio
| | - Komal Keerthy
- Digestive Health Research Institute, Case Western Reserve University School of Medicine, Cleveland, Ohio
| | - Andrew E Blum
- Division of General Medical Sciences-Oncology, Case Western Reserve University School of Medicine, Cleveland, Ohio; Digestive Health Research Institute, Case Western Reserve University School of Medicine, Cleveland, Ohio; Division of Gastroenterology, Northeast Ohio Veteran Affairs Healthcare System, Cleveland, Ohio
| | - Kishore Guda
- Division of General Medical Sciences-Oncology, Case Western Reserve University School of Medicine, Cleveland, Ohio; Digestive Health Research Institute, Case Western Reserve University School of Medicine, Cleveland, Ohio; Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, Ohio; Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, Ohio.
| |
Collapse
|
31
|
Chen Y, Cao X, Loh KH, Slavoff SA. Chemical labeling and proteomics for characterization of unannotated small and alternative open reading frame-encoded polypeptides. Biochem Soc Trans 2023; 51:1071-1082. [PMID: 37171061 PMCID: PMC10317152 DOI: 10.1042/bst20221074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 03/27/2023] [Accepted: 04/13/2023] [Indexed: 05/13/2023]
Abstract
Thousands of unannotated small and alternative open reading frames (smORFs and alt-ORFs, respectively) have recently been revealed in mammalian genomes. While hundreds of mammalian smORF- and alt-ORF-encoded proteins (SEPs and alt-proteins, respectively) affect cell proliferation, the overwhelming majority of smORFs and alt-ORFs remain uncharacterized at the molecular level. Complicating the task of identifying the biological roles of smORFs and alt-ORFs, the SEPs and alt-proteins that they encode exhibit limited sequence homology to protein domains of known function. Experimental techniques for the functionalization of these gene classes are therefore required. Approaches combining chemical labeling and quantitative proteomics have greatly advanced our ability to identify and characterize functional SEPs and alt-proteins in high throughput. In this review, we briefly describe the principles of proteomic discovery of SEPs and alt-proteins, then summarize how these technologies interface with chemical labeling for identification of SEPs and alt-proteins with specific properties, as well as in defining the interactome of SEPs and alt-proteins.
Collapse
Affiliation(s)
- Yanran Chen
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
| | - Xiongwen Cao
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT, U.S.A
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Ken H. Loh
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Comparative Medicine, Yale University School of Medicine, New Haven, CT, U.S.A
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, U.S.A
- Institute for Biomolecular Design and Discovery, Yale University, West Haven, CT, U.S.A
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, U.S.A
| |
Collapse
|
32
|
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World. Int J Mol Sci 2023; 24:10562. [PMID: 37445739 DOI: 10.3390/ijms241310562] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023] Open
Abstract
Small open reading frames (sORFs) are often overlooked features in genomes. In the past, they were labeled as noncoding or "transcriptional noise". However, accumulating evidence from recent years suggests that sORFs may be transcribed and translated to produce sORF-encoded polypeptides (SEPs) with less than 100 amino acids. The vigorous development of computational algorithms, ribosome profiling, and peptidome has facilitated the prediction and identification of many new SEPs. These SEPs were revealed to be involved in a wide range of basic biological processes, such as gene expression regulation, embryonic development, cellular metabolism, inflammation, and even carcinogenesis. To effectively understand the potential biological functions of SEPs, we discuss the history and development of the newly emerging research on sORFs and SEPs. In particular, we review a range of recently discovered bioinformatics tools for identifying, predicting, and validating SEPs as well as a variety of biochemical experiments for characterizing SEP functions. Lastly, this review underlines the challenges and future directions in identifying and validating sORFs and their encoded micropeptides, providing a significant reference for upcoming research on sORF-encoded peptides.
Collapse
Affiliation(s)
- Xiaoping Dong
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Kun Zhang
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Chengfeng Xun
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Tianqi Chu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Songping Liang
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Yong Zeng
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Zhonghua Liu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| |
Collapse
|
33
|
Wright Z, Seymour M, Paszczak K, Truttmann T, Senn K, Stilp S, Jansen N, Gosz M, Goeden L, Anantharaman V, Aravind L, Waters LS. The small protein MntS evolved from a signal peptide and acquired a novel function regulating manganese homeostasis in Escherichia coli. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.02.543501. [PMID: 37398132 PMCID: PMC10312517 DOI: 10.1101/2023.06.02.543501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Small proteins (< 50 amino acids) are emerging as ubiquitous and important regulators in organisms ranging from bacteria to humans, where they commonly bind to and regulate larger proteins during stress responses. However, fundamental aspects of small proteins, such as their molecular mechanism of action, downregulation after they are no longer needed, and their evolutionary provenance are poorly understood. Here we show that the MntS small protein involved in manganese (Mn) homeostasis binds and inhibits the MntP Mn transporter. Mn is crucial for bacterial survival in stressful environments, but is toxic in excess. Thus, Mn transport is tightly controlled at multiple levels to maintain optimal Mn levels. The small protein MntS adds a new level of regulation for Mn transporters, beyond the known transcriptional and post-transcriptional control. We also found that MntS binds to itself in the presence of Mn, providing a possible mechanism of downregulating MntS activity to terminate its inhibition of MntP Mn export. MntS is homologous to the signal peptide of SitA, the periplasmic metal-binding subunit of a Mn importer. Remarkably, the homologous signal peptide regions can substitute for MntS, demonstrating a functional relationship between MntS and these signal peptides. Conserved gene-neighborhoods support that MntS evolved from an ancestral SitA, acquiring a life of its own with a distinct function in Mn homeostasis. Significance This study demonstrates that the MntS small protein binds and inhibits the MntP Mn exporter, adding another layer to the complex regulation of Mn homeostasis. MntS also interacts with itself in cells with Mn, which could prevent it from regulating MntP. We propose that MntS and other small proteins might sense environmental signals and shut off their own regulation via binding to ligands (e.g., metals) or other proteins. We also provide evidence that MntS evolved from the signal peptide region of the Mn importer, SitA. Homologous SitA signal peptides can recapitulate MntS activities, showing that they have a second function beyond protein secretion. Overall, we establish that small proteins can emerge and develop novel functionalities from gene remnants.
Collapse
|
34
|
Grandchamp A, Kühl L, Lebherz M, Brüggemann K, Parsch J, Bornberg-Bauer E. Population genomics reveals mechanisms and dynamics of de novo expressed open reading frame emergence in Drosophila melanogaster. Genome Res 2023; 33:872-890. [PMID: 37442576 PMCID: PMC10519401 DOI: 10.1101/gr.277482.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 06/06/2023] [Indexed: 07/15/2023]
Abstract
Novel genes are essential for evolutionary innovations and differ substantially even between closely related species. Recently, multiple studies across many taxa showed that some novel genes arise de novo, that is, from previously noncoding DNA. To characterize the underlying mutations that allowed de novo gene emergence and their order of occurrence, homologous regions must be detected within noncoding sequences in closely related sister genomes. So far, most studies do not detect noncoding homologs of de novo genes because of incomplete assemblies and annotations, and long evolutionary distances separating genomes. Here, we overcome these issues by searching for de novo expressed open reading frames (neORFs), the not-yet fixed precursors of de novo genes that emerged within a single species. We sequenced and assembled genomes with long-read technology and the corresponding transcriptomes from inbred lines of Drosophila melanogaster, derived from seven geographically diverse populations. We found line-specific neORFs in abundance but few neORFs shared by lines, suggesting a rapid turnover. Gain and loss of transcription is more frequent than the creation of ORFs, for example, by forming new start and stop codons. Consequently, the gain of ORFs becomes rate limiting and is frequently the initial step in neORFs emergence. Furthermore, transposable elements (TEs) are major drivers for intragenomic duplications of neORFs, yet TE insertions are less important for the emergence of neORFs. However, highly mutable genomic regions around TEs provide new features that enable gene birth. In conclusion, neORFs have a high birth-death rate, are rapidly purged, but surviving neORFs spread neutrally through populations and within genomes.
Collapse
Affiliation(s)
- Anna Grandchamp
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany;
| | - Lucas Kühl
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Marie Lebherz
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Kathrin Brüggemann
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - John Parsch
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, 82152 Munich, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
- Max Planck Institute for Biology Tübingen, Department of Protein Evolution, 72076 Tübingen, Germany
| |
Collapse
|
35
|
Averina OA, Permyakov OA, Emelianova MA, Guseva EA, Grigoryeva OO, Lovat ML, Egorova AE, Grinchenko AV, Kumeiko VV, Marey MV, Manskikh VN, Dontsova OA, Vyssokikh MY, Sergiev PV. Kidney-Related Function of Mitochondrial Protein Mitoregulin. Int J Mol Sci 2023; 24:ijms24109106. [PMID: 37240452 DOI: 10.3390/ijms24109106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 05/15/2023] [Accepted: 05/18/2023] [Indexed: 05/28/2023] Open
Abstract
A small protein, Mitoregulin (Mtln), localizes in mitochondria and contributes to oxidative phosphorylation and fatty acid metabolism. Mtln knockout mice develop obesity on a high-fat diet, demonstrating elevated cardiolipin damage and suboptimal creatine kinase oligomerization in muscle tissue. Kidneys heavily depend on the oxidative phosphorylation in mitochondria. Here we report kidney-related phenotypes in aged Mtln knockout mice. Similar to Mtln knockout mice muscle mitochondria, those of the kidney demonstrate a decreased respiratory complex I activity and excessive cardiolipin damage. Aged male mice carrying Mtln knockout demonstrated an increased frequency of renal proximal tubules' degeneration. At the same time, a decreased glomerular filtration rate has been more frequently detected in aged female mice devoid of Mtln. An amount of Mtln partner protein, Cyb5r3, is drastically decreased in the kidneys of Mtln knockout mice.
Collapse
Affiliation(s)
- Olga A Averina
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Oleg A Permyakov
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Mariia A Emelianova
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
| | - Ekaterina A Guseva
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
| | - Olga O Grigoryeva
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Maxim L Lovat
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Institute of Mitoengineering MSU, 119992 Moscow, Russia
| | - Anna E Egorova
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, 690922 Vladivostok, Russia
| | - Andrei V Grinchenko
- A.V. Zhirmunsky National Scientific Center of Marine Biology, 690041 Vladivostok, Russia
| | - Vadim V Kumeiko
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, 690922 Vladivostok, Russia
- A.V. Zhirmunsky National Scientific Center of Marine Biology, 690041 Vladivostok, Russia
| | - Maria V Marey
- National Medical Research Center for Obstetrics, Gynecology and Perinatology Named after Academician V.I.Kulakov, 117198 Moscow, Russia
| | - Vasily N Manskikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Institute of Mitoengineering MSU, 119992 Moscow, Russia
| | - Olga A Dontsova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 119992 Moscow, Russia
| | - Mikhail Y Vyssokikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- National Medical Research Center for Obstetrics, Gynecology and Perinatology Named after Academician V.I.Kulakov, 117198 Moscow, Russia
| | - Petr V Sergiev
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| |
Collapse
|
36
|
Inchingolo MA, Diman A, Adamczewski M, Humphreys T, Jaquier-Gubler P, Curran JA. TP53BP1, a dual-coding gene, uses promoter switching and translational reinitiation to express a smORF protein. iScience 2023; 26:106757. [PMID: 37216125 PMCID: PMC10193022 DOI: 10.1016/j.isci.2023.106757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 03/07/2023] [Accepted: 04/24/2023] [Indexed: 05/24/2023] Open
Abstract
The complexity of the metazoan proteome is significantly increased by the expression of small proteins (<100 aa) derived from smORFs within lncRNAs, uORFs, 3' UTRs and, reading frames overlapping the CDS. These smORF encoded proteins (SEPs) have diverse roles, ranging from the regulation of cellular physiological to essential developmental functions. We report the characterization of a new member of this protein family, SEP53BP1, derived from a small internal ORF that overlaps the CDS encoding 53BP1. Its expression is coupled to the utilization of an alternative, cell-type specific promoter coupled to translational reinitiation events mediated by a uORF in the alternative 5' TL of the mRNA. This uORF-mediated reinitiation at an internal ORF is also observed in zebrafish. Interactome studies indicate that the human SEP53BP1 associates with components of the protein turnover pathway including the proteasome, and the TRiC/CCT chaperonin complex, suggesting that it may play a role in cellular proteostasis.
Collapse
Affiliation(s)
- Marta A. Inchingolo
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Aurélie Diman
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Maxime Adamczewski
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculté de Médecine et Pharmacie, Université Grenoble Alpes, Grenoble, France
| | - Tom Humphreys
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Pascale Jaquier-Gubler
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Joseph A. Curran
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, Geneva, Switzerland
| |
Collapse
|
37
|
Tornini VA, Miao L, Lee HJ, Gerson T, Dube SE, Schmidt V, Kroll F, Tang Y, Du K, Kuchroo M, Vejnar CE, Bazzini AA, Krishnaswamy S, Rihel J, Giraldez AJ. linc-mipep and linc-wrb encode micropeptides that regulate chromatin accessibility in vertebrate-specific neural cells. eLife 2023; 12:e82249. [PMID: 37191016 PMCID: PMC10188112 DOI: 10.7554/elife.82249] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 04/14/2023] [Indexed: 05/17/2023] Open
Abstract
Thousands of long intergenic non-coding RNAs (lincRNAs) are transcribed throughout the vertebrate genome. A subset of lincRNAs enriched in developing brains have recently been found to contain cryptic open-reading frames and are speculated to encode micropeptides. However, systematic identification and functional assessment of these transcripts have been hindered by technical challenges caused by their small size. Here, we show that two putative lincRNAs (linc-mipep, also called lnc-rps25, and linc-wrb) encode micropeptides with homology to the vertebrate-specific chromatin architectural protein, Hmgn1, and demonstrate that they are required for development of vertebrate-specific brain cell types. Specifically, we show that NMDA receptor-mediated pathways are dysregulated in zebrafish lacking these micropeptides and that their loss preferentially alters the gene regulatory networks that establish cerebellar cells and oligodendrocytes - evolutionarily newer cell types that develop postnatally in humans. These findings reveal a key missing link in the evolution of vertebrate brain cell development and illustrate a genetic basis for how some neural cell types are more susceptible to chromatin disruptions, with implications for neurodevelopmental disorders and disease.
Collapse
Affiliation(s)
| | - Liyun Miao
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Ho-Joon Lee
- Department of Genetics, Yale UniversityNew HavenUnited States
- Yale Center for Genome Analysis, Yale UniversityNew HavenUnited States
| | - Timothy Gerson
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Sarah E Dube
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Valeria Schmidt
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - François Kroll
- Department of Cell and Developmental Biology, University College LondonLondonUnited Kingdom
| | - Yin Tang
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Katherine Du
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | - Manik Kuchroo
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | | | - Ariel Alejandro Bazzini
- Stowers Institute for Medical ResearchKansas CityUnited States
- Department of Molecular & Integrative Physiology, University of Kansas School of MedicineKansas CityUnited States
| | - Smita Krishnaswamy
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | - Jason Rihel
- Department of Cell and Developmental Biology, University College LondonLondonUnited Kingdom
| | - Antonio J Giraldez
- Department of Genetics, Yale UniversityNew HavenUnited States
- Yale Stem Cell Center, Yale University School of MedicineNew HavenUnited States
- Yale Cancer Center, Yale University School of MedicineNew HavenUnited States
| |
Collapse
|
38
|
Averina OA, Permyakov OA, Emelianova MA, Grigoryeva OO, Lovat ML, Egorova AE, Grinchenko AV, Kumeiko VV, Marey MV, Manskikh VN, Dontsova OA, Vysokikh MY, Sergiev PV. Mitoregulin Contributes to Creatine Shuttling and Cardiolipin Protection in Mice Muscle. Int J Mol Sci 2023; 24:ijms24087589. [PMID: 37108753 PMCID: PMC10143810 DOI: 10.3390/ijms24087589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 04/06/2023] [Accepted: 04/15/2023] [Indexed: 04/29/2023] Open
Abstract
Small peptides compose a large share of the mitochondrial proteome. Mitoregulin (Mtln) is a mitochondrial peptide known to contribute to the respiratory complex I functioning and other processes in mitochondria. In our previous studies, we demonstrated that Mtln knockout mice develop obesity and accumulate triglycerides and other oxidation substrates in serum, concomitant with an exhaustion of tricarboxylic acids cycle intermediates. Here we examined the functional role of Mtln in skeletal muscles, one of the major energy consuming tissues. We observed reduced muscle strength for Mtln knockout mice. Decrease of the mitochondrial cardiolipin and concomitant increase in monolysocardiolipin concentration upon Mtln inactivation is likely to be a consequence of imbalance between oxidative damage and remodeling of cardiolipin. It is accompanied by the mitochondrial creatine kinase octamer dissociation and suboptimal respiratory chain performance in Mtln knockout mice.
Collapse
Affiliation(s)
- Olga A Averina
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Oleg A Permyakov
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Mariia A Emelianova
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
| | - Olga O Grigoryeva
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
| | - Maxim L Lovat
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Institute of Mitoengineering MSU, 119992 Moscow, Russia
| | - Anna E Egorova
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, 690922 Vladivostok, Russia
| | - Andrei V Grinchenko
- A.V. Zhirmunsky National Scientific Center of Marine Biology, 690041 Vladivostok, Russia
| | - Vadim V Kumeiko
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, 690922 Vladivostok, Russia
- A.V. Zhirmunsky National Scientific Center of Marine Biology, 690041 Vladivostok, Russia
| | - Maria V Marey
- Research Center for Obstetrics, Gynecology and Perinatology, 117198 Moscow, Russia
| | - Vasily N Manskikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Institute of Mitoengineering MSU, 119992 Moscow, Russia
| | - Olga A Dontsova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 119992 Moscow, Russia
| | - Mikhail Yu Vysokikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Research Center for Obstetrics, Gynecology and Perinatology, 117198 Moscow, Russia
| | - Petr V Sergiev
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992 Moscow, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119992 Moscow, Russia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, 143025 Moscow, Russia
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| |
Collapse
|
39
|
Pueyo JI, Salazar J, Grincho C, Berni J, Towler BP, Newbury SF. Purriato is a conserved small open reading frame gene that interacts with the CASA pathway to regulate muscle homeostasis and epithelial tissue growth in Drosophila. Front Cell Dev Biol 2023; 11:1117454. [PMID: 36968202 PMCID: PMC10036370 DOI: 10.3389/fcell.2023.1117454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 02/24/2023] [Indexed: 03/12/2023] Open
Abstract
Recent advances in proteogenomic techniques and bioinformatic pipelines have permitted the detection of thousands of translated small Open Reading Frames (smORFs), which contain less than 100 codons, in eukaryotic genomes. Hundreds of these actively translated smORFs display conserved sequence, structure and evolutionary signatures indicating that the translated peptides could fulfil important biological roles. Despite their abundance, only tens of smORF genes have been fully characterised; these act mainly as regulators of canonical proteins involved in essential cellular processes. Importantly, some of these smORFs display conserved functions with their mutations being associated with pathogenesis. Thus, investigating smORF roles in Drosophila will not only expand our understanding of their functions but it may have an impact in human health. Here we describe the function of a novel and essential Drosophila smORF gene named purriato (prto). prto belongs to an ancient gene family whose members have expanded throughout the Protostomia clade. prto encodes a transmembrane peptide which is localized in endo-lysosomes and perinuclear and plasma membranes. prto is dynamically expressed in mesodermal tissues and imaginal discs. Targeted prto knockdown (KD) in these organs results in changes in nuclear morphology and endo-lysosomal distributions correlating with the loss of sarcomeric homeostasis in muscles and reduction of mitosis in wing discs. Consequently, prto KD mutants display severe reduction of motility, and shorter wings. Finally, our genetic interaction experiments show that prto function is closely associated to the CASA pathway, a conserved mechanism involved in turnover of mis-folded proteins and linked to muscle dystrophies and neurodegenerative diseases. Thus, this study shows the relevance of smORFs in regulating important cellular functions and supports the systematic characterisation of this class of genes to understand their functions and evolution.
Collapse
Affiliation(s)
- Jose I. Pueyo
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jorge Salazar
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Carolina Grincho
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jimena Berni
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Benjamin P. Towler
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
- Department of Biochemistry and Biomedicine, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sarah F. Newbury
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
40
|
Quaife NM, Chothani S, Schulz JF, Lindberg EL, Vanezis K, Adami E, O'Fee K, Greiner J, Litviňuková M, van Heesch S, Whiffin N, Hubner N, Schafer S, Rackham O, Cook SA, Barton PJR. LINC01013 Is a Determinant of Fibroblast Activation and Encodes a Novel Fibroblast-Activating Micropeptide. J Cardiovasc Transl Res 2023; 16:77-85. [PMID: 35759180 PMCID: PMC9944705 DOI: 10.1007/s12265-022-10288-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 06/09/2022] [Indexed: 10/17/2022]
Abstract
Myocardial fibrosis confers an almost threefold mortality risk in heart disease. There are no prognostic therapies and novel therapeutic targets are needed. Many thousands of unannotated small open reading frames (smORFs) have been identified across the genome with potential to produce micropeptides (< 100 amino acids). We sought to investigate the role of smORFs in myocardial fibroblast activation.Analysis of human cardiac atrial fibroblasts (HCFs) stimulated with profibrotic TGFβ1 using RNA sequencing (RNA-Seq) and ribosome profiling (Ribo-Seq) identified long intergenic non-coding RNA LINC01013 as TGFβ1 responsive and containing an actively translated smORF. Knockdown of LINC01013 using siRNA reduced expression of profibrotic markers at baseline and blunted their response to TGFβ1. In contrast, overexpression of a codon-optimised smORF invoked a profibrotic response comparable to that seen with TGFβ1 treatment, whilst FLAG-tagged peptide associated with the mitochondria.Together, these data support a novel LINC01013 smORF micropeptide-mediated mechanism of fibroblast activation. TGFβ1 stimulation of atrial fibroblasts induces expression of LINC01013, whose knockdown reduces fibroblast activation. Overexpression of a smORF contained within LINC01013 localises to mitochondria and activates fibroblasts.
Collapse
Affiliation(s)
- N M Quaife
- National Heart and Lung Institute, Imperial College London, London, UK
- MRC London Institute of Medical Sciences, London, UK
| | - S Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - J F Schulz
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- DZHK (German Center for Cardiovascular Research), partner site Berlin, Berlin, Germany
| | - E L Lindberg
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - K Vanezis
- National Heart and Lung Institute, Imperial College London, London, UK
- MRC London Institute of Medical Sciences, London, UK
| | - E Adami
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - K O'Fee
- MRC London Institute of Medical Sciences, London, UK
| | - J Greiner
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - M Litviňuková
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - S van Heesch
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - N Whiffin
- National Heart and Lung Institute, Imperial College London, London, UK
- Cardiovascular Research Centre, Royal Brompton and Harefield Hospitals, Guy's and St Thomas NHS Foundation Trust, London, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - N Hubner
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
- DZHK (German Center for Cardiovascular Research), partner site Berlin, Berlin, Germany
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - S Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - O Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - S A Cook
- MRC London Institute of Medical Sciences, London, UK
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
- National Heart Centre Singapore, Singapore, Singapore
| | - P J R Barton
- National Heart and Lung Institute, Imperial College London, London, UK.
- MRC London Institute of Medical Sciences, London, UK.
- Cardiovascular Research Centre, Royal Brompton and Harefield Hospitals, Guy's and St Thomas NHS Foundation Trust, London, UK.
| |
Collapse
|
41
|
|
42
|
The Essentials on microRNA-Encoded Peptides from Plants to Animals. Biomolecules 2023; 13:biom13020206. [PMID: 36830576 PMCID: PMC9953219 DOI: 10.3390/biom13020206] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 01/06/2023] [Accepted: 01/08/2023] [Indexed: 01/22/2023] Open
Abstract
Primary transcripts of microRNAs (pri-miRNAs) were initially defined as long non-coding RNAs that host miRNAs further processed by the microRNA processor complex. A few years ago, however, it was discovered in plants that pri-miRNAs actually contain functional open reading frames (sORFs) that translate into small peptides called miPEPs, for microRNA-encoded peptides. Initially detected in Arabidopsis thaliana and Medicago truncatula, recent studies have revealed the presence of miPEPs in other pri-miRNAs as well as in other species ranging from various plant species to animals. This suggests that miPEP numbers remain largely underestimated and that they could be a common signature of pri-miRNAs. Here we present the most recent advances in miPEPs research and discuss how their discovery has broadened our vision of the regulation of gene expression by miRNAs, and how miPEPs could be interesting tools in sustainable agriculture or the treatment of certain human diseases.
Collapse
|
43
|
Wan L, Xiao W, Huang Z, Zhou A, Jiang Y, Zou B, Liu B, Deng C, Zhang Y. Systematic identification of smORFs in domestic silkworm ( Bombyx mori). PeerJ 2023; 11:e14682. [PMID: 36655040 PMCID: PMC9841908 DOI: 10.7717/peerj.14682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 12/13/2022] [Indexed: 01/15/2023] Open
Abstract
The silkworm (Bombyx mori) is not only an excellent model species, but also an important agricultural economic insect. Taking it as the research object, its advantages of low maintenance cost and no biohazard risks are considered. Small open reading frames (smORFs) are an important class of genomic elements that can produce bioactive peptides. However, the smORFs in silkworm had been poorly identified and studied. To further study the smORFs in silkworm, systematic genome-wide identification is essential. Here, we identified and analyzed smORFs in the silkworm using comprehensive methods. Our results showed that at least 738 highly reliable smORFs were found in B. mori and that 34,401 possible smORFs were partially supported. We also identified some differentially expressed and tissue-specific-expressed smORFs, which may be closely related to the characteristics and functions of the tissues. This article provides a basis for subsequent research on smORFs in silkworm, and also hopes to provide a reference point for future research methods for smORFs in other species.
Collapse
Affiliation(s)
- Linrong Wan
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China,College of Agronomy, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Wenfu Xiao
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| | - Ziyan Huang
- Research and Development Center, LyuKang, Chengdu, Sichuan, China,Departments of Bioinformatics, DNA Stories Bioinformatics Center, Chengdu, Sichuan, China
| | - Anlian Zhou
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| | - Yaming Jiang
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| | - Bangxing Zou
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| | - Binbin Liu
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| | - Cao Deng
- Research and Development Center, LyuKang, Chengdu, Sichuan, China,Departments of Bioinformatics, DNA Stories Bioinformatics Center, Chengdu, Sichuan, China
| | - Youhong Zhang
- Sericultural Research Institute,Sichuan Academy of Agricultural Sciences, Nanchong, Sichuan, China
| |
Collapse
|
44
|
Álvarez-Urdiola R, Borràs E, Valverde F, Matus JT, Sabidó E, Riechmann JL. Peptidomics Methods Applied to the Study of Flower Development. Methods Mol Biol 2023; 2686:509-536. [PMID: 37540375 DOI: 10.1007/978-1-0716-3299-4_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
Abstract
Understanding the global and dynamic nature of plant developmental processes requires not only the study of the transcriptome, but also of the proteome, including its largely uncharacterized peptidome fraction. Recent advances in proteomics and high-throughput analyses of translating RNAs (ribosome profiling) have begun to address this issue, evidencing the existence of novel, uncharacterized, and possibly functional peptides. To validate the accumulation in tissues of sORF-encoded polypeptides (SEPs), the basic setup of proteomic analyses (i.e., LC-MS/MS) can be followed. However, the detection of peptides that are small (up to ~100 aa, 6-7 kDa) and novel (i.e., not annotated in reference databases) presents specific challenges that need to be addressed both experimentally and with computational biology resources. Several methods have been developed in recent years to isolate and identify peptides from plant tissues. In this chapter, we outline two different peptide extraction protocols and the subsequent peptide identification by mass spectrometry using the database search or the de novo identification methods.
Collapse
Affiliation(s)
- Raquel Álvarez-Urdiola
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Eva Borràs
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Federico Valverde
- Institute for Plant Biochemistry and Photosynthesis CSIC - University of Seville, Seville, Spain
| | - José Tomás Matus
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
- Institute for Integrative Systems Biology (I2SysBio), Universitat de València-CSIC, Paterna, Valencia, Spain
| | - Eduard Sabidó
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - José Luis Riechmann
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
| |
Collapse
|
45
|
Averina OA, Permyakov OA, Emelianova MA, Grigoryeva OO, Gulyaev MV, Pavlova OS, Mariasina SS, Frolova OY, Kurkina MV, Baydakova GV, Zakharova EY, Marey MV, Tsarev DA, Tashlitsky VN, Popov VS, Lovat ML, Polshakov VI, Vyssokikh MY, Sergiev PV. Mitochondrial peptide Mtln contributes to oxidative metabolism in mice. Biochimie 2023; 204:136-139. [PMID: 36174793 DOI: 10.1016/j.biochi.2022.09.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/10/2022] [Accepted: 09/12/2022] [Indexed: 01/12/2023]
Abstract
Mitoregulin (Mtln) is a recently identified 56 amino acid long mitochondrial peptide conserved in vertebrates. Mtln is known to enhance function of respiratory complex I, which is likely mediated by modulation of lipid composition. To address an influence of Mtln gene on the metabolism we created knockout mice deficient in Mtln gene. In line with accumulation of triglycerides observed earlier on a model of Mtln knockout cell lines, we observed Mtln KO mice to develop obesity on a high fat diet. An increased weight gain could be attributed to enhanced fat accumulation according to the magnetic resonance live imaging. In addition, Mtln KO mice demonstrate elevated serum triglycerides and other oxidation substrates accompanied by an exhaustion of tricarboxylic acids cycle intermediates, suggesting suboptimal oxidation of respiration substrates by mitochondria lacking Mtln.
Collapse
Affiliation(s)
- Olga A Averina
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992, Moscow, Russia; Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Oleg A Permyakov
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992, Moscow, Russia
| | - Mariia A Emelianova
- Center for Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, Moscow Region, 143025, Russia
| | - Olga O Grigoryeva
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992, Moscow, Russia
| | - Mikhail V Gulyaev
- Faculty of Fundamental Medicine, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Olga S Pavlova
- Faculty of Fundamental Medicine, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Sofia S Mariasina
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992, Moscow, Russia
| | | | - Marina V Kurkina
- Research Centre for Medical Genetics (FSBI, RCMG), 115522, Moscow, Russia
| | - Galina V Baydakova
- Research Centre for Medical Genetics (FSBI, RCMG), 115522, Moscow, Russia
| | | | - Maria V Marey
- Research Center for Obstetrics, Gynecology and Perinatology, 117198, Moscow, Russia
| | | | - Vadim N Tashlitsky
- Department of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Vladimir S Popov
- Faculty of Fundamental Medicine, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Maxim L Lovat
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia; Faculty of Fundamental Medicine, Lomonosov Moscow State University, Moscow, 119991, Russia; Institute of Mitoengineering MSU, 119992, Moscow, Russia
| | - Vladimir I Polshakov
- Faculty of Fundamental Medicine, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Mikhail Yu Vyssokikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia; Research Center for Obstetrics, Gynecology and Perinatology, 117198, Moscow, Russia.
| | - Petr V Sergiev
- Institute of Functional Genomics, Lomonosov Moscow State University, 119992, Moscow, Russia; Center for Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, Moscow Region, 143025, Russia; Department of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia.
| |
Collapse
|
46
|
Chothani S, Ho L, Schafer S, Rackham O. Discovering microproteins: making the most of ribosome profiling data. RNA Biol 2023; 20:943-954. [PMID: 38013207 PMCID: PMC10730196 DOI: 10.1080/15476286.2023.2279845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized biological process discovery and understanding. Traditionally, gene models have been confirmed using cDNA sequencing and encoded translated regions inferred using sequence-based detection of start and stop combinations longer than 100 amino-acids to prevent false positives. This has led to small ORFs (smORFs) and their encoded proteins left un-annotated. Ribo-seq allows deciphering translated regions from untranslated irrespective of the length. In this review, we describe the power of Ribo-seq data in detection of smORFs while discussing the major challenge posed by data-quality, -depth and -sparseness in identifying the start and end of smORF translation. In particular, we outline smORF cataloguing efforts in humans and the large differences that have arisen due to variation in data, methods and assumptions. Although current versions of smORF reference sets can already be used as a powerful tool for hypothesis generation, we recommend that future editions should consider these data limitations and adopt unified processing for the community to establish a canonical catalogue of translated smORFs.
Collapse
Affiliation(s)
- Sonia Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Owen Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
- The Alan Turing Institute, The British Library, London, UK
| |
Collapse
|
47
|
Nabi A, Dilekoglu B, Adebali O, Tastan O. Discovering misannotated lncRNAs using deep learning training dynamics. Bioinformatics 2023; 39:6960922. [PMID: 36571493 PMCID: PMC9825752 DOI: 10.1093/bioinformatics/btac821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 10/05/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022] Open
Abstract
MOTIVATION Recent experimental evidence has shown that some long non-coding RNAs (lncRNAs) contain small open reading frames (sORFs) that are translated into functional micropeptides, suggesting that these lncRNAs are misannotated as non-coding. Current methods to detect misannotated lncRNAs rely on ribosome-profiling (Ribo-Seq) and mass-spectrometry experiments, which are cell-type dependent and expensive. RESULTS Here, we propose a computational method to identify possible misannotated lncRNAs from sequence information alone. Our approach first builds deep learning models to discriminate coding and non-coding transcripts and leverages these models' training dynamics to identify misannotated lncRNAs-i.e. lncRNAs with coding potential. The set of misannotated lncRNAs we identified significantly overlap with experimentally validated ones and closely resemble coding protein sequences as evidenced by significant BLAST hits. Our analysis on a subset of misannotated lncRNA candidates also shows that some ORFs they contain yield high confidence folded structures as predicted by AlphaFold2. This methodology offers promising potential for assisting experimental efforts in characterizing the hidden proteome encoded by misannotated lncRNAs and for curating better datasets for building coding potential predictors. AVAILABILITY AND IMPLEMENTATION Source code is available at https://github.com/nabiafshan/DetectingMisannotatedLncRNAs. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Afshan Nabi
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Berke Dilekoglu
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Ogun Adebali
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | | |
Collapse
|
48
|
Vakirlis N, Vance Z, Duggan KM, McLysaght A. De novo birth of functional microproteins in the human lineage. Cell Rep 2022; 41:111808. [PMID: 36543139 PMCID: PMC10073203 DOI: 10.1016/j.celrep.2022.111808] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 06/21/2022] [Accepted: 11/18/2022] [Indexed: 12/24/2022] Open
Abstract
Small open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences. Here we sought to identify such cases in the human lineage by reconstructing the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the formation of each ORF and its transcriptional activation, we show that novel microproteins with significant phenotypic effects have emerged de novo throughout animal evolution, including two after the human-chimpanzee split. Notably, traditional methods for assessing coding potential would miss most of these cases. This evidence demonstrates that the functional potential intrinsic to sORFs can be relatively rapidly and frequently realized through de novo gene emergence.
Collapse
Affiliation(s)
- Nikolaos Vakirlis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece.
| | - Zoe Vance
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Kate M Duggan
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Aoife McLysaght
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland.
| |
Collapse
|
49
|
Singh D, Roy J. A large-scale benchmark study of tools for the classification of protein-coding and non-coding RNAs. Nucleic Acids Res 2022; 50:12094-12111. [PMID: 36420898 PMCID: PMC9757047 DOI: 10.1093/nar/gkac1092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 10/22/2022] [Accepted: 10/28/2022] [Indexed: 11/27/2022] Open
Abstract
Identification of protein-coding and non-coding transcripts is paramount for understanding their biological roles. Computational approaches have been addressing this task for over a decade; however, generalized and high-performance models are still unreliable. This benchmark study assessed the performance of 24 tools producing >55 models on the datasets covering a wide range of species. We have collected 135 small and large transcriptomic datasets from existing studies for comparison and identified the potential bottlenecks hampering the performance of current tools. The key insights of this study include lack of standardized training sets, reliance on homogeneous training data, gradual changes in annotated data, lack of augmentation with homology searches, the presence of false positives and negatives in datasets and the lower performance of end-to-end deep learning models. We also derived a new dataset, RNAChallenge, from the benchmark considering hard instances that may include potential false alarms. The best and least well performing models under- and overfit the dataset, respectively, thereby serving a dual purpose. For computational approaches, it will be valuable to develop accurate and unbiased models. The identification of false alarms will be of interest for genome annotators, and experimental study of hard RNAs will help to untangle the complexity of the RNA world.
Collapse
Affiliation(s)
- Dalwinder Singh
- To whom correspondence should be addressed. Tel: +91 172 5221206;
| | - Joy Roy
- Correspondence may also be addressed to Joy Roy.
| |
Collapse
|
50
|
Zhang M, Zhao J, Li C, Ge F, Wu J, Jiang B, Song J, Song X. csORF-finder: an effective ensemble learning framework for accurate identification of multi-species coding short open reading frames. Brief Bioinform 2022; 23:bbac392. [PMID: 36094083 PMCID: PMC9677467 DOI: 10.1093/bib/bbac392] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 08/03/2022] [Accepted: 08/11/2022] [Indexed: 12/14/2022] Open
Abstract
Short open reading frames (sORFs) refer to the small nucleic fragments no longer than 303 nt in length that probably encode small peptides. To date, translatable sORFs have been found in both untranslated regions of messenger ribonucleic acids (RNAs; mRNAs) and long non-coding RNAs (lncRNAs), playing vital roles in a myriad of biological processes. As not all sORFs are translated or essentially translatable, it is important to develop a highly accurate computational tool for characterizing the coding potential of sORFs, thereby facilitating discovery of novel functional peptides. In light of this, we designed a series of ensemble models by integrating Efficient-CapsNet and LightGBM, collectively termed csORF-finder, to differentiate the coding sORFs (csORFs) from non-coding sORFs in Homo sapiens, Mus musculus and Drosophila melanogaster, respectively. To improve the performance of csORF-finder, we introduced a novel feature encoding scheme named trinucleotide deviation from expected mean (TDE) and computed all types of in-frame sequence-based features, such as i-framed-3mer, i-framed-CKSNAP and i-framed-TDE. Benchmarking results showed that these features could significantly boost the performance compared to the original 3-mer, CKSNAP and TDE features. Our performance comparisons showed that csORF-finder achieved a superior performance than the state-of-the-art methods for csORF prediction on multi-species and non-ATG initiation independent test datasets. Furthermore, we applied csORF-finder to screen the lncRNA datasets for identifying potential csORFs. The resulting data serve as an important computational repository for further experimental validation. We hope that csORF-finder can be exploited as a powerful platform for high-throughput identification of csORFs and functional characterization of these csORFs encoded peptides.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Jian Zhao
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Fang Ge
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| | - Jing Wu
- School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing 211166, China
| | - Bin Jiang
- College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Xiaofeng Song
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| |
Collapse
|